; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0014683 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0014683
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionProtein FAR-RED ELONGATED HYPOCOTYL 3-like
Genome locationchr06:21954861..21955912
RNA-Seq ExpressionPay0014683
SyntenyPay0014683
Gene Ontology termsNA
InterPro domainsIPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039149.1 protein FAR-RED ELONGATED HYPOCOTYL 3-like [Cucumis melo var. makuwa]3.0e-10664.33Show/hide
Query:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI
        MSKVDSNFALKVQ+MF++K LFKQC+Q I LRDNFQYV VKSNKEVMILQCAIEN KWSLRASCCIHGDRSLW+LT                 RQ TF +
Subjt:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI

Query:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC
        I DLIKNKIS  GS+LST K+IV FI  EHGLSISYQKAW A + ALDDIRGS +DSYKMLP+FAYIL+LNN G VVEYKVDA+ RFLY FMAL ASI  
Subjt:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC

Query:  WQHC----------HPNKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLR
        WQHC            NKYGGTLL++ST DAN QIFPLAF VVDS+NDS WTWF NQ+KRIIGG+NEVV VS+RHK                        
Subjt:  WQHC----------HPNKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLR

Query:  NLKLKNKRIVDIVFHACGKVFNIVDFKY
          +  + R+VD +FHAC K FNIV+FK+
Subjt:  NLKLKNKRIVDIVFHACGKVFNIVDFKY

XP_008449188.1 PREDICTED: uncharacterized protein LOC103491139 [Cucumis melo]2.4e-10070.59Show/hide
Query:  LFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILTRQTTFTIINDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQ
        L KQCVQ I LRDNFQYVTVKSNKEVMILQ     Y  SL     I  D  L    RQT F +INDL KNKIS VGS+LS  KDIV FI +EH LSISYQ
Subjt:  LFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILTRQTTFTIINDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQ

Query:  KAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFCWQHCHP--------NKYGGTLLTTSTPDANDQIFPLA
        KAW  REAA DD  GS +DSYKMLP+FA+IL+LNNPG VVEYKVDAN RFLYLFM L  SI  WQHC P         K  GTL + STPDANDQIF LA
Subjt:  KAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFCWQHCHP--------NKYGGTLLTTSTPDANDQIFPLA

Query:  FYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLRNLKLKNKRIVDIVFHACGKVFNIVDFKY
        F VVD ENDS WT FCNQLKRIIGG+NEVV VSD HKSICKVIEVVF NVLHCMCLVH LRNLKLK KRIVD VFH C K FNI+DF++
Subjt:  FYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLRNLKLKNKRIVDIVFHACGKVFNIVDFKY

XP_008452162.1 PREDICTED: uncharacterized protein LOC103493267 [Cucumis melo]7.1e-10062.5Show/hide
Query:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI
        MSKVDSNFALKVQ+MF+TKSLFK+CVQVIALRDNFQYVT+KSN EVMILQC IEN KWSL ASC I+GDRS+W+LT                 RQ TFT+
Subjt:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI

Query:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC
        I DLIKNKI   GS+LST KDIV FI  EHGLSI YQKAW AR AALDDIR               +L+      V+EYKVDA GRFLY FM L ASI  
Subjt:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC

Query:  WQHCH----------PNKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLR
         QHC            NKY GTLL+ STPD+ND IFPLAF VVDSENDS WTWFCNQLKRIIG +N+VV VSDRH+SICK I                 +
Subjt:  WQHCH----------PNKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLR

Query:  NLKLKNKRIVDIVFHACGKVFNIVDFKY
        NLKLK KRIVD VFH+CGK FNIVDF++
Subjt:  NLKLKNKRIVDIVFHACGKVFNIVDFKY

XP_008461035.1 PREDICTED: uncharacterized protein LOC103499740 [Cucumis melo]2.9e-10162.2Show/hide
Query:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI
        MSKVDS F+LKVQ+MF+TKSL K+CVQVIALRDNFQYVTVKSNKEVMILQCAIEN KWSLRASC IH DRS+W+LT                 +Q TFT+
Subjt:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI

Query:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC
        I DLIKNKI  VGSELST KDIV FI  EH LS                                         +V+EYKVDA+GRFLY FM L ASI  
Subjt:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC

Query:  WQHCHP----------NKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLR
        WQHC P          NKYGGTLL+ STPD+NDQIF LAF VVDSENDS WTWFCNQLKRII GQN+VV VSDRH+SICK IEVVF NVLHC+CLV+ L+
Subjt:  WQHCHP----------NKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLR

Query:  NLKLKNKRIVDIVFHACGKVFNIVDFKY
        NLKLK KRI+D +FH+C K FNIVDF++
Subjt:  NLKLKNKRIVDIVFHACGKVFNIVDFKY

XP_008463110.1 PREDICTED: uncharacterized protein LOC103501336 [Cucumis melo]6.4e-10171.38Show/hide
Query:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI
        MSKVDSNFALKVQ+MF++K LFKQC+Q I LRDNFQYV VKSNKEVMILQCAIEN KWSLRASCCIHGDRSLW+LT                 RQ TF +
Subjt:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI

Query:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC
        I DLIKNKIS  GS+LST K+IV FI  EHGLSISYQKAW A + ALDDIRGS +DSYKMLP+FAYIL+LNN G VVEYKVDA+ RFLY FMAL ASI  
Subjt:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC

Query:  WQHC----------HPNKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHK
        WQHC            NKYGGTLL++ST DAN QIFPLAF VVDS+NDS WTWF NQ+KRIIGG+NEVV VS+RHK
Subjt:  WQHC----------HPNKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHK

TrEMBL top hitse value%identityAlignment
A0A1S3BM35 uncharacterized protein LOC1034911391.2e-10070.59Show/hide
Query:  LFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILTRQTTFTIINDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQ
        L KQCVQ I LRDNFQYVTVKSNKEVMILQ     Y  SL     I  D  L    RQT F +INDL KNKIS VGS+LS  KDIV FI +EH LSISYQ
Subjt:  LFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILTRQTTFTIINDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQ

Query:  KAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFCWQHCHP--------NKYGGTLLTTSTPDANDQIFPLA
        KAW  REAA DD  GS +DSYKMLP+FA+IL+LNNPG VVEYKVDAN RFLYLFM L  SI  WQHC P         K  GTL + STPDANDQIF LA
Subjt:  KAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFCWQHCHP--------NKYGGTLLTTSTPDANDQIFPLA

Query:  FYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLRNLKLKNKRIVDIVFHACGKVFNIVDFKY
        F VVD ENDS WT FCNQLKRIIGG+NEVV VSD HKSICKVIEVVF NVLHCMCLVH LRNLKLK KRIVD VFH C K FNI+DF++
Subjt:  FYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLRNLKLKNKRIVDIVFHACGKVFNIVDFKY

A0A1S3BT85 uncharacterized protein LOC1034932673.4e-10062.5Show/hide
Query:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI
        MSKVDSNFALKVQ+MF+TKSLFK+CVQVIALRDNFQYVT+KSN EVMILQC IEN KWSL ASC I+GDRS+W+LT                 RQ TFT+
Subjt:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI

Query:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC
        I DLIKNKI   GS+LST KDIV FI  EHGLSI YQKAW AR AALDDIR               +L+      V+EYKVDA GRFLY FM L ASI  
Subjt:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC

Query:  WQHCH----------PNKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLR
         QHC            NKY GTLL+ STPD+ND IFPLAF VVDSENDS WTWFCNQLKRIIG +N+VV VSDRH+SICK I                 +
Subjt:  WQHCH----------PNKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLR

Query:  NLKLKNKRIVDIVFHACGKVFNIVDFKY
        NLKLK KRIVD VFH+CGK FNIVDF++
Subjt:  NLKLKNKRIVDIVFHACGKVFNIVDFKY

A0A1S3CF11 uncharacterized protein LOC1034997401.4e-10162.2Show/hide
Query:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI
        MSKVDS F+LKVQ+MF+TKSL K+CVQVIALRDNFQYVTVKSNKEVMILQCAIEN KWSLRASC IH DRS+W+LT                 +Q TFT+
Subjt:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI

Query:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC
        I DLIKNKI  VGSELST KDIV FI  EH LS                                         +V+EYKVDA+GRFLY FM L ASI  
Subjt:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC

Query:  WQHCHP----------NKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLR
        WQHC P          NKYGGTLL+ STPD+NDQIF LAF VVDSENDS WTWFCNQLKRII GQN+VV VSDRH+SICK IEVVF NVLHC+CLV+ L+
Subjt:  WQHCHP----------NKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLR

Query:  NLKLKNKRIVDIVFHACGKVFNIVDFKY
        NLKLK KRI+D +FH+C K FNIVDF++
Subjt:  NLKLKNKRIVDIVFHACGKVFNIVDFKY

A0A1S3CIV7 uncharacterized protein LOC1035013363.1e-10171.38Show/hide
Query:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI
        MSKVDSNFALKVQ+MF++K LFKQC+Q I LRDNFQYV VKSNKEVMILQCAIEN KWSLRASCCIHGDRSLW+LT                 RQ TF +
Subjt:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI

Query:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC
        I DLIKNKIS  GS+LST K+IV FI  EHGLSISYQKAW A + ALDDIRGS +DSYKMLP+FAYIL+LNN G VVEYKVDA+ RFLY FMAL ASI  
Subjt:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC

Query:  WQHC----------HPNKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHK
        WQHC            NKYGGTLL++ST DAN QIFPLAF VVDS+NDS WTWF NQ+KRIIGG+NEVV VS+RHK
Subjt:  WQHC----------HPNKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHK

A0A5A7TCZ3 Protein FAR-RED ELONGATED HYPOCOTYL 3-like1.4e-10664.33Show/hide
Query:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI
        MSKVDSNFALKVQ+MF++K LFKQC+Q I LRDNFQYV VKSNKEVMILQCAIEN KWSLRASCCIHGDRSLW+LT                 RQ TF +
Subjt:  MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILT-----------------RQTTFTI

Query:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC
        I DLIKNKIS  GS+LST K+IV FI  EHGLSISYQKAW A + ALDDIRGS +DSYKMLP+FAYIL+LNN G VVEYKVDA+ RFLY FMAL ASI  
Subjt:  INDLIKNKISSVGSELSTLKDIVPFIHIEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFC

Query:  WQHC----------HPNKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLR
        WQHC            NKYGGTLL++ST DAN QIFPLAF VVDS+NDS WTWF NQ+KRIIGG+NEVV VS+RHK                        
Subjt:  WQHC----------HPNKYGGTLLTTSTPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLR

Query:  NLKLKNKRIVDIVFHACGKVFNIVDFKY
          +  + R+VD +FHAC K FNIV+FK+
Subjt:  NLKLKNKRIVDIVFHACGKVFNIVDFKY

SwissProt top hitse value%identityAlignment
P19775 Transposase for insertion sequence element IS256 in transposon Tn40017.5e-0436Show/hide
Query:  TPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLRNL
        T D + +I  + F +   E++  WT F   LK   G Q   + +SD HK +   I   F+NV    C VHFLRN+
Subjt:  TPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLRNL

P59787 Transposase for insertion sequence element IS256 in transposon Tn40017.5e-0436Show/hide
Query:  TPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLRNL
        T D + +I  + F +   E++  WT F   LK   G Q   + +SD HK +   I   F+NV    C VHFLRN+
Subjt:  TPDANDQIFPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLRNL

Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase4.2e-1027.33Show/hide
Query:  AREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDA------NGRFLYLFMALYASIFCWQHCHP----------NKYGGTLLTTSTPDANDQI
        A+  A+    G    S++++PK   +L  +N G +V+++ D+      +  F  LF A   SI  +QHC P           KY   L+  S  DA +Q 
Subjt:  AREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDA------NGRFLYLFMALYASIFCWQHCHP----------NKYGGTLLTTSTPDANDQI

Query:  FPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSN-----VLHCMCLVHFLRNL
        FPLAF V    +   W WF  +++  +  +  +  +S     I  VI    S        H  CL H    L
Subjt:  FPLAFYVVDSENDSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSN-----VLHCMCLVHFLRNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAAGGTAGATTCTAATTTTGCATTGAAAGTCCAGAATATGTTTGCAACCAAATCTTTGTTCAAACAGTGCGTACAAGTCATTGCTCTTAGAGACAATTTTCAATA
TGTTACTGTGAAATCAAACAAGGAAGTTATGATTTTGCAATGCGCCATTGAAAATTACAAATGGTCCTTGCGTGCTTCTTGTTGTATTCATGGGGATAGATCATTATGGA
TTCTTACTAGACAAACCACCTTTACTATTATCAATGACCTAATTAAGAACAAAATATCTTCGGTTGGTTCTGAATTGTCAACTCTCAAAGATATAGTTCCTTTCATTCAT
ATTGAACATGGTTTAAGTATATCCTACCAAAAAGCATGGCATGCTCGTGAGGCTGCGTTGGATGATATTCGCGGATCTCCAAAGGACTCCTACAAGATGCTACCTAAATT
TGCATACATATTGAAACTCAACAACCCAGGTTTTGTTGTTGAATACAAAGTTGATGCCAATGGTAGATTTCTTTACTTATTCATGGCATTATATGCTTCCATCTTTTGTT
GGCAACATTGTCATCCAAATAAGTACGGTGGTACTTTGTTAACTACTTCAACACCTGATGCCAATGATCAAATTTTTCCACTAGCCTTTTATGTTGTCGATTCTGAGAAT
GATTCCTTATGGACCTGGTTTTGCAACCAACTGAAGAGAATTATAGGTGGCCAGAATGAGGTTGTCAAAGTATCTGATAGACATAAAAGTATATGCAAAGTCATAGAAGT
AGTATTTTCCAACGTTTTACATTGCATGTGCCTTGTTCATTTTCTTAGGAACCTAAAGTTGAAGAATAAAAGAATAGTCGATATTGTATTTCATGCATGTGGGAAGGTAT
TTAATATTGTAGACTTCAAATACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAAGGTAGATTCTAATTTTGCATTGAAAGTCCAGAATATGTTTGCAACCAAATCTTTGTTCAAACAGTGCGTACAAGTCATTGCTCTTAGAGACAATTTTCAATA
TGTTACTGTGAAATCAAACAAGGAAGTTATGATTTTGCAATGCGCCATTGAAAATTACAAATGGTCCTTGCGTGCTTCTTGTTGTATTCATGGGGATAGATCATTATGGA
TTCTTACTAGACAAACCACCTTTACTATTATCAATGACCTAATTAAGAACAAAATATCTTCGGTTGGTTCTGAATTGTCAACTCTCAAAGATATAGTTCCTTTCATTCAT
ATTGAACATGGTTTAAGTATATCCTACCAAAAAGCATGGCATGCTCGTGAGGCTGCGTTGGATGATATTCGCGGATCTCCAAAGGACTCCTACAAGATGCTACCTAAATT
TGCATACATATTGAAACTCAACAACCCAGGTTTTGTTGTTGAATACAAAGTTGATGCCAATGGTAGATTTCTTTACTTATTCATGGCATTATATGCTTCCATCTTTTGTT
GGCAACATTGTCATCCAAATAAGTACGGTGGTACTTTGTTAACTACTTCAACACCTGATGCCAATGATCAAATTTTTCCACTAGCCTTTTATGTTGTCGATTCTGAGAAT
GATTCCTTATGGACCTGGTTTTGCAACCAACTGAAGAGAATTATAGGTGGCCAGAATGAGGTTGTCAAAGTATCTGATAGACATAAAAGTATATGCAAAGTCATAGAAGT
AGTATTTTCCAACGTTTTACATTGCATGTGCCTTGTTCATTTTCTTAGGAACCTAAAGTTGAAGAATAAAAGAATAGTCGATATTGTATTTCATGCATGTGGGAAGGTAT
TTAATATTGTAGACTTCAAATACTAA
Protein sequenceShow/hide protein sequence
MSKVDSNFALKVQNMFATKSLFKQCVQVIALRDNFQYVTVKSNKEVMILQCAIENYKWSLRASCCIHGDRSLWILTRQTTFTIINDLIKNKISSVGSELSTLKDIVPFIH
IEHGLSISYQKAWHAREAALDDIRGSPKDSYKMLPKFAYILKLNNPGFVVEYKVDANGRFLYLFMALYASIFCWQHCHPNKYGGTLLTTSTPDANDQIFPLAFYVVDSEN
DSLWTWFCNQLKRIIGGQNEVVKVSDRHKSICKVIEVVFSNVLHCMCLVHFLRNLKLKNKRIVDIVFHACGKVFNIVDFKY