; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0026124 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0026124
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionRNA-directed DNA polymerase-like protein
Genome locationchr04:6719511..6727639
RNA-Seq ExpressionIVF0026124
SyntenyIVF0026124
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR025312 - Domain of unknown function DUF4216
IPR025452 - Domain of unknown function DUF4218
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038923.1 RNA-directed DNA polymerase-like protein [Cucumis melo var. makuwa]7.53e-18164.32Show/hide
Query:  MPPRTSRRLRQNQAEMKGPTEGQSIGKFSTPRVQVGARNKRFTRTTKKIGRPKRAEPSNPEKACGIKQLRKLGATVFEGSTNPTDVKEWLNMLEKCFDV-
        MPPRTSRRLRQNQAEMKGPTEGQSIGKFSTPRVQVGARNKRFTRTTKKIGRPKRAEPSNPEKACGIKQLRKLGATVFEGSTNPTDVKEWLNMLEKC D  
Subjt:  MPPRTSRRLRQNQAEMKGPTEGQSIGKFSTPRVQVGARNKRFTRTTKKIGRPKRAEPSNPEKACGIKQLRKLGATVFEGSTNPTDVKEWLNMLEKCFDV-

Query:  ------------------------------MNYFEE-----------------------------------------------------------RKRAP
                                      MNY +                                                            R+RAP
Subjt:  ------------------------------MNYFEE-----------------------------------------------------------RKRAP

Query:  AMQSTREFLDDLGNRKVLRYDSIRSFSARYRYASGFATTRVIDARLVEVQREKRKPEDVPMVKEFLDVLPNDLTNLPPNREIEFTVKLLSGTAPTSQAPY
        AMQSTREFLDDLGNRKVLRYDSIRSFSARYRYASGFATTRVIDARLVEVQREKRKPEDVPMVKEFLDVLPNDLTNLPPNREIEFTVKLLSGTAPTSQAPY
Subjt:  AMQSTREFLDDLGNRKVLRYDSIRSFSARYRYASGFATTRVIDARLVEVQREKRKPEDVPMVKEFLDVLPNDLTNLPPNREIEFTVKLLSGTAPTSQAPY

Query:  KMARSELKELKVQLQKLVDKGYIRHNVLRRRAPHFISFLHHFPSKLSEYFEERVKEEEEEEFRGLRLKKIGKSSCNCQPLVVLFGLSYLEGTKRPRGNRP
        KMARSELKELKVQLQKLVDKGYIRHNVLRRRA                                                    G    E    PRGNRP
Subjt:  KMARSELKELKVQLQKLVDKGYIRHNVLRRRAPHFISFLHHFPSKLSEYFEERVKEEEEEEFRGLRLKKIGKSSCNCQPLVVLFGLSYLEGTKRPRGNRP

Query:  SLQTGKLDKTVIFPLLTLSVLRDNDAVVSDGRAPLRQRRWECRVEIELLVSDTLPMSAESFESNSSTWLELYFESVHVEMFC
        SLQTGKLDKTVIFPLLTLSVLRDNDAVV                EIELLVSDTLPMSAESFESNSSTWLELYFESVHVEMFC
Subjt:  SLQTGKLDKTVIFPLLTLSVLRDNDAVVSDGRAPLRQRRWECRVEIELLVSDTLPMSAESFESNSSTWLELYFESVHVEMFC

KAA0046916.1 transposase [Cucumis melo var. makuwa]1.16e-9353.71Show/hide
Query:  MVHLAVHLIREIEFCGPDHLRWMYPFERYIKVLKAYARNRNRPEECMVENYIVEEVIKFRYEFIVGVSSIGLNSSIITRDSNVDRTLSASSFIRLSKEQL
        MVHL VHL+REIEFCGP HLRWMYPFERY+KVLK Y RNRNRPE CMVENYIVEE I+F  EFI GVSSIGLNSS+I ++SN+D+ LSASSFIR SKEQL
Subjt:  MVHLAVHLIREIEFCGPDHLRWMYPFERYIKVLKAYARNRNRPEECMVENYIVEEVIKFRYEFIVGVSSIGLNSSIITRDSNVDRTLSASSFIRLSKEQL

Query:  DQAHLYVIQN------------------------------------------------------------------------------------------
        DQAHLYVIQN                                                                                          
Subjt:  DQAHLYVIQN------------------------------------------------------------------------------------------

Query:  ------EIWEIDYHQLLFILFKCDWVDNRSGVKMDKLGFTIMDLKRIGHKSKSFILVTQAKQVFYVQDSSNPE---------------------------
              EIWEIDYHQL FILFKCDWVDNRSGVK+D+LGFTI+DLKRIGHKS SFIL TQAKQVFYVQDS+NPE                           
Subjt:  ------EIWEIDYHQLLFILFKCDWVDNRSGVKMDKLGFTIMDLKRIGHKSKSFILVTQAKQVFYVQDSSNPE---------------------------

Query:  C-YGTIQRMPDVNTPNETDDTTSTYIRHDCEGRWVEK
        C Y TI+RMP+V+TPNETDDT STYIRHDCEGRWVEK
Subjt:  C-YGTIQRMPDVNTPNETDDTTSTYIRHDCEGRWVEK

KAA0064110.1 transposase [Cucumis melo var. makuwa]5.68e-8949.19Show/hide
Query:  MVHLAVHLIREIEFCGPDHLRWMYPFERYIKVLKAYARNRNRPEECMVENYIVEEVIKFRYEFIVGVSSIGLNSSIITRDSNVDRTLSASSFIRLSKEQL
        MVHL VHL+REIEFCGP HLRWMYPFERY+KVLK Y RNRNRPE CMVENYIVEE I+F  EFI GVSSIGLNSS+I ++SN+DR LSASSFIR SKEQL
Subjt:  MVHLAVHLIREIEFCGPDHLRWMYPFERYIKVLKAYARNRNRPEECMVENYIVEEVIKFRYEFIVGVSSIGLNSSIITRDSNVDRTLSASSFIRLSKEQL

Query:  DQAHLYVIQN------------------------------------------------------------------------------------------
        DQAHLYVIQN                                                                                          
Subjt:  DQAHLYVIQN------------------------------------------------------------------------------------------

Query:  ---------------------------------------EIWEIDYHQLLFILFKCDWVDNRSGVKMDKLGFTIMDLKRIGHKSKSFILVTQAKQVFYVQ
                                               EIWEIDYHQL FILFKCDWVDNRSGVK+D+LGFTI+DLKRIGHKS SFIL TQAKQVFYVQ
Subjt:  ---------------------------------------EIWEIDYHQLLFILFKCDWVDNRSGVKMDKLGFTIMDLKRIGHKSKSFILVTQAKQVFYVQ

Query:  DSSNPE---------------------------C-YGTIQRMPDVNTPNETDDTTSTYIRHDCEGRWVEK
        DS+NPE                           C Y TI+RMP+V+TPNETDDT STYIRHDCEGRWVEK
Subjt:  DSSNPE---------------------------C-YGTIQRMPDVNTPNETDDTTSTYIRHDCEGRWVEK

KAA0067831.1 transposase [Cucumis melo var. makuwa]1.20e-9247.99Show/hide
Query:  MVHLAVHLIREIEFCGPDHLRWMYPFERYIKVLKAYARNRNRPEECMVENYIVEEVIKFRYEFIVGVSSIGLNSSIITRDSNVDRTLSASSFIRLSKEQL
        MVHL VHL+REIEFCGP HL WMYPFERY+KVLK Y RNR+RPE CMVENYIVEE I+F  EFI GV+SIGLNSS+I ++SNV+R +SASSFIR SKEQL
Subjt:  MVHLAVHLIREIEFCGPDHLRWMYPFERYIKVLKAYARNRNRPEECMVENYIVEEVIKFRYEFIVGVSSIGLNSSIITRDSNVDRTLSASSFIRLSKEQL

Query:  DQAHLYVIQN------------------------------------------------------------------------------------------
        DQAHLYVIQN                                                                                          
Subjt:  DQAHLYVIQN------------------------------------------------------------------------------------------

Query:  ---------------------------------------EIWEIDYHQLLFILFKCDWVDNRSGVKMDKLGFTIMDLKRIGHKSKSFILVTQAKQVFYVQ
                                               EIWEIDYHQL FILFKCDWVDNRSGVK+D+LGFTI+DLKRIGHKS SFIL TQAKQVFYVQ
Subjt:  ---------------------------------------EIWEIDYHQLLFILFKCDWVDNRSGVKMDKLGFTIMDLKRIGHKSKSFILVTQAKQVFYVQ

Query:  DSSNPE---------------------------C-YGTIQRMPDVNTPNETDDTTSTYIRHDCEGRWVEKGVM
        DS+NPE                           C Y TI+RMP+V+TPNETDDT STYIRHDCEGRWVEK V+
Subjt:  DSSNPE---------------------------C-YGTIQRMPDVNTPNETDDTTSTYIRHDCEGRWVEKGVM

TYJ98714.1 RNA-directed DNA polymerase-like protein [Cucumis melo var. makuwa]6.87e-18473.41Show/hide
Query:  MPPRTSRRLRQNQAEMKGPTEGQSIGKFSTPRVQVGARNKRFTRTTKKIGRPKRAEPSNPEKACGIKQLRKLGATVFEGSTNPTDVKEWLNMLEK-CFDV
        MPPRTSRRLRQNQAEMKGPTEGQSIGKFSTPRVQVGARNKRFTRTTKKIGRPKRAEPSNPEKACGIKQLRKLGATVFEGSTNPTDVKE  N+  K    +
Subjt:  MPPRTSRRLRQNQAEMKGPTEGQSIGKFSTPRVQVGARNKRFTRTTKKIGRPKRAEPSNPEKACGIKQLRKLGATVFEGSTNPTDVKEWLNMLEK-CFDV

Query:  MNYFEE-----------------RKRAPAMQSTREFLDDLGNRKVLRYDSIRSFSARYRYASGFATTRVIDARLVEVQREKRKPEDVPMVKEFLDVLPND
           F                   R+RAPAMQSTREFLDDLGNRKVLRYDSIRSFSARYRYASGFATTRVIDARLVEVQREKRKPEDVPMVKEFLDVLPND
Subjt:  MNYFEE-----------------RKRAPAMQSTREFLDDLGNRKVLRYDSIRSFSARYRYASGFATTRVIDARLVEVQREKRKPEDVPMVKEFLDVLPND

Query:  LTNLPPNREIEFTVKLLSGTAPTSQAPYKMARSELKELKVQLQKLVDKGYIRHNVLRRRAPHFISFLHHFPSKLSEYFEERVKEEEEEEFRGLRLKKIGK
        LTNLPPNREIEFTVKLLSGTAPTSQAPYKMARSELKELKVQLQKLVDKGYIRHNVLRRRA                                        
Subjt:  LTNLPPNREIEFTVKLLSGTAPTSQAPYKMARSELKELKVQLQKLVDKGYIRHNVLRRRAPHFISFLHHFPSKLSEYFEERVKEEEEEEFRGLRLKKIGK

Query:  SSCNCQPLVVLFGLSYLEGTKRPRGNRPSLQTGKLDKTVIFPLLTLSVLRDNDAVVSDGRAPLRQRRWECRVEIELLVSDTLPMSAESFESNSSTWLELY
                    G    E    PRGNRPSLQTGKLDKTVIFPLLTLSVLRDNDAVV                EIELLVSDTLPMSAESFESNSSTWLELY
Subjt:  SSCNCQPLVVLFGLSYLEGTKRPRGNRPSLQTGKLDKTVIFPLLTLSVLRDNDAVVSDGRAPLRQRRWECRVEIELLVSDTLPMSAESFESNSSTWLELY

Query:  FESVHVEMFC
        FESVHVEMFC
Subjt:  FESVHVEMFC

TrEMBL top hitse value%identityAlignment
A0A5A7T7I6 RNA-directed DNA polymerase-like protein9.8e-14764.32Show/hide
Query:  MPPRTSRRLRQNQAEMKGPTEGQSIGKFSTPRVQVGARNKRFTRTTKKIGRPKRAEPSNPEKACGIKQLRKLGATVFEGSTNPTDVKEWLNMLEKCFDV-
        MPPRTSRRLRQNQAEMKGPTEGQSIGKFSTPRVQVGARNKRFTRTTKKIGRPKRAEPSNPEKACGIKQLRKLGATVFEGSTNPTDVKEWLNMLEKC D  
Subjt:  MPPRTSRRLRQNQAEMKGPTEGQSIGKFSTPRVQVGARNKRFTRTTKKIGRPKRAEPSNPEKACGIKQLRKLGATVFEGSTNPTDVKEWLNMLEKCFDV-

Query:  ------------------------------MNYFEE-----------------------------------------------------------RKRAP
                                      MNY +                                                            R+RAP
Subjt:  ------------------------------MNYFEE-----------------------------------------------------------RKRAP

Query:  AMQSTREFLDDLGNRKVLRYDSIRSFSARYRYASGFATTRVIDARLVEVQREKRKPEDVPMVKEFLDVLPNDLTNLPPNREIEFTVKLLSGTAPTSQAPY
        AMQSTREFLDDLGNRKVLRYDSIRSFSARYRYASGFATTRVIDARLVEVQREKRKPEDVPMVKEFLDVLPNDLTNLPPNREIEFTVKLLSGTAPTSQAPY
Subjt:  AMQSTREFLDDLGNRKVLRYDSIRSFSARYRYASGFATTRVIDARLVEVQREKRKPEDVPMVKEFLDVLPNDLTNLPPNREIEFTVKLLSGTAPTSQAPY

Query:  KMARSELKELKVQLQKLVDKGYIRHNVLRRRAPHFISFLHHFPSKLSEYFEERVKEEEEEEFRGLRLKKIGKSSCNCQPLVVLFGLSYLEGTKRPRGNRP
        KMARSELKELKVQLQKLVDKGYIRHNVLRRRA                                                    G    E    PRGNRP
Subjt:  KMARSELKELKVQLQKLVDKGYIRHNVLRRRAPHFISFLHHFPSKLSEYFEERVKEEEEEEFRGLRLKKIGKSSCNCQPLVVLFGLSYLEGTKRPRGNRP

Query:  SLQTGKLDKTVIFPLLTLSVLRDNDAVVSDGRAPLRQRRWECRVEIELLVSDTLPMSAESFESNSSTWLELYFESVHVEMFC
        SLQTGKLDKTVIFPLLTLSVLRDNDAV                VEIELLVSDTLPMSAESFESNSSTWLELYFESVHVEMFC
Subjt:  SLQTGKLDKTVIFPLLTLSVLRDNDAVVSDGRAPLRQRRWECRVEIELLVSDTLPMSAESFESNSSTWLELYFESVHVEMFC

A0A5A7TY60 Transposase2.6e-8353.71Show/hide
Query:  MVHLAVHLIREIEFCGPDHLRWMYPFERYIKVLKAYARNRNRPEECMVENYIVEEVIKFRYEFIVGVSSIGLNSSIITRDSNVDRTLSASSFIRLSKEQL
        MVHL VHL+REIEFCGP HLRWMYPFERY+KVLK Y RNRNRPE CMVENYIVEE I+F  EFI GVSSIGLNSS+I ++SN+D+ LSASSFIR SKEQL
Subjt:  MVHLAVHLIREIEFCGPDHLRWMYPFERYIKVLKAYARNRNRPEECMVENYIVEEVIKFRYEFIVGVSSIGLNSSIITRDSNVDRTLSASSFIRLSKEQL

Query:  DQAHLYVIQN------------------------------------------------------------------------------------------
        DQAHLYVIQN                                                                                          
Subjt:  DQAHLYVIQN------------------------------------------------------------------------------------------

Query:  ------EIWEIDYHQLLFILFKCDWVDNRSGVKMDKLGFTIMDLKRIGHKSKSFILVTQAKQVFYVQDSSNP---------------------------E
              EIWEIDYHQL FILFKCDWVDNRSGVK+D+LGFTI+DLKRIGHKS SFIL TQAKQVFYVQDS+NP                           E
Subjt:  ------EIWEIDYHQLLFILFKCDWVDNRSGVKMDKLGFTIMDLKRIGHKSKSFILVTQAKQVFYVQDSSNP---------------------------E

Query:  C-YGTIQRMPDVNTPNETDDTTSTYIRHDCEGRWVEK
        C Y TI+RMP+V+TPNETDDT STYIRHDCEGRWVEK
Subjt:  C-YGTIQRMPDVNTPNETDDTTSTYIRHDCEGRWVEK

A0A5A7VGQ2 Transposase7.9e-8049.19Show/hide
Query:  MVHLAVHLIREIEFCGPDHLRWMYPFERYIKVLKAYARNRNRPEECMVENYIVEEVIKFRYEFIVGVSSIGLNSSIITRDSNVDRTLSASSFIRLSKEQL
        MVHL VHL+REIEFCGP HLRWMYPFERY+KVLK Y RNRNRPE CMVENYIVEE I+F  EFI GVSSIGLNSS+I ++SN+DR LSASSFIR SKEQL
Subjt:  MVHLAVHLIREIEFCGPDHLRWMYPFERYIKVLKAYARNRNRPEECMVENYIVEEVIKFRYEFIVGVSSIGLNSSIITRDSNVDRTLSASSFIRLSKEQL

Query:  DQAHLYVIQN------------------------------------------------------------------------------------------
        DQAHLYVIQN                                                                                          
Subjt:  DQAHLYVIQN------------------------------------------------------------------------------------------

Query:  ---------------------------------------EIWEIDYHQLLFILFKCDWVDNRSGVKMDKLGFTIMDLKRIGHKSKSFILVTQAKQVFYVQ
                                               EIWEIDYHQL FILFKCDWVDNRSGVK+D+LGFTI+DLKRIGHKS SFIL TQAKQVFYVQ
Subjt:  ---------------------------------------EIWEIDYHQLLFILFKCDWVDNRSGVKMDKLGFTIMDLKRIGHKSKSFILVTQAKQVFYVQ

Query:  DSSNP---------------------------EC-YGTIQRMPDVNTPNETDDTTSTYIRHDCEGRWVEK
        DS+NP                           EC Y TI+RMP+V+TPNETDDT STYIRHDCEGRWVEK
Subjt:  DSSNP---------------------------EC-YGTIQRMPDVNTPNETDDTTSTYIRHDCEGRWVEK

A0A5D3BI39 RNA-directed DNA polymerase-like protein3.1e-14873.41Show/hide
Query:  MPPRTSRRLRQNQAEMKGPTEGQSIGKFSTPRVQVGARNKRFTRTTKKIGRPKRAEPSNPEKACGIKQLRKLGATVFEGSTNPTDVKEWLNMLEK-CFDV
        MPPRTSRRLRQNQAEMKGPTEGQSIGKFSTPRVQVGARNKRFTRTTKKIGRPKRAEPSNPEKACGIKQLRKLGATVFEGSTNPTDVKE  N+  K    +
Subjt:  MPPRTSRRLRQNQAEMKGPTEGQSIGKFSTPRVQVGARNKRFTRTTKKIGRPKRAEPSNPEKACGIKQLRKLGATVFEGSTNPTDVKEWLNMLEK-CFDV

Query:  MNYFEE-----------------RKRAPAMQSTREFLDDLGNRKVLRYDSIRSFSARYRYASGFATTRVIDARLVEVQREKRKPEDVPMVKEFLDVLPND
           F                   R+RAPAMQSTREFLDDLGNRKVLRYDSIRSFSARYRYASGFATTRVIDARLVEVQREKRKPEDVPMVKEFLDVLPND
Subjt:  MNYFEE-----------------RKRAPAMQSTREFLDDLGNRKVLRYDSIRSFSARYRYASGFATTRVIDARLVEVQREKRKPEDVPMVKEFLDVLPND

Query:  LTNLPPNREIEFTVKLLSGTAPTSQAPYKMARSELKELKVQLQKLVDKGYIRHNVLRRRAPHFISFLHHFPSKLSEYFEERVKEEEEEEFRGLRLKKIGK
        LTNLPPNREIEFTVKLLSGTAPTSQAPYKMARSELKELKVQLQKLVDKGYIRHNVLRRRA                                        
Subjt:  LTNLPPNREIEFTVKLLSGTAPTSQAPYKMARSELKELKVQLQKLVDKGYIRHNVLRRRAPHFISFLHHFPSKLSEYFEERVKEEEEEEFRGLRLKKIGK

Query:  SSCNCQPLVVLFGLSYLEGTKRPRGNRPSLQTGKLDKTVIFPLLTLSVLRDNDAVVSDGRAPLRQRRWECRVEIELLVSDTLPMSAESFESNSSTWLELY
                    G    E    PRGNRPSLQTGKLDKTVIFPLLTLSVLRDNDAV                VEIELLVSDTLPMSAESFESNSSTWLELY
Subjt:  SSCNCQPLVVLFGLSYLEGTKRPRGNRPSLQTGKLDKTVIFPLLTLSVLRDNDAVVSDGRAPLRQRRWECRVEIELLVSDTLPMSAESFESNSSTWLELY

Query:  FESVHVEMFC
        FESVHVEMFC
Subjt:  FESVHVEMFC

A0A5D3C2U9 Transposase7.9e-8049.19Show/hide
Query:  MVHLAVHLIREIEFCGPDHLRWMYPFERYIKVLKAYARNRNRPEECMVENYIVEEVIKFRYEFIVGVSSIGLNSSIITRDSNVDRTLSASSFIRLSKEQL
        MVHL VHL+REIEFCGP HLRWMYPFERY+KVLK Y RNRNRPE CMVENYIVEE I+F  EFI GVSSIGLNSS+I ++SN+DR LSASSFIR SKEQL
Subjt:  MVHLAVHLIREIEFCGPDHLRWMYPFERYIKVLKAYARNRNRPEECMVENYIVEEVIKFRYEFIVGVSSIGLNSSIITRDSNVDRTLSASSFIRLSKEQL

Query:  DQAHLYVIQN------------------------------------------------------------------------------------------
        DQAHLYVIQN                                                                                          
Subjt:  DQAHLYVIQN------------------------------------------------------------------------------------------

Query:  ---------------------------------------EIWEIDYHQLLFILFKCDWVDNRSGVKMDKLGFTIMDLKRIGHKSKSFILVTQAKQVFYVQ
                                               EIWEIDYHQL FILFKCDWVDNRSGVK+D+LGFTI+DLKRIGHKS SFIL TQAKQVFYVQ
Subjt:  ---------------------------------------EIWEIDYHQLLFILFKCDWVDNRSGVKMDKLGFTIMDLKRIGHKSKSFILVTQAKQVFYVQ

Query:  DSSNP---------------------------EC-YGTIQRMPDVNTPNETDDTTSTYIRHDCEGRWVEK
        DS+NP                           EC Y TI+RMP+V+TPNETDDT STYIRHDCEGRWVEK
Subjt:  DSSNP---------------------------EC-YGTIQRMPDVNTPNETDDTTSTYIRHDCEGRWVEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTACACCTTGCAGTGCATCTTATAAGAGAAATTGAGTTTTGTGGACCTGATCATTTAAGGTGGATGTATCCTTTTGAACGATATATAAAAGTCCTAAAAGCTTATGC
ACGAAATAGAAATCGACCAGAAGAATGTATGGTAGAAAATTATATTGTTGAAGAGGTTATAAAGTTCCGTTATGAATTTATTGTTGGAGTTAGTTCTATTGGACTTAACT
CATCTATAATTACAAGGGATTCAAACGTGGATAGAACATTATCGGCTTCTTCTTTTATCAGACTGAGTAAAGAGCAGTTAGATCAAGCTCATCTTTACGTCATTCAAAAT
GAGATATGGGAGATCGATTACCATCAATTATTATTTATTTTATTTAAGTGTGATTGGGTTGACAATAGAAGTGGAGTCAAAATGGACAAGCTTGGGTTTACCATCATGGA
TCTCAAACGTATTGGGCATAAATCAAAGTCATTTATTTTAGTCACTCAAGCAAAACAAGTATTCTATGTCCAAGATTCTTCAAATCCTGAATGTTATGGAACCATTCAAA
GGATGCCCGATGTCAATACACCTAATGAGACTGATGATACAACTTCAACTTATATTAGACATGATTGTGAGGGTAGATGGGTAGAAAAGGGAGTCATGCCACCACGTACT
AGTAGACGACTCAGACAGAATCAAGCCGAGATGAAGGGTCCTACCGAGGGTCAATCTATAGGGAAATTTAGTACCCCAAGAGTTCAGGTTGGGGCGAGAAACAAGCGGTT
TACGAGAACTACAAAGAAGATAGGAAGGCCAAAGAGAGCAGAGCCTAGTAATCCAGAAAAGGCATGCGGAATTAAACAGCTAAGGAAGTTAGGGGCCACAGTGTTTGAGG
GTTCCACAAATCCAACTGATGTGAAGGAGTGGTTGAATATGCTTGAAAAATGTTTTGACGTGATGAATTATTTTGAGGAACGGAAAAGGGCACCAGCAATGCAAAGCACA
AGGGAGTTCTTGGATGACCTAGGAAACAGGAAAGTTCTACGTTATGACTCAATAAGAAGTTTTAGTGCAAGGTATAGGTATGCTAGTGGATTTGCTACCACTAGAGTTAT
AGATGCTAGATTGGTTGAAGTGCAGAGAGAAAAGCGGAAGCCAGAAGATGTTCCTATGGTGAAAGAGTTTCTTGATGTACTTCCAAACGATCTGACAAATTTGCCACCTA
ATAGAGAGATCGAGTTCACTGTTAAATTATTATCAGGGACAGCACCTACTTCACAGGCACCGTACAAAATGGCTCGAAGTGAGCTTAAAGAGCTGAAGGTGCAGTTACAA
AAATTGGTTGACAAGGGATACATTAGGCATAATGTACTGCGTCGGAGAGCGCCCCATTTCATTTCTTTCCTCCATCATTTTCCATCAAAACTTTCAGAGTACTTTGAGGA
GAGAGTGAAGGAAGAAGAAGAAGAAGAATTTCGTGGTTTGAGATTGAAGAAGATAGGAAAAAGTTCATGCAACTGTCAACCTCTGGTTGTTTTGTTCGGTTTAAGCTATC
TAGAAGGAACTAAGAGGCCAAGAGGAAATAGGCCGAGTCTACAAACCGGGAAACTAGATAAGACGGTTATATTTCCGTTGTTGACGTTGAGTGTACTCCGTGACAACGAT
GCTGTCGTGAGTGATGGGCGGGCCCCACTACGACAAAGACGATGGGAGTGTCGGGTAGAGATTGAGCTCCTGGTGTCTGATACACTACCAATGTCTGCTGAAAGTTTCGA
ATCAAACTCCAGTACGTGGTTGGAGTTGTATTTTGAGTCTGTTCATGTTGAAATGTTTTGTAAGGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTACACCTTGCAGTGCATCTTATAAGAGAAATTGAGTTTTGTGGACCTGATCATTTAAGGTGGATGTATCCTTTTGAACGATATATAAAAGTCCTAAAAGCTTATGC
ACGAAATAGAAATCGACCAGAAGAATGTATGGTAGAAAATTATATTGTTGAAGAGGTTATAAAGTTCCGTTATGAATTTATTGTTGGAGTTAGTTCTATTGGACTTAACT
CATCTATAATTACAAGGGATTCAAACGTGGATAGAACATTATCGGCTTCTTCTTTTATCAGACTGAGTAAAGAGCAGTTAGATCAAGCTCATCTTTACGTCATTCAAAAT
GAGATATGGGAGATCGATTACCATCAATTATTATTTATTTTATTTAAGTGTGATTGGGTTGACAATAGAAGTGGAGTCAAAATGGACAAGCTTGGGTTTACCATCATGGA
TCTCAAACGTATTGGGCATAAATCAAAGTCATTTATTTTAGTCACTCAAGCAAAACAAGTATTCTATGTCCAAGATTCTTCAAATCCTGAATGTTATGGAACCATTCAAA
GGATGCCCGATGTCAATACACCTAATGAGACTGATGATACAACTTCAACTTATATTAGACATGATTGTGAGGGTAGATGGGTAGAAAAGGGAGTCATGCCACCACGTACT
AGTAGACGACTCAGACAGAATCAAGCCGAGATGAAGGGTCCTACCGAGGGTCAATCTATAGGGAAATTTAGTACCCCAAGAGTTCAGGTTGGGGCGAGAAACAAGCGGTT
TACGAGAACTACAAAGAAGATAGGAAGGCCAAAGAGAGCAGAGCCTAGTAATCCAGAAAAGGCATGCGGAATTAAACAGCTAAGGAAGTTAGGGGCCACAGTGTTTGAGG
GTTCCACAAATCCAACTGATGTGAAGGAGTGGTTGAATATGCTTGAAAAATGTTTTGACGTGATGAATTATTTTGAGGAACGGAAAAGGGCACCAGCAATGCAAAGCACA
AGGGAGTTCTTGGATGACCTAGGAAACAGGAAAGTTCTACGTTATGACTCAATAAGAAGTTTTAGTGCAAGGTATAGGTATGCTAGTGGATTTGCTACCACTAGAGTTAT
AGATGCTAGATTGGTTGAAGTGCAGAGAGAAAAGCGGAAGCCAGAAGATGTTCCTATGGTGAAAGAGTTTCTTGATGTACTTCCAAACGATCTGACAAATTTGCCACCTA
ATAGAGAGATCGAGTTCACTGTTAAATTATTATCAGGGACAGCACCTACTTCACAGGCACCGTACAAAATGGCTCGAAGTGAGCTTAAAGAGCTGAAGGTGCAGTTACAA
AAATTGGTTGACAAGGGATACATTAGGCATAATGTACTGCGTCGGAGAGCGCCCCATTTCATTTCTTTCCTCCATCATTTTCCATCAAAACTTTCAGAGTACTTTGAGGA
GAGAGTGAAGGAAGAAGAAGAAGAAGAATTTCGTGGTTTGAGATTGAAGAAGATAGGAAAAAGTTCATGCAACTGTCAACCTCTGGTTGTTTTGTTCGGTTTAAGCTATC
TAGAAGGAACTAAGAGGCCAAGAGGAAATAGGCCGAGTCTACAAACCGGGAAACTAGATAAGACGGTTATATTTCCGTTGTTGACGTTGAGTGTACTCCGTGACAACGAT
GCTGTCGTGAGTGATGGGCGGGCCCCACTACGACAAAGACGATGGGAGTGTCGGGTAGAGATTGAGCTCCTGGTGTCTGATACACTACCAATGTCTGCTGAAAGTTTCGA
ATCAAACTCCAGTACGTGGTTGGAGTTGTATTTTGAGTCTGTTCATGTTGAAATGTTTTGTAAGGTTTAA
Protein sequenceShow/hide protein sequence
MVHLAVHLIREIEFCGPDHLRWMYPFERYIKVLKAYARNRNRPEECMVENYIVEEVIKFRYEFIVGVSSIGLNSSIITRDSNVDRTLSASSFIRLSKEQLDQAHLYVIQN
EIWEIDYHQLLFILFKCDWVDNRSGVKMDKLGFTIMDLKRIGHKSKSFILVTQAKQVFYVQDSSNPECYGTIQRMPDVNTPNETDDTTSTYIRHDCEGRWVEKGVMPPRT
SRRLRQNQAEMKGPTEGQSIGKFSTPRVQVGARNKRFTRTTKKIGRPKRAEPSNPEKACGIKQLRKLGATVFEGSTNPTDVKEWLNMLEKCFDVMNYFEERKRAPAMQST
REFLDDLGNRKVLRYDSIRSFSARYRYASGFATTRVIDARLVEVQREKRKPEDVPMVKEFLDVLPNDLTNLPPNREIEFTVKLLSGTAPTSQAPYKMARSELKELKVQLQ
KLVDKGYIRHNVLRRRAPHFISFLHHFPSKLSEYFEERVKEEEEEEFRGLRLKKIGKSSCNCQPLVVLFGLSYLEGTKRPRGNRPSLQTGKLDKTVIFPLLTLSVLRDND
AVVSDGRAPLRQRRWECRVEIELLVSDTLPMSAESFESNSSTWLELYFESVHVEMFCKV