; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0224971 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0224971
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:15479652..15480356
RNA-Seq ExpressionCmc08g0224971
SyntenyCmc08g0224971
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032794.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.7e-8683.17Show/hide
Query:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----
        EFIVMSFGL NA TVFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM+SFLGHVVSKAGVSVDP KIEAVTSWPRPSTVSEVRSFLGL    
Subjt:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----

Query:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA
                        LTRK APFVWSKACEDSFQNLK+KLVTAPV TVPDGSG+F+IY DASKKGL CVLMQQGKVV YASRQLKSH QNYPTHD+ELA
Subjt:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA

Query:  AVVFALKI
        AVVFALKI
Subjt:  AVVFALKI

KAA0035816.1 pol protein [Cucumis melo var. makuwa]1.1e-8582.69Show/hide
Query:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----
        EFIVMSFGL NA  VFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEH+EHLR++SFLGHVVSKAGVSVDP KIEAVT W RPSTVSEVRSFLGL    
Subjt:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----

Query:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA
                        LTRK APFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG+FVIYSDASKKGL CVLMQQGKVV YASRQLKSH QNYPTHD+ELA
Subjt:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA

Query:  AVVFALKI
        AVVFALKI
Subjt:  AVVFALKI

KAA0036553.1 pol protein [Cucumis melo var. makuwa]1.4e-8582.69Show/hide
Query:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----
        EFIVMSFGL NA  VFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR++SFLGHVVSKAGVSVDP KIEAVT W RPSTVSE RSFLGL    
Subjt:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----

Query:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA
                        LTRK APFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG+FVIYSDASKKGL CVLMQQGKVV YASRQLKSH QNYPTHD+ELA
Subjt:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA

Query:  AVVFALKI
        AVVFALKI
Subjt:  AVVFALKI

KAA0051368.1 pol protein [Cucumis melo var. makuwa]3.7e-8683.17Show/hide
Query:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----
        EFIVMSFGL NA  VFM+LMNRVFR+FLDTFVIVFIDDILIYSKTEAEHEEHLRM+SFLGHVVSKAGVSVDP KIEAVT W RPSTVSEVRSFLGL    
Subjt:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----

Query:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA
                        LTRK APFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG+FVIYSDASKKGL CVLMQQGKVV YASRQLKSH QNYPTHD+ELA
Subjt:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA

Query:  AVVFALKI
        AVVFALKI
Subjt:  AVVFALKI

KAA0062719.1 pol protein [Cucumis melo var. makuwa]1.7e-8375.76Show/hide
Query:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM-----------------------ISFLGHVVSKAGVSVDPTKIEA
        EFIVMSFGL NAL VFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM                       +SFLGHVVSKAGVSVDP KIEA
Subjt:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM-----------------------ISFLGHVVSKAGVSVDPTKIEA

Query:  VTSWPRPSTVSEVRSFLGL--------------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKV
        VTSW RPSTVSEVRSFLGL                    LTRK APFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG+FVIYSDASKKGL CVL+QQGKV
Subjt:  VTSWPRPSTVSEVRSFLGL--------------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKV

Query:  VTYASRQLKSHVQNYPTHDVELAAVVFALKI
        V YASRQLKSH QNYPTHD+ELAAVVFALKI
Subjt:  VTYASRQLKSHVQNYPTHDVELAAVVFALKI

TrEMBL top hitse value%identityAlignment
A0A5A7T0S7 Reverse transcriptase5.2e-8682.69Show/hide
Query:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----
        EFIVMSFGL NA  VFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEH+EHLR++SFLGHVVSKAGVSVDP KIEAVT W RPSTVSEVRSFLGL    
Subjt:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----

Query:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA
                        LTRK APFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG+FVIYSDASKKGL CVLMQQGKVV YASRQLKSH QNYPTHD+ELA
Subjt:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA

Query:  AVVFALKI
        AVVFALKI
Subjt:  AVVFALKI

A0A5A7T0Y9 Reverse transcriptase6.8e-8682.69Show/hide
Query:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----
        EFIVMSFGL NA  VFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR++SFLGHVVSKAGVSVDP KIEAVT W RPSTVSE RSFLGL    
Subjt:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----

Query:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA
                        LTRK APFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG+FVIYSDASKKGL CVLMQQGKVV YASRQLKSH QNYPTHD+ELA
Subjt:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA

Query:  AVVFALKI
        AVVFALKI
Subjt:  AVVFALKI

A0A5A7U7V9 Reverse transcriptase1.8e-8683.17Show/hide
Query:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----
        EFIVMSFGL NA  VFM+LMNRVFR+FLDTFVIVFIDDILIYSKTEAEHEEHLRM+SFLGHVVSKAGVSVDP KIEAVT W RPSTVSEVRSFLGL    
Subjt:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----

Query:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA
                        LTRK APFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG+FVIYSDASKKGL CVLMQQGKVV YASRQLKSH QNYPTHD+ELA
Subjt:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA

Query:  AVVFALKI
        AVVFALKI
Subjt:  AVVFALKI

A0A5A7VAL8 Pol protein8.3e-8475.76Show/hide
Query:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM-----------------------ISFLGHVVSKAGVSVDPTKIEA
        EFIVMSFGL NAL VFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM                       +SFLGHVVSKAGVSVDP KIEA
Subjt:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM-----------------------ISFLGHVVSKAGVSVDPTKIEA

Query:  VTSWPRPSTVSEVRSFLGL--------------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKV
        VTSW RPSTVSEVRSFLGL                    LTRK APFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG+FVIYSDASKKGL CVL+QQGKV
Subjt:  VTSWPRPSTVSEVRSFLGL--------------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKV

Query:  VTYASRQLKSHVQNYPTHDVELAAVVFALKI
        V YASRQLKSH QNYPTHD+ELAAVVFALKI
Subjt:  VTYASRQLKSHVQNYPTHDVELAAVVFALKI

A0A5D3E456 Reverse transcriptase8.0e-8783.17Show/hide
Query:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----
        EFIVMSFGL NA TVFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM+SFLGHVVSKAGVSVDP KIEAVTSWPRPSTVSEVRSFLGL    
Subjt:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL----

Query:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA
                        LTRK APFVWSKACEDSFQNLK+KLVTAPV TVPDGSG+F+IY DASKKGL CVLMQQGKVV YASRQLKSH QNYPTHD+ELA
Subjt:  ----------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELA

Query:  AVVFALKI
        AVVFALKI
Subjt:  AVVFALKI

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.64.6e-2330.3Show/hide
Query:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMI-----------------------SFLGHVVSKAGVSVDPTKIEA
        E++ M FGL NA   F   MN + R  L+   +V++DDI+++S +  EH + L ++                       +FLGHV++  G+  +P KIEA
Subjt:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMI-----------------------SFLGHVVSKAGVSVDPTKIEA

Query:  VTSWPRPSTVSEVRSFLGL---------------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGK
        +  +P P+   E+++FLGL                     L +       +   + +F+ LK  +   P+L VPD +  F + +DAS   L  VL Q G 
Subjt:  VTSWPRPSTVSEVRSFLGL---------------------LTRKEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGK

Query:  VVTYASRQLKSHVQNYPTHDVELAAVVFALK
         ++Y SR L  H  NY T + EL A+V+A K
Subjt:  VVTYASRQLKSHVQNYPTHDVELAAVVFALK

P10401 Retrovirus-related Pol polyprotein from transposon gypsy3.2e-2430.29Show/hide
Query:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHL-----------------------RMISFLGHVVSKAGVSVDPTKIEA
        EF  + FGL NA ++F   ++ V RE +     V++DD++I+S+ E++H  H+                         + +LG +VSK G   DP K++A
Subjt:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHL-----------------------RMISFLGHVVSKAGVSVDPTKIEA

Query:  VTSWPRPSTVSEVRSFLGLLT-------------------------------RKEAPFVWSKACEDSFQNLKQKLVTAPV-LTVPDGSGNFVIYSDASKK
        +  +P P  V +VRSFLGL +                                K+ P  +++   ++FQ L+  L +  V L  PD    F + +DAS  
Subjt:  VTSWPRPSTVSEVRSFLGLLT-------------------------------RKEAPFVWSKACEDSFQNLKQKLVTAPV-LTVPDGSGNFVIYSDASKK

Query:  GLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELAAVVFAL
        G+  VL Q+G+ +T  SR LK   QNY T++ EL A+V+AL
Subjt:  GLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELAAVVFAL

P20825 Retrovirus-related Pol polyprotein from transposon 2975.4e-2429.87Show/hide
Query:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMI-----------------------SFLGHVVSKAGVSVDPTKIEA
        E++ M FGL NA   F   MN + R  L+   +V++DDI+I+S +  EH   ++++                       +FLGH+V+  G+  +P K++A
Subjt:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMI-----------------------SFLGHVVSKAGVSVDPTKIEA

Query:  VTSWPRPSTVSEVRSFLGLL-------------------TRKEAPFVWSKACE--DSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGK
        + S+P P+   E+R+FLGL                      K+   + ++  E  ++F+ LK  ++  P+L +PD    FV+ +DAS   L  VL Q G 
Subjt:  VTSWPRPSTVSEVRSFLGLL-------------------TRKEAPFVWSKACE--DSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGK

Query:  VVTYASRQLKSHVQNYPTHDVELAAVVFALK
         +++ SR L  H  NY   + EL A+V+A K
Subjt:  VVTYASRQLKSHVQNYPTHDVELAAVVFALK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.9e-1729.18Show/hide
Query:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMI-----------------------SFLGHVVSKAGVSVDPTKIEA
        E+ VM FGL+NA + F   M   FR+    FV V++DDILI+S++  EH +HL  +                        FLG+ +    ++    K  A
Subjt:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMI-----------------------SFLGHVVSKAGVSVDPTKIEA

Query:  VTSWPRPSTVSEVRSFLGLLT--RKEAP-----------FV-----WSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGK---
        +  +P P TV + + FLG++   R+  P           F+     W++  + + + LK  L  +PVL   +   N+ + +DASK G+  VL +      
Subjt:  VTSWPRPSTVSEVRSFLGLLT--RKEAP-----------FV-----WSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGK---

Query:  ---VVTYASRQLKSHVQNYPTHDVELAAVVFAL
           VV Y S+ L+S  +NYP  ++EL  ++ AL
Subjt:  ---VVTYASRQLKSHVQNYPTHDVELAAVVFAL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.4e-1825.82Show/hide
Query:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM-----------------------ISFLGHVVSKAGVSVDPTKIEA
        EF+ + FGL NA  +F  +++ + RE +     V+IDDI+++S+    H ++LR+                       + FLG++V+  G+  DP K+ A
Subjt:  EFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM-----------------------ISFLGHVVSKAGVSVDPTKIEA

Query:  VTSWPRPSTVSEVRSFLGL--------------------LTR-----------KEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKG
        ++  P P++V E++ FLG+                    LTR            + P    +    SF +LK  L ++ +L  P  +  F + +DAS   
Subjt:  VTSWPRPSTVSEVRSFLGL--------------------LTR-----------KEAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKG

Query:  LCCVLMQ----QGKVVTYASRQLKSHVQNYPTHDVELAAVVFAL
        +  VL Q    + + + Y SR L    +NY T + E+ A++++L
Subjt:  LCCVLMQ----QGKVVTYASRQLKSHVQNYPTHDVELAAVVFAL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.0e-0936.08Show/hide
Query:  ISFLG--HVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL-----------------LTR--KEAPFVWSKACEDSFQNLKQKLVTAPVLTVPD
        I++LG  H++S  GVS DP K+EA+  WP P   +E+R FLGL                 LT   K+    W++    +F+ LK  + T PVL +PD
Subjt:  ISFLG--HVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGL-----------------LTR--KEAPFVWSKACEDSFQNLKQKLVTAPVLTVPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACACTGAGTTTATTGTGATGTCTTTTGGTTTGATGAATGCTCTAACAGTGTTTATGGAGTTAATGAACAGGGTGTTTAGGGAGTTCCTAGACACTTTTGTGATCGT
GTTTATTGATGATATTTTGATATATTCCAAAACAGAGGCAGAGCATGAGGAGCATTTACGCATGATATCCTTTCTAGGCCATGTGGTTTCTAAAGCTGGTGTTTCTGTAG
ATCCAACTAAGATAGAGGCAGTCACCAGTTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTATTGACCAGAAAGGAAGCTCCTTTTGTTTGGAGC
AAGGCCTGTGAGGACAGTTTTCAGAACCTTAAACAGAAACTCGTCACTGCACCGGTTCTTACTGTACCTGATGGTTCAGGGAATTTTGTGATTTACAGTGATGCTTCTAA
GAAAGGTTTGTGTTGCGTTTTGATGCAGCAAGGTAAGGTAGTCACTTATGCTTCTCGTCAGTTGAAGAGTCATGTGCAGAATTACCCGACCCACGATGTAGAGTTGGCAG
CAGTAGTTTTTGCATTGAAGATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACACTGAGTTTATTGTGATGTCTTTTGGTTTGATGAATGCTCTAACAGTGTTTATGGAGTTAATGAACAGGGTGTTTAGGGAGTTCCTAGACACTTTTGTGATCGT
GTTTATTGATGATATTTTGATATATTCCAAAACAGAGGCAGAGCATGAGGAGCATTTACGCATGATATCCTTTCTAGGCCATGTGGTTTCTAAAGCTGGTGTTTCTGTAG
ATCCAACTAAGATAGAGGCAGTCACCAGTTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTATTGACCAGAAAGGAAGCTCCTTTTGTTTGGAGC
AAGGCCTGTGAGGACAGTTTTCAGAACCTTAAACAGAAACTCGTCACTGCACCGGTTCTTACTGTACCTGATGGTTCAGGGAATTTTGTGATTTACAGTGATGCTTCTAA
GAAAGGTTTGTGTTGCGTTTTGATGCAGCAAGGTAAGGTAGTCACTTATGCTTCTCGTCAGTTGAAGAGTCATGTGCAGAATTACCCGACCCACGATGTAGAGTTGGCAG
CAGTAGTTTTTGCATTGAAGATATGA
Protein sequenceShow/hide protein sequence
MDTEFIVMSFGLMNALTVFMELMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMISFLGHVVSKAGVSVDPTKIEAVTSWPRPSTVSEVRSFLGLLTRKEAPFVWS
KACEDSFQNLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLCCVLMQQGKVVTYASRQLKSHVQNYPTHDVELAAVVFALKI