; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc05g0125501 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc05g0125501
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr05:2574021..2575361
RNA-Seq ExpressionCmc05g0125501
SyntenyCmc05g0125501
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037220.1 reverse transcriptase [Cucumis melo var. makuwa]4.8e-24698.37Show/hide
Query:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
        MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
Subjt:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL

Query:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
        PKSLPPRRMIDHEIELVPGAK PAKNAYRMA PELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
Subjt:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD

Query:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
        RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCT+MNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
Subjt:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK

Query:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK
        ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFL LANYYRRF+EGFSKRASPLTELLKKDVHWNWDPECQ AFDGLK
Subjt:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK

Query:  QALMEGPLLGIVDVTKPFEVETDASDYALG
        QALMEGPLLGI DVTKPFEVETDASDYALG
Subjt:  QALMEGPLLGIVDVTKPFEVETDASDYALG

KAA0052270.1 uncharacterized protein E6C27_scaffold207G00960 [Cucumis melo var. makuwa]6.7e-24895.29Show/hide
Query:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
        MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPL SSENS ETVPKEI+RVLEKYRDVMPDSL
Subjt:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL

Query:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
        PKSLPP+RMIDHEIELVPGAK PAKNAYRMA PELAELRKQLDELLNAGFIRPAKAPYGAPVLFQ+KKDGSLRLCI YRALNKLTVRNKYPLPIIT+LFD
Subjt:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD

Query:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
        RLH AKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCT+MNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
Subjt:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK

Query:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK
        ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAI DWAMPKSVSELRSFL LANYYRRF+EGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK
Subjt:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK

Query:  QALMEGPLLGIVDVTKPFEVETDASDYALGVCSYRMGTRSHTKVEN
        QA++EGPLLGI DVT+PFEVETDASDYALGVCSYRM TR HTKV+N
Subjt:  QALMEGPLLGIVDVTKPFEVETDASDYALGVCSYRMGTRSHTKVEN

KAA0057285.1 uncharacterized protein E6C27_scaffold280G001260 [Cucumis melo var. makuwa]7.6e-260100Show/hide
Query:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
        MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
Subjt:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL

Query:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
        PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
Subjt:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD

Query:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
        RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
Subjt:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK

Query:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK
        ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK
Subjt:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK

Query:  QALMEGPLLGIVDVTKPFEVETDASDYALGVCSYRMGTRSHTKVEN
        QALMEGPLLGIVDVTKPFEVETDASDYALGVCSYRMGTRSHTKVEN
Subjt:  QALMEGPLLGIVDVTKPFEVETDASDYALGVCSYRMGTRSHTKVEN

KAA0063412.1 reverse transcriptase [Cucumis melo var. makuwa]4.8e-24698.37Show/hide
Query:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
        MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
Subjt:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL

Query:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
        PKSLPPRRMIDHEIELVPGAK PAKNAYRMA PELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
Subjt:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD

Query:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
        RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCT+MNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
Subjt:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK

Query:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK
        ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFL LANYYRRF+EGFSKRASPLTELLKKDVHWNWDPECQ AFDGLK
Subjt:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK

Query:  QALMEGPLLGIVDVTKPFEVETDASDYALG
        QALMEGPLLGI DVTKPFEVETDASDYALG
Subjt:  QALMEGPLLGIVDVTKPFEVETDASDYALG

KAA0067557.1 reverse transcriptase [Cucumis melo var. makuwa]4.8e-24698.37Show/hide
Query:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
        MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
Subjt:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL

Query:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
        PKSLPPRRMIDHEIELVPGAK PAKNAYRMA PELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
Subjt:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD

Query:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
        RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCT+MNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
Subjt:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK

Query:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK
        ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFL LANYYRRF+EGFSKRASPLTELLKKDVHWNWDPECQ AFDGLK
Subjt:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK

Query:  QALMEGPLLGIVDVTKPFEVETDASDYALG
        QALMEGPLLGI DVTKPFEVETDASDYALG
Subjt:  QALMEGPLLGIVDVTKPFEVETDASDYALG

TrEMBL top hitse value%identityAlignment
A0A5A7UAP7 Reverse transcriptase domain-containing protein3.2e-24895.29Show/hide
Query:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
        MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPL SSENS ETVPKEI+RVLEKYRDVMPDSL
Subjt:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL

Query:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
        PKSLPP+RMIDHEIELVPGAK PAKNAYRMA PELAELRKQLDELLNAGFIRPAKAPYGAPVLFQ+KKDGSLRLCI YRALNKLTVRNKYPLPIIT+LFD
Subjt:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD

Query:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
        RLH AKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCT+MNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
Subjt:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK

Query:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK
        ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAI DWAMPKSVSELRSFL LANYYRRF+EGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK
Subjt:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK

Query:  QALMEGPLLGIVDVTKPFEVETDASDYALGVCSYRMGTRSHTKVEN
        QA++EGPLLGI DVT+PFEVETDASDYALGVCSYRM TR HTKV+N
Subjt:  QALMEGPLLGIVDVTKPFEVETDASDYALGVCSYRMGTRSHTKVEN

A0A5A7UUR0 Reverse transcriptase domain-containing protein3.7e-260100Show/hide
Query:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
        MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
Subjt:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL

Query:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
        PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
Subjt:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD

Query:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
        RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
Subjt:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK

Query:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK
        ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK
Subjt:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK

Query:  QALMEGPLLGIVDVTKPFEVETDASDYALGVCSYRMGTRSHTKVEN
        QALMEGPLLGIVDVTKPFEVETDASDYALGVCSYRMGTRSHTKVEN
Subjt:  QALMEGPLLGIVDVTKPFEVETDASDYALGVCSYRMGTRSHTKVEN

A0A5A7UXR6 Reverse transcriptase2.3e-24698.37Show/hide
Query:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
        MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
Subjt:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL

Query:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
        PKSLPPRRMIDHEIELVPGAK PAKNAYRMA PELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
Subjt:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD

Query:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
        RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCT+MNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
Subjt:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK

Query:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK
        ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFL LANYYRRF+EGFSKRASPLTELLKKDVHWNWDPECQ AFDGLK
Subjt:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK

Query:  QALMEGPLLGIVDVTKPFEVETDASDYALG
        QALMEGPLLGI DVTKPFEVETDASDYALG
Subjt:  QALMEGPLLGIVDVTKPFEVETDASDYALG

A0A5D3BRZ6 Reverse transcriptase2.3e-24698.37Show/hide
Query:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
        MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
Subjt:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL

Query:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
        PKSLPPRRMIDHEIELVPGAK PAKNAYRMA PELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
Subjt:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD

Query:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
        RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCT+MNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
Subjt:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK

Query:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK
        ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFL LANYYRRF+EGFSKRASPLTELLKKDVHWNWDPECQ AFDGLK
Subjt:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK

Query:  QALMEGPLLGIVDVTKPFEVETDASDYALG
        QALMEGPLLGI DVTKPFEVETDASDYALG
Subjt:  QALMEGPLLGIVDVTKPFEVETDASDYALG

A0A5D3C4R1 Reverse transcriptase2.3e-24698.37Show/hide
Query:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
        MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL
Subjt:  MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSL

Query:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
        PKSLPPRRMIDHEIELVPGAK PAKNAYRMA PELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD
Subjt:  PKSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFD

Query:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
        RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCT+MNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK
Subjt:  RLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLK

Query:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK
        ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFL LANYYRRF+EGFSKRASPLTELLKKDVHWNWDPECQ AFDGLK
Subjt:  ENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLK

Query:  QALMEGPLLGIVDVTKPFEVETDASDYALG
        QALMEGPLLGI DVTKPFEVETDASDYALG
Subjt:  QALMEGPLLGIVDVTKPFEVETDASDYALG

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein4.6e-6637.71Show/hide
Query:  EIMRVLEKYRDVMPDSLPKSLP-PRRMIDHEIELV-PGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRAL
        E+  + ++++D+  ++  + LP P + ++ E+EL     + P +N Y +   ++  +  ++++ L +G IR +KA    PV+F  KK+G+LR+ +DY+ L
Subjt:  EIMRVLEKYRDVMPDSLPKSLP-PRRMIDHEIELV-PGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRAL

Query:  NKLTVRNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVY
        NK    N YPLP+I  L  ++ G+  F+KLDL+S Y+ +R+ +GDE K       G FE+LVMP+G++ APA F   +N +  E  +  VV Y+DDI+++
Subjt:  NKLTVRNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVY

Query:  STTMEEHRDHLQKVFQKLKENQLYVKREKCSFAQERINFLG-HVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELL
        S +  EH  H++ V QKLK   L + + KC F Q ++ F+G H+ E G    +E  I  +  W  PK+  ELR FL   NY R+FI   S+   PL  LL
Subjt:  STTMEEHRDHLQKVFQKLKENQLYVKREKCSFAQERINFLG-HVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELL

Query:  KKDVHWNWDPECQAAFDGLKQALMEGPLLGIVDVTKPFEVETDASDYALG
        KKDV W W P    A + +KQ L+  P+L   D +K   +ETDASD A+G
Subjt:  KKDVHWNWDPECQAAFDGLKQALMEGPLLGIVDVTKPFEVETDASDYALG

P0CT35 Transposon Tf2-2 polyprotein4.6e-6637.71Show/hide
Query:  EIMRVLEKYRDVMPDSLPKSLP-PRRMIDHEIELV-PGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRAL
        E+  + ++++D+  ++  + LP P + ++ E+EL     + P +N Y +   ++  +  ++++ L +G IR +KA    PV+F  KK+G+LR+ +DY+ L
Subjt:  EIMRVLEKYRDVMPDSLPKSLP-PRRMIDHEIELV-PGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRAL

Query:  NKLTVRNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVY
        NK    N YPLP+I  L  ++ G+  F+KLDL+S Y+ +R+ +GDE K       G FE+LVMP+G++ APA F   +N +  E  +  VV Y+DDI+++
Subjt:  NKLTVRNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVY

Query:  STTMEEHRDHLQKVFQKLKENQLYVKREKCSFAQERINFLG-HVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELL
        S +  EH  H++ V QKLK   L + + KC F Q ++ F+G H+ E G    +E  I  +  W  PK+  ELR FL   NY R+FI   S+   PL  LL
Subjt:  STTMEEHRDHLQKVFQKLKENQLYVKREKCSFAQERINFLG-HVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELL

Query:  KKDVHWNWDPECQAAFDGLKQALMEGPLLGIVDVTKPFEVETDASDYALG
        KKDV W W P    A + +KQ L+  P+L   D +K   +ETDASD A+G
Subjt:  KKDVHWNWDPECQAAFDGLKQALMEGPLLGIVDVTKPFEVETDASDYALG

P0CT41 Transposon Tf2-12 polyprotein4.6e-6637.71Show/hide
Query:  EIMRVLEKYRDVMPDSLPKSLP-PRRMIDHEIELV-PGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRAL
        E+  + ++++D+  ++  + LP P + ++ E+EL     + P +N Y +   ++  +  ++++ L +G IR +KA    PV+F  KK+G+LR+ +DY+ L
Subjt:  EIMRVLEKYRDVMPDSLPKSLP-PRRMIDHEIELV-PGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRAL

Query:  NKLTVRNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVY
        NK    N YPLP+I  L  ++ G+  F+KLDL+S Y+ +R+ +GDE K       G FE+LVMP+G++ APA F   +N +  E  +  VV Y+DDI+++
Subjt:  NKLTVRNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVY

Query:  STTMEEHRDHLQKVFQKLKENQLYVKREKCSFAQERINFLG-HVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELL
        S +  EH  H++ V QKLK   L + + KC F Q ++ F+G H+ E G    +E  I  +  W  PK+  ELR FL   NY R+FI   S+   PL  LL
Subjt:  STTMEEHRDHLQKVFQKLKENQLYVKREKCSFAQERINFLG-HVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELL

Query:  KKDVHWNWDPECQAAFDGLKQALMEGPLLGIVDVTKPFEVETDASDYALG
        KKDV W W P    A + +KQ L+  P+L   D +K   +ETDASD A+G
Subjt:  KKDVHWNWDPECQAAFDGLKQALMEGPLLGIVDVTKPFEVETDASDYALG

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.4e-6740.63Show/hide
Query:  EKYRDVMPDSLP------KSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNK
        +KYR+++ + LP       ++P    + H+IE+ PGA+ P    Y +      E+ K + +LL+  FI P+K+P  +PV+   KKDG+ RLC+DYR LNK
Subjt:  EKYRDVMPDSLP------KSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNK

Query:  LTVRNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYST
         T+ + +PLP I +L  R+  A+ F+ LDL SGY+Q+ +   D  KT  VT  G +E+ VMPFGL NAP+TF   M   F +   +FV VYLDDI+++S 
Subjt:  LTVRNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYST

Query:  TMEEHRDHLQKVFQKLKENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKD
        + EEH  HL  V ++LK   L VK++KC FA E   FLG+ I   +I   + K AAIRD+  PK+V + + FL + NYYRRFI   SK A P+ +L   D
Subjt:  TMEEHRDHLQKVFQKLKENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKD

Query:  VHWNWDPECQAAFDGLKQALMEGPLLGIVDVTKPFEVETDASDYALG
            W  +   A + LK AL   P+L   +    + + TDAS   +G
Subjt:  VHWNWDPECQAAFDGLKQALMEGPLLGIVDVTKPFEVETDASDYALG

Q99315 Transposon Ty3-G Gag-Pol polyprotein6.4e-6840.92Show/hide
Query:  EKYRDVMPDSLP------KSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNK
        +KYR+++ + LP       ++P    + H+IE+ PGA+ P    Y +      E+ K + +LL+  FI P+K+P  +PV+   KKDG+ RLC+DYR LNK
Subjt:  EKYRDVMPDSLP------KSLPPRRMIDHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNK

Query:  LTVRNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYST
         T+ + +PLP I +L  R+  A+ F+ LDL SGY+Q+ +   D  KT  VT  G +E+ VMPFGL NAP+TF   M   F +   +FV VYLDDI+++S 
Subjt:  LTVRNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYST

Query:  TMEEHRDHLQKVFQKLKENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKD
        + EEH  HL  V ++LK   L VK++KC FA E   FLG+ I   +I   + K AAIRD+  PK+V + + FL + NYYRRFI   SK A P+ +L   D
Subjt:  TMEEHRDHLQKVFQKLKENQLYVKREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKD

Query:  VHWNWDPECQAAFDGLKQALMEGPLLGIVDVTKPFEVETDASDYALG
            W  +   A D LK AL   P+L   +    + + TDAS   +G
Subjt:  VHWNWDPECQAAFDGLKQALMEGPLLGIVDVTKPFEVETDASDYALG

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein4.6e-2139.69Show/hide
Query:  DHLQKVFQKLKENQLYVKREKCSFAQERINFLG--HVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWN
        +HL  V Q  +++Q Y  R+KC+F Q +I +LG  H+I    +  +  K+ A+  W  PK+ +ELR FL L  YYRRF++ + K   PLTELLKK+    
Subjt:  DHLQKVFQKLKENQLYVKREKCSFAQERINFLG--HVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWN

Query:  WDPECQAAFDGLKQALMEGPLLGIVDVTKPF
        W      AF  LK A+   P+L + D+  PF
Subjt:  WDPECQAAFDGLKQALMEGPLLGIVDVTKPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGACTTTGATGTGGTACTGGGAATGGAGTTCCTACTTGAACATCAGGTAATCCCAATGCCTTTGGCCAAATGCTTGGTGATCACTGGACCTACACCCTCGGTTGT
ACAGACTGACCTACGTCAACCAGATGGATTGAAAATGATCTCGGCCATGCAATTAAAGAAGGGTCTCTCTCGAGACGAACCAACGTTTATGGCCATCCCACTCAAATCGT
CAGAGAACTCAGGGGAGACAGTCCCTAAGGAGATCATGCGCGTGCTAGAGAAATACCGTGATGTGATGCCCGATAGTTTGCCCAAGTCTTTGCCACCTCGGAGAATGATT
GATCATGAGATCGAGTTAGTGCCAGGGGCAAAATCGCCTGCGAAGAATGCTTATCGTATGGCGCTTCCGGAGTTAGCTGAACTTCGAAAACAGTTAGATGAACTACTGAA
TGCAGGGTTTATCAGGCCTGCAAAAGCTCCGTATGGGGCCCCAGTTCTTTTCCAAAGGAAGAAAGATGGGAGTTTACGACTGTGCATTGATTATCGCGCCCTAAATAAGC
TCACAGTCCGTAACAAGTATCCACTTCCCATAATTACTGACTTGTTCGACCGTTTACATGGGGCAAAGTATTTTTCAAAGTTAGACTTGCGGTCGGGATACTACCAAGTG
AGAATTGCAGAGGGAGATGAACCGAAGACAACCTGTGTCACCCGATATGGTGCGTTCGAATTCCTTGTAATGCCATTTGGTCTCACCAATGCCCCTGCCACCTTCTGCAC
GATGATGAACCAGGTCTTCCACGAATATCTCGATAAATTCGTAGTAGTCTACCTGGATGATATAGTGGTCTATAGTACGACCATGGAGGAACATAGGGACCACCTACAAA
AGGTTTTTCAGAAATTGAAGGAGAATCAACTGTACGTCAAAAGAGAAAAATGCTCTTTTGCACAAGAGCGGATAAACTTCTTGGGCCATGTGATAGAGTGTGGCCGAATT
GGAATGGAAGAAGGGAAGATTGCTGCGATACGCGACTGGGCAATGCCGAAATCAGTCTCAGAGTTACGCTCCTTCCTCCGGTTGGCAAATTACTATCGTCGATTTATCGA
GGGATTCTCGAAACGAGCAAGCCCGCTGACTGAGCTACTGAAAAAAGACGTTCACTGGAATTGGGACCCCGAGTGCCAAGCCGCCTTCGACGGCCTAAAGCAAGCCTTGA
TGGAGGGGCCACTTCTAGGGATTGTGGATGTGACCAAACCTTTCGAAGTCGAGACAGATGCGTCTGATTATGCGTTGGGGGTGTGCTCCTACAGAATGGGCACCCGATCG
CATACGAAAGTCGAAAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGACTTTGATGTGGTACTGGGAATGGAGTTCCTACTTGAACATCAGGTAATCCCAATGCCTTTGGCCAAATGCTTGGTGATCACTGGACCTACACCCTCGGTTGT
ACAGACTGACCTACGTCAACCAGATGGATTGAAAATGATCTCGGCCATGCAATTAAAGAAGGGTCTCTCTCGAGACGAACCAACGTTTATGGCCATCCCACTCAAATCGT
CAGAGAACTCAGGGGAGACAGTCCCTAAGGAGATCATGCGCGTGCTAGAGAAATACCGTGATGTGATGCCCGATAGTTTGCCCAAGTCTTTGCCACCTCGGAGAATGATT
GATCATGAGATCGAGTTAGTGCCAGGGGCAAAATCGCCTGCGAAGAATGCTTATCGTATGGCGCTTCCGGAGTTAGCTGAACTTCGAAAACAGTTAGATGAACTACTGAA
TGCAGGGTTTATCAGGCCTGCAAAAGCTCCGTATGGGGCCCCAGTTCTTTTCCAAAGGAAGAAAGATGGGAGTTTACGACTGTGCATTGATTATCGCGCCCTAAATAAGC
TCACAGTCCGTAACAAGTATCCACTTCCCATAATTACTGACTTGTTCGACCGTTTACATGGGGCAAAGTATTTTTCAAAGTTAGACTTGCGGTCGGGATACTACCAAGTG
AGAATTGCAGAGGGAGATGAACCGAAGACAACCTGTGTCACCCGATATGGTGCGTTCGAATTCCTTGTAATGCCATTTGGTCTCACCAATGCCCCTGCCACCTTCTGCAC
GATGATGAACCAGGTCTTCCACGAATATCTCGATAAATTCGTAGTAGTCTACCTGGATGATATAGTGGTCTATAGTACGACCATGGAGGAACATAGGGACCACCTACAAA
AGGTTTTTCAGAAATTGAAGGAGAATCAACTGTACGTCAAAAGAGAAAAATGCTCTTTTGCACAAGAGCGGATAAACTTCTTGGGCCATGTGATAGAGTGTGGCCGAATT
GGAATGGAAGAAGGGAAGATTGCTGCGATACGCGACTGGGCAATGCCGAAATCAGTCTCAGAGTTACGCTCCTTCCTCCGGTTGGCAAATTACTATCGTCGATTTATCGA
GGGATTCTCGAAACGAGCAAGCCCGCTGACTGAGCTACTGAAAAAAGACGTTCACTGGAATTGGGACCCCGAGTGCCAAGCCGCCTTCGACGGCCTAAAGCAAGCCTTGA
TGGAGGGGCCACTTCTAGGGATTGTGGATGTGACCAAACCTTTCGAAGTCGAGACAGATGCGTCTGATTATGCGTTGGGGGTGTGCTCCTACAGAATGGGCACCCGATCG
CATACGAAAGTCGAAAATTGA
Protein sequenceShow/hide protein sequence
MDDFDVVLGMEFLLEHQVIPMPLAKCLVITGPTPSVVQTDLRQPDGLKMISAMQLKKGLSRDEPTFMAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSLPKSLPPRRMI
DHEIELVPGAKSPAKNAYRMALPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQV
RIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTMMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLKENQLYVKREKCSFAQERINFLGHVIECGRI
GMEEGKIAAIRDWAMPKSVSELRSFLRLANYYRRFIEGFSKRASPLTELLKKDVHWNWDPECQAAFDGLKQALMEGPLLGIVDVTKPFEVETDASDYALGVCSYRMGTRS
HTKVEN