; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0185451 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0185451
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr07:3513417..3514724
RNA-Seq ExpressionCmc07g0185451
SyntenyCmc07g0185451
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035455.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]7.5e-19079.54Show/hide
Query:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW
        MPFGLTNAPAVFMD MNRIF +YLDQFVIVFIDDILVYS+D+++HEEHLRIVLQTLR+KQLYAKFSKCEFWL QVVFLGHVVSA GVSVDPQKV+AVVNW
Subjt:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW

Query:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT
        ERP SAT                                    EWSDKCEQSFQELKKRLVTAPIL L VT K+YVIYCDASR GLGCVLMQ+  VIAY 
Subjt:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT

Query:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG
        SRQLK+HECNYPTHDLELAAVVLALKIWRHYLF EKCHIFT HKSLKYIFDQKELNLRQR+ LELIKDYDCTIEYHPGKANVV DALSRKSRLPKSAL G
Subjt:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG

Query:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM
        IR +LL+ELRG KAV+T E SGSLLAQF VRSSLV EIV RQ EDSNLQK   K+K+G + EFELRTD AIVKQGRLCVPNISELK AILEEAH+SAYAM
Subjt:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM

Query:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ
        HPGSTKMYRTLKKTYWW GMK+EI EYVDRCLICQ
Subjt:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ

KAA0050493.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]7.5e-19079.54Show/hide
Query:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW
        MPFGLTNAPAVFMD MNRIF +YLDQFVIVFIDDILVYS+D+++HEEHLRIVLQTLR+KQLYAKFSKCEFWL QVVFLGHVVSA GVSVDPQKV+AVVNW
Subjt:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW

Query:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT
        ERP SAT                                    EWSDKCEQSFQELKKRLVTAPIL L VT K+YVIYCDASR GLGCVLMQ+  VIAY 
Subjt:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT

Query:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG
        SRQLK+HECNYPTHDLELAAVVLALKIWRHYLF EKCHIFT HKSLKYIFDQKELNLRQR+ LELIKDYDCTIEYHPGKANVV DALSRKSRLPKSAL G
Subjt:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG

Query:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM
        IR +LL+ELRG KAV+T E SGSLLAQF VRSSLV EIV RQ EDSNLQK   K+K+G + EFELRTD AIVKQGRLCVPNISELK AILEEAH+SAYAM
Subjt:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM

Query:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ
        HPGSTKMYRTLKKTYWW GMK+EI EYVDRCLICQ
Subjt:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ

KAA0050527.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]7.5e-19079.54Show/hide
Query:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW
        MPFGLTNAPAVFMD MNRIF +YLDQFVIVFIDDILVYS+D+++HEEHLRIVLQTLR+KQLYAKFSKCEFWL QVVFLGHVVSA GVSVDPQKV+AVVNW
Subjt:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW

Query:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT
        ERP SAT                                    EWSDKCEQSFQELKKRLVTAPIL L VT K+YVIYCDASR GLGCVLMQ+  VIAY 
Subjt:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT

Query:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG
        SRQLK+HECNYPTHDLELAAVVLALKIWRHYLF EKCHIFT HKSLKYIFDQKELNLRQR+ LELIKDYDCTIEYHPGKANVV DALSRKSRLPKSAL G
Subjt:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG

Query:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM
        IR +LL+ELRG KAV+T E SGSLLAQF VRSSLV EIV RQ EDSNLQK   K+K+G + EFELRTD AIVKQGRLCVPNISELK AILEEAH+SAYAM
Subjt:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM

Query:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ
        HPGSTKMYRTLKKTYWW GMK+EI EYVDRCLICQ
Subjt:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ

KAA0053322.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]2.7e-19284.9Show/hide
Query:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW
        MPFGLTNAPAVFMD MNRIF +YLDQFVIVFIDDILVYS+D+++HEEHLRIVLQTLR+KQLYAKFSKCEFWL QVVFLGHVVSA GVSVDP KV+AVVNW
Subjt:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW

Query:  ERPASAT-----EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYTSRQLKKHECNYPTHDLELAAVVLALKIWRHY
        ERP S T     EWSDKCEQSFQELKKRLVTAPIL L VT K+YVIYCDASR GLGCVLMQ+  VIAY SRQLK+HECNYPTHDLELAAVVLALKIWRHY
Subjt:  ERPASAT-----EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYTSRQLKKHECNYPTHDLELAAVVLALKIWRHY

Query:  LFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYGIRASLLSELRGFKAVMTAESSGSLLAQFHVR
        LF EKCHIFT HKSLKYIFDQKELNLRQR+ LELIKDYDCTIEYHPGKANVV DALSRKSRLPKSAL GIR +LL+ELRG KAV+T E SGSLLAQF VR
Subjt:  LFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYGIRASLLSELRGFKAVMTAESSGSLLAQFHVR

Query:  SSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAMHPGSTKMYRTLKKTYWWPGMKREIVEYVDRC
        SSLV EIV RQ EDSNLQK   K+K+G + EFELRTD AIVKQGRLCVPNISELK +ILEEAH+SAYAMHPGSTKMYRTLKKTYWW GMK+EI EYVDRC
Subjt:  SSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAMHPGSTKMYRTLKKTYWWPGMKREIVEYVDRC

Query:  LICQ
        LICQ
Subjt:  LICQ

KAA0066849.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]7.5e-19079.54Show/hide
Query:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW
        MPFGLTNAPAVFMD MNRIF +YLDQFVIVFIDDILVYS+D+++HEEHLRIVLQTLR+KQLYAKFSKCEFWL QVVFLGHVVSA GVSVDPQKV+AVVNW
Subjt:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW

Query:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT
        ERP SAT                                    EWSDKCEQSFQELKKRLVTAPIL L VT K+YVIYCDASR GLGCVLMQ+  VIAY 
Subjt:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT

Query:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG
        SRQLK+HECNYPTHDLELAAVVLALKIWRHYLF EKCHIFT HKSLKYIFDQKELNLRQR+ LELIKDYDCTIEYHPGKANVV DALSRKSRLPKSAL G
Subjt:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG

Query:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM
        IR +LL+ELRG KAV+T E SGSLLAQF VRSSLV EIV RQ EDSNLQK   K+K+G + EFELRTD AIVKQGRLCVPNISELK AILEEAH+SAYAM
Subjt:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM

Query:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ
        HPGSTKMYRTLKKTYWW GMK+EI EYVDRCLICQ
Subjt:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ

TrEMBL top hitse value%identityAlignment
A0A5A7T1Y5 Reverse transcriptase3.6e-19079.54Show/hide
Query:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW
        MPFGLTNAPAVFMD MNRIF +YLDQFVIVFIDDILVYS+D+++HEEHLRIVLQTLR+KQLYAKFSKCEFWL QVVFLGHVVSA GVSVDPQKV+AVVNW
Subjt:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW

Query:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT
        ERP SAT                                    EWSDKCEQSFQELKKRLVTAPIL L VT K+YVIYCDASR GLGCVLMQ+  VIAY 
Subjt:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT

Query:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG
        SRQLK+HECNYPTHDLELAAVVLALKIWRHYLF EKCHIFT HKSLKYIFDQKELNLRQR+ LELIKDYDCTIEYHPGKANVV DALSRKSRLPKSAL G
Subjt:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG

Query:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM
        IR +LL+ELRG KAV+T E SGSLLAQF VRSSLV EIV RQ EDSNLQK   K+K+G + EFELRTD AIVKQGRLCVPNISELK AILEEAH+SAYAM
Subjt:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM

Query:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ
        HPGSTKMYRTLKKTYWW GMK+EI EYVDRCLICQ
Subjt:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ

A0A5A7U2V7 Reverse transcriptase3.6e-19079.54Show/hide
Query:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW
        MPFGLTNAPAVFMD MNRIF +YLDQFVIVFIDDILVYS+D+++HEEHLRIVLQTLR+KQLYAKFSKCEFWL QVVFLGHVVSA GVSVDPQKV+AVVNW
Subjt:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW

Query:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT
        ERP SAT                                    EWSDKCEQSFQELKKRLVTAPIL L VT K+YVIYCDASR GLGCVLMQ+  VIAY 
Subjt:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT

Query:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG
        SRQLK+HECNYPTHDLELAAVVLALKIWRHYLF EKCHIFT HKSLKYIFDQKELNLRQR+ LELIKDYDCTIEYHPGKANVV DALSRKSRLPKSAL G
Subjt:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG

Query:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM
        IR +LL+ELRG KAV+T E SGSLLAQF VRSSLV EIV RQ EDSNLQK   K+K+G + EFELRTD AIVKQGRLCVPNISELK AILEEAH+SAYAM
Subjt:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM

Query:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ
        HPGSTKMYRTLKKTYWW GMK+EI EYVDRCLICQ
Subjt:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ

A0A5A7UC49 Reverse transcriptase1.3e-19284.9Show/hide
Query:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW
        MPFGLTNAPAVFMD MNRIF +YLDQFVIVFIDDILVYS+D+++HEEHLRIVLQTLR+KQLYAKFSKCEFWL QVVFLGHVVSA GVSVDP KV+AVVNW
Subjt:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW

Query:  ERPASAT-----EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYTSRQLKKHECNYPTHDLELAAVVLALKIWRHY
        ERP S T     EWSDKCEQSFQELKKRLVTAPIL L VT K+YVIYCDASR GLGCVLMQ+  VIAY SRQLK+HECNYPTHDLELAAVVLALKIWRHY
Subjt:  ERPASAT-----EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYTSRQLKKHECNYPTHDLELAAVVLALKIWRHY

Query:  LFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYGIRASLLSELRGFKAVMTAESSGSLLAQFHVR
        LF EKCHIFT HKSLKYIFDQKELNLRQR+ LELIKDYDCTIEYHPGKANVV DALSRKSRLPKSAL GIR +LL+ELRG KAV+T E SGSLLAQF VR
Subjt:  LFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYGIRASLLSELRGFKAVMTAESSGSLLAQFHVR

Query:  SSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAMHPGSTKMYRTLKKTYWWPGMKREIVEYVDRC
        SSLV EIV RQ EDSNLQK   K+K+G + EFELRTD AIVKQGRLCVPNISELK +ILEEAH+SAYAMHPGSTKMYRTLKKTYWW GMK+EI EYVDRC
Subjt:  SSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAMHPGSTKMYRTLKKTYWWPGMKREIVEYVDRC

Query:  LICQ
        LICQ
Subjt:  LICQ

A0A5A7UUL6 Reverse transcriptase3.6e-19079.54Show/hide
Query:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW
        MPFGLTNAPAVFMD MNRIF +YLDQFVIVFIDDILVYS+D+++HEEHLRIVLQTLR+KQLYAKFSKCEFWL QVVFLGHVVSA GVSVDPQKV+AVVNW
Subjt:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW

Query:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT
        ERP SAT                                    EWSDKCEQSFQELKKRLVTAPIL L VT K+YVIYCDASR GLGCVLMQ+  VIAY 
Subjt:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT

Query:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG
        SRQLK+HECNYPTHDLELAAVVLALKIWRHYLF EKCHIFT HKSLKYIFDQKELNLRQR+ LELIKDYDCTIEYHPGKANVV DALSRKSRLPKSAL G
Subjt:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG

Query:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM
        IR +LL+ELRG KAV+T E SGSLLAQF VRSSLV EIV RQ EDSNLQK   K+K+G + EFELRTD AIVKQGRLCVPNISELK AILEEAH+SAYAM
Subjt:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM

Query:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ
        HPGSTKMYRTLKKTYWW GMK+EI EYVDRCLICQ
Subjt:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ

A0A5D3BHI1 Reverse transcriptase3.6e-19079.54Show/hide
Query:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW
        MPFGLTNAPAVFMD MNRIF +YLDQFVIVFIDDILVYS+D+++HEEHLRIVLQTLR+KQLYAKFSKCEFWL QVVFLGHVVSA GVSVDPQKV+AVVNW
Subjt:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW

Query:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT
        ERP SAT                                    EWSDKCEQSFQELKKRLVTAPIL L VT K+YVIYCDASR GLGCVLMQ+  VIAY 
Subjt:  ERPASAT------------------------------------EWSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYT

Query:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG
        SRQLK+HECNYPTHDLELAAVVLALKIWRHYLF EKCHIFT HKSLKYIFDQKELNLRQR+ LELIKDYDCTIEYHPGKANVV DALSRKSRLPKSAL G
Subjt:  SRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYG

Query:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM
        IR +LL+ELRG KAV+T E SGSLLAQF VRSSLV EIV RQ EDSNLQK   K+K+G + EFELRTD AIVKQGRLCVPNISELK AILEEAH+SAYAM
Subjt:  IRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKGAILEEAHNSAYAM

Query:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ
        HPGSTKMYRTLKKTYWW GMK+EI EYVDRCLICQ
Subjt:  HPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein7.8e-4125.22Show/hide
Query:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW
        MP+G++ APA F   +N I  +  +  V+ ++DDIL++S  +  H +H++ VLQ L++  L    +KCEF  +QV F+G+ +S  G +   + +  V+ W
Subjt:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW

Query:  ERPASATE------------------------------------WSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQ-----EVK
        ++P +  E                                    W+    Q+ + +K+ LV+ P+L      K+ ++  DAS   +G VL Q     +  
Subjt:  ERPASATE------------------------------------WSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQ-----EVK

Query:  VIAYTSRQLKKHECNYPTHDLELAAVVLALKIWRHYLFS--EKCHIFTYHKSL--KYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSR--
         + Y S ++ K + NY   D E+ A++ +LK WRHYL S  E   I T H++L  +   + +  N R  +    ++D++  I Y PG AN + DALSR  
Subjt:  VIAYTSRQLKKHECNYPTHDLELAAVVLALKIWRHYLFS--EKCHIFTYHKSL--KYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSR--

Query:  --KSRLPKSALYGIRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKG
             +PK                     + ++S + + Q  +      ++V     D+ L  +L    +  +   +L+    I  + ++ +PN ++L  
Subjt:  --KSRLPKSALYGIRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKG

Query:  AILEEAHNSAYAMHPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ
         I+++ H     +HPG   +   + + + W G++++I EYV  C  CQ
Subjt:  AILEEAHNSAYAMHPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ

P0CT35 Transposon Tf2-2 polyprotein7.8e-4125.22Show/hide
Query:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW
        MP+G++ APA F   +N I  +  +  V+ ++DDIL++S  +  H +H++ VLQ L++  L    +KCEF  +QV F+G+ +S  G +   + +  V+ W
Subjt:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW

Query:  ERPASATE------------------------------------WSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQ-----EVK
        ++P +  E                                    W+    Q+ + +K+ LV+ P+L      K+ ++  DAS   +G VL Q     +  
Subjt:  ERPASATE------------------------------------WSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQ-----EVK

Query:  VIAYTSRQLKKHECNYPTHDLELAAVVLALKIWRHYLFS--EKCHIFTYHKSL--KYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSR--
         + Y S ++ K + NY   D E+ A++ +LK WRHYL S  E   I T H++L  +   + +  N R  +    ++D++  I Y PG AN + DALSR  
Subjt:  VIAYTSRQLKKHECNYPTHDLELAAVVLALKIWRHYLFS--EKCHIFTYHKSL--KYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSR--

Query:  --KSRLPKSALYGIRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKG
             +PK                     + ++S + + Q  +      ++V     D+ L  +L    +  +   +L+    I  + ++ +PN ++L  
Subjt:  --KSRLPKSALYGIRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKG

Query:  AILEEAHNSAYAMHPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ
         I+++ H     +HPG   +   + + + W G++++I EYV  C  CQ
Subjt:  AILEEAHNSAYAMHPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ

P0CT36 Transposon Tf2-3 polyprotein7.8e-4125.22Show/hide
Query:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW
        MP+G++ APA F   +N I  +  +  V+ ++DDIL++S  +  H +H++ VLQ L++  L    +KCEF  +QV F+G+ +S  G +   + +  V+ W
Subjt:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW

Query:  ERPASATE------------------------------------WSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQ-----EVK
        ++P +  E                                    W+    Q+ + +K+ LV+ P+L      K+ ++  DAS   +G VL Q     +  
Subjt:  ERPASATE------------------------------------WSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQ-----EVK

Query:  VIAYTSRQLKKHECNYPTHDLELAAVVLALKIWRHYLFS--EKCHIFTYHKSL--KYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSR--
         + Y S ++ K + NY   D E+ A++ +LK WRHYL S  E   I T H++L  +   + +  N R  +    ++D++  I Y PG AN + DALSR  
Subjt:  VIAYTSRQLKKHECNYPTHDLELAAVVLALKIWRHYLFS--EKCHIFTYHKSL--KYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSR--

Query:  --KSRLPKSALYGIRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKG
             +PK                     + ++S + + Q  +      ++V     D+ L  +L    +  +   +L+    I  + ++ +PN ++L  
Subjt:  --KSRLPKSALYGIRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKG

Query:  AILEEAHNSAYAMHPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ
         I+++ H     +HPG   +   + + + W G++++I EYV  C  CQ
Subjt:  AILEEAHNSAYAMHPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ

P0CT37 Transposon Tf2-4 polyprotein7.8e-4125.22Show/hide
Query:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW
        MP+G++ APA F   +N I  +  +  V+ ++DDIL++S  +  H +H++ VLQ L++  L    +KCEF  +QV F+G+ +S  G +   + +  V+ W
Subjt:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW

Query:  ERPASATE------------------------------------WSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQ-----EVK
        ++P +  E                                    W+    Q+ + +K+ LV+ P+L      K+ ++  DAS   +G VL Q     +  
Subjt:  ERPASATE------------------------------------WSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQ-----EVK

Query:  VIAYTSRQLKKHECNYPTHDLELAAVVLALKIWRHYLFS--EKCHIFTYHKSL--KYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSR--
         + Y S ++ K + NY   D E+ A++ +LK WRHYL S  E   I T H++L  +   + +  N R  +    ++D++  I Y PG AN + DALSR  
Subjt:  VIAYTSRQLKKHECNYPTHDLELAAVVLALKIWRHYLFS--EKCHIFTYHKSL--KYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSR--

Query:  --KSRLPKSALYGIRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKG
             +PK                     + ++S + + Q  +      ++V     D+ L  +L    +  +   +L+    I  + ++ +PN ++L  
Subjt:  --KSRLPKSALYGIRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKG

Query:  AILEEAHNSAYAMHPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ
         I+++ H     +HPG   +   + + + W G++++I EYV  C  CQ
Subjt:  AILEEAHNSAYAMHPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ

P0CT41 Transposon Tf2-12 polyprotein7.8e-4125.22Show/hide
Query:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW
        MP+G++ APA F   +N I  +  +  V+ ++DDIL++S  +  H +H++ VLQ L++  L    +KCEF  +QV F+G+ +S  G +   + +  V+ W
Subjt:  MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNW

Query:  ERPASATE------------------------------------WSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQ-----EVK
        ++P +  E                                    W+    Q+ + +K+ LV+ P+L      K+ ++  DAS   +G VL Q     +  
Subjt:  ERPASATE------------------------------------WSDKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQ-----EVK

Query:  VIAYTSRQLKKHECNYPTHDLELAAVVLALKIWRHYLFS--EKCHIFTYHKSL--KYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSR--
         + Y S ++ K + NY   D E+ A++ +LK WRHYL S  E   I T H++L  +   + +  N R  +    ++D++  I Y PG AN + DALSR  
Subjt:  VIAYTSRQLKKHECNYPTHDLELAAVVLALKIWRHYLFS--EKCHIFTYHKSL--KYIFDQKELNLRQRQCLELIKDYDCTIEYHPGKANVVEDALSR--

Query:  --KSRLPKSALYGIRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKG
             +PK                     + ++S + + Q  +      ++V     D+ L  +L    +  +   +L+    I  + ++ +PN ++L  
Subjt:  --KSRLPKSALYGIRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELRTDRAIVKQGRLCVPNISELKG

Query:  AILEEAHNSAYAMHPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ
         I+++ H     +HPG   +   + + + W G++++I EYV  C  CQ
Subjt:  AILEEAHNSAYAMHPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.8e-0844.44Show/hide
Query:  HLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLG--HVVSAGGVSVDPQKVKAVVNWERPASATE
        HL +VLQ     Q YA   KC F   Q+ +LG  H++S  GVS DP K++A+V W  P + TE
Subjt:  HLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLG--HVVSAGGVSVDPQKVKAVVNWERPASATE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTCGGTTTAACGAATGCGCCAGCAGTTTTCATGGACCCCATGAACAGGATCTTCCAACAGTATTTAGATCAATTTGTGATAGTGTTCATCGATGACATACTAGT
TTACTCAATGGACAAAAAAGCCCATGAGGAACATCTGAGGATTGTTCTACAAACACTGCGTGATAAACAATTATACGCCAAGTTCAGCAAATGTGAGTTCTGGTTGAATC
AAGTGGTGTTCTTAGGGCATGTGGTTTCAGCGGGCGGAGTTAGTGTTGATCCGCAGAAAGTGAAAGCTGTTGTCAATTGGGAGAGACCAGCCAGTGCAACAGAGTGGTCG
GATAAGTGTGAACAAAGTTTTCAGGAGCTGAAGAAGAGATTGGTGACAGCACCTATTCTGCCACTTCTTGTAACAGAAAAGGAGTATGTGATCTATTGTGATGCTTCGAG
ACAAGGATTGGGTTGTGTGCTCATGCAGGAAGTGAAAGTAATAGCTTATACTTCAAGGCAGTTGAAGAAGCATGAGTGTAATTACCCTACCCATGATCTTGAGCTAGCAG
CAGTTGTTCTAGCACTGAAGATTTGGAGACATTATTTATTTAGCGAGAAGTGTCACATTTTCACATATCATAAAAGTCTGAAGTACATCTTTGATCAGAAAGAGCTAAAT
CTAAGACAGAGGCAATGTCTAGAACTAATCAAAGACTATGATTGTACCATAGAATATCATCCTGGTAAGGCTAACGTGGTAGAAGATGCATTAAGTAGGAAGTCGAGACT
TCCAAAGAGTGCCTTGTATGGTATTCGAGCAAGCTTGCTAAGTGAGTTAAGAGGCTTTAAAGCAGTTATGACTGCAGAAAGCTCAGGGAGTCTTTTAGCTCAATTTCATG
TTAGGTCTTCCTTAGTAGCAGAGATTGTAGGAAGACAGCTAGAGGATAGTAATTTGCAGAAGATGCTTGCAAAGGCCAAGCAAGGCCCAAAGGCAGAATTTGAGTTGAGA
ACGGACAGAGCCATAGTTAAGCAGGGAAGACTATGTGTTCCGAATATTAGTGAGCTTAAGGGTGCTATACTAGAAGAAGCTCACAATTCAGCTTATGCTATGCATCCAGG
AAGCACCAAGATGTATAGAACTCTGAAGAAGACTTATTGGTGGCCTGGTATGAAGCGAGAGATAGTTGAATATGTCGATAGATGTTTGATCTGTCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCATTCGGTTTAACGAATGCGCCAGCAGTTTTCATGGACCCCATGAACAGGATCTTCCAACAGTATTTAGATCAATTTGTGATAGTGTTCATCGATGACATACTAGT
TTACTCAATGGACAAAAAAGCCCATGAGGAACATCTGAGGATTGTTCTACAAACACTGCGTGATAAACAATTATACGCCAAGTTCAGCAAATGTGAGTTCTGGTTGAATC
AAGTGGTGTTCTTAGGGCATGTGGTTTCAGCGGGCGGAGTTAGTGTTGATCCGCAGAAAGTGAAAGCTGTTGTCAATTGGGAGAGACCAGCCAGTGCAACAGAGTGGTCG
GATAAGTGTGAACAAAGTTTTCAGGAGCTGAAGAAGAGATTGGTGACAGCACCTATTCTGCCACTTCTTGTAACAGAAAAGGAGTATGTGATCTATTGTGATGCTTCGAG
ACAAGGATTGGGTTGTGTGCTCATGCAGGAAGTGAAAGTAATAGCTTATACTTCAAGGCAGTTGAAGAAGCATGAGTGTAATTACCCTACCCATGATCTTGAGCTAGCAG
CAGTTGTTCTAGCACTGAAGATTTGGAGACATTATTTATTTAGCGAGAAGTGTCACATTTTCACATATCATAAAAGTCTGAAGTACATCTTTGATCAGAAAGAGCTAAAT
CTAAGACAGAGGCAATGTCTAGAACTAATCAAAGACTATGATTGTACCATAGAATATCATCCTGGTAAGGCTAACGTGGTAGAAGATGCATTAAGTAGGAAGTCGAGACT
TCCAAAGAGTGCCTTGTATGGTATTCGAGCAAGCTTGCTAAGTGAGTTAAGAGGCTTTAAAGCAGTTATGACTGCAGAAAGCTCAGGGAGTCTTTTAGCTCAATTTCATG
TTAGGTCTTCCTTAGTAGCAGAGATTGTAGGAAGACAGCTAGAGGATAGTAATTTGCAGAAGATGCTTGCAAAGGCCAAGCAAGGCCCAAAGGCAGAATTTGAGTTGAGA
ACGGACAGAGCCATAGTTAAGCAGGGAAGACTATGTGTTCCGAATATTAGTGAGCTTAAGGGTGCTATACTAGAAGAAGCTCACAATTCAGCTTATGCTATGCATCCAGG
AAGCACCAAGATGTATAGAACTCTGAAGAAGACTTATTGGTGGCCTGGTATGAAGCGAGAGATAGTTGAATATGTCGATAGATGTTTGATCTGTCAATAG
Protein sequenceShow/hide protein sequence
MPFGLTNAPAVFMDPMNRIFQQYLDQFVIVFIDDILVYSMDKKAHEEHLRIVLQTLRDKQLYAKFSKCEFWLNQVVFLGHVVSAGGVSVDPQKVKAVVNWERPASATEWS
DKCEQSFQELKKRLVTAPILPLLVTEKEYVIYCDASRQGLGCVLMQEVKVIAYTSRQLKKHECNYPTHDLELAAVVLALKIWRHYLFSEKCHIFTYHKSLKYIFDQKELN
LRQRQCLELIKDYDCTIEYHPGKANVVEDALSRKSRLPKSALYGIRASLLSELRGFKAVMTAESSGSLLAQFHVRSSLVAEIVGRQLEDSNLQKMLAKAKQGPKAEFELR
TDRAIVKQGRLCVPNISELKGAILEEAHNSAYAMHPGSTKMYRTLKKTYWWPGMKREIVEYVDRCLICQ