; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0064111 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0064111
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:4669066..4670196
RNA-Seq ExpressionCmc03g0064111
SyntenyCmc03g0064111
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042141.1 hypothetical protein E6C27_scaffold67G006300 [Cucumis melo var. makuwa]2.1e-14379.17Show/hide
Query:  EIVFKRP-GKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDIPMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAP
        E++   P GK+EI FR DRKIL T VI ALKA+KLLRKGC  YLA V+DTQ SKLKLEDIP+VREFPDVFL+ELSGLP D+EIEFSIDLV G APISQ+P
Subjt:  EIVFKRP-GKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDIPMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAP

Query:  YRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYI
        YRMAPME+RELK QLQELVD GFIRPSASPWGA +LFVKKKD TLRLCIDYRQLNKVTI NKYPLPRIDDLFD+ HGA VFSKIDLRSGYHQLKVKES I
Subjt:  YRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYI

Query:  LKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSA
        LKTAFR RYGHYEFLVIPF LTNAP AFMDLMNRVFH YLD                KEKH EHLRIVL+ L DRELY KFSKCDFWLD+VVFLGHVVS 
Subjt:  LKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSA

Query:  EGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
        EGICVD QK E VDKW++PTS+TEI+SFLGLAGYYR
Subjt:  EGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR

KAA0050527.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]2.2e-14066.76Show/hide
Query:  VHIDGETLEVDLISLNIHKFDIILGMNFLSNHYASLKCHKKEIVFKRPGKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDI
        V ++G +L VDL+ L + + D+ILGM+FL  HYAS+ CH+KE+VF++PG  E++FRG RK +S  +I  LKA KLLRKGC  +LA+++  Q  KLK ED+
Subjt:  VHIDGETLEVDLISLNIHKFDIILGMNFLSNHYASLKCHKKEIVFKRPGKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDI

Query:  PMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIR
        P+V+EF DVF  +LSGLP D+EIEF+I+L+ G APISQAPYRMAP EL+ELK+QLQELVD G+IRPS SPWGA +LFVKKKD TLRLCIDYRQLNKVTIR
Subjt:  PMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIR

Query:  NKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEK
        NKYPLPRIDDLFD+L GA +FSKIDLRSGYHQLKV+ES I KTAFR RYGHYEF V+PF LTNAP  FMDLMNR+FH YLD                +E 
Subjt:  NKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEK

Query:  HVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
        H EHLRIVL+ L +++LY KFSKC+FWL+QVVFLGHVVSA+G+ VD QK+EAV  WE+P S TE++SFLGLAGYYR
Subjt:  HVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR

KAA0050832.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.7e-14066.49Show/hide
Query:  VHIDGETLEVDLISLNIHKFDIILGMNFLSNHYASLKCHKKEIVFKRPGKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDI
        V ++G +L VDL+ L + + D+ILGM+FL  HYAS+ CH+KE+VF++PG  E++FRG RK++S  +I  LKA KLLRKGC  +LA+++  Q  KLK ED+
Subjt:  VHIDGETLEVDLISLNIHKFDIILGMNFLSNHYASLKCHKKEIVFKRPGKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDI

Query:  PMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIR
        P+V+EF DVF  +LSGLP D+EIEF+I+L+ G APISQAPYRMAP EL+ELK+QLQELV+ G+IRPS SPWGA +LFVKKKD TL+LCIDYRQLNKVTIR
Subjt:  PMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIR

Query:  NKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEK
        NKYPLPRIDDLFD+L GA +FSKIDLRSGYHQLKV+ES I KTAFR RYGHYEF V+PF LTNAPV FMDLMNR+FH YLD                +E 
Subjt:  NKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEK

Query:  HVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
        H EHLRIVL+ L +++LY KFSKC+FWL+QVVFLGHVVS +G+ VD QK+EAV  WE+PTS TE++SFLGLAGYYR
Subjt:  HVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR

KAA0066849.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]2.2e-14066.76Show/hide
Query:  VHIDGETLEVDLISLNIHKFDIILGMNFLSNHYASLKCHKKEIVFKRPGKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDI
        V ++G +L VDL+ L + + D+ILGM+FL  HYAS+ CH+KE+VF++PG  E++FRG RK +S  +I  LKA KLLRKGC  +LA+++  Q  KLK ED+
Subjt:  VHIDGETLEVDLISLNIHKFDIILGMNFLSNHYASLKCHKKEIVFKRPGKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDI

Query:  PMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIR
        P+V+EF DVF  +LSGLP D+EIEF+I+L+ G APISQAPYRMAP EL+ELK+QLQELVD G+IRPS SPWGA +LFVKKKD TLRLCIDYRQLNKVTIR
Subjt:  PMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIR

Query:  NKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEK
        NKYPLPRIDDLFD+L GA +FSKIDLRSGYHQLKV+ES I KTAFR RYGHYEF V+PF LTNAP  FMDLMNR+FH YLD                +E 
Subjt:  NKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEK

Query:  HVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
        H EHLRIVL+ L +++LY KFSKC+FWL+QVVFLGHVVSA+G+ VD QK+EAV  WE+P S TE++SFLGLAGYYR
Subjt:  HVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR

TYK18080.1 hypothetical protein E5676_scaffold306G004160 [Cucumis melo var. makuwa]2.1e-14379.17Show/hide
Query:  EIVFKRP-GKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDIPMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAP
        E++   P GK+EI FR DRKIL T VI ALKA+KLLRKGC  YLA V+DTQ SKLKLEDIP+VREFPDVFL+ELSGLP D+EIEFSIDLV G APISQ+P
Subjt:  EIVFKRP-GKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDIPMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAP

Query:  YRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYI
        YRMAPME+RELK QLQELVD GFIRPSASPWGA +LFVKKKD TLRLCIDYRQLNKVTI NKYPLPRIDDLFD+ HGA VFSKIDLRSGYHQLKVKES I
Subjt:  YRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYI

Query:  LKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSA
        LKTAFR RYGHYEFLVIPF LTNAP AFMDLMNRVFH YLD                KEKH EHLRIVL+ L DRELY KFSKCDFWLD+VVFLGHVVS 
Subjt:  LKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSA

Query:  EGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
        EGICVD QK E VDKW++PTS+TEI+SFLGLAGYYR
Subjt:  EGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR

TrEMBL top hitse value%identityAlignment
A0A5A7TKQ0 Uncharacterized protein1.0e-14379.17Show/hide
Query:  EIVFKRP-GKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDIPMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAP
        E++   P GK+EI FR DRKIL T VI ALKA+KLLRKGC  YLA V+DTQ SKLKLEDIP+VREFPDVFL+ELSGLP D+EIEFSIDLV G APISQ+P
Subjt:  EIVFKRP-GKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDIPMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAP

Query:  YRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYI
        YRMAPME+RELK QLQELVD GFIRPSASPWGA +LFVKKKD TLRLCIDYRQLNKVTI NKYPLPRIDDLFD+ HGA VFSKIDLRSGYHQLKVKES I
Subjt:  YRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYI

Query:  LKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSA
        LKTAFR RYGHYEFLVIPF LTNAP AFMDLMNRVFH YLD                KEKH EHLRIVL+ L DRELY KFSKCDFWLD+VVFLGHVVS 
Subjt:  LKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSA

Query:  EGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
        EGICVD QK E VDKW++PTS+TEI+SFLGLAGYYR
Subjt:  EGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR

A0A5A7U2V7 Reverse transcriptase1.1e-14066.76Show/hide
Query:  VHIDGETLEVDLISLNIHKFDIILGMNFLSNHYASLKCHKKEIVFKRPGKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDI
        V ++G +L VDL+ L + + D+ILGM+FL  HYAS+ CH+KE+VF++PG  E++FRG RK +S  +I  LKA KLLRKGC  +LA+++  Q  KLK ED+
Subjt:  VHIDGETLEVDLISLNIHKFDIILGMNFLSNHYASLKCHKKEIVFKRPGKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDI

Query:  PMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIR
        P+V+EF DVF  +LSGLP D+EIEF+I+L+ G APISQAPYRMAP EL+ELK+QLQELVD G+IRPS SPWGA +LFVKKKD TLRLCIDYRQLNKVTIR
Subjt:  PMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIR

Query:  NKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEK
        NKYPLPRIDDLFD+L GA +FSKIDLRSGYHQLKV+ES I KTAFR RYGHYEF V+PF LTNAP  FMDLMNR+FH YLD                +E 
Subjt:  NKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEK

Query:  HVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
        H EHLRIVL+ L +++LY KFSKC+FWL+QVVFLGHVVSA+G+ VD QK+EAV  WE+P S TE++SFLGLAGYYR
Subjt:  HVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR

A0A5A7U4R7 Reverse transcriptase8.2e-14166.49Show/hide
Query:  VHIDGETLEVDLISLNIHKFDIILGMNFLSNHYASLKCHKKEIVFKRPGKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDI
        V ++G +L VDL+ L + + D+ILGM+FL  HYAS+ CH+KE+VF++PG  E++FRG RK++S  +I  LKA KLLRKGC  +LA+++  Q  KLK ED+
Subjt:  VHIDGETLEVDLISLNIHKFDIILGMNFLSNHYASLKCHKKEIVFKRPGKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDI

Query:  PMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIR
        P+V+EF DVF  +LSGLP D+EIEF+I+L+ G APISQAPYRMAP EL+ELK+QLQELV+ G+IRPS SPWGA +LFVKKKD TL+LCIDYRQLNKVTIR
Subjt:  PMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIR

Query:  NKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEK
        NKYPLPRIDDLFD+L GA +FSKIDLRSGYHQLKV+ES I KTAFR RYGHYEF V+PF LTNAPV FMDLMNR+FH YLD                +E 
Subjt:  NKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEK

Query:  HVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
        H EHLRIVL+ L +++LY KFSKC+FWL+QVVFLGHVVS +G+ VD QK+EAV  WE+PTS TE++SFLGLAGYYR
Subjt:  HVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR

A0A5D3BS67 Reverse transcriptase1.1e-14066.76Show/hide
Query:  VHIDGETLEVDLISLNIHKFDIILGMNFLSNHYASLKCHKKEIVFKRPGKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDI
        V ++G +L VDL+ L + + D+ILGM+FL  HYAS+ CH+KE+VF++PG  E++FRG RK +S  +I  LKA KLLRKGC  +LA+++  Q  KLK ED+
Subjt:  VHIDGETLEVDLISLNIHKFDIILGMNFLSNHYASLKCHKKEIVFKRPGKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDI

Query:  PMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIR
        P+V+EF DVF  +LSGLP D+EIEF+I+L+ G APISQAPYRMAP EL+ELK+QLQELVD G+IRPS SPWGA +LFVKKKD TLRLCIDYRQLNKVTIR
Subjt:  PMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIR

Query:  NKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEK
        NKYPLPRIDDLFD+L GA +FSKIDLRSGYHQLKV+ES I KTAFR RYGHYEF V+PF LTNAP  FMDLMNR+FH YLD                +E 
Subjt:  NKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEK

Query:  HVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
        H EHLRIVL+ L +++LY KFSKC+FWL+QVVFLGHVVSA+G+ VD QK+EAV  WE+P S TE++SFLGLAGYYR
Subjt:  HVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR

A0A5D3D2Y2 Uncharacterized protein1.0e-14379.17Show/hide
Query:  EIVFKRP-GKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDIPMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAP
        E++   P GK+EI FR DRKIL T VI ALKA+KLLRKGC  YLA V+DTQ SKLKLEDIP+VREFPDVFL+ELSGLP D+EIEFSIDLV G APISQ+P
Subjt:  EIVFKRP-GKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDIPMVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAP

Query:  YRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYI
        YRMAPME+RELK QLQELVD GFIRPSASPWGA +LFVKKKD TLRLCIDYRQLNKVTI NKYPLPRIDDLFD+ HGA VFSKIDLRSGYHQLKVKES I
Subjt:  YRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYI

Query:  LKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSA
        LKTAFR RYGHYEFLVIPF LTNAP AFMDLMNRVFH YLD                KEKH EHLRIVL+ L DRELY KFSKCDFWLD+VVFLGHVVS 
Subjt:  LKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDY---------------KEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSA

Query:  EGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
        EGICVD QK E VDKW++PTS+TEI+SFLGLAGYYR
Subjt:  EGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.4e-4134.29Show/hide
Query:  MVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFV-KKKDAT----LRLCIDYRQLNK
        +++++ D+   E   L F  + + +I+    +   S+  Y  A  +  E++ Q+Q++++ G IR S SP+ + +  V KK+DA+     R+ IDYR+LN+
Subjt:  MVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFV-KKKDAT----LRLCIDYRQLNK

Query:  VTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHP--------YLD-------
        +T+ +++P+P +D++  KL     F+ IDL  G+HQ+++    + KTAF  ++GHYE+L +PF L NAP  F   MN +  P        YLD       
Subjt:  VTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHP--------YLD-------

Query:  YKEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
          ++H++ L +V + L    L ++  KC+F   +  FLGHV++ +GI  + +KIEA+ K+  PT   EI++FLGL GYYR
Subjt:  YKEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR

P20825 Retrovirus-related Pol polyprotein from transposon 2977.1e-4135.71Show/hide
Query:  MVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFV-KKKDAT----LRLCIDYRQLNK
        ++ +F ++  KE   L F   I+  ++     +PI    Y +A     E++ Q+QE+++ G IR S SP+ +    V KK DA+     R+ IDYR+LN+
Subjt:  MVREFPDVFLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVD-GFIRPSASPWGASMLFV-KKKDAT----LRLCIDYRQLNK

Query:  VTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHP--------YLD-------
        +TI ++YP+P +D++  KL     F+ IDL  G+HQ+++ E  I KTAF  + GHYE+L +PF L NAP  F   MN +  P        YLD       
Subjt:  VTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHP--------YLD-------

Query:  YKEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
           +H+  +++V   L D  L ++  KC+F   +  FLGH+V+ +GI  +  K++A+  +  PT   EI++FLGL GYYR
Subjt:  YKEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.1e-3938.72Show/hide
Query:  PYRMAPMELRELKLQLQELVDG-FIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESY
        PY +     +E+   +Q+L+D  FI PS SP  + ++ V KKD T RLC+DYR LNK TI + +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++   
Subjt:  PYRMAPMELRELKLQLQELVDG-FIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESY

Query:  ILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVF------HPYLD-------YKEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAE
          KTAF    G YE+ V+PF L NAP  F   M   F      + YLD         E+H +HL  VL+ L +  L VK  KC F  ++  FLG+ +  +
Subjt:  ILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVF------HPYLD-------YKEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAE

Query:  GICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
         I     K  A+  +  P ++ + Q FLG+  YYR
Subjt:  GICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.7e-3935.54Show/hide
Query:  MVREFPDVFLKELSGLPFDKEIEFSI-----DLVFGIAPISQAPYRMAPMELR-ELKLQLQELV-DGFIRPSASPWGASMLFVKKK-----DATLRLCID
        ++ EFP +F   LSG+  +  ++  I     D ++        PY   P+ +R E++ Q+ EL+ DG IRPS SP+ + +  V KK     +   R+ +D
Subjt:  MVREFPDVFLKELSGLPFDKEIEFSI-----DLVFGIAPISQAPYRMAPMELR-ELKLQLQELV-DGFIRPSASPWGASMLFVKKK-----DATLRLCID

Query:  YRQLNKVTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYL----------
        +++LN VTI + YP+P I+     L  A  F+ +DL SG+HQ+ +KES I KTAF    G YEFL +PF L NAP  F  +++ +   ++          
Subjt:  YRQLNKVTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYL----------

Query:  ------DYKEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
              DY + H ++LR+VL  L    L V   K  F   QV FLG++V+A+GI  D +K+ A+ +   PTS+ E++ FLG+  YYR
Subjt:  ------DYKEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.1e-3938.72Show/hide
Query:  PYRMAPMELRELKLQLQELVDG-FIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESY
        PY +     +E+   +Q+L+D  FI PS SP  + ++ V KKD T RLC+DYR LNK TI + +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++   
Subjt:  PYRMAPMELRELKLQLQELVDG-FIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIRNKYPLPRIDDLFDKLHGAFVFSKIDLRSGYHQLKVKESY

Query:  ILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVF------HPYLD-------YKEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAE
          KTAF    G YE+ V+PF L NAP  F   M   F      + YLD         E+H +HL  VL+ L +  L VK  KC F  ++  FLG+ +  +
Subjt:  ILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVF------HPYLD-------YKEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAE

Query:  GICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
         I     K  A+  +  P ++ + Q FLG+  YYR
Subjt:  GICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.9e-1241.56Show/hide
Query:  VEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLG--HVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR
        + HL +VL+I    + Y    KC F   Q+ +LG  H++S EG+  D  K+EA+  W +P + TE++ FLGL GYYR
Subjt:  VEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLG--HVVSAEGICVDSQKIEAVDKWEKPTSITEIQSFLGLAGYYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCATATTGATGGTGAAACTTTAGAAGTGGATTTGATTTCGTTAAACATTCATAAGTTTGATATTATCTTGGGAATGAATTTTTTATCTAATCACTATGCCTCCCT
AAAATGTCATAAAAAAGAGATTGTTTTTAAACGTCCAGGGAAGAACGAAATCATTTTCCGCGGTGATAGAAAAATACTTTCCACCTATGTGATCTTTGCTTTAAAGGCCA
CTAAATTGTTAAGGAAGGGTTGTATGACTTACCTAGCATATGTAATGGATACACAGGCTAGTAAGCTGAAGCTTGAAGATATACCAATGGTAAGAGAATTCCCAGATGTT
TTTCTGAAAGAGTTATCGGGACTACCATTTGATAAAGAGATTGAATTCTCAATTGATTTAGTGTTTGGGATTGCACCCATTTCACAAGCTCCCTATAGAATGGCACCGAT
GGAACTAAGAGAACTAAAATTACAGTTGCAGGAATTGGTGGATGGCTTTATCCGACCTAGTGCATCTCCATGGGGTGCATCAATGTTGTTTGTTAAAAAGAAGGATGCTA
CATTGAGGTTATGTATTGACTATCGGCAATTAAATAAGGTAACAATACGTAACAAATATCCTTTGCCCCGAATAGATGATTTGTTTGATAAACTTCATGGTGCCTTTGTA
TTCTCTAAGATTGACTTGAGATCAGGTTACCATCAGTTGAAAGTTAAAGAGTCATATATCCTGAAAACAGCTTTTCGAATGAGATATGGGCATTATGAATTCTTGGTGAT
CCCTTTTAGATTGACTAATGCCCCTGTAGCCTTCATGGACTTGATGAATAGGGTCTTTCATCCGTACTTAGATTATAAAGAAAAGCATGTTGAACACCTCAGAATAGTGT
TAAAGATACTGTGGGATCGAGAGTTGTATGTTAAATTCAGTAAGTGCGATTTTTGGCTAGATCAGGTAGTATTTTTGGGTCATGTGGTTTCAGCGGAAGGCATTTGTGTC
GATTCTCAAAAAATTGAAGCAGTAGATAAGTGGGAAAAACCTACCTCTATCACTGAGATACAAAGTTTTCTTGGTTTAGCGGGGTATTATCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCATATTGATGGTGAAACTTTAGAAGTGGATTTGATTTCGTTAAACATTCATAAGTTTGATATTATCTTGGGAATGAATTTTTTATCTAATCACTATGCCTCCCT
AAAATGTCATAAAAAAGAGATTGTTTTTAAACGTCCAGGGAAGAACGAAATCATTTTCCGCGGTGATAGAAAAATACTTTCCACCTATGTGATCTTTGCTTTAAAGGCCA
CTAAATTGTTAAGGAAGGGTTGTATGACTTACCTAGCATATGTAATGGATACACAGGCTAGTAAGCTGAAGCTTGAAGATATACCAATGGTAAGAGAATTCCCAGATGTT
TTTCTGAAAGAGTTATCGGGACTACCATTTGATAAAGAGATTGAATTCTCAATTGATTTAGTGTTTGGGATTGCACCCATTTCACAAGCTCCCTATAGAATGGCACCGAT
GGAACTAAGAGAACTAAAATTACAGTTGCAGGAATTGGTGGATGGCTTTATCCGACCTAGTGCATCTCCATGGGGTGCATCAATGTTGTTTGTTAAAAAGAAGGATGCTA
CATTGAGGTTATGTATTGACTATCGGCAATTAAATAAGGTAACAATACGTAACAAATATCCTTTGCCCCGAATAGATGATTTGTTTGATAAACTTCATGGTGCCTTTGTA
TTCTCTAAGATTGACTTGAGATCAGGTTACCATCAGTTGAAAGTTAAAGAGTCATATATCCTGAAAACAGCTTTTCGAATGAGATATGGGCATTATGAATTCTTGGTGAT
CCCTTTTAGATTGACTAATGCCCCTGTAGCCTTCATGGACTTGATGAATAGGGTCTTTCATCCGTACTTAGATTATAAAGAAAAGCATGTTGAACACCTCAGAATAGTGT
TAAAGATACTGTGGGATCGAGAGTTGTATGTTAAATTCAGTAAGTGCGATTTTTGGCTAGATCAGGTAGTATTTTTGGGTCATGTGGTTTCAGCGGAAGGCATTTGTGTC
GATTCTCAAAAAATTGAAGCAGTAGATAAGTGGGAAAAACCTACCTCTATCACTGAGATACAAAGTTTTCTTGGTTTAGCGGGGTATTATCGATGA
Protein sequenceShow/hide protein sequence
MVHIDGETLEVDLISLNIHKFDIILGMNFLSNHYASLKCHKKEIVFKRPGKNEIIFRGDRKILSTYVIFALKATKLLRKGCMTYLAYVMDTQASKLKLEDIPMVREFPDV
FLKELSGLPFDKEIEFSIDLVFGIAPISQAPYRMAPMELRELKLQLQELVDGFIRPSASPWGASMLFVKKKDATLRLCIDYRQLNKVTIRNKYPLPRIDDLFDKLHGAFV
FSKIDLRSGYHQLKVKESYILKTAFRMRYGHYEFLVIPFRLTNAPVAFMDLMNRVFHPYLDYKEKHVEHLRIVLKILWDRELYVKFSKCDFWLDQVVFLGHVVSAEGICV
DSQKIEAVDKWEKPTSITEIQSFLGLAGYYR