; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0064171 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0064171
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr03:4725208..4726269
RNA-Seq ExpressionCmc03g0064171
SyntenyCmc03g0064171
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035157.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.1e-12366.96Show/hide
Query:  ETVNTTCHI-----HNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSGTVMETINVVVND
        +++  TC+      H RVT R+GTTV LY+LWK RKPNVKYFH+FGST YILAD+EY +KWD +S+QGIFLGYSQN+ AY+ +N +S +VMETINVV+ND
Subjt:  ETVNTTCHI-----HNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSGTVMETINVVVND

Query:  FESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKVDYSKMIVDLCYV
         +S++ Q N E+DET  +S V  T   E  K D+  DS   +L+K  +E +N ++ L+PS H+KKNHP SSIIGDPSAG+ TRRK+K+DY KM+ DLCY+
Subjt:  FESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKVDYSKMIVDLCYV

Query:  SAIEPTSIE-NALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLL
        S IE ++++ +ALKDEY++NAMQEELLQF+ NNVW LV KP GVNVIGT WIFKNKTDE G VTKNKARLVAQ Y QVEGVDFDETFA VARLEAIRLLL
Subjt:  SAIEPTSIE-NALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLL

Query:  NISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP
         IS  +KFKLYQMDVKS FLNGYLNEEVYVAQPKGFVD E P
Subjt:  NISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP

KAA0035996.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.3e-13168.21Show/hide
Query:  MARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSG
        MAR+IIH K+LPL+FWAE +N  CHIH R+TTRSG+ V LY+LWKGRKPNVKYFHIF  T YILAD+EYHRKWD KS+QG+FLG SQNSRAYK FN ++ 
Subjt:  MARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSG

Query:  TVMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKV
        TVMETIN+VVND E    + + E+DET  ++    +   ++ K D +  ++    +  + E + + T+ +PS+H+ KNHP SSIIGDPSAGITTR+K+K+
Subjt:  TVMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKV

Query:  DYSKMIVDLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAH
        DYSKMI+DLCY SAIEPTS+E ALKDEY+INAMQEEL+QFK NNVWTLV KP+GVN+IGT W+FKNKTDESG VT+NKA LVAQ YAQ+EGVDFDE F  
Subjt:  DYSKMIVDLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAH

Query:  VARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKG
        V RLEAIRLLL IS   KFKLYQMDVKS FLNGYLNEEVYVAQPKG
Subjt:  VARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKG

KAA0051798.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.1e-15071.18Show/hide
Query:  MARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSG
        MARV+IH KNLPLNFWAE VNT CHIHNRVTTRSG TV LY+L KGRKPNVKYFHIFGST YILAD+EYHRKWD KS QGIFLGYSQNSRAY+ FNIKSG
Subjt:  MARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSG

Query:  TVMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKV
        TVMETINVVVNDFESNVNQFNIEDDET V   VT+TPL EMPK +SQ  S KT+   ITDE +N+ET+LVPS H+KKNHP SSII DPSAGITTRRKE V
Subjt:  TVMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKV

Query:  -----------------------DYSK-------MIV----------------DLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDG
                                YS+       +++                DLCYVSAIEPTS+EN+LKDEY+I  MQEE LQFK NNVWTLV KPDG
Subjt:  -----------------------DYSK-------MIV----------------DLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDG

Query:  VNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP
         N+IGT WIFKNKTDESG++ +NKARLVAQ Y QVEGVD DETFA VARLEAIRLLL+IS F+KFKL+QMDVKSAFLNGYLNEEV VA+PKGF+DSEFP
Subjt:  VNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP

KAA0053320.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.5e-12269.32Show/hide
Query:  ARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSGT
        AR +IH K LPLNFWAE VNT CHIH                                T YILAD+EYHRKWDVKSD+GIFLGYSQNSRAY+ FNIK GT
Subjt:  ARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSGT

Query:  VMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKVD
        VMETINVVVNDFESNVNQFNIEDDETSV+ +V AT L++MPKDDSQPDSTKTNLEKITDE +NDETVLVPS H+KKNHP S IIGDPSAGITTR KEKVD
Subjt:  VMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKVD

Query:  YSKMIVDLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHV
        Y+KMI DLCYVSAIEPTS+E AL+DEY+IN                                  NKTDESGNVTKNKARLVAQ YAQVEGVDFDE FA V
Subjt:  YSKMIVDLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHV

Query:  ARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP
        A LEAI LLL+I  FRKFKLYQMDVKSAFLN YLNEEVYVAQPK FVDSEFP
Subjt:  ARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP

TYK21443.1 gag-pol polyprotein [Cucumis melo var. makuwa]6.5e-15375.81Show/hide
Query:  MARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSG
        MARV+IH KNLPLNFWAE VNT CHIHNRVTTRSG TV LY+L KGRKPNVKYFHIFGST YILAD+EYHRKWD KS QGIFLGYSQNSRAY+ FNIKSG
Subjt:  MARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSG

Query:  TVMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKV
        TVMETINVVVNDFESNVNQFNIEDDET V   VT+TPL EMPK +SQ  S KT+   ITDE +N+ET+LVPS H+KKNHP SSII DPSAGITTRRKE +
Subjt:  TVMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKV

Query:  DYSKMI-------------------VDLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARL
        + +  I                    DLCYVSAIEPTS+EN+LKDEY+I  MQEE LQFK NNVWTLV KPDG N+IGT WIFKNKTDESG++ +NKARL
Subjt:  DYSKMI-------------------VDLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARL

Query:  VAQSYAQVEGVDFDETFAHVARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP
        VAQ Y QVEGVD DETFA VARLEAIRLLL+IS F+KFKL+QMDVKSAFLNGYLNEEV VA+PKGF+DSEFP
Subjt:  VAQSYAQVEGVDFDETFAHVARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP

TrEMBL top hitse value%identityAlignment
A0A5A7T0Q0 Gag-pol polyprotein1.5e-12366.96Show/hide
Query:  ETVNTTCHI-----HNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSGTVMETINVVVND
        +++  TC+      H RVT R+GTTV LY+LWK RKPNVKYFH+FGST YILAD+EY +KWD +S+QGIFLGYSQN+ AY+ +N +S +VMETINVV+ND
Subjt:  ETVNTTCHI-----HNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSGTVMETINVVVND

Query:  FESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKVDYSKMIVDLCYV
         +S++ Q N E+DET  +S V  T   E  K D+  DS   +L+K  +E +N ++ L+PS H+KKNHP SSIIGDPSAG+ TRRK+K+DY KM+ DLCY+
Subjt:  FESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKVDYSKMIVDLCYV

Query:  SAIEPTSIE-NALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLL
        S IE ++++ +ALKDEY++NAMQEELLQF+ NNVW LV KP GVNVIGT WIFKNKTDE G VTKNKARLVAQ Y QVEGVDFDETFA VARLEAIRLLL
Subjt:  SAIEPTSIE-NALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLL

Query:  NISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP
         IS  +KFKLYQMDVKS FLNGYLNEEVYVAQPKGFVD E P
Subjt:  NISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP

A0A5A7T197 Gag-pol polyprotein2.6e-13168.21Show/hide
Query:  MARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSG
        MAR+IIH K+LPL+FWAE +N  CHIH R+TTRSG+ V LY+LWKGRKPNVKYFHIF  T YILAD+EYHRKWD KS+QG+FLG SQNSRAYK FN ++ 
Subjt:  MARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSG

Query:  TVMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKV
        TVMETIN+VVND E    + + E+DET  ++    +   ++ K D +  ++    +  + E + + T+ +PS+H+ KNHP SSIIGDPSAGITTR+K+K+
Subjt:  TVMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKV

Query:  DYSKMIVDLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAH
        DYSKMI+DLCY SAIEPTS+E ALKDEY+INAMQEEL+QFK NNVWTLV KP+GVN+IGT W+FKNKTDESG VT+NKA LVAQ YAQ+EGVDFDE F  
Subjt:  DYSKMIVDLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAH

Query:  VARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKG
        V RLEAIRLLL IS   KFKLYQMDVKS FLNGYLNEEVYVAQPKG
Subjt:  VARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKG

A0A5A7U931 Gag-pol polyprotein2.5e-15071.18Show/hide
Query:  MARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSG
        MARV+IH KNLPLNFWAE VNT CHIHNRVTTRSG TV LY+L KGRKPNVKYFHIFGST YILAD+EYHRKWD KS QGIFLGYSQNSRAY+ FNIKSG
Subjt:  MARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSG

Query:  TVMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKV
        TVMETINVVVNDFESNVNQFNIEDDET V   VT+TPL EMPK +SQ  S KT+   ITDE +N+ET+LVPS H+KKNHP SSII DPSAGITTRRKE V
Subjt:  TVMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKV

Query:  -----------------------DYSK-------MIV----------------DLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDG
                                YS+       +++                DLCYVSAIEPTS+EN+LKDEY+I  MQEE LQFK NNVWTLV KPDG
Subjt:  -----------------------DYSK-------MIV----------------DLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDG

Query:  VNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP
         N+IGT WIFKNKTDESG++ +NKARLVAQ Y QVEGVD DETFA VARLEAIRLLL+IS F+KFKL+QMDVKSAFLNGYLNEEV VA+PKGF+DSEFP
Subjt:  VNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP

A0A5A7UDQ0 Gag-pol polyprotein1.7e-12269.32Show/hide
Query:  ARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSGT
        AR +IH K LPLNFWAE VNT CHIH                                T YILAD+EYHRKWDVKSD+GIFLGYSQNSRAY+ FNIK GT
Subjt:  ARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSGT

Query:  VMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKVD
        VMETINVVVNDFESNVNQFNIEDDETSV+ +V AT L++MPKDDSQPDSTKTNLEKITDE +NDETVLVPS H+KKNHP S IIGDPSAGITTR KEKVD
Subjt:  VMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKVD

Query:  YSKMIVDLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHV
        Y+KMI DLCYVSAIEPTS+E AL+DEY+IN                                  NKTDESGNVTKNKARLVAQ YAQVEGVDFDE FA V
Subjt:  YSKMIVDLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHV

Query:  ARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP
        A LEAI LLL+I  FRKFKLYQMDVKSAFLN YLNEEVYVAQPK FVDSEFP
Subjt:  ARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP

A0A5D3DCZ8 Gag-pol polyprotein3.1e-15375.81Show/hide
Query:  MARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSG
        MARV+IH KNLPLNFWAE VNT CHIHNRVTTRSG TV LY+L KGRKPNVKYFHIFGST YILAD+EYHRKWD KS QGIFLGYSQNSRAY+ FNIKSG
Subjt:  MARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSG

Query:  TVMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKV
        TVMETINVVVNDFESNVNQFNIEDDET V   VT+TPL EMPK +SQ  S KT+   ITDE +N+ET+LVPS H+KKNHP SSII DPSAGITTRRKE +
Subjt:  TVMETINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKV

Query:  DYSKMI-------------------VDLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARL
        + +  I                    DLCYVSAIEPTS+EN+LKDEY+I  MQEE LQFK NNVWTLV KPDG N+IGT WIFKNKTDESG++ +NKARL
Subjt:  DYSKMI-------------------VDLCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARL

Query:  VAQSYAQVEGVDFDETFAHVARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP
        VAQ Y QVEGVD DETFA VARLEAIRLLL+IS F+KFKL+QMDVKSAFLNGYLNEEV VA+PKGF+DSEFP
Subjt:  VAQSYAQVEGVDFDETFAHVARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-2243.48Show/hide
Query:  AMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLLNISYFRKFKLYQMDVKSAFL
        A+  EL   K NN WT+  +P+  N++ + W+F  K +E GN  + KARLVA+ + Q   +D++ETFA VAR+ + R +L++      K++QMDVK+AFL
Subjt:  AMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLLNISYFRKFKLYQMDVKSAFL

Query:  NGYLNEEVYVAQPKG
        NG L EE+Y+  P+G
Subjt:  NGYLNEEVYVAQPKG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.5e-3027.35Show/hide
Query:  RVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSGTV
        R ++ +  LP +FW E V T C++ NR  +          +W  ++ +  +  +FG   +    KE   K D KS   IF+GY      Y+ ++     V
Subjt:  RVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSGTV

Query:  METINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITD------------ETLND--ETVLVPSTHMKKNHPPSSIIGDP
        + + +VV  + E        E  +  +I N        +P   + P S ++  +++++            E L++  E V  P T  ++ H P      P
Subjt:  METINVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITD------------ETLND--ETVLVPSTHMKKNHPPSSIIGDP

Query:  SAGITTRRKEKVDYSKMIVDLCYVSAIEPTSIENAL---KDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQS
           + +RR    +Y  +  D       EP S++  L   +    + AMQEE+   + N  + LV  P G   +   W+FK K D    + + KARLV + 
Subjt:  SAGITTRRKEKVDYSKMIVDLCYVSAIEPTSIENAL---KDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQS

Query:  YAQVEGVDFDETFAHVARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGF
        + Q +G+DFDE F+ V ++ +IR +L+++     ++ Q+DVK+AFL+G L EE+Y+ QP+GF
Subjt:  YAQVEGVDFDETFAHVARLEAIRLLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGF

P92520 Uncharacterized mitochondrial protein AtMg008202.2e-1543.43Show/hide
Query:  EPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLLNIS
        EP S+  ALKD  +  AMQEEL     N  W LV  P   N++G  W+FK K    G + + KARLVA+ + Q EG+ F ET++ V R   IR +LN++
Subjt:  EPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLLNIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-2230.58Show/hide
Query:  NVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLV---PSTHMKKNHPPSSIIGDPSAGITTRRKEKVDYSKMIVDLCYV
        N +Q N  ++  S ++   +TP +      S P  T +     T  T    ++L+   P      N+   + +   S G   +        K  + +   
Subjt:  NVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLV---PSTHMKKNHPPSSIIGDPSAGITTRRKEKVDYSKMIVDLCYV

Query:  SAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDG-VNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLL
        +  EP +   ALKDE + NAM  E+     N+ W LV  P   V ++G  WIF  K +  G++ + KARLVA+ Y Q  G+D+ ETF+ V +  +IR++L
Subjt:  SAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDG-VNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLL

Query:  NISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP
         ++  R + + Q+DV +AFL G L ++VY++QP GF+D + P
Subjt:  NISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.5e-2340.29Show/hide
Query:  EPTSIENALKDEYYINAMQEELLQFKCNNVWTLV-SKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLLNIS
        EP +   A+KD+ +  AM  E+     N+ W LV   P  V ++G  WIF  K +  G++ + KARLVA+ Y Q  G+D+ ETF+ V +  +IR++L ++
Subjt:  EPTSIENALKDEYYINAMQEELLQFKCNNVWTLV-SKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLLNIS

Query:  YFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP
          R + + Q+DV +AFL G L +EVY++QP GFVD + P
Subjt:  YFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.7e-2437.41Show/hide
Query:  LCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIR
        +C   A EP++   A +   +  AM +E+   +  + W + + P     IG  W++K K +  G + + KARLVA+ Y Q EG+DF ETF+ V +L +++
Subjt:  LCYVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIR

Query:  LLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGF
        L+L IS    F L+Q+D+ +AFLNG L+EE+Y+  P G+
Subjt:  LLLNISYFRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.6e-1643.43Show/hide
Query:  EPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLLNIS
        EP S+  ALKD  +  AMQEEL     N  W LV  P   N++G  W+FK K    G + + KARLVA+ + Q EG+ F ET++ V R   IR +LN++
Subjt:  EPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLLNIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGAGTTATAATACATGTCAAAAACTTACCTTTGAATTTTTGGGCTGAAACTGTAAACACAACATGTCATATTCACAACAGGGTCACTACTCGATCTGGT
ACGACAGTCGCATTATATGACCTTTGGAAAGGAAGGAAGCCAAATGTTAAGTATTTTCATATTTTTGGAAGTACCTATTACATTCTAGCCGATAAAGAGTATCAT
CGGAAGTGGGATGTGAAATCTGATCAAGGAATTTTTCTTGGATACTCTCAGAATAGTCGAGCGTACAAAGCCTTTAATATTAAATCAGGAACAGTTATGGAAACA
ATCAATGTTGTGGTAAATGATTTTGAATCTAATGTCAACCAGTTTAATATTGAGGATGATGAGACTTCTGTTATATCTAATGTTACTGCTACCCCTCTTAAGGAA
ATGCCTAAAGATGATTCTCAGCCAGACAGTACGAAGACAAATTTAGAAAAAATAACTGATGAGACCCTGAATGATGAAACTGTACTTGTTCCCTCTACACATATG
AAAAAGAATCATCCCCCAAGTTCCATAATAGGTGATCCATCAGCTGGAATCACTACCAGAAGGAAAGAGAAAGTTGATTACTCGAAAATGATTGTTGATTTATGT
TATGTGTCAGCAATAGAACCCACATCTATTGAAAATGCTCTCAAGGATGAATACTATATAAACGCTATGCAAGAAGAATTACTACAATTTAAGTGTAACAATGTA
TGGACTTTGGTTTCCAAGCCTGACGGGGTGAATGTTATAGGAACTACGTGGATTTTTAAAAATAAAACTGATGAATCAGGCAATGTAACAAAGAACAAAGCTCGT
TTGGTGGCTCAAAGTTATGCTCAGGTAGAAGGTGTTGATTTTGATGAAACCTTTGCACATGTGGCTAGACTTGAAGCTATTCGCCTTCTGCTCAATATATCCTAT
TTTCGAAAATTTAAATTGTATCAAATGGACGTTAAAAGTGCTTTCTTGAATGGATACTTGAATGAAGAAGTTTATGTAGCACAACCTAAAGGGTTTGTTGATTCA
GAATTTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCGAGTTATAATACATGTCAAAAACTTACCTTTGAATTTTTGGGCTGAAACTGTAAACACAACATGTCATATTCACAACAGGGTCACTACTCGATCTGGT
ACGACAGTCGCATTATATGACCTTTGGAAAGGAAGGAAGCCAAATGTTAAGTATTTTCATATTTTTGGAAGTACCTATTACATTCTAGCCGATAAAGAGTATCAT
CGGAAGTGGGATGTGAAATCTGATCAAGGAATTTTTCTTGGATACTCTCAGAATAGTCGAGCGTACAAAGCCTTTAATATTAAATCAGGAACAGTTATGGAAACA
ATCAATGTTGTGGTAAATGATTTTGAATCTAATGTCAACCAGTTTAATATTGAGGATGATGAGACTTCTGTTATATCTAATGTTACTGCTACCCCTCTTAAGGAA
ATGCCTAAAGATGATTCTCAGCCAGACAGTACGAAGACAAATTTAGAAAAAATAACTGATGAGACCCTGAATGATGAAACTGTACTTGTTCCCTCTACACATATG
AAAAAGAATCATCCCCCAAGTTCCATAATAGGTGATCCATCAGCTGGAATCACTACCAGAAGGAAAGAGAAAGTTGATTACTCGAAAATGATTGTTGATTTATGT
TATGTGTCAGCAATAGAACCCACATCTATTGAAAATGCTCTCAAGGATGAATACTATATAAACGCTATGCAAGAAGAATTACTACAATTTAAGTGTAACAATGTA
TGGACTTTGGTTTCCAAGCCTGACGGGGTGAATGTTATAGGAACTACGTGGATTTTTAAAAATAAAACTGATGAATCAGGCAATGTAACAAAGAACAAAGCTCGT
TTGGTGGCTCAAAGTTATGCTCAGGTAGAAGGTGTTGATTTTGATGAAACCTTTGCACATGTGGCTAGACTTGAAGCTATTCGCCTTCTGCTCAATATATCCTAT
TTTCGAAAATTTAAATTGTATCAAATGGACGTTAAAAGTGCTTTCTTGAATGGATACTTGAATGAAGAAGTTTATGTAGCACAACCTAAAGGGTTTGTTGATTCA
GAATTTCCTTAG
Protein sequenceShow/hide protein sequence
MARVIIHVKNLPLNFWAETVNTTCHIHNRVTTRSGTTVALYDLWKGRKPNVKYFHIFGSTYYILADKEYHRKWDVKSDQGIFLGYSQNSRAYKAFNIKSGTVMET
INVVVNDFESNVNQFNIEDDETSVISNVTATPLKEMPKDDSQPDSTKTNLEKITDETLNDETVLVPSTHMKKNHPPSSIIGDPSAGITTRRKEKVDYSKMIVDLC
YVSAIEPTSIENALKDEYYINAMQEELLQFKCNNVWTLVSKPDGVNVIGTTWIFKNKTDESGNVTKNKARLVAQSYAQVEGVDFDETFAHVARLEAIRLLLNISY
FRKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEFP