; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc12g0322451 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc12g0322451
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationCMiso1.1chr12:10283859..10284932
RNA-Seq ExpressionCmc12g0322451
SyntenyCmc12g0322451
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004601 - peroxidase activity (molecular function)
GO:0030246 - carbohydrate binding (molecular function)
GO:0047938 - glucose-6-phosphate 1-epimerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032313.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]1.5e-16586.14Show/hide
Query:  EMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDE
        E  PP+R+IEHHI L+E TDPINVRPYRYGFHQKAEMEKLVEEMLTSG+IRP KSPYSSPVLLVKKKDGSWRFCVDYRAVNNAT+PDKFPI VVEELFDE
Subjt:  EMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDE

Query:  LNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLRE
        LNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTH G+Y+FLVMPFGLTNAPATFQ LMNNVF+PYLRKFVLVFFDDIL+YSKNEEEH +HLG VLS LRE
Subjt:  LNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLRE

Query:  HTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNA
        ++LYANKKKCSFAQ +IEYLGHIISG G+EVDPEKI SIADWPIP  V+E+ GFLGLT YYRRFVQHYGSIAAPLTQLLKKGGF WNKEAEEAF +LK A
Subjt:  HTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNA

Query:  MISLLVLALPNFDQLFEIKTDASSVGVGAVLT
        MISL VLALPNF++ FE++TDAS +GVGAVLT
Subjt:  MISLLVLALPNFDQLFEIKTDASSVGVGAVLT

KAA0040958.1 peroxidase 64 [Cucumis melo var. makuwa]1.1e-20599.16Show/hide
Query:  MEISFVETNPVQKVLKQYKDVFDWLEMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCV
        MEISFVETNPVQKVLKQYKDVFDWLEMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCV
Subjt:  MEISFVETNPVQKVLKQYKDVFDWLEMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCV

Query:  DYRAVNNATVPDKFPISVVEELFDELNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFD
        DYRAVNNATVPDKFPISVVEELFDELNGASLFSK DLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFD
Subjt:  DYRAVNNATVPDKFPISVVEELFDELNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFD

Query:  DILVYSKNEEEHMNHLGKVLSTLREHTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPL
        DILVYSKNEEEHMNHLGK+LSTLREHTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLT YYRRFVQHYGSIAAPL
Subjt:  DILVYSKNEEEHMNHLGKVLSTLREHTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPL

Query:  TQLLKKGGFMWNKEAEEAFEKLKNAMISLLVLALPNFDQLFEIKTDASSVGVGAVLT
        TQLLKKGGFMWNKEAEEAFEKLKNAMISLLVLALPNFDQLFEIKTDASSVGVGAVLT
Subjt:  TQLLKKGGFMWNKEAEEAFEKLKNAMISLLVLALPNFDQLFEIKTDASSVGVGAVLT

KAA0061922.1 Transposon Tf2-9 polyprotein [Cucumis melo var. makuwa]6.3e-16485.54Show/hide
Query:  EMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDE
        E QPPRR+IEH I+L+E TDPINVRPYRYGFHQKAEMEKLVEEML SGVIRP KSP+SSP+LLVKKKD SWRFCVDYRAVNN T+PDKFPI +VEELFDE
Subjt:  EMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDE

Query:  LNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLRE
        LNGASLFSKIDLK GYHQIRM EKDIPKTAFRTHEGHY+FLVMPFGLTNAPATFQ LMN VF+PYLRKFVLVFFDDILVYSKNEE+H+ HLGKVLS+LRE
Subjt:  LNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLRE

Query:  HTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNA
        H+LYANKKK SFAQ K+EYL HII GGGVEVDP+KIRSIADWP PTNVRE  GFLGLT YYRRFV+HYGSIAAPLTQLLKKGGF+WNKEA+EAFE+LK+A
Subjt:  HTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNA

Query:  MISLLVLALPNFDQLFEIKTDASSVGVGAVLT
        MISL VLALPNF++ FEI+TDAS VGVGA+LT
Subjt:  MISLLVLALPNFDQLFEIKTDASSVGVGAVLT

TYK24006.1 Transposon Tf2-9 polyprotein [Cucumis melo var. makuwa]6.3e-16485.54Show/hide
Query:  EMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDE
        E QPPRR+IEH I+L+E TDPINVRPYRYGFHQKAEMEKLVEEML SGVIRP KSP+SSP+LLVKKKD SWRFCVDYRAVNN T+PDKFPI +VEELFDE
Subjt:  EMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDE

Query:  LNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLRE
        LNGASLFSKIDLK GYHQIRM EKDIPKTAFRTHEGHY+FLVMPFGLTNAPATFQ LMN VF+PYLRKFVLVFFDDILVYSKNEE+H+ HLGKVLS+LRE
Subjt:  LNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLRE

Query:  HTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNA
        H+LYANKKK SFAQ K+EYL HII GGGVEVDP+KIRSIADWP PTNVRE  GFLGLT YYRRFV+HYGSIAAPLTQLLKKGGF+WNKEA+EAFE+LK+A
Subjt:  HTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNA

Query:  MISLLVLALPNFDQLFEIKTDASSVGVGAVLT
        MISL VLALPNF++ FEI+TDAS VGVGA+LT
Subjt:  MISLLVLALPNFDQLFEIKTDASSVGVGAVLT

TYK27013.1 Transposon Tf2-9 polyprotein [Cucumis melo var. makuwa]1.5e-16582Show/hide
Query:  TNPVQKVLKQYKDVFDWLEMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNN
        TN +Q VLK + DVF W E  PPRR+IEH I+L++ TDPINVR YRY FHQK EMEKLV+EML  GVIRP KSP+SSPVLLVKKKDGSWRFCVDYRA+NN
Subjt:  TNPVQKVLKQYKDVFDWLEMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNN

Query:  ATVPDKFPISVVEELFDELNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSK
         T+PDKFPI +VEELFDELNGASLFSKIDLKS YHQIRM EKDIPKTAFRTHEGHY+FLVMPFGLTNAPATFQALMN VF+PYLRKFVLVFFDDILVYSK
Subjt:  ATVPDKFPISVVEELFDELNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSK

Query:  NEEEHMNHLGKVLSTLREHTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKG
        NE++H+ HLGKVLS+LREH+L+ANKKKCSFAQ K+EYLGHIISG GVEVDP+KIRSIADWP  TNVRE  GFLGL  YYRRFV+HYGSI APLTQLLKKG
Subjt:  NEEEHMNHLGKVLSTLREHTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKG

Query:  GFMWNKEAEEAFEKLKNAMISLLVLALPNFDQLFEIKTDASSVGVGAVLT
        GF WNKEAEEAFE+LK AMISL VLALPNF+Q FEI+TDAS V +GAVLT
Subjt:  GFMWNKEAEEAFEKLKNAMISLLVLALPNFDQLFEIKTDASSVGVGAVLT

TrEMBL top hitse value%identityAlignment
A0A5A7SPK3 Transposon Ty3-I Gag-Pol polyprotein7.3e-16686.14Show/hide
Query:  EMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDE
        E  PP+R+IEHHI L+E TDPINVRPYRYGFHQKAEMEKLVEEMLTSG+IRP KSPYSSPVLLVKKKDGSWRFCVDYRAVNNAT+PDKFPI VVEELFDE
Subjt:  EMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDE

Query:  LNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLRE
        LNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTH G+Y+FLVMPFGLTNAPATFQ LMNNVF+PYLRKFVLVFFDDIL+YSKNEEEH +HLG VLS LRE
Subjt:  LNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLRE

Query:  HTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNA
        ++LYANKKKCSFAQ +IEYLGHIISG G+EVDPEKI SIADWPIP  V+E+ GFLGLT YYRRFVQHYGSIAAPLTQLLKKGGF WNKEAEEAF +LK A
Subjt:  HTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNA

Query:  MISLLVLALPNFDQLFEIKTDASSVGVGAVLT
        MISL VLALPNF++ FE++TDAS +GVGAVLT
Subjt:  MISLLVLALPNFDQLFEIKTDASSVGVGAVLT

A0A5A7V139 Transposon Tf2-9 polyprotein3.0e-16485.54Show/hide
Query:  EMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDE
        E QPPRR+IEH I+L+E TDPINVRPYRYGFHQKAEMEKLVEEML SGVIRP KSP+SSP+LLVKKKD SWRFCVDYRAVNN T+PDKFPI +VEELFDE
Subjt:  EMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDE

Query:  LNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLRE
        LNGASLFSKIDLK GYHQIRM EKDIPKTAFRTHEGHY+FLVMPFGLTNAPATFQ LMN VF+PYLRKFVLVFFDDILVYSKNEE+H+ HLGKVLS+LRE
Subjt:  LNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLRE

Query:  HTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNA
        H+LYANKKK SFAQ K+EYL HII GGGVEVDP+KIRSIADWP PTNVRE  GFLGLT YYRRFV+HYGSIAAPLTQLLKKGGF+WNKEA+EAFE+LK+A
Subjt:  HTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNA

Query:  MISLLVLALPNFDQLFEIKTDASSVGVGAVLT
        MISL VLALPNF++ FEI+TDAS VGVGA+LT
Subjt:  MISLLVLALPNFDQLFEIKTDASSVGVGAVLT

A0A5D3D9X2 Peroxidase 645.5e-20699.16Show/hide
Query:  MEISFVETNPVQKVLKQYKDVFDWLEMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCV
        MEISFVETNPVQKVLKQYKDVFDWLEMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCV
Subjt:  MEISFVETNPVQKVLKQYKDVFDWLEMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCV

Query:  DYRAVNNATVPDKFPISVVEELFDELNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFD
        DYRAVNNATVPDKFPISVVEELFDELNGASLFSK DLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFD
Subjt:  DYRAVNNATVPDKFPISVVEELFDELNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFD

Query:  DILVYSKNEEEHMNHLGKVLSTLREHTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPL
        DILVYSKNEEEHMNHLGK+LSTLREHTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLT YYRRFVQHYGSIAAPL
Subjt:  DILVYSKNEEEHMNHLGKVLSTLREHTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPL

Query:  TQLLKKGGFMWNKEAEEAFEKLKNAMISLLVLALPNFDQLFEIKTDASSVGVGAVLT
        TQLLKKGGFMWNKEAEEAFEKLKNAMISLLVLALPNFDQLFEIKTDASSVGVGAVLT
Subjt:  TQLLKKGGFMWNKEAEEAFEKLKNAMISLLVLALPNFDQLFEIKTDASSVGVGAVLT

A0A5D3DK67 Transposon Tf2-9 polyprotein3.0e-16485.54Show/hide
Query:  EMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDE
        E QPPRR+IEH I+L+E TDPINVRPYRYGFHQKAEMEKLVEEML SGVIRP KSP+SSP+LLVKKKD SWRFCVDYRAVNN T+PDKFPI +VEELFDE
Subjt:  EMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDE

Query:  LNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLRE
        LNGASLFSKIDLK GYHQIRM EKDIPKTAFRTHEGHY+FLVMPFGLTNAPATFQ LMN VF+PYLRKFVLVFFDDILVYSKNEE+H+ HLGKVLS+LRE
Subjt:  LNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLRE

Query:  HTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNA
        H+LYANKKK SFAQ K+EYL HII GGGVEVDP+KIRSIADWP PTNVRE  GFLGLT YYRRFV+HYGSIAAPLTQLLKKGGF+WNKEA+EAFE+LK+A
Subjt:  HTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNA

Query:  MISLLVLALPNFDQLFEIKTDASSVGVGAVLT
        MISL VLALPNF++ FEI+TDAS VGVGA+LT
Subjt:  MISLLVLALPNFDQLFEIKTDASSVGVGAVLT

A0A5D3DTQ1 Transposon Tf2-9 polyprotein7.3e-16682Show/hide
Query:  TNPVQKVLKQYKDVFDWLEMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNN
        TN +Q VLK + DVF W E  PPRR+IEH I+L++ TDPINVR YRY FHQK EMEKLV+EML  GVIRP KSP+SSPVLLVKKKDGSWRFCVDYRA+NN
Subjt:  TNPVQKVLKQYKDVFDWLEMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNN

Query:  ATVPDKFPISVVEELFDELNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSK
         T+PDKFPI +VEELFDELNGASLFSKIDLKS YHQIRM EKDIPKTAFRTHEGHY+FLVMPFGLTNAPATFQALMN VF+PYLRKFVLVFFDDILVYSK
Subjt:  ATVPDKFPISVVEELFDELNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSK

Query:  NEEEHMNHLGKVLSTLREHTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKG
        NE++H+ HLGKVLS+LREH+L+ANKKKCSFAQ K+EYLGHIISG GVEVDP+KIRSIADWP  TNVRE  GFLGL  YYRRFV+HYGSI APLTQLLKKG
Subjt:  NEEEHMNHLGKVLSTLREHTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKG

Query:  GFMWNKEAEEAFEKLKNAMISLLVLALPNFDQLFEIKTDASSVGVGAVLT
        GF WNKEAEEAFE+LK AMISL VLALPNF+Q FEI+TDAS V +GAVLT
Subjt:  GFMWNKEAEEAFEKLKNAMISLLVLALPNFDQLFEIKTDASSVGVGAVLT

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.2e-7143.13Show/hide
Query:  YRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLV-KKKDGS----WRFCVDYRAVNNATVPDKFPISVVEELFDELNGASLFSKIDLKSGYHQIRM
        Y Y    + E+E  +++ML  G+IR   SPY+SP+ +V KK+D S    +R  +DYR +N  TV D+ PI  ++E+  +L   + F+ IDL  G+HQI M
Subjt:  YRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLV-KKKDGS----WRFCVDYRAVNNATVPDKFPISVVEELFDELNGASLFSKIDLKSGYHQIRM

Query:  EEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLREHTLYANKKKCSFAQSKIEYLG
        + + + KTAF T  GHY++L MPFGL NAPATFQ  MN++ RP L K  LV+ DDI+V+S + +EH+  LG V   L +  L     KC F + +  +LG
Subjt:  EEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLREHTLYANKKKCSFAQSKIEYLG

Query:  HIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFM--WNKEAEEAFEKLKNAMISLLVLALPNFDQLFEIK
        H+++  G++ +PEKI +I  +PIPT  +EI  FLGLT YYR+F+ ++  IA P+T+ LKK   +   N E + AF+KLK  +    +L +P+F + F + 
Subjt:  HIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFM--WNKEAEEAFEKLKNAMISLLVLALPNFDQLFEIK

Query:  TDASSVGVGAVLT
        TDAS V +GAVL+
Subjt:  TDASSVGVGAVLT

P20825 Retrovirus-related Pol polyprotein from transposon 2977.4e-7542.68Show/hide
Query:  HIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKD-----GSWRFCVDYRAVNNATVPDKFPISVVEELFDELNGASL
        H+       PI  + Y      + E+E  V+EML  G+IR   SPY+SP  +V KK        +R  +DYR +N  T+PD++PI  ++E+  +L     
Subjt:  HIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKD-----GSWRFCVDYRAVNNATVPDKFPISVVEELFDELNGASL

Query:  FSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLREHTLYAN
        F+ IDL  G+HQI M+E+ I KTAF T  GHY++L MPFGL NAPATFQ  MNN+ RP L K  LV+ DDI+++S +  EH+N +  V + L +  L   
Subjt:  FSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLREHTLYAN

Query:  KKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAE--EAFEKLKNAMISL
          KC F + +  +LGHI++  G++ +P K+++I  +PIPT  +EI  FLGLT YYR+F+ +Y  IA P+T  LKK   +  ++ E  EAFEKLK  +I  
Subjt:  KKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAE--EAFEKLKNAMISL

Query:  LVLALPNFDQLFEIKTDASSVGVGAVLT
         +L LP+F++ F + TDAS++ +GAVL+
Subjt:  LVLALPNFDQLFEIKTDASSVGVGAVLT

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.8e-6842.11Show/hide
Query:  IEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDELNGASLFS
        ++H I ++       ++PY      + E+ K+V+++L +  I P KSP SSPV+LV KKDG++R CVDYR +N AT+ D FP+  ++ L   +  A +F+
Subjt:  IEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDELNGASLFS

Query:  KIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLREHTLYANKK
         +DL SGYHQI ME KD  KTAF T  G Y++ VMPFGL NAP+TF   M + FR    +FV V+ DDIL++S++ EEH  HL  VL  L+   L   KK
Subjt:  KIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLREHTLYANKK

Query:  KCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNAMISLLVLA
        KC FA  + E+LG+ I    +     K  +I D+P P  V++   FLG+  YYRRF+ +   IA P+ QL       W ++ ++A EKLK A+ +  VL 
Subjt:  KCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNAMISLLVLA

Query:  LPNFDQLFEIKTDASSVGVGAVL
          N    + + TDAS  G+GAVL
Subjt:  LPNFDQLFEIKTDASSVGVGAVL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.4e-7239.39Show/hide
Query:  DPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKK-----DGSWRFCVDYRAVNNATVPDKFPISVVEELFDELNGASLFSKIDLKS
        DPI  + Y Y  + + E+E+ ++E+L  G+IRP  SPY+SP+ +V KK     +  +R  VD++ +N  T+PD +PI  +      L  A  F+ +DL S
Subjt:  DPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKK-----DGSWRFCVDYRAVNNATVPDKFPISVVEELFDELNGASLFSKIDLKS

Query:  GYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLREHTLYANKKKCSFAQ
        G+HQI M+E DIPKTAF T  G Y+FL +PFGL NAPA FQ +++++ R ++ K   V+ DDI+V+S++ + H  +L  VL++L +  L  N +K  F  
Subjt:  GYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLREHTLYANKKKCSFAQ

Query:  SKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLK------------KGGFMWNKEAEEAFEKLKNAMI
        +++E+LG+I++  G++ DP+K+R+I++ P PT+V+E+  FLG+T YYR+F+Q Y  +A PLT L +            K     ++ A ++F  LK+ + 
Subjt:  SKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLK------------KGGFMWNKEAEEAFEKLKNAMI

Query:  SLLVLALPNFDQLFEIKTDASSVGVGAVLT
        S  +LA P F + F + TDAS+  +GAVL+
Subjt:  SLLVLALPNFDQLFEIKTDASSVGVGAVLT

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.8e-6841.8Show/hide
Query:  IEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDELNGASLFS
        ++H I ++       ++PY      + E+ K+V+++L +  I P KSP SSPV+LV KKDG++R CVDYR +N AT+ D FP+  ++ L   +  A +F+
Subjt:  IEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATVPDKFPISVVEELFDELNGASLFS

Query:  KIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLREHTLYANKK
         +DL SGYHQI ME KD  KTAF T  G Y++ VMPFGL NAP+TF   M + FR    +FV V+ DDIL++S++ EEH  HL  VL  L+   L   KK
Subjt:  KIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVLSTLREHTLYANKK

Query:  KCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNAMISLLVLA
        KC FA  + E+LG+ I    +     K  +I D+P P  V++   FLG+  YYRRF+ +   IA P+ QL       W ++ ++A +KLK+A+ +  VL 
Subjt:  KCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNAMISLLVLA

Query:  LPNFDQLFEIKTDASSVGVGAVL
          N    + + TDAS  G+GAVL
Subjt:  LPNFDQLFEIKTDASSVGVGAVL

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein2.2e-0550Show/hide
Query:  GFH--QKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSW
        G H  ++  ++  + EML + +I+P  SPYSSPVLLV+KKDG W
Subjt:  GFH--QKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSW

ATMG00860.1 DNA/RNA polymerases superfamily protein5.9e-3555.56Show/hide
Query:  MNHLGKVLSTLREHTLYANKKKCSFAQSKIEYLG--HIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFM
        MNHLG VL    +H  YAN+KKC+F Q +I YLG  HIISG GV  DP K+ ++  WP P N  E+ GFLGLT YYRRFV++YG I  PLT+LLKK    
Subjt:  MNHLGKVLSTLREHTLYANKKKCSFAQSKIEYLG--HIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFM

Query:  WNKEAEEAFEKLKNAMISLLVLALPN
        W + A  AF+ LK A+ +L VLALP+
Subjt:  WNKEAEEAFEKLKNAMISLLVLALPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATTTCCTTCGTGGAAACAAATCCAGTGCAGAAGGTACTGAAACAGTATAAGGATGTGTTTGATTGGCTAGAAATGCAGCCGCCGAGAAGGAAAATTGAGCACCA
CATTTACCTTAGAGAGAGGACTGACCCAATTAATGTAAGACCATATAGATATGGCTTCCATCAGAAGGCTGAAATGGAGAAATTGGTGGAAGAAATGCTAACCTCGGGGG
TGATTCGGCCGAGAAAAAGTCCTTATTCCAGCCCAGTGTTACTTGTCAAAAAGAAGGATGGAAGTTGGAGGTTTTGTGTAGACTACCGGGCAGTTAATAATGCAACTGTT
CCGGATAAGTTCCCAATTTCGGTGGTGGAGGAACTTTTCGATGAGTTGAATGGAGCCAGCTTGTTCTCGAAAATTGATCTCAAATCTGGGTATCATCAAATAAGGATGGA
GGAAAAGGATATCCCGAAAACAGCTTTTAGGACGCATGAAGGACATTACAAGTTTCTTGTCATGCCGTTTGGATTGACTAATGCACCGGCTACTTTCCAGGCTTTGATGA
ATAATGTCTTCAGACCTTACTTGAGGAAGTTTGTCCTAGTATTTTTTGATGACATACTAGTTTATAGCAAGAATGAGGAAGAGCATATGAATCATTTGGGAAAAGTTCTT
TCAACCTTGAGGGAACATACACTATATGCTAATAAAAAGAAGTGCAGTTTTGCCCAGTCAAAGATTGAATATTTGGGGCATATAATATCAGGAGGAGGAGTAGAAGTGGA
TCCGGAGAAAATTCGTTCCATCGCTGATTGGCCAATACCGACAAATGTTAGAGAAATCTGGGGGTTCCTTGGCCTAACCGAGTATTACCGACGGTTTGTGCAACATTATG
GTTCTATAGCAGCACCACTAACCCAACTACTTAAAAAAGGGGGGTTTATGTGGAACAAAGAAGCTGAAGAAGCTTTTGAGAAGTTGAAGAACGCTATGATATCATTGCTG
GTATTAGCTCTCCCCAACTTTGATCAGCTCTTTGAGATCAAAACAGATGCTTCGAGTGTTGGAGTGGGGGCCGTATTAACATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAATTTCCTTCGTGGAAACAAATCCAGTGCAGAAGGTACTGAAACAGTATAAGGATGTGTTTGATTGGCTAGAAATGCAGCCGCCGAGAAGGAAAATTGAGCACCA
CATTTACCTTAGAGAGAGGACTGACCCAATTAATGTAAGACCATATAGATATGGCTTCCATCAGAAGGCTGAAATGGAGAAATTGGTGGAAGAAATGCTAACCTCGGGGG
TGATTCGGCCGAGAAAAAGTCCTTATTCCAGCCCAGTGTTACTTGTCAAAAAGAAGGATGGAAGTTGGAGGTTTTGTGTAGACTACCGGGCAGTTAATAATGCAACTGTT
CCGGATAAGTTCCCAATTTCGGTGGTGGAGGAACTTTTCGATGAGTTGAATGGAGCCAGCTTGTTCTCGAAAATTGATCTCAAATCTGGGTATCATCAAATAAGGATGGA
GGAAAAGGATATCCCGAAAACAGCTTTTAGGACGCATGAAGGACATTACAAGTTTCTTGTCATGCCGTTTGGATTGACTAATGCACCGGCTACTTTCCAGGCTTTGATGA
ATAATGTCTTCAGACCTTACTTGAGGAAGTTTGTCCTAGTATTTTTTGATGACATACTAGTTTATAGCAAGAATGAGGAAGAGCATATGAATCATTTGGGAAAAGTTCTT
TCAACCTTGAGGGAACATACACTATATGCTAATAAAAAGAAGTGCAGTTTTGCCCAGTCAAAGATTGAATATTTGGGGCATATAATATCAGGAGGAGGAGTAGAAGTGGA
TCCGGAGAAAATTCGTTCCATCGCTGATTGGCCAATACCGACAAATGTTAGAGAAATCTGGGGGTTCCTTGGCCTAACCGAGTATTACCGACGGTTTGTGCAACATTATG
GTTCTATAGCAGCACCACTAACCCAACTACTTAAAAAAGGGGGGTTTATGTGGAACAAAGAAGCTGAAGAAGCTTTTGAGAAGTTGAAGAACGCTATGATATCATTGCTG
GTATTAGCTCTCCCCAACTTTGATCAGCTCTTTGAGATCAAAACAGATGCTTCGAGTGTTGGAGTGGGGGCCGTATTAACATAG
Protein sequenceShow/hide protein sequence
MEISFVETNPVQKVLKQYKDVFDWLEMQPPRRKIEHHIYLRERTDPINVRPYRYGFHQKAEMEKLVEEMLTSGVIRPRKSPYSSPVLLVKKKDGSWRFCVDYRAVNNATV
PDKFPISVVEELFDELNGASLFSKIDLKSGYHQIRMEEKDIPKTAFRTHEGHYKFLVMPFGLTNAPATFQALMNNVFRPYLRKFVLVFFDDILVYSKNEEEHMNHLGKVL
STLREHTLYANKKKCSFAQSKIEYLGHIISGGGVEVDPEKIRSIADWPIPTNVREIWGFLGLTEYYRRFVQHYGSIAAPLTQLLKKGGFMWNKEAEEAFEKLKNAMISLL
VLALPNFDQLFEIKTDASSVGVGAVLT