; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0222231 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0222231
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:10726692..10727829
RNA-Seq ExpressionCmc08g0222231
SyntenyCmc08g0222231
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039192.1 reverse transcriptase [Cucumis melo var. makuwa]1.8e-17691.33Show/hide
Query:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVD RE EVSLSSEPVVREYPDVFPDEL GLPPPRE+DFAIELE GT PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        PWGA+VLFVKKKDGSM LCIDYRELNKVTVKN  PLPRIDDLFDQLQGAT+FSKIDLRSGYHQLRIRDS+I KTAFRSRY HYEFIVMSFGLTNAPAVFM
Subjt:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DFDEQSVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTRKGTPFVW
        D   + VLET RANKLYAKFSKCEFWLKKVTFL HVVSS  VSVDPAKIEAVT+WPRPSTVSEIR FLGL GYYRRFVEDFSRIA+PLTQLTRKGTPFVW
Subjt:  DFDEQSVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTRKGTPFVW

Query:  SPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM
        SPACESSFQELK+KLV+APVLTVPDGSGSFVIYSDASKKGL CVLM
Subjt:  SPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM

KAA0040409.1 pol protein [Cucumis melo var. makuwa]2.6e-17588.42Show/hide
Query:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVD REPEVSLSSEPVVREYPDVFPD+L GLPPPRE+DFAIELE GT PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        PWGA VLFVKKKDGSM LCIDYRELNKVTVKNR PLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD +IPKTAFRSRY HYEF+VMSFGLTNAPAVFM
Subjt:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DFDEQS--------VLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLT
        D   ++        VLET RANKLYAKFSKCEFWL+KVTFL HVVSS  VSVDPAKIEAVT+WPRPSTVSEIRSFLGL GYYR FVEDFSRIA+PLTQLT
Subjt:  DFDEQS--------VLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLT

Query:  RKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM
        RKGTPFVWS ACE SFQELK+KLV+A VLTVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  RKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM

KAA0058812.1 pol protein [Cucumis melo var. makuwa]2.6e-17583.6Show/hide
Query:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVD REPEVSLSSEPVVREYPDVFPDEL GLPPPRE+DFAIELE GT PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        PWGA VLFVKKKDGSM LCIDYRELNKVT+KNR PLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD +IPKTAF SRY HYEF+VMSFGLTNAPAVFM
Subjt:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DFDEQ--------------------------------SVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFL
        D   +                                 VLET RANKLYAKFSKCEFWL+KVTFL HVVSS  VSVDPAKIEAVT+WPRPSTVSEIRSFL
Subjt:  DFDEQ--------------------------------SVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFL

Query:  GLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM
        GL GYYRRFVEDFSRIA+PLTQLTRKGTPFVWSPACESSFQELK+KLV+APVLTVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  GLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM

KAA0060745.1 pol protein [Cucumis melo var. makuwa]2.6e-17583.86Show/hide
Query:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVD REPEVSLSSEPVVREYPDVFPDEL GLPPPRE+DFAIELE GT PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        PWGA VLFVKKKDGSM LCIDYRELNKVTVKNR PLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIRD +IPKTAFRSRY HYEF+VMSFGLTNAPAVFM
Subjt:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DFDEQ--------------------------------SVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFL
        D   +                                 VLET RANKLYAKFSKCEFWL+KVTFL HVVSS  VSVDPAKIEAVT+WPRPSTVSEIRSFL
Subjt:  DFDEQ--------------------------------SVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFL

Query:  GLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM
        GL GYYRRFVEDFSRIA+PLTQLTRKGTPFVWSPACESSFQELK+KLV APVLTVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  GLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM

TYK05193.1 pol protein [Cucumis melo var. makuwa]1.5e-17583.86Show/hide
Query:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVD REPEVSLSSEPVVREYPDVFPDEL GLPPPRE+DFAIELE GT PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        PWGA VLFVKKKDGSM LCIDYRELNKVTVKNR PLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD +IPKTAFRSRY HYEF+VMSFGLTNAPAVFM
Subjt:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DFDEQ--------------------------------SVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFL
        D   +                                 VLET RANKLYAKFSKCEFWL+KVTFL HVVSS  VSVDP KIEAVT+WPRPSTVSEIRSFL
Subjt:  DFDEQ--------------------------------SVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFL

Query:  GLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM
        GL GYYRRFVEDFSRIA+PLTQLTRKGTPFVWSPACESSFQELK+KLV+APVLTVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  GLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM

TrEMBL top hitse value%identityAlignment
A0A5A7T6S9 Reverse transcriptase8.8e-17791.33Show/hide
Query:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVD RE EVSLSSEPVVREYPDVFPDEL GLPPPRE+DFAIELE GT PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        PWGA+VLFVKKKDGSM LCIDYRELNKVTVKN  PLPRIDDLFDQLQGAT+FSKIDLRSGYHQLRIRDS+I KTAFRSRY HYEFIVMSFGLTNAPAVFM
Subjt:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DFDEQSVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTRKGTPFVW
        D   + VLET RANKLYAKFSKCEFWLKKVTFL HVVSS  VSVDPAKIEAVT+WPRPSTVSEIR FLGL GYYRRFVEDFSRIA+PLTQLTRKGTPFVW
Subjt:  DFDEQSVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTRKGTPFVW

Query:  SPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM
        SPACESSFQELK+KLV+APVLTVPDGSGSFVIYSDASKKGL CVLM
Subjt:  SPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM

A0A5A7TGL7 Pol protein1.3e-17588.42Show/hide
Query:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVD REPEVSLSSEPVVREYPDVFPD+L GLPPPRE+DFAIELE GT PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        PWGA VLFVKKKDGSM LCIDYRELNKVTVKNR PLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD +IPKTAFRSRY HYEF+VMSFGLTNAPAVFM
Subjt:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DFDEQS--------VLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLT
        D   ++        VLET RANKLYAKFSKCEFWL+KVTFL HVVSS  VSVDPAKIEAVT+WPRPSTVSEIRSFLGL GYYR FVEDFSRIA+PLTQLT
Subjt:  DFDEQS--------VLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLT

Query:  RKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM
        RKGTPFVWS ACE SFQELK+KLV+A VLTVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  RKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM

A0A5A7USG7 Reverse transcriptase1.3e-17583.6Show/hide
Query:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVD REPEVSLSSEPVVREYPDVFPDEL GLPPPRE+DFAIELE GT PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        PWGA VLFVKKKDGSM LCIDYRELNKVT+KNR PLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD +IPKTAF SRY HYEF+VMSFGLTNAPAVFM
Subjt:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DFDEQ--------------------------------SVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFL
        D   +                                 VLET RANKLYAKFSKCEFWL+KVTFL HVVSS  VSVDPAKIEAVT+WPRPSTVSEIRSFL
Subjt:  DFDEQ--------------------------------SVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFL

Query:  GLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM
        GL GYYRRFVEDFSRIA+PLTQLTRKGTPFVWSPACESSFQELK+KLV+APVLTVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  GLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM

A0A5A7V4E4 Reverse transcriptase1.3e-17583.86Show/hide
Query:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVD REPEVSLSSEPVVREYPDVFPDEL GLPPPRE+DFAIELE GT PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        PWGA VLFVKKKDGSM LCIDYRELNKVTVKNR PLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIRD +IPKTAFRSRY HYEF+VMSFGLTNAPAVFM
Subjt:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DFDEQ--------------------------------SVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFL
        D   +                                 VLET RANKLYAKFSKCEFWL+KVTFL HVVSS  VSVDPAKIEAVT+WPRPSTVSEIRSFL
Subjt:  DFDEQ--------------------------------SVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFL

Query:  GLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM
        GL GYYRRFVEDFSRIA+PLTQLTRKGTPFVWSPACESSFQELK+KLV APVLTVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  GLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM

A0A5D3BZN1 Reverse transcriptase7.5e-17683.86Show/hide
Query:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVD REPEVSLSSEPVVREYPDVFPDEL GLPPPRE+DFAIELE GT PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        PWGA VLFVKKKDGSM LCIDYRELNKVTVKNR PLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD +IPKTAFRSRY HYEF+VMSFGLTNAPAVFM
Subjt:  PWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DFDEQ--------------------------------SVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFL
        D   +                                 VLET RANKLYAKFSKCEFWL+KVTFL HVVSS  VSVDP KIEAVT+WPRPSTVSEIRSFL
Subjt:  DFDEQ--------------------------------SVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFL

Query:  GLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM
        GL GYYRRFVEDFSRIA+PLTQLTRKGTPFVWSPACESSFQELK+KLV+APVLTVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  GLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLM

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.8e-4935.26Show/hide
Query:  YRMAPAELKELKVQLQELLDKGFIRPSVSPWGASVLFV-KKKDGS----MSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRI
        Y    A  +E++ Q+Q++L++G IR S SP+ + +  V KK+D S      + IDYR+LN++TV +R P+P +D++  +L     F+ IDL  G+HQ+ +
Subjt:  YRMAPAELKELKVQLQELLDKGFIRPSVSPWGASVLFV-KKKDGS----MSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRI

Query:  RDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVF-----------------------------MDFDEQS---VLETFRANKLYAKFSKCEFWLKKVTFLN
           ++ KTAF +++ HYE++ M FGL NAPA F                             +D   QS   V E      L  +  KCEF  ++ TFL 
Subjt:  RDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVF-----------------------------MDFDEQS---VLETFRANKLYAKFSKCEFWLKKVTFLN

Query:  HVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTRKGTPF-VWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIY
        HV++   +  +P KIEA+  +P P+   EI++FLGL GYYR+F+ +F+ IA P+T+  +K       +P  +S+F++LK  +   P+L VPD +  F + 
Subjt:  HVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTRKGTPF-VWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIY

Query:  SDASKKGLGCVL
        +DAS   LG VL
Subjt:  SDASKKGLGCVL

P20825 Retrovirus-related Pol polyprotein from transposon 2979.8e-4833.54Show/hide
Query:  TPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGASVLFVKKKDGSMS-----LCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRS
        +PI    Y +A     E++ Q+QE+L++G IR S SP+ +    V KK  +       + IDYR+LN++T+ +R P+P +D++  +L     F+ IDL  
Subjt:  TPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGASVLFVKKKDGSMS-----LCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRS

Query:  GYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVF-----------------MDFDE---------------QSVLETFRANKLYAKFSKCEFWL
        G+HQ+ + + +I KTAF ++  HYE++ M FGL NAPA F                 +  D+               Q V        L  +  KCEF  
Subjt:  GYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVF-----------------MDFDE---------------QSVLETFRANKLYAKFSKCEFWL

Query:  KKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTRKGTPF-VWSPACESSFQELKRKLVSAPVLTVPDG
        K+  FL H+V+   +  +P K++A+ S+P P+   EIR+FLGL GYYR+F+ +++ IA P+T   +K T           +F++LK  ++  P+L +PD 
Subjt:  KKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTRKGTPF-VWSPACESSFQELKRKLVSAPVLTVPDG

Query:  SGSFVIYSDASKKGLGCVL
           FV+ +DAS   LG VL
Subjt:  SGSFVIYSDASKKGLGCVL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein8.8e-4931.03Show/hide
Query:  KASKLLSQGTWGILASVVDTREPEVSLSSEP---------VVREYPDVFPDELRGLPPPREIDF-------AIELELGTTPISRAPYRMAPAELKELKVQ
        +AS L   G +  + S + + EP  +  S           + ++Y ++  ++L    PPR  D         IE++ G       PY +     +E+   
Subjt:  KASKLLSQGTWGILASVVDTREPEVSLSSEP---------VVREYPDVFPDELRGLPPPREIDF-------AIELELGTTPISRAPYRMAPAELKELKVQ

Query:  LQELLDKGFIRPSVSPWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEF
        +Q+LLD  FI PS SP  + V+ V KKDG+  LC+DYR LNK T+ +  PLPRID+L  ++  A +F+ +DL SGYHQ+ +   +  KTAF +    YE+
Subjt:  LQELLDKGFIRPSVSPWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEF

Query:  IVMSFGLTNAPAVF---------------------MDFDE---------QSVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSW
         VM FGL NAP+ F                     + F E          +VLE  +   L  K  KC+F  ++  FL + +   +++    K  A+  +
Subjt:  IVMSFGLTNAPAVF---------------------MDFDE---------QSVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVL
        P P TV + + FLG++ YYRRF+ + S+IA P+       +   W+   + + ++LK  L ++PVL   +   ++ + +DASK G+G VL
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.1e-4630.75Show/hide
Query:  EYPDVFPDELRGLPPPREIDFAIELELGTT---PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGASVLFVKKK-----DGSMSLCIDYRELNK
        E+P +F   L G+     ++ A++ E+ T    PI    Y        E++ Q+ ELL  G IRPS SP+ + +  V KK     +    + +D++ LN 
Subjt:  EYPDVFPDELRGLPPPREIDFAIELELGTT---PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGASVLFVKKK-----DGSMSLCIDYRELNK

Query:  VTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM---------------------------
        VT+ +  P+P I+     L  A  F+ +DL SG+HQ+ +++S+IPKTAF +    YEF+ + FGL NAPA+F                            
Subjt:  VTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFM---------------------------

Query:  DFDE-----QSVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTR--
        D+D      + VL +     L     K  F   +V FL ++V++  +  DP K+ A++  P P++V E++ FLG+  YYR+F++D++++A PLT LTR  
Subjt:  DFDE-----QSVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTR--

Query:  ---------KGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVL
                    P         SF +LK  L S+ +L  P  +  F + +DAS   +G VL
Subjt:  ---------KGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVL

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.0e-4831.03Show/hide
Query:  KASKLLSQGTWGILASVVDTREPEVSLSSEP---------VVREYPDVFPDELRGLPPPREIDF-------AIELELGTTPISRAPYRMAPAELKELKVQ
        +AS L   G +  + S + + EP  +  S           + ++Y ++  ++L    PPR  D         IE++ G       PY +     +E+   
Subjt:  KASKLLSQGTWGILASVVDTREPEVSLSSEP---------VVREYPDVFPDELRGLPPPREIDF-------AIELELGTTPISRAPYRMAPAELKELKVQ

Query:  LQELLDKGFIRPSVSPWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEF
        +Q+LLD  FI PS SP  + V+ V KKDG+  LC+DYR LNK T+ +  PLPRID+L  ++  A +F+ +DL SGYHQ+ +   +  KTAF +    YE+
Subjt:  LQELLDKGFIRPSVSPWGASVLFVKKKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEF

Query:  IVMSFGLTNAPAVF---------------------MDFDE---------QSVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSW
         VM FGL NAP+ F                     + F E          +VLE  +   L  K  KC+F  ++  FL + +   +++    K  A+  +
Subjt:  IVMSFGLTNAPAVF---------------------MDFDE---------QSVLETFRANKLYAKFSKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVL
        P P TV + + FLG++ YYRRF+ + S+IA P+       +   W+   + +  +LK  L ++PVL   +   ++ + +DASK G+G VL
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.0e-2340.16Show/hide
Query:  VLETFRANKLYAKFSKCEFWLKKVTFL--NHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPAC
        VL+ +  ++ YA   KC F   ++ +L   H++S   VS DPAK+EA+  WP P   +E+R FLGL GYYRRFV+++ +I  PLT+L +K +   W+   
Subjt:  VLETFRANKLYAKFSKCEFWLKKVTFL--NHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPAC

Query:  ESSFQELKRKLVSAPVLTVPDGSGSFV
          +F+ LK  + + PVL +PD    FV
Subjt:  ESSFQELKRKLVSAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCTAGTAAACTACTCAGCCAGGGTACTTGGGGCATCTTAGCAAGCGTAGTAGATACTAGAGAGCCAGAAGTTTCCCTGTCCTCCGAACCAGTGGTAAGGGAGTA
CCCTGATGTTTTCCCCGACGAGCTTCGAGGACTCCCGCCTCCCAGGGAGATAGACTTCGCCATCGAGTTAGAGCTGGGCACTACTCCTATCTCGAGAGCCCCTTATAGAA
TGGCTCCAGCCGAGCTAAAGGAGCTGAAGGTACAGCTGCAGGAATTGCTGGATAAGGGTTTCATCCGACCCAGTGTGTCACCTTGGGGAGCCTCAGTGTTGTTTGTGAAG
AAGAAGGATGGGTCGATGAGCCTTTGCATAGACTACAGAGAGCTGAACAAGGTGACAGTTAAGAATCGTTGTCCCTTGCCCAGGATTGATGACTTATTCGATCAGTTGCA
GGGAGCCACCGTCTTTTCTAAGATCGATCTGCGATCAGGCTATCACCAGTTGAGGATCAGGGACAGTAACATCCCTAAGACGGCCTTCCGTTCTAGATACAGACATTACG
AGTTCATTGTGATGTCTTTTGGTCTGACTAATGCTCCTGCGGTATTCATGGACTTTGATGAACAGAGTGTTTTGGAGACTTTTCGAGCCAACAAGCTGTATGCCAAGTTC
TCCAAGTGTGAGTTCTGGTTAAAGAAGGTGACGTTCCTCAACCACGTGGTTTCTAGTGGGAGAGTTTCTGTAGACCCAGCAAAGATCGAAGCAGTTACTAGTTGGCCCCG
ACCTTCGACAGTTAGCGAGATTCGTAGTTTCTTGGGTTTGGTAGGTTACTATAGGAGGTTCGTGGAAGACTTCTCTCGTATAGCCAATCCCTTGACTCAGTTGACCAGGA
AGGGGACCCCTTTTGTTTGGAGCCCAGCTTGCGAGAGTAGCTTCCAGGAGCTTAAGCGGAAGCTGGTGTCTGCACCAGTCCTGACAGTGCCCGATGGGTCGGGAAGCTTT
GTGATCTATAGTGATGCCTCCAAAAAGGGACTGGGTTGTGTCCTGATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCTAGTAAACTACTCAGCCAGGGTACTTGGGGCATCTTAGCAAGCGTAGTAGATACTAGAGAGCCAGAAGTTTCCCTGTCCTCCGAACCAGTGGTAAGGGAGTA
CCCTGATGTTTTCCCCGACGAGCTTCGAGGACTCCCGCCTCCCAGGGAGATAGACTTCGCCATCGAGTTAGAGCTGGGCACTACTCCTATCTCGAGAGCCCCTTATAGAA
TGGCTCCAGCCGAGCTAAAGGAGCTGAAGGTACAGCTGCAGGAATTGCTGGATAAGGGTTTCATCCGACCCAGTGTGTCACCTTGGGGAGCCTCAGTGTTGTTTGTGAAG
AAGAAGGATGGGTCGATGAGCCTTTGCATAGACTACAGAGAGCTGAACAAGGTGACAGTTAAGAATCGTTGTCCCTTGCCCAGGATTGATGACTTATTCGATCAGTTGCA
GGGAGCCACCGTCTTTTCTAAGATCGATCTGCGATCAGGCTATCACCAGTTGAGGATCAGGGACAGTAACATCCCTAAGACGGCCTTCCGTTCTAGATACAGACATTACG
AGTTCATTGTGATGTCTTTTGGTCTGACTAATGCTCCTGCGGTATTCATGGACTTTGATGAACAGAGTGTTTTGGAGACTTTTCGAGCCAACAAGCTGTATGCCAAGTTC
TCCAAGTGTGAGTTCTGGTTAAAGAAGGTGACGTTCCTCAACCACGTGGTTTCTAGTGGGAGAGTTTCTGTAGACCCAGCAAAGATCGAAGCAGTTACTAGTTGGCCCCG
ACCTTCGACAGTTAGCGAGATTCGTAGTTTCTTGGGTTTGGTAGGTTACTATAGGAGGTTCGTGGAAGACTTCTCTCGTATAGCCAATCCCTTGACTCAGTTGACCAGGA
AGGGGACCCCTTTTGTTTGGAGCCCAGCTTGCGAGAGTAGCTTCCAGGAGCTTAAGCGGAAGCTGGTGTCTGCACCAGTCCTGACAGTGCCCGATGGGTCGGGAAGCTTT
GTGATCTATAGTGATGCCTCCAAAAAGGGACTGGGTTGTGTCCTGATGTAG
Protein sequenceShow/hide protein sequence
MKASKLLSQGTWGILASVVDTREPEVSLSSEPVVREYPDVFPDELRGLPPPREIDFAIELELGTTPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGASVLFVK
KKDGSMSLCIDYRELNKVTVKNRCPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSNIPKTAFRSRYRHYEFIVMSFGLTNAPAVFMDFDEQSVLETFRANKLYAKF
SKCEFWLKKVTFLNHVVSSGRVSVDPAKIEAVTSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIANPLTQLTRKGTPFVWSPACESSFQELKRKLVSAPVLTVPDGSGSF
VIYSDASKKGLGCVLM