; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0164661 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0164661
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr06:15448821..15449894
RNA-Seq ExpressionCmc06g0164661
SyntenyCmc06g0164661
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039594.1 reverse transcriptase [Cucumis melo var. makuwa]8.7e-19093.56Show/hide
Query:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK
        MA AELKELKVQLQELLDKGFIRPSVSPWGA VLFVKKKDGS+RLCIDY++LNKVTVKNRY LPRIDDLFDQLQ +TVFSKIDLRSGYHQLRIRDSDIPK
Subjt:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG
        T FRSRYGHYEFIVMSFGLTNAPTVFMDL NRVFKDFLDS VIVFI DILIYSKTEA+HEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFL HVVSSEG
Subjt:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL
        VSV+PAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIAS LT LTRKGT FVWSPACESSFQELKQKLVT PVL VPDGSGSFVIY+DASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH
        GCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI+TDH
Subjt:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH

KAA0054634.1 pol protein [Cucumis melo var. makuwa]8.7e-19093.28Show/hide
Query:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK
        MA AELKELKVQLQELLDKGFIRPSVSPWGA VLFVK+KDGSM LCIDY+ELNKVTVKNRY LPRIDDLFDQLQ ATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG
        TTFRSRYGHYEF+VMSFGLTNAP VFMDL NRVFKDFLDS VIVFI DILIYSKTEA+HEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEG
Subjt:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLT LTRKGT FVWSPACESSFQELKQKLVT PVL VPDGSG+FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH
        GCVLMQQGKVVAY SRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI+TDH
Subjt:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH

KAA0063793.1 pol protein [Cucumis melo var. makuwa]5.1e-19093.28Show/hide
Query:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK
        MA AELKELKVQLQELLDKGFIRPSVSPWGA VLFVKKKDGSMRLCIDY+ELNKVTVKNRY LPRIDDLFDQLQ ATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG
        T FRSRYGHYEF+VMSFGLTNAP VFMDL NRVFKDFLDS VIVFI DILIYSKTEA+HEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEG
Subjt:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLT LTRKGT FVWSPACE SFQELKQKLVT PVL VPDGSG+FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH
        GCVLMQQGKVVAY SRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI+TDH
Subjt:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH

KAA0063946.1 pol protein [Cucumis melo var. makuwa]5.1e-19093.84Show/hide
Query:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK
        MA AELKELKVQLQELLDKGFIR SVSPWGA+VLFVKKKDGSMRLCIDY+ELNKVTVKNRY LPRIDDLFDQLQ ATVFSKIDLRSGYHQLRIRDSDIPK
Subjt:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG
        T FRSRYGHYEF+VMSFGLTNAP VFMDL NRVFKDFLDS VIVFI DILIYSKTEA+HEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEG
Subjt:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSR ASPLT LTRKGT FVWSPACESSFQELKQKLVT PVL VPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH
        GCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAVVFALKIWR YLYGEKIQIFTDH
Subjt:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH

TYK05193.1 pol protein [Cucumis melo var. makuwa]1.8e-19093.56Show/hide
Query:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK
        MA AELKELKVQLQELLDKGFIRPSVSPWGA+VLFVKKKDGSMRLCIDY+ELNKVTVKNRY LPRIDDLFDQLQ ATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG
        T FRSRYGHYEF+VMSFGLTNAP VFMDL NRVFKDFLDS VIVFI DILIYSKTEA+HEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEG
Subjt:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL
        VSVDP KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLT LTRKGT FVWSPACESSFQELKQKLVT PVL VPDGSG+FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH
        GCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAVVFALKIWR YLYGEKIQIFTDH
Subjt:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH

TrEMBL top hitse value%identityAlignment
A0A5A7T804 Reverse transcriptase4.2e-19093.56Show/hide
Query:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK
        MA AELKELKVQLQELLDKGFIRPSVSPWGA VLFVKKKDGS+RLCIDY++LNKVTVKNRY LPRIDDLFDQLQ +TVFSKIDLRSGYHQLRIRDSDIPK
Subjt:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG
        T FRSRYGHYEFIVMSFGLTNAPTVFMDL NRVFKDFLDS VIVFI DILIYSKTEA+HEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFL HVVSSEG
Subjt:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL
        VSV+PAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIAS LT LTRKGT FVWSPACESSFQELKQKLVT PVL VPDGSGSFVIY+DASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH
        GCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI+TDH
Subjt:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH

A0A5A7UHL7 Reverse transcriptase4.2e-19093.28Show/hide
Query:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK
        MA AELKELKVQLQELLDKGFIRPSVSPWGA VLFVK+KDGSM LCIDY+ELNKVTVKNRY LPRIDDLFDQLQ ATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG
        TTFRSRYGHYEF+VMSFGLTNAP VFMDL NRVFKDFLDS VIVFI DILIYSKTEA+HEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEG
Subjt:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLT LTRKGT FVWSPACESSFQELKQKLVT PVL VPDGSG+FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH
        GCVLMQQGKVVAY SRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI+TDH
Subjt:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH

A0A5A7V6R2 Reverse transcriptase2.5e-19093.28Show/hide
Query:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK
        MA AELKELKVQLQELLDKGFIRPSVSPWGA VLFVKKKDGSMRLCIDY+ELNKVTVKNRY LPRIDDLFDQLQ ATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG
        T FRSRYGHYEF+VMSFGLTNAP VFMDL NRVFKDFLDS VIVFI DILIYSKTEA+HEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEG
Subjt:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLT LTRKGT FVWSPACE SFQELKQKLVT PVL VPDGSG+FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH
        GCVLMQQGKVVAY SRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI+TDH
Subjt:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH

A0A5A7VBY3 Reverse transcriptase2.5e-19093.84Show/hide
Query:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK
        MA AELKELKVQLQELLDKGFIR SVSPWGA+VLFVKKKDGSMRLCIDY+ELNKVTVKNRY LPRIDDLFDQLQ ATVFSKIDLRSGYHQLRIRDSDIPK
Subjt:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG
        T FRSRYGHYEF+VMSFGLTNAP VFMDL NRVFKDFLDS VIVFI DILIYSKTEA+HEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEG
Subjt:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSR ASPLT LTRKGT FVWSPACESSFQELKQKLVT PVL VPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH
        GCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAVVFALKIWR YLYGEKIQIFTDH
Subjt:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH

A0A5D3BZN1 Reverse transcriptase8.5e-19193.56Show/hide
Query:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK
        MA AELKELKVQLQELLDKGFIRPSVSPWGA+VLFVKKKDGSMRLCIDY+ELNKVTVKNRY LPRIDDLFDQLQ ATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG
        T FRSRYGHYEF+VMSFGLTNAP VFMDL NRVFKDFLDS VIVFI DILIYSKTEA+HEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFL HVVSSEG
Subjt:  TTFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL
        VSVDP KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLT LTRKGT FVWSPACESSFQELKQKLVT PVL VPDGSG+FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH
        GCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAVVFALKIWR YLYGEKIQIFTDH
Subjt:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.7e-7438.94Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGAQVLFV-KKKDGS----MRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPKT
        +E++ Q+Q++L++G IR S SP+ + +  V KK+D S     R+ IDY++LN++TV +R+ +P +D++  +L R   F+ IDL  G+HQ+ +    + KT
Subjt:  KELKVQLQELLDKGFIRPSVSPWGAQVLFV-KKKDGS----MRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPKT

Query:  TFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEGV
         F +++GHYE++ M FGL NAP  F    N + +  L+   +V++ DI+++S +  +H + L  V E L    L  +  KCEF  ++ TFL HV++ +G+
Subjt:  TFRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEGV

Query:  SVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHF-VWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL
          +P KIEA+  +P P+   EI++FLGL GYYR+F+ +F+ IA P+T   +K       +P  +S+F++LK  +   P+L VPD +  F + +DAS   L
Subjt:  SVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHF-VWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH
        G VL Q G  ++Y SR L  HE NY T + EL A+V+A K +RHYL G   +I +DH
Subjt:  GCVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH

P20825 Retrovirus-related Pol polyprotein from transposon 2978.5e-7137.08Show/hide
Query:  ELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKD-----GSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPKTT
        E++ Q+QE+L++G IR S SP+ +    V KK         R+ IDY++LN++T+ +RY +P +D++  +L +   F+ IDL  G+HQ+ + +  I KT 
Subjt:  ELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKD-----GSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPKTT

Query:  FRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEGVS
        F ++ GHYE++ M FGL NAP  F    N + +  L+   +V++ DI+I+S +  +H   +  V   L    L  +  KCEF  K+  FL H+V+ +G+ 
Subjt:  FRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEGVS

Query:  VDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHF-VWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGLG
         +P K++A+ ++P P+   EIR+FLGL GYYR+F+ +++ IA P+T   +K T           +F++LK  ++  P+L +PD    FV+ +DAS   LG
Subjt:  VDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHF-VWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGLG

Query:  CVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH
         VL Q G  +++ SR L  HE NY   + EL A+V+A K +RHYL G +  I +DH
Subjt:  CVLMQQGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.8e-6637.54Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPKTTFRSR
        +E+   +Q+LLD  FI PS SP  + V+ V KKDG+ RLC+DY+ LNK T+ + + LPRID+L  ++  A +F+ +DL SGYHQ+ +   D  KT F + 
Subjt:  KELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPKTTFRSR

Query:  YGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEGVSVDPA
         G YE+ VM FGL NAP+ F       F+D     V V++ DILI+S++  +H +HL  VLE L+   L  K  KC+F  ++  FL + +  + ++    
Subjt:  YGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEGVSVDPA

Query:  KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGLGCVLMQ
        K  A+ ++P P TV + + FLG+  YYRRF+ + S+IA P+ L     +   W+   + + ++LK  L   PVL+  +   ++ + +DASK G+G VL +
Subjt:  KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGLGCVLMQ

Query:  QGK------VVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH
                 VV Y S+ L+S ++NYP  +LEL  ++ AL  +R+ L+G+   + TDH
Subjt:  QGK------VVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus5.9e-6434.77Show/hide
Query:  ELKVQLQELLDKGFIRPSVSPWGAQVLFVKKK-----DGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPKTT
        E++ Q+ ELL  G IRPS SP+ + +  V KK     +   R+ +D+K LN VT+ + Y +P I+     L  A  F+ +DL SG+HQ+ +++SDIPKT 
Subjt:  ELKVQLQELLDKGFIRPSVSPWGAQVLFVKKK-----DGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPKTT

Query:  FRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEGVS
        F +  G YEF+ + FGL NAP +F  + + + ++ +  +  V+I DI+++S+    H ++L  VL +L    L     K  F   +V FL ++V+++G+ 
Subjt:  FRSRYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEGVS

Query:  VDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTR-----------KGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVI
         DP K+ A++  P P++V E++ FLG+  YYR+F++D++++A PLT LTR                        SF +LK  L +  +L  P  +  F +
Subjt:  VDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTR-----------KGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVI

Query:  YSDASKKGLGCVLMQ----QGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE-KIQIFTDH
         +DAS   +G VL Q    + + +AY SR L   E+NY T + E+ A++++L   R YLYG   I+++TDH
Subjt:  YSDASKKGLGCVLMQ----QGKVVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE-KIQIFTDH

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.7e-6637.54Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPKTTFRSR
        +E+   +Q+LLD  FI PS SP  + V+ V KKDG+ RLC+DY+ LNK T+ + + LPRID+L  ++  A +F+ +DL SGYHQ+ +   D  KT F + 
Subjt:  KELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPKTTFRSR

Query:  YGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEGVSVDPA
         G YE+ VM FGL NAP+ F       F+D     V V++ DILI+S++  +H +HL  VLE L+   L  K  KC+F  ++  FL + +  + ++    
Subjt:  YGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEGVSVDPA

Query:  KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGLGCVLMQ
        K  A+ ++P P TV + + FLG+  YYRRF+ + S+IA P+ L     +   W+   + +  +LK  L   PVL+  +   ++ + +DASK G+G VL +
Subjt:  KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGLGCVLMQ

Query:  QGK------VVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH
                 VV Y S+ L+S ++NYP  +LEL  ++ AL  +R+ L+G+   + TDH
Subjt:  QGK------VVAYTSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein8.5e-2642.75Show/hide
Query:  HLHQVLETLRANKLYAKFSKCEFWLKKVTFLS--HVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVW
        HL  VL+    ++ YA   KC F   ++ +L   H++S EGVS DPAK+EA+  WP P   +E+R FLGL GYYRRFV+++ +I  PLT L +K +   W
Subjt:  HLHQVLETLRANKLYAKFSKCEFWLKKVTFLS--HVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVW

Query:  SPACESSFQELKQKLVTVPVLIVPDGSGSFV
        +     +F+ LK  + T+PVL +PD    FV
Subjt:  SPACESSFQELKQKLVTVPVLIVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAGCCGAGCTGAAGGAGCTGAAGGTACAGCTGCAGGAATTGCTGGACAAGGGTTTCATCCGACCCAGTGTGTCACCTTGGGGAGCCCAAGTGTTGTTT
GTGAAGAAGAAGGATGGGTCGATGCGCCTTTGCATTGACTACAAAGAGCTGAACAAGGTGACAGTTAAGAACCGCTACCTCTTGCCCAGGATTGATGACTTGTTC
GATCAGTTGCAGAGAGCCACTGTCTTTTCTAAGATCGACCTGCGATCAGGCTATCACCAGTTGAGGATCAGGGACAGTGATATTCCTAAGACGACCTTCCGTTCA
AGATACGGACATTACGAGTTCATTGTGATGTCTTTTGGGTTAACTAATGCTCCTACGGTATTCATGGACTTGAGGAACAGGGTGTTTAAGGATTTCTTAGACTCG
TTAGTCATAGTTTTCATTAAAGACATTTTGATCTACTCCAAGACTGAGGCTAAGCATGAGGAGCATTTGCACCAGGTTTTGGAGACTCTTCGAGCCAATAAGTTG
TATGCTAAGTTCTCCAAGTGTGAGTTCTGGCTGAAGAAGGTGACTTTCCTCAGCCACGTGGTTTCCAGTGAGGGAGTTTCTGTGGACCCAGCAAAGATCGAAGCG
GTTACCAATTGGCCTCGACCGTCTACGGTTAGCGAGATTCGTAGTTTCCTGGGTTTGGCAGGTTACTACAGAAGGTTCGTGGAAGACTTTTCACGTATAGCAAGT
CCCTTGACTCTGTTGACCAGGAAGGGGACTCATTTTGTTTGGAGCCCAGCTTGCGAGAGTAGCTTCCAAGAGCTTAAGCAGAAGCTTGTGACTGTACCAGTCCTG
ATAGTGCCAGACGGATCGGGGAGTTTTGTGATCTACAGTGATGCCTCCAAAAAGGGACTGGGTTGTGTTCTTATGCAGCAAGGTAAGGTAGTTGCTTATACCTCC
CGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACCCATGACCTAGAGTTGGCAGCAGTGGTTTTTGCATTGAAGATATGGAGGCACTACCTGTACGGTGAGAAG
ATACAGATTTTCACTGACCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAGCCGAGCTGAAGGAGCTGAAGGTACAGCTGCAGGAATTGCTGGACAAGGGTTTCATCCGACCCAGTGTGTCACCTTGGGGAGCCCAAGTGTTGTTT
GTGAAGAAGAAGGATGGGTCGATGCGCCTTTGCATTGACTACAAAGAGCTGAACAAGGTGACAGTTAAGAACCGCTACCTCTTGCCCAGGATTGATGACTTGTTC
GATCAGTTGCAGAGAGCCACTGTCTTTTCTAAGATCGACCTGCGATCAGGCTATCACCAGTTGAGGATCAGGGACAGTGATATTCCTAAGACGACCTTCCGTTCA
AGATACGGACATTACGAGTTCATTGTGATGTCTTTTGGGTTAACTAATGCTCCTACGGTATTCATGGACTTGAGGAACAGGGTGTTTAAGGATTTCTTAGACTCG
TTAGTCATAGTTTTCATTAAAGACATTTTGATCTACTCCAAGACTGAGGCTAAGCATGAGGAGCATTTGCACCAGGTTTTGGAGACTCTTCGAGCCAATAAGTTG
TATGCTAAGTTCTCCAAGTGTGAGTTCTGGCTGAAGAAGGTGACTTTCCTCAGCCACGTGGTTTCCAGTGAGGGAGTTTCTGTGGACCCAGCAAAGATCGAAGCG
GTTACCAATTGGCCTCGACCGTCTACGGTTAGCGAGATTCGTAGTTTCCTGGGTTTGGCAGGTTACTACAGAAGGTTCGTGGAAGACTTTTCACGTATAGCAAGT
CCCTTGACTCTGTTGACCAGGAAGGGGACTCATTTTGTTTGGAGCCCAGCTTGCGAGAGTAGCTTCCAAGAGCTTAAGCAGAAGCTTGTGACTGTACCAGTCCTG
ATAGTGCCAGACGGATCGGGGAGTTTTGTGATCTACAGTGATGCCTCCAAAAAGGGACTGGGTTGTGTTCTTATGCAGCAAGGTAAGGTAGTTGCTTATACCTCC
CGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACCCATGACCTAGAGTTGGCAGCAGTGGTTTTTGCATTGAAGATATGGAGGCACTACCTGTACGGTGAGAAG
ATACAGATTTTCACTGACCATTAG
Protein sequenceShow/hide protein sequence
MASAELKELKVQLQELLDKGFIRPSVSPWGAQVLFVKKKDGSMRLCIDYKELNKVTVKNRYLLPRIDDLFDQLQRATVFSKIDLRSGYHQLRIRDSDIPKTTFRS
RYGHYEFIVMSFGLTNAPTVFMDLRNRVFKDFLDSLVIVFIKDILIYSKTEAKHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLSHVVSSEGVSVDPAKIEA
VTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTLLTRKGTHFVWSPACESSFQELKQKLVTVPVLIVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYTS
RQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDH