; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G011060 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G011060
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr20:10517255..10520175
RNA-Seq ExpressionCmoCh20G011060
SyntenyCmoCh20G011060
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO45752.1 pol protein [Cucumis melo subsp. melo]2.9e-11676.03Show/hide
Query:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI
        RELNK TVKN+YPL RI+DLFDQL+ ATVFSKI LRSGYHQ++ K+ED+PKTAFR+RYGHY+F+VMSFGLTNAPAVFM+LMNRVF+E LDTFVIVFIDDI
Subjt:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI

Query:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ
        L+YSKT+ EH+ HLR  L  L +NKLY KFSKCEFWL++V+FLGHV+S+ GVSVDPAK+EAVT W RP  V +VRSFLGLA YYRRF+++FS+IA PL Q
Subjt:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ

Query:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR
        LTRK APFVWS+ C+ SFQ LK KLVT PVLT+P+G+GN++IYSDASKKGLGCVLMQQGKV+AYASR
Subjt:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR

KAA0048687.1 pol protein [Cucumis melo var. makuwa]3.7e-11676.03Show/hide
Query:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI
        RELNK TVKN+YPL RI+DLFDQL+ ATVFSKI LRSGYHQ++ K+ED+PKTAFR+RYGHYEF+VMSFGLTNAPAVFM+LMNRVF+E LDTFVIVFIDDI
Subjt:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI

Query:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ
        L+YSKT+ EH+ HLR  L  L +NKLY KFSKCEFWL++V+FLGHV+S+ GVSVDPAK+EAVT W RP  V +VRSFLGLA YYRRF+++FS+IA PL Q
Subjt:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ

Query:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR
        LTRK APFVWS+ C+ SFQ LK KLVT PVLT+P+G+G+++IYSDASKKGLGCVLMQQGKV+AYASR
Subjt:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR

KAA0065873.1 pol protein [Cucumis melo var. makuwa]5.8e-11777.15Show/hide
Query:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI
        RELNK TVKN+YPL RI+DLFDQL+ ATVFSKI LRSGYHQ++ K+EDIPKTAFR+RYGHYEF+VMSFGLTNAPAVFM+LMNRVF+E LDTFVIVFIDDI
Subjt:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI

Query:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ
        L+YSKT+ EH+ HLR  L  L +NKLY KFSKCEFWL++V+FLGHVIS+ GVSVDPAK+EAVT W RP  V +VRSFLGLA YYRRF+++FS+IA PL Q
Subjt:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ

Query:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR
        LTRK  PFVWS+ C+ SFQ LK KLVT PVLT+PNG+GN++IYSDASKKGLGCVLMQQGKV+AYASR
Subjt:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR

XP_022951914.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111454654 [Cucurbita moschata]9.5e-12078.65Show/hide
Query:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI
        RELNKRTVKNKYPL RIEDLFDQLR ATVFSKI LR GYHQIK KNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLD+FVIVFIDDI
Subjt:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI

Query:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ
        L+YSKTD EHQ HLR+ALTIL ENKLY KF++CEFWLR+V+FLGHV+S+ G+ VDP K+EAVTKW RP  V ++RSFLGLA YYRRF++DF++IA PL Q
Subjt:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ

Query:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR
        LT+K  PFVW + C+ SFQELK +LV+ PVLT+P  +  Y+IYSDASKKGLGCVLMQ GKV+AYASR
Subjt:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR

XP_023520277.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111783585 [Cucurbita pepo subsp. pepo]5.5e-13691.01Show/hide
Query:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI
        RELNKRTVKNKYPL RIEDLFDQL+EATVFSKI LRSGYHQIK KNEDIPKTAFRTRYGHYEFVVMSFGLTNAP VFMELMN VFKECLDTFVIVFIDDI
Subjt:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI

Query:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ
        LVYSKTDHEHQLHLR+ALTIL ENKLY KFSKCEFWL+E  FLGHVISEKGVSVDP KVEAVTKWNRPI V +VRSFLGLA YYRRFIKDFSKIAAPL Q
Subjt:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ

Query:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR
        LTRKT PFVWSEEC+TSFQELKDKLVT PVLTMP+GT NY+IYSDASKKGLGCVLMQQGKVIAYASR
Subjt:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR

TrEMBL top hitse value%identityAlignment
A0A5A7U330 Reverse transcriptase1.8e-11676.03Show/hide
Query:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI
        RELNK TVKN+YPL RI+DLFDQL+ ATVFSKI LRSGYHQ++ K+ED+PKTAFR+RYGHYEF+VMSFGLTNAPAVFM+LMNRVF+E LDTFVIVFIDDI
Subjt:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI

Query:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ
        L+YSKT+ EH+ HLR  L  L +NKLY KFSKCEFWL++V+FLGHV+S+ GVSVDPAK+EAVT W RP  V +VRSFLGLA YYRRF+++FS+IA PL Q
Subjt:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ

Query:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR
        LTRK APFVWS+ C+ SFQ LK KLVT PVLT+P+G+G+++IYSDASKKGLGCVLMQQGKV+AYASR
Subjt:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR

A0A5A7VCI9 Pol protein2.8e-11777.15Show/hide
Query:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI
        RELNK TVKN+YPL RI+DLFDQL+ ATVFSKI LRSGYHQ++ K+EDIPKTAFR+RYGHYEF+VMSFGLTNAPAVFM+LMNRVF+E LDTFVIVFIDDI
Subjt:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI

Query:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ
        L+YSKT+ EH+ HLR  L  L +NKLY KFSKCEFWL++V+FLGHVIS+ GVSVDPAK+EAVT W RP  V +VRSFLGLA YYRRF+++FS+IA PL Q
Subjt:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ

Query:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR
        LTRK  PFVWS+ C+ SFQ LK KLVT PVLT+PNG+GN++IYSDASKKGLGCVLMQQGKV+AYASR
Subjt:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR

A0A5D3BPI1 Reverse transcriptase2.4e-11676.03Show/hide
Query:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI
        RELNK TVKN+YPL RI+DLFDQL+ ATVFSKI LRSGYHQ++ K+ED+PKTAFR+RYGHYEF+VMSFGLTNAPAVFM+LMNRVF+E LDTFVIVFIDDI
Subjt:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI

Query:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ
        L+YSKT+ EH+ HLR  L  L +NKLY KFSKCEFWL++V+FLGHV+S+ GVSVDPAK+EAVT W RP  V +VRSFLGLA YYRRF+++FS+IA PL Q
Subjt:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ

Query:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR
        LTRK APFVWS+ C+ SFQ LK KLVT PVLT+P+G+G+++IYSDASKKGLGCVLMQQGKV+AYASR
Subjt:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR

A0A6J1GK52 Reverse transcriptase4.6e-12078.65Show/hide
Query:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI
        RELNKRTVKNKYPL RIEDLFDQLR ATVFSKI LR GYHQIK KNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLD+FVIVFIDDI
Subjt:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI

Query:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ
        L+YSKTD EHQ HLR+ALTIL ENKLY KF++CEFWLR+V+FLGHV+S+ G+ VDP K+EAVTKW RP  V ++RSFLGLA YYRRF++DF++IA PL Q
Subjt:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ

Query:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR
        LT+K  PFVW + C+ SFQELK +LV+ PVLT+P  +  Y+IYSDASKKGLGCVLMQ GKV+AYASR
Subjt:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR

Q84KB0 Pol protein1.4e-11676.03Show/hide
Query:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI
        RELNK TVKN+YPL RI+DLFDQL+ ATVFSKI LRSGYHQ++ K+ED+PKTAFR+RYGHY+F+VMSFGLTNAPAVFM+LMNRVF+E LDTFVIVFIDDI
Subjt:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI

Query:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ
        L+YSKT+ EH+ HLR  L  L +NKLY KFSKCEFWL++V+FLGHV+S+ GVSVDPAK+EAVT W RP  V +VRSFLGLA YYRRF+++FS+IA PL Q
Subjt:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ

Query:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR
        LTRK APFVWS+ C+ SFQ LK KLVT PVLT+P+G+GN++IYSDASKKGLGCVLMQQGKV+AYASR
Subjt:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.8e-4936.57Show/hide
Query:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI
        R+LN+ TV +++P+  ++++  +L     F+ I L  G+HQI+   E + KTAF T++GHYE++ M FGL NAPA F   MN + +  L+   +V++DDI
Subjt:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI

Query:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ
        +V+S +  EH   L      L +  L ++  KCEF  +E  FLGHV++  G+  +P K+EA+ K+  P    ++++FLGL  YYR+FI +F+ IA P+ +
Subjt:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ

Query:  LTRKTAPF-VWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR
          +K       + E  ++F++LK  +   P+L +P+ T  + + +DAS   LG VL Q G  ++Y SR
Subjt:  LTRKTAPF-VWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR

P10401 Retrovirus-related Pol polyprotein from transposon gypsy3.6e-4535.13Show/hide
Query:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI
        R+LN++T+ ++YP+  I  +   L +A  F+ + L+SGYHQI     D  KT+F    G YEF  + FGL NA ++F   ++ V +E +     V++DD+
Subjt:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI

Query:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ
        +++S+ + +H  H+   L  L +  + V   K  F+   V +LG ++S+ G   DP KV+A+ ++  P  V +VRSFLGLASYYR FIKDF+ IA P+  
Subjt:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ

Query:  LTR-----------KTAPFVWSEECKTSFQELKDKLVTTPV-LTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR
        + +           K  P  ++E  + +FQ L++ L +  V L  P+    + + +DAS  G+G VL Q+G+ I   SR
Subjt:  LTR-----------KTAPFVWSEECKTSFQELKDKLVTTPV-LTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR

P20825 Retrovirus-related Pol polyprotein from transposon 2971.4e-4935.82Show/hide
Query:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI
        R+LN+ T+ ++YP+  ++++  +L +   F+ I L  G+HQI+   E I KTAF T+ GHYE++ M FGL NAPA F   MN + +  L+   +V++DDI
Subjt:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI

Query:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ
        +++S +  EH   ++   T L +  L ++  KCEF  +E  FLGH+++  G+  +P KV+A+  +  P    ++R+FLGL  YYR+FI +++ IA P+  
Subjt:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ

Query:  -LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR
         L ++T       E   +F++LK  ++  P+L +P+    +++ +DAS   LG VL Q G  I++ SR
Subjt:  -LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQQGKVIAYASR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.9e-4636.88Show/hide
Query:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI
        + LN  T+ + YP+  I      L  A  F+ + L SG+HQI  K  DIPKTAF T  G YEF+ + FGL NAPA+F  +++ + +E +     V+IDDI
Subjt:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI

Query:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ
        +V+S+    H  +LR  L  L +  L V   K  F   +V FLG++++  G+  DP KV A+++   P  V +++ FLG+ SYYR+FI+D++K+A PL  
Subjt:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ

Query:  LTR-----------KTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQ----QGKVIAYASR
        LTR              P    E    SF +LK  L ++ +L  P  T  + + +DAS   +G VL Q    + + IAY SR
Subjt:  LTR-----------KTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVLMQ----QGKVIAYASR

Q99315 Transposon Ty3-G Gag-Pol polyprotein4.7e-4540.39Show/hide
Query:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI
        R LNK T+ + +PL RI++L  ++  A +F+ + L SGYHQI  + +D  KTAF T  G YE+ VM FGL NAP+ F   M   F++    FV V++DDI
Subjt:  RELNKRTVKNKYPLLRIEDLFDQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDI

Query:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ
        L++S++  EH  HL   L  L    L VK  KC+F   E  FLG+ I  + ++    K  A+  +  P  V Q + FLG+ +YYRRFI + SKIA P+  
Subjt:  LVYSKTDHEHQLHLRRALTILGENKLYVKFSKCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQ

Query:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVL
             +   W+E+   +  +LKD L  +PVL   N   NY + +DASK G+G VL
Subjt:  LTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYIIYSDASKKGLGCVL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein6.7e-2341.6Show/hide
Query:  HLRRALTILGENKLYVKFSKCEFWLREVAFLG--HVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQLTRKTAPFVW
        HL   L I  +++ Y    KC F   ++A+LG  H+IS +GVS DPAK+EA+  W  P    ++R FLGL  YYRRF+K++ KI  PL +L +K +   W
Subjt:  HLRRALTILGENKLYVKFSKCEFWLREVAFLG--HVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQLTRKTAPFVW

Query:  SEECKTSFQELKDKLVTTPVLTMPN
        +E    +F+ LK  + T PVL +P+
Subjt:  SEECKTSFQELKDKLVTTPVLTMPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTACTGCTCAGAGTAGACAGAAAAGCTATGCGGACTTAAGACGAAGGAATCTAGAGTTTGAAGTGGGCGACAAGGTATTCCTGAAAGTGGCACCAATAAAAAGTGT
TCTAAGGTTTGGACACAAGCGTAAATTGAGCTCACACTTTATAGGACCTTTTGAGATTTTGGAGCGGATTGGACCCGTCGCATATAGACGTATTCCTAGAGGACTTGCCG
AGAATTCCTTCAACACGAGAGGTGGATTTTGCAATAGAGCTGGAACCAGAGAGCTAAATAAGAGAACGGTGAAGAACAAGTATCCCCTTCTACGTATCGAGGACTTATTT
GATCAGCTAAGGGAGGCAACAGTGTTTTCGAAGATCCATCTTCGATCTGGTTACCATCAGATCAAGACTAAAAATGAGGACATACCAAAGACAGCTTTCAGGACGAGGTA
TGGTCATTATGAGTTTGTGGTTATGTCATTTGGACTTACTAATGCCCCTGCAGTGTTTATGGAACTGATGAATCGCGTGTTCAAGGAATGTTTGGATACTTTTGTCATAG
TGTTCATAGACGATATCTTGGTATACTCAAAGACGGATCACGAACATCAGTTACATCTAAGAAGGGCTCTGACAATACTAGGAGAAAACAAACTATATGTCAAGTTCTCG
AAGTGTGAATTCTGGTTACGAGAAGTCGCTTTCCTAGGGCACGTGATATCTGAGAAGGGTGTGTCAGTAGATCCTGCCAAGGTGGAAGCTGTCACTAAATGGAATCGTCC
TATCATTGTTATTCAGGTACGAAGTTTTCTAGGCTTAGCAAGTTATTACCGACGCTTCATAAAGGATTTTTCCAAGATAGCTGCACCTCTTATGCAATTAACCCGAAAGA
CTGCACCATTTGTATGGTCTGAGGAGTGCAAAACAAGTTTCCAAGAGCTGAAGGATAAGCTGGTGACCACGCCAGTGCTTACAATGCCCAACGGTACAGGAAACTATATT
ATTTACAGTGATGCTTCTAAGAAGGGTTTGGGATGTGTGTTGATGCAACAAGGGAAGGTTATTGCTTATGCATCTAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGCGTACTGCTCAGAGTAGACAGAAAAGCTATGCGGACTTAAGACGAAGGAATCTAGAGTTTGAAGTGGGCGACAAGGTATTCCTGAAAGTGGCACCAATAAAAAGTGT
TCTAAGGTTTGGACACAAGCGTAAATTGAGCTCACACTTTATAGGACCTTTTGAGATTTTGGAGCGGATTGGACCCGTCGCATATAGACGTATTCCTAGAGGACTTGCCG
AGAATTCCTTCAACACGAGAGGTGGATTTTGCAATAGAGCTGGAACCAGAGAGCTAAATAAGAGAACGGTGAAGAACAAGTATCCCCTTCTACGTATCGAGGACTTATTT
GATCAGCTAAGGGAGGCAACAGTGTTTTCGAAGATCCATCTTCGATCTGGTTACCATCAGATCAAGACTAAAAATGAGGACATACCAAAGACAGCTTTCAGGACGAGGTA
TGGTCATTATGAGTTTGTGGTTATGTCATTTGGACTTACTAATGCCCCTGCAGTGTTTATGGAACTGATGAATCGCGTGTTCAAGGAATGTTTGGATACTTTTGTCATAG
TGTTCATAGACGATATCTTGGTATACTCAAAGACGGATCACGAACATCAGTTACATCTAAGAAGGGCTCTGACAATACTAGGAGAAAACAAACTATATGTCAAGTTCTCG
AAGTGTGAATTCTGGTTACGAGAAGTCGCTTTCCTAGGGCACGTGATATCTGAGAAGGGTGTGTCAGTAGATCCTGCCAAGGTGGAAGCTGTCACTAAATGGAATCGTCC
TATCATTGTTATTCAGGTACGAAGTTTTCTAGGCTTAGCAAGTTATTACCGACGCTTCATAAAGGATTTTTCCAAGATAGCTGCACCTCTTATGCAATTAACCCGAAAGA
CTGCACCATTTGTATGGTCTGAGGAGTGCAAAACAAGTTTCCAAGAGCTGAAGGATAAGCTGGTGACCACGCCAGTGCTTACAATGCCCAACGGTACAGGAAACTATATT
ATTTACAGTGATGCTTCTAAGAAGGGTTTGGGATGTGTGTTGATGCAACAAGGGAAGGTTATTGCTTATGCATCTAGATAG
Protein sequenceShow/hide protein sequence
MRTAQSRQKSYADLRRRNLEFEVGDKVFLKVAPIKSVLRFGHKRKLSSHFIGPFEILERIGPVAYRRIPRGLAENSFNTRGGFCNRAGTRELNKRTVKNKYPLLRIEDLF
DQLREATVFSKIHLRSGYHQIKTKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDTFVIVFIDDILVYSKTDHEHQLHLRRALTILGENKLYVKFS
KCEFWLREVAFLGHVISEKGVSVDPAKVEAVTKWNRPIIVIQVRSFLGLASYYRRFIKDFSKIAAPLMQLTRKTAPFVWSEECKTSFQELKDKLVTTPVLTMPNGTGNYI
IYSDASKKGLGCVLMQQGKVIAYASR