; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh12G007400 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh12G007400
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr12:5818083..5819210
RNA-Seq ExpressionCmoCh12G007400
SyntenyCmoCh12G007400
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK02449.1 F15O4.13 [Cucumis melo var. makuwa]4.4e-18184.72Show/hide
Query:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV
        MAAYRTNPT+TKEIQRQVEELMDKGY+RESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDEL+GANLFSKIDLKSGYHQIRM+V
Subjt:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV

Query:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL
        GDEWK AFKTKFGLYEWLV+PFGLTNAPSTFMRLMNHVL++YIGKFVVVYFDDILVYSK LNDHILHVK IL  LREEKLYA  KKCSFCL+Q++FLGF+
Subjt:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL

Query:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA
        VGK GV+VDEEK+    + P                  RRFIKDFSSIASPL ELVKKHVKFEWKEKQEN+FNELK+KL  APCLALPNFDKSFEIECDA
Subjt:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA

Query:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK
        SGIGIGAVLMQEK+PIMFFSEKLNGAQLNY TYDKEL+ALVRAL+VWQHYLWPKEFVIHTDHESLKHLK   K
Subjt:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK

TYK04936.1 F15O4.13 [Cucumis melo var. makuwa]4.4e-18184.72Show/hide
Query:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV
        MAAYRTNPT+TKEIQRQVEELMDKGY+RESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDEL+GANLFSKIDLKSGYHQIRM+V
Subjt:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV

Query:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL
        GDEWK AFKTKFGLYEWLV+PFGLTNAPSTFMRLMNHVL++YIGKFVVVYFDDILVYSK LNDHILHVK IL  LREEKLYA  KKCSFCL+Q++FLGF+
Subjt:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL

Query:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA
        VGK GV+VDEEK+    + P                  RRFIKDFSSIASPL ELVKKHVKFEWKEKQEN+FNELK+KL  APCLALPNFDKSFEIECDA
Subjt:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA

Query:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK
        SGIGIGAVLMQEK+PIMFFSEKLNGAQLNY TYDKEL+ALVRAL+VWQHYLWPKEFVIHTDHESLKHLK   K
Subjt:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK

TYK11702.1 F15O4.13 [Cucumis melo var. makuwa]4.4e-18184.72Show/hide
Query:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV
        MAAYRTNPT+TKEIQRQVEELMDKGY+RESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDEL+GANLFSKIDLKSGYHQIRM+V
Subjt:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV

Query:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL
        GDEWK AFKTKFGLYEWLV+PFGLTNAPSTFMRLMNHVL++YIGKFVVVYFDDILVYSK LNDHILHVK IL  LREEKLYA  KKCSFCL+Q++FLGF+
Subjt:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL

Query:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA
        VGK GV+VDEEK+    + P                  RRFIKDFSSIASPL ELVKKHVKFEWKEKQEN+FNELK+KL  APCLALPNFDKSFEIECDA
Subjt:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA

Query:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK
        SGIGIGAVLMQEK+PIMFFSEKLNGAQLNY TYDKEL+ALVRAL+VWQHYLWPKEFVIHTDHESLKHLK   K
Subjt:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK

TYK22420.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]4.4e-18184.72Show/hide
Query:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV
        MAAYRTNPT+TKEIQRQVEELMDKGY+RESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDEL+GANLFSKIDLKSGYHQIRM+V
Subjt:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV

Query:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL
        GDEWK AFKTKFGLYEWLV+PFGLTNAPSTFMRLMNHVL++YIGKFVVVYFDDILVYSK LNDHILHVK IL  LREEKLYA  KKCSFCL+Q++FLGF+
Subjt:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL

Query:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA
        VGK GV+VDEEK+    + P                  RRFIKDFSSIASPL ELVKKHVKFEWKEKQEN+FNELK+KL  APCLALPNFDKSFEIECDA
Subjt:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA

Query:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK
        SGIGIGAVLMQEK+PIMFFSEKLNGAQLNY TYDKEL+ALVRAL+VWQHYLWPKEFVIHTDHESLKHLK   K
Subjt:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK

TYK26105.1 F15O4.13 [Cucumis melo var. makuwa]4.4e-18184.72Show/hide
Query:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV
        MAAYRTNPT+TKEIQRQVEELMDKGY+RESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDEL+GANLFSKIDLKSGYHQIRM+V
Subjt:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV

Query:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL
        GDEWK AFKTKFGLYEWLV+PFGLTNAPSTFMRLMNHVL++YIGKFVVVYFDDILVYSK LNDHILHVK IL  LREEKLYA  KKCSFCL+Q++FLGF+
Subjt:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL

Query:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA
        VGK GV+VDEEK+    + P                  RRFIKDFSSIASPL ELVKKHVKFEWKEKQEN+FNELK+KL  APCLALPNFDKSFEIECDA
Subjt:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA

Query:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK
        SGIGIGAVLMQEK+PIMFFSEKLNGAQLNY TYDKEL+ALVRAL+VWQHYLWPKEFVIHTDHESLKHLK   K
Subjt:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK

TrEMBL top hitse value%identityAlignment
A0A5D3BWE8 F15O4.132.1e-18184.72Show/hide
Query:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV
        MAAYRTNPT+TKEIQRQVEELMDKGY+RESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDEL+GANLFSKIDLKSGYHQIRM+V
Subjt:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV

Query:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL
        GDEWK AFKTKFGLYEWLV+PFGLTNAPSTFMRLMNHVL++YIGKFVVVYFDDILVYSK LNDHILHVK IL  LREEKLYA  KKCSFCL+Q++FLGF+
Subjt:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL

Query:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA
        VGK GV+VDEEK+    + P                  RRFIKDFSSIASPL ELVKKHVKFEWKEKQEN+FNELK+KL  APCLALPNFDKSFEIECDA
Subjt:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA

Query:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK
        SGIGIGAVLMQEK+PIMFFSEKLNGAQLNY TYDKEL+ALVRAL+VWQHYLWPKEFVIHTDHESLKHLK   K
Subjt:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK

A0A5D3C3D3 F15O4.132.1e-18184.72Show/hide
Query:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV
        MAAYRTNPT+TKEIQRQVEELMDKGY+RESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDEL+GANLFSKIDLKSGYHQIRM+V
Subjt:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV

Query:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL
        GDEWK AFKTKFGLYEWLV+PFGLTNAPSTFMRLMNHVL++YIGKFVVVYFDDILVYSK LNDHILHVK IL  LREEKLYA  KKCSFCL+Q++FLGF+
Subjt:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL

Query:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA
        VGK GV+VDEEK+    + P                  RRFIKDFSSIASPL ELVKKHVKFEWKEKQEN+FNELK+KL  APCLALPNFDKSFEIECDA
Subjt:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA

Query:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK
        SGIGIGAVLMQEK+PIMFFSEKLNGAQLNY TYDKEL+ALVRAL+VWQHYLWPKEFVIHTDHESLKHLK   K
Subjt:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK

A0A5D3CK70 F15O4.132.1e-18184.72Show/hide
Query:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV
        MAAYRTNPT+TKEIQRQVEELMDKGY+RESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDEL+GANLFSKIDLKSGYHQIRM+V
Subjt:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV

Query:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL
        GDEWK AFKTKFGLYEWLV+PFGLTNAPSTFMRLMNHVL++YIGKFVVVYFDDILVYSK LNDHILHVK IL  LREEKLYA  KKCSFCL+Q++FLGF+
Subjt:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL

Query:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA
        VGK GV+VDEEK+    + P                  RRFIKDFSSIASPL ELVKKHVKFEWKEKQEN+FNELK+KL  APCLALPNFDKSFEIECDA
Subjt:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA

Query:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK
        SGIGIGAVLMQEK+PIMFFSEKLNGAQLNY TYDKEL+ALVRAL+VWQHYLWPKEFVIHTDHESLKHLK   K
Subjt:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK

A0A5D3DGA7 Transposon Ty3-I Gag-Pol polyprotein2.1e-18184.72Show/hide
Query:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV
        MAAYRTNPT+TKEIQRQVEELMDKGY+RESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDEL+GANLFSKIDLKSGYHQIRM+V
Subjt:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV

Query:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL
        GDEWK AFKTKFGLYEWLV+PFGLTNAPSTFMRLMNHVL++YIGKFVVVYFDDILVYSK LNDHILHVK IL  LREEKLYA  KKCSFCL+Q++FLGF+
Subjt:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL

Query:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA
        VGK GV+VDEEK+    + P                  RRFIKDFSSIASPL ELVKKHVKFEWKEKQEN+FNELK+KL  APCLALPNFDKSFEIECDA
Subjt:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA

Query:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK
        SGIGIGAVLMQEK+PIMFFSEKLNGAQLNY TYDKEL+ALVRAL+VWQHYLWPKEFVIHTDHESLKHLK   K
Subjt:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK

A0A5D3DRJ1 F15O4.132.1e-18184.72Show/hide
Query:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV
        MAAYRTNPT+TKEIQRQVEELMDKGY+RESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDEL+GANLFSKIDLKSGYHQIRM+V
Subjt:  MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNV

Query:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL
        GDEWK AFKTKFGLYEWLV+PFGLTNAPSTFMRLMNHVL++YIGKFVVVYFDDILVYSK LNDHILHVK IL  LREEKLYA  KKCSFCL+Q++FLGF+
Subjt:  GDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFL

Query:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA
        VGK GV+VDEEK+    + P                  RRFIKDFSSIASPL ELVKKHVKFEWKEKQEN+FNELK+KL  APCLALPNFDKSFEIECDA
Subjt:  VGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDA

Query:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK
        SGIGIGAVLMQEK+PIMFFSEKLNGAQLNY TYDKEL+ALVRAL+VWQHYLWPKEFVIHTDHESLKHLK   K
Subjt:  SGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.64.9e-6636.49Show/hide
Query:  KEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGT-----WRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNVGDEWKM
        +E++ Q+++++++G +R S SP + P+ +VPKK        +R+ +D R +N+ITV  RHPIP +D++L +L   N F+ IDL  G+HQI M+     K 
Subjt:  KEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGT-----WRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNVGDEWKM

Query:  AFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFLVGKKGV
        AF TK G YE+L +PFGL NAP+TF R MN +LR  + K  +VY DDI+V+S SL++H+  +  +   L +  L     KC F   +  FLG ++   G+
Subjt:  AFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFLVGKKGV

Query:  QVDEEKLLDNGQHP-QMQP-------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQ-ENSFNELKDKLTNAPCLALPNFDKSFEIECDASGIGI
        + + EK+    ++P   +P             R+FI +F+ IA P+ + +KK++K +    + +++F +LK  ++  P L +P+F K F +  DAS + +
Subjt:  QVDEEKLLDNGQHP-QMQP-------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQ-ENSFNELKDKLTNAPCLALPNFDKSFEIECDASGIGI

Query:  GAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLK---HLKKPN
        GAVL Q+  P+ + S  LN  ++NY T +KEL A+V A + ++HYL  + F I +DH+ L     +K PN
Subjt:  GAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLK---HLKKPN

P0CT41 Transposon Tf2-12 polyprotein7.8e-6436.04Show/hide
Query:  YRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNVGDE
        Y   P K + +  ++ + +  G +RES +  + PV+ VPKK+GT RM VD + +NK      +P+P ++ +L ++ G+ +F+K+DLKS YH IR+  GDE
Subjt:  YRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNVGDE

Query:  WKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFLVGK
         K+AF+   G++E+LV+P+G++ AP+ F   +N +L +     VV Y DDIL++SKS ++H+ HVK +L  L+   L     KC F   QV F+G+ + +
Subjt:  WKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFLVGK

Query:  KG---VQVDEEKLLDNGQHPQMQP-----------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDASGI
        KG    Q + +K+L   Q    +            R+FI   S +  PLN L+KK V+++W   Q  +   +K  L + P L   +F K   +E DAS +
Subjt:  KG---VQVDEEKLLDNGQHPQMQP-----------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDASGI

Query:  GIGAVLMQEK-----RPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWP--KEFVIHTDHESL
         +GAVL Q+       P+ ++S K++ AQLNY   DKE+ A++++L+ W+HYL    + F I TDH +L
Subjt:  GIGAVLMQEK-----RPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWP--KEFVIHTDHESL

P20825 Retrovirus-related Pol polyprotein from transposon 2978.3e-6636.6Show/hide
Query:  YRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKD-----GTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRM
        Y    T   E++ QV+E++++G +RES SP + P  +VPKK        +R+ +D R +N+IT+  R+PIP +D++L +L     F+ IDL  G+HQI M
Subjt:  YRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKD-----GTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRM

Query:  NVGDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLG
        +     K AF TK G YE+L +PFGL NAP+TF R MN++LR  + K  +VY DDI+++S SL +H+  ++ +   L +  L     KC F   + NFLG
Subjt:  NVGDEWKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLG

Query:  FLVGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQE-NSFNELKDKLTNAPCLALPNFDKSFEIE
         +V   G++ +  K+     +P                  R+FI +++ IA P+   +KK  K + ++ +   +F +LK  +   P L LP+F+K F + 
Subjt:  FLVGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQE-NSFNELKDKLTNAPCLALPNFDKSFEIE

Query:  CDASGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLK---HLKKP
         DAS + +GAVL Q   PI F S  LN  +LNY   +KEL A+V A + ++HYL  ++F+I +DH+ L+   +LK+P
Subjt:  CDASGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLK---HLKKP

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.6e-6738.83Show/hide
Query:  YRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNVGDE
        Y       +EI + V++L+D  ++  S SPCS PV+LVPKKDGT+R+CVD R +NK T+    P+PR+D++L  +  A +F+ +DL SGYHQI M   D 
Subjt:  YRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNVGDE

Query:  WKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFLVGK
        +K AF T  G YE+ V+PFGL NAPSTF R M    RD   +FV VY DDIL++S+S  +H  H+  +L  L+ E L    KKC F  ++  FLG+ +G 
Subjt:  WKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFLVGK

Query:  KGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDASGI
        + +   + K       P  +               RRFI + S IA P+   +    K +W EKQ+ +  +LK  L N+P L   N   ++ +  DAS  
Subjt:  KGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDASGI

Query:  GIGAVL--MQEKRPIM----FFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK
        GIGAVL  +  K  ++    +FS+ L  AQ NYP  + EL  +++AL  +++ L  K F + TDH SL  L+  N+
Subjt:  GIGAVL--MQEKRPIM----FFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.3e-6839.1Show/hide
Query:  YRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNVGDE
        Y       +EI + V++L+D  ++  S SPCS PV+LVPKKDGT+R+CVD R +NK T+    P+PR+D++L  +  A +F+ +DL SGYHQI M   D 
Subjt:  YRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNVGDE

Query:  WKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFLVGK
        +K AF T  G YE+ V+PFGL NAPSTF R M    RD   +FV VY DDIL++S+S  +H  H+  +L  L+ E L    KKC F  ++  FLG+ +G 
Subjt:  WKMAFKTKFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFLVGK

Query:  KGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDASGI
        + +   + K       P  +               RRFI + S IA P+   +    K +W EKQ+ + ++LKD L N+P L   N   ++ +  DAS  
Subjt:  KGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDASGI

Query:  GIGAVL--MQEKRPIM----FFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK
        GIGAVL  +  K  ++    +FS+ L  AQ NYP  + EL  +++AL  +++ L  K F + TDH SL  L+  N+
Subjt:  GIGAVL--MQEKRPIM----FFSEKLNGAQLNYPTYDKELYALVRALQVWQHYLWPKEFVIHTDHESLKHLKKPNK

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.3e-0931.54Show/hide
Query:  HVKKILFILREEKLYAYCKKCSFCLDQVNFLG--FLVGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEW
        H+  +L I  + + YA  KKC+F   Q+ +LG   ++  +GV  D  KL      P+ +               RRF+K++  I  PL EL+KK+   +W
Subjt:  HVKKILFILREEKLYAYCKKCSFCLDQVNFLG--FLVGKKGVQVDEEKLLDNGQHPQMQP--------------RRFIKDFSSIASPLNELVKKHVKFEW

Query:  KEKQENSFNELKDKLTNAPCLALPNFDKSF
         E    +F  LK  +T  P LALP+    F
Subjt:  KEKQENSFNELKDKLTNAPCLALPNFDKSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCTATAGGACCAATCCCACCAAGACTAAGGAGATTCAGAGACAAGTGGAAGAACTCATGGATAAAGGATATGTTAGAGAAAGCATGAGTCCTTGTTCAGTTCC
TGTGATTTTGGTACCCAAGAAAGATGGTACGTGGAGGATGTGTGTTGATTGTCGAGCCATCAACAAGATCACGGTAAAGTATCGACACCCTATTCCTAGATTGGATGACA
TGTTGGATGAATTATATGGTGCTAATCTGTTTTCAAAAATAGATCTTAAAAGTGGTTATCATCAAATTCGAATGAATGTGGGAGATGAGTGGAAAATGGCTTTCAAAACA
AAGTTTGGGCTTTATGAGTGGCTCGTTATACCATTTGGCCTTACAAATGCACCAAGTACTTTTATGAGGCTAATGAATCATGTTTTGAGAGACTATATTGGAAAATTTGT
TGTTGTATACTTTGATGATATCCTTGTTTACTCAAAAAGCTTGAATGATCATATTTTGCACGTTAAGAAAATTTTGTTTATACTTAGAGAAGAGAAGCTTTATGCCTACT
GCAAAAAGTGCAGTTTTTGTCTAGACCAAGTCAACTTCCTAGGGTTCTTAGTTGGAAAGAAGGGAGTACAAGTTGATGAAGAAAAGCTATTAGATAATGGCCAACACCCA
CAAATGCAACCGAGGAGATTCATTAAGGATTTCAGTAGTATAGCATCACCATTGAATGAACTTGTTAAAAAGCATGTGAAATTTGAATGGAAGGAAAAACAAGAGAATTC
ATTCAATGAACTGAAAGATAAATTAACCAATGCACCTTGTCTTGCTTTACCTAATTTTGATAAATCTTTTGAAATTGAATGTGATGCAAGTGGGATAGGCATAGGGGCTG
TTTTAATGCAGGAAAAAAGGCCAATCATGTTCTTTAGTGAAAAGCTTAATGGAGCACAACTCAACTATCCGACTTATGACAAAGAGTTATATGCACTTGTGAGGGCTTTG
CAAGTTTGGCAACATTATTTGTGGCCAAAAGAGTTTGTTATTCATACGGACCATGAAAGTTTGAAGCACCTCAAAAAGCCAAACAAAGCTAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCCTATAGGACCAATCCCACCAAGACTAAGGAGATTCAGAGACAAGTGGAAGAACTCATGGATAAAGGATATGTTAGAGAAAGCATGAGTCCTTGTTCAGTTCC
TGTGATTTTGGTACCCAAGAAAGATGGTACGTGGAGGATGTGTGTTGATTGTCGAGCCATCAACAAGATCACGGTAAAGTATCGACACCCTATTCCTAGATTGGATGACA
TGTTGGATGAATTATATGGTGCTAATCTGTTTTCAAAAATAGATCTTAAAAGTGGTTATCATCAAATTCGAATGAATGTGGGAGATGAGTGGAAAATGGCTTTCAAAACA
AAGTTTGGGCTTTATGAGTGGCTCGTTATACCATTTGGCCTTACAAATGCACCAAGTACTTTTATGAGGCTAATGAATCATGTTTTGAGAGACTATATTGGAAAATTTGT
TGTTGTATACTTTGATGATATCCTTGTTTACTCAAAAAGCTTGAATGATCATATTTTGCACGTTAAGAAAATTTTGTTTATACTTAGAGAAGAGAAGCTTTATGCCTACT
GCAAAAAGTGCAGTTTTTGTCTAGACCAAGTCAACTTCCTAGGGTTCTTAGTTGGAAAGAAGGGAGTACAAGTTGATGAAGAAAAGCTATTAGATAATGGCCAACACCCA
CAAATGCAACCGAGGAGATTCATTAAGGATTTCAGTAGTATAGCATCACCATTGAATGAACTTGTTAAAAAGCATGTGAAATTTGAATGGAAGGAAAAACAAGAGAATTC
ATTCAATGAACTGAAAGATAAATTAACCAATGCACCTTGTCTTGCTTTACCTAATTTTGATAAATCTTTTGAAATTGAATGTGATGCAAGTGGGATAGGCATAGGGGCTG
TTTTAATGCAGGAAAAAAGGCCAATCATGTTCTTTAGTGAAAAGCTTAATGGAGCACAACTCAACTATCCGACTTATGACAAAGAGTTATATGCACTTGTGAGGGCTTTG
CAAGTTTGGCAACATTATTTGTGGCCAAAAGAGTTTGTTATTCATACGGACCATGAAAGTTTGAAGCACCTCAAAAAGCCAAACAAAGCTAAATAG
Protein sequenceShow/hide protein sequence
MAAYRTNPTKTKEIQRQVEELMDKGYVRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELYGANLFSKIDLKSGYHQIRMNVGDEWKMAFKT
KFGLYEWLVIPFGLTNAPSTFMRLMNHVLRDYIGKFVVVYFDDILVYSKSLNDHILHVKKILFILREEKLYAYCKKCSFCLDQVNFLGFLVGKKGVQVDEEKLLDNGQHP
QMQPRRFIKDFSSIASPLNELVKKHVKFEWKEKQENSFNELKDKLTNAPCLALPNFDKSFEIECDASGIGIGAVLMQEKRPIMFFSEKLNGAQLNYPTYDKELYALVRAL
QVWQHYLWPKEFVIHTDHESLKHLKKPNKAK