; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0228211 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0228211
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:21955419..21957415
RNA-Seq ExpressionCmc08g0228211
SyntenyCmc08g0228211
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036204.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.4e-21166.72Show/hide
Query:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF
        E EFAIELEP TVPIS+APYRMAP ELKELKVQLQELLDK FI+PSVSPWGAPVLFVKKKDGSM LC DY+ELNKVTV+N+Y LPRID LF QL GA +F
Subjt:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF

Query:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF
        SKIDLR GYHQ                                          VFREFL TFVIVFI+DILIYSKTE EHE+HL +VL+TLR NKLYAKF
Subjt:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF

Query:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVT----------------------------SSSDLP-QSLIRKGAPFVWSKACEDSFQNLKQKLVTSSV
        SKC+FWLK +SFLGHVV K GVSVDPAKIEAVT                            S    P   L RKGAPFVWSKACEDSFQNLKQKLVT+ V
Subjt:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVT----------------------------SSSDLP-QSLIRKGAPFVWSKACEDSFQNLKQKLVTSSV

Query:  LTVPDGSGSFVIYSDASKKGLGCVLMQQG----------------------------------------------------------------KLAQLSV
        LTVPDGSGSFVIYSDASKKGLGCVLMQQG                                                                KLAQL+V
Subjt:  LTVPDGSGSFVIYSDASKKGLGCVLMQQG----------------------------------------------------------------KLAQLSV

Query:  QPTLRQKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGK
        QPTLRQ+II  Q+NDPYLVEKR LAEAGQAVEF +SSDGGL F+RRLCVP++ AVKTELLTEAHSSP SMH GSTKMYQDLKRVYWWR+MKREVAEFV +
Subjt:  QPTLRQKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGK

Query:  CLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACF
        CLVCQQVKAPRQKPAGLLQPLS+ EWKWENVSM+FITGLPRTLRGFTVIWVVVDRLTKSAHF+LGKSTYTA KWAQLY++EIVRLHGVPVSIVSDRDA F
Subjt:  CLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACF

Query:  TSKF
        TSKF
Subjt:  TSKF

KAA0042464.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-21068.28Show/hide
Query:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF
        E EFAIELEP TV +S+APYRMAP ELKELKVQLQELLDK FI+PSVSPWGAPVLF KKK+GSMRLC DY+ELNKVTV+N+Y LPRID LF QL GA +F
Subjt:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF

Query:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF
         KIDLR GYHQ                                          VFREFL TFVIVFIDDILIYSKTE EHEEHL +VL+TLR NKLYAKF
Subjt:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF

Query:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVT--SSSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLTVPDGSGSFVIYSDASKKGLGCVLMQ
        SKC+FWLK++SFLGHVV K GVSVDPAKIEAVT  +       L RKGAPFVWSKACEDSFQNLKQKLVT+ VL+VPDGSGSFVIYSDASKKGLGCVLMQ
Subjt:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVT--SSSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLTVPDGSGSFVIYSDASKKGLGCVLMQ

Query:  QG----------------------------------------------------------------KLAQLSVQPTLRQKIIVTQNNDPYLVEKRCLAEA
        QG                                                                KLAQL+VQPTLRQ+II  Q+NDPYLVEKR LAEA
Subjt:  QG----------------------------------------------------------------KLAQLSVQPTLRQKIIVTQNNDPYLVEKRCLAEA

Query:  GQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWK
        GQAVEF +SSDGGL F+RRLCVP++ AVKTELL+EAHSSP SMH GST MYQDLKRVYWWR+MKREVAEFV +CLVCQQVKAPRQKPAGLLQPLS+ EWK
Subjt:  GQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWK

Query:  WENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSKF
        WENVSM+FITGLPRTL+GFTVIWVVVDRLTKSAHF+ GKSTY ASKWAQLY++EIVRLHGVPVSIVSDRDA FTSKF
Subjt:  WENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSKF

KAA0050673.1 pol protein [Cucumis melo var. makuwa]8.7e-20966.78Show/hide
Query:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF
        E EFAIELEP TVPIS+APYRMAP ELKELKVQLQ+LLDK FI+PSVSPWGA VLFVKKKDGSMRLC DY+ELNKVTV+N Y LPRID LF QL GA +F
Subjt:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF

Query:  SKIDLRLGYHQ----------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKFSKCKFWLKKISFLGHVVPKVGVSVDP
        SKIDLR GYHQ                VFREFL TFVIVFIDDILIYSKTE EHEEHL +VL+TLR NKLYAKFSKC+FWLK++SFLGHVV K GVSVDP
Subjt:  SKIDLRLGYHQ----------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKFSKCKFWLKKISFLGHVVPKVGVSVDP

Query:  AKIEAVTS------------------------------SSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLTVPDGSGSFVIYSDASKKGLGCVL
        AKIEAVT                               ++ L Q L RKGAPFVWSKACEDSFQNLKQKLVT+ VLTVPDGSGSFVIYSDASKKGLGCVL
Subjt:  AKIEAVTS------------------------------SSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLTVPDGSGSFVIYSDASKKGLGCVL

Query:  MQQGK------------------------------------------------------------------------------------LAQLSVQPTLR
        MQQGK                                                                                    LAQL+VQPTLR
Subjt:  MQQGK------------------------------------------------------------------------------------LAQLSVQPTLR

Query:  QKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQ
        Q+II  Q+NDPYLVEKR LAEAGQAV F ISSDGGL F+RRLCVP++ AVKTELL+EAHS P SMH GSTKMYQDLKRVYWWR+MKREVAEFV +CLVCQ
Subjt:  QKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQ

Query:  QVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSKF
        QVKAPRQKPAGLLQPLS+ EWKWENVSM+FITGLPRTLRGFTVIWVVVDR TKSAHF+ GKSTYT SKWAQLY++EIVRLHGVPVSIVSDRDA FTSKF
Subjt:  QVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSKF

KAA0051051.1 reverse transcriptase [Cucumis melo var. makuwa]5.1e-20963.21Show/hide
Query:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF
        E EFAIELEP TVPIS+APYRMAP ELKELKVQLQELLDK FI+PSVSPWGAPVLFVKKKDGSMRLC DY+ELNKVTV+N+Y LPRID LF QL GA +F
Subjt:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF

Query:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF
        SKIDLR GYHQ                                          VFREFL TFVIVFIDDILIYSKTE EHEEHL +VL+TLR NKLYAKF
Subjt:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF

Query:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVTS------------------------------SSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSS
        SKC+FWLK++SFLGHVV K GVSVDPAKIEAVT                               ++ L Q L RKGAPFVWSKACEDSFQNLKQKLVT+ 
Subjt:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVTS------------------------------SSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSS

Query:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQG----------------------------------------------------------------------
        VLTVPDGSGSFVIYSDASKKGLGCVLMQQG                                                                      
Subjt:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQG----------------------------------------------------------------------

Query:  -------------------------KLAQLSVQPTLRQKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPL
                                 KLAQL+VQPTL Q+II  Q+NDPYLV+KR LAEAGQAVEF +SSDGGL F+RRLCVP++ AVKTELL+EAHSSP 
Subjt:  -------------------------KLAQLSVQPTLRQKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPL

Query:  SMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKST
        SMH GSTKMYQDLKRVYWWR+MKREVAEFV +CLVCQQVKAPRQKPAGLLQPLS+ EWKWENVSM+FITGLPRTLRGFTVIWVVVDRLTKSAHF+ GKST
Subjt:  SMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKST

Query:  YTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSKF
        YTASKWA+LY++EIVRLHGVP+SIVSDRDACFTSKF
Subjt:  YTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSKF

KAA0062245.1 pol protein [Cucumis melo var. makuwa]8.7e-20963.87Show/hide
Query:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF
        E EFAIELEP TVPIS+APYRMAP ELKELKVQLQELLDK FI+PSVSPWGAPVLFVKKKDGSMRLC DY+ELNKVTV+N+Y LPRID LF QL GA +F
Subjt:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF

Query:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF
        SKIDLR GYHQ                                          VFREFL TFVIVFIDDILIYSKTE EHEEHL MVL+TLR NKLYAKF
Subjt:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF

Query:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVTS------------------------------SSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSS
        SKC+FWLK++SFLGHVV K GVSVDPAKIEAVT                               ++ L Q L RKGAPFVWSKACEDSFQNLKQKLVT+ 
Subjt:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVTS------------------------------SSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSS

Query:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQGK---------------------------------------------------------------------
        VLTVPDGSGSFVIYSDA KKGLGCVLMQQGK                                                                     
Subjt:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQGK---------------------------------------------------------------------

Query:  ---------------------LAQLSVQPTLRQKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSG
                             LAQL+VQPTLRQ+II  Q+NDPYLVEKR LAEAGQA EF +SSDGGL F+RRLCVP++ AVKTELL+EAHSSP SMH G
Subjt:  ---------------------LAQLSVQPTLRQKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSG

Query:  STKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASK
        STKMYQDLKRVYWWR+MKREVAEFV KCLVCQQVKAP QKPAGLLQPLS+ EWKWENVSM+FITGLPRTLRGF+VIWVVVDRLTKSAHF+ GKSTYTASK
Subjt:  STKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASK

Query:  WAQLYLTEIVRLHGVPVSIVSDRDACFTSKF
        WAQLY++EIVRLHGVPVSIVSDRDA FTSKF
Subjt:  WAQLYLTEIVRLHGVPVSIVSDRDACFTSKF

TrEMBL top hitse value%identityAlignment
A0A5A7SY46 Ty3-gypsy retrotransposon protein1.2e-21166.72Show/hide
Query:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF
        E EFAIELEP TVPIS+APYRMAP ELKELKVQLQELLDK FI+PSVSPWGAPVLFVKKKDGSM LC DY+ELNKVTV+N+Y LPRID LF QL GA +F
Subjt:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF

Query:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF
        SKIDLR GYHQ                                          VFREFL TFVIVFI+DILIYSKTE EHE+HL +VL+TLR NKLYAKF
Subjt:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF

Query:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVT----------------------------SSSDLP-QSLIRKGAPFVWSKACEDSFQNLKQKLVTSSV
        SKC+FWLK +SFLGHVV K GVSVDPAKIEAVT                            S    P   L RKGAPFVWSKACEDSFQNLKQKLVT+ V
Subjt:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVT----------------------------SSSDLP-QSLIRKGAPFVWSKACEDSFQNLKQKLVTSSV

Query:  LTVPDGSGSFVIYSDASKKGLGCVLMQQG----------------------------------------------------------------KLAQLSV
        LTVPDGSGSFVIYSDASKKGLGCVLMQQG                                                                KLAQL+V
Subjt:  LTVPDGSGSFVIYSDASKKGLGCVLMQQG----------------------------------------------------------------KLAQLSV

Query:  QPTLRQKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGK
        QPTLRQ+II  Q+NDPYLVEKR LAEAGQAVEF +SSDGGL F+RRLCVP++ AVKTELLTEAHSSP SMH GSTKMYQDLKRVYWWR+MKREVAEFV +
Subjt:  QPTLRQKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGK

Query:  CLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACF
        CLVCQQVKAPRQKPAGLLQPLS+ EWKWENVSM+FITGLPRTLRGFTVIWVVVDRLTKSAHF+LGKSTYTA KWAQLY++EIVRLHGVPVSIVSDRDA F
Subjt:  CLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACF

Query:  TSKF
        TSKF
Subjt:  TSKF

A0A5A7T7E7 Ty3-gypsy retrotransposon protein2.8e-20566.67Show/hide
Query:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF
        E EFAIELEP TVPIS+APYRM P ELKELKVQLQELL K FI+PSVSPWGAPVLFVKKKDGSMRLC DY+ELNKVTV+N+Y LPRID LF QL GA +F
Subjt:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF

Query:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF
        SKIDLR GYHQ                                          VFREFL TFVIVFIDDILIY KTEVEHEEHL MVL+TLR NKLYAKF
Subjt:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF

Query:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVTS------------------------------SSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSS
        SKC+FWLK++SFLGHVV K  V VDPAKI+AVT                               ++ L Q L RKGAPFVWSKA EDSFQNLKQKLVT+ 
Subjt:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVTS------------------------------SSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSS

Query:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQG---------------------------------------------------------KLAQLSVQPTLRQ
        VLTVPDGSGSFVIYSDASKKGLGCVLMQQG                                                         KLAQL+VQPTLRQ
Subjt:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQG---------------------------------------------------------KLAQLSVQPTLRQ

Query:  KIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQ
        +II  Q+NDPYLVEKR LAEAGQAVEF ISSDGGL F+RRLCVP++ AVKTELL EAHSSP SMH GSTKMYQDLKRVYWWR+MKREVAEFV KCLVCQQ
Subjt:  KIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQ

Query:  VKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRD
        VKAPRQKPAGLLQPLS+ EWKWENVSM+FIT LPRTLRGFTVIWVVVDRLTKSAHF+ GKSTYTA KWAQLY++EIVRLHGVPVSIVSDRD
Subjt:  VKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRD

A0A5A7TGC8 Ty3-gypsy retrotransposon protein5.9e-21168.28Show/hide
Query:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF
        E EFAIELEP TV +S+APYRMAP ELKELKVQLQELLDK FI+PSVSPWGAPVLF KKK+GSMRLC DY+ELNKVTV+N+Y LPRID LF QL GA +F
Subjt:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF

Query:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF
         KIDLR GYHQ                                          VFREFL TFVIVFIDDILIYSKTE EHEEHL +VL+TLR NKLYAKF
Subjt:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF

Query:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVT--SSSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLTVPDGSGSFVIYSDASKKGLGCVLMQ
        SKC+FWLK++SFLGHVV K GVSVDPAKIEAVT  +       L RKGAPFVWSKACEDSFQNLKQKLVT+ VL+VPDGSGSFVIYSDASKKGLGCVLMQ
Subjt:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVT--SSSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLTVPDGSGSFVIYSDASKKGLGCVLMQ

Query:  QG----------------------------------------------------------------KLAQLSVQPTLRQKIIVTQNNDPYLVEKRCLAEA
        QG                                                                KLAQL+VQPTLRQ+II  Q+NDPYLVEKR LAEA
Subjt:  QG----------------------------------------------------------------KLAQLSVQPTLRQKIIVTQNNDPYLVEKRCLAEA

Query:  GQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWK
        GQAVEF +SSDGGL F+RRLCVP++ AVKTELL+EAHSSP SMH GST MYQDLKRVYWWR+MKREVAEFV +CLVCQQVKAPRQKPAGLLQPLS+ EWK
Subjt:  GQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWK

Query:  WENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSKF
        WENVSM+FITGLPRTL+GFTVIWVVVDRLTKSAHF+ GKSTY ASKWAQLY++EIVRLHGVPVSIVSDRDA FTSKF
Subjt:  WENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSKF

A0A5A7TTY2 Ty3-gypsy retrotransposon protein1.6e-20867.94Show/hide
Query:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF
        E EFAIELE  TVPIS+APYRMA  ELKELKVQLQE+LDK FI+PSV PWGAPVLFVKKKDGSMRL  DY+ELNKVTV+N+Y LPRI+ LF QL GA MF
Subjt:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF

Query:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF
        SKIDLR GYHQ                                          VFR+FL TFVIVFIDDILIYSKTE EHEEHLHMVL+TLR NKLYAKF
Subjt:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF

Query:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVTS--SSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLTVPDGSGSFVIYSDASKKGLGCVLMQ
        SKC+FWLK++SFLGHVV K GVSVDPAKIEA+TS         L RKGAPFVWSKACEDSFQNLKQKLVT+ VL VPDGSGSFVIYSDASKKGLG VL+Q
Subjt:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVTS--SSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLTVPDGSGSFVIYSDASKKGLGCVLMQ

Query:  QG----------------------------------------------------------------KLAQLSVQPTLRQKIIVTQNNDPYLVEKRCLAEA
        QG                                                                KLAQL+VQPTLRQKII  Q+NDPYLVEKR LAEA
Subjt:  QG----------------------------------------------------------------KLAQLSVQPTLRQKIIVTQNNDPYLVEKRCLAEA

Query:  GQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWK
        GQAVEF ISSDGGL F+RRLCVP++ AVKTELL+EAHSSP S+H GSTKMYQDLK+VYWWR+MKRE+AEFV KCLVCQQVKAP++KPAGLLQPLSV EWK
Subjt:  GQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWK

Query:  WENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSKF
        WENVSM+FITGLPRTLRGFTVIWVVVDRLTKS HF+ G STYTASKWAQLY+++IVRLHGVPVSIVSDRDA FTSKF
Subjt:  WENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSKF

A0A5D3DZH4 Ty3-gypsy retrotransposon protein1.6e-20867.94Show/hide
Query:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF
        E EFAIELE  TVPIS+APYRMA  ELKELKVQLQE+LDK FI+PSV PWGAPVLFVKKKDGSMRL  DY+ELNKVTV+N+Y LPRI+ LF QL GA MF
Subjt:  ESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMF

Query:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF
        SKIDLR GYHQ                                          VFR+FL TFVIVFIDDILIYSKTE EHEEHLHMVL+TLR NKLYAKF
Subjt:  SKIDLRLGYHQ------------------------------------------VFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKF

Query:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVTS--SSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLTVPDGSGSFVIYSDASKKGLGCVLMQ
        SKC+FWLK++SFLGHVV K GVSVDPAKIEA+TS         L RKGAPFVWSKACEDSFQNLKQKLVT+ VL VPDGSGSFVIYSDASKKGLG VL+Q
Subjt:  SKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVTS--SSDLPQSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLTVPDGSGSFVIYSDASKKGLGCVLMQ

Query:  QG----------------------------------------------------------------KLAQLSVQPTLRQKIIVTQNNDPYLVEKRCLAEA
        QG                                                                KLAQL+VQPTLRQKII  Q+NDPYLVEKR LAEA
Subjt:  QG----------------------------------------------------------------KLAQLSVQPTLRQKIIVTQNNDPYLVEKRCLAEA

Query:  GQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWK
        GQAVEF ISSDGGL F+RRLCVP++ AVKTELL+EAHSSP S+H GSTKMYQDLK+VYWWR+MKRE+AEFV KCLVCQQVKAP++KPAGLLQPLSV EWK
Subjt:  GQAVEFFISSDGGLFFKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWK

Query:  WENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSKF
        WENVSM+FITGLPRTLRGFTVIWVVVDRLTKS HF+ G STYTASKWAQLY+++IVRLHGVPVSIVSDRDA FTSKF
Subjt:  WENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSKF

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.6e-4022.09Show/hide
Query:  EFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMFSK
        EF +EL  +   +    Y + P +++ +  ++ + L    I+ S +    PV+FV KK+G++R+  DYK LNK    N Y LP I+ L  ++ G+ +F+K
Subjt:  EFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMFSK

Query:  IDLRLGYHQV----------------------------------FREFLGTF--------VIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKFSK
        +DL+  YH +                                  F+ F+ T         V+ ++DDILI+SK+E EH +H+  VL+ L+   L    +K
Subjt:  IDLRLGYHQV----------------------------------FREFLGTF--------VIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKFSK

Query:  CKFWLKKISFLGHVVPKVGVSVDPAKIEAV----------------------------TSSSDLP-QSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLT
        C+F   ++ F+G+ + + G +     I+ V                            TS    P  +L++K   + W+     + +N+KQ LV+  VL 
Subjt:  CKFWLKKISFLGHVVPKVGVSVDPAKIEAV----------------------------TSSSDLP-QSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLT

Query:  VPDGSGSFVIYSDASKKGLGCVLMQQ-------------------------------------------------------------GKLA---------
          D S   ++ +DAS   +G VL Q+                                                             G++          
Subjt:  VPDGSGSFVIYSDASKKGLGCVLMQQ-------------------------------------------------------------GKLA---------

Query:  ----------------------------------------------------QLSVQPTLRQKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFF-
                                                            Q+S+    + +++    ND  L+    L    + VE  I    GL   
Subjt:  ----------------------------------------------------QLSVQPTLRQKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFF-

Query:  -KRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRT
         K ++ +P +  +   ++ + H     +H G   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+  SE  WE++SM+FIT LP +
Subjt:  -KRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRT

Query:  LRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSK
          G+  ++VVVDR +K A  +    + TA + A+++   ++   G P  I++D D  FTS+
Subjt:  LRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSK

P0CT35 Transposon Tf2-2 polyprotein3.6e-4022.09Show/hide
Query:  EFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMFSK
        EF +EL  +   +    Y + P +++ +  ++ + L    I+ S +    PV+FV KK+G++R+  DYK LNK    N Y LP I+ L  ++ G+ +F+K
Subjt:  EFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMFSK

Query:  IDLRLGYHQV----------------------------------FREFLGTF--------VIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKFSK
        +DL+  YH +                                  F+ F+ T         V+ ++DDILI+SK+E EH +H+  VL+ L+   L    +K
Subjt:  IDLRLGYHQV----------------------------------FREFLGTF--------VIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKFSK

Query:  CKFWLKKISFLGHVVPKVGVSVDPAKIEAV----------------------------TSSSDLP-QSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLT
        C+F   ++ F+G+ + + G +     I+ V                            TS    P  +L++K   + W+     + +N+KQ LV+  VL 
Subjt:  CKFWLKKISFLGHVVPKVGVSVDPAKIEAV----------------------------TSSSDLP-QSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLT

Query:  VPDGSGSFVIYSDASKKGLGCVLMQQ-------------------------------------------------------------GKLA---------
          D S   ++ +DAS   +G VL Q+                                                             G++          
Subjt:  VPDGSGSFVIYSDASKKGLGCVLMQQ-------------------------------------------------------------GKLA---------

Query:  ----------------------------------------------------QLSVQPTLRQKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFF-
                                                            Q+S+    + +++    ND  L+    L    + VE  I    GL   
Subjt:  ----------------------------------------------------QLSVQPTLRQKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFF-

Query:  -KRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRT
         K ++ +P +  +   ++ + H     +H G   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+  SE  WE++SM+FIT LP +
Subjt:  -KRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRT

Query:  LRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSK
          G+  ++VVVDR +K A  +    + TA + A+++   ++   G P  I++D D  FTS+
Subjt:  LRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSK

P0CT41 Transposon Tf2-12 polyprotein3.6e-4022.09Show/hide
Query:  EFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMFSK
        EF +EL  +   +    Y + P +++ +  ++ + L    I+ S +    PV+FV KK+G++R+  DYK LNK    N Y LP I+ L  ++ G+ +F+K
Subjt:  EFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMFSK

Query:  IDLRLGYHQV----------------------------------FREFLGTF--------VIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKFSK
        +DL+  YH +                                  F+ F+ T         V+ ++DDILI+SK+E EH +H+  VL+ L+   L    +K
Subjt:  IDLRLGYHQV----------------------------------FREFLGTF--------VIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKFSK

Query:  CKFWLKKISFLGHVVPKVGVSVDPAKIEAV----------------------------TSSSDLP-QSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLT
        C+F   ++ F+G+ + + G +     I+ V                            TS    P  +L++K   + W+     + +N+KQ LV+  VL 
Subjt:  CKFWLKKISFLGHVVPKVGVSVDPAKIEAV----------------------------TSSSDLP-QSLIRKGAPFVWSKACEDSFQNLKQKLVTSSVLT

Query:  VPDGSGSFVIYSDASKKGLGCVLMQQ-------------------------------------------------------------GKLA---------
          D S   ++ +DAS   +G VL Q+                                                             G++          
Subjt:  VPDGSGSFVIYSDASKKGLGCVLMQQ-------------------------------------------------------------GKLA---------

Query:  ----------------------------------------------------QLSVQPTLRQKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFF-
                                                            Q+S+    + +++    ND  L+    L    + VE  I    GL   
Subjt:  ----------------------------------------------------QLSVQPTLRQKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFF-

Query:  -KRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRT
         K ++ +P +  +   ++ + H     +H G   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+  SE  WE++SM+FIT LP +
Subjt:  -KRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRT

Query:  LRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSK
          G+  ++VVVDR +K A  +    + TA + A+++   ++   G P  I++D D  FTS+
Subjt:  LRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.2e-4624.7Show/hide
Query:  IELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMFSKIDL
        IE++P        PY +     +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC DY+ LNK T+ + + LPRID L  ++  A +F+ +DL
Subjt:  IELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMFSKIDL

Query:  RLGYHQV----------------------------------FREFLG------TFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKFSKCKFWL
          GYHQ+                                  F  ++        FV V++DDILI+S++  EH +HL  VLE L+   L  K  KCKF  
Subjt:  RLGYHQV----------------------------------FREFLG------TFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKFSKCKFWL

Query:  KKISFLGHVVPKVGVSVDPAKIEAVTSSSDLP-QSLIRKGAPFV-----------------------------WSKACEDSFQNLKQKLVTSSVLTVPDG
        ++  FLG+    +G+           +  D P    +++   F+                             W++  + + + LK  L  S VL   + 
Subjt:  KKISFLGHVVPKVGVSVDPAKIEAVTSSSDLP-QSLIRKGAPFV-----------------------------WSKACEDSFQNLKQKLVTSSVLTVPDG

Query:  SGSFVIYSDASKKGLGCVLMQ--------------------------QGKLAQLSVQPTLRQ-----------------KIIVTQN-NDP----------
          ++ + +DASK G+G VL +                           G+L  L +   L                    ++  QN N+P          
Subjt:  SGSFVIYSDASKKGLGCVLMQ--------------------------QGKLAQLSVQPTLRQ-----------------KIIVTQN-NDP----------

Query:  ---------YLV-EKRCLAEAGQAVEFFIS----------------------------------------------------------------SDGGLF
                 YL   K  +A+A     + I+                                                                 D  ++
Subjt:  ---------YLV-EKRCLAEAGQAVEFFIS----------------------------------------------------------------SDGGLF

Query:  FKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRT
        ++ RL VP         L   H+     H G T     +  +Y+W  ++  + +++  C+ CQ +K+ R +  GLLQPL ++E +W ++SM+F+TGLP T
Subjt:  FKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRT

Query:  LRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTS
             +I VVVDR +K AHFI  + T  A++   L    I   HG P +I SDRD   T+
Subjt:  LRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTS

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.2e-4624.7Show/hide
Query:  IELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMFSKIDL
        IE++P        PY +     +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC DY+ LNK T+ + + LPRID L  ++  A +F+ +DL
Subjt:  IELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGAIMFSKIDL

Query:  RLGYHQV----------------------------------FREFLG------TFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKFSKCKFWL
          GYHQ+                                  F  ++        FV V++DDILI+S++  EH +HL  VLE L+   L  K  KCKF  
Subjt:  RLGYHQV----------------------------------FREFLG------TFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKFSKCKFWL

Query:  KKISFLGHVVPKVGVSVDPAKIEAVTSSSDLP-QSLIRKGAPFV-----------------------------WSKACEDSFQNLKQKLVTSSVLTVPDG
        ++  FLG+    +G+           +  D P    +++   F+                             W++  + +   LK  L  S VL   + 
Subjt:  KKISFLGHVVPKVGVSVDPAKIEAVTSSSDLP-QSLIRKGAPFV-----------------------------WSKACEDSFQNLKQKLVTSSVLTVPDG

Query:  SGSFVIYSDASKKGLGCVLMQ--------------------------QGKLAQLSVQPTLRQ-----------------KIIVTQN-NDP----------
          ++ + +DASK G+G VL +                           G+L  L +   L                    ++  QN N+P          
Subjt:  SGSFVIYSDASKKGLGCVLMQ--------------------------QGKLAQLSVQPTLRQ-----------------KIIVTQN-NDP----------

Query:  ---------YLV-EKRCLAEAGQAVEFFIS----------------------------------------------------------------SDGGLF
                 YL   K  +A+A     + I+                                                                 D  ++
Subjt:  ---------YLV-EKRCLAEAGQAVEFFIS----------------------------------------------------------------SDGGLF

Query:  FKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRT
        ++ RL VP         L   H+     H G T     +  +Y+W  ++  + +++  C+ CQ +K+ R +  GLLQPL ++E +W ++SM+F+TGLP T
Subjt:  FKRRLCVPANGAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRT

Query:  LRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTS
             +I VVVDR +K AHFI  + T  A++   L    I   HG P +I SDRD   T+
Subjt:  LRGFTVIWVVVDRLTKSAHFILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTS

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.3e-0528.46Show/hide
Query:  HLHMVLETLRANKLYAKFSKCKFWLKKISFLG--HVVPKVGVSVDPAKIEAVT------SSSDL----------------------PQSLIRKGAPFVWS
        HL MVL+    ++ YA   KC F   +I++LG  H++   GVS DPAK+EA+       ++++L                      P + + K     W+
Subjt:  HLHMVLETLRANKLYAKFSKCKFWLKKISFLG--HVVPKVGVSVDPAKIEAVT------SSSDL----------------------PQSLIRKGAPFVWS

Query:  KACEDSFQNLKQKLVTSSVLTVPDGSGSFV
        +    +F+ LK  + T  VL +PD    FV
Subjt:  KACEDSFQNLKQKLVTSSVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTTCCTGAAGAACTTCCTAGATTACCTCCTCATAGAGAGTGAATTTGCCATTGAGTTGGAACCTGATACTGTTCCTATATCTAAAGCCCCATATAGAATGGCCCC
AATAGAGTTGAAAGAGTTGAAAGTGCAGTTACAAGAATTACTTGACAAAGACTTCATTCAACCGAGTGTGTCACCTTGGGGTGCACCAGTTTTATTTGTTAAAAAGAAGG
ATGGATCGATGCGCTTATGTACTGACTACAAAGAGTTGAATAAGGTAACTGTTAGGAATAAATACCTCTTGCCCAGGATCGATGGTCTGTTTTACCAGTTACATGGAGCT
ATAATGTTCTCTAAGATCGACCTTCGATTAGGATATCATCAAGTGTTTAGGGAGTTCCTAGGCACTTTTGTGATTGTGTTTATTGACGACATTTTGATATATTCCAAGAC
AGAGGTAGAGCATGAGGAGCATTTACACATGGTTCTAGAAACCCTTCGAGCTAATAAATTGTATGCAAAGTTCTCAAAATGTAAGTTTTGGTTAAAGAAAATATCTTTTC
TAGGCCATGTGGTTCCTAAAGTTGGTGTTTCTGTTGATCCAGCTAAGATAGAGGCAGTCACTAGTAGCTCCGACCTTCCACAGTCATTGATCAGGAAGGGAGCTCCTTTT
GTTTGGAGCAAGGCCTGTGAGGACAGTTTTCAGAACCTTAAACAGAAACTCGTTACTTCATCAGTTCTTACTGTACCTGATGGTTCAGGAAGTTTTGTGATTTACAGTGA
TGCTTCTAAGAAAGGTTTGGGTTGTGTTCTGATGCAACAAGGTAAGTTAGCTCAATTATCGGTGCAACCGACTTTGAGGCAGAAGATTATTGTTACTCAGAATAACGATC
CTTATTTGGTTGAGAAGCGTTGCCTAGCGGAAGCAGGGCAAGCTGTTGAGTTCTTCATATCCTCTGATGGTGGGCTTTTTTTTAAGAGGCGCCTTTGTGTGCCAGCAAAT
GGTGCGGTTAAAACAGAATTATTAACTGAGGCTCATAGTTCCCCACTTTCCATGCACTCAGGTAGTACAAAGATGTATCAAGACCTGAAACGAGTTTACTGGTGGCGTGA
TATGAAGAGAGAGGTGGCAGAATTTGTCGGTAAATGCTTGGTGTGTCAGCAGGTTAAAGCACCAAGGCAGAAACCAGCAGGTTTATTACAACCCTTAAGTGTGTCGGAAT
GGAAGTGGGAAAATGTGTCCATGAATTTCATTACAGGACTGCCTAGAACTCTGAGGGGTTTTACAGTGATTTGGGTTGTTGTTGACAGGCTTACCAAGTCAGCACATTTC
ATTCTGGGGAAATCTACTTATACTGCCAGTAAGTGGGCACAGTTGTACTTGACTGAGATAGTGAGACTGCATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCTG
TTTCACTTCTAAATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTTTCCTGAAGAACTTCCTAGATTACCTCCTCATAGAGAGTGAATTTGCCATTGAGTTGGAACCTGATACTGTTCCTATATCTAAAGCCCCATATAGAATGGCCCC
AATAGAGTTGAAAGAGTTGAAAGTGCAGTTACAAGAATTACTTGACAAAGACTTCATTCAACCGAGTGTGTCACCTTGGGGTGCACCAGTTTTATTTGTTAAAAAGAAGG
ATGGATCGATGCGCTTATGTACTGACTACAAAGAGTTGAATAAGGTAACTGTTAGGAATAAATACCTCTTGCCCAGGATCGATGGTCTGTTTTACCAGTTACATGGAGCT
ATAATGTTCTCTAAGATCGACCTTCGATTAGGATATCATCAAGTGTTTAGGGAGTTCCTAGGCACTTTTGTGATTGTGTTTATTGACGACATTTTGATATATTCCAAGAC
AGAGGTAGAGCATGAGGAGCATTTACACATGGTTCTAGAAACCCTTCGAGCTAATAAATTGTATGCAAAGTTCTCAAAATGTAAGTTTTGGTTAAAGAAAATATCTTTTC
TAGGCCATGTGGTTCCTAAAGTTGGTGTTTCTGTTGATCCAGCTAAGATAGAGGCAGTCACTAGTAGCTCCGACCTTCCACAGTCATTGATCAGGAAGGGAGCTCCTTTT
GTTTGGAGCAAGGCCTGTGAGGACAGTTTTCAGAACCTTAAACAGAAACTCGTTACTTCATCAGTTCTTACTGTACCTGATGGTTCAGGAAGTTTTGTGATTTACAGTGA
TGCTTCTAAGAAAGGTTTGGGTTGTGTTCTGATGCAACAAGGTAAGTTAGCTCAATTATCGGTGCAACCGACTTTGAGGCAGAAGATTATTGTTACTCAGAATAACGATC
CTTATTTGGTTGAGAAGCGTTGCCTAGCGGAAGCAGGGCAAGCTGTTGAGTTCTTCATATCCTCTGATGGTGGGCTTTTTTTTAAGAGGCGCCTTTGTGTGCCAGCAAAT
GGTGCGGTTAAAACAGAATTATTAACTGAGGCTCATAGTTCCCCACTTTCCATGCACTCAGGTAGTACAAAGATGTATCAAGACCTGAAACGAGTTTACTGGTGGCGTGA
TATGAAGAGAGAGGTGGCAGAATTTGTCGGTAAATGCTTGGTGTGTCAGCAGGTTAAAGCACCAAGGCAGAAACCAGCAGGTTTATTACAACCCTTAAGTGTGTCGGAAT
GGAAGTGGGAAAATGTGTCCATGAATTTCATTACAGGACTGCCTAGAACTCTGAGGGGTTTTACAGTGATTTGGGTTGTTGTTGACAGGCTTACCAAGTCAGCACATTTC
ATTCTGGGGAAATCTACTTATACTGCCAGTAAGTGGGCACAGTTGTACTTGACTGAGATAGTGAGACTGCATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCTG
TTTCACTTCTAAATTCTAG
Protein sequenceShow/hide protein sequence
MFFLKNFLDYLLIESEFAIELEPDTVPISKAPYRMAPIELKELKVQLQELLDKDFIQPSVSPWGAPVLFVKKKDGSMRLCTDYKELNKVTVRNKYLLPRIDGLFYQLHGA
IMFSKIDLRLGYHQVFREFLGTFVIVFIDDILIYSKTEVEHEEHLHMVLETLRANKLYAKFSKCKFWLKKISFLGHVVPKVGVSVDPAKIEAVTSSSDLPQSLIRKGAPF
VWSKACEDSFQNLKQKLVTSSVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKLAQLSVQPTLRQKIIVTQNNDPYLVEKRCLAEAGQAVEFFISSDGGLFFKRRLCVPAN
GAVKTELLTEAHSSPLSMHSGSTKMYQDLKRVYWWRDMKREVAEFVGKCLVCQQVKAPRQKPAGLLQPLSVSEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHF
ILGKSTYTASKWAQLYLTEIVRLHGVPVSIVSDRDACFTSKF