; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G02040 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G02040
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr7:1730625..1731795
RNA-Seq ExpressionCSPI07G02040
SyntenyCSPI07G02040
Gene Ontology termsGO:0009987 - cellular process (biological process)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049630.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.7e-7644.88Show/hide
Query:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET
        G Y S+KG++VDP+K R +KEWP P NVRE                              +GAY+WT+ET+ AFEKLK AMMTLPVLA+P+FNLPFEIE+
Subjt:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET

Query:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL
        +ASG G+G VL+Q     ++F +       A+P+YE ELMAV   VQRWRP                                      Y+ EV+YKPGL
Subjt:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL

Query:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR
        ENK ADALSR+  + HLNQLTAPALL++ +I++E+ K P L EI+  +++    +  ++  Q                                  GF R
Subjt:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR

Query:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK
        TYKR+ GELYW+GMK DV+KYCE+C++CQ  KS  LSPAGLL+PLEIPDA+ S I MDFI+GLPK+ G+ VILV VDRLSK
Subjt:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK

KAA0049776.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.7e-7644.88Show/hide
Query:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET
        G Y S+KG++VDP+K R +KEWP P NVRE                              +GAY+WT+ET+ AFEKLK AMMTLPVLA+P+FNLPFEIE+
Subjt:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET

Query:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL
        +ASG G+G VL+Q     ++F +       A+P+YE ELMAV   VQRWRP                                      Y+ EV+YKPGL
Subjt:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL

Query:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR
        ENK ADALSR+  + HLNQLTAPALL++ +I++E+ K P L EI+  +++    +  ++  Q                                  GF R
Subjt:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR

Query:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK
        TYKR+ GELYW+GMK DV+KYCE+C++CQ  KS  LSPAGLL+PLEIPDA+ S I MDFI+GLPK+ G+ VILV VDRLSK
Subjt:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK

KAA0051400.1 putative retroelement pol polyprotein [Cucumis melo var. makuwa]5.1e-7649Show/hide
Query:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVREAGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIETNASGNGIGVVLIQ-----SHFVR----EGS
        G Y S KG++VDP+K R ++EWP P+N   AGAY+WT+ET+ AFEKLK AMMTLPVLA+P+FNLPFEIE++ASG G+G VL+Q     ++F +       
Subjt:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVREAGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIETNASGNGIGVVLIQ-----SHFVR----EGS

Query:  AKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGLENKVADALSRM-TSMHLNQLTAPALLELII
        A+P+YE EL+AV   VQRWRP                                      Y+ EV+YKPG+ENK  DALSRM  + HLNQLTAPALL++ +
Subjt:  AKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGLENKVADALSRM-TSMHLNQLTAPALLELII

Query:  IKEELEKGPWLSEI---IEE--------------LKKNEECVAD-----------------FSLQQGFFRTYKRLKGELYWEGMKGDVKKYCEKCVVCQH
        I+EE+ K P L EI   IEE              LK  E  V                   F    GF +TYKR+ GELYW+GMK DV+KYCE+C++CQ 
Subjt:  IKEELEKGPWLSEI---IEE--------------LKKNEECVAD-----------------FSLQQGFFRTYKRLKGELYWEGMKGDVKKYCEKCVVCQH

Query:  PKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK
         KS  LSP GLL+PLEIPDA+ S I MDFI+GLPK+ G+ VILV VDRLSK
Subjt:  PKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK

TYK15990.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.7e-7644.88Show/hide
Query:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET
        G Y S+KG++VDP+K R +KEWP P NVRE                              +GAY+WT+ET+ AFEKLK AMMTLPVLA+P+FNLPFEIE+
Subjt:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET

Query:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL
        +ASG G+G VL+Q     ++F +       A+P+YE ELMAV   VQRWRP                                      Y+ EV+YKPGL
Subjt:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL

Query:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR
        ENK ADALSR+  + HLNQLTAPALL++ +I++E+ K P L EI+  +++    +  ++  Q                                  GF R
Subjt:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR

Query:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK
        TYKR+ GELYW+GMK DV+KYCE+C++CQ  KS  LSPAGLL+PLEIPDA+ S I MDFI+GLPK+ G+ VILV VDRLSK
Subjt:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK

TYK23090.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.7e-7644.88Show/hide
Query:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET
        G Y S+KG++VDP+K R +KEWP P NVRE                              +GAY+WT+ET+ AFEKLK AMMTLPVLA+P+FNLPFEIE+
Subjt:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET

Query:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL
        +ASG G+G VL+Q     ++F +       A+P+YE ELMAV   VQRWRP                                      Y+ EV+YKPGL
Subjt:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL

Query:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR
        ENK ADALSR+  + HLNQLTAPALL++ +I++E+ K P L EI+  +++    +  ++  Q                                  GF R
Subjt:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR

Query:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK
        TYKR+ GELYW+GMK DV+KYCE+C++CQ  KS  LSPAGLL+PLEIPDA+ S I MDFI+GLPK+ G+ VILV VDRLSK
Subjt:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK

TrEMBL top hitse value%identityAlignment
A0A5A7U2S1 Ty3/gypsy retrotransposon protein4.2e-7644.88Show/hide
Query:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET
        G Y S+KG++VDP+K R +KEWP P NVRE                              +GAY+WT+ET+ AFEKLK AMMTLPVLA+P+FNLPFEIE+
Subjt:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET

Query:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL
        +ASG G+G VL+Q     ++F +       A+P+YE ELMAV   VQRWRP                                      Y+ EV+YKPGL
Subjt:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL

Query:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR
        ENK ADALSR+  + HLNQLTAPALL++ +I++E+ K P L EI+  +++    +  ++  Q                                  GF R
Subjt:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR

Query:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK
        TYKR+ GELYW+GMK DV+KYCE+C++CQ  KS  LSPAGLL+PLEIPDA+ S I MDFI+GLPK+ G+ VILV VDRLSK
Subjt:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK

A0A5A7U6J3 Ty3/gypsy retrotransposon protein4.2e-7644.88Show/hide
Query:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET
        G Y S+KG++VDP+K R +KEWP P NVRE                              +GAY+WT+ET+ AFEKLK AMMTLPVLA+P+FNLPFEIE+
Subjt:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET

Query:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL
        +ASG G+G VL+Q     ++F +       A+P+YE ELMAV   VQRWRP                                      Y+ EV+YKPGL
Subjt:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL

Query:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR
        ENK ADALSR+  + HLNQLTAPALL++ +I++E+ K P L EI+  +++    +  ++  Q                                  GF R
Subjt:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR

Query:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK
        TYKR+ GELYW+GMK DV+KYCE+C++CQ  KS  LSPAGLL+PLEIPDA+ S I MDFI+GLPK+ G+ VILV VDRLSK
Subjt:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK

A0A5A7U8A5 Putative retroelement pol polyprotein2.5e-7649Show/hide
Query:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVREAGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIETNASGNGIGVVLIQ-----SHFVR----EGS
        G Y S KG++VDP+K R ++EWP P+N   AGAY+WT+ET+ AFEKLK AMMTLPVLA+P+FNLPFEIE++ASG G+G VL+Q     ++F +       
Subjt:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVREAGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIETNASGNGIGVVLIQ-----SHFVR----EGS

Query:  AKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGLENKVADALSRM-TSMHLNQLTAPALLELII
        A+P+YE EL+AV   VQRWRP                                      Y+ EV+YKPG+ENK  DALSRM  + HLNQLTAPALL++ +
Subjt:  AKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGLENKVADALSRM-TSMHLNQLTAPALLELII

Query:  IKEELEKGPWLSEI---IEE--------------LKKNEECVAD-----------------FSLQQGFFRTYKRLKGELYWEGMKGDVKKYCEKCVVCQH
        I+EE+ K P L EI   IEE              LK  E  V                   F    GF +TYKR+ GELYW+GMK DV+KYCE+C++CQ 
Subjt:  IKEELEKGPWLSEI---IEE--------------LKKNEECVAD-----------------FSLQQGFFRTYKRLKGELYWEGMKGDVKKYCEKCVVCQH

Query:  PKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK
         KS  LSP GLL+PLEIPDA+ S I MDFI+GLPK+ G+ VILV VDRLSK
Subjt:  PKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK

A0A5D3CXB1 Ty3/gypsy retrotransposon protein4.2e-7644.88Show/hide
Query:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET
        G Y S+KG++VDP+K R +KEWP P NVRE                              +GAY+WT+ET+ AFEKLK AMMTLPVLA+P+FNLPFEIE+
Subjt:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET

Query:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL
        +ASG G+G VL+Q     ++F +       A+P+YE ELMAV   VQRWRP                                      Y+ EV+YKPGL
Subjt:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL

Query:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR
        ENK ADALSR+  + HLNQLTAPALL++ +I++E+ K P L EI+  +++    +  ++  Q                                  GF R
Subjt:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR

Query:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK
        TYKR+ GELYW+GMK DV+KYCE+C++CQ  KS  LSPAGLL+PLEIPDA+ S I MDFI+GLPK+ G+ VILV VDRLSK
Subjt:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK

A0A5D3DI73 Ty3/gypsy retrotransposon protein4.2e-7644.88Show/hide
Query:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET
        G Y S+KG++VDP+K R +KEWP P NVRE                              +GAY+WT+ET+ AFEKLK AMMTLPVLA+P+FNLPFEIE+
Subjt:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE------------------------------AGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIET

Query:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL
        +ASG G+G VL+Q     ++F +       A+P+YE ELMAV   VQRWRP                                      Y+ EV+YKPGL
Subjt:  NASGNGIGVVLIQ-----SHFVR----EGSAKPIYEWELMAVELVVQRWRPKC------------------------------------YTSEVIYKPGL

Query:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR
        ENK ADALSR+  + HLNQLTAPALL++ +I++E+ K P L EI+  +++    +  ++  Q                                  GF R
Subjt:  ENKVADALSRM-TSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQ----------------------------------GFFR

Query:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK
        TYKR+ GELYW+GMK DV+KYCE+C++CQ  KS  LSPAGLL+PLEIPDA+ S I MDFI+GLPK+ G+ VILV VDRLSK
Subjt:  TYKRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK

SwissProt top hitse value%identityAlignment
P10394 Retrovirus-related Pol polyprotein from transposon 4129.6e-0925.33Show/hide
Query:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVREAG-------------------------------AYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIE
        G  C+ KG+  D KK   I+ +P+P +   A                                 +EWT E Q+AF  LK+ ++   +L  P+F+  F I 
Subjt:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVREAG-------------------------------AYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIE

Query:  TNASGNGIGVVLIQSH-------------FVREGSAKPIYEWELMAVELVVQRWRPKCYTSE------------------------------------VI
        T+AS    G VL Q+H             F +  S K   E EL A+   +  +RP  Y                                       V 
Subjt:  TNASGNGIGVVLIQSH-------------FVREGSAKPIYEWELMAVELVVQRWRPKCYTSE------------------------------------VI

Query:  YKPGLENKVADALSRMTSMHLNQLTAPAL
        Y  G +N VADALSR+T   L  +T   L
Subjt:  YKPGLENKVADALSRMTSMHLNQLTAPAL

P10401 Retrovirus-related Pol polyprotein from transposon gypsy6.4e-0523.6Show/hide
Query:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVREAGAY------------------------------------------EWTKETQEAFEKLKNAMMTLPV-L
        G   S+ G K DP+K + I+E+P P  V +  ++                                          E+ +  + AF++L+N + +  V L
Subjt:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVREAGAY------------------------------------------EWTKETQEAFEKLKNAMMTLPV-L

Query:  ALPNFNLPFEIETNASGNGIGVVLIQ--------SHFVREGSAK-PIYEWELMAV-----------------------------------ELVVQRWRPK
          P+F  PF++ T+AS +GIG VL Q        S  +++        E EL+A+                                      ++RW+  
Subjt:  ALPNFNLPFEIETNASGNGIGVVLIQ--------SHFVREGSAK-PIYEWELMAV-----------------------------------ELVVQRWRPK

Query:  C--YTSEVIYKPGLENKVADALSRMTSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEEC
           + ++V YKPG EN VADALSR    +LN L      +   I  EL     L+  +E   K   C
Subjt:  C--YTSEVIYKPGLENKVADALSRMTSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEEC

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein9.6e-0925.07Show/hide
Query:  EWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIETNASGNGIGVVL-----------IQSHFVR--EGSAK--PIYEWELMAVELV-------------
        +WT++  +A EKLK A+   PVL   N    + + T+AS +GIG VL           +  +F +  E + K  P  E EL+ +                
Subjt:  EWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIETNASGNGIGVVL-----------IQSHFVR--EGSAK--PIYEWELMAVELV-------------

Query:  ---------------------VQRWRP--KCYTSEVIYKPGLENKVADALSRMTSMHLNQLTAPALLE---------------LIIIKEELEKGPWLS--
                             VQRW      Y   + Y  G +N VADA+SR       + + P   E               LI +KE  +        
Subjt:  ---------------------VQRWRP--KCYTSEVIYKPGLENKVADALSRMTSMHLNQLTAPALLE---------------LIIIKEELEKGPWLS--

Query:  ----------EIIEELKKN-----------EECVADFSLQQGFFRTY----------------KRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPA
                  E+ E  +KN           +  V     Q    R Y                 ++    YW  ++  + +Y   CV CQ  KS      
Subjt:  ----------EIIEELKKN-----------EECVADFSLQQGFFRTY----------------KRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPA

Query:  GLLMPLEIPDAVLSHIFMDFIDGLPKAA-GFNVILVAVDRLSK
        GLL PL I +     I MDF+ GLP  +   N+ILV VDR SK
Subjt:  GLLMPLEIPDAVLSHIFMDFIDGLPKAA-GFNVILVAVDRLSK

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.2e-0826.25Show/hide
Query:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE-----------------------------AGAYEWTKETQ-------------EAFEKLKNAMMTLPVLA
        G   +  G+K DPKK R I E P PT+V+E                              G Y   K +Q             ++F  LK+ + +  +LA
Subjt:  GAYCSRKGVKVDPKKTRTIKEWPIPTNVRE-----------------------------AGAYEWTKETQ-------------EAFEKLKNAMMTLPVLA

Query:  LPNFNLPFEIETNASGNGIGVVLIQSHFVREGSAKPI----------------YEWELMAV-----------------------------------ELVV
         P F  PF + T+AS   IG VL Q     +G  +PI                 E E++A+                                      +
Subjt:  LPNFNLPFEIETNASGNGIGVVLIQSHFVREGSAKPI----------------YEWELMAV-----------------------------------ELVV

Query:  QRWRPKC--YTSEVIYKPGLENKVADALSRMTSMHLNQLT
        +RW+ +   Y  E+IYKPG  N VADALSR+    LNQL+
Subjt:  QRWRPKC--YTSEVIYKPGLENKVADALSRMTSMHLNQLT

Q99315 Transposon Ty3-G Gag-Pol polyprotein7.3e-0924.78Show/hide
Query:  EWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIETNASGNGIGVVL-----------IQSHFVR--EGSAK--PIYEWELMAVELV-------------
        +WT++  +A +KLK+A+   PVL   N    + + T+AS +GIG VL           +  +F +  E + K  P  E EL+ +                
Subjt:  EWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIETNASGNGIGVVL-----------IQSHFVR--EGSAK--PIYEWELMAVELV-------------

Query:  ---------------------VQRWRP--KCYTSEVIYKPGLENKVADALSRMTSMHLNQLTAPALLE---------------LIIIKEELEKGPWLS--
                             VQRW      Y   + Y  G +N VADA+SR       + + P   E               LI +KE  +        
Subjt:  ---------------------VQRWRP--KCYTSEVIYKPGLENKVADALSRMTSMHLNQLTAPALLE---------------LIIIKEELEKGPWLS--

Query:  ----------EIIEELKKN-----------EECVADFSLQQGFFRTY----------------KRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPA
                  E+ E  +KN           +  V     Q    R Y                 ++    YW  ++  + +Y   CV CQ  KS      
Subjt:  ----------EIIEELKKN-----------EECVADFSLQQGFFRTY----------------KRLKGELYWEGMKGDVKKYCEKCVVCQHPKSLPLSPA

Query:  GLLMPLEIPDAVLSHIFMDFIDGLPKAA-GFNVILVAVDRLSK
        GLL PL I +     I MDF+ GLP  +   N+ILV VDR SK
Subjt:  GLLMPLEIPDAVLSHIFMDFIDGLPKAA-GFNVILVAVDRLSK

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein5.0e-0533.7Show/hide
Query:  SRKGVKVDPKKTRTIKEWPIPTNVRE-------AGAY-----------------------EWTKETQEAFEKLKNAMMTLPVLALPNFNLPF
        S +GV  DP K   +  WP P N  E        G Y                       +WT+    AF+ LK A+ TLPVLALP+  LPF
Subjt:  SRKGVKVDPKKTRTIKEWPIPTNVRE-------AGAY-----------------------EWTKETQEAFEKLKNAMMTLPVLALPNFNLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACTTTGCCCGGGCAAGAGTATAATATTTGGGGCATATTGTTCGAGAAAAGGAGTGAAAGTTGACCCGAAAAAAACTAGAACAATAAAGGAATGGCCAATACCGAC
AAATGTGCGGGAAGCAGGAGCTTACGAGTGGACGAAAGAGACTCAAGAAGCTTTTGAGAAATTAAAGAATGCTATGATGACATTGCCAGTGCTAGCATTACCTAATTTCA
ATTTGCCCTTCGAGATTGAAACTAATGCATCTGGCAATGGGATTGGAGTAGTGTTGATCCAATCACACTTTGTTCGTGAGGGATCGGCTAAACCGATTTATGAATGGGAG
TTAATGGCAGTGGAGTTAGTTGTGCAACGGTGGAGACCGAAATGTTATACGTCTGAGGTAATCTATAAACCGGGGTTGGAGAATAAGGTTGCTGATGCACTCTCTAGAAT
GACTTCAATGCATTTAAATCAATTAACGGCTCCTGCTTTGCTGGAATTGATCATAATTAAAGAAGAGCTGGAGAAGGGTCCATGGCTGAGTGAAATAATAGAAGAATTGA
AGAAGAATGAGGAGTGTGTGGCTGACTTTTCTTTACAGCAGGGATTTTTTAGGACTTATAAACGGCTGAAGGGAGAGCTGTATTGGGAAGGTATGAAGGGTGATGTGAAG
AAATATTGTGAGAAATGTGTCGTATGTCAGCATCCTAAGTCCTTGCCGTTGTCTCCAGCAGGATTATTGATGCCTTTAGAGATTCCGGATGCTGTATTGAGCCACATCTT
TATGGATTTCATTGATGGGTTACCAAAGGCGGCTGGTTTCAATGTGATATTAGTAGCGGTGGACAGATTGAGTAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAACTTTGCCCGGGCAAGAGTATAATATTTGGGGCATATTGTTCGAGAAAAGGAGTGAAAGTTGACCCGAAAAAAACTAGAACAATAAAGGAATGGCCAATACCGAC
AAATGTGCGGGAAGCAGGAGCTTACGAGTGGACGAAAGAGACTCAAGAAGCTTTTGAGAAATTAAAGAATGCTATGATGACATTGCCAGTGCTAGCATTACCTAATTTCA
ATTTGCCCTTCGAGATTGAAACTAATGCATCTGGCAATGGGATTGGAGTAGTGTTGATCCAATCACACTTTGTTCGTGAGGGATCGGCTAAACCGATTTATGAATGGGAG
TTAATGGCAGTGGAGTTAGTTGTGCAACGGTGGAGACCGAAATGTTATACGTCTGAGGTAATCTATAAACCGGGGTTGGAGAATAAGGTTGCTGATGCACTCTCTAGAAT
GACTTCAATGCATTTAAATCAATTAACGGCTCCTGCTTTGCTGGAATTGATCATAATTAAAGAAGAGCTGGAGAAGGGTCCATGGCTGAGTGAAATAATAGAAGAATTGA
AGAAGAATGAGGAGTGTGTGGCTGACTTTTCTTTACAGCAGGGATTTTTTAGGACTTATAAACGGCTGAAGGGAGAGCTGTATTGGGAAGGTATGAAGGGTGATGTGAAG
AAATATTGTGAGAAATGTGTCGTATGTCAGCATCCTAAGTCCTTGCCGTTGTCTCCAGCAGGATTATTGATGCCTTTAGAGATTCCGGATGCTGTATTGAGCCACATCTT
TATGGATTTCATTGATGGGTTACCAAAGGCGGCTGGTTTCAATGTGATATTAGTAGCGGTGGACAGATTGAGTAAGTAGGCACATT
Protein sequenceShow/hide protein sequence
MQLCPGKSIIFGAYCSRKGVKVDPKKTRTIKEWPIPTNVREAGAYEWTKETQEAFEKLKNAMMTLPVLALPNFNLPFEIETNASGNGIGVVLIQSHFVREGSAKPIYEWE
LMAVELVVQRWRPKCYTSEVIYKPGLENKVADALSRMTSMHLNQLTAPALLELIIIKEELEKGPWLSEIIEELKKNEECVADFSLQQGFFRTYKRLKGELYWEGMKGDVK
KYCEKCVVCQHPKSLPLSPAGLLMPLEIPDAVLSHIFMDFIDGLPKAAGFNVILVAVDRLSK