; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G18610 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G18610
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationChr4:16133827..16135555
RNA-Seq ExpressionCSPI04G18610
SyntenyCSPI04G18610
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039232.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]3.9e-9353.06Show/hide
Query:  GEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQG---------------------------AEITTFNHLPTLYETDEDFGEIWSH
        G+KQQH+FDSLKRK ASQ VLKLPEF++PFEVA+DASGV IG VLSQG                           AEIT FNHLPTLYETDEDF      
Subjt:  GEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQG---------------------------AEITTFNHLPTLYETDEDFGEIWSH

Query:  CTHFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVGLYTPLSIPKNV
                          GDQLCI HTSLREALIKEAHSG L GHFGQ KTFQI+IKR YWPQARRD      +  IF + +                  
Subjt:  CTHFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVGLYTPLSIPKNV

Query:  WEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDT-----------------------------------TLWKKFDTTLKFSTTAHP-QT
         +DL       L  TQRGF+S MVVVDRFSKMSHF PCKK  D                                    TLWKKFDTTL  +   HP Q 
Subjt:  WEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDT-----------------------------------TLWKKFDTTLKFSTTAHP-QT

Query:  DGQTKVTNRSLENLIRYLSGNHPRQ--WDMAPRLTFDLTSLPKEVEIQEEVEQLAKRIQKLHTEVIDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLK
        D        +  N++   +G  P +  +  APRLTFDLTSLPKEVEIQEE EQLA+RIQKLH EVID ITKTT SYKEEKNKK++EVHFQVGDLVMAHLK
Subjt:  DGQTKVTNRSLENLIRYLSGNHPRQ--WDMAPRLTFDLTSLPKEVEIQEEVEQLAKRIQKLHTEVIDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLK

Query:  KKRFPIGTY
        KKRFP+GTY
Subjt:  KKRFPIGTY

KAE8652794.1 hypothetical protein Csa_022828 [Cucumis sativus]1.7e-8842.82Show/hide
Query:  MGLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQ---------------------
        +GLASFYR+FIKNFS+I A + +CLKKG+F WG+ ++ +F  LK   AS  VLKLP F+ PFEV +DASG+ IGVVLSQ                     
Subjt:  MGLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQ---------------------

Query:  ------------------------------------------------------------------------------------GAEITTFNHLPTLYET
                                                                                             ++I  F+HL TLY T
Subjt:  ------------------------------------------------------------------------------------GAEITTFNHLPTLYET

Query:  DEDFGEIWSHCT-HFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVG
        D DF  IW +C+ H   +DYH+V  FLFKGD LC+ HTSLREA+IKE HS  LAGHFG+ KT   II +F+WPQ  R++ NF+KRC I Q AKG+S N G
Subjt:  DEDFGEIWSHCT-HFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVG

Query:  LYTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDT-----------------------------------TLWKKFDTTL
        LYTPL I   +WEDLS+DFV+GLP+TQRG +S  VVV+RFSKM+HF PCKK  D                                    +L KKFDT L
Subjt:  LYTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDT-----------------------------------TLWKKFDTTL

Query:  KFSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQWDMA
         FST +HPQTDGQT+VTNR+L NLIR LSG+ P+QWD+A
Subjt:  KFSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQWDMA

KAG7588782.1 Zinc finger CCHC-type [Arabidopsis suecica]6.9e-9038.58Show/hide
Query:  GLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQ----------------------
        GLA+FYR+FI++FS+I + + +CLKKG F+WG +Q+ +FD +K K  + LVL LP+F+  F+V  DASGV IG VLSQ                      
Subjt:  GLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQ----------------------

Query:  -----------------------------GAEITTFNHLPTLYETDEDFGEIWSHCTHFH-DRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAG
                                       EI  F  +  LYE D +F E+W+ C   H   D+H+ EG+LFKGD+LCI  +SLRE LI++ H   L+G
Subjt:  -----------------------------GAEITTFNHLPTLYETDEDFGEIWSHCTHFH-DRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAG

Query:  HFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVGLYTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDT
        H G+ KT   + +R++WP  RRD    V+RC I Q +KG S N GLY PL IP ++W+DL++DFV+GLP+TQRG +S  VVVDRFSKM+HF  CKK  D 
Subjt:  HFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVGLYTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDT

Query:  -----------------------------------TLWKKFDTTLKFSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQWDMAPRLTFDLTSLPKEVEI
                                           TLW+ F T L  S+TAHPQTDGQT+VTNR+L N++R  S      +   P+   DL  LPK   +
Subjt:  -----------------------------------TLWKKFDTTLKFSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQWDMAPRLTFDLTSLPKEVEI

Query:  QEEVEQLAKRIQKLHTEVIDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDK----------------------HINISPIFNV
            E +A+ I  +   V   +  T    K   +K+RR   F+ GD VM  L+K+RFP+GTY K+K +                       +NIS  FNV
Subjt:  QEEVEQLAKRIQKLHTEVIDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDK----------------------HINISPIFNV

Query:  ADLRSYNA
        AD+  Y+A
Subjt:  ADLRSYNA

TXG62763.1 hypothetical protein EZV62_009757 [Acer yangbiense]1.5e-8934.69Show/hide
Query:  GLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQ----------------------
        GLA+FYR+F+++FSSI+A + +CLKKG F W E     F  +K K  +  VL LP F   FEV  DASGV IG VLSQ                      
Subjt:  GLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQ----------------------

Query:  -----------------------------------------------------------------------------------GAEITTFNHLPTLYETD
                                                                                             EI  F  L  LY  D
Subjt:  -----------------------------------------------------------------------------------GAEITTFNHLPTLYETD

Query:  EDFGEIWSHCT-HFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVGL
        EDFGE+W  C     D ++H+ EG+LF G+QLCI  +SLRE LI+E H G L GH G+ KT   + +R+YWPQ +RD+ NFV++C ++Q +KG + N GL
Subjt:  EDFGEIWSHCT-HFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVGL

Query:  YTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIID-----------------------------------TTLWKKFDTTLK
        Y PL +P  +WEDL++DFV+GLP+TQRG +S  VVVDRFSKM HF PC+K  D                                    TLW++ DT LK
Subjt:  YTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIID-----------------------------------TTLWKKFDTTLK

Query:  FSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQWDMA----------------------------PRLTFDLTSLPKEVEIQEEVEQLAKRIQKLHTEV
        FS+TAHPQTDGQT+  NR+L NLIR + G+ P+QWD+A                            P+   DL  LPK   +    E +A++++ +  EV
Subjt:  FSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQWDMA----------------------------PRLTFDLTSLPKEVEIQEEVEQLAKRIQKLHTEV

Query:  IDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDK----------------------HINISPIFNVADLRSY
           + +    YK   + KRRE  F  GD VM  L+K+RFP+G+Y K+K +                      ++NIS  FNVADL  +
Subjt:  IDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDK----------------------HINISPIFNVADLRSY

TYK23243.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]3.3e-14060.81Show/hide
Query:  MGLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLS----------------------
        +GL SFYRKFIKNFSSI A M DCLKKG F+W EKQQH+F+S+KRK ASQ VLKL EF++PFEVA+DA  +E  +VLS                      
Subjt:  MGLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLS----------------------

Query:  ----QGAEITTFNHLPTLYETDEDFGEIWSHCT-HFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDI
               EIT FNHLPT   TDEDFG+IWSHCT H HDRDYHLVEGFLFKG+QLCI HTSLREALIKEAHSG LA HFG                     
Subjt:  ----QGAEITTFNHLPTLYETDEDFGEIWSHCT-HFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDI

Query:  NNFVKRCPIFQRAKGSSSNVGLYTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDT------------------------
                     KGSSSNV LYTPLSI KN+WEDLSIDFV+GLPKTQRGF+S MVVVDRFSKMSHF PCKK  D                         
Subjt:  NNFVKRCPIFQRAKGSSSNVGLYTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDT------------------------

Query:  -----------TLWKKFDTTLKFSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQWDM----------------------------APRLTFDLTSLPK
                   TLWKK  TTLKFSTTAHPQ DGQT+VTN SL NLI  LSGNHPRQWDM                            APRLTFDLTSLPK
Subjt:  -----------TLWKKFDTTLKFSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQWDM----------------------------APRLTFDLTSLPK

Query:  EVEIQEEVEQLAKRIQKLHTEVIDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDK
        EVEI+EE EQLA+RIQKLHTEVIDHITKTT+SYKEEKNKKRREVHFQVGDL+MAHLKKKRF IGTYGK+KDK
Subjt:  EVEIQEEVEQLAKRIQKLHTEVIDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDK

TrEMBL top hitse value%identityAlignment
A0A2N9F1X7 Uncharacterized protein4.6e-9238.24Show/hide
Query:  GLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQGA--------------------
        GLA+FYR+FI++FS+I+A + +C+KKG F WGE+ + +F  +K K  + LVL LP F   FEV  DASGV IG VLSQ                      
Subjt:  GLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQGA--------------------

Query:  ----------------------------------------------------------EITTFNHLPTLYETDEDFGEIWSHCTHFHD-RDYHLVEGFLF
                                                                  EI  F+ L  LYE D+DF EIW  C       D++  EG+LF
Subjt:  ----------------------------------------------------------EITTFNHLPTLYETDEDFGEIWSHCTHFHD-RDYHLVEGFLF

Query:  KGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVGLYTPLSIPKNVWEDLSIDFVMGLPKTQR
        +G+ LC+  TSLRE LI++ H G L GH G+ KT   + +R+YWPQ +RD+ N V++C   Q +KG S N GLY PL IP ++WEDLS+DFV+GLP+TQR
Subjt:  KGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVGLYTPLSIPKNVWEDLSIDFVMGLPKTQR

Query:  GFNSAMVVVDRFSKMSHFFPCKKIIDT-----------------------------------TLWKKFDTTLKFSTTAHPQTDGQTKVTNRSLENLIRYL
        G +S  VVVDR+SKM HF PC+K  D                                    TLWK FDT+L  STTAHPQTDGQT+  NR+L NLIR +
Subjt:  GFNSAMVVVDRFSKMSHFFPCKKIIDT-----------------------------------TLWKKFDTTLKFSTTAHPQTDGQTKVTNRSLENLIRYL

Query:  SGNHPRQWDMA----------------------------PRLTFDLTSLPKEVEIQEEVEQLAKRIQKLHTEVIDHITKTTDSYKEEKNKKRREVHFQVG
         G+ P+QWD A                            P+ T DL  LPK   +    E +A+++Q +  EV   + +TT  YK   +K RR   F+ G
Subjt:  SGNHPRQWDMA----------------------------PRLTFDLTSLPKEVEIQEEVEQLAKRIQKLHTEVIDHITKTTDSYKEEKNKKRREVHFQVG

Query:  DLVMAHLKKKRFPIGTYGKIKDK
        D VM  L+K+RFP+GTY K+K K
Subjt:  DLVMAHLKKKRFPIGTYGKIKDK

A0A5A7T6X3 Transposon Ty3-G Gag-Pol polyprotein1.9e-9353.06Show/hide
Query:  GEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQG---------------------------AEITTFNHLPTLYETDEDFGEIWSH
        G+KQQH+FDSLKRK ASQ VLKLPEF++PFEVA+DASGV IG VLSQG                           AEIT FNHLPTLYETDEDF      
Subjt:  GEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQG---------------------------AEITTFNHLPTLYETDEDFGEIWSH

Query:  CTHFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVGLYTPLSIPKNV
                          GDQLCI HTSLREALIKEAHSG L GHFGQ KTFQI+IKR YWPQARRD      +  IF + +                  
Subjt:  CTHFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVGLYTPLSIPKNV

Query:  WEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDT-----------------------------------TLWKKFDTTLKFSTTAHP-QT
         +DL       L  TQRGF+S MVVVDRFSKMSHF PCKK  D                                    TLWKKFDTTL  +   HP Q 
Subjt:  WEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDT-----------------------------------TLWKKFDTTLKFSTTAHP-QT

Query:  DGQTKVTNRSLENLIRYLSGNHPRQ--WDMAPRLTFDLTSLPKEVEIQEEVEQLAKRIQKLHTEVIDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLK
        D        +  N++   +G  P +  +  APRLTFDLTSLPKEVEIQEE EQLA+RIQKLH EVID ITKTT SYKEEKNKK++EVHFQVGDLVMAHLK
Subjt:  DGQTKVTNRSLENLIRYLSGNHPRQ--WDMAPRLTFDLTSLPKEVEIQEEVEQLAKRIQKLHTEVIDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLK

Query:  KKRFPIGTY
        KKRFP+GTY
Subjt:  KKRFPIGTY

A0A5B7BER3 Uncharacterized protein2.7e-9236.18Show/hide
Query:  GLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQ----------------------
        GLA+FYR+FI+NFSSI+A + DC+KKG F W + Q+ +F  +K K ++  VL LP F   F+V  DAS   IG VLSQ                      
Subjt:  GLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQ----------------------

Query:  -----------------------------------------------------------------------------------GAEITTFNHLPTLYETD
                                                                                            +EIT+F  L  LY+ D
Subjt:  -----------------------------------------------------------------------------------GAEITTFNHLPTLYETD

Query:  EDFGEIWSHC-THFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVGL
        EDF + W+ C       ++H+ +G+LFKG+QLCI  TSLRE ++++ HSG L GH G+ KT  ++ +R+YWPQ +RD+  FV++CPI Q AKG + N GL
Subjt:  EDFGEIWSHC-THFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVGL

Query:  YTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDT-----------------------------------TLWKKFDTTLK
        YTPL +P+++WEDL++DF++GLP+TQRG +S  VVVDRFSKM+HF PCKK  D                                    TLW+KFDT+L+
Subjt:  YTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDT-----------------------------------TLWKKFDTTLK

Query:  FSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQWDMA----------------------------PRLTFDLTSLPKEVEIQEEVEQLAKRIQKLHTEV
        +S+TAHPQTDGQT+VTNR+L NLIR  SG+ P+QWD+                             P+   DL  LPK        E  A R   +  EV
Subjt:  FSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQWDMA----------------------------PRLTFDLTSLPKEVEIQEEVEQLAKRIQKLHTEV

Query:  IDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDK
          ++ K  + YK   +K RR   F  GDLVM  L+K RFP+GTY K+K++
Subjt:  IDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDK

A0A5C7HZV4 Reverse transcriptase7.4e-9034.69Show/hide
Query:  GLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQ----------------------
        GLA+FYR+F+++FSSI+A + +CLKKG F W E     F  +K K  +  VL LP F   FEV  DASGV IG VLSQ                      
Subjt:  GLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQ----------------------

Query:  -----------------------------------------------------------------------------------GAEITTFNHLPTLYETD
                                                                                             EI  F  L  LY  D
Subjt:  -----------------------------------------------------------------------------------GAEITTFNHLPTLYETD

Query:  EDFGEIWSHCT-HFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVGL
        EDFGE+W  C     D ++H+ EG+LF G+QLCI  +SLRE LI+E H G L GH G+ KT   + +R+YWPQ +RD+ NFV++C ++Q +KG + N GL
Subjt:  EDFGEIWSHCT-HFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVGL

Query:  YTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIID-----------------------------------TTLWKKFDTTLK
        Y PL +P  +WEDL++DFV+GLP+TQRG +S  VVVDRFSKM HF PC+K  D                                    TLW++ DT LK
Subjt:  YTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIID-----------------------------------TTLWKKFDTTLK

Query:  FSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQWDMA----------------------------PRLTFDLTSLPKEVEIQEEVEQLAKRIQKLHTEV
        FS+TAHPQTDGQT+  NR+L NLIR + G+ P+QWD+A                            P+   DL  LPK   +    E +A++++ +  EV
Subjt:  FSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQWDMA----------------------------PRLTFDLTSLPKEVEIQEEVEQLAKRIQKLHTEV

Query:  IDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDK----------------------HINISPIFNVADLRSY
           + +    YK   + KRRE  F  GD VM  L+K+RFP+G+Y K+K +                      ++NIS  FNVADL  +
Subjt:  IDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDK----------------------HINISPIFNVADLRSY

A0A5D3DI34 Transposon Ty3-I Gag-Pol polyprotein1.6e-14060.81Show/hide
Query:  MGLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLS----------------------
        +GL SFYRKFIKNFSSI A M DCLKKG F+W EKQQH+F+S+KRK ASQ VLKL EF++PFEVA+DA  +E  +VLS                      
Subjt:  MGLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLS----------------------

Query:  ----QGAEITTFNHLPTLYETDEDFGEIWSHCT-HFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDI
               EIT FNHLPT   TDEDFG+IWSHCT H HDRDYHLVEGFLFKG+QLCI HTSLREALIKEAHSG LA HFG                     
Subjt:  ----QGAEITTFNHLPTLYETDEDFGEIWSHCT-HFHDRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDI

Query:  NNFVKRCPIFQRAKGSSSNVGLYTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDT------------------------
                     KGSSSNV LYTPLSI KN+WEDLSIDFV+GLPKTQRGF+S MVVVDRFSKMSHF PCKK  D                         
Subjt:  NNFVKRCPIFQRAKGSSSNVGLYTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDT------------------------

Query:  -----------TLWKKFDTTLKFSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQWDM----------------------------APRLTFDLTSLPK
                   TLWKK  TTLKFSTTAHPQ DGQT+VTN SL NLI  LSGNHPRQWDM                            APRLTFDLTSLPK
Subjt:  -----------TLWKKFDTTLKFSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQWDM----------------------------APRLTFDLTSLPK

Query:  EVEIQEEVEQLAKRIQKLHTEVIDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDK
        EVEI+EE EQLA+RIQKLHTEVIDHITKTT+SYKEEKNKKRREVHFQVGDL+MAHLKKKRF IGTYGK+KDK
Subjt:  EVEIQEEVEQLAKRIQKLHTEVIDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein8.3e-2226.22Show/hide
Query:  NHLPTLYETDEDFGEIWSHCTHFHDRDYHLVEGFLFKG-DQLCI-SHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQ
        N + T Y  D     + ++     + +  L +G L    DQ+ + + T L   +IK+ H      H G      II++RF W   R+ I  +V+ C   Q
Subjt:  NHLPTLYETDEDFGEIWSHCTHFHDRDYHLVEGFLFKG-DQLCI-SHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQ

Query:  RAKGSSSN-VGLYTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKK-------------------------------IIDTTLW
          K  +    G   P+   +  WE LS+DF+  LP++  G+N+  VVVDRFSKM+   PC K                               I  +  W
Subjt:  RAKGSSSN-VGLYTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKK-------------------------------IIDTTLW

Query:  K----KFDTTLKFSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQW---------------DMAPRLT-------FDLTSLPKEV-EIQEEVEQLAKRI
        K    K++  +KFS    PQTDGQT+ TN+++E L+R +   HP  W                 A ++T       +     P E+    ++ ++ ++  
Subjt:  K----KFDTTLKFSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQW---------------DMAPRLT-------FDLTSLPKEV-EIQEEVEQLAKRI

Query:  QKLHTEVIDHITKTTDSYKEEKNKKRREV-HFQVGDLVMAHLKKKRF
         ++   V +H+       K+  + K +E+  FQ GDLVM    K  F
Subjt:  QKLHTEVIDHITKTTDSYKEEKNKKRREV-HFQVGDLVMAHLKKKRF

P0CT41 Transposon Tf2-12 polyprotein8.3e-2226.22Show/hide
Query:  NHLPTLYETDEDFGEIWSHCTHFHDRDYHLVEGFLFKG-DQLCI-SHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQ
        N + T Y  D     + ++     + +  L +G L    DQ+ + + T L   +IK+ H      H G      II++RF W   R+ I  +V+ C   Q
Subjt:  NHLPTLYETDEDFGEIWSHCTHFHDRDYHLVEGFLFKG-DQLCI-SHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQ

Query:  RAKGSSSN-VGLYTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKK-------------------------------IIDTTLW
          K  +    G   P+   +  WE LS+DF+  LP++  G+N+  VVVDRFSKM+   PC K                               I  +  W
Subjt:  RAKGSSSN-VGLYTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKK-------------------------------IIDTTLW

Query:  K----KFDTTLKFSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQW---------------DMAPRLT-------FDLTSLPKEV-EIQEEVEQLAKRI
        K    K++  +KFS    PQTDGQT+ TN+++E L+R +   HP  W                 A ++T       +     P E+    ++ ++ ++  
Subjt:  K----KFDTTLKFSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQW---------------DMAPRLT-------FDLTSLPKEV-EIQEEVEQLAKRI

Query:  QKLHTEVIDHITKTTDSYKEEKNKKRREV-HFQVGDLVMAHLKKKRF
         ++   V +H+       K+  + K +E+  FQ GDLVM    K  F
Subjt:  QKLHTEVIDHITKTTDSYKEEKNKKRREV-HFQVGDLVMAHLKKKRF

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.0e-2325.52Show/hide
Query:  RDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSL-AGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNV-GLYTPLSIPKNVWEDL
        ++Y L +  ++  D+L +     + A+++  H  +L  GHFG   T   I   +YWP+ +  I  +++ C   Q  K     + GL  PL I +  W D+
Subjt:  RDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSL-AGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNV-GLYTPLSIPKNVWEDL

Query:  SIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDTT-----------------------------------LWKKFDTTLKFSTTAHPQTDGQTK
        S+DFV GLP T    N  +VVVDRFSK +HF   +K +D T                                   L K+       S+  HPQTDGQ++
Subjt:  SIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDTT-----------------------------------LWKKFDTTLKFSTTAHPQTDGQTK

Query:  VTNRSLENLIRYLSGNHPRQWDM-APRLTF-----------------DLTSLPKEVEIQEEVE---------QLAKRIQKLHTEVIDHITKTTDSYKEEK
         T ++L  L+R     + + W +  P++ F                 DL  LP    I+ + E         +LAK ++ L  +  + +       +   
Subjt:  VTNRSLENLIRYLSGNHPRQWDM-APRLTF-----------------DLTSLPKEVEIQEEVE---------QLAKRIQKLHTEVIDHITKTTDSYKEEK

Query:  NKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDKHI
        N++R+ +   +GD V+ H +   F  G Y K++  ++
Subjt:  NKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDKHI

Q99315 Transposon Ty3-G Gag-Pol polyprotein8.9e-2425.52Show/hide
Query:  RDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSL-AGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNV-GLYTPLSIPKNVWEDL
        ++Y L +  ++  D+L +     + A+++  H  +L  GHFG   T   I   +YWP+ +  I  +++ C   Q  K     + GL  PL I +  W D+
Subjt:  RDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSL-AGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNV-GLYTPLSIPKNVWEDL

Query:  SIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDTT-----------------------------------LWKKFDTTLKFSTTAHPQTDGQTK
        S+DFV GLP T    N  +VVVDRFSK +HF   +K +D T                                   L K+       S+  HPQTDGQ++
Subjt:  SIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKKIIDTT-----------------------------------LWKKFDTTLKFSTTAHPQTDGQTK

Query:  VTNRSLENLIRYLSGNHPRQWDM-APRLTF-----------------DLTSLPKEVEIQEEVE---------QLAKRIQKLHTEVIDHITKTTDSYKEEK
         T ++L  L+R  +  + + W +  P++ F                 DL  LP    I+ + E         +LAK ++ L  +  + +       +   
Subjt:  VTNRSLENLIRYLSGNHPRQWDM-APRLTF-----------------DLTSLPKEVEIQEEVE---------QLAKRIQKLHTEVIDHITKTTDSYKEEK

Query:  NKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDKHI
        N++R+ +   +GD V+ H +   F  G Y K++  ++
Subjt:  NKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDKHI

Q9UR07 Transposon Tf2-11 polyprotein8.3e-2226.22Show/hide
Query:  NHLPTLYETDEDFGEIWSHCTHFHDRDYHLVEGFLFKG-DQLCI-SHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQ
        N + T Y  D     + ++     + +  L +G L    DQ+ + + T L   +IK+ H      H G      II++RF W   R+ I  +V+ C   Q
Subjt:  NHLPTLYETDEDFGEIWSHCTHFHDRDYHLVEGFLFKG-DQLCI-SHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQ

Query:  RAKGSSSN-VGLYTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKK-------------------------------IIDTTLW
          K  +    G   P+   +  WE LS+DF+  LP++  G+N+  VVVDRFSKM+   PC K                               I  +  W
Subjt:  RAKGSSSN-VGLYTPLSIPKNVWEDLSIDFVMGLPKTQRGFNSAMVVVDRFSKMSHFFPCKK-------------------------------IIDTTLW

Query:  K----KFDTTLKFSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQW---------------DMAPRLT-------FDLTSLPKEV-EIQEEVEQLAKRI
        K    K++  +KFS    PQTDGQT+ TN+++E L+R +   HP  W                 A ++T       +     P E+    ++ ++ ++  
Subjt:  K----KFDTTLKFSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQW---------------DMAPRLT-------FDLTSLPKEV-EIQEEVEQLAKRI

Query:  QKLHTEVIDHITKTTDSYKEEKNKKRREV-HFQVGDLVMAHLKKKRF
         ++   V +H+       K+  + K +E+  FQ GDLVM    K  F
Subjt:  QKLHTEVIDHITKTTDSYKEEKNKKRREV-HFQVGDLVMAHLKKKRF

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.1e-0535.48Show/hide
Query:  MGLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPF
        +GL  +YR+F+KN+  I+  + + LKK +  W E     F +LK    +  VL LP+   PF
Subjt:  MGLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTTAGCATCTTTTTACAGGAAGTTTATTAAGAACTTTAGTTCTATAATAGCTCTTATGATTGATTGTTTGAAGAAGGGAGCCTTTTATTGGGGAGAAAAACAGCA
GCACAATTTTGACTCCCTTAAAAGAAAGTTTGCCAGCCAACTAGTCCTCAAATTACCAGAGTTTAACAACCCTTTCGAAGTGGCAATAGACGCCAGTGGTGTGGAAATTG
GTGTTGTCCTTTCCCAAGGAGCTGAAATTACAACATTTAACCATCTTCCAACACTATATGAGACTGATGAAGACTTTGGTGAGATTTGGAGTCATTGTACTCATTTCCAT
GATCGAGATTATCATTTGGTGGAAGGTTTTCTCTTTAAAGGAGACCAACTATGCATTTCACATACTTCCTTAAGGGAAGCCCTAATAAAAGAAGCCCATTCGGGAAGCCT
AGCTGGACACTTTGGCCAAGGAAAGACTTTTCAAATTATCATCAAGAGGTTTTATTGGCCTCAGGCTAGAAGAGACATTAATAACTTTGTGAAAAGGTGTCCTATTTTTC
AAAGAGCAAAAGGATCTTCATCTAATGTTGGTCTCTACACTCCTCTATCGATTCCTAAAAACGTATGGGAGGATTTGTCAATTGATTTCGTAATGGGTCTACCTAAGACT
CAAAGGGGATTTAACTCAGCTATGGTTGTGGTGGACAGATTCAGCAAGATGTCTCACTTCTTCCCTTGTAAAAAAATCATAGACACAACATTGTGGAAGAAGTTTGACAC
CACCTTGAAGTTTAGTACCACTGCTCATCCACAAACGGATGGACAAACTAAGGTAACTAACCGGTCCTTGGAAAATTTAATTCGCTACCTTAGTGGAAACCACCCTAGAC
AATGGGACATGGCACCAAGGTTGACATTCGACCTAACTAGCCTTCCTAAGGAAGTGGAAATCCAAGAGGAAGTTGAACAGTTAGCTAAAAGAATACAGAAACTTCACACA
GAAGTCATTGACCATATTACTAAAACTACTGACTCTTACAAAGAAGAGAAGAATAAGAAGCGAAGGGAAGTACATTTCCAAGTTGGAGATCTTGTAATGGCACATTTGAA
GAAGAAGAGGTTCCCCATTGGAACCTATGGAAAGATAAAGGACAAGCATATTAACATCAGCCCAATCTTCAATGTAGCAGACCTTAGAAGCTACAATGCACCGGATGAAT
TTCAGCTTTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGTTAGCATCTTTTTACAGGAAGTTTATTAAGAACTTTAGTTCTATAATAGCTCTTATGATTGATTGTTTGAAGAAGGGAGCCTTTTATTGGGGAGAAAAACAGCA
GCACAATTTTGACTCCCTTAAAAGAAAGTTTGCCAGCCAACTAGTCCTCAAATTACCAGAGTTTAACAACCCTTTCGAAGTGGCAATAGACGCCAGTGGTGTGGAAATTG
GTGTTGTCCTTTCCCAAGGAGCTGAAATTACAACATTTAACCATCTTCCAACACTATATGAGACTGATGAAGACTTTGGTGAGATTTGGAGTCATTGTACTCATTTCCAT
GATCGAGATTATCATTTGGTGGAAGGTTTTCTCTTTAAAGGAGACCAACTATGCATTTCACATACTTCCTTAAGGGAAGCCCTAATAAAAGAAGCCCATTCGGGAAGCCT
AGCTGGACACTTTGGCCAAGGAAAGACTTTTCAAATTATCATCAAGAGGTTTTATTGGCCTCAGGCTAGAAGAGACATTAATAACTTTGTGAAAAGGTGTCCTATTTTTC
AAAGAGCAAAAGGATCTTCATCTAATGTTGGTCTCTACACTCCTCTATCGATTCCTAAAAACGTATGGGAGGATTTGTCAATTGATTTCGTAATGGGTCTACCTAAGACT
CAAAGGGGATTTAACTCAGCTATGGTTGTGGTGGACAGATTCAGCAAGATGTCTCACTTCTTCCCTTGTAAAAAAATCATAGACACAACATTGTGGAAGAAGTTTGACAC
CACCTTGAAGTTTAGTACCACTGCTCATCCACAAACGGATGGACAAACTAAGGTAACTAACCGGTCCTTGGAAAATTTAATTCGCTACCTTAGTGGAAACCACCCTAGAC
AATGGGACATGGCACCAAGGTTGACATTCGACCTAACTAGCCTTCCTAAGGAAGTGGAAATCCAAGAGGAAGTTGAACAGTTAGCTAAAAGAATACAGAAACTTCACACA
GAAGTCATTGACCATATTACTAAAACTACTGACTCTTACAAAGAAGAGAAGAATAAGAAGCGAAGGGAAGTACATTTCCAAGTTGGAGATCTTGTAATGGCACATTTGAA
GAAGAAGAGGTTCCCCATTGGAACCTATGGAAAGATAAAGGACAAGCATATTAACATCAGCCCAATCTTCAATGTAGCAGACCTTAGAAGCTACAATGCACCGGATGAAT
TTCAGCTTTCATAG
Protein sequenceShow/hide protein sequence
MGLASFYRKFIKNFSSIIALMIDCLKKGAFYWGEKQQHNFDSLKRKFASQLVLKLPEFNNPFEVAIDASGVEIGVVLSQGAEITTFNHLPTLYETDEDFGEIWSHCTHFH
DRDYHLVEGFLFKGDQLCISHTSLREALIKEAHSGSLAGHFGQGKTFQIIIKRFYWPQARRDINNFVKRCPIFQRAKGSSSNVGLYTPLSIPKNVWEDLSIDFVMGLPKT
QRGFNSAMVVVDRFSKMSHFFPCKKIIDTTLWKKFDTTLKFSTTAHPQTDGQTKVTNRSLENLIRYLSGNHPRQWDMAPRLTFDLTSLPKEVEIQEEVEQLAKRIQKLHT
EVIDHITKTTDSYKEEKNKKRREVHFQVGDLVMAHLKKKRFPIGTYGKIKDKHINISPIFNVADLRSYNAPDEFQLS