; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G01990 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G01990
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationChr7:1691571..1692767
RNA-Seq ExpressionCSPI07G01990
SyntenyCSPI07G01990
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6472693.1 hypothetical protein ZIOFF_070170 [Zingiber officinale]1.4e-14575.71Show/hide
Query:  SLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR-----------------------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCN
        SLK+LGF+KC QE A+YTR ++E  +LVGVYVDDLIVTGR                             IEVEQQK RI+L+Q  YAK+ILSQF MADCN
Subjt:  SLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR-----------------------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCN

Query:  ATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLA
        ATK PMEPK QLHKD E  PIDAT+YR + GCLRYLL+TRPDLSY VGM SRYME+PT+MH+KVVKQILRYL+GTI+FGL YTKGP+E NIFGYSDSDLA
Subjt:  ATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLA

Query:  GDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFI
        GDLD RKSTSGMTFY NESLVSWNSQKQKTVALSSCEAEF+AATTAACQALWLR LVSE+ G EP+PVTLFVDNKSAIALMKNPVFHGR+KHIDT FHFI
Subjt:  GDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFI

Query:  RECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN
        RECV+NGQI+VE +NTGEQRADVLTKAL GVKLA MRQLLGVR LE CQN
Subjt:  RECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN

KAG6479166.1 hypothetical protein ZIOFF_062627 [Zingiber officinale]1.5e-14475.14Show/hide
Query:  SLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR-----------------------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCN
        SLK+LGF+KC QE A+YTR ++E  +LVGVYVDDLIVTGR                             IEVEQQK RI+L+Q  YAK+ILSQF MADCN
Subjt:  SLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR-----------------------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCN

Query:  ATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLA
        ATK PMEPK QLHKD E  PIDAT+YR + GCLRYLL+TRPDLSY VGM SRYME+PT+MH+KVVKQILRYL+GTI+FGL YTKGP+E NIFGYSDSDLA
Subjt:  ATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLA

Query:  GDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFI
        GDLD RKSTSGM FY NESLVSWNSQKQKTV LSSCEAEF+AATTAACQALWLR LVSE+ G EP+PVTLFVDNKSAIALMKNPVFHGR+KHIDT FHFI
Subjt:  GDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFI

Query:  RECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN
        RECV+NGQI+VE +NTGEQRADVLTKAL GVKLA MRQLLGVR LE CQN
Subjt:  RECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN

KAG6483537.1 hypothetical protein ZIOFF_060185 [Zingiber officinale]8.3e-14374.79Show/hide
Query:  LKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR-----------------------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNA
        L +LGF+KC QE A+YTR E E  +LVGVYVDDLIVTG                              IEVEQQK RI+L+Q TYAK+ILSQF MADCNA
Subjt:  LKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR-----------------------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNA

Query:  TKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAG
        TK+PMEPK QLHKD E  P+DAT+YR + GCLRYLL+TRPDLSY VGMASRYMERPTTMH+KVVKQILRYL+GTIHFGLTY KGP+E +IFGYSDSDLAG
Subjt:  TKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAG

Query:  DLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIR
        DLD RKSTSGMTFY NESLVSWNSQKQKTVALSSCEAEF+AATTAAC ALWLR L SE+ G +P+PVTLFVDNKS+IALMKNPVFHGR+KHIDT FHFIR
Subjt:  DLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIR

Query:  ECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN
        ECV+NGQI+VE +NTGEQRADVLTKAL GVKLA MRQLLGV  LE CQN
Subjt:  ECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN

KAG6503176.1 hypothetical protein ZIOFF_035487 [Zingiber officinale]4.9e-14374.57Show/hide
Query:  SLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR-----------------------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCN
        SLK+LGF+KC QE A+YTR ++E  +LVGVYVDDLIVTGR                             IEVEQQK RI+++Q  YAK+ILSQF MADCN
Subjt:  SLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR-----------------------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCN

Query:  ATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLA
        ATK PMEPK QLHKD E  PIDAT+YR + GCLRYLL+TRPDLSY VGMASRYME+PT+MH+KVVKQILRYL+GTI+FGLTY KG +E +IFGYSDSDLA
Subjt:  ATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLA

Query:  GDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFI
        GDLD RKSTSGMTFY NESLVSWNSQKQKTVALSSCEAEF+AATTAACQALWLR LVSE+ G EP+PVTLFVDNKSA+ALMKNPVFHGR+KHIDT FHFI
Subjt:  GDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFI

Query:  RECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN
        RECV+NGQI+VE +NTGEQRADVLTKAL GVKLA MRQLLGV  LE CQN
Subjt:  RECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN

XP_031741734.1 uncharacterized protein LOC116403928 [Cucumis sativus]1.7e-17279.65Show/hide
Query:  MENWKRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR------------
        MENWKRK  + +        +  +          LRQAP+AWNIRLDRSLKDLGFRKCTQEQ +YTRRE+EE VLVGVYVDDLIV G             
Subjt:  MENWKRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR------------

Query:  -----------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASR
                         IEVEQQ GRI+LKQPTYAK ILSQFGMADCNATKYPMEPKAQLHK+T+ APIDAT+YRSI GCLRYLLNTRPDLSY VGMASR
Subjt:  -----------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASR

Query:  YMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALW
        Y+ERPTTMHYKVVKQIL YLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLD RKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALW
Subjt:  YMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALW

Query:  LRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN
        LRCLVSEIV MEPRP+TLFVDNKSAIALMKNPVFHGRNKHIDT FHFIRECV+NGQIIVE VNTGEQRADVLTKALTGVKLAAMRQLLGVR LESCQN
Subjt:  LRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN

TrEMBL top hitse value%identityAlignment
A0A0P0XB91 Os08g0125300 protein2.3e-13061.17Show/hide
Query:  KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR----------------
        + ++YV  PEGF    E+H V RLS ALYGLRQAP+AWN RLD+ LK+LGF +CTQEQA+YTR + +  V+VGVYVDDLIVTG                 
Subjt:  KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR----------------

Query:  -------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMER
                     IEV Q +  I +KQ  YAK+ILSQFGM  CN T  PMEP++ LHKD +  PIDAT+YR + GCLRYLL+TRPDLSY VG+ASR+MER
Subjt:  -------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMER

Query:  PTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL
        PTTMH K VK ILRYL+GT+  GL +  G    +I G++DSDLAGD+D R+ST GM FY+N SLVSW SQKQKTVALSSCEAEF+AAT AAC ALWLR L
Subjt:  PTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL

Query:  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN
        +SE++G E + V LFVDNKSAIALMKNPVFHGR+KHIDT +HFIRECV++GQI++E V + EQRAD +TK L   KLA  R LLGVR L   Q+
Subjt:  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN

A0A5K0VEQ6 Reverse transcriptase Ty1/copia-type domain-containing protein (Fragment)3.9e-13063.98Show/hide
Query:  KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR----------------
        + ++YVT PEGF V N++H V +LS ALYGLRQAP+ WN +LDRSLK LGF KC  EQA+YTR +    ++V VYVDDLIVTG                 
Subjt:  KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR----------------

Query:  -------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMER
                     IEVEQ +  I +KQ TYAK++L QFGM DCN TK PMEP++QL+KD E  P+DAT+YR   GCLRYL++TRPDL Y VG+ASR+MER
Subjt:  -------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMER

Query:  PTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL
        PT MH+K VKQILRYL+GTI+FGL YT+G  E  I G++DSDLAGD+D RKS  GM FY+NESLVSWNSQKQKTVALSSCEAEF+AAT AACQALWLR L
Subjt:  PTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL

Query:  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKAL
        + E++G EP+ V L+VDNKSAIALMKNPVFHGR+KHIDT FHFIRECV+ G IIV+ V T EQRAD++TKAL
Subjt:  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKAL

B8BDZ6 Uncharacterized protein1.6e-13161.42Show/hide
Query:  KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR----------------
        + ++YV  PEGF    E+H V RLS ALYGLRQAP+AWN RLD+ LK+LGF +CTQEQA+YTR + +  V+VGVYVDDLIVTG                 
Subjt:  KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR----------------

Query:  -------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMER
                     IEV Q +  I +KQ  YAK+ILSQFGM  CN T  PMEP++ LHKD +  PIDAT+YR + GCLRYLL+TRPDLSY VG+ASR+MER
Subjt:  -------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMER

Query:  PTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL
        PTTMH K VK ILRYL+GT+  GL +  G    +I G++DSDLAGD+D R+ST GM FY+N SLVSW SQKQKTVALSSCEAEF+AAT AAC ALWLR L
Subjt:  PTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL

Query:  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN
        +SE++G E +PV LFVDNKSAIALMKNPVFHGR+KHIDT +HFIRECV++GQI++E V++ EQRAD +TK L   KLA  R LLGVR L   Q+
Subjt:  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN

Q0J8A6 Os08g0125300 protein2.3e-13061.17Show/hide
Query:  KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR----------------
        + ++YV  PEGF    E+H V RLS ALYGLRQAP+AWN RLD+ LK+LGF +CTQEQA+YTR + +  V+VGVYVDDLIVTG                 
Subjt:  KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR----------------

Query:  -------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMER
                     IEV Q +  I +KQ  YAK+ILSQFGM  CN T  PMEP++ LHKD +  PIDAT+YR + GCLRYLL+TRPDLSY VG+ASR+MER
Subjt:  -------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMER

Query:  PTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL
        PTTMH K VK ILRYL+GT+  GL +  G    +I G++DSDLAGD+D R+ST GM FY+N SLVSW SQKQKTVALSSCEAEF+AAT AAC ALWLR L
Subjt:  PTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL

Query:  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN
        +SE++G E + V LFVDNKSAIALMKNPVFHGR+KHIDT +HFIRECV++GQI++E V + EQRAD +TK L   KLA  R LLGVR L   Q+
Subjt:  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN

Q10RM4 Retrotransposon protein, putative, unclassified2.4e-13260.91Show/hide
Query:  KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR----------------
        + ++YV  PEGF    ++H V +L  ALYGLRQAP+AWNIRLDRSL++LGF +CTQEQA+YTR    + ++VGVYVDDLIVTG                 
Subjt:  KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR----------------

Query:  -------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMER
                     IEV+Q +    LKQ  YAK++LSQFGM +CN+   P++P++QL KD E  P+DAT+YR I G LRYLL+TRPDLSY VG+ASR+MER
Subjt:  -------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMER

Query:  PTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL
        PT MH+K VKQILRY++GT+ +GL Y  G     I GY+DSDLAGDLD R+ST GM FY+N+SLV+W+SQKQKTVALSSCEAEF+AATTAACQALWLR L
Subjt:  PTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL

Query:  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN
        ++E+ G+E + V LFVDN+SAIALMKNPVFHGR+KHIDT +HFIRECV  GQI+VE V T EQRAD LTK L   KL   R LLGVR L S QN
Subjt:  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-5434.78Show/hide
Query:  KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIY--TRREKEEYVLVGVYVDDLIV-TG--------------
        K +IY+  P+G    ++   V +L+ A+YGL+QA + W    +++LK+  F   + ++ IY   +    E + V +YVDD+++ TG              
Subjt:  KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIY--TRREKEEYVLVGVYVDDLIV-TG--------------

Query:  --------------RIEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRY-LLNTRPDLSYVVGMASRY
                       I +E Q+ +I L Q  Y K+ILS+F M +CNA   P+  K        +   + T  RS+ GCL Y +L TRPDL+  V + SRY
Subjt:  --------------RIEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRY-LLNTRPDLSYVVGMASRY

Query:  MERPTTMHYKVVKQILRYLRGTIHFGLTYTKG-PREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNE-SLVSWNSQKQKTVALSSCEAEFIAATTAACQAL
          +  +  ++ +K++LRYL+GTI   L + K    E  I GY DSD AG    RKST+G  F + + +L+ WN+++Q +VA SS EAE++A   A  +AL
Subjt:  MERPTTMHYKVVKQILRYLRGTIHFGLTYTKG-PREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNE-SLVSWNSQKQKTVALSSCEAEFIAATTAACQAL

Query:  WLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGV
        WL+ L++ I      P+ ++ DN+  I++  NP  H R KHID  +HF RE V+N  I +E + T  Q AD+ TK L   +   +R  LG+
Subjt:  WLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-6033.84Show/hide
Query:  KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKE-EYVLVGVYVDDLIVTG----------------
        + +IY+  PEGFEV  +KH V +L+ +LYGL+QAP+ W ++ D  +K   + K   +  +Y +R  E  ++++ +YVDD+++ G                
Subjt:  KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKE-EYVLVGVYVDDLIVTG----------------

Query:  ---------------RIEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKD------TEEAPIDATKYRSIFGCLRY-LLNTRPDLSYV
                       +I  E+   ++ L Q  Y +R+L +F M +      P+    +L K        E+  +    Y S  G L Y ++ TRPD+++ 
Subjt:  ---------------RIEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKD------TEEAPIDATKYRSIFGCLRY-LLNTRPDLSYV

Query:  VGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTA
        VG+ SR++E P   H++ VK ILRYLRGT    L +  G  +  + GY+D+D+AGD+D RKS++G  F  +   +SW S+ QK VALS+ EAE+IAAT  
Subjt:  VGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTA

Query:  ACQALWLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGV
          + +WL+  + E+ G+  +   ++ D++SAI L KN ++H R KHID  +H+IRE V +  + V  ++T E  AD+LTK +   K    ++L+G+
Subjt:  ACQALWLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGV

P92519 Uncharacterized mitochondrial protein AtMg008101.6e-2736.07Show/hide
Query:  IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQIL
        I+++     + L Q  YA++IL+  GM DC     P+  K      T + P D + +RSI G L+YL  TRPD+SY V +  + M  PT   + ++K++L
Subjt:  IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQIL

Query:  RYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALW
        RY++GTI  GL Y     + N+  + DSD AG    R+ST+G   +L  +++SW++++Q TV+ SS E E+ A    A +  W
Subjt:  RYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-5634.2Show/hide
Query:  IYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR-------------------
        +Y++ P GF   +  + V +L  ALYGL+QAP+AW + L   L  +GF     + +++  +  +  V + VYVDD+++TG                    
Subjt:  IYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR-------------------

Query:  ----------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTT
                  IE ++    + L Q  Y   +L++  M        PM P  +L   +     D T+YR I G L+YL  TRPD+SY V   S++M  PT 
Subjt:  ----------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTT

Query:  MHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCLVSE
         H + +K+ILRYL GT + G+   KG    ++  YSD+D AGD D   ST+G   YL    +SW+S+KQK V  SS EAE+ +    + +  W+  L++E
Subjt:  MHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCLVSE

Query:  IVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKL
        +     RP  ++ DN  A  L  NPVFH R KHI   +HFIR  V++G + V  V+T +Q AD LTK L+          +GV ++
Subjt:  IVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.5e-5734.96Show/hide
Query:  KIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR------------------
        ++Y++ P GF   +    V RL  A+YGL+QAP+AW + L   L  +GF     + +++  +     + + VYVDD+++TG                   
Subjt:  KIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR------------------

Query:  -----------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPM--EPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMER
                   IE ++    + L Q  Y   +L++  M        PM   PK  LH  T+    D T+YR I G L+YL  TRPDLSY V   S+YM  
Subjt:  -----------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPM--EPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMER

Query:  PTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL
        PT  H+  +K++LRYL GT   G+   KG    ++  YSD+D AGD D   ST+G   YL    +SW+S+KQK V  SS EAE+ +    + +  W+  L
Subjt:  PTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL

Query:  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKL
        ++E+      P  ++ DN  A  L  NPVFH R KHI   +HFIR  V++G + V  V+T +Q AD LTK L+ V      + +GV K+
Subjt:  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.8e-4229.89Show/hide
Query:  KIYVTHPEGFEVPN----EKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR--------------
        +IY+  P G+          + V  L  ++YGL+QA + W ++   +L   GF +   +   + +     ++ V VYVDD+I+                 
Subjt:  KIYVTHPEGFEVPN----EKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGR--------------

Query:  ---------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYM
                       +E+ +    I + Q  YA  +L + G+  C  +  PM+P       +    +DA  YR + G L YL  TR D+S+ V   S++ 
Subjt:  ---------------IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYM

Query:  ERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLR
        E P   H + V +IL Y++GT+  GL Y+    E  +  +SD+      D R+ST+G   +L  SL+SW S+KQ+ V+ SS EAE+ A + A  + +WL 
Subjt:  ERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLR

Query:  CLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRE
            E+     +P  LF DN +AI +  N VFH R KHI++  H +RE
Subjt:  CLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.5e-0633.77Show/hide
Query:  YLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSG
        YL  TRPDL++ V   S++     T   + V ++L Y++GT+  GL Y+    +  +  ++DSD A   D R+S +G
Subjt:  YLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSG

ATMG00810.1 DNA/RNA polymerases superfamily protein1.1e-2836.07Show/hide
Query:  IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQIL
        I+++     + L Q  YA++IL+  GM DC     P+  K      T + P D + +RSI G L+YL  TRPD+SY V +  + M  PT   + ++K++L
Subjt:  IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQIL

Query:  RYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALW
        RY++GTI  GL Y     + N+  + DSD AG    R+ST+G   +L  +++SW++++Q TV+ SS E E+ A    A +  W
Subjt:  RYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATTGGAAGAGGAAGATATATGTTACTCATCCGGAGGGTTTTGAGGTTCCAAATGAAAAACACAAGGTGTATAGATTGTCAAATGCTCTTTACGGATTGAGGCA
AGCTCCACAAGCTTGGAACATTCGACTTGATAGGAGTCTCAAAGATCTTGGTTTTAGAAAATGCACTCAAGAGCAAGCAATCTACACAAGAAGAGAAAAAGAGGAATATG
TTCTTGTTGGAGTGTATGTTGACGATCTCATTGTAACAGGAAGAATTGAAGTTGAACAACAGAAGGGTCGAATCATGCTCAAACAACCAACTTATGCCAAAAGAATTTTG
TCCCAATTTGGAATGGCTGATTGCAATGCCACAAAGTACCCGATGGAACCCAAGGCACAACTTCATAAAGACACAGAAGAAGCACCAATTGATGCTACGAAGTATAGAAG
CATCTTTGGTTGTCTTAGATACTTACTGAACACAAGGCCAGATCTTTCATATGTTGTTGGGATGGCGAGTAGGTATATGGAAAGGCCTACAACCATGCATTACAAGGTGG
TCAAGCAAATACTTAGGTATTTGAGAGGGACGATTCATTTTGGGCTCACTTATACGAAAGGTCCCAGAGAATTCAATATATTCGGTTACTCAGACAGTGATTTAGCTGGT
GATCTCGACAGGAGGAAAAGCACAAGTGGAATGACATTCTACTTAAACGAAAGCTTGGTTTCATGGAATTCGCAAAAGCAAAAGACGGTGGCACTCTCATCTTGCGAAGC
CGAGTTCATTGCAGCTACTACCGCAGCTTGCCAAGCATTGTGGTTAAGATGCCTTGTTAGCGAGATAGTCGGAATGGAGCCAAGGCCGGTAACATTATTTGTGGACAACA
AATCCGCGATAGCTCTCATGAAGAATCCCGTATTTCATGGTCGCAACAAGCACATAGATACATGTTTTCATTTCATTCGAGAGTGTGTCAAGAATGGACAAATTATCGTT
GAAGTTGTCAACACTGGAGAACAACGAGCCGATGTCCTGACTAAAGCATTGACGGGAGTAAAGTTAGCTGCTATGCGTCAACTACTTGGTGTTCGTAAGTTAGAATCATG
CCAGAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAATTGGAAGAGGAAGATATATGTTACTCATCCGGAGGGTTTTGAGGTTCCAAATGAAAAACACAAGGTGTATAGATTGTCAAATGCTCTTTACGGATTGAGGCA
AGCTCCACAAGCTTGGAACATTCGACTTGATAGGAGTCTCAAAGATCTTGGTTTTAGAAAATGCACTCAAGAGCAAGCAATCTACACAAGAAGAGAAAAAGAGGAATATG
TTCTTGTTGGAGTGTATGTTGACGATCTCATTGTAACAGGAAGAATTGAAGTTGAACAACAGAAGGGTCGAATCATGCTCAAACAACCAACTTATGCCAAAAGAATTTTG
TCCCAATTTGGAATGGCTGATTGCAATGCCACAAAGTACCCGATGGAACCCAAGGCACAACTTCATAAAGACACAGAAGAAGCACCAATTGATGCTACGAAGTATAGAAG
CATCTTTGGTTGTCTTAGATACTTACTGAACACAAGGCCAGATCTTTCATATGTTGTTGGGATGGCGAGTAGGTATATGGAAAGGCCTACAACCATGCATTACAAGGTGG
TCAAGCAAATACTTAGGTATTTGAGAGGGACGATTCATTTTGGGCTCACTTATACGAAAGGTCCCAGAGAATTCAATATATTCGGTTACTCAGACAGTGATTTAGCTGGT
GATCTCGACAGGAGGAAAAGCACAAGTGGAATGACATTCTACTTAAACGAAAGCTTGGTTTCATGGAATTCGCAAAAGCAAAAGACGGTGGCACTCTCATCTTGCGAAGC
CGAGTTCATTGCAGCTACTACCGCAGCTTGCCAAGCATTGTGGTTAAGATGCCTTGTTAGCGAGATAGTCGGAATGGAGCCAAGGCCGGTAACATTATTTGTGGACAACA
AATCCGCGATAGCTCTCATGAAGAATCCCGTATTTCATGGTCGCAACAAGCACATAGATACATGTTTTCATTTCATTCGAGAGTGTGTCAAGAATGGACAAATTATCGTT
GAAGTTGTCAACACTGGAGAACAACGAGCCGATGTCCTGACTAAAGCATTGACGGGAGTAAAGTTAGCTGCTATGCGTCAACTACTTGGTGTTCGTAAGTTAGAATCATG
CCAGAATTAG
Protein sequenceShow/hide protein sequence
MENWKRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTRREKEEYVLVGVYVDDLIVTGRIEVEQQKGRIMLKQPTYAKRIL
SQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAG
DLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIV
EVVNTGEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN