; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G21300 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G21300
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr6:19288370..19292500
RNA-Seq ExpressionCSPI06G21300
SyntenyCSPI06G21300
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001951 - Histone H4
IPR009072 - Histone-fold
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.0e-9060.42Show/hide
Query:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY
        PI+ +    LC  F E EIH  L++F+NN++PGPD FT+EF K  W +LK++I  +F DF    IIN  VN T IALIAKKEKC++  DYRPISL T++Y
Subjt:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY

Query:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI
        KLIAK + ERLK TLP+T++++QMAFVK RQI DAIL+ NEAIDYW+VKK +GFVIKLDI KAFDK+NW FID+MLMKK Y  +WR WI +CISSV YSI
Subjt:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI

Query:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLLQS----IKVHHISHKCRQLQN
        +ING+P GKI+P+RGIR+GDP+SPFIFVLAMDY SRL+  V +  KIKGV      NLTHLL +    + V    H  + L+N
Subjt:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLLQS----IKVHHISHKCRQLQN

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.6e-9363.74Show/hide
Query:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY
        PI+   + +LC PF ESEI ST+ SF+N + PGPD +T+ F KKHW  LK D+  VF DF + GI+NN VN T+IALI+KKEKCSK  DYRPISL T+LY
Subjt:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY

Query:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI
        K++AK +  RLK  LP TI+++QMAF+K RQI DAILI NEAID WK +K +GFV+KLDI KAFDKI+WSFIDYML KK++  +WR+WI +CIS+V YSI
Subjt:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI

Query:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLL
        L+NG P G+IK  RGIR+GDPLSPFIFVLAMDY SRL+ H++ +G IKGV FNN  N++HLL
Subjt:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLL

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.1e-9263.36Show/hide
Query:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY
        PI+   + +LC PF ESEI ST+ SF+N + PGPD +T+ F KKHW  LK D+  VF DF + GI+NN VN T+IALI+KKEKCSK  DYRPISL T+LY
Subjt:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY

Query:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI
        K++AK +  RLK  LP TI+++QMAF+K RQI DAILI NE ID WK +K +GFV+KLDI KAFDKI+WSFIDYML KK++  +WR+WI +CIS+V YSI
Subjt:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI

Query:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLL
        L+NG P G+IK  RGIR+GDPLSPFIFVLAMDY SRL+ H++ +G IKGV FNN  N++HLL
Subjt:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLL

TYK21642.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.3e-9060.07Show/hide
Query:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY
        PI+ +    LC  F E EIH+ L++F+NN++PGPD FT+EF K  W +LK++I  +F DF    IIN  VN T IALIAKKEKC++  DYRPISL T++Y
Subjt:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY

Query:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI
        KLIAK + ERLK TLP T++++QMAFVK RQI DAIL+ NEAIDYW+VKK +GFVIKLDI KAFDK+NW FID+MLMKK Y  +WR+WI +CISSV YSI
Subjt:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI

Query:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLLQS----IKVHHISHKCRQLQN
        +ING+P GKI+P+RGIR+GDP+SPFIFVLAMDY SRL+  V +  KIKG+      NLTHLL +    + V    H  + L+N
Subjt:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLLQS----IKVHHISHKCRQLQN

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]1.0e-9060.42Show/hide
Query:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY
        PI+ +    LC  F E EIH  L++F+NN++PGPD FT+EF K  W +LK++I  +F DF    IIN  VN T IALIAKKEKC++  DYRPISL T++Y
Subjt:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY

Query:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI
        KLIAK + ERLK TLP+T++++QMAFVK RQI DAIL+ NEAIDYW+VKK +GFVIKLDI KAFDK+NW FID+MLMKK Y  +WR WI +CISSV YSI
Subjt:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI

Query:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLLQS----IKVHHISHKCRQLQN
        +ING+P GKI+P+RGIR+GDP+SPFIFVLAMDY SRL+  V +  KIKGV      NLTHLL +    + V    H  + L+N
Subjt:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLLQS----IKVHHISHKCRQLQN

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein5.1e-9160.42Show/hide
Query:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY
        PI+ +    LC  F E EIH  L++F+NN++PGPD FT+EF K  W +LK++I  +F DF    IIN  VN T IALIAKKEKC++  DYRPISL T++Y
Subjt:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY

Query:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI
        KLIAK + ERLK TLP+T++++QMAFVK RQI DAIL+ NEAIDYW+VKK +GFVIKLDI KAFDK+NW FID+MLMKK Y  +WR WI +CISSV YSI
Subjt:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI

Query:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLLQS----IKVHHISHKCRQLQN
        +ING+P GKI+P+RGIR+GDP+SPFIFVLAMDY SRL+  V +  KIKGV      NLTHLL +    + V    H  + L+N
Subjt:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLLQS----IKVHHISHKCRQLQN

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein1.1e-9064.12Show/hide
Query:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY
        PI++   + L  PF E+EI  TL SFA N+ PGPD + ++FL+K W  +KQ+I  +F DF    IIN +VNET I LIAKKE C  A D+RPISL TA+Y
Subjt:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY

Query:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI
        KLIAKT+ +RLK TLP TIS+ QMAFVK RQIT+AILI NEA+D+W+ KK RGFVIKLDI KAFDK+NW FID++LMKKNYS +WR+ I SCISSV YSI
Subjt:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI

Query:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLL
        LING+P G+IKP+RGIR+GDPLSPFIFVLAMDY SRL+ ++  + KI GV F+   NLTH+L
Subjt:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLL

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein3.2e-9363.74Show/hide
Query:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY
        PI+   + +LC PF ESEI ST+ SF+N + PGPD +T+ F KKHW  LK D+  VF DF + GI+NN VN T+IALI+KKEKCSK  DYRPISL T+LY
Subjt:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY

Query:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI
        K++AK +  RLK  LP TI+++QMAF+K RQI DAILI NEAID WK +K +GFV+KLDI KAFDKI+WSFIDYML KK++  +WR+WI +CIS+V YSI
Subjt:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI

Query:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLL
        L+NG P G+IK  RGIR+GDPLSPFIFVLAMDY SRL+ H++ +G IKGV FNN  N++HLL
Subjt:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLL

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein5.4e-9363.36Show/hide
Query:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY
        PI+   + +LC PF ESEI ST+ SF+N + PGPD +T+ F KKHW  LK D+  VF DF + GI+NN VN T+IALI+KKEKCSK  DYRPISL T+LY
Subjt:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY

Query:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI
        K++AK +  RLK  LP TI+++QMAF+K RQI DAILI NE ID WK +K +GFV+KLDI KAFDKI+WSFIDYML KK++  +WR+WI +CIS+V YSI
Subjt:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI

Query:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLL
        L+NG P G+IK  RGIR+GDPLSPFIFVLAMDY SRL+ H++ +G IKGV FNN  N++HLL
Subjt:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLL

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein5.1e-9160.42Show/hide
Query:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY
        PI+ +    LC  F E EIH  L++F+NN++PGPD FT+EF K  W +LK++I  +F DF    IIN  VN T IALIAKKEKC++  DYRPISL T++Y
Subjt:  PITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALY

Query:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI
        KLIAK + ERLK TLP+T++++QMAFVK RQI DAIL+ NEAIDYW+VKK +GFVIKLDI KAFDK+NW FID+MLMKK Y  +WR WI +CISSV YSI
Subjt:  KLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSI

Query:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLLQS----IKVHHISHKCRQLQN
        +ING+P GKI+P+RGIR+GDP+SPFIFVLAMDY SRL+  V +  KIKGV      NLTHLL +    + V    H  + L+N
Subjt:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLLQS----IKVHHISHKCRQLQN

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.2e-2027.97Show/hide
Query:  PPITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEK-CSKADDYRPISLRTA
        P +     + L  P   SEI + ++S    ++PGPD FT EF +++ + L   +  +F    ++GI+ N   E  I LI K  +  +K +++RPISL   
Subjt:  PPITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEK-CSKADDYRPISLRTA

Query:  LYKLIAKTMTERLKVTLPHTISDHQMAFVKERQ----ITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCIS
          K++ K +  R++  +   I   Q+ F+   Q    I  +I ++       + K     +I +D  KAFDKI   F+   L K    G + + I +   
Subjt:  LYKLIAKTMTERLKVTLPHTISDHQMAFVKERQ----ITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCIS

Query:  SVHYSILINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNE
            +I++NG+         G R+G PLSP +F + ++  +R IR   Q+ +IKG+    E
Subjt:  SVHYSILINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNE

P08548 LINE-1 reverse transcriptase homolog4.1e-2128.68Show/hide
Query:  PPITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQ----KGIINNIVNETYIALIAKKEK-CSKADDYRPIS
        P ++    + L  P   SEI ST+ +    ++PGPD FT EF    ++  K+++  + ++ FQ    +GI+ N   E  I LI K  K  ++ ++YRPIS
Subjt:  PPITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQ----KGIINNIVNETYIALIAKKEK-CSKADDYRPIS

Query:  LRTALYKLIAKTMTERLKVTLPHTISDHQMAFVKERQ----ITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIH
        L     K++ K +T R++  +   I   Q+ F+   Q    I  +I ++       K+K     ++ +D  KAFD I   F+   L K    G + + I 
Subjt:  LRTALYKLIAKTMTERLKVTLPHTISDHQMAFVKERQ----ITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIH

Query:  SCISSVHYSILINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNE
        +  S    +I++NG          G R+G PLSP +F + M+  +  IR   ++  IKG+   +E
Subjt:  SCISSVHYSILINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNE

P11369 LINE-1 retrotransposable element ORF2 protein9.7e-2329.76Show/hide
Query:  PPITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQK----GIINNIVNETYIALIAKKEK-CSKADDYRPIS
        P +     D L  P    EI + ++S    ++PGPD F+ EF    ++  K+D+  +    F K    G + N   E  I LI K +K  +K +++RPIS
Subjt:  PPITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQK----GIINNIVNETYIALIAKKEK-CSKADDYRPIS

Query:  LRTALYKLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYW-KVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCI
        L     K++ K +  R++  +   I   Q+ F+   Q    I      I Y  K+K     +I LD  KAFDKI   F+  +L +    G +   I +  
Subjt:  LRTALYKLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYW-KVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCI

Query:  SSVHYSILINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLLQSIKVHHIS---HKCRQLQN
        S    +I +NG+    I    G R+G PLSP++F + ++  +R IR   QQ +IKG+    E     LL    + +IS   +  R+L N
Subjt:  SSVHYSILINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFNNEFNLTHLLQSIKVHHIS---HKCRQLQN

P14381 Transposon TX1 uncharacterized 149 kDa protein3.7e-2227.27Show/hide
Query:  GEPPITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRT
        G P +++  K++L  P    E+   L    +N++PG D  TIEF +  W  L  D   V  + F+KG +        ++L+ KK       ++RP+SL +
Subjt:  GEPPITDSLKDQLCLPFRESEIHSTLSSFANNETPGPDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRT

Query:  ALYKLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVH
          YK++AK ++ RLK  L   I   Q   V  R I D + ++ + + + +        + LD  KAFD+++  ++   L   ++  Q+  ++ +  +S  
Subjt:  ALYKLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVH

Query:  YSILINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIR
          + IN   +  +   RG+R+G PLS  ++ LA++ F  L+R
Subjt:  YSILINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIR

P92555 Uncharacterized mitochondrial protein AtMg012501.9e-1046.03Show/hide
Query:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGV-CFNNEFNLTHLL
        +ING P G + P+RG+R+GDPLSP++F+L  +  S L R  Q+QG++ G+   NN   + HLL
Subjt:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGV-CFNNEFNLTHLL

Arabidopsis top hitse value%identityAlignment
AT1G07660.1 Histone superfamily protein3.6e-0458.14Show/hide
Query:  ERNSNVLKIFLKNFIRNIVTYTEHTLQKTDTTMDVAAKQYSLK
        E    VLKIFL+N IR+ VTYTEH  +KT T MDV    Y+LK
Subjt:  ERNSNVLKIFLKNFIRNIVTYTEHTLQKTDTTMDVAAKQYSLK

AT1G07820.1 Histone superfamily protein3.6e-0458.14Show/hide
Query:  ERNSNVLKIFLKNFIRNIVTYTEHTLQKTDTTMDVAAKQYSLK
        E    VLKIFL+N IR+ VTYTEH  +KT T MDV    Y+LK
Subjt:  ERNSNVLKIFLKNFIRNIVTYTEHTLQKTDTTMDVAAKQYSLK

AT1G07820.2 Histone superfamily protein3.6e-0458.14Show/hide
Query:  ERNSNVLKIFLKNFIRNIVTYTEHTLQKTDTTMDVAAKQYSLK
        E    VLKIFL+N IR+ VTYTEH  +KT T MDV    Y+LK
Subjt:  ERNSNVLKIFLKNFIRNIVTYTEHTLQKTDTTMDVAAKQYSLK

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.7e-0634.57Show/hide
Query:  MTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKK-TRGF-VIKLDIAKAFDKINWSFIDYMLMKKNYSGQW
        M ERLK  + + I   Q +F+  R  TD I+ V EA+   + KK  +G+ ++KLD+ KA+D+I W +++  L+   +   W
Subjt:  MTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAIDYWKVKK-TRGF-VIKLDIAKAFDKINWSFIDYMLMKKNYSGQW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.3e-1146.03Show/hide
Query:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGV-CFNNEFNLTHLL
        +ING P G + P+RG+R+GDPLSP++F+L  +  S L R  Q+QG++ G+   NN   + HLL
Subjt:  LINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGV-CFNNEFNLTHLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTTGTTTAGGTTGTTGATTCTGCACCTACTGGGTCGGTTGAAAATGCATTTGTGTTTCAACAATGAAGTAAATACACAACAACCTACACACAATGTATCACAGAT
TATTGATGTTGATTCCTTTGATTCACCAAATGATGCAGTACCTATTGATGTCGAAGTAAAGTTCAGAACTAAGTCAATACTGAAGAAAGCGATTTATATGTTGGCTGTGA
ACAATAGTTTTGAATTTGTTACAGTTAGGTCCAACCGCACATCATTTGACATTCGATGCAAAGATATGTTGTGTTCAAGGCAAGCAACATCCTGGAATATTGTTTTTGAG
TGTACGAAGTCATTTTTTAAAATGAATGACAAGGCTCCATGCCGCCCTTCTGATGTTATTAATTACATGAAAATTAATCACGGGGTAAATTTAAGTTATGATAAGGCCTG
GAGAGAACGTGAGATTGCATTGAATTCCATTAGGGGTGCCCCGGAGGAATTGTATGCAGTGTTGGGGGCATTCACAGATGCATTGATTAGGAATAATCCATGTGAATGTA
TACGGCTGAAGAATCAGACGATGAATGTCGATTCAATTGATGGTGCAGCGTTGAAGACAAAATATCTTGATACGCTCATTTCTGTTTGTACTATTAATGGAAATTCTCAA
ATTGTGCCACTAGCTATTCAGAGAATAACTTGTCATGGCACTACTACAAAAAGAGGATTACTTGACGCCTGCAATTTAGAATTACTTGACAGTTTTAATAAAAACTGTCA
ACAACAATGTCCAACAGAGGGAAAGGAGGAAAAGGTCTTGAAAAGGGAAAGAAACTCGAATGTTCTCAAGATCTTCCTAAAAAATTTCATTCGCAACATCGTCACTTACA
CTGAGCACACTCTCCAGAAGACTGACACCACCATGGATGTTGCTGCAAAACAGTATTCCCTAAAATCTTCTCTTCTTTGGATTGAGAACTTAGAGTTTTTAGATGAACAT
TGTGAGAGTAGTTGGATTAGGATTTTCTCACTCACTCCCAATTCTAATTCCATTTCCAAAATATCATTCTCTTTGCAATCTCCGCCGTCTGCAACCATCCCCCATCTCTC
TCGTCGTCGCCCTCCTTCTCTTCCTTCCTTCATTCAGCTCAACAAGAAGCGGCGGGGAGTTCTTCGTAAGGCAGAGGCTACGGCGGAGAAGATGGGAGTGCCAAGATCAG
GAGGGGAGCCACCCATTACAGATAGTCTCAAGGATCAACTTTGCCTCCCTTTTCGAGAGTCAGAAATTCATTCAACTCTTAGCTCTTTTGCAAACAACGAAACCCCGGGT
CCAGATAGGTTTACTATTGAATTCCTAAAAAAACATTGGAAAATTCTGAAACAAGACATCAAGATTGTTTTTGTTGATTTCTTTCAAAAGGGGATCATAAATAATATTGT
AAATGAGACTTACATTGCCCTTATTGCTAAGAAAGAAAAATGCTCCAAAGCTGATGACTACAGACCTATTAGCCTTAGAACAGCCTTATACAAACTAATTGCAAAAACCA
TGACAGAAAGACTCAAAGTCACCCTCCCTCACACCATATCAGACCATCAGATGGCCTTTGTTAAAGAGAGGCAGATTACTGATGCCATTCTCATTGTCAATGAAGCCATT
GATTACTGGAAAGTCAAGAAAACAAGGGGATTTGTGATCAAGCTGGATATCGCAAAGGCTTTCGACAAAATCAATTGGAGCTTTATAGATTATATGTTAATGAAAAAAAA
CTACTCTGGACAGTGGAGGAGGTGGATTCATTCGTGTATTAGCAGTGTACATTACTCAATCCTCATCAATGGAAAACCCAGCGGTAAAATCAAACCTACTAGAGGCATAC
GACGAGGTGATCCACTTTCTCCTTTCATATTCGTACTTGCCATGGATTATTTCAGCAGGTTGATTCGACATGTTCAACAACAAGGTAAAATTAAAGGAGTCTGCTTCAAT
AATGAATTCAACCTCACACATCTGCTTCAATCTATCAAAGTCCACCATATCTCCCATAAATGTCGACAACTGCAGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTTTGTTTAGGTTGTTGATTCTGCACCTACTGGGTCGGTTGAAAATGCATTTGTGTTTCAACAATGAAGTAAATACACAACAACCTACACACAATGTATCACAGAT
TATTGATGTTGATTCCTTTGATTCACCAAATGATGCAGTACCTATTGATGTCGAAGTAAAGTTCAGAACTAAGTCAATACTGAAGAAAGCGATTTATATGTTGGCTGTGA
ACAATAGTTTTGAATTTGTTACAGTTAGGTCCAACCGCACATCATTTGACATTCGATGCAAAGATATGTTGTGTTCAAGGCAAGCAACATCCTGGAATATTGTTTTTGAG
TGTACGAAGTCATTTTTTAAAATGAATGACAAGGCTCCATGCCGCCCTTCTGATGTTATTAATTACATGAAAATTAATCACGGGGTAAATTTAAGTTATGATAAGGCCTG
GAGAGAACGTGAGATTGCATTGAATTCCATTAGGGGTGCCCCGGAGGAATTGTATGCAGTGTTGGGGGCATTCACAGATGCATTGATTAGGAATAATCCATGTGAATGTA
TACGGCTGAAGAATCAGACGATGAATGTCGATTCAATTGATGGTGCAGCGTTGAAGACAAAATATCTTGATACGCTCATTTCTGTTTGTACTATTAATGGAAATTCTCAA
ATTGTGCCACTAGCTATTCAGAGAATAACTTGTCATGGCACTACTACAAAAAGAGGATTACTTGACGCCTGCAATTTAGAATTACTTGACAGTTTTAATAAAAACTGTCA
ACAACAATGTCCAACAGAGGGAAAGGAGGAAAAGGTCTTGAAAAGGGAAAGAAACTCGAATGTTCTCAAGATCTTCCTAAAAAATTTCATTCGCAACATCGTCACTTACA
CTGAGCACACTCTCCAGAAGACTGACACCACCATGGATGTTGCTGCAAAACAGTATTCCCTAAAATCTTCTCTTCTTTGGATTGAGAACTTAGAGTTTTTAGATGAACAT
TGTGAGAGTAGTTGGATTAGGATTTTCTCACTCACTCCCAATTCTAATTCCATTTCCAAAATATCATTCTCTTTGCAATCTCCGCCGTCTGCAACCATCCCCCATCTCTC
TCGTCGTCGCCCTCCTTCTCTTCCTTCCTTCATTCAGCTCAACAAGAAGCGGCGGGGAGTTCTTCGTAAGGCAGAGGCTACGGCGGAGAAGATGGGAGTGCCAAGATCAG
GAGGGGAGCCACCCATTACAGATAGTCTCAAGGATCAACTTTGCCTCCCTTTTCGAGAGTCAGAAATTCATTCAACTCTTAGCTCTTTTGCAAACAACGAAACCCCGGGT
CCAGATAGGTTTACTATTGAATTCCTAAAAAAACATTGGAAAATTCTGAAACAAGACATCAAGATTGTTTTTGTTGATTTCTTTCAAAAGGGGATCATAAATAATATTGT
AAATGAGACTTACATTGCCCTTATTGCTAAGAAAGAAAAATGCTCCAAAGCTGATGACTACAGACCTATTAGCCTTAGAACAGCCTTATACAAACTAATTGCAAAAACCA
TGACAGAAAGACTCAAAGTCACCCTCCCTCACACCATATCAGACCATCAGATGGCCTTTGTTAAAGAGAGGCAGATTACTGATGCCATTCTCATTGTCAATGAAGCCATT
GATTACTGGAAAGTCAAGAAAACAAGGGGATTTGTGATCAAGCTGGATATCGCAAAGGCTTTCGACAAAATCAATTGGAGCTTTATAGATTATATGTTAATGAAAAAAAA
CTACTCTGGACAGTGGAGGAGGTGGATTCATTCGTGTATTAGCAGTGTACATTACTCAATCCTCATCAATGGAAAACCCAGCGGTAAAATCAAACCTACTAGAGGCATAC
GACGAGGTGATCCACTTTCTCCTTTCATATTCGTACTTGCCATGGATTATTTCAGCAGGTTGATTCGACATGTTCAACAACAAGGTAAAATTAAAGGAGTCTGCTTCAAT
AATGAATTCAACCTCACACATCTGCTTCAATCTATCAAAGTCCACCATATCTCCCATAAATGTCGACAACTGCAGAACTGA
Protein sequenceShow/hide protein sequence
MILFRLLILHLLGRLKMHLCFNNEVNTQQPTHNVSQIIDVDSFDSPNDAVPIDVEVKFRTKSILKKAIYMLAVNNSFEFVTVRSNRTSFDIRCKDMLCSRQATSWNIVFE
CTKSFFKMNDKAPCRPSDVINYMKINHGVNLSYDKAWREREIALNSIRGAPEELYAVLGAFTDALIRNNPCECIRLKNQTMNVDSIDGAALKTKYLDTLISVCTINGNSQ
IVPLAIQRITCHGTTTKRGLLDACNLELLDSFNKNCQQQCPTEGKEEKVLKRERNSNVLKIFLKNFIRNIVTYTEHTLQKTDTTMDVAAKQYSLKSSLLWIENLEFLDEH
CESSWIRIFSLTPNSNSISKISFSLQSPPSATIPHLSRRRPPSLPSFIQLNKKRRGVLRKAEATAEKMGVPRSGGEPPITDSLKDQLCLPFRESEIHSTLSSFANNETPG
PDRFTIEFLKKHWKILKQDIKIVFVDFFQKGIINNIVNETYIALIAKKEKCSKADDYRPISLRTALYKLIAKTMTERLKVTLPHTISDHQMAFVKERQITDAILIVNEAI
DYWKVKKTRGFVIKLDIAKAFDKINWSFIDYMLMKKNYSGQWRRWIHSCISSVHYSILINGKPSGKIKPTRGIRRGDPLSPFIFVLAMDYFSRLIRHVQQQGKIKGVCFN
NEFNLTHLLQSIKVHHISHKCRQLQN