; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G01060 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G01060
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRibonuclease H-like domain, Reverse transcriptase, RNA-dependent DNA polymerase
Genome locationChr5:1381513..1382867
RNA-Seq ExpressionCSPI05G01060
SyntenyCSPI05G01060
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ABF94034.1 retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group]1.9e-10846.44Show/hide
Query:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGKEITEFQVIDQFYSDEFENL
        GRKP L HL+VFGC A+ KNT PHLKKLDDRS+P VY GV+E  KAHRL+DP RG++ +S DV+F+EN+ W W      G+E T+F + +    +  E L
Subjt:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGKEITEFQVIDQFYSDEFENL

Query:  EDAETRVENVLPH----------------ATEIPA-----------IGETSPSPPSMNTTILL------------------RSLSDIYANTEEV-VGGDE
            T    V P+                A E+P+            G  +P  PS N+  ++                  RSL+D+      V +  DE
Subjt:  EDAETRVENVLPH----------------ATEIPA-----------IGETSPSPPSMNTTILL------------------RSLSDIYANTEEV-VGGDE

Query:  QENEVMMVVSEEPTCFQEAVTEGRC------------------LTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARL
         + E ++  SEEP+ ++EA  +                     L  LP GH+ IGLKWV+KLKK+  GE++KHK RLVAKGYVQ+QG++FEEVFAPVARL
Subjt:  QENEVMMVVSEEPTCFQEAVTEGRC------------------LTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARL

Query:  DTIRVILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEEC
        DT+RV+LA+AA++ W                 EEVYV QPEGF    ++H V +L KALYGLRQAPRAWNIRLDRSL++LGF +C QEQAVY R    + 
Subjt:  DTIRVILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEEC

Query:  VLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN
        ++VGVYVDDLIVTG +  ++  FK+QMM +FEMSDLG L+YYLGIEV+Q +    LKQ  YAK++LSQFGM +CN+ +
Subjt:  VLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN

EEC84282.1 hypothetical protein OsI_30754 [Oryza sativa Indica Group]8.5e-10948.83Show/hide
Query:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGKEITEFQVIDQ---------
        GRKP L HLRVFGC+A+ K TTP+ KKLDDRS+P VY GV+E  KAHRL+DP  G++ +S DVIF+EN+ W W+ VV+  +  TEF V +          
Subjt:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGKEITEFQVIDQ---------

Query:  --FYSDEFENLEDA--ETRVENVLPHATEIPAIGETSPS--PPSMNTT---------------------ILLRSLSDIYANTEEV-VGGDEQENEVMMVV
          +    +     A  ++ + + +  +  +P   ++SPS  PPS   T                     +  RSL DI      V +  DE + + ++  
Subjt:  --FYSDEFENLEDA--ETRVENVLPHATEIPAIGETSPS--PPSMNTT---------------------ILLRSLSDIYANTEEV-VGGDEQENEVMMVV

Query:  SEEPTCFQEAVTEGR------------------CLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILAL
         EEP+ ++EA  +                     LT LP GHK IGLKWV+KLKK+  GEV+KHK RLVAKGYVQRQG++FEEVFAPVARLDT+RVILA+
Subjt:  SEEPTCFQEAVTEGR------------------CLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILAL

Query:  AANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDD
        AA++ W                 EEVYV QPEGF    E+H V RLSKALYGLRQAPRAWN RLD+ LK+LGF +C QEQAVY R + +  V+VGVYVDD
Subjt:  AANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDD

Query:  LIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN
        LIVTG + +++  FKQQMM +FEMSDLG LSYYLGIEV Q +  I +KQ  YAK+ILSQFGM  CN T+
Subjt:  LIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN

KAB8107251.1 hypothetical protein EE612_041900 [Oryza sativa]1.1e-10849.04Show/hide
Query:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGKEITEFQVIDQ---------
        GRKP L HLRVFGC+A+ K TTP+ KKLDDRS+P VY GV+E  KAHRL+DP  G++ +S DVIF+EN+ W W+ VV+  +  TEF V +          
Subjt:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGKEITEFQVIDQ---------

Query:  --FYSDEFENLEDAETRVENVLP--HATEIPAIGETSPS--PPSMNTT---------------------ILLRSLSDIYANTEEV-VGGDEQENEVMMVV
          +    +     A  +     P   ++ +P   ++SPS  PPS   T                     +  RSL DI      V +  DE + + ++  
Subjt:  --FYSDEFENLEDAETRVENVLP--HATEIPAIGETSPS--PPSMNTT---------------------ILLRSLSDIYANTEEV-VGGDEQENEVMMVV

Query:  SEEPTCFQEAVTEGR------------------CLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILAL
         EEP+ ++EA  +                     LT LP GHK IGLKWV+KLKK+  GEV+KHK RLVAKGYVQRQG++FEEVFAPVARLDT+RVILA+
Subjt:  SEEPTCFQEAVTEGR------------------CLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILAL

Query:  AANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDD
        AA++ W                 EEVYV QPEGF    E+H V RLSKALYGLRQAPRAWN RLD+ LK+LGF +C QEQAVY R + +  V+VGVYVDD
Subjt:  AANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDD

Query:  LIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN
        LIVTG +  ++  FKQQMM +FEMSDLG LSYYLGIEV Q +  I +KQ  YAK+ILSQFGM  CN T+
Subjt:  LIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN

KAF5794275.1 putative RNA-directed DNA polymerase [Helianthus annuus]5.5e-10043.37Show/hide
Query:  RKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQIS--SDVIFQENLEWAWNEVVSDGKEITEFQVIDQFYSDEFEN
        ++P L  ++VFGC+ +V     H+ KLDDRS PMVY G++  C  HR YDP + +L ++   DV+F+E ++W W +     K ++  Q I         N
Subjt:  RKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQIS--SDVIFQENLEWAWNEVVSDGKEITEFQVIDQFYSDEFEN

Query:  LEDAETRVENVLPH---ATEIPAIGETSP------------------------------SPPSMNTTIL-----------LRSLSDIYANTEEVVGGDEQ
        LED+ T   N  P      E+   G+ SP                               P  M++               R+++D+Y NTEE+V     
Subjt:  LEDAETRVENVLPH---ATEIPAIGETSP------------------------------SPPSMNTTIL-----------LRSLSDIYANTEEVVGGDEQ

Query:  ENEVMMVVSEEPTCFQ----------------EAVTEGR--CLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLD
        E +V+M  SEEPTC++                EA+      CLTELP G K+IGLKW+FK+KKDP G++ K+K RLVAKGYVQ++GI+ EEVFAPVARL+
Subjt:  ENEVMMVVSEEPTCFQ----------------EAVTEGR--CLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLD

Query:  TIRVILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECV
        T+R+ILALA  + W                 EEVYV QP GFEV  ++ KVY+L++ALYGL+QAPRAWN RLD++LK +GF KC QE A+Y ++  +  +
Subjt:  TIRVILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECV

Query:  LVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNA
        +VGVYVDDL+VTGS+ ++V  FK QMM ++EMSDLG LSYYLGIEVEQ +G + + Q +YAKRI+   GM DCNA
Subjt:  LVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNA

XP_042756658.1 uncharacterized protein LOC111883520 [Lactuca sativa]1.9e-10044.47Show/hide
Query:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGK------EITEFQVIDQFYS
        GRKPH+ HLRVFGCVA++K    HLKKL+DRS  +VY G ++  KAHRL DP  G + +S DVIF+EN  W W + +           I  F   ++ YS
Subjt:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGK------EITEFQVIDQFYS

Query:  DEFENLEDA-------------------------ETRVENVLPHATEIPAIGET---SPSPPSMNTTIL-------------------LRSLSDIYANTE
        DE + + D                           T+  N  P+ + IP    T   +P PPS   +                      R LSD+Y NT 
Subjt:  DEFENLEDA-------------------------ETRVENVLPHATEIPAIGET---SPSPPSMNTTIL-------------------LRSLSDIYANTE

Query:  EVVGGDEQENEVMMVVS--EEPTCFQEAVTEGRC------------------LTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFE
        E+    E   E +M+VS  EEP  + EA  +                     L +LP G + IGLKWVFK+K+DP G ++KHK R+VAKGY+Q+QGI++E
Subjt:  EVVGGDEQENEVMMVVS--EEPTCFQEAVTEGRC------------------LTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFE

Query:  EVFAPVARLDTIRVILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAV
        EVFAPVAR++T+RVILALA +  W                 EEVYV+QPEGF   NE  KVY+LSKALYGL+QAPRAWN  LD+ LK LGFR+C QE +V
Subjt:  EVFAPVARLDTIRVILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAV

Query:  YIRREEEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATNT
        Y R +    +++G+YVDDL+VTGS+ E + +FK++M A+FEMSDLG LSYYLGIEV QQ   I LKQ  YAK IL +  M DCN T T
Subjt:  YIRREEEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATNT

TrEMBL top hitse value%identityAlignment
A0A0P0XB91 Os08g0125300 protein5.4e-10949.04Show/hide
Query:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGKEITEFQVIDQ---------
        GRKP L HLRVFGC+A+ K TTP+ KKLDDRS+P VY GV+E  KAHRL+DP  G++ +S DVIF+EN+ W W+ VV+  +  TEF V +          
Subjt:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGKEITEFQVIDQ---------

Query:  --FYSDEFENLEDAETRVENVLP--HATEIPAIGETSPS--PPSMNTT---------------------ILLRSLSDIYANTEEV-VGGDEQENEVMMVV
          +    +     A  +     P   ++ +P   ++SPS  PPS   T                     +  RSL DI      V +  DE + + ++  
Subjt:  --FYSDEFENLEDAETRVENVLP--HATEIPAIGETSPS--PPSMNTT---------------------ILLRSLSDIYANTEEV-VGGDEQENEVMMVV

Query:  SEEPTCFQEAVTEGR------------------CLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILAL
         EEP+ ++EA  +                     LT LP GHK IGLKWV+KLKK+  GEV+KHK RLVAKGYVQRQG++FEEVFAPVARLDT+RVILA+
Subjt:  SEEPTCFQEAVTEGR------------------CLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILAL

Query:  AANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDD
        AA++ W                 EEVYV QPEGF    E+H V RLSKALYGLRQAPRAWN RLD+ LK+LGF +C QEQAVY R + +  V+VGVYVDD
Subjt:  AANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDD

Query:  LIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN
        LIVTG +  ++  FKQQMM +FEMSDLG LSYYLGIEV Q +  I +KQ  YAK+ILSQFGM  CN T+
Subjt:  LIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN

A0A251UCI2 Putative ribonuclease H-like domain, Reverse transcriptase, RNA-dependent DNA polymerase1.1e-9844.73Show/hide
Query:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWN---EVVSDGK-EITEFQVIDQFYSDE
        GRKP+L HL+VFGC AY K      KKLDDRS+PMVY G++E  KA+RLYDP + K+ +S DV F E   W W+   E V  G  E T F + +     E
Subjt:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWN---EVVSDGK-EITEFQVIDQFYSDE

Query:  FE----------------------NLEDAETRVENVLPHATEI---------------PAIGETSPSPPSMNTTILLRSLSDIYANTEEVVGGDEQENEV
         E                       ++  E +   V P A                  P+  E++    S NT + LRSL ++Y  TEE+    + +   
Subjt:  FE----------------------NLEDAETRVENVLPHATEI---------------PAIGETSPSPPSMNTTILLRSLSDIYANTEEVVGGDEQENEV

Query:  MMVVSEEPTCFQEAVTEGR------------------CLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRV
        +++  EEP  ++EA ++ +                   LTELP  HK IGLKWVFK KKD NG +V+HK RLVAKGYVQ+ GI+F+EVFAP+AR++T+R+
Subjt:  MMVVSEEPTCFQEAVTEGR------------------CLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRV

Query:  ILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGV
        +LALAA Q W                 EEVYV+QPEGF  P ++ KVYRLSKALYGLRQAPRAWN +LD++LK L F+KC  E A+Y R  E   ++VGV
Subjt:  ILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGV

Query:  YVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATNT
        YVDDLIVTG+S ++++ FK QM  KF+MSDLG L+YYLGIEV Q  G I +KQ  Y  +IL    M+ CN T T
Subjt:  YVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATNT

B8BDZ6 Uncharacterized protein4.1e-10948.83Show/hide
Query:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGKEITEFQVIDQ---------
        GRKP L HLRVFGC+A+ K TTP+ KKLDDRS+P VY GV+E  KAHRL+DP  G++ +S DVIF+EN+ W W+ VV+  +  TEF V +          
Subjt:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGKEITEFQVIDQ---------

Query:  --FYSDEFENLEDA--ETRVENVLPHATEIPAIGETSPS--PPSMNTT---------------------ILLRSLSDIYANTEEV-VGGDEQENEVMMVV
          +    +     A  ++ + + +  +  +P   ++SPS  PPS   T                     +  RSL DI      V +  DE + + ++  
Subjt:  --FYSDEFENLEDA--ETRVENVLPHATEIPAIGETSPS--PPSMNTT---------------------ILLRSLSDIYANTEEV-VGGDEQENEVMMVV

Query:  SEEPTCFQEAVTEGR------------------CLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILAL
         EEP+ ++EA  +                     LT LP GHK IGLKWV+KLKK+  GEV+KHK RLVAKGYVQRQG++FEEVFAPVARLDT+RVILA+
Subjt:  SEEPTCFQEAVTEGR------------------CLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILAL

Query:  AANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDD
        AA++ W                 EEVYV QPEGF    E+H V RLSKALYGLRQAPRAWN RLD+ LK+LGF +C QEQAVY R + +  V+VGVYVDD
Subjt:  AANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDD

Query:  LIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN
        LIVTG + +++  FKQQMM +FEMSDLG LSYYLGIEV Q +  I +KQ  YAK+ILSQFGM  CN T+
Subjt:  LIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN

Q0J8A6 Os08g0125300 protein5.4e-10949.04Show/hide
Query:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGKEITEFQVIDQ---------
        GRKP L HLRVFGC+A+ K TTP+ KKLDDRS+P VY GV+E  KAHRL+DP  G++ +S DVIF+EN+ W W+ VV+  +  TEF V +          
Subjt:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGKEITEFQVIDQ---------

Query:  --FYSDEFENLEDAETRVENVLP--HATEIPAIGETSPS--PPSMNTT---------------------ILLRSLSDIYANTEEV-VGGDEQENEVMMVV
          +    +     A  +     P   ++ +P   ++SPS  PPS   T                     +  RSL DI      V +  DE + + ++  
Subjt:  --FYSDEFENLEDAETRVENVLP--HATEIPAIGETSPS--PPSMNTT---------------------ILLRSLSDIYANTEEV-VGGDEQENEVMMVV

Query:  SEEPTCFQEAVTEGR------------------CLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILAL
         EEP+ ++EA  +                     LT LP GHK IGLKWV+KLKK+  GEV+KHK RLVAKGYVQRQG++FEEVFAPVARLDT+RVILA+
Subjt:  SEEPTCFQEAVTEGR------------------CLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILAL

Query:  AANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDD
        AA++ W                 EEVYV QPEGF    E+H V RLSKALYGLRQAPRAWN RLD+ LK+LGF +C QEQAVY R + +  V+VGVYVDD
Subjt:  AANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDD

Query:  LIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN
        LIVTG +  ++  FKQQMM +FEMSDLG LSYYLGIEV Q +  I +KQ  YAK+ILSQFGM  CN T+
Subjt:  LIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN

Q10RM4 Retrotransposon protein, putative, unclassified9.2e-10946.44Show/hide
Query:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGKEITEFQVIDQFYSDEFENL
        GRKP L HL+VFGC A+ KNT PHLKKLDDRS+P VY GV+E  KAHRL+DP RG++ +S DV+F+EN+ W W      G+E T+F + +    +  E L
Subjt:  GRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGKEITEFQVIDQFYSDEFENL

Query:  EDAETRVENVLPH----------------ATEIPA-----------IGETSPSPPSMNTTILL------------------RSLSDIYANTEEV-VGGDE
            T    V P+                A E+P+            G  +P  PS N+  ++                  RSL+D+      V +  DE
Subjt:  EDAETRVENVLPH----------------ATEIPA-----------IGETSPSPPSMNTTILL------------------RSLSDIYANTEEV-VGGDE

Query:  QENEVMMVVSEEPTCFQEAVTEGRC------------------LTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARL
         + E ++  SEEP+ ++EA  +                     L  LP GH+ IGLKWV+KLKK+  GE++KHK RLVAKGYVQ+QG++FEEVFAPVARL
Subjt:  QENEVMMVVSEEPTCFQEAVTEGRC------------------LTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARL

Query:  DTIRVILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEEC
        DT+RV+LA+AA++ W                 EEVYV QPEGF    ++H V +L KALYGLRQAPRAWNIRLDRSL++LGF +C QEQAVY R    + 
Subjt:  DTIRVILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEEC

Query:  VLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN
        ++VGVYVDDLIVTG +  ++  FK+QMM +FEMSDLG L+YYLGIEV+Q +    LKQ  YAK++LSQFGM +CN+ +
Subjt:  VLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.8e-3824.1Show/hide
Query:  RKPHLAHLRVFGCVAYVKNTTPHLK----KLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQE------------------------------
        +KP+L HLRVFG   YV     H+K    K DD+S   ++ G +      +L+D    K  ++ DV+  E                              
Subjt:  RKPHLAHLRVFGCVAYVKNTTPHLK----KLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQE------------------------------

Query:  NLEWAWNEVVSDGKEITEFQVIDQFYSDEFENL-EDAETRVENVLP---------------------------------HATEIPAIGETSPS-------
        + +    E  ++ KE    Q +      E +N   D+   ++   P                                 H  E    G  + S       
Subjt:  NLEWAWNEVVSDGKEITEFQVIDQFYSDEFENL-EDAETRVENVLP---------------------------------HATEIPAIGETSPS-------

Query:  --------PPSMNTTILLRSLSDIYANTEEVVGGDEQENEVMMVVSEEPTCFQEA----------------------------VTEGRCLTELPLGHKLI
                 P+ N  I + +       T+  +  +E++N +  VV    T F +                             +     +T+ P    ++
Subjt:  --------PPSMNTTILLRSLSDIYANTEEVVGGDEQENEVMMVVSEEPTCFQEA----------------------------VTEGRCLTELPLGHKLI

Query:  GLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKVYR
          +WVF +K +  G  +++K RLVA+G+ Q+  I++EE FAPVAR+ + R IL+L    +                  EE+Y+  P+G    ++   V +
Subjt:  GLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKVYR

Query:  LSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYI--RREEEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKG
        L+KA+YGL+QA R W    +++LK+  F     ++ +YI  +    E + V +YVDD+++      ++N FK+ +M KF M+DL  + +++GI +E Q+ 
Subjt:  LSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYI--RREEEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKG

Query:  RILLKQPTYAKRILSQFGMADCNATNT
        +I L Q  Y K+ILS+F M +CNA +T
Subjt:  RILLKQPTYAKRILSQFGMADCNATNT

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-5432.36Show/hide
Query:  AHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQEN--------LEWAWNEVVSD------------GKEITEF
        +HL+VFGC A+         KLDD+S P ++ G  +    +RL+DP + K+  S DV+F+E+         E   N ++ +              E T  
Subjt:  AHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQEN--------LEWAWNEVVSD------------GKEITEF

Query:  QVIDQFYSDEFENLEDAETRVENVLPHATEIPAIGETSPSPPSMNTTILLRSLSDIYANTEEVVGGDEQE-------------NEVMMVVSEEPTCFQEA
        +V +Q      E +E  E   E V     E P  GE    P  +  +   R  S  Y +TE V+  D++E             N++M  + EE    Q+ 
Subjt:  QVIDQFYSDEFENLEDAETRVENVLPHATEIPAIGETSPSPPSMNTTILLRSLSDIYANTEEVVGGDEQE-------------NEVMMVVSEEPTCFQEA

Query:  VTEGRCLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILALAANQSW-----------------EEVYV
         T    L ELP G + +  KWVFKLKKD + ++V++K RLV KG+ Q++GI+F+E+F+PV ++ +IR IL+LAA+                    EE+Y+
Subjt:  VTEGRCLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILALAANQSW-----------------EEVYV

Query:  TQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRR-EEEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDL
         QPEGFEV  +KH V +L+K+LYGL+QAPR W ++ D  +K   + K   +  VY +R  E   +++ +YVDD+++ G     + K K  +   F+M DL
Subjt:  TQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRR-EEEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDL

Query:  GFLSYYLGIEV--EQQKGRILLKQPTYAKRILSQFGMADCNATNT
        G     LG+++  E+   ++ L Q  Y +R+L +F M +    +T
Subjt:  GFLSYYLGIEV--EQQKGRILLKQPTYAKRILSQFGMADCNATNT

P25600 Putative transposon Ty5-1 protein YCL074W1.4e-1632.43Show/hide
Query:  EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFE
        E +YV QP GF        V+ L   +YGL+QAP  WN  ++ +LK +GF +   E  +Y R   +  + + VYVDDL+V   S +  ++ KQ++   + 
Subjt:  EEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFE

Query:  MSDLGFLSYYLGIEVEQ-QKGRILLKQPTYAKRILSQFGMADCNATNT
        M DLG +  +LG+ + Q   G I L    Y  +  S+  +     T T
Subjt:  MSDLGFLSYYLGIEVEQ-QKGRILLKQPTYAKRILSQFGMADCNATNT

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.6e-3331.72Show/hide
Query:  LIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKV
        ++G +W+F  K + +G + ++K RLVAKGY QR G+++ E F+PV +  +IR++L +A ++SW                 ++VY++QP GF   +  + V
Subjt:  LIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKV

Query:  YRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKG
         +L KALYGL+QAPRAW + L   L  +GF   + + ++++ +  +  V + VYVDD+++TG+    ++     +  +F + D   L Y+LGIE ++   
Subjt:  YRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKG

Query:  RILLKQPTYAKRILSQFGMADCNATNT
         + L Q  Y   +L++  M       T
Subjt:  RILLKQPTYAKRILSQFGMADCNATNT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-3232.88Show/hide
Query:  LIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKV
        ++G +W+F  K + +G + ++K RLVAKGY QR G+++ E F+PV +  +IR++L +A ++SW                 +EVY++QP GF   +    V
Subjt:  LIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILALAANQSW-----------------EEVYVTQPEGFEVPNEKHKV

Query:  YRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKG
         RL KA+YGL+QAPRAW + L   L  +GF   I + ++++ +     + + VYVDD+++TG+ T  +      +  +F + +   L Y+LGIE ++   
Subjt:  YRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKG

Query:  RILLKQPTYAKRILSQFGM
         + L Q  Y   +L++  M
Subjt:  RILLKQPTYAKRILSQFGM

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.6e-3332.2Show/hide
Query:  LPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILALAANQSW-----------------EEVYVTQPEGFEVP
        LP   K IG KWV+K+K + +G + ++K RLVAKGY Q++GI+F E F+PV +L ++++ILA++A  ++                 EE+Y+  P G+   
Subjt:  LPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILALAANQSW-----------------EEVYVTQPEGFEVP

Query:  N----EKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYY
               + V  L K++YGL+QA R W ++   +L   GF +   +   +++      + V VYVDD+I+  ++   V++ K Q+ + F++ DLG L Y+
Subjt:  N----EKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYY

Query:  LGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN
        LG+E+ +    I + Q  YA  +L + G+  C  ++
Subjt:  LGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATN

ATMG00810.1 DNA/RNA polymerases superfamily protein6.7e-1140Show/hide
Query:  VYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATNT
        +YVDD+++TGSS   +N    Q+ + F M DLG + Y+LGI+++     + L Q  YA++IL+  GM DC   +T
Subjt:  VYVDDLIVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATNT

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.3e-1047.62Show/hide
Query:  PLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILALA
        P+   ++G KWVFK K   +G + + K RLVAKG+ Q +GI F E ++PV R  TIR IL +A
Subjt:  PLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYVQRQGINFEEVFAPVARLDTIRVILALA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGAAAGCCACATCTTGCACACTTGAGAGTCTTTGGTTGTGTGGCATATGTAAAGAACACAACCCCTCACCTCAAGAAACTCGACGATCGAAGCTCACCAATGGT
ATATTTTGGTGTCAAAGAAAGATGCAAAGCCCATCGCTTATATGACCCAGGTCGTGGAAAACTACAAATTAGTAGTGATGTTATTTTTCAAGAGAATCTTGAATGGGCTT
GGAACGAAGTTGTTAGTGACGGTAAGGAGATTACAGAGTTTCAAGTGATTGACCAATTTTATTCTGACGAGTTCGAAAACTTGGAGGATGCAGAAACTAGGGTTGAAAAT
GTCTTACCACATGCAACTGAGATACCTGCGATTGGAGAGACCAGTCCATCTCCTCCGTCGATGAACACAACGATCCTTCTAAGATCTCTCAGTGACATCTACGCCAACAC
AGAGGAAGTTGTAGGTGGTGATGAACAAGAGAATGAGGTGATGATGGTAGTGTCCGAAGAACCAACTTGTTTCCAAGAAGCTGTTACAGAGGGCCGCTGTCTGACCGAGC
TTCCACTAGGACATAAACTCATTGGTCTAAAATGGGTGTTCAAATTGAAGAAAGACCCTAATGGAGAAGTTGTCAAGCACAAAACAAGATTGGTTGCTAAAGGCTATGTA
CAAAGACAAGGCATTAACTTTGAAGAAGTTTTTGCACCGGTTGCAAGACTTGACACCATTCGAGTCATTCTTGCACTCGCTGCAAACCAAAGTTGGGAGGAAGTATATGT
TACTCAACCGGAGGGTTTTGAGGTTCCAAATGAAAAACACAAGGTGTATAGATTGTCAAAAGCTCTGTACGGATTGAGGCAAGCTCCACGAGCTTGGAACATTCGACTTG
ATAGGAGTCTCAAAGATCTTGGTTTTAGAAAATGCATTCAAGAGCAAGCAGTCTACATAAGAAGAGAAGAAGAGGAATGTGTTCTTGTTGGTGTGTATGTTGACGATCTC
ATTGTAACAGGAAGTAGCACTGAAAAGGTCAATAAGTTCAAGCAACAAATGATGGCAAAATTTGAAATGAGCGACTTAGGCTTTCTCTCTTACTACTTAGGAATTGAAGT
TGAACAACAGAAGGGTCGAATCCTGCTCAAACAACCAACTTATGCCAAAAGAATTTTGTCCCAGTTTGGAATGGCTGATTGCAATGCCACAAACACCAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGAAAGCCACATCTTGCACACTTGAGAGTCTTTGGTTGTGTGGCATATGTAAAGAACACAACCCCTCACCTCAAGAAACTCGACGATCGAAGCTCACCAATGGT
ATATTTTGGTGTCAAAGAAAGATGCAAAGCCCATCGCTTATATGACCCAGGTCGTGGAAAACTACAAATTAGTAGTGATGTTATTTTTCAAGAGAATCTTGAATGGGCTT
GGAACGAAGTTGTTAGTGACGGTAAGGAGATTACAGAGTTTCAAGTGATTGACCAATTTTATTCTGACGAGTTCGAAAACTTGGAGGATGCAGAAACTAGGGTTGAAAAT
GTCTTACCACATGCAACTGAGATACCTGCGATTGGAGAGACCAGTCCATCTCCTCCGTCGATGAACACAACGATCCTTCTAAGATCTCTCAGTGACATCTACGCCAACAC
AGAGGAAGTTGTAGGTGGTGATGAACAAGAGAATGAGGTGATGATGGTAGTGTCCGAAGAACCAACTTGTTTCCAAGAAGCTGTTACAGAGGGCCGCTGTCTGACCGAGC
TTCCACTAGGACATAAACTCATTGGTCTAAAATGGGTGTTCAAATTGAAGAAAGACCCTAATGGAGAAGTTGTCAAGCACAAAACAAGATTGGTTGCTAAAGGCTATGTA
CAAAGACAAGGCATTAACTTTGAAGAAGTTTTTGCACCGGTTGCAAGACTTGACACCATTCGAGTCATTCTTGCACTCGCTGCAAACCAAAGTTGGGAGGAAGTATATGT
TACTCAACCGGAGGGTTTTGAGGTTCCAAATGAAAAACACAAGGTGTATAGATTGTCAAAAGCTCTGTACGGATTGAGGCAAGCTCCACGAGCTTGGAACATTCGACTTG
ATAGGAGTCTCAAAGATCTTGGTTTTAGAAAATGCATTCAAGAGCAAGCAGTCTACATAAGAAGAGAAGAAGAGGAATGTGTTCTTGTTGGTGTGTATGTTGACGATCTC
ATTGTAACAGGAAGTAGCACTGAAAAGGTCAATAAGTTCAAGCAACAAATGATGGCAAAATTTGAAATGAGCGACTTAGGCTTTCTCTCTTACTACTTAGGAATTGAAGT
TGAACAACAGAAGGGTCGAATCCTGCTCAAACAACCAACTTATGCCAAAAGAATTTTGTCCCAGTTTGGAATGGCTGATTGCAATGCCACAAACACCAATTGA
Protein sequenceShow/hide protein sequence
MGRKPHLAHLRVFGCVAYVKNTTPHLKKLDDRSSPMVYFGVKERCKAHRLYDPGRGKLQISSDVIFQENLEWAWNEVVSDGKEITEFQVIDQFYSDEFENLEDAETRVEN
VLPHATEIPAIGETSPSPPSMNTTILLRSLSDIYANTEEVVGGDEQENEVMMVVSEEPTCFQEAVTEGRCLTELPLGHKLIGLKWVFKLKKDPNGEVVKHKTRLVAKGYV
QRQGINFEEVFAPVARLDTIRVILALAANQSWEEVYVTQPEGFEVPNEKHKVYRLSKALYGLRQAPRAWNIRLDRSLKDLGFRKCIQEQAVYIRREEEECVLVGVYVDDL
IVTGSSTEKVNKFKQQMMAKFEMSDLGFLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATNTN