; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G21360 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G21360
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr3:17605966..17608195
RNA-Seq ExpressionCSPI03G21360
SyntenyCSPI03G21360
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69274.1 TatD related DNase [Prunus dulcis]6.5e-6536.99Show/hide
Query:  KEDVIHVKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKN
        K + + V DF+PISL T  YKV++KVLA RL++V+ + +S +Q AF++ RQILD+VL+A+E VE+   + ++G + K+  EKA+D  EW F+  VM  K 
Subjt:  KEDVIHVKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKN

Query:  STPKGQFGWWAVGKIEVFPYSSMGN-QGTGLWL----LEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYTDDTLLFCKYDE
           K  +  W  G +E   +S M N +  G +     L   +  +PF F + S   VL  II +        G V+G +++ +  LQ+ DDT+ F    E
Subjt:  STPKGQFGWWAVGKIEVFPYSSMGN-QGTGLWL----LEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYTDDTLLFCKYDE

Query:  DLLLKLKEAIRLFEWSSGQKINRNKSALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKRFNISRGGRQTVCK
        +  L L + ++LF   SG KIN+ KS + G+N   E L+ MA   GC+ G  P +YLGLPLGG PR   FW PV+D+V K L R KR  +S+GGR T+ +
Subjt:  DLLLKLKEAIRLFEWSSGQKINRNKSALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKRFNISRGGRQTVCK

Query:  SVLANLPTYYSSLFAIPDNL-----------------------------------DGGLGLGGIKIQNTALLAKWGWRFLVEEPFDWHTVGKSSNSLKSH
        +VL+++P+YY SLF +P  +                                   +GGLG+G ++ +N AL AKW WRF +E    WH + KS   L S+
Subjt:  SVLANLPTYYSSLFAIPDNL-----------------------------------DGGLGLGGIKIQNTALLAKWGWRFLVEEPFDWHTVGKSSNSLKSH

Query:  -W-------VSISRAWRKV
         W       VS    WR++
Subjt:  -W-------VSISRAWRKV

RVX11949.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.0e-6236.93Show/hide
Query:  LGRNKALGSDGFTA-------NFLKEDVIH---VKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRG
        L R+KA G DGFT        + +KED+     + DF+PISL T  YK++AKVL+ RL+ V+   +  +Q AF++GRQILD+VLIA+E V++     + G
Subjt:  LGRNKALGSDGFTA-------NFLKEDVIH---VKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRG

Query:  WILKL-LEKAFDRAEWGFLQKVMHCKNSTPKGQFGWWAVGKIEVFPYSSMGNQGTGLWL-----LEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEG
         + K+  EKA+D  +W FL  V+  K  + K +   W  G +    Y+ + N     W+     L  S+  +PF F I +   VL  ++ K       EG
Subjt:  WILKL-LEKAFDRAEWGFLQKVMHCKNSTPKGQFGWWAVGKIEVFPYSSMGNQGTGLWL-----LEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEG

Query:  FVTGKEKIHIPILQYTDDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNKSALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQP
        F  G+ +  +  LQ+ DDT+LF    E+ L  LK  + +F   SG K+N +KS L G+N+D   LS++A  L CKA   P LYLGLPLGG P    FW P
Subjt:  FVTGKEKIHIPILQYTDDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNKSALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQP

Query:  VIDRVHKELDRLKRFNISRGGRQTVCKSVLANLPTYYSSLFAIP-----------------------------------DNLDGGLGLGGIKIQNTALLA
        VI+R+ + LD  ++  +S GGR T+  S L+++P+Y+ SLF IP                                     + GGLG+G I ++N ALL 
Subjt:  VIDRVHKELDRLKRFNISRGGRQTVCKSVLANLPTYYSSLFAIP-----------------------------------DNLDGGLGLGGIKIQNTALLA

Query:  KWGWRFLVEEPFDWHTV
        KW WRF  E    WH V
Subjt:  KWGWRFLVEEPFDWHTV

TYK06397.1 hypothetical protein E5676_scaffold163G00940 [Cucumis melo var. makuwa]2.9e-7358.69Show/hide
Query:  AKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKNSTPKGQFGW--WAVGKIEVFPYS
        AKVLA+RLK VMDSI SP Q+ FIEGRQILD + IA+EAVEDY AKKK+GWILKL LEKAFDR +WGFL+KV+HCK  + K    W  W +G I      
Subjt:  AKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKNSTPKGQFGW--WAVGKIEVFPYS

Query:  SMGNQGTGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHY-EGFVTGKEKIHIPILQYTDDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNK
                        K T F+ FI+   +  G I+    + G + +GF+ GK++ HIPILQY DDT+LFCK DE +L+KLKE I  FEW SGQK+N  K
Subjt:  SMGNQGTGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHY-EGFVTGKEKIHIPILQYTDDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNK

Query:  SALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDR
        SAL  +N+ D++LS MA KLGCK  KLPFLYLGLPLGGYPRQKLFWQPV+DRV+K+LDR
Subjt:  SALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDR

VVA20479.1 Hypothetical predicted protein, partial [Prunus dulcis]3.8e-6534.69Show/hide
Query:  YLCNKNLQAAKDLGRNKALGSDGFTANFL--------------------------------------KEDVIHVKDFKPISLTTLTYKVVAKVLAERLKQ
        Y  NKN +A  D G++K+ G DGF+ +F                                       K + + V DF+PISL T  YKV++KVLA RL++
Subjt:  YLCNKNLQAAKDLGRNKALGSDGFTANFL--------------------------------------KEDVIHVKDFKPISLTTLTYKVVAKVLAERLKQ

Query:  VMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKNSTPKGQFGWWAVGKIEVFPYSSMGN-QGTGLWL
        V+ + +S SQ AF++ RQILD+VL+A+E VE+   + ++G + K+  EKA+D  EW F+  V+  K    K  +  W  G +E   +S M N +  G + 
Subjt:  VMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKNSTPKGQFGWWAVGKIEVFPYSSMGN-QGTGLWL

Query:  ----LEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYTDDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNKSALRGVNV
            L   +  +PF F + S   VL  II +        G V+G +++ +  LQ+ DDT+ F    E+  L L + ++LF   SG KIN+ KS + G+N 
Subjt:  ----LEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYTDDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNKSALRGVNV

Query:  DDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKRFNISRGGRQTVCKSVLANLPTYYSSLFAIPDNL-------------
          E L+ MA   GC+ G  P +YLGLPLGG PR   FW PV+D+V K L + KR  +S+GGR T+ ++VL+++P+YY SLF +P  +             
Subjt:  DDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKRFNISRGGRQTVCKSVLANLPTYYSSLFAIPDNL-------------

Query:  ----------------------DGGLGLGGIKIQNTALLAKWGWRFLVEEPFDWHTVGKSSNSLKSH
                              +GGLG+G ++ +N AL AKW WRF +E    WH + KS   L S+
Subjt:  ----------------------DGGLGLGGIKIQNTALLAKWGWRFLVEEPFDWHTVGKSSNSLKSH

XP_038880332.1 uncharacterized protein LOC120071973 [Benincasa hispida]8.8e-7040.27Show/hide
Query:  GFLQKVMHCKNSTPKGQFGW--WAVG-----KIEVFPYSSMGNQGTGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQ
        G L+KV+  KN  P+    W  W  G     K  +F       +      +   +  +PF F + S  +VL  +I +L+  G YEGF+ GK+K+HI I+Q
Subjt:  GFLQKVMHCKNSTPKGQFGW--WAVG-----KIEVFPYSSMGNQGTGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQ

Query:  YTDDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNKSALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKR
        +  DTLLFCKY ++++  L+  I +FEW S QK+N  KSA+ G+N+++  +  +A +L CK   LP +YLGLPLGGYP+   FWQPVID++  +LD+ +R
Subjt:  YTDDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNKSALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKR

Query:  FNISRGGRQTVCKSVLANLPTYYSSLFAIPDNL-----------------------------------DGGLGLGGIKIQNTALLAKWGWRFLVEE----
        FN+SRGG+ T+CKSV +NLPTYY SLF +P+ +                                   DGGLGLGG++ +N A LAKWGWR L  E    
Subjt:  FNISRGGRQTVCKSVLANLPTYYSSLFAIPDNL-----------------------------------DGGLGLGGIKIQNTALLAKWGWRFLVEE----

Query:  -----------PFDWHTVGKSSNSLKSHWVSISRAWRKVEALALLSLVMEEKLLFGKTYGSVTSP
                    FDWHT GK S +L+S W+SISR+W KVEALA+  L    ++ FG    S  +P
Subjt:  -----------PFDWHTVGKSSNSLKSHWVSISRAWRKVEALALLSLVMEEKLLFGKTYGSVTSP

TrEMBL top hitse value%identityAlignment
A0A5D3C3J5 Reverse transcriptase domain-containing protein1.4e-7358.69Show/hide
Query:  AKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKNSTPKGQFGW--WAVGKIEVFPYS
        AKVLA+RLK VMDSI SP Q+ FIEGRQILD + IA+EAVEDY AKKK+GWILKL LEKAFDR +WGFL+KV+HCK  + K    W  W +G I      
Subjt:  AKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKNSTPKGQFGW--WAVGKIEVFPYS

Query:  SMGNQGTGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHY-EGFVTGKEKIHIPILQYTDDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNK
                        K T F+ FI+   +  G I+    + G + +GF+ GK++ HIPILQY DDT+LFCK DE +L+KLKE I  FEW SGQK+N  K
Subjt:  SMGNQGTGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHY-EGFVTGKEKIHIPILQYTDDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNK

Query:  SALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDR
        SAL  +N+ D++LS MA KLGCK  KLPFLYLGLPLGGYPRQKLFWQPV+DRV+K+LDR
Subjt:  SALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDR

A0A5E4F090 Reverse transcriptase domain-containing protein (Fragment)1.8e-6534.69Show/hide
Query:  YLCNKNLQAAKDLGRNKALGSDGFTANFL--------------------------------------KEDVIHVKDFKPISLTTLTYKVVAKVLAERLKQ
        Y  NKN +A  D G++K+ G DGF+ +F                                       K + + V DF+PISL T  YKV++KVLA RL++
Subjt:  YLCNKNLQAAKDLGRNKALGSDGFTANFL--------------------------------------KEDVIHVKDFKPISLTTLTYKVVAKVLAERLKQ

Query:  VMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKNSTPKGQFGWWAVGKIEVFPYSSMGN-QGTGLWL
        V+ + +S SQ AF++ RQILD+VL+A+E VE+   + ++G + K+  EKA+D  EW F+  V+  K    K  +  W  G +E   +S M N +  G + 
Subjt:  VMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKNSTPKGQFGWWAVGKIEVFPYSSMGN-QGTGLWL

Query:  ----LEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYTDDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNKSALRGVNV
            L   +  +PF F + S   VL  II +        G V+G +++ +  LQ+ DDT+ F    E+  L L + ++LF   SG KIN+ KS + G+N 
Subjt:  ----LEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYTDDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNKSALRGVNV

Query:  DDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKRFNISRGGRQTVCKSVLANLPTYYSSLFAIPDNL-------------
          E L+ MA   GC+ G  P +YLGLPLGG PR   FW PV+D+V K L + KR  +S+GGR T+ ++VL+++P+YY SLF +P  +             
Subjt:  DDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKRFNISRGGRQTVCKSVLANLPTYYSSLFAIPDNL-------------

Query:  ----------------------DGGLGLGGIKIQNTALLAKWGWRFLVEEPFDWHTVGKSSNSLKSH
                              +GGLG+G ++ +N AL AKW WRF +E    WH + KS   L S+
Subjt:  ----------------------DGGLGLGGIKIQNTALLAKWGWRFLVEEPFDWHTVGKSSNSLKSH

A0A5H2XQW2 TatD related DNase3.1e-6536.99Show/hide
Query:  KEDVIHVKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKN
        K + + V DF+PISL T  YKV++KVLA RL++V+ + +S +Q AF++ RQILD+VL+A+E VE+   + ++G + K+  EKA+D  EW F+  VM  K 
Subjt:  KEDVIHVKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKN

Query:  STPKGQFGWWAVGKIEVFPYSSMGN-QGTGLWL----LEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYTDDTLLFCKYDE
           K  +  W  G +E   +S M N +  G +     L   +  +PF F + S   VL  II +        G V+G +++ +  LQ+ DDT+ F    E
Subjt:  STPKGQFGWWAVGKIEVFPYSSMGN-QGTGLWL----LEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYTDDTLLFCKYDE

Query:  DLLLKLKEAIRLFEWSSGQKINRNKSALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKRFNISRGGRQTVCK
        +  L L + ++LF   SG KIN+ KS + G+N   E L+ MA   GC+ G  P +YLGLPLGG PR   FW PV+D+V K L R KR  +S+GGR T+ +
Subjt:  DLLLKLKEAIRLFEWSSGQKINRNKSALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKRFNISRGGRQTVCK

Query:  SVLANLPTYYSSLFAIPDNL-----------------------------------DGGLGLGGIKIQNTALLAKWGWRFLVEEPFDWHTVGKSSNSLKSH
        +VL+++P+YY SLF +P  +                                   +GGLG+G ++ +N AL AKW WRF +E    WH + KS   L S+
Subjt:  SVLANLPTYYSSLFAIPDNL-----------------------------------DGGLGLGGIKIQNTALLAKWGWRFLVEEPFDWHTVGKSSNSLKSH

Query:  -W-------VSISRAWRKV
         W       VS    WR++
Subjt:  -W-------VSISRAWRKV

A0A803P465 Uncharacterized protein1.7e-6637.86Show/hide
Query:  VKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKNSTPKGQ
        V+D++PISL T  YK++AK+L+ RL+ V+   +  +QSAF+EGRQILDSVLIA+E VEDY ++ + G + K+  EKA+DR EW F+  V+  K       
Subjt:  VKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKNSTPKGQ

Query:  FG--W--WAVGKIEVFPYSSMGNQG-----TGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYTDDTLLFCKYDEDL
        FG  W  W  G I    +S   N+      +G   L   +  +PF F + +   VLG + NK    G+  GF+ GKE++ +  LQ+ DDT+ F + +E  
Subjt:  FG--W--WAVGKIEVFPYSSMGNQG-----TGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYTDDTLLFCKYDEDL

Query:  LLKLKEAIRLFEWSSGQKINRNKSALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKRFNISRGGRQTVCKSV
        L KL   +  F   SG KIN +KS L G+ +D+E +S++AR++GC+ G  P  YLG+PLGG PR+  FW+PV+D+  K LD  K   +S+GGR T+ +SV
Subjt:  LLKLKEAIRLFEWSSGQKINRNKSALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKRFNISRGGRQTVCKSV

Query:  LANLPTYYSSLFAIPDNL-----------------------------------DGGLGLGGIKIQNTALLAKWGWRFLVEEPFDWHTV-----GKSSN--
        L++LP Y+ SLF  P ++                                   +GGLG+G ++++N +LL KW WRF +E+   WH V     G++ N  
Subjt:  LANLPTYYSSLFAIPDNL-----------------------------------DGGLGLGGIKIQNTALLAKWGWRFLVEEPFDWHTV-----GKSSN--

Query:  -SLKSHWVSISRAWRKVEAL
         S K   +S    WR +  L
Subjt:  -SLKSHWVSISRAWRKVEAL

A0A803QI00 Uncharacterized protein4.4e-6740.1Show/hide
Query:  VKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKNSTPKGQ
        VKDF+PISL T  YK+VAK LA RL+ V+   +S +QSAF+EGRQILDSVLIA+E VED+ ++ K+G++ K+ LEKA+DR +W FL  V+  K       
Subjt:  VKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVMHCKNSTPKGQ

Query:  FG--W--WAVGKIEVFPYSSMGN-----QGTGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYTDDTLLFCKYDEDL
        FG  W  W  G +    +S + N     +  G   L   +  +PF F +     VLG +++K   +  + GF  GK+ I I  LQ+ DDTL F K DE  
Subjt:  FG--W--WAVGKIEVFPYSSMGN-----QGTGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYTDDTLLFCKYDEDL

Query:  LLKLKEAIRLFEWSSGQKINRNKSALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKRFNISRGGRQTVCKSV
        L KL E +  F   SG K+N NKS L G+++++E ++Q A  +GC+ G  P  YLG+PLGG PR+  FW+PV+D+  K LD  K   +SRGGR  + +SV
Subjt:  LLKLKEAIRLFEWSSGQKINRNKSALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKRFNISRGGRQTVCKSV

Query:  LANLPTYYSSLFAIP------------------------DNL-----------DGGLGLGGIKIQNTALLAKWGWRFLVEEPFDWHTV-----GKSSNSL
        L++LP YY SLF  P                        D+L           +GGL +G ++++N  LL KW WR+ +E    WH V     GK+ N  
Subjt:  LANLPTYYSSLFAIP------------------------DNL-----------DGGLGLGGIKIQNTALLAKWGWRFLVEEPFDWHTV-----GKSSNSL

Query:  KSHW
         + W
Subjt:  KSHW

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.9e-1124.18Show/hide
Query:  DVIHVKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYH-AKKKRGWILKL-LEKAFDRAEWGFLQKVMH----
        D    ++F+PISL  +  K++ K+LA R++Q +  ++   Q  FI G Q   ++  +   ++  + AK K   I+ +  EKAFD+ +  F+ K ++    
Subjt:  DVIHVKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYH-AKKKRGWILKL-LEKAFDRAEWGFLQKVMH----

Query:  ---------------CKNSTPKGQFGWWAVGKIEVFPYSSMGNQGTGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQ
                         N    GQ       K+E FP  +   QG  L         +P  F I     VL  +   +      +G   GKE++ + +  
Subjt:  ---------------CKNSTPKGQFGWWAVGKIEVFPYSSMGNQGTGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQ

Query:  YTDDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNKSALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQ--KLFWQPVIDRVHKELDRL
        + DD +++ +        L + I  F   SG KIN  KS     N + +  SQ+  +L          YLG+ L    +   K  ++P++  + ++ ++ 
Subjt:  YTDDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNKSALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQ--KLFWQPVIDRVHKELDRL

Query:  KRFNISRGGRQTVCKSVLANLPTYYSSLFAIPDNL
        K    S  GR  + K  +A LP       AIP  L
Subjt:  KRFNISRGGRQTVCKSVLANLPTYYSSLFAIPDNL

P08548 LINE-1 reverse transcriptase homolog3.8e-0726.07Show/hide
Query:  EDVIHVKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYH-AKKKRGWILKL-LEKAFDRAEWGFL--------
        +D    ++++PISL  +  K++ K+L  R++Q +  I+   Q  FI G Q   ++  +   ++  +  K K   IL +  EKAFD  +  F+        
Subjt:  EDVIHVKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYH-AKKKRGWILKL-LEKAFDRAEWGFL--------

Query:  -----QKVMHCKNSTPKGQFGWWAVGKIEVFPYSSMGNQGTGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYTDDT
              K++    S P        V K++ FP  S   QG  L         +P  F I    +VL   I +       +G   G E+I + +  + DD 
Subjt:  -----QKVMHCKNSTPKGQFGWWAVGKIEVFPYSSMGNQGTGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYTDDT

Query:  LLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNKS
        +++ +   D   KL E I+ +   SG KIN +KS
Subjt:  LLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNKS

P0C2F6 Putative ribonuclease H protein At1g657503.9e-0423.87Show/hide
Query:  VIDRVHKELDRLKRFNISRGGRQTVCKSVLANLPTYYSSLFAIPDNL-----------------------------------DGGLGLGGIKIQNTALLA
        +++RV   +   +   +S  GR T+ K+VL+++P +  S   +P ++                                   +GGLG+   K  N AL++
Subjt:  VIDRVHKELDRLKRFNISRGGRQTVCKSVLANLPTYYSSLFAIPDNL-----------------------------------DGGLGLGGIKIQNTALLA

Query:  KWGWRFLVEEPFDWHTVGKSSNSL----KSHWV----SISRAWRKVEALALLSLV
        K GWR L E+   W  V +    +     S W+    S S  WR + A+ L  +V
Subjt:  KWGWRFLVEEPFDWHTVGKSSNSL----KSHWV----SISRAWRKVEALALLSLV

P11369 LINE-1 retrotransposable element ORF2 protein1.8e-0927.85Show/hide
Query:  KEDVIHVKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQ----ILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVM
        ++D   +++F+PISL  +  K++ K+LA R+++ + +I+ P Q  FI G Q    I  S+ + H   +    K K   I+ L  EKAFD+ +  F+ KV+
Subjt:  KEDVIHVKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQ----ILDSVLIAHEAVEDYHAKKKRGWILKL-LEKAFDRAEWGFLQKVM

Query:  H-----------CKNSTPKGQFGWWAVG-KIEVFPYSSMGNQGTGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYT
                     K    K        G K+E  P  S   QG  L         +P+ F I     VL  +   +      +G   GKE++ I +L   
Subjt:  H-----------CKNSTPKGQFGWWAVG-KIEVFPYSSMGNQGTGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGHYEGFVTGKEKIHIPILQYT

Query:  DDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNKS
        DD +++    ++   +L   I  F    G KIN NKS
Subjt:  DDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNKS

P14381 Transposon TX1 uncharacterized 149 kDa protein1.4e-0925.16Show/hide
Query:  KEDVIHVKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKLL----EKAFDRAEWGFLQKVMH
        K D+  +K+++P+SL +  YK+VAK ++ RLK V+  ++ P QS  + GR I D+V +  + +   H  ++ G  L  L    EKAFDR +  +L   + 
Subjt:  KEDVIHVKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILDSVLIAHEAVEDYHAKKKRGWILKLL----EKAFDRAEWGFLQKVMH

Query:  CKNSTPKGQFGWWAVGKIEVFPYSSMGNQGTGLWLLEASNKETPFTFFISSCQ------QVLGAIINKL--YVNGHYEGFVTGKEKIHIPILQYTDDTLL
          +      FG   VG ++   Y+S        W L A     P  F     Q      Q+    I      +     G V  +  + + +  Y DD +L
Subjt:  CKNSTPKGQFGWWAVGKIEVFPYSSMGNQGTGLWLLEASNKETPFTFFISSCQ------QVLGAIINKL--YVNGHYEGFVTGKEKIHIPILQYTDDTLL

Query:  FCKYDEDLLLKLKEAIRLFEWSSGQKINRNKSA-LRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKRFN--IS
          +   D L + +E   ++  +S  +IN +KS+ L   ++  + L    R +  ++  + +L + L    YP  + F + + + V   L + K F   +S
Subjt:  FCKYDEDLLLKLKEAIRLFEWSSGQKINRNKSA-LRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHKELDRLKRFN--IS

Query:  RGGRQTVCKSVLAN
          GR  V   ++A+
Subjt:  RGGRQTVCKSVLAN

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAATCAGCAACTTGTTAAAGAAGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAACAAGGATAAGCTATTTGTGCAACAAAAATCTTCAGGCAGC
AAAGGATCTTGGAAGGAACAAAGCTCTGGGTTCGGATGGCTTTACCGCGAATTTCCTTAAAGAAGATGTGATTCATGTTAAAGATTTCAAACCTATCAGCCTCACTACCC
TAACCTACAAGGTGGTGGCCAAAGTTTTAGCGGAACGTTTGAAACAGGTTATGGATTCAATTGTAAGCCCATCCCAAAGTGCCTTTATCGAGGGAAGGCAGATTTTAGAT
TCAGTTTTAATTGCTCATGAGGCAGTTGAAGATTATCATGCTAAAAAGAAAAGGGGATGGATTCTGAAACTTCTTGAGAAGGCCTTTGATAGAGCGGAATGGGGATTCCT
TCAAAAGGTAATGCACTGCAAAAATTCAACTCCTAAAGGTCAGTTTGGATGGTGGGCTGTTGGAAAAATCGAAGTTTTTCCATATTCATCAATGGGAAACCAAGGGACAG
GATTGTGGCTTCTAGAGGCATCCAACAAGGAAACCCCGTTCACATTTTTTATTTCTTCTTGTCAGCAGGTTCTAGGAGCTATCATCAATAAGCTGTACGTTAATGGGCAT
TATGAAGGTTTCGTGACTGGAAAGGAGAAGATCCACATCCCCATCCTCCAATACACTGATGATACACTCCTATTTTGCAAGTATGATGAGGATCTGCTTCTCAAGTTGAA
GGAGGCTATTAGATTGTTTGAATGGAGTTCAGGGCAGAAAATTAATCGGAATAAATCAGCTCTCAGGGGAGTTAATGTGGATGATGAAGATCTGTCTCAAATGGCCAGAA
AACTAGGGTGTAAGGCGGGAAAGCTTCCATTTTTGTACTTAGGACTTCCCTTGGGAGGTTATCCGAGGCAAAAGTTATTCTGGCAACCAGTGATTGACCGGGTTCATAAA
GAACTGGATAGATTGAAAAGGTTTAATATTTCAAGAGGAGGAAGACAAACTGTATGTAAGTCAGTTTTGGCCAACCTTCCCACTTATTATTCGTCTCTCTTTGCCATCCC
TGACAATCTTGATGGGGGCCTTGGGTTAGGAGGCATAAAAATTCAAAACACAGCTCTACTTGCTAAATGGGGGTGGAGATTCTTAGTGGAAGAGCCCTTTGATTGGCACA
CGGTGGGTAAATCCAGTAATAGTTTGAAGAGTCATTGGGTTAGTATTTCAAGAGCTTGGAGGAAGGTGGAAGCTTTGGCTTTGTTAAGCTTGGTAATGGAAGAAAAATTG
CTTTTTGGAAAGACTTATGGATCGGTGACATCCCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCAAATCAGCAACTTGTTAAAGAAGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAACAAGGATAAGCTATTTGTGCAACAAAAATCTTCAGGCAGC
AAAGGATCTTGGAAGGAACAAAGCTCTGGGTTCGGATGGCTTTACCGCGAATTTCCTTAAAGAAGATGTGATTCATGTTAAAGATTTCAAACCTATCAGCCTCACTACCC
TAACCTACAAGGTGGTGGCCAAAGTTTTAGCGGAACGTTTGAAACAGGTTATGGATTCAATTGTAAGCCCATCCCAAAGTGCCTTTATCGAGGGAAGGCAGATTTTAGAT
TCAGTTTTAATTGCTCATGAGGCAGTTGAAGATTATCATGCTAAAAAGAAAAGGGGATGGATTCTGAAACTTCTTGAGAAGGCCTTTGATAGAGCGGAATGGGGATTCCT
TCAAAAGGTAATGCACTGCAAAAATTCAACTCCTAAAGGTCAGTTTGGATGGTGGGCTGTTGGAAAAATCGAAGTTTTTCCATATTCATCAATGGGAAACCAAGGGACAG
GATTGTGGCTTCTAGAGGCATCCAACAAGGAAACCCCGTTCACATTTTTTATTTCTTCTTGTCAGCAGGTTCTAGGAGCTATCATCAATAAGCTGTACGTTAATGGGCAT
TATGAAGGTTTCGTGACTGGAAAGGAGAAGATCCACATCCCCATCCTCCAATACACTGATGATACACTCCTATTTTGCAAGTATGATGAGGATCTGCTTCTCAAGTTGAA
GGAGGCTATTAGATTGTTTGAATGGAGTTCAGGGCAGAAAATTAATCGGAATAAATCAGCTCTCAGGGGAGTTAATGTGGATGATGAAGATCTGTCTCAAATGGCCAGAA
AACTAGGGTGTAAGGCGGGAAAGCTTCCATTTTTGTACTTAGGACTTCCCTTGGGAGGTTATCCGAGGCAAAAGTTATTCTGGCAACCAGTGATTGACCGGGTTCATAAA
GAACTGGATAGATTGAAAAGGTTTAATATTTCAAGAGGAGGAAGACAAACTGTATGTAAGTCAGTTTTGGCCAACCTTCCCACTTATTATTCGTCTCTCTTTGCCATCCC
TGACAATCTTGATGGGGGCCTTGGGTTAGGAGGCATAAAAATTCAAAACACAGCTCTACTTGCTAAATGGGGGTGGAGATTCTTAGTGGAAGAGCCCTTTGATTGGCACA
CGGTGGGTAAATCCAGTAATAGTTTGAAGAGTCATTGGGTTAGTATTTCAAGAGCTTGGAGGAAGGTGGAAGCTTTGGCTTTGTTAAGCTTGGTAATGGAAGAAAAATTG
CTTTTTGGAAAGACTTATGGATCGGTGACATCCCCCTAA
Protein sequenceShow/hide protein sequence
MPNQQLVKEGEQLGDILTKALNGTRISYLCNKNLQAAKDLGRNKALGSDGFTANFLKEDVIHVKDFKPISLTTLTYKVVAKVLAERLKQVMDSIVSPSQSAFIEGRQILD
SVLIAHEAVEDYHAKKKRGWILKLLEKAFDRAEWGFLQKVMHCKNSTPKGQFGWWAVGKIEVFPYSSMGNQGTGLWLLEASNKETPFTFFISSCQQVLGAIINKLYVNGH
YEGFVTGKEKIHIPILQYTDDTLLFCKYDEDLLLKLKEAIRLFEWSSGQKINRNKSALRGVNVDDEDLSQMARKLGCKAGKLPFLYLGLPLGGYPRQKLFWQPVIDRVHK
ELDRLKRFNISRGGRQTVCKSVLANLPTYYSSLFAIPDNLDGGLGLGGIKIQNTALLAKWGWRFLVEEPFDWHTVGKSSNSLKSHWVSISRAWRKVEALALLSLVMEEKL
LFGKTYGSVTSP