; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G18980 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G18980
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationChr5:20139970..20141136
RNA-Seq ExpressionCSPI05G18980
SyntenyCSPI05G18980
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]3.2e-6853Show/hide
Query:  MSRGEEDDLFLYTLKSPTPSLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLS--NPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFV
        + +GE+D+LF+Y + SPTPSL+     S   RP I +VYS R  P    S P   L S  +P  SD+LPIAL KGK  CTYP+ S +S+ QLS  T +F+
Subjt:  MSRGEEDDLFLYTLKSPTPSLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLS--NPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFV

Query:  KSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFS-----SHLK
         SL   SIP  VHEALSH GW +AMI+E+ ALDDN          GKK IGCK VFA+K+N DG++ARLKA LVAKG+AQ YG DY DTFS     + ++
Subjt:  KSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFS-----SHLK

Query:  IGLYINLTLKMS----------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISGIILLV
        + L +  T K S          L G  QEEVY          G++DKV RL KSLYGLKQSPRA FGKF  AL  FGMKKS SDHSVFY+RS  GI+LLV
Subjt:  IGLYINLTLKMS----------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISGIILLV

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]3.2e-6853Show/hide
Query:  MSRGEEDDLFLYTLKSPTPSLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLS--NPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFV
        + +GE+D+LF+Y + SPTPSL+     S   RP I +VYS R  P    S P   L S  +P  SD+LPIAL KGK  CTYP+ S +S+ QLS  T +F+
Subjt:  MSRGEEDDLFLYTLKSPTPSLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLS--NPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFV

Query:  KSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFS-----SHLK
         SL   SIP  VHEALSH GW +AMI+E+ ALDDN          GKK IGCK VFA+K+N DG++ARLKA LVAKG+AQ YG DY DTFS     + ++
Subjt:  KSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFS-----SHLK

Query:  IGLYINLTLKMS----------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISGIILLV
        + L +  T K S          L G  QEEVY          G++DKV RL KSLYGLKQSPRA FGKF  AL  FGMKKS SDHSVFY+RS  GI+LLV
Subjt:  IGLYINLTLKMS----------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISGIILLV

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]3.2e-6853Show/hide
Query:  MSRGEEDDLFLYTLKSPTPSLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLS--NPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFV
        + +GE+D+LF+Y + SPTPSL+     S   RP I +VYS R  P    S P   L S  +P  SD+LPIAL KGK  CTYP+ S +S+ QLS  T +F+
Subjt:  MSRGEEDDLFLYTLKSPTPSLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLS--NPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFV

Query:  KSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFS-----SHLK
         SL   SIP  VHEALSH GW +AMI+E+ ALDDN          GKK IGCK VFA+K+N DG++ARLKA LVAKG+AQ YG DY DTFS     + ++
Subjt:  KSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFS-----SHLK

Query:  IGLYINLTLKMS----------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISGIILLV
        + L +  T K S          L G  QEEVY          G++DKV RL KSLYGLKQSPRA FGKF  AL  FGMKKS SDHSVFY+RS  GI+LLV
Subjt:  IGLYINLTLKMS----------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISGIILLV

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]3.2e-6853Show/hide
Query:  MSRGEEDDLFLYTLKSPTPSLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLS--NPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFV
        + +GE+D+LF+Y + SPTPSL+     S   RP I +VYS R  P    S P   L S  +P  SD+LPIAL KGK  CTYP+ S +S+ QLS  T +F+
Subjt:  MSRGEEDDLFLYTLKSPTPSLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLS--NPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFV

Query:  KSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFS-----SHLK
         SL   SIP  VHEALSH GW +AMI+E+ ALDDN          GKK IGCK VFA+K+N DG++ARLKA LVAKG+AQ YG DY DTFS     + ++
Subjt:  KSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFS-----SHLK

Query:  IGLYINLTLKMS----------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISGIILLV
        + L +  T K S          L G  QEEVY          G++DKV RL KSLYGLKQSPRA FGKF  AL  FGMKKS SDHSVFY+RS  GI+LLV
Subjt:  IGLYINLTLKMS----------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISGIILLV

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]3.2e-6853Show/hide
Query:  MSRGEEDDLFLYTLKSPTPSLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLS--NPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFV
        + +GE+D+LF+Y + SPTPSL+     S   RP I +VYS R  P    S P   L S  +P  SD+LPIAL KGK  CTYP+ S +S+ QLS  T +F+
Subjt:  MSRGEEDDLFLYTLKSPTPSLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLS--NPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFV

Query:  KSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFS-----SHLK
         SL   SIP  VHEALSH GW +AMI+E+ ALDDN          GKK IGCK VFA+K+N DG++ARLKA LVAKG+AQ YG DY DTFS     + ++
Subjt:  KSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFS-----SHLK

Query:  IGLYINLTLKMS----------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISGIILLV
        + L +  T K S          L G  QEEVY          G++DKV RL KSLYGLKQSPRA FGKF  AL  FGMKKS SDHSVFY+RS  GI+LLV
Subjt:  IGLYINLTLKMS----------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISGIILLV

TrEMBL top hitse value%identityAlignment
A0A061EWC9 Integrase catalytic domain-containing protein1.5e-5548.2Show/hide
Query:  SRGEEDDLFLYTLKSP--TPSLALSPPASALNRPPIPKVYSCRQQ-----PLGEYSIPKDSLLSNPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPT
        S+GEEDDL +YT+     T  +  + PA A  +PPI  VYS R +     PL   S P D + ++   S  LPIAL KGK  CTYPI S VS+D LS  +
Subjt:  SRGEEDDLFLYTLKSP--TPSLALSPPASALNRPPIPKVYSCRQQ-----PLGEYSIPKDSLLSNPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPT

Query:  CSFVKSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLKI
         SFV SL  +SIPK VHEALSH GW +AM++E+ ALD N          GKK IGCK V A+KV+ +GS+    A LVAKG+AQTY +DYF TFS   K+
Subjt:  CSFVKSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLKI

Query:  GLYINLTLKM----------------SLCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISG
          ++ L + M                 L G  Q+EVY      +   G+  KV  L K LYGLKQSPRA FGKF   +++FGMKKSK DHSVFY++S +G
Subjt:  GLYINLTLKM----------------SLCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISG

Query:  IILLV
        IILLV
Subjt:  IILLV

A0A438G5Y3 Retrovirus-related Pol polyprotein from transposon TNT 1-949.8e-5545.75Show/hide
Query:  SRGEEDDLFLYTLKSPTPSLALSPP-------ASALNRPPIPKVYSCRQQPLGEYSIPKDSLLSNPGSSDELPIALHKGKHSC--TYPIVSCVSHDQLSS
        S GE ++  +Y   +P+ S   S          SA  +PPI + YS  Q+       P  S LS+P S  +LPI L KGK  C   Y I + VS+DQLS 
Subjt:  SRGEEDDLFLYTLKSPTPSLALSPP-------ASALNRPPIPKVYSCRQQPLGEYSIPKDSLLSNPGSSDELPIALHKGKHSC--TYPIVSCVSHDQLSS

Query:  PTCSFVKSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHL
         + +FV SL  +SIPK + EAL+H GW +AM++E+ AL+ N          GK  +GCK VFAIKVN +GS+ARLK  LVAKG+AQTYGVDY DTFS   
Subjt:  PTCSFVKSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHL

Query:  KIG---LYINLTLKMS------------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSIS
        ++    L I++                 L G  QEEVY          G+  KV  L KSLYGLKQSPRA FGKF   +++FGM KSK DHSVFY++S +
Subjt:  KIG---LYINLTLKMS------------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSIS

Query:  GIILLV
        GIILLV
Subjt:  GIILLV

A0A5A7TVM7 Ty3-gypsy retrotransposon protein2.3e-6774.21Show/hide
Query:  EEDDLFLYTLKSPTPSLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLSNPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFVKSLAFM
        E DDLFLYTL SPTPS AL PPAS  NRPPIPK+YS RQQP GE SIPK SLLS+PG SDE PIAL KGKHSCTYPI S VS+D LSSPTCSFVKSL F+
Subjt:  EEDDLFLYTLKSPTPSLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLSNPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFVKSLAFM

Query:  SIPKIVHEALSHFGWCSAMIKEVNALDDNG----------KKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLKI
        SIPK VHEALSH GW +A IKEVNAL+DNG          KKTIGCK VFAIKVN DGSIARLKA L+AKG+AQT+G+DYFDT S   K+
Subjt:  SIPKIVHEALSHFGWCSAMIKEVNALDDNG----------KKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLKI

A0A5D3DJ35 Cysteine-rich RLK (RECEPTOR-like protein kinase) 83.8e-5945.12Show/hide
Query:  RGEEDDLFLYTLKSPT--PSLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLSNPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFVKS
        +GE+D+L +Y + SPT  P   L P      RPP     SC        S+P  S   +PG SD+L I L KGK  CTYP+ S V + QLSSPT +F+ S
Subjt:  RGEEDDLFLYTLKSPT--PSLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLSNPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFVKS

Query:  LAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLKIGLYINL
        L   SI   VHEALSH GW +AMI+E+ ALDDN          GKK I CK VF++KVNLDG++ +LKAHLVAKG+AQTYG++Y DTFS   K+   I L
Subjt:  LAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLKIGLYINL

Query:  TLKMS----------------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISGIILLVS
         L M+                L G  QEEVY          G++DKV RL KSLYGLKQ P A FGKF +AL +FG               IS +   + 
Subjt:  TLKMS----------------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISGIILLVS

Query:  RELGARLCNTLMILDLQLLKDGEPFKHP
         +LGA+   TLM+L+ QL+K+G+  K P
Subjt:  RELGARLCNTLMILDLQLLKDGEPFKHP

Q6L3Q0 Polyprotein, putative1.3e-5448.2Show/hide
Query:  ALNRPPIPKVYSCRQQPLGEYSIP----KDSLLSNPGSSD--ELPIALHKGKHSC--TYPIVSCVSHDQLSSPTCSFVKSLAFMSIPKIVHEALSHFGWC
        A  +PPI +VYS RQ        P     D L  NP  ++  ++PIAL KGK  C   Y I + +S+D LS  +CS + SL  + +PK V EAL+H GW 
Subjt:  ALNRPPIPKVYSCRQQPLGEYSIP----KDSLLSNPGSSD--ELPIALHKGKHSC--TYPIVSCVSHDQLSSPTCSFVKSLAFMSIPKIVHEALSHFGWC

Query:  SAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLK---IGLYINLTLKMS------------
         AM+ E++ALDDN          GKK +GCK VF IKVN DGS+ARLKA LVAKG+AQTYGVDY DTFS   K   + L+I+L    +            
Subjt:  SAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLK---IGLYINLTLKMS------------

Query:  LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISGIILLV
        L G  QEEVY          G+N KV  L K LYGLKQSPRA FGKF   +++FG+ KS  DHSVFY++S  GIILLV
Subjt:  LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISGIILLV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-1227.43Show/hide
Query:  KDSLLSNPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFVKSLAFMSIPKIVHEAL---SHFGWCSAMIKEVNALDDN----------GKKTIG
        K+  + NP  +D + I   + +   T P +S    D   +       ++ F  +P    E         W  A+  E+NA   N           K  + 
Subjt:  KDSLLSNPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFVKSLAFMSIPKIVHEAL---SHFGWCSAMIKEVNALDDN----------GKKTIG

Query:  CKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLKI-------GLYINLTLKMS--------LCGGFQEEVYYGATTWVCCLGDNDKVFRL
         + VF++K N  G+  R KA LVA+G  Q Y +DY +TF+   +I        L I   LK+         L G  +EE+Y      + C  DN  V +L
Subjt:  CKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLKI-------GLYINLTLKMS--------LCGGFQEEVYYGATTWVCCLGDNDKVFRL

Query:  HKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVF
        +K++YGLKQ+ R  F  F  AL++     S  D  ++
Subjt:  HKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-1431.63Show/hide
Query:  PKIVHEALSH---FGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLK-------IGL
        P+ + E LSH        AM +E+ +L  N          GK+ + CK VF +K + D  + R KA LV KG  Q  G+D+ + FS  +K       + L
Subjt:  PKIVHEALSH---FGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLK-------IGL

Query:  YINLTLKMS--------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQR-SISGIILLV
          +L L++         L G  +EE+Y          G    V +L+KSLYGLKQ+PR  + KF + ++     K+ SD  V+++R S +  I+L+
Subjt:  YINLTLKMS--------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQR-SISGIILLV

P92520 Uncharacterized mitochondrial protein AtMg008202.7e-0933.08Show/hide
Query:  SPTCSFVKSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFS--
        +P  S   +      PK V  AL   GWC AM +E++AL  N           +  +GCK VF  K++ DG++ RLKA LVAKG  Q  G+ + +T+S  
Subjt:  SPTCSFVKSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFS--

Query:  -------------SHLKIGLYINLTLKMSLCGG
                       L++G  IN   KM    G
Subjt:  -------------SHLKIGLYINLTLKMSLCGG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-1629.69Show/hide
Query:  TLKSPTP-SLALSPPASALNRPPIPKV----YSCRQQPLGEYSIPKDSLLSNPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFVKSLAFMSIP
        T +SP+  + +LS PA + +  P P       S    P      P   L     ++++ P+  H         I+         +P  S   SLA  S P
Subjt:  TLKSPTP-SLALSPPASALNRPPIPKV----YSCRQQPLGEYSIPKDSLLSNPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFVKSLAFMSIP

Query:  KIVHEALSHFGWCSAMIKEVNALDDNGK-----------KTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLK---IGLYINLTLK
        +   +AL    W +AM  E+NA   N               +GC+ +F  K N DGS+ R KA LVAKG+ Q  G+DY +TFS  +K   I + + + + 
Subjt:  KIVHEALSHFGWCSAMIKEVNALDDNGK-----------KTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLK---IGLYINLTLK

Query:  MS------------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVF-YQRSISGIILLV
         S            L G   ++VY             + V +L K+LYGLKQ+PRA + +  N L   G   S SD S+F  QR  S + +LV
Subjt:  MS------------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVF-YQRSISGIILLV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.1e-1830Show/hide
Query:  PTPSLAL----SPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLSNPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPT--CSFVKSLAFMSIPKIV
        PTPS ++    SP +S+ + PP+P V            +P   ++     + + P+  H          ++  + D +  P    S+  SLA  S P+  
Subjt:  PTPSLAL----SPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLSNPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPT--CSFVKSLAFMSIPKIV

Query:  HEALSHFGWCSAMIKEVNALDDN-----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLK---IGLYINLTLKMS-
         +A+    W  AM  E+NA   N               +GC+ +F  K N DGS+ R KA LVAKG+ Q  G+DY +TFS  +K   I + + + +  S 
Subjt:  HEALSHFGWCSAMIKEVNALDDN-----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLK---IGLYINLTLKMS-

Query:  -----------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVF-YQRSISGIILLV
                   L G   +EVY             D V RL K++YGLKQ+PRA + +    L   G   S SD S+F  QR  S I +LV
Subjt:  -----------LCGGFQEEVYYGATTWVCCLGDNDKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVF-YQRSISGIILLV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.3e-2231.73Show/hide
Query:  SLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLSNPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFVKSLAFMSIPKIVHEALSHFGW
        S+ + P A+  N  P P V++  ++        K + L +        + +H      +Y  VS + H        SF+  +A    P   +EA     W
Subjt:  SLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLSNPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFVKSLAFMSIPKIVHEALSHFGW

Query:  CSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLK---------IGLYINLTL------KM
        C AM  E+ A++             KK IGCK V+ IK N DG+I R KA LVAKG+ Q  G+D+ +TFS   K         I    N TL        
Subjt:  CSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLK---------IGLYINLTL------KM

Query:  SLCGGFQEEVYYGATT-WVCCLGDN---DKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVF
         L G   EE+Y      +    GD+   + V  L KS+YGLKQ+ R  F KF   L  FG  +S SDH+ F
Subjt:  SLCGGFQEEVYYGATT-WVCCLGDN---DKVFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.9e-1033.08Show/hide
Query:  SPTCSFVKSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFS--
        +P  S   +      PK V  AL   GWC AM +E++AL  N           +  +GCK VF  K++ DG++ RLKA LVAKG  Q  G+ + +T+S  
Subjt:  SPTCSFVKSLAFMSIPKIVHEALSHFGWCSAMIKEVNALDDN----------GKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFS--

Query:  -------------SHLKIGLYINLTLKMSLCGG
                       L++G  IN   KM    G
Subjt:  -------------SHLKIGLYINLTLKMSLCGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCGGGGGGAGGAGGATGATCTTTTTCTTTATACTCTAAAGTCTCCAACTCCCTCCCTTGCGTTGTCTCCTCCGGCATCTGCCCTTAATCGTCCACCAATACCTAA
GGTCTACTCTTGTCGGCAGCAACCTCTAGGTGAATATTCTATCCCGAAAGATTCTTTGTTATCAAATCCAGGATCAAGTGATGAGCTTCCCATTGCTCTTCATAAAGGTA
AACATTCTTGCACTTATCCTATTGTGTCTTGTGTATCTCATGACCAATTATCATCTCCTACATGTTCCTTTGTTAAATCTCTTGCTTTTATGTCTATACCTAAGATCGTA
CATGAAGCTTTGTCTCATTTTGGGTGGTGCAGTGCAATGATTAAAGAGGTGAATGCCTTAGATGATAATGGAAAGAAGACTATTGGGTGCAAGTCGGTGTTTGCAATTAA
AGTTAATCTTGATGGATCAATAGCTCGTTTAAAAGCACACCTTGTGGCTAAAGGTCATGCTCAGACGTATGGGGTTGACTATTTTGATACATTCTCTTCTCATCTCAAGA
TTGGCCTTTACATCAACTTGACATTAAAAATGTCTCTCTGCGGTGGTTTTCAAGAGGAAGTCTATTATGGAGCAACCACCTGGGTTTGTTGCTTAGGAGATAATGATAAA
GTTTTTCGCCTTCACAAATCCTTGTATGGGTTGAAGCAGAGTCCACGTGCATTGTTTGGAAAATTCTGTAACGCACTTGAGCAGTTTGGAATGAAGAAAAGCAAATCAGA
TCACTCAGTTTTCTACCAAAGATCTATTAGTGGAATTATTCTACTTGTTTCGAGGGAATTGGGAGCCAGACTATGCAATACTCTGATGATACTTGATCTACAACTATTAA
AAGACGGAGAACCATTTAAGCATCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTCGGGGGGAGGAGGATGATCTTTTTCTTTATACTCTAAAGTCTCCAACTCCCTCCCTTGCGTTGTCTCCTCCGGCATCTGCCCTTAATCGTCCACCAATACCTAA
GGTCTACTCTTGTCGGCAGCAACCTCTAGGTGAATATTCTATCCCGAAAGATTCTTTGTTATCAAATCCAGGATCAAGTGATGAGCTTCCCATTGCTCTTCATAAAGGTA
AACATTCTTGCACTTATCCTATTGTGTCTTGTGTATCTCATGACCAATTATCATCTCCTACATGTTCCTTTGTTAAATCTCTTGCTTTTATGTCTATACCTAAGATCGTA
CATGAAGCTTTGTCTCATTTTGGGTGGTGCAGTGCAATGATTAAAGAGGTGAATGCCTTAGATGATAATGGAAAGAAGACTATTGGGTGCAAGTCGGTGTTTGCAATTAA
AGTTAATCTTGATGGATCAATAGCTCGTTTAAAAGCACACCTTGTGGCTAAAGGTCATGCTCAGACGTATGGGGTTGACTATTTTGATACATTCTCTTCTCATCTCAAGA
TTGGCCTTTACATCAACTTGACATTAAAAATGTCTCTCTGCGGTGGTTTTCAAGAGGAAGTCTATTATGGAGCAACCACCTGGGTTTGTTGCTTAGGAGATAATGATAAA
GTTTTTCGCCTTCACAAATCCTTGTATGGGTTGAAGCAGAGTCCACGTGCATTGTTTGGAAAATTCTGTAACGCACTTGAGCAGTTTGGAATGAAGAAAAGCAAATCAGA
TCACTCAGTTTTCTACCAAAGATCTATTAGTGGAATTATTCTACTTGTTTCGAGGGAATTGGGAGCCAGACTATGCAATACTCTGATGATACTTGATCTACAACTATTAA
AAGACGGAGAACCATTTAAGCATCCTTAG
Protein sequenceShow/hide protein sequence
MSRGEEDDLFLYTLKSPTPSLALSPPASALNRPPIPKVYSCRQQPLGEYSIPKDSLLSNPGSSDELPIALHKGKHSCTYPIVSCVSHDQLSSPTCSFVKSLAFMSIPKIV
HEALSHFGWCSAMIKEVNALDDNGKKTIGCKSVFAIKVNLDGSIARLKAHLVAKGHAQTYGVDYFDTFSSHLKIGLYINLTLKMSLCGGFQEEVYYGATTWVCCLGDNDK
VFRLHKSLYGLKQSPRALFGKFCNALEQFGMKKSKSDHSVFYQRSISGIILLVSRELGARLCNTLMILDLQLLKDGEPFKHP