; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020304 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020304
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionWRKY protein
Genome locationscaffold665:889287..891070
RNA-Seq ExpressionMS020304
SyntenyMS020304
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0005516 - calmodulin binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR003657 - WRKY domain
IPR018872 - Zn-cluster domain
IPR036576 - WRKY domain superfamily
IPR044810 - WRKY transcription factor, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
QIS68546.1 WRKY20 protein [Cucumis metulifer]6.6e-13175.72Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQPA---DQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSG
        MEEVEEAN++A+ESCHGVLNLL QP     Q   +NLM+ET EAVFKF+KV+SLLNS   H R R  NKI LPLPQN+LLD P + L   N+NL     G
Subjt:  MEEVEEANRAALESCHGVLNLLAQPA---DQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSG

Query:  LNGKVSTFLGIPDLELGPNDKNFLQIPRQAPSF---FPQH--QQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSV
         N KVS FLG PDLEL  NDKN L IP+Q+PS    FP H  QQQQ +L  QKQ+KQQ+EMMFLRNNN GMNLNFDTSNCTLTMSSARSFISSLSMDGSV
Subjt:  LNGKVSTFLGIPDLELGPNDKNFLQIPRQAPSF---FPQH--QQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSV

Query:  -ADGSSFHLIGPS-----TSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPR
          D SSFHLIGPS     TS D+KRKFS R G+EGSLKCGST KCHCSKK       RKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPR
Subjt:  -ADGSSFHLIGPS-----TSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPR

Query:  GYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH
        GYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH
Subjt:  GYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH

XP_004154104.1 probable WRKY transcription factor 21 isoform X1 [Cucumis sativus]1.2e-12772.47Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQPA-----DQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQ
        MEEVEEAN++A+ESCHGVLNLL QP       Q   +NLM+ET EAVFKF+KV+SLLNS   H R R  NKI LPLPQN+LLD P + L   N+NL  S 
Subjt:  MEEVEEANRAALESCHGVLNLLAQPA-----DQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQ

Query:  SGLNGKVSTFLGIPDLELGPNDKNFLQIPRQAPSF---FPQH----------QQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSF
         G N KVS  LG PDLEL  NDKN L IP+Q+PS    FP H          QQQQ +L  QKQ+K Q+EMMFLRNNN GMNLNFDTSNCT+TMSSARSF
Subjt:  SGLNGKVSTFLGIPDLELGPNDKNFLQIPRQAPSF---FPQH----------QQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSF

Query:  ISSLSMDGSV-ADGSSFHLIGPS-----TSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQ
        ISSLSMDGSV  D SSFHLIGPS     TS ++KRKFS R G+EGSLKCGST KCHCSKK       RKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQ
Subjt:  ISSLSMDGSV-ADGSSFHLIGPS-----TSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQ

Query:  KPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH
        KPIKGSPHPRGYYKCSS+RGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH
Subjt:  KPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH

XP_008449343.1 PREDICTED: probable WRKY transcription factor 21 [Cucumis melo]1.3e-12673.22Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQ----PADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQS
        MEEVEEA ++A+ESCHGVLNLL Q    P  Q   +NLMVET EAVFKF+KV+SLLNS   H R R  NKI LPLPQN+LLD P + L   N+NL     
Subjt:  MEEVEEANRAALESCHGVLNLLAQ----PADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQS

Query:  GLNGKVSTFLGIPDLELGPNDKNFLQIPRQAPSF---FPQH------QQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLS
        G N KVS FLG PDLEL  NDKN L IP+Q+PS    FP H      QQQQ +L  QKQ+KQQ+EM FLRNNN GMNLNFDTSNCTLTMSSARSFISSLS
Subjt:  GLNGKVSTFLGIPDLELGPNDKNFLQIPRQAPSF---FPQH------QQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLS

Query:  MDGSV-ADGSSFHLIGPS-----TSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKG
        MDGSV  D SSFHLIGPS     TS ++KRKFS R G+EGSLKCGST KCHCSKK       RKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKG
Subjt:  MDGSV-ADGSSFHLIGPS-----TSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKG

Query:  SPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH
        SPHPRGYYKCSS+RGCPARKHVERCLEDPSMLIVTYEGEH+HPKM TQSAH
Subjt:  SPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH

XP_022146052.1 probable WRKY transcription factor 21 [Momordica charantia]1.2e-18097.6Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSGLNG
        MEEVEEANR ALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSGLNG
Subjt:  MEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSGLNG

Query:  KVSTFLGIPDLELGPNDKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSVADGSSFHL
        KVSTFLGIPDLELGPNDKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSVADGSSFHL
Subjt:  KVSTFLGIPDLELGPNDKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSVADGSSFHL

Query:  IGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPAR
        IGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKK       RKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPAR
Subjt:  IGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPAR

Query:  KHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHP
        KHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHP
Subjt:  KHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHP

XP_038886893.1 probable WRKY transcription factor 21 isoform X1 [Benincasa hispida]3.8e-13980.52Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQPA-DQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSGLN
        MEEVEEAN+AA+ESCHGVLNLL QP+ DQ+  RNLMVET EAVFKF+KV+SLLNSG GHARVR  NKI LPLP+ +LLD   +     N+NL+P   GLN
Subjt:  MEEVEEANRAALESCHGVLNLLAQPA-DQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSGLN

Query:  GKVSTFLGIPDLELGPNDKNFLQIPRQAPSF---FP--QHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSV--
        GKVS FLG PDLEL PNDKN LQIP+Q PS    FP  Q QQQQRIL  QKQ+KQQ+EMMFLRNNN GMNLNFDTSN TLTMSSARSFISSLSMDGSV  
Subjt:  GKVSTFLGIPDLELGPNDKNFLQIPRQAPSF---FP--QHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSV--

Query:  ADGSSFHLIGPS----TSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGY
        AD SSFHLIGPS    TSADNKRKFS R GDEGSLKCGSTGKCHCSKK       RKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGY
Subjt:  ADGSSFHLIGPS----TSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGY

Query:  YKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH
        YKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH
Subjt:  YKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH

TrEMBL top hitse value%identityAlignment
A0A1S3BLU2 probable WRKY transcription factor 216.2e-12773.22Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQ----PADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQS
        MEEVEEA ++A+ESCHGVLNLL Q    P  Q   +NLMVET EAVFKF+KV+SLLNS   H R R  NKI LPLPQN+LLD P + L   N+NL     
Subjt:  MEEVEEANRAALESCHGVLNLLAQ----PADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQS

Query:  GLNGKVSTFLGIPDLELGPNDKNFLQIPRQAPSF---FPQH------QQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLS
        G N KVS FLG PDLEL  NDKN L IP+Q+PS    FP H      QQQQ +L  QKQ+KQQ+EM FLRNNN GMNLNFDTSNCTLTMSSARSFISSLS
Subjt:  GLNGKVSTFLGIPDLELGPNDKNFLQIPRQAPSF---FPQH------QQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLS

Query:  MDGSV-ADGSSFHLIGPS-----TSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKG
        MDGSV  D SSFHLIGPS     TS ++KRKFS R G+EGSLKCGST KCHCSKK       RKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKG
Subjt:  MDGSV-ADGSSFHLIGPS-----TSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKG

Query:  SPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH
        SPHPRGYYKCSS+RGCPARKHVERCLEDPSMLIVTYEGEH+HPKM TQSAH
Subjt:  SPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH

A0A455PAJ4 WRKY213.1e-12675.07Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQPADQIQ---LRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSG
        MEE+EEANRAA++SCHGVLNLLAQP+ Q Q    +NLMVETGEAVFK +KVVSLLNSG G+A+VR+   I LPLPQ  LLD              P  +G
Subjt:  MEEVEEANRAALESCHGVLNLLAQPADQIQ---LRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSG

Query:  LNGKVSTFLGIPDLELGPNDKNFLQIPRQ-APSF---FPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSVA
        LNGKVS FLG PDLE+  N KN L+I +Q APS    FPQ QQQQR         +Q+EMMFLR +NSGMN+NFD S CTLTMSSARSFISSLSMDGSVA
Subjt:  LNGKVSTFLGIPDLELGPNDKNFLQIPRQ-APSF---FPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSVA

Query:  DGSSFHLIGPST--SADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKC
        DGSSFHLIGPS+  S DNKR+FSGR GDEGSLKCGSTGKCHCSKK       RKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKC
Subjt:  DGSSFHLIGPST--SADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKC

Query:  SSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH
        SSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH
Subjt:  SSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH

A0A6H0CEQ4 WRKY20 protein3.2e-13175.72Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQPA---DQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSG
        MEEVEEAN++A+ESCHGVLNLL QP     Q   +NLM+ET EAVFKF+KV+SLLNS   H R R  NKI LPLPQN+LLD P + L   N+NL     G
Subjt:  MEEVEEANRAALESCHGVLNLLAQPA---DQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSG

Query:  LNGKVSTFLGIPDLELGPNDKNFLQIPRQAPSF---FPQH--QQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSV
         N KVS FLG PDLEL  NDKN L IP+Q+PS    FP H  QQQQ +L  QKQ+KQQ+EMMFLRNNN GMNLNFDTSNCTLTMSSARSFISSLSMDGSV
Subjt:  LNGKVSTFLGIPDLELGPNDKNFLQIPRQAPSF---FPQH--QQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSV

Query:  -ADGSSFHLIGPS-----TSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPR
          D SSFHLIGPS     TS D+KRKFS R G+EGSLKCGST KCHCSKK       RKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPR
Subjt:  -ADGSSFHLIGPS-----TSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPR

Query:  GYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH
        GYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH
Subjt:  GYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH

A0A6J1CYE8 probable WRKY transcription factor 215.7e-18197.6Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSGLNG
        MEEVEEANR ALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSGLNG
Subjt:  MEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSGLNG

Query:  KVSTFLGIPDLELGPNDKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSVADGSSFHL
        KVSTFLGIPDLELGPNDKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSVADGSSFHL
Subjt:  KVSTFLGIPDLELGPNDKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSVADGSSFHL

Query:  IGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPAR
        IGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKK       RKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPAR
Subjt:  IGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPAR

Query:  KHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHP
        KHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHP
Subjt:  KHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHP

E7CEW1 WRKY protein5.6e-12872.47Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQPA-----DQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQ
        MEEVEEAN++A+ESCHGVLNLL QP       Q   +NLM+ET EAVFKF+KV+SLLNS   H R R  NKI LPLPQN+LLD P + L   N+NL  S 
Subjt:  MEEVEEANRAALESCHGVLNLLAQPA-----DQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQ

Query:  SGLNGKVSTFLGIPDLELGPNDKNFLQIPRQAPSF---FPQH----------QQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSF
         G N KVS  LG PDLEL  NDKN L IP+Q+PS    FP H          QQQQ +L  QKQ+K Q+EMMFLRNNN GMNLNFDTSNCT+TMSSARSF
Subjt:  SGLNGKVSTFLGIPDLELGPNDKNFLQIPRQAPSF---FPQH----------QQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSF

Query:  ISSLSMDGSV-ADGSSFHLIGPS-----TSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQ
        ISSLSMDGSV  D SSFHLIGPS     TS ++KRKFS R G+EGSLKCGST KCHCSKK       RKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQ
Subjt:  ISSLSMDGSV-ADGSSFHLIGPS-----TSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQ

Query:  KPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH
        KPIKGSPHPRGYYKCSS+RGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH
Subjt:  KPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAH

SwissProt top hitse value%identityAlignment
O04336 Probable WRKY transcription factor 215.9e-8249.87Show/hide
Query:  MEEVEEANRAALESCHGVLNLL--AQPADQIQL-RNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQ-----NQNLH
        MEE+E  NRAA+ESCH VLNLL  +Q  D +   +NL+ ET EAV +F++V SLL+S +GHAR R+  K+   + Q+ LLD P     T+     +Q   
Subjt:  MEEVEEANRAALESCHGVLNLL--AQPADQIQL-RNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQ-----NQNLH

Query:  PSQSGL------NGKVSTFLGIPDLELGPNDK-NFLQI------PRQAPSFFPQHQQQQRILPQQKQIKQQSEM--------------------------
          +SG           S  LG     L  N K   LQ+      P   P+ FP  QQQQ+   QQ+Q +QQ +                           
Subjt:  PSQSGL------NGKVSTFLGIPDLELGPNDK-NFLQI------PRQAPSFFPQHQQQQRILPQQKQIKQQSEM--------------------------

Query:  -MFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSVAD---GSSFHLIGPSTSADN----KRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRK
         + LR  N G++L+FD S+CT TMSS RSF+SSLS+DGSVA+    +SFH   PS++  N    KRK   +G + GSLKCGS+ +CHC+KK       RK
Subjt:  -MFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSVAD---GSSFHLIGPSTSADN----KRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRK

Query:  HRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQS
        HRV+RSI+VPAISNK+ADIP DDYSWRKYGQKPIKGSP+PRGYYKCSSMRGCPARKHVERCLEDP+MLIVTYE EHNHPK+ +Q+
Subjt:  HRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQS

Q0JEE2 WRKY transcription factor WRKY517.2e-4051.27Show/hide
Query:  GMNLNFDTSNCTLTMSSARSFISSLSMDGSVADG-----SSFHLIGP-----------------STSADNKRKFSGRGGDEGSL--KCGST-GKCHCSKK
        G +  F  S  +   +++ SF+SS++ DGSV++G     SS  L  P                 S  A +KRK       E     K GST G+CHCSK+
Subjt:  GMNLNFDTSNCTLTMSSARSFISSLSMDGSVADG-----SSFHLIGP-----------------STSADNKRKFSGRGGDEGSL--KCGST-GKCHCSKK

Query:  SELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHP
               RKHRVKR+I+VPAIS+K+ADIP+DD+SWRKYGQKPIKGSP PRGYYKCS++RGCPARKHVER   DPSMLIVTYEGEH H   +    HP
Subjt:  SELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHP

Q32SG4 Protein WRKY12.7e-6343.28Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQPADQ-IQLRNLMVETGEAVFKFRKVVSLLNS----GL--GHARVRKLNKISLPLPQNTLLD---------------
        ME +EEANR A+ESCH VL LL+ P  Q +  + L+  TGEAV KF  + + L++    GL  GHARVRK+ K       N  L+               
Subjt:  MEEVEEANRAALESCHGVLNLLAQPADQ-IQLRNLMVETGEAVFKFRKVVSLLNS----GL--GHARVRKLNKISLPLPQNTLLD---------------

Query:  ---------YPIHH------------LPTQNQN---LHPSQSGLNGKVSTFLGI-----PDLELGPNDKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQ
                 +P +H            +PTQ      L    +G+ G  S    I     P     P       +P     F  Q Q  QR    Q Q+K 
Subjt:  ---------YPIHH------------LPTQNQN---LHPSQSGLNGKVSTFLGI-----PDLELGPNDKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQ

Query:  QSEMMFLRN--------------NNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSVA--DGS----SFHLIGPSTSAD-------NKRKFSGRGGDEGS
        QSEMM   N                 G+NL FD+SNC  T SS+RSF+SSLSM+GS+A  DGS     F L+  S +A         +R+ +GR  ++G+
Subjt:  QSEMMFLRN--------------NNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSVA--DGS----SFHLIGPSTSAD-------NKRKFSGRGGDEGS

Query:  LKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHN
         +C +  +CHCSKK       RK R++RSIKVPAISNK+ADIP+D++SWRKYGQKPIKGSPHPRGYYKCSS+RGCPARKHVERC++DPSMLIVTYEG+HN
Subjt:  LKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHN

Query:  HPKMSTQSA
        H ++  Q A
Subjt:  HPKMSTQSA

Q93WU6 Probable WRKY transcription factor 743.4e-7450.28Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNK-----ISLPLPQNTLLDYPIHHLPTQNQNLHPSQ
        MEEVE AN+AA+ESCHGVLNLL+Q  +    +++MVET EAV KF++V SLL+ GLG  +++KLN       S  LPQ+  L+ P+            S 
Subjt:  MEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNK-----ISLPLPQNTLLDYPIHHLPTQNQNLHPSQ

Query:  SGLNGKVSTF---------LGIPDLELGPN----DKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTS---NC--TLTMS
        + ++G +             G P L L       DK+FL++  + PS        + + P+  Q     +      + SG+NL FD S   +C      +
Subjt:  SGLNGKVSTF---------LGIPDLELGPN----DKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTS---NC--TLTMS

Query:  SARSFISSLSMDGSVA--DGSSFHLIGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYG
         +RSF+SSLSMDGSV   D +SFHLIG    +D+  + S R    GSLKCGS  KCHCSKK       RK RVKRSIKVPAISNK+ADIP D+YSWRKYG
Subjt:  SARSFISSLSMDGSVA--DGSSFHLIGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYG

Query:  QKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPK-MSTQSAH
        QKPIKGSPHPRGYYKCSS+RGCPARKHVERC+E+ SMLIVTYEGEHNH + +S+QSAH
Subjt:  QKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPK-MSTQSAH

Q9SR07 Probable WRKY transcription factor 395.3e-7550.85Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPI-----------HHLPTQNQ
        MEEVE ANR+A+ESCHGVLNLL+Q       ++L VETGE V KF++V SLL  GLGH + R  NK     PQ+  L+ PI             L  +  
Subjt:  MEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPI-----------HHLPTQNQ

Query:  NLHPSQSGLNG-KVSTFLGIPDLELGPN---DKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTL----TMSSARS
         + P+ +  N  +    LG P L L      DK+FL++  + P F   +Q    ++   +QI           +NSG+NL FD S  +       + +RS
Subjt:  NLHPSQSGLNG-KVSTFLGIPDLELGPN---DKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTL----TMSSARS

Query:  FISSLSMDGSVA--DGSSFHLIGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPI
        F+SSLSMD SV   D +SFHL G S  +D +     R    GSLKCGS  KCHCSKK       RK RVKRSIKVPAISNK+ADIP D+YSWRKYGQKPI
Subjt:  FISSLSMDGSVA--DGSSFHLIGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPI

Query:  KGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPK-MSTQSAH
        KGSPHPRGYYKCSS+RGCPARKHVERC+++ SMLIVTYEGEHNH + +S+QSAH
Subjt:  KGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPK-MSTQSAH

Arabidopsis top hitse value%identityAlignment
AT2G24570.1 WRKY DNA-binding protein 171.5e-4037.93Show/hide
Query:  QMEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVE--TGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSG
        +ME+      AA +    + +L+   +++ + RN+     T   V KF+KV+SLLN   GHAR R+                P+H          P  S 
Subjt:  QMEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVE--TGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSG

Query:  LNGKVSTFLGIPDLELGPNDKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISS-LSMDGSVADGS
        +   V      P            QI   AP  F Q  QQ   L   +        +F     S   + F  +  + ++SS  SF+SS ++ DGSV+ GS
Subjt:  LNGKVSTFLGIPDLELGPNDKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISS-LSMDGSVADGS

Query:  SFHLIG----PSTSAD---------NKRKF--SGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKG
        S  L      P TS+           KR F      G  G +     GKCHC K        RK+R+KR+++VPA+S K+ADIP D+YSWRKYGQKPIKG
Subjt:  SFHLIG----PSTSAD---------NKRKF--SGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKG

Query:  SPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQ
        SPHPRGYYKCS+ RGCPARKHVER L+D +MLIVTYEGEH H + + Q
Subjt:  SPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQ

AT2G30590.1 WRKY DNA-binding protein 214.2e-8349.87Show/hide
Query:  MEEVEEANRAALESCHGVLNLL--AQPADQIQL-RNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQ-----NQNLH
        MEE+E  NRAA+ESCH VLNLL  +Q  D +   +NL+ ET EAV +F++V SLL+S +GHAR R+  K+   + Q+ LLD P     T+     +Q   
Subjt:  MEEVEEANRAALESCHGVLNLL--AQPADQIQL-RNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQ-----NQNLH

Query:  PSQSGL------NGKVSTFLGIPDLELGPNDK-NFLQI------PRQAPSFFPQHQQQQRILPQQKQIKQQSEM--------------------------
          +SG           S  LG     L  N K   LQ+      P   P+ FP  QQQQ+   QQ+Q +QQ +                           
Subjt:  PSQSGL------NGKVSTFLGIPDLELGPNDK-NFLQI------PRQAPSFFPQHQQQQRILPQQKQIKQQSEM--------------------------

Query:  -MFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSVAD---GSSFHLIGPSTSADN----KRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRK
         + LR  N G++L+FD S+CT TMSS RSF+SSLS+DGSVA+    +SFH   PS++  N    KRK   +G + GSLKCGS+ +CHC+KK       RK
Subjt:  -MFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSVAD---GSSFHLIGPSTSADN----KRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRK

Query:  HRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQS
        HRV+RSI+VPAISNK+ADIP DDYSWRKYGQKPIKGSP+PRGYYKCSSMRGCPARKHVERCLEDP+MLIVTYE EHNHPK+ +Q+
Subjt:  HRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQS

AT3G04670.1 WRKY DNA-binding protein 393.8e-7650.85Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPI-----------HHLPTQNQ
        MEEVE ANR+A+ESCHGVLNLL+Q       ++L VETGE V KF++V SLL  GLGH + R  NK     PQ+  L+ PI             L  +  
Subjt:  MEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPI-----------HHLPTQNQ

Query:  NLHPSQSGLNG-KVSTFLGIPDLELGPN---DKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTL----TMSSARS
         + P+ +  N  +    LG P L L      DK+FL++  + P F   +Q    ++   +QI           +NSG+NL FD S  +       + +RS
Subjt:  NLHPSQSGLNG-KVSTFLGIPDLELGPN---DKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTL----TMSSARS

Query:  FISSLSMDGSVA--DGSSFHLIGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPI
        F+SSLSMD SV   D +SFHL G S  +D +     R    GSLKCGS  KCHCSKK       RK RVKRSIKVPAISNK+ADIP D+YSWRKYGQKPI
Subjt:  FISSLSMDGSVA--DGSSFHLIGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPI

Query:  KGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPK-MSTQSAH
        KGSPHPRGYYKCSS+RGCPARKHVERC+++ SMLIVTYEGEHNH + +S+QSAH
Subjt:  KGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPK-MSTQSAH

AT3G04670.2 WRKY DNA-binding protein 397.7e-5346.62Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPI-----------HHLPTQNQ
        MEEVE ANR+A+ESCHGVLNLL+Q       ++L VETGE V KF++V SLL  GLGH + R  NK     PQ+  L+ PI             L  +  
Subjt:  MEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPI-----------HHLPTQNQ

Query:  NLHPSQSGLNG-KVSTFLGIPDLELGPN---DKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTL----TMSSARS
         + P+ +  N  +    LG P L L      DK+FL++  + P F   +Q    ++   +QI           +NSG+NL FD S  +       + +RS
Subjt:  NLHPSQSGLNG-KVSTFLGIPDLELGPN---DKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTL----TMSSARS

Query:  FISSLSMDGSVA--DGSSFHLIGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPI
        F+SSLSMD SV   D +SFHL G S  +D +     R    GSLKCGS  KCHCSKK       RK RVKRSIKVPAISNK+ADIP D+YSWRKYGQKPI
Subjt:  FISSLSMDGSVA--DGSSFHLIGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPI

Query:  KGSPHPRGYYK
        KGSPHPR  YK
Subjt:  KGSPHPRGYYK

AT5G28650.1 WRKY DNA-binding protein 742.4e-7550.28Show/hide
Query:  MEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNK-----ISLPLPQNTLLDYPIHHLPTQNQNLHPSQ
        MEEVE AN+AA+ESCHGVLNLL+Q  +    +++MVET EAV KF++V SLL+ GLG  +++KLN       S  LPQ+  L+ P+            S 
Subjt:  MEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNK-----ISLPLPQNTLLDYPIHHLPTQNQNLHPSQ

Query:  SGLNGKVSTF---------LGIPDLELGPN----DKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTS---NC--TLTMS
        + ++G +             G P L L       DK+FL++  + PS        + + P+  Q     +      + SG+NL FD S   +C      +
Subjt:  SGLNGKVSTF---------LGIPDLELGPN----DKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTS---NC--TLTMS

Query:  SARSFISSLSMDGSVA--DGSSFHLIGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYG
         +RSF+SSLSMDGSV   D +SFHLIG    +D+  + S R    GSLKCGS  KCHCSKK       RK RVKRSIKVPAISNK+ADIP D+YSWRKYG
Subjt:  SARSFISSLSMDGSVA--DGSSFHLIGPSTSADNKRKFSGRGGDEGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYG

Query:  QKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPK-MSTQSAH
        QKPIKGSPHPRGYYKCSS+RGCPARKHVERC+E+ SMLIVTYEGEHNH + +S+QSAH
Subjt:  QKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPK-MSTQSAH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CAAATGGAAGAGGTGGAGGAAGCTAACAGAGCAGCTTTAGAGAGCTGCCATGGAGTTCTGAATCTCTTGGCTCAACCTGCAGACCAAATTCAGTTGAGGAATTTAATGGT
GGAAACTGGGGAAGCTGTTTTCAAGTTCAGGAAAGTGGTTTCTCTTTTGAATTCTGGTTTGGGTCATGCTAGAGTCAGAAAACTCAACAAGATTTCTCTCCCTTTGCCCC
AAAACACCCTCTTGGATTACCCAATTCACCATCTTCCAACCCAAAATCAAAATCTCCACCCTTCTCAGTCTGGTTTGAATGGTAAAGTTTCAACTTTTCTGGGAATCCCA
GATTTAGAATTGGGTCCAAATGACAAGAACTTCCTCCAAATTCCCAGACAAGCTCCATCTTTCTTCCCTCAACACCAACAACAACAGAGGATTTTGCCTCAGCAGAAACA
GATTAAACAGCAATCAGAAATGATGTTTCTTAGGAACAACAACAGCGGCATGAATCTGAATTTTGACACATCTAACTGCACATTGACAATGTCATCAGCTAGATCTTTCA
TTTCTTCGTTGAGTATGGACGGGAGCGTGGCGGACGGAAGCTCATTCCACTTGATCGGACCGTCGACGTCGGCCGATAACAAGAGGAAGTTTTCAGGAAGGGGAGGAGAT
GAGGGGAGCTTGAAATGTGGAAGCACTGGCAAATGCCACTGCTCAAAAAAGAGTGAATTAAGTTCTTTTTGTAGGAAACATAGGGTGAAAAGATCAATCAAAGTGCCTGC
TATAAGTAACAAGCTTGCAGATATCCCTTCTGATGATTATTCATGGAGGAAATATGGGCAGAAGCCAATTAAGGGCTCTCCTCATCCCCGGGGTTACTACAAATGCAGCA
GCATGAGAGGGTGTCCAGCGAGAAAGCACGTCGAACGGTGCTTAGAAGACCCGTCGATGCTTATCGTAACATACGAAGGAGAACACAATCACCCGAAAATGTCGACTCAA
TCTGCACACCCT
mRNA sequenceShow/hide mRNA sequence
CAAATGGAAGAGGTGGAGGAAGCTAACAGAGCAGCTTTAGAGAGCTGCCATGGAGTTCTGAATCTCTTGGCTCAACCTGCAGACCAAATTCAGTTGAGGAATTTAATGGT
GGAAACTGGGGAAGCTGTTTTCAAGTTCAGGAAAGTGGTTTCTCTTTTGAATTCTGGTTTGGGTCATGCTAGAGTCAGAAAACTCAACAAGATTTCTCTCCCTTTGCCCC
AAAACACCCTCTTGGATTACCCAATTCACCATCTTCCAACCCAAAATCAAAATCTCCACCCTTCTCAGTCTGGTTTGAATGGTAAAGTTTCAACTTTTCTGGGAATCCCA
GATTTAGAATTGGGTCCAAATGACAAGAACTTCCTCCAAATTCCCAGACAAGCTCCATCTTTCTTCCCTCAACACCAACAACAACAGAGGATTTTGCCTCAGCAGAAACA
GATTAAACAGCAATCAGAAATGATGTTTCTTAGGAACAACAACAGCGGCATGAATCTGAATTTTGACACATCTAACTGCACATTGACAATGTCATCAGCTAGATCTTTCA
TTTCTTCGTTGAGTATGGACGGGAGCGTGGCGGACGGAAGCTCATTCCACTTGATCGGACCGTCGACGTCGGCCGATAACAAGAGGAAGTTTTCAGGAAGGGGAGGAGAT
GAGGGGAGCTTGAAATGTGGAAGCACTGGCAAATGCCACTGCTCAAAAAAGAGTGAATTAAGTTCTTTTTGTAGGAAACATAGGGTGAAAAGATCAATCAAAGTGCCTGC
TATAAGTAACAAGCTTGCAGATATCCCTTCTGATGATTATTCATGGAGGAAATATGGGCAGAAGCCAATTAAGGGCTCTCCTCATCCCCGGGGTTACTACAAATGCAGCA
GCATGAGAGGGTGTCCAGCGAGAAAGCACGTCGAACGGTGCTTAGAAGACCCGTCGATGCTTATCGTAACATACGAAGGAGAACACAATCACCCGAAAATGTCGACTCAA
TCTGCACACCCT
Protein sequenceShow/hide protein sequence
QMEEVEEANRAALESCHGVLNLLAQPADQIQLRNLMVETGEAVFKFRKVVSLLNSGLGHARVRKLNKISLPLPQNTLLDYPIHHLPTQNQNLHPSQSGLNGKVSTFLGIP
DLELGPNDKNFLQIPRQAPSFFPQHQQQQRILPQQKQIKQQSEMMFLRNNNSGMNLNFDTSNCTLTMSSARSFISSLSMDGSVADGSSFHLIGPSTSADNKRKFSGRGGD
EGSLKCGSTGKCHCSKKSELSSFCRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQ
SAHP