; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002288 (gene) of Snake gourd v1 genome

Gene IDTan0002288
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionWRKY protein
Genome locationLG01:100673442..100676953
RNA-Seq ExpressionTan0002288
SyntenyTan0002288
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR003657 - WRKY domain
IPR036576 - WRKY domain superfamily
IPR044810 - WRKY transcription factor, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008452856.1 PREDICTED: probable WRKY transcription factor 40 isoform X1 [Cucumis melo]1.8e-15790.55Show/hide
Query:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK
        MDIFLDLNVDP SSYANS MD+A  SSQKR+   GE+Y  +EKLSLSL+NKG SDS+PTLE+ELDRKI+ENGKLSQMLR MYEKY+NLQKQVMYLLS QK
Subjt:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK

Query:  QNSEIEAV-SRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
        QN+E+E V SRKRKAEG +E+YENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
Subjt:  QNSEIEAV-SRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP

Query:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS
        VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESA   EAAVWQEFLVQQMAS
Subjt:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS

Query:  SLKKDPEFAGIVAGAISGKVLGNQTNRE
        SLKKDPEFAGIVAGAISGKVLGNQTNRE
Subjt:  SLKKDPEFAGIVAGAISGKVLGNQTNRE

XP_008452857.1 PREDICTED: probable WRKY transcription factor 40 isoform X2 [Cucumis melo]8.9e-15790.24Show/hide
Query:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK
        MDIFLDLNVDP SSYANS MD+A  SS++ +FD GE+Y  +EKLSLSL+NKG SDS+PTLE+ELDRKI+ENGKLSQMLR MYEKY+NLQKQVMYLLS QK
Subjt:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK

Query:  QNSEIEAV-SRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
        QN+E+E V SRKRKAEG +E+YENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
Subjt:  QNSEIEAV-SRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP

Query:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS
        VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESA   EAAVWQEFLVQQMAS
Subjt:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS

Query:  SLKKDPEFAGIVAGAISGKVLGNQTNRE
        SLKKDPEFAGIVAGAISGKVLGNQTNRE
Subjt:  SLKKDPEFAGIVAGAISGKVLGNQTNRE

XP_011654231.1 probable WRKY transcription factor 40 isoform X1 [Cucumis sativus]3.4e-15689.97Show/hide
Query:  MDIFLDLNVDPTSSYANSAMDQ-ALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQ
        MDIFLDLNVDP SSYANS MD+ A  SSQKR+   GE+Y DKEKL+LSL+NKG S+S+PTLE+ELDRKI+ENGKLSQMLR MYEKY+NLQKQVMYLLS Q
Subjt:  MDIFLDLNVDPTSSYANSAMDQ-ALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQ

Query:  KQNSEIEAV-SRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNC
        KQ++E+E V SRKRKAEG +E+YENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNC
Subjt:  KQNSEIEAV-SRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNC

Query:  PVKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMA
        PVKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESA   EAAVWQEFLVQQMA
Subjt:  PVKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMA

Query:  SSLKKDPEFAGIVAGAISGKVLGNQTNRE
        SSLKKDPEFAGIVAGAISGKVLGNQTNRE
Subjt:  SSLKKDPEFAGIVAGAISGKVLGNQTNRE

XP_022940049.1 probable WRKY transcription factor 40 isoform X1 [Cucurbita moschata]3.1e-15789.94Show/hide
Query:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK
        MDIFLDLN++P+SSY NSAMD+AL SSQKRE +A E+  D+EKL+LSLANKG SD   TLEE+LDRKIKENG+LSQMLR+MYEKYMNLQKQVMYL++QQK
Subjt:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK

Query:  -QNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
         Q+SEIE VSRKR+AEG EEEYENLEGICS+RDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
Subjt:  -QNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP

Query:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS
        VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGK SAVPVLATIKPSCA+VTLDLIHEDGLFKSPKDYA+SESA STEAAVWQEFLVQQMAS
Subjt:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS

Query:  SLKKDPEFAGIVAGAISGKVLGNQTNRE
        SLKKDPEFAGIVAGAISGKVLGNQTNRE
Subjt:  SLKKDPEFAGIVAGAISGKVLGNQTNRE

XP_022982149.1 probable WRKY transcription factor 40 isoform X1 [Cucurbita maxima]2.3e-15790.24Show/hide
Query:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK
        MDIFLDLN++P+SSY NSAMD+AL SSQKRE +A E   D+EKL+LSLANKG SD   TLEE+LDRKIKENG+LSQMLR+MYEKYMNLQKQVMYL++QQK
Subjt:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK

Query:  -QNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
         Q+SEIEAVSRKR+AEG EEEYENLEGICS+RDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
Subjt:  -QNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP

Query:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS
        VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGK SAVPVLATIKPSCA+VTLDLIHEDGLFKSPKDYA+SESA STEAAVWQEFLVQQMAS
Subjt:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS

Query:  SLKKDPEFAGIVAGAISGKVLGNQTNRE
        SLKKDPEFAGIVAGAISGKVLGNQTNRE
Subjt:  SLKKDPEFAGIVAGAISGKVLGNQTNRE

TrEMBL top hitse value%identityAlignment
A0A1S3BUA4 probable WRKY transcription factor 40 isoform X24.3e-15790.24Show/hide
Query:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK
        MDIFLDLNVDP SSYANS MD+A  SS++ +FD GE+Y  +EKLSLSL+NKG SDS+PTLE+ELDRKI+ENGKLSQMLR MYEKY+NLQKQVMYLLS QK
Subjt:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK

Query:  QNSEIEAV-SRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
        QN+E+E V SRKRKAEG +E+YENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
Subjt:  QNSEIEAV-SRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP

Query:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS
        VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESA   EAAVWQEFLVQQMAS
Subjt:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS

Query:  SLKKDPEFAGIVAGAISGKVLGNQTNRE
        SLKKDPEFAGIVAGAISGKVLGNQTNRE
Subjt:  SLKKDPEFAGIVAGAISGKVLGNQTNRE

A0A1S3BW19 probable WRKY transcription factor 40 isoform X18.7e-15890.55Show/hide
Query:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK
        MDIFLDLNVDP SSYANS MD+A  SSQKR+   GE+Y  +EKLSLSL+NKG SDS+PTLE+ELDRKI+ENGKLSQMLR MYEKY+NLQKQVMYLLS QK
Subjt:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK

Query:  QNSEIEAV-SRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
        QN+E+E V SRKRKAEG +E+YENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
Subjt:  QNSEIEAV-SRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP

Query:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS
        VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESA   EAAVWQEFLVQQMAS
Subjt:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS

Query:  SLKKDPEFAGIVAGAISGKVLGNQTNRE
        SLKKDPEFAGIVAGAISGKVLGNQTNRE
Subjt:  SLKKDPEFAGIVAGAISGKVLGNQTNRE

A0A5A7VC91 Putative WRKY transcription factor 40 isoform X18.7e-15890.55Show/hide
Query:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK
        MDIFLDLNVDP SSYANS MD+A  SSQKR+   GE+Y  +EKLSLSL+NKG SDS+PTLE+ELDRKI+ENGKLSQMLR MYEKY+NLQKQVMYLLS QK
Subjt:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK

Query:  QNSEIEAV-SRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
        QN+E+E V SRKRKAEG +E+YENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
Subjt:  QNSEIEAV-SRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP

Query:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS
        VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESA   EAAVWQEFLVQQMAS
Subjt:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS

Query:  SLKKDPEFAGIVAGAISGKVLGNQTNRE
        SLKKDPEFAGIVAGAISGKVLGNQTNRE
Subjt:  SLKKDPEFAGIVAGAISGKVLGNQTNRE

A0A6J1FIY1 probable WRKY transcription factor 40 isoform X11.5e-15789.94Show/hide
Query:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK
        MDIFLDLN++P+SSY NSAMD+AL SSQKRE +A E+  D+EKL+LSLANKG SD   TLEE+LDRKIKENG+LSQMLR+MYEKYMNLQKQVMYL++QQK
Subjt:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK

Query:  -QNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
         Q+SEIE VSRKR+AEG EEEYENLEGICS+RDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
Subjt:  -QNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP

Query:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS
        VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGK SAVPVLATIKPSCA+VTLDLIHEDGLFKSPKDYA+SESA STEAAVWQEFLVQQMAS
Subjt:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS

Query:  SLKKDPEFAGIVAGAISGKVLGNQTNRE
        SLKKDPEFAGIVAGAISGKVLGNQTNRE
Subjt:  SLKKDPEFAGIVAGAISGKVLGNQTNRE

A0A6J1J3S7 probable WRKY transcription factor 40 isoform X11.1e-15790.24Show/hide
Query:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK
        MDIFLDLN++P+SSY NSAMD+AL SSQKRE +A E   D+EKL+LSLANKG SD   TLEE+LDRKIKENG+LSQMLR+MYEKYMNLQKQVMYL++QQK
Subjt:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK

Query:  -QNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
         Q+SEIEAVSRKR+AEG EEEYENLEGICS+RDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP
Subjt:  -QNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCP

Query:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS
        VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGK SAVPVLATIKPSCA+VTLDLIHEDGLFKSPKDYA+SESA STEAAVWQEFLVQQMAS
Subjt:  VKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMAS

Query:  SLKKDPEFAGIVAGAISGKVLGNQTNRE
        SLKKDPEFAGIVAGAISGKVLGNQTNRE
Subjt:  SLKKDPEFAGIVAGAISGKVLGNQTNRE

SwissProt top hitse value%identityAlignment
Q0DAJ3 WRKY transcription factor WRKY283.6e-3639.12Show/hide
Query:  LEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLS-----------QQKQNSEIEAVSRKRK----------------------AEGSEEEYENLEG
        LE EL R  +EN KL++MLR++  KY  LQ QV  ++S            Q   SE  +VS  RK                      A  +   Y + + 
Subjt:  LEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLS-----------QQKQNSEIEAVSRKRK----------------------AEGSEEEYENLEG

Query:  ICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHAS
        +  T         +  R +   KVSK FV  D SD SLVVKDGYQWRKYGQKVT+DNP PRAYF+CS AP CPVKKKVQRS +D T+LVATYEGEH+HA 
Subjt:  ICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHAS

Query:  HFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKVL
            +   ++    K S         PS A   +    E      P          STE A  ++ L +QMA++L +DP F   +  A+SG++L
Subjt:  HFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKVL

Q6EPZ2 WRKY transcription factor WRKY762.7e-3950.85Show/hide
Query:  KVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVL
        KVS+V  + D SD SLVVKDGYQWRKYGQKVTRDNPSPRAYF+C+ AP+CPVKKKVQRS ED ++LVATYEGEH+H         L +  GG G ++P  
Subjt:  KVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVL

Query:  ATIKPSCATVTLDLIHEDGLFK------SPKDYASSESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKV
         +I  S  T+TLDL    G  +       P      E      +  ++  LV+QMAS+L  DP+F G +A AI  K+
Subjt:  ATIKPSCATVTLDLIHEDGLFK------SPKDYASSESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKV

Q6IEK5 WRKY transcription factor WRKY763.5e-3950.85Show/hide
Query:  KVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVL
        KVS+V  + D SD SLVVKDGYQWRKYGQKVTRDNPSPRAYF+C+ AP+CPVKKKVQRS ED ++LVATYEGEH+H         L +  GG G ++P  
Subjt:  KVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVL

Query:  ATIKPSCATVTLDLIHEDGLFK------SPKDYASSESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKV
         +I  S  T+TLDL    G  +       P      E      +  ++  LV+QMAS+L  DP+F G +A AI  K+
Subjt:  ATIKPSCATVTLDLIHEDGLFK------SPKDYASSESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKV

Q9C5T4 WRKY transcription factor 182.2e-4138.19Show/hide
Query:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK
        +DI LDLN +P S+         L S+  +        W ++            +S   L EEL+R   EN KL++ML  + E Y  L   +  L S+Q 
Subjt:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK

Query:  QNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDF----------------NRWL--KRPRLN--GNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVT
           E   +  K++ +  +E      G+ S + E+                 N+ L  KRP  +    +KVS V+V  + SD SL VKDG+QWRKYGQKVT
Subjt:  QNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDF----------------NRWL--KRPRLN--GNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVT

Query:  RDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLI---HEDGLFKSPKDYAS
        RDNPSPRAYF+CS AP+CPVKKKVQRS EDP++LVATYEG H+H             N  +G      AT +   +TVTLDL+   H   L K+ +D   
Subjt:  RDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLI---HEDGLFKSPKDYAS

Query:  SESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKVL
                    QE L+QQMASSL KD +F   +A AISG+++
Subjt:  SESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKVL

Q9SAH7 Probable WRKY transcription factor 401.6e-4441.72Show/hide
Query:  LSLSLANKGISDSNPT--LEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQKQNSEIEAVS--RKRKAEGSEEEYE--NLEGICSTRDEDFNR
        L++ +    + +  PT  L EEL+R   EN KLS+ML  M + Y  L+KQ+M  ++ +   +E + +S  +KRK+   E+ +    + G+  +   D + 
Subjt:  LSLSLANKGISDSNPT--LEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQKQNSEIEAVS--RKRKAEGSEEEYE--NLEGICSTRDEDFNR

Query:  WL---KRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELS-
        +L   +R       KVS+V+ + +ASD +LVVKDGYQWRKYGQKVTRDNPSPRAYFKC+ AP+C VKKKVQRS+ED ++LVATYEGEH+H    Q + + 
Subjt:  WL---KRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELS-

Query:  --LRSINGGKGSAVPVLATIKPSCA--TVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKV
           R I+ G  ++ PV A  + S      T+D+I         K   S  S +       Q+ LV+QMASSL KDP F   +A A++GK+
Subjt:  --LRSINGGKGSAVPVLATIKPSCA--TVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKV

Arabidopsis top hitse value%identityAlignment
AT1G80840.1 WRKY DNA-binding protein 401.2e-4541.72Show/hide
Query:  LSLSLANKGISDSNPT--LEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQKQNSEIEAVS--RKRKAEGSEEEYE--NLEGICSTRDEDFNR
        L++ +    + +  PT  L EEL+R   EN KLS+ML  M + Y  L+KQ+M  ++ +   +E + +S  +KRK+   E+ +    + G+  +   D + 
Subjt:  LSLSLANKGISDSNPT--LEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQKQNSEIEAVS--RKRKAEGSEEEYE--NLEGICSTRDEDFNR

Query:  WL---KRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELS-
        +L   +R       KVS+V+ + +ASD +LVVKDGYQWRKYGQKVTRDNPSPRAYFKC+ AP+C VKKKVQRS+ED ++LVATYEGEH+H    Q + + 
Subjt:  WL---KRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELS-

Query:  --LRSINGGKGSAVPVLATIKPSCA--TVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKV
           R I+ G  ++ PV A  + S      T+D+I         K   S  S +       Q+ LV+QMASSL KDP F   +A A++GK+
Subjt:  --LRSINGGKGSAVPVLATIKPSCA--TVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKV

AT2G25000.1 WRKY DNA-binding protein 604.4e-3740.91Show/hide
Query:  LEEELDRKIKENGKLSQMLRSMYEKYM---NLQKQVMYLLSQQKQNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQ
        L++E++R   EN KL++ML  + EKY    NL +++    S +  N + + ++ KRK E  +E   +  G+     E+          N  + VS  +  
Subjt:  LEEELDRKIKENGKLSQMLRSMYEKYM---NLQKQVMYLLSQQKQNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQ

Query:  KDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCA
         + SD SL VKDGYQWRKYGQK+TRDNPSPRAYF+CS +P+C VKKKVQRS EDP+ LVATYEG H+H                 G    V  T+K    
Subjt:  KDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCA

Query:  TVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKVL
           LDL+ + GL   P +       +       QE LVQQMASSL KDP+F   +A AISG+++
Subjt:  TVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKVL

AT4G01720.1 WRKY family transcription factor8.4e-2034.36Show/hide
Query:  LEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQ------KQNSEIEAVSRKRKAEGSEEEYENLEGICST---------RDEDFNRWL-KRPR
        L+ EL+R  +EN KL  +L  + E Y +LQ++V+     Q      KQ+ ++      +  E    +  N E   +T            D +R   K PR
Subjt:  LEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQ------KQNSEIEAVSRKRKAEGSEEEYENLEGICST---------RDEDFNRWL-KRPR

Query:  LNGNSKVS--------------KVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSH
        ++ N   +              K  V   A   +  V DG QWRKYGQK+ + NP PRAY++C+ A  CPV+K+VQR  ED TIL  TYEG H+H
Subjt:  LNGNSKVS--------------KVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSH

AT4G31800.1 WRKY DNA-binding protein 181.6e-4238.19Show/hide
Query:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK
        +DI LDLN +P S+         L S+  +        W ++            +S   L EEL+R   EN KL++ML  + E Y  L   +  L S+Q 
Subjt:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK

Query:  QNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDF----------------NRWL--KRPRLN--GNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVT
           E   +  K++ +  +E      G+ S + E+                 N+ L  KRP  +    +KVS V+V  + SD SL VKDG+QWRKYGQKVT
Subjt:  QNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDF----------------NRWL--KRPRLN--GNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVT

Query:  RDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLI---HEDGLFKSPKDYAS
        RDNPSPRAYF+CS AP+CPVKKKVQRS EDP++LVATYEG H+H             N  +G      AT +   +TVTLDL+   H   L K+ +D   
Subjt:  RDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLI---HEDGLFKSPKDYAS

Query:  SESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKVL
                    QE L+QQMASSL KD +F   +A AISG+++
Subjt:  SESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKVL

AT4G31800.2 WRKY DNA-binding protein 185.4e-4338.19Show/hide
Query:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK
        +DI LDLN +P S+     +     +  KR+       W ++            +S   L EEL+R   EN KL++ML  + E Y  L   +  L S+Q 
Subjt:  MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQK

Query:  QNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDF----------------NRWL--KRPRLN--GNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVT
           E   +  K++ +  +E      G+ S + E+                 N+ L  KRP  +    +KVS V+V  + SD SL VKDG+QWRKYGQKVT
Subjt:  QNSEIEAVSRKRKAEGSEEEYENLEGICSTRDEDF----------------NRWL--KRPRLN--GNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVT

Query:  RDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLI---HEDGLFKSPKDYAS
        RDNPSPRAYF+CS AP+CPVKKKVQRS EDP++LVATYEG H+H             N  +G      AT +   +TVTLDL+   H   L K+ +D   
Subjt:  RDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEGEHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLI---HEDGLFKSPKDYAS

Query:  SESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKVL
                    QE L+QQMASSL KD +F   +A AISG+++
Subjt:  SESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATTTTCTTGGACCTTAATGTGGATCCTACTTCTTCTTATGCTAATTCAGCCATGGATCAGGCTCTCGATTCTTCACAGAAAAGAGAGTTTGATGCTGGAGAACT
CTATTGGGATAAGGAAAAACTCTCACTAAGCTTGGCAAATAAGGGAATTAGTGATTCGAACCCGACATTAGAAGAAGAGTTGGATAGAAAAATCAAAGAGAATGGGAAGC
TAAGTCAGATGCTAAGATCAATGTATGAGAAATACATGAATCTTCAGAAGCAAGTGATGTATTTGCTTAGCCAGCAAAAGCAGAACTCAGAAATAGAAGCAGTTTCAAGG
AAGAGAAAGGCAGAAGGATCAGAAGAAGAATATGAGAATTTGGAAGGAATTTGCAGCACAAGGGATGAAGATTTCAACAGGTGGCTTAAAAGGCCAAGACTAAATGGAAA
TTCAAAGGTTTCTAAGGTTTTTGTGCAGAAAGATGCATCAGATCCAAGCTTGGTGGTGAAAGATGGGTATCAATGGAGGAAGTATGGGCAAAAGGTCACAAGAGACAATC
CTTCTCCAAGAGCTTACTTCAAGTGCTCCTCTGCACCAAATTGTCCTGTCAAAAAGAAGGTGCAAAGAAGTTTGGAAGATCCAACAATTTTGGTGGCCACTTACGAAGGA
GAACACAGCCACGCCAGCCATTTTCAAACTGAGCTATCTTTAAGGTCCATCAATGGCGGCAAAGGCAGTGCAGTCCCCGTCTTAGCCACCATCAAGCCGTCGTGCGCCAC
CGTGACCCTCGATTTGATTCACGAAGATGGGCTGTTTAAGAGTCCGAAAGATTACGCATCGTCGGAGTCGGCAGTGTCGACGGAGGCAGCGGTTTGGCAGGAGTTTTTAG
TACAACAAATGGCCTCCTCTTTGAAGAAGGATCCTGAATTTGCTGGCATTGTTGCAGGCGCCATTTCAGGCAAGGTTTTGGGAAACCAAACAAACCGAGAATAA
mRNA sequenceShow/hide mRNA sequence
AATCACTCAATTTTCTGCAAAATCCTTCTTCTTCTTCTTCTTCAAAACCCCATTTTCTTGTTCGTTTTCATGCAATTATCCAAACTTTTAATTAATTAATTGGTCTTTCT
GATCTGCAAAAATGGATATTTTCTTGGACCTTAATGTGGATCCTACTTCTTCTTATGCTAATTCAGCCATGGATCAGGCTCTCGATTCTTCACAGAAAAGAGAGTTTGAT
GCTGGAGAACTCTATTGGGATAAGGAAAAACTCTCACTAAGCTTGGCAAATAAGGGAATTAGTGATTCGAACCCGACATTAGAAGAAGAGTTGGATAGAAAAATCAAAGA
GAATGGGAAGCTAAGTCAGATGCTAAGATCAATGTATGAGAAATACATGAATCTTCAGAAGCAAGTGATGTATTTGCTTAGCCAGCAAAAGCAGAACTCAGAAATAGAAG
CAGTTTCAAGGAAGAGAAAGGCAGAAGGATCAGAAGAAGAATATGAGAATTTGGAAGGAATTTGCAGCACAAGGGATGAAGATTTCAACAGGTGGCTTAAAAGGCCAAGA
CTAAATGGAAATTCAAAGGTTTCTAAGGTTTTTGTGCAGAAAGATGCATCAGATCCAAGCTTGGTGGTGAAAGATGGGTATCAATGGAGGAAGTATGGGCAAAAGGTCAC
AAGAGACAATCCTTCTCCAAGAGCTTACTTCAAGTGCTCCTCTGCACCAAATTGTCCTGTCAAAAAGAAGGTGCAAAGAAGTTTGGAAGATCCAACAATTTTGGTGGCCA
CTTACGAAGGAGAACACAGCCACGCCAGCCATTTTCAAACTGAGCTATCTTTAAGGTCCATCAATGGCGGCAAAGGCAGTGCAGTCCCCGTCTTAGCCACCATCAAGCCG
TCGTGCGCCACCGTGACCCTCGATTTGATTCACGAAGATGGGCTGTTTAAGAGTCCGAAAGATTACGCATCGTCGGAGTCGGCAGTGTCGACGGAGGCAGCGGTTTGGCA
GGAGTTTTTAGTACAACAAATGGCCTCCTCTTTGAAGAAGGATCCTGAATTTGCTGGCATTGTTGCAGGCGCCATTTCAGGCAAGGTTTTGGGAAACCAAACAAACCGAG
AATAAACCCTAAACCCTAAATCCTAAAGGCTTTATGCTCTATATTTGCTTTCGTTTAGCTACATTCTTAGCCTTTTGTGAATATGTGTTTGTTTGGATGAGACAACTTAC
TTAGCTGGAGTTACAATCATGTAAGTTATTTCGGTATTGGTTTTATCTCGAAATGATGGTTCAATTGGCAAAAAATTTAGAGACACTTCAATTAATAAAAATCAATTAGG
TTAAAA
Protein sequenceShow/hide protein sequence
MDIFLDLNVDPTSSYANSAMDQALDSSQKREFDAGELYWDKEKLSLSLANKGISDSNPTLEEELDRKIKENGKLSQMLRSMYEKYMNLQKQVMYLLSQQKQNSEIEAVSR
KRKAEGSEEEYENLEGICSTRDEDFNRWLKRPRLNGNSKVSKVFVQKDASDPSLVVKDGYQWRKYGQKVTRDNPSPRAYFKCSSAPNCPVKKKVQRSLEDPTILVATYEG
EHSHASHFQTELSLRSINGGKGSAVPVLATIKPSCATVTLDLIHEDGLFKSPKDYASSESAVSTEAAVWQEFLVQQMASSLKKDPEFAGIVAGAISGKVLGNQTNRE