; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh15G005980 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh15G005980
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionG-box-binding factor 4-like
Genome locationCmo_Chr15:2909179..2913201
RNA-Seq ExpressionCmoCh15G005980
SyntenyCmoCh15G005980
Gene Ontology termsGO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR043452 - Plant bZIP transcription factors


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578883.1 G-box-binding factor 4, partial [Cucurbita argyrosperma subsp. sororia]1.6e-15598.12Show/hide
Query:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT
        MASYKLLASSNSRNSDL RGSSSSSFSSASLLESQF+SNHLRNNDIPTRN+SHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTT 
Subjt:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT

Query:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV
         TTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPI ASTFQALDKV
Subjt:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV

Query:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP
        EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP
Subjt:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP

Query:  VVEKRRPPRVIRRVNSMKW
        VVEKRRPPRVIRRVNSMKW
Subjt:  VVEKRRPPRVIRRVNSMKW

XP_022134119.1 G-box-binding factor 4-like isoform X1 [Momordica charantia]8.2e-12383.7Show/hide
Query:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT
        MAS KLLASSNSRNSDL RGSSSSS SS+SLL+ QFLSN  RNND  T     S SSMTVDGLL NGYDSNPTE SILLDAQITLVDS NPSS PMN   
Subjt:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT

Query:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV
              T TNSSAVID+NH T+S AAPKTVDDVWREIVSGERKELKEEV DE ITLED+L++TG +PVEDVKLPQTERLSGGI+SFDPI  + FQALDKV
Subjt:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV

Query:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP
        EGSIIGF +GVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEE ERLLREKAERTKERF+QLMEKVIP
Subjt:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP

Query:  VVEKRRPPRVIRRVNSMKW
        VVEKRRPPRVIRRV+SM W
Subjt:  VVEKRRPPRVIRRVNSMKW

XP_022939348.1 G-box-binding factor 4-like [Cucurbita moschata]4.9e-160100Show/hide
Query:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT
        MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT
Subjt:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT

Query:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV
        ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV
Subjt:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV

Query:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP
        EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP
Subjt:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP

Query:  VVEKRRPPRVIRRVNSMKW
        VVEKRRPPRVIRRVNSMKW
Subjt:  VVEKRRPPRVIRRVNSMKW

XP_022992992.1 G-box-binding factor 4-like [Cucurbita maxima]1.4e-15197.18Show/hide
Query:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT
        MASYKLLASSNSRNSDL RGSSSSS SSASLL SQFLSNHLRNNDIPTRN+SHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT
Subjt:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT

Query:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV
         TTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERK+LKEEV DEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPI ASTFQALDKV
Subjt:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV

Query:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP
        EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP
Subjt:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP

Query:  VVEKRRPPRVIRRVNSMKW
        VVEKRRPP+VIRRVNSMKW
Subjt:  VVEKRRPPRVIRRVNSMKW

XP_023550818.1 G-box-binding factor 4-like [Cucurbita pepo subsp. pepo]3.4e-15398.12Show/hide
Query:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT
        MASYKLLASSNSRNSDL RGSSSSS SSASLLESQFLSNHLRNNDIPTRN+SHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT
Subjt:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT

Query:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV
         TTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEV DEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPI ASTFQALDKV
Subjt:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV

Query:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP
        EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP
Subjt:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP

Query:  VVEKRRPPRVIRRVNSMKW
        VVEKRRPPRVIRRVNSMKW
Subjt:  VVEKRRPPRVIRRVNSMKW

TrEMBL top hitse value%identityAlignment
A0A0A0KBB4 BZIP domain-containing protein1.3e-12182.24Show/hide
Query:  MASYKLLASSNSRNSDLLRG-SSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTT
        MAS KL  SSNSRNSDL RG SSSSS SSASLL+ QFLSNH RN D P+RN      +MTVDGLL N +DSNPTESSILLDAQITLVDS NPSSL ++  
Subjt:  MASYKLLASSNSRNSDLLRG-SSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTT

Query:  TATTTTTTTTNSSAVIDSNHNTSSGA-APKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALD
           TTT TTTNSSAVIDSNHN+SS A  PKTVDDVWREIVSGERKELKEEV +E ITLED+L+++G +PVEDVK PQTERLSGGIFSFDPI ++TFQALD
Subjt:  TATTTTTTTTNSSAVIDSNHNTSSGA-APKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALD

Query:  KVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKV
        K+EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEE E+LLREKAERTKER +QLM+KV
Subjt:  KVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKV

Query:  IPVVEKRRPPRVIRRVNSMKW
        IPVVEKRRP RVIRRVNSM+W
Subjt:  IPVVEKRRPPRVIRRVNSMKW

A0A6J1BXX0 G-box-binding factor 4-like isoform X14.0e-12383.7Show/hide
Query:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT
        MAS KLLASSNSRNSDL RGSSSSS SS+SLL+ QFLSN  RNND  T     S SSMTVDGLL NGYDSNPTE SILLDAQITLVDS NPSS PMN   
Subjt:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT

Query:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV
              T TNSSAVID+NH T+S AAPKTVDDVWREIVSGERKELKEEV DE ITLED+L++TG +PVEDVKLPQTERLSGGI+SFDPI  + FQALDKV
Subjt:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV

Query:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP
        EGSIIGF +GVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEE ERLLREKAERTKERF+QLMEKVIP
Subjt:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP

Query:  VVEKRRPPRVIRRVNSMKW
        VVEKRRPPRVIRRV+SM W
Subjt:  VVEKRRPPRVIRRVNSMKW

A0A6J1FLE3 G-box-binding factor 4-like2.4e-160100Show/hide
Query:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT
        MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT
Subjt:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT

Query:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV
        ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV
Subjt:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV

Query:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP
        EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP
Subjt:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP

Query:  VVEKRRPPRVIRRVNSMKW
        VVEKRRPPRVIRRVNSMKW
Subjt:  VVEKRRPPRVIRRVNSMKW

A0A6J1HH29 G-box-binding factor 4-like isoform X17.3e-11780.62Show/hide
Query:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNND-IPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTT
        MAS KL ASS SRNSDL RGSSSSS SSASLL  QFLSN  RNND   TRN+SHS SSM +DGL+ +GYDSNPTE SILLDAQITLVDS  PS+ PMN  
Subjt:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNND-IPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTT

Query:  TATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDK
                TTNSSAVIDS HNTSSG APKTVDDVWREIVSGERKELKEEV DE ITLED+L++TG  PVEDVKLPQTERLSGGIFSFD I  S+FQA++K
Subjt:  TATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDK

Query:  VEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVI
        VEGSI+GF +GVDL+GSGGS GRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESL  +LEEE ERLLREKAER+KER +QLMEKVI
Subjt:  VEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVI

Query:  PVVEKRRPPRVIRRVNSMKW
        PVVEKRRPPR IRRVNSMKW
Subjt:  PVVEKRRPPRVIRRVNSMKW

A0A6J1JRH8 G-box-binding factor 4-like7.0e-15297.18Show/hide
Query:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT
        MASYKLLASSNSRNSDL RGSSSSS SSASLL SQFLSNHLRNNDIPTRN+SHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT
Subjt:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTT

Query:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV
         TTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERK+LKEEV DEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPI ASTFQALDKV
Subjt:  ATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKV

Query:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP
        EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP
Subjt:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIP

Query:  VVEKRRPPRVIRRVNSMKW
        VVEKRRPP+VIRRVNSMKW
Subjt:  VVEKRRPPRVIRRVNSMKW

SwissProt top hitse value%identityAlignment
P42777 G-box-binding factor 49.7e-4244.88Show/hide
Query:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGN---GYDSNPTESSILLDAQITLVDSHNPSSLPMN
        MAS+KL++SS   NSDL R +SSS+ SS S+      S+HLR    P  +  HSR S    G + +     DS P E +I +D                 
Subjt:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGN---GYDSNPTESSILLDAQITLVDSHNPSSLPMN

Query:  TTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKE--LKEEVPDEFITLEDYLLRT----GVMPVEDVKLPQTERLSG-GIFSFD-PI
                         I   ++ ++G   K+VDDVW+EIVSGE+K   +KEE P++ +TLED+L +     G     DVK+P TERL+  G ++FD P+
Subjt:  TTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKE--LKEEVPDEFITLEDYLLRT----GVMPVEDVKLPQTERLSG-GIFSFD-PI

Query:  -RASTFQALDKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTK
         R S+FQ    VEGS+            GG   RGKRGR  +E +DKAA QRQ+RMIKNRESAARSRERKQAYQVELE+LA +LEEE E+LL+E  E TK
Subjt:  -RASTFQALDKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTK

Query:  ERFEQLMEKVIPVVEKRRPP-RVIRRVNSMKW
        ER+++LME +IPV EK RPP R + R +S++W
Subjt:  ERFEQLMEKVIPVVEKRRPP-RVIRRVNSMKW

Q0JHF1 bZIP transcription factor 124.5e-2336.74Show/hide
Query:  AAPKTVDDVWREIV-SGERKELKEEVPDEF-------------------ITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKVEGSI
        AAP+T ++VW+EI  +G        VP                      +TLED+L R G +  ++  +       G +                    +
Subjt:  AAPKTVDDVWREIV-SGERKELKEEVPDEF-------------------ITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKVEGSI

Query:  IGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIPVVEK
        +GF NG ++ G G +GGR ++ R  ++P+D+AA QRQ+RMIKNRESAARSRERKQAY  ELESL  +LEEE  ++ +E+ E+ ++R ++L E V+PV+ +
Subjt:  IGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIPVVEK

Query:  RRPPRVIRRVNSMKW
        +   R +RR NSM+W
Subjt:  RRPPRVIRRVNSMKW

Q6Z312 bZIP transcription factor 231.5e-0528.48Show/hide
Query:  GNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTTATTTTTTTTNSSAVIDSNHNTSSGAAP---------------KTVDDVWREIV------SGERK
        G G D        LL +  T  +SH           A TTTT TT S A  +   + + GA P               KTVD+VWR+++      +    
Subjt:  GNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTTATTTTTTTTNSSAVIDSNHNTSSGAAP---------------KTVDDVWREIV------SGERK

Query:  ELKEEVPDEF-------ITLEDYLLRTGVMPVEDVKLP---------------------QTERLSGGIFSFDPIRASTFQALDKVEGSI----IGFANGV
           E  P          ITLE++L+R GV+  ED+ +P                     QT  L G    F P+          V G++     G A+ V
Subjt:  ELKEEVPDEF-------ITLEDYLLRTGVMPVEDVKLP---------------------QTERLSGGIFSFDPIRASTFQALDKVEGSI----IGFANGV

Query:  DLIGSGGSGGRGK----------------------RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKE
          +    S G GK                      RGR A   ++K  E+RQRRMIKNRESAARSR+RKQAY +ELE+   +L+E  + L +++ E  ++
Subjt:  DLIGSGGSGGRGK----------------------RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKE

Query:  RFEQLMEKVIPVVEKRRPPRVIRRVNSMKW
        +  +++E++   V        +RR  +  W
Subjt:  RFEQLMEKVIPVVEKRRPPRVIRRVNSMKW

Q9C5Q2 ABSCISIC ACID-INSENSITIVE 5-like protein 34.5e-0733.72Show/hide
Query:  NPTESSILLDAQITLVDSHNPSSLPMNTTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIV-----SGERKELKEEVPD-EFITLEDYLLRTG
        N   S  L + Q  L  S  P    MN      T         V   +       + KTVD+VWR+I      +G       + P    ITLED LLR G
Subjt:  NPTESSILLDAQITLVDSHNPSSLPMNTTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIV-----SGERKELKEEVPD-EFITLEDYLLRTG

Query:  VMPVEDVKLPQTERL---SGG---IFSFDPIRASTFQALDKVEGSIIGFANGVDLIGSGGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSR
        V  V +  +PQ   +   S G    +   P +   F      E          D++  GG     +   R R A E ++K  E+RQ+RMIKNRESAARSR
Subjt:  VMPVEDVKLPQTERL---SGG---IFSFDPIRASTFQALDKVEGSIIGFANGVDLIGSGGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSR

Query:  ERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIPVVEKRRPPRVIRRVNS
         RKQAY  ELE    RLEEE E+L R K           +EK++P      P   +RR NS
Subjt:  ERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIPVVEKRRPPRVIRRVNS

Q9LES3 ABSCISIC ACID-INSENSITIVE 5-like protein 28.2e-0930.26Show/hide
Query:  NQSHSRSSMTVDGLLGNGYDSNPTESSILLD---AQITLVDSHNPSSLPMNTTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIV----SGER
        N+  S  S+T+D +  +   S     S+ LD     +  V+++ PSS+ +N   A     +   S  +           + KTVD+VW++I      G  
Subjt:  NQSHSRSSMTVDGLLGNGYDSNPTESSILLD---AQITLVDSHNPSSLPMNTTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIV----SGER

Query:  KELKEEVPD-EFITLEDYLLRTGVM-----------PVEDVKLPQTERLSGGIF------------------SFDPIRASTFQALDKVEGSIIGFANGVD
         E +++ P    +TLED LL+ GV+           PV          L   I                   +F P   S  QA+   + S++G      
Subjt:  KELKEEVPD-EFITLEDYLLRTGVM-----------PVEDVKLPQTERLSGGIF------------------SFDPIRASTFQALDKVEGSIIGFANGVD

Query:  LIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIPVVEKRRPPRVIR
          G   +   G++  A+ E ++K  E+RQ+RMIKNRESAARSR RKQAY  ELE    RLEEE ERL ++K           +EK++P V    P R +R
Subjt:  LIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIPVVEKRRPPRVIR

Query:  RVNS
        R +S
Subjt:  RVNS

Arabidopsis top hitse value%identityAlignment
AT1G03970.1 G-box binding factor 46.9e-4344.88Show/hide
Query:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGN---GYDSNPTESSILLDAQITLVDSHNPSSLPMN
        MAS+KL++SS   NSDL R +SSS+ SS S+      S+HLR    P  +  HSR S    G + +     DS P E +I +D                 
Subjt:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGN---GYDSNPTESSILLDAQITLVDSHNPSSLPMN

Query:  TTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKE--LKEEVPDEFITLEDYLLRT----GVMPVEDVKLPQTERLSG-GIFSFD-PI
                         I   ++ ++G   K+VDDVW+EIVSGE+K   +KEE P++ +TLED+L +     G     DVK+P TERL+  G ++FD P+
Subjt:  TTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKE--LKEEVPDEFITLEDYLLRT----GVMPVEDVKLPQTERLSG-GIFSFD-PI

Query:  -RASTFQALDKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTK
         R S+FQ    VEGS+            GG   RGKRGR  +E +DKAA QRQ+RMIKNRESAARSRERKQAYQVELE+LA +LEEE E+LL+E  E TK
Subjt:  -RASTFQALDKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTK

Query:  ERFEQLMEKVIPVVEKRRPP-RVIRRVNSMKW
        ER+++LME +IPV EK RPP R + R +S++W
Subjt:  ERFEQLMEKVIPVVEKRRPP-RVIRRVNSMKW

AT2G41070.1 Basic-leucine zipper (bZIP) transcription factor family protein3.2e-0833.72Show/hide
Query:  NPTESSILLDAQITLVDSHNPSSLPMNTTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIV-----SGERKELKEEVPD-EFITLEDYLLRTG
        N   S  L + Q  L  S  P    MN      T         V   +       + KTVD+VWR+I      +G       + P    ITLED LLR G
Subjt:  NPTESSILLDAQITLVDSHNPSSLPMNTTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIV-----SGERKELKEEVPD-EFITLEDYLLRTG

Query:  VMPVEDVKLPQTERL---SGG---IFSFDPIRASTFQALDKVEGSIIGFANGVDLIGSGGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSR
        V  V +  +PQ   +   S G    +   P +   F      E          D++  GG     +   R R A E ++K  E+RQ+RMIKNRESAARSR
Subjt:  VMPVEDVKLPQTERL---SGG---IFSFDPIRASTFQALDKVEGSIIGFANGVDLIGSGGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSR

Query:  ERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIPVVEKRRPPRVIRRVNS
         RKQAY  ELE    RLEEE E+L R K           +EK++P      P   +RR NS
Subjt:  ERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIPVVEKRRPPRVIRRVNS

AT2G41070.3 Basic-leucine zipper (bZIP) transcription factor family protein3.2e-0833.72Show/hide
Query:  NPTESSILLDAQITLVDSHNPSSLPMNTTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIV-----SGERKELKEEVPD-EFITLEDYLLRTG
        N   S  L + Q  L  S  P    MN      T         V   +       + KTVD+VWR+I      +G       + P    ITLED LLR G
Subjt:  NPTESSILLDAQITLVDSHNPSSLPMNTTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIV-----SGERKELKEEVPD-EFITLEDYLLRTG

Query:  VMPVEDVKLPQTERL---SGG---IFSFDPIRASTFQALDKVEGSIIGFANGVDLIGSGGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSR
        V  V +  +PQ   +   S G    +   P +   F      E          D++  GG     +   R R A E ++K  E+RQ+RMIKNRESAARSR
Subjt:  VMPVEDVKLPQTERL---SGG---IFSFDPIRASTFQALDKVEGSIIGFANGVDLIGSGGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSR

Query:  ERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIPVVEKRRPPRVIRRVNS
         RKQAY  ELE    RLEEE E+L R K           +EK++P      P   +RR NS
Subjt:  ERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIPVVEKRRPPRVIRRVNS

AT3G56850.1 ABA-responsive element binding protein 35.9e-1030.26Show/hide
Query:  NQSHSRSSMTVDGLLGNGYDSNPTESSILLD---AQITLVDSHNPSSLPMNTTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIV----SGER
        N+  S  S+T+D +  +   S     S+ LD     +  V+++ PSS+ +N   A     +   S  +           + KTVD+VW++I      G  
Subjt:  NQSHSRSSMTVDGLLGNGYDSNPTESSILLD---AQITLVDSHNPSSLPMNTTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIV----SGER

Query:  KELKEEVPD-EFITLEDYLLRTGVM-----------PVEDVKLPQTERLSGGIF------------------SFDPIRASTFQALDKVEGSIIGFANGVD
         E +++ P    +TLED LL+ GV+           PV          L   I                   +F P   S  QA+   + S++G      
Subjt:  KELKEEVPD-EFITLEDYLLRTGVM-----------PVEDVKLPQTERLSGGIF------------------SFDPIRASTFQALDKVEGSIIGFANGVD

Query:  LIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIPVVEKRRPPRVIR
          G   +   G++  A+ E ++K  E+RQ+RMIKNRESAARSR RKQAY  ELE    RLEEE ERL ++K           +EK++P V    P R +R
Subjt:  LIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIPVVEKRRPPRVIR

Query:  RVNS
        R +S
Subjt:  RVNS

AT5G44080.1 Basic-leucine zipper (bZIP) transcription factor family protein4.0e-5948.81Show/hide
Query:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQS-HSRSSMTVDGLLGNGYDSN---PTESSILLDAQITLVDSHNPSSLPM
        M S++++ SSNSRNSDL R  SS+S SS+S+   Q     L +     RN   +S +SMTV+G+L + + S+   PTESS LLDA I L+D+   S  PM
Subjt:  MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQS-HSRSSMTVDGLLGNGYDSN---PTESSILLDAQITLVDSHNPSSLPM

Query:  NTTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVE---------DVKLPQTERLSGGIFSFD
                  TTT +S V+D    T +    K+VD++WRE+VSGE K +KEE  +E +TLED+L +  V             DVK+P T       + FD
Subjt:  NTTTATTTTTTTTNSSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVE---------DVKLPQTERLSGGIFSFD

Query:  PIRA--STFQALDKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAE
              + FQ +DKVEGSI+ F NG+D+    G G RGKR R  +EPLDKAA QRQRRMIKNRESAARSRERKQAYQVELE+LA +LEEE E L +E  +
Subjt:  PIRA--STFQALDKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAE

Query:  RTKERFEQLMEKVIPVVE--KRRPPRVIRRVNSMKW
        + KER+++LME VIPVVE  K++PPR +RR+ S++W
Subjt:  RTKERFEQLMEKVIPVVE--KRRPPRVIRRVNSMKW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCATACAAGCTCCTGGCATCTTCCAATTCGCGAAATTCGGATCTATTGCGAGGTTCTTCTTCTTCTTCTTTCTCTTCTGCTTCTCTTTTAGAATCCCAATTTCT
TTCGAATCACCTGAGGAATAATGACATCCCAACTCGCAATCAAAGCCATAGTCGTAGCTCAATGACCGTCGATGGCTTACTTGGCAATGGGTACGATTCCAATCCCACGG
AGTCGTCCATTCTCTTAGATGCCCAGATTACTCTTGTTGATTCCCATAACCCTTCTTCTCTTCCCATGAATACGACTACAGCAACCACCACCACCACCACCACCACTAAT
TCCTCTGCCGTCATTGATAGCAATCACAATACTTCCTCCGGTGCGGCGCCCAAAACTGTGGATGATGTGTGGAGGGAGATTGTTTCTGGTGAGAGGAAGGAGTTGAAGGA
GGAGGTTCCTGACGAGTTCATAACCCTCGAGGATTATCTCTTGAGAACTGGGGTCATGCCTGTTGAGGATGTCAAATTGCCGCAGACGGAGAGGCTGAGTGGAGGGATTT
TTTCGTTTGATCCAATTCGAGCCAGCACATTTCAGGCCTTGGATAAGGTCGAAGGATCCATTATTGGATTTGCTAATGGGGTCGATTTGATCGGTAGTGGAGGAAGTGGG
GGGAGAGGCAAAAGAGGGCGAGCTGCTTTGGAACCTTTAGATAAGGCTGCAGAGCAAAGACAGAGGAGGATGATCAAGAATAGGGAGTCTGCAGCAAGATCAAGGGAACG
GAAGCAGGCTTATCAAGTGGAATTAGAATCATTAGCTGTAAGACTAGAGGAGGAGAAGGAGCGGCTTTTGAGGGAAAAGGCTGAGAGGACTAAAGAGAGGTTCGAGCAGC
TAATGGAGAAGGTGATTCCTGTGGTTGAAAAGCGACGACCCCCACGAGTAATTCGGCGAGTTAATTCCATGAAATGGTGA
mRNA sequenceShow/hide mRNA sequence
TCTTCCTGTCTGCCCTCGCCCAGACCAAAACCAACCACCATAGGCCAAAAAAACAGGAAGCGATCAAAACAAAAACCTTGACGCGCACAATCCAAATCGATAGAGGAAAC
CATAAATTCTTCCCAAATTTTGTTTTTGTCTCTAGAAAGATTCCCACTCGAGTTTCGAGAGAATCTCTGCGAATTCCTTGCCTATTTTGACGAACATAATGGCATCATAC
AAGCTCCTGGCATCTTCCAATTCGCGAAATTCGGATCTATTGCGAGGTTCTTCTTCTTCTTCTTTCTCTTCTGCTTCTCTTTTAGAATCCCAATTTCTTTCGAATCACCT
GAGGAATAATGACATCCCAACTCGCAATCAAAGCCATAGTCGTAGCTCAATGACCGTCGATGGCTTACTTGGCAATGGGTACGATTCCAATCCCACGGAGTCGTCCATTC
TCTTAGATGCCCAGATTACTCTTGTTGATTCCCATAACCCTTCTTCTCTTCCCATGAATACGACTACAGCAACCACCACCACCACCACCACCACTAATTCCTCTGCCGTC
ATTGATAGCAATCACAATACTTCCTCCGGTGCGGCGCCCAAAACTGTGGATGATGTGTGGAGGGAGATTGTTTCTGGTGAGAGGAAGGAGTTGAAGGAGGAGGTTCCTGA
CGAGTTCATAACCCTCGAGGATTATCTCTTGAGAACTGGGGTCATGCCTGTTGAGGATGTCAAATTGCCGCAGACGGAGAGGCTGAGTGGAGGGATTTTTTCGTTTGATC
CAATTCGAGCCAGCACATTTCAGGCCTTGGATAAGGTCGAAGGATCCATTATTGGATTTGCTAATGGGGTCGATTTGATCGGTAGTGGAGGAAGTGGGGGGAGAGGCAAA
AGAGGGCGAGCTGCTTTGGAACCTTTAGATAAGGCTGCAGAGCAAAGACAGAGGAGGATGATCAAGAATAGGGAGTCTGCAGCAAGATCAAGGGAACGGAAGCAGGCTTA
TCAAGTGGAATTAGAATCATTAGCTGTAAGACTAGAGGAGGAGAAGGAGCGGCTTTTGAGGGAAAAGGCTGAGAGGACTAAAGAGAGGTTCGAGCAGCTAATGGAGAAGG
TGATTCCTGTGGTTGAAAAGCGACGACCCCCACGAGTAATTCGGCGAGTTAATTCCATGAAATGGTGAACTATTGATTTTCCACAAGACAGGTTCAACAGACAAGGAAAC
CTGTAGGAGAAAGAAGAAGAAGAAGCAGAAGAGATAGACGGCATAGTATTAGGAGTTATTGAAAACACATTCTTTGCAGCTGATGATGGCTGAGGAGCATCAGAGCATCA
GAGCATCAGAGGTGTCTGCTGCAAGCAAAAATGCCGGCGATCACAATCCTTTTACCAAACAACAGATTAACAGAGACAAAACTAACACAGGATACATGAGAGAAGATGAT
GATTTATAGGCTACAAAACTTTGAGGTACTCAGCTCTCTGTCTCATTTTCTGTTTTTTGTTCAATGCATGAGAGAGTCAAAGAGGAAGAAGTTGAGTTGAGTAGCAATTT
AAAGCATTTTTTCAATGGTTTTGTTGGATAGAGGAAATTTGTTATAATAGTTGGTAGTGTAGTTTGGTGATATTAGAGGTATATCATTTTTTTTCCCTCTTAATGTTATG
TTTTTTAGTTAAGAAAAGACTATTGGTGACTGTACTTTCATGTAATTAAGAATATGG
Protein sequenceShow/hide protein sequence
MASYKLLASSNSRNSDLLRGSSSSSFSSASLLESQFLSNHLRNNDIPTRNQSHSRSSMTVDGLLGNGYDSNPTESSILLDAQITLVDSHNPSSLPMNTTTATTTTTTTTN
SSAVIDSNHNTSSGAAPKTVDDVWREIVSGERKELKEEVPDEFITLEDYLLRTGVMPVEDVKLPQTERLSGGIFSFDPIRASTFQALDKVEGSIIGFANGVDLIGSGGSG
GRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEEKERLLREKAERTKERFEQLMEKVIPVVEKRRPPRVIRRVNSMKW