; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G06800 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G06800
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionG-box-binding factor 4-like
Genome locationClcChr08:18292293..18297223
RNA-Seq ExpressionClc08G06800
SyntenyClc08G06800
Gene Ontology termsGO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR043452 - Plant bZIP transcription factors


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578883.1 G-box-binding factor 4, partial [Cucurbita argyrosperma subsp. sororia]9.4e-12374.25Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMN---
        MAS KL  SSNSRNSDLSRGSSSSS SSASLL+ QF SN  RN D P+RNR H+ +SMTVDGLL NGY+SNPTESSILLDAQITLVDS NPSSLPMN   
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMN---

Query:  TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKVE
        TTTTTTTNSSAVIDSNHN+SSGA  PKTVDDVWREIVSGERKELKEEV DE ITLED+LL++G +PVEDVKLPQTERLSGG+FSFDPIP++TFQALDKVE
Subjt:  TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKVE

Query:  GSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKKFFFATRQSSY
        GSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEE ERLLREK  +E  K+ F        
Subjt:  GSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKKFFFATRQSSY

Query:  HLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW
                                                  +LMEKVIPVVEKRRPPRV+RRVNSMKW
Subjt:  HLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW

XP_004140964.1 G-box-binding factor 4 [Cucumis sativus]6.3e-12775.82Show/hide
Query:  MASSKLFPSSNSRNSDLSRG-SSSSSSSSASLLKPQFFSNRARNYDNPSRN-RPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMNT
        MASSKLFPSSNSRNSDLSRG SSSSSSSSASLLKPQF SN +RNYD PSRN  PHT   MTVDGLL N ++SNPTESSILLDAQITLVDSPNPSSL ++T
Subjt:  MASSKLFPSSNSRNSDLSRG-SSSSSSSSASLLKPQFFSNRARNYDNPSRN-RPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMNT

Query:  TTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKVEG
        TT TTTNSSAVIDSNHNSSS APPPKTVDDVWREIVSGERKELKEEVA+E+ITLEDFL+KSGAVPVEDVK PQTERLSGG+FSFDPIPSTTFQALDK+EG
Subjt:  TTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKVEG

Query:  SIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKKFFFATRQSSYH
        SIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENE+LLREK                   
Subjt:  SIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKKFFFATRQSSYH

Query:  LMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW
                                       +R   R+   +LM+KVIPVVEKRRP RV+RRVNSM+W
Subjt:  LMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW

XP_022992992.1 G-box-binding factor 4-like [Cucurbita maxima]4.0e-12173.78Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMN---
        MAS KL  SSNSRNSDLSRG SSSSSSSASLL  QF SN  RN D P+RNR H+ +SMTVDGLL NGY+SNPTESSILLDAQITLVDS NPSSLPMN   
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMN---

Query:  -TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKV
         TTTTTTTNSSAVIDSNHN+SSGA  PKTVDDVWREIVSGERK+LKEEVADE ITLED+LL++G +PVEDVKLPQTERLSGG+FSFDPIP++TFQALDKV
Subjt:  -TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKV

Query:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKKFFFATRQSS
        EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEE ERLLREK  +E  K+ F       
Subjt:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKKFFFATRQSS

Query:  YHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW
                                                   +LMEKVIPVVEKRRPP+V+RRVNSMKW
Subjt:  YHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW

XP_023550818.1 G-box-binding factor 4-like [Cucurbita pepo subsp. pepo]2.7e-12274.32Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMN---
        MAS KL  SSNSRNSDLSRG SSSSSSSASLL+ QF SN  RN D P+RNR H+ +SMTVDGLL NGY+SNPTESSILLDAQITLVDS NPSSLPMN   
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMN---

Query:  -TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKV
         TTTTTTTNSSAVIDSNHN+SSGA  PKTVDDVWREIVSGERKELKEEVADE ITLED+LL++G +PVEDVKLPQTERLSGG+FSFDPIP++TFQALDKV
Subjt:  -TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKV

Query:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKKFFFATRQSS
        EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEE ERLLREK  +E  K+ F       
Subjt:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKKFFFATRQSS

Query:  YHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW
                                                   +LMEKVIPVVEKRRPPRV+RRVNSMKW
Subjt:  YHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW

XP_038885445.1 G-box-binding factor 4 [Benincasa hispida]5.2e-12976Show/hide
Query:  MASSKLFPSSNSRNSDLSRG----SSSSSSSSASLLKPQFFSN-RARNYDN-PSRNRPHTLTSMTVDGLLRNGYNS-NPTESSILLDAQITLVDSPNPSS
        MASSKLFPSS SR SDLSRG    SSSSSSSSASLLKPQF SN R+RNYDN PSR+RPHTLTSMTVDGLLRNGY+S NPTESSILLDAQITLVDSPNP+S
Subjt:  MASSKLFPSSNSRNSDLSRG----SSSSSSSSASLLKPQFFSN-RARNYDN-PSRNRPHTLTSMTVDGLLRNGYNS-NPTESSILLDAQITLVDSPNPSS

Query:  LPMNTTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQAL
        LPMN+TT TTTNSSAVIDSNH+SSSGAPP KTVDDVWREIVSGERKELKEE+ D ++TLE+FL KSGAVPVEDVKLPQTERLSGG+FSFDPIPSTTFQAL
Subjt:  LPMNTTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQAL

Query:  DKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK--KSSEEYKKFFFA
        DKVEGSIIGFANGVDLIGSGGSGGR KRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK  ++ E +K     
Subjt:  DKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK--KSSEEYKKFFFA

Query:  TRQSSYHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW
                                                        ++MEKVIPVVEKRRPPRV+RRVNSMKW
Subjt:  TRQSSYHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW

TrEMBL top hitse value%identityAlignment
A0A0A0KBB4 BZIP domain-containing protein3.1e-12775.82Show/hide
Query:  MASSKLFPSSNSRNSDLSRG-SSSSSSSSASLLKPQFFSNRARNYDNPSRN-RPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMNT
        MASSKLFPSSNSRNSDLSRG SSSSSSSSASLLKPQF SN +RNYD PSRN  PHT   MTVDGLL N ++SNPTESSILLDAQITLVDSPNPSSL ++T
Subjt:  MASSKLFPSSNSRNSDLSRG-SSSSSSSSASLLKPQFFSNRARNYDNPSRN-RPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMNT

Query:  TTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKVEG
        TT TTTNSSAVIDSNHNSSS APPPKTVDDVWREIVSGERKELKEEVA+E+ITLEDFL+KSGAVPVEDVK PQTERLSGG+FSFDPIPSTTFQALDK+EG
Subjt:  TTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKVEG

Query:  SIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKKFFFATRQSSYH
        SIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENE+LLREK                   
Subjt:  SIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKKFFFATRQSSYH

Query:  LMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW
                                       +R   R+   +LM+KVIPVVEKRRP RV+RRVNSM+W
Subjt:  LMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW

A0A6J1BXX0 G-box-binding factor 4-like isoform X11.9e-11672.01Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMNTTT
        MASSKL  SSNSRNSDLSRGSSSSSSSS+SLLK QF SNRARN D  +     +L+SMTVDGLLRNGY+SNPTE SILLDAQITLVDSPNPSS PMN   
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMNTTT

Query:  TTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKVEGSI
         T TNSSAVID+NH +SS A  PKTVDDVWREIVSGERKELKEEVADE+ITLEDFL+K+GAVPVEDVKLPQTERLSGG++SFDPIP T FQALDKVEGSI
Subjt:  TTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKVEGSI

Query:  IGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK--KSSEEYKKFFFATRQSSYH
        IGF +GVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK  ++ E +K            
Subjt:  IGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK--KSSEEYKKFFFATRQSSYH

Query:  LMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW
                                                 +LMEKVIPVVEKRRPPRV+RRV+SM W
Subjt:  LMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW

A0A6J1FLE3 G-box-binding factor 4-like1.2e-12073.05Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMN---
        MAS KL  SSNSRNSDL RGSSSSS SSASLL+ QF SN  RN D P+RN+ H+ +SMTVDGLL NGY+SNPTESSILLDAQITLVDS NPSSLPMN   
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMN---

Query:  --TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDK
          TTTTTTTNSSAVIDSNHN+SSGA  PKTVDDVWREIVSGERKELKEEV DE ITLED+LL++G +PVEDVKLPQTERLSGG+FSFDPI ++TFQALDK
Subjt:  --TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDK

Query:  VEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKKFFFATRQS
        VEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEE ERLLREK  +E  K+ F      
Subjt:  VEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKKFFFATRQS

Query:  SYHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW
                                                    +LMEKVIPVVEKRRPPRV+RRVNSMKW
Subjt:  SYHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW

A0A6J1HH29 G-box-binding factor 4-like isoform X15.2e-11168.83Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYD-NPSRNRPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMNTT
        MASSKL+ SS SRNSDLSRG SSSSSSSASLL PQF SN +RN D N +RNR H+L+SM +DGL+R+GY+SNPTE SILLDAQITLVDSP PS+ PMN  
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYD-NPSRNRPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMNTT

Query:  TTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKVEGS
           TTNSSAVIDS HN+SSGA  PKTVDDVWREIVSGERKELKEEV DE+ITLEDFL+K+GA PVEDVKLPQTERLSGG+FSFD IP ++FQA++KVEGS
Subjt:  TTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKVEGS

Query:  IIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK--KSSEEYKKFFFATRQSSY
        I+GF +GVDL+GSGGS GRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESL  +LEEENERLLREK  +S E +K           
Subjt:  IIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK--KSSEEYKKFFFATRQSSY

Query:  HLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW
                                                  +LMEKVIPVVEKRRPPR +RRVNSMKW
Subjt:  HLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW

A0A6J1JRH8 G-box-binding factor 4-like1.9e-12173.78Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMN---
        MAS KL  SSNSRNSDLSRG SSSSSSSASLL  QF SN  RN D P+RNR H+ +SMTVDGLL NGY+SNPTESSILLDAQITLVDS NPSSLPMN   
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMN---

Query:  -TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKV
         TTTTTTTNSSAVIDSNHN+SSGA  PKTVDDVWREIVSGERK+LKEEVADE ITLED+LL++G +PVEDVKLPQTERLSGG+FSFDPIP++TFQALDKV
Subjt:  -TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKV

Query:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKKFFFATRQSS
        EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEE ERLLREK  +E  K+ F       
Subjt:  EGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKKFFFATRQSS

Query:  YHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW
                                                   +LMEKVIPVVEKRRPP+V+RRVNSMKW
Subjt:  YHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW

SwissProt top hitse value%identityAlignment
P42777 G-box-binding factor 41.2e-3237.99Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRN---GYNSNPTESSILLDAQITLVDSPNPSSLPMN
        MAS KL  SS   NSDLSR +SSS+SSS S+        R+ ++  P+ +  H+  S    G + +     +S P E +I +D  I              
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRN---GYNSNPTESSILLDAQITLVDSPNPSSLPMN

Query:  TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKE--LKEEVADEMITLEDFLLKS----GAVPVEDVKLPQTERLSG-GVFSFDPIPSTTF
                       + NS +     K+VDDVW+EIVSGE+K   +KEE  ++++TLEDFL K+    GA    DVK+P TERL+  G ++FD  P    
Subjt:  TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKE--LKEEVADEMITLEDFLLKS----GAVPVEDVKLPQTERLSG-GVFSFDPIPSTTF

Query:  QALDKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLR--EKKSSEEYKKF
         +   VEGS+            GG   RGKRGR  +E +DKAA QRQ+RMIKNRESAARSRERKQAYQVELE+LA +LEEENE+LL+  E+ + E YK  
Subjt:  QALDKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLR--EKKSSEEYKKF

Query:  FFATRQSSYHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPP-RVMRRVNSMKW
                                                           KLME +IPV EK RPP R + R +S++W
Subjt:  FFATRQSSYHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPP-RVMRRVNSMKW

Q0JHF1 bZIP transcription factor 122.7e-1633.62Show/hide
Query:  LTSMTVDGLLRNGYNSNPTESSILL--DAQITLVDSPNPSSLPMNTTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITL
        L SM V+ +LR  Y   PT +  L+  D  ++ + +P+ ++ P          + A + +   ++ G  PP         +V+G       E     +TL
Subjt:  LTSMTVDGLLRNGYNSNPTESSILL--DAQITLVDSPNPSSLPMNTTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITL

Query:  EDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRE
        EDFL + GAV  ++  +       G V                    ++GF NG ++ G G +GGR ++ R  ++P+D+AA QRQ+RMIKNRESAARSRE
Subjt:  EDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRE

Query:  RKQAYQVELESLAVRLEEENERLLREKKSSEE
        RKQAY  ELESL  +LEEEN ++ +E++   +
Subjt:  RKQAYQVELESLAVRLEEENERLLREKKSSEE

Q6Z312 bZIP transcription factor 232.1e-0830.65Show/hide
Query:  TTTTNSSAVIDSNHNSSSGAPP--------------PKTVDDVWREIV--SGERKELKEEVADE-----------MITLEDFLLKSGAVPVEDVKLP---
        TTTT ++A + +  +++ GAPP               KTVD+VWR+++   G         A+             ITLE+FL+++G V  ED+ +P   
Subjt:  TTTTNSSAVIDSNHNSSSGAPP--------------PKTVDDVWREIV--SGERKELKEEVADE-----------MITLEDFLLKSGAVPVEDVKLP---

Query:  ------------------QTERLSGGVFSFDPIPSTTFQALDKVEGSI----IGFANGVDLIGSGGSGGRGK----------------------RGRAAL
                          QT  L G    F P+          V G++     G A+ V  +    S G GK                      RGR A 
Subjt:  ------------------QTERLSGGVFSFDPIPSTTFQALDKVEGSI----IGFANGVDLIGSGGSGGRGK----------------------RGRAAL

Query:  EPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKK
          ++K  E+RQRRMIKNRESAARSR+RKQAY +ELE+   +L+E N+ L +++    E +K
Subjt:  EPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKK

Q9C5Q2 ABSCISIC ACID-INSENSITIVE 5-like protein 32.9e-1038.01Show/hide
Query:  KTVDDVWREIVSGER-------KELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGG----VFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGS
        KTVD+VWR+I   +           K+    E ITLED LL++G V    V       ++       +   P     F      E          D++  
Subjt:  KTVDDVWREIVSGER-------KELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGG----VFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGS

Query:  GGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKK
        GG     +   R R A E ++K  E+RQ+RMIKNRESAARSR RKQAY  ELE    RLEEENE+L R K+
Subjt:  GGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKK

Q9LES3 ABSCISIC ACID-INSENSITIVE 5-like protein 21.0e-1031.5Show/hide
Query:  SRNRPHTLTSMTVDGLLRNGYNSNPTESSILLD---AQITLVDSPNPSSLPMNTTTTTTTNSSAVIDSNHNSSSGAP---PPKTVDDVWREIVSGERKEL
        S NR  +L S+T+D +  +  +S     S+ LD     +  V++  PSS+ +N        ++A    +   S   P     KTVD+VW++I   +    
Subjt:  SRNRPHTLTSMTVDGLLRNGYNSNPTESSILLD---AQITLVDSPNPSSLPMNTTTTTTTNSSAVIDSNHNSSSGAP---PPKTVDDVWREIVSGERKEL

Query:  KEEVADEM-----ITLEDFLLKSGAV-----------PVEDVKLPQTERLSGGVF------------------SFDPIPSTTFQALDKVEGSIIGFANGV
          E  D+      +TLED LLK+G V           PV          L   +                   +F P P +  QA+   + S++G     
Subjt:  KEEVADEM-----ITLEDFLLKSGAV-----------PVEDVKLPQTERLSGGVF------------------SFDPIPSTTFQALDKVEGSIIGFANGV

Query:  DLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKK
           G   +   G++  A+ E ++K  E+RQ+RMIKNRESAARSR RKQAY  ELE    RLEEENERL ++K+
Subjt:  DLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKK

Arabidopsis top hitse value%identityAlignment
AT1G03970.1 G-box binding factor 48.7e-3437.99Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRN---GYNSNPTESSILLDAQITLVDSPNPSSLPMN
        MAS KL  SS   NSDLSR +SSS+SSS S+        R+ ++  P+ +  H+  S    G + +     +S P E +I +D  I              
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRN---GYNSNPTESSILLDAQITLVDSPNPSSLPMN

Query:  TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKE--LKEEVADEMITLEDFLLKS----GAVPVEDVKLPQTERLSG-GVFSFDPIPSTTF
                       + NS +     K+VDDVW+EIVSGE+K   +KEE  ++++TLEDFL K+    GA    DVK+P TERL+  G ++FD  P    
Subjt:  TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKE--LKEEVADEMITLEDFLLKS----GAVPVEDVKLPQTERLSG-GVFSFDPIPSTTF

Query:  QALDKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLR--EKKSSEEYKKF
         +   VEGS+            GG   RGKRGR  +E +DKAA QRQ+RMIKNRESAARSRERKQAYQVELE+LA +LEEENE+LL+  E+ + E YK  
Subjt:  QALDKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLR--EKKSSEEYKKF

Query:  FFATRQSSYHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPP-RVMRRVNSMKW
                                                           KLME +IPV EK RPP R + R +S++W
Subjt:  FFATRQSSYHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVEKRRPP-RVMRRVNSMKW

AT2G41070.1 Basic-leucine zipper (bZIP) transcription factor family protein2.1e-1138.01Show/hide
Query:  KTVDDVWREIVSGER-------KELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGG----VFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGS
        KTVD+VWR+I   +           K+    E ITLED LL++G V    V       ++       +   P     F      E          D++  
Subjt:  KTVDDVWREIVSGER-------KELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGG----VFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGS

Query:  GGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKK
        GG     +   R R A E ++K  E+RQ+RMIKNRESAARSR RKQAY  ELE    RLEEENE+L R K+
Subjt:  GGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKK

AT2G41070.3 Basic-leucine zipper (bZIP) transcription factor family protein2.1e-1138.01Show/hide
Query:  KTVDDVWREIVSGER-------KELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGG----VFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGS
        KTVD+VWR+I   +           K+    E ITLED LL++G V    V       ++       +   P     F      E          D++  
Subjt:  KTVDDVWREIVSGER-------KELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGG----VFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGS

Query:  GGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKK
        GG     +   R R A E ++K  E+RQ+RMIKNRESAARSR RKQAY  ELE    RLEEENE+L R K+
Subjt:  GGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKK

AT3G56850.1 ABA-responsive element binding protein 37.2e-1231.5Show/hide
Query:  SRNRPHTLTSMTVDGLLRNGYNSNPTESSILLD---AQITLVDSPNPSSLPMNTTTTTTTNSSAVIDSNHNSSSGAP---PPKTVDDVWREIVSGERKEL
        S NR  +L S+T+D +  +  +S     S+ LD     +  V++  PSS+ +N        ++A    +   S   P     KTVD+VW++I   +    
Subjt:  SRNRPHTLTSMTVDGLLRNGYNSNPTESSILLD---AQITLVDSPNPSSLPMNTTTTTTTNSSAVIDSNHNSSSGAP---PPKTVDDVWREIVSGERKEL

Query:  KEEVADEM-----ITLEDFLLKSGAV-----------PVEDVKLPQTERLSGGVF------------------SFDPIPSTTFQALDKVEGSIIGFANGV
          E  D+      +TLED LLK+G V           PV          L   +                   +F P P +  QA+   + S++G     
Subjt:  KEEVADEM-----ITLEDFLLKSGAV-----------PVEDVKLPQTERLSGGVF------------------SFDPIPSTTFQALDKVEGSIIGFANGV

Query:  DLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKK
           G   +   G++  A+ E ++K  E+RQ+RMIKNRESAARSR RKQAY  ELE    RLEEENERL ++K+
Subjt:  DLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKK

AT5G44080.1 Basic-leucine zipper (bZIP) transcription factor family protein1.2e-5144.13Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRP-HTLTSMTVDGLLRNGYNSN---PTESSILLDAQITLVDSPNPSSLPM
        M S ++  SSNSRNSDLSR  SS+S+SS+S+   Q F     +     RN   ++  SMTV+G+L + + S+   PTESS LLDA I L+D+   S  PM
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRP-HTLTSMTVDGLLRNGYNSN---PTESSILLDAQITLVDSPNPSSLPM

Query:  NTTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSG-----AVPVE----DVKLPQTERLSGGVFSFDPIPS
          TTTT   +S V+D    + +     K+VD++WRE+VSGE K +KEE ++E++TLEDFL K+      AV       DVK+P T       F     P 
Subjt:  NTTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSG-----AVPVE----DVKLPQTERLSGGVFSFDPIPS

Query:  TTFQALDKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLR--EKKSSEEY
          FQ +DKVEGSI+ F NG+D+    G G RGKR R  +EPLDKAA QRQRRMIKNRESAARSRERKQAYQVELE+LA +LEEENE L +  E K  E Y
Subjt:  TTFQALDKVEGSIIGFANGVDLIGSGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLR--EKKSSEEY

Query:  KKFFFATRQSSYHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVE--KRRPPRVMRRVNSMKW
        +                                                     KLME VIPVVE  K++PPR +RR+ S++W
Subjt:  KKFFFATRQSSYHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLKRGSSRVTKLKLMEKVIPVVE--KRRPPRVMRRVNSMKW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCATCCAAGCTCTTCCCATCTTCCAATTCGCGAAATTCGGATCTATCTCGAGGTTCTTCTTCTTCTTCTTCCTCTTCTGCTTCTCTTTTAAAACCCCAATTTTT
CTCCAATCGCGCCAGAAATTATGACAACCCATCTCGCAATCGACCCCATACTCTTACATCAATGACCGTCGACGGCTTACTTCGTAATGGCTACAATTCCAATCCCACGG
AATCGTCTATTCTCTTAGATGCACAGATTACTCTTGTTGATTCCCCTAACCCCTCTTCGCTTCCCATGAATACCACCACCACTACCACCACTAATTCCTCTGCTGTCATT
GATAGTAACCATAATTCTTCCTCTGGTGCGCCGCCGCCCAAAACTGTGGATGATGTGTGGAGGGAGATTGTTTCTGGTGAGAGGAAGGAATTGAAGGAGGAGGTTGCTGA
CGAGATGATAACCCTCGAGGATTTTCTTTTGAAATCTGGAGCTGTGCCTGTTGAGGATGTTAAATTGCCGCAGACAGAGAGGCTGAGTGGAGGGGTTTTTTCGTTTGATC
CAATTCCTTCCACTACATTTCAGGCATTGGATAAGGTTGAAGGATCCATTATTGGGTTTGCTAATGGGGTTGATTTGATTGGAAGTGGTGGAAGTGGGGGCAGAGGTAAA
AGAGGGCGAGCTGCTTTGGAACCTTTAGATAAGGCTGCAGAGCAAAGACAGAGGAGGATGATCAAGAACAGGGAGTCTGCAGCAAGGTCTAGGGAACGGAAGCAGGCTTA
TCAAGTGGAATTAGAGTCGTTAGCTGTAAGATTAGAGGAAGAGAATGAGCGGCTTTTGAGGGAAAAGAAGAGTTCTGAAGAATACAAAAAGTTCTTTTTTGCTACTAGAC
AAAGTTCTTATCATCTAATGGAAGCGAAACTGGAGGATATTTTGGAAATTAACAAAATATTCTTATATGGGATTTTCCATTTATATAGTGTAGGGCTGAGAGGACTAAAG
AGAGGTTCAAGCAGGGTGACTAAGCTTAAATTAATGGAAAAGGTGATTCCTGTTGTTGAAAAGCGACGACCCCCACGAGTCATGCGGCGAGTTAATTCAATGAAATGGTG
A
mRNA sequenceShow/hide mRNA sequence
TTCACCGGCAACCGGCTAACCCTGGCTCCATCTTCCTCCCTCACTCTCTAAATCTTCTCACGCGCTCTCACTCGCCTTCTTTCTTTCCCTCTGCCCCTAACAATACTCAA
CTCCCAGTTCTCACCCACCAACGGAAGAAAACCACAAAACCTCGCCGCTCTCATACAAATTCTTTTGTATCACTATAAAGATTCCCACTCCAGTTTCGAGAGAATCTCTG
CGAATTCCTTGCCTATTTTGTCGATCAATATGGCATCATCCAAGCTCTTCCCATCTTCCAATTCGCGAAATTCGGATCTATCTCGAGGTTCTTCTTCTTCTTCTTCCTCT
TCTGCTTCTCTTTTAAAACCCCAATTTTTCTCCAATCGCGCCAGAAATTATGACAACCCATCTCGCAATCGACCCCATACTCTTACATCAATGACCGTCGACGGCTTACT
TCGTAATGGCTACAATTCCAATCCCACGGAATCGTCTATTCTCTTAGATGCACAGATTACTCTTGTTGATTCCCCTAACCCCTCTTCGCTTCCCATGAATACCACCACCA
CTACCACCACTAATTCCTCTGCTGTCATTGATAGTAACCATAATTCTTCCTCTGGTGCGCCGCCGCCCAAAACTGTGGATGATGTGTGGAGGGAGATTGTTTCTGGTGAG
AGGAAGGAATTGAAGGAGGAGGTTGCTGACGAGATGATAACCCTCGAGGATTTTCTTTTGAAATCTGGAGCTGTGCCTGTTGAGGATGTTAAATTGCCGCAGACAGAGAG
GCTGAGTGGAGGGGTTTTTTCGTTTGATCCAATTCCTTCCACTACATTTCAGGCATTGGATAAGGTTGAAGGATCCATTATTGGGTTTGCTAATGGGGTTGATTTGATTG
GAAGTGGTGGAAGTGGGGGCAGAGGTAAAAGAGGGCGAGCTGCTTTGGAACCTTTAGATAAGGCTGCAGAGCAAAGACAGAGGAGGATGATCAAGAACAGGGAGTCTGCA
GCAAGGTCTAGGGAACGGAAGCAGGCTTATCAAGTGGAATTAGAGTCGTTAGCTGTAAGATTAGAGGAAGAGAATGAGCGGCTTTTGAGGGAAAAGAAGAGTTCTGAAGA
ATACAAAAAGTTCTTTTTTGCTACTAGACAAAGTTCTTATCATCTAATGGAAGCGAAACTGGAGGATATTTTGGAAATTAACAAAATATTCTTATATGGGATTTTCCATT
TATATAGTGTAGGGCTGAGAGGACTAAAGAGAGGTTCAAGCAGGGTGACTAAGCTTAAATTAATGGAAAAGGTGATTCCTGTTGTTGAAAAGCGACGACCCCCACGAGTC
ATGCGGCGAGTTAATTCAATGAAATGGTGAAGTACTCATCTTCCATGAGACGGGTAGTGGTAGTGGCAAGAAGGAACAGAAGAGAGAGATAGGCAATAGTATCTTTACAG
TTACTTAAAAACATATTCTTTGCTACTGATGGTTGAGGAGTCATCAGAGGGGTCTGCAATGAAAAGTGCTGACAATCACAATCCTTTAATCAAAGAACTGATTGGCAGAG
ACAAAACAACAGCAACAGAATACATGAAAGCAGATGGTTTATAGGTGTTATTAAAACAAAGTGGATCAGAGCAAATAAAAGGTAGAAACTTGAGGTGCTCAACTAGTCTC
TGTCTCACATTTTTTCTTTTTTTCTAAATGCCTGAGAGAGTAAATAGAGGAAGAAGTTGAGCTGAGTTTAGCATTTTTATGGATAGTGGAAATATAACAACATATGGCAA
AACTATTCCTTGTTTTGTTATAATACATACTTGCTAGAATAGTGTGGTGGTAATACAGGTATATAATTTATTATTTCTATGTTTCCTTGAAGTTCTTACGTTTTCTAGGT
GCTACTAGGGATGTACCGAAGTATTTTTAAGTTGGAAATTATTTTTGGTAATTTTTATGTA
Protein sequenceShow/hide protein sequence
MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRARNYDNPSRNRPHTLTSMTVDGLLRNGYNSNPTESSILLDAQITLVDSPNPSSLPMNTTTTTTTNSSAVI
DSNHNSSSGAPPPKTVDDVWREIVSGERKELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTERLSGGVFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGSGGSGGRGK
RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKKSSEEYKKFFFATRQSSYHLMEAKLEDILEINKIFLYGIFHLYSVGLRGLK
RGSSRVTKLKLMEKVIPVVEKRRPPRVMRRVNSMKW