; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G005590 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G005590
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionG-box-binding factor 4-like
Genome locationchr08:13775194..13779985
RNA-Seq ExpressionLsi08G005590
SyntenyLsi08G005590
Gene Ontology termsGO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR043452 - Plant bZIP transcription factors


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578883.1 G-box-binding factor 4, partial [Cucurbita argyrosperma subsp. sororia]3.7e-12680.18Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMN---
        MAS KL  SSNSRNSDLSRGSSSSS SSASLL+ QF SN  RN D P+RNR H+ +SMTVDGLL N YDSNPTESSILLDAQITLVDS NPSSLPMN   
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMN---

Query:  TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKVE
        TTTTTTTNSSAVIDSNHN+SSGA  PKTVDDVWREIVSGER+ELKEEV DE ITLED+LL++G +PVEDVKLPQT+RLSGGIFSFDPIP++TFQALDKVE
Subjt:  TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKVE

Query:  GSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSS
        GSIIGFANGVDLIG GGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEE ERLLREK +   ++           
Subjt:  GSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSS

Query:  SFLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW
                  E LMEKVIPVVEKRRPPRVIRRVNSMKW
Subjt:  SFLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW

XP_004140964.1 G-box-binding factor 4 [Cucumis sativus]3.0e-13183.38Show/hide
Query:  MASSKLFPSSNSRNSDLSRG-SSSSSSSSASLLKPQFFSNRSRNYDNPSRN-RPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMNT
        MASSKLFPSSNSRNSDLSRG SSSSSSSSASLLKPQF SN SRNYD PSRN  PHT   MTVDGLL NA+DSNPTESSILLDAQITLVDSPNPSSL ++T
Subjt:  MASSKLFPSSNSRNSDLSRG-SSSSSSSSASLLKPQFFSNRSRNYDNPSRN-RPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMNT

Query:  TTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKVEG
        TT TTTNSSAVIDSNHNSSS APPPKTVDDVWREIVSGER+ELKEEVA+E+ITLEDFL+KSGAVPVEDVK PQT+RLSGGIFSFDPIPSTTFQALDK+EG
Subjt:  TTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKVEG

Query:  SIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSSS
        SIIGFANGVDLIG GGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENE+LLREK +   ++     LK+    
Subjt:  SIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSSS

Query:  FLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW
                   LM+KVIPVVEKRRP RVIRRVNSM+W
Subjt:  FLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW

XP_022992992.1 G-box-binding factor 4-like [Cucurbita maxima]2.1e-12479.65Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMN---
        MAS KL  SSNSRNSDLSRG SSSSSSSASLL  QF SN  RN D P+RNR H+ +SMTVDGLL N YDSNPTESSILLDAQITLVDS NPSSLPMN   
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMN---

Query:  -TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKV
         TTTTTTTNSSAVIDSNHN+SSGA  PKTVDDVWREIVSGER++LKEEVADE ITLED+LL++G +PVEDVKLPQT+RLSGGIFSFDPIP++TFQALDKV
Subjt:  -TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKV

Query:  EGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGS
        EGSIIGFANGVDLIG GGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEE ERLLREK +   ++          
Subjt:  EGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGS

Query:  SSFLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW
                   E LMEKVIPVVEKRRPP+VIRRVNSMKW
Subjt:  SSFLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW

XP_023550818.1 G-box-binding factor 4-like [Cucurbita pepo subsp. pepo]1.4e-12580.24Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMN---
        MAS KL  SSNSRNSDLSRG SSSSSSSASLL+ QF SN  RN D P+RNR H+ +SMTVDGLL N YDSNPTESSILLDAQITLVDS NPSSLPMN   
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMN---

Query:  -TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKV
         TTTTTTTNSSAVIDSNHN+SSGA  PKTVDDVWREIVSGER+ELKEEVADE ITLED+LL++G +PVEDVKLPQT+RLSGGIFSFDPIP++TFQALDKV
Subjt:  -TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKV

Query:  EGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGS
        EGSIIGFANGVDLIG GGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEE ERLLREK +   ++          
Subjt:  EGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGS

Query:  SSFLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW
                   E LMEKVIPVVEKRRPPRVIRRVNSMKW
Subjt:  SSFLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW

XP_038885445.1 G-box-binding factor 4 [Benincasa hispida]3.5e-13282.75Show/hide
Query:  MASSKLFPSSNSRNSDLSRG----SSSSSSSSASLLKPQFFSN-RSRNYDN-PSRNRPHTLTSMTVDGLLRNAYDS-NPTESSILLDAQITLVDSPNPSS
        MASSKLFPSS SR SDLSRG    SSSSSSSSASLLKPQF SN RSRNYDN PSR+RPHTLTSMTVDGLLRN YDS NPTESSILLDAQITLVDSPNP+S
Subjt:  MASSKLFPSSNSRNSDLSRG----SSSSSSSSASLLKPQFFSN-RSRNYDN-PSRNRPHTLTSMTVDGLLRNAYDS-NPTESSILLDAQITLVDSPNPSS

Query:  LPMNTTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQAL
        LPMN+TT TTTNSSAVIDSNH+SSSGAPP KTVDDVWREIVSGER+ELKEE+ D ++TLE+FL KSGAVPVEDVKLPQT+RLSGGIFSFDPIPSTTFQAL
Subjt:  LPMNTTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQAL

Query:  DKVEGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLK
        DKVEGSIIGFANGVDLIG GGSGGR KRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK +   ++ +     
Subjt:  DKVEGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLK

Query:  RGSSSFLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW
                        +MEKVIPVVEKRRPPRVIRRVNSMKW
Subjt:  RGSSSFLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW

TrEMBL top hitse value%identityAlignment
A0A0A0KBB4 BZIP domain-containing protein1.4e-13183.38Show/hide
Query:  MASSKLFPSSNSRNSDLSRG-SSSSSSSSASLLKPQFFSNRSRNYDNPSRN-RPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMNT
        MASSKLFPSSNSRNSDLSRG SSSSSSSSASLLKPQF SN SRNYD PSRN  PHT   MTVDGLL NA+DSNPTESSILLDAQITLVDSPNPSSL ++T
Subjt:  MASSKLFPSSNSRNSDLSRG-SSSSSSSSASLLKPQFFSNRSRNYDNPSRN-RPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMNT

Query:  TTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKVEG
        TT TTTNSSAVIDSNHNSSS APPPKTVDDVWREIVSGER+ELKEEVA+E+ITLEDFL+KSGAVPVEDVK PQT+RLSGGIFSFDPIPSTTFQALDK+EG
Subjt:  TTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKVEG

Query:  SIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSSS
        SIIGFANGVDLIG GGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENE+LLREK +   ++     LK+    
Subjt:  SIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSSS

Query:  FLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW
                   LM+KVIPVVEKRRP RVIRRVNSM+W
Subjt:  FLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW

A0A6J1BXX0 G-box-binding factor 4-like isoform X16.2e-11977.91Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMNTTT
        MASSKL  SSNSRNSDLSRGSSSSSSSS+SLLK QF SNR+RN D  +     +L+SMTVDGLLRN YDSNPTE SILLDAQITLVDSPNPSS PMN   
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMNTTT

Query:  TTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKVEGSI
         T TNSSAVID+NH +SS A  PKTVDDVWREIVSGER+ELKEEVADE+ITLEDFL+K+GAVPVEDVKLPQT+RLSGGI+SFDPIP T FQALDKVEGSI
Subjt:  TTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKVEGSI

Query:  IGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSSSFL
        IGF +GVDLIG GGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK +   ++ +            
Subjt:  IGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSSSFL

Query:  HSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW
                 LMEKVIPVVEKRRPPRVIRRV+SM W
Subjt:  HSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW

A0A6J1FLE3 G-box-binding factor 4-like4.9e-12478.82Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMN---
        MAS KL  SSNSRNSDL RGSSSSS SSASLL+ QF SN  RN D P+RN+ H+ +SMTVDGLL N YDSNPTESSILLDAQITLVDS NPSSLPMN   
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMN---

Query:  --TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDK
          TTTTTTTNSSAVIDSNHN+SSGA  PKTVDDVWREIVSGER+ELKEEV DE ITLED+LL++G +PVEDVKLPQT+RLSGGIFSFDPI ++TFQALDK
Subjt:  --TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDK

Query:  VEGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRG
        VEGSIIGFANGVDLIG GGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEE ERLLREK +   ++         
Subjt:  VEGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRG

Query:  SSSFLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW
                    E LMEKVIPVVEKRRPPRVIRRVNSMKW
Subjt:  SSSFLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW

A0A6J1HH29 G-box-binding factor 4-like isoform X16.0e-11474.7Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYD-NPSRNRPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMNTT
        MASSKL+ SS SRNSDLSRG SSSSSSSASLL PQF SN SRN D N +RNR H+L+SM +DGL+R+ YDSNPTE SILLDAQITLVDSP PS+ PMN  
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYD-NPSRNRPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMNTT

Query:  TTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKVEGS
           TTNSSAVIDS HN+SSGA  PKTVDDVWREIVSGER+ELKEEV DE+ITLEDFL+K+GA PVEDVKLPQT+RLSGGIFSFD IP ++FQA++KVEGS
Subjt:  TTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKVEGS

Query:  IIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSSSF
        I+GF +GVDL+G GGS GRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESL  +LEEENERLLREK + + ++             
Subjt:  IIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSSSF

Query:  LHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW
                + LMEKVIPVVEKRRPPR IRRVNSMKW
Subjt:  LHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW

A0A6J1JRH8 G-box-binding factor 4-like9.9e-12579.65Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMN---
        MAS KL  SSNSRNSDLSRG SSSSSSSASLL  QF SN  RN D P+RNR H+ +SMTVDGLL N YDSNPTESSILLDAQITLVDS NPSSLPMN   
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMN---

Query:  -TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKV
         TTTTTTTNSSAVIDSNHN+SSGA  PKTVDDVWREIVSGER++LKEEVADE ITLED+LL++G +PVEDVKLPQT+RLSGGIFSFDPIP++TFQALDKV
Subjt:  -TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKV

Query:  EGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGS
        EGSIIGFANGVDLIG GGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEE ERLLREK +   ++          
Subjt:  EGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGS

Query:  SSFLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW
                   E LMEKVIPVVEKRRPP+VIRRVNSMKW
Subjt:  SSFLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW

SwissProt top hitse value%identityAlignment
P42777 G-box-binding factor 44.4e-3740.29Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRN---AYDSNPTESSILLDAQITLVDSPNPSSLPMN
        MAS KL  SS   NSDLSR +SSS+SSS S+        RS ++  P+ +  H+  S    G + +   A DS P E +I +D  I              
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRN---AYDSNPTESSILLDAQITLVDSPNPSSLPMN

Query:  TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRE--LKEEVADEMITLEDFLLKS----GAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQ
                       + NS +     K+VDDVW+EIVSGE++   +KEE  ++++TLEDFL K+    GA    DVK+P  +  + G ++FD  P     
Subjt:  TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRE--LKEEVADEMITLEDFLLKS----GAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQ

Query:  ALDKVEGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRG
        +   VEGS            +GG   RGKRGR  +E +DKAA QRQ+RMIKNRESAARSRERKQAYQVELE+LA +LEEENE+LL+E  +S  ++ +   
Subjt:  ALDKVEGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRG

Query:  LKRGSSSFLHSFLLIKEILMEKVIPVVEKRRPP-RVIRRVNSMKW
                          LME +IPV EK RPP R + R +S++W
Subjt:  LKRGSSSFLHSFLLIKEILMEKVIPVVEKRRPP-RVIRRVNSMKW

Q0JHF1 bZIP transcription factor 121.7e-2032.51Show/hide
Query:  LTSMTVDGLLRNAYDSNPTESSILL--DAQITLVDSPNPSSLPMNTTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITL
        L SM V+ +LR  Y   PT +  L+  D  ++ + +P+ ++ P          + A + +   ++ G  PP         +V+G       E     +TL
Subjt:  LTSMTVDGLLRNAYDSNPTESSILL--DAQITLVDSPNPSSLPMNTTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITL

Query:  EDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRE
        EDFL + GAV  ++  +       G +                    ++GF NG ++ G G +GGR ++ R  ++P+D+AA QRQ+RMIKNRESAARSRE
Subjt:  EDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRE

Query:  RKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSSSFLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW
        RKQAY  ELESL  +LEEEN ++ +E+ +    + RL+ LK                  E V+PV+ ++   R +RR NSM+W
Subjt:  RKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSSSFLHSFLLIKEILMEKVIPVVEKRRPPRVIRRVNSMKW

Q6Z312 bZIP transcription factor 231.1e-0830.37Show/hide
Query:  TTTTNSSAVIDSNHNSSSGAPP--------------PKTVDDVWREIV--SGERRELKEEVADE-----------MITLEDFLLKSGAVPVEDVKLP---
        TTTT ++A + +  +++ GAPP               KTVD+VWR+++   G         A+             ITLE+FL+++G V  ED+ +P   
Subjt:  TTTTNSSAVIDSNHNSSSGAPP--------------PKTVDDVWREIV--SGERRELKEEVADE-----------MITLEDFLLKSGAVPVEDVKLP---

Query:  ------------------QTKRLSGGIFSFDPIPSTTFQALDKVEGSI---------------------IGFANGVDLIGIGGSG-----GRGKRGRAAL
                          QT  L G    F P+          V G++                      G   G DL  +  S        G RGR A 
Subjt:  ------------------QTKRLSGGIFSFDPIPSTTFQALDKVEGSI---------------------IGFANGVDLIGIGGSG-----GRGKRGRAAL

Query:  EPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGS
          ++K  E+RQRRMIKNRESAARSR+RKQAY +ELE+   +L+E N+  L++K    L++ +   L+R S
Subjt:  EPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGS

Q9C5Q2 ABSCISIC ACID-INSENSITIVE 5-like protein 37.1e-1138.24Show/hide
Query:  KTVDDVWREIVSGER-------RELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGG----IFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGI
        KTVD+VWR+I   +           K+    E ITLED LL++G V    V       ++       +   P     F      E          D++ +
Subjt:  KTVDDVWREIVSGER-------RELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGG----IFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGI

Query:  GGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK
        GG     +   R R A E ++K  E+RQ+RMIKNRESAARSR RKQAY  ELE    RLEEENE+L R K
Subjt:  GGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK

Q9LES3 ABSCISIC ACID-INSENSITIVE 5-like protein 22.9e-1230.37Show/hide
Query:  SRNRPHTLTSMTVDGLLRNAYDSNPTESSILLD---AQITLVDSPNPSSLPMNTTTTTTTNSSAVIDSNHNSSSGAP---PPKTVDDVWREIV------S
        S NR  +L S+T+D +  +   S     S+ LD     +  V++  PSS+ +N        ++A    +   S   P     KTVD+VW++I       S
Subjt:  SRNRPHTLTSMTVDGLLRNAYDSNPTESSILLD---AQITLVDSPNPSSLPMNTTTTTTTNSSAVIDSNHNSSSGAP---PPKTVDDVWREIV------S

Query:  GERRELKEEVADEMITLEDFLLKSGAV-----------PVEDVKLPQTKRLSGGIF------------------SFDPIPSTTFQALDKVEGSIIGFANG
           R  K+    EM TLED LLK+G V           PV          L   I                   +F P P +  QA+   + S++G    
Subjt:  GERRELKEEVADEMITLEDFLLKSGAV-----------PVEDVKLPQTKRLSGGIF------------------SFDPIPSTTFQALDKVEGSIIGFANG

Query:  VDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSSSFLHSFLLI
            G+  +   G++  A+ E ++K  E+RQ+RMIKNRESAARSR RKQAY  ELE    RLEEENERL ++K                           
Subjt:  VDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSSSFLHSFLLI

Query:  KEILMEKVIPVVEKRRPPRVIRRVNS
            +EK++P V    P R +RR +S
Subjt:  KEILMEKVIPVVEKRRPPRVIRRVNS

Arabidopsis top hitse value%identityAlignment
AT1G03970.1 G-box binding factor 43.1e-3840.29Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRN---AYDSNPTESSILLDAQITLVDSPNPSSLPMN
        MAS KL  SS   NSDLSR +SSS+SSS S+        RS ++  P+ +  H+  S    G + +   A DS P E +I +D  I              
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRN---AYDSNPTESSILLDAQITLVDSPNPSSLPMN

Query:  TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRE--LKEEVADEMITLEDFLLKS----GAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQ
                       + NS +     K+VDDVW+EIVSGE++   +KEE  ++++TLEDFL K+    GA    DVK+P  +  + G ++FD  P     
Subjt:  TTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRE--LKEEVADEMITLEDFLLKS----GAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQ

Query:  ALDKVEGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRG
        +   VEGS            +GG   RGKRGR  +E +DKAA QRQ+RMIKNRESAARSRERKQAYQVELE+LA +LEEENE+LL+E  +S  ++ +   
Subjt:  ALDKVEGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRG

Query:  LKRGSSSFLHSFLLIKEILMEKVIPVVEKRRPP-RVIRRVNSMKW
                          LME +IPV EK RPP R + R +S++W
Subjt:  LKRGSSSFLHSFLLIKEILMEKVIPVVEKRRPP-RVIRRVNSMKW

AT2G41070.1 Basic-leucine zipper (bZIP) transcription factor family protein5.0e-1238.24Show/hide
Query:  KTVDDVWREIVSGER-------RELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGG----IFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGI
        KTVD+VWR+I   +           K+    E ITLED LL++G V    V       ++       +   P     F      E          D++ +
Subjt:  KTVDDVWREIVSGER-------RELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGG----IFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGI

Query:  GGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK
        GG     +   R R A E ++K  E+RQ+RMIKNRESAARSR RKQAY  ELE    RLEEENE+L R K
Subjt:  GGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK

AT2G41070.3 Basic-leucine zipper (bZIP) transcription factor family protein5.0e-1238.24Show/hide
Query:  KTVDDVWREIVSGER-------RELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGG----IFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGI
        KTVD+VWR+I   +           K+    E ITLED LL++G V    V       ++       +   P     F      E          D++ +
Subjt:  KTVDDVWREIVSGER-------RELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGG----IFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGI

Query:  GGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK
        GG     +   R R A E ++K  E+RQ+RMIKNRESAARSR RKQAY  ELE    RLEEENE+L R K
Subjt:  GGSGGRGK---RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREK

AT3G56850.1 ABA-responsive element binding protein 32.0e-1330.37Show/hide
Query:  SRNRPHTLTSMTVDGLLRNAYDSNPTESSILLD---AQITLVDSPNPSSLPMNTTTTTTTNSSAVIDSNHNSSSGAP---PPKTVDDVWREIV------S
        S NR  +L S+T+D +  +   S     S+ LD     +  V++  PSS+ +N        ++A    +   S   P     KTVD+VW++I       S
Subjt:  SRNRPHTLTSMTVDGLLRNAYDSNPTESSILLD---AQITLVDSPNPSSLPMNTTTTTTTNSSAVIDSNHNSSSGAP---PPKTVDDVWREIV------S

Query:  GERRELKEEVADEMITLEDFLLKSGAV-----------PVEDVKLPQTKRLSGGIF------------------SFDPIPSTTFQALDKVEGSIIGFANG
           R  K+    EM TLED LLK+G V           PV          L   I                   +F P P +  QA+   + S++G    
Subjt:  GERRELKEEVADEMITLEDFLLKSGAV-----------PVEDVKLPQTKRLSGGIF------------------SFDPIPSTTFQALDKVEGSIIGFANG

Query:  VDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSSSFLHSFLLI
            G+  +   G++  A+ E ++K  E+RQ+RMIKNRESAARSR RKQAY  ELE    RLEEENERL ++K                           
Subjt:  VDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSSSFLHSFLLI

Query:  KEILMEKVIPVVEKRRPPRVIRRVNS
            +EK++P V    P R +RR +S
Subjt:  KEILMEKVIPVVEKRRPPRVIRRVNS

AT5G44080.1 Basic-leucine zipper (bZIP) transcription factor family protein1.8e-5448.01Show/hide
Query:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRP-HTLTSMTVDGLLRNAYDSN---PTESSILLDAQITLVDSPNPSSLPM
        M S ++  SSNSRNSDLSR  SS+S+SS+S+   Q F     +     RN   ++  SMTV+G+L + + S+   PTESS LLDA I L+D+   S  PM
Subjt:  MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRP-HTLTSMTVDGLLRNAYDSN---PTESSILLDAQITLVDSPNPSSLPM

Query:  NTTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSG-----AVPVE----DVKLPQTKRLSGGIFSFD--PI
          TTTT   +S V+D    + +     K+VD++WRE+VSGE + +KEE ++E++TLEDFL K+      AV       DVK+P T       + FD    
Subjt:  NTTTTTTTNSSAVIDSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSG-----AVPVE----DVKLPQTKRLSGGIFSFD--PI

Query:  PSTTFQALDKVEGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALD
        P   FQ +DKVEGSI+ F NG+D   + G G RGKR R  +EPLDKAA QRQRRMIKNRESAARSRERKQAYQVELE+LA +LEEENE L +E      D
Subjt:  PSTTFQALDKVEGSIIGFANGVDLIGIGGSGGRGKRGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALD

Query:  KCRLRGLKRGSSSFLHSFLLIKEILMEKVIPVVE--KRRPPRVIRRVNSMKW
        K + R  K                LME VIPVVE  K++PPR +RR+ S++W
Subjt:  KCRLRGLKRGSSSFLHSFLLIKEILMEKVIPVVE--KRRPPRVIRRVNSMKW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCATCCAAGCTCTTCCCATCTTCCAATTCGCGTAATTCGGATCTATCTCGAGGTTCTTCTTCTTCTTCTTCCTCTTCTGCTTCTCTTTTAAAACCCCAATTTTT
CTCCAATCGCTCGAGGAATTATGACAACCCATCTCGCAATCGACCCCATACTCTTACCTCAATGACCGTCGATGGCTTACTTCGTAATGCCTACGATTCCAATCCCACGG
AATCGTCTATTCTTTTAGATGCACAGATTACTCTTGTTGATTCCCCTAACCCCTCTTCTCTTCCCATGAATACCACCACCACTACCACCACTAATTCCTCTGCTGTCATT
GATAGTAATCACAATTCTTCCTCCGGTGCGCCACCCCCCAAAACTGTGGATGATGTGTGGAGGGAGATTGTTTCTGGCGAGAGGAGGGAATTGAAGGAGGAGGTTGCTGA
CGAGATGATAACCCTCGAGGATTTTCTTTTGAAATCTGGAGCTGTGCCTGTTGAGGATGTTAAATTGCCGCAGACAAAGAGGCTGAGTGGAGGGATTTTTTCGTTTGATC
CAATTCCTTCCACCACATTTCAGGCATTGGACAAGGTTGAAGGATCCATTATTGGGTTTGCTAATGGGGTTGATTTGATTGGAATTGGTGGAAGTGGGGGGAGAGGCAAA
AGAGGGCGAGCTGCTTTGGAACCTTTAGATAAGGCTGCAGAGCAAAGACAGAGGAGGATGATCAAGAACAGGGAGTCTGCAGCAAGGTCTAGGGAACGGAAGCAGGCTTA
TCAAGTGGAATTAGAGTCGTTAGCTGTAAGATTAGAGGAAGAGAATGAGCGGCTTTTGAGGGAAAAGTATAAATCTGCTTTAGACAAATGTAGGCTGAGAGGACTAAAGA
GAGGTTCAAGCAGCTTCTTACATTCATTTTTGTTGATTAAAGAAATCCTAATGGAAAAGGTGATTCCTGTGGTTGAAAAGCGACGACCCCCACGAGTCATTCGACGAGTT
AATTCCATGAAATGGTGA
mRNA sequenceShow/hide mRNA sequence
AGCCGTAGTTTCCCCACCCTAAGAGATAGAGACCTTCTAGAAACTTAAAAAAAAAAAACCCGAATATATTCACTGGCAACCGGCTAATCCCGGTTCCATCTTCCTCCCTC
ACACTCTGCAACTTCTCACGCGCTTTCACTCGCCGTCTTTCTTTCTCTCTGCCCCAATACTCAGCTCCCAGTTCTCACCCACCAACAGAAAAACCACAAAACCTCGCCGT
TCTCAAATCCCAACTCATACAACTTCTTTTTGCATCACTAGAAAGATTCACACTCGAGTTTCGAGAGAATCTCTGCGAATTCCTTGCCTACTTTGTCGATCAATATGGCA
TCATCCAAGCTCTTCCCATCTTCCAATTCGCGTAATTCGGATCTATCTCGAGGTTCTTCTTCTTCTTCTTCCTCTTCTGCTTCTCTTTTAAAACCCCAATTTTTCTCCAA
TCGCTCGAGGAATTATGACAACCCATCTCGCAATCGACCCCATACTCTTACCTCAATGACCGTCGATGGCTTACTTCGTAATGCCTACGATTCCAATCCCACGGAATCGT
CTATTCTTTTAGATGCACAGATTACTCTTGTTGATTCCCCTAACCCCTCTTCTCTTCCCATGAATACCACCACCACTACCACCACTAATTCCTCTGCTGTCATTGATAGT
AATCACAATTCTTCCTCCGGTGCGCCACCCCCCAAAACTGTGGATGATGTGTGGAGGGAGATTGTTTCTGGCGAGAGGAGGGAATTGAAGGAGGAGGTTGCTGACGAGAT
GATAACCCTCGAGGATTTTCTTTTGAAATCTGGAGCTGTGCCTGTTGAGGATGTTAAATTGCCGCAGACAAAGAGGCTGAGTGGAGGGATTTTTTCGTTTGATCCAATTC
CTTCCACCACATTTCAGGCATTGGACAAGGTTGAAGGATCCATTATTGGGTTTGCTAATGGGGTTGATTTGATTGGAATTGGTGGAAGTGGGGGGAGAGGCAAAAGAGGG
CGAGCTGCTTTGGAACCTTTAGATAAGGCTGCAGAGCAAAGACAGAGGAGGATGATCAAGAACAGGGAGTCTGCAGCAAGGTCTAGGGAACGGAAGCAGGCTTATCAAGT
GGAATTAGAGTCGTTAGCTGTAAGATTAGAGGAAGAGAATGAGCGGCTTTTGAGGGAAAAGTATAAATCTGCTTTAGACAAATGTAGGCTGAGAGGACTAAAGAGAGGTT
CAAGCAGCTTCTTACATTCATTTTTGTTGATTAAAGAAATCCTAATGGAAAAGGTGATTCCTGTGGTTGAAAAGCGACGACCCCCACGAGTCATTCGACGAGTTAATTCC
ATGAAATGGTGAACTATTCATCTACCACGAGACGGGTAAAACTGATAACGAAAGCGGTTGGGGCAAGAAGAAACAGAAGAGAGAGAGAGAGGCAATAGTATTTAACCGTT
ACTTAAAAACATATTCTTTGCTACTGATGAGTTTGAGGAGCATCAGAGGGATCTGCAAGGAAAAGTGCTGGCAATCACAAATCTTTTTATCAAAGAACTGATTGGCAGAG
ACAAAACAACAGCAACAAGATACATGAAAGCAGCTGAATTAATAGGTGTTATTAAAACAAAGTAGATCTCAGAGCAAATAAAAGGTAGAAACTTGAGATGCTCAACTTGT
CTCTGTCTCACATTTTTCCTTTTTTCTAAATGCATGAGAGAGTAAATAGAGGAAGTTGAGTTGAGTTTTTTTT
Protein sequenceShow/hide protein sequence
MASSKLFPSSNSRNSDLSRGSSSSSSSSASLLKPQFFSNRSRNYDNPSRNRPHTLTSMTVDGLLRNAYDSNPTESSILLDAQITLVDSPNPSSLPMNTTTTTTTNSSAVI
DSNHNSSSGAPPPKTVDDVWREIVSGERRELKEEVADEMITLEDFLLKSGAVPVEDVKLPQTKRLSGGIFSFDPIPSTTFQALDKVEGSIIGFANGVDLIGIGGSGGRGK
RGRAALEPLDKAAEQRQRRMIKNRESAARSRERKQAYQVELESLAVRLEEENERLLREKYKSALDKCRLRGLKRGSSSFLHSFLLIKEILMEKVIPVVEKRRPPRVIRRV
NSMKW