; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039826 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039826
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionBasic-leucine zipper (bZIP) transcription factor family protein
Genome locationchr13:244174..247329
RNA-Seq ExpressionLag0039826
SyntenyLag0039826
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044759 - RF2-like transcription factor, bZIP domain
IPR044797 - Uncharacterized protein At4g06598-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032375.1 hypothetical protein SDJN02_06420 [Cucurbita argyrosperma subsp. argyrosperma]3.7e-17187.13Show/hide
Query:  VMENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSD
        VMENSK+LSNMRN++YSGKHALLPPKSPF  GSSSY ADYFP+PIIGSRAVQNPREGNV HHRTSSES LME+QPSWL+DLLDEPETPVQRGGHRRSSSD
Subjt:  VMENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSD

Query:  SFAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSS
        SFAYLDA N+ N+NYTQDDSQCKNM LPSW+SQDFD RKDPHQASF MKAS IKQKNR RELPPTT TT   + PSAKSSILLESSR +S PQE NGFSS
Subjt:  SFAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSS

Query:  -TTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETV
         TTEKQDSAET + D KSSE++D PH+KPAPA+TDNKRAKQQFAQRSRVRKLQYIAELERNV+ALQAEGSEVSAELEFLSQQNLILGMENKALKQRLE +
Subjt:  -TTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETV

Query:  SQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI
        SQEQLIKY EHEVLEREIGRLRMLYQQQQQPQPPPSSLKR+KSRDLETQFAKLSLRQKDG SGPES+AGPVQI
Subjt:  SQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI

XP_022956708.1 uncharacterized protein At4g06598-like [Cucurbita moschata]4.1e-17086.83Show/hide
Query:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MENSK+LSNMRN++YSGKHALLPPKSPF  GSS Y ADYFP+PIIGSRAVQNPREGNV HHRTSSES LME+QPSWL+DLLDEPETPVQRGGHRRSSSDS
Subjt:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSS-
        FAYLDA N+ N+NYTQDDSQCKNM LPSW+SQDFD RKDPHQASF MKAS IKQKNR RELPPTT TT   + PSAKSSILLESSR +S PQE NGFSS 
Subjt:  FAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSS-

Query:  TTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVS
        TTEKQDSAET + D KSSE++D PH+KPAPA+TDNKRAKQQFAQRSRVRKLQYIAELERNV+ALQAEGSEVSAELEFLSQQNLILGMENKALKQRLE +S
Subjt:  TTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVS

Query:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI
        QEQLIKY EHEVLEREIGRLRMLYQQQQQPQPPPSSLKR+KSRDLETQFAKLSLRQKDG SGPES+AGPVQI
Subjt:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI

XP_022998117.1 uncharacterized protein At4g06598-like [Cucurbita maxima]4.4e-17287.37Show/hide
Query:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MENSK+LSNMRN++YSGKHALLPPKSPF  GSSSY ADYFP+PIIGSRAVQNPREGNV HHRTSSES LME+QPSWL+DLLDEPETPVQRGGHRRSSSDS
Subjt:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSS-
        FAYLDA N+ N+NYTQDDSQCKNM LPSW+SQDFD RKDPHQASF MKAS IKQKNR RELPPTT TTN  +RPSAKSSILLESSR +S PQE NGFSS 
Subjt:  FAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSS-

Query:  TTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVS
        TTEKQDSAET + D KSSE++D PH+KPAPA+TDNKRAKQQFAQRSRVRKLQYIAELERNV+ALQAEGSEVSAELEFLSQQNLILGMENKALKQRLE +S
Subjt:  TTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVS

Query:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI
        QEQLIKY EHEVLEREIGRLRMLYQQQQQPQPPPSSLKR+KSRDLETQFAKLSLRQKDG SGPES+ GPVQI
Subjt:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI

XP_038877887.1 uncharacterized protein At4g06598 isoform X1 [Benincasa hispida]3.5e-17781.84Show/hide
Query:  SLLELSVVQWRRIESDNSSLLIVGLYPYIIKFCANETCAVKVMENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNV
        SL  + VV+W R ES+NS      LY  II F A+ETCAV  MENSK+LSNMRN++YSGKHALLPPKSPF  GSSSY ADYFPNPIIGSRA+QNPREGNV
Subjt:  SLLELSVVQWRRIESDNSSLLIVGLYPYIIKFCANETCAVKVMENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNV

Query:  QHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRT
        QHHRTSSES LMEEQPSWLDDLL+EPETP+QRGGHRRSSSDSFAYLDA N+SN+N+TQDDSQCKNM LPSW+SQ FDS    HQ S YMKA+WIKQKNRT
Subjt:  QHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRT

Query:  RELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSSTTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELER
        RELPPTT T N  ARPSAKSSILLE+SR +S PQE N F+S++EKQDSAETGL D K SER+DS HVKP PA+TDNKRAKQQFAQRSRVRKLQYIAELER
Subjt:  RELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSSTTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELER

Query:  NVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDG
        NV+ALQA GSEVSAELEFLSQQNLILGMENKALKQRLE++SQEQLIKY EHEVLEREIGRLRMLYQQQQQPQPPPS+LKR+KSRDLETQFAKLSLRQKD 
Subjt:  NVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDG

Query:  HSGPESVAGPVQI
        H+G ESVAGPVQI
Subjt:  HSGPESVAGPVQI

XP_038877888.1 uncharacterized protein At4g06598 isoform X2 [Benincasa hispida]3.7e-17185.19Show/hide
Query:  ETCAVKVMENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGH
        ETCAV  MENSK+LSNMRN++YSGKHALLPPKSPF  GSSSY ADYFPNPIIGSRA+QNPREGNVQHHRTSSES LMEEQPSWLDDLL+EPETP+QRGGH
Subjt:  ETCAVKVMENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGH

Query:  RRSSSDSFAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQE
        RRSSSDSFAYLDA N+SN+N+TQDDSQCKNM LPSW+SQ FDS    HQ S YMKA+WIKQKNRTRELPPTT T N  ARPSAKSSILLE+SR +S PQE
Subjt:  RRSSSDSFAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQE

Query:  TNGFSSTTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQ
         N F+S++EKQDSAETGL D K SER+DS HVKP PA+TDNKRAKQQFAQRSRVRKLQYIAELERNV+ALQA GSEVSAELEFLSQQNLILGMENKALKQ
Subjt:  TNGFSSTTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQ

Query:  RLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI
        RLE++SQEQLIKY EHEVLEREIGRLRMLYQQQQQPQPPPS+LKR+KSRDLETQFAKLSLRQKD H+G ESVAGPVQI
Subjt:  RLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI

TrEMBL top hitse value%identityAlignment
A0A0A0KPT7 BZIP domain-containing protein2.4e-16081.79Show/hide
Query:  ETCAVKVMENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGH
        ETC V  MENSK+LSNMRNM+ SGKHALLPPKSPF  GSS+Y +DY PNPIIGSRAVQNPR GNV HHRTSSES LMEEQPSWLDDLL+EPETPVQRGGH
Subjt:  ETCAVKVMENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGH

Query:  RRSSSDSFAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQE
        RRSSSDSFAYLDA N+SN+NYTQDDSQCKNM LPSW+SQDFDS    HQA  YMK SW KQKNRTRELP TT TTN    PSAK+S+LLES R +S P E
Subjt:  RRSSSDSFAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQE

Query:  TNGFS-STTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALK
         N FS +TTEK DSAET + D K SER+DS HVKP P +TDNKRAKQQFAQRSRVRKLQYIAELERNV+ALQA GSEVSAELEFLSQQNLILGMENKALK
Subjt:  TNGFS-STTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALK

Query:  QRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI
        QRLE++SQEQLIKY EHEVLE+EIGRLRMLYQQQQQPQPPPS+LKR+KSRDLETQFAKLSLRQKD  S  ESVAGPVQI
Subjt:  QRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI

A0A1S3BEP8 uncharacterized protein At4g06598 isoform X14.4e-16282.32Show/hide
Query:  ETCAVKVMENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGH
        ETC V  MENSK+LSNMRNM+ SGKHALLPPKSP   GSS+Y ++Y PNPI+GSRAVQNPR GNV HHRTSSES LMEEQPSWLDDLL+EPETPVQRGGH
Subjt:  ETCAVKVMENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGH

Query:  RRSSSDSFAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQE
        RRSSSDSFAYLDA N+SN+NYTQDDSQCKNM LPSW+SQDFDS    HQAS YMK SW KQKNRTRELPPTT TTN   RPSAK+SILLES R +S  QE
Subjt:  RRSSSDSFAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQE

Query:  TNGFSS-TTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALK
         N FSS TTEK DSAET L D K SER+DS HVKP P +TDNKRAKQQFAQRSRVRKLQYIAELERNV+ALQA GSEVSAELEFLS+QNLILGMENKALK
Subjt:  TNGFSS-TTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALK

Query:  QRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI
        QRLE++SQEQLIKY EHEVLE+EIGRLRMLYQQQQQPQPPPS+LKR+KSRDLETQF+KLSLRQKD  SG ESVAGPVQI
Subjt:  QRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI

A0A6J1DEI2 uncharacterized protein At4g065984.4e-16283.33Show/hide
Query:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MENSK L NMRN+MYSGKHALLPPKSPF  GSSS   DYFPNPIIGSRAVQNPREGNV HHRTSS S LMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
Subjt:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSS-
        FAYLDAVN+SN  YTQDDS+CKN+SLPSW+S DF+ RKD HQ SF+M+ASWIKQKNRTRELPPTT   NQ   PSAKSSILLESSRP+S P E NG SS 
Subjt:  FAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSS-

Query:  TTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVS
        TTEKQDSAETGLQD K ++RI+SP+VKP   +TDNKRAKQQFAQRSRVRKLQYIAELERNV+ALQAEGSEVSAELEFLSQQNLIL MENKALKQRLE++S
Subjt:  TTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVS

Query:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI
        QEQLIKY EHEVLEREIGRLR+LYQ+QQQPQ PPSSLKRSKSRDLETQF  LSLRQ DGH+G ESV G V I
Subjt:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI

A0A6J1GXW7 uncharacterized protein At4g06598-like2.0e-17086.83Show/hide
Query:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MENSK+LSNMRN++YSGKHALLPPKSPF  GSS Y ADYFP+PIIGSRAVQNPREGNV HHRTSSES LME+QPSWL+DLLDEPETPVQRGGHRRSSSDS
Subjt:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSS-
        FAYLDA N+ N+NYTQDDSQCKNM LPSW+SQDFD RKDPHQASF MKAS IKQKNR RELPPTT TT   + PSAKSSILLESSR +S PQE NGFSS 
Subjt:  FAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSS-

Query:  TTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVS
        TTEKQDSAET + D KSSE++D PH+KPAPA+TDNKRAKQQFAQRSRVRKLQYIAELERNV+ALQAEGSEVSAELEFLSQQNLILGMENKALKQRLE +S
Subjt:  TTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVS

Query:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI
        QEQLIKY EHEVLEREIGRLRMLYQQQQQPQPPPSSLKR+KSRDLETQFAKLSLRQKDG SGPES+AGPVQI
Subjt:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI

A0A6J1K701 uncharacterized protein At4g06598-like2.1e-17287.37Show/hide
Query:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MENSK+LSNMRN++YSGKHALLPPKSPF  GSSSY ADYFP+PIIGSRAVQNPREGNV HHRTSSES LME+QPSWL+DLLDEPETPVQRGGHRRSSSDS
Subjt:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSS-
        FAYLDA N+ N+NYTQDDSQCKNM LPSW+SQDFD RKDPHQASF MKAS IKQKNR RELPPTT TTN  +RPSAKSSILLESSR +S PQE NGFSS 
Subjt:  FAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSS-

Query:  TTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVS
        TTEKQDSAET + D KSSE++D PH+KPAPA+TDNKRAKQQFAQRSRVRKLQYIAELERNV+ALQAEGSEVSAELEFLSQQNLILGMENKALKQRLE +S
Subjt:  TTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVS

Query:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI
        QEQLIKY EHEVLEREIGRLRMLYQQQQQPQPPPSSLKR+KSRDLETQFAKLSLRQKDG SGPES+ GPVQI
Subjt:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI

SwissProt top hitse value%identityAlignment
F4IN23 Basic leucine zipper 346.6e-2234.59Show/hide
Query:  EQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNISNKNY---TQDDSQCKNM--------SLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRE
        + PSW+D+ LD   +  +RG HRRS SDS A+L+A  +S +++     DD Q  +M        S PS  +   ++   P  +S           +  +E
Subjt:  EQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNISNKNY---TQDDSQCKNM--------SLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRE

Query:  LPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSSTTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNV
        LPP+    N     +    +  +        +  +G +S     DS+   + D K  +RI                A +Q AQRSRVRKLQYI+ELER+V
Subjt:  LPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSSTTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNV

Query:  KALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQ
         +LQAE S +S  + FL  Q L+L ++N ALKQR+  +SQ++L K    E L+REI RLR +Y QQ
Subjt:  KALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQ

Q5QNI5 Basic leucine zipper 24.9e-1733.13Show/hide
Query:  ALLPPKSP--------FHGGSSSYAA---DYFPNPIIGSRAV----------QNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSS
        A LPPK P        F GG   +AA    +  +P +G+               P  G  Q  +        + QPSW+D+ LD   T  +RG HRRS S
Subjt:  ALLPPKSP--------FHGGSSSYAA---DYFPNPIIGSRAV----------QNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSS

Query:  DSFAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFS
        DS A+LD V+  N                   + DFD   D    S +         +  +  PP      Q A P+A +          S P + N  S
Subjt:  DSFAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFS

Query:  STTEKQDSAETGLQDSKSSERIDSPHVKPA-PAETDNKRAK-----QQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALK
           EKQD  ET   D   SE   +   +PA PA  D KR K     +Q AQRSRVRKLQYI+ELER+V +LQ E S +S  + FL  Q  +L + N  LK
Subjt:  STTEKQDSAETGLQDSKSSERIDSPHVKPA-PAETDNKRAK-----QQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALK

Query:  QRLETVSQEQLIKYFEHEVLERE
        QR+  ++Q+++ K    E  +RE
Subjt:  QRLETVSQEQLIKYFEHEVLERE

Q6K3R9 Basic leucine zipper 191.5e-1350.55Show/hide
Query:  AKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQ
        A +Q AQRSRVRKLQYI+ELER+V  LQ E S +S  + FL  Q  +L + N  LKQR+  ++Q+++ K    E L++EI RLR +Y QQQ
Subjt:  AKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQ

Q8W3M7 Uncharacterized protein At4g065981.8e-4850Show/hide
Query:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        M +SK   N RN+  +GK ALLPPKSPF GG  +++AD+ P+ +IGS+AVQ   EGN  HHRTSSESFL+EEQPSWLDDLL+EPETPV++GGHRRSSSDS
Subjt:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQAS----FYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNG
        FAY+D     + +YT  D    N +   +S+     ++  +  S    FY  A   KQK R     P     +  ARP++ S  L  SS   S    ++G
Subjt:  FAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQAS----FYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNG

Query:  FSSTTEKQDSAETGLQD-----SKSS-ERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQ
            TEK  SA    +D     +KSS E+ D+P  K A +E D KRA+QQFAQRSRVRK+QYIAELERNV+ LQ
Subjt:  FSSTTEKQDSAETGLQD-----SKSS-ERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQ

Q9M2K4 Basic leucine zipper 614.0e-1931.46Show/hide
Query:  EEQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLD--AVNISNKNYTQ-DDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFT
        ++ PSW+D+ LD   T  +RG HRRS SDS A+L+  +  + N ++ + DD Q  +M      + D  +    H     +  +    ++ +    P+   
Subjt:  EEQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLD--AVNISNKNYTQ-DDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFT

Query:  T-----NQCARPSAKSSILLESSRPISIPQETNGFSSTTEKQDSAETGLQDSKSSERI---DSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERN
        +     N    P +     ++++         N ++ + E Q   +T  QD  S+ +     S +    P       A +Q AQRSRVRKLQYI+ELER+
Subjt:  T-----NQCARPSAKSSILLESSRPISIPQETNGFSSTTEKQDSAETGLQDSKSSERI---DSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERN

Query:  VKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQ
        V +LQ E S +S  + FL  Q L+L ++N A+KQR+  ++Q+++ K    E L+REI RLR +Y QQ
Subjt:  VKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQ

Arabidopsis top hitse value%identityAlignment
AT1G35490.1 bZIP family transcription factor2.0e-2933.22Show/hide
Query:  NVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKN
        N+ HH  S +    E+QP+WLD+LL EP +P    GHRRS+SD+ AYL++  + +K          +++  SW  Q++D            +++  +Q N
Subjt:  NVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKN

Query:  RTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSSTTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAEL
        +      T   TN   + +     L  SS+PI             EK  S         +S + D P  K     TD+KR K Q A R+R+R+L+YI++L
Subjt:  RTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSSTTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAEL

Query:  ERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSR----------DLET
        ER ++ LQ EG E+S+ + +L QQ L+L MEN+ALKQR++++++ Q +K+ E ++LEREIG L+   + QQQPQ     ++  ++R          + + 
Subjt:  ERNVKALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSR----------DLET

Query:  QFAKLSL
        QFA L++
Subjt:  QFAKLSL

AT1G58110.1 Basic-leucine zipper (bZIP) transcription factor family protein1.6e-7951.87Show/hide
Query:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDE-PETPVQRGGHRRSSSD
        M +SK   ++RN+MY GKHALLPPK PF   S+SY ++Y P  +IGSR  Q        H RTSSES L+EE P WLDDLL+E PE+P ++ GHRRSSSD
Subjt:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDE-PETPVQRGGHRRSSSD

Query:  SFAYLDAVNISNKNYT-QDDSQCKNMSLPSWSS-QDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCAR--PSAKSSILLESSRPISIPQETN
        S+AYLD  N +N + T Q+D   +N  L +    Q+ D  K+   A+FY  AS++KQK+R R+   +   T  C    P A+ +   ++   + + Q+  
Subjt:  SFAYLDAVNISNKNYT-QDDSQCKNMSLPSWSS-QDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCAR--PSAKSSILLESSRPISIPQETN

Query:  GFSSTTEKQDSAETGLQDSK--SSERIDSPHVKPAPAETDN-KRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALK
          SS  E+++ AE    D K  SSE  +S +  P   E DN KRAKQQFAQRSRVRKLQYI+ELERNV+ LQAEGS+VSAEL+FL+Q+NLIL MENKALK
Subjt:  GFSSTTEKQDSAETGLQDSK--SSERIDSPHVKPAPAETDN-KRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALK

Query:  QRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVA
        +RLE+++QE+LIK  E EVLE+EIGRLR LYQQQQQ Q P +S  R+ S+DL++QF+ LSL  KD +   +SV+
Subjt:  QRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVA

AT1G58110.2 Basic-leucine zipper (bZIP) transcription factor family protein1.6e-7951.87Show/hide
Query:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDE-PETPVQRGGHRRSSSD
        M +SK   ++RN+MY GKHALLPPK PF   S+SY ++Y P  +IGSR  Q        H RTSSES L+EE P WLDDLL+E PE+P ++ GHRRSSSD
Subjt:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDE-PETPVQRGGHRRSSSD

Query:  SFAYLDAVNISNKNYT-QDDSQCKNMSLPSWSS-QDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCAR--PSAKSSILLESSRPISIPQETN
        S+AYLD  N +N + T Q+D   +N  L +    Q+ D  K+   A+FY  AS++KQK+R R+   +   T  C    P A+ +   ++   + + Q+  
Subjt:  SFAYLDAVNISNKNYT-QDDSQCKNMSLPSWSS-QDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFTTNQCAR--PSAKSSILLESSRPISIPQETN

Query:  GFSSTTEKQDSAETGLQDSK--SSERIDSPHVKPAPAETDN-KRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALK
          SS  E+++ AE    D K  SSE  +S +  P   E DN KRAKQQFAQRSRVRKLQYI+ELERNV+ LQAEGS+VSAEL+FL+Q+NLIL MENKALK
Subjt:  GFSSTTEKQDSAETGLQDSK--SSERIDSPHVKPAPAETDN-KRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKALK

Query:  QRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVA
        +RLE+++QE+LIK  E EVLE+EIGRLR LYQQQQQ Q P +S  R+ S+DL++QF+ LSL  KD +   +SV+
Subjt:  QRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVA

AT2G42380.2 Basic-leucine zipper (bZIP) transcription factor family protein4.7e-2334.59Show/hide
Query:  EQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNISNKNY---TQDDSQCKNM--------SLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRE
        + PSW+D+ LD   +  +RG HRRS SDS A+L+A  +S +++     DD Q  +M        S PS  +   ++   P  +S           +  +E
Subjt:  EQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNISNKNY---TQDDSQCKNM--------SLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRE

Query:  LPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSSTTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNV
        LPP+    N     +    +  +        +  +G +S     DS+   + D K  +RI                A +Q AQRSRVRKLQYI+ELER+V
Subjt:  LPPTTFTTNQCARPSAKSSILLESSRPISIPQETNGFSSTTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNV

Query:  KALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQ
         +LQAE S +S  + FL  Q L+L ++N ALKQR+  +SQ++L K    E L+REI RLR +Y QQ
Subjt:  KALQAEGSEVSAELEFLSQQNLILGMENKALKQRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQ

AT4G06598.1 BEST Arabidopsis thaliana protein match is: Basic-leucine zipper (bZIP) transcription factor family protein (TAIR:AT1G58110.2)1.7e-6849.6Show/hide
Query:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        M +SK   N RN+  +GK ALLPPKSPF GG  +++AD+ P+ +IGS+AVQ   EGN  HHRTSSESFL+EEQPSWLDDLL+EPETPV++GGHRRSSSDS
Subjt:  MENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGNVQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQAS----FYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNG
        FAY+D     + +YT  D    N +   +S+     ++  +  S    FY  A   KQK R     P     +  ARP++ S  L  SS   S    ++G
Subjt:  FAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQAS----FYMKASWIKQKNRTRELPPTTFTTNQCARPSAKSSILLESSRPISIPQETNG

Query:  FSSTTEKQDSAETGLQD-----SKSS-ERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKA
            TEK  SA    +D     +KSS E+ D+P  K A +E D KRA+QQFAQRSRVRK+QYIAELERNV+ L                       ENK+
Subjt:  FSSTTEKQDSAETGLQD-----SKSS-ERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFLSQQNLILGMENKA

Query:  LKQRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPP---------PSSLKRSKSRDLETQFAKLSLR
        LK RLE+++QEQLIKY EH+VLE+EI RLR LYQ QQQ +P           SS +RSKSRDLETQF  LSLR
Subjt:  LKQRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPP---------PSSLKRSKSRDLETQFAKLSLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTACATTTTGCTTCCTCATGTCAATTTTTCTCTGCTTGAACTGTCTGTCGTGCAATGGCGCAGAATCGAATCGGATAATTCTAGCTTGCTAATTGTGGGGTTGTA
TCCATACATCATCAAATTCTGTGCGAACGAAACTTGCGCAGTTAAGGTCATGGAGAATTCAAAAATGCTGTCAAACATGAGAAATATGATGTACTCTGGAAAGCATGCTC
TACTTCCTCCTAAAAGTCCATTTCATGGTGGTTCCTCTTCATATGCTGCTGATTATTTCCCCAATCCCATTATTGGGTCAAGAGCAGTGCAGAATCCCAGAGAGGGAAAT
GTGCAACATCATCGAACATCATCTGAAAGTTTTCTGATGGAGGAGCAACCTTCTTGGCTTGATGATCTCCTTGATGAACCTGAAACACCTGTTCAAAGAGGTGGTCATCG
ACGTTCATCAAGTGATTCCTTTGCTTACTTAGATGCGGTAAATATTTCAAATAAAAATTATACACAAGATGACTCCCAATGTAAAAATATGTCTTTACCTTCCTGGTCAT
CACAAGACTTTGATTCCCGCAAAGATCCACATCAAGCTTCATTTTATATGAAAGCAAGCTGGATCAAACAGAAGAACAGGACACGGGAGTTGCCTCCAACTACATTTACA
ACTAACCAATGTGCCCGCCCTTCTGCCAAAAGTAGCATTCTTCTTGAAAGCTCAAGGCCGATAAGTATACCACAGGAAACAAATGGATTTTCATCAACTACTGAAAAGCA
GGATTCAGCAGAAACTGGTCTGCAGGATTCAAAGTCATCTGAGAGAATTGATAGTCCCCATGTTAAGCCAGCTCCGGCTGAAACAGATAATAAAAGAGCTAAACAACAAT
TTGCTCAACGTTCTCGTGTACGGAAACTTCAGTACATTGCCGAGCTAGAAAGGAACGTAAAAGCTTTACAAGCAGAAGGTTCTGAAGTTTCTGCCGAGCTTGAATTTCTC
AGTCAGCAAAACTTAATTCTTGGCATGGAGAACAAAGCACTCAAGCAGCGACTAGAAACTGTATCTCAGGAGCAGCTTATAAAATATTTTGAGCATGAAGTACTGGAGAG
GGAGATTGGAAGACTACGAATGTTGTATCAACAGCAGCAGCAGCCGCAGCCACCACCTTCCAGCCTTAAACGCAGCAAGAGCAGAGACCTTGAGACGCAGTTTGCGAAGC
TGTCATTGAGACAGAAGGATGGGCATTCAGGTCCTGAGTCTGTGGCCGGTCCAGTCCAAATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCTACATTTTGCTTCCTCATGTCAATTTTTCTCTGCTTGAACTGTCTGTCGTGCAATGGCGCAGAATCGAATCGGATAATTCTAGCTTGCTAATTGTGGGGTTGTA
TCCATACATCATCAAATTCTGTGCGAACGAAACTTGCGCAGTTAAGGTCATGGAGAATTCAAAAATGCTGTCAAACATGAGAAATATGATGTACTCTGGAAAGCATGCTC
TACTTCCTCCTAAAAGTCCATTTCATGGTGGTTCCTCTTCATATGCTGCTGATTATTTCCCCAATCCCATTATTGGGTCAAGAGCAGTGCAGAATCCCAGAGAGGGAAAT
GTGCAACATCATCGAACATCATCTGAAAGTTTTCTGATGGAGGAGCAACCTTCTTGGCTTGATGATCTCCTTGATGAACCTGAAACACCTGTTCAAAGAGGTGGTCATCG
ACGTTCATCAAGTGATTCCTTTGCTTACTTAGATGCGGTAAATATTTCAAATAAAAATTATACACAAGATGACTCCCAATGTAAAAATATGTCTTTACCTTCCTGGTCAT
CACAAGACTTTGATTCCCGCAAAGATCCACATCAAGCTTCATTTTATATGAAAGCAAGCTGGATCAAACAGAAGAACAGGACACGGGAGTTGCCTCCAACTACATTTACA
ACTAACCAATGTGCCCGCCCTTCTGCCAAAAGTAGCATTCTTCTTGAAAGCTCAAGGCCGATAAGTATACCACAGGAAACAAATGGATTTTCATCAACTACTGAAAAGCA
GGATTCAGCAGAAACTGGTCTGCAGGATTCAAAGTCATCTGAGAGAATTGATAGTCCCCATGTTAAGCCAGCTCCGGCTGAAACAGATAATAAAAGAGCTAAACAACAAT
TTGCTCAACGTTCTCGTGTACGGAAACTTCAGTACATTGCCGAGCTAGAAAGGAACGTAAAAGCTTTACAAGCAGAAGGTTCTGAAGTTTCTGCCGAGCTTGAATTTCTC
AGTCAGCAAAACTTAATTCTTGGCATGGAGAACAAAGCACTCAAGCAGCGACTAGAAACTGTATCTCAGGAGCAGCTTATAAAATATTTTGAGCATGAAGTACTGGAGAG
GGAGATTGGAAGACTACGAATGTTGTATCAACAGCAGCAGCAGCCGCAGCCACCACCTTCCAGCCTTAAACGCAGCAAGAGCAGAGACCTTGAGACGCAGTTTGCGAAGC
TGTCATTGAGACAGAAGGATGGGCATTCAGGTCCTGAGTCTGTGGCCGGTCCAGTCCAAATCTAG
Protein sequenceShow/hide protein sequence
MGYILLPHVNFSLLELSVVQWRRIESDNSSLLIVGLYPYIIKFCANETCAVKVMENSKMLSNMRNMMYSGKHALLPPKSPFHGGSSSYAADYFPNPIIGSRAVQNPREGN
VQHHRTSSESFLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNISNKNYTQDDSQCKNMSLPSWSSQDFDSRKDPHQASFYMKASWIKQKNRTRELPPTTFT
TNQCARPSAKSSILLESSRPISIPQETNGFSSTTEKQDSAETGLQDSKSSERIDSPHVKPAPAETDNKRAKQQFAQRSRVRKLQYIAELERNVKALQAEGSEVSAELEFL
SQQNLILGMENKALKQRLETVSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRSKSRDLETQFAKLSLRQKDGHSGPESVAGPVQI