; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC11G217080 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC11G217080
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionBasic-leucine zipper (bZIP) transcription factor family protein
Genome locationCmU531Chr11:25471571..25474840
RNA-Seq ExpressionCmUC11G217080
SyntenyCmUC11G217080
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044759 - RF2-like transcription factor, bZIP domain
IPR044797 - Uncharacterized protein At4g06598-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135341.1 uncharacterized protein At4g06598 isoform X2 [Cucumis sativus]4.0e-17690.13Show/hide
Query:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR
        ET  VNAMENSKVLSNMRNMI SGKHALLPPKSP PSGSS+Y+DY PNPIIGSRAVQNPR GNV+HHRTSSESL+MEEQPSWLDDLLNEPETPVQRGGHR
Subjt:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR

Query:  RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQA  +MK SW KQKNRTRELP TTLTTNPG   SAK+S+LLES R+LS TP EAN F
Subjt:  RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  S-STTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE
        S +TTEK DSAET +PDRK SERMDSSHVKPG  DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE
Subjt:  S-STTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE

Query:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        SLSQEQLIKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQFAKLSLRQKDARS SESVAGPVQI
Subjt:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

XP_008445999.1 PREDICTED: uncharacterized protein At4g06598 isoform X1 [Cucumis melo]1.9e-17890.67Show/hide
Query:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR
        ET  VNAMENSKVLSNMRNMI SGKHALLPPKSP+PSGSS+Y++Y PNPI+GSRAVQNPR GNV+HHRTSSESL+MEEQPSWLDDLLNEPETPVQRGGHR
Subjt:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR

Query:  RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQAS +MK SW KQKNRTRELPPTTLTTNPG R SAK+SILLES R+LST  QEAN F
Subjt:  RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SS-TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE
        SS TTEK DSAET LPDRK SERMDSSHVKPG  DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLS+QNLILGMENKALKQRLE
Subjt:  SS-TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE

Query:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        SLSQEQLIKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQF+KLSLRQKD+RSGSESVAGPVQI
Subjt:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

XP_038877887.1 uncharacterized protein At4g06598 isoform X1 [Benincasa hispida]2.1e-18593.05Show/hide
Query:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR
        ET AVNAMENSKVLSNMRN+IYSGKHALLPPKSP PSGSSSYADYFPNPIIGSRA+QNPREGNV HHRTSSESL+MEEQPSWLDDLLNEPETP+QRGGHR
Subjt:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR

Query:  RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        RSSSDSFAYLDAGNVSNEN+TQDDSQCKNMYLPSWASQ FDSHQ S +MKA+WIKQKNRTRELPPTTLT NPGAR SAKSSILLE+SRSLS TPQEAN F
Subjt:  RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLES
        +S++EKQDSAETGLPDRK SERMDSSHVKPG ADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLES
Subjt:  SSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLES

Query:  LSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        LSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQFAKLSLRQKDA +GSESVAGPVQI
Subjt:  LSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

XP_038877888.1 uncharacterized protein At4g06598 isoform X2 [Benincasa hispida]2.1e-18593.05Show/hide
Query:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR
        ET AVNAMENSKVLSNMRN+IYSGKHALLPPKSP PSGSSSYADYFPNPIIGSRA+QNPREGNV HHRTSSESL+MEEQPSWLDDLLNEPETP+QRGGHR
Subjt:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR

Query:  RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        RSSSDSFAYLDAGNVSNEN+TQDDSQCKNMYLPSWASQ FDSHQ S +MKA+WIKQKNRTRELPPTTLT NPGAR SAKSSILLE+SRSLS TPQEAN F
Subjt:  RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLES
        +S++EKQDSAETGLPDRK SERMDSSHVKPG ADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLES
Subjt:  SSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLES

Query:  LSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        LSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQFAKLSLRQKDA +GSESVAGPVQI
Subjt:  LSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

XP_038877890.1 uncharacterized protein At4g06598 isoform X3 [Benincasa hispida]2.8e-18293.19Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRN+IYSGKHALLPPKSP PSGSSSYADYFPNPIIGSRA+QNPREGNV HHRTSSESL+MEEQPSWLDDLLNEPETP+QRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSSTTEKQ
        AYLDAGNVSNEN+TQDDSQCKNMYLPSWASQ FDSHQ S +MKA+WIKQKNRTRELPPTTLT NPGAR SAKSSILLE+SRSLS TPQEAN F+S++EKQ
Subjt:  AYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSSTTEKQ

Query:  DSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLI
        DSAETGLPDRK SERMDSSHVKPG ADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLI
Subjt:  DSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLI

Query:  KYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        KYLEHEVLEREIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQFAKLSLRQKDA +GSESVAGPVQI
Subjt:  KYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

TrEMBL top hitse value%identityAlignment
A0A0A0KPT7 BZIP domain-containing protein1.9e-17690.13Show/hide
Query:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR
        ET  VNAMENSKVLSNMRNMI SGKHALLPPKSP PSGSS+Y+DY PNPIIGSRAVQNPR GNV+HHRTSSESL+MEEQPSWLDDLLNEPETPVQRGGHR
Subjt:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR

Query:  RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQA  +MK SW KQKNRTRELP TTLTTNPG   SAK+S+LLES R+LS TP EAN F
Subjt:  RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  S-STTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE
        S +TTEK DSAET +PDRK SERMDSSHVKPG  DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE
Subjt:  S-STTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE

Query:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        SLSQEQLIKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQFAKLSLRQKDARS SESVAGPVQI
Subjt:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

A0A1S3BEP8 uncharacterized protein At4g06598 isoform X19.2e-17990.67Show/hide
Query:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR
        ET  VNAMENSKVLSNMRNMI SGKHALLPPKSP+PSGSS+Y++Y PNPI+GSRAVQNPR GNV+HHRTSSESL+MEEQPSWLDDLLNEPETPVQRGGHR
Subjt:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR

Query:  RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQAS +MK SW KQKNRTRELPPTTLTTNPG R SAK+SILLES R+LST  QEAN F
Subjt:  RSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SS-TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE
        SS TTEK DSAET LPDRK SERMDSSHVKPG  DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLS+QNLILGMENKALKQRLE
Subjt:  SS-TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE

Query:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        SLSQEQLIKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQF+KLSLRQKD+RSGSESVAGPVQI
Subjt:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

A0A1S3BEV4 uncharacterized protein At4g06598 isoform X23.3e-17691.03Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRNMI SGKHALLPPKSP+PSGSS+Y++Y PNPI+GSRAVQNPR GNV+HHRTSSESL+MEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSS-TTEK
        AYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQAS +MK SW KQKNRTRELPPTTLTTNPG R SAK+SILLES R+LST  QEAN FSS TTEK
Subjt:  AYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSS-TTEK

Query:  QDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQL
         DSAET LPDRK SERMDSSHVKPG  DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLS+QNLILGMENKALKQRLESLSQEQL
Subjt:  QDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQL

Query:  IKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        IKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQF+KLSLRQKD+RSGSESVAGPVQI
Subjt:  IKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

A0A6J1GXW7 uncharacterized protein At4g06598-like9.9e-17389.52Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRN+IYSGKHALLPPKSP PSGSS YADYFP+PIIGSRAVQNPREGNVHHHRTSSESL+ME+QPSWL+DLL+EPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVSNENYTQDDSQCKNMYLPSWASQDF----DSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSS-
        AYLDAGNV NENYTQDDSQCKNMYLPSWASQDF    D HQASF+MKAS IKQKNR RELPPTTLTT  G+  SAKSSILLESSR LS TPQEANGFSS 
Subjt:  AYLDAGNVSNENYTQDDSQCKNMYLPSWASQDF----DSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSS-

Query:  TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLS
        TTEKQDSAET +PDRK SE+MD  H+KP  ADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQA GSEVSAELEFLSQQNLILGMENKALKQRLE+LS
Subjt:  TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLS

Query:  QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKD RSG ES+AGPVQI
Subjt:  QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

A0A6J1K701 uncharacterized protein At4g06598-like3.6e-17590.32Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRN+IYSGKHALLPPKSP PSGSSSYADYFP+PIIGSRAVQNPREGNVHHHRTSSESL+ME+QPSWL+DLL+EPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVSNENYTQDDSQCKNMYLPSWASQDF----DSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSS-
        AYLDAGNV NENYTQDDSQCKNMYLPSWASQDF    D HQASF+MKAS IKQKNR RELPPTTLTTN G+R SAKSSILLESSRSLS TPQEANGFSS 
Subjt:  AYLDAGNVSNENYTQDDSQCKNMYLPSWASQDF----DSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSS-

Query:  TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLS
        TTEKQDSAET +PDRK SE+MD  H+KP  ADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQA GSEVSAELEFLSQQNLILGMENKALKQRLE+LS
Subjt:  TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLS

Query:  QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKD RSG ES+ GPVQI
Subjt:  QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

SwissProt top hitse value%identityAlignment
F4IN23 Basic leucine zipper 346.9e-2234.87Show/hide
Query:  EQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLDAGNVSNENY---TQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGA
        + PSW+D+ L+   +  +RG HRRS SDS A+L+A  VS E++     DD Q  +M+     + D + H    H+        N+   + PT  ++N   
Subjt:  EQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLDAGNVSNENY---TQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGA

Query:  RSSAKSSILLESSRSLSTTPQEA-NGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQALQA
         +S  S+   + ++ L  +     N  ++    +  ++  +     +   ++S    G    D KR K     +Q AQRSRVRKLQYI+ELER+V +LQA
Subjt:  RSSAKSSILLESSRSLSTTPQEA-NGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQALQA

Query:  NGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQ
          S +S  + FL  Q L+L ++N ALKQR+ +LSQ++L K    E L+REI RLR +Y QQ
Subjt:  NGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQ

Q5QNI5 Basic leucine zipper 24.0e-1434.02Show/hide
Query:  EEQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLD------AGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTT
        + QPSW+D+ L+   T  +RG HRRS SDS A+LD      AG  +++    DD Q  +M+                            + +L P     
Subjt:  EEQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLD------AGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTT

Query:  NPGARSSAKSSILLESSRSLSTTPQEANGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGL-ADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQ
         P A +++ SS            P + N  S   EKQD  ET   D   SE   ++  +P   A  D KR K     +Q AQRSRVRKLQYI+ELER+V 
Subjt:  NPGARSSAKSSILLESSRSLSTTPQEANGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGL-ADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQ

Query:  ALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIK
        +LQ   S +S  + FL  Q  +L + N  LKQR+ +L+Q+++ K
Subjt:  ALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIK

Q6K3R9 Basic leucine zipper 194.5e-1343.44Show/hide
Query:  GLADTDNKR---AKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQ
        G+AD    +   A +Q AQRSRVRKLQYI+ELER+V  LQ   S +S  + FL  Q  +L + N  LKQR+ +L+Q+++ K    E L++EI RLR +Y 
Subjt:  GLADTDNKR---AKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQ

Query:  QQQQPQPPPSSLKRTKSRDLET
        QQQ        +K T   D+ T
Subjt:  QQQQPQPPPSSLKRTKSRDLET

Q8W3M7 Uncharacterized protein At4g065981.8e-4646.52Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
        M +SK   N RN+  +GK ALLPPKSP   G +  AD+ P+ +IGS+AVQ   EGN +HHRTSSES ++EEQPSWLDDLLNEPETPV++GGHRRSSSDSF
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVSNENYT-------QDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        AY+D     + +YT        +++   N       S    S    F+  A   KQK R     P     + GAR ++ S  L     S S T   ++G 
Subjt:  AYLDAGNVSNENYT-------QDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SSTTEKQDSAETGLPD------RKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQ
           TEK  SA     D      +   E+ D+   K   ++ D KRA+QQFAQRSRVRK+QYIAELERNVQ LQ
Subjt:  SSTTEKQDSAETGLPD------RKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQ

Q9M2K4 Basic leucine zipper 615.5e-1933.46Show/hide
Query:  EEQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLD--AGNVSNENYTQ-DDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPG
        ++ PSW+D+ L+   T  +RG HRRS SDS A+L+  +  V N ++ + DD Q  +M+     + D  ++  + H   S       TR    T+  ++  
Subjt:  EEQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLD--AGNVSNENYTQ-DDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPG

Query:  ARSSAKSSILLESS--------RSLSTTPQEANGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAK-----QQFAQRSRVRKLQYIAELE
        + S   ++     S           +      N ++ + E Q   +T  P   PS   +S     G    D KR K     +Q AQRSRVRKLQYI+ELE
Subjt:  ARSSAKSSILLESS--------RSLSTTPQEANGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAK-----QQFAQRSRVRKLQYIAELE

Query:  RNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQ
        R+V +LQ   S +S  + FL  Q L+L ++N A+KQR+ +L+Q+++ K    E L+REI RLR +Y QQ
Subjt:  RNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQ

Arabidopsis top hitse value%identityAlignment
AT1G35490.1 bZIP family transcription factor2.2e-3132.23Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
        MEN + LSN  N  + G+     P+  + +  S      PN              N+HHH  S + L  E+QP+WLD+LL+EP +P    GHRRS+SD+ 
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNV-SNENYTQDDSQCKNMYLPSWASQDFDSHQASF---HMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSST
        AYL++  + S EN+             SW  Q++D  Q++    H K  W               +T  G       S     + ++S+ P E +     
Subjt:  AYLDAGNV-SNENYTQDDSQCKNMYLPSWASQDFDSHQASF---HMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSST

Query:  TEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQ
          K     +  PD   S+             TD+KR K Q A R+R+R+L+YI++LER +Q LQ  G E+S+ + +L QQ L+L MEN+ALKQR++SL++
Subjt:  TEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSR----------DLETQFAKLSL
         Q +K++E ++LEREIG L+   + QQQPQ     ++  ++R          + + QFA L++
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSR----------DLETQFAKLSL

AT1G58110.1 Basic-leucine zipper (bZIP) transcription factor family protein2.4e-7850.67Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNE-PETPVQRGGHRRSSSDS
        M +SK   ++RN++Y GKHALLPPK P PS S+SY++Y P  +IGSR  Q       HH RTSSES ++EE P WLDDLLNE PE+P ++ GHRRSSSDS
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNE-PETPVQRGGHRRSSSDS

Query:  FAYLDAGNVSNENYT-QDDSQCKNMYLPSWAS-QDFD----SHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        +AYLD  N +N + T Q+D   +N  L +    Q+ D    +  A+F+  AS++KQK+R R+    +L       S    +      ++L       +  
Subjt:  FAYLDAGNVSNENYT-QDDSQCKNMYLPSWAS-QDFD----SHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SSTTEKQDSAETGLPDRKP-SERMDSSHVKPGLADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRL
          ++E+++ AE    D K  S   ++S+  P   + DN KRAKQQFAQRSRVRKLQYI+ELERNVQ LQA GS+VSAEL+FL+Q+NLIL MENKALK+RL
Subjt:  SSTTEKQDSAETGLPDRKP-SERMDSSHVKPGLADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRL

Query:  ESLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVA
        ES++QE+LIK LE EVLE+EIGRLR LYQQQQQ Q P +S  R  S+DL++QF+ LSL  KD+    +SV+
Subjt:  ESLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVA

AT1G58110.2 Basic-leucine zipper (bZIP) transcription factor family protein2.4e-7850.67Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNE-PETPVQRGGHRRSSSDS
        M +SK   ++RN++Y GKHALLPPK P PS S+SY++Y P  +IGSR  Q       HH RTSSES ++EE P WLDDLLNE PE+P ++ GHRRSSSDS
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNE-PETPVQRGGHRRSSSDS

Query:  FAYLDAGNVSNENYT-QDDSQCKNMYLPSWAS-QDFD----SHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        +AYLD  N +N + T Q+D   +N  L +    Q+ D    +  A+F+  AS++KQK+R R+    +L       S    +      ++L       +  
Subjt:  FAYLDAGNVSNENYT-QDDSQCKNMYLPSWAS-QDFD----SHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SSTTEKQDSAETGLPDRKP-SERMDSSHVKPGLADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRL
          ++E+++ AE    D K  S   ++S+  P   + DN KRAKQQFAQRSRVRKLQYI+ELERNVQ LQA GS+VSAEL+FL+Q+NLIL MENKALK+RL
Subjt:  SSTTEKQDSAETGLPDRKP-SERMDSSHVKPGLADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRL

Query:  ESLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVA
        ES++QE+LIK LE EVLE+EIGRLR LYQQQQQ Q P +S  R  S+DL++QF+ LSL  KD+    +SV+
Subjt:  ESLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVA

AT2G42380.2 Basic-leucine zipper (bZIP) transcription factor family protein4.9e-2334.87Show/hide
Query:  EQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLDAGNVSNENY---TQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGA
        + PSW+D+ L+   +  +RG HRRS SDS A+L+A  VS E++     DD Q  +M+     + D + H    H+        N+   + PT  ++N   
Subjt:  EQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLDAGNVSNENY---TQDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGA

Query:  RSSAKSSILLESSRSLSTTPQEA-NGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQALQA
         +S  S+   + ++ L  +     N  ++    +  ++  +     +   ++S    G    D KR K     +Q AQRSRVRKLQYI+ELER+V +LQA
Subjt:  RSSAKSSILLESSRSLSTTPQEA-NGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQALQA

Query:  NGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQ
          S +S  + FL  Q L+L ++N ALKQR+ +LSQ++L K    E L+REI RLR +Y QQ
Subjt:  NGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQ

AT4G06598.1 BEST Arabidopsis thaliana protein match is: Basic-leucine zipper (bZIP) transcription factor family protein (TAIR:AT1G58110.2)7.2e-6747.58Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
        M +SK   N RN+  +GK ALLPPKSP   G +  AD+ P+ +IGS+AVQ   EGN +HHRTSSES ++EEQPSWLDDLLNEPETPV++GGHRRSSSDSF
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVSNENYT-------QDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        AY+D     + +YT        +++   N       S    S    F+  A   KQK R     P     + GAR ++ S  L     S S T   ++G 
Subjt:  AYLDAGNVSNENYT-------QDDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SSTTEKQDSAETGLPD------RKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKAL
           TEK  SA     D      +   E+ D+   K   ++ D KRA+QQFAQRSRVRK+QYIAELERNVQ L                       ENK+L
Subjt:  SSTTEKQDSAETGLPD------RKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKAL

Query:  KQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPP---------PSSLKRTKSRDLETQFAKLSLR
        K RLESL+QEQLIKYLEH+VLE+EI RLR LYQ QQQ +P           SS +R+KSRDLETQF  LSLR
Subjt:  KQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPP---------PSSLKRTKSRDLETQFAKLSLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCGTCTCGGGAATGGGCTACATTTTGCTTGGTTTTCATCATCTCTCTCCGGTATTGTAGTCTGTCTCTCTCTGTCTGATTGCAGATGCAATGCGGGAGGAGAAAT
TCTAGCGTTGATGTGTGCAACAGTTAAATTTAACTTAAGTGTTGAGGTTTTGCTTCTTCCTTCACTTTGTCTGTTTGTATGTTTTTTAGCAGAAACTTGGGCAGTCAATG
CCATGGAGAATTCCAAGGTGTTGTCAAACATGAGAAATATGATTTACTCTGGAAAGCATGCTCTACTTCCTCCTAAGAGTCCACTTCCTAGTGGGTCATCCTCATATGCT
GATTATTTCCCTAATCCCATTATTGGGTCAAGAGCAGTGCAGAATCCCAGAGAGGGAAATGTGCACCATCATAGAACATCATCTGAAAGTCTTGTGATGGAGGAACAACC
TTCTTGGCTTGATGATCTCCTCAATGAACCCGAAACACCTGTTCAAAGAGGTGGTCATCGACGTTCATCAAGTGACTCCTTTGCTTACTTAGATGCAGGAAATGTTTCAA
ATGAAAATTATACACAAGATGACTCCCAATGTAAAAATATGTATTTACCTTCCTGGGCATCTCAAGATTTTGATTCCCATCAAGCTTCATTTCATATGAAAGCAAGCTGG
ATCAAACAGAAAAACAGGACACGGGAATTGCCTCCAACTACATTGACAACTAACCCAGGTGCCCGCTCTTCTGCGAAAAGTAGCATTCTTCTTGAAAGCTCAAGGTCGTT
GAGTACTACACCACAGGAAGCAAATGGGTTTTCCTCAACTACTGAAAAGCAGGATTCAGCAGAAACTGGTCTGCCTGATAGGAAGCCATCTGAAAGAATGGATAGTTCTC
ATGTTAAGCCAGGTCTGGCTGATACAGATAATAAAAGAGCTAAACAGCAATTTGCTCAACGTTCACGTGTACGGAAACTTCAGTACATTGCAGAGCTGGAAAGGAACGTA
CAAGCGTTACAAGCAAATGGTTCTGAAGTTTCTGCCGAACTTGAATTTCTCAGTCAGCAAAACTTAATTCTTGGCATGGAGAATAAAGCACTCAAGCAACGATTAGAAAG
TTTATCACAGGAGCAGCTTATAAAATACCTGGAACATGAAGTACTGGAGAGGGAGATTGGAAGACTAAGAATGTTGTACCAACAGCAGCAACAGCCACAGCCACCACCTT
CCAGCCTTAAACGCACCAAAAGCCGAGACCTTGAGACGCAATTTGCTAAGCTCTCTTTGAGACAGAAGGATGCACGTTCAGGTTCCGAGTCTGTGGCCGGTCCAGTCCAA
ATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCCGTCTCGGGAATGGGCTACATTTTGCTTGGTTTTCATCATCTCTCTCCGGTATTGTAGTCTGTCTCTCTCTGTCTGATTGCAGATGCAATGCGGGAGGAGAAAT
TCTAGCGTTGATGTGTGCAACAGTTAAATTTAACTTAAGTGTTGAGGTTTTGCTTCTTCCTTCACTTTGTCTGTTTGTATGTTTTTTAGCAGAAACTTGGGCAGTCAATG
CCATGGAGAATTCCAAGGTGTTGTCAAACATGAGAAATATGATTTACTCTGGAAAGCATGCTCTACTTCCTCCTAAGAGTCCACTTCCTAGTGGGTCATCCTCATATGCT
GATTATTTCCCTAATCCCATTATTGGGTCAAGAGCAGTGCAGAATCCCAGAGAGGGAAATGTGCACCATCATAGAACATCATCTGAAAGTCTTGTGATGGAGGAACAACC
TTCTTGGCTTGATGATCTCCTCAATGAACCCGAAACACCTGTTCAAAGAGGTGGTCATCGACGTTCATCAAGTGACTCCTTTGCTTACTTAGATGCAGGAAATGTTTCAA
ATGAAAATTATACACAAGATGACTCCCAATGTAAAAATATGTATTTACCTTCCTGGGCATCTCAAGATTTTGATTCCCATCAAGCTTCATTTCATATGAAAGCAAGCTGG
ATCAAACAGAAAAACAGGACACGGGAATTGCCTCCAACTACATTGACAACTAACCCAGGTGCCCGCTCTTCTGCGAAAAGTAGCATTCTTCTTGAAAGCTCAAGGTCGTT
GAGTACTACACCACAGGAAGCAAATGGGTTTTCCTCAACTACTGAAAAGCAGGATTCAGCAGAAACTGGTCTGCCTGATAGGAAGCCATCTGAAAGAATGGATAGTTCTC
ATGTTAAGCCAGGTCTGGCTGATACAGATAATAAAAGAGCTAAACAGCAATTTGCTCAACGTTCACGTGTACGGAAACTTCAGTACATTGCAGAGCTGGAAAGGAACGTA
CAAGCGTTACAAGCAAATGGTTCTGAAGTTTCTGCCGAACTTGAATTTCTCAGTCAGCAAAACTTAATTCTTGGCATGGAGAATAAAGCACTCAAGCAACGATTAGAAAG
TTTATCACAGGAGCAGCTTATAAAATACCTGGAACATGAAGTACTGGAGAGGGAGATTGGAAGACTAAGAATGTTGTACCAACAGCAGCAACAGCCACAGCCACCACCTT
CCAGCCTTAAACGCACCAAAAGCCGAGACCTTGAGACGCAATTTGCTAAGCTCTCTTTGAGACAGAAGGATGCACGTTCAGGTTCCGAGTCTGTGGCCGGTCCAGTCCAA
ATCTAGATTAGTAAATCGAGTTACGAAGCAAAGCCACCTAAGGAAATGCGGTTTGTCGATTGCTTGTGCAGGATATAACCAAGGCTCATCAAATCAAAAGTAAGTCTGTG
GTACTCAGCATCTGTTTGTTGATCTTGGTAGATGTGGAATGAAAATGTTCCAATGCGCCTGGCTTGTCGTTGCTGGAAAATGTCCTCTTTCGGGCCGGTGACTTGCCGGA
GGTGATATCTAATTTTCTACACATGGTCTTATATGGTTTATTATCAAGATGTATCGAAAAGTTCTATAGTAGAGGTGTTTTTAAGTCCCCTATTACTCTTGGGGGGCATT
CTTTTTTACTTCATCATATGCAAATATGGATGGTACTACAAATTACTATTGAACAATTTCAATGATCATACATGATAATATTATACAAGTTCAAAGCTTCATTATA
Protein sequenceShow/hide protein sequence
MGRLGNGLHFAWFSSSLSGIVVCLSLSDCRCNAGGEILALMCATVKFNLSVEVLLLPSLCLFVCFLAETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYA
DYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLDAGNVSNENYTQDDSQCKNMYLPSWASQDFDSHQASFHMKASW
IKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNV
QALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQ
I