; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G14640 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G14640
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionBZIP domain-containing protein
Genome locationClcChr11:25517899..25521169
RNA-Seq ExpressionClc11G14640
SyntenyClc11G14640
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044759 - RF2-like transcription factor, bZIP domain
IPR044797 - Uncharacterized protein At4g06598-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135341.1 uncharacterized protein At4g06598 isoform X2 [Cucumis sativus]5.7e-17589.6Show/hide
Query:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR
        ET  VNAMENSKVLSNMRNMI SGKHALLPPKSP PSGSS+Y+DY PNPIIGSRAVQNPR GNV+HHRTSSESL+MEEQPSWLDDLLNEPETPVQRGGHR
Subjt:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR

Query:  RSSSDSFAYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        RSSSDSFAYLDAGNVSNENYT +DSQCKNMYLPSWASQDFDSHQA  +MK SW KQKNRTRELP TTLTTNPG   SAK+S+LLES R+LS TP EAN F
Subjt:  RSSSDSFAYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  S-STTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE
        S +TTEK DSAET +PDRK SERMDSSHVKPG  DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE
Subjt:  S-STTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE

Query:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        SLSQEQLIKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQFAKLSLRQKDARS SESVAGPVQI
Subjt:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

XP_008445999.1 PREDICTED: uncharacterized protein At4g06598 isoform X1 [Cucumis melo]2.1e-17790.13Show/hide
Query:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR
        ET  VNAMENSKVLSNMRNMI SGKHALLPPKSP+PSGSS+Y++Y PNPI+GSRAVQNPR GNV+HHRTSSESL+MEEQPSWLDDLLNEPETPVQRGGHR
Subjt:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR

Query:  RSSSDSFAYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        RSSSDSFAYLDAGNVSNENYT +DSQCKNMYLPSWASQDFDSHQAS +MK SW KQKNRTRELPPTTLTTNPG R SAK+SILLES R+LST  QEAN F
Subjt:  RSSSDSFAYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SS-TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE
        SS TTEK DSAET LPDRK SERMDSSHVKPG  DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLS+QNLILGMENKALKQRLE
Subjt:  SS-TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE

Query:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        SLSQEQLIKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQF+KLSLRQKD+RSGSESVAGPVQI
Subjt:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

XP_038877887.1 uncharacterized protein At4g06598 isoform X1 [Benincasa hispida]3.0e-18492.51Show/hide
Query:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR
        ET AVNAMENSKVLSNMRN+IYSGKHALLPPKSP PSGSSSYADYFPNPIIGSRA+QNPREGNV HHRTSSESL+MEEQPSWLDDLLNEPETP+QRGGHR
Subjt:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR

Query:  RSSSDSFAYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        RSSSDSFAYLDAGNVSNEN+T +DSQCKNMYLPSWASQ FDSHQ S +MKA+WIKQKNRTRELPPTTLT NPGAR SAKSSILLE+SRSLS TPQEAN F
Subjt:  RSSSDSFAYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLES
        +S++EKQDSAETGLPDRK SERMDSSHVKPG ADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLES
Subjt:  SSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLES

Query:  LSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        LSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQFAKLSLRQKDA +GSESVAGPVQI
Subjt:  LSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

XP_038877888.1 uncharacterized protein At4g06598 isoform X2 [Benincasa hispida]3.0e-18492.51Show/hide
Query:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR
        ET AVNAMENSKVLSNMRN+IYSGKHALLPPKSP PSGSSSYADYFPNPIIGSRA+QNPREGNV HHRTSSESL+MEEQPSWLDDLLNEPETP+QRGGHR
Subjt:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR

Query:  RSSSDSFAYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        RSSSDSFAYLDAGNVSNEN+T +DSQCKNMYLPSWASQ FDSHQ S +MKA+WIKQKNRTRELPPTTLT NPGAR SAKSSILLE+SRSLS TPQEAN F
Subjt:  RSSSDSFAYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLES
        +S++EKQDSAETGLPDRK SERMDSSHVKPG ADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLES
Subjt:  SSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLES

Query:  LSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        LSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQFAKLSLRQKDA +GSESVAGPVQI
Subjt:  LSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

XP_038877890.1 uncharacterized protein At4g06598 isoform X3 [Benincasa hispida]4.1e-18192.64Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRN+IYSGKHALLPPKSP PSGSSSYADYFPNPIIGSRA+QNPREGNV HHRTSSESL+MEEQPSWLDDLLNEPETP+QRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSSTTEKQ
        AYLDAGNVSNEN+T +DSQCKNMYLPSWASQ FDSHQ S +MKA+WIKQKNRTRELPPTTLT NPGAR SAKSSILLE+SRSLS TPQEAN F+S++EKQ
Subjt:  AYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSSTTEKQ

Query:  DSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLI
        DSAETGLPDRK SERMDSSHVKPG ADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLI
Subjt:  DSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLI

Query:  KYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        KYLEHEVLEREIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQFAKLSLRQKDA +GSESVAGPVQI
Subjt:  KYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

TrEMBL top hitse value%identityAlignment
A0A0A0KPT7 BZIP domain-containing protein2.8e-17589.6Show/hide
Query:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR
        ET  VNAMENSKVLSNMRNMI SGKHALLPPKSP PSGSS+Y+DY PNPIIGSRAVQNPR GNV+HHRTSSESL+MEEQPSWLDDLLNEPETPVQRGGHR
Subjt:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR

Query:  RSSSDSFAYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        RSSSDSFAYLDAGNVSNENYT +DSQCKNMYLPSWASQDFDSHQA  +MK SW KQKNRTRELP TTLTTNPG   SAK+S+LLES R+LS TP EAN F
Subjt:  RSSSDSFAYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  S-STTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE
        S +TTEK DSAET +PDRK SERMDSSHVKPG  DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE
Subjt:  S-STTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE

Query:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        SLSQEQLIKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQFAKLSLRQKDARS SESVAGPVQI
Subjt:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

A0A1S3BEP8 uncharacterized protein At4g06598 isoform X11.0e-17790.13Show/hide
Query:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR
        ET  VNAMENSKVLSNMRNMI SGKHALLPPKSP+PSGSS+Y++Y PNPI+GSRAVQNPR GNV+HHRTSSESL+MEEQPSWLDDLLNEPETPVQRGGHR
Subjt:  ETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHR

Query:  RSSSDSFAYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        RSSSDSFAYLDAGNVSNENYT +DSQCKNMYLPSWASQDFDSHQAS +MK SW KQKNRTRELPPTTLTTNPG R SAK+SILLES R+LST  QEAN F
Subjt:  RSSSDSFAYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SS-TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE
        SS TTEK DSAET LPDRK SERMDSSHVKPG  DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLS+QNLILGMENKALKQRLE
Subjt:  SS-TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLE

Query:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        SLSQEQLIKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQF+KLSLRQKD+RSGSESVAGPVQI
Subjt:  SLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

A0A1S3BEV4 uncharacterized protein At4g06598 isoform X23.6e-17590.49Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRNMI SGKHALLPPKSP+PSGSS+Y++Y PNPI+GSRAVQNPR GNV+HHRTSSESL+MEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSS-TTEK
        AYLDAGNVSNENYT +DSQCKNMYLPSWASQDFDSHQAS +MK SW KQKNRTRELPPTTLTTNPG R SAK+SILLES R+LST  QEAN FSS TTEK
Subjt:  AYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSS-TTEK

Query:  QDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQL
         DSAET LPDRK SERMDSSHVKPG  DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLS+QNLILGMENKALKQRLESLSQEQL
Subjt:  QDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQL

Query:  IKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        IKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQF+KLSLRQKD+RSGSESVAGPVQI
Subjt:  IKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

A0A6J1GXW7 uncharacterized protein At4g06598-like1.4e-17188.98Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRN+IYSGKHALLPPKSP PSGSS YADYFP+PIIGSRAVQNPREGNVHHHRTSSESL+ME+QPSWL+DLL+EPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVSNENYTHNDSQCKNMYLPSWASQDF----DSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSS-
        AYLDAGNV NENYT +DSQCKNMYLPSWASQDF    D HQASF+MKAS IKQKNR RELPPTTLTT  G+  SAKSSILLESSR LS TPQEANGFSS 
Subjt:  AYLDAGNVSNENYTHNDSQCKNMYLPSWASQDF----DSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSS-

Query:  TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLS
        TTEKQDSAET +PDRK SE+MD  H+KP  ADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQA GSEVSAELEFLSQQNLILGMENKALKQRLE+LS
Subjt:  TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLS

Query:  QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKD RSG ES+AGPVQI
Subjt:  QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

A0A6J1K701 uncharacterized protein At4g06598-like5.2e-17489.78Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRN+IYSGKHALLPPKSP PSGSSSYADYFP+PIIGSRAVQNPREGNVHHHRTSSESL+ME+QPSWL+DLL+EPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVSNENYTHNDSQCKNMYLPSWASQDF----DSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSS-
        AYLDAGNV NENYT +DSQCKNMYLPSWASQDF    D HQASF+MKAS IKQKNR RELPPTTLTTN G+R SAKSSILLESSRSLS TPQEANGFSS 
Subjt:  AYLDAGNVSNENYTHNDSQCKNMYLPSWASQDF----DSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSS-

Query:  TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLS
        TTEKQDSAET +PDRK SE+MD  H+KP  ADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQA GSEVSAELEFLSQQNLILGMENKALKQRLE+LS
Subjt:  TTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLS

Query:  QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI
        QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKD RSG ES+ GPVQI
Subjt:  QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQI

SwissProt top hitse value%identityAlignment
F4IN23 Basic leucine zipper 341.5e-2134.48Show/hide
Query:  EQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLDAGNVSNENY---THNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGA
        + PSW+D+ L+   +  +RG HRRS SDS A+L+A  VS E++     +D Q  +M+     + D + H    H+        N+   + PT  ++N   
Subjt:  EQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLDAGNVSNENY---THNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGA

Query:  RSSAKSSILLESSRSLSTTPQEA-NGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQALQA
         +S  S+   + ++ L  +     N  ++    +  ++  +     +   ++S    G    D KR K     +Q AQRSRVRKLQYI+ELER+V +LQA
Subjt:  RSSAKSSILLESSRSLSTTPQEA-NGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQALQA

Query:  NGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQ
          S +S  + FL  Q L+L ++N ALKQR+ +LSQ++L K    E L+REI RLR +Y QQ
Subjt:  NGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQ

Q5QNI5 Basic leucine zipper 22.4e-1435.27Show/hide
Query:  EEQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLDAGNVSNENY---THNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPG
        + QPSW+D+ L+   T  +RG HRRS SDS A+LD   VS++N     H+  +  +  L S  S D                       L P      P 
Subjt:  EEQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLDAGNVSNENY---THNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPG

Query:  ARSSAKSSILLESSRSLSTTPQEANGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGL-ADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQALQ
        A +++ SS            P + N  S   EKQD  ET   D   SE   ++  +P   A  D KR K     +Q AQRSRVRKLQYI+ELER+V +LQ
Subjt:  ARSSAKSSILLESSRSLSTTPQEANGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGL-ADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQALQ

Query:  ANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIK
           S +S  + FL  Q  +L + N  LKQR+ +L+Q+++ K
Subjt:  ANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIK

Q6K3R9 Basic leucine zipper 194.5e-1343.44Show/hide
Query:  GLADTDNKR---AKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQ
        G+AD    +   A +Q AQRSRVRKLQYI+ELER+V  LQ   S +S  + FL  Q  +L + N  LKQR+ +L+Q+++ K    E L++EI RLR +Y 
Subjt:  GLADTDNKR---AKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQ

Query:  QQQQPQPPPSSLKRTKSRDLET
        QQQ        +K T   D+ T
Subjt:  QQQQPQPPPSSLKRTKSRDLET

Q8W3M7 Uncharacterized protein At4g065983.6e-4746.89Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
        M +SK   N RN+  +GK ALLPPKSP   G +  AD+ P+ +IGS+AVQ   EGN +HHRTSSES ++EEQPSWLDDLLNEPETPV++GGHRRSSSDSF
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVSNENYT-------HNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        AY+D     + +YT       +N++   N       S    S    F+  A   KQK R     P     + GAR ++ S  L     S S T   ++G 
Subjt:  AYLDAGNVSNENYT-------HNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SSTTEKQDSAETGLPD------RKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQ
           TEK  SA     D      +   E+ D+   K   ++ D KRA+QQFAQRSRVRK+QYIAELERNVQ LQ
Subjt:  SSTTEKQDSAETGLPD------RKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQ

Q9M2K4 Basic leucine zipper 612.7e-1833.09Show/hide
Query:  EEQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLD--AGNVSNENYTH-NDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPG
        ++ PSW+D+ L+   T  +RG HRRS SDS A+L+  +  V N ++   +D Q  +M+     + D  ++  + H   S       TR    T+  ++  
Subjt:  EEQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLD--AGNVSNENYTH-NDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPG

Query:  ARSSAKSSILLESS--------RSLSTTPQEANGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAK-----QQFAQRSRVRKLQYIAELE
        + S   ++     S           +      N ++ + E Q   +T  P   PS   +S     G    D KR K     +Q AQRSRVRKLQYI+ELE
Subjt:  ARSSAKSSILLESS--------RSLSTTPQEANGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAK-----QQFAQRSRVRKLQYIAELE

Query:  RNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQ
        R+V +LQ   S +S  + FL  Q L+L ++N A+KQR+ +L+Q+++ K    E L+REI RLR +Y QQ
Subjt:  RNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQ

Arabidopsis top hitse value%identityAlignment
AT1G35490.1 bZIP family transcription factor2.2e-3132.23Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
        MEN + LSN  N  + G+     P+  + +  S      PN              N+HHH  S + L  E+QP+WLD+LL+EP +P    GHRRS+SD+ 
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNV-SNENYTHNDSQCKNMYLPSWASQDFDSHQASF---HMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSST
        AYL++  + S EN+             SW  Q++D  Q++    H K  W               +T  G       S     + ++S+ P E +     
Subjt:  AYLDAGNV-SNENYTHNDSQCKNMYLPSWASQDFDSHQASF---HMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSST

Query:  TEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQ
          K     +  PD   S+             TD+KR K Q A R+R+R+L+YI++LER +Q LQ  G E+S+ + +L QQ L+L MEN+ALKQR++SL++
Subjt:  TEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSR----------DLETQFAKLSL
         Q +K++E ++LEREIG L+   + QQQPQ     ++  ++R          + + QFA L++
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSR----------DLETQFAKLSL

AT1G58110.1 Basic-leucine zipper (bZIP) transcription factor family protein2.4e-7850.67Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNE-PETPVQRGGHRRSSSDS
        M +SK   ++RN++Y GKHALLPPK P PS S+SY++Y P  +IGSR  Q       HH RTSSES ++EE P WLDDLLNE PE+P ++ GHRRSSSDS
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNE-PETPVQRGGHRRSSSDS

Query:  FAYLDAGNVSNENYT-HNDSQCKNMYLPSWAS-QDFD----SHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        +AYLD  N +N + T  ND   +N  L +    Q+ D    +  A+F+  AS++KQK+R R+    +L       S    +      ++L       +  
Subjt:  FAYLDAGNVSNENYT-HNDSQCKNMYLPSWAS-QDFD----SHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SSTTEKQDSAETGLPDRKP-SERMDSSHVKPGLADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRL
          ++E+++ AE    D K  S   ++S+  P   + DN KRAKQQFAQRSRVRKLQYI+ELERNVQ LQA GS+VSAEL+FL+Q+NLIL MENKALK+RL
Subjt:  SSTTEKQDSAETGLPDRKP-SERMDSSHVKPGLADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRL

Query:  ESLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVA
        ES++QE+LIK LE EVLE+EIGRLR LYQQQQQ Q P +S  R  S+DL++QF+ LSL  KD+    +SV+
Subjt:  ESLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVA

AT1G58110.2 Basic-leucine zipper (bZIP) transcription factor family protein2.4e-7850.67Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNE-PETPVQRGGHRRSSSDS
        M +SK   ++RN++Y GKHALLPPK P PS S+SY++Y P  +IGSR  Q       HH RTSSES ++EE P WLDDLLNE PE+P ++ GHRRSSSDS
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNE-PETPVQRGGHRRSSSDS

Query:  FAYLDAGNVSNENYT-HNDSQCKNMYLPSWAS-QDFD----SHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        +AYLD  N +N + T  ND   +N  L +    Q+ D    +  A+F+  AS++KQK+R R+    +L       S    +      ++L       +  
Subjt:  FAYLDAGNVSNENYT-HNDSQCKNMYLPSWAS-QDFD----SHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SSTTEKQDSAETGLPDRKP-SERMDSSHVKPGLADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRL
          ++E+++ AE    D K  S   ++S+  P   + DN KRAKQQFAQRSRVRKLQYI+ELERNVQ LQA GS+VSAEL+FL+Q+NLIL MENKALK+RL
Subjt:  SSTTEKQDSAETGLPDRKP-SERMDSSHVKPGLADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKALKQRL

Query:  ESLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVA
        ES++QE+LIK LE EVLE+EIGRLR LYQQQQQ Q P +S  R  S+DL++QF+ LSL  KD+    +SV+
Subjt:  ESLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVA

AT2G42380.2 Basic-leucine zipper (bZIP) transcription factor family protein1.1e-2234.48Show/hide
Query:  EQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLDAGNVSNENY---THNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGA
        + PSW+D+ L+   +  +RG HRRS SDS A+L+A  VS E++     +D Q  +M+     + D + H    H+        N+   + PT  ++N   
Subjt:  EQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLDAGNVSNENY---THNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGA

Query:  RSSAKSSILLESSRSLSTTPQEA-NGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQALQA
         +S  S+   + ++ L  +     N  ++    +  ++  +     +   ++S    G    D KR K     +Q AQRSRVRKLQYI+ELER+V +LQA
Subjt:  RSSAKSSILLESSRSLSTTPQEA-NGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQALQA

Query:  NGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQ
          S +S  + FL  Q L+L ++N ALKQR+ +LSQ++L K    E L+REI RLR +Y QQ
Subjt:  NGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQ

AT4G06598.1 BEST Arabidopsis thaliana protein match is: Basic-leucine zipper (bZIP) transcription factor family protein (TAIR:AT1G58110.2)1.5e-6747.85Show/hide
Query:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF
        M +SK   N RN+  +GK ALLPPKSP   G +  AD+ P+ +IGS+AVQ   EGN +HHRTSSES ++EEQPSWLDDLLNEPETPV++GGHRRSSSDSF
Subjt:  MENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYADYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVSNENYT-------HNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF
        AY+D     + +YT       +N++   N       S    S    F+  A   KQK R     P     + GAR ++ S  L     S S T   ++G 
Subjt:  AYLDAGNVSNENYT-------HNDSQCKNMYLPSWASQDFDSHQASFHMKASWIKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGF

Query:  SSTTEKQDSAETGLPD------RKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKAL
           TEK  SA     D      +   E+ D+   K   ++ D KRA+QQFAQRSRVRK+QYIAELERNVQ L                       ENK+L
Subjt:  SSTTEKQDSAETGLPD------RKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQANGSEVSAELEFLSQQNLILGMENKAL

Query:  KQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPP---------PSSLKRTKSRDLETQFAKLSLR
        K RLESL+QEQLIKYLEH+VLE+EI RLR LYQ QQQ +P           SS +R+KSRDLETQF  LSLR
Subjt:  KQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPP---------PSSLKRTKSRDLETQFAKLSLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCGTCTCGGGAATGGGCTACATTTTGCTTGGTTTTCATCATCTCTCTCCGGTATTGTAGTCTGTCTCTCTCTGTCTGATTGCAGATGCAATGCGGGAGGAGAAAT
TCTAGCGTTGATGTGTGCAACAGTTAAATTTAACTTAAGTGTTGAGGTTTTGCTTCTTCCTTCACTTTGTCTGTTTGTATGTTTTTTAGCAGAAACTTGGGCAGTCAATG
CCATGGAGAATTCCAAGGTGTTGTCAAACATGAGAAATATGATTTACTCTGGAAAGCATGCTCTACTTCCTCCTAAGAGTCCACTTCCTAGTGGGTCATCCTCATATGCT
GATTATTTCCCTAATCCCATTATTGGGTCAAGAGCAGTGCAGAATCCCAGAGAGGGAAATGTGCACCATCATAGAACATCATCTGAAAGTCTTGTGATGGAGGAACAACC
TTCTTGGCTTGATGATCTCCTCAATGAACCCGAAACACCTGTTCAAAGAGGTGGTCATCGACGTTCATCAAGTGACTCCTTTGCTTACTTAGATGCAGGAAATGTTTCAA
ATGAAAATTATACACATAATGACTCCCAATGTAAAAATATGTATTTACCTTCCTGGGCATCTCAAGATTTTGATTCCCATCAAGCTTCATTTCATATGAAAGCAAGCTGG
ATCAAACAGAAAAACAGGACACGGGAATTGCCTCCAACTACATTGACAACTAACCCAGGTGCCCGCTCTTCTGCGAAAAGTAGCATTCTTCTTGAAAGCTCAAGGTCGTT
GAGTACTACACCACAGGAAGCAAATGGGTTTTCCTCAACTACTGAAAAGCAGGATTCAGCAGAAACTGGTCTGCCTGATAGGAAGCCATCTGAAAGAATGGATAGTTCTC
ATGTTAAGCCAGGTCTGGCTGATACAGATAATAAAAGAGCTAAACAGCAATTTGCTCAACGTTCACGTGTACGGAAACTTCAGTACATTGCAGAGCTGGAAAGGAACGTA
CAAGCGTTACAAGCAAATGGTTCTGAAGTTTCTGCCGAACTTGAATTTCTCAGTCAGCAAAACTTAATTCTTGGCATGGAGAATAAAGCACTCAAGCAACGATTAGAAAG
TTTATCACAGGAGCAGCTTATAAAATACCTGGAACATGAAGTACTGGAGAGGGAGATTGGAAGACTAAGAATGTTGTACCAACAGCAGCAACAGCCACAGCCACCACCTT
CCAGCCTTAAACGCACCAAAAGCCGAGACCTTGAGACGCAATTTGCTAAGCTCTCTTTGAGACAGAAGGATGCACGTTCAGGTTCCGAGTCTGTGGCCGGTCCAGTCCAA
ATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCCGTCTCGGGAATGGGCTACATTTTGCTTGGTTTTCATCATCTCTCTCCGGTATTGTAGTCTGTCTCTCTCTGTCTGATTGCAGATGCAATGCGGGAGGAGAAAT
TCTAGCGTTGATGTGTGCAACAGTTAAATTTAACTTAAGTGTTGAGGTTTTGCTTCTTCCTTCACTTTGTCTGTTTGTATGTTTTTTAGCAGAAACTTGGGCAGTCAATG
CCATGGAGAATTCCAAGGTGTTGTCAAACATGAGAAATATGATTTACTCTGGAAAGCATGCTCTACTTCCTCCTAAGAGTCCACTTCCTAGTGGGTCATCCTCATATGCT
GATTATTTCCCTAATCCCATTATTGGGTCAAGAGCAGTGCAGAATCCCAGAGAGGGAAATGTGCACCATCATAGAACATCATCTGAAAGTCTTGTGATGGAGGAACAACC
TTCTTGGCTTGATGATCTCCTCAATGAACCCGAAACACCTGTTCAAAGAGGTGGTCATCGACGTTCATCAAGTGACTCCTTTGCTTACTTAGATGCAGGAAATGTTTCAA
ATGAAAATTATACACATAATGACTCCCAATGTAAAAATATGTATTTACCTTCCTGGGCATCTCAAGATTTTGATTCCCATCAAGCTTCATTTCATATGAAAGCAAGCTGG
ATCAAACAGAAAAACAGGACACGGGAATTGCCTCCAACTACATTGACAACTAACCCAGGTGCCCGCTCTTCTGCGAAAAGTAGCATTCTTCTTGAAAGCTCAAGGTCGTT
GAGTACTACACCACAGGAAGCAAATGGGTTTTCCTCAACTACTGAAAAGCAGGATTCAGCAGAAACTGGTCTGCCTGATAGGAAGCCATCTGAAAGAATGGATAGTTCTC
ATGTTAAGCCAGGTCTGGCTGATACAGATAATAAAAGAGCTAAACAGCAATTTGCTCAACGTTCACGTGTACGGAAACTTCAGTACATTGCAGAGCTGGAAAGGAACGTA
CAAGCGTTACAAGCAAATGGTTCTGAAGTTTCTGCCGAACTTGAATTTCTCAGTCAGCAAAACTTAATTCTTGGCATGGAGAATAAAGCACTCAAGCAACGATTAGAAAG
TTTATCACAGGAGCAGCTTATAAAATACCTGGAACATGAAGTACTGGAGAGGGAGATTGGAAGACTAAGAATGTTGTACCAACAGCAGCAACAGCCACAGCCACCACCTT
CCAGCCTTAAACGCACCAAAAGCCGAGACCTTGAGACGCAATTTGCTAAGCTCTCTTTGAGACAGAAGGATGCACGTTCAGGTTCCGAGTCTGTGGCCGGTCCAGTCCAA
ATCTAGATTAGTAAATCGAGTTACGAAGCAAAGCCACCTAAGGAAATGCGGTTTGTCGATTGCTTGTGCAGGATATAACCAAGGCTCATCAAATCAAAAGTAAGTCTGTG
GTACTCAGCATCTGTTTGTTGATCTTGGTAGATGTGGAATGAAAATGTTCCAATGCGCCTGGCTTGTCGTTGCTGGAAAATGTCCTCTTTCGGGCCGGTGACTTGCCGGA
GGTGATATCTAATTTTCTACACATGGTCTTATATGGTTTATTATCAAGATGTATCGAAAAGTTCTATAGTAGAGGTGTTTTTAAGTCCCCTATTACTCTTGGGGGGCATT
CTTTTTTACTTCATCATATGCAAATATGGATGGTACTACAAATTACTATTGAACAATTTCAATGATCATACATGATAATATTATACAAGTTCAAAGCTTCATTATA
Protein sequenceShow/hide protein sequence
MGRLGNGLHFAWFSSSLSGIVVCLSLSDCRCNAGGEILALMCATVKFNLSVEVLLLPSLCLFVCFLAETWAVNAMENSKVLSNMRNMIYSGKHALLPPKSPLPSGSSSYA
DYFPNPIIGSRAVQNPREGNVHHHRTSSESLVMEEQPSWLDDLLNEPETPVQRGGHRRSSSDSFAYLDAGNVSNENYTHNDSQCKNMYLPSWASQDFDSHQASFHMKASW
IKQKNRTRELPPTTLTTNPGARSSAKSSILLESSRSLSTTPQEANGFSSTTEKQDSAETGLPDRKPSERMDSSHVKPGLADTDNKRAKQQFAQRSRVRKLQYIAELERNV
QALQANGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDARSGSESVAGPVQ
I