; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G018540 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G018540
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionBasic-leucine zipper (bZIP) transcription factor family protein
Genome locationCma_Chr04:9401035..9405342
RNA-Seq ExpressionCmaCh04G018540
SyntenyCmaCh04G018540
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044759 - RF2-like transcription factor, bZIP domain
IPR044797 - Uncharacterized protein At4g06598-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601617.1 hypothetical protein SDJN03_06850, partial [Cucurbita argyrosperma subsp. sororia]1.3e-20399.21Show/hide
Query:  QHSRKLRSYWAVMENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQ
        +HSRKLRSYWAVMENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQ
Subjt:  QHSRKLRSYWAVMENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQ

Query:  RGGHRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLS
        RGGHRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTT LGSRPSAKSSILLESSRSLS
Subjt:  RGGHRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLS

Query:  TPQEANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMEN
        TPQEANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMEN
Subjt:  TPQEANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMEN

Query:  KALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVG
        KALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESM G
Subjt:  KALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVG

KAG7032375.1 hypothetical protein SDJN02_06420 [Cucurbita argyrosperma subsp. argyrosperma]6.6e-20398.95Show/hide
Query:  RKLRSYWAVMENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGG
        RKLRSYWAVMENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGG
Subjt:  RKLRSYWAVMENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGG

Query:  HRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQ
        HRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTT LGS PSAKSSILLESSR LSTPQ
Subjt:  HRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQ

Query:  EANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKAL
        EANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKAL
Subjt:  EANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKAL

Query:  KQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI
        KQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESM GPVQI
Subjt:  KQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI

XP_022956708.1 uncharacterized protein At4g06598-like [Cucurbita moschata]1.2e-19698.65Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSS YADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT
        AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTT LGS PSAKSSILLESSR LSTPQEANGFSSTT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT

Query:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
        TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
Subjt:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI
        EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESM GPVQI
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI

XP_022998117.1 uncharacterized protein At4g06598-like [Cucurbita maxima]5.3e-200100Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT
        AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT

Query:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
        TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
Subjt:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI
        EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI

XP_023524171.1 uncharacterized protein At4g06598-like [Cucurbita pepo subsp. pepo]4.3e-19497.58Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSN RNVIYSGKHALLPPKSPFPSGSS YADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT
        AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRA ELPP TLTTNLGSRPSAKSSILLESSRSLSTPQE NGFSSTT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT

Query:  TEKQDSAET-SMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLS
        TEKQDSAET SMPDRKSSE++DGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLS
Subjt:  TEKQDSAET-SMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLS

Query:  QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI
        QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESM GPVQI
Subjt:  QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI

TrEMBL top hitse value%identityAlignment
A0A0A0KPT7 BZIP domain-containing protein1.2e-16585.44Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRN+I SGKHALLPPKSPFPSGSS+Y+DY P+PIIGSRAVQNPR GNV+HHRTSSESLLME+QPSWL+DLL+EPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT
        AYLDAGNV NENYTQDDSQCKNMYLPSWASQDF    D HQA   MK S  KQKNR RELP TTLTTN G  PSAK+S+LLES R+LSTP EAN FS TT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT

Query:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
        TEK DSAET +PDRK SE+MD  H+KP P DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQA GSEVSAELEFLSQQNLILGMENKALKQRLE+LSQ
Subjt:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI
        EQLIKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQFAKLSLRQKD RS  ES+ GPVQI
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI

A0A1S3BEP8 uncharacterized protein At4g06598 isoform X11.7e-16785.71Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRN+I SGKHALLPPKSP PSGSS+Y++Y P+PI+GSRAVQNPR GNV+HHRTSSESLLME+QPSWL+DLL+EPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT
        AYLDAGNV NENYTQDDSQCKNMYLPSWASQDF    D HQAS  MK S  KQKNR RELPPTTLTTN G RPSAK+SILLES R+LST QEAN FSSTT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT

Query:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
        TEK DSAET++PDRK SE+MD  H+KP P DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQA GSEVSAELEFLS+QNLILGMENKALKQRLE+LSQ
Subjt:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI
        EQLIKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQF+KLSLRQKD RSG ES+ GPVQI
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI

A0A1S3BEV4 uncharacterized protein At4g06598 isoform X21.7e-16785.71Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRN+I SGKHALLPPKSP PSGSS+Y++Y P+PI+GSRAVQNPR GNV+HHRTSSESLLME+QPSWL+DLL+EPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT
        AYLDAGNV NENYTQDDSQCKNMYLPSWASQDF    D HQAS  MK S  KQKNR RELPPTTLTTN G RPSAK+SILLES R+LST QEAN FSSTT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT

Query:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
        TEK DSAET++PDRK SE+MD  H+KP P DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQA GSEVSAELEFLS+QNLILGMENKALKQRLE+LSQ
Subjt:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI
        EQLIKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQF+KLSLRQKD RSG ES+ GPVQI
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI

A0A6J1GXW7 uncharacterized protein At4g06598-like5.9e-19798.65Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSS YADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT
        AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTT LGS PSAKSSILLESSR LSTPQEANGFSSTT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT

Query:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
        TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
Subjt:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI
        EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESM GPVQI
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI

A0A6J1K701 uncharacterized protein At4g06598-like2.5e-200100Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT
        AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTT

Query:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
        TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
Subjt:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI
        EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMVGPVQI

SwissProt top hitse value%identityAlignment
F4IN23 Basic leucine zipper 342.3e-2033.13Show/hide
Query:  ALLPPKSP-----FPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNVLNENY-
        A LPPK P     +P  SS     F +P   + A       N                PSW+++ LD   +  +RG HRRS SDS A+L+A  V  E++ 
Subjt:  ALLPPKSP-----FPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNVLNENY-

Query:  --TQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTTTEKQDSAETSM
            DD Q  +M+     + D +   +P         S I  KN    + PT  ++N     S  S+   + ++ L  P + N  ++      D  ++  
Subjt:  --TQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTTTEKQDSAETSM

Query:  PDRKSSEKMDGPHIKPAPADTDNKR-----------AKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
              E  DG        D+   R           A +Q AQRSRVRKLQYI+ELER+V +LQAE S +S  + FL  Q L+L ++N ALKQR+  LSQ
Subjt:  PDRKSSEKMDGPHIKPAPADTDNKR-----------AKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQ
        ++L K    E L+REI RLR +Y QQ
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQ

Q5QNI5 Basic leucine zipper 21.4e-1435.8Show/hide
Query:  EDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNL
        + QPSW+++ LD   T  +RG HRRS SDS A+LD           DD+           + DFD R D  Q   +M +  ++        PP       
Subjt:  EDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNL

Query:  GSRPSAKSSILLESSRSLSTPQEANGFSSTTTEKQDSAETSMPDRKSSEKMDG--PHIKPAPADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQA
          +P+A ++       S S+P + N   S   EKQD  ET     ++  + DG  P    +PA  D KR K     +Q AQRSRVRKLQYI+ELER+V +
Subjt:  GSRPSAKSSILLESSRSLSTPQEANGFSSTTTEKQDSAETSMPDRKSSEKMDG--PHIKPAPADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQA

Query:  LQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIK
        LQ E S +S  + FL  Q  +L + N  LKQR+  L+Q+++ K
Subjt:  LQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIK

Q6K3R9 Basic leucine zipper 193.5e-1346.36Show/hide
Query:  AKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSL
        A +Q AQRSRVRKLQYI+ELER+V  LQ E S +S  + FL  Q  +L + N  LKQR+  L+Q+++ K    E L++EI RLR +Y QQQ        +
Subjt:  AKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSL

Query:  KRTKSRDLET
        K T   D+ T
Subjt:  KRTKSRDLET

Q8W3M7 Uncharacterized protein At4g065987.5e-4848.19Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        M +SK   N RN+  +GK ALLPPKSPF  G +  AD+ PS +IGS+AVQ   EGN +HHRTSSES L+E+QPSWL+DLL+EPETPV++GGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKN------MYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEAN
        AY+D     + +YT  D    N       ++      D+  R  P    F   A   KQK R  +  P +     G+RP++ S  L     S S  +  +
Subjt:  AYLDAGNVLNENYTQDDSQCKN------MYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEAN

Query:  GFSSTTTEKQDSAETSMPD------RKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQ
          S   TEK  SA  S  D      + S EK D P  K A ++ D KRA+QQFAQRSRVRK+QYIAELERNVQ LQ
Subjt:  GFSSTTTEKQDSAETSMPD------RKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQ

Q9M2K4 Basic leucine zipper 611.1e-1932.73Show/hide
Query:  EDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLD--AGNVLNENYTQ-DDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLT
        +  PSW+++ LD   T  +RG HRRS SDS A+L+  +  V N ++ + DD Q  +M           F  D H  + N         N    + PT  +
Subjt:  EDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLD--AGNVLNENYTQ-DDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLT

Query:  TNLGSRPSAKSSILLESSRSLSTPQEANGFSSTTTEKQDSA-------ETSMPDRKSSEKMDGPHIKPAPADTDNKR-----------AKQQFAQRSRVR
        +N  S PS  +S+  + +   + P + +         Q++A          +  +  +E  DGP        +   R           A +Q AQRSRVR
Subjt:  TNLGSRPSAKSSILLESSRSLSTPQEANGFSSTTTEKQDSA-------ETSMPDRKSSEKMDGPHIKPAPADTDNKR-----------AKQQFAQRSRVR

Query:  KLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQ
        KLQYI+ELER+V +LQ E S +S  + FL  Q L+L ++N A+KQR+  L+Q+++ K    E L+REI RLR +Y QQ
Subjt:  KLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQ

Arabidopsis top hitse value%identityAlignment
AT1G35490.1 bZIP family transcription factor2.0e-3235.6Show/hide
Query:  NVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNV-LNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQK
        N+HHH  S + L  EDQP+WL++LL EP +P    GHRRS+SD+ AYL++  +   EN+             SW  Q++D            +++S +Q 
Subjt:  NVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNV-LNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQK

Query:  NRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIA
        N+      T   TN+               R++S    A   SS   EK      S     +S K DGP  K     TD+KR K Q A R+R+R+L+YI+
Subjt:  NRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIA

Query:  ELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSR----------DL
        +LER +Q LQ EG E+S+ + +L QQ L+L MEN+ALKQR+++L++ Q +K++E ++LEREIG L+   + QQQPQ     ++  ++R          + 
Subjt:  ELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSR----------DL

Query:  ETQFAKLSL
        + QFA L++
Subjt:  ETQFAKLSL

AT1G58110.1 Basic-leucine zipper (bZIP) transcription factor family protein4.9e-7951.6Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDE-PETPVQRGGHRRSSSDS
        M +SK   ++RN++Y GKHALLPPK PFPS S+SY++Y P+ +IGSR  Q       HH RTSSES L+E+ P WL+DLL+E PE+P ++ GHRRSSSDS
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDE-PETPVQRGGHRRSSSDS

Query:  FAYLDAGNVLNENYT-QDDSQCKNMYLPSWAS-QDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPS----AKSSILLESSRSLSTPQEA
        +AYLD  N  N + T Q+D   +N  L +    Q+ D  K+   A+F   AS +KQK+R R+    T     G+ PS    A+ +   ++  +L   Q+A
Subjt:  FAYLDAGNVLNENYT-QDDSQCKNMYLPSWAS-QDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPS----AKSSILLESSRSLSTPQEA

Query:  NGFSSTTTEKQDSAETSMPDRKS-SEKMDGPHIKPAPADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKAL
           SS   E+++ AE    D K  S + +  +  P   + DN KRAKQQFAQRSRVRKLQYI+ELERNVQ LQAEGS+VSAEL+FL+Q+NLIL MENKAL
Subjt:  NGFSSTTTEKQDSAETSMPDRKS-SEKMDGPHIKPAPADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKAL

Query:  KQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESM
        K+RLE+++QE+LIK LE EVLE+EIGRLR LYQQQQQ Q P +S  R  S+DL++QF+ LSL  KD     +S+
Subjt:  KQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESM

AT1G58110.2 Basic-leucine zipper (bZIP) transcription factor family protein4.9e-7951.6Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDE-PETPVQRGGHRRSSSDS
        M +SK   ++RN++Y GKHALLPPK PFPS S+SY++Y P+ +IGSR  Q       HH RTSSES L+E+ P WL+DLL+E PE+P ++ GHRRSSSDS
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDE-PETPVQRGGHRRSSSDS

Query:  FAYLDAGNVLNENYT-QDDSQCKNMYLPSWAS-QDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPS----AKSSILLESSRSLSTPQEA
        +AYLD  N  N + T Q+D   +N  L +    Q+ D  K+   A+F   AS +KQK+R R+    T     G+ PS    A+ +   ++  +L   Q+A
Subjt:  FAYLDAGNVLNENYT-QDDSQCKNMYLPSWAS-QDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPS----AKSSILLESSRSLSTPQEA

Query:  NGFSSTTTEKQDSAETSMPDRKS-SEKMDGPHIKPAPADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKAL
           SS   E+++ AE    D K  S + +  +  P   + DN KRAKQQFAQRSRVRKLQYI+ELERNVQ LQAEGS+VSAEL+FL+Q+NLIL MENKAL
Subjt:  NGFSSTTTEKQDSAETSMPDRKS-SEKMDGPHIKPAPADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKAL

Query:  KQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESM
        K+RLE+++QE+LIK LE EVLE+EIGRLR LYQQQQQ Q P +S  R  S+DL++QF+ LSL  KD     +S+
Subjt:  KQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESM

AT2G42380.2 Basic-leucine zipper (bZIP) transcription factor family protein1.6e-2133.13Show/hide
Query:  ALLPPKSP-----FPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNVLNENY-
        A LPPK P     +P  SS     F +P   + A       N                PSW+++ LD   +  +RG HRRS SDS A+L+A  V  E++ 
Subjt:  ALLPPKSP-----FPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNVLNENY-

Query:  --TQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTTTEKQDSAETSM
            DD Q  +M+     + D +   +P         S I  KN    + PT  ++N     S  S+   + ++ L  P + N  ++      D  ++  
Subjt:  --TQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTTTEKQDSAETSM

Query:  PDRKSSEKMDGPHIKPAPADTDNKR-----------AKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
              E  DG        D+   R           A +Q AQRSRVRKLQYI+ELER+V +LQAE S +S  + FL  Q L+L ++N ALKQR+  LSQ
Subjt:  PDRKSSEKMDGPHIKPAPADTDNKR-----------AKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQ
        ++L K    E L+REI RLR +Y QQ
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQ

AT4G06598.1 BEST Arabidopsis thaliana protein match is: Basic-leucine zipper (bZIP) transcription factor family protein (TAIR:AT1G58110.2)3.9e-6848.53Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        M +SK   N RN+  +GK ALLPPKSPF  G +  AD+ PS +IGS+AVQ   EGN +HHRTSSES L+E+QPSWL+DLL+EPETPV++GGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKN------MYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEAN
        AY+D     + +YT  D    N       ++      D+  R  P    F   A   KQK R  +  P +     G+RP++ S  L     S S  +  +
Subjt:  AYLDAGNVLNENYTQDDSQCKN------MYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEAN

Query:  GFSSTTTEKQDSAETSMPD------RKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMEN
          S   TEK  SA  S  D      + S EK D P  K A ++ D KRA+QQFAQRSRVRK+QYIAELERNVQ L                       EN
Subjt:  GFSSTTTEKQDSAETSMPD------RKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMEN

Query:  KALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPP---------PSSLKRTKSRDLETQFAKLSLR
        K+LK RLE+L+QEQLIKYLEH+VLE+EI RLR LYQ QQQ +P           SS +R+KSRDLETQF  LSLR
Subjt:  KALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPP---------PSSLKRTKSRDLETQFAKLSLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGACTATTCTGCCGGGAACAAAAGGATCGGATACCCATATTCCCGGAGTCTTTTCTATATCTGGGCTGTACAAGTAGAACTGATCTGTGTTCTTGCCCGCATCGG
GGAAGAGAGAGAGAGAGAGGACTCGAGGACTTACTTTGCGTCAACTCCTTTCCCCTTCCCATTGCATTTTTCAAGATCTCGGTCTACCGATCAGCATTCTCGAAAACTGC
GCAGTTATTGGGCAGTCATGGAGAATTCCAAGGTGTTGTCAAACATGAGAAATGTGATTTACTCTGGAAAGCATGCTCTACTTCCTCCCAAGAGTCCATTTCCTAGTGGT
TCCTCCTCATATGCTGATTATTTCCCCAGTCCCATTATTGGGTCAAGAGCTGTGCAGAATCCCAGAGAGGGAAATGTGCATCATCATAGAACATCATCTGAAAGTCTTCT
AATGGAGGATCAACCTTCTTGGCTCAATGATCTCCTCGATGAACCTGAAACACCTGTTCAAAGAGGTGGTCATCGACGTTCATCGAGTGACTCCTTTGCATACTTGGATG
CAGGAAATGTTTTGAACGAAAATTATACGCAAGATGACTCCCAATGTAAAAATATGTATTTACCTTCCTGGGCATCACAAGATTTTGATTTCCGCAAAGATCCCCATCAA
GCTTCTTTCAATATGAAGGCAAGCTCGATCAAACAGAAAAACAGGGCACGGGAATTGCCTCCAACTACATTGACAACTAACCTGGGTTCCCGGCCTTCTGCCAAAAGTAG
CATTCTTCTTGAGAGCTCGAGGTCGCTAAGTACACCACAGGAAGCAAATGGGTTCTCATCAACAACTACTGAAAAGCAGGATTCAGCAGAAACCAGTATGCCTGATCGAA
AGTCATCTGAGAAAATGGATGGTCCCCATATCAAGCCAGCTCCGGCTGATACGGATAATAAAAGAGCTAAACAGCAATTTGCTCAACGTTCACGTGTACGTAAACTTCAG
TACATTGCAGAGCTAGAACGGAACGTACAAGCTTTACAAGCAGAAGGTTCTGAAGTTTCTGCCGAGCTTGAATTTCTCAGTCAGCAAAACTTAATTCTTGGCATGGAGAA
TAAAGCACTCAAGCAGCGATTAGAAAATTTATCTCAGGAGCAGCTTATAAAATACTTGGAGCATGAAGTGCTGGAGAGGGAGATAGGAAGACTACGAATGTTGTACCAAC
AGCAACAACAGCCTCAGCCACCACCTTCCAGCCTTAAACGTACCAAGAGCCGAGACCTTGAGACGCAGTTTGCCAAGCTCTCTTTGAGACAGAAGGATGGGCGTTCGGGT
CCCGAGTCTATGGTCGGTCCAGTTCAAATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGACTATTCTGCCGGGAACAAAAGGATCGGATACCCATATTCCCGGAGTCTTTTCTATATCTGGGCTGTACAAGTAGAACTGATCTGTGTTCTTGCCCGCATCGG
GGAAGAGAGAGAGAGAGAGGACTCGAGGACTTACTTTGCGTCAACTCCTTTCCCCTTCCCATTGCATTTTTCAAGATCTCGGTCTACCGATCAGCATTCTCGAAAACTGC
GCAGTTATTGGGCAGTCATGGAGAATTCCAAGGTGTTGTCAAACATGAGAAATGTGATTTACTCTGGAAAGCATGCTCTACTTCCTCCCAAGAGTCCATTTCCTAGTGGT
TCCTCCTCATATGCTGATTATTTCCCCAGTCCCATTATTGGGTCAAGAGCTGTGCAGAATCCCAGAGAGGGAAATGTGCATCATCATAGAACATCATCTGAAAGTCTTCT
AATGGAGGATCAACCTTCTTGGCTCAATGATCTCCTCGATGAACCTGAAACACCTGTTCAAAGAGGTGGTCATCGACGTTCATCGAGTGACTCCTTTGCATACTTGGATG
CAGGAAATGTTTTGAACGAAAATTATACGCAAGATGACTCCCAATGTAAAAATATGTATTTACCTTCCTGGGCATCACAAGATTTTGATTTCCGCAAAGATCCCCATCAA
GCTTCTTTCAATATGAAGGCAAGCTCGATCAAACAGAAAAACAGGGCACGGGAATTGCCTCCAACTACATTGACAACTAACCTGGGTTCCCGGCCTTCTGCCAAAAGTAG
CATTCTTCTTGAGAGCTCGAGGTCGCTAAGTACACCACAGGAAGCAAATGGGTTCTCATCAACAACTACTGAAAAGCAGGATTCAGCAGAAACCAGTATGCCTGATCGAA
AGTCATCTGAGAAAATGGATGGTCCCCATATCAAGCCAGCTCCGGCTGATACGGATAATAAAAGAGCTAAACAGCAATTTGCTCAACGTTCACGTGTACGTAAACTTCAG
TACATTGCAGAGCTAGAACGGAACGTACAAGCTTTACAAGCAGAAGGTTCTGAAGTTTCTGCCGAGCTTGAATTTCTCAGTCAGCAAAACTTAATTCTTGGCATGGAGAA
TAAAGCACTCAAGCAGCGATTAGAAAATTTATCTCAGGAGCAGCTTATAAAATACTTGGAGCATGAAGTGCTGGAGAGGGAGATAGGAAGACTACGAATGTTGTACCAAC
AGCAACAACAGCCTCAGCCACCACCTTCCAGCCTTAAACGTACCAAGAGCCGAGACCTTGAGACGCAGTTTGCCAAGCTCTCTTTGAGACAGAAGGATGGGCGTTCGGGT
CCCGAGTCTATGGTCGGTCCAGTTCAAATCTAGATTTGTAAATCAGTTGGGAGTTTTTGTGCATGGCCTAACGATTTCTCCGATGTACGAAACCATGTTTGTCGATTACT
TGTGCAGGATACCCAGGGCTCATCAAAAGTCAGTCTGTACTCAGCATATGACTATGGTGGCTCTTGGTAGATGTGAATGAGAATCTGCCAATGCACCTGGCTGTCATGGC
TGGAAAATGTCCTCTTTTGGGCTGGTGACTTGCTGGTGGTGATATCTATTTTCTAAGCATGGTCTCCTTTGGTTTTTTTATCAGGATGTAATGAAAATTCTAGTAGGTGT
TTTTCAAGTCCCCCATTCCTCTTGGGGGGCTCTCTTTTTTTTCCCCCTAAATCAAATGCCAATGCATCCAGAGTATTAGGTGTTGCGGGCATGAATGGTACTAACAATTA
CTATTGAACAATATCAATGATCATGCTTGATATCCTCCTGCTTTGTGGGTGGGCCGAAGTGTTAAACATTGAGCCTTTGCCCAATAATTTCATTCTGATGACCCATTATA
TAGGGGTGTCATTTGAC
Protein sequenceShow/hide protein sequence
MRDYSAGNKRIGYPYSRSLFYIWAVQVELICVLARIGEEREREDSRTYFASTPFPFPLHFSRSRSTDQHSRKLRSYWAVMENSKVLSNMRNVIYSGKHALLPPKSPFPSG
SSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQ
ASFNMKASSIKQKNRARELPPTTLTTNLGSRPSAKSSILLESSRSLSTPQEANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQ
YIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSG
PESMVGPVQI