; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg24411 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg24411
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionBasic-leucine zipper (bZIP) transcription factor family protein
Genome locationCarg_Chr04:10767552..10771540
RNA-Seq ExpressionCarg24411
SyntenyCarg24411
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044759 - RF2-like transcription factor, bZIP domain
IPR044797 - Uncharacterized protein At4g06598-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601617.1 hypothetical protein SDJN03_06850, partial [Cucurbita argyrosperma subsp. sororia]2.8e-20299.47Show/hide
Query:  RKLRSYWAVMENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGG
        RKLRSYWAVMENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGG
Subjt:  RKLRSYWAVMENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGG

Query:  HRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQ
        HRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGS PSAKSSILLESSR LSTPQ
Subjt:  HRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQ

Query:  EANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKAL
        EANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKAL
Subjt:  EANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKAL

Query:  KQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAG
        KQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAG
Subjt:  KQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAG

KAG7032375.1 hypothetical protein SDJN02_06420 [Cucurbita argyrosperma subsp. argyrosperma]2.3e-212100Show/hide
Query:  MILFCMLFGRKLRSYWAVMENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDE
        MILFCMLFGRKLRSYWAVMENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDE
Subjt:  MILFCMLFGRKLRSYWAVMENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDE

Query:  PETPVQRGGHRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLE
        PETPVQRGGHRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLE
Subjt:  PETPVQRGGHRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLE

Query:  SSRLLSTPQEANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNL
        SSRLLSTPQEANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNL
Subjt:  SSRLLSTPQEANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNL

Query:  ILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI
        ILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI
Subjt:  ILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI

XP_022956708.1 uncharacterized protein At4g06598-like [Cucurbita moschata]7.0e-20199.73Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSS YADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT
        AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT

Query:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
        TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
Subjt:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI
        EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI

XP_022998117.1 uncharacterized protein At4g06598-like [Cucurbita maxima]4.3e-19898.92Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT
        AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTT LGS PSAKSSILLESSR LSTPQEANGFSSTT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT

Query:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
        TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
Subjt:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI
        EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESM GPVQI
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI

XP_023524171.1 uncharacterized protein At4g06598-like [Cucurbita pepo subsp. pepo]4.1e-19397.04Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSN RNVIYSGKHALLPPKSPFPSGSS YADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT
        AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRA ELPP TLTT LGS PSAKSSILLESSR LSTPQE NGFSSTT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT

Query:  TEKQDSAET-SMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLS
        TEKQDSAET SMPDRKSSE++DGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLS
Subjt:  TEKQDSAET-SMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLS

Query:  QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI
        QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI
Subjt:  QEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI

TrEMBL top hitse value%identityAlignment
A0A0A0KPT7 BZIP domain-containing protein4.6e-16685.44Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRN+I SGKHALLPPKSPFPSGSS+Y+DY P+PIIGSRAVQNPR GNV+HHRTSSESLLME+QPSWL+DLL+EPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT
        AYLDAGNV NENYTQDDSQCKNMYLPSWASQDF    D HQA   MK S  KQKNR RELP TTLTT  G  PSAK+S+LLES R LSTP EAN FS TT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT

Query:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
        TEK DSAET +PDRK SE+MD  H+KP P DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQA GSEVSAELEFLSQQNLILGMENKALKQRLE+LSQ
Subjt:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI
        EQLIKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQFAKLSLRQKD RS  ES+AGPVQI
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI

A0A1S3BEP8 uncharacterized protein At4g06598 isoform X15.5e-16785.44Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRN+I SGKHALLPPKSP PSGSS+Y++Y P+PI+GSRAVQNPR GNV+HHRTSSESLLME+QPSWL+DLL+EPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT
        AYLDAGNV NENYTQDDSQCKNMYLPSWASQDF    D HQAS  MK S  KQKNR RELPPTTLTT  G  PSAK+SILLES R LST QEAN FSSTT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT

Query:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
        TEK DSAET++PDRK SE+MD  H+KP P DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQA GSEVSAELEFLS+QNLILGMENKALKQRLE+LSQ
Subjt:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI
        EQLIKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQF+KLSLRQKD RSG ES+AGPVQI
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI

A0A1S3BEV4 uncharacterized protein At4g06598 isoform X25.5e-16785.44Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRN+I SGKHALLPPKSP PSGSS+Y++Y P+PI+GSRAVQNPR GNV+HHRTSSESLLME+QPSWL+DLL+EPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT
        AYLDAGNV NENYTQDDSQCKNMYLPSWASQDF    D HQAS  MK S  KQKNR RELPPTTLTT  G  PSAK+SILLES R LST QEAN FSSTT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT

Query:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
        TEK DSAET++PDRK SE+MD  H+KP P DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQA GSEVSAELEFLS+QNLILGMENKALKQRLE+LSQ
Subjt:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI
        EQLIKYLEHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQF+KLSLRQKD RSG ES+AGPVQI
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI

A0A6J1GXW7 uncharacterized protein At4g06598-like3.4e-20199.73Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSS YADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT
        AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT

Query:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
        TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
Subjt:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI
        EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI

A0A6J1K701 uncharacterized protein At4g06598-like2.1e-19898.92Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT
        AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTT LGS PSAKSSILLESSR LSTPQEANGFSSTT
Subjt:  AYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTT

Query:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
        TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ
Subjt:  TEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQ

Query:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI
        EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESM GPVQI
Subjt:  EQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI

SwissProt top hitse value%identityAlignment
F4IN23 Basic leucine zipper 341.3e-1933.12Show/hide
Query:  ALLPPKSP-----FPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNVLNENY-
        A LPPK P     +P  SS     F +P   + A       N                PSW+++ LD   +  +RG HRRS SDS A+L+A  V  E++ 
Subjt:  ALLPPKSP-----FPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNVLNENY-

Query:  --TQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTTTEKQDSAETSM
            DD Q  +M+     + D +   +P         S I  KN    + PT  ++   S PS  +S   ++  L  +    N   +     +  ++  M
Subjt:  --TQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTTTEKQDSAETSM

Query:  PDRKSSEKMDGPHIKPAPADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKY
             +   +           D KR K     +Q AQRSRVRKLQYI+ELER+V +LQAE S +S  + FL  Q L+L ++N ALKQR+  LSQ++L K 
Subjt:  PDRKSSEKMDGPHIKPAPADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKY

Query:  LEHEVLEREIGRLRMLYQQQ
           E L+REI RLR +Y QQ
Subjt:  LEHEVLEREIGRLRMLYQQQ

Q5QNI5 Basic leucine zipper 24.7e-1435.39Show/hide
Query:  EDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTL
        + QPSW+++ LD   T  +RG HRRS SDS A+LD           DD+           + DFD R D  Q   +M +  ++        PP       
Subjt:  EDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTL

Query:  GSWPSAKSSILLESSRLLSTPQEANGFSSTTTEKQDSAETSMPDRKSSEKMDG--PHIKPAPADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQA
           P+A +          S+P + N   S   EKQD  ET     ++  + DG  P    +PA  D KR K     +Q AQRSRVRKLQYI+ELER+V +
Subjt:  GSWPSAKSSILLESSRLLSTPQEANGFSSTTTEKQDSAETSMPDRKSSEKMDG--PHIKPAPADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQA

Query:  LQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIK
        LQ E S +S  + FL  Q  +L + N  LKQR+  L+Q+++ K
Subjt:  LQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIK

Q6K3R9 Basic leucine zipper 193.0e-1346.36Show/hide
Query:  AKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSL
        A +Q AQRSRVRKLQYI+ELER+V  LQ E S +S  + FL  Q  +L + N  LKQR+  L+Q+++ K    E L++EI RLR +Y QQQ        +
Subjt:  AKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSL

Query:  KRTKSRDLET
        K T   D+ T
Subjt:  KRTKSRDLET

Q8W3M7 Uncharacterized protein At4g065981.1e-4748.91Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        M +SK   N RN+  +GK ALLPPKSPF  G +  AD+ PS +IGS+AVQ   EGN +HHRTSSES L+E+QPSWL+DLL+EPETPV++GGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKN------MYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEAN
        AY+D     + +YT  D    N       ++      D+  R  P    F   A   KQK R  +  P +     G+ P++ SS  LESS   S  +  +
Subjt:  AYLDAGNVLNENYTQDDSQCKN------MYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEAN

Query:  GFSSTTTEKQDSAETSMPD------RKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQ
          S   TEK  SA  S  D      + S EK D P  K A ++ D KRA+QQFAQRSRVRK+QYIAELERNVQ LQ
Subjt:  GFSSTTTEKQDSAETSMPD------RKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQ

Q9M2K4 Basic leucine zipper 612.8e-1932.37Show/hide
Query:  EDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLD--AGNVLNENYTQ-DDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLT
        +  PSW+++ LD   T  +RG HRRS SDS A+L+  +  V N ++ + DD Q  +M           F  D H  + N         N    + PT  +
Subjt:  EDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLD--AGNVLNENYTQ-DDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLT

Query:  TTLGSWPSAKSSILLESSRLLSTPQEANGFSSTTTEKQDSA-------ETSMPDRKSSEKMDGPHIKPAPADTDNKR-----------AKQQFAQRSRVR
        +   S PS  +S+  + +   + P + +         Q++A          +  +  +E  DGP        +   R           A +Q AQRSRVR
Subjt:  TTLGSWPSAKSSILLESSRLLSTPQEANGFSSTTTEKQDSA-------ETSMPDRKSSEKMDGPHIKPAPADTDNKR-----------AKQQFAQRSRVR

Query:  KLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQ
        KLQYI+ELER+V +LQ E S +S  + FL  Q L+L ++N A+KQR+  L+Q+++ K    E L+REI RLR +Y QQ
Subjt:  KLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQ

Arabidopsis top hitse value%identityAlignment
AT1G35490.1 bZIP family transcription factor5.1e-3234.95Show/hide
Query:  NVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNV-LNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQK
        N+HHH  S + L  EDQP+WL++LL EP +P    GHRRS+SD+ AYL++  +   EN+             SW  Q++D            +++S +Q 
Subjt:  NVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNV-LNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQK

Query:  NRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIA
        N+            LG   S  +   ++ +        A   SS   EK      S     +S K DGP  K     TD+KR K Q A R+R+R+L+YI+
Subjt:  NRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTTTEKQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIA

Query:  ELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSR----------DL
        +LER +Q LQ EG E+S+ + +L QQ L+L MEN+ALKQR+++L++ Q +K++E ++LEREIG L+   + QQQPQ     ++  ++R          + 
Subjt:  ELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSR----------DL

Query:  ETQFAKLSL
        + QFA L++
Subjt:  ETQFAKLSL

AT1G58110.1 Basic-leucine zipper (bZIP) transcription factor family protein7.7e-8151.88Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDE-PETPVQRGGHRRSSSDS
        M +SK   ++RN++Y GKHALLPPK PFPS S+SY++Y P+ +IGSR  Q       HH RTSSES L+E+ P WL+DLL+E PE+P ++ GHRRSSSDS
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDE-PETPVQRGGHRRSSSDS

Query:  FAYLDAGNVLNENYT-QDDSQCKNMYLPSWAS-QDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSW-PSAKSSILLESSRLLSTPQEANGF
        +AYLD  N  N + T Q+D   +N  L +    Q+ D  K+   A+F   AS +KQK+R R+      T    SW P A+ +   ++   L   Q+A   
Subjt:  FAYLDAGNVLNENYT-QDDSQCKNMYLPSWAS-QDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSW-PSAKSSILLESSRLLSTPQEANGF

Query:  SSTTTEKQDSAETSMPDRKS-SEKMDGPHIKPAPADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQR
        SS   E+++ AE    D K  S + +  +  P   + DN KRAKQQFAQRSRVRKLQYI+ELERNVQ LQAEGS+VSAEL+FL+Q+NLIL MENKALK+R
Subjt:  SSTTTEKQDSAETSMPDRKS-SEKMDGPHIKPAPADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQR

Query:  LENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMA
        LE+++QE+LIK LE EVLE+EIGRLR LYQQQQQ Q P +S  R  S+DL++QF+ LSL  KD     +S++
Subjt:  LENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMA

AT1G58110.2 Basic-leucine zipper (bZIP) transcription factor family protein7.7e-8151.88Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDE-PETPVQRGGHRRSSSDS
        M +SK   ++RN++Y GKHALLPPK PFPS S+SY++Y P+ +IGSR  Q       HH RTSSES L+E+ P WL+DLL+E PE+P ++ GHRRSSSDS
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDE-PETPVQRGGHRRSSSDS

Query:  FAYLDAGNVLNENYT-QDDSQCKNMYLPSWAS-QDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSW-PSAKSSILLESSRLLSTPQEANGF
        +AYLD  N  N + T Q+D   +N  L +    Q+ D  K+   A+F   AS +KQK+R R+      T    SW P A+ +   ++   L   Q+A   
Subjt:  FAYLDAGNVLNENYT-QDDSQCKNMYLPSWAS-QDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSW-PSAKSSILLESSRLLSTPQEANGF

Query:  SSTTTEKQDSAETSMPDRKS-SEKMDGPHIKPAPADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQR
        SS   E+++ AE    D K  S + +  +  P   + DN KRAKQQFAQRSRVRKLQYI+ELERNVQ LQAEGS+VSAEL+FL+Q+NLIL MENKALK+R
Subjt:  SSTTTEKQDSAETSMPDRKS-SEKMDGPHIKPAPADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQR

Query:  LENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMA
        LE+++QE+LIK LE EVLE+EIGRLR LYQQQQQ Q P +S  R  S+DL++QF+ LSL  KD     +S++
Subjt:  LENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMA

AT2G42380.2 Basic-leucine zipper (bZIP) transcription factor family protein9.0e-2133.12Show/hide
Query:  ALLPPKSP-----FPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNVLNENY-
        A LPPK P     +P  SS     F +P   + A       N                PSW+++ LD   +  +RG HRRS SDS A+L+A  V  E++ 
Subjt:  ALLPPKSP-----FPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSFAYLDAGNVLNENY-

Query:  --TQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTTTEKQDSAETSM
            DD Q  +M+     + D +   +P         S I  KN    + PT  ++   S PS  +S   ++  L  +    N   +     +  ++  M
Subjt:  --TQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTTTEKQDSAETSM

Query:  PDRKSSEKMDGPHIKPAPADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKY
             +   +           D KR K     +Q AQRSRVRKLQYI+ELER+V +LQAE S +S  + FL  Q L+L ++N ALKQR+  LSQ++L K 
Subjt:  PDRKSSEKMDGPHIKPAPADTDNKRAK-----QQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKY

Query:  LEHEVLEREIGRLRMLYQQQ
           E L+REI RLR +Y QQ
Subjt:  LEHEVLEREIGRLRMLYQQQ

AT4G06598.1 BEST Arabidopsis thaliana protein match is: Basic-leucine zipper (bZIP) transcription factor family protein (TAIR:AT1G58110.2)7.5e-6849.07Show/hide
Query:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF
        M +SK   N RN+  +GK ALLPPKSPF  G +  AD+ PS +IGS+AVQ   EGN +HHRTSSES L+E+QPSWL+DLL+EPETPV++GGHRRSSSDSF
Subjt:  MENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGHRRSSSDSF

Query:  AYLDAGNVLNENYTQDDSQCKN------MYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEAN
        AY+D     + +YT  D    N       ++      D+  R  P    F   A   KQK R  +  P +     G+ P++ SS  LESS   S  +  +
Subjt:  AYLDAGNVLNENYTQDDSQCKN------MYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEAN

Query:  GFSSTTTEKQDSAETSMPD------RKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMEN
          S   TEK  SA  S  D      + S EK D P  K A ++ D KRA+QQFAQRSRVRK+QYIAELERNVQ L                       EN
Subjt:  GFSSTTTEKQDSAETSMPD------RKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMEN

Query:  KALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPP---------PSSLKRTKSRDLETQFAKLSLR
        K+LK RLE+L+QEQLIKYLEH+VLE+EI RLR LYQ QQQ +P           SS +R+KSRDLETQF  LSLR
Subjt:  KALKQRLENLSQEQLIKYLEHEVLEREIGRLRMLYQQQQQPQPP---------PSSLKRTKSRDLETQFAKLSLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATTGTTTTGTATGCTTTTTGGCAGAAAACTGCGCAGTTATTGGGCAGTCATGGAGAATTCCAAGGTGTTGTCAAACATGAGAAATGTGATTTACTCTGGAAAGCA
TGCTCTACTTCCTCCCAAGAGTCCATTTCCTAGTGGTTCCTCCTCATATGCTGATTATTTCCCCAGTCCCATTATTGGGTCAAGAGCTGTGCAGAATCCCAGAGAGGGAA
ATGTGCACCATCATAGAACATCATCTGAAAGTCTTCTAATGGAGGATCAACCTTCTTGGCTCAATGATCTTCTCGATGAACCTGAAACACCTGTTCAAAGAGGTGGTCAT
CGACGTTCATCGAGTGACTCCTTTGCATACTTGGATGCAGGAAATGTTTTGAACGAAAATTATACGCAAGATGACTCCCAATGTAAAAATATGTATTTACCTTCCTGGGC
ATCACAAGATTTTGATTTCCGCAAAGATCCCCATCAAGCTTCTTTCAATATGAAGGCAAGCTCGATCAAACAGAAGAACAGGGCACGGGAATTGCCTCCAACTACATTGA
CAACTACCCTGGGTTCCTGGCCTTCTGCCAAAAGTAGCATTCTTCTTGAGAGCTCGAGGTTGTTAAGTACACCACAGGAAGCAAATGGGTTCTCATCAACAACTACTGAA
AAGCAGGATTCAGCAGAAACCAGTATGCCTGATCGAAAGTCATCTGAGAAAATGGATGGTCCCCATATCAAGCCAGCTCCGGCTGATACAGATAATAAAAGAGCTAAACA
GCAATTTGCTCAACGTTCACGTGTACGTAAACTTCAGTACATTGCAGAGCTAGAACGGAACGTACAAGCTTTACAAGCAGAAGGTTCTGAAGTTTCTGCCGAGCTTGAAT
TTCTCAGTCAGCAAAACTTAATTCTTGGCATGGAGAATAAAGCACTCAAGCAGCGATTAGAAAATTTATCTCAGGAGCAGCTTATAAAATACTTGGAGCATGAAGTGCTG
GAGAGGGAGATAGGAAGACTACGAATGTTGTACCAACAGCAACAACAGCCTCAGCCACCACCTTCCAGCCTTAAACGTACCAAGAGCCGAGACCTTGAGACGCAGTTTGC
CAAGCTCTCTTTGAGACAGAAGGATGGGCGTTCGGGTCCCGAGTCTATGGCCGGTCCAGTTCAAATCTAG
mRNA sequenceShow/hide mRNA sequence
AGAACAAAAGGATCAAACTCTTCGACACCCGACGCAGGATCTACAGCTTGTATCCTTTCAGTTAATCCATAAGGATCGGATACCCATATTCCCGGACTCTTTTCTATATC
TGGGCTGCACAAGTAGAACTGATCTGTGTTCTTGTAGCTGACTACTACTGACTATCATCAAATTTAGGGACATAAAATGTATTCTACAGGTTCAAAAGTTGGAGAAATCC
CAAGAATTATTTGAACTTTTTGTTTGTTTGTTTGTTGTTTGTCCACATGGATTTTGGCGATTGTCTAAAAAAAGTAAGAGAGAGAGAAAGAGATAGAAAAAGTCGTATGC
CATATCATATATGAATGGTGTTGAGCAAATGACAGCAAAGGCAGCAACTCAGAAACTGGAATTATTGATAATTGAAGATTATTTAATGTTGCCTTTCCTTCATCCTTAAT
CAATCCCTTCCTCCACGCACCTACTACCCAATCCATTAGAGAGATTGGGAGAGAATGTAAAATCATGCCCTAGGCCCGCTTCGGGGAAGAGAGAGAGAGAGAGAGAAGGG
GACTCGAGGATTTACTTTGCGTCAACTCCTTTCCCCTTCCCATTTCATTTTACAAGATCTCGGTCTACCGATCAGCATTCTCGGTATGCTCATTCTTCCTCTTTTCTTTA
ATCTTTCATGAATTTTTGCATTTCCGGGAGCTCTTCAATCGTTTTCATTGTAATTTGTGAGAAACTTCGATGTTTTTGCGTTTGCTATTTACGCATTCGCAATCTTGGAT
CGACTCGGGAATGAGCTGCATTTTGCTTTCTCAGATCATTTGTCTCTGATATTCTAATTTCAGCTATTCTCTTCGTCTCTCTGTCTCTCTGTCTTCTCTGATTGCGGATG
CAATGCGGGAGGAGAGATTCCAGCGTTGGTATTTTTCTTTTTGAGTTGTTAATTTCTTGAATTTGTGCATTCACAAAGTTTCGTTCTGTTTTCTAAGAAATTTTGTTAAG
GTTCTAACGGACTGCATGTTTGTATTCGACGATCTCATCTCTCTCCTCTCATTTCGGATTTCCTTTTTCGTAGTCAGCCTGAACTTTCCGTCACAAAATGATGCAGAATC
GAGTCGGATCATTCTAGCTTGCTAATTGTCGGGTTGCATGTTTGTCTATCATCAATTTCTGTGCGCACGGTATTTGCGTCACTGCAAACTTGCATCCATGGAATAAACTG
TTAGATCGGGATGAGAGGAGGATGAATATCAACTCCTCTTATAGATCTGCCAAACCCTGCGAAACTTTTTCATTAGTTTTCGTGTTCGTTCCGGAGGAAGGAGTGTCTCC
TTCCACGCTTTGGTTCAAGAGCGATTCCTGCATGTGACTTCCAAGATTTCAAATTATTCTCTGCTTTGATGTTGACTTCTTTGATTTTGATAGATATTGTCGCTTTATTT
TCTAACATTATCATAAACTTTGTACGCATGGCTGTTCTTGAAAACAATGCACATGCTGCCAGGATTTCACATTAACCCCCTTCAAGTAACCTGTTAAGCTGTTAACAGTA
GTTTCCATCCACTTCTTCGCCCAAATGGCCTATTCCTACATCGATTTATTTTGTTAAAACAGTAGCTGGTGCTGCAAGAATCTTACTCATGATATTGTTTTGTATGCTTT
TTGGCAGAAAACTGCGCAGTTATTGGGCAGTCATGGAGAATTCCAAGGTGTTGTCAAACATGAGAAATGTGATTTACTCTGGAAAGCATGCTCTACTTCCTCCCAAGAGT
CCATTTCCTAGTGGTTCCTCCTCATATGCTGATTATTTCCCCAGTCCCATTATTGGGTCAAGAGCTGTGCAGAATCCCAGAGAGGGAAATGTGCACCATCATAGAACATC
ATCTGAAAGTCTTCTAATGGAGGATCAACCTTCTTGGCTCAATGATCTTCTCGATGAACCTGAAACACCTGTTCAAAGAGGTGGTCATCGACGTTCATCGAGTGACTCCT
TTGCATACTTGGATGCAGGAAATGTTTTGAACGAAAATTATACGCAAGATGACTCCCAATGTAAAAATATGTATTTACCTTCCTGGGCATCACAAGATTTTGATTTCCGC
AAAGATCCCCATCAAGCTTCTTTCAATATGAAGGCAAGCTCGATCAAACAGAAGAACAGGGCACGGGAATTGCCTCCAACTACATTGACAACTACCCTGGGTTCCTGGCC
TTCTGCCAAAAGTAGCATTCTTCTTGAGAGCTCGAGGTTGTTAAGTACACCACAGGAAGCAAATGGGTTCTCATCAACAACTACTGAAAAGCAGGATTCAGCAGAAACCA
GTATGCCTGATCGAAAGTCATCTGAGAAAATGGATGGTCCCCATATCAAGCCAGCTCCGGCTGATACAGATAATAAAAGAGCTAAACAGCAATTTGCTCAACGTTCACGT
GTACGTAAACTTCAGTACATTGCAGAGCTAGAACGGAACGTACAAGCTTTACAAGCAGAAGGTTCTGAAGTTTCTGCCGAGCTTGAATTTCTCAGTCAGCAAAACTTAAT
TCTTGGCATGGAGAATAAAGCACTCAAGCAGCGATTAGAAAATTTATCTCAGGAGCAGCTTATAAAATACTTGGAGCATGAAGTGCTGGAGAGGGAGATAGGAAGACTAC
GAATGTTGTACCAACAGCAACAACAGCCTCAGCCACCACCTTCCAGCCTTAAACGTACCAAGAGCCGAGACCTTGAGACGCAGTTTGCCAAGCTCTCTTTGAGACAGAAG
GATGGGCGTTCGGGTCCCGAGTCTATGGCCGGTCCAGTTCAAATCTAGATTTGTAAATCAGTTGGGAGTTGTTGTGCATGGCCTAACGATTTCTCGAATGTACGAAGCCA
TGTTTGTCGATTGCTTGTGCAGGATACCCAGGGCTCATCAAGTCAGTCTGTACTCAGCATTTGACTGTTGTGGCTCTTGGTAGATGTGAATGAGAATCTGCCAATGCACC
TGGCTGTCATGGCTGGAAAATGTCCTCTTTTG
Protein sequenceShow/hide protein sequence
MILFCMLFGRKLRSYWAVMENSKVLSNMRNVIYSGKHALLPPKSPFPSGSSSYADYFPSPIIGSRAVQNPREGNVHHHRTSSESLLMEDQPSWLNDLLDEPETPVQRGGH
RRSSSDSFAYLDAGNVLNENYTQDDSQCKNMYLPSWASQDFDFRKDPHQASFNMKASSIKQKNRARELPPTTLTTTLGSWPSAKSSILLESSRLLSTPQEANGFSSTTTE
KQDSAETSMPDRKSSEKMDGPHIKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLENLSQEQLIKYLEHEVL
EREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPESMAGPVQI