; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G15584 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G15584
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionBasic-leucine zipper (bZIP) transcription factor family protein
Genome locationctg2009:2288090..2292777
RNA-Seq ExpressionCucsat.G15584
SyntenyCucsat.G15584
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044759 - RF2-like transcription factor, bZIP domain
IPR044797 - Uncharacterized protein At4g06598-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576990.1 hypothetical protein SDJN03_24564, partial [Cucurbita argyrosperma subsp. sororia]2.70e-24092.27Show/hide
Query:  KEKEFFTFLTMANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRV
        +EKEF  FLTM NSKGSSNVR+FMSSGKHALLPPKSPFPSVSPSYTE+VPNTVIGAKAVQRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRRV
Subjt:  KEKEFFTFLTMANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRV

Query:  GHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTP
        GHRRSSSDSFAYTDAANVNFDSI QE+F+YANA+PGHSWLSQEFDHQRDARHASFYTE NVTKQKNRVWESSLSTMNNPIALHSPRE I IHTSGPLS P
Subjt:  GHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTP

Query:  QEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENK
        QEADGLPSTASEKQDPVESGSHDPKV+S+RKD SHGKS+ SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQ+LILSMEN 
Subjt:  QEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENK

Query:  ALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQ-HQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        ALKQRLENLAQEQLIKYLEQEVLEREIGRLR +HQQ HQQQQQPQ LRPSSSHRR+SSKDLDNQFANLSLK+KDSGSSRD VTGPVRS
Subjt:  ALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQ-HQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

KAG7015010.1 hypothetical protein SDJN02_22641 [Cucurbita argyrosperma subsp. argyrosperma]2.11e-23692.17Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        M NSKGSSNVR+FMSSGKHALLPPKSPFPSVSPSYTE+VPNTVIGAKAVQRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSI QE+F+YANA+PGHSWLSQEFDHQRDARHASFYTE NVTKQKNRVWESSLSTMNNPIALHSPRE I IHTSGPLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKV+S+RKD SHGKS+ SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQ+LILSMEN ALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQ------HQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLR + QQ      HQQQQQPQQLRPSSSHRR+SSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQ------HQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

XP_004146254.1 uncharacterized protein At4g06598 isoform X1 [Cucumis sativus]1.92e-256100Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

XP_008456043.1 PREDICTED: uncharacterized protein At4g06598 [Cucumis melo]4.59e-25098.14Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDS MQEEFRYANA+PGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPI LHSPRETIGIHTSGPLSTPQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKVASDRKDTSHGKS VSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRD VTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

XP_038896639.1 uncharacterized protein At4g06598 [Benincasa hispida]1.66e-24195.24Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSN+RSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGN YHQRTSSES LIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSIMQEEFRYANA+PGHSWL QEFDHQRDARHAS YTEPNVTKQKNRVWESSLSTMNNPIALHSPRE IGIHTSGPLSTPQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDP ESGSHDPKV+SDRKD S GKS+VSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQ-HQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLR VHQQ H QQQQPQQLRPSSSHRRT SKDLDNQFANLSLKQKDSGSSRD VTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQ-HQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

TrEMBL top hitse value%identityAlignment
A0A0A0L9F3 BZIP domain-containing protein3.59e-29699.77Show/hide
Query:  LRHTEEGRETRERRKKKKKKERRKKKKKREKELSILILASWGSPNTNPKQKEKEFFTFLTMANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVP
        +RHTEEGRETRERRKKKKKKERRKKKKKREKELSILILASWGSPNTNPKQKEKEFFTFLTMANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVP
Subjt:  LRHTEEGRETRERRKKKKKKERRKKKKKREKELSILILASWGSPNTNPKQKEKEFFTFLTMANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVP

Query:  NTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDA
        NTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDA
Subjt:  NTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDA

Query:  RHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQ
        RHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQ
Subjt:  RHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQ

Query:  QFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSS
        QFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSS
Subjt:  QFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSS

Query:  SHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        SHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
Subjt:  SHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

A0A1S3C2E7 uncharacterized protein At4g065982.22e-25098.14Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDS MQEEFRYANA+PGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPI LHSPRETIGIHTSGPLSTPQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKVASDRKDTSHGKS VSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRD VTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

A0A6J1C5A1 uncharacterized protein At4g06598-like1.33e-23391.01Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVRSFM+SGKHALLPPKSPFPSVSPSYTEYVPNT IGAKA+QRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRR+GHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSIMQEEF+Y N VPGHSWLSQEFDHQRDARHASFY E N T+QKNRVWESSLSTM+NP ALHSPRE + IHTSGPLSTPQEADGLPS+A
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDP ESGSHDPKV+S+RKD +HGKS+VSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQNLILSMENKALKQRL+NLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQ-HQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLR++HQQ H QQQQPQ LRPSS+HRRTSSKDLD+QFANLSLKQKDSGSSRD VTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQ-HQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

A0A6J1E439 uncharacterized protein At4g06598-like isoform X12.92e-23691.91Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        M NSKGSSNVR+FMSSGKHALLPPKSPFPSVSPSYTE+VPNTVIGAKAVQRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSI QE+F+YANA+PGHSWLSQEFDHQRDARHASFYTE NVTKQKNRVWESSLSTMNNPIALHSPRE I IHTSGPLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKV+S+RKD SHGKS+ SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQ+LILSMEN ALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQ------HQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLR +HQQ      HQQQQQPQ LRPSSSHRR+SSKDLDNQFANLSLKQKDSGSSRD VTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQ------HQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

A0A6J1J9S7 uncharacterized protein At4g06598-like isoform X11.55e-23391.36Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        M NSKGSSNVR+FMSSGKHALLPPKSPFPSVSPSYTE+VPNTVIGAKAVQRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSI QE+F+YANA+PGHSWLSQEFDHQRDARHASFYTE NVTKQKNRVWESSLSTMNNPIALHSPRE I IHTS PLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQD VESGSH+PKV+S+RKD SHGKS+ SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQ+LILSMEN ALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQ-----HQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLR +HQQ     HQQQQQPQ LRPSSSHRR+SSKDLDNQFANLSLKQKDSGSSRD VTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQ-----HQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

SwissProt top hitse value%identityAlignment
F4IN23 Basic leucine zipper 341.5e-1731.88Show/hide
Query:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPI
        + PSW+D+ L+   +  RR  HRRS SDS A+ +A  V+ +                     +FD   D +  S +T+ +             +  +NP 
Subjt:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPI

Query:  ALHSPRETIG-IHTSGPLSTPQEA-----DGLPST------------ASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKR--AKQQFAQRSRVRK
         +++    +G   +S   STP  +       LP +              E Q   +    D   +++    S G   +     KR  A +Q AQRSRVRK
Subjt:  ALHSPRETIG-IHTSGPLSTPQEA-----DGLPST------------ASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKR--AKQQFAQRSRVRK

Query:  LQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQ
        LQYI+ELER V +LQAE + +S  + FL+ Q L+L+++N ALKQR+  L+Q++L K   QE L+REI RLR V+ Q
Subjt:  LQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQ

Q5JMK6 Basic leucine zipper 62.2e-1355.06Show/hide
Query:  AKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQ
        A +Q AQRSRVRKLQYI+ELER V  LQ E + +S  + FL+QQ  IL++ N  LKQR+  LAQ+++ K   QE L +EI RLR V+QQ
Subjt:  AKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQ

Q6K3R9 Basic leucine zipper 194.5e-1132.23Show/hide
Query:  RRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPL
        RR  HRRS+SDS A+   A V  D ++         V G      EFD   D +  S +++                       + +P  + G    GP 
Subjt:  RRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPL

Query:  STPQEAD-GLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILS
              D G          P  +G+     A+   D   G +     +   A +Q AQRSRVRKLQYI+ELER V  LQ E + +S  + FL+ Q  +L+
Subjt:  STPQEAD-GLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILS

Query:  MENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQ
        + N  LKQR+  LAQ+++ K   QE L++EI RLR V+ Q Q
Subjt:  MENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQ

Q8W3M7 Uncharacterized protein At4g065984.9e-5047.39Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MA+SKGS N R+   +GK ALLPPKSPF        ++VP++VIG+KAVQ+  +GN  H RTSSES LIEEQPSWLDDLLNEPETPVR+ GHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDA-ANVNFDSIMQEEFRY--ANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLP
        AY D     + D  + +  RY   N    H    +E D+ R ++   FY   +++KQK R W+S   +   P +     E+  I  SG   + ++ +   
Subjt:  AYTDA-ANVNFDSIMQEEFRY--ANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLP

Query:  STASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQ
        S A  K+D + + +   K + +++D    KS  S+ + KRA+QQFAQRSRVRK+QYIAELER VQ LQ
Subjt:  STASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQ

Q9M2K4 Basic leucine zipper 617.2e-1731.74Show/hide
Query:  EEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTD-----AANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLS
        ++ PSW+D+ L+   T  RR  HRRS SDS A+ +       N +FD    E+F         S  + +  +     H       NV   ++    S+ S
Subjt:  EEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTD-----AANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLS

Query:  TMNNPIAL-------HSPRETIGIHTSGPLSTPQEADGLP-STASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKR--AKQQFAQRSRVRKLQYI
        T ++  +L        +P      H    ++    A G   + + E Q   ++   D   A+     S G         KR  A +Q AQRSRVRKLQYI
Subjt:  TMNNPIAL-------HSPRETIGIHTSGPLSTPQEADGLP-STASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKR--AKQQFAQRSRVRKLQYI

Query:  AELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQ--------QQQQPQQLRPS
        +ELER V +LQ E + +S  + FL+ Q L+L+++N A+KQR+  LAQ+++ K   QE L+REI RLR V+ Q            Q P  ++PS
Subjt:  AELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQ--------QQQQPQQLRPS

Arabidopsis top hitse value%identityAlignment
AT1G35490.1 bZIP family transcription factor3.4e-3033.44Show/hide
Query:  NIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQK
        N++H   S + +  E+QP+WLD+LL+EP +P    GHRRS+SD+ AY ++A      +M  +    N V G SW  Q +D                    
Subjt:  NIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQK

Query:  NRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIA
          +W+S+    +N          +G   S    T  + +      +    P+E   H  K+         G    S T++KR K Q A R+R+R+L+YI+
Subjt:  NRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIA

Query:  ELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRP----SSSHRRTSSKDLD
        +LER +Q LQ EG E+S+ + +L+QQ L+LSMEN+ALKQR+++LA+ Q +K++EQ++LEREIG L+    Q Q QQ  +Q++      + ++   +++ D
Subjt:  ELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRP----SSSHRRTSSKDLD

Query:  NQFANLSL
         QFA L++
Subjt:  NQFANLSL

AT1G58110.1 Basic-leucine zipper (bZIP) transcription factor family protein1.1e-9255.47Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS
        MA+SKGS +VR+ M  GKHALLPPK PFPSVS SY+EY+P  +IG++  Q+  +   +HQRTSSES L+EE P WLDDLLNE PE+P R+ GHRRSSSDS
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS

Query:  FAYTDAAN-VNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPS
        +AY D AN  N    +Q +F Y N V       QE D  ++A+ A+FY+  +  KQK+R  +S ++T   P  L   RE  G    G L   Q+A  +  
Subjt:  FAYTDAAN-VNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPS

Query:  TASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLE
         +SE+++  E  SHDPK+ S  ++ S+      + +N KRAKQQFAQRSRVRKLQYI+ELER VQ LQAEG++VSAEL+FLNQ+NLILSMENKALK+RLE
Subjt:  TASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLE

Query:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVT
        ++AQE+LIK LEQEVLE+EIGRLR ++QQ QQ Q     +PS+S  R +SKDLD+QF++LSL  KDS   RDSV+
Subjt:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVT

AT1G58110.2 Basic-leucine zipper (bZIP) transcription factor family protein1.1e-9255.47Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS
        MA+SKGS +VR+ M  GKHALLPPK PFPSVS SY+EY+P  +IG++  Q+  +   +HQRTSSES L+EE P WLDDLLNE PE+P R+ GHRRSSSDS
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS

Query:  FAYTDAAN-VNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPS
        +AY D AN  N    +Q +F Y N V       QE D  ++A+ A+FY+  +  KQK+R  +S ++T   P  L   RE  G    G L   Q+A  +  
Subjt:  FAYTDAAN-VNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPS

Query:  TASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLE
         +SE+++  E  SHDPK+ S  ++ S+      + +N KRAKQQFAQRSRVRKLQYI+ELER VQ LQAEG++VSAEL+FLNQ+NLILSMENKALK+RLE
Subjt:  TASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLE

Query:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVT
        ++AQE+LIK LEQEVLE+EIGRLR ++QQ QQ Q     +PS+S  R +SKDLD+QF++LSL  KDS   RDSV+
Subjt:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVT

AT2G42380.2 Basic-leucine zipper (bZIP) transcription factor family protein1.0e-1831.88Show/hide
Query:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPI
        + PSW+D+ L+   +  RR  HRRS SDS A+ +A  V+ +                     +FD   D +  S +T+ +             +  +NP 
Subjt:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPI

Query:  ALHSPRETIG-IHTSGPLSTPQEA-----DGLPST------------ASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKR--AKQQFAQRSRVRK
         +++    +G   +S   STP  +       LP +              E Q   +    D   +++    S G   +     KR  A +Q AQRSRVRK
Subjt:  ALHSPRETIG-IHTSGPLSTPQEA-----DGLPST------------ASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKR--AKQQFAQRSRVRK

Query:  LQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQ
        LQYI+ELER V +LQAE + +S  + FL+ Q L+L+++N ALKQR+  L+Q++L K   QE L+REI RLR V+ Q
Subjt:  LQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQ

AT4G06598.1 BEST Arabidopsis thaliana protein match is: Basic-leucine zipper (bZIP) transcription factor family protein (TAIR:AT1G58110.2)1.9e-6847.55Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MA+SKGS N R+   +GK ALLPPKSPF        ++VP++VIG+KAVQ+  +GN  H RTSSES LIEEQPSWLDDLLNEPETPVR+ GHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDA-ANVNFDSIMQEEFRY--ANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLP
        AY D     + D  + +  RY   N    H    +E D+ R ++   FY   +++KQK R W+S   +   P +     E+  I  SG   + ++ +   
Subjt:  AYTDA-ANVNFDSIMQEEFRY--ANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLP

Query:  STASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLE
        S A  K+D + + +   K + +++D    KS  S+ + KRA+QQFAQRSRVRK+QYIAELER VQ L                       ENK+LK RLE
Subjt:  STASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLE

Query:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQ-----LRPSSSHRRTSSKDLDNQFANLSLK
        +LAQEQLIKYLE +VLE+EI RLR ++Q  QQQQ+PQQ      + SSSH+R+ S+DL+ QF NLSL+
Subjt:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQ-----LRPSSSHRRTSSKDLDNQFANLSLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AACAAAATTACAAATTTTATAAAACAATGTTACTTTTTAAATATAAAACAAAAAAAGGTGTTATTTATCAAAAATTTCCTCCTAAACCTCCTTCACCTGCACCCACTTAA
TTTCACTCTCAGACATACGGAAGAGGGGAGAGAGACGAGAGAGAGGAGGAAGAAGAAGAAGAAAAAAGAAAGAAGGAAGAAGAAGAAGAAGAGAGAGAAAGAGCTGAGCA
TCTTGATTCTTGCGAGCTGGGGGTCCCCAAATACAAACCCTAAACAGAAGGAAAAGGAATTTTTCACATTTCTTACTATGGCAAATTCTAAAGGGTCATCCAACGTTAGA
AGCTTTATGAGTTCTGGGAAACATGCACTACTTCCTCCTAAAAGTCCATTTCCTAGTGTTTCTCCATCATATACTGAATATGTTCCTAATACTGTAATTGGAGCAAAAGC
TGTTCAGAGACCAAGAGATGGTAACATCTATCATCAAAGAACTTCTTCCGAAAGTATTTTAATAGAGGAGCAGCCTTCTTGGCTTGATGATCTTCTCAATGAGCCAGAGA
CCCCTGTTCGCAGAGTTGGTCATCGACGTTCATCAAGTGATTCCTTTGCATATACGGATGCTGCTAATGTAAATTTTGATAGTATTATGCAAGAAGAATTTAGATATGCA
AATGCAGTTCCTGGACACTCTTGGTTATCACAAGAATTTGATCATCAGAGAGATGCAAGGCATGCTTCATTCTATACTGAACCGAATGTAACAAAACAGAAGAATAGGGT
GTGGGAGTCTTCTTTATCTACCATGAATAATCCCATTGCCCTTCACTCTCCTAGGGAGACCATAGGTATTCATACCTCAGGACCATTAAGCACTCCGCAGGAAGCAGATG
GTTTGCCTTCTACAGCAAGTGAGAAACAGGATCCAGTTGAGTCTGGTTCACACGATCCAAAAGTCGCTTCTGACAGGAAGGATACTTCTCATGGAAAATCAACTGTGTCT
GATACCGAAAATAAACGTGCCAAACAGCAATTTGCTCAGCGTTCAAGGGTTCGGAAACTTCAATATATAGCTGAGCTTGAAAGGAAAGTACAAGCTTTGCAGGCAGAGGG
CACTGAGGTCTCAGCTGAGCTTGAGTTTCTCAACCAGCAAAATCTTATTCTTAGCATGGAAAATAAAGCCCTCAAGCAGCGGTTGGAGAATTTAGCTCAAGAGCAGCTAA
TTAAATACTTGGAGCAGGAAGTACTGGAGAGGGAAATTGGAAGGTTAAGAACTGTGCACCAGCAGCATCAGCAGCAGCAACAACCGCAACAACTACGACCTTCTTCGAGT
CATCGACGTACTTCGAGCAAAGACCTTGACAATCAATTTGCTAACCTTTCATTGAAGCAAAAGGATTCTGGTTCAAGTCGTGACTCGGTAACAGGTCCAGTGCGCAGTTA
G
mRNA sequenceShow/hide mRNA sequence
AACAAAATTACAAATTTTATAAAACAATGTTACTTTTTAAATATAAAACAAAAAAAGGTGTTATTTATCAAAAATTTCCTCCTAAACCTCCTTCACCTGCACCCACTTAA
TTTCACTCTCAGACATACGGAAGAGGGGAGAGAGACGAGAGAGAGGAGGAAGAAGAAGAAGAAAAAAGAAAGAAGGAAGAAGAAGAAGAAGAGAGAGAAAGAGCTGAGCA
TCTTGATTCTTGCGAGCTGGGGGTCCCCAAATACAAACCCTAAACAGAAGGAAAAGGAATTTTTCACATTTCTTACTATGGCAAATTCTAAAGGGTCATCCAACGTTAGA
AGCTTTATGAGTTCTGGGAAACATGCACTACTTCCTCCTAAAAGTCCATTTCCTAGTGTTTCTCCATCATATACTGAATATGTTCCTAATACTGTAATTGGAGCAAAAGC
TGTTCAGAGACCAAGAGATGGTAACATCTATCATCAAAGAACTTCTTCCGAAAGTATTTTAATAGAGGAGCAGCCTTCTTGGCTTGATGATCTTCTCAATGAGCCAGAGA
CCCCTGTTCGCAGAGTTGGTCATCGACGTTCATCAAGTGATTCCTTTGCATATACGGATGCTGCTAATGTAAATTTTGATAGTATTATGCAAGAAGAATTTAGATATGCA
AATGCAGTTCCTGGACACTCTTGGTTATCACAAGAATTTGATCATCAGAGAGATGCAAGGCATGCTTCATTCTATACTGAACCGAATGTAACAAAACAGAAGAATAGGGT
GTGGGAGTCTTCTTTATCTACCATGAATAATCCCATTGCCCTTCACTCTCCTAGGGAGACCATAGGTATTCATACCTCAGGACCATTAAGCACTCCGCAGGAAGCAGATG
GTTTGCCTTCTACAGCAAGTGAGAAACAGGATCCAGTTGAGTCTGGTTCACACGATCCAAAAGTCGCTTCTGACAGGAAGGATACTTCTCATGGAAAATCAACTGTGTCT
GATACCGAAAATAAACGTGCCAAACAGCAATTTGCTCAGCGTTCAAGGGTTCGGAAACTTCAATATATAGCTGAGCTTGAAAGGAAAGTACAAGCTTTGCAGGCAGAGGG
CACTGAGGTCTCAGCTGAGCTTGAGTTTCTCAACCAGCAAAATCTTATTCTTAGCATGGAAAATAAAGCCCTCAAGCAGCGGTTGGAGAATTTAGCTCAAGAGCAGCTAA
TTAAATACTTGGAGCAGGAAGTACTGGAGAGGGAAATTGGAAGGTTAAGAACTGTGCACCAGCAGCATCAGCAGCAGCAACAACCGCAACAACTACGACCTTCTTCGAGT
CATCGACGTACTTCGAGCAAAGACCTTGACAATCAATTTGCTAACCTTTCATTGAAGCAAAAGGATTCTGGTTCAAGTCGTGACTCGGTAACAGGTCCAGTGCGCAGTTA
G
Protein sequenceShow/hide protein sequence
NKITNFIKQCYFLNIKQKKVLFIKNFLLNLLHLHPLNFTLRHTEEGRETRERRKKKKKKERRKKKKKREKELSILILASWGSPNTNPKQKEKEFFTFLTMANSKGSSNVR
SFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYA
NAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVS
DTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSS
HRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS