; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G19150 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G19150
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionBasic-leucine zipper (bZIP) transcription factor family protein
Genome locationChr3:14911945..14920572
RNA-Seq ExpressionCSPI03G19150
SyntenyCSPI03G19150
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044759 - RF2-like transcription factor, bZIP domain
IPR044797 - Uncharacterized protein At4g06598-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576990.1 hypothetical protein SDJN03_24564, partial [Cucurbita argyrosperma subsp. sororia]6.8e-19291.84Show/hide
Query:  SCELGEKEFFTFLTMANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETP
        S ++ EKEF  FLTM NSKGSSNVR+FMSSGKHALLPPKSPFPSVSPSYTE+VPNTVIGAKAVQRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETP
Subjt:  SCELGEKEFFTFLTMANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETP

Query:  VRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGP
        VRRVGHRRSSSDSFAYTDAANVNFDSI QE+F+YANA+PGHSWLSQEFDHQRDARHASFYTE NVTKQKNRVWESSLSTMNNPIALHSPRE I IHTSGP
Subjt:  VRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGP

Query:  LSAPQEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILS
        LSAPQEADGLPSTASEKQDPVESGSHDPKV+S+RKD SHGKS+ SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQ+LILS
Subjt:  LSAPQEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILS

Query:  MENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVH-QQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        MEN ALKQRLENLAQEQLIKYLEQEVLEREIGRLR +H QQHQQQQQPQ LRPSSSHRR+SSKDLDNQFANLSLK+KDSGSSRD VTGPVRS
Subjt:  MENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVH-QQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

KAG7015010.1 hypothetical protein SDJN02_22641 [Cucurbita argyrosperma subsp. argyrosperma]1.3e-18792.43Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        M NSKGSSNVR+FMSSGKHALLPPKSPFPSVSPSYTE+VPNTVIGAKAVQRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA
        AYTDAANVNFDSI QE+F+YANA+PGHSWLSQEFDHQRDARHASFYTE NVTKQKNRVWESSLSTMNNPIALHSPRE I IHTSGPLSAPQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKV+S+RKD SHGKS+ SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQ+LILSMEN ALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTV------HQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLR +       QQHQQQQQPQQLRPSSSHRR+SSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTV------HQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

XP_004146254.1 uncharacterized protein At4g06598 isoform X1 [Cucumis sativus]9.4e-20299.73Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA
        AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

XP_008456043.1 PREDICTED: uncharacterized protein At4g06598 [Cucumis melo]7.0e-19797.88Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA
        AYTDAANVNFDS MQEEFRYANA+PGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPI LHSPRETIGIHTSGPLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKVASDRKDTSHGKS VSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRD VTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

XP_038896639.1 uncharacterized protein At4g06598 [Benincasa hispida]2.2e-19094.97Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSN+RSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGN YHQRTSSES LIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA
        AYTDAANVNFDSIMQEEFRYANA+PGHSWL QEFDHQRDARHAS YTEPNVTKQKNRVWESSLSTMNNPIALHSPRE IGIHTSGPLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDP ESGSHDPKV+SDRKD S GKS+VSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVH-QQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLR VH QQH QQQQPQQLRPSSSHRRT SKDLDNQFANLSLKQKDSGSSRD VTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVH-QQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

TrEMBL top hitse value%identityAlignment
A0A0A0L9F3 BZIP domain-containing protein9.8e-20591.88Show/hide
Query:  GRGERDEREEEEEEEKRKKEEEEEERERAEHLDSC-------ELGEKEFFTFLTMANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGA
        GR  R+ R++++++E+RKK+++ E+      L S        +  EKEFFTFLTMANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGA
Subjt:  GRGERDEREEEEEEEKRKKEEEEEERERAEHLDSC-------ELGEKEFFTFLTMANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGA

Query:  KAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFY
        KAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFY
Subjt:  KAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFY

Query:  TEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRS
        TEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLS PQEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRS
Subjt:  TEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRS

Query:  RVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTS
        RVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTS
Subjt:  RVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTS

Query:  SKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        SKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
Subjt:  SKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

A0A1S3C2E7 uncharacterized protein At4g065983.4e-19797.88Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA
        AYTDAANVNFDS MQEEFRYANA+PGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPI LHSPRETIGIHTSGPLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKVASDRKDTSHGKS VSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRD VTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

A0A6J1C5A1 uncharacterized protein At4g06598-like1.9e-18490.74Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVRSFM+SGKHALLPPKSPFPSVSPSYTEYVPNT IGAKA+QRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRR+GHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA
        AYTDAANVNFDSIMQEEF+Y N VPGHSWLSQEFDHQRDARHASFY E N T+QKNRVWESSLSTM+NP ALHSPRE + IHTSGPLS PQEADGLPS+A
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDP ESGSHDPKV+S+RKD +HGKS+VSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQNLILSMENKALKQRL+NLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVH-QQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLR++H QQH QQQQPQ LRPSS+HRRTSSKDLD+QFANLSLKQKDSGSSRD VTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVH-QQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

A0A6J1E439 uncharacterized protein At4g06598-like isoform X11.4e-18792.17Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        M NSKGSSNVR+FMSSGKHALLPPKSPFPSVSPSYTE+VPNTVIGAKAVQRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA
        AYTDAANVNFDSI QE+F+YANA+PGHSWLSQEFDHQRDARHASFYTE NVTKQKNRVWESSLSTMNNPIALHSPRE I IHTSGPLSAPQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKV+S+RKD SHGKS+ SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQ+LILSMEN ALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVH------QQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLR +H      QQHQQQQQPQ LRPSSSHRR+SSKDLDNQFANLSLKQKDSGSSRD VTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVH------QQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

A0A6J1J9S7 uncharacterized protein At4g06598-like isoform X11.7e-18591.62Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        M NSKGSSNVR+FMSSGKHALLPPKSPFPSVSPSYTE+VPNTVIGAKAVQRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA
        AYTDAANVNFDSI QE+F+YANA+PGHSWLSQEFDHQRDARHASFYTE NVTKQKNRVWESSLSTMNNPIALHSPRE I IHTS PLSAPQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQD VESGSH+PKV+S+RKD SHGKS+ SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQ+LILSMEN ALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVH-----QQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS
        QEQLIKYLEQEVLEREIGRLR +H     QQHQQQQQPQ LRPSSSHRR+SSKDLDNQFANLSLKQKDSGSSRD VTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVH-----QQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVTGPVRS

SwissProt top hitse value%identityAlignment
F4IN23 Basic leucine zipper 343.9e-1731.52Show/hide
Query:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPI
        + PSW+D+ L+   +  RR  HRRS SDS A+ +A  V+ +                     +FD   D +  S +T+ +             +  +NP 
Subjt:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPI

Query:  ALHSPRETIG-----IHTSGPLSA---------PQEADGLPSTASEKQDPVES----GSHDPKVASDRKDTSHGKSTVSDTENKR--AKQQFAQRSRVRK
         +++    +G      +TS P ++         P + +   +  +   D V+S       D   +++    S G   +     KR  A +Q AQRSRVRK
Subjt:  ALHSPRETIG-----IHTSGPLSA---------PQEADGLPSTASEKQDPVES----GSHDPKVASDRKDTSHGKSTVSDTENKR--AKQQFAQRSRVRK

Query:  LQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQ
        LQYI+ELER V +LQAE + +S  + FL+ Q L+L+++N ALKQR+  L+Q++L K   QE L+REI RLR V+ Q
Subjt:  LQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQ

Q5JMK6 Basic leucine zipper 62.6e-1355.06Show/hide
Query:  AKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQ
        A +Q AQRSRVRKLQYI+ELER V  LQ E + +S  + FL+QQ  IL++ N  LKQR+  LAQ+++ K   QE L +EI RLR V+QQ
Subjt:  AKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQ

Q6K3R9 Basic leucine zipper 194.2e-1132.23Show/hide
Query:  RRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPL
        RR  HRRS+SDS A+   A V  D ++         V G      EFD   D +  S +++                       + +P  + G    GP 
Subjt:  RRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPL

Query:  SAPQEAD-GLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILS
              D G          P  +G+     A+   D   G +     +   A +Q AQRSRVRKLQYI+ELER V  LQ E + +S  + FL+ Q  +L+
Subjt:  SAPQEAD-GLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILS

Query:  MENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQ
        + N  LKQR+  LAQ+++ K   QE L++EI RLR V+ Q Q
Subjt:  MENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQ

Q8W3M7 Uncharacterized protein At4g065984.6e-5047.39Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MA+SKGS N R+   +GK ALLPPKSPF        ++VP++VIG+KAVQ+  +GN  H RTSSES LIEEQPSWLDDLLNEPETPVR+ GHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDA-ANVNFDSIMQEEFRY--ANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLP
        AY D     + D  + +  RY   N    H    +E D+ R ++   FY   +++KQK R W+S   +   P +     E+  I  SG   + ++ +   
Subjt:  AYTDA-ANVNFDSIMQEEFRY--ANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLP

Query:  STASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQ
        S A  K+D + + +   K + +++D    KS  S+ + KRA+QQFAQRSRVRK+QYIAELER VQ LQ
Subjt:  STASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQ

Q9M2K4 Basic leucine zipper 611.5e-1631.74Show/hide
Query:  EEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTD-----AANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLS
        ++ PSW+D+ L+   T  RR  HRRS SDS A+ +       N +FD    E+F         S  + +  +     H       NV   ++    S+ S
Subjt:  EEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTD-----AANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLS

Query:  TMNNPIAL-------HSPRETIGIHTSGPLSAPQEADGLP-STASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKR--AKQQFAQRSRVRKLQYI
        T ++  +L        +P      H    ++    A G   + + E Q   ++   D   A+     S G         KR  A +Q AQRSRVRKLQYI
Subjt:  TMNNPIAL-------HSPRETIGIHTSGPLSAPQEADGLP-STASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKR--AKQQFAQRSRVRKLQYI

Query:  AELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQ--------QQQQPQQLRPS
        +ELER V +LQ E + +S  + FL+ Q L+L+++N A+KQR+  LAQ+++ K   QE L+REI RLR V+ Q            Q P  ++PS
Subjt:  AELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQ--------QQQQPQQLRPS

Arabidopsis top hitse value%identityAlignment
AT1G35490.1 bZIP family transcription factor4.2e-3034.3Show/hide
Query:  NIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQK
        N++H   S + +  E+QP+WLD+LL+EP +P    GHRRS+SD+ AY ++A      +M  +    N V G SW  Q +D            + N  +Q 
Subjt:  NIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQK

Query:  NRV-WESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYI
        N++ W+   ST N            G +    +S            +    P+E   H  K+         G    S T++KR K Q A R+R+R+L+YI
Subjt:  NRV-WESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYI

Query:  AELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRP----SSSHRRTSSKDL
        ++LER +Q LQ EG E+S+ + +L+QQ L+LSMEN+ALKQR+++LA+ Q +K++EQ++LEREIG L+    Q Q QQ  +Q++      + ++   +++ 
Subjt:  AELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRP----SSSHRRTSSKDL

Query:  DNQFANLSL
        D QFA L++
Subjt:  DNQFANLSL

AT1G58110.1 Basic-leucine zipper (bZIP) transcription factor family protein7.7e-9355.47Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS
        MA+SKGS +VR+ M  GKHALLPPK PFPSVS SY+EY+P  +IG++  Q+  +   +HQRTSSES L+EE P WLDDLLNE PE+P R+ GHRRSSSDS
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS

Query:  FAYTDAAN-VNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPS
        +AY D AN  N    +Q +F Y N V       QE D  ++A+ A+FY+  +  KQK+R  +S ++T   P  L   RE  G    G L   Q+A  +  
Subjt:  FAYTDAAN-VNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPS

Query:  TASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLE
         +SE+++  E  SHDPK+ S  ++ S+      + +N KRAKQQFAQRSRVRKLQYI+ELER VQ LQAEG++VSAEL+FLNQ+NLILSMENKALK+RLE
Subjt:  TASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLE

Query:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVT
        ++AQE+LIK LEQEVLE+EIGRLR ++QQ QQ Q     +PS+S  R +SKDLD+QF++LSL  KDS   RDSV+
Subjt:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVT

AT1G58110.2 Basic-leucine zipper (bZIP) transcription factor family protein7.7e-9355.47Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS
        MA+SKGS +VR+ M  GKHALLPPK PFPSVS SY+EY+P  +IG++  Q+  +   +HQRTSSES L+EE P WLDDLLNE PE+P R+ GHRRSSSDS
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS

Query:  FAYTDAAN-VNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPS
        +AY D AN  N    +Q +F Y N V       QE D  ++A+ A+FY+  +  KQK+R  +S ++T   P  L   RE  G    G L   Q+A  +  
Subjt:  FAYTDAAN-VNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPS

Query:  TASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLE
         +SE+++  E  SHDPK+ S  ++ S+      + +N KRAKQQFAQRSRVRKLQYI+ELER VQ LQAEG++VSAEL+FLNQ+NLILSMENKALK+RLE
Subjt:  TASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLE

Query:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVT
        ++AQE+LIK LEQEVLE+EIGRLR ++QQ QQ Q     +PS+S  R +SKDLD+QF++LSL  KDS   RDSV+
Subjt:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDSVT

AT2G42380.2 Basic-leucine zipper (bZIP) transcription factor family protein2.8e-1831.52Show/hide
Query:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPI
        + PSW+D+ L+   +  RR  HRRS SDS A+ +A  V+ +                     +FD   D +  S +T+ +             +  +NP 
Subjt:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPI

Query:  ALHSPRETIG-----IHTSGPLSA---------PQEADGLPSTASEKQDPVES----GSHDPKVASDRKDTSHGKSTVSDTENKR--AKQQFAQRSRVRK
         +++    +G      +TS P ++         P + +   +  +   D V+S       D   +++    S G   +     KR  A +Q AQRSRVRK
Subjt:  ALHSPRETIG-----IHTSGPLSA---------PQEADGLPSTASEKQDPVES----GSHDPKVASDRKDTSHGKSTVSDTENKR--AKQQFAQRSRVRK

Query:  LQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQ
        LQYI+ELER V +LQAE + +S  + FL+ Q L+L+++N ALKQR+  L+Q++L K   QE L+REI RLR V+ Q
Subjt:  LQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQ

AT4G06598.1 BEST Arabidopsis thaliana protein match is: Basic-leucine zipper (bZIP) transcription factor family protein (TAIR:AT1G58110.2)1.7e-6847.55Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MA+SKGS N R+   +GK ALLPPKSPF        ++VP++VIG+KAVQ+  +GN  H RTSSES LIEEQPSWLDDLLNEPETPVR+ GHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDA-ANVNFDSIMQEEFRY--ANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLP
        AY D     + D  + +  RY   N    H    +E D+ R ++   FY   +++KQK R W+S   +   P +     E+  I  SG   + ++ +   
Subjt:  AYTDA-ANVNFDSIMQEEFRY--ANAVPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLP

Query:  STASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLE
        S A  K+D + + +   K + +++D    KS  S+ + KRA+QQFAQRSRVRK+QYIAELER VQ L                       ENK+LK RLE
Subjt:  STASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLE

Query:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQ-----LRPSSSHRRTSSKDLDNQFANLSLK
        +LAQEQLIKYLE +VLE+EI RLR ++Q  QQQQ+PQQ      + SSSH+R+ S+DL+ QF NLSL+
Subjt:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQ-----LRPSSSHRRTSSKDLDNQFANLSLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGTTACTGCTTCTTTAATAGGATAAGGCAATTGACTCCCAACCATGCTACGACGGTGGTAAAGACTTTTGTTGGCGATGCATTTGTGGATGTTCCTGCTCTGGA
CATTGATGACCATCTTGCTGACGAGGAGAAGGGGACATCCATCAGTGGCGACCTCGAGGATGGTATCATACCCATTTACATACCATCAGGGGGCGAAGATGTGCCAGTAC
TGCTTCCTCAACCCATCCTAAAGTCGAAGGATGAGAAGTTACAGGTTAATTCGATCCCTCACACGTCAAGAAATGATTCACTACAGCTGGCCATGCCATCACGGCGTAAC
TCAGGTCTATACGAGGAGATTAAAAGGGACACTCGCCTGGTTGACGTTCAGTACACTTGGGTTAAAGACGCTATCCACGTTGATGTGTTCACAGTGAAGCCGAATTTACC
TTCGACATCATGGAAGACATACGGAAGAGGGGAGAGAGACGAGAGAGAGGAGGAAGAAGAAGAAGAAAAAAGAAAGAAGGAAGAAGAAGAAGAAGAGAGAGAAAGAGCTG
AGCATCTTGATTCTTGCGAGCTGGGGGAAAAGGAATTTTTCACATTTCTTACTATGGCAAATTCTAAAGGGTCATCCAACGTTAGAAGCTTTATGAGTTCTGGGAAACAT
GCACTACTTCCTCCTAAAAGTCCATTTCCTAGTGTTTCTCCATCATATACTGAATATGTTCCTAATACTGTAATTGGAGCAAAAGCTGTTCAGAGACCAAGAGATGGTAA
CATCTATCATCAAAGAACTTCTTCCGAAAGTATTTTAATAGAGGAGCAGCCTTCTTGGCTTGATGATCTTCTCAATGAGCCAGAGACCCCTGTTCGCAGAGTTGGTCATC
GACGTTCATCAAGTGATTCCTTTGCATATACGGATGCTGCTAATGTAAATTTTGATAGTATTATGCAAGAAGAATTTAGATATGCAAATGCAGTTCCTGGACACTCTTGG
TTATCACAAGAATTTGATCATCAGAGAGATGCAAGGCATGCTTCATTCTATACTGAACCGAATGTAACAAAGCAAAAGAATAGGGTGTGGGAGTCTTCTTTATCTACCAT
GAATAATCCCATTGCCCTTCACTCTCCTAGGGAGACCATAGGTATTCATACCTCAGGACCATTAAGCGCTCCGCAGGAAGCAGATGGTTTGCCTTCTACAGCAAGTGAGA
AACAGGATCCCGTTGAGTCTGGTTCACACGATCCAAAAGTCGCTTCTGACAGGAAGGATACTTCTCATGGAAAATCAACTGTGTCTGATACCGAAAATAAACGTGCCAAA
CAGCAATTTGCTCAGCGTTCAAGGGTTCGGAAACTTCAATATATAGCTGAGCTTGAAAGGAAAGTACAAGCTTTGCAGGCAGAGGGCACTGAGGTCTCAGCTGAGCTTGA
GTTTCTCAACCAGCAAAATCTTATTCTTAGCATGGAAAATAAAGCCCTCAAGCAGCGGTTGGAGAATTTAGCTCAAGAGCAGCTAATTAAATACTTGGAGCAGGAAGTAC
TGGAGAGGGAAATTGGAAGGTTAAGAACTGTGCACCAGCAGCATCAGCAGCAGCAACAACCGCAACAACTACGACCTTCTTCGAGTCATCGACGTACTTCGAGCAAAGAC
CTTGACAATCAATTTGCTAACCTTTCATTGAAGCAAAAGGATTCTGGTTCAAGTCGTGACTCGGTAACAGGTCCAGTGCGCAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGGTTACTGCTTCTTTAATAGGATAAGGCAATTGACTCCCAACCATGCTACGACGGTGGTAAAGACTTTTGTTGGCGATGCATTTGTGGATGTTCCTGCTCTGGA
CATTGATGACCATCTTGCTGACGAGGAGAAGGGGACATCCATCAGTGGCGACCTCGAGGATGGTATCATACCCATTTACATACCATCAGGGGGCGAAGATGTGCCAGTAC
TGCTTCCTCAACCCATCCTAAAGTCGAAGGATGAGAAGTTACAGGTTAATTCGATCCCTCACACGTCAAGAAATGATTCACTACAGCTGGCCATGCCATCACGGCGTAAC
TCAGGTCTATACGAGGAGATTAAAAGGGACACTCGCCTGGTTGACGTTCAGTACACTTGGGTTAAAGACGCTATCCACGTTGATGTGTTCACAGTGAAGCCGAATTTACC
TTCGACATCATGGAAGACATACGGAAGAGGGGAGAGAGACGAGAGAGAGGAGGAAGAAGAAGAAGAAAAAAGAAAGAAGGAAGAAGAAGAAGAAGAGAGAGAAAGAGCTG
AGCATCTTGATTCTTGCGAGCTGGGGGAAAAGGAATTTTTCACATTTCTTACTATGGCAAATTCTAAAGGGTCATCCAACGTTAGAAGCTTTATGAGTTCTGGGAAACAT
GCACTACTTCCTCCTAAAAGTCCATTTCCTAGTGTTTCTCCATCATATACTGAATATGTTCCTAATACTGTAATTGGAGCAAAAGCTGTTCAGAGACCAAGAGATGGTAA
CATCTATCATCAAAGAACTTCTTCCGAAAGTATTTTAATAGAGGAGCAGCCTTCTTGGCTTGATGATCTTCTCAATGAGCCAGAGACCCCTGTTCGCAGAGTTGGTCATC
GACGTTCATCAAGTGATTCCTTTGCATATACGGATGCTGCTAATGTAAATTTTGATAGTATTATGCAAGAAGAATTTAGATATGCAAATGCAGTTCCTGGACACTCTTGG
TTATCACAAGAATTTGATCATCAGAGAGATGCAAGGCATGCTTCATTCTATACTGAACCGAATGTAACAAAGCAAAAGAATAGGGTGTGGGAGTCTTCTTTATCTACCAT
GAATAATCCCATTGCCCTTCACTCTCCTAGGGAGACCATAGGTATTCATACCTCAGGACCATTAAGCGCTCCGCAGGAAGCAGATGGTTTGCCTTCTACAGCAAGTGAGA
AACAGGATCCCGTTGAGTCTGGTTCACACGATCCAAAAGTCGCTTCTGACAGGAAGGATACTTCTCATGGAAAATCAACTGTGTCTGATACCGAAAATAAACGTGCCAAA
CAGCAATTTGCTCAGCGTTCAAGGGTTCGGAAACTTCAATATATAGCTGAGCTTGAAAGGAAAGTACAAGCTTTGCAGGCAGAGGGCACTGAGGTCTCAGCTGAGCTTGA
GTTTCTCAACCAGCAAAATCTTATTCTTAGCATGGAAAATAAAGCCCTCAAGCAGCGGTTGGAGAATTTAGCTCAAGAGCAGCTAATTAAATACTTGGAGCAGGAAGTAC
TGGAGAGGGAAATTGGAAGGTTAAGAACTGTGCACCAGCAGCATCAGCAGCAGCAACAACCGCAACAACTACGACCTTCTTCGAGTCATCGACGTACTTCGAGCAAAGAC
CTTGACAATCAATTTGCTAACCTTTCATTGAAGCAAAAGGATTCTGGTTCAAGTCGTGACTCGGTAACAGGTCCAGTGCGCAGTTAG
Protein sequenceShow/hide protein sequence
MQGYCFFNRIRQLTPNHATTVVKTFVGDAFVDVPALDIDDHLADEEKGTSISGDLEDGIIPIYIPSGGEDVPVLLPQPILKSKDEKLQVNSIPHTSRNDSLQLAMPSRRN
SGLYEEIKRDTRLVDVQYTWVKDAIHVDVFTVKPNLPSTSWKTYGRGERDEREEEEEEEKRKKEEEEEERERAEHLDSCELGEKEFFTFLTMANSKGSSNVRSFMSSGKH
ALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNIYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFRYANAVPGHSW
LSQEFDHQRDARHASFYTEPNVTKQKNRVWESSLSTMNNPIALHSPRETIGIHTSGPLSAPQEADGLPSTASEKQDPVESGSHDPKVASDRKDTSHGKSTVSDTENKRAK
QQFAQRSRVRKLQYIAELERKVQALQAEGTEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQHQQQQQPQQLRPSSSHRRTSSKD
LDNQFANLSLKQKDSGSSRDSVTGPVRS