; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0021892 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0021892
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionBasic-leucine zipper (bZIP) transcription factor family protein
Genome locationchr06:9051753..9058283
RNA-Seq ExpressionPI0021892
SyntenyPI0021892
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044759 - RF2-like transcription factor, bZIP domain
IPR044797 - Uncharacterized protein At4g06598-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576990.1 hypothetical protein SDJN03_24564, partial [Cucurbita argyrosperma subsp. sororia]9.2e-19093.92Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        M NSKGSSNVR+FMSSGKHALLPPKSPFPSVSPSYTE+VPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSI QEDF+YANAIPGHSWLSQEFDHQRDARHASFYTE NVTKQKNRVWES LSTMNNPIALHSPRE I IHTSGPLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKV+S+RKD SHGKS+ SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQ+LILSMEN ALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLR +HQQQH QQQQPQ LRPSSSHRR+SSKDLDNQFANLSLK+KDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS

XP_004146254.1 uncharacterized protein At4g06598 isoform X1 [Cucumis sativus]3.9e-19697.35Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSIMQE+FRYANA+PGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWES LSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKVASDRKD SHGKS VSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRTVH QQH QQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRD VTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS

XP_008456043.1 PREDICTED: uncharacterized protein At4g06598 [Cucumis melo]3.0e-19698.15Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDS MQE+FRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWES LSTMNNPI LHSPRETIGIHTSGPLSTPQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKVASDRKD SHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRTVH QQH QQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS

XP_022922717.1 uncharacterized protein At4g06598-like isoform X1 [Cucurbita moschata]2.3e-18892.95Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        M NSKGSSNVR+FMSSGKHALLPPKSPFPSVSPSYTE+VPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSI QEDF+YANAIPGHSWLSQEFDHQRDARHASFYTE NVTKQKNRVWES LSTMNNPIALHSPRE I IHTSGPLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKV+S+RKD SHGKS+ SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQ+LILSMEN ALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVH-----QQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLR +H     QQQH QQQQPQ LRPSSSHRR+SSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVH-----QQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS

XP_038896639.1 uncharacterized protein At4g06598 [Benincasa hispida]5.6e-19596.56Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSN+RSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSES LIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSIMQE+FRYANAIPGHSWL QEFDHQRDARHAS YTEPNVTKQKNRVWES LSTMNNPIALHSPRE IGIHTSGPLSTPQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDP ESGSHDPKV+SDRKDAS GKS+VSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLR VHQQQHHQQQQPQQLRPSSSHRRT SKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS

TrEMBL top hitse value%identityAlignment
A0A0A0L9F3 BZIP domain-containing protein1.9e-19697.35Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSIMQE+FRYANA+PGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWES LSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKVASDRKD SHGKS VSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRTVH QQH QQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRD VTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS

A0A1S3C2E7 uncharacterized protein At4g065981.4e-19698.15Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDS MQE+FRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWES LSTMNNPI LHSPRETIGIHTSGPLSTPQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKVASDRKD SHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRTVH QQH QQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS

A0A6J1C5A1 uncharacterized protein At4g06598-like7.1e-18891.27Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVRSFM+SGKHALLPPKSPFPSVSPSYTEYVPNT IGAKA+QRPRDGN+YHQRTSSESILIEEQPSWLDDLLNEPETPVRR+GHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSIMQE+F+Y N +PGHSWLSQEFDHQRDARHASFY E N T+QKNRVWES LSTM+NP ALHSPRE + IHTSGPLSTPQEADGLPS+A
Subjt:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDP ESGSHDPKV+S+RKD +HGKS+VSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRL+NLA
Subjt:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLR++HQQQHHQQQQPQ LRPSS+HRRTSSKDLD+QFANLSLKQKDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS

A0A6J1E439 uncharacterized protein At4g06598-like isoform X11.1e-18892.95Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        M NSKGSSNVR+FMSSGKHALLPPKSPFPSVSPSYTE+VPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSI QEDF+YANAIPGHSWLSQEFDHQRDARHASFYTE NVTKQKNRVWES LSTMNNPIALHSPRE I IHTSGPLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQDPVESGSHDPKV+S+RKD SHGKS+ SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQ+LILSMEN ALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTVH-----QQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLR +H     QQQH QQQQPQ LRPSSSHRR+SSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTVH-----QQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS

A0A6J1J9S7 uncharacterized protein At4g06598-like isoform X14.6e-18792.67Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        M NSKGSSNVR+FMSSGKHALLPPKSPFPSVSPSYTE+VPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSI QEDF+YANAIPGHSWLSQEFDHQRDARHASFYTE NVTKQKNRVWES LSTMNNPIALHSPRE I IHTS PLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTA

Query:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEKQD VESGSH+PKV+S+RKDASHGKS+ SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQ+LILSMEN ALKQRLENLA
Subjt:  SEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRTV----HQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLR +    HQQQH QQQQPQ LRPSSSHRR+SSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRTV----HQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS

SwissProt top hitse value%identityAlignment
F4IN23 Basic leucine zipper 343.0e-1832.49Show/hide
Query:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPI
        + PSW+D+ L+   +  RR  HRRS SDS A+ +A  V+ +                     +FD   D +  S +T+ +             +  +NP 
Subjt:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPI

Query:  ALHSPRETIG-IHTSGPLSTPQEA-----DGLPST------------ASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKR--AKQQFAQRSRVRK
         +++    +G   +S   STP  +       LP +              E Q   +    D   +++    S G   +     KR  A +Q AQRSRVRK
Subjt:  ALHSPRETIG-IHTSGPLSTPQEA-----DGLPST------------ASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKR--AKQQFAQRSRVRK

Query:  LQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQQ
        LQYI+ELER V +LQAE S +S  + FL+ Q L+L+++N ALKQR+  L+Q++L K   QE L+REI RLR V+ QQ
Subjt:  LQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQQ

Q5JMK6 Basic leucine zipper 64.5e-1456.04Show/hide
Query:  AKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQQH
        A +Q AQRSRVRKLQYI+ELER V  LQ E S +S  + FL+QQ  IL++ N  LKQR+  LAQ+++ K   QE L +EI RLR V+QQQ+
Subjt:  AKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQQH

Q6K3R9 Basic leucine zipper 191.2e-1132.37Show/hide
Query:  RRVGHRRSSSDSFAYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPL
        RR  HRRS+SDS A+   A V  D ++         + G      EFD   D +  S +++                       + +P  + G    GP 
Subjt:  RRVGHRRSSSDSFAYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPL

Query:  STPQEAD-GLPSTASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILS
              D G          P  +G+     A+    A+ G +     +   A +Q AQRSRVRKLQYI+ELER V  LQ E S +S  + FL+ Q  +L+
Subjt:  STPQEAD-GLPSTASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILS

Query:  MENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQQ
        + N  LKQR+  LAQ+++ K   QE L++EI RLR V+ QQ
Subjt:  MENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQQ

Q8W3M7 Uncharacterized protein At4g065981.8e-5047.76Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MA+SKGS N R+   +GK ALLPPKSPF        ++VP++VIG+KAVQ+  +GN+ H RTSSES LIEEQPSWLDDLLNEPETPVR+ GHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDA-ANVNFDSIMQEDFRY--ANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLP
        AY D     + D  + +  RY   N    H    +E D+ R ++   FY   +++KQK R W+S   +   P +     E+  I  SG   + ++ +   
Subjt:  AYTDA-ANVNFDSIMQEDFRY--ANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLP

Query:  STASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQ
        S A  K+D + + +   K + +++D    KSA S+ + KRA+QQFAQRSRVRK+QYIAELER VQ LQ
Subjt:  STASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQ

Q9M2K4 Basic leucine zipper 615.7e-1730.72Show/hide
Query:  ALLPPKSPFPSVSPSYTEYVPNTV--IGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTD-----AANVNFDS
        A LPPK P    +P++ ++    +  I A A      G              ++ PSW+D+ L+   T  RR  HRRS SDS A+ +       N +FD 
Subjt:  ALLPPKSPFPSVSPSYTEYVPNTV--IGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTD-----AANVNFDS

Query:  IMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIAL-------HSPRETIGIHTSGPLSTPQEADGLP-STASEKQ
           E F         S  + +  +     H       NV   ++    S  ST ++  +L        +P      H    ++    A G   + + E Q
Subjt:  IMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIAL-------HSPRETIGIHTSGPLSTPQEADGLP-STASEKQ

Query:  DPVESGSHDPKVASDRKDASHGKSAVSDTENKR--AKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQE
           ++   D   A+     S G         KR  A +Q AQRSRVRKLQYI+ELER V +LQ E S +S  + FL+ Q L+L+++N A+KQR+  LAQ+
Subjt:  DPVESGSHDPKVASDRKDASHGKSAVSDTENKR--AKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQE

Query:  QLIKYLEQEVLEREIGRLRTVHQQQ-------HHQQQQPQQLRPS
        ++ K   QE L+REI RLR V+ QQ       +   Q P  ++PS
Subjt:  QLIKYLEQEVLEREIGRLRTVHQQQ-------HHQQQQPQQLRPS

Arabidopsis top hitse value%identityAlignment
AT1G35490.1 bZIP family transcription factor2.3e-2934.3Show/hide
Query:  NSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQK
        N +H   S + +  E+QP+WLD+LL+EP +P    GHRRS+SD+ AY ++A      +M       N + G SW  Q +D            + N  +Q 
Subjt:  NSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQK

Query:  NRV-WESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYI
        N++ W+   ST N            G +    +S            +    P+E   H  K+         G    S T++KR K Q A R+R+R+L+YI
Subjt:  NRV-WESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYI

Query:  AELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTS---SKDL
        ++LER +Q LQ EG E+S+ + +L+QQ L+LSMEN+ALKQR+++LA+ Q +K++EQ++LEREIG L+    QQ  QQ Q Q     + + +     +++ 
Subjt:  AELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTS---SKDL

Query:  DNQFANLSL
        D QFA L++
Subjt:  DNQFANLSL

AT1G58110.1 Basic-leucine zipper (bZIP) transcription factor family protein5.6e-9255.05Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS
        MA+SKGS +VR+ M  GKHALLPPK PFPSVS SY+EY+P  +IG++  Q+  +  ++HQRTSSES L+EE P WLDDLLNE PE+P R+ GHRRSSSDS
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS

Query:  FAYTDAAN-VNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPS
        +AY D AN  N    +Q DF Y N +       QE D  ++A+ A+FY+  +  KQK+R  +S ++T   P  L   RE  G    G L   Q+A  +  
Subjt:  FAYTDAAN-VNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPS

Query:  TASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLE
         +SE+++  E  SHDPK+ S  ++ S+      + +N KRAKQQFAQRSRVRKLQYI+ELER VQ LQAEGS+VSAEL+FLNQ+NLILSMENKALK+RLE
Subjt:  TASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLE

Query:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVT
        ++AQE+LIK LEQEVLE+EIGRLR ++QQQ   Q      +PS+S  R +SKDLD+QF++LSL  KDS   RD V+
Subjt:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVT

AT1G58110.2 Basic-leucine zipper (bZIP) transcription factor family protein5.6e-9255.05Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS
        MA+SKGS +VR+ M  GKHALLPPK PFPSVS SY+EY+P  +IG++  Q+  +  ++HQRTSSES L+EE P WLDDLLNE PE+P R+ GHRRSSSDS
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS

Query:  FAYTDAAN-VNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPS
        +AY D AN  N    +Q DF Y N +       QE D  ++A+ A+FY+  +  KQK+R  +S ++T   P  L   RE  G    G L   Q+A  +  
Subjt:  FAYTDAAN-VNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPS

Query:  TASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLE
         +SE+++  E  SHDPK+ S  ++ S+      + +N KRAKQQFAQRSRVRKLQYI+ELER VQ LQAEGS+VSAEL+FLNQ+NLILSMENKALK+RLE
Subjt:  TASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLE

Query:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVT
        ++AQE+LIK LEQEVLE+EIGRLR ++QQQ   Q      +PS+S  R +SKDLD+QF++LSL  KDS   RD V+
Subjt:  NLAQEQLIKYLEQEVLEREIGRLRTVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVT

AT2G42380.2 Basic-leucine zipper (bZIP) transcription factor family protein2.2e-1932.49Show/hide
Query:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPI
        + PSW+D+ L+   +  RR  HRRS SDS A+ +A  V+ +                     +FD   D +  S +T+ +             +  +NP 
Subjt:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPI

Query:  ALHSPRETIG-IHTSGPLSTPQEA-----DGLPST------------ASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKR--AKQQFAQRSRVRK
         +++    +G   +S   STP  +       LP +              E Q   +    D   +++    S G   +     KR  A +Q AQRSRVRK
Subjt:  ALHSPRETIG-IHTSGPLSTPQEA-----DGLPST------------ASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKR--AKQQFAQRSRVRK

Query:  LQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQQ
        LQYI+ELER V +LQAE S +S  + FL+ Q L+L+++N ALKQR+  L+Q++L K   QE L+REI RLR V+ QQ
Subjt:  LQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQQ

AT4G06598.1 BEST Arabidopsis thaliana protein match is: Basic-leucine zipper (bZIP) transcription factor family protein (TAIR:AT1G58110.2)1.5e-6847.41Show/hide
Query:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MA+SKGS N R+   +GK ALLPPKSPF        ++VP++VIG+KAVQ+  +GN+ H RTSSES LIEEQPSWLDDLLNEPETPVR+ GHRRSSSDSF
Subjt:  MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDA-ANVNFDSIMQEDFRY--ANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLP
        AY D     + D  + +  RY   N    H    +E D+ R ++   FY   +++KQK R W+S   +   P +     E+  I  SG   + ++ +   
Subjt:  AYTDA-ANVNFDSIMQEDFRY--ANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLP

Query:  STASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLE
        S A  K+D + + +   K + +++D    KSA S+ + KRA+QQFAQRSRVRK+QYIAELER VQ L                       ENK+LK RLE
Subjt:  STASEKQDPVESGSHDPKVASDRKDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLE

Query:  NLAQEQLIKYLEQEVLEREIGRLRTVH---QQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLK
        +LAQEQLIKYLE +VLE+EI RLR ++   QQQ  QQQ   + + SSSH+R+ S+DL+ QF NLSL+
Subjt:  NLAQEQLIKYLEQEVLEREIGRLRTVH---QQQHHQQQQPQQLRPSSSHRRTSSKDLDNQFANLSLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAATTCTAAAGGGTCATCCAACGTTAGAAGCTTTATGAGTTCTGGGAAACATGCACTACTTCCTCCTAAAAGTCCATTTCCTAGTGTTTCTCCATCATATACTGA
ATATGTTCCTAATACTGTAATTGGAGCAAAAGCTGTTCAGAGACCAAGAGATGGTAACAGCTATCATCAAAGAACTTCTTCCGAAAGTATTTTAATAGAGGAGCAGCCTT
CTTGGCTTGATGATCTTCTCAATGAGCCGGAGACCCCTGTTCGCAGAGTTGGTCATCGACGTTCATCGAGTGACTCCTTTGCATATACGGATGCTGCTAATGTAAATTTT
GATAGTATCATGCAAGAAGATTTTAGATATGCAAATGCAATTCCTGGACACTCTTGGTTATCTCAAGAATTTGATCATCAGAGAGATGCGAGGCATGCTTCATTCTATAC
TGAACCAAATGTAACAAAACAGAAGAATAGGGTGTGGGAGTCTCCTTTATCTACCATGAATAATCCCATTGCCCTTCATTCTCCTAGGGAGACCATAGGTATTCATACCT
CAGGGCCATTGAGCACTCCGCAGGAGGCAGATGGTTTGCCTTCTACAGCAAGTGAGAAACAGGATCCAGTTGAGTCTGGTTCACACGATCCAAAAGTCGCTTCTGACAGG
AAGGATGCTTCTCATGGAAAATCAGCTGTGTCTGATACAGAAAATAAACGGGCCAAACAGCAATTTGCTCAGCGTTCAAGGGTTCGGAAACTTCAATATATAGCTGAGCT
TGAAAGGAAAGTACAAGCTTTGCAGGCAGAGGGCTCTGAGGTCTCAGCTGAGCTTGAGTTTCTCAACCAGCAAAATCTTATTCTTAGCATGGAAAATAAAGCCCTCAAGC
AGCGGTTGGAGAATTTAGCTCAAGAGCAGCTAATTAAATACTTGGAGCAGGAAGTACTGGAGAGGGAGATTGGAAGGTTAAGAACTGTGCACCAGCAGCAACATCATCAG
CAGCAACAACCGCAACAACTACGACCTTCTTCGAGTCATCGGCGTACTTCGAGCAAAGACCTTGACAATCAATTTGCTAACCTTTCATTGAAGCAAAAGGATTCTGGTTC
AAGTCGTGACCCGGTAACAGGTCCAGTGCGCAGTTAG
mRNA sequenceShow/hide mRNA sequence
AAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAGAGAAAGAGCTGAGCATCTTGATTCTTCAGAGCTGGGGGTCCCCAAATACAAACCCTAAAC
AGAAGGAAAAGGAATTATTCACATTTCTTACTATGGCAAATTCTAAAGGGTCATCCAACGTTAGAAGCTTTATGAGTTCTGGGAAACATGCACTACTTCCTCCTAAAAGT
CCATTTCCTAGTGTTTCTCCATCATATACTGAATATGTTCCTAATACTGTAATTGGAGCAAAAGCTGTTCAGAGACCAAGAGATGGTAACAGCTATCATCAAAGAACTTC
TTCCGAAAGTATTTTAATAGAGGAGCAGCCTTCTTGGCTTGATGATCTTCTCAATGAGCCGGAGACCCCTGTTCGCAGAGTTGGTCATCGACGTTCATCGAGTGACTCCT
TTGCATATACGGATGCTGCTAATGTAAATTTTGATAGTATCATGCAAGAAGATTTTAGATATGCAAATGCAATTCCTGGACACTCTTGGTTATCTCAAGAATTTGATCAT
CAGAGAGATGCGAGGCATGCTTCATTCTATACTGAACCAAATGTAACAAAACAGAAGAATAGGGTGTGGGAGTCTCCTTTATCTACCATGAATAATCCCATTGCCCTTCA
TTCTCCTAGGGAGACCATAGGTATTCATACCTCAGGGCCATTGAGCACTCCGCAGGAGGCAGATGGTTTGCCTTCTACAGCAAGTGAGAAACAGGATCCAGTTGAGTCTG
GTTCACACGATCCAAAAGTCGCTTCTGACAGGAAGGATGCTTCTCATGGAAAATCAGCTGTGTCTGATACAGAAAATAAACGGGCCAAACAGCAATTTGCTCAGCGTTCA
AGGGTTCGGAAACTTCAATATATAGCTGAGCTTGAAAGGAAAGTACAAGCTTTGCAGGCAGAGGGCTCTGAGGTCTCAGCTGAGCTTGAGTTTCTCAACCAGCAAAATCT
TATTCTTAGCATGGAAAATAAAGCCCTCAAGCAGCGGTTGGAGAATTTAGCTCAAGAGCAGCTAATTAAATACTTGGAGCAGGAAGTACTGGAGAGGGAGATTGGAAGGT
TAAGAACTGTGCACCAGCAGCAACATCATCAGCAGCAACAACCGCAACAACTACGACCTTCTTCGAGTCATCGGCGTACTTCGAGCAAAGACCTTGACAATCAATTTGCT
AACCTTTCATTGAAGCAAAAGGATTCTGGTTCAAGTCGTGACCCGGTAACAGGTCCAGTGCGCAGTTAGGTGTCTGGTTTGGCTTCAAATGTGTTTGTGCCTGGGTGATT
TTTGCCAAATTGGTGAAACGAAAAATGATTGTCCCTTCGCGCAAGCCACTGCTGATCTTTTCCAGTCTTGGTACCGTATACCTGTCTGTCTCTTTCTCTCCTCTCTACCC
TTTTAATTCTTTGCCGCTGTGTTCCTGCCTTTTCTGGAGTTAACATCTTCATATGAATTGAATGCACCTGGTGGTCCATTGCCGTAGTATTTGTTTGTTCTACATGGGTT
GTGTGGCCTACCGGAGGTGGTATTTGTATGTACTGTATGTCTAATCAATTATAATTTACATCAGTTGTAAGCACCATTTTTTTTAGTTTCATCCCAAAAAAAAAAAAAAA
GGAAAAAGGTTCCGCTCCCACAGTGTGAACATCTTCTGTGTGCTGTTTGGAACAGACATTCAGTACCTGTAATGCATCCATGGTGAGAAATAACTTATTGTGATTTCCAA
TGAAATCCCTGATTTACTGA
Protein sequenceShow/hide protein sequence
MANSKGSSNVRSFMSSGKHALLPPKSPFPSVSPSYTEYVPNTVIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNF
DSIMQEDFRYANAIPGHSWLSQEFDHQRDARHASFYTEPNVTKQKNRVWESPLSTMNNPIALHSPRETIGIHTSGPLSTPQEADGLPSTASEKQDPVESGSHDPKVASDR
KDASHGKSAVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRTVHQQQHHQ
QQQPQQLRPSSSHRRTSSKDLDNQFANLSLKQKDSGSSRDPVTGPVRS