; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G005760 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G005760
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionBasic-leucine zipper (bZIP) transcription factor family protein
Genome locationCmo_Chr04:2876910..2880917
RNA-Seq ExpressionCmoCh04G005760
SyntenyCmoCh04G005760
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044759 - RF2-like transcription factor, bZIP domain
IPR044797 - Uncharacterized protein At4g06598-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031031.1 hypothetical protein SDJN02_05070 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-20699.74Show/hide
Query:  MLQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVR
        MLQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVR
Subjt:  MLQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVR

Query:  RVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLI
        RVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLI
Subjt:  RVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLI

Query:  PPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSME
        PPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSME
Subjt:  PPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSME

Query:  NKSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        NK+LKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
Subjt:  NKSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

XP_022942253.1 uncharacterized protein At4g06598-like isoform X1 [Cucurbita moschata]2.7e-205100Show/hide
Query:  AKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVG
        AKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVG
Subjt:  AKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVG

Query:  HRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQ
        HRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQ
Subjt:  HRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQ

Query:  EADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKS
        EADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKS
Subjt:  EADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKS

Query:  LKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        LKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
Subjt:  LKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

XP_022942254.1 uncharacterized protein At4g06598-like isoform X2 [Cucurbita moschata]7.7e-200100Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
        MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE
        AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE
Subjt:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE

Query:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLA
        SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLA
Subjt:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLA

Query:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

XP_022981031.1 uncharacterized protein At4g06598-like isoform X1 [Cucurbita maxima]1.2e-20097.14Show/hide
Query:  LQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRR
        +  KEFLEF+TMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRD NSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRR
Subjt:  LQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRR

Query:  VGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIP
        VGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIP
Subjt:  VGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIP

Query:  PQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMEN
        PQEADGLLST SEKQDPTESGPHDPK+SSERKD+SHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMEN
Subjt:  PQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMEN

Query:  KSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        K+LKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQ QQLRPSSSHRRTSSKDLDSQF NLSLKQKDSGSSCDPVTGPVRS
Subjt:  KSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

XP_023533631.1 uncharacterized protein At4g06598-like isoform X1 [Cucurbita pepo subsp. pepo]8.2e-20298.43Show/hide
Query:  KEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGH
        KEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGH
Subjt:  KEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGH

Query:  RRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQE
        RRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWL QEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQE
Subjt:  RRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQE

Query:  ADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSL
        ADGLLST SEKQDPTESGPHDPK+SSERKD+SHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENK+L
Subjt:  ADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSL

Query:  KQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        KQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQ QQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
Subjt:  KQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

TrEMBL top hitse value%identityAlignment
A0A6J1E439 uncharacterized protein At4g06598-like isoform X16.8e-17887.73Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
        M  SKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTE+VPNT IGAKAVQRPRDGNSYHQRTSSES LIEEQPSWL+DLLNEPETPVRRVGHRRSSSDSF
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE
        AYTD+AN NFDSI QEDFKY N+IPGHSWLSQEFD QRDARHASFYTEANVTKQKNRVWESSLS MNNP+AL SPRENIV+HTSGPL  PQEADGL ST 
Subjt:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE

Query:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLA
        SEKQDP ESG HDPK+SSERKD+SHGKSS SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGS+VSAELEFLNQQ++ILSMEN +LKQRLE+LA
Subjt:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLA

Query:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQ----------LRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRA+HQQQ  QQ          LRPSSSHRR+SSKDLD+QFANLSLKQKDSGSS DPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQ----------LRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

A0A6J1FPQ6 uncharacterized protein At4g06598-like isoform X23.7e-200100Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
        MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE
        AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE
Subjt:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE

Query:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLA
        SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLA
Subjt:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLA

Query:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

A0A6J1FUC3 uncharacterized protein At4g06598-like isoform X11.3e-205100Show/hide
Query:  AKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVG
        AKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVG
Subjt:  AKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVG

Query:  HRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQ
        HRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQ
Subjt:  HRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQ

Query:  EADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKS
        EADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKS
Subjt:  EADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKS

Query:  LKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        LKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
Subjt:  LKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

A0A6J1IVC2 uncharacterized protein At4g06598-like isoform X15.7e-20197.14Show/hide
Query:  LQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRR
        +  KEFLEF+TMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRD NSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRR
Subjt:  LQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRR

Query:  VGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIP
        VGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIP
Subjt:  VGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIP

Query:  PQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMEN
        PQEADGLLST SEKQDPTESGPHDPK+SSERKD+SHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMEN
Subjt:  PQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMEN

Query:  KSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        K+LKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQ QQLRPSSSHRRTSSKDLDSQF NLSLKQKDSGSSCDPVTGPVRS
Subjt:  KSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

A0A6J1J119 uncharacterized protein At4g06598-like isoform X21.9e-19698.12Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
        MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRD NSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE
        AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLST 
Subjt:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE

Query:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLA
        SEKQDPTESGPHDPK+SSERKD+SHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENK+LKQRLESLA
Subjt:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLA

Query:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRAVHQQQQ QQLRPSSSHRRTSSKDLDSQF NLSLKQKDSGSSCDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

SwissProt top hitse value%identityAlignment
F4IN23 Basic leucine zipper 342.4e-1832.97Show/hide
Query:  EQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTE--------ANVTKQKNRVWESS
        + PSW+++ L+   +  RR  HRRS SDS A+ ++               T SI  H     +FDR  D +  S +T+        +++  + N V  + 
Subjt:  EQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTE--------ANVTKQKNRVWESS

Query:  LSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTES----GPHDPKISSERKDMSHGKSSGSDTENKR--AKQQFAQRSRVRKLQYIAE
         S+  +     +P  +   +     +PP + +   +  +   D  +S     P D   S+     S G         KR  A +Q AQRSRVRKLQYI+E
Subjt:  LSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTES----GPHDPKISSERKDMSHGKSSGSDTENKR--AKQQFAQRSRVRKLQYIAE

Query:  LERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQL
        LER V +LQAE S +S  + FL+ Q ++L+++N +LKQR+ +L+Q++L K   QE L+REI RLR V+ QQ L
Subjt:  LERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQL

Q5JMK6 Basic leucine zipper 67.1e-1556.67Show/hide
Query:  AKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQ
        A +Q AQRSRVRKLQYI+ELER V  LQ E S +S  + FL+QQ  IL++ N  LKQR+ +LAQ+++ K   QE L +EI RLR V+QQQ
Subjt:  AKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQ

Q6K3R9 Basic leucine zipper 195.1e-1332.51Show/hide
Query:  RRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPL
        RR  HRRS+SDS A+   A    D ++         + G      EFDR  D +  S +++                A+++    R P     +   G  
Subjt:  RRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPL

Query:  IPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSM
              DG+ +T      P  +G      ++   D   G +     +   A +Q AQRSRVRKLQYI+ELER V  LQ E S +S  + FL+ Q  +L++
Subjt:  IPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSM

Query:  ENKSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQ
         N  LKQR+ +LAQ+++ K   QE L++EI RLR V+ QQQ++
Subjt:  ENKSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQ

Q8W3M7 Uncharacterized protein At4g065981.3e-4847.39Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
        MA+SKGS N RN   +GK ALLPPKSPF        ++VP++ IG+KAVQ+  +GN+ H RTSSESFLIEEQPSWL+DLLNEPETPVR+ GHRRSSSDSF
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTD-SANANFDSIVQEDFKYTNS--IPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLL
        AY D     + D  + +  +Y N+     H    +E D  R ++   FY  A+++KQK R W+S   +   P +     E+  +  SG     ++ +   
Subjt:  AYTD-SANANFDSIVQEDFKYTNS--IPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLL

Query:  STESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQ
        S    K+D      +  K S E++D    KS+ S+ + KRA+QQFAQRSRVRK+QYIAELER VQ LQ
Subjt:  STESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQ

Q9M2K4 Basic leucine zipper 615.3e-1830.72Show/hide
Query:  ALLPPKSPFPSVSPSYTEYVPN--TAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDS-----ANANFDS
        A LPPK P    +P++ ++      +I A A      G              ++ PSW+++ L+   T  RR  HRRS SDS A+ +       N +FD 
Subjt:  ALLPPKSPFPSVSPSYTEYVPN--TAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDS-----ANANFDS

Query:  IVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMN----NPMALRSPRENIVVHTSGPLIPPQEADGLLSTES-EKQDPT
           E F         S  + +        H       NV   ++    S+ S  N    +     +P  +   H    +     A G    ES E Q   
Subjt:  IVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMN----NPMALRSPRENIVVHTSGPLIPPQEADGLLSTES-EKQDPT

Query:  ESGPHDPKISSERKDMSHGKSSGSDTENKR--AKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLAQEQLI
        ++ P D   +++    S G         KR  A +Q AQRSRVRKLQYI+ELER V +LQ E S +S  + FL+ Q ++L+++N ++KQR+ +LAQ+++ 
Subjt:  ESGPHDPKISSERKDMSHGKSSGSDTENKR--AKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLAQEQLI

Query:  KYLEQEVLEREIGRLRAVHQQQQLQQLRPSSS
        K   QE L+REI RLR V+ QQ L+++  + S
Subjt:  KYLEQEVLEREIGRLRAVHQQQQLQQLRPSSS

Arabidopsis top hitse value%identityAlignment
AT1G35490.1 bZIP family transcription factor3.6e-3033.76Show/hide
Query:  NSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQK
        N +H   S +    E+QP+WL++LL+EP +P    GHRRS+SD+ AY +SA               N + G SW  Q +D            ++N  +Q 
Subjt:  NSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQK

Query:  NRV-WESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSG--SDTENKRAKQQFAQRSRVRKLQ
        N++ W+ S +                   +G  I    + G L+  S+   P E      K  S+ K+ +  K  G  S T++KR K Q A R+R+R+L+
Subjt:  NRV-WESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSG--SDTENKRAKQQFAQRSRVRKLQ

Query:  YIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPS--------SSHRRTSSK
        YI++LER +Q LQ EG ++S+ + +L+QQ ++LSMEN++LKQR++SLA+ Q +K++EQ++LEREIG L+    QQQ QQ +          + ++   ++
Subjt:  YIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPS--------SSHRRTSSK

Query:  DLDSQFANLSL
        + D+QFA L++
Subjt:  DLDSQFANLSL

AT1G58110.1 Basic-leucine zipper (bZIP) transcription factor family protein1.9e-9256.33Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNE-PETPVRRVGHRRSSSDS
        MA+SKGS +VRN M  GKHALLPPK PFPSVS SY+EY+P   IG++  Q+  +  ++HQRTSSES L+EE P WL+DLLNE PE+P R+ GHRRSSSDS
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNE-PETPVRRVGHRRSSSDS

Query:  FAYTDSANA-NFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLS
        +AY D ANA N    +Q DF Y N++       QE DR ++A+ A+FY+ A+  KQK+R  +S ++    P  L   REN      G L   Q+A  +  
Subjt:  FAYTDSANA-NFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLS

Query:  TESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLE
          SE+++  E   HDPK+ S  ++ S+      + +N KRAKQQFAQRSRVRKLQYI+ELER VQ LQAEGS VSAEL+FLNQ+N+ILSMENK+LK+RLE
Subjt:  TESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLE

Query:  SLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVT
        S+AQE+LIK LEQEVLE+EIGRLRA++QQQQ Q  +PS+S  R +SKDLDSQF++LSL  KDS    D V+
Subjt:  SLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVT

AT1G58110.2 Basic-leucine zipper (bZIP) transcription factor family protein1.9e-9256.33Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNE-PETPVRRVGHRRSSSDS
        MA+SKGS +VRN M  GKHALLPPK PFPSVS SY+EY+P   IG++  Q+  +  ++HQRTSSES L+EE P WL+DLLNE PE+P R+ GHRRSSSDS
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNE-PETPVRRVGHRRSSSDS

Query:  FAYTDSANA-NFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLS
        +AY D ANA N    +Q DF Y N++       QE DR ++A+ A+FY+ A+  KQK+R  +S ++    P  L   REN      G L   Q+A  +  
Subjt:  FAYTDSANA-NFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLS

Query:  TESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLE
          SE+++  E   HDPK+ S  ++ S+      + +N KRAKQQFAQRSRVRKLQYI+ELER VQ LQAEGS VSAEL+FLNQ+N+ILSMENK+LK+RLE
Subjt:  TESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLE

Query:  SLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVT
        S+AQE+LIK LEQEVLE+EIGRLRA++QQQQ Q  +PS+S  R +SKDLDSQF++LSL  KDS    D V+
Subjt:  SLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVT

AT2G42380.2 Basic-leucine zipper (bZIP) transcription factor family protein1.7e-1932.97Show/hide
Query:  EQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTE--------ANVTKQKNRVWESS
        + PSW+++ L+   +  RR  HRRS SDS A+ ++               T SI  H     +FDR  D +  S +T+        +++  + N V  + 
Subjt:  EQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTE--------ANVTKQKNRVWESS

Query:  LSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTES----GPHDPKISSERKDMSHGKSSGSDTENKR--AKQQFAQRSRVRKLQYIAE
         S+  +     +P  +   +     +PP + +   +  +   D  +S     P D   S+     S G         KR  A +Q AQRSRVRKLQYI+E
Subjt:  LSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTES----GPHDPKISSERKDMSHGKSSGSDTENKR--AKQQFAQRSRVRKLQYIAE

Query:  LERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQL
        LER V +LQAE S +S  + FL+ Q ++L+++N +LKQR+ +L+Q++L K   QE L+REI RLR V+ QQ L
Subjt:  LERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQL

AT4G06598.1 BEST Arabidopsis thaliana protein match is: Basic-leucine zipper (bZIP) transcription factor family protein (TAIR:AT1G58110.2)2.2e-6747.41Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
        MA+SKGS N RN   +GK ALLPPKSPF        ++VP++ IG+KAVQ+  +GN+ H RTSSESFLIEEQPSWL+DLLNEPETPVR+ GHRRSSSDSF
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTD-SANANFDSIVQEDFKYTNS--IPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLL
        AY D     + D  + +  +Y N+     H    +E D  R ++   FY  A+++KQK R W+S   +   P +     E+  +  SG     ++ +   
Subjt:  AYTD-SANANFDSIVQEDFKYTNS--IPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLL

Query:  STESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLE
        S    K+D      +  K S E++D    KS+ S+ + KRA+QQFAQRSRVRK+QYIAELER VQ L                       ENKSLK RLE
Subjt:  STESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLE

Query:  SLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQ--------LRPSSSHRRTSSKDLDSQFANLSLK
        SLAQEQLIKYLE +VLE+EI RLRA++Q QQ Q+         + SSSH+R+ S+DL++QF NLSL+
Subjt:  SLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQ--------LRPSSSHRRTSSKDLDSQFANLSLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACAGGCAAAGGAATTTCTCGAATTTTATACCATGGCAACTTCAAAAGGGTCATCCAACGTCAGAAATTTTATGAGTTCTGGAAAACATGCACTACTCCCTCCTAA
AAGTCCTTTCCCTAGTGTTTCTCCATCATATACTGAATATGTTCCCAATACTGCAATTGGAGCAAAAGCTGTTCAGAGACCAAGAGATGGTAACAGCTACCATCAAAGAA
CTTCTTCTGAAAGCTTTCTAATAGAGGAGCAGCCTTCTTGGCTTGAAGATCTTCTTAATGAGCCGGAGACCCCTGTTCGCAGAGTTGGTCATCGACGTTCATCAAGTGAC
TCCTTTGCATATACAGATTCTGCTAATGCTAATTTTGATAGTATCGTGCAAGAAGACTTCAAATATACAAATTCTATTCCTGGACACTCTTGGTTATCTCAAGAATTTGA
TCGTCAGAGAGATGCAAGGCATGCTTCATTTTATACTGAAGCGAATGTGACAAAGCAGAAGAATAGGGTGTGGGAATCATCTTTGTCTGCCATGAATAATCCCATGGCTC
TTCGTTCTCCCAGGGAGAACATTGTTGTTCATACCTCGGGACCATTGATCCCTCCGCAGGAGGCAGATGGTTTGCTTTCTACAGAAAGTGAAAAACAGGATCCAACTGAG
TCTGGTCCACACGATCCAAAAATCTCTTCTGAAAGGAAAGACATGTCTCATGGAAAATCATCCGGGTCGGATACCGAAAATAAACGTGCCAAACAGCAATTTGCTCAGCG
TTCAAGGGTTCGGAAACTTCAATATATAGCTGAGCTTGAAAGGAAAGTACAAGCTTTGCAGGCAGAGGGCTCTGACGTCTCAGCTGAGCTTGAATTTCTCAACCAGCAAA
ATATTATTCTTAGCATGGAGAACAAATCCCTCAAGCAGCGGTTAGAGAGTTTAGCTCAAGAGCAGCTAATTAAATACTTGGAGCAGGAAGTACTGGAGAGGGAGATTGGA
AGGTTAAGAGCTGTGCATCAGCAGCAACAACTGCAACAACTACGGCCTTCTTCTAGTCATCGGCGTACTTCAAGCAAAGACCTTGACAGTCAATTTGCAAATCTTTCTTT
GAAGCAAAAGGATTCTGGTTCAAGTTGTGACCCAGTAACAGGTCCAGTGCGCAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTACAGGCAAAGGAATTTCTCGAATTTTATACCATGGCAACTTCAAAAGGGTCATCCAACGTCAGAAATTTTATGAGTTCTGGAAAACATGCACTACTCCCTCCTAA
AAGTCCTTTCCCTAGTGTTTCTCCATCATATACTGAATATGTTCCCAATACTGCAATTGGAGCAAAAGCTGTTCAGAGACCAAGAGATGGTAACAGCTACCATCAAAGAA
CTTCTTCTGAAAGCTTTCTAATAGAGGAGCAGCCTTCTTGGCTTGAAGATCTTCTTAATGAGCCGGAGACCCCTGTTCGCAGAGTTGGTCATCGACGTTCATCAAGTGAC
TCCTTTGCATATACAGATTCTGCTAATGCTAATTTTGATAGTATCGTGCAAGAAGACTTCAAATATACAAATTCTATTCCTGGACACTCTTGGTTATCTCAAGAATTTGA
TCGTCAGAGAGATGCAAGGCATGCTTCATTTTATACTGAAGCGAATGTGACAAAGCAGAAGAATAGGGTGTGGGAATCATCTTTGTCTGCCATGAATAATCCCATGGCTC
TTCGTTCTCCCAGGGAGAACATTGTTGTTCATACCTCGGGACCATTGATCCCTCCGCAGGAGGCAGATGGTTTGCTTTCTACAGAAAGTGAAAAACAGGATCCAACTGAG
TCTGGTCCACACGATCCAAAAATCTCTTCTGAAAGGAAAGACATGTCTCATGGAAAATCATCCGGGTCGGATACCGAAAATAAACGTGCCAAACAGCAATTTGCTCAGCG
TTCAAGGGTTCGGAAACTTCAATATATAGCTGAGCTTGAAAGGAAAGTACAAGCTTTGCAGGCAGAGGGCTCTGACGTCTCAGCTGAGCTTGAATTTCTCAACCAGCAAA
ATATTATTCTTAGCATGGAGAACAAATCCCTCAAGCAGCGGTTAGAGAGTTTAGCTCAAGAGCAGCTAATTAAATACTTGGAGCAGGAAGTACTGGAGAGGGAGATTGGA
AGGTTAAGAGCTGTGCATCAGCAGCAACAACTGCAACAACTACGGCCTTCTTCTAGTCATCGGCGTACTTCAAGCAAAGACCTTGACAGTCAATTTGCAAATCTTTCTTT
GAAGCAAAAGGATTCTGGTTCAAGTTGTGACCCAGTAACAGGTCCAGTGCGCAGTTAGGTTTCATGGTTGACTTCACGTGTTGTGCCTGGGAACGTTCGCCAAATTGGTG
AACGAAAATGATCGTCACTTCATGCAAGACAAGCTGATTGCTTCCAGTCTTGGTACCTCTCTGTTTCTCTCAGGTCTCCTTTTAATTCTTTGCCGCTGTTTCCTACCTTT
TCCTGGAGTTAACATCTTCATAATGCATGCACCTGGTGGTCATTGCCGTTGTCTTGTTTGTTCAACATGAGATCATTGACTTGACTTTAGGCTTGTGCTCCCAACTGTAC
CAGGTTTTTTCAGCACTCTTTTTAACGTTTATAACAAAATTTTCTCTCATGTCTCATATTGTTTGTTTTAGACCGTTACGTGTAGTTGTCAACCTCACAATTTTAAAATG
TGTCTTTTAGGGAAAGGTTTCCACCCTTGGGGGCCAGCGTTCTTGTTGGCACACCACCCATTGCCTGGCTTGATACATTTGTAAACAGTCCAAAATCACCGTTAGCTCAC
CGCTAGCAGATATTGTCCGCTTTAGTCCGTTACATACTGTTGTCAACCTCACAGTTAAGAAACGCATTTTAAAACCATGTAGTAGCAATATGTAATGGGTCAAAGTAGAC
AATATTTGCTAACGGTGAGTTTAGGCTGTTTACTAATGGTATCAAAGACTTAATGGTAATACATAACGGACCAACACAGACAATTAGATTGACATCTTGTAAGATTAATT
GGAACACATATCTCATTCATGGTGAAAGCTTGATAATATAGTAAGCCGACTCTAATGCATTATAATGTACACATATGGCCACAATTTGTACGGGACTGATTTTGGTCACC
TTTAAAAAA
Protein sequenceShow/hide protein sequence
MLQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSD
SFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTE
SGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKSLKQRLESLAQEQLIKYLEQEVLEREIG
RLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS