; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg13199 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg13199
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionBasic-leucine zipper (bZIP) transcription factor family protein
Genome locationCarg_Chr04:2883752..2887002
RNA-Seq ExpressionCarg13199
SyntenyCarg13199
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044759 - RF2-like transcription factor, bZIP domain
IPR044797 - Uncharacterized protein At4g06598-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031031.1 hypothetical protein SDJN02_05070 [Cucurbita argyrosperma subsp. argyrosperma]6.9e-223100Show/hide
Query:  MLMFFGLLFYCFLTLICCPCLSVSEIFLIMLQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYH
        MLMFFGLLFYCFLTLICCPCLSVSEIFLIMLQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYH
Subjt:  MLMFFGLLFYCFLTLICCPCLSVSEIFLIMLQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYH

Query:  QRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVW
        QRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVW
Subjt:  QRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVW

Query:  ESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELER
        ESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELER
Subjt:  ESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELER

Query:  KVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKD
        KVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKD
Subjt:  KVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKD

Query:  SGSSCDPVTGPVRS
        SGSSCDPVTGPVRS
Subjt:  SGSSCDPVTGPVRS

XP_022942253.1 uncharacterized protein At4g06598-like isoform X1 [Cucurbita moschata]3.8e-20599.74Show/hide
Query:  AKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVG
        AKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVG
Subjt:  AKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVG

Query:  HRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQ
        HRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQ
Subjt:  HRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQ

Query:  EADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKA
        EADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENK+
Subjt:  EADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKA

Query:  LKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        LKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
Subjt:  LKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

XP_022942254.1 uncharacterized protein At4g06598-like isoform X2 [Cucurbita moschata]1.1e-19999.73Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
        MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE
        AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE
Subjt:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE

Query:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLA
        SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENK+LKQRLESLA
Subjt:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLA

Query:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

XP_022981031.1 uncharacterized protein At4g06598-like isoform X1 [Cucurbita maxima]9.7e-20193.84Show/hide
Query:  FYCFLTLICCPCLSVSEIFLIMLQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESF
        FY FL+ I   C  + E     +  KEFLEF+TMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRD NSYHQRTSSESF
Subjt:  FYCFLTLICCPCLSVSEIFLIMLQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESF

Query:  LIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMN
        LIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMN
Subjt:  LIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMN

Query:  NPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAE
        NPMALRSPRENIVVHTSGPLIPPQEADGLLST SEKQDPTESGPHDPK+SSERKD+SHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAE
Subjt:  NPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAE

Query:  GSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPV
        GSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQ QQLRPSSSHRRTSSKDLDSQF NLSLKQKDSGSSCDPV
Subjt:  GSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPV

Query:  TGPVRS
        TGPVRS
Subjt:  TGPVRS

XP_023533631.1 uncharacterized protein At4g06598-like isoform X1 [Cucurbita pepo subsp. pepo]2.3e-20298.69Show/hide
Query:  KEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGH
        KEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGH
Subjt:  KEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGH

Query:  RRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQE
        RRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWL QEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQE
Subjt:  RRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQE

Query:  ADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKAL
        ADGLLST SEKQDPTESGPHDPK+SSERKD+SHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKAL
Subjt:  ADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKAL

Query:  KQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        KQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQ QQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
Subjt:  KQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

TrEMBL top hitse value%identityAlignment
A0A6J1E439 uncharacterized protein At4g06598-like isoform X11.9e-17887.99Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
        M  SKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTE+VPNT IGAKAVQRPRDGNSYHQRTSSES LIEEQPSWL+DLLNEPETPVRRVGHRRSSSDSF
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE
        AYTD+AN NFDSI QEDFKY N+IPGHSWLSQEFD QRDARHASFYTEANVTKQKNRVWESSLS MNNP+AL SPRENIV+HTSGPL  PQEADGL ST 
Subjt:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE

Query:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLA
        SEKQDP ESG HDPK+SSERKD+SHGKSS SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGS+VSAELEFLNQQ++ILSMEN ALKQRLE+LA
Subjt:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLA

Query:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQ----------LRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRA+HQQQ  QQ          LRPSSSHRR+SSKDLD+QFANLSLKQKDSGSS DPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQ----------LRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

A0A6J1FPQ6 uncharacterized protein At4g06598-like isoform X25.2e-20099.73Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
        MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE
        AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE
Subjt:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE

Query:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLA
        SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENK+LKQRLESLA
Subjt:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLA

Query:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

A0A6J1FUC3 uncharacterized protein At4g06598-like isoform X11.9e-20599.74Show/hide
Query:  AKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVG
        AKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVG
Subjt:  AKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVG

Query:  HRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQ
        HRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQ
Subjt:  HRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQ

Query:  EADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKA
        EADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENK+
Subjt:  EADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKA

Query:  LKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        LKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
Subjt:  LKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

A0A6J1IVC2 uncharacterized protein At4g06598-like isoform X14.7e-20193.84Show/hide
Query:  FYCFLTLICCPCLSVSEIFLIMLQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESF
        FY FL+ I   C  + E     +  KEFLEF+TMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRD NSYHQRTSSESF
Subjt:  FYCFLTLICCPCLSVSEIFLIMLQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESF

Query:  LIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMN
        LIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMN
Subjt:  LIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMN

Query:  NPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAE
        NPMALRSPRENIVVHTSGPLIPPQEADGLLST SEKQDPTESGPHDPK+SSERKD+SHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAE
Subjt:  NPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAE

Query:  GSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPV
        GSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQ QQLRPSSSHRRTSSKDLDSQF NLSLKQKDSGSSCDPV
Subjt:  GSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPV

Query:  TGPVRS
        TGPVRS
Subjt:  TGPVRS

A0A6J1J119 uncharacterized protein At4g06598-like isoform X25.4e-19798.39Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
        MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRD NSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE
        AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLST 
Subjt:  AYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTE

Query:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLA
        SEKQDPTESGPHDPK+SSERKD+SHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLA
Subjt:  SEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLA

Query:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRAVHQQQQ QQLRPSSSHRRTSSKDLDSQF NLSLKQKDSGSSCDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS

SwissProt top hitse value%identityAlignment
F4IN23 Basic leucine zipper 348.7e-1933.33Show/hide
Query:  EQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTE--------ANVTKQKNRVWESS
        + PSW+++ L+   +  RR  HRRS SDS A+ ++               T SI  H     +FDR  D +  S +T+        +++  + N V  + 
Subjt:  EQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTE--------ANVTKQKNRVWESS

Query:  LSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTES----GPHDPKISSERKDMSHGKSSGSDTENKR--AKQQFAQRSRVRKLQYIAE
         S+  +     +P  +   +     +PP + +   +  +   D  +S     P D   S+     S G         KR  A +Q AQRSRVRKLQYI+E
Subjt:  LSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTES----GPHDPKISSERKDMSHGKSSGSDTENKR--AKQQFAQRSRVRKLQYIAE

Query:  LERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQL
        LER V +LQAE S +S  + FL+ Q ++L+++N ALKQR+ +L+Q++L K   QE L+REI RLR V+ QQ L
Subjt:  LERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQL

Q5JMK6 Basic leucine zipper 61.0e-1456.67Show/hide
Query:  AKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQ
        A +Q AQRSRVRKLQYI+ELER V  LQ E S +S  + FL+QQ  IL++ N  LKQR+ +LAQ+++ K   QE L +EI RLR V+QQQ
Subjt:  AKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQ

Q6K3R9 Basic leucine zipper 195.5e-1332.51Show/hide
Query:  RRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPL
        RR  HRRS+SDS A+   A    D ++         + G      EFDR  D +  S +++                A+++    R P     +   G  
Subjt:  RRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPL

Query:  IPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSM
              DG+ +T      P  +G      ++   D   G +     +   A +Q AQRSRVRKLQYI+ELER V  LQ E S +S  + FL+ Q  +L++
Subjt:  IPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSM

Query:  ENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQ
         N  LKQR+ +LAQ+++ K   QE L++EI RLR V+ QQQ++
Subjt:  ENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQ

Q8W3M7 Uncharacterized protein At4g065981.1e-4847.39Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
        MA+SKGS N RN   +GK ALLPPKSPF        ++VP++ IG+KAVQ+  +GN+ H RTSSESFLIEEQPSWL+DLLNEPETPVR+ GHRRSSSDSF
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTD-SANANFDSIVQEDFKYTNS--IPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLL
        AY D     + D  + +  +Y N+     H    +E D  R ++   FY  A+++KQK R W+S   +   P +     E+  +  SG     ++ +   
Subjt:  AYTD-SANANFDSIVQEDFKYTNS--IPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLL

Query:  STESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQ
        S    K+D      +  K S E++D    KS+ S+ + KRA+QQFAQRSRVRK+QYIAELER VQ LQ
Subjt:  STESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQ

Q9M2K4 Basic leucine zipper 612.5e-1831.02Show/hide
Query:  ALLPPKSPFPSVSPSYTEYVPN--TAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDS-----ANANFDS
        A LPPK P    +P++ ++      +I A A      G              ++ PSW+++ L+   T  RR  HRRS SDS A+ +       N +FD 
Subjt:  ALLPPKSPFPSVSPSYTEYVPN--TAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDS-----ANANFDS

Query:  IVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMN----NPMALRSPRENIVVHTSGPLIPPQEADGLLSTES-EKQDPT
           E F         S  + +        H       NV   ++    S+ S  N    +     +P  +   H    +     A G    ES E Q   
Subjt:  IVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMN----NPMALRSPRENIVVHTSGPLIPPQEADGLLSTES-EKQDPT

Query:  ESGPHDPKISSERKDMSHGKSSGSDTENKR--AKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLI
        ++ P D   +++    S G         KR  A +Q AQRSRVRKLQYI+ELER V +LQ E S +S  + FL+ Q ++L+++N A+KQR+ +LAQ+++ 
Subjt:  ESGPHDPKISSERKDMSHGKSSGSDTENKR--AKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLI

Query:  KYLEQEVLEREIGRLRAVHQQQQLQQLRPSSS
        K   QE L+REI RLR V+ QQ L+++  + S
Subjt:  KYLEQEVLEREIGRLRAVHQQQQLQQLRPSSS

Arabidopsis top hitse value%identityAlignment
AT1G35490.1 bZIP family transcription factor1.3e-3034.08Show/hide
Query:  NSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQK
        N +H   S +    E+QP+WL++LL+EP +P    GHRRS+SD+ AY +SA               N + G SW  Q +D            ++N  +Q 
Subjt:  NSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQK

Query:  NRV-WESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSG--SDTENKRAKQQFAQRSRVRKLQ
        N++ W+ S +                   +G  I    + G L+  S+   P E      K  S+ K+ +  K  G  S T++KR K Q A R+R+R+L+
Subjt:  NRV-WESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSG--SDTENKRAKQQFAQRSRVRKLQ

Query:  YIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPS--------SSHRRTSSK
        YI++LER +Q LQ EG ++S+ + +L+QQ ++LSMEN+ALKQR++SLA+ Q +K++EQ++LEREIG L+    QQQ QQ +          + ++   ++
Subjt:  YIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPS--------SSHRRTSSK

Query:  DLDSQFANLSL
        + D+QFA L++
Subjt:  DLDSQFANLSL

AT1G58110.1 Basic-leucine zipper (bZIP) transcription factor family protein7.2e-9356.6Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNE-PETPVRRVGHRRSSSDS
        MA+SKGS +VRN M  GKHALLPPK PFPSVS SY+EY+P   IG++  Q+  +  ++HQRTSSES L+EE P WL+DLLNE PE+P R+ GHRRSSSDS
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNE-PETPVRRVGHRRSSSDS

Query:  FAYTDSANA-NFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLS
        +AY D ANA N    +Q DF Y N++       QE DR ++A+ A+FY+ A+  KQK+R  +S ++    P  L   REN      G L   Q+A  +  
Subjt:  FAYTDSANA-NFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLS

Query:  TESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLE
          SE+++  E   HDPK+ S  ++ S+      + +N KRAKQQFAQRSRVRKLQYI+ELER VQ LQAEGS VSAEL+FLNQ+N+ILSMENKALK+RLE
Subjt:  TESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLE

Query:  SLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVT
        S+AQE+LIK LEQEVLE+EIGRLRA++QQQQ Q  +PS+S  R +SKDLDSQF++LSL  KDS    D V+
Subjt:  SLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVT

AT1G58110.2 Basic-leucine zipper (bZIP) transcription factor family protein7.2e-9356.6Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNE-PETPVRRVGHRRSSSDS
        MA+SKGS +VRN M  GKHALLPPK PFPSVS SY+EY+P   IG++  Q+  +  ++HQRTSSES L+EE P WL+DLLNE PE+P R+ GHRRSSSDS
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNE-PETPVRRVGHRRSSSDS

Query:  FAYTDSANA-NFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLS
        +AY D ANA N    +Q DF Y N++       QE DR ++A+ A+FY+ A+  KQK+R  +S ++    P  L   REN      G L   Q+A  +  
Subjt:  FAYTDSANA-NFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLS

Query:  TESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLE
          SE+++  E   HDPK+ S  ++ S+      + +N KRAKQQFAQRSRVRKLQYI+ELER VQ LQAEGS VSAEL+FLNQ+N+ILSMENKALK+RLE
Subjt:  TESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLE

Query:  SLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVT
        S+AQE+LIK LEQEVLE+EIGRLRA++QQQQ Q  +PS+S  R +SKDLDSQF++LSL  KDS    D V+
Subjt:  SLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVT

AT2G42380.2 Basic-leucine zipper (bZIP) transcription factor family protein6.2e-2033.33Show/hide
Query:  EQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTE--------ANVTKQKNRVWESS
        + PSW+++ L+   +  RR  HRRS SDS A+ ++               T SI  H     +FDR  D +  S +T+        +++  + N V  + 
Subjt:  EQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTE--------ANVTKQKNRVWESS

Query:  LSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTES----GPHDPKISSERKDMSHGKSSGSDTENKR--AKQQFAQRSRVRKLQYIAE
         S+  +     +P  +   +     +PP + +   +  +   D  +S     P D   S+     S G         KR  A +Q AQRSRVRKLQYI+E
Subjt:  LSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLLSTESEKQDPTES----GPHDPKISSERKDMSHGKSSGSDTENKR--AKQQFAQRSRVRKLQYIAE

Query:  LERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQL
        LER V +LQAE S +S  + FL+ Q ++L+++N ALKQR+ +L+Q++L K   QE L+REI RLR V+ QQ L
Subjt:  LERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQL

AT4G06598.1 BEST Arabidopsis thaliana protein match is: Basic-leucine zipper (bZIP) transcription factor family protein (TAIR:AT1G58110.2)3.0e-6747.14Show/hide
Query:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF
        MA+SKGS N RN   +GK ALLPPKSPF        ++VP++ IG+KAVQ+  +GN+ H RTSSESFLIEEQPSWL+DLLNEPETPVR+ GHRRSSSDSF
Subjt:  MATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLIEEQPSWLEDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTD-SANANFDSIVQEDFKYTNS--IPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLL
        AY D     + D  + +  +Y N+     H    +E D  R ++   FY  A+++KQK R W+S   +   P +     E+  +  SG     ++ +   
Subjt:  AYTD-SANANFDSIVQEDFKYTNS--IPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENIVVHTSGPLIPPQEADGLL

Query:  STESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLE
        S    K+D      +  K S E++D    KS+ S+ + KRA+QQFAQRSRVRK+QYIAELER VQ L                       ENK+LK RLE
Subjt:  STESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMENKALKQRLE

Query:  SLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQ--------LRPSSSHRRTSSKDLDSQFANLSLK
        SLAQEQLIKYLE +VLE+EI RLRA++Q QQ Q+         + SSSH+R+ S+DL++QF NLSL+
Subjt:  SLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQ--------LRPSSSHRRTSSKDLDSQFANLSLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGATGTTTTTTGGACTGCTCTTTTATTGCTTCTTAACCTTAATCTGTTGTCCATGTTTATCTGTTTCTGAGATATTTCTCATCATGTTACAGGCAAAGGAATTTCT
CGAATTTTATACCATGGCAACTTCAAAAGGGTCATCCAACGTCAGAAATTTTATGAGTTCTGGAAAACATGCACTACTCCCTCCTAAAAGTCCTTTCCCTAGTGTTTCTC
CATCATATACTGAATATGTTCCCAATACTGCAATTGGAGCAAAAGCTGTTCAGAGACCAAGAGATGGTAACAGCTACCATCAAAGAACTTCTTCCGAAAGCTTTCTAATA
GAGGAGCAGCCTTCTTGGCTTGAAGATCTTCTTAATGAGCCGGAGACCCCTGTTCGCAGAGTTGGTCATCGACGTTCATCAAGTGACTCCTTTGCATATACAGATTCTGC
TAATGCTAATTTTGATAGTATCGTGCAAGAAGACTTCAAATATACAAATTCTATTCCTGGACACTCTTGGTTATCTCAAGAATTTGATCGTCAGAGAGATGCAAGGCATG
CTTCATTTTATACTGAAGCGAATGTGACAAAGCAGAAGAATAGGGTGTGGGAATCATCTTTGTCTGCCATGAATAATCCCATGGCTCTTCGTTCTCCCAGGGAGAACATT
GTTGTTCATACCTCGGGACCATTGATCCCTCCGCAGGAGGCAGATGGTTTGCTTTCTACAGAAAGTGAAAAACAGGATCCAACTGAGTCTGGTCCACACGATCCAAAAAT
CTCTTCTGAAAGGAAAGACATGTCTCATGGAAAATCATCCGGGTCGGATACCGAAAATAAACGTGCCAAACAGCAATTTGCTCAGCGTTCAAGGGTTCGGAAACTTCAAT
ATATAGCTGAGCTTGAAAGGAAAGTACAAGCTTTGCAGGCAGAGGGCTCTGACGTCTCAGCTGAGCTTGAATTTCTCAACCAGCAAAATATTATTCTTAGCATGGAGAAC
AAAGCCCTCAAGCAGCGGTTAGAGAGTTTAGCTCAAGAGCAGCTAATTAAATACTTGGAGCAGGAAGTACTGGAGAGGGAGATTGGAAGGTTAAGAGCTGTGCATCAGCA
GCAACAACTGCAACAACTACGACCTTCTTCTAGTCATCGGCGTACTTCAAGCAAAGACCTTGACAGTCAATTTGCAAATCTTTCTTTGAAGCAAAAGGATTCTGGTTCAA
GTTGTGACCCAGTAACAGGTCCAGTGCGCAGTTAG
mRNA sequenceShow/hide mRNA sequence
GTAACAAGTTATTCTTGCATATCAGCGAAGAATTTAATTATACATGCTGATGTTTTTTGGACTGCTCTTTTATTGCTTCTTAACCTTAATCTGTTGTCCATGTTTATCTG
TTTCTGAGATATTTCTCATCATGTTACAGGCAAAGGAATTTCTCGAATTTTATACCATGGCAACTTCAAAAGGGTCATCCAACGTCAGAAATTTTATGAGTTCTGGAAAA
CATGCACTACTCCCTCCTAAAAGTCCTTTCCCTAGTGTTTCTCCATCATATACTGAATATGTTCCCAATACTGCAATTGGAGCAAAAGCTGTTCAGAGACCAAGAGATGG
TAACAGCTACCATCAAAGAACTTCTTCCGAAAGCTTTCTAATAGAGGAGCAGCCTTCTTGGCTTGAAGATCTTCTTAATGAGCCGGAGACCCCTGTTCGCAGAGTTGGTC
ATCGACGTTCATCAAGTGACTCCTTTGCATATACAGATTCTGCTAATGCTAATTTTGATAGTATCGTGCAAGAAGACTTCAAATATACAAATTCTATTCCTGGACACTCT
TGGTTATCTCAAGAATTTGATCGTCAGAGAGATGCAAGGCATGCTTCATTTTATACTGAAGCGAATGTGACAAAGCAGAAGAATAGGGTGTGGGAATCATCTTTGTCTGC
CATGAATAATCCCATGGCTCTTCGTTCTCCCAGGGAGAACATTGTTGTTCATACCTCGGGACCATTGATCCCTCCGCAGGAGGCAGATGGTTTGCTTTCTACAGAAAGTG
AAAAACAGGATCCAACTGAGTCTGGTCCACACGATCCAAAAATCTCTTCTGAAAGGAAAGACATGTCTCATGGAAAATCATCCGGGTCGGATACCGAAAATAAACGTGCC
AAACAGCAATTTGCTCAGCGTTCAAGGGTTCGGAAACTTCAATATATAGCTGAGCTTGAAAGGAAAGTACAAGCTTTGCAGGCAGAGGGCTCTGACGTCTCAGCTGAGCT
TGAATTTCTCAACCAGCAAAATATTATTCTTAGCATGGAGAACAAAGCCCTCAAGCAGCGGTTAGAGAGTTTAGCTCAAGAGCAGCTAATTAAATACTTGGAGCAGGAAG
TACTGGAGAGGGAGATTGGAAGGTTAAGAGCTGTGCATCAGCAGCAACAACTGCAACAACTACGACCTTCTTCTAGTCATCGGCGTACTTCAAGCAAAGACCTTGACAGT
CAATTTGCAAATCTTTCTTTGAAGCAAAAGGATTCTGGTTCAAGTTGTGACCCAGTAACAGGTCCAGTGCGCAGTTAGGTTTCATGGTTGACTTCACGTGTTGTGCCTGG
GAACGTTCGCCAAATTGGTGAACAAAAATGATCGTCACTTCATGCAAGACAAGCTGATTGCTTCCAGTCTTGGTACCTCTCTATTTCTCTCAGGTCTCCTTTTAATTCTT
TGCCGCTGTTTCCTACCTTTTCCTGGAGTTAACATCTTCATAATGCATGCACCTGGTGGTCATTGCCGTTGTCTTGTTTGTTCAACATGAGGTATGTGGCCTACCGGAGG
TGGTATCTGTCTGTACTGTATGTCTAATCAATTATAATTTACACCACTTGTAAGCACCATTTTTTAACTTCCCTCCAAAAAAGTTCCGCTCCCACAGTGTGAACATCTTC
TGTGCTGTTTGGACATTCAGTACCTGTAATGCATCACGGTGAGAAATAATTACATGATTCCATTGAAATCTCTGATTTACGGATGGAATGCTTGCCTAATCTCTCTCCCT
CCATGTACTTCAGACTTGTTTTAACCATTTTTATGGCTGAGTTGATCGAAATATTTTGATGTTCATCCCGTTGGGTTATCGAGCAGTGTGGTAG
Protein sequenceShow/hide protein sequence
MLMFFGLLFYCFLTLICCPCLSVSEIFLIMLQAKEFLEFYTMATSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESFLI
EEQPSWLEDLLNEPETPVRRVGHRRSSSDSFAYTDSANANFDSIVQEDFKYTNSIPGHSWLSQEFDRQRDARHASFYTEANVTKQKNRVWESSLSAMNNPMALRSPRENI
VVHTSGPLIPPQEADGLLSTESEKQDPTESGPHDPKISSERKDMSHGKSSGSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSDVSAELEFLNQQNIILSMEN
KALKQRLESLAQEQLIKYLEQEVLEREIGRLRAVHQQQQLQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSCDPVTGPVRS