; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019016 (gene) of Snake gourd v1 genome

Gene IDTan0019016
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBasic-leucine zipper (bZIP) transcription factor family protein
Genome locationLG01:7574365..7583133
RNA-Seq ExpressionTan0019016
SyntenyTan0019016
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044759 - RF2-like transcription factor, bZIP domain
IPR044797 - Uncharacterized protein At4g06598-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576990.1 hypothetical protein SDJN03_24564, partial [Cucurbita argyrosperma subsp. sororia]1.1e-19094.44Show/hide
Query:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        M NSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTE+VPNT IGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSI QE+FKY NAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWES LS MNNP+ALHSPRENIVIHTSGPLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA

Query:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEK+DP ESGSHDPKVSSERKDVSHGKSS SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQ+LILSMEN ALKQRLENLA
Subjt:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRA+HQQQH QQQQPQ LRPSSSHRR+SSKDLD+QFANLSLK+KDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS

XP_022136764.1 uncharacterized protein At4g06598-like [Momordica charantia]3.4e-19293.92Show/hide
Query:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVR+FM+SGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKA+QRPRDGN+YHQRTSSESILIEEQPSWLDDLLNEPETPVRR+GHRRSSSDSF
Subjt:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSIMQEEFKYTN +PGHSWLSQEFDHQRDARHASFY EAN T+QKNRVWES LS M+NP ALHSPREN+VIHTSGPLSTPQEADGLPS+A
Subjt:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA

Query:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEK+DPTESGSHDPKVSSERKDV+HGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRL+NLA
Subjt:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLR++HQQQHHQQQQPQ LRPSS+HRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS

XP_022922717.1 uncharacterized protein At4g06598-like isoform X1 [Cucurbita moschata]2.7e-18993.47Show/hide
Query:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        M NSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTE+VPNT IGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSI QE+FKY NAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWES LS MNNP+ALHSPRENIVIHTSGPLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA

Query:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEK+DP ESGSHDPKVSSERKDVSHGKSS SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQ+LILSMEN ALKQRLENLA
Subjt:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRAVH-----QQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRA+H     QQQH QQQQPQ LRPSSSHRR+SSKDLD+QFANLSLKQKDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVH-----QQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS

XP_023552775.1 uncharacterized protein At4g06598-like isoform X1 [Cucurbita pepo subsp. pepo]2.3e-18891.49Show/hide
Query:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        M NSKGSSN+RNFMSSGKHALLPPKSPFPS+SPSYTE+VPNT IGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSI QE+FKY NAIPGHSWLSQEFDHQRDARHASFYTEAN+TKQKNRVWES LS MNNP+ALHSPR NIVIHTSGPLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA

Query:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEK+DP ESGSHDPKVSSERKDVSHGKSS SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQ+LILSMEN ALKQRLENLA
Subjt:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRAVH----------QQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRA+H          QQQH QQQQPQQLRPSSSHRR+SSKDLD+QFANLSLKQKDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVH----------QQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS

XP_038896639.1 uncharacterized protein At4g06598 [Benincasa hispida]4.1e-19094.44Show/hide
Query:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSN+R+FMSSGKHALLPPKSPFPSVSPSYTEYVPNT IGAKAVQRPRDGNSYHQRTSSES LIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSIMQEEF+Y NAIPGHSWL QEFDHQRDARHAS YTE NVTKQKNRVWES LS MNNP+ALHSPRENI IHTSGPLSTPQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA

Query:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEK+DP ESGSHDPKVSS+RKD S GKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLR VHQQQHHQQQQPQQLRPSSSHRRT SKDLD+QFANLSLKQKDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS

TrEMBL top hitse value%identityAlignment
A0A0A0L9F3 BZIP domain-containing protein1.4e-18893.65Show/hide
Query:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVR+FMSSGKHALLPPKSPFPSVSPSYTEYVPNT IGAKAVQRPRDGN YHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSIMQEEF+Y NA+PGHSWLSQEFDHQRDARHASFYTE NVTKQKNRVWES LS MNNP+ALHSPRE I IHTSGPLSTPQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA

Query:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEK+DP ESGSHDPKV+S+RKD SHGKS+VSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEG+EVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLR VH QQH QQQQPQQLRPSSSHRRTSSKDLD+QFANLSLKQKDSGSSRD VTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS

A0A1S3C2E7 uncharacterized protein At4g065982.5e-18894.18Show/hide
Query:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVR+FMSSGKHALLPPKSPFPSVSPSYTEYVPNT IGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDS MQEEF+Y NAIPGHSWLSQEFDHQRDARHASFYTE NVTKQKNRVWES LS MNNP+ LHSPRE I IHTSGPLSTPQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA

Query:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEK+DP ESGSHDPKV+S+RKD SHGKS+VSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
Subjt:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLR VH QQH QQQQPQQLRPSSSHRRTSSKDLD+QFANLSLKQKDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS

A0A6J1C5A1 uncharacterized protein At4g06598-like1.6e-19293.92Show/hide
Query:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MANSKGSSNVR+FM+SGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKA+QRPRDGN+YHQRTSSESILIEEQPSWLDDLLNEPETPVRR+GHRRSSSDSF
Subjt:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSIMQEEFKYTN +PGHSWLSQEFDHQRDARHASFY EAN T+QKNRVWES LS M+NP ALHSPREN+VIHTSGPLSTPQEADGLPS+A
Subjt:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA

Query:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEK+DPTESGSHDPKVSSERKDV+HGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRL+NLA
Subjt:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLR++HQQQHHQQQQPQ LRPSS+HRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS

A0A6J1E439 uncharacterized protein At4g06598-like isoform X11.3e-18993.47Show/hide
Query:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        M NSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTE+VPNT IGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSI QE+FKY NAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWES LS MNNP+ALHSPRENIVIHTSGPLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA

Query:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEK+DP ESGSHDPKVSSERKDVSHGKSS SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQ+LILSMEN ALKQRLENLA
Subjt:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRAVH-----QQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRA+H     QQQH QQQQPQ LRPSSSHRR+SSKDLD+QFANLSLKQKDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAVH-----QQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS

A0A6J1J9S7 uncharacterized protein At4g06598-like isoform X14.6e-18792.67Show/hide
Query:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        M NSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTE+VPNT IGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
Subjt:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA
        AYTDAANVNFDSI QE+FKY NAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWES LS MNNP+ALHSPRENIVIHTS PLS PQEADGLPSTA
Subjt:  AYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTA

Query:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA
        SEK+D  ESGSH+PKVSSERKD SHGKSS SDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQ+LILSMEN ALKQRLENLA
Subjt:  SEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLA

Query:  QEQLIKYLEQEVLEREIGRLRAV----HQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS
        QEQLIKYLEQEVLEREIGRLRA+    HQQQH QQQQPQ LRPSSSHRR+SSKDLD+QFANLSLKQKDSGSSRDPVTGPVRS
Subjt:  QEQLIKYLEQEVLEREIGRLRAV----HQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS

SwissProt top hitse value%identityAlignment
F4IN23 Basic leucine zipper 346.1e-1932.14Show/hide
Query:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTE--------ANVTKQKNRV----
        + PSW+D+ L+   +  RR  HRRS SDS A+ +A  V+ +                     +FD   D +  S +T+        +++  + N V    
Subjt:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTE--------ANVTKQKNRV----

Query:  ----WESPLSAMNN-----PMALHSPRENIVIHTSGPLSTPQEADGLPSTASEKRDPTESGSH--DPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSR
              +P ++ N+     P + H+   NI  + +  + +  + +    TAS       SG+   DPK                  +   A +Q AQRSR
Subjt:  ----WESPLSAMNN-----PMALHSPRENIVIHTSGPLSTPQEADGLPSTASEKRDPTESGSH--DPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSR

Query:  VRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRAVHQQQ
        VRKLQYI+ELER V +LQAE S +S  + FL+ Q L+L+++N ALKQR+  L+Q++L K   QE L+REI RLR V+ QQ
Subjt:  VRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRAVHQQQ

Q5JMK6 Basic leucine zipper 63.5e-1456.04Show/hide
Query:  AKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRAVHQQQH
        A +Q AQRSRVRKLQYI+ELER V  LQ E S +S  + FL+QQ  IL++ N  LKQR+  LAQ+++ K   QE L +EI RLR V+QQQ+
Subjt:  AKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRAVHQQQH

Q6K3R9 Basic leucine zipper 197.2e-1232.92Show/hide
Query:  RRVGHRRSSSDSFAYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPL
        RR  HRRS+SDS A+   A V  D ++         + G      EFD   D +  S +++           E+P  A+++      P     +   G  
Subjt:  RRVGHRRSSSDSFAYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPL

Query:  STPQEADGLPSTASEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSM
              DG+ +T+     P  +G+     ++   D   G +     +   A +Q AQRSRVRKLQYI+ELER V  LQ E S +S  + FL+ Q  +L++
Subjt:  STPQEADGLPSTASEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSM

Query:  ENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRAVHQQQ
         N  LKQR+  LAQ+++ K   QE L++EI RLR V+ QQ
Subjt:  ENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRAVHQQQ

Q8W3M7 Uncharacterized protein At4g065981.8e-5048.13Show/hide
Query:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MA+SKGS N RN   +GK ALLPPKSPF        ++VP++ IG+KAVQ+  +GN+ H RTSSES LIEEQPSWLDDLLNEPETPVR+ GHRRSSSDSF
Subjt:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDA-ANVNFDSIMQEEFKY--TNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLP
        AY D     + D  + +  +Y   N    H    +E D+ R ++   FY  A+++KQK R W+S   +   P +     E+  I  SG   + ++ +   
Subjt:  AYTDA-ANVNFDSIMQEEFKY--TNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLP

Query:  STASEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQ
        S A  K+D   + +   K S E++D    KS+ S+ + KRA+QQFAQRSRVRK+QYIAELER VQ LQ
Subjt:  STASEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQ

Q9M2K4 Basic leucine zipper 614.4e-1729.94Show/hide
Query:  ALLPPKSPFPSVSPSYTEYVPN--TAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTD-----AANVNFDS
        A LPPK P    +P++ ++      +I A A      G              ++ PSW+D+ L+   T  RR  HRRS SDS A+ +       N +FD 
Subjt:  ALLPPKSPFPSVSPSYTEYVPN--TAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTD-----AANVNFDS

Query:  IMQEEF--KYTNAIPG-------HSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTASEK
           E+F   + + +         H  ++      R + + S  ++ N     +   E+P S  +     H    N+    +   +   E+D + S    K
Subjt:  IMQEEF--KYTNAIPG-------HSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTASEK

Query:  RDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQ
         +P +  S +        +  H    V   +   A +Q AQRSRVRKLQYI+ELER V +LQ E S +S  + FL+ Q L+L+++N A+KQR+  LAQ++
Subjt:  RDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQ

Query:  LIKYLEQEVLEREIGRLRAVHQQQ-------HHQQQQPQQLRPS
        + K   QE L+REI RLR V+ QQ       +   Q P  ++PS
Subjt:  LIKYLEQEVLEREIGRLRAVHQQQ-------HHQQQQPQQLRPS

Arabidopsis top hitse value%identityAlignment
AT1G35490.1 bZIP family transcription factor4.6e-3032.33Show/hide
Query:  MANSKGSSNVRNFMSSGK-HALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDS
        M N +  SN  NF   G+  + +P K+   ++SP      PN              N +H   S + +  E+QP+WLD+LL+EP +P    GHRRS+SD+
Subjt:  MANSKGSSNVRNFMSSGK-HALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDS

Query:  FAYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRV-WESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPS
         AY ++A      +M  +    N + G SW  Q +D            ++N  +Q N++ W+   S  N      + + N+        S P E      
Subjt:  FAYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRV-WESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPS

Query:  TASEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLEN
          S+ ++ T +    P+               S T++KR K Q A R+R+R+L+YI++LER +Q LQ EG E+S+ + +L+QQ L+LSMEN+ALKQR+++
Subjt:  TASEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLEN

Query:  LAQEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTS---SKDLDSQFANLSL
        LA+ Q +K++EQ++LEREIG L+    QQ  QQ Q Q     + + +     +++ D+QFA L++
Subjt:  LAQEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTS---SKDLDSQFANLSL

AT1G58110.1 Basic-leucine zipper (bZIP) transcription factor family protein5.6e-9255.59Show/hide
Query:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS
        MA+SKGS +VRN M  GKHALLPPK PFPSVS SY+EY+P   IG++  Q+  +  ++HQRTSSES L+EE P WLDDLLNE PE+P R+ GHRRSSSDS
Subjt:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS

Query:  FAYTDAAN-VNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPS
        +AY D AN  N    +Q +F Y N +       QE D  ++A+ A+FY+ A+  KQK+R  +S ++    P  L   REN      G L   Q+A  +  
Subjt:  FAYTDAAN-VNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPS

Query:  TASEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLE
         +SE+++  E  SHDPK+ S  ++ S+      + +N KRAKQQFAQRSRVRKLQYI+ELER VQ LQAEGS+VSAEL+FLNQ+NLILSMENKALK+RLE
Subjt:  TASEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLE

Query:  NLAQEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVT
        ++AQE+LIK LEQEVLE+EIGRLRA++QQQ   Q      +PS+S  R +SKDLDSQF++LSL  KDS   RD V+
Subjt:  NLAQEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVT

AT1G58110.2 Basic-leucine zipper (bZIP) transcription factor family protein5.6e-9255.59Show/hide
Query:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS
        MA+SKGS +VRN M  GKHALLPPK PFPSVS SY+EY+P   IG++  Q+  +  ++HQRTSSES L+EE P WLDDLLNE PE+P R+ GHRRSSSDS
Subjt:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNE-PETPVRRVGHRRSSSDS

Query:  FAYTDAAN-VNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPS
        +AY D AN  N    +Q +F Y N +       QE D  ++A+ A+FY+ A+  KQK+R  +S ++    P  L   REN      G L   Q+A  +  
Subjt:  FAYTDAAN-VNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPS

Query:  TASEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLE
         +SE+++  E  SHDPK+ S  ++ S+      + +N KRAKQQFAQRSRVRKLQYI+ELER VQ LQAEGS+VSAEL+FLNQ+NLILSMENKALK+RLE
Subjt:  TASEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTEN-KRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLE

Query:  NLAQEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVT
        ++AQE+LIK LEQEVLE+EIGRLRA++QQQ   Q      +PS+S  R +SKDLDSQF++LSL  KDS   RD V+
Subjt:  NLAQEQLIKYLEQEVLEREIGRLRAVHQQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVT

AT2G42380.2 Basic-leucine zipper (bZIP) transcription factor family protein4.3e-2032.14Show/hide
Query:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTE--------ANVTKQKNRV----
        + PSW+D+ L+   +  RR  HRRS SDS A+ +A  V+ +                     +FD   D +  S +T+        +++  + N V    
Subjt:  EQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNFDSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTE--------ANVTKQKNRV----

Query:  ----WESPLSAMNN-----PMALHSPRENIVIHTSGPLSTPQEADGLPSTASEKRDPTESGSH--DPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSR
              +P ++ N+     P + H+   NI  + +  + +  + +    TAS       SG+   DPK                  +   A +Q AQRSR
Subjt:  ----WESPLSAMNN-----PMALHSPRENIVIHTSGPLSTPQEADGLPSTASEKRDPTESGSH--DPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSR

Query:  VRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRAVHQQQ
        VRKLQYI+ELER V +LQAE S +S  + FL+ Q L+L+++N ALKQR+  L+Q++L K   QE L+REI RLR V+ QQ
Subjt:  VRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRAVHQQQ

AT4G06598.1 BEST Arabidopsis thaliana protein match is: Basic-leucine zipper (bZIP) transcription factor family protein (TAIR:AT1G58110.2)5.1e-6947.96Show/hide
Query:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF
        MA+SKGS N RN   +GK ALLPPKSPF        ++VP++ IG+KAVQ+  +GN+ H RTSSES LIEEQPSWLDDLLNEPETPVR+ GHRRSSSDSF
Subjt:  MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSF

Query:  AYTDA-ANVNFDSIMQEEFKY--TNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLP
        AY D     + D  + +  +Y   N    H    +E D+ R ++   FY  A+++KQK R W+S   +   P +     E+  I  SG   + ++ +   
Subjt:  AYTDA-ANVNFDSIMQEEFKY--TNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLP

Query:  STASEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLE
        S A  K+D   + +   K S E++D    KS+ S+ + KRA+QQFAQRSRVRK+QYIAELER VQ L                       ENK+LK RLE
Subjt:  STASEKRDPTESGSHDPKVSSERKDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLE

Query:  NLAQEQLIKYLEQEVLEREIGRLRAVH---QQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLK
        +LAQEQLIKYLE +VLE+EI RLRA++   QQQ  QQQ   + + SSSH+R+ S+DL++QF NLSL+
Subjt:  NLAQEQLIKYLEQEVLEREIGRLRAVH---QQQHHQQQQPQQLRPSSSHRRTSSKDLDSQFANLSLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAATTCAAAAGGGTCATCCAACGTCAGAAATTTTATGAGTTCTGGAAAACATGCACTACTCCCTCCTAAAAGTCCCTTTCCTAGTGTTTCCCCATCATATACTGA
ATATGTTCCAAATACTGCAATCGGAGCAAAAGCTGTTCAGAGACCAAGAGATGGTAACAGCTACCATCAAAGAACTTCATCTGAAAGCATTCTAATAGAGGAGCAGCCTT
CTTGGCTTGATGATCTTCTCAATGAGCCAGAGACCCCTGTTCGCAGAGTTGGTCATCGACGTTCCTCAAGTGACTCCTTTGCATATACAGATGCTGCTAATGTTAATTTT
GATAGTATCATGCAAGAAGAATTCAAATATACAAATGCGATTCCTGGACACTCTTGGTTATCTCAAGAATTTGATCATCAGAGAGATGCAAGGCATGCTTCATTTTATAC
TGAAGCAAATGTGACAAAACAGAAGAATAGGGTGTGGGAATCACCTTTATCCGCCATGAATAATCCCATGGCTCTTCATTCTCCGAGGGAGAACATTGTTATTCATACCT
CAGGGCCATTAAGCACTCCACAGGAAGCAGATGGTTTGCCTTCTACAGCAAGTGAAAAACGGGATCCAACTGAGTCTGGTTCACACGATCCAAAAGTCTCTTCTGAAAGG
AAAGATGTTTCTCATGGAAAATCATCTGTGTCTGATACAGAAAATAAACGTGCCAAACAGCAATTTGCTCAGCGTTCAAGGGTTCGGAAACTTCAATATATAGCTGAGCT
TGAAAGGAAAGTACAAGCTTTGCAGGCAGAGGGCTCTGAAGTCTCAGCGGAGCTTGAATTTCTCAACCAGCAAAATCTTATTCTTAGCATGGAAAACAAAGCCCTCAAGC
AGCGGTTAGAGAATTTAGCTCAAGAGCAGCTAATTAAATACTTGGAGCAGGAAGTACTGGAGAGGGAGATCGGAAGGTTAAGAGCTGTGCATCAGCAGCAACATCATCAG
CAGCAACAACCGCAACAACTGCGACCTTCTTCTAGTCATCGGCGTACTTCAAGCAAAGACCTTGACAGTCAATTTGCTAACCTTTCTTTGAAGCAAAAGGATTCTGGTTC
AAGTCGTGACCCAGTAACCGGTCCAGTGCGCAGTTAG
mRNA sequenceShow/hide mRNA sequence
AATTGTTTAATTTAATTTTTCAATCCAACCCTCCTTGCACCTGCACCCACTTAATTTCAGTCTCAGACTTGGGAAGAGAGGAGAGAGAGAGAAAGAGAGAAAGAGAGAGA
GAGCTCAGCATCTTGAGAGGTGGGGTGGAAATACAAACCCTAGAGAGAAGGTGGGATTTTGAAATCATTTTCGCTTTGATTTCATTCCCTTTTCGCGTTACGCTTCGAAG
ATCTTCGTTCTTCCATCGCCACAAAGGAAAGGGAATTTCTCGTATTTCATACCATGGCAAATTCAAAAGGGTCATCCAACGTCAGAAATTTTATGAGTTCTGGAAAACAT
GCACTACTCCCTCCTAAAAGTCCCTTTCCTAGTGTTTCCCCATCATATACTGAATATGTTCCAAATACTGCAATCGGAGCAAAAGCTGTTCAGAGACCAAGAGATGGTAA
CAGCTACCATCAAAGAACTTCATCTGAAAGCATTCTAATAGAGGAGCAGCCTTCTTGGCTTGATGATCTTCTCAATGAGCCAGAGACCCCTGTTCGCAGAGTTGGTCATC
GACGTTCCTCAAGTGACTCCTTTGCATATACAGATGCTGCTAATGTTAATTTTGATAGTATCATGCAAGAAGAATTCAAATATACAAATGCGATTCCTGGACACTCTTGG
TTATCTCAAGAATTTGATCATCAGAGAGATGCAAGGCATGCTTCATTTTATACTGAAGCAAATGTGACAAAACAGAAGAATAGGGTGTGGGAATCACCTTTATCCGCCAT
GAATAATCCCATGGCTCTTCATTCTCCGAGGGAGAACATTGTTATTCATACCTCAGGGCCATTAAGCACTCCACAGGAAGCAGATGGTTTGCCTTCTACAGCAAGTGAAA
AACGGGATCCAACTGAGTCTGGTTCACACGATCCAAAAGTCTCTTCTGAAAGGAAAGATGTTTCTCATGGAAAATCATCTGTGTCTGATACAGAAAATAAACGTGCCAAA
CAGCAATTTGCTCAGCGTTCAAGGGTTCGGAAACTTCAATATATAGCTGAGCTTGAAAGGAAAGTACAAGCTTTGCAGGCAGAGGGCTCTGAAGTCTCAGCGGAGCTTGA
ATTTCTCAACCAGCAAAATCTTATTCTTAGCATGGAAAACAAAGCCCTCAAGCAGCGGTTAGAGAATTTAGCTCAAGAGCAGCTAATTAAATACTTGGAGCAGGAAGTAC
TGGAGAGGGAGATCGGAAGGTTAAGAGCTGTGCATCAGCAGCAACATCATCAGCAGCAACAACCGCAACAACTGCGACCTTCTTCTAGTCATCGGCGTACTTCAAGCAAA
GACCTTGACAGTCAATTTGCTAACCTTTCTTTGAAGCAAAAGGATTCTGGTTCAAGTCGTGACCCAGTAACCGGTCCAGTGCGCAGTTAGTTTCTCGTTTGGCTTCAAAT
GTGTTGTGCCTGGGTGATTTTTGCCAAATTGGTGAACGAAAAGGATCGTCCCTTTGTGCAAGACAAGCTGATCTCTTCCAGTCTTGGTACCTGTCTGTCTCTCTCCTCCT
CTCTGCCTTTTAATTCTTTGCTGCTGTTTCCTGCCTTTCCTGGAGTTAACATCTTCATATGCATGCACCTGGTGGTCATTGCCATTGTATTGTTTGTTCTACATGGGGTG
TGTGGCCTACCGGAGGTGGTATCTGTATGTACTGTATGTCTAATCAATTATATTTACACCAGTTGTAAGCACCATTTTTTTTAGTTTCATCCAAAAAGTTCCGCTCCAAC
AGTGTGAACATCTTCTGTGCTGTTTGGACGTTCAGTACCTGTAATGCATCATGGTGAGAAATAACTTATATGATTTCCAATGAAATCCCTGATTTACTGATGGAATGCTT
GCCTTCCTTCTCTCTCCTCCCTCCATGTACTTACACTTGTTTAAACTGTTTTTATGGTTGAGTTGATCTGACATTTTTTGATTTTCTTCCATTTGAGT
Protein sequenceShow/hide protein sequence
MANSKGSSNVRNFMSSGKHALLPPKSPFPSVSPSYTEYVPNTAIGAKAVQRPRDGNSYHQRTSSESILIEEQPSWLDDLLNEPETPVRRVGHRRSSSDSFAYTDAANVNF
DSIMQEEFKYTNAIPGHSWLSQEFDHQRDARHASFYTEANVTKQKNRVWESPLSAMNNPMALHSPRENIVIHTSGPLSTPQEADGLPSTASEKRDPTESGSHDPKVSSER
KDVSHGKSSVSDTENKRAKQQFAQRSRVRKLQYIAELERKVQALQAEGSEVSAELEFLNQQNLILSMENKALKQRLENLAQEQLIKYLEQEVLEREIGRLRAVHQQQHHQ
QQQPQQLRPSSSHRRTSSKDLDSQFANLSLKQKDSGSSRDPVTGPVRS