; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0021879 (gene) of Chayote v1 genome

Gene IDSed0021879
OrganismSechium edule (Chayote v1)
DescriptionBasic-leucine zipper (bZIP) transcription factor family protein
Genome locationLG05:35361619..35366068
RNA-Seq ExpressionSed0021879
SyntenySed0021879
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044759 - RF2-like transcription factor, bZIP domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588408.1 bZIP transcription factor 18, partial [Cucurbita argyrosperma subsp. sororia]6.2e-25184.83Show/hide
Query:  DGTEDHTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSR
        D  + HTD+ RN+QC     SS+ VNHH SMDQLKI     SQGR QHF+ NF+GDN+RRIGIPP PNSSQIPPISPYSQIP SRPMN QSY PV THSR
Subjt:  DGTEDHTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSR

Query:  SLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGGSG
        SLSQP+FFSLDSLPPLSPS FRDSPST+NSDQVSADT+MEDRDASSHSLLPPSPYMR NSSKMGDALPPRKAHRRSNSDIPFG SSMIQSS   PF GSG
Subjt:  SLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGGSG

Query:  GFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDNET
        G ERST+ KENAG+FKPA+QFVKRE SLEK+ DN+LEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAG  DKN  ENREDLDSRGSGTKTGGDSSDNE 
Subjt:  GFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDNET

Query:  ESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAE
        ESSVNESGDN QM GL SSAEK+EG KRTAG DIAPTTRHYRS+SMDSFM  +QFGDESPKMPPTPPGVR GQ+SSNN+ DGNS  FSLEFGNGEFSGAE
Subjt:  ESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAE

Query:  LKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEA
        LKKIMANDKLAEIAL DPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEA
Subjt:  LKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEA

Query:  LTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESNQ
        LTAEVQRLKLATT+IN+QSHPSN +MAQPS NHHGLQLQ QQQQ HH Q     QNGN T KPESNQ
Subjt:  LTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESNQ

XP_008448029.1 PREDICTED: probable transcription factor PosF21 [Cucumis melo]7.3e-25284.89Show/hide
Query:  MDGTED-HTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTH
        M  TED  TD++RN+QC     SSSA+ +H SMDQLKI     SQGR QHFQ NF+GDN+RRIGIPPCPNS Q+PPISPYSQIP SRPMNQQSY+ V TH
Subjt:  MDGTED-HTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTH

Query:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGG
        SRSLSQPSFFSLDSLPPLSP+PFRDSPST+NSDQVSADT+MEDRDASSHSLLPPSPY R NSSKMGDALPPRKAHRRSNSDIPFGLSSMIQS    PF G
Subjt:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGG

Query:  SGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDN
        SGG ERSTS KENAGIFK A+QFVKRE SLEK+IDN LEGMGE+KSEGDTVDDLFSAYMNLDNIDLFNS+G  DKN  ENREDLDSRGSGTKTGG+SSDN
Subjt:  SGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDN

Query:  ETESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSG
        E ESSVNESGDN+QM GLNSSAEK+EG KRTAGGDIAP  RHYRS+SMDSFMG +QFGDESPKMPPTPPG+R GQ+SSNN+ DGNS  FSLEFGNGEFSG
Subjt:  ETESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSG

Query:  AELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
        AELKKIMANDKLAEIAL DPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
Subjt:  AELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN

Query:  EALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESNQ
        EALTAEVQRLKLATTDINAQSHPSNG+MAQ S N HGLQLQ Q     HQQQQHM QNG+AT KPESNQ
Subjt:  EALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESNQ

XP_022970655.1 uncharacterized protein LOC111469572 isoform X1 [Cucurbita maxima]3.6e-25185.09Show/hide
Query:  DGTEDHTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSR
        D  + HTD+ RNIQC     SS+ VNHH SMDQLKI     SQGR QHF+ NF+GDN+RRIGIPP PNS QIPPISPYSQIP SRPMNQQSY PV THSR
Subjt:  DGTEDHTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSR

Query:  SLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPF---GGSG
        SLSQP+FFSLDSLPPLSPSPFRDSPST+NSDQVSADT+MEDRDASSHSLLPPSPYMR NSSKMGDALPPRKAHRRSNSDIPFG SSMIQSSP     GSG
Subjt:  SLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPF---GGSG

Query:  GFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDNET
        G ERST+ KENAG+FKPANQFVKRE SLEK+ DN+LEGMGERKSEGDTVDDLFSAYMNLDNIDLFNS G  DKN  ENREDLDSRGSGTKTGGDSSDNE 
Subjt:  GFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDNET

Query:  ESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAE
        ESSVNESGDN QM GL SSAEK+EG KRTAG DIAPTTRHYRS+SMDSFM  +QFGDESPKMPPTPPGV  GQ+SSNN+ DGNS  FSLEFGNGEFSGAE
Subjt:  ESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAE

Query:  LKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEA
        LKKIMANDKLAEIAL DPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEA
Subjt:  LKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEA

Query:  LTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQ--QQQHHHQQQQHMH-QNGNATKKPESNQ
        LTAEVQRLKLATT+INAQSHPSN +MAQPS NHHGLQLQ Q  QQQHHHQQQQ M  QNG+   KPESNQ
Subjt:  LTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQ--QQQHHHQQQQHMH-QNGNATKKPESNQ

XP_023531256.1 uncharacterized protein LOC111793556 [Cucurbita pepo subsp. pepo]5.6e-25284.83Show/hide
Query:  DGTEDHTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSR
        D  + HTD+ RN+QC     SS+ VNHH SMDQLKI     SQGR QHF+ NF+GDN+RRIGIPP PNSSQIPPISPYSQIP SRPMNQQSY PV THSR
Subjt:  DGTEDHTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSR

Query:  SLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGGSG
        SLSQP+FFSLDSLPPLSPSPFRDSPST+NSDQVSADT+MEDRDASSHSLLPPSPYMR NSSKMGDALPPRKAHRRSNSDIPFG SSMIQSS   PF GSG
Subjt:  SLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGGSG

Query:  GFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDNET
        G ERST+ KENAG+FKPA+QFVKRE SLEK+ DN+LEGMGERKSEGDTVDDLFSAYMNLDNIDLFNS G  DKN  ENREDLDSRGSGTKTGGDSSDNE 
Subjt:  GFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDNET

Query:  ESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAE
        ESSVNESGDN QM GL SSAEK+EG KRTAG DIAPTTRHYRS+SMDSFM  +QF DESPKMPPTPPGVR GQ+SSNN+ DGNS  FSLEFGNGEFSGAE
Subjt:  ESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAE

Query:  LKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEA
        LKKIMANDKLAEIAL DPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEA
Subjt:  LKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEA

Query:  LTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESNQ
        LTAEVQRLKLATT+IN+QSHPSN +MAQPS N HGLQLQ QQQQ   QQQ H+ QNGN T KPESNQ
Subjt:  LTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESNQ

XP_038887946.1 probable serine/threonine-protein kinase tsuA [Benincasa hispida]1.6e-25485.07Show/hide
Query:  MDGTED-HTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTH
        M  TED  TD++RN+QC     SSSA+ HH SMDQLKI     SQ R QHFQ NF+GDN+RRIGIPPCPNS QIPPISPYSQIP SRPMNQQSY+ V TH
Subjt:  MDGTED-HTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTH

Query:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGG
        SRSLSQPSFFSLDSLPPLSPSPFRDSPST+NSDQVSADT+MEDRDASSHSLLPPSPY R NSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS   PF G
Subjt:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGG

Query:  SGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDN
        S G ERSTS KENA IFKPA+QFVKREHSLEK+IDN+LEGMGE+KSEGDTVDDLF+AYMNLDNIDLFNS+G  DKN  ENREDLDSRGSGTKTGG+SSDN
Subjt:  SGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDN

Query:  ETESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSG
        E ESSVNESGDN+QM GL+SSAEK+EG KRTAGGDIAP  RHYRS+SMDSFMG +QFG+ESPKMPPTPPG+R GQ+SSNN+VDGNS  FSLEFGNGEFSG
Subjt:  ETESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSG

Query:  AELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
        AELKKIMANDKLAEIAL DPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
Subjt:  AELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN

Query:  EALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQ-----HHH--QQQQHMHQNGNATKKPESNQ
        EALTAEVQRLKLATTDINAQSHPSNG+MAQ S NHHGLQLQLQQQQ     HHH  QQQQ M QNG+AT KPESNQ
Subjt:  EALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQ-----HHH--QQQQHMHQNGNATKKPESNQ

TrEMBL top hitse value%identityAlignment
A0A0A0K0G6 BZIP domain-containing protein6.7e-25184.53Show/hide
Query:  MDGTED-HTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTH
        M  TED  TD++RN+QC     SSSA+ HH SMDQLKI     SQGR QHFQ NF+GDN+RRIGIPPCPNS Q+PPISPYSQIP SRPMNQ SY+ V TH
Subjt:  MDGTED-HTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTH

Query:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGG
        SRSLSQPSFFSLDSLPPLSPSPFRDSPST+NSDQVSADT+MEDRDASSHSLLPPSPY R NSSKM DALPPRKAHRRSNSDIPFGLSSMIQS    PF G
Subjt:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGG

Query:  SGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDN
        SGG ERSTS KENAGIFK A+QFVKRE SLEK+IDN +EGMGE+KSEGDTVDDLFSAYMNLDNIDLFNS+   DKN  ENREDLDSRGSGTKTGG+SSDN
Subjt:  SGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDN

Query:  ETESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSG
        E ESSVNESGDN+QM GLNSSAEK+EG KRTAGGDIAP  RHYRS+SMDSFMG +QFGDESPKMPPTPPG+R GQ+SSNN+VDGNS  FSLEFGNGEFSG
Subjt:  ETESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSG

Query:  AELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
        AELKKIMANDKLAEIAL DPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
Subjt:  AELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN

Query:  EALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESNQ
        EALTAEVQRLKLATTDINAQSHPSNG+MAQ S NHHGLQLQ       HQQQQHM QNG+A  KPESNQ
Subjt:  EALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESNQ

A0A1S3BIS8 probable transcription factor PosF213.5e-25284.89Show/hide
Query:  MDGTED-HTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTH
        M  TED  TD++RN+QC     SSSA+ +H SMDQLKI     SQGR QHFQ NF+GDN+RRIGIPPCPNS Q+PPISPYSQIP SRPMNQQSY+ V TH
Subjt:  MDGTED-HTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTH

Query:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGG
        SRSLSQPSFFSLDSLPPLSP+PFRDSPST+NSDQVSADT+MEDRDASSHSLLPPSPY R NSSKMGDALPPRKAHRRSNSDIPFGLSSMIQS    PF G
Subjt:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGG

Query:  SGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDN
        SGG ERSTS KENAGIFK A+QFVKRE SLEK+IDN LEGMGE+KSEGDTVDDLFSAYMNLDNIDLFNS+G  DKN  ENREDLDSRGSGTKTGG+SSDN
Subjt:  SGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDN

Query:  ETESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSG
        E ESSVNESGDN+QM GLNSSAEK+EG KRTAGGDIAP  RHYRS+SMDSFMG +QFGDESPKMPPTPPG+R GQ+SSNN+ DGNS  FSLEFGNGEFSG
Subjt:  ETESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSG

Query:  AELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
        AELKKIMANDKLAEIAL DPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
Subjt:  AELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN

Query:  EALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESNQ
        EALTAEVQRLKLATTDINAQSHPSNG+MAQ S N HGLQLQ Q     HQQQQHM QNG+AT KPESNQ
Subjt:  EALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESNQ

A0A6J1C4L8 uncharacterized protein LOC111007873 isoform X12.0e-24784.76Show/hide
Query:  MDGTED-HTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTH
        M  TED  TD  RN+QC     SSSAV HH SMDQLK+     SQ R QHFQ NF+G+N+RRIGIPP PNS+QIPPISPYSQIP SRPMNQQS++PV TH
Subjt:  MDGTED-HTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTH

Query:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGG
        SRSLSQPSFFSLDSLPPLSPSPFRDSPST+NSDQVSADT+MEDRDASSHSLLPPSPYMR NSSK+GDALPPRKAHRRSNSDIPFGLSSMIQSS   PF G
Subjt:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGG

Query:  SGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKT-GGDSSD
        SGG ERSTS KENAG+ +PA+QFVKRE SLEK++DN+LEGMGERKSEG+TVDDLFSAYMNLDNIDLFNS+G  DKN  E+REDLDSRGSGTKT GGDSSD
Subjt:  SGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKT-GGDSSD

Query:  NETESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFS
        NE ESSVNESGDN+Q+ GL SSAEK+EG KRTAGGDIAPTTRHYRS+SMDSFMG +QFGDESPKMPPTPPG+RSGQISSNN+VDGNS  FSLEFGNGEFS
Subjt:  NETESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFS

Query:  GAELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDAL
        GAELKKIMANDKLAEIAL DPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDAL
Subjt:  GAELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDAL

Query:  NEALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHM-HQNGNATKKPESNQ
        NEALTAEVQRLKLATTDINAQSHPSNG+M     NHHGLQLQLQQQ    QQQQHM  QNG+AT KPESNQ
Subjt:  NEALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHM-HQNGNATKKPESNQ

A0A6J1EWE2 uncharacterized protein LOC1114385352.5e-25084.3Show/hide
Query:  DGTEDHTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSR
        D  + HTD+ RN+QC     SS+ VNHH SMDQLKI     SQGR QHF+ NF+GDN+RRIGIPP PNSSQIPPISPYSQIP SRPMNQQSY PV THSR
Subjt:  DGTEDHTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSR

Query:  SLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGGSG
        SLSQP+FFSLDSLPPLSPSPFRDSPST+NSDQVSADT+MEDRDASSHSLLPPSPYMR NSSKMGDALPPRKAHRRSNSDIPFG SSMIQSS   PF GSG
Subjt:  SLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSS---PFGGSG

Query:  GFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDNET
        G ERST+ KENAG+FKPA+QFVKRE SLEK+ DN+LEGMGERKSEGDTVDDLFSAYMNLDNIDLFNS G  DKN  ENREDLDSRGSGTKTGGDSSDNE 
Subjt:  GFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDNET

Query:  ESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAE
        ESSVNESGDN QM GL SSAEK+EG KRTAG DIAPTTRHYRS+SMDSFM  +QFGDESPKMPPTPPGVR GQ+SSNN+ DGNS  FSLEFGNGEFSGAE
Subjt:  ESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAE

Query:  LKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEA
        LKKIMANDKLAEIAL DPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEA
Subjt:  LKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEA

Query:  LTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESNQ
        LTAEVQRLKLATT+IN+QSHPSN +MAQPS NHHGLQL         QQQ H+ QNGN T KPESNQ
Subjt:  LTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESNQ

A0A6J1I3G1 uncharacterized protein LOC111469572 isoform X11.8e-25185.09Show/hide
Query:  DGTEDHTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSR
        D  + HTD+ RNIQC     SS+ VNHH SMDQLKI     SQGR QHF+ NF+GDN+RRIGIPP PNS QIPPISPYSQIP SRPMNQQSY PV THSR
Subjt:  DGTEDHTDSIRNIQC-----SSSAVNHHLSMDQLKI-----SQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSR

Query:  SLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPF---GGSG
        SLSQP+FFSLDSLPPLSPSPFRDSPST+NSDQVSADT+MEDRDASSHSLLPPSPYMR NSSKMGDALPPRKAHRRSNSDIPFG SSMIQSSP     GSG
Subjt:  SLSQPSFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPF---GGSG

Query:  GFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDNET
        G ERST+ KENAG+FKPANQFVKRE SLEK+ DN+LEGMGERKSEGDTVDDLFSAYMNLDNIDLFNS G  DKN  ENREDLDSRGSGTKTGGDSSDNE 
Subjt:  GFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDNET

Query:  ESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAE
        ESSVNESGDN QM GL SSAEK+EG KRTAG DIAPTTRHYRS+SMDSFM  +QFGDESPKMPPTPPGV  GQ+SSNN+ DGNS  FSLEFGNGEFSGAE
Subjt:  ESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAE

Query:  LKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEA
        LKKIMANDKLAEIAL DPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEA
Subjt:  LKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEA

Query:  LTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQ--QQQHHHQQQQHMH-QNGNATKKPESNQ
        LTAEVQRLKLATT+INAQSHPSN +MAQPS NHHGLQLQ Q  QQQHHHQQQQ M  QNG+   KPESNQ
Subjt:  LTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQ--QQQHHHQQQQHMH-QNGNATKKPESNQ

SwissProt top hitse value%identityAlignment
O22873 bZIP transcription factor 181.1e-3754.31Show/hide
Query:  GDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTE
        G  +P+    P    +G   + N    + +S S++ G+      E KK MA DKLAE+ ++DPKRAKRI+ANRQSAARSKERK RYI ELE KVQTLQTE
Subjt:  GDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTE

Query:  ATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQ--QHHHQQQ
        ATTLSAQL+L QRD+ GL+++N ELK RLQ MEQQA+LRDALNE L  EV+RLK AT +++     + G+       H   Q Q QQ   QHHHQQQ
Subjt:  ATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQ--QHHHQQQ

Q04088 Probable transcription factor PosF212.1e-3656.7Show/hide
Query:  VDGNSN-SFSLEFGNGEFSGAELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNN
        +DG+ N +  L  GN + S  + KK M+  KLAE+ALIDPKRAKRI ANRQSAARSKERK RYI ELE KVQTLQTEATTLSAQLTLLQRD+ GLT +NN
Subjt:  VDGNSN-SFSLEFGNGEFSGAELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNN

Query:  ELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTDI------------NAQSHPSNG-----IMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQ
        ELK RLQ MEQQ  L+D LNEAL  E+Q LK+ T  +            N Q   SN      I+A        +  Q QQQQ   QQQQH  Q
Subjt:  ELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTDI------------NAQSHPSNG-----IMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQ

Q69IL4 Transcription factor RF2a1.3e-3342.63Show/hide
Query:  EDLDSRGSGTKTG---GDSSDNETESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTT--------RHYRSLSMDSFMGNMQFGDESPKMPPTPPG
        EDLD   +G   G    D +D E  S   +        G +S AE +  +   A    A           +H  SLSMD  M                  
Subjt:  EDLDSRGSGTKTG---GDSSDNETESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTT--------RHYRSLSMDSFMGNMQFGDESPKMPPTPPG

Query:  VRSGQISSNNIVDGNSNSFSLEFGNGEFSGAELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQR
             I +  +V  +        G    S AE KK ++  KLAE+AL+DPKRAKRI ANRQSAARSKERKMRYI+ELE KVQTLQTEATTLSAQL LLQR
Subjt:  VRSGQISSNNIVDGNSNSFSLEFGNGEFSGAELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQR

Query:  DSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTDI-------------------NAQSHPSNGIMAQPSTNHHGLQLQL----QQQQ-
        D+ GLT +N+ELK RLQ MEQQ  L+DALN+ L +EVQRLK+AT  +                   N Q   +N  M      H   QLQL    QQQQ 
Subjt:  DSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTDI-------------------NAQSHPSNGIMAQPSTNHHGLQLQL----QQQQ-

Query:  --HHHQQQQHMH
            HQQQQ +H
Subjt:  --HHHQQQQHMH

Q8H1F0 bZIP transcription factor 296.7e-12354.06Show/hide
Query:  DGTEDHTDSIRNIQCSSSAVNHHL---SMDQLKISQGRTQHFQPNF---VGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSP-VTTHSRSLS
        D  + ++D I+ +  S    +  +    + QL ++    +   P F     D+ +RIG+PP  + + IPP SP+SQIP +R     +++P    HSRS+S
Subjt:  DGTEDHTDSIRNIQCSSSAVNHHL---SMDQLKISQGRTQHFQPNF---VGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSP-VTTHSRSLS

Query:  QP-SFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDA----SSHSLLPPSPYMRTNSS-----KMGDALPPRKAHRRSNSDIPFGLSSMIQSSPF
        QP SFFS DSLPPLSPSPFRD            D +MEDRD+    S+HS LPPSP+ R NS+     ++G++LPPRK+HRRSNSDIP G +SM    P 
Subjt:  QP-SFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDA----SSHSLLPPSPYMRTNSS-----KMGDALPPRKAHRRSNSDIPFGLSSMIQSSPF

Query:  GGSGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATD-KNDPENREDLD-SRGSGTKTGGD
              ERS S  E A  +  +N FVK+E S E+      EG+GER    + +DDLFSAYMNL+NID+ NS+ A D KN  ENR+D++ SR SGTKT G 
Subjt:  GGSGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATD-KNDPENREDLD-SRGSGTKTGGD

Query:  SSDNETESSVNESGDNTQMAGLNSSAEKKEG-NKRTAGGDIAPTTRHYRSLSMDS-FMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSN-SFSLEF
         ++ E+ SSVNES +N     +NSS EK+E   +R AGGDIAPTTRHYRS+S+DS FM  + FGDES K PP+ PG  S ++S  N VDGNS  +FS+EF
Subjt:  SSDNETESSVNESGDNTQMAGLNSSAEKKEG-NKRTAGGDIAPTTRHYRSLSMDS-FMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSN-SFSLEF

Query:  GNGEFSGAELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQA
         NGEF+ AE+KKIMANDKLAE+A+ DPKR KRILANRQSAARSKERKMRYI ELEHKVQTLQTEATTLSAQLTLLQRD +GLTNQNNELKFRLQAMEQQA
Subjt:  GNGEFSGAELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQA

Query:  QLRDALNEALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQL-QLQQQQHHHQQQQHM--HQNGNATKKPESNQ
        +LRDALNEAL  EVQRLKLA  + +      + + +  +     L + QL+QQ    QQQ H   HQNG    K ESN+
Subjt:  QLRDALNEALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQL-QLQQQQHHHQQQQHM--HQNGNATKKPESNQ

Q9SIG8 bZIP transcription factor 304.8e-9750.45Show/hide
Query:  SSSAVNHHLSMDQLKISQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSRSLSQP-SFFSLDSLPPLSPSPFRDS
        SSS   H+L ++   I      HF+  F        G PP P    IPPISPYSQIP +             HSRS+SQP SFFS DSLPPL+PS     
Subjt:  SSSAVNHHLSMDQLKISQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSRSLSQP-SFFSLDSLPPLSPSPFRDS

Query:  PSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKM-----GDALPPRKAHRRSNSDIPFGLSSMI-QSSPFGGSGGFERSTSCKENAGIFKPANQF
           A S  VS +   E   A     LPPSP+   +SS       G+ LPPRK+HRRSNSD+ FG SSM+ Q+         ERS S ++ +      +  
Subjt:  PSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKM-----GDALPPRKAHRRSNSDIPFGLSSMI-QSSPFGGSGGFERSTSCKENAGIFKPANQF

Query:  VKREHSLEKNIDNSLEGM--GERKSEGDTVDDLFSAYMNLDNIDLFNSAGATD-KNDPENREDLD-SRGSGTK--TGGDSSDNETESSVNESGDNTQMAG
        VK+E           EG   G +      +DD+F+AYMNLDNID+ NS G  D KN  EN E+++ SRGSGTK   GG SSD+E +SS   +  N ++A 
Subjt:  VKREHSLEKNIDNSLEGM--GERKSEGDTVDDLFSAYMNLDNIDLFNSAGATD-KNDPENREDLD-SRGSGTK--TGGDSSDNETESSVNESGDNTQMAG

Query:  LNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDS-FMGNMQFGDESP-KMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAELKKIMANDKLAEI
         +SS+    G KR AGGDIAPT RHYRS+SMDS FMG + FGDES  K+PP+     S ++S  N  +GNS+++S+EFGN EF+ AE+KKI A++KLAEI
Subjt:  LNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDS-FMGNMQFGDESP-KMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAELKKIMANDKLAEI

Query:  ALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATT
         + DPKR KRILANR SAARSKERK RY++ELEHKVQTLQTEATTLSAQLT LQRDS+GLTNQN+ELKFRLQAMEQQAQLRDAL+E L  EVQRLKL   
Subjt:  ALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATT

Query:  DINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESN
        + N +   S+   ++ S N    Q QL   Q  HQQ QH +Q      K  SN
Subjt:  DINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESN

Arabidopsis top hitse value%identityAlignment
AT2G21230.1 Basic-leucine zipper (bZIP) transcription factor family protein3.4e-9850.45Show/hide
Query:  SSSAVNHHLSMDQLKISQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSRSLSQP-SFFSLDSLPPLSPSPFRDS
        SSS   H+L ++   I      HF+  F        G PP P    IPPISPYSQIP +             HSRS+SQP SFFS DSLPPL+PS     
Subjt:  SSSAVNHHLSMDQLKISQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSRSLSQP-SFFSLDSLPPLSPSPFRDS

Query:  PSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKM-----GDALPPRKAHRRSNSDIPFGLSSMI-QSSPFGGSGGFERSTSCKENAGIFKPANQF
           A S  VS +   E   A     LPPSP+   +SS       G+ LPPRK+HRRSNSD+ FG SSM+ Q+         ERS S ++ +      +  
Subjt:  PSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKM-----GDALPPRKAHRRSNSDIPFGLSSMI-QSSPFGGSGGFERSTSCKENAGIFKPANQF

Query:  VKREHSLEKNIDNSLEGM--GERKSEGDTVDDLFSAYMNLDNIDLFNSAGATD-KNDPENREDLD-SRGSGTK--TGGDSSDNETESSVNESGDNTQMAG
        VK+E           EG   G +      +DD+F+AYMNLDNID+ NS G  D KN  EN E+++ SRGSGTK   GG SSD+E +SS   +  N ++A 
Subjt:  VKREHSLEKNIDNSLEGM--GERKSEGDTVDDLFSAYMNLDNIDLFNSAGATD-KNDPENREDLD-SRGSGTK--TGGDSSDNETESSVNESGDNTQMAG

Query:  LNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDS-FMGNMQFGDESP-KMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAELKKIMANDKLAEI
         +SS+    G KR AGGDIAPT RHYRS+SMDS FMG + FGDES  K+PP+     S ++S  N  +GNS+++S+EFGN EF+ AE+KKI A++KLAEI
Subjt:  LNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDS-FMGNMQFGDESP-KMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAELKKIMANDKLAEI

Query:  ALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATT
         + DPKR KRILANR SAARSKERK RY++ELEHKVQTLQTEATTLSAQLT LQRDS+GLTNQN+ELKFRLQAMEQQAQLRDAL+E L  EVQRLKL   
Subjt:  ALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATT

Query:  DINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESN
        + N +   S+   ++ S N    Q QL   Q  HQQ QH +Q      K  SN
Subjt:  DINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESN

AT2G21230.3 Basic-leucine zipper (bZIP) transcription factor family protein9.3e-9649.73Show/hide
Query:  SSSAVNHHLSMDQLKISQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSRSLSQP-SFFSLDSLPPLSPSPFRDS
        SSS   H+L ++   I      HF+  F        G PP P    IPPISPYSQIP +             HSRS+SQP SFFS DSLPPL+PS     
Subjt:  SSSAVNHHLSMDQLKISQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSRSLSQP-SFFSLDSLPPLSPSPFRDS

Query:  PSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKM-----GDALPPRKAHRRSNSDIPFGLSSMI-QSSPFGGSGGFERSTSCKENAGIFKPANQF
           A S  VS +   E   A     LPPSP+   +SS       G+ LPPRK+HRRSNSD+ FG SSM+ Q+         ERS S ++ +      +  
Subjt:  PSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKM-----GDALPPRKAHRRSNSDIPFGLSSMI-QSSPFGGSGGFERSTSCKENAGIFKPANQF

Query:  VKREHSLEKNIDNSLEGM--GERKSEGDTVDDLFSAYMNLDNIDLFNSAGATD-KNDPENREDLD-SRGSGTK--TGGDSSDNETESSVNESGDNTQMAG
        VK+E           EG   G +      +DD+F+AYMNLDNID+ NS G  D KN  EN E+++ SRGSGTK   GG SSD+E +SS   +  N ++A 
Subjt:  VKREHSLEKNIDNSLEGM--GERKSEGDTVDDLFSAYMNLDNIDLFNSAGATD-KNDPENREDLD-SRGSGTK--TGGDSSDNETESSVNESGDNTQMAG

Query:  LNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDS-FMGNMQFGDESP-KMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAELKKIMANDKLAEI
         +SS+    G KR AGGDIAPT RHYRS+SMDS FMG + FGDES  K+PP+     S ++S  N  +GNS+++S+EFGN EF+ AE+KKI A++KLAEI
Subjt:  LNSSAEKKEGNKRTAGGDIAPTTRHYRSLSMDS-FMGNMQFGDESP-KMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAELKKIMANDKLAEI

Query:  ALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRD------ALNEALTAEVQR
         + DPKR KRILANR SAARSKERK RY++ELEHKVQTLQTEATTLSAQLT LQRDS+GLTNQN+ELKFRLQAMEQQAQLRD       L+E L  EVQR
Subjt:  ALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRD------ALNEALTAEVQR

Query:  LKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESN
        LKL   + N +   S+   ++ S N    Q QL   Q  HQQ QH +Q      K  SN
Subjt:  LKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKKPESN

AT4G38900.1 Basic-leucine zipper (bZIP) transcription factor family protein4.4e-12253.5Show/hide
Query:  DGTEDHTDSIRNIQCSSSAVNHHL---SMDQLKISQGRTQHFQPNF---VGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSP-VTTHSRSLS
        D  + ++D I+ +  S    +  +    + QL ++    +   P F     D+ +RIG+PP  + + IPP SP+SQIP +R     +++P    HSRS+S
Subjt:  DGTEDHTDSIRNIQCSSSAVNHHL---SMDQLKISQGRTQHFQPNF---VGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSP-VTTHSRSLS

Query:  QP-SFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDA----SSHSLLPPSPYMRTNSS-----KMGDALPPRKAHRRSNSDIPFGLSSMIQSSPF
        QP SFFS DSLPPLSPSPFRD            D +MEDRD+    S+HS LPPSP+ R NS+     ++G++LPPRK+HRRSNSDIP G +SM    P 
Subjt:  QP-SFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDA----SSHSLLPPSPYMRTNSS-----KMGDALPPRKAHRRSNSDIPFGLSSMIQSSPF

Query:  GGSGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATD-KNDPENREDLD-SRGSGTKTGGD
              ERS S  E A  +  +N FVK+E S E+      EG+GER    + +DDLFSAYMNL+NID+ NS+ A D KN  ENR+D++ SR SGTKT G 
Subjt:  GGSGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATD-KNDPENREDLD-SRGSGTKTGGD

Query:  SSDNETESSVNESGDNTQMAGLNSSAEKKEG-NKRTAGGDIAPTTRHYRSLSMDS-FMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSN-SFSLEF
         ++ E+ SSVNES +N     +NSS EK+E   +R AGGDIAPTTRHYRS+S+DS FM  + FGDES K PP+ PG  S ++S  N VDGNS  +FS+EF
Subjt:  SSDNETESSVNESGDNTQMAGLNSSAEKKEG-NKRTAGGDIAPTTRHYRSLSMDS-FMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSN-SFSLEF

Query:  GNGEFSGAELKKIMANDKLAEIALIDPKRAK------RILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQ
         NGEF+ AE+KKIMANDKLAE+A+ DPKR K      RILANRQSAARSKERKMRYI ELEHKVQTLQTEATTLSAQLTLLQRD +GLTNQNNELKFRLQ
Subjt:  GNGEFSGAELKKIMANDKLAEIALIDPKRAK------RILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQ

Query:  AMEQQAQLRDALNEALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQL-QLQQQQHHHQQQQHM--HQNGNATKKPESNQ
        AMEQQA+LRDALNEAL  EVQRLKLA  + +      + + +  +     L + QL+QQ    QQQ H   HQNG    K ESN+
Subjt:  AMEQQAQLRDALNEALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQL-QLQQQQHHHQQQQHM--HQNGNATKKPESNQ

AT4G38900.2 Basic-leucine zipper (bZIP) transcription factor family protein4.7e-12454.06Show/hide
Query:  DGTEDHTDSIRNIQCSSSAVNHHL---SMDQLKISQGRTQHFQPNF---VGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSP-VTTHSRSLS
        D  + ++D I+ +  S    +  +    + QL ++    +   P F     D+ +RIG+PP  + + IPP SP+SQIP +R     +++P    HSRS+S
Subjt:  DGTEDHTDSIRNIQCSSSAVNHHL---SMDQLKISQGRTQHFQPNF---VGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSP-VTTHSRSLS

Query:  QP-SFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDA----SSHSLLPPSPYMRTNSS-----KMGDALPPRKAHRRSNSDIPFGLSSMIQSSPF
        QP SFFS DSLPPLSPSPFRD            D +MEDRD+    S+HS LPPSP+ R NS+     ++G++LPPRK+HRRSNSDIP G +SM    P 
Subjt:  QP-SFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDA----SSHSLLPPSPYMRTNSS-----KMGDALPPRKAHRRSNSDIPFGLSSMIQSSPF

Query:  GGSGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATD-KNDPENREDLD-SRGSGTKTGGD
              ERS S  E A  +  +N FVK+E S E+      EG+GER    + +DDLFSAYMNL+NID+ NS+ A D KN  ENR+D++ SR SGTKT G 
Subjt:  GGSGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATD-KNDPENREDLD-SRGSGTKTGGD

Query:  SSDNETESSVNESGDNTQMAGLNSSAEKKEG-NKRTAGGDIAPTTRHYRSLSMDS-FMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSN-SFSLEF
         ++ E+ SSVNES +N     +NSS EK+E   +R AGGDIAPTTRHYRS+S+DS FM  + FGDES K PP+ PG  S ++S  N VDGNS  +FS+EF
Subjt:  SSDNETESSVNESGDNTQMAGLNSSAEKKEG-NKRTAGGDIAPTTRHYRSLSMDS-FMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSN-SFSLEF

Query:  GNGEFSGAELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQA
         NGEF+ AE+KKIMANDKLAE+A+ DPKR KRILANRQSAARSKERKMRYI ELEHKVQTLQTEATTLSAQLTLLQRD +GLTNQNNELKFRLQAMEQQA
Subjt:  GNGEFSGAELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQA

Query:  QLRDALNEALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQL-QLQQQQHHHQQQQHM--HQNGNATKKPESNQ
        +LRDALNEAL  EVQRLKLA  + +      + + +  +     L + QL+QQ    QQQ H   HQNG    K ESN+
Subjt:  QLRDALNEALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQL-QLQQQQHHHQQQQHM--HQNGNATKKPESNQ

AT4G38900.3 Basic-leucine zipper (bZIP) transcription factor family protein4.7e-12454.06Show/hide
Query:  DGTEDHTDSIRNIQCSSSAVNHHL---SMDQLKISQGRTQHFQPNF---VGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSP-VTTHSRSLS
        D  + ++D I+ +  S    +  +    + QL ++    +   P F     D+ +RIG+PP  + + IPP SP+SQIP +R     +++P    HSRS+S
Subjt:  DGTEDHTDSIRNIQCSSSAVNHHL---SMDQLKISQGRTQHFQPNF---VGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSP-VTTHSRSLS

Query:  QP-SFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDA----SSHSLLPPSPYMRTNSS-----KMGDALPPRKAHRRSNSDIPFGLSSMIQSSPF
        QP SFFS DSLPPLSPSPFRD            D +MEDRD+    S+HS LPPSP+ R NS+     ++G++LPPRK+HRRSNSDIP G +SM    P 
Subjt:  QP-SFFSLDSLPPLSPSPFRDSPSTANSDQVSADTAMEDRDA----SSHSLLPPSPYMRTNSS-----KMGDALPPRKAHRRSNSDIPFGLSSMIQSSPF

Query:  GGSGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATD-KNDPENREDLD-SRGSGTKTGGD
              ERS S  E A  +  +N FVK+E S E+      EG+GER    + +DDLFSAYMNL+NID+ NS+ A D KN  ENR+D++ SR SGTKT G 
Subjt:  GGSGGFERSTSCKENAGIFKPANQFVKREHSLEKNIDNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATD-KNDPENREDLD-SRGSGTKTGGD

Query:  SSDNETESSVNESGDNTQMAGLNSSAEKKEG-NKRTAGGDIAPTTRHYRSLSMDS-FMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSN-SFSLEF
         ++ E+ SSVNES +N     +NSS EK+E   +R AGGDIAPTTRHYRS+S+DS FM  + FGDES K PP+ PG  S ++S  N VDGNS  +FS+EF
Subjt:  SSDNETESSVNESGDNTQMAGLNSSAEKKEG-NKRTAGGDIAPTTRHYRSLSMDS-FMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSN-SFSLEF

Query:  GNGEFSGAELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQA
         NGEF+ AE+KKIMANDKLAE+A+ DPKR KRILANRQSAARSKERKMRYI ELEHKVQTLQTEATTLSAQLTLLQRD +GLTNQNNELKFRLQAMEQQA
Subjt:  GNGEFSGAELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQA

Query:  QLRDALNEALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQL-QLQQQQHHHQQQQHM--HQNGNATKKPESNQ
        +LRDALNEAL  EVQRLKLA  + +      + + +  +     L + QL+QQ    QQQ H   HQNG    K ESN+
Subjt:  QLRDALNEALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQL-QLQQQQHHHQQQQHM--HQNGNATKKPESNQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGAACTGAAGATCATACTGATAGCATTCGAAATATTCAATGTTCATCTTCGGCTGTAAATCATCACTTATCGATGGATCAGCTAAAAATTTCTCAAGGCCGTAC
ACAGCATTTTCAGCCAAACTTTGTCGGAGATAATAGTAGAAGAATTGGGATACCGCCTTGTCCCAACTCATCGCAGATCCCTCCAATCTCACCGTATTCTCAGATTCCTA
AGTCGCGTCCGATGAACCAGCAAAGTTATAGCCCAGTTACTACTCATTCTCGATCGTTATCGCAGCCTTCATTTTTCTCTCTTGATTCTTTGCCCCCTTTAAGCCCGTCT
CCATTTCGCGACTCCCCTTCTACAGCAAATTCAGATCAGGTTTCTGCAGATACAGCAATGGAGGATAGGGATGCCAGTTCACATTCTTTGTTGCCTCCTTCACCTTATAT
GAGAACCAATTCATCTAAGATGGGAGATGCCTTACCCCCTCGTAAAGCCCATAGGCGGTCTAACAGTGATATTCCATTTGGATTATCTTCGATGATTCAGTCATCTCCTT
TTGGTGGCTCGGGTGGATTTGAGCGATCAACAAGTTGTAAAGAGAATGCGGGGATATTTAAGCCGGCCAACCAGTTTGTTAAAAGAGAACACAGTTTGGAGAAAAACATT
GATAACAGTTTGGAAGGAATGGGTGAAAGGAAGTCCGAAGGGGATACTGTGGATGATTTGTTCTCTGCTTATATGAATTTGGATAATATTGATCTGTTCAACTCCGCAGG
GGCCACTGACAAGAATGATCCTGAGAATCGGGAGGATTTGGATAGTAGGGGTAGTGGAACAAAGACAGGGGGTGACAGCAGCGATAATGAAACCGAAAGCAGTGTAAATG
AAAGTGGGGATAACACTCAAATGGCTGGATTGAATTCCTCTGCTGAGAAGAAGGAAGGGAACAAACGGACTGCAGGTGGAGATATTGCTCCAACTACCAGACATTACCGG
AGTCTTTCCATGGATAGTTTCATGGGAAATATGCAGTTTGGTGATGAGTCGCCCAAAATGCCTCCTACACCACCTGGTGTTCGCTCAGGGCAAATTTCTTCAAACAACAT
AGTTGATGGTAATTCAAATTCATTCAGCTTGGAGTTTGGTAATGGTGAGTTCAGTGGGGCTGAACTGAAGAAAATTATGGCAAATGACAAACTTGCTGAGATTGCACTAA
TTGATCCAAAGCGTGCAAAAAGGATCTTGGCCAACCGCCAATCTGCTGCTCGATCGAAAGAACGAAAAATGCGGTATATATCTGAGTTGGAACACAAGGTTCAGACTCTT
CAGACAGAAGCCACCACACTGTCTGCCCAACTCACACTTCTGCAGCGAGACTCAGTTGGGCTTACAAACCAAAATAACGAACTGAAGTTCCGTCTCCAAGCTATGGAGCA
GCAAGCTCAACTACGGGATGCTCTAAATGAAGCCTTAACTGCGGAGGTTCAGCGACTGAAGCTCGCTACGACCGACATAAATGCACAATCTCATCCCTCTAATGGGATAA
TGGCTCAACCTTCTACGAATCACCATGGACTCCAGCTTCAGCTTCAGCAGCAGCAGCACCACCACCAACAACAGCAGCATATGCACCAGAATGGCAATGCAACCAAAAAA
CCAGAATCCAACCAATAG
mRNA sequenceShow/hide mRNA sequence
AATCAGTGCTCGCTCGTCTTCATGAGAGAGAAATTTTAGAGAGAGAAATCGCCGGCGTCTTCTCGGGATTCCCGCCGGAAATTTAGTCGCCGCTTGTTTTTGACCGGAAA
ATGCTTCTTCTACAAATGGCGGAGAATGAAGCCTTGCGGTTCGTTCAATCTTCGCCGCCATTCTTGAGTTTTTGCCTGCTATGAGATTGAGTGGGCAATTGTTTGTGTTT
ATGATTCAAATTCAATTCGAGTAGCTTTGAAGAAGTTGTGATTTAGGTCTTTTTTTTGCGCGTGCATTGTCTTTCGAAGTTACGTTAGTGTTTGAGGTGAAATGGATGGA
ACTGAAGATCATACTGATAGCATTCGAAATATTCAATGTTCATCTTCGGCTGTAAATCATCACTTATCGATGGATCAGCTAAAAATTTCTCAAGGCCGTACACAGCATTT
TCAGCCAAACTTTGTCGGAGATAATAGTAGAAGAATTGGGATACCGCCTTGTCCCAACTCATCGCAGATCCCTCCAATCTCACCGTATTCTCAGATTCCTAAGTCGCGTC
CGATGAACCAGCAAAGTTATAGCCCAGTTACTACTCATTCTCGATCGTTATCGCAGCCTTCATTTTTCTCTCTTGATTCTTTGCCCCCTTTAAGCCCGTCTCCATTTCGC
GACTCCCCTTCTACAGCAAATTCAGATCAGGTTTCTGCAGATACAGCAATGGAGGATAGGGATGCCAGTTCACATTCTTTGTTGCCTCCTTCACCTTATATGAGAACCAA
TTCATCTAAGATGGGAGATGCCTTACCCCCTCGTAAAGCCCATAGGCGGTCTAACAGTGATATTCCATTTGGATTATCTTCGATGATTCAGTCATCTCCTTTTGGTGGCT
CGGGTGGATTTGAGCGATCAACAAGTTGTAAAGAGAATGCGGGGATATTTAAGCCGGCCAACCAGTTTGTTAAAAGAGAACACAGTTTGGAGAAAAACATTGATAACAGT
TTGGAAGGAATGGGTGAAAGGAAGTCCGAAGGGGATACTGTGGATGATTTGTTCTCTGCTTATATGAATTTGGATAATATTGATCTGTTCAACTCCGCAGGGGCCACTGA
CAAGAATGATCCTGAGAATCGGGAGGATTTGGATAGTAGGGGTAGTGGAACAAAGACAGGGGGTGACAGCAGCGATAATGAAACCGAAAGCAGTGTAAATGAAAGTGGGG
ATAACACTCAAATGGCTGGATTGAATTCCTCTGCTGAGAAGAAGGAAGGGAACAAACGGACTGCAGGTGGAGATATTGCTCCAACTACCAGACATTACCGGAGTCTTTCC
ATGGATAGTTTCATGGGAAATATGCAGTTTGGTGATGAGTCGCCCAAAATGCCTCCTACACCACCTGGTGTTCGCTCAGGGCAAATTTCTTCAAACAACATAGTTGATGG
TAATTCAAATTCATTCAGCTTGGAGTTTGGTAATGGTGAGTTCAGTGGGGCTGAACTGAAGAAAATTATGGCAAATGACAAACTTGCTGAGATTGCACTAATTGATCCAA
AGCGTGCAAAAAGGATCTTGGCCAACCGCCAATCTGCTGCTCGATCGAAAGAACGAAAAATGCGGTATATATCTGAGTTGGAACACAAGGTTCAGACTCTTCAGACAGAA
GCCACCACACTGTCTGCCCAACTCACACTTCTGCAGCGAGACTCAGTTGGGCTTACAAACCAAAATAACGAACTGAAGTTCCGTCTCCAAGCTATGGAGCAGCAAGCTCA
ACTACGGGATGCTCTAAATGAAGCCTTAACTGCGGAGGTTCAGCGACTGAAGCTCGCTACGACCGACATAAATGCACAATCTCATCCCTCTAATGGGATAATGGCTCAAC
CTTCTACGAATCACCATGGACTCCAGCTTCAGCTTCAGCAGCAGCAGCACCACCACCAACAACAGCAGCATATGCACCAGAATGGCAATGCAACCAAAAAACCAGAATCC
AACCAATAG
Protein sequenceShow/hide protein sequence
MDGTEDHTDSIRNIQCSSSAVNHHLSMDQLKISQGRTQHFQPNFVGDNSRRIGIPPCPNSSQIPPISPYSQIPKSRPMNQQSYSPVTTHSRSLSQPSFFSLDSLPPLSPS
PFRDSPSTANSDQVSADTAMEDRDASSHSLLPPSPYMRTNSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPFGGSGGFERSTSCKENAGIFKPANQFVKREHSLEKNI
DNSLEGMGERKSEGDTVDDLFSAYMNLDNIDLFNSAGATDKNDPENREDLDSRGSGTKTGGDSSDNETESSVNESGDNTQMAGLNSSAEKKEGNKRTAGGDIAPTTRHYR
SLSMDSFMGNMQFGDESPKMPPTPPGVRSGQISSNNIVDGNSNSFSLEFGNGEFSGAELKKIMANDKLAEIALIDPKRAKRILANRQSAARSKERKMRYISELEHKVQTL
QTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTDINAQSHPSNGIMAQPSTNHHGLQLQLQQQQHHHQQQQHMHQNGNATKK
PESNQ