; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10005714 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10005714
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionBasic-leucine zipper (bZIP) transcription factor family protein
Genome locationChr07:4997556..5002479
RNA-Seq ExpressionHG10005714
SyntenyHG10005714
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044759 - RF2-like transcription factor, bZIP domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008448029.1 PREDICTED: probable transcription factor PosF21 [Cucumis melo]6.7e-27996.96Show/hide
Query:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH
        MGDTEDARTDNLRNLQCSFGTSSSS L++HFSMDQLKISQM CSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQ+PPISPYSQI VSRPMNQQSYNSVPTH
Subjt:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH

Query:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG
        SRSLSQPSFFSLDSLPPLSP+PFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQS P+LPFSG
Subjt:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG

Query:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN
        SGGLERSTSSKEN GIFK ASQFVKREPSLEKSIDN+LEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDS+GSGTKTGG+SSDN
Subjt:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN

Query:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG
        EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRS+SMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNL DGNSTPFSLEFGNGEFSG
Subjt:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG

Query:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
        AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
Subjt:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN

Query:  EALTAEVQRLKLATTDINAQSHPSNG
        EALTAEVQRLKLATTDINAQSHPSNG
Subjt:  EALTAEVQRLKLATTDINAQSHPSNG

XP_022136096.1 uncharacterized protein LOC111007873 isoform X1 [Momordica charantia]6.5e-26693.55Show/hide
Query:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH
        MGDTEDARTD  RNLQCSFG SSSS ++HHFSMDQLK+SQMNCSQ RPQHFQSNFLG+NNRRIGIPP PNS QIPPISPYSQI +SRPMNQQS+N VPTH
Subjt:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH

Query:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG
        SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPY RANSSK+GDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG
Subjt:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG

Query:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKT-GGDSSD
        SGGLERSTS+KEN G+ +PASQFVKREPSLEKS+DNNLEGMGE+KSEG+TVDDLFSAYMNLDNIDLFNSSGTNDKNGHE+REDLDS+GSGTKT GGDSSD
Subjt:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKT-GGDSSD

Query:  NEAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFS
        NEAESSVNESGDNSQ+PGL SSAEKREGIKRTAGGDIAP  RHYRSVSMDSFMGKLQFGDESPKMPPTPPGIR GQ+SSNNLVDGNSTPFSLEFGNGEFS
Subjt:  NEAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFS

Query:  GAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDAL
        GAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDAL
Subjt:  GAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDAL

Query:  NEALTAEVQRLKLATTDINAQSHPSNG
        NEALTAEVQRLKLATTDINAQSHPSNG
Subjt:  NEALTAEVQRLKLATTDINAQSHPSNG

XP_022932214.1 uncharacterized protein LOC111438535 [Cucurbita moschata]1.9e-26592.95Show/hide
Query:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH
        M DT DA TDN RNLQCSFGTSSS+T+ HHFSMDQLKISQMNCSQGRPQHF+SNFLGDNNRRIGIPP PNS QIPPISPYSQI +SRPMNQQSY+ VPTH
Subjt:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH

Query:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG
        SRSLSQP+FFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPY RANSSKMGDALPPRKAHRRSNSDIPFG SSMIQSSPLLPFSG
Subjt:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG

Query:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN
        SGGLERST+SKEN G+FKPASQFVKRE SLEKS DNNLEGMGE+KSEGDTVDDLFSAYMNLDNIDLFNS+GTNDKNGHENREDLDS+GSGTKTGGDSSDN
Subjt:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN

Query:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG
        EAESSVNESGDN+QMPGL SSAEKREG+KRTAG DIAP  RHYRSVSMDSFM KLQFGDESPKMPPTPPG+RPGQLSSNNL DGNSTPFSLEFGNGEFSG
Subjt:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG

Query:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
        AELKKIMANDKLAEIAL DPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
Subjt:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN

Query:  EALTAEVQRLKLATTDINAQSHPSN
        EALTAEVQRLKLATT+IN+QSHPSN
Subjt:  EALTAEVQRLKLATTDINAQSHPSN

XP_031744770.1 transcription factor VIP1 [Cucumis sativus]7.4e-27896.77Show/hide
Query:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH
        MGDTEDARTDNLRNLQCSFGTSSSS L+HHFSMDQLKISQM CSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQ+PPISPYSQI VSRPMNQ SYNSVPTH
Subjt:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH

Query:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG
        SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKM DALPPRKAHRRSNSDIPFGLSSMIQS P+LPFSG
Subjt:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG

Query:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN
        SGGLERSTSSKEN GIFK ASQFVKREPSLEKSIDN++EGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSS TNDKNGHENREDLDS+GSGTKTGG+SSDN
Subjt:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN

Query:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG
        EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRS+SMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG
Subjt:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG

Query:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
        AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
Subjt:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN

Query:  EALTAEVQRLKLATTDINAQSHPSNG
        EALTAEVQRLKLATTDINAQSHPSNG
Subjt:  EALTAEVQRLKLATTDINAQSHPSNG

XP_038887946.1 probable serine/threonine-protein kinase tsuA [Benincasa hispida]1.5e-27897.34Show/hide
Query:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH
        MGDTEDARTDNLRNLQCSFGTSSSS L+HHFSMDQLKISQMNCSQ RPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQI VSRPMNQQSYNSVPTH
Subjt:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH

Query:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG
        SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG
Subjt:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG

Query:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN
        S GLERSTSSKEN  IFKPASQFVKRE SLEKSIDNNLEGMGEKKSEGDTVDDLF+AYMNLDNIDLFNSSG NDKNGHENREDLDS+GSGTKTGG+SSDN
Subjt:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN

Query:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG
        EAESSVNESGDNSQMPGL+SSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFG+ESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG
Subjt:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG

Query:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
        AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
Subjt:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN

Query:  EALTAEVQRLKLATTDINAQSHPSNG
        EALTAEVQRLKLATTDINAQSHPSNG
Subjt:  EALTAEVQRLKLATTDINAQSHPSNG

TrEMBL top hitse value%identityAlignment
A0A0A0K0G6 BZIP domain-containing protein3.6e-27896.77Show/hide
Query:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH
        MGDTEDARTDNLRNLQCSFGTSSSS L+HHFSMDQLKISQM CSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQ+PPISPYSQI VSRPMNQ SYNSVPTH
Subjt:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH

Query:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG
        SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKM DALPPRKAHRRSNSDIPFGLSSMIQS P+LPFSG
Subjt:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG

Query:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN
        SGGLERSTSSKEN GIFK ASQFVKREPSLEKSIDN++EGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSS TNDKNGHENREDLDS+GSGTKTGG+SSDN
Subjt:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN

Query:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG
        EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRS+SMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG
Subjt:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG

Query:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
        AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
Subjt:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN

Query:  EALTAEVQRLKLATTDINAQSHPSNG
        EALTAEVQRLKLATTDINAQSHPSNG
Subjt:  EALTAEVQRLKLATTDINAQSHPSNG

A0A1S3BIS8 probable transcription factor PosF213.2e-27996.96Show/hide
Query:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH
        MGDTEDARTDNLRNLQCSFGTSSSS L++HFSMDQLKISQM CSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQ+PPISPYSQI VSRPMNQQSYNSVPTH
Subjt:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH

Query:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG
        SRSLSQPSFFSLDSLPPLSP+PFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQS P+LPFSG
Subjt:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG

Query:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN
        SGGLERSTSSKEN GIFK ASQFVKREPSLEKSIDN+LEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDS+GSGTKTGG+SSDN
Subjt:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN

Query:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG
        EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRS+SMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNL DGNSTPFSLEFGNGEFSG
Subjt:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG

Query:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
        AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
Subjt:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN

Query:  EALTAEVQRLKLATTDINAQSHPSNG
        EALTAEVQRLKLATTDINAQSHPSNG
Subjt:  EALTAEVQRLKLATTDINAQSHPSNG

A0A6J1C4L8 uncharacterized protein LOC111007873 isoform X13.1e-26693.55Show/hide
Query:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH
        MGDTEDARTD  RNLQCSFG SSSS ++HHFSMDQLK+SQMNCSQ RPQHFQSNFLG+NNRRIGIPP PNS QIPPISPYSQI +SRPMNQQS+N VPTH
Subjt:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH

Query:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG
        SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPY RANSSK+GDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG
Subjt:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG

Query:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKT-GGDSSD
        SGGLERSTS+KEN G+ +PASQFVKREPSLEKS+DNNLEGMGE+KSEG+TVDDLFSAYMNLDNIDLFNSSGTNDKNGHE+REDLDS+GSGTKT GGDSSD
Subjt:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKT-GGDSSD

Query:  NEAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFS
        NEAESSVNESGDNSQ+PGL SSAEKREGIKRTAGGDIAP  RHYRSVSMDSFMGKLQFGDESPKMPPTPPGIR GQ+SSNNLVDGNSTPFSLEFGNGEFS
Subjt:  NEAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFS

Query:  GAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDAL
        GAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDAL
Subjt:  GAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDAL

Query:  NEALTAEVQRLKLATTDINAQSHPSNG
        NEALTAEVQRLKLATTDINAQSHPSNG
Subjt:  NEALTAEVQRLKLATTDINAQSHPSNG

A0A6J1EWE2 uncharacterized protein LOC1114385359.1e-26692.95Show/hide
Query:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH
        M DT DA TDN RNLQCSFGTSSS+T+ HHFSMDQLKISQMNCSQGRPQHF+SNFLGDNNRRIGIPP PNS QIPPISPYSQI +SRPMNQQSY+ VPTH
Subjt:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH

Query:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG
        SRSLSQP+FFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPY RANSSKMGDALPPRKAHRRSNSDIPFG SSMIQSSPLLPFSG
Subjt:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG

Query:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN
        SGGLERST+SKEN G+FKPASQFVKRE SLEKS DNNLEGMGE+KSEGDTVDDLFSAYMNLDNIDLFNS+GTNDKNGHENREDLDS+GSGTKTGGDSSDN
Subjt:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN

Query:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG
        EAESSVNESGDN+QMPGL SSAEKREG+KRTAG DIAP  RHYRSVSMDSFM KLQFGDESPKMPPTPPG+RPGQLSSNNL DGNSTPFSLEFGNGEFSG
Subjt:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG

Query:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
        AELKKIMANDKLAEIAL DPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
Subjt:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN

Query:  EALTAEVQRLKLATTDINAQSHPSN
        EALTAEVQRLKLATT+IN+QSHPSN
Subjt:  EALTAEVQRLKLATTDINAQSHPSN

A0A6J1I3G1 uncharacterized protein LOC111469572 isoform X11.9e-26392.38Show/hide
Query:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH
        M DT DA TDN RN+QCSFGTSSS+T+ HHFSMDQLKISQMNCSQGRPQHF+SNFLGDNNRRIGIPP PNS QIPPISPYSQI +SRPMNQQSY+ VPTH
Subjt:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTH

Query:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG
        SRSLSQP+FFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPY RANSSKMGDALPPRKAHRRSNSDIPFG SSMIQSSPLLP SG
Subjt:  SRSLSQPSFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSG

Query:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN
        SGGLERST+SKEN G+FKPA+QFVKRE SLEKS DNNLEGMGE+KSEGDTVDDLFSAYMNLDNIDLFNS+GTNDKNGHENREDLDS+GSGTKTGGDSSDN
Subjt:  SGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDN

Query:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG
        EAESSVNESGDN+QMPGL SSAEKREG+KRTAG DIAP  RHYRSVSMDSFM KLQFGDESPKMPPTPPG+ PGQLSSNNL DGNSTPFSLEFGNGEFSG
Subjt:  EAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSG

Query:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
        AELKKIMANDKLAEIAL DPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN
Subjt:  AELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALN

Query:  EALTAEVQRLKLATTDINAQSHPSN
        EALTAEVQRLKLATT+INAQSHPSN
Subjt:  EALTAEVQRLKLATTDINAQSHPSN

SwissProt top hitse value%identityAlignment
O22873 bZIP transcription factor 184.1e-3760.76Show/hide
Query:  PPTPPGIRPGQLSSNNLVDGNSTP---FSLEF-GNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATT
        P  P    P    +     GNS P    SL   G+      E KK MA DKLAE+ + DPKRAKRI+ANRQSAARSKERK RYI ELE KVQTLQTEATT
Subjt:  PPTPPGIRPGQLSSNNLVDGNSTP---FSLEF-GNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATT

Query:  LSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTDIN
        LSAQL+L QRD+ GL+++N ELK RLQ MEQQA+LRDALNE L  EV+RLK AT +++
Subjt:  LSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTDIN

Q69IL4 Transcription factor RF2a1.2e-3345.49Show/hide
Query:  EDLDSKGSGTKTGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIA-----------PNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPG
        EDLD   +G   G   SD   E   +   D  ++     ++ + E    +AG   A              +H  S+SMD  M                  
Subjt:  EDLDSKGSGTKTGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIA-----------PNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPG

Query:  IRPGQLSSNNLVDGNSTPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQR
             + +  LV  +        G    S AE KK ++  KLAE+AL DPKRAKRI ANRQSAARSKERKMRYI+ELE KVQTLQTEATTLSAQL LLQR
Subjt:  IRPGQLSSNNLVDGNSTPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQR

Query:  DSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLAT
        D+ GLT +N+ELK RLQ MEQQ  L+DALN+ L +EVQRLK+AT
Subjt:  DSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLAT

Q6S4P4 Transcription factor RF2b3.5e-3675Show/hide
Query:  ELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNE
        E KK M  ++L+E+A  DPKRAKRILANRQSAARSKERK RYI+ELE KVQTLQTEATTLSAQLTL QRD+ GL+ +N ELK RLQAMEQQAQLRDALN+
Subjt:  ELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNE

Query:  ALTAEVQRLKLATTDI
        AL  E++RLKLAT ++
Subjt:  ALTAEVQRLKLATTDI

Q8H1F0 bZIP transcription factor 294.5e-12959.06Show/hide
Query:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPC-PNSPQIPPISPYSQIAVSRPMNQQSYN-SVP
        MGDTE   +D ++ L  SFGT+SSS  ++  S  QL ++        PQ   S    D+ +RIG+PP  PN   IPP SP+SQI  +R     ++N    
Subjt:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPC-PNSPQIPPISPYSQIAVSRPMNQQSYN-SVP

Query:  THSRSLSQP-SFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDA----SSHSLLPPSPYTRANSS-----KMGDALPPRKAHRRSNSDIPFGLSS
         HSRS+SQP SFFS DSLPPLSPSPFRD            D SMEDRD+    S+HS LPPSP+TR NS+     ++G++LPPRK+HRRSNSDIP G +S
Subjt:  THSRSLSQP-SFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDA----SSHSLLPPSPYTRANSS-----KMGDALPPRKAHRRSNSDIPFGLSS

Query:  MIQSSPLLPFSGSGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTND-KNGHENREDLD-S
        M    PL+P      LERS S  E    +  ++ FVK+E S E+      EG+GE+    + +DDLFSAYMNL+NID+ NSS  +D KNG+ENR+D++ S
Subjt:  MIQSSPLLPFSGSGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTND-KNGHENREDLD-S

Query:  KGSGTKTGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIK-RTAGGDIAPNNRHYRSVSMDS-FMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDG
        + SGTKT G  ++ E+ SSVNES +N+    +NSS EKRE +K R AGGDIAP  RHYRSVS+DS FM KL FGDES K PP+ PG    ++S  N VDG
Subjt:  KGSGTKTGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIK-RTAGGDIAPNNRHYRSVSMDS-FMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDG

Query:  NS-TPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELK
        NS   FS+EF NGEF+ AE+KKIMANDKLAE+A++DPKR KRILANRQSAARSKERKMRYI ELEHKVQTLQTEATTLSAQLTLLQRD +GLTNQNNELK
Subjt:  NS-TPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELK

Query:  FRLQAMEQQAQLRDALNEALTAEVQRLKLA
        FRLQAMEQQA+LRDALNEAL  EVQRLKLA
Subjt:  FRLQAMEQQAQLRDALNEALTAEVQRLKLA

Q9SIG8 bZIP transcription factor 301.7e-10451.66Show/hide
Query:  GDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTHS
        GDT D  T+ ++ +  S GTSSSS  +H+  ++   I   +       HF+  F        G PP    P IPPISPYSQI    P   Q     P HS
Subjt:  GDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTHS

Query:  RSLSQP-SFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHS-LLPPSPYTRANSSKM-----GDALPPRKAHRRSNSDIPFGLSSMIQSSP
        RS+SQP SFFS DSLPPL+PS    +PS S         S+E++  +  S  LPPSP+T  +SS       G+ LPPRK+HRRSNSD+ FG SSM+  + 
Subjt:  RSLSQP-SFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHS-LLPPSPYTRANSSKM-----GDALPPRKAHRRSNSDIPFGLSSMIQSSP

Query:  LLPFSGSGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGM--GEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTND-KNGHENREDLD-SKGSG
          P   S  LERS S ++        S  VK+EP          EG   G K      +DD+F+AYMNLDNID+ NS G  D KNG+EN E+++ S+GSG
Subjt:  LLPFSGSGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGM--GEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTND-KNGHENREDLD-SKGSG

Query:  TK--TGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDS-FMGKLQFGDESP-KMPPTPPGIRPGQLSSNNLVDGNS
        TK   GG SSD+E +SS +     +    L+SS+    G+KR AGGDIAP  RHYRSVSMDS FMGKL FGDES  K+PP+       ++S  N  +GNS
Subjt:  TK--TGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDS-FMGKLQFGDESP-KMPPTPPGIRPGQLSSNNLVDGNS

Query:  TPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRL
        + +S+EFGN EF+ AE+KKI A++KLAEI + DPKR KRILANR SAARSKERK RY++ELEHKVQTLQTEATTLSAQLT LQRDS+GLTNQN+ELKFRL
Subjt:  TPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRL

Query:  QAMEQQAQLRDALNEALTAEVQRLKLATTDINAQSHPSNGTD
        QAMEQQAQLRDAL+E L  EVQRLKL   + N +   S+ ++
Subjt:  QAMEQQAQLRDALNEALTAEVQRLKLATTDINAQSHPSNGTD

Arabidopsis top hitse value%identityAlignment
AT2G21230.1 Basic-leucine zipper (bZIP) transcription factor family protein1.2e-10551.66Show/hide
Query:  GDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTHS
        GDT D  T+ ++ +  S GTSSSS  +H+  ++   I   +       HF+  F        G PP    P IPPISPYSQI    P   Q     P HS
Subjt:  GDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTHS

Query:  RSLSQP-SFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHS-LLPPSPYTRANSSKM-----GDALPPRKAHRRSNSDIPFGLSSMIQSSP
        RS+SQP SFFS DSLPPL+PS    +PS S         S+E++  +  S  LPPSP+T  +SS       G+ LPPRK+HRRSNSD+ FG SSM+  + 
Subjt:  RSLSQP-SFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHS-LLPPSPYTRANSSKM-----GDALPPRKAHRRSNSDIPFGLSSMIQSSP

Query:  LLPFSGSGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGM--GEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTND-KNGHENREDLD-SKGSG
          P   S  LERS S ++        S  VK+EP          EG   G K      +DD+F+AYMNLDNID+ NS G  D KNG+EN E+++ S+GSG
Subjt:  LLPFSGSGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGM--GEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTND-KNGHENREDLD-SKGSG

Query:  TK--TGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDS-FMGKLQFGDESP-KMPPTPPGIRPGQLSSNNLVDGNS
        TK   GG SSD+E +SS +     +    L+SS+    G+KR AGGDIAP  RHYRSVSMDS FMGKL FGDES  K+PP+       ++S  N  +GNS
Subjt:  TK--TGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDS-FMGKLQFGDESP-KMPPTPPGIRPGQLSSNNLVDGNS

Query:  TPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRL
        + +S+EFGN EF+ AE+KKI A++KLAEI + DPKR KRILANR SAARSKERK RY++ELEHKVQTLQTEATTLSAQLT LQRDS+GLTNQN+ELKFRL
Subjt:  TPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRL

Query:  QAMEQQAQLRDALNEALTAEVQRLKLATTDINAQSHPSNGTD
        QAMEQQAQLRDAL+E L  EVQRLKL   + N +   S+ ++
Subjt:  QAMEQQAQLRDALNEALTAEVQRLKLATTDINAQSHPSNGTD

AT2G21230.3 Basic-leucine zipper (bZIP) transcription factor family protein3.4e-10350.91Show/hide
Query:  GDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTHS
        GDT D  T+ ++ +  S GTSSSS  +H+  ++   I   +       HF+  F        G PP    P IPPISPYSQI    P   Q     P HS
Subjt:  GDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTHS

Query:  RSLSQP-SFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHS-LLPPSPYTRANSSKM-----GDALPPRKAHRRSNSDIPFGLSSMIQSSP
        RS+SQP SFFS DSLPPL+PS    +PS S         S+E++  +  S  LPPSP+T  +SS       G+ LPPRK+HRRSNSD+ FG SSM+  + 
Subjt:  RSLSQP-SFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHS-LLPPSPYTRANSSKM-----GDALPPRKAHRRSNSDIPFGLSSMIQSSP

Query:  LLPFSGSGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGM--GEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTND-KNGHENREDLD-SKGSG
          P   S  LERS S ++        S  VK+EP          EG   G K      +DD+F+AYMNLDNID+ NS G  D KNG+EN E+++ S+GSG
Subjt:  LLPFSGSGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGM--GEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTND-KNGHENREDLD-SKGSG

Query:  TK--TGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDS-FMGKLQFGDESP-KMPPTPPGIRPGQLSSNNLVDGNS
        TK   GG SSD+E +SS +     +    L+SS+    G+KR AGGDIAP  RHYRSVSMDS FMGKL FGDES  K+PP+       ++S  N  +GNS
Subjt:  TK--TGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSVSMDS-FMGKLQFGDESP-KMPPTPPGIRPGQLSSNNLVDGNS

Query:  TPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRL
        + +S+EFGN EF+ AE+KKI A++KLAEI + DPKR KRILANR SAARSKERK RY++ELEHKVQTLQTEATTLSAQLT LQRDS+GLTNQN+ELKFRL
Subjt:  TPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRL

Query:  QAMEQQAQLRD------ALNEALTAEVQRLKLATTDINAQSHPSNGTD
        QAMEQQAQLRD       L+E L  EVQRLKL   + N +   S+ ++
Subjt:  QAMEQQAQLRD------ALNEALTAEVQRLKLATTDINAQSHPSNGTD

AT4G38900.1 Basic-leucine zipper (bZIP) transcription factor family protein3.0e-12858.4Show/hide
Query:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPC-PNSPQIPPISPYSQIAVSRPMNQQSYN-SVP
        MGDTE   +D ++ L  SFGT+SSS  ++  S  QL ++        PQ   S    D+ +RIG+PP  PN   IPP SP+SQI  +R     ++N    
Subjt:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPC-PNSPQIPPISPYSQIAVSRPMNQQSYN-SVP

Query:  THSRSLSQP-SFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDA----SSHSLLPPSPYTRANSS-----KMGDALPPRKAHRRSNSDIPFGLSS
         HSRS+SQP SFFS DSLPPLSPSPFRD            D SMEDRD+    S+HS LPPSP+TR NS+     ++G++LPPRK+HRRSNSDIP G +S
Subjt:  THSRSLSQP-SFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDA----SSHSLLPPSPYTRANSS-----KMGDALPPRKAHRRSNSDIPFGLSS

Query:  MIQSSPLLPFSGSGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTND-KNGHENREDLD-S
        M    PL+P      LERS S  E    +  ++ FVK+E S E+      EG+GE+    + +DDLFSAYMNL+NID+ NSS  +D KNG+ENR+D++ S
Subjt:  MIQSSPLLPFSGSGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTND-KNGHENREDLD-S

Query:  KGSGTKTGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIK-RTAGGDIAPNNRHYRSVSMDS-FMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDG
        + SGTKT G  ++ E+ SSVNES +N+    +NSS EKRE +K R AGGDIAP  RHYRSVS+DS FM KL FGDES K PP+ PG    ++S  N VDG
Subjt:  KGSGTKTGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIK-RTAGGDIAPNNRHYRSVSMDS-FMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDG

Query:  NS-TPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAK------RILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTN
        NS   FS+EF NGEF+ AE+KKIMANDKLAE+A++DPKR K      RILANRQSAARSKERKMRYI ELEHKVQTLQTEATTLSAQLTLLQRD +GLTN
Subjt:  NS-TPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAK------RILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTN

Query:  QNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLA
        QNNELKFRLQAMEQQA+LRDALNEAL  EVQRLKLA
Subjt:  QNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLA

AT4G38900.2 Basic-leucine zipper (bZIP) transcription factor family protein3.2e-13059.06Show/hide
Query:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPC-PNSPQIPPISPYSQIAVSRPMNQQSYN-SVP
        MGDTE   +D ++ L  SFGT+SSS  ++  S  QL ++        PQ   S    D+ +RIG+PP  PN   IPP SP+SQI  +R     ++N    
Subjt:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPC-PNSPQIPPISPYSQIAVSRPMNQQSYN-SVP

Query:  THSRSLSQP-SFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDA----SSHSLLPPSPYTRANSS-----KMGDALPPRKAHRRSNSDIPFGLSS
         HSRS+SQP SFFS DSLPPLSPSPFRD            D SMEDRD+    S+HS LPPSP+TR NS+     ++G++LPPRK+HRRSNSDIP G +S
Subjt:  THSRSLSQP-SFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDA----SSHSLLPPSPYTRANSS-----KMGDALPPRKAHRRSNSDIPFGLSS

Query:  MIQSSPLLPFSGSGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTND-KNGHENREDLD-S
        M    PL+P      LERS S  E    +  ++ FVK+E S E+      EG+GE+    + +DDLFSAYMNL+NID+ NSS  +D KNG+ENR+D++ S
Subjt:  MIQSSPLLPFSGSGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTND-KNGHENREDLD-S

Query:  KGSGTKTGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIK-RTAGGDIAPNNRHYRSVSMDS-FMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDG
        + SGTKT G  ++ E+ SSVNES +N+    +NSS EKRE +K R AGGDIAP  RHYRSVS+DS FM KL FGDES K PP+ PG    ++S  N VDG
Subjt:  KGSGTKTGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIK-RTAGGDIAPNNRHYRSVSMDS-FMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDG

Query:  NS-TPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELK
        NS   FS+EF NGEF+ AE+KKIMANDKLAE+A++DPKR KRILANRQSAARSKERKMRYI ELEHKVQTLQTEATTLSAQLTLLQRD +GLTNQNNELK
Subjt:  NS-TPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELK

Query:  FRLQAMEQQAQLRDALNEALTAEVQRLKLA
        FRLQAMEQQA+LRDALNEAL  EVQRLKLA
Subjt:  FRLQAMEQQAQLRDALNEALTAEVQRLKLA

AT4G38900.3 Basic-leucine zipper (bZIP) transcription factor family protein3.2e-13059.06Show/hide
Query:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPC-PNSPQIPPISPYSQIAVSRPMNQQSYN-SVP
        MGDTE   +D ++ L  SFGT+SSS  ++  S  QL ++        PQ   S    D+ +RIG+PP  PN   IPP SP+SQI  +R     ++N    
Subjt:  MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPC-PNSPQIPPISPYSQIAVSRPMNQQSYN-SVP

Query:  THSRSLSQP-SFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDA----SSHSLLPPSPYTRANSS-----KMGDALPPRKAHRRSNSDIPFGLSS
         HSRS+SQP SFFS DSLPPLSPSPFRD            D SMEDRD+    S+HS LPPSP+TR NS+     ++G++LPPRK+HRRSNSDIP G +S
Subjt:  THSRSLSQP-SFFSLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDA----SSHSLLPPSPYTRANSS-----KMGDALPPRKAHRRSNSDIPFGLSS

Query:  MIQSSPLLPFSGSGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTND-KNGHENREDLD-S
        M    PL+P      LERS S  E    +  ++ FVK+E S E+      EG+GE+    + +DDLFSAYMNL+NID+ NSS  +D KNG+ENR+D++ S
Subjt:  MIQSSPLLPFSGSGGLERSTSSKENGGIFKPASQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTND-KNGHENREDLD-S

Query:  KGSGTKTGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIK-RTAGGDIAPNNRHYRSVSMDS-FMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDG
        + SGTKT G  ++ E+ SSVNES +N+    +NSS EKRE +K R AGGDIAP  RHYRSVS+DS FM KL FGDES K PP+ PG    ++S  N VDG
Subjt:  KGSGTKTGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIK-RTAGGDIAPNNRHYRSVSMDS-FMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDG

Query:  NS-TPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELK
        NS   FS+EF NGEF+ AE+KKIMANDKLAE+A++DPKR KRILANRQSAARSKERKMRYI ELEHKVQTLQTEATTLSAQLTLLQRD +GLTNQNNELK
Subjt:  NS-TPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELK

Query:  FRLQAMEQQAQLRDALNEALTAEVQRLKLA
        FRLQAMEQQA+LRDALNEAL  EVQRLKLA
Subjt:  FRLQAMEQQAQLRDALNEALTAEVQRLKLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGATACTGAAGATGCTCGTACTGATAACTTACGAAATCTTCAATGTTCGTTTGGAACATCTTCCTCTTCTACTTTAAGGCATCATTTCTCTATGGATCAACTTAA
AATTTCTCAAATGAACTGCTCACAAGGCCGTCCACAGCATTTTCAGTCGAATTTTCTTGGGGATAATAATAGAAGAATTGGGATACCTCCTTGTCCAAACTCACCGCAGA
TCCCGCCAATCTCACCGTATTCTCAGATTGCGGTATCGCGTCCGATGAACCAGCAGAGTTATAACTCAGTTCCTACTCATTCTCGATCGTTATCTCAGCCTTCTTTTTTC
TCTCTCGATTCTTTGCCCCCTTTAAGTCCGTCTCCATTTCGTGACTCCCCATCTACATCGAATTCAGATCAGGTTTCTGCTGATACATCAATGGAGGATAGGGATGCCAG
TTCACATTCTTTGTTGCCTCCCTCACCTTATACGAGGGCCAATTCTTCGAAGATGGGTGATGCTTTACCTCCTCGTAAAGCTCATAGGCGGTCTAACAGTGATATTCCAT
TTGGATTATCTTCGATGATTCAGTCATCTCCTCTTCTCCCTTTTAGTGGCTCTGGTGGATTGGAGCGATCAACAAGTAGTAAAGAGAATGGGGGCATATTTAAGCCGGCC
AGCCAGTTTGTTAAAAGAGAACCTAGTTTGGAGAAAAGCATTGATAACAACTTGGAAGGAATGGGTGAAAAGAAGTCCGAAGGGGACACTGTGGATGATTTATTCTCTGC
TTATATGAATTTGGATAATATTGATCTGTTCAACTCCTCAGGGACCAACGACAAGAATGGTCATGAGAATCGGGAGGATTTGGATAGTAAAGGCAGTGGAACAAAGACAG
GGGGTGACAGCAGTGATAATGAGGCAGAAAGCAGTGTGAACGAAAGTGGGGATAACTCTCAAATGCCTGGATTGAATTCGTCTGCGGAGAAGAGGGAGGGGATTAAACGG
ACTGCAGGGGGAGATATTGCTCCAAATAACAGACATTACCGGAGTGTCTCCATGGATAGTTTCATGGGCAAGTTGCAATTTGGTGATGAGTCACCCAAAATGCCACCTAC
ACCACCCGGCATTCGTCCAGGGCAACTTTCTTCAAACAACCTAGTTGACGGTAATTCAACTCCATTCAGCTTGGAGTTTGGTAATGGTGAGTTCAGTGGCGCTGAACTGA
AGAAAATTATGGCAAATGACAAACTTGCTGAAATTGCACTAACCGATCCCAAGCGTGCAAAGAGGATCTTGGCAAACCGTCAATCTGCTGCTCGATCAAAAGAACGAAAA
ATGCGCTATATATCTGAGTTGGAACACAAGGTTCAGACTCTTCAGACAGAAGCCACCACGCTGTCTGCCCAACTCACGCTTTTGCAGCGAGACTCAGTTGGACTTACAAA
CCAGAACAATGAGCTGAAGTTCCGTCTCCAAGCCATGGAACAGCAAGCACAACTACGGGATGCTCTAAATGAAGCGTTAACCGCGGAGGTTCAGAGATTGAAGCTCGCTA
CAACCGATATAAATGCGCAATCTCATCCCTCAAACGGGACAGACATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGATACTGAAGATGCTCGTACTGATAACTTACGAAATCTTCAATGTTCGTTTGGAACATCTTCCTCTTCTACTTTAAGGCATCATTTCTCTATGGATCAACTTAA
AATTTCTCAAATGAACTGCTCACAAGGCCGTCCACAGCATTTTCAGTCGAATTTTCTTGGGGATAATAATAGAAGAATTGGGATACCTCCTTGTCCAAACTCACCGCAGA
TCCCGCCAATCTCACCGTATTCTCAGATTGCGGTATCGCGTCCGATGAACCAGCAGAGTTATAACTCAGTTCCTACTCATTCTCGATCGTTATCTCAGCCTTCTTTTTTC
TCTCTCGATTCTTTGCCCCCTTTAAGTCCGTCTCCATTTCGTGACTCCCCATCTACATCGAATTCAGATCAGGTTTCTGCTGATACATCAATGGAGGATAGGGATGCCAG
TTCACATTCTTTGTTGCCTCCCTCACCTTATACGAGGGCCAATTCTTCGAAGATGGGTGATGCTTTACCTCCTCGTAAAGCTCATAGGCGGTCTAACAGTGATATTCCAT
TTGGATTATCTTCGATGATTCAGTCATCTCCTCTTCTCCCTTTTAGTGGCTCTGGTGGATTGGAGCGATCAACAAGTAGTAAAGAGAATGGGGGCATATTTAAGCCGGCC
AGCCAGTTTGTTAAAAGAGAACCTAGTTTGGAGAAAAGCATTGATAACAACTTGGAAGGAATGGGTGAAAAGAAGTCCGAAGGGGACACTGTGGATGATTTATTCTCTGC
TTATATGAATTTGGATAATATTGATCTGTTCAACTCCTCAGGGACCAACGACAAGAATGGTCATGAGAATCGGGAGGATTTGGATAGTAAAGGCAGTGGAACAAAGACAG
GGGGTGACAGCAGTGATAATGAGGCAGAAAGCAGTGTGAACGAAAGTGGGGATAACTCTCAAATGCCTGGATTGAATTCGTCTGCGGAGAAGAGGGAGGGGATTAAACGG
ACTGCAGGGGGAGATATTGCTCCAAATAACAGACATTACCGGAGTGTCTCCATGGATAGTTTCATGGGCAAGTTGCAATTTGGTGATGAGTCACCCAAAATGCCACCTAC
ACCACCCGGCATTCGTCCAGGGCAACTTTCTTCAAACAACCTAGTTGACGGTAATTCAACTCCATTCAGCTTGGAGTTTGGTAATGGTGAGTTCAGTGGCGCTGAACTGA
AGAAAATTATGGCAAATGACAAACTTGCTGAAATTGCACTAACCGATCCCAAGCGTGCAAAGAGGATCTTGGCAAACCGTCAATCTGCTGCTCGATCAAAAGAACGAAAA
ATGCGCTATATATCTGAGTTGGAACACAAGGTTCAGACTCTTCAGACAGAAGCCACCACGCTGTCTGCCCAACTCACGCTTTTGCAGCGAGACTCAGTTGGACTTACAAA
CCAGAACAATGAGCTGAAGTTCCGTCTCCAAGCCATGGAACAGCAAGCACAACTACGGGATGCTCTAAATGAAGCGTTAACCGCGGAGGTTCAGAGATTGAAGCTCGCTA
CAACCGATATAAATGCGCAATCTCATCCCTCAAACGGGACAGACATTTAA
Protein sequenceShow/hide protein sequence
MGDTEDARTDNLRNLQCSFGTSSSSTLRHHFSMDQLKISQMNCSQGRPQHFQSNFLGDNNRRIGIPPCPNSPQIPPISPYSQIAVSRPMNQQSYNSVPTHSRSLSQPSFF
SLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMGDALPPRKAHRRSNSDIPFGLSSMIQSSPLLPFSGSGGLERSTSSKENGGIFKPA
SQFVKREPSLEKSIDNNLEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSGTNDKNGHENREDLDSKGSGTKTGGDSSDNEAESSVNESGDNSQMPGLNSSAEKREGIKR
TAGGDIAPNNRHYRSVSMDSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERK
MRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTDINAQSHPSNGTDI