; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0026834 (gene) of Chayote v1 genome

Gene IDSed0026834
OrganismSechium edule (Chayote v1)
DescriptionBZIP domain-containing protein
Genome locationLG04:29424554..29428653
RNA-Seq ExpressionSed0026834
SyntenySed0026834
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044827 - G-box-binding factor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580686.1 hypothetical protein SDJN03_20688, partial [Cucurbita argyrosperma subsp. sororia]1.7e-17869.14Show/hide
Query:  SSSSNCSENTTCSGLSSSSS-MSAFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR
        +SSS CSE T+CSGLSSSS+  S+ ++M AD MVKVEIE AEALADLA LA R++G QPSE KWR K +KGKR+R EVKTE   SAF DSLP+R DL+LR
Subjt:  SSSSNCSENTTCSGLSSSSS-MSAFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR

Query:  I-QDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTR
        I QDRGV+SH PSEK+CA  SHPEW+TT++M+KA+K EAES ++      S+PLFG RRS RNLTEAEKEERRIRR+LANRESARQTIRRRQALCE+LT+
Subjt:  I-QDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTR

Query:  KAADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVV
        KA+DLAWENENLKREKELALKEYQSLE TNKELKEQIAQA +PK +EIPGNN  S VQ PPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYH+LHNV VV
Subjt:  KAADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVV

Query:  PPSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEE
        PPS+R P+NNTV V DSSHVQENF  V GLRTPFC++PCSWLLPHHDHRNQQS + SCP G  QE I+SNS N AYTSKV VR E +H SL S  EK   
Subjt:  PPSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEE

Query:  PALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMA
           N+  +LNEA + K+HT NTVGV V+ F                    EP S V+QD  S+DD   SSRT DD C+  ++KH P+++   KKTIDAMA
Subjt:  PALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMA

Query:  ATEARRRRKELTKLKNLHTRQCRMN
        ATEARRRRKELTKLKNLHTR CRM+
Subjt:  ATEARRRRKELTKLKNLHTRQCRMN

KAG7017441.1 hypothetical protein SDJN02_19306 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-18069.47Show/hide
Query:  SSSSNCSENTTCSGLSSSSS-MSAFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR
        +SSS CSE T+CSGLSSSS+  S+ ++M AD MVKVEIE AEALADLA LA R++G QPSE KWR K +KGKR+R EVKTE   SAF DSLP+R DL+LR
Subjt:  SSSSNCSENTTCSGLSSSSS-MSAFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR

Query:  IQDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRK
        IQDRGV+SH PSEK+CA  SHPEW+TT++M+KA+K EAES ++      S+PLFG RRS RNLTEAEKEERRIRR+LANRESARQTIRRRQALCE+LT+K
Subjt:  IQDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRK

Query:  AADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVVP
        A+DLAWENENLKREKELALKEYQSLE TNKELKEQIAQA +PK +EIPGNN  S VQ PPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYH+LHNV VVP
Subjt:  AADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVVP

Query:  PSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEEP
        PS+R P+NNTV V DSSHVQENF  V GLRTPFC++PCSWLLPHHDHRNQQS + SCP G  QE I+SNS N AYTSKV VR E +H SL S EEK    
Subjt:  PSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEEP

Query:  ALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMAA
          N+  +LNEA + K+HT NTVGV V+ F                    EP S V+QD  S+DD   SSRT DD C+  ++KH P+++   KKTIDAMAA
Subjt:  ALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMAA

Query:  TEARRRRKELTKLKNLHTRQCRMN
        TEARRRRKELTKLKNLHTR CRM+
Subjt:  TEARRRRKELTKLKNLHTRQCRMN

XP_022934487.1 uncharacterized protein LOC111441650 isoform X1 [Cucurbita moschata]1.3e-17869.33Show/hide
Query:  SSSSNCSENTTCSGLSSSSSMS-AFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR
        +SSS CSE T+CSGLSSSS+ S + ++M AD MVKVEIE AEALADLA LA R++G QPSE KWR K +KGKR+R EVKTE   SAF DSLP+R DL+LR
Subjt:  SSSSNCSENTTCSGLSSSSSMS-AFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR

Query:  I-QDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTR
        I QDRGV+SH PSEK+CA  SHPEW+TT++M+KA+K EAES ++      S+PLFG RRS RNLTEAEKEERRIRR+LANRESARQTIRRRQALCE+LT+
Subjt:  I-QDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTR

Query:  KAADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVV
        KA+DLAWENENLKREKELALKEYQSLE TNKELKEQIA A +PK +EIPGNN  S VQ PPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYH+LHNV VV
Subjt:  KAADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVV

Query:  PPSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEE
        PPS+R P+NNTV V DSSHVQENF  V GLRTPFC++PCSWLLPHHDHRNQQS + SCP G  QE I+SNS N AYTSKV VR E +H SL S EEK   
Subjt:  PPSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEE

Query:  PALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMA
           N+  +LNEA + K+HT NTVGV V+ F                    EP S V+QD  S+DD   SSRT DD C+  ++KH P+I+   KKTIDAMA
Subjt:  PALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMA

Query:  ATEARRRRKELTKLKNLHTRQCRMN
        ATEARRRRKELTKLKNLHTR CRM+
Subjt:  ATEARRRRKELTKLKNLHTRQCRMN

XP_022934488.1 uncharacterized protein LOC111441650 isoform X2 [Cucurbita moschata]5.2e-18069.47Show/hide
Query:  SSSSNCSENTTCSGLSSSSSMS-AFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR
        +SSS CSE T+CSGLSSSS+ S + ++M AD MVKVEIE AEALADLA LA R++G QPSE KWR K +KGKR+R EVKTE   SAF DSLP+R DL+LR
Subjt:  SSSSNCSENTTCSGLSSSSSMS-AFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR

Query:  IQDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRK
        IQDRGV+SH PSEK+CA  SHPEW+TT++M+KA+K EAES ++      S+PLFG RRS RNLTEAEKEERRIRR+LANRESARQTIRRRQALCE+LT+K
Subjt:  IQDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRK

Query:  AADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVVP
        A+DLAWENENLKREKELALKEYQSLE TNKELKEQIA A +PK +EIPGNN  S VQ PPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYH+LHNV VVP
Subjt:  AADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVVP

Query:  PSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEEP
        PS+R P+NNTV V DSSHVQENF  V GLRTPFC++PCSWLLPHHDHRNQQS + SCP G  QE I+SNS N AYTSKV VR E +H SL S EEK    
Subjt:  PSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEEP

Query:  ALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMAA
          N+  +LNEA + K+HT NTVGV V+ F                    EP S V+QD  S+DD   SSRT DD C+  ++KH P+I+   KKTIDAMAA
Subjt:  ALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMAA

Query:  TEARRRRKELTKLKNLHTRQCRMN
        TEARRRRKELTKLKNLHTR CRM+
Subjt:  TEARRRRKELTKLKNLHTRQCRMN

XP_023528186.1 uncharacterized protein LOC111791175 isoform X2 [Cucurbita pepo subsp. pepo]1.7e-17869.08Show/hide
Query:  SSSSNCSENTTCSGLSSSSS-MSAFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR
        +SSS CSE T+CSGLSSSS+  S+ ++M AD MVKVEIE AEALADLA  A R++G QPSE KWR K +KGKR+R EVKTE   SAF DSLP+R DL+LR
Subjt:  SSSSNCSENTTCSGLSSSSS-MSAFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR

Query:  IQDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRK
        IQDR V+SH PSEK+CA  SHPEW+TT++M+KA+K EAES ++      S+PLFG RRS RNLTEAEKEERRIRR+LANRESARQTIRRRQALCE+LT+K
Subjt:  IQDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRK

Query:  AADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVVP
        A+DLAWENENLKREKELALKEYQSLE TNKELKEQIAQA +PK +EIPGNN  S VQ PPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYH+LH V VVP
Subjt:  AADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVVP

Query:  PSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEEP
        PS+R P+NNTV V DSSH+QENF  V GLRTPFC++PCSWLLPHHDHRNQQS + SCP G  QE I+SNS N AYTSKV VR E +H SL S EEK    
Subjt:  PSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEEP

Query:  ALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMAA
          N+  +LNEA + KDHT NTVGV V+ F                    EP S V+QD  S+DD   SSRT DD C+  ++KH P+I+   KKTIDAMAA
Subjt:  ALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMAA

Query:  TEARRRRKELTKLKNLHTRQCRMN
        TEARRRRKELTKLKNLHTR CRM+
Subjt:  TEARRRRKELTKLKNLHTRQCRMN

TrEMBL top hitse value%identityAlignment
A0A6J1CU60 uncharacterized protein LOC111014317 isoform X28.4e-16865.97Show/hide
Query:  SSSNCSENTTCSGL---SSSSSMSAFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTKRKGKRSRTEVKTESA---FADSLPTRLDLELR
        +S+ CS+ ++CS L   SSSS  S++TAM AD MVKVEIE AEALADLAALA RE+G QPS+ KWRTK KGKR+R +VK+ES    F DSLP+R DL+ R
Subjt:  SSSNCSENTTCSGL---SSSSSMSAFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTKRKGKRSRTEVKTESA---FADSLPTRLDLELR

Query:  IQDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRK
        I+DRGVVS  PSEK+C  QS  + +TTRKMLK +K E E  +V+P CTTSYPLFG R+S RNLTEAEKEERR+RRILANRESARQTIRRRQALCEELT+K
Subjt:  IQDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRK

Query:  AADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPY-HELHNVVVV
        AADLAWENENLKREKELALKEY SLETTNKELKEQ+AQAVKPK +EIPGNN  S +Q+PPLPTNYPLFL+ RPP+ASYFW     PSSPY HEL NVVV+
Subjt:  AADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPY-HELHNVVVV

Query:  PPSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEE
        P SI LP N+ V+  DSSHV ENF   NG  TPFC+LPCSWLLPHHD RNQQ P+VSC TG NQEDI  NS N  +TSKV VR E +H SL S EEK E 
Subjt:  PPSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEE

Query:  PALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMA
               +  EALN K+H  N VGVAV GF                    E  SAV+QDNRSKDDH  S+R   DFC F ++KH  + +   KKTIDAM 
Subjt:  PALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMA

Query:  ATEARRRRKELTKLKNLHTRQCRMNS
        A EARRRRKELTKLKNLH RQC M+S
Subjt:  ATEARRRRKELTKLKNLHTRQCRMNS

A0A6J1F2W5 uncharacterized protein LOC111441650 isoform X16.2e-17969.33Show/hide
Query:  SSSSNCSENTTCSGLSSSSSMS-AFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR
        +SSS CSE T+CSGLSSSS+ S + ++M AD MVKVEIE AEALADLA LA R++G QPSE KWR K +KGKR+R EVKTE   SAF DSLP+R DL+LR
Subjt:  SSSSNCSENTTCSGLSSSSSMS-AFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR

Query:  I-QDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTR
        I QDRGV+SH PSEK+CA  SHPEW+TT++M+KA+K EAES ++      S+PLFG RRS RNLTEAEKEERRIRR+LANRESARQTIRRRQALCE+LT+
Subjt:  I-QDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTR

Query:  KAADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVV
        KA+DLAWENENLKREKELALKEYQSLE TNKELKEQIA A +PK +EIPGNN  S VQ PPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYH+LHNV VV
Subjt:  KAADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVV

Query:  PPSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEE
        PPS+R P+NNTV V DSSHVQENF  V GLRTPFC++PCSWLLPHHDHRNQQS + SCP G  QE I+SNS N AYTSKV VR E +H SL S EEK   
Subjt:  PPSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEE

Query:  PALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMA
           N+  +LNEA + K+HT NTVGV V+ F                    EP S V+QD  S+DD   SSRT DD C+  ++KH P+I+   KKTIDAMA
Subjt:  PALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMA

Query:  ATEARRRRKELTKLKNLHTRQCRMN
        ATEARRRRKELTKLKNLHTR CRM+
Subjt:  ATEARRRRKELTKLKNLHTRQCRMN

A0A6J1F7T1 uncharacterized protein LOC111441650 isoform X22.5e-18069.47Show/hide
Query:  SSSSNCSENTTCSGLSSSSSMS-AFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR
        +SSS CSE T+CSGLSSSS+ S + ++M AD MVKVEIE AEALADLA LA R++G QPSE KWR K +KGKR+R EVKTE   SAF DSLP+R DL+LR
Subjt:  SSSSNCSENTTCSGLSSSSSMS-AFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR

Query:  IQDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRK
        IQDRGV+SH PSEK+CA  SHPEW+TT++M+KA+K EAES ++      S+PLFG RRS RNLTEAEKEERRIRR+LANRESARQTIRRRQALCE+LT+K
Subjt:  IQDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRK

Query:  AADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVVP
        A+DLAWENENLKREKELALKEYQSLE TNKELKEQIA A +PK +EIPGNN  S VQ PPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYH+LHNV VVP
Subjt:  AADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVVP

Query:  PSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEEP
        PS+R P+NNTV V DSSHVQENF  V GLRTPFC++PCSWLLPHHDHRNQQS + SCP G  QE I+SNS N AYTSKV VR E +H SL S EEK    
Subjt:  PSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEEP

Query:  ALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMAA
          N+  +LNEA + K+HT NTVGV V+ F                    EP S V+QD  S+DD   SSRT DD C+  ++KH P+I+   KKTIDAMAA
Subjt:  ALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMAA

Query:  TEARRRRKELTKLKNLHTRQCRMN
        TEARRRRKELTKLKNLHTR CRM+
Subjt:  TEARRRRKELTKLKNLHTRQCRMN

A0A6J1J476 uncharacterized protein LOC111481617 isoform X22.6e-17768.51Show/hide
Query:  SSSSNCSENTTCSGLSSSSS-MSAFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR
        +SSS CSE T+CSGLSSSS+  S+ ++M AD MVKVEIE AEAL DLA LA R++G +PSE KWR K +KGKR+R EVKTE   SAF DSLP+R DL+LR
Subjt:  SSSSNCSENTTCSGLSSSSS-MSAFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR

Query:  IQDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRK
        IQDRGV+SH PSEK+CA  SHPEW+TT++M+KA+K E ES ++      S+PLFG RR  RNLTEAEKEERRIRR+LANRESARQTIRRRQ LCE+LT+K
Subjt:  IQDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRK

Query:  AADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVVP
        A+DLAWENENLKREKELALKEYQSLE TNKELKEQIAQA +PK +EIPGNN  S VQ PPLPTNYPLF FSRPPYASYFWPSVVQPSSPYH+LHNV VVP
Subjt:  AADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVVP

Query:  PSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEEP
        PS+R P+NNTV V DSSHVQENF  V GLRTPFC++PCSWLLPHHDHRNQQS + SCP G  QE I+SNS N AYTSKV VR E +  SL S EEK    
Subjt:  PSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEEP

Query:  ALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMAA
          N+  +LNEA + KDHT NTVGV V+ F                    EP S V+QD  S+DD   SSRT DD C+  ++KH P+ L   KKTIDAMAA
Subjt:  ALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMAA

Query:  TEARRRRKELTKLKNLHTRQCRMN
        TEARRRRKELTKLKNLHTR CRM+
Subjt:  TEARRRRKELTKLKNLHTRQCRMN

A0A6J1J5U4 uncharacterized protein LOC111481617 isoform X16.4e-17668.38Show/hide
Query:  SSSSNCSENTTCSGLSSSSS-MSAFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR
        +SSS CSE T+CSGLSSSS+  S+ ++M AD MVKVEIE AEAL DLA LA R++G +PSE KWR K +KGKR+R EVKTE   SAF DSLP+R DL+LR
Subjt:  SSSSNCSENTTCSGLSSSSS-MSAFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTK-RKGKRSRTEVKTE---SAFADSLPTRLDLELR

Query:  I-QDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTR
        I QDRGV+SH PSEK+CA  SHPEW+TT++M+KA+K E ES ++      S+PLFG RR  RNLTEAEKEERRIRR+LANRESARQTIRRRQ LCE+LT+
Subjt:  I-QDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTR

Query:  KAADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVV
        KA+DLAWENENLKREKELALKEYQSLE TNKELKEQIAQA +PK +EIPGNN  S VQ PPLPTNYPLF FSRPPYASYFWPSVVQPSSPYH+LHNV VV
Subjt:  KAADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVV

Query:  PPSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEE
        PPS+R P+NNTV V DSSHVQENF  V GLRTPFC++PCSWLLPHHDHRNQQS + SCP G  QE I+SNS N AYTSKV VR E +  SL S EEK   
Subjt:  PPSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEE

Query:  PALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMA
           N+  +LNEA + KDHT NTVGV V+ F                    EP S V+QD  S+DD   SSRT DD C+  ++KH P+ L   KKTIDAMA
Subjt:  PALNKVPNLNEALNSKDHTHNTVGVAVEGF--------------------EPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMA

Query:  ATEARRRRKELTKLKNLHTRQCRMN
        ATEARRRRKELTKLKNLHTR CRM+
Subjt:  ATEARRRRKELTKLKNLHTRQCRMN

SwissProt top hitse value%identityAlignment
Q8TFU8 Transcriptional activator hacA2.7e-0630.11Show/hide
Query:  VKTESAFADSLPTRLDLELRIQDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANR
        VK E AFA+SLPT   LE+ +            K+   Q+ PE K   K  K+  +E       P   T+ P        R  TE EKE+RRI R+L NR
Subjt:  VKTESAFADSLPTRLDLELRIQDRGVVSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANR

Query:  ESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQIAQ---AVKPKEQEIPGNNIPSQVQVPPLPT
         +A+ +  R++   E+L  +  D+  +N+ L       L+    +E  N  L +Q+AQ    V+      P ++ P+ V     PT
Subjt:  ESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQIAQ---AVKPKEQEIPGNNIPSQVQVPPLPT

Arabidopsis top hitse value%identityAlignment
AT1G19490.1 Basic-leucine zipper (bZIP) transcription factor family protein4.7e-5437.23Show/hide
Query:  SSSNCSENTTCSGLSSSSSMSAFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTKRKGKRSRTEVKTESAFADSLPTRLDLELRIQDRGV
        SSS CS +++ SG  ++++ +             E+E AEALADLA LA        S   W +  KGKR R  VKTES  +DSL       L+  D   
Subjt:  SSSNCSENTTCSGLSSSSSMSAFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTKRKGKRSRTEVKTESAFADSLPTRLDLELRIQDRGV

Query:  VSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLF-----------GYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCE
        +  P   ++  ++   E +    + K   +     E+N    T  P+            G  RS +NL+EAE+EERRIRRILANRESARQTIRRRQA+CE
Subjt:  VSHPPSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLF-----------GYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCE

Query:  ELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHN
        EL++KAADL +ENENL+REK+ ALKE+QSLET NK LKEQ+ ++VKP  +E   +  PSQV++    T  P + +++ PY  + WP V Q S+P   + +
Subjt:  ELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHN

Query:  VVVVPPSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGI--NQEDIHSNS--LNRAYTSKVDV-RVERKHCSL
         +  P S    A  T+T  +     EN    NG +T F ++PC W LP  DH N        P G+   Q    SN   ++ +    +DV    R H   
Subjt:  VVVVPPSIRLPANNTVTVYDSSHVQENFMTVNGLRTPFCMLPCSWLLPHHDHRNQQSPEVSCPTGI--NQEDIHSNS--LNRAYTSKVDV-RVERKHCSL

Query:  LSVEEKIEEPALNKVPNLNEALNSKDHTHNTVGVAVEGFEPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMAATEARRRRKEL
           EE    P    + +LNE+            V  EG +     +Q    K + V  +    +        H   I  P KK   ++AA EAR+RRKEL
Subjt:  LSVEEKIEEPALNKVPNLNEALNSKDHTHNTVGVAVEGFEPPSAVEQDNRSKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMAATEARRRRKEL

Query:  TKLKNLHTRQCRM
        T+LKNLH RQCRM
Subjt:  TKLKNLHTRQCRM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTTCTTCTTCTTCTTCCTCCAACTGCTCGGAAAACACCACTTGCTCTGGTTTGAGTTCTTCTTCCTCTATGTCGGCTTTTACGGCTATGCCGGCGGATCCGAT
GGTCAAGGTTGAGATTGAGGTGGCGGAGGCTCTCGCCGATTTGGCTGCTTTGGCGCCCAGGGAGAACGGACCTCAGCCCTCGGAACTCAAATGGCGGACTAAAAGAAAGG
GGAAGCGGTCCAGGACGGAGGTTAAGACCGAGTCTGCCTTTGCCGACTCTTTACCTACTCGCCTGGATCTGGAACTCAGGATTCAGGATAGAGGAGTCGTAAGTCATCCA
CCATCCGAAAAGGATTGTGCAATTCAGTCCCACCCCGAGTGGAAAACAACCAGAAAGATGTTAAAAGCAGACAAGGAGGAGGCTGAATCACTTGAAGTGAATCCTACATG
CACTACAAGCTACCCATTATTTGGCTACAGGAGGTCGAGCCGCAATCTAACTGAGGCTGAAAAGGAAGAAAGGAGAATAAGAAGGATTTTAGCGAATAGAGAGTCAGCCA
GACAAACAATTCGGCGTAGGCAGGCTCTGTGCGAGGAGTTAACCAGAAAGGCTGCAGATTTAGCATGGGAAAATGAAAATTTAAAGAGGGAAAAGGAATTGGCCCTGAAA
GAGTACCAATCTCTGGAGACTACTAACAAGGAATTAAAGGAGCAGATAGCTCAAGCAGTAAAGCCCAAGGAGCAGGAAATCCCAGGAAACAATATACCATCTCAAGTTCA
GGTGCCTCCTTTACCTACCAACTACCCTCTTTTCTTGTTTAGTCGCCCTCCGTACGCATCGTATTTCTGGCCATCTGTGGTCCAGCCTTCAAGTCCTTATCATGAACTAC
ATAATGTCGTCGTCGTCCCTCCGAGTATTCGTTTGCCTGCGAATAATACCGTTACTGTGTACGACTCTTCCCATGTACAAGAAAACTTTATGACGGTCAATGGCCTGAGA
ACACCCTTTTGTATGCTACCTTGTTCTTGGTTGTTGCCTCATCATGATCATAGGAATCAACAGAGTCCTGAAGTCTCGTGTCCCACGGGAATTAATCAAGAGGATATACA
TTCTAATTCTCTAAATAGGGCTTATACTTCAAAGGTGGATGTGCGTGTAGAAAGAAAACATTGCTCATTGCTTTCCGTGGAAGAAAAAATCGAAGAGCCAGCCTTGAACA
AAGTTCCTAATTTAAACGAAGCTTTGAATTCAAAGGATCATACTCATAACACAGTTGGAGTAGCTGTGGAGGGATTCGAACCGCCATCAGCTGTCGAACAAGATAACCGA
AGCAAAGATGATCACGTTCGGTCATCAAGAACTCGTGATGACTTCTGTAATTTTACAGACAGAAAGCATGGACCAGACATTCTTTTCCCCTATAAGAAAACCATAGATGC
AATGGCTGCAACTGAGGCAAGGAGGAGGAGAAAAGAACTAACAAAGTTAAAGAATCTTCACACCCGTCAATGCCGTATGAATTCTTGA
mRNA sequenceShow/hide mRNA sequence
GTTTCTGTTTCTTATCTCTCTGGTTTTTTCTTTCTTTCTCTTTTGCTCTCTCATGGCCGCTTCTTCTTCTTCTTCCTCCAACTGCTCGGAAAACACCACTTGCTCTGGTT
TGAGTTCTTCTTCCTCTATGTCGGCTTTTACGGCTATGCCGGCGGATCCGATGGTCAAGGTTGAGATTGAGGTGGCGGAGGCTCTCGCCGATTTGGCTGCTTTGGCGCCC
AGGGAGAACGGACCTCAGCCCTCGGAACTCAAATGGCGGACTAAAAGAAAGGGGAAGCGGTCCAGGACGGAGGTTAAGACCGAGTCTGCCTTTGCCGACTCTTTACCTAC
TCGCCTGGATCTGGAACTCAGGATTCAGGATAGAGGAGTCGTAAGTCATCCACCATCCGAAAAGGATTGTGCAATTCAGTCCCACCCCGAGTGGAAAACAACCAGAAAGA
TGTTAAAAGCAGACAAGGAGGAGGCTGAATCACTTGAAGTGAATCCTACATGCACTACAAGCTACCCATTATTTGGCTACAGGAGGTCGAGCCGCAATCTAACTGAGGCT
GAAAAGGAAGAAAGGAGAATAAGAAGGATTTTAGCGAATAGAGAGTCAGCCAGACAAACAATTCGGCGTAGGCAGGCTCTGTGCGAGGAGTTAACCAGAAAGGCTGCAGA
TTTAGCATGGGAAAATGAAAATTTAAAGAGGGAAAAGGAATTGGCCCTGAAAGAGTACCAATCTCTGGAGACTACTAACAAGGAATTAAAGGAGCAGATAGCTCAAGCAG
TAAAGCCCAAGGAGCAGGAAATCCCAGGAAACAATATACCATCTCAAGTTCAGGTGCCTCCTTTACCTACCAACTACCCTCTTTTCTTGTTTAGTCGCCCTCCGTACGCA
TCGTATTTCTGGCCATCTGTGGTCCAGCCTTCAAGTCCTTATCATGAACTACATAATGTCGTCGTCGTCCCTCCGAGTATTCGTTTGCCTGCGAATAATACCGTTACTGT
GTACGACTCTTCCCATGTACAAGAAAACTTTATGACGGTCAATGGCCTGAGAACACCCTTTTGTATGCTACCTTGTTCTTGGTTGTTGCCTCATCATGATCATAGGAATC
AACAGAGTCCTGAAGTCTCGTGTCCCACGGGAATTAATCAAGAGGATATACATTCTAATTCTCTAAATAGGGCTTATACTTCAAAGGTGGATGTGCGTGTAGAAAGAAAA
CATTGCTCATTGCTTTCCGTGGAAGAAAAAATCGAAGAGCCAGCCTTGAACAAAGTTCCTAATTTAAACGAAGCTTTGAATTCAAAGGATCATACTCATAACACAGTTGG
AGTAGCTGTGGAGGGATTCGAACCGCCATCAGCTGTCGAACAAGATAACCGAAGCAAAGATGATCACGTTCGGTCATCAAGAACTCGTGATGACTTCTGTAATTTTACAG
ACAGAAAGCATGGACCAGACATTCTTTTCCCCTATAAGAAAACCATAGATGCAATGGCTGCAACTGAGGCAAGGAGGAGGAGAAAAGAACTAACAAAGTTAAAGAATCTT
CACACCCGTCAATGCCGTATGAATTCTTGATCTCTATATGGCTGAAGGGTTTGACACCTATAATTTTATTTGTCAAGTTTTGTATTTCACTGGCTTTTGTTGCCAGAGGC
AAGCACATGGCTTCCCTTTTGAGGCATTTCTTTGCTTTTGTTTTCGAGCTCAGTGCGCTGCGAGATCTTGCTGCTAGTCGGGGCGCTTTTTCGGGTCGGCTCTGATGAAT
TTACTAACAAATGAAATGAGATATATTTGGGACTGAGATGCTGATTACTTTGAAGAGCTTTGAGTTAAATTTGGAAGGGCTGCAATGAGATTTGGATAATGTAAGAATTT
TGTACTTAATATCTTGTATGTCTACTCAAAGACCCTTTATTGGGTTGGCTACAGTTCTTCAATAAAGTTAGTAAAATAACACATAGAAGTTGTGCAAAGATAGCTGAATT
TAACCAAGGAAATGCTTCATACAAATTAGAGTTTTGTTTCAACGACTTGTGCC
Protein sequenceShow/hide protein sequence
MAASSSSSSNCSENTTCSGLSSSSSMSAFTAMPADPMVKVEIEVAEALADLAALAPRENGPQPSELKWRTKRKGKRSRTEVKTESAFADSLPTRLDLELRIQDRGVVSHP
PSEKDCAIQSHPEWKTTRKMLKADKEEAESLEVNPTCTTSYPLFGYRRSSRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALK
EYQSLETTNKELKEQIAQAVKPKEQEIPGNNIPSQVQVPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHELHNVVVVPPSIRLPANNTVTVYDSSHVQENFMTVNGLR
TPFCMLPCSWLLPHHDHRNQQSPEVSCPTGINQEDIHSNSLNRAYTSKVDVRVERKHCSLLSVEEKIEEPALNKVPNLNEALNSKDHTHNTVGVAVEGFEPPSAVEQDNR
SKDDHVRSSRTRDDFCNFTDRKHGPDILFPYKKTIDAMAATEARRRRKELTKLKNLHTRQCRMNS