; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029068 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029068
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationchr8:34935961..34937313
RNA-Seq ExpressionLag0029068
SyntenyLag0029068
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054953.1 putative transcription factor [Cucumis melo var. makuwa]5.2e-23291.11Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ QHSHALHQQHHPHTRQGSSANPSIQEGF LSMG VQNCDHTMSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD++G GRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD++E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE
        ENF PH DNRRSLGVLGGSVKR++R G D DD HACGNSLS LDCNKSSH + Q+ FAQADTAHLETESMK STSQKQWMELRLLQLEDQKLQIQVEMLE
Subjt:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE

Query:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH
        LEKQKFKWER NKKKDRELEKMRMVNEKMKLENER+ALDLKQK+IGSGFH
Subjt:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH

XP_008441519.2 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103485620 [Cucumis melo]2.6e-23190.89Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ QHSHALHQQHHPHTRQGSSANPSIQEGF LSMG VQNCDHTMSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD++G GRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD++E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE
        ENF PH DNRRSLGVLGGSVKR++R G D DD HACGNSLS LDCNKSSH + Q+ FAQADTAHLETESMK STSQKQWMELRLLQLEDQKLQIQVEMLE
Subjt:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE

Query:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH
        LEKQKFKWER NK KDRELEKMRMVNEKMKLENER+ALDLKQK+IGSGFH
Subjt:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH

XP_023551421.1 uncharacterized protein LOC111809238 [Cucurbita pepo subsp. pepo]2.5e-22690.22Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG SYGGLDLQG FKVH+QAQHSHALHQQHHPHTRQGS+ANPSIQEGF LSMG VQNCDH MSLVDYNKGERCKNSASDEEPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNETSKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD +G GRRK  IIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSC+VVENPALLDV++YLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD+NEH ETDE DDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE
        ENFAPHGD+RRS GVLGGSVKR+RR   D DD HACG SL       SSHA+ Q+ FAQADTAHLETE MK STSQKQWMELRLLQLEDQKLQIQVEMLE
Subjt:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE

Query:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH
        LEKQKFKWER NKKKDRELEKMRMVNE+MKLENERIALDLKQKEIGSGFH
Subjt:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH

XP_031743106.1 uncharacterized protein LOC105435760 [Cucumis sativus]2.2e-23090.67Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ Q SHALHQQHHPHTRQGSSANPSIQEGF LSMG VQNCDHTMSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD++GGGRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD++E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE
        ENF PH DNRRSLGVLGGSVKR++R G D DD HACGNSLS LDCNKSSH + Q+ F QADTAHLETESMK STSQKQWMELRLLQLEDQKLQIQVEMLE
Subjt:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE

Query:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH
        LEKQKFKWER NKKKDRELEKMRMVNE+MKLENER+ALDLKQK+IGSGFH
Subjt:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH

XP_038885368.1 uncharacterized protein LOC120075776 [Benincasa hispida]6.2e-23391.11Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG SYGGLDLQG FKVHNQ QHSHALHQ HHPHTRQGSSANPSIQEGF LSMG V NCDHTM LV+YNKGERCKNSASDEEPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        +DGHNE SKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDI SDL+GGGR+K QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD+ EH ETDEHDDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE
        ENF PH DNRRSLGVLGGSVKR++R G D DD HACGNSLSSLDCNKSSH + Q+PFAQADTAHLETESMK STSQKQWMELRLLQLE+QKLQIQVEMLE
Subjt:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE

Query:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH
        LEKQKFKW+R NKKKDRELE MRMVNE+MKLEN+R+ALDLKQK+IGSGFH
Subjt:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH

TrEMBL top hitse value%identityAlignment
A0A0A0KBC2 Uncharacterized protein1.1e-23090.67Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ Q SHALHQQHHPHTRQGSSANPSIQEGF LSMG VQNCDHTMSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD++GGGRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD++E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE
        ENF PH DNRRSLGVLGGSVKR++R G D DD HACGNSLS LDCNKSSH + Q+ F QADTAHLETESMK STSQKQWMELRLLQLEDQKLQIQVEMLE
Subjt:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE

Query:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH
        LEKQKFKWER NKKKDRELEKMRMVNE+MKLENER+ALDLKQK+IGSGFH
Subjt:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH

A0A1S3B4A7 LOW QUALITY PROTEIN: uncharacterized protein LOC1034856201.3e-23190.89Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ QHSHALHQQHHPHTRQGSSANPSIQEGF LSMG VQNCDHTMSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD++G GRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD++E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE
        ENF PH DNRRSLGVLGGSVKR++R G D DD HACGNSLS LDCNKSSH + Q+ FAQADTAHLETESMK STSQKQWMELRLLQLEDQKLQIQVEMLE
Subjt:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE

Query:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH
        LEKQKFKWER NK KDRELEKMRMVNEKMKLENER+ALDLKQK+IGSGFH
Subjt:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH

A0A5D3DGK7 Putative transcription factor2.5e-23291.11Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ QHSHALHQQHHPHTRQGSSANPSIQEGF LSMG VQNCDHTMSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD++G GRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD++E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE
        ENF PH DNRRSLGVLGGSVKR++R G D DD HACGNSLS LDCNKSSH + Q+ FAQADTAHLETESMK STSQKQWMELRLLQLEDQKLQIQVEMLE
Subjt:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE

Query:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH
        LEKQKFKWER NKKKDRELEKMRMVNEKMKLENER+ALDLKQK+IGSGFH
Subjt:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH

A0A6J1FFN2 uncharacterized protein LOC1114452941.8e-22590.22Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG SYGGLDLQG FKVH+QAQHSHALHQQHHPHTRQGS+ANPSIQEGF LSMG VQNCDH MSLVDYNKGERCKNSASDEEPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNETSKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD +G GRRK Q IQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSC+VVENPALLD+++YLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD+NEH ETDE DDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE
        ENFAPHGDNRRS GVLGGSVKR+RR   D DD HACG SL       SSHA+ Q+ FAQADTAHLETE MK STSQKQWMELRLLQLEDQKLQIQVEMLE
Subjt:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE

Query:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH
        LEKQKFKWER NKKKDRELEKMRMVNE+MKLENERIALDLKQKEIGSGFH
Subjt:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH

A0A6J1JU49 uncharacterized protein LOC1114897531.3e-22590Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG SYGGLDLQ  FKVH+QAQHSHALHQQHHPHTRQGS+ANPSIQEGF LSMG VQNCDH MSLVD+NKGERCKNSASDEEPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNETSKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD +G GRRK  IIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSC+VVENPALLDV++YLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD+NEH ETDE DDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE
        ENFA HGDNRRS GVLGGSVKR+RR G D DD HACG SL       SSHA+ Q+ FAQADTAHLETE MK STSQKQWMELRLLQLEDQKLQIQVEMLE
Subjt:  ENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLE

Query:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH
        LEKQKFKWER NKKKDRELEKMRMVNE+MKLENERIALDLKQKEIGSGFH
Subjt:  LEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGSGFH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors1.5e-13960.88Show/hide
Query:  MEGNLSQGGLIPGGA-SYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDH----TMSLVDYNKGERCKNSAS-DEEP
        M+GN  QGG++  GA SYGG DLQGS +VH    H  +++QQH    R   ++ P + EG   +M   Q CDH     MS+ +  K ER KNS S D+EP
Subjt:  MEGNLSQGGLIPGGA-SYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDH----TMSLVDYNKGERCKNSAS-DEEP

Query:  SFTEDGIDG-HNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKR
        SFTE+G DG HNE ++  KGS W RVKWTDKMVKLLITAVSYIGDD  S ++   RRKF ++QKKGKWK +SKVMAERGY VSPQQCEDKFNDLNKRYK+
Subjt:  SFTEDGIDG-HNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKR

Query:  LNDIIGRGTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEH-AE
        LND++GRGTSCQVVENPALLD I YL DKEKDDVRKI++SK LFYEEMCSYHN NRLHLPHD ALQRSLQLA R+RDDHDND+ R+HQ +D DD +H  +
Subjt:  LNDIIGRGTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEH-AE

Query:  TDEHDDFEENFAPHGDNR-RSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQK
         DEHD++EE    +GD R    G  GG +K++R     +D  H   + ++SL+CNK S    Q PF+QAD      ES +  + QKQWME R LQLE+QK
Subjt:  TDEHDDFEENFAPHGDNR-RSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQK

Query:  LQIQVEMLELEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIG
        LQIQVE+LELEKQ+F+W+R +KK+D+ELE+MRM NE+MKLEN+R+ L+LKQ+E+G
Subjt:  LQIQVEMLELEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIG

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)3.0e-9248.11Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGN SQG      +S    DL+ +    NQ        +QHHP++RQ S  N                  +TM    +N  +R K S S+++       
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
         DG N   K K+ S W RVKW DKMVKL+ITA+SYIG+D  SD      +KF ++QKKGKW+ +SKVM ERGY VSPQQCEDKFNDLNKRYK+LN+++GR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQL-AFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDF
        GTSC+VVENP+LLD IDYL +KEKD+VR+I++SK LFYEEMCSYHN NRLHLPHDPA+QRSL L    +RDDHDNDE  +HQN+D DD+        DD+
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQL-AFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDF

Query:  EENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAH-LETESMKPSTSQKQWMELRLLQLEDQKLQIQVEM
        EE+      +R         +KR+R+    +D  H   N    + C          P +QAD    +  +S K +  Q+Q +E + L+LE +KLQIQ EM
Subjt:  EENFAPHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAH-LETESMKPSTSQKQWMELRLLQLEDQKLQIQVEM

Query:  LELEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGS
        +ELE+Q+FKWE  +K+++++L KMRM NE+MKLENER++L+LK+ E+G+
Subjt:  LELEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEIGS

AT3G10040.1 sequence-specific DNA binding transcription factors3.7e-5033.94Show/hide
Query:  NQAQHSHALHQQH-HPHTRQG--------SSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDGIDGHNETSKGKKGSMWHRVK
        N     +++  QH HP+T  G         S  P   +  ++S  +   CD      D ++G    +  + E+ +    G DG       +K S WHR+K
Subjt:  NQAQHSHALHQQH-HPHTRQG--------SSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDGIDGHNETSKGKKGSMWHRVK

Query:  WTDKMVKLLITAVSYIGDDIA----------SDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENP
        WTD MV+LLI AV YIGD+            +   GGG     ++QKKGKWK +S+ M E+G+ VSPQQCEDKFNDLNKRYKR+NDI+G+G +C+VVEN 
Subjt:  WTDKMVKLLITAVSYIGDDIA----------SDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENP

Query:  ALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHD--PALQRSLQLAFRARDD---HDNDEPRRHQ-NDDFDDNEHAETDEHDDFE-ENF
         LL+ +D+LT K KD+V+K+LNSK LF+ EMC+YHNS      HD  P  Q  + +   ++     H  +  +  +  +  +  E  E+D  +D E E  
Subjt:  ALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHD--PALQRSLQLAFRARDD---HDNDEPRRHQ-NDDFDDNEHAETDEHDDFE-ENF

Query:  APHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLELEK
            +  R    +  +VKR+R                                    + A +  +  K    +K+W+  ++L++E++K+  + E +E+EK
Subjt:  APHGDNRRSLGVLGGSVKRVRRGGHDQDDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLELEK

Query:  QKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEI
        Q+ KW R   KK+RE+EK ++ N++ +LE ER+ L L++ EI
Subjt:  QKFKWERINKKKDRELEKMRMVNEKMKLENERIALDLKQKEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGAATTTATCACAAGGAGGGTTGATTCCAGGAGGGGCCTCTTATGGAGGTCTCGATTTGCAAGGATCGTTTAAAGTTCATAATCAGGCACAACACTCTCACGC
CTTACATCAGCAACATCATCCTCATACTCGTCAGGGATCTTCAGCCAATCCTTCCATTCAGGAGGGATTTAAACTTTCCATGGGAGCTGTGCAAAATTGTGACCATACCA
TGTCTTTGGTAGATTATAACAAGGGAGAAAGGTGTAAAAACTCGGCTAGTGACGAAGAGCCGAGCTTTACTGAGGATGGTATTGATGGTCATAATGAGACTAGTAAGGGG
AAGAAGGGATCGATGTGGCATCGCGTAAAATGGACGGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTGCTTCAGATCTTGAAGGGGG
TGGAAGAAGGAAATTTCAGATTATACAGAAGAAAGGTAAATGGAAATTGATATCAAAGGTCATGGCTGAAAGGGGTTATCAAGTTTCACCTCAGCAGTGTGAGGATAAAT
TTAATGACCTCAATAAGAGGTACAAGAGGCTCAATGATATAATTGGGAGAGGCACTTCTTGCCAGGTTGTTGAAAACCCTGCACTTCTTGATGTTATTGATTATTTAACT
GACAAAGAAAAGGATGATGTGAGAAAAATTTTAAACTCAAAGCAACTGTTCTATGAGGAGATGTGTTCTTATCATAATTCAAATCGACTCCATTTGCCCCATGATCCTGC
TTTGCAGCGTTCTTTGCAGTTGGCTTTTAGAGCAAGGGATGATCACGATAATGATGAGCCAAGGAGACACCAAAATGATGATTTTGATGATAACGAACATGCTGAAACTG
ATGAACATGATGATTTTGAGGAGAATTTTGCACCCCATGGGGACAATAGGCGATCACTTGGGGTATTAGGAGGCTCAGTGAAGAGGGTAAGACGAGGAGGCCACGACCAA
GATGATGTCCATGCCTGTGGCAATTCCTTGAGTTCTCTTGATTGCAACAAAAGTTCTCATGCTTACTTACAATCACCATTTGCTCAAGCCGATACAGCTCACTTAGAAAC
TGAAAGTATGAAACCTTCTACATCACAAAAGCAATGGATGGAGCTTCGCTTACTTCAGTTGGAAGACCAGAAGCTTCAAATTCAAGTTGAAATGTTGGAATTGGAGAAAC
AGAAATTCAAGTGGGAGAGAATTAACAAGAAAAAGGACCGTGAGTTGGAAAAAATGAGGATGGTAAATGAGAAGATGAAGCTTGAAAATGAGCGCATTGCACTTGACTTA
AAGCAAAAGGAAATTGGATCCGGATTTCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGGAATTTATCACAAGGAGGGTTGATTCCAGGAGGGGCCTCTTATGGAGGTCTCGATTTGCAAGGATCGTTTAAAGTTCATAATCAGGCACAACACTCTCACGC
CTTACATCAGCAACATCATCCTCATACTCGTCAGGGATCTTCAGCCAATCCTTCCATTCAGGAGGGATTTAAACTTTCCATGGGAGCTGTGCAAAATTGTGACCATACCA
TGTCTTTGGTAGATTATAACAAGGGAGAAAGGTGTAAAAACTCGGCTAGTGACGAAGAGCCGAGCTTTACTGAGGATGGTATTGATGGTCATAATGAGACTAGTAAGGGG
AAGAAGGGATCGATGTGGCATCGCGTAAAATGGACGGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTGCTTCAGATCTTGAAGGGGG
TGGAAGAAGGAAATTTCAGATTATACAGAAGAAAGGTAAATGGAAATTGATATCAAAGGTCATGGCTGAAAGGGGTTATCAAGTTTCACCTCAGCAGTGTGAGGATAAAT
TTAATGACCTCAATAAGAGGTACAAGAGGCTCAATGATATAATTGGGAGAGGCACTTCTTGCCAGGTTGTTGAAAACCCTGCACTTCTTGATGTTATTGATTATTTAACT
GACAAAGAAAAGGATGATGTGAGAAAAATTTTAAACTCAAAGCAACTGTTCTATGAGGAGATGTGTTCTTATCATAATTCAAATCGACTCCATTTGCCCCATGATCCTGC
TTTGCAGCGTTCTTTGCAGTTGGCTTTTAGAGCAAGGGATGATCACGATAATGATGAGCCAAGGAGACACCAAAATGATGATTTTGATGATAACGAACATGCTGAAACTG
ATGAACATGATGATTTTGAGGAGAATTTTGCACCCCATGGGGACAATAGGCGATCACTTGGGGTATTAGGAGGCTCAGTGAAGAGGGTAAGACGAGGAGGCCACGACCAA
GATGATGTCCATGCCTGTGGCAATTCCTTGAGTTCTCTTGATTGCAACAAAAGTTCTCATGCTTACTTACAATCACCATTTGCTCAAGCCGATACAGCTCACTTAGAAAC
TGAAAGTATGAAACCTTCTACATCACAAAAGCAATGGATGGAGCTTCGCTTACTTCAGTTGGAAGACCAGAAGCTTCAAATTCAAGTTGAAATGTTGGAATTGGAGAAAC
AGAAATTCAAGTGGGAGAGAATTAACAAGAAAAAGGACCGTGAGTTGGAAAAAATGAGGATGGTAAATGAGAAGATGAAGCTTGAAAATGAGCGCATTGCACTTGACTTA
AAGCAAAAGGAAATTGGATCCGGATTTCATTAA
Protein sequenceShow/hide protein sequence
MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDGIDGHNETSKG
KKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLT
DKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFEENFAPHGDNRRSLGVLGGSVKRVRRGGHDQ
DDVHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERINKKKDRELEKMRMVNEKMKLENERIALDL
KQKEIGSGFH