; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg008788 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg008788
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationscaffold10:36657207..36658556
RNA-Seq ExpressionSpg008788
SyntenySpg008788
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054953.1 putative transcription factor [Cucumis melo var. makuwa]1.6e-23391.09Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHN  QHSHALHQQHHPHTRQGSSANPSIQEGF LSMG VQNCDHTMSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD++G GRRK QIIQKKGKWKL+SKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD++E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENF PH DNRRSLGVLGGSVKRL+RG D DDAHACGNSLS LDCNKSSH + Q+ FAQADTAHLETESMK STSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNE+M+LENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH

XP_008441519.2 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103485620 [Cucumis melo]8.0e-23390.87Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHN  QHSHALHQQHHPHTRQGSSANPSIQEGF LSMG VQNCDHTMSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD++G GRRK QIIQKKGKWKL+SKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD++E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENF PH DNRRSLGVLGGSVKRL+RG D DDAHACGNSLS LDCNKSSH + Q+ FAQADTAHLETESMK STSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH
        EKQKFKWERFNK KDRELEKMRMVNE+M+LENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH

XP_023551421.1 uncharacterized protein LOC111809238 [Cucurbita pepo subsp. pepo]4.6e-22890.42Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG SYGGLDLQG FKVH+ AQHSHALHQQHHPHTRQGS+ANPSIQEGF LSMG VQNCDH MSLVDYNKGERCKNSASDEEPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNETSKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD +G GRRK  IIQKKGKWKL+SKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSC+VVENPALLDV++YLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD+NEH ETDE DDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENFAPHGD+RRS GVLGGSVKRLRR  D DD HACG SL       SSHA+ Q+ FAQADTAHLETE MK STSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERM+LENERIALDLKQKEIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH

XP_031743106.1 uncharacterized protein LOC105435760 [Cucumis sativus]1.4e-23291.09Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHN  Q SHALHQQHHPHTRQGSSANPSIQEGF LSMG VQNCDHTMSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD++GGGRRK QIIQKKGKWKL+SKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD++E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENF PH DNRRSLGVLGGSVKRL+RG D DDAHACGNSLS LDCNKSSH + Q+ F QADTAHLETESMK STSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERM+LENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH

XP_038885368.1 uncharacterized protein LOC120075776 [Benincasa hispida]3.8e-23591.54Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG SYGGLDLQG FKVHN  QHSHALHQ HHPHTRQGSSANPSIQEGF LSMG V NCDHTM LV+YNKGERCKNSASDEEPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        +DGHNE SKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDI SDL+GGGR+K QIIQKKGKWKL+SKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD+ EH ETDEHDDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENF PH DNRRSLGVLGGSVKRL+RG D DDAHACGNSLSSLDCNKSSH + Q+PFAQADTAHLETESMK STSQKQWMELRLLQLE+QKLQIQVEMLEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH
        EKQKFKW+RFNKKKDRELE MRMVNERM+LEN+R+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH

TrEMBL top hitse value%identityAlignment
A0A0A0KBC2 Uncharacterized protein6.6e-23391.09Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHN  Q SHALHQQHHPHTRQGSSANPSIQEGF LSMG VQNCDHTMSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD++GGGRRK QIIQKKGKWKL+SKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD++E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENF PH DNRRSLGVLGGSVKRL+RG D DDAHACGNSLS LDCNKSSH + Q+ F QADTAHLETESMK STSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERM+LENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH

A0A1S3B4A7 LOW QUALITY PROTEIN: uncharacterized protein LOC1034856203.9e-23390.87Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHN  QHSHALHQQHHPHTRQGSSANPSIQEGF LSMG VQNCDHTMSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD++G GRRK QIIQKKGKWKL+SKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD++E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENF PH DNRRSLGVLGGSVKRL+RG D DDAHACGNSLS LDCNKSSH + Q+ FAQADTAHLETESMK STSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH
        EKQKFKWERFNK KDRELEKMRMVNE+M+LENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH

A0A5D3DGK7 Putative transcription factor7.8e-23491.09Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHN  QHSHALHQQHHPHTRQGSSANPSIQEGF LSMG VQNCDHTMSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD++G GRRK QIIQKKGKWKL+SKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD++E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENF PH DNRRSLGVLGGSVKRL+RG D DDAHACGNSLS LDCNKSSH + Q+ FAQADTAHLETESMK STSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNE+M+LENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH

A0A6J1FFN2 uncharacterized protein LOC1114452944.2e-22790.42Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG SYGGLDLQG FKVH+ AQHSHALHQQHHPHTRQGS+ANPSIQEGF LSMG VQNCDH MSLVDYNKGERCKNSASDEEPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNETSKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD +G GRRK Q IQKKGKWKL+SKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSC+VVENPALLD+++YLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD+NEH ETDE DDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENFAPHGDNRRS GVLGGSVKRLRR  D DD HACG SL       SSHA+ Q+ FAQADTAHLETE MK STSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERM+LENERIALDLKQKEIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH

A0A6J1JU49 uncharacterized protein LOC1114897532.4e-22790.2Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG
        MEGNLSQGGLIPGG SYGGLDLQ  FKVH+ AQHSHALHQQHHPHTRQGS+ANPSIQEGF LSMG VQNCDH MSLVD+NKGERCKNSASDEEPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDG

Query:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNETSKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD +G GRRK  IIQKKGKWKL+SKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE
        GTSC+VVENPALLDV++YLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFD+NEH ETDE DDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENFA HGDNRRS GVLGGSVKRLRRG D DD HACG SL       SSHA+ Q+ FAQADTAHLETE MK STSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERM+LENERIALDLKQKEIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIGSGFH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors1.1e-13961.01Show/hide
Query:  MEGNLSQGGLIPGGA-SYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDH----TMSLVDYNKGERCKNSAS-DEEP
        M+GN  QGG++  GA SYGG DLQGS +VH    H  +++QQH    R   ++ P + EG   +M   Q CDH     MS+ +  K ER KNS S D+EP
Subjt:  MEGNLSQGGLIPGGA-SYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDH----TMSLVDYNKGERCKNSAS-DEEP

Query:  SFTEDGIDG-HNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKR
        SFTE+G DG HNE ++  KGS W RVKWTDKMVKLLITAVSYIGDD  S ++   RRKF ++QKKGKWK +SKVMAERGY VSPQQCEDKFNDLNKRYK+
Subjt:  SFTEDGIDG-HNETSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKR

Query:  LNDIIGRGTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEH-AE
        LND++GRGTSCQVVENPALLD I YL DKEKDDVRKI++SK LFYEEMCSYHN NRLHLPHD ALQRSLQLA R+RDDHDND+ R+HQ +D DD +H  +
Subjt:  LNDIIGRGTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEH-AE

Query:  TDEHDDFEENFAPHGDNR-RSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKL
         DEHD++EE    +GD R    G  GG +K++R     +D     + ++SL+CNK S    Q PF+QAD      ES +  + QKQWME R LQLE+QKL
Subjt:  TDEHDDFEENFAPHGDNR-RSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKL

Query:  QIQVEMLELEKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIG
        QIQVE+LELEKQ+F+W+RF+KK+D+ELE+MRM NERM+LEN+R+ L+LKQ+E+G
Subjt:  QIQVEMLELEKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEIG

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)5.0e-9248.39Show/hide
Query:  GGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDGIDGHNETSKGKK
        G  S G  D Q S          +   +QHHP++RQ S  N                  +TM    +N  +R K S S+++        DG N   K K+
Subjt:  GGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDGIDGHNETSKGKK

Query:  GSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENPAL
         S W RVKW DKMVKL+ITA+SYIG+D  SD      +KF ++QKKGKW+ +SKVM ERGY VSPQQCEDKFNDLNKRYK+LN+++GRGTSC+VVENP+L
Subjt:  GSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENPAL

Query:  LDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQL-AFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFEENFAPHGDNRR
        LD IDYL +KEKD+VR+I++SK LFYEEMCSYHN NRLHLPHDPA+QRSL L    +RDDHDNDE  +HQN+D DD+        DD+EE+      +R 
Subjt:  LDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQL-AFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFEENFAPHGDNRR

Query:  SLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAH-LETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERF
                +KRLR+    +D     N    + C          P +QAD    +  +S K +  Q+Q +E + L+LE +KLQIQ EM+ELE+Q+FKWE F
Subjt:  SLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAH-LETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERF

Query:  NKKKDRELEKMRMVNERMRLENERIALDLKQKEIGS
        +K+++++L KMRM NERM+LENER++L+LK+ E+G+
Subjt:  NKKKDRELEKMRMVNERMRLENERIALDLKQKEIGS

AT3G10040.1 sequence-specific DNA binding transcription factors1.0e-5234.92Show/hide
Query:  NPAQHSHALHQQH-HPHTRQG--------SSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDGIDGHNETSKGKKGSMWHRVK
        NP    +++  QH HP+T  G         S  P   +  ++S  +   CD      D ++G    +  + E+ +    G DG       +K S WHR+K
Subjt:  NPAQHSHALHQQH-HPHTRQG--------SSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDGIDGHNETSKGKKGSMWHRVK

Query:  WTDKMVKLLITAVSYIGDDIA----------SDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENP
        WTD MV+LLI AV YIGD+            +   GGG     ++QKKGKWK +S+ M E+G+ VSPQQCEDKFNDLNKRYKR+NDI+G+G +C+VVEN 
Subjt:  WTDKMVKLLITAVSYIGDDIA----------SDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENP

Query:  ALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHD--PALQRSLQLAFRARDD---HDNDEPRRHQ-NDDFDDNEHAETDEHDDFE-ENF
         LL+ +D+LT K KD+V+K+LNSK LF+ EMC+YHNS      HD  P  Q  + +   ++     H  +  +  +  +  +  E  E+D  +D E E  
Subjt:  ALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHD--PALQRSLQLAFRARDD---HDNDEPRRHQ-NDDFDDNEHAETDEHDDFE-ENF

Query:  APHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQ
            +  R    +  +VKRLR                                   + A +  +  K    +K+W+  ++L++E++K+  + E +E+EKQ
Subjt:  APHGDNRRSLGVLGGSVKRLRRGHDQDDAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQ

Query:  KFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEI
        + KW R+  KK+RE+EK ++ N+R RLE ER+ L L++ EI
Subjt:  KFKWERFNKKKDRELEKMRMVNERMRLENERIALDLKQKEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGAATTTATCACAAGGAGGGTTGATTCCAGGAGGGGCCTCTTATGGAGGTCTCGATTTGCAAGGATCGTTTAAAGTTCATAATCCGGCACAACACTCTCACGC
CTTACATCAGCAACATCATCCTCATACTCGTCAGGGATCTTCAGCCAATCCGTCCATTCAGGAGGGATTTAAACTTTCCATGGGAGCTGTGCAAAATTGTGACCATACCA
TGTCTTTGGTAGATTATAACAAGGGAGAAAGGTGTAAAAACTCAGCTAGTGACGAAGAGCCGAGCTTTACTGAGGATGGTATTGATGGTCATAATGAGACTAGTAAGGGG
AAGAAGGGATCGATGTGGCATCGCGTGAAATGGACGGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTGCTTCAGATCTTGAAGGGGG
TGGAAGAAGGAAATTTCAGATTATACAGAAGAAAGGTAAATGGAAATTGCTATCAAAGGTCATGGCTGAAAGGGGTTATCAAGTTTCACCTCAGCAGTGTGAGGATAAAT
TTAATGACCTCAATAAGAGGTACAAGAGGCTCAATGATATAATTGGGAGAGGCACTTCTTGCCAGGTTGTTGAAAACCCTGCACTTCTTGATGTTATTGATTATTTAACT
GACAAAGAAAAGGATGATGTGAGAAAAATTTTAAACTCAAAGCAACTGTTCTATGAGGAGATGTGTTCTTATCATAATTCGAATCGACTCCATCTGCCCCATGATCCTGC
TTTGCAGCGTTCTTTGCAGTTGGCTTTTAGAGCAAGGGATGATCATGATAATGATGAGCCAAGGAGACACCAAAATGATGATTTTGATGATAACGAACATGCTGAAACTG
ATGAACATGATGATTTTGAAGAGAATTTTGCACCCCATGGGGACAATAGGCGATCACTTGGGGTATTAGGAGGCTCAGTGAAGAGGCTACGACGAGGCCACGACCAAGAT
GATGCCCATGCCTGTGGCAATTCCTTGAGTTCTCTTGATTGCAACAAAAGTTCCCATGCTTACTTACAATCACCATTTGCTCAAGCCGATACAGCTCACTTAGAAACTGA
AAGTATGAAACCTTCTACATCACAAAAGCAATGGATGGAGCTTCGCTTACTTCAGTTGGAAGACCAGAAGCTTCAAATTCAAGTTGAAATGTTGGAATTGGAGAAACAGA
AATTCAAGTGGGAGAGATTTAACAAGAAAAAGGACCGTGAGTTGGAAAAAATGAGGATGGTAAATGAGAGGATGAGGCTTGAAAATGAGCGCATTGCACTTGACTTAAAG
CAAAAGGAAATTGGATCCGGATTTCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGGAATTTATCACAAGGAGGGTTGATTCCAGGAGGGGCCTCTTATGGAGGTCTCGATTTGCAAGGATCGTTTAAAGTTCATAATCCGGCACAACACTCTCACGC
CTTACATCAGCAACATCATCCTCATACTCGTCAGGGATCTTCAGCCAATCCGTCCATTCAGGAGGGATTTAAACTTTCCATGGGAGCTGTGCAAAATTGTGACCATACCA
TGTCTTTGGTAGATTATAACAAGGGAGAAAGGTGTAAAAACTCAGCTAGTGACGAAGAGCCGAGCTTTACTGAGGATGGTATTGATGGTCATAATGAGACTAGTAAGGGG
AAGAAGGGATCGATGTGGCATCGCGTGAAATGGACGGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTGCTTCAGATCTTGAAGGGGG
TGGAAGAAGGAAATTTCAGATTATACAGAAGAAAGGTAAATGGAAATTGCTATCAAAGGTCATGGCTGAAAGGGGTTATCAAGTTTCACCTCAGCAGTGTGAGGATAAAT
TTAATGACCTCAATAAGAGGTACAAGAGGCTCAATGATATAATTGGGAGAGGCACTTCTTGCCAGGTTGTTGAAAACCCTGCACTTCTTGATGTTATTGATTATTTAACT
GACAAAGAAAAGGATGATGTGAGAAAAATTTTAAACTCAAAGCAACTGTTCTATGAGGAGATGTGTTCTTATCATAATTCGAATCGACTCCATCTGCCCCATGATCCTGC
TTTGCAGCGTTCTTTGCAGTTGGCTTTTAGAGCAAGGGATGATCATGATAATGATGAGCCAAGGAGACACCAAAATGATGATTTTGATGATAACGAACATGCTGAAACTG
ATGAACATGATGATTTTGAAGAGAATTTTGCACCCCATGGGGACAATAGGCGATCACTTGGGGTATTAGGAGGCTCAGTGAAGAGGCTACGACGAGGCCACGACCAAGAT
GATGCCCATGCCTGTGGCAATTCCTTGAGTTCTCTTGATTGCAACAAAAGTTCCCATGCTTACTTACAATCACCATTTGCTCAAGCCGATACAGCTCACTTAGAAACTGA
AAGTATGAAACCTTCTACATCACAAAAGCAATGGATGGAGCTTCGCTTACTTCAGTTGGAAGACCAGAAGCTTCAAATTCAAGTTGAAATGTTGGAATTGGAGAAACAGA
AATTCAAGTGGGAGAGATTTAACAAGAAAAAGGACCGTGAGTTGGAAAAAATGAGGATGGTAAATGAGAGGATGAGGCTTGAAAATGAGCGCATTGCACTTGACTTAAAG
CAAAAGGAAATTGGATCCGGATTTCATTAA
Protein sequenceShow/hide protein sequence
MEGNLSQGGLIPGGASYGGLDLQGSFKVHNPAQHSHALHQQHHPHTRQGSSANPSIQEGFKLSMGAVQNCDHTMSLVDYNKGERCKNSASDEEPSFTEDGIDGHNETSKG
KKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLEGGGRRKFQIIQKKGKWKLLSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLT
DKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDDNEHAETDEHDDFEENFAPHGDNRRSLGVLGGSVKRLRRGHDQD
DAHACGNSLSSLDCNKSSHAYLQSPFAQADTAHLETESMKPSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKMRMVNERMRLENERIALDLK
QKEIGSGFH