; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012467 (gene) of Snake gourd v1 genome

Gene IDTan0012467
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationLG11:9028826..9031165
RNA-Seq ExpressionTan0012467
SyntenyTan0012467
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054953.1 putative transcription factor [Cucumis melo var. makuwa]6.1e-23390.65Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ QHSHALHQQHH HTRQGSSANPS+QEGFSLSMG VQNCDHTMSLV+YNKGERCKNS SD++PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG

Query:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKW DKMVKLLITAVSYIGDDIASD++GSGRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRA+DDHDNDEPRRHQ+DDFDE+E GETDEHD++E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL
        ENF PH DNRRSLGVLGGSVKRL+RGQDHDDAHACGNSL+ LDCNKS H +SQ QFAQADTAH+ETESMKASTSQKQWMELR+LQ+EDQKLQIQVE LEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNE+MKLENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH

XP_008441519.2 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103485620 [Cucumis melo]3.0e-23290.42Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ QHSHALHQQHH HTRQGSSANPS+QEGFSLSMG VQNCDHTMSLV+YNKGERCKNS SD++PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG

Query:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKW DKMVKLLITAVSYIGDDIASD++GSGRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRA+DDHDNDEPRRHQ+DDFDE+E GETDEHD++E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL
        ENF PH DNRRSLGVLGGSVKRL+RGQDHDDAHACGNSL+ LDCNKS H +SQ QFAQADTAH+ETESMKASTSQKQWMELR+LQ+EDQKLQIQVE LEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQKFKWERFNK KDRELEKMRMVNE+MKLENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH

XP_023551421.1 uncharacterized protein LOC111809238 [Cucurbita pepo subsp. pepo]7.3e-22689.09Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGG SYGGLDLQG FKVH+QAQHSHALHQQHH HTRQGS+ANPS+QEGFSLSMG VQNCDH MSLVDYNKGERCKNS SD+EPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG

Query:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGS+WHRVKW DKMVKLLITAVSYIGDDI SD +G+GRRK  IIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE
        GTSC+VVENPALLDV++YLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRA+DDHDNDEPRRHQ+DDFDENEH ETDE D+FE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL
        ENFAPHGD+RRS GVLGGSVKRLRR QDHDD HACG SL+S       HA++Q QFAQADTAH+ETE MK STSQKQWMELR+LQ+EDQKLQIQVE LEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH

XP_031743106.1 uncharacterized protein LOC105435760 [Cucumis sativus]2.9e-23089.98Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ Q SHALHQQHH HTRQGSSANPS+QEGFSLSMG VQNCDHTMSLV+YNKGERCKNS SD++PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG

Query:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKW DKMVKLLITAVSYIGDDIASD++G GRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRA+DDHDNDEPRRHQ+DDFDE+E  ETDEHD++E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL
        ENF PH DNRRSLGVLGGSVKRL+RGQDHDDAHACGNSL+ LDCNKS H +SQ QF QADTAH+ETESMKASTSQKQWMELR+LQ+EDQKLQIQVE LEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERMKLENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH

XP_038885368.1 uncharacterized protein LOC120075776 [Benincasa hispida]8.9e-23290.2Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGG SYGGLDLQG FKVHNQ QHSHALHQ HH HTRQGSSANPS+QEGFSLSMG V NCDHTM LV+YNKGERCKNS SD+EPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG

Query:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        +DGHNE SKGKKGSMWHRVKW DKMVKLLITAVSYIGDDI SDL+G GR+K QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE
        GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRA+DDHDNDEPRRHQ+DDFDE EHGETDEHD+FE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL
        ENF PH DNRRSLGVLGGSVKRL+RGQDHDDAHACGNSL+SLDCNKS H +SQ  FAQADTAH+ETESMKASTSQKQWMELR+LQ+E+QKLQIQVE LEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQKFKW+RFNKKKDRELE MRMVNERMKLEN+R+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH

TrEMBL top hitse value%identityAlignment
A0A0A0KBC2 Uncharacterized protein1.4e-23089.98Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ Q SHALHQQHH HTRQGSSANPS+QEGFSLSMG VQNCDHTMSLV+YNKGERCKNS SD++PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG

Query:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKW DKMVKLLITAVSYIGDDIASD++G GRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRA+DDHDNDEPRRHQ+DDFDE+E  ETDEHD++E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL
        ENF PH DNRRSLGVLGGSVKRL+RGQDHDDAHACGNSL+ LDCNKS H +SQ QF QADTAH+ETESMKASTSQKQWMELR+LQ+EDQKLQIQVE LEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERMKLENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH

A0A1S3B4A7 LOW QUALITY PROTEIN: uncharacterized protein LOC1034856201.5e-23290.42Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ QHSHALHQQHH HTRQGSSANPS+QEGFSLSMG VQNCDHTMSLV+YNKGERCKNS SD++PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG

Query:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKW DKMVKLLITAVSYIGDDIASD++GSGRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRA+DDHDNDEPRRHQ+DDFDE+E GETDEHD++E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL
        ENF PH DNRRSLGVLGGSVKRL+RGQDHDDAHACGNSL+ LDCNKS H +SQ QFAQADTAH+ETESMKASTSQKQWMELR+LQ+EDQKLQIQVE LEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQKFKWERFNK KDRELEKMRMVNE+MKLENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH

A0A5D3DGK7 Putative transcription factor3.0e-23390.65Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ QHSHALHQQHH HTRQGSSANPS+QEGFSLSMG VQNCDHTMSLV+YNKGERCKNS SD++PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG

Query:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGSMWHRVKW DKMVKLLITAVSYIGDDIASD++GSGRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRA+DDHDNDEPRRHQ+DDFDE+E GETDEHD++E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL
        ENF PH DNRRSLGVLGGSVKRL+RGQDHDDAHACGNSL+ LDCNKS H +SQ QFAQADTAH+ETESMKASTSQKQWMELR+LQ+EDQKLQIQVE LEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNE+MKLENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH

A0A6J1FFN2 uncharacterized protein LOC1114452941.7e-22589.31Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGG SYGGLDLQG FKVH+QAQHSHALHQQHH HTRQGS+ANPS+QEGFSLSMG VQNCDH MSLVDYNKGERCKNS SD+EPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG

Query:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGS+WHRVKW DKMVKLLITAVSYIGDDI SD +G+GRRK Q IQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE
        GTSC+VVENPALLD+++YLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRA+DDHDNDEPRRHQ+DDFDENEH ETDE D+FE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL
        ENFAPHGDNRRS GVLGGSVKRLRR QDHDD HACG SL+S       HA+SQ QFAQADTAH+ETE MK STSQKQWMELR+LQ+EDQKLQIQVE LEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH

A0A6J1JU49 uncharacterized protein LOC1114897533.9e-22588.86Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGG SYGGLDLQ  FKVH+QAQHSHALHQQHH HTRQGS+ANPS+QEGFSLSMG VQNCDH MSLVD+NKGERCKNS SD+EPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG

Query:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGS+WHRVKW DKMVKLLITAVSYIGDDI SD +G+GRRK  IIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE
        GTSC+VVENPALLDV++YLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRA+DDHDNDEPRRHQ+DDFDENEH ETDE D+FE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFE

Query:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL
        ENFA HGDNRRS GVLGGSVKRLRRGQDHDD HACG SL+S       HA++Q QFAQADTAH+ETE MK STSQKQWMELR+LQ+EDQKLQIQVE LEL
Subjt:  ENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors4.5e-14161.67Show/hide
Query:  MEGNLSQGGLIPGGA-SYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDH----TMSLVDYNKGERCKNSPS-DDEP
        M+GN  QGG++  GA SYGG DLQGS +VH    H  +++QQH    R   ++ P L EG   +M   Q CDH     MS+ +  K ER KNS S DDEP
Subjt:  MEGNLSQGGLIPGGA-SYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDH----TMSLVDYNKGERCKNSPS-DDEP

Query:  SFTEDGIDG-HNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKR
        SFTE+G DG HNE ++  KGS W RVKW DKMVKLLITAVSYIGDD  S ++ S RRKF ++QKKGKWK +SKVMAERGY VSPQQCEDKFNDLNKRYK+
Subjt:  SFTEDGIDG-HNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKR

Query:  LNDIIGRGTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEH-GE
        LND++GRGTSCQVVENPALLD I YL DKEKDDVRKI++SK LFYEEMCSYHN NRLHLPHD ALQRSLQLA R++DDHDND+ R+HQ +D D+ +H G+
Subjt:  LNDIIGRGTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEH-GE

Query:  TDEHDEFEENFAPHGDNR-RSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKL
         DEHDE+EE    +GD R    G  GG +K++R    H+D     + +NSL+CNK   +  Q  F+QAD      ES +A + QKQWME R LQ+E+QKL
Subjt:  TDEHDEFEENFAPHGDNR-RSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKL

Query:  QIQVETLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIG
        QIQVE LELEKQ+F+W+RF+KK+D+ELE+MRM NERMKLEN+R+ L+LKQ+E+G
Subjt:  QIQVETLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIG

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)1.1e-9147.88Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG
        MEGN SQG      +S    DL+ +    NQ        +QHH ++RQ S  N ++                      +N  +R K S S+D+       
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDG

Query:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
         DG N   K K+ S W RVKW DKMVKL+ITA+SYIG+D  SD      +KF ++QKKGKW+ +SKVM ERGY VSPQQCEDKFNDLNKRYK+LN+++GR
Subjt:  IDGHNEMSKGKKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQL-AFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEF
        GTSC+VVENP+LLD IDYL +KEKD+VR+I++SK LFYEEMCSYHN NRLHLPHDPA+QRSL L    ++DDHDNDE  +HQ++D D+++  E D HD  
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQL-AFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEF

Query:  EENFAPHGDNRRSLGVLGG-SVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAH-IETESMKASTSQKQWMELRILQMEDQKLQIQVET
                      G L    +KRLR+ Q H+D     N    + C            +QAD    I  +S KA+  Q+Q +E + L++E +KLQIQ E 
Subjt:  EENFAPHGDNRRSLGVLGG-SVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQADTAH-IETESMKASTSQKQWMELRILQMEDQKLQIQVET

Query:  LELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGS
        +ELE+Q+FKWE F+K+++++L KMRM NERMKLENER++L+LK+ E+G+
Subjt:  LELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGS

AT3G10040.1 sequence-specific DNA binding transcription factors1.3e-5235.94Show/hide
Query:  DDEPSFTEDGIDGHNEMSKGKKG----SMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKF----------QIIQKKGKWKLISKVMAERGYQVS
        DDE   +  G   + E S G  G    S WHR+KW D MV+LLI AV YIGD+   +     ++K            ++QKKGKWK +S+ M E+G+ VS
Subjt:  DDEPSFTEDGIDGHNEMSKGKKG----SMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKF----------QIIQKKGKWKLISKVMAERGYQVS

Query:  PQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNS--------------NRLHLPHDPALQRSL
        PQQCEDKFNDLNKRYKR+NDI+G+G +C+VVEN  LL+ +D+LT K KD+V+K+LNSK LF+ EMC+YHNS              N + +P     Q   
Subjt:  PQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNS--------------NRLHLPHDPALQRSL

Query:  QLAFRAKDDH--DNDEPRRHQHDDFDENEHGETDEHDEFEENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQAD
          A   K     +  E       D  E+   E +E +E         +  R    +  +VKRLR                                   +
Subjt:  QLAFRAKDDH--DNDEPRRHQHDDFDENEHGETDEHDEFEENFAPHGDNRRSLGVLGGSVKRLRRGQDHDDAHACGNSLNSLDCNKSLHAYSQTQFAQAD

Query:  TAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEI
         A +  +  K+   +K+W+  ++L++E++K+  + E +E+EKQ+ KW R+  KK+RE+EK ++ N+R +LE ER+ L L++ EI
Subjt:  TAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGAATTTATCACAAGGAGGGTTGATTCCAGGAGGGGCCTCTTATGGAGGTCTTGATTTGCAAGGATCGTTTAAGGTTCATAATCAGGCACAACACTCTCACGC
TTTACATCAGCAACATCATTCTCATACTCGTCAGGGATCTTCAGCTAATCCTTCCCTTCAGGAGGGATTTTCACTTTCCATGGGAGCTGTACAAAATTGTGACCATACCA
TGTCTTTGGTAGATTATAACAAGGGAGAAAGGTGTAAAAACTCACCTAGTGACGATGAGCCGAGTTTTACTGAGGATGGTATTGATGGTCATAATGAGATGAGTAAGGGG
AAGAAGGGATCGATGTGGCATCGCGTGAAATGGGCGGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTGCTTCAGATTTAGAAGGGAG
TGGAAGAAGGAAATTTCAAATTATACAGAAGAAAGGTAAATGGAAATTGATATCAAAGGTCATGGCTGAAAGAGGTTATCAAGTCTCACCCCAGCAGTGTGAGGATAAAT
TTAATGACCTCAATAAGAGGTATAAGAGGCTTAATGATATAATTGGGAGAGGCACTTCTTGCCAGGTTGTTGAGAACCCTGCACTTCTTGATGTCATTGATTATTTAACA
GACAAAGAAAAGGATGATGTGAGAAAAATTTTAAACTCAAAGCAGCTGTTCTATGAGGAGATGTGTTCTTATCATAATTCGAATCGACTCCATCTGCCCCATGATCCTGC
TTTGCAGCGTTCTTTGCAGTTGGCTTTTAGAGCAAAGGATGATCATGATAACGATGAGCCAAGGAGACACCAACATGATGATTTTGATGAAAATGAACATGGTGAAACTG
ATGAACATGATGAGTTTGAGGAGAATTTTGCACCCCATGGGGACAACAGACGATCACTTGGGGTATTAGGAGGCTCAGTGAAGAGGCTAAGACGAGGCCAAGATCATGAT
GATGCTCATGCCTGTGGCAATTCCTTGAATTCTCTCGATTGCAACAAAAGTTTGCATGCTTACTCACAAACACAATTTGCTCAAGCTGATACAGCTCACATAGAAACTGA
AAGTATGAAAGCTTCTACGTCACAAAAGCAGTGGATGGAGCTTCGCATACTTCAGATGGAAGATCAGAAGCTTCAAATTCAAGTTGAAACGTTGGAATTGGAGAAACAGA
AGTTCAAATGGGAGAGATTTAACAAAAAAAAGGACCGTGAGTTGGAAAAAATGAGGATGGTCAATGAGAGGATGAAGCTTGAAAATGAGCGCATTGCACTCGACTTAAAG
CAAAAGGAAATTGGATCGGGATTTCATTAA
mRNA sequenceShow/hide mRNA sequence
GACGAGATGCTTGCCTAAACCTAAACAGTAAACATAACATGGCATATGGTTAACGCGACCCAGTAGCAGCAGCACGCAAAAGGTTTTTTGTCAGTACTCGTGACGTCAAA
TTTAATGACCCACCAGATCCTCGCATTCATTGAACCTTTCATAAATCAAAACCAAAGCCTACGCATTGGCTTCTCTGATCGGTTCTTTCGCCCACTACTTTGACATTTTT
CCTCTTTTTCTTCTTACAAAATACAAAACCCCCCACATTCCTGCCGTTTCCCTTCCCTTCTCTTCTCGGTTGCCACGCACAGCCAATAAGCGTCAGGATGTCTTCTCTTC
GCCTCTTTGCGTTTCCTCTTTTCTGGGTTTTCACCTCTTGGCGGGTTTTGCTCAGATAAGTGTTTCTCCTTCTTTGTTGTTTGATTTCCTTTGAAGGGTCGAGGGTTGTT
TCTCTTATTTTCCCCTTTTTGGGGTTTTATCGCTCGCTCTTAATCGAAAGATTAGGTTCTTTTGGTGCTCATTTCTGTTGGTTCTTTTGGTTTTTGGGCAAATGGGGTTT
GTTTTAATGTAATTTCCAATTGTCTACTGGGGAAATTTCGATCTGGGTGTGCTTGATTGTTCCATTCTGTAACTTTAGGGACGATAGATTTAGTAGTTTGAGAATCATGT
GAAAGATAAGATCGTGGGGTATGTTCTTTTGACATATGGAAGGGAATTTATCACAAGGAGGGTTGATTCCAGGAGGGGCCTCTTATGGAGGTCTTGATTTGCAAGGATCG
TTTAAGGTTCATAATCAGGCACAACACTCTCACGCTTTACATCAGCAACATCATTCTCATACTCGTCAGGGATCTTCAGCTAATCCTTCCCTTCAGGAGGGATTTTCACT
TTCCATGGGAGCTGTACAAAATTGTGACCATACCATGTCTTTGGTAGATTATAACAAGGGAGAAAGGTGTAAAAACTCACCTAGTGACGATGAGCCGAGTTTTACTGAGG
ATGGTATTGATGGTCATAATGAGATGAGTAAGGGGAAGAAGGGATCGATGTGGCATCGCGTGAAATGGGCGGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTAT
ATAGGAGATGATATTGCTTCAGATTTAGAAGGGAGTGGAAGAAGGAAATTTCAAATTATACAGAAGAAAGGTAAATGGAAATTGATATCAAAGGTCATGGCTGAAAGAGG
TTATCAAGTCTCACCCCAGCAGTGTGAGGATAAATTTAATGACCTCAATAAGAGGTATAAGAGGCTTAATGATATAATTGGGAGAGGCACTTCTTGCCAGGTTGTTGAGA
ACCCTGCACTTCTTGATGTCATTGATTATTTAACAGACAAAGAAAAGGATGATGTGAGAAAAATTTTAAACTCAAAGCAGCTGTTCTATGAGGAGATGTGTTCTTATCAT
AATTCGAATCGACTCCATCTGCCCCATGATCCTGCTTTGCAGCGTTCTTTGCAGTTGGCTTTTAGAGCAAAGGATGATCATGATAACGATGAGCCAAGGAGACACCAACA
TGATGATTTTGATGAAAATGAACATGGTGAAACTGATGAACATGATGAGTTTGAGGAGAATTTTGCACCCCATGGGGACAACAGACGATCACTTGGGGTATTAGGAGGCT
CAGTGAAGAGGCTAAGACGAGGCCAAGATCATGATGATGCTCATGCCTGTGGCAATTCCTTGAATTCTCTCGATTGCAACAAAAGTTTGCATGCTTACTCACAAACACAA
TTTGCTCAAGCTGATACAGCTCACATAGAAACTGAAAGTATGAAAGCTTCTACGTCACAAAAGCAGTGGATGGAGCTTCGCATACTTCAGATGGAAGATCAGAAGCTTCA
AATTCAAGTTGAAACGTTGGAATTGGAGAAACAGAAGTTCAAATGGGAGAGATTTAACAAAAAAAAGGACCGTGAGTTGGAAAAAATGAGGATGGTCAATGAGAGGATGA
AGCTTGAAAATGAGCGCATTGCACTCGACTTAAAGCAAAAGGAAATTGGATCGGGATTTCATTAATTGGCTGCTATAATCTGAACTAATTTTATCAGGTTTGGAAGATTA
TGAACCGTTAAGCTGCTGCTGTTTATATATTTCTGGTTATGATGAGTAATGAGGGAGAGTTGTATTTGGGTACTTTTACATTGCCTGCATCTAGCATGTAAAGTTAGAAG
GTGACAATACAATTGACTGCTTTTCATTGTTTTGCCTTTACTTATGTGGAATCTATTGTGGTTGAGGAACTTGAAAAATGGATGGATATTTTGTGTGTCAAATTGATGCC
ATTTCTCTGAACTAGTTCTACAGTTTTGGC
Protein sequenceShow/hide protein sequence
MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHSHTRQGSSANPSLQEGFSLSMGAVQNCDHTMSLVDYNKGERCKNSPSDDEPSFTEDGIDGHNEMSKG
KKGSMWHRVKWADKMVKLLITAVSYIGDDIASDLEGSGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLT
DKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRAKDDHDNDEPRRHQHDDFDENEHGETDEHDEFEENFAPHGDNRRSLGVLGGSVKRLRRGQDHD
DAHACGNSLNSLDCNKSLHAYSQTQFAQADTAHIETESMKASTSQKQWMELRILQMEDQKLQIQVETLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLK
QKEIGSGFH