; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1346 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1346
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationMC04:21518757..21521572
RNA-Seq ExpressionMC04g1346
SyntenyMC04g1346
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054953.1 putative transcription factor [Cucumis melo var. makuwa]1.28e-27486.86Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQ-----ALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGGSSYGGLDLQG FKVHNQ     ALH   H HTRQGSSANPSIQEGFSLSM  +QNCDHTMS+V+YNKGER KNS SD++PSF ED 
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQ-----ALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG

Query:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        +DGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDIASD DG GRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSCQVVENPALLDV+DYLT+KDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE+E GETDEHDD+E
Subjt:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQ--FTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLEL
        ENF PH DNRR  G   GS KRL+RGQDHD+AHACGNSL+  DCNKSSH +SQ  F  ADTA LETESMKASTSQKQWMELRLLQ+EDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQ--FTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLEL

Query:  EKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQ+FKWERFNKKKD ELEKMRMVNE+MKLENER+ALDLKQK+IGSGFH
Subjt:  EKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH

XP_008441519.2 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103485620 [Cucumis melo]1.05e-27386.64Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQ-----ALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGGSSYGGLDLQG FKVHNQ     ALH   H HTRQGSSANPSIQEGFSLSM  +QNCDHTMS+V+YNKGER KNS SD++PSF ED 
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQ-----ALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG

Query:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        +DGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDIASD DG GRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSCQVVENPALLDV+DYLT+KDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE+E GETDEHDD+E
Subjt:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQ--FTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLEL
        ENF PH DNRR  G   GS KRL+RGQDHD+AHACGNSL+  DCNKSSH +SQ  F  ADTA LETESMKASTSQKQWMELRLLQ+EDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQ--FTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLEL

Query:  EKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQ+FKWERFNK KD ELEKMRMVNE+MKLENER+ALDLKQK+IGSGFH
Subjt:  EKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH

XP_022134251.1 uncharacterized protein LOC111006553 [Momordica charantia]0.0100Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDGVDGHN
        MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDGVDGHN
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDGVDGHN

Query:  ETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQ
        ETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQ
Subjt:  ETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQ

Query:  VVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFEENFAP
        VVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFEENFAP
Subjt:  VVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFEENFAP

Query:  HGDNRRLFGAGSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQFTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLELEKQRFKWER
        HGDNRRLFGAGSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQFTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLELEKQRFKWER
Subjt:  HGDNRRLFGAGSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQFTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLELEKQRFKWER

Query:  FNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        FNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH
Subjt:  FNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH

XP_031743106.1 uncharacterized protein LOC105435760 [Cucumis sativus]1.10e-27587.31Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQ-----ALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGGSSYGGLDLQG FKVHNQ     ALH   H HTRQGSSANPSIQEGFSLSM  +QNCDHTMS+V+YNKGER KNS SD++PSF ED 
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQ-----ALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG

Query:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        +DGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDIASD DGGGRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSCQVVENPALLDV+DYLT+KDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE+E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQ--FTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLEL
        ENF PH DNRR  G   GS KRL+RGQDHD+AHACGNSL+  DCNKSSH +SQ  FT ADTA LETESMKASTSQKQWMELRLLQ+EDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQ--FTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLEL

Query:  EKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQ+FKWERFNKKKD ELEKMRMVNERMKLENER+ALDLKQK+IGSGFH
Subjt:  EKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH

XP_038885368.1 uncharacterized protein LOC120075776 [Benincasa hispida]7.74e-27687.08Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQ-----ALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ     ALH H H HTRQGSSANPSIQEGFSLSM  + NCDHTM +V+YNKGER KNS SD+EPSFTEDG
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQ-----ALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG

Query:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        VDGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD DGGGR+K QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSCQVVENPALLDV+DYLTDK+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE EHGETDEHDDFE
Subjt:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQ--FTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLEL
        ENF PH DNRR  G   GS KRL+RGQDHD+AHACGNSL+S DCNKSSH +SQ  F  ADTA LETESMKASTSQKQWMELRLLQ+E+QKLQIQVEMLEL
Subjt:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQ--FTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLEL

Query:  EKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQ+FKW+RFNKKKD ELE MRMVNERMKLEN+R+ALDLKQK+IGSGFH
Subjt:  EKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH

TrEMBL top hitse value%identityAlignment
A0A0A0KBC2 Uncharacterized protein5.32e-27687.31Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQ-----ALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGGSSYGGLDLQG FKVHNQ     ALH   H HTRQGSSANPSIQEGFSLSM  +QNCDHTMS+V+YNKGER KNS SD++PSF ED 
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQ-----ALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG

Query:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        +DGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDIASD DGGGRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSCQVVENPALLDV+DYLT+KDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE+E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQ--FTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLEL
        ENF PH DNRR  G   GS KRL+RGQDHD+AHACGNSL+  DCNKSSH +SQ  FT ADTA LETESMKASTSQKQWMELRLLQ+EDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQ--FTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLEL

Query:  EKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQ+FKWERFNKKKD ELEKMRMVNERMKLENER+ALDLKQK+IGSGFH
Subjt:  EKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH

A0A1S3B4A7 LOW QUALITY PROTEIN: uncharacterized protein LOC1034856205.08e-27486.64Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQ-----ALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGGSSYGGLDLQG FKVHNQ     ALH   H HTRQGSSANPSIQEGFSLSM  +QNCDHTMS+V+YNKGER KNS SD++PSF ED 
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQ-----ALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG

Query:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        +DGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDIASD DG GRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSCQVVENPALLDV+DYLT+KDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE+E GETDEHDD+E
Subjt:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQ--FTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLEL
        ENF PH DNRR  G   GS KRL+RGQDHD+AHACGNSL+  DCNKSSH +SQ  F  ADTA LETESMKASTSQKQWMELRLLQ+EDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQ--FTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLEL

Query:  EKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQ+FKWERFNK KD ELEKMRMVNE+MKLENER+ALDLKQK+IGSGFH
Subjt:  EKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH

A0A5D3DGK7 Putative transcription factor6.20e-27586.86Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQ-----ALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGGSSYGGLDLQG FKVHNQ     ALH   H HTRQGSSANPSIQEGFSLSM  +QNCDHTMS+V+YNKGER KNS SD++PSF ED 
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQ-----ALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG

Query:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        +DGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDIASD DG GRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSCQVVENPALLDV+DYLT+KDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE+E GETDEHDD+E
Subjt:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQ--FTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLEL
        ENF PH DNRR  G   GS KRL+RGQDHD+AHACGNSL+  DCNKSSH +SQ  F  ADTA LETESMKASTSQKQWMELRLLQ+EDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQ--FTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLEL

Query:  EKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        EKQ+FKWERFNKKKD ELEKMRMVNE+MKLENER+ALDLKQK+IGSGFH
Subjt:  EKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH

A0A6J1BXD6 uncharacterized protein LOC1110065530.0100Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDGVDGHN
        MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDGVDGHN
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDGVDGHN

Query:  ETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQ
        ETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQ
Subjt:  ETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQ

Query:  VVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFEENFAP
        VVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFEENFAP
Subjt:  VVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFEENFAP

Query:  HGDNRRLFGAGSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQFTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLELEKQRFKWER
        HGDNRRLFGAGSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQFTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLELEKQRFKWER
Subjt:  HGDNRRLFGAGSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQFTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLELEKQRFKWER

Query:  FNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        FNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH
Subjt:  FNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH

A0A6J1FFN2 uncharacterized protein LOC1114452942.42e-27086.35Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQALHHHP-----HSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVH+QA H H      H HTRQGS+ANPSIQEGFSLSM  +QNCDH MS+VDYNKGER KNS SD+EPSFTEDG
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQALHHHP-----HSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDG

Query:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        +DGHNETSKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD DG GRRK QI QKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  VDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSC+VVENPALLD+L+YLTDK+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEH ETDE DDFE
Subjt:  GTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQFTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLELEK
        ENFAPHGDNRR FG   GS KRLRR QDHD+ HACG SL+SH     +H+ +QF  ADTA LETE MK STSQKQWMELRLLQ+EDQKLQIQVEMLELEK
Subjt:  ENFAPHGDNRRLFGA--GSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQFTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLELEK

Query:  QRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH
        Q+FKWERFNKKKD ELEKMRMVNERMKLENERIALDLKQKEIGSGFH
Subjt:  QRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors5.4e-13961.3Show/hide
Query:  MEGNLSQGGLI-PGGSSYGGLDLQGSFKVHNQALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDH----TMSMVDYNKGERFKNSPS-DDEPSFTED
        M+GN  QGG++  G SSYGG DLQGS +VH+Q   +  H H     ++ P + EG   +M   Q CDH     MSM +  K ER KNS S DDEPSFTE+
Subjt:  MEGNLSQGGLI-PGGSSYGGLDLQGSFKVHNQALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDH----TMSMVDYNKGERFKNSPS-DDEPSFTED

Query:  GVDG-HNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDII
        G DG HNE ++  KGS W RVKWTDKMVKLLITAVSYIGDD  S  D   RRKF ++QKKGKWK +SKVMAERGY VSPQQCEDKFNDLNKRYK+LND++
Subjt:  GVDG-HNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDII

Query:  GRGTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEH-GETDEHD
        GRGTSCQVVENPALLD + YL DK+KDDVRKI++SK LFYEEMCSYHN NRLHLPHD ALQRSLQLA R+RDDHDND+ R+HQ +D D+ +H G+ DEHD
Subjt:  GRGTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEH-GETDEHD

Query:  DFEENFAPHGD---NRRLFGAGSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQFTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEML
        ++EE    +GD   N    G G  K++R    H++     + + S +CNK S     F+ AD  Q   ES +A + QKQWME R LQ+E+QKLQIQVE+L
Subjt:  DFEENFAPHGD---NRRLFGAGSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQFTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEML

Query:  ELEKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIG
        ELEKQRF+W+RF+KK+D ELE+MRM NERMKLEN+R+ L+LKQ+E+G
Subjt:  ELEKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIG

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)7.6e-9349.66Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDGVDGHN
        MEGN SQG      SS    DL+ +    NQ   HHP+S  RQ S  N                  +TM    +N  +R K S S+D+        DG N
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDGVDGHN

Query:  ETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQ
           K K+ S W RVKW DKMVKL+ITA+SYIG+D  SD      +KF ++QKKGKW+ +SKVM ERGY VSPQQCEDKFNDLNKRYK+LN+++GRGTSC+
Subjt:  ETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQ

Query:  VVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQL-AFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFEENFA
        VVENP+LLD +DYL +K+KD+VR+I++SK LFYEEMCSYHN NRLHLPHDPA+QRSL L    +RDDHDNDE  +HQN+D D++        DD+EE+  
Subjt:  VVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQL-AFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFEENFA

Query:  PHGDNRRLFGAGSAKRLRRGQDHDEAHACGNSLTSHD--CNKSSHTYSQFTPADTAQ-LETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLELEKQRF
            +R L      KRLR+ Q H++    G+    +D  C   S        AD  + +  +S KA+  Q+Q +E + L++E +KLQIQ EM+ELE+Q+F
Subjt:  PHGDNRRLFGAGSAKRLRRGQDHDEAHACGNSLTSHD--CNKSSHTYSQFTPADTAQ-LETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLELEKQRF

Query:  KWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGS
        KWE F+K+++ +L KMRM NERMKLENER++L+LK+ E+G+
Subjt:  KWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGS

AT3G10040.1 sequence-specific DNA binding transcription factors1.3e-5234.05Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDL-QGSFKVHNQALHHHPHSHTRQG-SSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDGVDG
        ME N+   G  P   S   L++ Q      N     HPH +T  G     P I+  +  +    Q     MS +             DDE   +  G   
Subjt:  MEGNLSQGGLIPGGSSYGGLDL-QGSFKVHNQALHHHPHSHTRQG-SSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDGVDG

Query:  HNETSKGKKG----SLWHRVKWTDKMVKLLITAVSYIGDDIA----------SDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNK
        + E S G  G    S WHR+KWTD MV+LLI AV YIGD+            +   GGG     ++QKKGKWK +S+ M E+G+ VSPQQCEDKFNDLNK
Subjt:  HNETSKGKKG----SLWHRVKWTDKMVKLLITAVSYIGDDIA----------SDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNK

Query:  RYKRLNDIIGRGTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHD--PALQRSLQLAFRARDDH--------------D
        RYKR+NDI+G+G +C+VVEN  LL+ +D+LT K KD+V+K+LNSK LF+ EMC+YHNS      HD  P  Q  + +   ++  +              +
Subjt:  RYKRLNDIIGRGTSCQVVENPALLDVLDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHD--PALQRSLQLAFRARDDH--------------D

Query:  NDEPRRHQNDDFDENEHGETDEHDDFEENFAPHGDNRRLFGAGSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQFTPADTAQLETESMKASTSQKQ
          E       D  E+   E +E ++ E         RR+  + + KRLR                                 + A +  +  K+   +K+
Subjt:  NDEPRRHQNDDFDENEHGETDEHDDFEENFAPHGDNRRLFGAGSAKRLRRGQDHDEAHACGNSLTSHDCNKSSHTYSQFTPADTAQLETESMKASTSQKQ

Query:  WMELRLLQMEDQKLQIQVEMLELEKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEI
        W+  ++L++E++K+  + E +E+EKQR KW R+  KK+ E+EK ++ N+R +LE ER+ L L++ EI
Subjt:  WMELRLLQMEDQKLQIQVEMLELEKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGAATTTATCTCAAGGAGGGTTGATTCCAGGTGGGTCCTCTTATGGAGGTCTCGATTTGCAAGGATCGTTTAAGGTTCATAATCAGGCTTTACACCACCATCC
TCACTCTCATACTCGTCAGGGATCTTCAGCCAATCCCTCCATTCAGGAAGGATTTTCACTTTCCATGGCAGCTCTGCAGAATTGTGACCATACCATGTCCATGGTAGATT
ATAACAAGGGAGAAAGGTTTAAAAACTCGCCCAGCGATGATGAGCCGAGCTTTACCGAAGATGGCGTTGATGGTCATAATGAGACTAGTAAGGGGAAGAAGGGATCGCTG
TGGCATCGCGTGAAATGGACGGATAAAATGGTGAAGCTTCTCATTACTGCAGTGTCTTATATAGGAGATGATATTGCCTCAGATCATGATGGGGGTGGAAGAAGAAAATT
TCAGATTATACAGAAGAAAGGCAAATGGAAGTTGATATCTAAGGTGATGGCTGAAAGGGGTTATCAAGTTTCACCCCAGCAGTGTGAGGATAAATTCAATGACCTCAATA
AGAGGTATAAGAGGCTCAATGATATAATTGGGAGAGGCACTTCTTGCCAGGTTGTTGAGAACCCTGCACTTCTTGATGTCTTAGATTACTTAACCGACAAAGATAAGGAC
GATGTGCGAAAAATTTTGAACTCGAAGCAGTTGTTCTATGAAGAGATGTGTTCTTATCATAATTCGAATAGGCTCCATCTGCCTCATGATCCTGCTTTGCAGCGCTCTTT
GCAGCTGGCTTTTAGAGCAAGGGATGATCATGATAACGATGAGCCAAGGAGACACCAAAATGATGATTTTGATGAAAACGAACATGGTGAGACTGATGAACATGATGATT
TTGAGGAGAACTTTGCACCCCATGGGGACAATAGGCGATTATTTGGAGCAGGGTCGGCGAAGAGGCTCAGACGAGGACAAGACCATGATGAAGCTCATGCCTGTGGCAAT
TCCTTGACTTCTCATGATTGCAACAAAAGTTCTCATACTTACTCACAATTTACTCCAGCTGATACGGCACAGTTAGAAACCGAAAGTATGAAGGCTTCTACATCGCAAAA
ACAGTGGATGGAGCTCCGCTTACTTCAGATGGAAGATCAGAAGCTTCAGATTCAAGTTGAAATGTTGGAATTGGAGAAACAGAGGTTCAAGTGGGAGAGATTTAACAAGA
AAAAGGACTGCGAGTTGGAAAAAATGAGGATGGTAAACGAGAGGATGAAGCTTGAAAATGAGCGCATCGCACTCGACTTAAAGCAAAAGGAAATCGGATCGGGATTTCAT
TAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGGAATTTATCTCAAGGAGGGTTGATTCCAGGTGGGTCCTCTTATGGAGGTCTCGATTTGCAAGGATCGTTTAAGGTTCATAATCAGGCTTTACACCACCATCC
TCACTCTCATACTCGTCAGGGATCTTCAGCCAATCCCTCCATTCAGGAAGGATTTTCACTTTCCATGGCAGCTCTGCAGAATTGTGACCATACCATGTCCATGGTAGATT
ATAACAAGGGAGAAAGGTTTAAAAACTCGCCCAGCGATGATGAGCCGAGCTTTACCGAAGATGGCGTTGATGGTCATAATGAGACTAGTAAGGGGAAGAAGGGATCGCTG
TGGCATCGCGTGAAATGGACGGATAAAATGGTGAAGCTTCTCATTACTGCAGTGTCTTATATAGGAGATGATATTGCCTCAGATCATGATGGGGGTGGAAGAAGAAAATT
TCAGATTATACAGAAGAAAGGCAAATGGAAGTTGATATCTAAGGTGATGGCTGAAAGGGGTTATCAAGTTTCACCCCAGCAGTGTGAGGATAAATTCAATGACCTCAATA
AGAGGTATAAGAGGCTCAATGATATAATTGGGAGAGGCACTTCTTGCCAGGTTGTTGAGAACCCTGCACTTCTTGATGTCTTAGATTACTTAACCGACAAAGATAAGGAC
GATGTGCGAAAAATTTTGAACTCGAAGCAGTTGTTCTATGAAGAGATGTGTTCTTATCATAATTCGAATAGGCTCCATCTGCCTCATGATCCTGCTTTGCAGCGCTCTTT
GCAGCTGGCTTTTAGAGCAAGGGATGATCATGATAACGATGAGCCAAGGAGACACCAAAATGATGATTTTGATGAAAACGAACATGGTGAGACTGATGAACATGATGATT
TTGAGGAGAACTTTGCACCCCATGGGGACAATAGGCGATTATTTGGAGCAGGGTCGGCGAAGAGGCTCAGACGAGGACAAGACCATGATGAAGCTCATGCCTGTGGCAAT
TCCTTGACTTCTCATGATTGCAACAAAAGTTCTCATACTTACTCACAATTTACTCCAGCTGATACGGCACAGTTAGAAACCGAAAGTATGAAGGCTTCTACATCGCAAAA
ACAGTGGATGGAGCTCCGCTTACTTCAGATGGAAGATCAGAAGCTTCAGATTCAAGTTGAAATGTTGGAATTGGAGAAACAGAGGTTCAAGTGGGAGAGATTTAACAAGA
AAAAGGACTGCGAGTTGGAAAAAATGAGGATGGTAAACGAGAGGATGAAGCTTGAAAATGAGCGCATCGCACTCGACTTAAAGCAAAAGGAAATCGGATCGGGATTTCAT
TAATTGGCTGCTGTTTTCTGAACTAATTTTATTAGGTCAGAAGATTTGCATTCAATTAAGCTACTACTTTGGATCGATAAGCTGCTGCTGTTTATATTCTTTCTGGTTAT
GCGAGTAATGAGGGAGAGCTGTATTTGGGTATTTTTACATTGCCTACATCTAGCATGTAAAGTTAGAAAAGTGACAATACAAATGGTTGCTTTTCACTGTTTTTGCCTTT
ACTTATGTGTGAACTTTTGTGGTTGAGGAACTTGAAGAATGGTTGGATGTTTTATCTGTCAAATTTCTGCCATTTCTCCCAACTAATTATTTTCTAGCATACAAATGATT
ATATACACTACTGGATTCTTGAGCTTATTGTATGGAGAATTGCATACTGTCACTTCTGAATGGTGTGGATTTCATGGTGCTTTCTTCTTTCAATTGGGCTTTTTAATTAT
TTTCCGATAATGGAACATCGGGGGCCTCCACGCCCAAAAGTTAGCTCCTAGAGATAACAAAGATTTTAAGTTATCAAGTTTTCTAGAAACTTGAAATGTATCTCCTAACA
TCTGAATTCCTTATCGCCAACATTCTGTGGAAGTTTATTGGAATGGATTCTGAATTTGTTATGAGATAAAACATAAACTGATTTGAGGAGACATTGGCTGGAAATATTCT
CCAATAAGCTGTGATGACTTTTACAGTCTCATTTTTTAGATTTTAGTGGAAATATTGATGAGAATCAGTGAGTTCTCTGAAGTGTTCTTATTCATCTTTTAGGCTTGTCA
AAAACTTCTCACAACTCCATCTAAGATTTATGGTCAAAATCAGGTGATGTTTCAGCCAATATATCAGATGGGAGAGGATCTTAGATTATGACTCTGAGGTTGTTACTGTC
ACTTCCAGCATCTATGCAGAATTCTCGACGATCTGCAAACCAGATAACTATAACGAACCTCCCCACTACTTGGAAGGGCTCGAGCGAGAACTGTTTTCGTTCCTACTTGT
GTAGATATCTCTATTCAGGCAAGTCTTTTGGTCTGTAAGTAAACGTACTAACAGGTGTGCAAAATAGACGTCAGGGGACTTGAATCTTGTATGGTACTTCAAACGAATTA
ACTCGTAACGTCTGAACTACTACTAATACACGTCGAAGTTTTACGTTGTAAAGCATGACAGGGTGTTCATATTGTGATTGAAGGAGATGGTGAACTTTTTTTTCCTTTTT
TTTTTCTTTTTTCTGGTGGGATCAACAGCCATAAGCATTTGAGGCAACAGTTTTGTGGTCATGAGGAACAGAAATTAAATGCTGTTTTTCTTTCTTTTTGTCTTATGTTC
AAATGAATTCAGTCATTTTAGCACTAAGTGGGCCAAGGTGGCCCAGTGGTTGGGAACATCATGATCCAGCCCAATTGAATTATGGCTGTCCTCTCTCAATTATATAAATA
TATTGTATATATCATGAAAATTTGTAATGGATGGTTTTTTTTTTTAATGAATTTTAAGTATGTGGC
Protein sequenceShow/hide protein sequence
MEGNLSQGGLIPGGSSYGGLDLQGSFKVHNQALHHHPHSHTRQGSSANPSIQEGFSLSMAALQNCDHTMSMVDYNKGERFKNSPSDDEPSFTEDGVDGHNETSKGKKGSL
WHRVKWTDKMVKLLITAVSYIGDDIASDHDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENPALLDVLDYLTDKDKD
DVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFEENFAPHGDNRRLFGAGSAKRLRRGQDHDEAHACGN
SLTSHDCNKSSHTYSQFTPADTAQLETESMKASTSQKQWMELRLLQMEDQKLQIQVEMLELEKQRFKWERFNKKKDCELEKMRMVNERMKLENERIALDLKQKEIGSGFH