; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004030 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004030
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationChr08:12958454..12959803
RNA-Seq ExpressionHG10004030
SyntenyHG10004030
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054953.1 putative transcription factor [Cucumis melo var. makuwa]4.1e-24595.32Show/hide
Query:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS
        MEGNLSQGGLIPGG+SYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGS+ NPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDE+PSFNEDS
Subjt:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD+DG GRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQ SLQLAFRARDDHDNDEPRRHQNDDFDE+E GETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL
        ENFVPH DNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSH H  AQF QADTAHLETESMKASTSQKQWMELRLLQLE+QKLQIQVEMLEL
Subjt:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL

Query:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH
        EKQKFKW+RFNKKKDRELEKMRMVNE+MKLENER+ALDLKQKQIGSGFH
Subjt:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH

XP_008441519.2 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103485620 [Cucumis melo]2.0e-24495.1Show/hide
Query:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS
        MEGNLSQGGLIPGG+SYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGS+ NPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDE+PSFNEDS
Subjt:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD+DG GRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQ SLQLAFRARDDHDNDEPRRHQNDDFDE+E GETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL
        ENFVPH DNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSH H  AQF QADTAHLETESMKASTSQKQWMELRLLQLE+QKLQIQVEMLEL
Subjt:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL

Query:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH
        EKQKFKW+RFNK KDRELEKMRMVNE+MKLENER+ALDLKQKQIGSGFH
Subjt:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH

XP_023551421.1 uncharacterized protein LOC111809238 [Cucurbita pepo subsp. pepo]4.9e-23090.65Show/hide
Query:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS
        MEGNLSQGGLIPGGTSYGGLDLQGPFKVH+Q QHSHALHQQHHPHTRQGSA NPSIQEGFSLSMGVVQNCDH MSLV+YNKGERCKNSASDEEPSF ED 
Subjt:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD DG GRRKC IIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSC+VVENPALLDV++YLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQ SLQLAFRARDDHDNDEPRRHQNDDFDENEH ETDE DDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL
        ENF PHGD+RRS GVLGGSVKRL+R QDHDD HACG SL       SSHAH  AQF QADTAHLETE MK STSQKQWMELRLLQLE+QKLQIQVEMLEL
Subjt:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL

Query:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH
        EKQKFKW+RFNKKKDRELEKMRMVNERMKLENERIALDLKQK+IGSGFH
Subjt:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH

XP_031743106.1 uncharacterized protein LOC105435760 [Cucumis sativus]1.2e-24495.55Show/hide
Query:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS
        MEGNLSQGGLIPGG+SYGGLDLQGPFKVHNQGQ SHALHQQHHPHTRQGS+ NPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDE+PSFNEDS
Subjt:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD+DGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQ SLQLAFRARDDHDNDEPRRHQNDDFDE+E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL
        ENFVPH DNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSH H  AQFTQADTAHLETESMKASTSQKQWMELRLLQLE+QKLQIQVEMLEL
Subjt:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL

Query:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH
        EKQKFKW+RFNKKKDRELEKMRMVNERMKLENER+ALDLKQKQIGSGFH
Subjt:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH

XP_038885368.1 uncharacterized protein LOC120075776 [Benincasa hispida]3.8e-24394.88Show/hide
Query:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS
        MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQ HHPHTRQGS+ NPSIQEGFSLSMGVV NCDHTM LVEYNKGERCKNSASDEEPSF ED 
Subjt:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        +DGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDI SDLDGGGR+KCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQ SLQLAFRARDDHDNDEPRRHQNDDFDE EHGETDEHDDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL
        ENF+PH DNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLS LDCNKSSH H  A F QADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL
Subjt:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL

Query:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH
        EKQKFKWDRFNKKKDRELE MRMVNERMKLEN+R+ALDLKQKQIGSGFH
Subjt:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH

TrEMBL top hitse value%identityAlignment
A0A0A0KBC2 Uncharacterized protein5.8e-24595.55Show/hide
Query:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS
        MEGNLSQGGLIPGG+SYGGLDLQGPFKVHNQGQ SHALHQQHHPHTRQGS+ NPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDE+PSFNEDS
Subjt:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD+DGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQ SLQLAFRARDDHDNDEPRRHQNDDFDE+E  ETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL
        ENFVPH DNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSH H  AQFTQADTAHLETESMKASTSQKQWMELRLLQLE+QKLQIQVEMLEL
Subjt:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL

Query:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH
        EKQKFKW+RFNKKKDRELEKMRMVNERMKLENER+ALDLKQKQIGSGFH
Subjt:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH

A0A1S3B4A7 LOW QUALITY PROTEIN: uncharacterized protein LOC1034856209.9e-24595.1Show/hide
Query:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS
        MEGNLSQGGLIPGG+SYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGS+ NPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDE+PSFNEDS
Subjt:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD+DG GRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQ SLQLAFRARDDHDNDEPRRHQNDDFDE+E GETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL
        ENFVPH DNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSH H  AQF QADTAHLETESMKASTSQKQWMELRLLQLE+QKLQIQVEMLEL
Subjt:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL

Query:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH
        EKQKFKW+RFNK KDRELEKMRMVNE+MKLENER+ALDLKQKQIGSGFH
Subjt:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH

A0A5D3DGK7 Putative transcription factor2.0e-24595.32Show/hide
Query:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS
        MEGNLSQGGLIPGG+SYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGS+ NPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDE+PSFNEDS
Subjt:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD+DG GRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQ SLQLAFRARDDHDNDEPRRHQNDDFDE+E GETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL
        ENFVPH DNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSH H  AQF QADTAHLETESMKASTSQKQWMELRLLQLE+QKLQIQVEMLEL
Subjt:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL

Query:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH
        EKQKFKW+RFNKKKDRELEKMRMVNE+MKLENER+ALDLKQKQIGSGFH
Subjt:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH

A0A6J1FFN2 uncharacterized protein LOC1114452942.6e-22990.65Show/hide
Query:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS
        MEGNLSQGGLIPGGTSYGGLDLQGPFKVH+Q QHSHALHQQHHPHTRQGSA NPSIQEGFSLSMGVVQNCDH MSLV+YNKGERCKNSASDEEPSF ED 
Subjt:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD DG GRRKCQ IQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSC+VVENPALLD+++YLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQ SLQLAFRARDDHDNDEPRRHQNDDFDENEH ETDE DDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL
        ENF PHGDNRRS GVLGGSVKRL+R QDHDD HACG SL       SSHAH  AQF QADTAHLETE MK STSQKQWMELRLLQLE+QKLQIQVEMLEL
Subjt:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL

Query:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH
        EKQKFKW+RFNKKKDRELEKMRMVNERMKLENERIALDLKQK+IGSGFH
Subjt:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH

A0A6J1JU49 uncharacterized protein LOC1114897532.6e-22990.42Show/hide
Query:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS
        MEGNLSQGGLIPGGTSYGGLDLQ PFKVH+Q QHSHALHQQHHPHTRQGSA NPSIQEGFSLSMGVVQNCDH MSLV++NKGERCKNSASDEEPSF ED 
Subjt:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD DG GRRKC IIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE
        GTSC+VVENPALLDV++YLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQ SLQLAFRARDDHDNDEPRRHQNDDFDENEH ETDE DDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL
        ENF  HGDNRRS GVLGGSVKRL+RGQDHDD HACG SL       SSHAH  AQF QADTAHLETE MK STSQKQWMELRLLQLE+QKLQIQVEMLEL
Subjt:  ENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLEL

Query:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH
        EKQKFKW+RFNKKKDRELEKMRMVNERMKLENERIALDLKQK+IGSGFH
Subjt:  EKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGSGFH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors3.3e-13659.65Show/hide
Query:  MEGNLSQGGLI-PGGTSYGGLDLQGPFKVHNQGQHSHALHQQ--HHPHTRQGSAVNPSIQEGFSLSMGVVQNCDH----TMSLVEYNKGERCKNSAS-DE
        M+GN  QGG++  G +SYGG DLQG  +VH    H  +++QQ  H+P++R        + EG   +M   Q CDH     MS+ E  K ER KNS S D+
Subjt:  MEGNLSQGGLI-PGGTSYGGLDLQGPFKVHNQGQHSHALHQQ--HHPHTRQGSAVNPSIQEGFSLSMGVVQNCDH----TMSLVEYNKGERCKNSAS-DE

Query:  EPSFNEDSIDG-HNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRY
        EPSF E+  DG HNE ++  KGS W RVKWTDKMVKLLITAVSYIGDD  S +D   RRK  ++QKKGKWK +SKV+AERGY VSPQQCEDKFNDLNKRY
Subjt:  EPSFNEDSIDG-HNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRY

Query:  KRLNDIIGRGTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEH-
        K+LND++GRGTSCQVVENPALLD I YL DKEKDDVRKI++SK LFYEEMCSYHN NRLHLPHD ALQ SLQLA R+RDDHDND+ R+HQ +D D+ +H 
Subjt:  KRLNDIIGRGTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEH-

Query:  GETDEHDDFEENFVPHGDNR-RSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQ
        G+ DEHD++EE    +GD R    G  GG +K+++    H+D     + ++ L+CNK S   P   F+QAD      ES +A + QKQWME R LQLEEQ
Subjt:  GETDEHDDFEENFVPHGDNR-RSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQ

Query:  KLQIQVEMLELEKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIG
        KLQIQVE+LELEKQ+F+W RF+KK+D+ELE+MRM NERMKLEN+R+ L+LKQ+++G
Subjt:  KLQIQVEMLELEKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIG

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)1.7e-9248.21Show/hide
Query:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS
        MEGN SQG      +S   L    P  + NQ Q      +QHHP++RQ S  N                  +TM    +N  +R K S S+++      S
Subjt:  MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
         DG N   K K+ S W RVKW DKMVKL+ITA+SYIG+D  SD      +K  ++QKKGKW+ +SKV+ ERGY VSPQQCEDKFNDLNKRYK+LN+++GR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQL-AFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDF
        GTSC+VVENP+LLD IDYL +KEKD+VR+I++SK LFYEEMCSYHN NRLHLPHDPA+Q SL L    +RDDHDNDE  +HQN+D D++        DD+
Subjt:  GTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQL-AFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDF

Query:  EENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAH-LETESMKASTSQKQWMELRLLQLEEQKLQIQVEML
        EE+      +R         +KRL++ Q H+D    G+     D        P    +QAD    +  +S KA+  Q+Q +E + L+LE +KLQIQ EM+
Subjt:  EENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAH-LETESMKASTSQKQWMELRLLQLEEQKLQIQVEML

Query:  ELEKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGS
        ELE+Q+FKW+ F+K+++++L KMRM NERMKLENER++L+LK+ ++G+
Subjt:  ELEKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQIGS

AT3G10040.1 sequence-specific DNA binding transcription factors5.6e-5133.4Show/hide
Query:  MEGNLSQGGLIPGGTSYGGLDL-QGPFKVHNQGQHSHALHQQHHPHTRQG-SAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNE
        ME N+   G  P   S   L++ Q P    N  Q  H      HP+T  G     P I+  +  +    Q     MS +    G  C     D+E   + 
Subjt:  MEGNLSQGGLIPGGTSYGGLDL-QGPFKVHNQGQHSHALHQQHHPHTRQG-SAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNE

Query:  DSIDGHNENSKGKKG----SMWHRVKWTDKMVKLLITAVSYIGDD--IASDLD--------GGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKF
             + E+S G  G    S WHR+KWTD MV+LLI AV YIGD+  +   +D        GGG     ++QKKGKWK +S+ + E+G+ VSPQQCEDKF
Subjt:  DSIDGHNENSKGKKG----SMWHRVKWTDKMVKLLITAVSYIGDD--IASDLD--------GGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKF

Query:  NDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHD--PALQHSLQLAFRARDDH----------
        NDLNKRYKR+NDI+G+G +C+VVEN  LL+ +D+LT K KD+V+K+LNSK LF+ EMC+YHNS      HD  P  Q+ + +   ++  +          
Subjt:  NDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHD--PALQHSLQLAFRARDDH----------

Query:  ----DNDEPRRHQNDDFDENEHGETDEHDDFEENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETES
            +  E       D  E+   E +E ++         +  R    +  +VKRL+                                   + A +  + 
Subjt:  ----DNDEPRRHQNDDFDENEHGETDEHDDFEENFVPHGDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETES

Query:  MKASTSQKQWMELRLLQLEEQKLQIQVEMLELEKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQI
         K+   +K+W+  ++L++EE+K+  + E +E+EKQ+ KW R+  KK+RE+EK ++ N+R +LE ER+ L L++ +I
Subjt:  MKASTSQKQWMELRLLQLEEQKLQIQVEMLELEKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLKQKQI

AT5G47660.1 Homeodomain-like superfamily protein5.2e-0426.45Show/hide
Query:  LVEYNKGERCKNSASDEEPSFNEDSIDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQV
        L E  K E+C+++  + E  F   S  G    S G+        +W  + V+ LI++ S + +                I K   W  IS  + ERGY+ 
Subjt:  LVEYNKGERCKNSASDEEPSFNEDSIDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQV

Query:  SPQQCEDKFNDLNKRYKRLND
        S ++C++K+ ++NK Y+R+ +
Subjt:  SPQQCEDKFNDLNKRYKRLND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGAATTTATCACAAGGAGGGTTGATTCCGGGAGGGACTTCTTATGGAGGTCTTGATTTGCAAGGACCGTTTAAGGTTCATAATCAGGGACAGCATTCTCATGC
CTTACATCAGCAGCATCATCCTCATACTCGTCAGGGATCTGCAGTTAATCCTTCCATTCAGGAGGGATTTTCACTTTCCATGGGAGTTGTACAAAATTGTGACCATACCA
TGTCTTTGGTAGAGTATAACAAGGGAGAAAGGTGTAAAAACTCAGCTAGCGACGAAGAGCCAAGTTTTAATGAGGATAGTATTGATGGTCATAATGAGAATAGTAAGGGG
AAGAAGGGATCGATGTGGCATCGCGTGAAATGGACGGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTGCTTCAGATCTTGATGGGGG
TGGAAGAAGGAAATGCCAAATTATACAGAAGAAAGGTAAATGGAAACTGATATCAAAGGTCATTGCTGAAAGGGGTTATCAAGTTTCACCCCAGCAGTGTGAGGATAAAT
TTAATGACCTCAATAAGAGGTACAAGAGGCTTAATGATATAATTGGGAGAGGTACTTCTTGTCAGGTTGTTGAGAACCCAGCACTTCTTGATGTCATTGATTATTTAACA
GACAAAGAGAAGGATGATGTGAGAAAAATTTTGAACTCAAAGCAACTGTTCTATGAGGAGATGTGTTCTTATCATAATTCAAATCGACTTCATCTGCCCCATGATCCTGC
TTTGCAGCATTCTTTGCAGCTGGCTTTTAGAGCAAGGGATGATCACGATAATGACGAGCCAAGGAGACATCAAAATGATGATTTTGATGAAAATGAACATGGTGAAACTG
ATGAACACGATGATTTTGAGGAGAATTTTGTACCCCATGGGGACAATAGGCGATCACTTGGGGTATTAGGAGGTTCAGTGAAGAGGCTAAAACGAGGCCAAGACCATGAT
GATGCTCATGCTTGTGGAAATTCCTTGAGTCCTCTTGATTGCAACAAAAGTTCTCATGCTCACCCACTAGCACAATTTACTCAAGCCGATACAGCTCACTTAGAAACTGA
AAGTATGAAAGCTTCTACATCGCAAAAGCAGTGGATGGAGCTTCGCTTACTTCAATTGGAAGAGCAGAAGCTTCAAATTCAAGTTGAAATGTTGGAATTGGAGAAACAGA
AGTTCAAGTGGGATAGATTTAACAAGAAAAAGGACCGTGAGTTGGAAAAAATGAGGATGGTAAACGAGAGGATGAAGCTTGAAAATGAGCGTATTGCACTTGACTTGAAG
CAAAAGCAAATTGGATCAGGATTTCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGGAATTTATCACAAGGAGGGTTGATTCCGGGAGGGACTTCTTATGGAGGTCTTGATTTGCAAGGACCGTTTAAGGTTCATAATCAGGGACAGCATTCTCATGC
CTTACATCAGCAGCATCATCCTCATACTCGTCAGGGATCTGCAGTTAATCCTTCCATTCAGGAGGGATTTTCACTTTCCATGGGAGTTGTACAAAATTGTGACCATACCA
TGTCTTTGGTAGAGTATAACAAGGGAGAAAGGTGTAAAAACTCAGCTAGCGACGAAGAGCCAAGTTTTAATGAGGATAGTATTGATGGTCATAATGAGAATAGTAAGGGG
AAGAAGGGATCGATGTGGCATCGCGTGAAATGGACGGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTGCTTCAGATCTTGATGGGGG
TGGAAGAAGGAAATGCCAAATTATACAGAAGAAAGGTAAATGGAAACTGATATCAAAGGTCATTGCTGAAAGGGGTTATCAAGTTTCACCCCAGCAGTGTGAGGATAAAT
TTAATGACCTCAATAAGAGGTACAAGAGGCTTAATGATATAATTGGGAGAGGTACTTCTTGTCAGGTTGTTGAGAACCCAGCACTTCTTGATGTCATTGATTATTTAACA
GACAAAGAGAAGGATGATGTGAGAAAAATTTTGAACTCAAAGCAACTGTTCTATGAGGAGATGTGTTCTTATCATAATTCAAATCGACTTCATCTGCCCCATGATCCTGC
TTTGCAGCATTCTTTGCAGCTGGCTTTTAGAGCAAGGGATGATCACGATAATGACGAGCCAAGGAGACATCAAAATGATGATTTTGATGAAAATGAACATGGTGAAACTG
ATGAACACGATGATTTTGAGGAGAATTTTGTACCCCATGGGGACAATAGGCGATCACTTGGGGTATTAGGAGGTTCAGTGAAGAGGCTAAAACGAGGCCAAGACCATGAT
GATGCTCATGCTTGTGGAAATTCCTTGAGTCCTCTTGATTGCAACAAAAGTTCTCATGCTCACCCACTAGCACAATTTACTCAAGCCGATACAGCTCACTTAGAAACTGA
AAGTATGAAAGCTTCTACATCGCAAAAGCAGTGGATGGAGCTTCGCTTACTTCAATTGGAAGAGCAGAAGCTTCAAATTCAAGTTGAAATGTTGGAATTGGAGAAACAGA
AGTTCAAGTGGGATAGATTTAACAAGAAAAAGGACCGTGAGTTGGAAAAAATGAGGATGGTAAACGAGAGGATGAAGCTTGAAAATGAGCGTATTGCACTTGACTTGAAG
CAAAAGCAAATTGGATCAGGATTTCATTAA
Protein sequenceShow/hide protein sequence
MEGNLSQGGLIPGGTSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSAVNPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEEPSFNEDSIDGHNENSKG
KKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDLDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLT
DKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQHSLQLAFRARDDHDNDEPRRHQNDDFDENEHGETDEHDDFEENFVPHGDNRRSLGVLGGSVKRLKRGQDHD
DAHACGNSLSPLDCNKSSHAHPLAQFTQADTAHLETESMKASTSQKQWMELRLLQLEEQKLQIQVEMLELEKQKFKWDRFNKKKDRELEKMRMVNERMKLENERIALDLK
QKQIGSGFH