; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0011174 (gene) of Chayote v1 genome

Gene IDSed0011174
OrganismSechium edule (Chayote v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationLG11:29538849..29541563
RNA-Seq ExpressionSed0011174
SyntenySed0011174
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054953.1 putative transcription factor [Cucumis melo var. makuwa]4.9e-22286.89Show/hide
Query:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED
        MEGN+SQGGLIPGG+SYGGLDLQGPFKVHNQ QHSHA+HQQ HHPHTR GSSAN SI EGF+LSMG + NCDH++SLV+YNKGERCKNS SD++PSF ED
Subjt:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED

Query:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG
          DGHNE SKG+KGSMWHRVKWTDKMVKLLITAVSYIGDDI SD +GSGRRK QIIQKKGKWKLISKV+AERG+QVSPQQCEDKFNDLNKRYKRLNDIIG
Subjt:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG

Query:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF
        RGTSC+VVENPALLDVIDYLT+KDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDD D++E  ETDEHDD+
Subjt:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF

Query:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE
        EENF PH DN+RSLGVLGGSVKRL+RGQD DDAHACGNSL+ LDC KS H +SQ Q+AQADTAHLETESMKASTSQKQWME  LL++EDQKLQIQVEMLE
Subjt:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE

Query:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH
        LEKQKFKWERFNKKKDRELEKMRM NE+MKLENER+ALDLKQK++GSGFH
Subjt:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH

XP_008441519.2 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103485620 [Cucumis melo]2.4e-22186.67Show/hide
Query:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED
        MEGN+SQGGLIPGG+SYGGLDLQGPFKVHNQ QHSHA+HQQ HHPHTR GSSAN SI EGF+LSMG + NCDH++SLV+YNKGERCKNS SD++PSF ED
Subjt:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED

Query:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG
          DGHNE SKG+KGSMWHRVKWTDKMVKLLITAVSYIGDDI SD +GSGRRK QIIQKKGKWKLISKV+AERG+QVSPQQCEDKFNDLNKRYKRLNDIIG
Subjt:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG

Query:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF
        RGTSC+VVENPALLDVIDYLT+KDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDD D++E  ETDEHDD+
Subjt:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF

Query:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE
        EENF PH DN+RSLGVLGGSVKRL+RGQD DDAHACGNSL+ LDC KS H +SQ Q+AQADTAHLETESMKASTSQKQWME  LL++EDQKLQIQVEMLE
Subjt:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE

Query:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH
        LEKQKFKWERFNK KDRELEKMRM NE+MKLENER+ALDLKQK++GSGFH
Subjt:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH

XP_023551421.1 uncharacterized protein LOC111809238 [Cucurbita pepo subsp. pepo]1.1e-21886.22Show/hide
Query:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED
        MEGN+SQGGLIPGG SYGGLDLQGPFKVH+Q QHSHA+HQQ HHPHTR GS+AN SI EGF+LSMG + NCDH++SLVDYNKGERCKNS SD+EPSFTED
Subjt:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED

Query:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG
        G DGHNE SKG+KGS+WHRVKWTDKMVKLLITAVSYIGDDI SDF+G+GRRK  IIQKKGKWKLISKVMAERG+QVSPQQCEDKFNDLNKRYKRLNDIIG
Subjt:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG

Query:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF
        RGTSC+VVENPALLDV++YLTDK+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDD D+NEHDETDE DDF
Subjt:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF

Query:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE
        EENFAPHGD++RS GVLGGSVKRLRR QD DD HACG SL+S       HA++Q Q+AQADTAHLETE MK STSQKQWME  LL++EDQKLQIQVEMLE
Subjt:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE

Query:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH
        LEKQKFKWERFNKKKDRELEKMRM NERMKLENERIALDLKQKE+GSGFH
Subjt:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH

XP_031743106.1 uncharacterized protein LOC105435760 [Cucumis sativus]5.4e-22186.67Show/hide
Query:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED
        MEGN+SQGGLIPGG+SYGGLDLQGPFKVHNQ Q SHA+HQQ HHPHTR GSSAN SI EGF+LSMG + NCDH++SLV+YNKGERCKNS SD++PSF ED
Subjt:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED

Query:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG
          DGHNE SKG+KGSMWHRVKWTDKMVKLLITAVSYIGDDI SD +G GRRK QIIQKKGKWKLISKV+AERG+QVSPQQCEDKFNDLNKRYKRLNDIIG
Subjt:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG

Query:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF
        RGTSC+VVENPALLDVIDYLT+KDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDD D++E DETDEHDD+
Subjt:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF

Query:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE
        EENF PH DN+RSLGVLGGSVKRL+RGQD DDAHACGNSL+ LDC KS H +SQ Q+ QADTAHLETESMKASTSQKQWME  LL++EDQKLQIQVEMLE
Subjt:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE

Query:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH
        LEKQKFKWERFNKKKDRELEKMRM NERMKLENER+ALDLKQK++GSGFH
Subjt:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH

XP_038885368.1 uncharacterized protein LOC120075776 [Benincasa hispida]6.4e-22286.67Show/hide
Query:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED
        MEGN+SQGGLIPGG SYGGLDLQGPFKVHNQ QHSHA+H Q+HHPHTR GSSAN SI EGF+LSMG + NCDH++ LV+YNKGERCKNS SD+EPSFTED
Subjt:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED

Query:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG
        G DGHNE SKG+KGSMWHRVKWTDKMVKLLITAVSYIGDDIGSD +G GR+K QIIQKKGKWKLISKV+AERG+QVSPQQCEDKFNDLNKRYKRLNDIIG
Subjt:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG

Query:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF
        RGTSC+VVENPALLDVIDYLTDK+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDD D+ EH ETDEHDDF
Subjt:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF

Query:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE
        EENF PH DN+RSLGVLGGSVKRL+RGQD DDAHACGNSL+SLDC KS H +SQ  +AQADTAHLETESMKASTSQKQWME  LL++E+QKLQIQVEMLE
Subjt:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE

Query:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH
        LEKQKFKW+RFNKKKDRELE MRM NERMKLEN+R+ALDLKQK++GSGFH
Subjt:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH

TrEMBL top hitse value%identityAlignment
A0A0A0KBC2 Uncharacterized protein2.6e-22186.67Show/hide
Query:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED
        MEGN+SQGGLIPGG+SYGGLDLQGPFKVHNQ Q SHA+HQQ HHPHTR GSSAN SI EGF+LSMG + NCDH++SLV+YNKGERCKNS SD++PSF ED
Subjt:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED

Query:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG
          DGHNE SKG+KGSMWHRVKWTDKMVKLLITAVSYIGDDI SD +G GRRK QIIQKKGKWKLISKV+AERG+QVSPQQCEDKFNDLNKRYKRLNDIIG
Subjt:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG

Query:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF
        RGTSC+VVENPALLDVIDYLT+KDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDD D++E DETDEHDD+
Subjt:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF

Query:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE
        EENF PH DN+RSLGVLGGSVKRL+RGQD DDAHACGNSL+ LDC KS H +SQ Q+ QADTAHLETESMKASTSQKQWME  LL++EDQKLQIQVEMLE
Subjt:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE

Query:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH
        LEKQKFKWERFNKKKDRELEKMRM NERMKLENER+ALDLKQK++GSGFH
Subjt:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH

A0A1S3B4A7 LOW QUALITY PROTEIN: uncharacterized protein LOC1034856201.2e-22186.67Show/hide
Query:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED
        MEGN+SQGGLIPGG+SYGGLDLQGPFKVHNQ QHSHA+HQQ HHPHTR GSSAN SI EGF+LSMG + NCDH++SLV+YNKGERCKNS SD++PSF ED
Subjt:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED

Query:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG
          DGHNE SKG+KGSMWHRVKWTDKMVKLLITAVSYIGDDI SD +GSGRRK QIIQKKGKWKLISKV+AERG+QVSPQQCEDKFNDLNKRYKRLNDIIG
Subjt:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG

Query:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF
        RGTSC+VVENPALLDVIDYLT+KDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDD D++E  ETDEHDD+
Subjt:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF

Query:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE
        EENF PH DN+RSLGVLGGSVKRL+RGQD DDAHACGNSL+ LDC KS H +SQ Q+AQADTAHLETESMKASTSQKQWME  LL++EDQKLQIQVEMLE
Subjt:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE

Query:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH
        LEKQKFKWERFNK KDRELEKMRM NE+MKLENER+ALDLKQK++GSGFH
Subjt:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH

A0A5D3DGK7 Putative transcription factor2.4e-22286.89Show/hide
Query:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED
        MEGN+SQGGLIPGG+SYGGLDLQGPFKVHNQ QHSHA+HQQ HHPHTR GSSAN SI EGF+LSMG + NCDH++SLV+YNKGERCKNS SD++PSF ED
Subjt:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED

Query:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG
          DGHNE SKG+KGSMWHRVKWTDKMVKLLITAVSYIGDDI SD +GSGRRK QIIQKKGKWKLISKV+AERG+QVSPQQCEDKFNDLNKRYKRLNDIIG
Subjt:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG

Query:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF
        RGTSC+VVENPALLDVIDYLT+KDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDD D++E  ETDEHDD+
Subjt:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF

Query:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE
        EENF PH DN+RSLGVLGGSVKRL+RGQD DDAHACGNSL+ LDC KS H +SQ Q+AQADTAHLETESMKASTSQKQWME  LL++EDQKLQIQVEMLE
Subjt:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE

Query:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH
        LEKQKFKWERFNKKKDRELEKMRM NE+MKLENER+ALDLKQK++GSGFH
Subjt:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH

A0A6J1FFN2 uncharacterized protein LOC1114452942.7e-21886.44Show/hide
Query:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED
        MEGN+SQGGLIPGG SYGGLDLQGPFKVH+Q QHSHA+HQQ HHPHTR GS+AN SI EGF+LSMG + NCDH++SLVDYNKGERCKNS SD+EPSFTED
Subjt:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED

Query:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG
        G DGHNE SKG+KGS+WHRVKWTDKMVKLLITAVSYIGDDI SDF+G+GRRK Q IQKKGKWKLISKVMAERG+QVSPQQCEDKFNDLNKRYKRLNDIIG
Subjt:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG

Query:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF
        RGTSC+VVENPALLD+++YLTDK+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDD D+NEHDETDE DDF
Subjt:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF

Query:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE
        EENFAPHGDN+RS GVLGGSVKRLRR QD DD HACG SL+S       HA+SQ Q+AQADTAHLETE MK STSQKQWME  LL++EDQKLQIQVEMLE
Subjt:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE

Query:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH
        LEKQKFKWERFNKKKDRELEKMRM NERMKLENERIALDLKQKE+GSGFH
Subjt:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH

A0A6J1JU49 uncharacterized protein LOC1114897536.0e-21886Show/hide
Query:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED
        MEGN+SQGGLIPGG SYGGLDLQ PFKVH+Q QHSHA+HQQ HHPHTR GS+AN SI EGF+LSMG + NCDH++SLVD+NKGERCKNS SD+EPSFTED
Subjt:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED

Query:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG
        G DGHNE SKG+KGS+WHRVKWTDKMVKLLITAVSYIGDDI SDF+G+GRRK  IIQKKGKWKLISKVMAERG+QVSPQQCEDKFNDLNKRYKRLNDIIG
Subjt:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG

Query:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF
        RGTSC+VVENPALLDV++YLTDK+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDD D+NEHDETDE DDF
Subjt:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDF

Query:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE
        EENFA HGDN+RS GVLGGSVKRLRRGQD DD HACG SL+S       HA++Q Q+AQADTAHLETE MK STSQKQWME  LL++EDQKLQIQVEMLE
Subjt:  EENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLE

Query:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH
        LEKQKFKWERFNKKKDRELEKMRM NERMKLENERIALDLKQKE+GSGFH
Subjt:  LEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGSGFH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors4.7e-13859.87Show/hide
Query:  MEGNISQGGLIPGGA-SYGGLDLQGPFKVHNQPQHSHAIHQQY-HHPHTRPGSSANSSIHEGFTLSMGALPNCDH----SLSLVDYNKGERCKNSPS-DD
        M+GN  QGG++  GA SYGG DLQG  +VH    H  +++QQ+ H+P++RP       +HEG   +M     CDH    ++S+ +  K ER KNS S DD
Subjt:  MEGNISQGGLIPGGA-SYGGLDLQGPFKVHNQPQHSHAIHQQY-HHPHTRPGSSANSSIHEGFTLSMGALPNCDH----SLSLVDYNKGERCKNSPS-DD

Query:  EPSFTEDGFDG-HNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRY
        EPSFTE+G DG HNE ++  KGS W RVKWTDKMVKLLITAVSYIGDD  S  + S RRKF ++QKKGKWK +SKVMAERG+ VSPQQCEDKFNDLNKRY
Subjt:  EPSFTEDGFDG-HNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRY

Query:  KRLNDIIGRGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHD
        K+LND++GRGTSC+VVENPALLD I YL DK+KDDVRKI++SK LFYEEMCSYHN NRLHLPHD ALQRSLQLA R+RDDHDND+ R+HQ +D+DD +HD
Subjt:  KRLNDIIGRGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHD

Query:  -ETDEHDDFEENFAPHGDNK-RSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQ
         + DEHD++EE    +GD +    G  GG +K++R     +D     + +NSL+C K   +  Q  ++QAD      ES +A + QKQWME   L++E+Q
Subjt:  -ETDEHDDFEENFAPHGDNK-RSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQ

Query:  KLQIQVEMLELEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELG
        KLQIQVE+LELEKQ+F+W+RF+KK+D+ELE+MRMENERMKLEN+R+ L+LKQ+ELG
Subjt:  KLQIQVEMLELEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELG

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)1.7e-9248.11Show/hide
Query:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED
        MEGN SQG      +S   L    P  ++         +Q+ HHP++R  S  N+++                      +N  +R K S S+D+      
Subjt:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED

Query:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG
          DG N   K ++ S W RVKW DKMVKL+ITA+SYIG+D GSD      +KF ++QKKGKW+ +SKVM ERG+ VSPQQCEDKFNDLNKRYK+LN+++G
Subjt:  GFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIG

Query:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQL-AFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDD
        RGTSCEVVENP+LLD IDYL +K+KD+VR+I++SK LFYEEMCSYHN NRLHLPHDPA+QRSL L    +RDDHDNDE  +HQN+D+DD+        DD
Subjt:  RGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQL-AFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDD

Query:  FEENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAH-LETESMKASTSQKQWMEHCLLRMEDQKLQIQVEM
        +EE      D+  +L      +KRLR+ Q  +D     N    + C            +QAD    +  +S KA+  Q+Q +E   L +E +KLQIQ EM
Subjt:  FEENFAPHGDNKRSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAH-LETESMKASTSQKQWMEHCLLRMEDQKLQIQVEM

Query:  LELEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGS
        +ELE+Q+FKWE F+K+++++L KMRMENERMKLENER++L+LK+ ELG+
Subjt:  LELEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKELGS

AT3G10040.1 sequence-specific DNA binding transcription factors1.8e-5234.55Show/hide
Query:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED
        ME N+   G  P       L L+ P    N P   ++I  Q+ HP+T  G        +    S+    +    +S +    G  C     DDE   +  
Subjt:  MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTED

Query:  GFDGHNEMSKG----RKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKF----------QIIQKKGKWKLISKVMAERGFQVSPQQCEDKFN
        G   + E S G    RK S WHR+KWTD MV+LLI AV YIGD+ G +     ++K            ++QKKGKWK +S+ M E+GF VSPQQCEDKFN
Subjt:  GFDGHNEMSKG----RKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKF----------QIIQKKGKWKLISKVMAERGFQVSPQQCEDKFN

Query:  DLNKRYKRLNDIIGRGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHD--PALQRSLQLAFRARDD---HDNDEPRRH
        DLNKRYKR+NDI+G+G +C VVEN  LL+ +D+LT K KD+V+K+LNSK LF+ EMC+YHNS      HD  P  Q  + +   ++     H  +  +  
Subjt:  DLNKRYKRLNDIIGRGTSCEVVENPALLDVIDYLTDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHD--PALQRSLQLAFRARDD---HDNDEPRRH

Query:  Q-NDDVDDNEHDETDEHDDFEENFAPHGDNK-RSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQW
        +  + V+  E  E+D  +D E       + + R    +  +VKRLR                                   + A +  +  K+   +K+W
Subjt:  Q-NDDVDDNEHDETDEHDDFEENFAPHGDNK-RSLGVLGGSVKRLRRGQDLDDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQW

Query:  MEHCLLRMEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKEL
        +   +L +E++K+  + E +E+EKQ+ KW R+  KK+RE+EK +++N+R +LE ER+ L L++ E+
Subjt:  MEHCLLRMEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDLKQKEL

AT5G47660.1 Homeodomain-like superfamily protein3.0e-0426.72Show/hide
Query:  KGERCKNSPSDDEPSFTEDGFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQC
        K E+C+++  + E  F      G    S GR        +W  + V+ LI++ S + +  G             I K   W  IS  M ERG++ S ++C
Subjt:  KGERCKNSPSDDEPSFTEDGFDGHNEMSKGRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQC

Query:  EDKFNDLNKRYKRLND
        ++K+ ++NK Y+R+ +
Subjt:  EDKFNDLNKRYKRLND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGAACATATCACAGGGAGGGTTGATTCCAGGAGGGGCTTCTTATGGAGGTCTTGATTTGCAAGGACCGTTTAAGGTTCATAATCAGCCACAACACTCTCACGC
TATACATCAGCAATATCATCATCCTCACACTCGTCCGGGATCTTCGGCAAATTCTTCAATTCACGAGGGATTTACACTTTCCATGGGAGCCCTACCAAATTGTGACCATT
CCCTGTCTTTGGTTGATTATAACAAGGGAGAAAGGTGTAAAAACTCACCTAGTGACGATGAGCCGAGCTTCACGGAGGATGGTTTTGATGGTCATAATGAGATGAGTAAG
GGGAGGAAGGGATCGATGTGGCATCGCGTGAAGTGGACGGATAAAATGGTGAAGCTTTTGATTACAGCAGTGTCTTATATAGGAGATGACATTGGTTCTGATTTTGAAGG
GAGTGGAAGAAGGAAATTTCAAATCATACAGAAGAAAGGTAAATGGAAATTGATATCAAAGGTCATGGCTGAAAGGGGTTTTCAAGTTTCACCTCAGCAATGTGAGGATA
AATTTAATGACCTCAATAAGAGGTATAAGAGACTCAATGATATAATTGGGAGGGGCACTTCTTGCGAGGTTGTTGAGAACCCTGCACTTCTTGATGTCATTGATTATTTA
ACAGATAAAGATAAGGATGATGTGAGAAAAATTTTAAACTCAAAGCAACTGTTCTATGAGGAGATGTGTTCGTATCATAACTCGAATCGACTCCATCTGCCCCATGATCC
TGCTTTGCAGCGTTCGTTGCAGTTGGCTTTCAGAGCTAGGGATGACCATGATAATGATGAGCCAAGGAGACACCAAAATGATGATGTTGATGATAATGAACATGATGAAA
CTGATGAGCATGATGATTTTGAGGAGAATTTTGCGCCCCATGGAGACAATAAGCGATCACTTGGGGTATTAGGAGGCTCGGTGAAGAGACTAAGGCGAGGCCAGGACCTC
GACGATGCTCATGCCTGTGGAAACTCCTTGAATTCTCTTGATTGCAAAAAGAGTTTGCATGCTTACTCACAAACACAATATGCTCAAGCCGATACAGCTCACTTAGAGAC
CGAAAGCATGAAGGCTTCTACATCGCAAAAACAGTGGATGGAGCATTGCTTACTTCGGATGGAAGATCAGAAGCTTCAAATTCAAGTTGAAATGTTGGAATTGGAGAAAC
AGAAGTTCAAGTGGGAGAGATTTAACAAGAAAAAGGACCGCGAGTTGGAAAAAATGAGGATGGAAAATGAAAGGATGAAGCTTGAAAATGAACGCATTGCACTCGACTTA
AAGCAAAAGGAACTCGGATCGGGATTTCATTAA
mRNA sequenceShow/hide mRNA sequence
CTAGAACAAGGGAACCTTCTTTTTCACTGGAAAGGAATAACCTTTGATGCAAACTCATGAACTCGATGATTTCCTAAACTTGGCATACGATTTTAACCCAATTTGTCATT
TAACTCGTGACGCAAAAATTTAATGCCCACACCCGATGAACCTTCATAAATTCAACAAAAACCTACGCATTGCCTTCTCTGATCGGTTCTTTCCCCCAATAATTTCACAT
TTTCCCATTTCTTTTCTTCTTATTTTACTAAACCCCAAATCCCTCCCATTTCCCTTCCCTTCTCGATCGCCGCAATAGGCCCAATCACCGCCTGGATGTCTTCTCTTTTC
CCCTTTGCATTTCCCCTCTTCTGGCCTTTCAGCTCTTCCCGGATTTCGCTCAGATAAGTCACTTCCCATCCCTTTTCTTTCGTTTTCTTTTGTGGCTGATTTCTGTTGGG
TCTTTTGCTTTTTGCCCAAATGGGATTTGTTTGAATCTGATTTCCCCCTTTCTCTGTTGGTGAAATTTTGATCTGGGTGTGCTTGATTTTTGTTGGTGATTCTGGTTTTT
GGGCAACTGGGGTTTGTTTGAATGTAATTTCCCATTGTCTATTGGGGAAATTTCGATTTGGGTGTGCTTGATTTCTGTTGGTTCTTCTGGTTTTTGGGCAACTGGGTTTG
GGGTTTGTTCGAACGTAATTTCCCATTGTCTATTGGTGAAATTTCGATCTGGGTATTGCTTGATTTCTGTTGGGTTTTTTGGTTTTTGGGCAACTGGCGTTTGTTTGAAT
GTAATTTCCCATTGTATATTGGGGAAATTTCGATCTGGGTGTGCTTGATTTTTTCATTCTGTGACTATTTGTTTCTCTTATCTTACCCCTTTTTGGGGTTTGATCGCTTC
TTTTGGGTGTGATTTCTGTTGGGTATTTTGGTTTTTGGCCAAATGGGGTTTGTTTGAATGAAATTTTCCATTGTCTATTGGCGAAAATTCGATCTGGGTGTGCTTGATTT
TTCCATACTGTAACTTTAGGGATGATAGATTTAGTAGTCTGAGAATTGTGTGAAAGATTAGATTGTGGGGTATGTTATTTTGAGACATATGGAAGGGAACATATCACAGG
GAGGGTTGATTCCAGGAGGGGCTTCTTATGGAGGTCTTGATTTGCAAGGACCGTTTAAGGTTCATAATCAGCCACAACACTCTCACGCTATACATCAGCAATATCATCAT
CCTCACACTCGTCCGGGATCTTCGGCAAATTCTTCAATTCACGAGGGATTTACACTTTCCATGGGAGCCCTACCAAATTGTGACCATTCCCTGTCTTTGGTTGATTATAA
CAAGGGAGAAAGGTGTAAAAACTCACCTAGTGACGATGAGCCGAGCTTCACGGAGGATGGTTTTGATGGTCATAATGAGATGAGTAAGGGGAGGAAGGGATCGATGTGGC
ATCGCGTGAAGTGGACGGATAAAATGGTGAAGCTTTTGATTACAGCAGTGTCTTATATAGGAGATGACATTGGTTCTGATTTTGAAGGGAGTGGAAGAAGGAAATTTCAA
ATCATACAGAAGAAAGGTAAATGGAAATTGATATCAAAGGTCATGGCTGAAAGGGGTTTTCAAGTTTCACCTCAGCAATGTGAGGATAAATTTAATGACCTCAATAAGAG
GTATAAGAGACTCAATGATATAATTGGGAGGGGCACTTCTTGCGAGGTTGTTGAGAACCCTGCACTTCTTGATGTCATTGATTATTTAACAGATAAAGATAAGGATGATG
TGAGAAAAATTTTAAACTCAAAGCAACTGTTCTATGAGGAGATGTGTTCGTATCATAACTCGAATCGACTCCATCTGCCCCATGATCCTGCTTTGCAGCGTTCGTTGCAG
TTGGCTTTCAGAGCTAGGGATGACCATGATAATGATGAGCCAAGGAGACACCAAAATGATGATGTTGATGATAATGAACATGATGAAACTGATGAGCATGATGATTTTGA
GGAGAATTTTGCGCCCCATGGAGACAATAAGCGATCACTTGGGGTATTAGGAGGCTCGGTGAAGAGACTAAGGCGAGGCCAGGACCTCGACGATGCTCATGCCTGTGGAA
ACTCCTTGAATTCTCTTGATTGCAAAAAGAGTTTGCATGCTTACTCACAAACACAATATGCTCAAGCCGATACAGCTCACTTAGAGACCGAAAGCATGAAGGCTTCTACA
TCGCAAAAACAGTGGATGGAGCATTGCTTACTTCGGATGGAAGATCAGAAGCTTCAAATTCAAGTTGAAATGTTGGAATTGGAGAAACAGAAGTTCAAGTGGGAGAGATT
TAACAAGAAAAAGGACCGCGAGTTGGAAAAAATGAGGATGGAAAATGAAAGGATGAAGCTTGAAAATGAACGCATTGCACTCGACTTAAAGCAAAAGGAACTCGGATCGG
GATTTCATTAATTGGCTGCAATAATCCGAACTAATTTGATCAGGTTTGGAAGATTATGAATCGTTAAGCTGCTGTTGTTTATATATTTCTGGTTATGCAAACGCGAGCGA
TGAGGTAGAGTCGTATTTTGGTACTTTTACATTGCCTGCATTTAGCATGTAAAGTTAGAAAGATGACAATACAAATGACTGCTTTTCATTGTTTTGCCTTTGTTATGTAG
AATCTATCTTTGTTGAGGAACTTGAAGAATGGTTGGGTATTGTGTCAAATTGATGCTATTTCTCTGAACTAACTC
Protein sequenceShow/hide protein sequence
MEGNISQGGLIPGGASYGGLDLQGPFKVHNQPQHSHAIHQQYHHPHTRPGSSANSSIHEGFTLSMGALPNCDHSLSLVDYNKGERCKNSPSDDEPSFTEDGFDGHNEMSK
GRKGSMWHRVKWTDKMVKLLITAVSYIGDDIGSDFEGSGRRKFQIIQKKGKWKLISKVMAERGFQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCEVVENPALLDVIDYL
TDKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDVDDNEHDETDEHDDFEENFAPHGDNKRSLGVLGGSVKRLRRGQDL
DDAHACGNSLNSLDCKKSLHAYSQTQYAQADTAHLETESMKASTSQKQWMEHCLLRMEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKMRMENERMKLENERIALDL
KQKELGSGFH