; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019278 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019278
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionWD repeat containing protein
Genome locationChr04:19722495..19726883
RNA-Seq ExpressionHG10019278
SyntenyHG10019278
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR036322 - WD40-repeat-containing domain superfamily
IPR039328 - WD repeat-containing protein 89


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008454356.1 PREDICTED: WD repeat-containing protein 89 homolog isoform X1 [Cucumis melo]3.4e-21992.33Show/hide
Query:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVE+H NAD+++NSSSF+RFGLKNSIQTNFGDDYVFHIAPN DWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTI+SWD+RT QQVSSISAGPSQEIFSFAYGGSN SLLAAG  SQILFWDWRNRKQVACLEDSHVEDVTQVHF+PGHQGKLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP
        DIDDDDHMDSVINVGTSVGKIGF+G+NYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSW MGHVDYLVDCHYS EG RLWVLGGTNDGTVGYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP

Query:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        I+   GKNAIESPD++LEGGH+GVVRSVLPTTN+LGGFSQSQGVFGWTGGEDGRLCCW SDDSHEMNRSWISSTLVIKSPG RRKNRH PY
Subjt:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

XP_022143440.1 WD repeat-containing protein 89 homolog [Momordica charantia]5.7e-21992.33Show/hide
Query:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        ME+IDMDVEEHANADTSTNSSSF+RFGLKNSIQTNFGDDYVFHIAP+ DWTSMAVSLSSNVVKLYSPVTGQYYGECRGH+GTINQISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTIRSWDVR   ++SSISAGPSQEIFSFAYGGS+ +LLAAG  SQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP
        DIDDDDHMDSVINVGTSVGKIGFFG+NYRKLWCLTHIETLSLWDW+DGRNEADITDARTLASNSW M HVDYLVDCHYSNEG RLWVLGGTNDGTVGYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP

Query:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        +DH KGKNAIESP+++LEGGHVGVVRSVLP TNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHE  RSWISSTLVIKSP ARRK+RHHPY
Subjt:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

XP_023522094.1 WD repeat-containing protein 89 homolog [Cucurbita pepo subsp. pepo]1.7e-21590.79Show/hide
Query:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVEE  NAD+S +SSSF+RFGLKN+IQTNFGDDYVFHIAPN DWTSMAVSLSSNVVKLYSPVTGQYYGECRGH GTINQISF+VPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTIRSWDVR  QQVSSISAGPSQEIFSFAYGGS+M+LLAAG  SQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQ KLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP
        DIDDD+HMDSVINVGTSVGKIGFFG+NYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYS+EG RLWVLGGTNDGTVGYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP

Query:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        +DH KGK AIESPD++LEGGH+G+VRSVLP TN+LGGFS+SQGVFGWTGGEDGRLCCW SDDS E NRSWISSTLVIKSPG+RRK+RHHPY
Subjt:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

XP_031740194.1 WD repeat-containing protein GTS1 [Cucumis sativus]4.1e-21791.56Show/hide
Query:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVEEH NAD+++NS+SF+RFGLKNSIQTNFGDDYVFHI PN DWTSMAVSLSSNVVKLYSPVTGQYYGEC GHTGT+NQISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTI+SWDVRT QQVSSISAG SQEIFSFAYGGSNMSLLAAG  SQILFWDWRNRKQVACLEDSHVEDVTQVHF+PGHQGKLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP
        DIDDDDHMDSVINVGTSVGKIGF+G+NYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASN+W MGHVDYLVDCHYSNEG RLWVLGGTNDGTVGYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP

Query:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        I+ S GK AIESPD++LEGGH+GVVRSVLPTTN+LGGFSQSQ VFGWTGGEDGRLCCW SDDS+EMNRSWISSTLVIKSPG RRKNRHHPY
Subjt:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

XP_038906204.1 WD repeat-containing protein GTS1 [Benincasa hispida]1.1e-22293.61Show/hide
Query:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVEEHANADT+TNSSSF+RFGLKNSIQTNFGDDYVFHIAPN DWTSMAVSLSSNVVKLYSPVTGQY+GECRGHTGTIN+ISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTIRSWDVRT QQVSSISAGPSQEIFSFAYGGSNMSLLAAG  SQILFWDWRNRKQVACLEDSHVEDVTQVHF+PGHQGKLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP
        DIDDDDHMDSVINVGTSVGKIGFFG+NYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASN WTMG VDYLVDCHYS+EG RLWVLGG+NDG +GYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP

Query:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        IDHSKGKNAIESPDI+LEGGH+G++RSVLPTTN LGGFSQSQGVFGWTGGEDGRLCCW SDDSHEMNRSWISS+LVIKSPG RRKNRHHPY
Subjt:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

TrEMBL top hitse value%identityAlignment
A0A0A0KWI9 WD_REPEATS_REGION domain-containing protein2.0e-21791.56Show/hide
Query:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVEEH NAD+++NS+SF+RFGLKNSIQTNFGDDYVFHI PN DWTSMAVSLSSNVVKLYSPVTGQYYGEC GHTGT+NQISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTI+SWDVRT QQVSSISAG SQEIFSFAYGGSNMSLLAAG  SQILFWDWRNRKQVACLEDSHVEDVTQVHF+PGHQGKLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP
        DIDDDDHMDSVINVGTSVGKIGF+G+NYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASN+W MGHVDYLVDCHYSNEG RLWVLGGTNDGTVGYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP

Query:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        I+ S GK AIESPD++LEGGH+GVVRSVLPTTN+LGGFSQSQ VFGWTGGEDGRLCCW SDDS+EMNRSWISSTLVIKSPG RRKNRHHPY
Subjt:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

A0A1S3BYE3 WD repeat-containing protein 89 homolog isoform X11.6e-21992.33Show/hide
Query:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVE+H NAD+++NSSSF+RFGLKNSIQTNFGDDYVFHIAPN DWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTI+SWD+RT QQVSSISAGPSQEIFSFAYGGSN SLLAAG  SQILFWDWRNRKQVACLEDSHVEDVTQVHF+PGHQGKLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP
        DIDDDDHMDSVINVGTSVGKIGF+G+NYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSW MGHVDYLVDCHYS EG RLWVLGGTNDGTVGYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP

Query:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        I+   GKNAIESPD++LEGGH+GVVRSVLPTTN+LGGFSQSQGVFGWTGGEDGRLCCW SDDSHEMNRSWISSTLVIKSPG RRKNRH PY
Subjt:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

A0A5A7TS28 WD repeat-containing protein 89-like protein isoform X11.6e-21992.33Show/hide
Query:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVE+H NAD+++NSSSF+RFGLKNSIQTNFGDDYVFHIAPN DWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTI+SWD+RT QQVSSISAGPSQEIFSFAYGGSN SLLAAG  SQILFWDWRNRKQVACLEDSHVEDVTQVHF+PGHQGKLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP
        DIDDDDHMDSVINVGTSVGKIGF+G+NYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSW MGHVDYLVDCHYS EG RLWVLGGTNDGTVGYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP

Query:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        I+   GKNAIESPD++LEGGH+GVVRSVLPTTN+LGGFSQSQGVFGWTGGEDGRLCCW SDDSHEMNRSWISSTLVIKSPG RRKNRH PY
Subjt:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

A0A6J1CNT7 WD repeat-containing protein 89 homolog2.8e-21992.33Show/hide
Query:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        ME+IDMDVEEHANADTSTNSSSF+RFGLKNSIQTNFGDDYVFHIAP+ DWTSMAVSLSSNVVKLYSPVTGQYYGECRGH+GTINQISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTIRSWDVR   ++SSISAGPSQEIFSFAYGGS+ +LLAAG  SQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP
        DIDDDDHMDSVINVGTSVGKIGFFG+NYRKLWCLTHIETLSLWDW+DGRNEADITDARTLASNSW M HVDYLVDCHYSNEG RLWVLGGTNDGTVGYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP

Query:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        +DH KGKNAIESP+++LEGGHVGVVRSVLP TNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHE  RSWISSTLVIKSP ARRK+RHHPY
Subjt:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

A0A6J1GAT4 WD repeat-containing protein 89 homolog1.0e-21390.03Show/hide
Query:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVEE  NAD+S +SSSF+RFGLKN+IQTNFGDDYVFHIAPN DWT MAVSLSSNVVKLYSPVTGQYYGECRGH GTINQISF+VPS PHVLHSC
Subjt:  MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTIRSWDVR  QQVSSISAGPSQEIFSFAYGGS+M+LLAAG  SQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQ KLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP
        DIDDD+HMDSVINVGTSVGKIGFFG+NYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSW MG VDYLVDCHYS+EG RLWVLGGTNDGTVGYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFP

Query:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        +DH KGKNAIESPD++LEGGH+G+VRSVLP TN+LGGFS+SQGVFGWTGGEDGRLCCW SDDS E NRSWISSTLVIKSPG RRK+RHHPY
Subjt:  IDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

SwissProt top hitse value%identityAlignment
Q3ZBK1 WD repeat-containing protein 895.3e-2628.53Show/hide
Query:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTC--QQVSSISAGPSQEIFSFAYGGSNMSLLAAG-----G
        +AV  S+  ++++         E RG+ G +N + F+  ++   ++S  +DGT++ WD R    + V      PS    SF    SN  ++ AG      
Subjt:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTC--QQVSSISAGPSQEIFSFAYGGSNMSLLAAG-----G

Query:  ISQILFWDWR--------NRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHI
         + ++FWD R         ++ +    ++H +D+TQV F P +   + S S DGLV +FD + D ++DD + +  N  +SV  IG+ G +Y++++C+TH 
Subjt:  ISQILFWDWR--------NRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHI

Query:  ETLSLWDWTDGRNEADIT-----DARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFPIDHSKGKNAIESPDIILEGGHVGVVRSVLPTT
        E    WD      +  IT     D R + +     G +DYL+   Y  +  +L+V+GGTN G +       + G   + S    L+GGH   VRS     
Subjt:  ETLSLWDWTDGRNEADIT-----DARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFPIDHSKGKNAIESPDIILEGGHVGVVRSVLPTT

Query:  NVLGGFSQSQGVFGWTGGEDGRLCCW
               Q   +   TGGED +L  W
Subjt:  NVLGGFSQSQGVFGWTGGEDGRLCCW

Q54QU5 WD repeat-containing protein 89 homolog8.2e-3531.34Show/hide
Query:  NFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYG
        + GDD  + +  +     +A + S+ ++K+Y            GH   IN+  F      + L SCSSD T++ WD +T Q   +I+     EIFS    
Subjt:  NFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYG

Query:  GSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCL
        G    +LA G  S ++ ++   +K +   + SH EDVT+V F P  + KL S SVDGL+C++D     DDDD +  VIN   S+G IGFFG  Y+ L+ L
Subjt:  GSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCL

Query:  THIETLSLWDWTDGRNEADI-TDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFPIDHSKGKNAIESPDII-----LEGGHVGVVRSV
        +H E L+ WD T G        D R+  S+ +    ++Y + C Y N   +L + GG  +GT          G   + +PD +     LE  H  V+R  
Subjt:  THIETLSLWDWTDGRNEADI-TDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFPIDHSKGKNAIESPDII-----LEGGHVGVVRSV

Query:  LPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDS
            NV     +S+ +   T  ED ++  W ++ S
Subjt:  LPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDS

Q5FVP5 WD repeat-containing protein 894.4e-2829.01Show/hide
Query:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTCQQ--VSSISAGPSQEIFSFAYGGSNMSLLAAG----GI
        +AV  S+  +++Y   T     E  G  G +N + F+  ++   ++S S+DGT++ WD R   +         PS    SF     +  + A        
Subjt:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTCQQ--VSSISAGPSQEIFSFAYGGSNMSLLAAG----GI

Query:  SQILFWDWR-------NRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIET
        + ++FWD R        R  +    ++H +D+TQV F P +   + S S DGLV +FD + D +++D + +  N  +SV  IG+ G +Y++++C+TH E 
Subjt:  SQILFWDWR-------NRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIET

Query:  LSLWDWTDGRNEADIT-----DARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFPIDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNV
           WD      +  IT     D R +       GH+DYL+   Y     RL+V+GGTN G +       + G + + S    L+GGH   VRS   T   
Subjt:  LSLWDWTDGRNEADIT-----DARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFPIDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNV

Query:  LGGFSQSQGVFGWTGGEDGRLCCW
            S+   +   TGGED +L  W
Subjt:  LGGFSQSQGVFGWTGGEDGRLCCW

Q944S2 WD repeat-containing protein GTS14.4e-14563.36Show/hide
Query:  SIDMDVE-EHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPS--TPHVLHS
        S +M+VE ++     S+ + + ++FGLKNSIQTNFG DYVF I P  DWT++AVSLS+N VKLYSPVTGQYYGEC+GH+ T+NQI+FS  S  +PHVLHS
Subjt:  SIDMDVE-EHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPS--TPHVLHS

Query:  CSSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTN
        CSSDGTIRSWD R+ QQVS I  G  QEIFSF+YGG+  +LLA G   Q+L WDWRN KQVACLE+SH++DVTQVHF+P    KL SASVDGL+C+F+T 
Subjt:  CSSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTN

Query:  GDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYF
        GDI+DDDH++SVINVGTS+GKIGF GD Y+KLWCLTHIETLS+W+W DG  E ++  AR LAS+SWT  +VDY VDCH    G  LWV+GGT  GTVGYF
Subjt:  GDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYF

Query:  PIDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSD-DSHEMNRSWISSTLVIKSPGARRKNRHHPY
        P+++ K   +I + + IL GGH+ VVRSVL      GG   + G+FGWTGGEDGRLCCW SD D+ E+NRSW SS LV+K P  R+KNRH PY
Subjt:  PIDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSD-DSHEMNRSWISSTLVIKSPGARRKNRHHPY

Q9D0R9 WD repeat-containing protein 891.5e-2829.32Show/hide
Query:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTCQQ--VSSISAGPSQEIFSFAYGGSNMSLLAAG----GI
        +AV  S+  +++Y   T     E  G  G ++ +SF+  ++   ++S S+DGT++ WD R   +  V      PS    SF     +  + A        
Subjt:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTCQQ--VSSISAGPSQEIFSFAYGGSNMSLLAAG----GI

Query:  SQILFWDWR-------NRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIET
        + ++FWD R        R  +    ++H +D+TQV F P +   + S S DGLV +FD + D  ++D + +  N  +SV  IG+ G +Y++++C+TH E 
Subjt:  SQILFWDWR-------NRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIET

Query:  LSLWDWTDGRNEADIT-----DARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFPIDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNV
           WD      +  IT     D R +       GH+DYL+   Y  +  RL+V+GGTN G +       S G   + S    L GGH   VRS       
Subjt:  LSLWDWTDGRNEADIT-----DARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFPIDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNV

Query:  LGGFSQSQGVFGWTGGEDGRLCCW
           ++ S+     TGGED +L  W
Subjt:  LGGFSQSQGVFGWTGGEDGRLCCW

Arabidopsis top hitse value%identityAlignment
AT1G15440.1 periodic tryptophan protein 22.8e-0623Show/hide
Query:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFW
        +A     N VK+++ ++G  +     HT  +  + F   +  H L S S DGT+R+WD +  +   + +    ++  S     S   ++ AG +     +
Subjt:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFW

Query:  DWRNRK-QVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFD---TNGDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDG
         W  +  Q+  +   H   V  + F P  Q  LAS+S D  V ++D   + G ++   H   V+ V         F  + ++L   T    ++ WD  +G
Subjt:  DWRNRK-QVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFD---TNGDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDG

AT1G15440.2 periodic tryptophan protein 22.8e-0623Show/hide
Query:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFW
        +A     N VK+++ ++G  +     HT  +  + F   +  H L S S DGT+R+WD +  +   + +    ++  S     S   ++ AG +     +
Subjt:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFW

Query:  DWRNRK-QVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFD---TNGDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDG
         W  +  Q+  +   H   V  + F P  Q  LAS+S D  V ++D   + G ++   H   V+ V         F  + ++L   T    ++ WD  +G
Subjt:  DWRNRK-QVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFD---TNGDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDG

AT2G47790.1 Transducin/WD40 repeat-like superfamily protein3.1e-14663.36Show/hide
Query:  SIDMDVE-EHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPS--TPHVLHS
        S +M+VE ++     S+ + + ++FGLKNSIQTNFG DYVF I P  DWT++AVSLS+N VKLYSPVTGQYYGEC+GH+ T+NQI+FS  S  +PHVLHS
Subjt:  SIDMDVE-EHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPS--TPHVLHS

Query:  CSSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTN
        CSSDGTIRSWD R+ QQVS I  G  QEIFSF+YGG+  +LLA G   Q+L WDWRN KQVACLE+SH++DVTQVHF+P    KL SASVDGL+C+F+T 
Subjt:  CSSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTN

Query:  GDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYF
        GDI+DDDH++SVINVGTS+GKIGF GD Y+KLWCLTHIETLS+W+W DG  E ++  AR LAS+SWT  +VDY VDCH    G  LWV+GGT  GTVGYF
Subjt:  GDIDDDDHMDSVINVGTSVGKIGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYF

Query:  PIDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSD-DSHEMNRSWISSTLVIKSPGARRKNRHHPY
        P+++ K   +I + + IL GGH+ VVRSVL      GG   + G+FGWTGGEDGRLCCW SD D+ E+NRSW SS LV+K P  R+KNRH PY
Subjt:  PIDHSKGKNAIESPDIILEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSD-DSHEMNRSWISSTLVIKSPGARRKNRHHPY

AT4G07410.1 Transducin family protein / WD-40 repeat family protein1.4e-0530.91Show/hide
Query:  LHSCSSDGTIRSWDVRTCQQVSSISA-----GPSQEIFSFAYGGSNMSLLAAG-GISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVD
        + S SSDG IR WD  +C +V  I+A     G S EI  ++      S+L +G     + FWD  +   +     +H  DV  +   P H  ++ SA  D
Subjt:  LHSCSSDGTIRSWDVRTCQQVSSISA-----GPSQEIFSFAYGGSNMSLLAAG-GISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVD

Query:  GLVCIFDTNG
        G V ++  +G
Subjt:  GLVCIFDTNG

AT5G52820.1 WD-40 repeat family protein / notchless protein, putative7.4e-0727.27Show/hide
Query:  HIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLA
        + +P+  W + A    S  V+L++ +TGQ+    RGH G + Q+S+S  S   +L S S D T++ W++RT +++     G + E+F+  +      +++
Subjt:  HIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTCQQVSSISAGPSQEIFSFAYGGSNMSLLA

Query:  AGGISQILFW
         G    +  W
Subjt:  AGGISQILFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCCATTGATATGGATGTCGAGGAGCACGCCAATGCCGATACGAGTACAAATTCCAGCTCCTTCCAGCGCTTTGGACTCAAGAATTCCATTCAAACCAACTTCGG
TGATGATTACGTTTTTCACATCGCTCCCAATGCTGATTGGACGTCAATGGCGGTGTCATTATCTTCCAATGTTGTGAAGCTATACTCACCAGTGACTGGTCAGTACTATG
GAGAGTGCAGAGGTCACACTGGAACAATCAATCAAATTTCGTTCTCTGTTCCATCAACCCCACATGTATTGCATTCTTGTTCTTCTGATGGAACTATCAGATCTTGGGAT
GTGCGGACTTGTCAGCAGGTTTCATCCATTAGTGCTGGCCCTTCTCAAGAGATCTTCAGCTTTGCATATGGAGGATCAAACATGAGTCTTCTTGCTGCAGGTGGTATTTC
TCAGATTCTCTTCTGGGATTGGCGGAACAGAAAGCAAGTTGCATGCTTGGAGGACTCTCATGTGGAAGATGTGACTCAGGTTCACTTTATTCCGGGCCATCAAGGCAAGC
TTGCTTCTGCTTCCGTCGATGGGTTGGTTTGTATATTTGACACTAACGGGGATATTGATGATGACGATCATATGGATTCTGTGATTAATGTTGGAACTTCAGTTGGTAAG
ATTGGGTTTTTTGGAGACAATTATAGAAAGTTGTGGTGCTTGACTCACATTGAAACCTTGAGCTTATGGGACTGGACAGATGGGAGAAATGAAGCAGATATCACAGATGC
TCGCACACTAGCTTCCAACAGTTGGACAATGGGTCATGTTGATTATTTAGTTGATTGTCACTACTCAAATGAAGGCGGAAGATTGTGGGTTCTTGGAGGTACCAATGATG
GCACCGTTGGCTACTTCCCAATTGACCACAGTAAGGGGAAGAATGCCATCGAATCACCGGACATTATACTTGAGGGTGGTCACGTTGGCGTCGTTAGAAGTGTCTTGCCC
ACGACGAACGTATTGGGTGGATTTTCACAGAGCCAAGGTGTGTTTGGATGGACAGGTGGAGAAGATGGACGTTTATGTTGTTGGTGTTCGGACGATTCTCATGAAATGAA
TCGATCCTGGATTTCAAGCACTCTAGTTATTAAGTCACCCGGTGCTCGGAGGAAAAATAGGCACCATCCTTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAATCCATTGATATGGATGTCGAGGAGCACGCCAATGCCGATACGAGTACAAATTCCAGCTCCTTCCAGCGCTTTGGACTCAAGAATTCCATTCAAACCAACTTCGG
TGATGATTACGTTTTTCACATCGCTCCCAATGCTGATTGGACGTCAATGGCGGTGTCATTATCTTCCAATGTTGTGAAGCTATACTCACCAGTGACTGGTCAGTACTATG
GAGAGTGCAGAGGTCACACTGGAACAATCAATCAAATTTCGTTCTCTGTTCCATCAACCCCACATGTATTGCATTCTTGTTCTTCTGATGGAACTATCAGATCTTGGGAT
GTGCGGACTTGTCAGCAGGTTTCATCCATTAGTGCTGGCCCTTCTCAAGAGATCTTCAGCTTTGCATATGGAGGATCAAACATGAGTCTTCTTGCTGCAGGTGGTATTTC
TCAGATTCTCTTCTGGGATTGGCGGAACAGAAAGCAAGTTGCATGCTTGGAGGACTCTCATGTGGAAGATGTGACTCAGGTTCACTTTATTCCGGGCCATCAAGGCAAGC
TTGCTTCTGCTTCCGTCGATGGGTTGGTTTGTATATTTGACACTAACGGGGATATTGATGATGACGATCATATGGATTCTGTGATTAATGTTGGAACTTCAGTTGGTAAG
ATTGGGTTTTTTGGAGACAATTATAGAAAGTTGTGGTGCTTGACTCACATTGAAACCTTGAGCTTATGGGACTGGACAGATGGGAGAAATGAAGCAGATATCACAGATGC
TCGCACACTAGCTTCCAACAGTTGGACAATGGGTCATGTTGATTATTTAGTTGATTGTCACTACTCAAATGAAGGCGGAAGATTGTGGGTTCTTGGAGGTACCAATGATG
GCACCGTTGGCTACTTCCCAATTGACCACAGTAAGGGGAAGAATGCCATCGAATCACCGGACATTATACTTGAGGGTGGTCACGTTGGCGTCGTTAGAAGTGTCTTGCCC
ACGACGAACGTATTGGGTGGATTTTCACAGAGCCAAGGTGTGTTTGGATGGACAGGTGGAGAAGATGGACGTTTATGTTGTTGGTGTTCGGACGATTCTCATGAAATGAA
TCGATCCTGGATTTCAAGCACTCTAGTTATTAAGTCACCCGGTGCTCGGAGGAAAAATAGGCACCATCCTTACTAA
Protein sequenceShow/hide protein sequence
MESIDMDVEEHANADTSTNSSSFQRFGLKNSIQTNFGDDYVFHIAPNADWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWD
VRTCQQVSSISAGPSQEIFSFAYGGSNMSLLAAGGISQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGK
IGFFGDNYRKLWCLTHIETLSLWDWTDGRNEADITDARTLASNSWTMGHVDYLVDCHYSNEGGRLWVLGGTNDGTVGYFPIDHSKGKNAIESPDIILEGGHVGVVRSVLP
TTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY