; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G10090 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G10090
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionWD repeat-containing protein 89 homolog
Genome locationClcChr10:22729978..22737534
RNA-Seq ExpressionClc10G10090
SyntenyClc10G10090
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR036322 - WD40-repeat-containing domain superfamily
IPR039328 - WD repeat-containing protein 89


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008454356.1 PREDICTED: WD repeat-containing protein 89 homolog isoform X1 [Cucumis melo]1.3e-22092.84Show/hide
Query:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVE+H NAD+ +NSSSFKRFGLKNSIQTNFGDDYVFHIAPN DWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING
        SSDGTI+SWD+RTFQQVSSISAGPSQEIFSFAYGGSN SLLAAGCKSQILFWDWRNR+QVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFD NG
Subjt:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP
        DIDDDDHMDSVINVGTSVGKIGF+GENYRKLWCLTHIETLSLWDWTDGRNEADI DARTLASNSW +GHVDYLVDCHYS EG RLWVLGGTNDGT+GYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP

Query:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        I+   GKNAIESPD+VLEGGH+GVVRSVLPTTN+LGGFSQSQGVFGWTGGEDGRLCCW SDDSHEMNRSWISSTLVIKSPG RRKNRH PY
Subjt:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

XP_022143440.1 WD repeat-containing protein 89 homolog [Momordica charantia]2.5e-21991.82Show/hide
Query:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        ME+IDMDVEEHANADT TNSSSFKRFGLKNSIQTNFGDDYVFHIAP+ DWTSMAVSLSSNVVKLYSPVTGQYYGECRGH+GTINQISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING
        SSDGTIRSWDVR   ++SSISAGPSQEIFSFAYGGS+ +LLAAGCKSQILFWDWRNR+QVACLEDSHVEDVTQVHF+PGHQGKLASASVDGLVCIFD NG
Subjt:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP
        DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDW+DGRNEADI DARTLASNSW + HVDYLVDCHYSNEG RLWVLGGTNDGT+GYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP

Query:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        +DH KGKNAIESP++VLEGGHVGVVRSVLP TNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHE  RSWISSTLVIKSP ARRK+RHHPY
Subjt:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

XP_023522094.1 WD repeat-containing protein 89 homolog [Cucurbita pepo subsp. pepo]1.8e-21790.79Show/hide
Query:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVEE  NAD+  +SSSFKRFGLKN+IQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGH GTINQISF+VPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING
        SSDGTIRSWDVR FQQVSSISAGPSQEIFSFAYGGS+M+LLAAGCKSQILFWDWRNR+QVACLEDSHVEDVTQVHF+PGHQ KLASASVDGLVCIFD NG
Subjt:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP
        DIDDD+HMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADI DARTLASNSWT+GHVDYLVDCHYS+EG RLWVLGGTNDGT+GYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP

Query:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        +DH KGK AIESPD+VLEGGH+G+VRSVLP TN+LGGFS+SQGVFGWTGGEDGRLCCW SDDS E NRSWISSTLVIKSPG+RRK+RHHPY
Subjt:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

XP_031740194.1 WD repeat-containing protein GTS1 [Cucumis sativus]2.8e-21891.82Show/hide
Query:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVEEH NAD+ +NS+SFKRFGLKNSIQTNFGDDYVFHI PN DWTSMAVSLSSNVVKLYSPVTGQYYGEC GHTGT+NQISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING
        SSDGTI+SWDVRTFQQVSSISAG SQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNR+QVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFD NG
Subjt:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP
        DIDDDDHMDSVINVGTSVGKIGF+GENYRKLWCLTHIETLSLWDWTDGRNEADI DARTLASN+W +GHVDYLVDCHYSNEG RLWVLGGTNDGT+GYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP

Query:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        I+   GK AIESPD+VLEGGH+GVVRSVLPTTN+LGGFSQSQ VFGWTGGEDGRLCCW SDDS+EMNRSWISSTLVIKSPG RRKNRHHPY
Subjt:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

XP_038906204.1 WD repeat-containing protein GTS1 [Benincasa hispida]4.0e-22594.63Show/hide
Query:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVEEHANADT TNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQY+GECRGHTGTIN+ISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING
        SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNR+QVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFD NG
Subjt:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP
        DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADI DARTLASN WT+G VDYLVDCHYS+EG RLWVLGG+NDG IGYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP

Query:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        IDH KGKNAIESPDIVLEGGH+G++RSVLPTTN LGGFSQSQGVFGWTGGEDGRLCCW SDDSHEMNRSWISS+LVIKSPG RRKNRHHPY
Subjt:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

TrEMBL top hitse value%identityAlignment
A0A0A0KWI9 WD_REPEATS_REGION domain-containing protein1.3e-21891.82Show/hide
Query:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVEEH NAD+ +NS+SFKRFGLKNSIQTNFGDDYVFHI PN DWTSMAVSLSSNVVKLYSPVTGQYYGEC GHTGT+NQISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING
        SSDGTI+SWDVRTFQQVSSISAG SQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNR+QVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFD NG
Subjt:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP
        DIDDDDHMDSVINVGTSVGKIGF+GENYRKLWCLTHIETLSLWDWTDGRNEADI DARTLASN+W +GHVDYLVDCHYSNEG RLWVLGGTNDGT+GYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP

Query:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        I+   GK AIESPD+VLEGGH+GVVRSVLPTTN+LGGFSQSQ VFGWTGGEDGRLCCW SDDS+EMNRSWISSTLVIKSPG RRKNRHHPY
Subjt:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

A0A1S3BYE3 WD repeat-containing protein 89 homolog isoform X16.4e-22192.84Show/hide
Query:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVE+H NAD+ +NSSSFKRFGLKNSIQTNFGDDYVFHIAPN DWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING
        SSDGTI+SWD+RTFQQVSSISAGPSQEIFSFAYGGSN SLLAAGCKSQILFWDWRNR+QVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFD NG
Subjt:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP
        DIDDDDHMDSVINVGTSVGKIGF+GENYRKLWCLTHIETLSLWDWTDGRNEADI DARTLASNSW +GHVDYLVDCHYS EG RLWVLGGTNDGT+GYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP

Query:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        I+   GKNAIESPD+VLEGGH+GVVRSVLPTTN+LGGFSQSQGVFGWTGGEDGRLCCW SDDSHEMNRSWISSTLVIKSPG RRKNRH PY
Subjt:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

A0A5A7TS28 WD repeat-containing protein 89-like protein isoform X16.4e-22192.84Show/hide
Query:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVE+H NAD+ +NSSSFKRFGLKNSIQTNFGDDYVFHIAPN DWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING
        SSDGTI+SWD+RTFQQVSSISAGPSQEIFSFAYGGSN SLLAAGCKSQILFWDWRNR+QVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFD NG
Subjt:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP
        DIDDDDHMDSVINVGTSVGKIGF+GENYRKLWCLTHIETLSLWDWTDGRNEADI DARTLASNSW +GHVDYLVDCHYS EG RLWVLGGTNDGT+GYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP

Query:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        I+   GKNAIESPD+VLEGGH+GVVRSVLPTTN+LGGFSQSQGVFGWTGGEDGRLCCW SDDSHEMNRSWISSTLVIKSPG RRKNRH PY
Subjt:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

A0A6J1CNT7 WD repeat-containing protein 89 homolog1.2e-21991.82Show/hide
Query:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        ME+IDMDVEEHANADT TNSSSFKRFGLKNSIQTNFGDDYVFHIAP+ DWTSMAVSLSSNVVKLYSPVTGQYYGECRGH+GTINQISFSVPSTPHVLHSC
Subjt:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING
        SSDGTIRSWDVR   ++SSISAGPSQEIFSFAYGGS+ +LLAAGCKSQILFWDWRNR+QVACLEDSHVEDVTQVHF+PGHQGKLASASVDGLVCIFD NG
Subjt:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP
        DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDW+DGRNEADI DARTLASNSW + HVDYLVDCHYSNEG RLWVLGGTNDGT+GYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP

Query:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        +DH KGKNAIESP++VLEGGHVGVVRSVLP TNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHE  RSWISSTLVIKSP ARRK+RHHPY
Subjt:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

A0A6J1GAT4 WD repeat-containing protein 89 homolog1.1e-21590.03Show/hide
Query:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC
        MESIDMDVEE  NAD+  +SSSFKRFGLKN+IQTNFGDDYVFHIAPNGDWT MAVSLSSNVVKLYSPVTGQYYGECRGH GTINQISF+VPS PHVLHSC
Subjt:  MESIDMDVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING
        SSDGTIRSWDVR FQQVSSISAGPSQEIFSFAYGGS+M+LLAAGCKSQILFWDWRNR+QVACLEDSHVEDVTQVHF+PGHQ KLASASVDGLVCIFD NG
Subjt:  SSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDING

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP
        DIDDD+HMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADI DARTLASNSW +G VDYLVDCHYS+EG RLWVLGGTNDGT+GYFP
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFP

Query:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY
        +DH KGKNAIESPD+VLEGGH+G+VRSVLP TN+LGGFS+SQGVFGWTGGEDGRLCCW SDDS E NRSWISSTLVIKSPG RRK+RHHPY
Subjt:  IDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY

SwissProt top hitse value%identityAlignment
Q54QU5 WD repeat-containing protein 89 homolog9.4e-3631.64Show/hide
Query:  NFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYG
        + GDD  + +  +     +A + S+ ++K+Y            GH   IN+  F      + L SCSSD T++ WD +T Q   +I+     EIFS    
Subjt:  NFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYG

Query:  GSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDINGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCL
        G    +LA G  S ++ ++   ++ +   + SH EDVT+V F P  + KL S SVDGL+C++D+    DDDD +  VIN   S+G IGFFG  Y+ L+ L
Subjt:  GSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDINGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCL

Query:  THIETLSLWDWTDGRNEADI-ADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFPIDHGKGKNAIESPDIV-----LEGGHVGVVRSV
        +H E L+ WD T G       AD R+  S+ +    ++Y + C Y N   +L + GG  +GT          G   + +PD V     LE  H  V+R  
Subjt:  THIETLSLWDWTDGRNEADI-ADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFPIDHGKGKNAIESPDIV-----LEGGHVGVVRSV

Query:  LPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDS
            NV     +S+ +   T  ED ++  W ++ S
Subjt:  LPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDS

Q5FVP5 WD repeat-containing protein 891.5e-2829.66Show/hide
Query:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTFQQ--VSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQ--
        +AV  S+  +++Y   T     E  G  G +N + F+  ++   ++S S+DGT++ WD R   +         PS    SF     +  + A   K +  
Subjt:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTFQQ--VSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQ--

Query:  --ILFWDWR-------NRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDINGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIET
          ++FWD R        R  +    ++H +D+TQV F P +   + S S DGLV +FD++ D +++D + +  N  +SV  IG+ G +Y++++C+TH E 
Subjt:  --ILFWDWR-------NRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDINGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIET

Query:  LSLWDW----TDG----RNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFPIDHGKGKNAIESPDIVLEGGHVGVVRSVLPT
           WD     TD      N  D+ D   +       GH+DYL+   Y     RL+V+GGTN G I         G + + S    L+GGH   VRS   T
Subjt:  LSLWDW----TDG----RNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFPIDHGKGKNAIESPDIVLEGGHVGVVRSVLPT

Query:  TNVLGGFSQSQGVFGWTGGEDGRLCCW
               S+   +   TGGED +L  W
Subjt:  TNVLGGFSQSQGVFGWTGGEDGRLCCW

Q944S2 WD repeat-containing protein GTS12.9e-14663.1Show/hide
Query:  SIDMDVE-EHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPS--TPHVLHS
        S +M+VE ++      + + + K+FGLKNSIQTNFG DYVF I P  DWT++AVSLS+N VKLYSPVTGQYYGEC+GH+ T+NQI+FS  S  +PHVLHS
Subjt:  SIDMDVE-EHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPS--TPHVLHS

Query:  CSSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDIN
        CSSDGTIRSWD R+FQQVS I  G  QEIFSF+YGG+  +LLA GCK Q+L WDWRN +QVACLE+SH++DVTQVHFVP    KL SASVDGL+C+F+  
Subjt:  CSSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDIN

Query:  GDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYF
        GDI+DDDH++SVINVGTS+GKIGF G+ Y+KLWCLTHIETLS+W+W DG  E ++  AR LAS+SWT  +VDY VDCH    G  LWV+GGT  GT+GYF
Subjt:  GDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYF

Query:  PIDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSD-DSHEMNRSWISSTLVIKSPGARRKNRHHPY
        P+++ K   +I + + +L GGH+ VVRSVL      GG   + G+FGWTGGEDGRLCCW SD D+ E+NRSW SS LV+K P  R+KNRH PY
Subjt:  PIDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSD-DSHEMNRSWISSTLVIKSPGARRKNRHHPY

Q96FK6 WD repeat-containing protein 898.0e-2730.06Show/hide
Query:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTFQQ--VSSISAGPSQEIFSFAYGGSNMSLLAAGCK----
        +AV  S+  +++Y         E  G+ G +N + F+  ++   ++S  +DGT++ WD R  ++  V      PS    SF     N  ++ AG +    
Subjt:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTFQQ--VSSISAGPSQEIFSFAYGGSNMSLLAAGCK----

Query:  -SQILFWDWR-NRRQVACLEDS-------HVEDVTQVHFVPGHQGKLASASVDGLVCIFDINGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHI
         + ++FWD R N + ++  +DS       H +DVTQV F P +   + S S DGLV +FDIN D +++D + +  N  +SV  IG+ G+ Y++++C+TH 
Subjt:  -SQILFWDWR-NRRQVACLEDS-------HVEDVTQVHFVPGHQGKLASASVDGLVCIFDINGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHI

Query:  ETLSLWDWT-----DGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFPIDHGKGKNAIESPDIVLEGGHVGVVRSVLPTT
        E    WD       +     +I D R + +       +DYL+   Y  +   L V+GGTN G I         G   + S    L+GGH   VRS     
Subjt:  ETLSLWDWT-----DGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFPIDHGKGKNAIESPDIVLEGGHVGVVRSVLPTT

Query:  NVLGGFSQSQGVFGWTGGEDGRLCCW
        NV     Q   +   TGGED +L  W
Subjt:  NVLGGFSQSQGVFGWTGGEDGRLCCW

Q9D0R9 WD repeat-containing protein 891.1e-2829.32Show/hide
Query:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVR--TFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCK----
        +AV  S+  +++Y   T     E  G  G ++ +SF+  ++   ++S S+DGT++ WD R  + + V      PS    SF     +  + A   K    
Subjt:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVR--TFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCK----

Query:  SQILFWDWR-------NRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDINGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIET
        + ++FWD R        R  +    ++H +D+TQV F P +   + S S DGLV +FD++ D  ++D + +  N  +SV  IG+ G++Y++++C+TH E 
Subjt:  SQILFWDWR-------NRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDINGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIET

Query:  LSLWDWTDGRNE-----ADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFPIDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNV
           WD      +      +I D R +       GH+DYL+   Y  +  RL+V+GGTN G I         G   + S    L GGH   VRS       
Subjt:  LSLWDWTDGRNE-----ADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFPIDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNV

Query:  LGGFSQSQGVFGWTGGEDGRLCCW
           ++ S+     TGGED +L  W
Subjt:  LGGFSQSQGVFGWTGGEDGRLCCW

Arabidopsis top hitse value%identityAlignment
AT1G15440.1 periodic tryptophan protein 23.2e-0722.94Show/hide
Query:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFW
        +A     N VK+++ ++G  +     HT  +  + F   +  H L S S DGT+R+WD + ++   + +    ++  S     S   ++ AG       +
Subjt:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFW

Query:  DWRNRR-QVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDI---NGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDG
         W  +  Q+  +   H   V  + F P  Q  LAS+S D  V ++D+    G ++   H   V+ V         F  + ++L   T    ++ WD  +G
Subjt:  DWRNRR-QVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDI---NGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDG

Query:  ------RNEADIADARTL
                  DIA  R +
Subjt:  ------RNEADIADARTL

AT1G15440.2 periodic tryptophan protein 23.2e-0722.94Show/hide
Query:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFW
        +A     N VK+++ ++G  +     HT  +  + F   +  H L S S DGT+R+WD + ++   + +    ++  S     S   ++ AG       +
Subjt:  MAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFW

Query:  DWRNRR-QVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDI---NGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDG
         W  +  Q+  +   H   V  + F P  Q  LAS+S D  V ++D+    G ++   H   V+ V         F  + ++L   T    ++ WD  +G
Subjt:  DWRNRR-QVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDI---NGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDG

Query:  ------RNEADIADARTL
                  DIA  R +
Subjt:  ------RNEADIADARTL

AT2G47790.1 Transducin/WD40 repeat-like superfamily protein2.1e-14763.1Show/hide
Query:  SIDMDVE-EHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPS--TPHVLHS
        S +M+VE ++      + + + K+FGLKNSIQTNFG DYVF I P  DWT++AVSLS+N VKLYSPVTGQYYGEC+GH+ T+NQI+FS  S  +PHVLHS
Subjt:  SIDMDVE-EHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPS--TPHVLHS

Query:  CSSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDIN
        CSSDGTIRSWD R+FQQVS I  G  QEIFSF+YGG+  +LLA GCK Q+L WDWRN +QVACLE+SH++DVTQVHFVP    KL SASVDGL+C+F+  
Subjt:  CSSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDIN

Query:  GDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYF
        GDI+DDDH++SVINVGTS+GKIGF G+ Y+KLWCLTHIETLS+W+W DG  E ++  AR LAS+SWT  +VDY VDCH    G  LWV+GGT  GT+GYF
Subjt:  GDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYF

Query:  PIDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSD-DSHEMNRSWISSTLVIKSPGARRKNRHHPY
        P+++ K   +I + + +L GGH+ VVRSVL      GG   + G+FGWTGGEDGRLCCW SD D+ E+NRSW SS LV+K P  R+KNRH PY
Subjt:  PIDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLGGFSQSQGVFGWTGGEDGRLCCWCSD-DSHEMNRSWISSTLVIKSPGARRKNRHHPY

AT5G49430.1 WD40/YVTN repeat-like-containing domain;Bromodomain5.2e-0539.74Show/hide
Query:  GDWTSMAVS-------LSSN--VVKLYSPVTGQYYGECRGHTGTINQISFSV-PSTPHVLHSCSSDGTIRSWDVRTFQ
        GD T +AVS        +SN  V++++    G      RGHTG +  I+FS  P +P+ L S S DGT R WD R  Q
Subjt:  GDWTSMAVS-------LSSN--VVKLYSPVTGQYYGECRGHTGTINQISFSV-PSTPHVLHSCSSDGTIRSWDVRTFQ

AT5G52820.1 WD-40 repeat family protein / notchless protein, putative8.5e-0828.18Show/hide
Query:  HIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLA
        + +P+G W + A    S  V+L++ +TGQ+    RGH G + Q+S+S  S   +L S S D T++ W++RT +++     G + E+F+  +      +++
Subjt:  HIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTFQQVSSISAGPSQEIFSFAYGGSNMSLLA

Query:  AGCKSQILFW
         G    +  W
Subjt:  AGCKSQILFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGAATTTTCCCCCATTTTTAGGCGGGGACAGGGGCGAGGATTCCCCGTTGAGGAAATGGGTCCTCGTGGAGGCCCTATTTCCTGCGAGACTTTAGAGAGAGCGAT
ACTGGATACCCTCCTCGTCCCTAGATGGGTACCCAAGGAATGTCCCCGATTTTATCCCGTCCTTGGAAGGATACTCGGAGATTTTACCTCGTCCAGTCAAACCGGAGATT
TCCCAACTCGATTGATTAACCAATTGGAACGAACTTTGTTGGAAGTTTCAGTGGCGCGCCTCCAACGAGAGAAAGAAGAAAGGGAAAGAAAAATGGAATCCATTGACATG
GATGTTGAGGAACACGCCAATGCCGATACGAGGACAAATTCCAGCTCCTTCAAGCGCTTTGGACTCAAGAATTCCATTCAAACCAACTTCGGTGATGATTACGTTTTTCA
CATCGCCCCCAATGGGGATTGGACGTCAATGGCGGTGTCATTATCTTCCAATGTTGTGAAGCTATACTCGCCTGTGACTGGTCAGTACTATGGAGAGTGCAGAGGTCACA
CTGGAACAATCAATCAAATTTCCTTCTCTGTTCCTTCAACCCCACATGTATTGCATTCTTGTTCTTCTGATGGAACTATCAGATCTTGGGATGTGCGGACTTTTCAGCAG
GTTTCATCCATTAGTGCTGGCCCTTCTCAAGAAATCTTCAGCTTTGCCTATGGAGGATCAAATATGAGTCTTCTTGCTGCAGGTTGTAAATCTCAGATTCTCTTCTGGGA
TTGGAGGAACAGAAGGCAAGTGGCATGCTTGGAGGACTCTCATGTGGAAGATGTGACTCAGGTTCACTTTGTTCCGGGCCATCAAGGCAAGCTTGCTTCTGCTTCCGTGG
ATGGGTTGGTTTGTATATTTGACATTAACGGGGATATTGATGATGACGATCATATGGATTCTGTGATTAATGTGGGAACTTCAGTCGGTAAGATTGGGTTTTTTGGAGAG
AATTATAGAAAGTTGTGGTGCTTGACTCACATTGAAACATTGAGCTTATGGGATTGGACAGATGGGAGAAATGAAGCAGATATCGCAGATGCTCGCACACTAGCTTCCAA
CAGTTGGACAATAGGCCATGTCGATTATTTAGTTGATTGTCACTACTCAAATGAAGGCGGTAGATTGTGGGTTCTTGGAGGTACCAACGATGGCACCATAGGCTACTTCC
CGATCGACCACGGTAAGGGGAAGAATGCAATCGAATCACCAGACATTGTTCTTGAGGGTGGCCACGTTGGCGTCGTTAGAAGTGTTTTGCCCACGACGAATGTATTGGGT
GGATTCTCACAGAGCCAAGGTGTGTTTGGATGGACAGGTGGAGAAGATGGACGTTTATGTTGTTGGTGTTCGGACGATTCCCACGAAATGAATCGATCCTGGATTTCGAG
CACTCTAGTTATTAAGTCACCTGGCGCTCGGAGGAAAAATAGACACCATCCTTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGAATTTTCCCCCATTTTTAGGCGGGGACAGGGGCGAGGATTCCCCGTTGAGGAAATGGGTCCTCGTGGAGGCCCTATTTCCTGCGAGACTTTAGAGAGAGCGAT
ACTGGATACCCTCCTCGTCCCTAGATGGGTACCCAAGGAATGTCCCCGATTTTATCCCGTCCTTGGAAGGATACTCGGAGATTTTACCTCGTCCAGTCAAACCGGAGATT
TCCCAACTCGATTGATTAACCAATTGGAACGAACTTTGTTGGAAGTTTCAGTGGCGCGCCTCCAACGAGAGAAAGAAGAAAGGGAAAGAAAAATGGAATCCATTGACATG
GATGTTGAGGAACACGCCAATGCCGATACGAGGACAAATTCCAGCTCCTTCAAGCGCTTTGGACTCAAGAATTCCATTCAAACCAACTTCGGTGATGATTACGTTTTTCA
CATCGCCCCCAATGGGGATTGGACGTCAATGGCGGTGTCATTATCTTCCAATGTTGTGAAGCTATACTCGCCTGTGACTGGTCAGTACTATGGAGAGTGCAGAGGTCACA
CTGGAACAATCAATCAAATTTCCTTCTCTGTTCCTTCAACCCCACATGTATTGCATTCTTGTTCTTCTGATGGAACTATCAGATCTTGGGATGTGCGGACTTTTCAGCAG
GTTTCATCCATTAGTGCTGGCCCTTCTCAAGAAATCTTCAGCTTTGCCTATGGAGGATCAAATATGAGTCTTCTTGCTGCAGGTTGTAAATCTCAGATTCTCTTCTGGGA
TTGGAGGAACAGAAGGCAAGTGGCATGCTTGGAGGACTCTCATGTGGAAGATGTGACTCAGGTTCACTTTGTTCCGGGCCATCAAGGCAAGCTTGCTTCTGCTTCCGTGG
ATGGGTTGGTTTGTATATTTGACATTAACGGGGATATTGATGATGACGATCATATGGATTCTGTGATTAATGTGGGAACTTCAGTCGGTAAGATTGGGTTTTTTGGAGAG
AATTATAGAAAGTTGTGGTGCTTGACTCACATTGAAACATTGAGCTTATGGGATTGGACAGATGGGAGAAATGAAGCAGATATCGCAGATGCTCGCACACTAGCTTCCAA
CAGTTGGACAATAGGCCATGTCGATTATTTAGTTGATTGTCACTACTCAAATGAAGGCGGTAGATTGTGGGTTCTTGGAGGTACCAACGATGGCACCATAGGCTACTTCC
CGATCGACCACGGTAAGGGGAAGAATGCAATCGAATCACCAGACATTGTTCTTGAGGGTGGCCACGTTGGCGTCGTTAGAAGTGTTTTGCCCACGACGAATGTATTGGGT
GGATTCTCACAGAGCCAAGGTGTGTTTGGATGGACAGGTGGAGAAGATGGACGTTTATGTTGTTGGTGTTCGGACGATTCCCACGAAATGAATCGATCCTGGATTTCGAG
CACTCTAGTTATTAAGTCACCTGGCGCTCGGAGGAAAAATAGACACCATCCTTACTAAAAATTAACATATGTTTTTCTTGTGTCTTTGATCATACAATCTCTCTATGTTT
CTGTAGCTGTATTTATCATCATTTTGCCTCCCAACCCCTCAGTTTGATGTTGTTTTCACTTGCCTCTTTAGGGAGTATTATATA
Protein sequenceShow/hide protein sequence
MGEFSPIFRRGQGRGFPVEEMGPRGGPISCETLERAILDTLLVPRWVPKECPRFYPVLGRILGDFTSSSQTGDFPTRLINQLERTLLEVSVARLQREKEERERKMESIDM
DVEEHANADTRTNSSSFKRFGLKNSIQTNFGDDYVFHIAPNGDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHTGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRTFQQ
VSSISAGPSQEIFSFAYGGSNMSLLAAGCKSQILFWDWRNRRQVACLEDSHVEDVTQVHFVPGHQGKLASASVDGLVCIFDINGDIDDDDHMDSVINVGTSVGKIGFFGE
NYRKLWCLTHIETLSLWDWTDGRNEADIADARTLASNSWTIGHVDYLVDCHYSNEGGRLWVLGGTNDGTIGYFPIDHGKGKNAIESPDIVLEGGHVGVVRSVLPTTNVLG
GFSQSQGVFGWTGGEDGRLCCWCSDDSHEMNRSWISSTLVIKSPGARRKNRHHPY