; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019051 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019051
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionWD repeat-containing protein 89 homolog
Genome locationscaffold20:1001863..1006156
RNA-Seq ExpressionMS019051
SyntenyMS019051
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR036322 - WD40-repeat-containing domain superfamily
IPR039328 - WD repeat-containing protein 89


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606965.1 WD repeat-containing protein GTS1, partial [Cucurbita argyrosperma subsp. sororia]2.4e-21289.37Show/hide
Query:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC
        ME+IDMDVEE  NAD+S +SSSFKRFGLKN+IQTNFGDDYVFHIAP+ DWT MAVSLSSNVVKLYSPVTGQYYGECRGH GTINQISF+VPSTPHVLHSC
Subjt:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTIRSWDVR   ++SSISAGPSQEIFSFAYGGSS NLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQ KLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV
        DIDDD+HMDSVINVGTSVGKIGFFGE YRKLWCLTHIETLSLWDW+DGRNEADITDARTLASNSW M H    VDYLVDCHYS+EG+RLWVLGGTNDGTV
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV

Query:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY
        GYFPVDH KGKNAIESP+VVLEGGH+G+VRSVLP+TN+LGGFS+SQGVFGWTGGEDGRLCCW SDDS ET RSWISSTLVIKSP  RRKHRHHPY
Subjt:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY

XP_008454356.1 PREDICTED: WD repeat-containing protein 89 homolog isoform X1 [Cucumis melo]1.5e-21188.61Show/hide
Query:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC
        ME+IDMDVE+H NAD+++NSSSFKRFGLKNSIQTNFGDDYVFHIAP+ DWTSMAVSLSSNVVKLYSPVTGQYYGECRGH+GTINQISFSVPSTPHVLHSC
Subjt:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTI+SWD+R   ++SSISAGPSQEIFSFAYGGS+T+LLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHF+PGHQGKLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV
        DIDDDDHMDSVINVGTSVGKIGF+GENYRKLWCLTHIETLSLWDW+DGRNEADITDARTLASNSW M H    VDYLVDCHYS EG RLWVLGGTNDGTV
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV

Query:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY
        GYFP++   GKNAIESP+VVLEGGH+GVVRSVLP TN+LGGFSQSQGVFGWTGGEDGRLCCW SDDSHE  RSWISSTLVIKSP  RRK+RH PY
Subjt:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY

XP_022143440.1 WD repeat-containing protein 89 homolog [Momordica charantia]2.1e-23298.48Show/hide
Query:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC
        MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC
Subjt:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV
        DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDH    VDYLVDCHYSNEG+RLWVLGGTNDGTV
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV

Query:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY
        GYFPVDH KGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY
Subjt:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY

XP_023522094.1 WD repeat-containing protein 89 homolog [Cucurbita pepo subsp. pepo]6.2e-21389.62Show/hide
Query:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC
        ME+IDMDVEE  NAD+S +SSSFKRFGLKN+IQTNFGDDYVFHIAP+ DWTSMAVSLSSNVVKLYSPVTGQYYGECRGH GTINQISF+VPSTPHVLHSC
Subjt:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTIRSWDVR   ++SSISAGPSQEIFSFAYGGSS NLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQ KLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV
        DIDDD+HMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDW+DGRNEADITDARTLASNSW M H    VDYLVDCHYS+EG+RLWVLGGTNDGTV
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV

Query:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY
        GYFPVDH KGK AIESP+VVLEGGH+G+VRSVLP+TN+LGGFS+SQGVFGWTGGEDGRLCCW SDDS ET RSWISSTLVIKSP +RRKHRHHPY
Subjt:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY

XP_038906204.1 WD repeat-containing protein GTS1 [Benincasa hispida]1.4e-21288.35Show/hide
Query:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC
        ME+IDMDVEEHANADT+TNSSSFKRFGLKNSIQTNFGDDYVFHIAP+ DWTSMAVSLSSNVVKLYSPVTGQY+GECRGH+GTIN+ISFSVPSTPHVLHSC
Subjt:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTIRSWDVR   ++SSISAGPSQEIFSFAYGGS+ +LLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHF+PGHQGKLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV
        DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDW+DGRNEADITDARTLASN W M      VDYLVDCHYS+EG RLWVLGG+NDG +
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV

Query:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY
        GYFP+DH KGKNAIESP++VLEGGH+G++RSVLP TN LGGFSQSQGVFGWTGGEDGRLCCW SDDSHE  RSWISS+LVIKSP  RRK+RHHPY
Subjt:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY

TrEMBL top hitse value%identityAlignment
A0A1S3BYE3 WD repeat-containing protein 89 homolog isoform X17.4e-21288.61Show/hide
Query:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC
        ME+IDMDVE+H NAD+++NSSSFKRFGLKNSIQTNFGDDYVFHIAP+ DWTSMAVSLSSNVVKLYSPVTGQYYGECRGH+GTINQISFSVPSTPHVLHSC
Subjt:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTI+SWD+R   ++SSISAGPSQEIFSFAYGGS+T+LLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHF+PGHQGKLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV
        DIDDDDHMDSVINVGTSVGKIGF+GENYRKLWCLTHIETLSLWDW+DGRNEADITDARTLASNSW M H    VDYLVDCHYS EG RLWVLGGTNDGTV
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV

Query:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY
        GYFP++   GKNAIESP+VVLEGGH+GVVRSVLP TN+LGGFSQSQGVFGWTGGEDGRLCCW SDDSHE  RSWISSTLVIKSP  RRK+RH PY
Subjt:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY

A0A5A7TS28 WD repeat-containing protein 89-like protein isoform X17.4e-21288.61Show/hide
Query:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC
        ME+IDMDVE+H NAD+++NSSSFKRFGLKNSIQTNFGDDYVFHIAP+ DWTSMAVSLSSNVVKLYSPVTGQYYGECRGH+GTINQISFSVPSTPHVLHSC
Subjt:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTI+SWD+R   ++SSISAGPSQEIFSFAYGGS+T+LLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHF+PGHQGKLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV
        DIDDDDHMDSVINVGTSVGKIGF+GENYRKLWCLTHIETLSLWDW+DGRNEADITDARTLASNSW M H    VDYLVDCHYS EG RLWVLGGTNDGTV
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV

Query:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY
        GYFP++   GKNAIESP+VVLEGGH+GVVRSVLP TN+LGGFSQSQGVFGWTGGEDGRLCCW SDDSHE  RSWISSTLVIKSP  RRK+RH PY
Subjt:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY

A0A6J1CNT7 WD repeat-containing protein 89 homolog9.9e-23398.48Show/hide
Query:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC
        MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC
Subjt:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV
        DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDH    VDYLVDCHYSNEG+RLWVLGGTNDGTV
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV

Query:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY
        GYFPVDH KGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY
Subjt:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY

A0A6J1GAT4 WD repeat-containing protein 89 homolog9.7e-21289.11Show/hide
Query:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC
        ME+IDMDVEE  NAD+S +SSSFKRFGLKN+IQTNFGDDYVFHIAP+ DWT MAVSLSSNVVKLYSPVTGQYYGECRGH GTINQISF+VPS PHVLHSC
Subjt:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSC

Query:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG
        SSDGTIRSWDVR   ++SSISAGPSQEIFSFAYGGSS NLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQ KLASASVDGLVCIFDTNG
Subjt:  SSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNG

Query:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV
        DIDDD+HMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDW+DGRNEADITDARTLASNSW M      VDYLVDCHYS+EG+RLWVLGGTNDGTV
Subjt:  DIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTV

Query:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY
        GYFPVDH KGKNAIESP+VVLEGGH+G+VRSVLP+TN+LGGFS+SQGVFGWTGGEDGRLCCW SDDS ET RSWISSTLVIKSP  RRKHRHHPY
Subjt:  GYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY

A0A6J1KH06 WD repeat-containing protein 89 homolog6.3e-21189.14Show/hide
Query:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPS-TPHVLHS
        ME+IDMDVEE  NAD+S +SSSFKRFGLKN+IQTNFGDDYVFHIAP+ DWTSMAVSLSSNVVKLYSPVTGQYYGECRGH GTINQISFS+PS TPHVLHS
Subjt:  MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPS-TPHVLHS

Query:  CSSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTN
        CSSDGTIRSWDVR   ++SSISAGPSQEIFSFAYGGSS NLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQ KLASASVDGLVCIFDTN
Subjt:  CSSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTN

Query:  GDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGT
        GDIDDD+HMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDW+DGRNEADIT+ARTLASNSW M H    VDYLVDCHYS+EG+RLWVLGGTNDGT
Subjt:  GDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGT

Query:  VGYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY
        VGYFPVDH KGKNAIESP+VVLEGGH+G+VRSVLP+TN  GGFS+SQGVFGWTGGEDGRLCCW SDDS ET RSWISSTLVIKSP  RRKHRHHPY
Subjt:  VGYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY

SwissProt top hitse value%identityAlignment
Q54QU5 WD repeat-containing protein 89 homolog8.3e-3531.27Show/hide
Query:  NFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRALNEISSISAGPSQEIFSFAYG
        + GDD  + +  S     +A + S+ ++K+Y            GH   IN+  F      + L SCSSD T++ WD +      S +     EIFS    
Subjt:  NFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRALNEISSISAGPSQEIFSFAYG

Query:  GSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCL
        G   ++LA G  S ++ ++   +K +   + SH EDVT+V F P  + KL S SVDGL+C++D     DDDD +  VIN   S+G IGFFG  Y+ L+ L
Subjt:  GSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCL

Query:  THIETLSLWDWSDGRNEADI-TDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTVGYFPVDHRKGKNAIESPNVV-----LEGGHVGV
        +H E L+ WD + G        D R+  S+ +K +     ++Y + C Y N  N+L + GG  +GT   F V          +P+ V     LE  H  V
Subjt:  THIETLSLWDWSDGRNEADI-TDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTVGYFPVDHRKGKNAIESPNVV-----LEGGHVGV

Query:  VRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDS
        +R      NV     +S+ +   T  ED ++  W ++ S
Subjt:  VRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDS

Q5FVP5 WD repeat-containing protein 896.8e-2928.22Show/hide
Query:  PSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRALNE-ISSISAGPSQEIFSFAYGGSSTNLLAAG
        P+E    +AV  S+  +++Y   T     E  G  G +N + F+  ++   ++S S+DGT++ WD R  +E  + +  G    IF         +++ AG
Subjt:  PSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRALNE-ISSISAGPSQEIFSFAYGGSSTNLLAAG

Query:  CK-----SQILFWDWR-------NRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLW
         +     + ++FWD R        R  +    ++H +D+TQV F P +   + S S DGLV +FD + D +++D + +  N  +SV  IG+ G +Y++++
Subjt:  CK-----SQILFWDWR-------NRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLW

Query:  CLTHIETLSLWDWSDGRNEADIT-----DARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTVGYFPVDHRKGKNAIESPNVVLEGGHVG
        C+TH E    WD +    +  IT     D R +     K  H+DYL+  L    Y    +RL+V+GGTN G +       + G + + S    L+GGH  
Subjt:  CLTHIETLSLWDWSDGRNEADIT-----DARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTVGYFPVDHRKGKNAIESPNVVLEGGHVG

Query:  VVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRH
         VRS          ++ S+     TGGED +L  W      +T+      +L I S   +R   H
Subjt:  VVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRH

Q944S2 WD repeat-containing protein GTS11.3e-14462.53Show/hide
Query:  DMDVE-EHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPS--TPHVLHSCS
        +M+VE ++     S+ + + K+FGLKNSIQTNFG DYVF I P  DWT++AVSLS+N VKLYSPVTGQYYGEC+GHS T+NQI+FS  S  +PHVLHSCS
Subjt:  DMDVE-EHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPS--TPHVLHSCS

Query:  SDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGD
        SDGTIRSWD R+  ++S I  G  QEIFSF+YGG++ NLLA GCK Q+L WDWRN KQVACLE+SH++DVTQVHF+P    KL SASVDGL+C+F+T GD
Subjt:  SDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGD

Query:  IDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTVG
        I+DDDH++SVINVGTS+GKIGF G+ Y+KLWCLTHIETLS+W+W DG  E ++  AR LAS+SW  D+    VDY VDCH    G  LWV+GGT  GTVG
Subjt:  IDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTVG

Query:  YFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSD-DSHETYRSWISSTLVIKSPSARRKHRHHPY
        YFPV++ K   +I +   +L GGH+ VVRSVL +    GG   + G+FGWTGGEDGRLCCW SD D+ E  RSW SS LV+K P  R+K+RH PY
Subjt:  YFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSD-DSHETYRSWISSTLVIKSPSARRKHRHHPY

Q96FK6 WD repeat-containing protein 892.9e-2727.82Show/hide
Query:  MAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRALNE--ISSISAGPSQEIFSFAYGGSSTNLLAAGCK----
        +AV  S+  +++Y         E  G+ G +N + F+  ++   ++S  +DGT++ WD R   E  +      PS    SF     + +++ AG +    
Subjt:  MAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRALNE--ISSISAGPSQEIFSFAYGGSSTNLLAAGCK----

Query:  -SQILFWDWRNRKQ--------VACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHI
         + ++FWD R   Q        +    ++H +DVTQV F P +   + S S DGLV +FD N D +++D + +  N  +SV  IG+ G+ Y++++C+TH 
Subjt:  -SQILFWDWRNRKQ--------VACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHI

Query:  ETLSLWDWSDGRNEADIT-----DARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTVGYFPVDHRKGKNAIESPNVVLEGGHVGVVRSV
        E    WD +    +  +T     D R +   + K D +DYL+  L    Y  + + L V+GGTN G +         G   + S    L+GGH   VRS 
Subjt:  ETLSLWDWSDGRNEADIT-----DARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTVGYFPVDHRKGKNAIESPNVVLEGGHVGVVRSV

Query:  LPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY
            NV     Q   +   TGGED +L  W      +T+    S  +        R H +  Y
Subjt:  LPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY

Q9D0R9 WD repeat-containing protein 891.5e-2828.41Show/hide
Query:  MAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRALNE--ISSISAGPSQEIFSFAYGGSSTNLLAAGCK----
        +AV  S+  +++Y   T     E  G  G ++ +SF+  ++   ++S S+DGT++ WD R  +E  +      PS    SF       +++ AG +    
Subjt:  MAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRALNE--ISSISAGPSQEIFSFAYGGSSTNLLAAGCK----

Query:  -SQILFWDWR-------NRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIE
         + ++FWD R        R  +    ++H +D+TQV F P +   + S S DGLV +FD + D  ++D + +  N  +SV  IG+ G++Y++++C+TH E
Subjt:  -SQILFWDWR-------NRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIE

Query:  TLSLWDWSDGRNEADIT-----DARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTVGYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVL
            WD +    +  IT     D R +     K  H+DYL+  L    Y  + +RL+V+GGTN G +         G   + S    L GGH   VRS  
Subjt:  TLSLWDWSDGRNEADIT-----DARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTVGYFPVDHRKGKNAIESPNVVLEGGHVGVVRSVL

Query:  PVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRH
                ++ S+     TGGED +L  W      +T+      +L I S   +R   H
Subjt:  PVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRH

Arabidopsis top hitse value%identityAlignment
AT2G16780.1 Transducin family protein / WD-40 repeat family protein4.9e-0622.96Show/hide
Query:  GHSGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLA-AGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHF
        GH   I  +S+ +    ++  S   DG +  WD R  N++        +E+   ++   +  +LA A   S +  +D R       +  SH  +V QV +
Subjt:  GHSGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLA-AGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHF

Query:  IPGHQGKLASASVDGLVCIFDTNG--------DID-DDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHI---ETLSLWDWSDG--RNEADITDAR
         P H+  LAS+  D  + ++D N         ++D +D   + + + G    KI  F  N  + W +  +    +L +W  ++   R+E D  D +
Subjt:  IPGHQGKLASASVDGLVCIFDTNG--------DID-DDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHI---ETLSLWDWSDG--RNEADITDAR

AT2G47410.1 WD40/YVTN repeat-like-containing domain;Bromodomain1.6e-0424.49Show/hide
Query:  IAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSV-PSTPHVLHSCSSDGTIRSWDVR--------------ALNEISSISAGPSQEI
        +A S +   +A + +  V++++    G      RGH+G +  I+FS   ++ + L S S DGT R WD R                N  S+ +A  S +I
Subjt:  IAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSV-PSTPHVLHSCSSDGTIRSWDVR--------------ALNEISSISAGPSQEI

Query:  FSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVH
           AY  + T  +     S    W           + +H  DV + H
Subjt:  FSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVH

AT2G47410.2 WD40/YVTN repeat-like-containing domain;Bromodomain1.6e-0424.49Show/hide
Query:  IAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSV-PSTPHVLHSCSSDGTIRSWDVR--------------ALNEISSISAGPSQEI
        +A S +   +A + +  V++++    G      RGH+G +  I+FS   ++ + L S S DGT R WD R                N  S+ +A  S +I
Subjt:  IAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSV-PSTPHVLHSCSSDGTIRSWDVR--------------ALNEISSISAGPSQEI

Query:  FSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVH
           AY  + T  +     S    W           + +H  DV + H
Subjt:  FSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVH

AT2G47790.1 Transducin/WD40 repeat-like superfamily protein9.1e-14662.53Show/hide
Query:  DMDVE-EHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPS--TPHVLHSCS
        +M+VE ++     S+ + + K+FGLKNSIQTNFG DYVF I P  DWT++AVSLS+N VKLYSPVTGQYYGEC+GHS T+NQI+FS  S  +PHVLHSCS
Subjt:  DMDVE-EHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPS--TPHVLHSCS

Query:  SDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGD
        SDGTIRSWD R+  ++S I  G  QEIFSF+YGG++ NLLA GCK Q+L WDWRN KQVACLE+SH++DVTQVHF+P    KL SASVDGL+C+F+T GD
Subjt:  SDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGD

Query:  IDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTVG
        I+DDDH++SVINVGTS+GKIGF G+ Y+KLWCLTHIETLS+W+W DG  E ++  AR LAS+SW  D+    VDY VDCH    G  LWV+GGT  GTVG
Subjt:  IDDDDHMDSVINVGTSVGKIGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTVG

Query:  YFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSD-DSHETYRSWISSTLVIKSPSARRKHRHHPY
        YFPV++ K   +I +   +L GGH+ VVRSVL +    GG   + G+FGWTGGEDGRLCCW SD D+ E  RSW SS LV+K P  R+K+RH PY
Subjt:  YFPVDHRKGKNAIESPNVVLEGGHVGVVRSVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSD-DSHETYRSWISSTLVIKSPSARRKHRHHPY

AT5G52820.1 WD-40 repeat family protein / notchless protein, putative2.9e-0626.79Show/hide
Query:  VFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNL
        V H+  S D   +A +     V+L++ +TGQ+    RGH G + Q+S+S  S   +L S S D T++ W++R   ++     G + E+F+  +      +
Subjt:  VFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSCSSDGTIRSWDVRALNEISSISAGPSQEIFSFAYGGSSTNL

Query:  LAAGCKSQILFW
        ++ G    +  W
Subjt:  LAAGCKSQILFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCCATTGACATGGATGTGGAGGAGCACGCTAATGCCGATACGAGTACAAATTCCAGCTCCTTCAAGCGCTTCGGACTCAAGAATTCCATTCAGACCAACTTCGG
CGATGACTACGTTTTTCACATCGCCCCTAGTGAGGATTGGACGTCAATGGCGGTGTCGTTATCTTCGAACGTAGTGAAGCTGTACTCGCCTGTGACTGGTCAGTACTATG
GAGAGTGCAGAGGCCACAGTGGAACCATCAATCAGATTTCCTTCTCTGTGCCGTCAACCCCACATGTATTGCATTCTTGTTCTTCTGATGGAACTATCAGATCTTGGGAC
GTGCGAGCTCTTAATGAGATTTCTTCCATCAGTGCTGGCCCTTCTCAAGAGATCTTCAGCTTCGCCTATGGAGGATCAAGTACGAATCTTCTTGCTGCTGGTTGTAAATC
TCAGATACTCTTCTGGGACTGGAGGAACAGAAAGCAAGTCGCATGCCTAGAGGACTCTCATGTGGAAGATGTTACTCAGGTTCACTTTATTCCTGGCCATCAAGGCAAGC
TTGCTTCTGCTTCCGTGGATGGGTTGGTTTGTATATTTGACACCAATGGGGATATTGATGATGACGACCATATGGATTCTGTGATTAACGTGGGAACTTCAGTCGGTAAG
ATTGGGTTTTTTGGAGAGAATTATAGAAAGTTGTGGTGTTTGACTCACATTGAAACCTTGAGCTTATGGGACTGGTCAGATGGGAGAAATGAAGCAGATATCACGGATGC
TCGCACACTAGCTTCCAACAGTTGGAAAATGGATCATGTTGATTATTTGGTTGATTATTTGGTTGATTGTCACTACTCGAATGAAGGCAATCGACTGTGGGTTCTTGGAG
GTACCAATGATGGCACTGTAGGCTACTTCCCAGTCGATCACCGTAAGGGGAAGAATGCAATCGAATCACCCAACGTTGTGCTTGAGGGTGGCCATGTTGGTGTTGTTAGA
AGTGTCTTGCCCGTGACAAATGTATTAGGTGGTTTTTCACAGAGTCAAGGTGTGTTTGGATGGACAGGTGGAGAAGATGGACGTTTATGTTGTTGGTGTTCGGATGATTC
TCACGAAACTTATCGATCCTGGATTTCGAGCACTCTTGTTATCAAGTCACCCAGTGCTCGAAGGAAACATAGACATCATCCTTAC
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCCATTGACATGGATGTGGAGGAGCACGCTAATGCCGATACGAGTACAAATTCCAGCTCCTTCAAGCGCTTCGGACTCAAGAATTCCATTCAGACCAACTTCGG
CGATGACTACGTTTTTCACATCGCCCCTAGTGAGGATTGGACGTCAATGGCGGTGTCGTTATCTTCGAACGTAGTGAAGCTGTACTCGCCTGTGACTGGTCAGTACTATG
GAGAGTGCAGAGGCCACAGTGGAACCATCAATCAGATTTCCTTCTCTGTGCCGTCAACCCCACATGTATTGCATTCTTGTTCTTCTGATGGAACTATCAGATCTTGGGAC
GTGCGAGCTCTTAATGAGATTTCTTCCATCAGTGCTGGCCCTTCTCAAGAGATCTTCAGCTTCGCCTATGGAGGATCAAGTACGAATCTTCTTGCTGCTGGTTGTAAATC
TCAGATACTCTTCTGGGACTGGAGGAACAGAAAGCAAGTCGCATGCCTAGAGGACTCTCATGTGGAAGATGTTACTCAGGTTCACTTTATTCCTGGCCATCAAGGCAAGC
TTGCTTCTGCTTCCGTGGATGGGTTGGTTTGTATATTTGACACCAATGGGGATATTGATGATGACGACCATATGGATTCTGTGATTAACGTGGGAACTTCAGTCGGTAAG
ATTGGGTTTTTTGGAGAGAATTATAGAAAGTTGTGGTGTTTGACTCACATTGAAACCTTGAGCTTATGGGACTGGTCAGATGGGAGAAATGAAGCAGATATCACGGATGC
TCGCACACTAGCTTCCAACAGTTGGAAAATGGATCATGTTGATTATTTGGTTGATTATTTGGTTGATTGTCACTACTCGAATGAAGGCAATCGACTGTGGGTTCTTGGAG
GTACCAATGATGGCACTGTAGGCTACTTCCCAGTCGATCACCGTAAGGGGAAGAATGCAATCGAATCACCCAACGTTGTGCTTGAGGGTGGCCATGTTGGTGTTGTTAGA
AGTGTCTTGCCCGTGACAAATGTATTAGGTGGTTTTTCACAGAGTCAAGGTGTGTTTGGATGGACAGGTGGAGAAGATGGACGTTTATGTTGTTGGTGTTCGGATGATTC
TCACGAAACTTATCGATCCTGGATTTCGAGCACTCTTGTTATCAAGTCACCCAGTGCTCGAAGGAAACATAGACATCATCCTTAC
Protein sequenceShow/hide protein sequence
MEAIDMDVEEHANADTSTNSSSFKRFGLKNSIQTNFGDDYVFHIAPSEDWTSMAVSLSSNVVKLYSPVTGQYYGECRGHSGTINQISFSVPSTPHVLHSCSSDGTIRSWD
VRALNEISSISAGPSQEIFSFAYGGSSTNLLAAGCKSQILFWDWRNRKQVACLEDSHVEDVTQVHFIPGHQGKLASASVDGLVCIFDTNGDIDDDDHMDSVINVGTSVGK
IGFFGENYRKLWCLTHIETLSLWDWSDGRNEADITDARTLASNSWKMDHVDYLVDYLVDCHYSNEGNRLWVLGGTNDGTVGYFPVDHRKGKNAIESPNVVLEGGHVGVVR
SVLPVTNVLGGFSQSQGVFGWTGGEDGRLCCWCSDDSHETYRSWISSTLVIKSPSARRKHRHHPY