; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G10280 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G10280
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein O-glucosyltransferase 1-like
Genome locationClcChr05:8257512..8262250
RNA-Seq ExpressionClc05G10280
SyntenyClc05G10280
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR006598 - Glycosyl transferase CAP10 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033653.1 O-glucosyltransferase rumi-like protein [Cucumis melo var. makuwa]6.9e-26382.44Show/hide
Query:  AGSFQQRFSHYASSNSCYFRIIVCSSHFSSLQPL--SLSSCSSSHSSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYNTSEIQRKPNH-RRRQV
        + SF  RFSHYAS +   F+  + S    SL  L  SL   +    S+    SS       T AYNL   TIKGSGKSQ YP +TSE+   PNH RRRQV
Subjt:  AGSFQQRFSHYASSNSCYFRIIVCSSHFSSLQPL--SLSSCSSSHSSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYNTSEIQRKPNH-RRRQV

Query:  EFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWG
        EF L CTSFNNITGGACPANYPTNWT +E   RPS+ TCP+YFRWIHEDLRPWARTGI+RA +EAG+RTANFRLVILNGKAYVETYKKSFQTRDTFTVWG
Subjt:  EFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWG

Query:  ILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWKG
        ILQLLRRYPGKV DLDLMFDCVDWPVIL+SHFSGP+GPTPPPLFRYCGDD TLDIVFPDWSFWGWPEINIKPWE LL DLKEGNK+I WK RE YAYWKG
Subjt:  ILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWKG

Query:  NPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVK
        NPEVA+TRKDLLKCNVSDQQDWNARVFAQDW KESQ+GYKQSDL+NQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVK
Subjt:  NPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVK

Query:  DGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSSP
        D DKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLS+YSKLLTFKP +P  AIELCSEAMACPAEGL KKFM +SLV RPA+S+P
Subjt:  DGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSSP

Query:  CTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP
        CTMPPPYDPASLHFVLRRKENSIKQVEKWET+FWNTQSKQP
Subjt:  CTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP

XP_004140839.1 protein O-glucosyltransferase 1 [Cucumis sativus]2.1e-26482.84Show/hide
Query:  GSFQQRFSHYASSNSCYFRIIVCSSHFSSLQPL--SLSSCSSSHSSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYNTSEIQRKPNH--RRRQV
        GSF+ RFSHYA      F+  + S    SL  L  SL   +    S+    SS       T AYNL   TIKGSGKSQ YP NTS++   PNH  RR QV
Subjt:  GSFQQRFSHYASSNSCYFRIIVCSSHFSSLQPL--SLSSCSSSHSSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYNTSEIQRKPNH--RRRQV

Query:  EFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAAT-CPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVW
        EFTLHC SFNNIT GACPA+YPTNWT +ED   PS+++ CPDYFRWIHEDLRPWARTGITRATLEAG+RTANFRL+ILNGKAYVETYKKSFQTRDTFTVW
Subjt:  EFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAAT-CPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVW

Query:  GILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWK
        GILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDAT DIVFPDWSFWGWPEINIKPWE LL D+KEGNK+IPWK RE YAYWK
Subjt:  GILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWK

Query:  GNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPV
        GNPEVA+TRKDL+KCNVSDQQDWNARVFAQDW KESQ+GYKQSDL+NQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPV
Subjt:  GNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPV

Query:  KDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSS
        KD DKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKP +P NAIELCSEAMACPAEGL KKFM +SLV RPA+S+
Subjt:  KDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSS

Query:  PCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP
        PCTMPPPYDPASLHFVL RKENSIKQVEKWET+FWNTQSKQP
Subjt:  PCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP

XP_008439228.1 PREDICTED: O-glucosyltransferase rumi homolog [Cucumis melo]4.9e-26182.1Show/hide
Query:  AGSFQQRFSHYASSNSCYFRIIVCSSHFSSLQPL--SLSSCSSSHSSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYNTSEIQRKPNH--RRRQ
        + SF  RFSHYAS +   F+  + S    SL  L  SL   +    S+    SS       T AYNL   TIKGSGKSQ YP +TSE+   PNH  RRRQ
Subjt:  AGSFQQRFSHYASSNSCYFRIIVCSSHFSSLQPL--SLSSCSSSHSSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYNTSEIQRKPNH--RRRQ

Query:  VEFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVW
        VEF L CTSFNNITGGACPANYPTN T +E   RPS+ TCP+YFRWIHEDLRPWARTGI+RA +EAG+RTANFRLVILNGKAYVETYKKSFQTRDTFTVW
Subjt:  VEFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVW

Query:  GILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWK
        GILQLLRRYPGKV DLDLMFDCVDWPVIL+SHFSGP+GPTPPPLFRYCGDD TLDIVFPDWSFWGWPEINIKPWE LL DLKEGNK+I WK RE YAYWK
Subjt:  GILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWK

Query:  GNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPV
        GNPEVA+TRKDLLKCNVSDQQDWNARVFAQDW KESQ+GYKQSDL+NQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPV
Subjt:  GNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPV

Query:  KDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSS
        KD DKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKP +P  AIELCSEAMACPAEGL KKFM +SLV RPA+S+
Subjt:  KDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSS

Query:  PCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP
        PCTMPPPYDPASLHFVLRRKENSIKQVEKWET+FWNT+SKQP
Subjt:  PCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP

XP_023552265.1 protein O-glucosyltransferase 1-like isoform X2 [Cucurbita pepo subsp. pepo]1.3e-25385.12Show/hide
Query:  AYNLPGNTIKGSGKSQSYPYNTSEIQRKPNHRRRQVEFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLE
        A+NL GN IKGS K  SYP ++SEI ++P HR+RQV+F L CTSFNN+T GACPA YPT WTVEEDP  P ++TCP+YFRWIHEDLRPWA+TGITRA+LE
Subjt:  AYNLPGNTIKGSGKSQSYPYNTSEIQRKPNHRRRQVEFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLE

Query:  AGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWG
        A K+TANFRLVI+NG AYVETY+KSFQTRDTFT+WGILQLLRRYPGKVPDL++MFDCVDWPVILT++FS PNGP PPPLFRYCG+DATLD+VFPDWSFWG
Subjt:  AGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWG

Query:  WPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVS
        W EINIKPWE LL DLKEGNK+ PWK REAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYK+SDLANQCLHRYKIYIEGSAWSVS
Subjt:  WPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVS

Query:  EKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIP
        EKYILACDSV LIVKPHYYDFFTRGLMP+HHYWPVKD DKCKSIKFAVDWGNSHKQKA+ IGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKP IP
Subjt:  EKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIP

Query:  RNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP
         NAIELCSEAMACPAEGL KKFMM+SLV  PADS PC MPPPYDPASLH VLRRKE+SIKQVE+WE  FW+ QS+QP
Subjt:  RNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP

XP_038898817.1 protein O-glucosyltransferase 1-like [Benincasa hispida]1.4e-27986.95Show/hide
Query:  MRDAGSFQQRFSHYASSNSCYF---RIIVCSSHFSSLQPLSLSSCSSSHSSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYNTSEIQRKPNHRR
        MR+AGSFQQRFSHYASSNS YF   R++       S    SL     S    +    S R     T A+NL GNTIKGS K+Q YP NTS+I   PNH R
Subjt:  MRDAGSFQQRFSHYASSNSCYF---RIIVCSSHFSSLQPLSLSSCSSSHSSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYNTSEIQRKPNHRR

Query:  RQVEFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFT
        RQVEFTL CTSFNNIT G CP NYPT WTVEED +RPS+ATCPD+FRWIHEDL PWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFT
Subjt:  RQVEFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFT

Query:  VWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAY
        VWGILQLLRRYPGKVPDL+LMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALL DLKEGNK+IPWKRREAYAY
Subjt:  VWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAY

Query:  WKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYW
        WKGNPEVAETRKDLLKCNVSDQQDWN RVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYW
Subjt:  WKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYW

Query:  PVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPAD
        PVKD DKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAI+LCSEAMACPAEGL KKFMMDSLV RPAD
Subjt:  PVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPAD

Query:  SSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP
        SSPC MPPPYDPASLHFVL RKENSIKQVEKWET+FWNTQSKQP
Subjt:  SSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP

TrEMBL top hitse value%identityAlignment
A0A0A0L5W3 CAP10 domain-containing protein1.0e-26482.84Show/hide
Query:  GSFQQRFSHYASSNSCYFRIIVCSSHFSSLQPL--SLSSCSSSHSSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYNTSEIQRKPNH--RRRQV
        GSF+ RFSHYA      F+  + S    SL  L  SL   +    S+    SS       T AYNL   TIKGSGKSQ YP NTS++   PNH  RR QV
Subjt:  GSFQQRFSHYASSNSCYFRIIVCSSHFSSLQPL--SLSSCSSSHSSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYNTSEIQRKPNH--RRRQV

Query:  EFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAAT-CPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVW
        EFTLHC SFNNIT GACPA+YPTNWT +ED   PS+++ CPDYFRWIHEDLRPWARTGITRATLEAG+RTANFRL+ILNGKAYVETYKKSFQTRDTFTVW
Subjt:  EFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAAT-CPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVW

Query:  GILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWK
        GILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDAT DIVFPDWSFWGWPEINIKPWE LL D+KEGNK+IPWK RE YAYWK
Subjt:  GILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWK

Query:  GNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPV
        GNPEVA+TRKDL+KCNVSDQQDWNARVFAQDW KESQ+GYKQSDL+NQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPV
Subjt:  GNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPV

Query:  KDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSS
        KD DKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKP +P NAIELCSEAMACPAEGL KKFM +SLV RPA+S+
Subjt:  KDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSS

Query:  PCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP
        PCTMPPPYDPASLHFVL RKENSIKQVEKWET+FWNTQSKQP
Subjt:  PCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP

A0A1S3AYX8 O-glucosyltransferase rumi homolog2.4e-26182.1Show/hide
Query:  AGSFQQRFSHYASSNSCYFRIIVCSSHFSSLQPL--SLSSCSSSHSSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYNTSEIQRKPNH--RRRQ
        + SF  RFSHYAS +   F+  + S    SL  L  SL   +    S+    SS       T AYNL   TIKGSGKSQ YP +TSE+   PNH  RRRQ
Subjt:  AGSFQQRFSHYASSNSCYFRIIVCSSHFSSLQPL--SLSSCSSSHSSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYNTSEIQRKPNH--RRRQ

Query:  VEFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVW
        VEF L CTSFNNITGGACPANYPTN T +E   RPS+ TCP+YFRWIHEDLRPWARTGI+RA +EAG+RTANFRLVILNGKAYVETYKKSFQTRDTFTVW
Subjt:  VEFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVW

Query:  GILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWK
        GILQLLRRYPGKV DLDLMFDCVDWPVIL+SHFSGP+GPTPPPLFRYCGDD TLDIVFPDWSFWGWPEINIKPWE LL DLKEGNK+I WK RE YAYWK
Subjt:  GILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWK

Query:  GNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPV
        GNPEVA+TRKDLLKCNVSDQQDWNARVFAQDW KESQ+GYKQSDL+NQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPV
Subjt:  GNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPV

Query:  KDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSS
        KD DKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKP +P  AIELCSEAMACPAEGL KKFM +SLV RPA+S+
Subjt:  KDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSS

Query:  PCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP
        PCTMPPPYDPASLHFVLRRKENSIKQVEKWET+FWNT+SKQP
Subjt:  PCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP

A0A5D3DHB1 O-glucosyltransferase rumi-like protein3.3e-26382.44Show/hide
Query:  AGSFQQRFSHYASSNSCYFRIIVCSSHFSSLQPL--SLSSCSSSHSSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYNTSEIQRKPNH-RRRQV
        + SF  RFSHYAS +   F+  + S    SL  L  SL   +    S+    SS       T AYNL   TIKGSGKSQ YP +TSE+   PNH RRRQV
Subjt:  AGSFQQRFSHYASSNSCYFRIIVCSSHFSSLQPL--SLSSCSSSHSSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYNTSEIQRKPNH-RRRQV

Query:  EFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWG
        EF L CTSFNNITGGACPANYPTNWT +E   RPS+ TCP+YFRWIHEDLRPWARTGI+RA +EAG+RTANFRLVILNGKAYVETYKKSFQTRDTFTVWG
Subjt:  EFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWG

Query:  ILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWKG
        ILQLLRRYPGKV DLDLMFDCVDWPVIL+SHFSGP+GPTPPPLFRYCGDD TLDIVFPDWSFWGWPEINIKPWE LL DLKEGNK+I WK RE YAYWKG
Subjt:  ILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWKG

Query:  NPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVK
        NPEVA+TRKDLLKCNVSDQQDWNARVFAQDW KESQ+GYKQSDL+NQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVK
Subjt:  NPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVK

Query:  DGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSSP
        D DKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLS+YSKLLTFKP +P  AIELCSEAMACPAEGL KKFM +SLV RPA+S+P
Subjt:  DGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSSP

Query:  CTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP
        CTMPPPYDPASLHFVLRRKENSIKQVEKWET+FWNTQSKQP
Subjt:  CTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP

A0A6J1E476 protein O-glucosyltransferase 1-like isoform X21.1e-25385.32Show/hide
Query:  AYNLPGNTIKGSGKSQSYPYNTSEIQRKPNHRRRQVEFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLE
        A+NL GN IKGS K  SYP +TSEI ++P HR+RQV+F L CTSFNN+T GACPA YPT WTVEEDP  P ++TCP+YFRWIHEDLRPWA+TGITRA+LE
Subjt:  AYNLPGNTIKGSGKSQSYPYNTSEIQRKPNHRRRQVEFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLE

Query:  AGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWG
        A K+TANFRLVI+NG AYVETY+KSFQTRDTFT+WGILQLLRRYPGKVPDL+LMFDCVDWPVILT++FS PNGP+PPPLFRYCG+DATLD+VFPDWSFWG
Subjt:  AGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWG

Query:  WPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVS
        W EINIKPWE LL DLKEGNK+ PWK REAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYK+SDLANQCLHRYKIYIEGSAWSVS
Subjt:  WPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVS

Query:  EKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIP
        EKYILACDSV LIVKPHYYDFFTRGLMP+HHYWPVKD DKCKSIKFAVDWGNSHK KA+AIGKAASSFI EELKMDYVYDYMFHLLSEYSKLLTFKP IP
Subjt:  EKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIP

Query:  RNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP
         NAIELCSE MACPAEGL KKFMM+SLV  PADS PC MPPPYDPASLH VLRRKENSIKQVE+WE  FW+ QS+QP
Subjt:  RNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP

A0A6J1JDM1 protein O-glucosyltransferase 1-like isoform X22.6e-25285.53Show/hide
Query:  AYNLPGNTIKGSGKSQSYPYNTSEIQRKPNHRRRQVEFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLE
        A+NL GN IKGS KS+SYP +TSEI ++P HR+RQV+F L CTSFNN T GACPA YPT WTVEEDP  P ++TCP+YFRWIHEDLRPWA+TGITRA+LE
Subjt:  AYNLPGNTIKGSGKSQSYPYNTSEIQRKPNHRRRQVEFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLE

Query:  AGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWG
        A K+TANFRLVI+NG AYVETY+KSFQTRDTFT+WGILQLLRRYPGKVPDL+LMFDCVDWPVILT++FS PNGP  PPLFRYCG+DATLD+VFPDWSFWG
Subjt:  AGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWG

Query:  WPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVS
        W EINIKPWE LL DLKEGNK+ PWK REAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYK+SDLANQCLHRYKIYIEGSAWSVS
Subjt:  WPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVS

Query:  EKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIP
        EKYILACDSV LIVKPHYYDFFTRGLMP+HHYWPVKD DKCKSIKFAVDWGNSHKQKA+AIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKP IP
Subjt:  EKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIP

Query:  RNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP
          AI+LCSEAMACPAEGL KKFMM+SLV  PADS PCTMPPPYDPASLH VLRRKENSIKQVE+WE+  W+ QS+QP
Subjt:  RNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP

SwissProt top hitse value%identityAlignment
B0X1Q4 O-glucosyltransferase rumi homolog6.3e-2525.21Show/hide
Query:  AATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTF---TVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHF
        ++ C  +   +  DLRP+ R+GIT+  +E  +           G  Y     + F+ RD        G+   +R    K+PD++L+ +C DWP I + H+
Subjt:  AATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTF---TVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHF

Query:  SGPNGPTPPPLFRYCGDDATLDIVFPDWSFW-GWPEINIKP-----WEALLNDLKEGNKKIPWKRREAYAYWKGNPE---------VAETRKDLLKCNVS
        +    P P   F    D   LDI++P W FW G P I++ P     W+     +++  K  PW+++   A+++G+           ++  R +L+    +
Subjt:  SGPNGPTPPPLFRYCGDDATLDIVFPDWSFW-GWPEINIKP-----WEALLNDLKEGNKKIPWKRREAYAYWKGNPE---------VAETRKDLLKCNVS

Query:  DQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNS
          Q W +    +D +    +  ++  L + C ++Y     G A S   K++  C S+   V   + +FF   L P  HY PV  G     ++  + +   
Subjt:  DQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNS

Query:  HKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIEL
        H Q AQ I       I   L+M+ V  Y   LL  Y KL+ ++ K     +E+
Subjt:  HKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIEL

G3V9D0 Protein O-glucosyltransferase 14.5e-2326.78Show/hide
Query:  EDPERPSAATCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVI
        E+ E  S+  C  Y   I EDL P+ R GI+R  + E  +R       I+  + + E     F +R +     IL+++RR    +PD++++ +  D+P +
Subjt:  EDPERPSAATCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVI

Query:  LTSHFSGPNGPTPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEALLNDLKEGNKKIPWKRREAYAYWKG-------NPEVAETRKD--
               P    P  P+F +       DI++P W+FW      WP     +  W+    DL     + PW+++ + AY++G       +P +  +RK+  
Subjt:  LTSHFSGPNGPTPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEALLNDLKEGNKKIPWKRREAYAYWKG-------NPEVAETRKD--

Query:  LLKCNVSDQQDWNARVFAQDWMKES--QQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSI
        L+    +  Q W +       MK++  +   K   L + C ++Y     G A S   K++  C S+   V   + +FF   L P  HY PVK       +
Subjt:  LLKCNVSDQQDWNARVFAQDWMKES--QQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSI

Query:  KFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF
        +  + +  ++   AQ I K  S FI   L+MD +  Y  +LL+EYSK L++
Subjt:  KFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF

Q5E9Q1 Protein O-glucosyltransferase 11.8e-2427.07Show/hide
Query:  EDPERPSAATCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVI
        E+ E  S+  C  Y   I EDL P+ R GI+R  + E  +R       I+  + Y E+    F +R +     IL+++    G++PD++++ +  D+P +
Subjt:  EDPERPSAATCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVI

Query:  LTSHFSGPNGPTPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEALLNDLKEGNKKIPWKRREAYAYWKG-------NPEVAETRKD--
               P    P  P+F +       DI++P W+FW      WP   + +  W+    DL     + PWK++ + AY++G       +P +  +RK+  
Subjt:  LTSHFSGPNGPTPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEALLNDLKEGNKKIPWKRREAYAYWKG-------NPEVAETRKD--

Query:  LLKCNVSDQQDWNARVFAQDWMKES--QQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSI
        L+    +  Q W +       MK++  +   K   L + C ++Y     G A S   K++  C S+   V   + +FF   L P  HY PVK      ++
Subjt:  LLKCNVSDQQDWNARVFAQDWMKES--QQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSI

Query:  KFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF
        +  + +  ++   AQ I +  S FI   LKMD +  Y  +LL+EYSK L++
Subjt:  KFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF

Q8BYB9 Protein O-glucosyltransferase 11.0e-2226.5Show/hide
Query:  EDPERPSAATCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVI
        E+ E  S+  C  Y   I EDL P+ R GI+R  + E  +R       I+  + + E     F +R +     IL+++ R    +PD++++ +  D+P +
Subjt:  EDPERPSAATCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVI

Query:  LTSHFSGPNGPTPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEALLNDLKEGNKKIPWKRREAYAYWKG-------NPEVAETRKD--
               P    P  P+F +       DI++P W+FW      WP     +  W+    DL     + PW+++ + AY++G       +P +  +RK+  
Subjt:  LTSHFSGPNGPTPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEALLNDLKEGNKKIPWKRREAYAYWKG-------NPEVAETRKD--

Query:  LLKCNVSDQQDWNARVFAQDWMKES--QQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSI
        L+    +  Q W +       MK++  +   K   L + C +RY     G A S   K++  C S+   V   + +FF   L P  HY PVK      ++
Subjt:  LLKCNVSDQQDWNARVFAQDWMKES--QQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSI

Query:  KFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF
        +  + +  ++   AQ I K  S FI   L+MD +  Y  +LL++YSK L++
Subjt:  KFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF

Q8NBL1 Protein O-glucosyltransferase 12.4e-2427.07Show/hide
Query:  EDPERPSAATCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVI
        E+ E  S+  C  Y   I EDL P+ R GI+R  + E  +R       I   + Y E     F +R +     IL+++    G++PD++++ +  D+P +
Subjt:  EDPERPSAATCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVI

Query:  LTSHFSGPNGPTPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEALLNDLKEGNKKIPWKRREAYAYWKG-------NPEVAETRKD--
               P    P  P+F +       DI++P W+FW      WP     +  W+    DL     + PWK++ + AY++G       +P +  +RK+  
Subjt:  LTSHFSGPNGPTPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEALLNDLKEGNKKIPWKRREAYAYWKG-------NPEVAETRKD--

Query:  LLKCNVSDQQDWNARVFAQDWMKES--QQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSI
        L+    +  Q W +       MK++  +   K   L + C ++Y     G A S   K++  C S+   V   + +FF   L P  HY PVK      ++
Subjt:  LLKCNVSDQQDWNARVFAQDWMKES--QQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSI

Query:  KFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF
        +  + +  ++   AQ I +  S FI+  L+MD +  Y  +LLSEYSK L++
Subjt:  KFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF

Arabidopsis top hitse value%identityAlignment
AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821)1.4e-16560.18Show/hide
Query:  TLHCTSF-NNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGI
        ++ C+SF N    G+C     + +      +  S  +CPDYF+WIHEDL+PW  TGIT+  +E GK TA+FRLVILNGK +VE YKKS QTRD FT+WGI
Subjt:  TLHCTSF-NNITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGI

Query:  LQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGP---TPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYW
        LQLLR+YPGK+PD+DLMFDC D PVI +  ++  N      PPPLFRYCGD  T+DIVFPDWSFWGW EINI+ W  +L +++EG KK  +  R+AYAYW
Subjt:  LQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGP---TPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYW

Query:  KGNPEVAE-TRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYW
        KGNP VA  +R+DLL CN+S   DWNAR+F QDW+ E Q+G++ S++ANQC +RYKIYIEG AWSVSEKYILACDSVTL+VKP+YYDFF+R L P+ HYW
Subjt:  KGNPEVAE-TRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYW

Query:  PVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAE-----GLIKKFMMDSLV
        P++D DKC+SIKFAVDW N+H QKAQ IG+ AS F+Q +L M+ VYDYMFHLL+EYSKLL +KP++P+N++ELC+EA+ CP+E     G+ KKFM+ SLV
Subjt:  PVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAE-----GLIKKFMMDSLV

Query:  TRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFW
        +RP  S PC++PPP+D   L    R+K N I+QVEKWE ++W
Subjt:  TRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFW

AT2G45830.1 downstream target of AGL15 24.9e-15852.63Show/hide
Query:  SSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYN-----TSEIQRKPNHRRRQVEFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPD
        ++S PA S  +A++ L  +  +    +   G      +      T+ I++ P   +R   F   C    N T    P N  +    +        +TCP 
Subjt:  SSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYN-----TSEIQRKPNHRRRQVEFTLHCTSFNNITGGACPANYPTNWTVEEDPERPSAATCPD

Query:  YFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPP
        YFRWIHEDLRPW  TG+TR  LE  +RTA+FR+VIL+G+ YV+ Y+KS QTRD FT+WGI+QLLR YPG++PDL+LMFD  D P + +  F G   P PP
Subjt:  YFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPP

Query:  PLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQ
        PLFRYC DDA+LDIVFPDWSFWGW E+NIKPW+  L  ++EGNK   WK R AYAYW+GNP VA TR+DLL+CNVS Q+DWN R++ QDW +ES++G+K 
Subjt:  PLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQ

Query:  SDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDY
        S+L NQC HRYKIYIEG AWSVSEKYI+ACDS+TL V+P +YDF+ RG+MP+ HYWP++D  KC S+KFAV WGN+H  +A  IG+  S FI+EE+KM+Y
Subjt:  SDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDY

Query:  VYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWN
        VYDYMFHL++EY+KLL FKP+IP  A E+  + M C A G  + FM +S+V  P++ SPC MP P++P  L  +L RK N  +QVE WE  +++
Subjt:  VYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWN

AT3G48980.1 Arabidopsis thaliana protein of unknown function (DUF821)3.7e-19865.04Show/hide
Query:  TIKGSGKSQSYPYNTSEIQRKPNHRRRQVEFTLHCTSFNNITGGACPA-NYPTNW---TVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGK
        T + + +S  YP +T  I  KP       EFTL+C +F+    G CP  NYPT++     E + +R  +ATCPDYFRWIHEDLRPW +TGITR  LE   
Subjt:  TIKGSGKSQSYPYNTSEIQRKPNHRRRQVEFTLHCTSFNNITGGACPA-NYPTNW---TVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGK

Query:  RTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPE
         TA FRL I+NG+ YVE ++++FQTRD FT+WG +QLLRRYPGK+PDL+LMFDCVDWPV+  + F+G + P PPPLFRYC +D TLDIVFPDWS+WGW E
Subjt:  RTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPE

Query:  INIKPWEALLNDLKEGNKKIPWKRREAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKY
        +NIKPWE+LL +L+EGN++  W  RE YAYWKGNP VAETR DL+KCN+S+  DW AR++ QDW+KES++GYKQSDLA+QC HRYKIYIEGSAWSVSEKY
Subjt:  INIKPWEALLNDLKEGNKKIPWKRREAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKY

Query:  ILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNA
        ILACDSVTL+VKPHYYDFFTRG+ P HHYWPVK+ DKC+SIKFAVDWGN H +KAQ IGK AS F+Q+ELKMDYVYDYMFHLL +YSKLL FKP+IP+N+
Subjt:  ILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNA

Query:  IELCSEAMACPAEGLIKKFMMDSLVTRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSK
         ELCSEAMACP +G  +KFMM+SLV RPA++ PC MPPPYDPAS + VL+R++++  ++E+WE+ +W  Q+K
Subjt:  IELCSEAMACPAEGLIKKFMMDSLVTRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSK

AT3G61270.1 Arabidopsis thaliana protein of unknown function (DUF821)1.9e-15760.34Show/hide
Query:  AATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGP
        ++TCP YFRWIHEDLRPW +TGITR  +E   RTA+FRLVI NGKAYV+ YKKS QTRD FT+WGILQLLR YPGK+PDL+LMFD  D PV+ +  F G 
Subjt:  AATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGP

Query:  NGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKES
            PPP+FRYC DDA+LDIVFPDWSFWGW E+N+KPW   L  +KEGN    WK R AYAYW+GNP V   R DLLKCN ++ ++WN R++ QDW KE+
Subjt:  NGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKES

Query:  QQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQE
        ++G+K S+L NQC HRYKIYIEG AWSVSEKYI+ACDS+TL VKP +YDF+ RG+MP+ HYWP++D  KC S+KFAV WGN+H+ KA+ IG+  S FI+E
Subjt:  QQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQE

Query:  ELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWN
        E+ M YVYDYMFHLL EY+ LL FKP+IP +A E+  ++M CPA    + F  +S++  P++ SPC M PPYDP +L  VL RK N  +QVE WE  ++ 
Subjt:  ELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWN

Query:  TQSKQP
          + +P
Subjt:  TQSKQP

AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821)2.1e-20167.61Show/hide
Query:  YPYNTSEIQRKPNHRRRQVEFTLHCTSFNNITGGACPAN-YPTNWTVE-EDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNG
        YP  T+ I + P     + EFTLHC++  N T  +CP+N YPT  + E +D   P  ATCPDYFRWIHEDLRPW+RTGITR  LE  K+TA FRL I+ G
Subjt:  YPYNTSEIQRKPNHRRRQVEFTLHCTSFNNITGGACPAN-YPTNWTVE-EDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNG

Query:  KAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLND
        K YVE ++ +FQTRD FT+WG LQLLR+YPGK+PDL+LMFDCVDWPV+  + F+G N P+PPPLFRYCG++ TLDIVFPDWSFWGW E+NIKPWE+LL +
Subjt:  KAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLND

Query:  LKEGNKKIPWKRREAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVK
        L+EGN++  W  RE YAYWKGNP VAETR+DL+KCNVS++ +WNAR++AQDW+KES++GYKQSDLA+QC HRYKIYIEGSAWSVSEKYILACDSVTL+VK
Subjt:  LKEGNKKIPWKRREAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVK

Query:  PHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPA
        PHYYDFFTRGL+P HHYWPV++ DKC+SIKFAVDWGNSH QKAQ IGKAAS FIQ++LKMDYVYDYM+HLL+EYSKLL FKP+IPRNA+E+CSE MAC  
Subjt:  PHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPA

Query:  EGLIKKFMMDSLVTRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSK
         G  +KFM +SLV +PADS PC MPPPYDPA+ + V++RK+++  ++ +WE  +W+ Q++
Subjt:  EGLIKKFMMDSLVTRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGACGCCGGCAGTTTCCAGCAGAGGTTTTCACATTACGCCTCTTCTAACTCTTGCTATTTTCGGATCATCGTCTGTTCAAGCCATTTCTCAAGTCTCCAGCCACT
TTCTCTCTCTTCTTGTTCTTCTTCTCACTCTTCCTCCTCGCCGGCGTCTTCCTCGCCACGCGCCTCCATTCATCTGACTATGGCATATAATTTACCTGGAAACACAATAA
AAGGGAGTGGGAAATCCCAATCTTACCCCTACAACACTTCCGAAATCCAACGAAAGCCAAACCACCGACGACGACAAGTCGAATTCACACTCCATTGTACGTCTTTCAAT
AACATCACAGGAGGAGCCTGCCCTGCCAACTACCCGACCAATTGGACTGTGGAGGAAGATCCTGAGCGTCCATCGGCAGCTACGTGCCCCGATTACTTCCGTTGGATCCA
CGAGGACCTGAGGCCGTGGGCTCGGACGGGGATCACGAGGGCCACGCTGGAGGCTGGGAAACGGACGGCCAATTTCCGGCTGGTGATTCTGAATGGGAAGGCTTACGTGG
AGACTTATAAAAAGTCGTTTCAAACGAGAGATACTTTTACGGTATGGGGGATCCTACAGTTGTTACGGAGGTACCCCGGAAAAGTGCCTGATTTGGATCTGATGTTTGAT
TGCGTTGACTGGCCTGTCATTTTGACCAGCCATTTTAGTGGGCCTAATGGGCCGACCCCACCTCCTCTGTTTCGTTACTGTGGGGATGATGCCACGCTGGATATTGTTTT
TCCTGACTGGTCCTTCTGGGGATGGCCAGAGATCAATATAAAGCCATGGGAGGCATTGTTGAATGATCTAAAAGAAGGGAATAAAAAGATACCATGGAAGAGAAGAGAGG
CTTATGCATACTGGAAGGGAAATCCGGAGGTCGCCGAAACCCGAAAAGATCTACTCAAATGCAATGTCTCTGACCAACAAGACTGGAATGCTCGTGTATTCGCTCAGGAT
TGGATGAAAGAATCCCAGCAGGGATACAAGCAATCAGATCTTGCAAACCAATGTCTTCATAGATATAAAATCTATATAGAAGGATCGGCCTGGTCTGTTAGCGAAAAGTA
CATTCTTGCTTGTGACTCCGTTACCTTAATCGTAAAGCCCCATTACTACGACTTCTTCACGAGAGGTTTGATGCCAGTGCACCACTATTGGCCCGTAAAGGATGGCGACA
AGTGCAAGTCTATAAAATTTGCAGTTGATTGGGGCAACAGCCATAAGCAAAAGGCACAGGCCATTGGTAAAGCAGCTAGCAGTTTCATCCAAGAGGAGCTGAAGATGGAC
TATGTCTATGACTACATGTTTCATCTTCTAAGCGAATATTCCAAACTCCTTACTTTCAAGCCGAAGATACCGCGCAATGCGATCGAGCTTTGTTCTGAAGCCATGGCTTG
TCCAGCTGAAGGGCTCATCAAGAAATTCATGATGGATTCATTAGTGACGAGACCTGCAGATTCGAGCCCTTGCACGATGCCGCCCCCATATGATCCGGCATCGCTTCATT
TTGTTCTTCGTAGAAAAGAGAATTCAATCAAACAAGTAGAAAAATGGGAGACAAATTTCTGGAACACTCAAAGTAAGCAGCCATAG
mRNA sequenceShow/hide mRNA sequence
TTTTTCTCTAACAACAAAACTTCTATCTCTTTCCACACATATAAGTATAGCTAAAAAATATTTGTGTGGATTTTTGTTATTCTGTGTTGTGGTTTTTAGAGTGGTTATTA
GACACGTGTGCCAATCACGTCTGAGTAAGTGTCTAAAGTTGTAATTTTGAGGAAATTAAAGGGGGTATTTATTATGTTTACATAAAATTTCTTAATTCAATATATGATTC
CACACGATCCCATCTCAAAACCACACATTATATTCTCCATCTAAAAAATCAAAATTACACTCTCTGTCCTCGCCGGACGGCGGCGGCGGCGGGTCAGAAAAGGCAGAATG
TGGGAGTGGTGAAGGAAGAGTAAATAAATTTAAAAAAAAAAATATGAGAGACGCCGGCAGTTTCCAGCAGAGGTTTTCACATTACGCCTCTTCTAACTCTTGCTATTTTC
GGATCATCGTCTGTTCAAGCCATTTCTCAAGTCTCCAGCCACTTTCTCTCTCTTCTTGTTCTTCTTCTCACTCTTCCTCCTCGCCGGCGTCTTCCTCGCCACGCGCCTCC
ATTCATCTGACTATGGCATATAATTTACCTGGAAACACAATAAAAGGGAGTGGGAAATCCCAATCTTACCCCTACAACACTTCCGAAATCCAACGAAAGCCAAACCACCG
ACGACGACAAGTCGAATTCACACTCCATTGTACGTCTTTCAATAACATCACAGGAGGAGCCTGCCCTGCCAACTACCCGACCAATTGGACTGTGGAGGAAGATCCTGAGC
GTCCATCGGCAGCTACGTGCCCCGATTACTTCCGTTGGATCCACGAGGACCTGAGGCCGTGGGCTCGGACGGGGATCACGAGGGCCACGCTGGAGGCTGGGAAACGGACG
GCCAATTTCCGGCTGGTGATTCTGAATGGGAAGGCTTACGTGGAGACTTATAAAAAGTCGTTTCAAACGAGAGATACTTTTACGGTATGGGGGATCCTACAGTTGTTACG
GAGGTACCCCGGAAAAGTGCCTGATTTGGATCTGATGTTTGATTGCGTTGACTGGCCTGTCATTTTGACCAGCCATTTTAGTGGGCCTAATGGGCCGACCCCACCTCCTC
TGTTTCGTTACTGTGGGGATGATGCCACGCTGGATATTGTTTTTCCTGACTGGTCCTTCTGGGGATGGCCAGAGATCAATATAAAGCCATGGGAGGCATTGTTGAATGAT
CTAAAAGAAGGGAATAAAAAGATACCATGGAAGAGAAGAGAGGCTTATGCATACTGGAAGGGAAATCCGGAGGTCGCCGAAACCCGAAAAGATCTACTCAAATGCAATGT
CTCTGACCAACAAGACTGGAATGCTCGTGTATTCGCTCAGGATTGGATGAAAGAATCCCAGCAGGGATACAAGCAATCAGATCTTGCAAACCAATGTCTTCATAGATATA
AAATCTATATAGAAGGATCGGCCTGGTCTGTTAGCGAAAAGTACATTCTTGCTTGTGACTCCGTTACCTTAATCGTAAAGCCCCATTACTACGACTTCTTCACGAGAGGT
TTGATGCCAGTGCACCACTATTGGCCCGTAAAGGATGGCGACAAGTGCAAGTCTATAAAATTTGCAGTTGATTGGGGCAACAGCCATAAGCAAAAGGCACAGGCCATTGG
TAAAGCAGCTAGCAGTTTCATCCAAGAGGAGCTGAAGATGGACTATGTCTATGACTACATGTTTCATCTTCTAAGCGAATATTCCAAACTCCTTACTTTCAAGCCGAAGA
TACCGCGCAATGCGATCGAGCTTTGTTCTGAAGCCATGGCTTGTCCAGCTGAAGGGCTCATCAAGAAATTCATGATGGATTCATTAGTGACGAGACCTGCAGATTCGAGC
CCTTGCACGATGCCGCCCCCATATGATCCGGCATCGCTTCATTTTGTTCTTCGTAGAAAAGAGAATTCAATCAAACAAGTAGAAAAATGGGAGACAAATTTCTGGAACAC
TCAAAGTAAGCAGCCATAGAGAGAGACA
Protein sequenceShow/hide protein sequence
MRDAGSFQQRFSHYASSNSCYFRIIVCSSHFSSLQPLSLSSCSSSHSSSSPASSSPRASIHLTMAYNLPGNTIKGSGKSQSYPYNTSEIQRKPNHRRRQVEFTLHCTSFN
NITGGACPANYPTNWTVEEDPERPSAATCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFD
CVDWPVILTSHFSGPNGPTPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEALLNDLKEGNKKIPWKRREAYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQD
WMKESQQGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDGDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMD
YVYDYMFHLLSEYSKLLTFKPKIPRNAIELCSEAMACPAEGLIKKFMMDSLVTRPADSSPCTMPPPYDPASLHFVLRRKENSIKQVEKWETNFWNTQSKQP