; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024400 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024400
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionArabidopsis thaliana protein of unknown function (DUF821)
Genome locationtig00001291:2398149..2401804
RNA-Seq ExpressionSgr024400
SyntenySgr024400
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR006598 - Glycosyl transferase CAP10 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB1222531.1 O-glucosyltransferase rumi [Morella rubra]2.2e-21369.71Show/hide
Query:  LSRKPIWRPPLRKASVAVPAAVFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQA
        ++ +  W+P L+K  VA P A+     L  + + ISS R  ++  S     N T   S K +   KKII+ +PLNC S GNQTQ   C  NYP  TTF  
Subjt:  LSRKPIWRPPLRKASVAVPAAVFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQA

Query:  DDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVK
        DDLDPS  PVCP YFRWIH+DL PW+ATGITREMVE+AK TAHFRLV+V GKAY+E+YKK+IQTRDVFTIWGILQLLRRYPG IPDLELMFDC+D PV++
Subjt:  DDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVK

Query:  SSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQND
        S +YR  N       T GPPP+FRYCGD WT DIVFPDWSFWGWAEINIRPWE+LLK+LKEGN R KW++REP+AYWKGNP V +TR+DLLKCNLS   D
Subjt:  SSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQND

Query:  WNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQK
        WNARLYVQDWI ES+QGYK+S+LASQC HRYKIYIEGYAWSVSEKYILACDSVTL+V P FYDFFTR LQPVHHYWP+RDD KC SIKFAV WGN+HK+K
Subjt:  WNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQK

Query:  AQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNAN
        AQAIGKA+SDFIQ+ELKMD VYDYMFH+LN YAKLL+F P+IP GA E+CSETMAC   G EKKFM ES+VK PS+TSPC MPPPF+   L  LYRRN N
Subjt:  AQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNAN

Query:  LIRQVEKWEDQFWES
        +I+QV KWE+ +WES
Subjt:  LIRQVEKWEDQFWES

XP_004143920.1 O-glucosyltransferase rumi homolog isoform X1 [Cucumis sativus]7.8e-23576.22Show/hide
Query:  MQRF--STSIFRRPTALSRKPIWRPPLRK--ASVAVP-AAVFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNC--SST
        MQRF  STSIF      SR+PIW   LRK  A+VAVP AA+F LA     AVLISSTR Q TL    FLGNQTE       K+PKK I+ YPLNC  SST
Subjt:  MQRF--STSIFRRPTALSRKPIWRPPLRK--ASVAVP-AAVFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNC--SST

Query:  GNQTQIFTCRKNYPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRR
         NQTQ FTCRK+YP  T ++ + + PSGR VCP+YFRWIHEDL PW A GITREMVE+ K TAHFRL +V G  YVE YKK+IQTRD+FTIWGILQLLRR
Subjt:  GNQTQIFTCRKNYPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRR

Query:  YPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKG
        YPG IPDLELMFDCDDRPVVKS++YRN         T   PPVFRYCGD  T+DIVFPDWSFWGWAEINIRPWENLLKELK+GNE+ KW++RE FAYWKG
Subjt:  YPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKG

Query:  NPYVADTRQDLLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVR
        NPYVADTRQDLLKCNLS QNDWNARLY+QDWIQESQQGYKQS LA+QC +RYKIYIEGY WSVSEKYILACDS+TLLVKPNFYDFF+RSL+P+HHYWP+ 
Subjt:  NPYVADTRQDLLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVR

Query:  DDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSP
        DD KC SIKFAVHWGNSHKQKAQ IGK AS+FIQQEL+M+NVYDYMFHLLN YAKLLRF+PEIP GA EVCSETMACPRDG EKKFM ESMVKTPSLT P
Subjt:  DDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSP

Query:  CAMPPPFDSPSLQRLYRRNANLIRQVEKWEDQFW
        C+MPPPFD+PSLQRLYRRNANLI QVEKWE+ FW
Subjt:  CAMPPPFDSPSLQRLYRRNANLIRQVEKWEDQFW

XP_007209901.1 protein O-glucosyltransferase 1 isoform X2 [Prunus persica]3.1e-20767.31Show/hide
Query:  WRPPLRKASVAVPAA---VFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQADDL
        W P     S A+ AA   +  L      A ++ S   Q +   I    N+T   S K+Q+ P K+I+ +PLNCS   N  Q  TC  +YPT T    DDL
Subjt:  WRPPLRKASVAVPAA---VFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQADDL

Query:  DPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSE
        +PS  P+CPDYFR+IH+DL PW+ATGITR+MVERAK TAHFRLVIV GKAYVE+YKK+IQTRDVFTIWGILQLLRRYPG +PDLELMFDCDD+PV++S +
Subjt:  DPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSE

Query:  YRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNA
        +R  N  +        PP+FRYCGD WT DIVFPDWSFWGWAEINI+PWE LLK+LK+GN+R KW++REP+AYWKGNP+VA++R+DLLKCN+S   DWNA
Subjt:  YRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNA

Query:  RLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQA
        RL++QDWI ESQQG+KQS++ASQC HRYKIYIEGYAWSVSEKYILACDSVTL+VKP +YDFFTRSLQPVHHYWP+R D KC SIKFAV WGN+HKQKAQA
Subjt:  RLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQA

Query:  IGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIR
        IGKAASDFIQQELKMD VYDYMFHLLN+YAKLLRF P+IP GA  +CSE+MACP   SEKKFMTES+VK+PS+TSPC MPP F   +L  LYRRN NL +
Subjt:  IGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIR

Query:  QVEKWEDQFWESLNSKQ
        QV+KWED++WE+L+ ++
Subjt:  QVEKWEDQFWESLNSKQ

XP_034216879.1 protein O-glucosyltransferase 1-like [Prunus dulcis]1.2e-20667.25Show/hide
Query:  WRPPLRKASVAVPAA---VFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQADDL
        W P     S A+ AA   +  L      A ++ S   Q +   I    N+T   S K+Q+ P K+I+ +PLNCS   N  Q  TC  +YPT T    DDL
Subjt:  WRPPLRKASVAVPAA---VFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQADDL

Query:  DPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSE
        +PS  P+CPDYFR+IH+DL PW+ATGITR+MVERAK TAHFRLVIV GKAYVE+YKK+IQTRDVFTIWGILQLLRRYPG +PDLELMFDCDD+PV++S +
Subjt:  DPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSE

Query:  YRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNA
        +R  N  +        PP+FRYCGD WT DIVFPDWSFWGWAEINI+PWE LLK+LK+GN+R KW++REP+AYWKGNP+VA++R+DLLKCN+S   DWNA
Subjt:  YRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNA

Query:  RLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQA
        RL++QDWI ESQQG+KQS++ASQC HRYKIYIEGYAWSVSEKYILACDSVTL+VKP +YDFFTRSLQPVHHYWP+R D KC SIKFAV WGN+HKQKAQA
Subjt:  RLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQA

Query:  IGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIR
        IGKAASDFIQQELKMD VYDYMFHLLN+YAKLLRF P+IP GA  +CSE+MACP   SEKKFMTES+VK+PS+TSPC MPP +   +L  LYRRN NL +
Subjt:  IGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIR

Query:  QVEKWEDQFWESLNSK
        QV+KWED++WE+L+ +
Subjt:  QVEKWEDQFWESLNSK

XP_038875037.1 protein O-glucosyltransferase 1-like [Benincasa hispida]3.7e-24578.65Show/hide
Query:  MQRF--STSIFRRPTALSRKPIWRPPLRKASVAV---PAAVFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNC--SST
        MQRF  ST IFR     SR+PIW+P L K SVAV    AAVFLLA L +VAV+ISS R QSTL S  FLGNQTE  S    K+PKK I+ YPL+C  SST
Subjt:  MQRF--STSIFRRPTALSRKPIWRPPLRKASVAV---PAAVFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNC--SST

Query:  GNQTQIFTCRKNYPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRR
         NQTQ F CRKNYPT+  F+ +  D SGRP+CP+YFRWIHEDL PW A GITREMVE+ K TAHFRL +VGG+ YVE YKK+IQTRD+FTIWGILQLLR+
Subjt:  GNQTQIFTCRKNYPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRR

Query:  YPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKG
        YPG +PDLELMFDCDDRPVVKS++Y+          TAGPPP+FRYCGD  T+DIVFPDWSFWGW EINIRPWE LLKELK+GNE+ KW++RE FAYWKG
Subjt:  YPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKG

Query:  NPYVADTRQDLLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVR
        NPYVADTRQDLLKCNLS QNDWNARLY+QDWI ESQQGYKQSNLASQC HRYKIYIEGYAWSVSEKYILACDSVTLL+KPNFYDFFTRSLQP+HHYWPV 
Subjt:  NPYVADTRQDLLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVR

Query:  DDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSP
        DD KC SIKFAVHWGNSHKQKAQAIGKAASDFIQQ+LKM+NVYDYMFHLLNQYAKLLRFRP IP GA EVCSETMACPRDG EKKFM ESMVKTPSLT P
Subjt:  DDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSP

Query:  CAMPPPFDSPSLQRLYRRNANLIRQVEKWEDQFW
        C+MPPPFD+PSLQRLYRRNANLIRQVEKWED+FW
Subjt:  CAMPPPFDSPSLQRLYRRNANLIRQVEKWEDQFW

TrEMBL top hitse value%identityAlignment
A0A2N9J2X5 CAP10 domain-containing protein1.5e-21569.8Show/hide
Query:  WRPPLRKASVAVPAAVFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQADDLDPS
        W+P L+K   A    +    TL  +   IS+ R  ++    Y   N T T   KHQ +P K I  +PLNC S GNQTQ  TC  NYP KT    +DLDPS
Subjt:  WRPPLRKASVAVPAAVFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQADDLDPS

Query:  GRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRN
          PVCPDYFRWIHEDL PW+  GITR+MVE+AK++AHFRLVIV GKAY+E+YKK+IQTRD+FTIWGILQLLRRYPG +PD+ELMFDCDDRPV+KS++YR 
Subjt:  GRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRN

Query:  RNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNARLY
         N         GPPP+FRYCGD WTMDIVFPDWSFWGWAEINI+PWE+LLKELKEGN+RSKW++REP+AYWKGNP+VA+TR+DLLKCN+S + DWNARLY
Subjt:  RNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNARLY

Query:  VQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGK
        +QDWI ESQQGYK+S LASQC HRYKIYIEGYAWSVSEKYILACDSV+L+VKP +YDFFTRSL+PVHHYWP+RDD KC SIKFAV WGN+HKQKAQAIGK
Subjt:  VQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGK

Query:  AASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIRQVE
        A+SDFIQ+ELKMD VYDYMFHLLN+YAKLL+F P+IP GA E+CSET AC  +G+EKKFM ES+VK PSLTSPC MPPP++   L  L RRN N+IRQVE
Subjt:  AASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIRQVE

Query:  KWEDQFWESL
        KWE+++WE++
Subjt:  KWEDQFWESL

A0A5B7BT47 CAP10 domain-containing protein1.3e-20665.19Show/hide
Query:  MQRFSTSIFRRPTAL---------SRKPIWRPPLRKASVAVPAAVFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCS
        MQRFS++I+RR T L         S   IWR PL+K        +FL   L     L+SS+   S+  S+  +  +  T  + +++  K+I    PLNCS
Subjt:  MQRFSTSIFRRPTAL---------SRKPIWRPPLRKASVAVPAAVFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCS

Query:  STGNQTQIFTCRKNYPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLL
        + GN TQ  TC  NYP  T+F+ +D DPS    CPDYFRWIHEDL  + +TGITR+MVERAK TAHFR+VIV GK YVE+YKK+IQTRDVFT+WGILQLL
Subjt:  STGNQTQIFTCRKNYPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLL

Query:  RRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYW
        RRYPG +PDLELMFDC+DRPV++  +YR  N      ATA PPP+FRYCGD W +DIVFPDWSFWGWAEINI+PWE++LKE+KEGN R+KW++RE +AYW
Subjt:  RRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYW

Query:  KGNPYVADTRQDLLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWP
        KGNP+VA+TR+DLLKCN+S + DWNARL+VQDWI ESQQGYKQS+LASQC +RYKIYIEGYAWSVSEKYILACDSVTLLVKP++YDFF RSLQPVHHYWP
Subjt:  KGNPYVADTRQDLLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWP

Query:  VRDDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLT
        +RDD KC SIKFAV WGN+HKQKAQAIGKAA DFIQ+ELKMD VYDYMFHLLN+YAKLL+F P++P GA E CSETMACP DG EK+FM ES+VK PS+T
Subjt:  VRDDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLT

Query:  SPCAMPPPFDSPSLQRLYRRNANLIRQVEKWEDQFWESLNSKQ
        SPC +PPP+D  +L    +R  N I+QVE WE ++W+SLN +Q
Subjt:  SPCAMPPPFDSPSLQRLYRRNANLIRQVEKWEDQFWESLNSKQ

A0A5E4FRC2 PREDICTED: O-glucosyltransferase5.7e-20767.25Show/hide
Query:  WRPPLRKASVAVPAA---VFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQADDL
        W P     S A+ AA   +  L      A ++ S   Q +   I    N+T   S K+Q+ P K+I+ +PLNCS   N  Q  TC  +YPT T    DDL
Subjt:  WRPPLRKASVAVPAA---VFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQADDL

Query:  DPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSE
        +PS  P+CPDYFR+IH+DL PW+ATGITR+MVERAK TAHFRLVIV GKAYVE+YKK+IQTRDVFTIWGILQLLRRYPG +PDLELMFDCDD+PV++S +
Subjt:  DPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSE

Query:  YRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNA
        +R  N  +        PP+FRYCGD WT DIVFPDWSFWGWAEINI+PWE LLK+LK+GN+R KW++REP+AYWKGNP+VA++R+DLLKCN+S   DWNA
Subjt:  YRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNA

Query:  RLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQA
        RL++QDWI ESQQG+KQS++ASQC HRYKIYIEGYAWSVSEKYILACDSVTL+VKP +YDFFTRSLQPVHHYWP+R D KC SIKFAV WGN+HKQKAQA
Subjt:  RLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQA

Query:  IGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIR
        IGKAASDFIQQELKMD VYDYMFHLLN+YAKLLRF P+IP GA  +CSE+MACP   SEKKFMTES+VK+PS+TSPC MPP +   +L  LYRRN NL +
Subjt:  IGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIR

Query:  QVEKWEDQFWESLNSK
        QV+KWED++WE+L+ +
Subjt:  QVEKWEDQFWESLNSK

A0A6A1WB87 O-glucosyltransferase rumi1.1e-21369.71Show/hide
Query:  LSRKPIWRPPLRKASVAVPAAVFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQA
        ++ +  W+P L+K  VA P A+     L  + + ISS R  ++  S     N T   S K +   KKII+ +PLNC S GNQTQ   C  NYP  TTF  
Subjt:  LSRKPIWRPPLRKASVAVPAAVFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQA

Query:  DDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVK
        DDLDPS  PVCP YFRWIH+DL PW+ATGITREMVE+AK TAHFRLV+V GKAY+E+YKK+IQTRDVFTIWGILQLLRRYPG IPDLELMFDC+D PV++
Subjt:  DDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVK

Query:  SSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQND
        S +YR  N       T GPPP+FRYCGD WT DIVFPDWSFWGWAEINIRPWE+LLK+LKEGN R KW++REP+AYWKGNP V +TR+DLLKCNLS   D
Subjt:  SSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQND

Query:  WNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQK
        WNARLYVQDWI ES+QGYK+S+LASQC HRYKIYIEGYAWSVSEKYILACDSVTL+V P FYDFFTR LQPVHHYWP+RDD KC SIKFAV WGN+HK+K
Subjt:  WNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQK

Query:  AQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNAN
        AQAIGKA+SDFIQ+ELKMD VYDYMFH+LN YAKLL+F P+IP GA E+CSETMAC   G EKKFM ES+VK PS+TSPC MPPPF+   L  LYRRN N
Subjt:  AQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNAN

Query:  LIRQVEKWEDQFWES
        +I+QV KWE+ +WES
Subjt:  LIRQVEKWEDQFWES

M5WWD4 CAP10 domain-containing protein1.5e-20767.31Show/hide
Query:  WRPPLRKASVAVPAA---VFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQADDL
        W P     S A+ AA   +  L      A ++ S   Q +   I    N+T   S K+Q+ P K+I+ +PLNCS   N  Q  TC  +YPT T    DDL
Subjt:  WRPPLRKASVAVPAA---VFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQADDL

Query:  DPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSE
        +PS  P+CPDYFR+IH+DL PW+ATGITR+MVERAK TAHFRLVIV GKAYVE+YKK+IQTRDVFTIWGILQLLRRYPG +PDLELMFDCDD+PV++S +
Subjt:  DPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSE

Query:  YRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNA
        +R  N  +        PP+FRYCGD WT DIVFPDWSFWGWAEINI+PWE LLK+LK+GN+R KW++REP+AYWKGNP+VA++R+DLLKCN+S   DWNA
Subjt:  YRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNA

Query:  RLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQA
        RL++QDWI ESQQG+KQS++ASQC HRYKIYIEGYAWSVSEKYILACDSVTL+VKP +YDFFTRSLQPVHHYWP+R D KC SIKFAV WGN+HKQKAQA
Subjt:  RLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQA

Query:  IGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIR
        IGKAASDFIQQELKMD VYDYMFHLLN+YAKLLRF P+IP GA  +CSE+MACP   SEKKFMTES+VK+PS+TSPC MPP F   +L  LYRRN NL +
Subjt:  IGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIR

Query:  QVEKWEDQFWESLNSKQ
        QV+KWED++WE+L+ ++
Subjt:  QVEKWEDQFWESLNSKQ

SwissProt top hitse value%identityAlignment
A0NDG6 O-glucosyltransferase rumi homolog2.4e-2126.02Show/hide
Query:  DLGPWRATGITREMVERAKT-TAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGP
        DL P++A GIT+EM+ RAK    H++  ++G K Y  Q +     R      G+   +R     +PD++L+ +C D P +    +R+ ++ +        
Subjt:  DLGPWRATGITREMVERAKT-TAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGP

Query:  PPVFRYCGDMWTMDIVFPDWSFW-GWAEINIRP-----WENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNARLYVQDWIQE
         PV  +      +DI++P W+FW G   I + P     W+   + + + +  + W  +EP A+++G+   +D R  L+  + +  +  +A+ Y ++   +
Subjt:  PPVFRYCGDMWTMDIVFPDWSFW-GWAEINIRP-----WENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNARLYVQDWIQE

Query:  SQQG------YKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGKA
        S Q        ++  L   C +R+     G A S   K++  C S+   V   + +FF  SL+P  HY PV        ++  + +   H Q A+AI + 
Subjt:  SQQG------YKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGKA

Query:  ASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEV
          + I   L+M +V  Y   LL +Y KL+R+  E  +   EV
Subjt:  ASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEV

Q5E9Q1 Protein O-glucosyltransferase 13.2e-2124.65Show/hide
Query:  DDLDPSGRPVCPDYFRWIHEDLGPWRATGITREM---VERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRP
        ++ +P   P C  Y   I EDL P+R  GI+R+M   V R K   H++  I+  + Y E               G+   +    G +PD+E++ +  D P
Subjt:  DDLDPSGRPVCPDYFRWIHEDLGPWRATGITREM---VERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRP

Query:  VVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFW--GWAEINIRP-----WENLLKELKEGNERSKWLQREPFAYWKG-------NPYV
         V                     P+F +   +   DI++P W+FW  G A   I P     W+   ++L     +  W ++   AY++G       +P +
Subjt:  VVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFW--GWAEINIRP-----WENLLKELKEGNERSKWLQREPFAYWKG-------NPYV

Query:  ADTRQD--LLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDD
          +R++  L+    +    W +   ++D +   +   K  +L   C ++Y     G A S   K++  C S+   V   + +FF   L+P  HY PV+ D
Subjt:  ADTRQD--LLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDD

Query:  QKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRF
           ++++  + +  ++   AQ I +  S FI   LKMD++  Y  +LL +Y+K L +
Subjt:  QKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRF

Q6UW63 Protein O-glucosyltransferase 22.1e-2024.36Show/hide
Query:  CPDYFRWIHEDLGPWRATGITREMVE------RAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSEY
        CP+    I  DL  + A    +  VE      + ++  H+ L     K Y++ + + +  R +F    +L L R+    +PD+EL  +  D P+ K    
Subjt:  CPDYFRWIHEDLGPWRATGITREMVE------RAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSEY

Query:  RNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNAR
         N +            P+F +CG   + DIV P +     + +      +L     + N    W  +   A W+G     + R +L+K +  H    +A 
Subjt:  RNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNAR

Query:  LYVQDWIQESQQGY----KQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQK
             + +  +  Y    K  +      H+Y+I I+G   +    Y+L  DSV L     +Y+ F   LQP  HY PV+ +   + +   + W   H ++
Subjt:  LYVQDWIQESQQGY----KQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQK

Query:  AQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSET
        A+ I KA  +F +  L  D+++ Y F L  +YA L    P+I  G   V  +T
Subjt:  AQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSET

Q8BYB9 Protein O-glucosyltransferase 15.1e-1924.74Show/hide
Query:  LNCSSTGNQTQIFTCRKNYPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREM---VERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTI
        L C   G Q    +  K +  +     ++ +P     C  Y   I EDL P+R  GI+R+M   V R K   H++  I+  + + E       +R     
Subjt:  LNCSSTGNQTQIFTCRKNYPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREM---VERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTI

Query:  WGILQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFW--GWAEINIRP-----WENLLKELKEGN
          IL+++ R    +PD+E++ +  D P V           +    T    PVF +       DI++P W+FW  G A   + P     W+   ++L    
Subjt:  WGILQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFW--GWAEINIRP-----WENLLKELKEGN

Query:  ERSKWLQREPFAYWKG-------NPYVADTRQD--LLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVT
         +  W ++   AY++G       +P +  +R++  L+    +    W +   ++D +   +   K  +L   C +RY     G A S   K++  C S+ 
Subjt:  ERSKWLQREPFAYWKG-------NPYVADTRQD--LLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVT

Query:  LLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRF
          V   + +FF   L+P  HY PV+ D   ++++  + +  ++   AQ I K  S FI   L+MD++  Y  +LL  Y+K L +
Subjt:  LLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRF

Q8NBL1 Protein O-glucosyltransferase 16.0e-2023.88Show/hide
Query:  SSTGNQTQIFTCRKNYPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREM---VERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGI
        S+ G Q +  +  K +  +     ++ +P     C  Y   I EDL P+R  GI+R+M   V R K   H++  I   + Y E               G+
Subjt:  SSTGNQTQIFTCRKNYPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREM---VERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGI

Query:  LQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFW--GWAEINIRP-----WENLLKELKEGNERS
           +    G +PD+E++ +  D P V                     PVF +       DI++P W+FW  G A   I P     W+   ++L     + 
Subjt:  LQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFW--GWAEINIRP-----WENLLKELKEGNERS

Query:  KWLQREPFAYWKG-------NPYVADTRQD--LLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLV
         W ++   AY++G       +P +  +R++  L+    +    W +   ++D +   +   K  +L   C ++Y     G A S   K++  C S+   V
Subjt:  KWLQREPFAYWKG-------NPYVADTRQD--LLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLV

Query:  KPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRF
           + +FF   L+P  HY PV+ D   ++++  + +  ++   AQ I +  S FI+  L+MD++  Y  +LL++Y+K L +
Subjt:  KPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRF

Arabidopsis top hitse value%identityAlignment
AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821)3.9e-17660.13Show/hide
Query:  LNCSSTGNQTQIFTCRKNYPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGI
        ++CSS  NQ +  +C +   +       + + S    CPDYF+WIHEDL PWR TGIT+EMVER KTTAHFRLVI+ GK +VE YKK+IQTRD FT+WGI
Subjt:  LNCSSTGNQTQIFTCRKNYPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGI

Query:  LQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREP
        LQLLR+YPG +PD++LMFDCDDRPV++S  Y   NR    T    PPP+FRYCGD WT+DIVFPDWSFWGW EINIR W  +LKE++EG ++ K+++R+ 
Subjt:  LQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREP

Query:  FAYWKGNPYVAD-TRQDLLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPV
        +AYWKGNP+VA  +R+DLL CNLS  +DWNAR+++QDWI E Q+G++ SN+A+QC +RYKIYIEGYAWSVSEKYILACDSVTL+VKP +YDFF+R+LQP+
Subjt:  FAYWKGNPYVAD-TRQDLLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPV

Query:  HHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGS-----EKKFMT
         HYWP+RD  KC SIKFAV W N+H QKAQ IG+ AS+F+Q++L M+NVYDYMFHLLN+Y+KLL+++P++P  + E+C+E + CP +G      +KKFM 
Subjt:  HHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGS-----EKKFMT

Query:  ESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIRQVEKWEDQFWESL
         S+V  P  + PC++PPPFDS  L++ +R+  NLIRQVEKWED +W+ +
Subjt:  ESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIRQVEKWEDQFWESL

AT2G45830.1 downstream target of AGL15 21.1e-16253.33Show/hide
Query:  PAAVFLLATLTAVAVLISSTRFQSTLSSI---YFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQADDLDPSGR---PVCP
        PA     ATL  V  L  S      L       F G +  TTS +   +     Q +P  C    NQTQ+F      P   + + +D   S       CP
Subjt:  PAAVFLLATLTAVAVLISSTRFQSTLSSI---YFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQADDLDPSGR---PVCP

Query:  DYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRT
         YFRWIHEDL PW+ TG+TR M+E+A+ TAHFR+VI+ G+ YV++Y+K+IQTRDVFT+WGI+QLLR YPG +PDLELMFD DDRP V+S +++ +     
Subjt:  DYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRT

Query:  ATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNARLYVQDWIQ
              PPP+FRYC D  ++DIVFPDWSFWGWAE+NI+PW+  L  ++EGN+ ++W  R  +AYW+GNP VA TR+DLL+CN+S Q DWN RLY+QDW +
Subjt:  ATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNARLYVQDWIQ

Query:  ESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFI
        ES++G+K SNL +QC HRYKIYIEG+AWSVSEKYI+ACDS+TL V+P FYDF+ R + P+ HYWP+RD  KCTS+KFAVHWGN+H  +A  IG+  S FI
Subjt:  ESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFI

Query:  QQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIRQVEKWEDQF
        ++E+KM+ VYDYMFHL+N+YAKLL+F+PEIP GA E+  + M C   G  + FM ESMV  PS  SPC MP PF+   L+ +  R  NL RQVE WEDQ+
Subjt:  QQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIRQVEKWEDQF

Query:  WESL-NSKQP
        +  L N K+P
Subjt:  WESL-NSKQP

AT3G48980.1 Arabidopsis thaliana protein of unknown function (DUF821)4.1e-18162.37Show/hide
Query:  QTETTSQKHQKVPKKIIQNYP----LNCSS-TGNQTQIFTCRK-NYPT--KTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHF
        + ETT         K+I   P    LNC++ +GN T   TC K NYPT  +++    + D S    CPDYFRWIHEDL PW  TGITRE +ERA  TA F
Subjt:  QTETTSQKHQKVPKKIIQNYP----LNCSS-TGNQTQIFTCRK-NYPT--KTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHF

Query:  RLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGW
        RL I+ G+ YVE++++A QTRDVFTIWG +QLLRRYPG IPDLELMFDC D PVVK++E+   ++         PPP+FRYC +  T+DIVFPDWS+WGW
Subjt:  RLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGW

Query:  AEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSE
        AE+NI+PWE+LLKEL+EGN+R+KW+ REP+AYWKGNP VA+TR DL+KCNLS   DW ARLY QDW++ES++GYKQS+LASQC+HRYKIYIEG AWSVSE
Subjt:  AEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSE

Query:  KYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPA
        KYILACDSVTL+VKP++YDFFTR + P HHYWPV++D KC SIKFAV WGN H +KAQ IGK AS+F+QQELKMD VYDYMFHLL QY+KLLRF+PEIP 
Subjt:  KYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPA

Query:  GAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIRQVEKWEDQFWESLN
         + E+CSE MACPRDG+E+KFM ES+VK P+ T PCAMPPP+D  S   + +R  +   ++E+WE ++W   N
Subjt:  GAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIRQVEKWEDQFWESLN

AT3G61270.1 Arabidopsis thaliana protein of unknown function (DUF821)2.3e-16055.58Show/hide
Query:  KVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAI
        K+  K  +  P  C    NQ+      +N  ++     +  + S    CP YFRWIHEDL PW+ TGITR M+E A  TAHFRLVI  GKAYV++YKK+I
Subjt:  KVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAI

Query:  QTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEG
        QTRD FT+WGILQLLR YPG +PDLELMFD DDRPVV+S ++  + +         PPPVFRYC D  ++DIVFPDWSFWGWAE+N++PW   L+ +KEG
Subjt:  QTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEG

Query:  NERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFY
        N  ++W  R  +AYW+GNPYV   R DLLKCN +   +WN RLY+QDW +E+++G+K SNL +QC HRYKIYIEG+AWSVSEKYI+ACDS+TL VKP FY
Subjt:  NERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFY

Query:  DFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSE
        DF+ R + P+ HYWP+RDD KCTS+KFAVHWGN+H+ KA+ IG+  S FI++E+ M  VYDYMFHLL +YA LL+F+PEIP  A E+  ++M CP     
Subjt:  DFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSE

Query:  KKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIRQVEKWEDQFWESLNSK
        + F  ESM+ +PS  SPC M PP+D  +L+ +  R ANL RQVE WE+Q++++L +K
Subjt:  KKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIRQVEKWEDQFWESLNSK

AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821)4.2e-18658.94Show/hide
Query:  IWRPPLRKASVAVPAAVFLLATLTAVAVL--ISSTR--------FQSTLSSIYFLGNQTETTSQKHQKVPKKIIQN----YPLNCSSTGNQTQIFTCRKN
        IW P ++      P   + L +L  + ++    STR         +   ++      QT+T + K+ +    I Q+    + L+CS+  N+T        
Subjt:  IWRPPLRKASVAVPAAVFLLATLTAVAVL--ISSTR--------FQSTLSSIYFLGNQTETTSQKHQKVPKKIIQN----YPLNCSSTGNQTQIFTCRKN

Query:  YPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMF
        YPT T+F+ DD +      CPDYFRWIHEDL PW  TGITRE +ERAK TA FRL IVGGK YVE+++ A QTRDVFTIWG LQLLR+YPG IPDLELMF
Subjt:  YPTKTTFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMF

Query:  DCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLL
        DC D PVV+++E+   N          PPP+FRYCG+  T+DIVFPDWSFWGWAE+NI+PWE+LLKEL+EGNER+KW+ REP+AYWKGNP VA+TRQDL+
Subjt:  DCDDRPVVKSSEYRNRNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLL

Query:  KCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAV
        KCN+S +++WNARLY QDWI+ES++GYKQS+LASQC+HRYKIYIEG AWSVSEKYILACDSVTLLVKP++YDFFTR L P HHYWPVR+  KC SIKFAV
Subjt:  KCNLSHQNDWNARLYVQDWIQESQQGYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAV

Query:  HWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSL
         WGNSH QKAQ IGKAASDFIQQ+LKMD VYDYM+HLL +Y+KLL+F+PEIP  A E+CSETMAC R G+E+KFMTES+VK P+ + PCAMPPP+D  + 
Subjt:  HWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMFHLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSL

Query:  QRLYRRNANLIRQVEKWEDQFWESLN
          + +R  +   ++ +WE ++W   N
Subjt:  QRLYRRNANLIRQVEKWEDQFWESLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAGGTTTTCCACTTCCATTTTCCGGCGACCTACTGCTCTCTCCCGAAAGCCCATCTGGCGCCCGCCGCTCCGGAAAGCCTCCGTCGCAGTTCCAGCCGCCGTCTT
CCTCCTCGCTACCCTCACCGCCGTCGCCGTCCTCATTTCCTCCACTCGCTTTCAATCTACTCTGTCTTCGATTTACTTTCTCGGGAACCAAACAGAAACAACCTCCCAGA
AGCATCAGAAAGTCCCTAAAAAGATAATCCAAAATTACCCACTTAACTGTTCCTCCACCGGAAACCAAACCCAAATCTTCACCTGCCGGAAAAACTACCCAACCAAGACC
ACATTCCAAGCCGATGACCTCGACCCGTCGGGTCGACCCGTTTGCCCGGACTACTTCCGGTGGATCCACGAGGATCTCGGGCCATGGAGAGCCACCGGAATCACGAGGGA
GATGGTGGAGAGGGCGAAGACGACGGCGCATTTCCGGCTGGTGATCGTCGGCGGCAAGGCCTACGTGGAACAGTACAAGAAAGCCATTCAGACGAGGGACGTGTTTACGA
TTTGGGGGATTCTGCAGCTGTTGAGAAGATACCCCGGTGGAATACCGGATTTGGAGCTGATGTTCGACTGCGACGACCGGCCGGTGGTGAAATCGTCCGAGTACCGGAAT
CGGAATCGGAACCGGACGGCTACGGCGACTGCGGGGCCGCCGCCGGTGTTCCGGTACTGCGGCGACATGTGGACGATGGACATCGTGTTCCCCGATTGGTCCTTCTGGGG
ATGGGCAGAGATAAATATAAGGCCATGGGAGAATTTGTTGAAGGAATTGAAAGAAGGGAATGAGAGAAGCAAATGGTTGCAAAGGGAACCCTTTGCCTATTGGAAAGGGA
ACCCTTACGTGGCAGACACCAGACAAGATCTTCTCAAATGCAACCTCTCCCATCAAAATGACTGGAATGCCCGCCTCTACGTCCAGGATTGGATCCAAGAGTCTCAACAA
GGTTATAAGCAATCCAACTTGGCCAGCCAATGTAATCACAGGTACAAGATCTACATAGAAGGGTATGCATGGTCGGTGAGTGAAAAATATATATTGGCATGTGATTCAGT
GACCTTGCTTGTAAAGCCCAATTTCTATGATTTCTTCACCAGATCTTTACAGCCAGTTCACCATTATTGGCCTGTTAGAGATGACCAGAAATGCACATCCATCAAGTTTG
CTGTCCATTGGGGCAACTCCCACAAACAAAAGGCACAAGCAATAGGAAAAGCAGCAAGCGACTTCATCCAACAAGAGCTGAAGATGGACAATGTTTATGACTACATGTTT
CATCTCCTCAACCAATACGCCAAGCTCCTCCGGTTTCGGCCTGAAATCCCCGCAGGCGCAGCGGAAGTTTGCTCCGAGACGATGGCTTGCCCGAGAGACGGGTCGGAGAA
GAAGTTCATGACGGAGTCGATGGTGAAAACTCCCTCTCTCACCAGCCCCTGCGCCATGCCACCACCCTTTGACAGCCCTTCTCTCCAGAGGCTTTACAGAAGGAATGCCA
ACCTAATTAGGCAAGTGGAGAAGTGGGAAGATCAGTTTTGGGAGAGCCTCAATAGTAAGCAACCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGAGGTTTTCCACTTCCATTTTCCGGCGACCTACTGCTCTCTCCCGAAAGCCCATCTGGCGCCCGCCGCTCCGGAAAGCCTCCGTCGCAGTTCCAGCCGCCGTCTT
CCTCCTCGCTACCCTCACCGCCGTCGCCGTCCTCATTTCCTCCACTCGCTTTCAATCTACTCTGTCTTCGATTTACTTTCTCGGGAACCAAACAGAAACAACCTCCCAGA
AGCATCAGAAAGTCCCTAAAAAGATAATCCAAAATTACCCACTTAACTGTTCCTCCACCGGAAACCAAACCCAAATCTTCACCTGCCGGAAAAACTACCCAACCAAGACC
ACATTCCAAGCCGATGACCTCGACCCGTCGGGTCGACCCGTTTGCCCGGACTACTTCCGGTGGATCCACGAGGATCTCGGGCCATGGAGAGCCACCGGAATCACGAGGGA
GATGGTGGAGAGGGCGAAGACGACGGCGCATTTCCGGCTGGTGATCGTCGGCGGCAAGGCCTACGTGGAACAGTACAAGAAAGCCATTCAGACGAGGGACGTGTTTACGA
TTTGGGGGATTCTGCAGCTGTTGAGAAGATACCCCGGTGGAATACCGGATTTGGAGCTGATGTTCGACTGCGACGACCGGCCGGTGGTGAAATCGTCCGAGTACCGGAAT
CGGAATCGGAACCGGACGGCTACGGCGACTGCGGGGCCGCCGCCGGTGTTCCGGTACTGCGGCGACATGTGGACGATGGACATCGTGTTCCCCGATTGGTCCTTCTGGGG
ATGGGCAGAGATAAATATAAGGCCATGGGAGAATTTGTTGAAGGAATTGAAAGAAGGGAATGAGAGAAGCAAATGGTTGCAAAGGGAACCCTTTGCCTATTGGAAAGGGA
ACCCTTACGTGGCAGACACCAGACAAGATCTTCTCAAATGCAACCTCTCCCATCAAAATGACTGGAATGCCCGCCTCTACGTCCAGGATTGGATCCAAGAGTCTCAACAA
GGTTATAAGCAATCCAACTTGGCCAGCCAATGTAATCACAGGTACAAGATCTACATAGAAGGGTATGCATGGTCGGTGAGTGAAAAATATATATTGGCATGTGATTCAGT
GACCTTGCTTGTAAAGCCCAATTTCTATGATTTCTTCACCAGATCTTTACAGCCAGTTCACCATTATTGGCCTGTTAGAGATGACCAGAAATGCACATCCATCAAGTTTG
CTGTCCATTGGGGCAACTCCCACAAACAAAAGGCACAAGCAATAGGAAAAGCAGCAAGCGACTTCATCCAACAAGAGCTGAAGATGGACAATGTTTATGACTACATGTTT
CATCTCCTCAACCAATACGCCAAGCTCCTCCGGTTTCGGCCTGAAATCCCCGCAGGCGCAGCGGAAGTTTGCTCCGAGACGATGGCTTGCCCGAGAGACGGGTCGGAGAA
GAAGTTCATGACGGAGTCGATGGTGAAAACTCCCTCTCTCACCAGCCCCTGCGCCATGCCACCACCCTTTGACAGCCCTTCTCTCCAGAGGCTTTACAGAAGGAATGCCA
ACCTAATTAGGCAAGTGGAGAAGTGGGAAGATCAGTTTTGGGAGAGCCTCAATAGTAAGCAACCTTAA
Protein sequenceShow/hide protein sequence
MQRFSTSIFRRPTALSRKPIWRPPLRKASVAVPAAVFLLATLTAVAVLISSTRFQSTLSSIYFLGNQTETTSQKHQKVPKKIIQNYPLNCSSTGNQTQIFTCRKNYPTKT
TFQADDLDPSGRPVCPDYFRWIHEDLGPWRATGITREMVERAKTTAHFRLVIVGGKAYVEQYKKAIQTRDVFTIWGILQLLRRYPGGIPDLELMFDCDDRPVVKSSEYRN
RNRNRTATATAGPPPVFRYCGDMWTMDIVFPDWSFWGWAEINIRPWENLLKELKEGNERSKWLQREPFAYWKGNPYVADTRQDLLKCNLSHQNDWNARLYVQDWIQESQQ
GYKQSNLASQCNHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPVRDDQKCTSIKFAVHWGNSHKQKAQAIGKAASDFIQQELKMDNVYDYMF
HLLNQYAKLLRFRPEIPAGAAEVCSETMACPRDGSEKKFMTESMVKTPSLTSPCAMPPPFDSPSLQRLYRRNANLIRQVEKWEDQFWESLNSKQP