; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10009152 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10009152
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionArabidopsis thaliana protein of unknown function (DUF821)
Genome locationChr06:3026399..3030818
RNA-Seq ExpressionHG10009152
SyntenyHG10009152
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR006598 - Glycosyl transferase CAP10 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB1222531.1 O-glucosyltransferase rumi [Morella rubra]4.8e-20567.89Show/hide
Query:  RPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPK--IPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPES
        R  W+P L K    VA P A +F +  L   + ISS RL ++  S +   N T  +S K     K I  +PLNCS     NQTQ   C  NYPT F P+ 
Subjt:  RPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPK--IPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPES

Query:  IDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSA
        +DPS  PVC  YFRWIH+DL PW A GITREMVEK K TAHFRL VV G+ Y+E YKKSIQTRD+FTIWGILQLLRRYPGRIPDLELMFDC+D PV++S 
Subjt:  IDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSA

Query:  DYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYI
        DYR     T GPPP+FRYCGD  T DIVFPDWSFWGWAEINIRPWE+LLK+LK+GN + KWMERE +AYWKGNP V ETR+DLLKCNLS   DWNARLY+
Subjt:  DYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYI

Query:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKA
        QDWI ES+QGYK+S+LASQCTHRYKIYIEGYAWSVSEKYILACDSVTL+V P FYDFFTR LQPVHHYWP+ DD KC+SIKFAV WGN+HK+KAQAIGKA
Subjt:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKA

Query:  ASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEK
        +SDFIQ+ELKM+ VYDYMFH+LN YAKLL+F P+IP GAVE+CSET+AC   G+EKKFM ES+VK PS+T PC+MPPPF+   L  LYRRN N+I+QV K
Subjt:  ASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEK

Query:  WENEFW------NKNKP
        WEN +W       K+KP
Subjt:  WENEFW------NKNKP

XP_004143920.1 O-glucosyltransferase rumi homolog isoform X1 [Cucumis sativus]3.0e-27186.78Show/hide
Query:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT
        MQRFPISTSI  P RRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKKSIKYYPLNCSSSSTTNQTQ+FT
Subjt:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT

Query:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL
        CRK+YPT +EPESI PS R VC EYFRWIHEDL+PWAAGGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWGILQLLRRYPG+IPDLEL
Subjt:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCN
        MFDCDDRPVVKSADYR A V+T   PPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEK KWM+RE FAYWKGNPYVA+TRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT+RYKIYIEGY WSVSEKYILACDS+TLLVKPNFYDFF+RSL+P+HHYWPLSDDHKCKSIKFAV WG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWG

Query:  NSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL
        NSHKQKAQ IGK AS+FIQQEL+MENVYDYMFHLLN YAKLLRF+PEIP GA+E+CSET+ACPRDG EKKFM+ESMVKTPSLTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL

Query:  YRRNANLIRQVEKWENEFWNKN
        YRRNANLI QVEKWEN FW +N
Subjt:  YRRNANLIRQVEKWENEFWNKN

XP_031740971.1 O-glucosyltransferase rumi homolog isoform X2 [Cucumis sativus]1.3e-25081.99Show/hide
Query:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT
        MQRFPISTSI  P RRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKKSIKYYPLNCSSSSTTNQTQ+FT
Subjt:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT

Query:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL
        CRK+YPT +EPESI PS R VC EYFRWIHEDL+PWAAGGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWGILQLLRRYPG+IPDLEL
Subjt:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCN
        MFDCDDRPVVKSADYR A V+T   PPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEK KWM+RE FAYWKGNPYVA+TRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT+RYKIYIEGY                            RSL+P+HHYWPLSDDHKCKSIKFAV WG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWG

Query:  NSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL
        NSHKQKAQ IGK AS+FIQQEL+MENVYDYMFHLLN YAKLLRF+PEIP GA+E+CSET+ACPRDG EKKFM+ESMVKTPSLTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL

Query:  YRRNANLIRQVEKWENEFWNKN
        YRRNANLI QVEKWEN FW +N
Subjt:  YRRNANLIRQVEKWENEFWNKN

XP_031740972.1 O-glucosyltransferase rumi homolog isoform X3 [Cucumis sativus]5.3e-24480.27Show/hide
Query:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT
        MQRFPISTSI  P RRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKKSIKYYPLNCSSSSTTNQTQ+FT
Subjt:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT

Query:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL
        CRK+YPT +EPESI PS R VC EYFRWIHEDL+PWAAGGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWGILQLLRRYPG+IPDLEL
Subjt:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCN
        MFDCDDRPVVKSADYR A V+T   PPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEK KWM+RE FAYWKGNPYVA+TRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT                                    ++RSL+P+HHYWPLSDDHKCKSIKFAV WG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWG

Query:  NSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL
        NSHKQKAQ IGK AS+FIQQEL+MENVYDYMFHLLN YAKLLRF+PEIP GA+E+CSET+ACPRDG EKKFM+ESMVKTPSLTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL

Query:  YRRNANLIRQVEKWENEFWNKN
        YRRNANLI QVEKWEN FW +N
Subjt:  YRRNANLIRQVEKWENEFWNKN

XP_038875037.1 protein O-glucosyltransferase 1-like [Benincasa hispida]1.9e-29491.79Show/hide
Query:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT
        MQRFPIST I RP RRPIWQPSLPKRSV VAVPAAA+FLLA LS AV+ISS RLQSTLFS NFLGNQTE ISPKIPKKSIKYYPL+CSSSSTTNQTQNF 
Subjt:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT

Query:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL
        CRKNYPT+FEPES D S RP+C EYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLR+YPGR+PDLEL
Subjt:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCN
        MFDCDDRPVVKSADY+TA +ETAGPPP+FRYCGDE+TLDIVFPDWSFWGW EINIRPWE LLKELKKGNEK KWMEREGFAYWKGNPYVA+TRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWG
        LSQQNDWNARLYIQDWI ESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLL+KPNFYDFFTRSLQP+HHYWP+SDDHKCKSIKFAV WG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWG

Query:  NSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL
        NSHKQKAQAIGKAASDFIQQ+LKMENVYDYMFHLLNQYAKLLRFRP IP GAVE+CSET+ACPRDGMEKKFM+ESMVKTPSLTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL

Query:  YRRNANLIRQVEKWENEFWNKNKP
        YRRNANLIRQVEKWE+EFWN+NKP
Subjt:  YRRNANLIRQVEKWENEFWNKNKP

TrEMBL top hitse value%identityAlignment
A0A0A0KQX4 CAP10 domain-containing protein1.4e-27186.78Show/hide
Query:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT
        MQRFPISTSI  P RRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKKSIKYYPLNCSSSSTTNQTQ+FT
Subjt:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT

Query:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL
        CRK+YPT +EPESI PS R VC EYFRWIHEDL+PWAAGGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWGILQLLRRYPG+IPDLEL
Subjt:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCN
        MFDCDDRPVVKSADYR A V+T   PPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEK KWM+RE FAYWKGNPYVA+TRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT+RYKIYIEGY WSVSEKYILACDS+TLLVKPNFYDFF+RSL+P+HHYWPLSDDHKCKSIKFAV WG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWG

Query:  NSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL
        NSHKQKAQ IGK AS+FIQQEL+MENVYDYMFHLLN YAKLLRF+PEIP GA+E+CSET+ACPRDG EKKFM+ESMVKTPSLTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL

Query:  YRRNANLIRQVEKWENEFWNKN
        YRRNANLI QVEKWEN FW +N
Subjt:  YRRNANLIRQVEKWENEFWNKN

A0A2N9J2X5 CAP10 domain-containing protein1.9e-21069.84Show/hide
Query:  WQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPK--IPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPES-ID
        WQP L K        A  I    TL     IS+ RL ++ F      N T TI  K  IP K I  +PLNCS     NQTQ  TC  NYP  F P + +D
Subjt:  WQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPK--IPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPES-ID

Query:  PSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADY
        PS  PVC +YFRWIHEDLRPW   GITR+MVEK K++AHFRL +V G+ Y+E YKKSIQTRD+FTIWGILQLLRRYPGR+PD+ELMFDCDDRPV+KSADY
Subjt:  PSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADY

Query:  RTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYIQD
        R A +   GPPP+FRYCGD  T+DIVFPDWSFWGWAEINI+PWE+LLKELK+GN++SKW+ERE +AYWKGNP+VAETR+DLLKCN+S + DWNARLYIQD
Subjt:  RTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYIQD

Query:  WIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKAAS
        WI ESQQGYK+S LASQCTHRYKIYIEGYAWSVSEKYILACDSV+L+VKP +YDFFTRSL+PVHHYWP+ DD KCKSIKFAV WGN+HKQKAQAIGKA+S
Subjt:  WIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKAAS

Query:  DFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWE
        DFIQ+ELKM+ VYDYMFHLLN+YAKLL+F P+IP GA+E+CSET AC  +G EKKFM ES+VK PSLT PC+MPPP++   L  L RRN N+IRQVEKWE
Subjt:  DFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWE

Query:  NEFW
        N++W
Subjt:  NEFW

A0A5B7BT47 CAP10 domain-containing protein4.3e-19965.54Show/hide
Query:  IWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIK--YYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESID
        IW+  L K   T  +      LLAT     L+SS+ + S+ FS+  +  +  TI PK  ++S K    PLNCS+ + T      TC  NYPT FE E  D
Subjt:  IWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIK--YYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESID

Query:  PSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADY
        PS    C +YFRWIHEDLR +A+ GITR+MVE+ K TAHFR+ +V G+VYVE YKKSIQTRD+FT+WGILQLLRRYPGR+PDLELMFDC+DRPV++  DY
Subjt:  PSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADY

Query:  RTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYIQD
        R        PPP+FRYCGD   LDIVFPDWSFWGWAEINI+PWE++LKE+K+GN ++KWMERE +AYWKGNP+VAETR+DLLKCN+S + DWNARL++QD
Subjt:  RTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYIQD

Query:  WIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKAAS
        WI ESQQGYKQS+LASQCT+RYKIYIEGYAWSVSEKYILACDSVTLLVKP++YDFF RSLQPVHHYWP+ DD KC+SIKFAV WGN+HKQKAQAIGKAA 
Subjt:  WIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKAAS

Query:  DFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWE
        DFIQ+ELKM+ VYDYMFHLLN+YAKLL+F P++P GA E CSET+ACP DG+EK+FM ES+VK PS+T PC++PPP+D  +L    +R  N I+QVE WE
Subjt:  DFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWE

Query:  NEFWN
          +W+
Subjt:  NEFWN

A0A6A1WB87 O-glucosyltransferase rumi2.3e-20567.89Show/hide
Query:  RPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPK--IPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPES
        R  W+P L K    VA P A +F +  L   + ISS RL ++  S +   N T  +S K     K I  +PLNCS     NQTQ   C  NYPT F P+ 
Subjt:  RPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPK--IPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPES

Query:  IDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSA
        +DPS  PVC  YFRWIH+DL PW A GITREMVEK K TAHFRL VV G+ Y+E YKKSIQTRD+FTIWGILQLLRRYPGRIPDLELMFDC+D PV++S 
Subjt:  IDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSA

Query:  DYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYI
        DYR     T GPPP+FRYCGD  T DIVFPDWSFWGWAEINIRPWE+LLK+LK+GN + KWMERE +AYWKGNP V ETR+DLLKCNLS   DWNARLY+
Subjt:  DYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYI

Query:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKA
        QDWI ES+QGYK+S+LASQCTHRYKIYIEGYAWSVSEKYILACDSVTL+V P FYDFFTR LQPVHHYWP+ DD KC+SIKFAV WGN+HK+KAQAIGKA
Subjt:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKA

Query:  ASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEK
        +SDFIQ+ELKM+ VYDYMFH+LN YAKLL+F P+IP GAVE+CSET+AC   G+EKKFM ES+VK PS+T PC+MPPPF+   L  LYRRN N+I+QV K
Subjt:  ASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEK

Query:  WENEFW------NKNKP
        WEN +W       K+KP
Subjt:  WENEFW------NKNKP

M5WWD4 CAP10 domain-containing protein7.3e-19967Show/hide
Query:  WQPSLPKRSVTVAVPAAAIFLLA--TLSAAVLISSTRLQSTLFSINFLGNQTETISPKI--PKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRF-EPES
        W PS   +S  +A     I  L    +SAA+L S   +Q + F I    N+T  IS K   P K I+ +PLNCS  S  NQTQ  TC  +YPT F   + 
Subjt:  WQPSLPKRSVTVAVPAAAIFLLA--TLSAAVLISSTRLQSTLFSINFLGNQTETISPKI--PKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRF-EPES

Query:  IDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSA
        ++PS  P+C +YFR+IH+DL PW A GITR+MVE+ K TAHFRL +V G+ YVE YKKSIQTRD+FTIWGILQLLRRYPGR+PDLELMFDCDD+PV++S 
Subjt:  IDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSA

Query:  DYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYI
        D+R     +   PP+FRYCGD  T DIVFPDWSFWGWAEINI+PWE LLK+LKKGN++ KWMERE +AYWKGNP+VAE+R+DLLKCN+S   DWNARL+I
Subjt:  DYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYI

Query:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKA
        QDWI ESQQG+KQS++ASQCTHRYKIYIEGYAWSVSEKYILACDSVTL+VKP +YDFFTRSLQPVHHYWP+  D KCKSIKFAV WGN+HKQKAQAIGKA
Subjt:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKA

Query:  ASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEK
        ASDFIQQELKM+ VYDYMFHLLN+YAKLLRF P+IP GA  +CSE++ACP    EKKFM ES+VK+PS+T PC+MPP F   +L  LYRRN NL +QV+K
Subjt:  ASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEK

Query:  WENEFW
        WE+++W
Subjt:  WENEFW

SwissProt top hitse value%identityAlignment
B0X1Q4 O-glucosyltransferase rumi homolog1.4e-2124.18Show/hide
Query:  CSSSSTTNQTQNFTCRKN--------YPTRFEP--ESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTR
        C +  T  +TQ      N        Y T  E    +  P +   CS +   +  DLRP+ + GIT++++E  ++    +  ++G R++        + R
Subjt:  CSSSSTTNQTQNFTCRKN--------YPTRFEP--ESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTR

Query:  DLF---TIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFW-GWAEINIRP-----WENLLKELKK
        D        G+   +R    ++PD+EL+ +C D P +  + +  A+ E   P PV  +    + LDI++P W FW G   I++ P     W+     ++K
Subjt:  DLF---TIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFW-GWAEINIRP-----WENLLKELKK

Query:  GNEKSKWMEREGFAYWKG-------NPYV--AETRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDS
          +   W ++   A+++G       +P V  +  R +L+    ++   W +    +D +    +  ++  L   C ++Y     G A S   K++  C S
Subjt:  GNEKSKWMEREGFAYWKG-------NPYV--AETRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDS

Query:  VTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEI
        +   V   + +FF  SL+P  HY P+        ++  +Q+   H Q AQ I     + I   L+ME+V  Y   LL +Y KL+++  +     VEI
Subjt:  VTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEI

G3V9D0 Protein O-glucosyltransferase 12.4e-2125.57Show/hide
Query:  ESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREM---VEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRP
        E+ +P     CS Y   I EDL P+  GGI+R+M   V + +   H++  ++  R++ E       +R       IL+++R    R+PD+E++ +  D P
Subjt:  ESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREM---VEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRP

Query:  VVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNEKSKWMEREGFAYWKG-------NPYVAETRQ
         V      T         PVF +    E  DI++P W+FW  G A   + P     W+   ++L +   +  W ++   AY++G       +P +  +R+
Subjt:  VVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNEKSKWMEREGFAYWKG-------NPYVAETRQ

Query:  D--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKS
        +  L+    ++   W +   ++D +   +   K  +L   C ++Y     G A S   K++  C S+   V   + +FF   L+P  HY P+  D     
Subjt:  D--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKS

Query:  IKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRF
        ++  +Q+  ++   AQ I K  S FI   L+M+++  Y  +LL +Y+K L +
Subjt:  IKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRF

Q5E9Q1 Protein O-glucosyltransferase 15.1e-2425.85Show/hide
Query:  ESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREM---VEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRP
        E+ +P   P CS Y   I EDL P+  GGI+R+M   V + K   H++  ++  R+Y E               G+   +    GR+PD+E++ +  D P
Subjt:  ESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREM---VEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRP

Query:  VVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNEKSKWMEREGFAYWKG-------NPYVAETRQ
         V    +   A+      P+F +    E  DI++P W+FW  G A   I P     W+   ++L +   +  W ++   AY++G       +P +  +R+
Subjt:  VVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNEKSKWMEREGFAYWKG-------NPYVAETRQ

Query:  D--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKS
        +  L+    ++   W +   ++D +   +   K  +L   C ++Y     G A S   K++  C S+   V   + +FF   L+P  HY P+  D    +
Subjt:  D--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKS

Query:  IKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRF
        ++  +Q+  ++   AQ I +  S FI   LKM+++  Y  +LL +Y+K L +
Subjt:  IKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRF

Q8BYB9 Protein O-glucosyltransferase 11.8e-2125.85Show/hide
Query:  ESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREM---VEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRP
        E+ +P     CS Y   I EDL P+  GGI+R+M   V + K   H++  ++  R++ E       +R       IL+++     R+PD+E++ +  D P
Subjt:  ESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREM---VEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRP

Query:  VVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNEKSKWMEREGFAYWKG-------NPYVAETRQ
         V      T         PVF +    E  DI++P W+FW  G A   + P     W+   ++L +   +  W ++   AY++G       +P +  +R+
Subjt:  VVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNEKSKWMEREGFAYWKG-------NPYVAETRQ

Query:  D--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKS
        +  L+    ++   W +   ++D +   +   K  +L   C +RY     G A S   K++  C S+   V   + +FF   L+P  HY P+  D    +
Subjt:  D--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKS

Query:  IKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRF
        ++  +Q+  ++   AQ I K  S FI   L+M+++  Y  +LL  Y+K L +
Subjt:  IKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRF

Q8NBL1 Protein O-glucosyltransferase 11.9e-2325.57Show/hide
Query:  ESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREM---VEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRP
        E+ +P     CS Y   I EDL P+  GGI+R+M   V + K   H++  +   R+Y E+              G+   +    GR+PD+E++ +  D P
Subjt:  ESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREM---VEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRP

Query:  VVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNEKSKWMEREGFAYWKG-------NPYVAETRQ
         V    +   A+      PVF +    E  DI++P W+FW  G A   I P     W+   ++L +   +  W ++   AY++G       +P +  +R+
Subjt:  VVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNEKSKWMEREGFAYWKG-------NPYVAETRQ

Query:  D--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKS
        +  L+    ++   W +   ++D +   +   K  +L   C ++Y     G A S   K++  C S+   V   + +FF   L+P  HY P+  D    +
Subjt:  D--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKS

Query:  IKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRF
        ++  +Q+  ++   AQ I +  S FI+  L+M+++  Y  +LL++Y+K L +
Subjt:  IKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRF

Arabidopsis top hitse value%identityAlignment
AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821)3.4e-17259.82Show/hide
Query:  KKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTR
        KKS +    +   SS  NQ ++ +C +   + +     + S+R  C +YF+WIHEDL+PW   GIT+EMVE+GK TAHFRL ++ G+V+VE+YKKSIQTR
Subjt:  KKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTR

Query:  DLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYR--TAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKW
        D FT+WGILQLLR+YPG++PD++LMFDCDDRPV++S  Y      VE A PPP+FRYCGD  T+DIVFPDWSFWGW EINIR W  +LKE+++G +K K+
Subjt:  DLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYR--TAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKW

Query:  MEREGFAYWKGNPYVAE-TRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTR
        MER+ +AYWKGNP+VA  +R+DLL CNLS  +DWNAR++IQDWI E Q+G++ SN+A+QCT+RYKIYIEGYAWSVSEKYILACDSVTL+VKP +YDFF+R
Subjt:  MEREGFAYWKGNPYVAE-TRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTR

Query:  SLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRD-----GME
        +LQP+ HYWP+ D  KC+SIKFAV W N+H QKAQ IG+ AS+F+Q++L MENVYDYMFHLLN+Y+KLL+++P++P  +VE+C+E + CP +     G++
Subjt:  SLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRD-----GME

Query:  KKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNK
        KKFM  S+V  P  + PCS+PPPFD+  L++ +R+  NLIRQVEKWE+ +W K
Subjt:  KKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNK

AT2G45830.1 downstream target of AGL15 23.6e-15853.66Show/hide
Query:  AVPAAAIFLLATL--SAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPSDRPVCSEYFRW
        ++  A +FL+ +L  SA +L     L        F G +  T S +    + + +P  C      NQTQ F    +     +P S   S    C  YFRW
Subjt:  AVPAAAIFLLATL--SAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPSDRPVCSEYFRW

Query:  IHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAVETAGPPPV
        IHEDLRPW   G+TR M+EK + TAHFR+ ++ GRVYV+ Y+KSIQTRD+FT+WGI+QLLR YPGR+PDLELMFD DDRP V+S D++    +   PPP+
Subjt:  IHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAVETAGPPPV

Query:  FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSN
        FRYC D+ +LDIVFPDWSFWGWAE+NI+PW+  L  +++GN+ ++W +R  +AYW+GNP VA TR+DLL+CN+S Q DWN RLYIQDW RES++G+K SN
Subjt:  FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSN

Query:  LASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVY
        L +QCTHRYKIYIEG+AWSVSEKYI+ACDS+TL V+P FYDF+ R + P+ HYWP+ D  KC S+KFAV WGN+H  +A  IG+  S FI++E+KME VY
Subjt:  LASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVY

Query:  DYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWN
        DYMFHL+N+YAKLL+F+PEIP GA EI  + + C   G  + FM ESMV  PS   PC MP PF+   L+ +  R  NL RQVE WE+++++
Subjt:  DYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWN

AT3G48980.1 Arabidopsis thaliana protein of unknown function (DUF821)1.7e-17963.16Show/hide
Query:  SPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRK-NYPTRF-----EPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVY
        S K+  +  K + LNC++ S  +     TC K NYPT F     E ES D S    C +YFRWIHEDLRPW   GITRE +E+  ATA FRLA++ GR+Y
Subjt:  SPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRK-NYPTRF-----EPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVY

Query:  VEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKEL
        VE ++++ QTRD+FTIWG +QLLRRYPG+IPDLELMFDC D PVVK+A++  A V+   PPP+FRYC ++ETLDIVFPDWS+WGWAE+NI+PWE+LLKEL
Subjt:  VEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKEL

Query:  KKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKP
        ++GN+++KW++RE +AYWKGNP VAETR DL+KCNLS+  DW ARLY QDW++ES++GYKQS+LASQC HRYKIYIEG AWSVSEKYILACDSVTL+VKP
Subjt:  KKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKP

Query:  NFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRD
        ++YDFFTR + P HHYWP+ +D KC+SIKFAV WGN H +KAQ IGK AS+F+QQELKM+ VYDYMFHLL QY+KLLRF+PEIP  + E+CSE +ACPRD
Subjt:  NFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRD

Query:  GMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNK
        G E+KFM ES+VK P+ T PC+MPPP+D  S   + +R  +   ++E+WE+++W K
Subjt:  GMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNK

AT3G61270.1 Arabidopsis thaliana protein of unknown function (DUF821)8.9e-15752.81Show/hide
Query:  TVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPSDRPVCSEYFRW
        TV  P  +I + AT+   VL  S  +   L  ++F          K+  K+ +  P  C      NQ+      +N  +R  P +   S    C  YFRW
Subjt:  TVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPSDRPVCSEYFRW

Query:  IHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAVETAGPPPV
        IHEDLRPW   GITR M+E+   TAHFRL +  G+ YV+ YKKSIQTRD FT+WGILQLLR YPG++PDLELMFD DDRPVV+S D+     E   PPPV
Subjt:  IHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAVETAGPPPV

Query:  FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSN
        FRYC D+ +LDIVFPDWSFWGWAE+N++PW   L+ +K+GN  ++W +R  +AYW+GNPYV   R DLLKCN ++  +WN RLYIQDW +E+++G+K SN
Subjt:  FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSN

Query:  LASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVY
        L +QCTHRYKIYIEG+AWSVSEKYI+ACDS+TL VKP FYDF+ R + P+ HYWP+ DD KC S+KFAV WGN+H+ KA+ IG+  S FI++E+ M+ VY
Subjt:  LASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVY

Query:  DYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNK--NKP
        DYMFHLL +YA LL+F+PEIP+ A EI  +++ CP     + F  ESM+ +PS   PC M PP+D  +L+ +  R ANL RQVE WEN+++    NKP
Subjt:  DYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNK--NKP

AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821)8.0e-18257.85Show/hide
Query:  IWQPSL-------PKRSVTVAVPAAAIFLLATLSAAVLISST-RLQSTLFSINFLGNQTETISPKIPKKSI-------KYYPLNCSSSSTTNQTQNFTCR
        IW P +       P RS  +      + + A +S  +L+ +T  L+    +      QT+TI+PK P+ +          + L+CS++ TT      +C 
Subjt:  IWQPSL-------PKRSVTVAVPAAAIFLLATLSAAVLISST-RLQSTLFSINFLGNQTETISPKIPKKSI-------KYYPLNCSSSSTTNQTQNFTCR

Query:  KN-YP--TRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLE
         N YP  T FE +  +      C +YFRWIHEDLRPW+  GITRE +E+ K TA FRLA+VGG++YVE ++ + QTRD+FTIWG LQLLR+YPG+IPDLE
Subjt:  KN-YP--TRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLE

Query:  LMFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKC
        LMFDC D PVV++ ++  A      PPP+FRYCG+EETLDIVFPDWSFWGWAE+NI+PWE+LLKEL++GNE++KW+ RE +AYWKGNP VAETRQDL+KC
Subjt:  LMFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKC

Query:  NLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQW
        N+S++++WNARLY QDWI+ES++GYKQS+LASQC HRYKIYIEG AWSVSEKYILACDSVTLLVKP++YDFFTR L P HHYWP+ +  KC+SIKFAV W
Subjt:  NLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQW

Query:  GNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQR
        GNSH QKAQ IGKAASDFIQQ+LKM+ VYDYM+HLL +Y+KLL+F+PEIP  AVEICSET+AC R G E+KFM ES+VK P+ + PC+MPPP+D  +   
Subjt:  GNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQR

Query:  LYRRNANLIRQVEKWENEFWNK
        + +R  +   ++ +WE ++W+K
Subjt:  LYRRNANLIRQVEKWENEFWNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAGATTTCCAATTTCAACTTCAATTTTGCGACCATGCCGGAGACCCATTTGGCAACCGTCTCTTCCAAAACGCTCCGTCACCGTCGCAGTTCCGGCCGCCGCCAT
CTTCCTCCTCGCCACCCTCTCCGCCGCCGTCCTCATTTCCTCCACTCGCCTCCAATCCACTCTGTTTTCCATTAATTTTCTCGGAAACCAAACAGAAACAATCTCACCAA
AAATCCCCAAAAAATCAATCAAATATTACCCACTTAACTGTTCTTCATCTTCCACCACAAACCAAACCCAGAATTTCACCTGCCGGAAAAACTACCCGACCCGATTCGAA
CCCGAATCAATCGACCCATCGGATCGGCCCGTTTGCTCAGAGTATTTCCGATGGATCCACGAGGATCTGCGGCCGTGGGCGGCGGGCGGAATCACGAGGGAGATGGTGGA
GAAAGGGAAGGCGACGGCGCATTTCCGGCTGGCGGTGGTCGGCGGGAGGGTCTACGTGGAGCACTACAAGAAATCAATTCAAACGAGGGATTTGTTTACGATTTGGGGGA
TTTTGCAGCTTCTGAGAAGGTACCCAGGGCGAATCCCTGATTTGGAGCTGATGTTCGACTGTGATGACCGACCGGTTGTTAAATCGGCCGATTACCGGACTGCGGCCGTG
GAGACGGCGGGGCCGCCACCGGTGTTCCGGTACTGCGGCGATGAGGAGACATTGGATATTGTGTTCCCGGATTGGTCCTTCTGGGGATGGGCAGAGATAAATATAAGGCC
ATGGGAGAATTTGTTGAAGGAATTGAAAAAAGGGAATGAAAAAAGCAAATGGATGGAAAGGGAAGGTTTTGCTTATTGGAAAGGAAACCCTTATGTGGCTGAAACAAGAC
AAGATCTTCTCAAATGCAACCTCTCCCAACAAAATGATTGGAATGCTCGCCTCTACATTCAGGATTGGATTCGAGAGTCTCAACAAGGTTATAAGCAATCGAACTTGGCA
AGCCAATGCACCCATAGGTACAAGATCTACATAGAAGGATATGCATGGTCAGTGAGTGAAAAATACATATTGGCATGTGATTCAGTGACATTACTTGTAAAGCCCAATTT
TTATGATTTCTTCACTAGATCTTTACAGCCAGTTCATCATTATTGGCCTCTTAGTGATGACCATAAGTGCAAATCCATCAAGTTTGCTGTCCAATGGGGCAATTCCCACA
AACAAAAGGCACAAGCTATAGGGAAAGCAGCAAGTGACTTCATCCAACAAGAGTTAAAGATGGAAAATGTGTATGACTACATGTTTCATCTCCTCAACCAATACGCTAAG
CTCCTTCGTTTTCGACCTGAAATCCCGATCGGCGCAGTGGAAATCTGCTCCGAGACGGTGGCTTGCCCGAGAGATGGGATGGAGAAGAAGTTCATGAGAGAATCCATGGT
GAAAACTCCCTCTCTCACCATCCCTTGCTCCATGCCACCACCCTTTGACACCCCTTCTCTCCAAAGGCTTTATAGAAGAAATGCCAACCTAATAAGGCAAGTGGAGAAGT
GGGAAAATGAATTTTGGAATAAAAATAAACCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGAGATTTCCAATTTCAACTTCAATTTTGCGACCATGCCGGAGACCCATTTGGCAACCGTCTCTTCCAAAACGCTCCGTCACCGTCGCAGTTCCGGCCGCCGCCAT
CTTCCTCCTCGCCACCCTCTCCGCCGCCGTCCTCATTTCCTCCACTCGCCTCCAATCCACTCTGTTTTCCATTAATTTTCTCGGAAACCAAACAGAAACAATCTCACCAA
AAATCCCCAAAAAATCAATCAAATATTACCCACTTAACTGTTCTTCATCTTCCACCACAAACCAAACCCAGAATTTCACCTGCCGGAAAAACTACCCGACCCGATTCGAA
CCCGAATCAATCGACCCATCGGATCGGCCCGTTTGCTCAGAGTATTTCCGATGGATCCACGAGGATCTGCGGCCGTGGGCGGCGGGCGGAATCACGAGGGAGATGGTGGA
GAAAGGGAAGGCGACGGCGCATTTCCGGCTGGCGGTGGTCGGCGGGAGGGTCTACGTGGAGCACTACAAGAAATCAATTCAAACGAGGGATTTGTTTACGATTTGGGGGA
TTTTGCAGCTTCTGAGAAGGTACCCAGGGCGAATCCCTGATTTGGAGCTGATGTTCGACTGTGATGACCGACCGGTTGTTAAATCGGCCGATTACCGGACTGCGGCCGTG
GAGACGGCGGGGCCGCCACCGGTGTTCCGGTACTGCGGCGATGAGGAGACATTGGATATTGTGTTCCCGGATTGGTCCTTCTGGGGATGGGCAGAGATAAATATAAGGCC
ATGGGAGAATTTGTTGAAGGAATTGAAAAAAGGGAATGAAAAAAGCAAATGGATGGAAAGGGAAGGTTTTGCTTATTGGAAAGGAAACCCTTATGTGGCTGAAACAAGAC
AAGATCTTCTCAAATGCAACCTCTCCCAACAAAATGATTGGAATGCTCGCCTCTACATTCAGGATTGGATTCGAGAGTCTCAACAAGGTTATAAGCAATCGAACTTGGCA
AGCCAATGCACCCATAGGTACAAGATCTACATAGAAGGATATGCATGGTCAGTGAGTGAAAAATACATATTGGCATGTGATTCAGTGACATTACTTGTAAAGCCCAATTT
TTATGATTTCTTCACTAGATCTTTACAGCCAGTTCATCATTATTGGCCTCTTAGTGATGACCATAAGTGCAAATCCATCAAGTTTGCTGTCCAATGGGGCAATTCCCACA
AACAAAAGGCACAAGCTATAGGGAAAGCAGCAAGTGACTTCATCCAACAAGAGTTAAAGATGGAAAATGTGTATGACTACATGTTTCATCTCCTCAACCAATACGCTAAG
CTCCTTCGTTTTCGACCTGAAATCCCGATCGGCGCAGTGGAAATCTGCTCCGAGACGGTGGCTTGCCCGAGAGATGGGATGGAGAAGAAGTTCATGAGAGAATCCATGGT
GAAAACTCCCTCTCTCACCATCCCTTGCTCCATGCCACCACCCTTTGACACCCCTTCTCTCCAAAGGCTTTATAGAAGAAATGCCAACCTAATAAGGCAAGTGGAGAAGT
GGGAAAATGAATTTTGGAATAAAAATAAACCTTAA
Protein sequenceShow/hide protein sequence
MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFE
PESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAV
ETAGPPPVFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKSKWMEREGFAYWKGNPYVAETRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLA
SQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPNFYDFFTRSLQPVHHYWPLSDDHKCKSIKFAVQWGNSHKQKAQAIGKAASDFIQQELKMENVYDYMFHLLNQYAK
LLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNKNKP