; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G002880 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G002880
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionArabidopsis thaliana protein of unknown function (DUF821)
Genome locationCG_Chr01:2922190..2926819
RNA-Seq ExpressionClCG01G002880
SyntenyClCG01G002880
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR006598 - Glycosyl transferase CAP10 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB1222531.1 O-glucosyltransferase rumi [Morella rubra]7.7e-20366.73Show/hide
Query:  RPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPES
        R  W+P L K    VA P A +F +  L   + ISS RL ++  S +   N T   S K     K I  +PLNCS  + T       C   YPT F P+ 
Subjt:  RPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPES

Query:  IDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSA
        +DPS  P+CP YFRWIH+DL PW   GITREMVEK K TAHFRL VV G+ Y+E YKKSIQTRD+FTIWG LQL RRYPGRIPDLELMFDC+D PV++S 
Subjt:  IDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSA

Query:  DYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYI
        DYR P  +T GPPPLFRYCGD  T DIVFPDWSFWGWAEINIRPWE+LLK+LK+GN R KWM+REP+AYWKGNP V +TR+DLLKCNLS   DWNARLY+
Subjt:  DYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYI

Query:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKA
        QDWI ES+QGYK+S+LASQCTHRYKIYIEGYAWSVSEKYILACDSVTL+V P+FYDFFTR LQPV HYWPI DD KC+SIKFAV WGN+HK+K QAIGKA
Subjt:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKA

Query:  ASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEK
        +SDFIQ++LKM+ VYDYMFH+LN YAKLL+F+P+IP GAVE+CSET+AC   GLEKKFM ESLVK P +T PC+MPPPF+   L  LYRRN NII+QV K
Subjt:  ASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEK

Query:  WEDEFW------NKNKP
        WE+ +W       K+KP
Subjt:  WEDEFW------NKNKP

XP_004143920.1 O-glucosyltransferase rumi homolog isoform X1 [Cucumis sativus]7.8e-26484.1Show/hide
Query:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT
        MQRFPIS SI  PSRRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKK IKYYPLNCSSSS+TNQT +FT
Subjt:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT

Query:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        CR  YPT +EPESI PS R +CPEYFRWIHEDL+PWA GGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWG LQL RRYPG+IPDLEL
Subjt:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDCDDRPVVKSADYR   VDT   PP+FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNE+ KWM+RE FAYWKGNPYVADTRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT+RYKIYIEGY WSVSEKYILACDS+TLLVKP FYDFF+RSL+P+ HYWP+SDDHKCKSIKFAVHWG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSHKQK Q IGK AS+FIQQ+L+MENVYDYMFHLLN YAKLLRFQPEIP GA+EVCSET+ACPR G EKKFMKES+VKTP LTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNKN
        YRRNAN+I QVEKWE+ FW +N
Subjt:  YRRNANIIRQVEKWEDEFWNKN

XP_031740971.1 O-glucosyltransferase rumi homolog isoform X2 [Cucumis sativus]5.3e-24479.5Show/hide
Query:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT
        MQRFPIS SI  PSRRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKK IKYYPLNCSSSS+TNQT +FT
Subjt:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT

Query:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        CR  YPT +EPESI PS R +CPEYFRWIHEDL+PWA GGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWG LQL RRYPG+IPDLEL
Subjt:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDCDDRPVVKSADYR   VDT   PP+FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNE+ KWM+RE FAYWKGNPYVADTRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT+RYKIYIEGY                            RSL+P+ HYWP+SDDHKCKSIKFAVHWG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSHKQK Q IGK AS+FIQQ+L+MENVYDYMFHLLN YAKLLRFQPEIP GA+EVCSET+ACPR G EKKFMKES+VKTP LTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNKN
        YRRNAN+I QVEKWE+ FW +N
Subjt:  YRRNANIIRQVEKWEDEFWNKN

XP_031740972.1 O-glucosyltransferase rumi homolog isoform X3 [Cucumis sativus]2.2e-23777.78Show/hide
Query:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT
        MQRFPIS SI  PSRRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKK IKYYPLNCSSSS+TNQT +FT
Subjt:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT

Query:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        CR  YPT +EPESI PS R +CPEYFRWIHEDL+PWA GGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWG LQL RRYPG+IPDLEL
Subjt:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDCDDRPVVKSADYR   VDT   PP+FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNE+ KWM+RE FAYWKGNPYVADTRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT                                    ++RSL+P+ HYWP+SDDHKCKSIKFAVHWG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSHKQK Q IGK AS+FIQQ+L+MENVYDYMFHLLN YAKLLRFQPEIP GA+EVCSET+ACPR G EKKFMKES+VKTP LTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNKN
        YRRNAN+I QVEKWE+ FW +N
Subjt:  YRRNANIIRQVEKWEDEFWNKN

XP_038875037.1 protein O-glucosyltransferase 1-like [Benincasa hispida]1.1e-28689.31Show/hide
Query:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT
        MQRFPIS  I RPSRRPIWQPSLPKRSV VAVPAAA+FLLAALS AV+ISS RLQSTLFS NFLGNQTE  SPKIPKK IKYYPL+CSSSS+TNQT NF 
Subjt:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT

Query:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        CR  YPT+FEPES D S RPICPEYFRWIHEDLRPWA GGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWG LQL R+YPGR+PDLEL
Subjt:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDCDDRPVVKSADY+T  ++TAGPPPLFRYCGDE+TLDIVFPDWSFWGW EINIRPWE LLKELKKGNE+ KWM+RE FAYWKGNPYVADTRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        LSQQNDWNARLYIQDWI ESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLL+KP FYDFFTRSLQP+ HYWP+SDDHKCKSIKFAVHWG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSHKQK QAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF+P IP GAVEVCSET+ACPR G+EKKFMKES+VKTP LTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNKNKP
        YRRNAN+IRQVEKWEDEFWN+NKP
Subjt:  YRRNANIIRQVEKWEDEFWNKNKP

TrEMBL top hitse value%identityAlignment
A0A0A0KQX4 CAP10 domain-containing protein3.8e-26484.1Show/hide
Query:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT
        MQRFPIS SI  PSRRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKK IKYYPLNCSSSS+TNQT +FT
Subjt:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT

Query:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        CR  YPT +EPESI PS R +CPEYFRWIHEDL+PWA GGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWG LQL RRYPG+IPDLEL
Subjt:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDCDDRPVVKSADYR   VDT   PP+FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNE+ KWM+RE FAYWKGNPYVADTRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT+RYKIYIEGY WSVSEKYILACDS+TLLVKP FYDFF+RSL+P+ HYWP+SDDHKCKSIKFAVHWG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSHKQK Q IGK AS+FIQQ+L+MENVYDYMFHLLN YAKLLRFQPEIP GA+EVCSET+ACPR G EKKFMKES+VKTP LTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNKN
        YRRNAN+I QVEKWE+ FW +N
Subjt:  YRRNANIIRQVEKWEDEFWNKN

A0A2N9J2X5 CAP10 domain-containing protein2.1e-20667.86Show/hide
Query:  WQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPES-ID
        WQP L K        A  I     L     IS+ RL ++ F      N T T   K  IP K I  +PLNCS  + T      TC   YP  F P + +D
Subjt:  WQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPES-ID

Query:  PSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADY
        PS  P+CP+YFRWIHEDLRPW   GITR+MVEK K++AHFRL +V G+ Y+E YKKSIQTRD+FTIWG LQL RRYPGR+PD+ELMFDCDDRPV+KSADY
Subjt:  PSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADY

Query:  RTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQD
        R   +   GPPPLFRYCGD  T+DIVFPDWSFWGWAEINI+PWE+LLKELK+GN+RSKW++REP+AYWKGNP+VA+TR+DLLKCN+S + DWNARLYIQD
Subjt:  RTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQD

Query:  WIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAAS
        WI ESQQGYK+S LASQCTHRYKIYIEGYAWSVSEKYILACDSV+L+VKP +YDFFTRSL+PV HYWPI DD KCKSIKFAV WGN+HKQK QAIGKA+S
Subjt:  WIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAAS

Query:  DFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWE
        DFIQ++LKM+ VYDYMFHLLN+YAKLL+F+P+IP GA+E+CSET AC   G EKKFM ESLVK P LT PC+MPPP++   L  L RRN NIIRQVEKWE
Subjt:  DFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWE

Query:  DEFW
        +++W
Subjt:  DEFW

A0A5E4FRC2 PREDICTED: O-glucosyltransferase8.1e-19865.74Show/hide
Query:  WQPSLPKRSVTVAVPAAAIFLLA--ALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKY-IKYYPLNCSSSSSTNQTHNFTCRNKYPTRF-EPESI
        W PS   +S  +A     I  L    +SAA+L S   +Q + F I    N+T   S K  + + +  +PLNCS  S+ NQT   TC   YPT F   + +
Subjt:  WQPSLPKRSVTVAVPAAAIFLLA--ALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKY-IKYYPLNCSSSSSTNQTHNFTCRNKYPTRF-EPESI

Query:  DPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSAD
        +PS  PICP+YFR+IH+DL PW   GITR+MVE+ K TAHFRL +V G+ YVE YKKSIQTRD+FTIWG LQL RRYPGR+PDLELMFDCDD+PV++S D
Subjt:  DPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSAD

Query:  YRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQ
        +R P  ++   PPLFRYCGD  T DIVFPDWSFWGWAEINI+PWE LLK+LKKGN+R KWM+REP+AYWKGNP+VA++R+DLLKCN+S   DWNARL+IQ
Subjt:  YRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQ

Query:  DWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAA
        DWI ESQQG+KQS++ASQCTHRYKIYIEGYAWSVSEKYILACDSVTL+VKP++YDFFTRSLQPV HYWPI  D KCKSIKFAV WGN+HKQK QAIGKAA
Subjt:  DWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAA

Query:  SDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKW
        SDFIQQ+LKM+ VYDYMFHLLN+YAKLLRF+P+IP GA  +CSE++ACP    EKKFM ESLVK+P +T PC+MPP +   +L  LYRRN N+ +QV+KW
Subjt:  SDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKW

Query:  EDEFW
        ED++W
Subjt:  EDEFW

A0A6A1WB87 O-glucosyltransferase rumi3.7e-20366.73Show/hide
Query:  RPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPES
        R  W+P L K    VA P A +F +  L   + ISS RL ++  S +   N T   S K     K I  +PLNCS  + T       C   YPT F P+ 
Subjt:  RPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPES

Query:  IDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSA
        +DPS  P+CP YFRWIH+DL PW   GITREMVEK K TAHFRL VV G+ Y+E YKKSIQTRD+FTIWG LQL RRYPGRIPDLELMFDC+D PV++S 
Subjt:  IDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSA

Query:  DYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYI
        DYR P  +T GPPPLFRYCGD  T DIVFPDWSFWGWAEINIRPWE+LLK+LK+GN R KWM+REP+AYWKGNP V +TR+DLLKCNLS   DWNARLY+
Subjt:  DYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYI

Query:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKA
        QDWI ES+QGYK+S+LASQCTHRYKIYIEGYAWSVSEKYILACDSVTL+V P+FYDFFTR LQPV HYWPI DD KC+SIKFAV WGN+HK+K QAIGKA
Subjt:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKA

Query:  ASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEK
        +SDFIQ++LKM+ VYDYMFH+LN YAKLL+F+P+IP GAVE+CSET+AC   GLEKKFM ESLVK P +T PC+MPPPF+   L  LYRRN NII+QV K
Subjt:  ASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEK

Query:  WEDEFW------NKNKP
        WE+ +W       K+KP
Subjt:  WEDEFW------NKNKP

M5WWD4 CAP10 domain-containing protein2.8e-19865.94Show/hide
Query:  WQPSLPKRSVTVAVPAAAIFLLA--ALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKY-IKYYPLNCSSSSSTNQTHNFTCRNKYPTRF-EPESI
        W PS   +S  +A     I  L    +SAA+L S   +Q + F I    N+T   S K  + + +  +PLNCS  S+ NQT   TC   YPT F   + +
Subjt:  WQPSLPKRSVTVAVPAAAIFLLA--ALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKY-IKYYPLNCSSSSSTNQTHNFTCRNKYPTRF-EPESI

Query:  DPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSAD
        +PS  PICP+YFR+IH+DL PW   GITR+MVE+ K TAHFRL +V G+ YVE YKKSIQTRD+FTIWG LQL RRYPGR+PDLELMFDCDD+PV++S D
Subjt:  DPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSAD

Query:  YRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQ
        +R P  ++   PPLFRYCGD  T DIVFPDWSFWGWAEINI+PWE LLK+LKKGN+R KWM+REP+AYWKGNP+VA++R+DLLKCN+S   DWNARL+IQ
Subjt:  YRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQ

Query:  DWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAA
        DWI ESQQG+KQS++ASQCTHRYKIYIEGYAWSVSEKYILACDSVTL+VKP++YDFFTRSLQPV HYWPI  D KCKSIKFAV WGN+HKQK QAIGKAA
Subjt:  DWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAA

Query:  SDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKW
        SDFIQQ+LKM+ VYDYMFHLLN+YAKLLRF+P+IP GA  +CSE++ACP    EKKFM ESLVK+P +T PC+MPP F   +L  LYRRN N+ +QV+KW
Subjt:  SDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKW

Query:  EDEFW
        ED++W
Subjt:  EDEFW

SwissProt top hitse value%identityAlignment
A0NDG6 O-glucosyltransferase rumi homolog1.2e-2025.22Show/hide
Query:  DLRPWATGGITREMVEKGKA-TAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPPPLFR
        DL+P+   GIT+EM+ + K    H++  V+G ++Y     +  +        G     R     +PD++L+ +C D P +    +R  + +     P+  
Subjt:  DLRPWATGGITREMVEKGKA-TAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPPPLFR

Query:  YCGDEETLDIVFPDWSFW-GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQG-
        +    E LDI++P W+FW G   I + P     W+   + + K +  + W  +EP A+++G+   +D R  L+  + +Q +  +A+ Y ++   +S Q  
Subjt:  YCGDEETLDIVFPDWSFW-GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQG-

Query:  -----YKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFI
              ++  L   C +R+     G A S   K++  C S+   V  ++ +FF  SL+P  HY P+      + ++  + +   H Q  +AI +   + I
Subjt:  -----YKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFI

Query:  QQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEV
           L+M +V  Y   LL +Y KL+R+  E     +EV
Subjt:  QQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEV

B0X1Q4 O-glucosyltransferase rumi homolog3.4e-2023.02Show/hide
Query:  CSSSSSTNQTHNF--TCRNKYPTRFEP--ESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLF---
        C+ +     T+N      NKY T  E    +  P +   C  +   +  DLRP+ + GIT++++E  ++    +  ++G R++        + RD     
Subjt:  CSSSSSTNQTHNF--TCRNKYPTRFEP--ESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLF---

Query:  TIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW-GWAEINIRP-----WENLLKELKKGNERSK
           G     R    ++PD+EL+ +C D P + S  +      +  P P+  +    + LDI++P W FW G   I++ P     W+     ++K  +   
Subjt:  TIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW-GWAEINIRP-----WENLLKELKKGNERSK

Query:  WMQREPFAYWKG-------NPYV--ADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVK
        W ++   A+++G       +P V  +  R +L+    ++   W +    +D +    +  ++  L   C ++Y     G A S   K++  C S+   V 
Subjt:  WMQREPFAYWKG-------NPYV--ADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVK

Query:  PKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEV
         ++ +FF  SL+P  HY P+        ++  + +   H Q  Q I     + I   L+ME+V  Y   LL +Y KL++++ +     VE+
Subjt:  PKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEV

Q5E9Q1 Protein O-glucosyltransferase 12.8e-2224.79Show/hide
Query:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD
        E+ +P   P C  Y   I EDL P+  GGI+R+M   V + K   H++  ++  R+Y E    +       + F +           GR+PD+E++ +  
Subjt:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD

Query:  DRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD
        D P V    +  P +      P+F +    E  DI++P W+FW  G A   I P     W+   ++L +   +  W ++   AY++G       +P +  
Subjt:  DRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD

Query:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK
        +R++  L+    ++   W +   ++D +   +   K  +L   C ++Y     G A S   K++  C S+   V  ++ +FF   L+P  HY P+  D  
Subjt:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK

Query:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF
          +++  + +  ++    Q I +  S FI   LKM+++  Y  +LL +Y+K L +
Subjt:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF

Q8BYB9 Protein O-glucosyltransferase 13.4e-2024.23Show/hide
Query:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD
        E+ +P     C  Y   I EDL P+  GGI+R+M   V + K   H++  ++  R++ E    +       + F +            R+PD+E++ +  
Subjt:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD

Query:  DRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD
        D P V    +  PT+      P+F +    E  DI++P W+FW  G A   + P     W+   ++L +   +  W ++   AY++G       +P +  
Subjt:  DRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD

Query:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK
        +R++  L+    ++   W +   ++D +   +   K  +L   C +RY     G A S   K++  C S+   V  ++ +FF   L+P  HY P+  D  
Subjt:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK

Query:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF
          +++  + +  ++    Q I K  S FI   L+M+++  Y  +LL  Y+K L +
Subjt:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF

Q8NBL1 Protein O-glucosyltransferase 11.8e-2124.23Show/hide
Query:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD
        E+ +P     C  Y   I EDL P+  GGI+R+M   V + K   H++  +   R+Y E+   +       + F +           GR+PD+E++ +  
Subjt:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD

Query:  DRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD
        D P V    +  P +      P+F +    E  DI++P W+FW  G A   I P     W+   ++L +   +  W ++   AY++G       +P +  
Subjt:  DRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD

Query:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK
        +R++  L+    ++   W +   ++D +   +   K  +L   C ++Y     G A S   K++  C S+   V  ++ +FF   L+P  HY P+  D  
Subjt:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK

Query:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF
          +++  + +  ++    Q I +  S FI+  L+M+++  Y  +LL++Y+K L +
Subjt:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF

Arabidopsis top hitse value%identityAlignment
AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821)5.8e-17260.91Show/hide
Query:  SSSTNQTHNFTCRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSR
        SS  NQ  + +C     + +     + S+R  CP+YF+WIHEDL+PW   GIT+EMVE+GK TAHFRL ++ G+V+VE+YKKSIQTRD FT+WG LQL R
Subjt:  SSSTNQTHNFTCRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSR

Query:  RYPGRIPDLELMFDCDDRPVVKSADYR--TPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNP
        +YPG++PD++LMFDCDDRPV++S  Y     TV+ A PPPLFRYCGD  T+DIVFPDWSFWGW EINIR W  +LKE+++G ++ K+M+R+ +AYWKGNP
Subjt:  RYPGRIPDLELMFDCDDRPVVKSADYR--TPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNP

Query:  YVAD-TRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISD
        +VA  +R+DLL CNLS  +DWNAR++IQDWI E Q+G++ SN+A+QCT+RYKIYIEGYAWSVSEKYILACDSVTL+VKP +YDFF+R+LQP++HYWPI D
Subjt:  YVAD-TRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISD

Query:  DHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPR-----GGLEKKFMKESLVKTPP
          KC+SIKFAV W N+H QK Q IG+ AS+F+Q+DL MENVYDYMFHLLN+Y+KLL+++P++P  +VE+C+E + CP       G++KKFM  SLV  P 
Subjt:  DHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPR-----GGLEKKFMKESLVKTPP

Query:  LTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK
         + PCS+PPPFD+  L++ +R+  N+IRQVEKWED +W K
Subjt:  LTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK

AT2G45830.1 downstream target of AGL15 28.9e-15752.85Show/hide
Query:  AVPAAAIFLLAAL--SAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPESIDPSDRPICPEYFRW
        ++  A +FL+ +L  SA +L     L        F G +  T+S +      + +P  C      NQT  F          +P S   S    CP YFRW
Subjt:  AVPAAAIFLLAAL--SAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPESIDPSDRPICPEYFRW

Query:  IHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPPPL
        IHEDLRPW   G+TR M+EK + TAHFR+ ++ GRVYV+ Y+KSIQTRD+FT+WG +QL R YPGR+PDLELMFD DDRP V+S D++        PPPL
Subjt:  IHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPPPL

Query:  FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSN
        FRYC D+ +LDIVFPDWSFWGWAE+NI+PW+  L  +++GN+ ++W  R  +AYW+GNP VA TR+DLL+CN+S Q DWN RLYIQDW RES++G+K SN
Subjt:  FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSN

Query:  LASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVY
        L +QCTHRYKIYIEG+AWSVSEKYI+ACDS+TL V+P FYDF+ R + P++HYWPI D  KC S+KFAVHWGN+H  +   IG+  S FI++++KME VY
Subjt:  LASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVY

Query:  DYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWN
        DYMFHL+N+YAKLL+F+PEIP GA E+  + + C   G  + FM+ES+V  P    PC MP PF+   L+ +  R  N+ RQVE WED++++
Subjt:  DYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWN

AT3G48980.1 Arabidopsis thaliana protein of unknown function (DUF821)2.7e-17759.8Show/hide
Query:  IFLL--AALSAAVLIS-STRLQSTLFSINFLGNQTETSSPKIPK--KYI----KYYPLNCSSSSSTNQTHNFTC-RNKYPTRF-----EPESIDPSDRPI
        +FLL  A LS  +L+  S  ++    S+     +  T SP+ P+  K I    K + LNC++ S  +     TC ++ YPT F     E ES D S    
Subjt:  IFLL--AALSAAVLIS-STRLQSTLFSINFLGNQTETSSPKIPK--KYI----KYYPLNCSSSSSTNQTHNFTC-RNKYPTRF-----EPESIDPSDRPI

Query:  CPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVD
        CP+YFRWIHEDLRPW   GITRE +E+  ATA FRLA++ GR+YVE ++++ QTRD+FTIWGF+QL RRYPG+IPDLELMFDC D PVVK+A++    VD
Subjt:  CPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVD

Query:  TAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQ
           PPPLFRYC ++ETLDIVFPDWS+WGWAE+NI+PWE+LLKEL++GN+R+KW+ REP+AYWKGNP VA+TR DL+KCNLS+  DW ARLY QDW++ES+
Subjt:  TAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQ

Query:  QGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQD
        +GYKQS+LASQC HRYKIYIEG AWSVSEKYILACDSVTL+VKP +YDFFTR + P  HYWP+ +D KC+SIKFAV WGN H +K Q IGK AS+F+QQ+
Subjt:  QGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQD

Query:  LKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK
        LKM+ VYDYMFHLL QY+KLLRF+PEIP  + E+CSE +ACPR G E+KFM ESLVK P  T PC+MPPP+D  S   + +R  +   ++E+WE ++W K
Subjt:  LKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK

AT3G61270.1 Arabidopsis thaliana protein of unknown function (DUF821)1.2e-15350.5Show/hide
Query:  SLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPESIDPSDRPI
        + P++S+   V A    ++  +SAA+L     L       N       T+  K P  Y   +  N SS +  +Q           +R  P +   S    
Subjt:  SLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPESIDPSDRPI

Query:  CPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVD
        CP YFRWIHEDLRPW   GITR M+E+   TAHFRL +  G+ YV+ YKKSIQTRD FT+WG LQL R YPG++PDLELMFD DDRPVV+S D+     +
Subjt:  CPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVD

Query:  TAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQ
           PPP+FRYC D+ +LDIVFPDWSFWGWAE+N++PW   L+ +K+GN  ++W  R  +AYW+GNPYV   R DLLKCN ++  +WN RLYIQDW +E++
Subjt:  TAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQ

Query:  QGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQD
        +G+K SNL +QCTHRYKIYIEG+AWSVSEKYI+ACDS+TL VKP+FYDF+ R + P++HYWPI DD KC S+KFAVHWGN+H+ K + IG+  S FI+++
Subjt:  QGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQD

Query:  LKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK
        + M+ VYDYMFHLL +YA LL+F+PEIP+ A E+  +++ CP     + F  ES++ +P    PC M PP+D  +L+ +  R AN+ RQVE WE++++  
Subjt:  LKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK

Query:  --NKP
          NKP
Subjt:  --NKP

AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821)1.8e-18157.77Show/hide
Query:  IWQPSL-------PKRSVTVAVPAAAIFLLAALSAAVLISST-RLQSTLFSINFLGNQTETSSPKIPKKYI-------KYYPLNCSSSSSTNQTHNFTCR
        IW P +       P RS  +      + + A +S  +L+ +T  L+    +      QT+T +PK P+            + L+CS++ +T    +    
Subjt:  IWQPSL-------PKRSVTVAVPAAAIFLLAALSAAVLISST-RLQSTLFSINFLGNQTETSSPKIPKKYI-------KYYPLNCSSSSSTNQTHNFTCR

Query:  NKYP--TRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        NKYP  T FE +  +      CP+YFRWIHEDLRPW+  GITRE +E+ K TA FRLA+VGG++YVE ++ + QTRD+FTIWGFLQL R+YPG+IPDLEL
Subjt:  NKYP--TRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDC D PVV++ ++     +   PPPLFRYCG+EETLDIVFPDWSFWGWAE+NI+PWE+LLKEL++GNER+KW+ REP+AYWKGNP VA+TRQDL+KCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        +S++++WNARLY QDWI+ES++GYKQS+LASQC HRYKIYIEG AWSVSEKYILACDSVTLLVKP +YDFFTR L P  HYWP+ +  KC+SIKFAV WG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSH QK Q IGKAASDFIQQDLKM+ VYDYM+HLL +Y+KLL+F+PEIP  AVE+CSET+AC R G E+KFM ESLVK P  + PC+MPPP+D  +   +
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNK
         +R  +   ++ +WE ++W+K
Subjt:  YRRNANIIRQVEKWEDEFWNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAGATTCCCAATTTCAATTTCAATTTTGCGACCTTCCCGGAGACCCATTTGGCAACCGTCTCTTCCAAAACGCTCCGTCACCGTCGCAGTTCCTGCCGCCGCCAT
CTTCCTCCTCGCCGCCCTCTCCGCCGCCGTCCTCATTTCCTCCACTCGCCTCCAATCCACTCTGTTTTCCATTAACTTTCTCGGAAACCAAACAGAAACAAGCTCACCAA
AAATCCCCAAAAAATACATAAAATATTACCCACTTAACTGTTCTTCATCTTCCTCCACAAACCAAACCCACAATTTCACCTGCCGGAATAAATACCCGACCCGATTCGAA
CCCGAATCCATCGACCCATCTGATCGGCCCATTTGCCCTGAGTACTTCCGATGGATACACGAGGATCTGCGGCCGTGGGCGACGGGCGGAATCACGAGGGAGATGGTGGA
GAAAGGGAAGGCGACGGCGCATTTCCGGCTGGCGGTGGTGGGCGGGAGGGTCTACGTGGAGCACTATAAGAAGTCGATTCAAACGAGGGATTTGTTTACGATTTGGGGGT
TTTTGCAGCTTTCGAGAAGGTACCCAGGGCGAATCCCTGATTTGGAGCTGATGTTCGACTGTGATGACCGGCCGGTTGTTAAATCGGCTGATTACCGGACTCCGACGGTG
GATACGGCGGGGCCGCCGCCGCTGTTCCGGTATTGCGGCGATGAGGAGACGTTGGACATTGTGTTTCCGGACTGGTCCTTCTGGGGATGGGCAGAGATAAATATAAGGCC
ATGGGAGAATTTGTTGAAGGAATTGAAAAAAGGGAATGAAAGAAGCAAATGGATGCAAAGGGAACCATTTGCTTATTGGAAAGGAAACCCTTATGTGGCTGACACAAGAC
AAGATCTTCTCAAATGCAACCTCTCCCAACAAAATGATTGGAATGCTCGCCTCTACATTCAGGATTGGATCCGAGAGTCTCAACAAGGTTATAAGCAATCTAACTTGGCA
AGCCAATGCACTCATAGGTACAAGATCTACATAGAAGGATATGCATGGTCAGTGAGTGAAAAATACATATTGGCATGTGATTCAGTGACATTGCTTGTAAAGCCCAAATT
TTATGATTTCTTCACTAGATCTTTACAGCCAGTTCGCCATTATTGGCCTATTAGTGATGACCATAAGTGCAAATCCATCAAATTTGCTGTCCATTGGGGCAATTCCCACA
AACAAAAGGTACAAGCTATAGGAAAAGCAGCGAGTGACTTCATCCAACAAGACTTAAAGATGGAAAATGTGTATGACTACATGTTTCATCTCCTCAACCAATACGCCAAG
CTCCTTCGATTTCAGCCCGAAATCCCGATGGGCGCAGTGGAAGTCTGCTCCGAGACGGTGGCTTGCCCGAGGGGCGGGCTGGAGAAGAAGTTCATGAAGGAATCCTTGGT
GAAAACTCCCCCTCTCACCATCCCTTGCTCCATGCCACCACCCTTTGATACCCCTTCTCTCCAAAGGCTTTATAGAAGAAATGCCAACATAATAAGGCAAGTGGAGAAAT
GGGAAGATGAGTTTTGGAATAAAAATAAACCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGAGATTCCCAATTTCAATTTCAATTTTGCGACCTTCCCGGAGACCCATTTGGCAACCGTCTCTTCCAAAACGCTCCGTCACCGTCGCAGTTCCTGCCGCCGCCAT
CTTCCTCCTCGCCGCCCTCTCCGCCGCCGTCCTCATTTCCTCCACTCGCCTCCAATCCACTCTGTTTTCCATTAACTTTCTCGGAAACCAAACAGAAACAAGCTCACCAA
AAATCCCCAAAAAATACATAAAATATTACCCACTTAACTGTTCTTCATCTTCCTCCACAAACCAAACCCACAATTTCACCTGCCGGAATAAATACCCGACCCGATTCGAA
CCCGAATCCATCGACCCATCTGATCGGCCCATTTGCCCTGAGTACTTCCGATGGATACACGAGGATCTGCGGCCGTGGGCGACGGGCGGAATCACGAGGGAGATGGTGGA
GAAAGGGAAGGCGACGGCGCATTTCCGGCTGGCGGTGGTGGGCGGGAGGGTCTACGTGGAGCACTATAAGAAGTCGATTCAAACGAGGGATTTGTTTACGATTTGGGGGT
TTTTGCAGCTTTCGAGAAGGTACCCAGGGCGAATCCCTGATTTGGAGCTGATGTTCGACTGTGATGACCGGCCGGTTGTTAAATCGGCTGATTACCGGACTCCGACGGTG
GATACGGCGGGGCCGCCGCCGCTGTTCCGGTATTGCGGCGATGAGGAGACGTTGGACATTGTGTTTCCGGACTGGTCCTTCTGGGGATGGGCAGAGATAAATATAAGGCC
ATGGGAGAATTTGTTGAAGGAATTGAAAAAAGGGAATGAAAGAAGCAAATGGATGCAAAGGGAACCATTTGCTTATTGGAAAGGAAACCCTTATGTGGCTGACACAAGAC
AAGATCTTCTCAAATGCAACCTCTCCCAACAAAATGATTGGAATGCTCGCCTCTACATTCAGGATTGGATCCGAGAGTCTCAACAAGGTTATAAGCAATCTAACTTGGCA
AGCCAATGCACTCATAGGTACAAGATCTACATAGAAGGATATGCATGGTCAGTGAGTGAAAAATACATATTGGCATGTGATTCAGTGACATTGCTTGTAAAGCCCAAATT
TTATGATTTCTTCACTAGATCTTTACAGCCAGTTCGCCATTATTGGCCTATTAGTGATGACCATAAGTGCAAATCCATCAAATTTGCTGTCCATTGGGGCAATTCCCACA
AACAAAAGGTACAAGCTATAGGAAAAGCAGCGAGTGACTTCATCCAACAAGACTTAAAGATGGAAAATGTGTATGACTACATGTTTCATCTCCTCAACCAATACGCCAAG
CTCCTTCGATTTCAGCCCGAAATCCCGATGGGCGCAGTGGAAGTCTGCTCCGAGACGGTGGCTTGCCCGAGGGGCGGGCTGGAGAAGAAGTTCATGAAGGAATCCTTGGT
GAAAACTCCCCCTCTCACCATCCCTTGCTCCATGCCACCACCCTTTGATACCCCTTCTCTCCAAAGGCTTTATAGAAGAAATGCCAACATAATAAGGCAAGTGGAGAAAT
GGGAAGATGAGTTTTGGAATAAAAATAAACCTTAA
Protein sequenceShow/hide protein sequence
MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFE
PESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTV
DTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLA
SQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAK
LLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNKNKP