; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G002940 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G002940
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionArabidopsis thaliana protein of unknown function (DUF821)
Genome locationCiama_Chr01:3065859..3070486
RNA-Seq ExpressionCaUC01G002940
SyntenyCaUC01G002940
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006598 - Glycosyl transferase CAP10 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB1222531.1 O-glucosyltransferase rumi [Morella rubra]2.9e-20266.73Show/hide
Query:  RPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNNYPTRFEPES
        R  W+P L K    VA P A +F +  L   + ISS RL ++  S +   N T   S K     K I  +PLNCS  + T       C  NYPT F P+ 
Subjt:  RPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNNYPTRFEPES

Query:  IDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSA
        +DPS  P+CP YFRWIH+DL PW   GITREMVEK K TAHFRL VV G+ Y+E YKKSIQTRD+FTIWG LQL RRYPGRIPDLELMFDC+D PV++S 
Subjt:  IDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSA

Query:  DYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYI
        DYR P  +T GP PLFRYCGD  T DIVFPDWSFWGWAEINIRPWE+LLK+LK+GN R KWM+REP+AYWKGNP V +TR+DLLKCNLS   DWNARLY+
Subjt:  DYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYI

Query:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKA
        QDWI ES+QGYK+S+LASQCTHRYKIYIEGYAWSVSEKYILACDSVTL+V P+FYDFFTR LQPV HYWPI DD KC+SIKFAV WGN+HK+K QAIGKA
Subjt:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKA

Query:  ASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEK
        +SDFIQ++LKM+ VYDYMFH+LN YAKLL+F P+IP GAVE+CSET+AC   GLEKKFM ESLVK P +T PC+MPPPF+   L  LYRRN NII+QV K
Subjt:  ASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEK

Query:  WEDEFW------NKNKP
        WE+ +W       K+KP
Subjt:  WEDEFW------NKNKP

XP_004143920.1 O-glucosyltransferase rumi homolog isoform X1 [Cucumis sativus]1.1e-26283.72Show/hide
Query:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT
        MQRFPIS SI  PSRRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKK IKYYPLNCSSSS+TNQT +FT
Subjt:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT

Query:  CRNNYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        CR +YPT +EPESI PS R +CPEYFRWIHEDL+PWA GGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWG LQL RRYPG+IPDLEL
Subjt:  CRNNYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDCDDRPVVKSADYR   VDT    P+FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNE+ KWM+RE FAYWKGNPYVADTRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT+RYKIYIEGY WSVSEKYILACDS+TLLVKP FYDFF+RSL+P+ HYWP+SDDHKCKSIKFAVHWG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSHKQK Q IGK AS+FIQQ+L+MENVYDYMFHLLN YAKLLRF+PEIP GA+EVCSET+ACPR G EKKFMKES+VKTP LTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNKN
        YRRNAN+I QVEKWE+ FW +N
Subjt:  YRRNANIIRQVEKWEDEFWNKN

XP_031740971.1 O-glucosyltransferase rumi homolog isoform X2 [Cucumis sativus]7.6e-24379.12Show/hide
Query:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT
        MQRFPIS SI  PSRRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKK IKYYPLNCSSSS+TNQT +FT
Subjt:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT

Query:  CRNNYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        CR +YPT +EPESI PS R +CPEYFRWIHEDL+PWA GGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWG LQL RRYPG+IPDLEL
Subjt:  CRNNYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDCDDRPVVKSADYR   VDT    P+FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNE+ KWM+RE FAYWKGNPYVADTRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT+RYKIYIEGY                            RSL+P+ HYWP+SDDHKCKSIKFAVHWG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSHKQK Q IGK AS+FIQQ+L+MENVYDYMFHLLN YAKLLRF+PEIP GA+EVCSET+ACPR G EKKFMKES+VKTP LTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNKN
        YRRNAN+I QVEKWE+ FW +N
Subjt:  YRRNANIIRQVEKWEDEFWNKN

XP_031740972.1 O-glucosyltransferase rumi homolog isoform X3 [Cucumis sativus]3.1e-23677.39Show/hide
Query:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT
        MQRFPIS SI  PSRRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKK IKYYPLNCSSSS+TNQT +FT
Subjt:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT

Query:  CRNNYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        CR +YPT +EPESI PS R +CPEYFRWIHEDL+PWA GGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWG LQL RRYPG+IPDLEL
Subjt:  CRNNYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDCDDRPVVKSADYR   VDT    P+FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNE+ KWM+RE FAYWKGNPYVADTRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT                                    ++RSL+P+ HYWP+SDDHKCKSIKFAVHWG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSHKQK Q IGK AS+FIQQ+L+MENVYDYMFHLLN YAKLLRF+PEIP GA+EVCSET+ACPR G EKKFMKES+VKTP LTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNKN
        YRRNAN+I QVEKWE+ FW +N
Subjt:  YRRNANIIRQVEKWEDEFWNKN

XP_038875037.1 protein O-glucosyltransferase 1-like [Benincasa hispida]6.6e-28789.5Show/hide
Query:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT
        MQRFPIS  I RPSRRPIWQPSLPKRSV VAVPAAA+FLLAALS AV+ISS RLQSTLFS NFLGNQTE  SPKIPKK IKYYPL+CSSSS+TNQT NF 
Subjt:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT

Query:  CRNNYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        CR NYPT+FEPES D S RPICPEYFRWIHEDLRPWA GGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWG LQL R+YPGR+PDLEL
Subjt:  CRNNYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDCDDRPVVKSADY+T  ++TAGP PLFRYCGDE+TLDIVFPDWSFWGW EINIRPWE LLKELKKGNE+ KWM+RE FAYWKGNPYVADTRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        LSQQNDWNARLYIQDWI ESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLL+KP FYDFFTRSLQP+ HYWP+SDDHKCKSIKFAVHWG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSHKQK QAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRP IP GAVEVCSET+ACPR G+EKKFMKES+VKTP LTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNKNKP
        YRRNAN+IRQVEKWEDEFWN+NKP
Subjt:  YRRNANIIRQVEKWEDEFWNKNKP

TrEMBL top hitse value%identityAlignment
A0A0A0KQX4 CAP10 domain-containing protein5.5e-26383.72Show/hide
Query:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT
        MQRFPIS SI  PSRRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKK IKYYPLNCSSSS+TNQT +FT
Subjt:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT

Query:  CRNNYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        CR +YPT +EPESI PS R +CPEYFRWIHEDL+PWA GGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWG LQL RRYPG+IPDLEL
Subjt:  CRNNYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDCDDRPVVKSADYR   VDT    P+FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNE+ KWM+RE FAYWKGNPYVADTRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT+RYKIYIEGY WSVSEKYILACDS+TLLVKP FYDFF+RSL+P+ HYWP+SDDHKCKSIKFAVHWG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSHKQK Q IGK AS+FIQQ+L+MENVYDYMFHLLN YAKLLRF+PEIP GA+EVCSET+ACPR G EKKFMKES+VKTP LTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNKN
        YRRNAN+I QVEKWE+ FW +N
Subjt:  YRRNANIIRQVEKWEDEFWNKN

A0A2N9J2X5 CAP10 domain-containing protein1.1e-20567.86Show/hide
Query:  WQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNNYPTRFEPES-ID
        WQP L K        A  I     L     IS+ RL ++ F      N T T   K  IP K I  +PLNCS  + T      TC  NYP  F P + +D
Subjt:  WQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNNYPTRFEPES-ID

Query:  PSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADY
        PS  P+CP+YFRWIHEDLRPW   GITR+MVEK K++AHFRL +V G+ Y+E YKKSIQTRD+FTIWG LQL RRYPGR+PD+ELMFDCDDRPV+KSADY
Subjt:  PSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADY

Query:  RTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQD
        R   +   GP PLFRYCGD  T+DIVFPDWSFWGWAEINI+PWE+LLKELK+GN+RSKW++REP+AYWKGNP+VA+TR+DLLKCN+S + DWNARLYIQD
Subjt:  RTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQD

Query:  WIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAAS
        WI ESQQGYK+S LASQCTHRYKIYIEGYAWSVSEKYILACDSV+L+VKP +YDFFTRSL+PV HYWPI DD KCKSIKFAV WGN+HKQK QAIGKA+S
Subjt:  WIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAAS

Query:  DFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWE
        DFIQ++LKM+ VYDYMFHLLN+YAKLL+F P+IP GA+E+CSET AC   G EKKFM ESLVK P LT PC+MPPP++   L  L RRN NIIRQVEKWE
Subjt:  DFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWE

Query:  DEFW
        +++W
Subjt:  DEFW

A0A5E4FRC2 PREDICTED: O-glucosyltransferase6.8e-19765.94Show/hide
Query:  WQPSLPKRSVTVAVPAAAIFLLA--ALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKY-IKYYPLNCSSSSSTNQTHNFTCRNNYPTRF-EPESI
        W PS   +S  +A     I  L    +SAA+L S   +Q + F I    N+T   S K  + + +  +PLNCS  S+ NQT   TC  +YPT F   + +
Subjt:  WQPSLPKRSVTVAVPAAAIFLLA--ALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKY-IKYYPLNCSSSSSTNQTHNFTCRNNYPTRF-EPESI

Query:  DPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSAD
        +PS  PICP+YFR+IH+DL PW   GITR+MVE+ K TAHFRL +V G+ YVE YKKSIQTRD+FTIWG LQL RRYPGR+PDLELMFDCDD+PV++S D
Subjt:  DPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSAD

Query:  YRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQ
        +R P   T  P PLFRYCGD  T DIVFPDWSFWGWAEINI+PWE LLK+LKKGN+R KWM+REP+AYWKGNP+VA++R+DLLKCN+S   DWNARL+IQ
Subjt:  YRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQ

Query:  DWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAA
        DWI ESQQG+KQS++ASQCTHRYKIYIEGYAWSVSEKYILACDSVTL+VKP++YDFFTRSLQPV HYWPI  D KCKSIKFAV WGN+HKQK QAIGKAA
Subjt:  DWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAA

Query:  SDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKW
        SDFIQQ+LKM+ VYDYMFHLLN+YAKLLRF P+IP GA  +CSE++ACP    EKKFM ESLVK+P +T PC+MPP +   +L  LYRRN N+ +QV+KW
Subjt:  SDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKW

Query:  EDEFW
        ED++W
Subjt:  EDEFW

A0A6A1WB87 O-glucosyltransferase rumi1.4e-20266.73Show/hide
Query:  RPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNNYPTRFEPES
        R  W+P L K    VA P A +F +  L   + ISS RL ++  S +   N T   S K     K I  +PLNCS  + T       C  NYPT F P+ 
Subjt:  RPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNNYPTRFEPES

Query:  IDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSA
        +DPS  P+CP YFRWIH+DL PW   GITREMVEK K TAHFRL VV G+ Y+E YKKSIQTRD+FTIWG LQL RRYPGRIPDLELMFDC+D PV++S 
Subjt:  IDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSA

Query:  DYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYI
        DYR P  +T GP PLFRYCGD  T DIVFPDWSFWGWAEINIRPWE+LLK+LK+GN R KWM+REP+AYWKGNP V +TR+DLLKCNLS   DWNARLY+
Subjt:  DYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYI

Query:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKA
        QDWI ES+QGYK+S+LASQCTHRYKIYIEGYAWSVSEKYILACDSVTL+V P+FYDFFTR LQPV HYWPI DD KC+SIKFAV WGN+HK+K QAIGKA
Subjt:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKA

Query:  ASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEK
        +SDFIQ++LKM+ VYDYMFH+LN YAKLL+F P+IP GAVE+CSET+AC   GLEKKFM ESLVK P +T PC+MPPPF+   L  LYRRN NII+QV K
Subjt:  ASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEK

Query:  WEDEFW------NKNKP
        WE+ +W       K+KP
Subjt:  WEDEFW------NKNKP

M5WWD4 CAP10 domain-containing protein2.3e-19766.14Show/hide
Query:  WQPSLPKRSVTVAVPAAAIFLLA--ALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKY-IKYYPLNCSSSSSTNQTHNFTCRNNYPTRF-EPESI
        W PS   +S  +A     I  L    +SAA+L S   +Q + F I    N+T   S K  + + +  +PLNCS  S+ NQT   TC  +YPT F   + +
Subjt:  WQPSLPKRSVTVAVPAAAIFLLA--ALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKY-IKYYPLNCSSSSSTNQTHNFTCRNNYPTRF-EPESI

Query:  DPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSAD
        +PS  PICP+YFR+IH+DL PW   GITR+MVE+ K TAHFRL +V G+ YVE YKKSIQTRD+FTIWG LQL RRYPGR+PDLELMFDCDD+PV++S D
Subjt:  DPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSAD

Query:  YRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQ
        +R P   T  P PLFRYCGD  T DIVFPDWSFWGWAEINI+PWE LLK+LKKGN+R KWM+REP+AYWKGNP+VA++R+DLLKCN+S   DWNARL+IQ
Subjt:  YRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQ

Query:  DWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAA
        DWI ESQQG+KQS++ASQCTHRYKIYIEGYAWSVSEKYILACDSVTL+VKP++YDFFTRSLQPV HYWPI  D KCKSIKFAV WGN+HKQK QAIGKAA
Subjt:  DWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAA

Query:  SDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKW
        SDFIQQ+LKM+ VYDYMFHLLN+YAKLLRF P+IP GA  +CSE++ACP    EKKFM ESLVK+P +T PC+MPP F   +L  LYRRN N+ +QV+KW
Subjt:  SDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKW

Query:  EDEFW
        ED++W
Subjt:  EDEFW

SwissProt top hitse value%identityAlignment
A0NDG6 O-glucosyltransferase rumi homolog5.3e-2125.22Show/hide
Query:  DLRPWATGGITREMVEKGKA-TAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPLPLFR
        DL+P+   GIT+EM+ + K    H++  V+G ++Y     +  +        G     R     +PD++L+ +C D P +    +R  + +    +P+  
Subjt:  DLRPWATGGITREMVEKGKA-TAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPLPLFR

Query:  YCGDEETLDIVFPDWSFW-GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQG-
        +    E LDI++P W+FW G   I + P     W+   + + K +  + W  +EP A+++G+   +D R  L+  + +Q +  +A+ Y ++   +S Q  
Subjt:  YCGDEETLDIVFPDWSFW-GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQG-

Query:  -----YKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFI
              ++  L   C +R+     G A S   K++  C S+   V  ++ +FF  SL+P  HY P+      + ++  + +   H Q  +AI +   + I
Subjt:  -----YKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFI

Query:  QQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEV
           L+M +V  Y   LL +Y KL+R+  E     +EV
Subjt:  QQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEV

G3V9D0 Protein O-glucosyltransferase 13.4e-2024.43Show/hide
Query:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRP
        E+ +P     C  Y   I EDL P+  GGI+R+M   V + +   H++  ++  R++ E       +R        L++ R    R+PD+E++ +  D P
Subjt:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRP

Query:  VVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVADTRQ
         V    +  PT+      P+F +    E  DI++P W+FW  G A   + P     W+   ++L +   +  W ++   AY++G       +P +  +R+
Subjt:  VVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVADTRQ

Query:  D--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKS
        +  L+    ++   W +   ++D +   +   K  +L   C ++Y     G A S   K++  C S+   V  ++ +FF   L+P  HY P+  D     
Subjt:  D--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKS

Query:  IKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF
        ++  + +  ++    Q I K  S FI   L+M+++  Y  +LL +Y+K L +
Subjt:  IKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF

Q5E9Q1 Protein O-glucosyltransferase 12.2e-2224.79Show/hide
Query:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD
        E+ +P   P C  Y   I EDL P+  GGI+R+M   V + K   H++  ++  R+Y E    +       + F +           GR+PD+E++ +  
Subjt:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD

Query:  DRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD
        D P V    +  P +      P+F +    E  DI++P W+FW  G A   I P     W+   ++L +   +  W ++   AY++G       +P +  
Subjt:  DRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD

Query:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK
        +R++  L+    ++   W +   ++D +   +   K  +L   C ++Y     G A S   K++  C S+   V  ++ +FF   L+P  HY P+  D  
Subjt:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK

Query:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF
          +++  + +  ++    Q I +  S FI   LKM+++  Y  +LL +Y+K L +
Subjt:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF

Q8BYB9 Protein O-glucosyltransferase 12.6e-2024.23Show/hide
Query:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD
        E+ +P     C  Y   I EDL P+  GGI+R+M   V + K   H++  ++  R++ E    +       + F +            R+PD+E++ +  
Subjt:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD

Query:  DRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD
        D P V    +  PT+      P+F +    E  DI++P W+FW  G A   + P     W+   ++L +   +  W ++   AY++G       +P +  
Subjt:  DRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD

Query:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK
        +R++  L+    ++   W +   ++D +   +   K  +L   C +RY     G A S   K++  C S+   V  ++ +FF   L+P  HY P+  D  
Subjt:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK

Query:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF
          +++  + +  ++    Q I K  S FI   L+M+++  Y  +LL  Y+K L +
Subjt:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF

Q8NBL1 Protein O-glucosyltransferase 11.4e-2124.23Show/hide
Query:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD
        E+ +P     C  Y   I EDL P+  GGI+R+M   V + K   H++  +   R+Y E+   +       + F +           GR+PD+E++ +  
Subjt:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD

Query:  DRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD
        D P V    +  P +      P+F +    E  DI++P W+FW  G A   I P     W+   ++L +   +  W ++   AY++G       +P +  
Subjt:  DRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD

Query:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK
        +R++  L+    ++   W +   ++D +   +   K  +L   C ++Y     G A S   K++  C S+   V  ++ +FF   L+P  HY P+  D  
Subjt:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK

Query:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF
          +++  + +  ++    Q I +  S FI+  L+M+++  Y  +LL++Y+K L +
Subjt:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF

Arabidopsis top hitse value%identityAlignment
AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821)3.7e-17160.68Show/hide
Query:  SSSTNQTHNFTCRNNYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSR
        SS  NQ  + +C     + +     + S+R  CP+YF+WIHEDL+PW   GIT+EMVE+GK TAHFRL ++ G+V+VE+YKKSIQTRD FT+WG LQL R
Subjt:  SSSTNQTHNFTCRNNYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSR

Query:  RYPGRIPDLELMFDCDDRPVVKSADYR--TPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNP
        +YPG++PD++LMFDCDDRPV++S  Y     TV+ A P PLFRYCGD  T+DIVFPDWSFWGW EINIR W  +LKE+++G ++ K+M+R+ +AYWKGNP
Subjt:  RYPGRIPDLELMFDCDDRPVVKSADYR--TPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNP

Query:  YVAD-TRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISD
        +VA  +R+DLL CNLS  +DWNAR++IQDWI E Q+G++ SN+A+QCT+RYKIYIEGYAWSVSEKYILACDSVTL+VKP +YDFF+R+LQP++HYWPI D
Subjt:  YVAD-TRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISD

Query:  DHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPR-----GGLEKKFMKESLVKTPP
          KC+SIKFAV W N+H QK Q IG+ AS+F+Q+DL MENVYDYMFHLLN+Y+KLL+++P++P  +VE+C+E + CP       G++KKFM  SLV  P 
Subjt:  DHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPR-----GGLEKKFMKESLVKTPP

Query:  LTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK
         + PCS+PPPFD+  L++ +R+  N+IRQVEKWED +W K
Subjt:  LTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK

AT2G45830.1 downstream target of AGL15 24.4e-15652.43Show/hide
Query:  AVPAAAIFLLAAL--SAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQ--THNFTCRNNYPTRFEPESIDPSDRPICPEYF
        ++  A +FL+ +L  SA +L     L        F G +  T+S +      + +P  C    +  Q    N + RNN   R     I       CP YF
Subjt:  AVPAAAIFLLAAL--SAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQ--THNFTCRNNYPTRFEPESIDPSDRPICPEYF

Query:  RWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPL
        RWIHEDLRPW   G+TR M+EK + TAHFR+ ++ GRVYV+ Y+KSIQTRD+FT+WG +QL R YPGR+PDLELMFD DDRP V+S D++        P 
Subjt:  RWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPL

Query:  PLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQ
        PLFRYC D+ +LDIVFPDWSFWGWAE+NI+PW+  L  +++GN+ ++W  R  +AYW+GNP VA TR+DLL+CN+S Q DWN RLYIQDW RES++G+K 
Subjt:  PLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQ

Query:  SNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMEN
        SNL +QCTHRYKIYIEG+AWSVSEKYI+ACDS+TL V+P FYDF+ R + P++HYWPI D  KC S+KFAVHWGN+H  +   IG+  S FI++++KME 
Subjt:  SNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMEN

Query:  VYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWN
        VYDYMFHL+N+YAKLL+F+PEIP GA E+  + + C   G  + FM+ES+V  P    PC MP PF+   L+ +  R  N+ RQVE WED++++
Subjt:  VYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWN

AT3G48980.1 Arabidopsis thaliana protein of unknown function (DUF821)1.0e-17659.8Show/hide
Query:  IFLL--AALSAAVLIS-STRLQSTLFSINFLGNQTETSSPKIPK--KYI----KYYPLNCSSSSSTNQTHNFTC-RNNYPTRF-----EPESIDPSDRPI
        +FLL  A LS  +L+  S  ++    S+     +  T SP+ P+  K I    K + LNC++ S  +     TC ++NYPT F     E ES D S    
Subjt:  IFLL--AALSAAVLIS-STRLQSTLFSINFLGNQTETSSPKIPK--KYI----KYYPLNCSSSSSTNQTHNFTC-RNNYPTRF-----EPESIDPSDRPI

Query:  CPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVD
        CP+YFRWIHEDLRPW   GITRE +E+  ATA FRLA++ GR+YVE ++++ QTRD+FTIWGF+QL RRYPG+IPDLELMFDC D PVVK+A++    VD
Subjt:  CPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVD

Query:  TAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQ
           P PLFRYC ++ETLDIVFPDWS+WGWAE+NI+PWE+LLKEL++GN+R+KW+ REP+AYWKGNP VA+TR DL+KCNLS+  DW ARLY QDW++ES+
Subjt:  TAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQ

Query:  QGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQD
        +GYKQS+LASQC HRYKIYIEG AWSVSEKYILACDSVTL+VKP +YDFFTR + P  HYWP+ +D KC+SIKFAV WGN H +K Q IGK AS+F+QQ+
Subjt:  QGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQD

Query:  LKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK
        LKM+ VYDYMFHLL QY+KLLRF+PEIP  + E+CSE +ACPR G E+KFM ESLVK P  T PC+MPPP+D  S   + +R  +   ++E+WE ++W K
Subjt:  LKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK

AT3G61270.1 Arabidopsis thaliana protein of unknown function (DUF821)4.6e-15350.5Show/hide
Query:  SLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFTCRNNYPTRFEPESIDPSDRPI
        + P++S+   V A    ++  +SAA+L     L    F+   L   T+T  P          P  C      NQ+       N  +R  P +   S    
Subjt:  SLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFTCRNNYPTRFEPESIDPSDRPI

Query:  CPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVD
        CP YFRWIHEDLRPW   GITR M+E+   TAHFRL +  G+ YV+ YKKSIQTRD FT+WG LQL R YPG++PDLELMFD DDRPVV+S D+     +
Subjt:  CPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVD

Query:  TAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQ
           P P+FRYC D+ +LDIVFPDWSFWGWAE+N++PW   L+ +K+GN  ++W  R  +AYW+GNPYV   R DLLKCN ++  +WN RLYIQDW +E++
Subjt:  TAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQ

Query:  QGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQD
        +G+K SNL +QCTHRYKIYIEG+AWSVSEKYI+ACDS+TL VKP+FYDF+ R + P++HYWPI DD KC S+KFAVHWGN+H+ K + IG+  S FI+++
Subjt:  QGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQD

Query:  LKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK
        + M+ VYDYMFHLL +YA LL+F+PEIP+ A E+  +++ CP     + F  ES++ +P    PC M PP+D  +L+ +  R AN+ RQVE WE++++  
Subjt:  LKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK

Query:  --NKP
          NKP
Subjt:  --NKP

AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821)7.5e-18057.39Show/hide
Query:  IWQPSL-------PKRSVTVAVPAAAIFLLAALSAAVLISST-RLQSTLFSINFLGNQTETSSPKIPKKYI-------KYYPLNCSSSSSTNQTHNFTCR
        IW P +       P RS  +      + + A +S  +L+ +T  L+    +      QT+T +PK P+            + L+CS++ +T    +    
Subjt:  IWQPSL-------PKRSVTVAVPAAAIFLLAALSAAVLISST-RLQSTLFSINFLGNQTETSSPKIPKKYI-------KYYPLNCSSSSSTNQTHNFTCR

Query:  NNYP--TRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        N YP  T FE +  +      CP+YFRWIHEDLRPW+  GITRE +E+ K TA FRLA+VGG++YVE ++ + QTRD+FTIWGFLQL R+YPG+IPDLEL
Subjt:  NNYP--TRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDC D PVV++ ++     +   P PLFRYCG+EETLDIVFPDWSFWGWAE+NI+PWE+LLKEL++GNER+KW+ REP+AYWKGNP VA+TRQDL+KCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        +S++++WNARLY QDWI+ES++GYKQS+LASQC HRYKIYIEG AWSVSEKYILACDSVTLLVKP +YDFFTR L P  HYWP+ +  KC+SIKFAV WG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSH QK Q IGKAASDFIQQDLKM+ VYDYM+HLL +Y+KLL+F+PEIP  AVE+CSET+AC R G E+KFM ESLVK P  + PC+MPPP+D  +   +
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNK
         +R  +   ++ +WE ++W+K
Subjt:  YRRNANIIRQVEKWEDEFWNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAGATTCCCAATTTCAATTTCAATTTTGCGACCTTCCCGGAGACCCATTTGGCAACCGTCTCTTCCAAAACGCTCCGTCACCGTCGCAGTTCCGGCCGCCGCCAT
CTTCCTCCTCGCCGCCCTCTCCGCCGCCGTCCTCATTTCCTCCACTCGCCTCCAATCCACTCTGTTTTCCATTAACTTTCTCGGAAACCAAACAGAAACAAGCTCACCAA
AAATCCCCAAAAAATACATAAAATATTACCCACTTAACTGTTCTTCATCTTCCTCCACAAACCAAACCCACAATTTCACCTGCCGGAATAACTACCCGACCCGATTCGAA
CCCGAATCCATCGACCCATCTGATCGGCCCATTTGCCCTGAGTACTTCCGATGGATACACGAAGATCTGCGGCCGTGGGCGACGGGCGGAATCACGAGGGAGATGGTGGA
GAAAGGGAAGGCGACGGCGCATTTCCGGCTGGCGGTGGTCGGCGGGAGGGTCTACGTGGAGCACTATAAGAAGTCGATTCAAACGAGGGATTTGTTTACGATTTGGGGGT
TTTTGCAGCTTTCGAGAAGGTACCCAGGGCGAATCCCTGATTTGGAGCTGATGTTCGACTGTGATGACCGGCCGGTTGTTAAATCGGCTGATTACCGGACTCCGACAGTG
GATACGGCGGGGCCGCTGCCGCTGTTCCGGTATTGCGGCGATGAGGAGACGTTGGATATTGTGTTTCCGGACTGGTCCTTCTGGGGATGGGCAGAGATAAATATAAGGCC
ATGGGAGAATTTGTTGAAGGAATTGAAAAAGGGGAATGAAAGAAGCAAATGGATGCAAAGGGAACCATTTGCTTATTGGAAAGGAAACCCTTATGTGGCTGACACAAGAC
AAGATCTTCTCAAATGCAACCTCTCCCAACAAAATGATTGGAATGCTCGCCTCTACATTCAGGATTGGATCCGAGAGTCTCAACAAGGTTATAAGCAATCGAACTTGGCA
AGCCAATGCACTCATAGGTACAAGATCTACATAGAAGGATATGCATGGTCAGTGAGTGAAAAATACATATTGGCATGTGATTCAGTGACATTGCTTGTAAAGCCCAAATT
TTATGATTTCTTCACTAGATCTTTACAGCCGGTTCGCCATTATTGGCCTATTAGTGATGACCATAAGTGCAAATCCATCAAATTTGCTGTCCATTGGGGCAATTCCCACA
AACAAAAGGTACAAGCTATAGGAAAAGCAGCGAGTGACTTCATCCAACAAGACTTAAAGATGGAAAATGTGTATGACTACATGTTTCATCTCCTCAACCAATACGCCAAG
CTCCTTCGATTTCGGCCCGAAATCCCGATAGGCGCAGTGGAAGTCTGCTCCGAGACGGTGGCTTGCCCGAGGGGCGGGCTGGAGAAGAAGTTCATGAAGGAATCCTTGGT
GAAAACTCCCCCTCTCACCATCCCTTGCTCCATGCCACCACCCTTTGATACCCCTTCTCTCCAAAGGCTTTATAGAAGAAATGCCAACATAATAAGGCAAGTGGAGAAAT
GGGAAGATGAGTTTTGGAATAAAAATAAACCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGAGATTCCCAATTTCAATTTCAATTTTGCGACCTTCCCGGAGACCCATTTGGCAACCGTCTCTTCCAAAACGCTCCGTCACCGTCGCAGTTCCGGCCGCCGCCAT
CTTCCTCCTCGCCGCCCTCTCCGCCGCCGTCCTCATTTCCTCCACTCGCCTCCAATCCACTCTGTTTTCCATTAACTTTCTCGGAAACCAAACAGAAACAAGCTCACCAA
AAATCCCCAAAAAATACATAAAATATTACCCACTTAACTGTTCTTCATCTTCCTCCACAAACCAAACCCACAATTTCACCTGCCGGAATAACTACCCGACCCGATTCGAA
CCCGAATCCATCGACCCATCTGATCGGCCCATTTGCCCTGAGTACTTCCGATGGATACACGAAGATCTGCGGCCGTGGGCGACGGGCGGAATCACGAGGGAGATGGTGGA
GAAAGGGAAGGCGACGGCGCATTTCCGGCTGGCGGTGGTCGGCGGGAGGGTCTACGTGGAGCACTATAAGAAGTCGATTCAAACGAGGGATTTGTTTACGATTTGGGGGT
TTTTGCAGCTTTCGAGAAGGTACCCAGGGCGAATCCCTGATTTGGAGCTGATGTTCGACTGTGATGACCGGCCGGTTGTTAAATCGGCTGATTACCGGACTCCGACAGTG
GATACGGCGGGGCCGCTGCCGCTGTTCCGGTATTGCGGCGATGAGGAGACGTTGGATATTGTGTTTCCGGACTGGTCCTTCTGGGGATGGGCAGAGATAAATATAAGGCC
ATGGGAGAATTTGTTGAAGGAATTGAAAAAGGGGAATGAAAGAAGCAAATGGATGCAAAGGGAACCATTTGCTTATTGGAAAGGAAACCCTTATGTGGCTGACACAAGAC
AAGATCTTCTCAAATGCAACCTCTCCCAACAAAATGATTGGAATGCTCGCCTCTACATTCAGGATTGGATCCGAGAGTCTCAACAAGGTTATAAGCAATCGAACTTGGCA
AGCCAATGCACTCATAGGTACAAGATCTACATAGAAGGATATGCATGGTCAGTGAGTGAAAAATACATATTGGCATGTGATTCAGTGACATTGCTTGTAAAGCCCAAATT
TTATGATTTCTTCACTAGATCTTTACAGCCGGTTCGCCATTATTGGCCTATTAGTGATGACCATAAGTGCAAATCCATCAAATTTGCTGTCCATTGGGGCAATTCCCACA
AACAAAAGGTACAAGCTATAGGAAAAGCAGCGAGTGACTTCATCCAACAAGACTTAAAGATGGAAAATGTGTATGACTACATGTTTCATCTCCTCAACCAATACGCCAAG
CTCCTTCGATTTCGGCCCGAAATCCCGATAGGCGCAGTGGAAGTCTGCTCCGAGACGGTGGCTTGCCCGAGGGGCGGGCTGGAGAAGAAGTTCATGAAGGAATCCTTGGT
GAAAACTCCCCCTCTCACCATCCCTTGCTCCATGCCACCACCCTTTGATACCCCTTCTCTCCAAAGGCTTTATAGAAGAAATGCCAACATAATAAGGCAAGTGGAGAAAT
GGGAAGATGAGTTTTGGAATAAAAATAAACCTTAA
Protein sequenceShow/hide protein sequence
MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFTCRNNYPTRFE
PESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTV
DTAGPLPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLA
SQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAK
LLRFRPEIPIGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNKNKP