; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G03020 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G03020
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionArabidopsis thaliana protein of unknown function (DUF821)
Genome locationClcChr01:2768551..2773432
RNA-Seq ExpressionClc01G03020
SyntenyClc01G03020
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR006598 - Glycosyl transferase CAP10 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB1222531.1 O-glucosyltransferase rumi [Morella rubra]7.7e-20366.73Show/hide
Query:  RPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPES
        R  W+P L K    VA P A +F +  L   + ISS RL ++  S +   N T   S K     K I  +PLNCS  + T       C   YPT F P+ 
Subjt:  RPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPES

Query:  IDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSA
        +DPS  P+CP YFRWIH+DL PW   GITREMVEK K TAHFRL VV G+ Y+E YKKSIQTRD+FTIWG LQL RRYPGRIPDLELMFDC+D PV++S 
Subjt:  IDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSA

Query:  DYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYI
        DYR P  +T GPPPLFRYCGD  T DIVFPDWSFWGWAEINIRPWE+LLK+LK+GN R KWM+REP+AYWKGNP V +TR+DLLKCNLS   DWNARLY+
Subjt:  DYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYI

Query:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKA
        QDWI ES+QGYK+S+LASQCTHRYKIYIEGYAWSVSEKYILACDSVTL+V P+FYDFFTR LQPV HYWPI DD KC+SIKFAV WGN+HK+K QAIGKA
Subjt:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKA

Query:  ASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEK
        +SDFIQ++LKM+ VYDYMFH+LN YAKLL+F+P+IP GAVE+CSET+AC   GLEKKFM ESLVK P +T PC+MPPPF+   L  LYRRN NII+QV K
Subjt:  ASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEK

Query:  WEDEFW------NKNKP
        WE+ +W       K+KP
Subjt:  WEDEFW------NKNKP

XP_004143920.1 O-glucosyltransferase rumi homolog isoform X1 [Cucumis sativus]7.8e-26484.1Show/hide
Query:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT
        MQRFPIS SI  PSRRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKK IKYYPLNCSSSS+TNQT +FT
Subjt:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT

Query:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        CR  YPT +EPESI PS R +CPEYFRWIHEDL+PWA GGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWG LQL RRYPG+IPDLEL
Subjt:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDCDDRPVVKSADYR   VDT   PP+FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNE+ KWM+RE FAYWKGNPYVADTRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT+RYKIYIEGY WSVSEKYILACDS+TLLVKP FYDFF+RSL+P+ HYWP+SDDHKCKSIKFAVHWG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSHKQK Q IGK AS+FIQQ+L+MENVYDYMFHLLN YAKLLRFQPEIP GA+EVCSET+ACPR G EKKFMKES+VKTP LTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNKN
        YRRNAN+I QVEKWE+ FW +N
Subjt:  YRRNANIIRQVEKWEDEFWNKN

XP_031740971.1 O-glucosyltransferase rumi homolog isoform X2 [Cucumis sativus]5.3e-24479.5Show/hide
Query:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT
        MQRFPIS SI  PSRRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKK IKYYPLNCSSSS+TNQT +FT
Subjt:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT

Query:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        CR  YPT +EPESI PS R +CPEYFRWIHEDL+PWA GGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWG LQL RRYPG+IPDLEL
Subjt:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDCDDRPVVKSADYR   VDT   PP+FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNE+ KWM+RE FAYWKGNPYVADTRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT+RYKIYIEGY                            RSL+P+ HYWP+SDDHKCKSIKFAVHWG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSHKQK Q IGK AS+FIQQ+L+MENVYDYMFHLLN YAKLLRFQPEIP GA+EVCSET+ACPR G EKKFMKES+VKTP LTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNKN
        YRRNAN+I QVEKWE+ FW +N
Subjt:  YRRNANIIRQVEKWEDEFWNKN

XP_031740972.1 O-glucosyltransferase rumi homolog isoform X3 [Cucumis sativus]2.2e-23777.78Show/hide
Query:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT
        MQRFPIS SI  PSRRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKK IKYYPLNCSSSS+TNQT +FT
Subjt:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT

Query:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        CR  YPT +EPESI PS R +CPEYFRWIHEDL+PWA GGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWG LQL RRYPG+IPDLEL
Subjt:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDCDDRPVVKSADYR   VDT   PP+FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNE+ KWM+RE FAYWKGNPYVADTRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT                                    ++RSL+P+ HYWP+SDDHKCKSIKFAVHWG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSHKQK Q IGK AS+FIQQ+L+MENVYDYMFHLLN YAKLLRFQPEIP GA+EVCSET+ACPR G EKKFMKES+VKTP LTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNKN
        YRRNAN+I QVEKWE+ FW +N
Subjt:  YRRNANIIRQVEKWEDEFWNKN

XP_038875037.1 protein O-glucosyltransferase 1-like [Benincasa hispida]1.1e-28689.31Show/hide
Query:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT
        MQRFPIS  I RPSRRPIWQPSLPKRSV VAVPAAA+FLLAALS AV+ISS RLQSTLFS NFLGNQTE  SPKIPKK IKYYPL+CSSSS+TNQT NF 
Subjt:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT

Query:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        CR  YPT+FEPES D S RPICPEYFRWIHEDLRPWA GGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWG LQL R+YPGR+PDLEL
Subjt:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDCDDRPVVKSADY+T  ++TAGPPPLFRYCGDE+TLDIVFPDWSFWGW EINIRPWE LLKELKKGNE+ KWM+RE FAYWKGNPYVADTRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        LSQQNDWNARLYIQDWI ESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLL+KP FYDFFTRSLQP+ HYWP+SDDHKCKSIKFAVHWG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSHKQK QAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF+P IP GAVEVCSET+ACPR G+EKKFMKES+VKTP LTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNKNKP
        YRRNAN+IRQVEKWEDEFWN+NKP
Subjt:  YRRNANIIRQVEKWEDEFWNKNKP

TrEMBL top hitse value%identityAlignment
A0A0A0KQX4 CAP10 domain-containing protein3.8e-26484.1Show/hide
Query:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT
        MQRFPIS SI  PSRRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKK IKYYPLNCSSSS+TNQT +FT
Subjt:  MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFT

Query:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        CR  YPT +EPESI PS R +CPEYFRWIHEDL+PWA GGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWG LQL RRYPG+IPDLEL
Subjt:  CRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDCDDRPVVKSADYR   VDT   PP+FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNE+ KWM+RE FAYWKGNPYVADTRQDLLKCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        LS QNDWNARLYIQDWI+ESQQGYKQS LA+QCT+RYKIYIEGY WSVSEKYILACDS+TLLVKP FYDFF+RSL+P+ HYWP+SDDHKCKSIKFAVHWG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSHKQK Q IGK AS+FIQQ+L+MENVYDYMFHLLN YAKLLRFQPEIP GA+EVCSET+ACPR G EKKFMKES+VKTP LTIPCSMPPPFDTPSLQRL
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNKN
        YRRNAN+I QVEKWE+ FW +N
Subjt:  YRRNANIIRQVEKWEDEFWNKN

A0A2N9J2X5 CAP10 domain-containing protein2.1e-20667.86Show/hide
Query:  WQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPES-ID
        WQP L K        A  I     L     IS+ RL ++ F      N T T   K  IP K I  +PLNCS  + T      TC   YP  F P + +D
Subjt:  WQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPES-ID

Query:  PSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADY
        PS  P+CP+YFRWIHEDLRPW   GITR+MVEK K++AHFRL +V G+ Y+E YKKSIQTRD+FTIWG LQL RRYPGR+PD+ELMFDCDDRPV+KSADY
Subjt:  PSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADY

Query:  RTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQD
        R   +   GPPPLFRYCGD  T+DIVFPDWSFWGWAEINI+PWE+LLKELK+GN+RSKW++REP+AYWKGNP+VA+TR+DLLKCN+S + DWNARLYIQD
Subjt:  RTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQD

Query:  WIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAAS
        WI ESQQGYK+S LASQCTHRYKIYIEGYAWSVSEKYILACDSV+L+VKP +YDFFTRSL+PV HYWPI DD KCKSIKFAV WGN+HKQK QAIGKA+S
Subjt:  WIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAAS

Query:  DFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWE
        DFIQ++LKM+ VYDYMFHLLN+YAKLL+F+P+IP GA+E+CSET AC   G EKKFM ESLVK P LT PC+MPPP++   L  L RRN NIIRQVEKWE
Subjt:  DFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWE

Query:  DEFW
        +++W
Subjt:  DEFW

A0A5E4FRC2 PREDICTED: O-glucosyltransferase8.1e-19865.74Show/hide
Query:  WQPSLPKRSVTVAVPAAAIFLLA--ALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKY-IKYYPLNCSSSSSTNQTHNFTCRNKYPTRF-EPESI
        W PS   +S  +A     I  L    +SAA+L S   +Q + F I    N+T   S K  + + +  +PLNCS  S+ NQT   TC   YPT F   + +
Subjt:  WQPSLPKRSVTVAVPAAAIFLLA--ALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKY-IKYYPLNCSSSSSTNQTHNFTCRNKYPTRF-EPESI

Query:  DPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSAD
        +PS  PICP+YFR+IH+DL PW   GITR+MVE+ K TAHFRL +V G+ YVE YKKSIQTRD+FTIWG LQL RRYPGR+PDLELMFDCDD+PV++S D
Subjt:  DPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSAD

Query:  YRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQ
        +R P  ++   PPLFRYCGD  T DIVFPDWSFWGWAEINI+PWE LLK+LKKGN+R KWM+REP+AYWKGNP+VA++R+DLLKCN+S   DWNARL+IQ
Subjt:  YRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQ

Query:  DWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAA
        DWI ESQQG+KQS++ASQCTHRYKIYIEGYAWSVSEKYILACDSVTL+VKP++YDFFTRSLQPV HYWPI  D KCKSIKFAV WGN+HKQK QAIGKAA
Subjt:  DWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAA

Query:  SDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKW
        SDFIQQ+LKM+ VYDYMFHLLN+YAKLLRF+P+IP GA  +CSE++ACP    EKKFM ESLVK+P +T PC+MPP +   +L  LYRRN N+ +QV+KW
Subjt:  SDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKW

Query:  EDEFW
        ED++W
Subjt:  EDEFW

A0A6A1WB87 O-glucosyltransferase rumi3.7e-20366.73Show/hide
Query:  RPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPES
        R  W+P L K    VA P A +F +  L   + ISS RL ++  S +   N T   S K     K I  +PLNCS  + T       C   YPT F P+ 
Subjt:  RPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPK--IPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPES

Query:  IDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSA
        +DPS  P+CP YFRWIH+DL PW   GITREMVEK K TAHFRL VV G+ Y+E YKKSIQTRD+FTIWG LQL RRYPGRIPDLELMFDC+D PV++S 
Subjt:  IDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSA

Query:  DYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYI
        DYR P  +T GPPPLFRYCGD  T DIVFPDWSFWGWAEINIRPWE+LLK+LK+GN R KWM+REP+AYWKGNP V +TR+DLLKCNLS   DWNARLY+
Subjt:  DYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYI

Query:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKA
        QDWI ES+QGYK+S+LASQCTHRYKIYIEGYAWSVSEKYILACDSVTL+V P+FYDFFTR LQPV HYWPI DD KC+SIKFAV WGN+HK+K QAIGKA
Subjt:  QDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKA

Query:  ASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEK
        +SDFIQ++LKM+ VYDYMFH+LN YAKLL+F+P+IP GAVE+CSET+AC   GLEKKFM ESLVK P +T PC+MPPPF+   L  LYRRN NII+QV K
Subjt:  ASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEK

Query:  WEDEFW------NKNKP
        WE+ +W       K+KP
Subjt:  WEDEFW------NKNKP

M5WWD4 CAP10 domain-containing protein2.8e-19865.94Show/hide
Query:  WQPSLPKRSVTVAVPAAAIFLLA--ALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKY-IKYYPLNCSSSSSTNQTHNFTCRNKYPTRF-EPESI
        W PS   +S  +A     I  L    +SAA+L S   +Q + F I    N+T   S K  + + +  +PLNCS  S+ NQT   TC   YPT F   + +
Subjt:  WQPSLPKRSVTVAVPAAAIFLLA--ALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKY-IKYYPLNCSSSSSTNQTHNFTCRNKYPTRF-EPESI

Query:  DPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSAD
        +PS  PICP+YFR+IH+DL PW   GITR+MVE+ K TAHFRL +V G+ YVE YKKSIQTRD+FTIWG LQL RRYPGR+PDLELMFDCDD+PV++S D
Subjt:  DPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSAD

Query:  YRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQ
        +R P  ++   PPLFRYCGD  T DIVFPDWSFWGWAEINI+PWE LLK+LKKGN+R KWM+REP+AYWKGNP+VA++R+DLLKCN+S   DWNARL+IQ
Subjt:  YRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQ

Query:  DWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAA
        DWI ESQQG+KQS++ASQCTHRYKIYIEGYAWSVSEKYILACDSVTL+VKP++YDFFTRSLQPV HYWPI  D KCKSIKFAV WGN+HKQK QAIGKAA
Subjt:  DWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAA

Query:  SDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKW
        SDFIQQ+LKM+ VYDYMFHLLN+YAKLLRF+P+IP GA  +CSE++ACP    EKKFM ESLVK+P +T PC+MPP F   +L  LYRRN N+ +QV+KW
Subjt:  SDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKW

Query:  EDEFW
        ED++W
Subjt:  EDEFW

SwissProt top hitse value%identityAlignment
A0NDG6 O-glucosyltransferase rumi homolog1.2e-2025.22Show/hide
Query:  DLRPWATGGITREMVEKGKA-TAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPPPLFR
        DL+P+   GIT+EM+ + K    H++  V+G ++Y     +  +        G     R     +PD++L+ +C D P +    +R  + +     P+  
Subjt:  DLRPWATGGITREMVEKGKA-TAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPPPLFR

Query:  YCGDEETLDIVFPDWSFW-GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQG-
        +    E LDI++P W+FW G   I + P     W+   + + K +  + W  +EP A+++G+   +D R  L+  + +Q +  +A+ Y ++   +S Q  
Subjt:  YCGDEETLDIVFPDWSFW-GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQG-

Query:  -----YKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFI
              ++  L   C +R+     G A S   K++  C S+   V  ++ +FF  SL+P  HY P+      + ++  + +   H Q  +AI +   + I
Subjt:  -----YKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFI

Query:  QQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEV
           L+M +V  Y   LL +Y KL+R+  E     +EV
Subjt:  QQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEV

B0X1Q4 O-glucosyltransferase rumi homolog3.4e-2023.02Show/hide
Query:  CSSSSSTNQTHNF--TCRNKYPTRFEP--ESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLF---
        C+ +     T+N      NKY T  E    +  P +   C  +   +  DLRP+ + GIT++++E  ++    +  ++G R++        + RD     
Subjt:  CSSSSSTNQTHNF--TCRNKYPTRFEP--ESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLF---

Query:  TIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW-GWAEINIRP-----WENLLKELKKGNERSK
           G     R    ++PD+EL+ +C D P + S  +      +  P P+  +    + LDI++P W FW G   I++ P     W+     ++K  +   
Subjt:  TIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW-GWAEINIRP-----WENLLKELKKGNERSK

Query:  WMQREPFAYWKG-------NPYV--ADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVK
        W ++   A+++G       +P V  +  R +L+    ++   W +    +D +    +  ++  L   C ++Y     G A S   K++  C S+   V 
Subjt:  WMQREPFAYWKG-------NPYV--ADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVK

Query:  PKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEV
         ++ +FF  SL+P  HY P+        ++  + +   H Q  Q I     + I   L+ME+V  Y   LL +Y KL++++ +     VE+
Subjt:  PKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEV

Q5E9Q1 Protein O-glucosyltransferase 12.8e-2224.79Show/hide
Query:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD
        E+ +P   P C  Y   I EDL P+  GGI+R+M   V + K   H++  ++  R+Y E    +       + F +           GR+PD+E++ +  
Subjt:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD

Query:  DRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD
        D P V    +  P +      P+F +    E  DI++P W+FW  G A   I P     W+   ++L +   +  W ++   AY++G       +P +  
Subjt:  DRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD

Query:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK
        +R++  L+    ++   W +   ++D +   +   K  +L   C ++Y     G A S   K++  C S+   V  ++ +FF   L+P  HY P+  D  
Subjt:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK

Query:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF
          +++  + +  ++    Q I +  S FI   LKM+++  Y  +LL +Y+K L +
Subjt:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF

Q8BYB9 Protein O-glucosyltransferase 13.4e-2024.23Show/hide
Query:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD
        E+ +P     C  Y   I EDL P+  GGI+R+M   V + K   H++  ++  R++ E    +       + F +            R+PD+E++ +  
Subjt:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD

Query:  DRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD
        D P V    +  PT+      P+F +    E  DI++P W+FW  G A   + P     W+   ++L +   +  W ++   AY++G       +P +  
Subjt:  DRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD

Query:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK
        +R++  L+    ++   W +   ++D +   +   K  +L   C +RY     G A S   K++  C S+   V  ++ +FF   L+P  HY P+  D  
Subjt:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK

Query:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF
          +++  + +  ++    Q I K  S FI   L+M+++  Y  +LL  Y+K L +
Subjt:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF

Q8NBL1 Protein O-glucosyltransferase 11.8e-2124.23Show/hide
Query:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD
        E+ +P     C  Y   I EDL P+  GGI+R+M   V + K   H++  +   R+Y E+   +       + F +           GR+PD+E++ +  
Subjt:  ESIDPSDRPICPEYFRWIHEDLRPWATGGITREM---VEKGKATAHFRLAVVGGRVYVEH---YKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCD

Query:  DRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD
        D P V    +  P +      P+F +    E  DI++P W+FW  G A   I P     W+   ++L +   +  W ++   AY++G       +P +  
Subjt:  DRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFW--GWAEINIRP-----WENLLKELKKGNERSKWMQREPFAYWKG-------NPYVAD

Query:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK
        +R++  L+    ++   W +   ++D +   +   K  +L   C ++Y     G A S   K++  C S+   V  ++ +FF   L+P  HY P+  D  
Subjt:  TRQD--LLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHK

Query:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF
          +++  + +  ++    Q I +  S FI+  L+M+++  Y  +LL++Y+K L +
Subjt:  CKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRF

Arabidopsis top hitse value%identityAlignment
AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821)5.8e-17260.91Show/hide
Query:  SSSTNQTHNFTCRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSR
        SS  NQ  + +C     + +     + S+R  CP+YF+WIHEDL+PW   GIT+EMVE+GK TAHFRL ++ G+V+VE+YKKSIQTRD FT+WG LQL R
Subjt:  SSSTNQTHNFTCRNKYPTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSR

Query:  RYPGRIPDLELMFDCDDRPVVKSADYR--TPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNP
        +YPG++PD++LMFDCDDRPV++S  Y     TV+ A PPPLFRYCGD  T+DIVFPDWSFWGW EINIR W  +LKE+++G ++ K+M+R+ +AYWKGNP
Subjt:  RYPGRIPDLELMFDCDDRPVVKSADYR--TPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNP

Query:  YVAD-TRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISD
        +VA  +R+DLL CNLS  +DWNAR++IQDWI E Q+G++ SN+A+QCT+RYKIYIEGYAWSVSEKYILACDSVTL+VKP +YDFF+R+LQP++HYWPI D
Subjt:  YVAD-TRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISD

Query:  DHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPR-----GGLEKKFMKESLVKTPP
          KC+SIKFAV W N+H QK Q IG+ AS+F+Q+DL MENVYDYMFHLLN+Y+KLL+++P++P  +VE+C+E + CP       G++KKFM  SLV  P 
Subjt:  DHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPR-----GGLEKKFMKESLVKTPP

Query:  LTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK
         + PCS+PPPFD+  L++ +R+  N+IRQVEKWED +W K
Subjt:  LTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK

AT2G45830.1 downstream target of AGL15 28.9e-15752.85Show/hide
Query:  AVPAAAIFLLAAL--SAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPESIDPSDRPICPEYFRW
        ++  A +FL+ +L  SA +L     L        F G +  T+S +      + +P  C      NQT  F          +P S   S    CP YFRW
Subjt:  AVPAAAIFLLAAL--SAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPESIDPSDRPICPEYFRW

Query:  IHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPPPL
        IHEDLRPW   G+TR M+EK + TAHFR+ ++ GRVYV+ Y+KSIQTRD+FT+WG +QL R YPGR+PDLELMFD DDRP V+S D++        PPPL
Subjt:  IHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVDTAGPPPL

Query:  FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSN
        FRYC D+ +LDIVFPDWSFWGWAE+NI+PW+  L  +++GN+ ++W  R  +AYW+GNP VA TR+DLL+CN+S Q DWN RLYIQDW RES++G+K SN
Subjt:  FRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQQGYKQSN

Query:  LASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVY
        L +QCTHRYKIYIEG+AWSVSEKYI+ACDS+TL V+P FYDF+ R + P++HYWPI D  KC S+KFAVHWGN+H  +   IG+  S FI++++KME VY
Subjt:  LASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQDLKMENVY

Query:  DYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWN
        DYMFHL+N+YAKLL+F+PEIP GA E+  + + C   G  + FM+ES+V  P    PC MP PF+   L+ +  R  N+ RQVE WED++++
Subjt:  DYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWN

AT3G48980.1 Arabidopsis thaliana protein of unknown function (DUF821)2.7e-17759.8Show/hide
Query:  IFLL--AALSAAVLIS-STRLQSTLFSINFLGNQTETSSPKIPK--KYI----KYYPLNCSSSSSTNQTHNFTC-RNKYPTRF-----EPESIDPSDRPI
        +FLL  A LS  +L+  S  ++    S+     +  T SP+ P+  K I    K + LNC++ S  +     TC ++ YPT F     E ES D S    
Subjt:  IFLL--AALSAAVLIS-STRLQSTLFSINFLGNQTETSSPKIPK--KYI----KYYPLNCSSSSSTNQTHNFTC-RNKYPTRF-----EPESIDPSDRPI

Query:  CPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVD
        CP+YFRWIHEDLRPW   GITRE +E+  ATA FRLA++ GR+YVE ++++ QTRD+FTIWGF+QL RRYPG+IPDLELMFDC D PVVK+A++    VD
Subjt:  CPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVD

Query:  TAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQ
           PPPLFRYC ++ETLDIVFPDWS+WGWAE+NI+PWE+LLKEL++GN+R+KW+ REP+AYWKGNP VA+TR DL+KCNLS+  DW ARLY QDW++ES+
Subjt:  TAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQ

Query:  QGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQD
        +GYKQS+LASQC HRYKIYIEG AWSVSEKYILACDSVTL+VKP +YDFFTR + P  HYWP+ +D KC+SIKFAV WGN H +K Q IGK AS+F+QQ+
Subjt:  QGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQD

Query:  LKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK
        LKM+ VYDYMFHLL QY+KLLRF+PEIP  + E+CSE +ACPR G E+KFM ESLVK P  T PC+MPPP+D  S   + +R  +   ++E+WE ++W K
Subjt:  LKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK

AT3G61270.1 Arabidopsis thaliana protein of unknown function (DUF821)1.2e-15350.5Show/hide
Query:  SLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPESIDPSDRPI
        + P++S+   V A    ++  +SAA+L     L       N       T+  K P  Y   +  N SS +  +Q           +R  P +   S    
Subjt:  SLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFTCRNKYPTRFEPESIDPSDRPI

Query:  CPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVD
        CP YFRWIHEDLRPW   GITR M+E+   TAHFRL +  G+ YV+ YKKSIQTRD FT+WG LQL R YPG++PDLELMFD DDRPVV+S D+     +
Subjt:  CPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVVKSADYRTPTVD

Query:  TAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQ
           PPP+FRYC D+ +LDIVFPDWSFWGWAE+N++PW   L+ +K+GN  ++W  R  +AYW+GNPYV   R DLLKCN ++  +WN RLYIQDW +E++
Subjt:  TAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQDWIRESQ

Query:  QGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQD
        +G+K SNL +QCTHRYKIYIEG+AWSVSEKYI+ACDS+TL VKP+FYDF+ R + P++HYWPI DD KC S+KFAVHWGN+H+ K + IG+  S FI+++
Subjt:  QGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQD

Query:  LKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK
        + M+ VYDYMFHLL +YA LL+F+PEIP+ A E+  +++ CP     + F  ES++ +P    PC M PP+D  +L+ +  R AN+ RQVE WE++++  
Subjt:  LKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNK

Query:  --NKP
          NKP
Subjt:  --NKP

AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821)1.8e-18157.77Show/hide
Query:  IWQPSL-------PKRSVTVAVPAAAIFLLAALSAAVLISST-RLQSTLFSINFLGNQTETSSPKIPKKYI-------KYYPLNCSSSSSTNQTHNFTCR
        IW P +       P RS  +      + + A +S  +L+ +T  L+    +      QT+T +PK P+            + L+CS++ +T    +    
Subjt:  IWQPSL-------PKRSVTVAVPAAAIFLLAALSAAVLISST-RLQSTLFSINFLGNQTETSSPKIPKKYI-------KYYPLNCSSSSSTNQTHNFTCR

Query:  NKYP--TRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL
        NKYP  T FE +  +      CP+YFRWIHEDLRPW+  GITRE +E+ K TA FRLA+VGG++YVE ++ + QTRD+FTIWGFLQL R+YPG+IPDLEL
Subjt:  NKYP--TRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN
        MFDC D PVV++ ++     +   PPPLFRYCG+EETLDIVFPDWSFWGWAE+NI+PWE+LLKEL++GNER+KW+ REP+AYWKGNP VA+TRQDL+KCN
Subjt:  MFDCDDRPVVKSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCN

Query:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG
        +S++++WNARLY QDWI+ES++GYKQS+LASQC HRYKIYIEG AWSVSEKYILACDSVTLLVKP +YDFFTR L P  HYWP+ +  KC+SIKFAV WG
Subjt:  LSQQNDWNARLYIQDWIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWG

Query:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL
        NSH QK Q IGKAASDFIQQDLKM+ VYDYM+HLL +Y+KLL+F+PEIP  AVE+CSET+AC R G E+KFM ESLVK P  + PC+MPPP+D  +   +
Subjt:  NSHKQKVQAIGKAASDFIQQDLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRL

Query:  YRRNANIIRQVEKWEDEFWNK
         +R  +   ++ +WE ++W+K
Subjt:  YRRNANIIRQVEKWEDEFWNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAGATTCCCAATTTCAATTTCAATTTTGCGACCTTCCCGGAGACCCATTTGGCAACCGTCTCTTCCAAAACGCTCCGTCACCGTCGCAGTTCCTGCCGCC
GCCATCTTCCTCCTCGCCGCCCTCTCCGCCGCCGTCCTCATTTCCTCCACTCGCCTCCAATCCACTCTGTTTTCCATTAACTTTCTCGGAAACCAAACAGAAACA
AGCTCACCAAAAATCCCCAAAAAATACATAAAATATTACCCACTTAACTGTTCTTCATCTTCCTCCACAAACCAAACCCACAATTTCACCTGCCGGAATAAATAC
CCGACCCGATTCGAACCCGAATCCATCGACCCATCTGATCGGCCCATTTGCCCTGAGTACTTCCGATGGATACACGAGGATCTGCGGCCGTGGGCGACGGGCGGA
ATCACGAGGGAGATGGTGGAGAAAGGGAAGGCGACGGCGCATTTCCGGCTGGCGGTGGTGGGCGGGAGGGTCTACGTGGAGCACTATAAGAAGTCGATTCAAACG
AGGGATTTGTTTACGATTTGGGGGTTTTTGCAGCTTTCGAGAAGGTACCCAGGGCGAATCCCTGATTTGGAGCTGATGTTCGACTGTGATGACCGGCCGGTTGTT
AAATCGGCTGATTACCGGACTCCGACGGTGGATACGGCGGGGCCGCCGCCGCTGTTCCGGTATTGCGGCGATGAGGAGACGTTGGACATTGTGTTTCCGGACTGG
TCCTTCTGGGGATGGGCAGAGATAAATATAAGGCCATGGGAGAATTTGTTGAAGGAATTGAAAAAAGGGAATGAAAGAAGCAAATGGATGCAAAGGGAACCATTT
GCTTATTGGAAAGGAAACCCTTATGTGGCTGACACAAGACAAGATCTTCTCAAATGCAACCTCTCCCAACAAAATGATTGGAATGCTCGCCTCTACATTCAGGAT
TGGATCCGAGAGTCTCAACAAGGTTATAAGCAATCTAACTTGGCAAGCCAATGCACTCATAGGTACAAGATCTACATAGAAGGATATGCATGGTCAGTGAGTGAA
AAATACATATTGGCATGTGATTCAGTGACATTGCTTGTAAAGCCCAAATTTTATGATTTCTTCACTAGATCTTTACAGCCAGTTCGCCATTATTGGCCTATTAGT
GATGACCATAAGTGCAAATCCATCAAATTTGCTGTCCATTGGGGCAATTCCCACAAACAAAAGGTACAAGCTATAGGAAAAGCAGCGAGTGACTTCATCCAACAA
GACTTAAAGATGGAAAATGTGTATGACTACATGTTTCATCTCCTCAACCAATACGCCAAGCTCCTTCGATTTCAGCCCGAAATCCCGATGGGCGCAGTGGAAGTC
TGCTCCGAGACGGTGGCTTGCCCGAGGGGCGGGCTGGAGAAGAAGTTCATGAAGGAATCCTTGGTGAAAACTCCCCCTCTCACCATCCCTTGCTCCATGCCACCA
CCCTTTGATACCCCTTCTCTCCAAAGGCTTTATAGAAGAAATGCCAACATAATAAGGCAAGTGGAGAAATGGGAAGATGAGTTTTGGAATAAAAATAAACCTTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAGTCAACGGGCATTTACCCACACAAAGCACAAAAAGTCTCCCTCGATAATTACTTTCAAAATTCTCTTTCGGATTCGTAAAATCCCACCAGATGCAA
TGCAGAGATTCCCAATTTCAATTTCAATTTTGCGACCTTCCCGGAGACCCATTTGGCAACCGTCTCTTCCAAAACGCTCCGTCACCGTCGCAGTTCCTGCCGCCG
CCATCTTCCTCCTCGCCGCCCTCTCCGCCGCCGTCCTCATTTCCTCCACTCGCCTCCAATCCACTCTGTTTTCCATTAACTTTCTCGGAAACCAAACAGAAACAA
GCTCACCAAAAATCCCCAAAAAATACATAAAATATTACCCACTTAACTGTTCTTCATCTTCCTCCACAAACCAAACCCACAATTTCACCTGCCGGAATAAATACC
CGACCCGATTCGAACCCGAATCCATCGACCCATCTGATCGGCCCATTTGCCCTGAGTACTTCCGATGGATACACGAGGATCTGCGGCCGTGGGCGACGGGCGGAA
TCACGAGGGAGATGGTGGAGAAAGGGAAGGCGACGGCGCATTTCCGGCTGGCGGTGGTGGGCGGGAGGGTCTACGTGGAGCACTATAAGAAGTCGATTCAAACGA
GGGATTTGTTTACGATTTGGGGGTTTTTGCAGCTTTCGAGAAGGTACCCAGGGCGAATCCCTGATTTGGAGCTGATGTTCGACTGTGATGACCGGCCGGTTGTTA
AATCGGCTGATTACCGGACTCCGACGGTGGATACGGCGGGGCCGCCGCCGCTGTTCCGGTATTGCGGCGATGAGGAGACGTTGGACATTGTGTTTCCGGACTGGT
CCTTCTGGGGATGGGCAGAGATAAATATAAGGCCATGGGAGAATTTGTTGAAGGAATTGAAAAAAGGGAATGAAAGAAGCAAATGGATGCAAAGGGAACCATTTG
CTTATTGGAAAGGAAACCCTTATGTGGCTGACACAAGACAAGATCTTCTCAAATGCAACCTCTCCCAACAAAATGATTGGAATGCTCGCCTCTACATTCAGGATT
GGATCCGAGAGTCTCAACAAGGTTATAAGCAATCTAACTTGGCAAGCCAATGCACTCATAGGTACAAGATCTACATAGAAGGATATGCATGGTCAGTGAGTGAAA
AATACATATTGGCATGTGATTCAGTGACATTGCTTGTAAAGCCCAAATTTTATGATTTCTTCACTAGATCTTTACAGCCAGTTCGCCATTATTGGCCTATTAGTG
ATGACCATAAGTGCAAATCCATCAAATTTGCTGTCCATTGGGGCAATTCCCACAAACAAAAGGTACAAGCTATAGGAAAAGCAGCGAGTGACTTCATCCAACAAG
ACTTAAAGATGGAAAATGTGTATGACTACATGTTTCATCTCCTCAACCAATACGCCAAGCTCCTTCGATTTCAGCCCGAAATCCCGATGGGCGCAGTGGAAGTCT
GCTCCGAGACGGTGGCTTGCCCGAGGGGCGGGCTGGAGAAGAAGTTCATGAAGGAATCCTTGGTGAAAACTCCCCCTCTCACCATCCCTTGCTCCATGCCACCAC
CCTTTGATACCCCTTCTCTCCAAAGGCTTTATAGAAGAAATGCCAACATAATAAGGCAAGTGGAGAAATGGGAAGATGAGTTTTGGAATAAAAATAAACCTTAAT
ACATTTTTTTTTTCCTTTTTCCCTCTATTTTTTAAATTTTGAAATAGAAACAGCAAAAAAAATATTTATGGTTTGTCTATTTTGAAATAATATATCCTTTAATTA
CCATCAGTTTGATTATTTCATGGTATTTTGAACTTAA
Protein sequenceShow/hide protein sequence
MQRFPISISILRPSRRPIWQPSLPKRSVTVAVPAAAIFLLAALSAAVLISSTRLQSTLFSINFLGNQTETSSPKIPKKYIKYYPLNCSSSSSTNQTHNFTCRNKY
PTRFEPESIDPSDRPICPEYFRWIHEDLRPWATGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGFLQLSRRYPGRIPDLELMFDCDDRPVV
KSADYRTPTVDTAGPPPLFRYCGDEETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNERSKWMQREPFAYWKGNPYVADTRQDLLKCNLSQQNDWNARLYIQD
WIRESQQGYKQSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPKFYDFFTRSLQPVRHYWPISDDHKCKSIKFAVHWGNSHKQKVQAIGKAASDFIQQ
DLKMENVYDYMFHLLNQYAKLLRFQPEIPMGAVEVCSETVACPRGGLEKKFMKESLVKTPPLTIPCSMPPPFDTPSLQRLYRRNANIIRQVEKWEDEFWNKNKP