; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020015 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020015
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein O-glucosyltransferase 1-like
Genome locationchr5:47391474..47395340
RNA-Seq ExpressionLag0020015
SyntenyLag0020015
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR006598 - Glycosyl transferase CAP10 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033653.1 O-glucosyltransferase rumi-like protein [Cucumis melo var. makuwa]1.4e-26882.62Show/hide
Query:  MRE---ASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFS-LFFFFSLLLLGGAFISTRLLNFTTTADNL--KG-----------SEIPKNPH--RRRHV
        MRE   +SF  RFS Y S     F+DH+ KP +KSPA FS LF FFSL LL G F+STRLL+ +T A NL  KG           SE+P+NP+  RRR V
Subjt:  MRE---ASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFS-LFFFFSLLLLGGAFISTRLLNFTTTADNL--KG-----------SEIPKNPH--RRRHV

Query:  EIPLDCTSFNNVTGGACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWG
        E  LDCTSFNN+TGGACPANYPTNWT +E  NRPS  TCP+YFRWIHEDLRPWARTGI+RA +EAG+RTANFRLVILNGKAYVET+ KSFQ+RDTFTVWG
Subjt:  EIPLDCTSFNNVTGGACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWG

Query:  ILQLLRWYPGKVPDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKG
        ILQLLR YPGKV DL+LMFDCVDWPVIL++HFSGP+GP PPPLFRYCGDD TLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRI WKSREPYAYWKG
Subjt:  ILQLLRWYPGKVPDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKG

Query:  NPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVK
        NPEVA+TRKDLLKCNVSDQQDWNARVFAQDW KESQ GYKQSDL+NQCLHRYKIYIEGSAWSVSEKYILACDSVTL+VKPHYYDFFTRGLMPVHHYWPVK
Subjt:  NPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVK

Query:  SDDKCRSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSP
         DDKC+SIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLS+YSKLLTFKP++P  AIELCSEAMACPAEGLT+KFMTESLVK PA+S+P
Subjt:  SDDKCRSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSP

Query:  CTMPPPYDPASLHFVLSRKENSIKQVEKWETAFWDTQSKQP
        CTMPPPYDPASLHFVL RKENSIKQVEKWET+FW+TQSKQP
Subjt:  CTMPPPYDPASLHFVLSRKENSIKQVEKWETAFWDTQSKQP

KAG6574177.1 Protein O-glucosyltransferase 1, partial [Cucurbita argyrosperma subsp. sororia]1.0e-27183.84Show/hide
Query:  MREASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFSLFFFFSLLLLGGAFISTRLLNFTTTADNLKG----SEIPKNPHRRRHVEIPLDCTSFNNVTGG
        MR+A F QRFS+Y SWVSRHF+DHL KP LKSPARFSL  FFSL LL GAF+STRLL+      N +G    S+IPK P RRR VE PLDCTSFNNV GG
Subjt:  MREASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFSLFFFFSLLLLGGAFISTRLLNFTTTADNLKG----SEIPKNPHRRRHVEIPLDCTSFNNVTGG

Query:  ACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDL
         CPA+YPTNWT EEDPN P P TCPDYFRWIHEDLRPW RTGITRA +EAG+RTANFRL I+NGKAYV+T+ KSFQ+RDTFTVWGILQLLR YPGKVPDL
Subjt:  ACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDL

Query:  ELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLLKCN
        ELMFDCVDWPVILT+HFSGPNGP PPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDL EGNK+IPWKSREPYAYWKGNPEVAETRKDLLKCN
Subjt:  ELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLLKCN

Query:  VSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWG
        VSDQQDWNARVFAQDWMKESQ+GYKQSDLA QC+H+YKIYIEGSAWSVSEKYILACDSVTLLVKP YYDFFTRGLMP+HHYWPVK+DDKCRSIKFAVDWG
Subjt:  VSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWG

Query:  NSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASLHFV
        NSH+QKAQAIGKAA+SFIQEELKM+YVYDYMFHLL++YSKLLTFKP+IP  AIELCSEAMACPA+GLT++FMTESLVKSPA++SPCT+PPPYDPASL FV
Subjt:  NSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASLHFV

Query:  LSRKENSIKQVEKWETAFWDTQSKQP
         S K++SIKQVE+WET    T+SKQP
Subjt:  LSRKENSIKQVEKWETAFWDTQSKQP

XP_022944810.1 protein O-glucosyltransferase 1 isoform X1 [Cucurbita moschata]1.7e-27183.46Show/hide
Query:  MREASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFSLFFFFSLLLLGGAFISTRLLNFTTTADNLKG----SEIPKNPHRRRHVEIPLDCTSFNNVTGG
        MR+A F QRFS+Y SWVSRHF+DHL KP LKSPARFSL  FFSL LL GAF+STRLL+      N +G    S+IPK P RRR VE PLDCT FNNV GG
Subjt:  MREASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFSLFFFFSLLLLGGAFISTRLLNFTTTADNLKG----SEIPKNPHRRRHVEIPLDCTSFNNVTGG

Query:  ACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDL
         CPA+YPTNWT EEDPN P P TCPDYFRWIHEDLRPW RTGITRAT+EAG+RTANFRL I+NGKAYV+T+ KSFQ+RDTFTVWGILQLLR YPGKVPDL
Subjt:  ACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDL

Query:  ELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLLKCN
        ELMFDCVDWPVILT+HF+GPNGPAPPPLFRYCGDD+TLDIVFPDWSFWGWPEINIKPWEPLLKDL EGNK+IPWKSREPYAYWKGNPEVAETRKDLLKCN
Subjt:  ELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLLKCN

Query:  VSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWG
        VSDQ+DWNARVFAQDWMKESQ+GYKQSDLA QC+H+YKIYIEGSAWSVSEKYILACDS+TLLVKP YYDFFTRGLMP+HHYWPVK+DDKCRSIKFAVDWG
Subjt:  VSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWG

Query:  NSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASLHFV
        NSH+QKAQAIGKAA+SFIQEELKM+YVYDYMFHLL++YSKLLTFKPSIP  AIELCSEAMACPA+GLT++FMTESLVKSPA++SPCT+PPPYDPASL FV
Subjt:  NSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASLHFV

Query:  LSRKENSIKQVEKWETAFWDTQSKQP
         S K++SIKQVE+WET    T+SKQP
Subjt:  LSRKENSIKQVEKWETAFWDTQSKQP

XP_022968464.1 protein O-glucosyltransferase 1 isoform X1 [Cucurbita maxima]9.5e-27083.08Show/hide
Query:  MREASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFSLFFFFSLLLLGGAFISTRLLNFTTTADNLKG----SEIPKNPHRRRHVEIPLDCTSFNNVTGG
        MR+A F QRFS+Y +WVSRHF+DHL KP LKSPARFSL  FFSL LL GAF+STRLL+  T   N +G    S+IPK P RRR VE PLDCTSFN+V  G
Subjt:  MREASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFSLFFFFSLLLLGGAFISTRLLNFTTTADNLKG----SEIPKNPHRRRHVEIPLDCTSFNNVTGG

Query:  ACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDL
         CPA+YPTNWT EEDPN P P TCPDYFRWIHEDLRPWARTGITRAT+EAG+RTANFRL I+NGKAYV+T+ KSFQ+RDTFTVWGILQLLR YPGKVPDL
Subjt:  ACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDL

Query:  ELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLLKCN
        ELMFDCVDWPVILT+HFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDL EGNK+IPWKSREPYAYWKGNPEVAETRKDLLKCN
Subjt:  ELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLLKCN

Query:  VSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWG
        VSDQQDWNARVF QDWMKESQ+GYKQSDLA QC+H+YKIYIEGSAWSVSEKYILACDSVTLLVKP YYDFFTRGL+P+HHYWPVK+DDKCRSIKFAVDWG
Subjt:  VSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWG

Query:  NSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASLHFV
        NSH+Q+AQAIGKAA+SFIQEELKM+YVYDYMFHLL++YSKLLTFKP+IP  AIELCSEAMACPA+GLT++ MTESLV+SPA++SPCT+PPPYDPASL FV
Subjt:  NSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASLHFV

Query:  LSRKENSIKQVEKWETAFWDTQSKQP
         S K++SIKQVE+WET    T+SKQP
Subjt:  LSRKENSIKQVEKWETAFWDTQSKQP

XP_038898817.1 protein O-glucosyltransferase 1-like [Benincasa hispida]2.3e-27685.21Show/hide
Query:  MREA-SFQQRFSDYTSWVSRHFADH-LTKPLLKSPARFSLFF-FFSLLLLGGAFISTRLLNFTTTADNLKG----------------SEIPKNPHRRRHV
        MREA SFQQRFS Y S  SR+F+DH L KP LKSPA FSLFF FFSL LL G F STRLL+ +TTA NL G                S+IP+NP+ RR V
Subjt:  MREA-SFQQRFSDYTSWVSRHFADH-LTKPLLKSPARFSLFF-FFSLLLLGGAFISTRLLNFTTTADNLKG----------------SEIPKNPHRRRHV

Query:  EIPLDCTSFNNVTGGACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWG
        E  LDCTSFNN+T G CP NYPT WT EED +RPS  TCPD+FRWIHEDL PWARTGITRATLEAGKRTANFRLVILNGKAYVET+ KSFQ+RDTFTVWG
Subjt:  EIPLDCTSFNNVTGGACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWG

Query:  ILQLLRWYPGKVPDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKG
        ILQLLR YPGKVPDLELMFDCVDWPVILT+HFSGPNGP PPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWE LLKDLKEGNKRIPWK RE YAYWKG
Subjt:  ILQLLRWYPGKVPDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKG

Query:  NPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVK
        NPEVAETRKDLLKCNVSDQQDWN RVFAQDWMKESQ+GYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTL+VKPHYYDFFTRGLMPVHHYWPVK
Subjt:  NPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVK

Query:  SDDKCRSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSP
         DDKC+SIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKP IPR AI+LCSEAMACPAEGLT+KFM +SLVK PADSSP
Subjt:  SDDKCRSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSP

Query:  CTMPPPYDPASLHFVLSRKENSIKQVEKWETAFWDTQSKQP
        C MPPPYDPASLHFVLSRKENSIKQVEKWET+FW+TQSKQP
Subjt:  CTMPPPYDPASLHFVLSRKENSIKQVEKWETAFWDTQSKQP

TrEMBL top hitse value%identityAlignment
A0A0A0L5W3 CAP10 domain-containing protein5.6e-26882.32Show/hide
Query:  MRE---ASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFS-LFFFFSLLLLGGAFISTRLLNFTTTADNL--KG-----------SEIPKNPH---RRRH
        MRE    SF+ RFS Y       F DH+ KP +KSPA FS LF FFSL LL G F+STRLL+ +TTA NL  KG           S++P NP+   RR  
Subjt:  MRE---ASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFS-LFFFFSLLLLGGAFISTRLLNFTTTADNL--KG-----------SEIPKNPH---RRRH

Query:  VEIPLDCTSFNNVTGGACPANYPTNWTAEEDPNRPSPPT-CPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTV
        VE  L C SFNN+T GACPA+YPTNWT +ED N PS  + CPDYFRWIHEDLRPWARTGITRATLEAG+RTANFRL+ILNGKAYVET+ KSFQ+RDTFTV
Subjt:  VEIPLDCTSFNNVTGGACPANYPTNWTAEEDPNRPSPPT-CPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTV

Query:  WGILQLLRWYPGKVPDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYW
        WGILQLLR YPGKVPDL+LMFDCVDWPVILT+HFSGPNGP PPPLFRYCGDDAT DIVFPDWSFWGWPEINIKPWEPLLKD+KEGNKRIPWKSREPYAYW
Subjt:  WGILQLLRWYPGKVPDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYW

Query:  KGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWP
        KGNPEVA+TRKDL+KCNVSDQQDWNARVFAQDW KESQ GYKQSDL+NQCLHRYKIYIEGSAWSVSEKYILACDSVTL+VKPHYYDFFTRGLMPVHHYWP
Subjt:  KGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWP

Query:  VKSDDKCRSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADS
        VK DDKC+SIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKP++P  AIELCSEAMACPAEGLT+KFMTESLVK PA+S
Subjt:  VKSDDKCRSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADS

Query:  SPCTMPPPYDPASLHFVLSRKENSIKQVEKWETAFWDTQSKQP
        +PCTMPPPYDPASLHFVLSRKENSIKQVEKWET+FW+TQSKQP
Subjt:  SPCTMPPPYDPASLHFVLSRKENSIKQVEKWETAFWDTQSKQP

A0A1S3AYX8 O-glucosyltransferase rumi homolog4.8e-26782.29Show/hide
Query:  MRE---ASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFS-LFFFFSLLLLGGAFISTRLLNFTTTADNL--KG-----------SEIPKNPH---RRRH
        MRE   +SF  RFS Y S     F+DH+ KP +KSPA FS LF FFSL LL G F+STRLL+ +T A NL  KG           SE+P+NP+   RRR 
Subjt:  MRE---ASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFS-LFFFFSLLLLGGAFISTRLLNFTTTADNL--KG-----------SEIPKNPH---RRRH

Query:  VEIPLDCTSFNNVTGGACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVW
        VE  LDCTSFNN+TGGACPANYPTN T +E  NRPS  TCP+YFRWIHEDLRPWARTGI+RA +EAG+RTANFRLVILNGKAYVET+ KSFQ+RDTFTVW
Subjt:  VEIPLDCTSFNNVTGGACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVW

Query:  GILQLLRWYPGKVPDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWK
        GILQLLR YPGKV DL+LMFDCVDWPVIL++HFSGP+GP PPPLFRYCGDD TLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRI WKSREPYAYWK
Subjt:  GILQLLRWYPGKVPDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWK

Query:  GNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPV
        GNPEVA+TRKDLLKCNVSDQQDWNARVFAQDW KESQ GYKQSDL+NQCLHRYKIYIEGSAWSVSEKYILACDSVTL+VKPHYYDFFTRGLMPVHHYWPV
Subjt:  GNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPV

Query:  KSDDKCRSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSS
        K DDKC+SIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKP++P  AIELCSEAMACPAEGLT+KFMTESLVK PA+S+
Subjt:  KSDDKCRSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSS

Query:  PCTMPPPYDPASLHFVLSRKENSIKQVEKWETAFWDTQSKQP
        PCTMPPPYDPASLHFVL RKENSIKQVEKWET+FW+T+SKQP
Subjt:  PCTMPPPYDPASLHFVLSRKENSIKQVEKWETAFWDTQSKQP

A0A5D3DHB1 O-glucosyltransferase rumi-like protein6.7e-26982.62Show/hide
Query:  MRE---ASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFS-LFFFFSLLLLGGAFISTRLLNFTTTADNL--KG-----------SEIPKNPH--RRRHV
        MRE   +SF  RFS Y S     F+DH+ KP +KSPA FS LF FFSL LL G F+STRLL+ +T A NL  KG           SE+P+NP+  RRR V
Subjt:  MRE---ASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFS-LFFFFSLLLLGGAFISTRLLNFTTTADNL--KG-----------SEIPKNPH--RRRHV

Query:  EIPLDCTSFNNVTGGACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWG
        E  LDCTSFNN+TGGACPANYPTNWT +E  NRPS  TCP+YFRWIHEDLRPWARTGI+RA +EAG+RTANFRLVILNGKAYVET+ KSFQ+RDTFTVWG
Subjt:  EIPLDCTSFNNVTGGACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWG

Query:  ILQLLRWYPGKVPDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKG
        ILQLLR YPGKV DL+LMFDCVDWPVIL++HFSGP+GP PPPLFRYCGDD TLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRI WKSREPYAYWKG
Subjt:  ILQLLRWYPGKVPDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKG

Query:  NPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVK
        NPEVA+TRKDLLKCNVSDQQDWNARVFAQDW KESQ GYKQSDL+NQCLHRYKIYIEGSAWSVSEKYILACDSVTL+VKPHYYDFFTRGLMPVHHYWPVK
Subjt:  NPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVK

Query:  SDDKCRSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSP
         DDKC+SIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLS+YSKLLTFKP++P  AIELCSEAMACPAEGLT+KFMTESLVK PA+S+P
Subjt:  SDDKCRSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSP

Query:  CTMPPPYDPASLHFVLSRKENSIKQVEKWETAFWDTQSKQP
        CTMPPPYDPASLHFVL RKENSIKQVEKWET+FW+TQSKQP
Subjt:  CTMPPPYDPASLHFVLSRKENSIKQVEKWETAFWDTQSKQP

A0A6J1FZ46 protein O-glucosyltransferase 1 isoform X18.4e-27283.46Show/hide
Query:  MREASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFSLFFFFSLLLLGGAFISTRLLNFTTTADNLKG----SEIPKNPHRRRHVEIPLDCTSFNNVTGG
        MR+A F QRFS+Y SWVSRHF+DHL KP LKSPARFSL  FFSL LL GAF+STRLL+      N +G    S+IPK P RRR VE PLDCT FNNV GG
Subjt:  MREASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFSLFFFFSLLLLGGAFISTRLLNFTTTADNLKG----SEIPKNPHRRRHVEIPLDCTSFNNVTGG

Query:  ACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDL
         CPA+YPTNWT EEDPN P P TCPDYFRWIHEDLRPW RTGITRAT+EAG+RTANFRL I+NGKAYV+T+ KSFQ+RDTFTVWGILQLLR YPGKVPDL
Subjt:  ACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDL

Query:  ELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLLKCN
        ELMFDCVDWPVILT+HF+GPNGPAPPPLFRYCGDD+TLDIVFPDWSFWGWPEINIKPWEPLLKDL EGNK+IPWKSREPYAYWKGNPEVAETRKDLLKCN
Subjt:  ELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLLKCN

Query:  VSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWG
        VSDQ+DWNARVFAQDWMKESQ+GYKQSDLA QC+H+YKIYIEGSAWSVSEKYILACDS+TLLVKP YYDFFTRGLMP+HHYWPVK+DDKCRSIKFAVDWG
Subjt:  VSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWG

Query:  NSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASLHFV
        NSH+QKAQAIGKAA+SFIQEELKM+YVYDYMFHLL++YSKLLTFKPSIP  AIELCSEAMACPA+GLT++FMTESLVKSPA++SPCT+PPPYDPASL FV
Subjt:  NSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASLHFV

Query:  LSRKENSIKQVEKWETAFWDTQSKQP
         S K++SIKQVE+WET    T+SKQP
Subjt:  LSRKENSIKQVEKWETAFWDTQSKQP

A0A6J1HZQ4 protein O-glucosyltransferase 1 isoform X14.6e-27083.08Show/hide
Query:  MREASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFSLFFFFSLLLLGGAFISTRLLNFTTTADNLKG----SEIPKNPHRRRHVEIPLDCTSFNNVTGG
        MR+A F QRFS+Y +WVSRHF+DHL KP LKSPARFSL  FFSL LL GAF+STRLL+  T   N +G    S+IPK P RRR VE PLDCTSFN+V  G
Subjt:  MREASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFSLFFFFSLLLLGGAFISTRLLNFTTTADNLKG----SEIPKNPHRRRHVEIPLDCTSFNNVTGG

Query:  ACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDL
         CPA+YPTNWT EEDPN P P TCPDYFRWIHEDLRPWARTGITRAT+EAG+RTANFRL I+NGKAYV+T+ KSFQ+RDTFTVWGILQLLR YPGKVPDL
Subjt:  ACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDL

Query:  ELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLLKCN
        ELMFDCVDWPVILT+HFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDL EGNK+IPWKSREPYAYWKGNPEVAETRKDLLKCN
Subjt:  ELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLLKCN

Query:  VSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWG
        VSDQQDWNARVF QDWMKESQ+GYKQSDLA QC+H+YKIYIEGSAWSVSEKYILACDSVTLLVKP YYDFFTRGL+P+HHYWPVK+DDKCRSIKFAVDWG
Subjt:  VSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWG

Query:  NSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASLHFV
        NSH+Q+AQAIGKAA+SFIQEELKM+YVYDYMFHLL++YSKLLTFKP+IP  AIELCSEAMACPA+GLT++ MTESLV+SPA++SPCT+PPPYDPASL FV
Subjt:  NSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASLHFV

Query:  LSRKENSIKQVEKWETAFWDTQSKQP
         S K++SIKQVE+WET    T+SKQP
Subjt:  LSRKENSIKQVEKWETAFWDTQSKQP

SwissProt top hitse value%identityAlignment
B0X1Q4 O-glucosyltransferase rumi homolog1.1e-2325.14Show/hide
Query:  CPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTF---TVWGILQLLRWYPGKVPDLELMFDCVDWPVILTTHFSGP
        C  +   +  DLRP+ R+GIT+  +E  +           G  Y     + F+ RD        G+   +R    K+PD+EL+ +C DWP I + H++  
Subjt:  CPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTF---TVWGILQLLRWYPGKVPDLELMFDCVDWPVILTTHFSGP

Query:  NGPAPPPLFRYCGDDATLDIVFPDWSFW-GWPEINIKP-----WEPLLKDLKEGNKRIPWKSREPYAYWKGNPE---------VAETRKDLLKCNVSDQQ
          P P   F    D   LDI++P W FW G P I++ P     W+     +++  K  PW+ +   A+++G+           ++  R +L+    +  Q
Subjt:  NGPAPPPLFRYCGDDATLDIVFPDWSFW-GWPEINIKP-----WEPLLKDLKEGNKRIPWKSREPYAYWKGNPE---------VAETRKDLLKCNVSDQQ

Query:  DWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWGNSHKQ
         W +    +D +       ++  L + C ++Y     G A S   K++  C S+   V   + +FF   L P  HY PV        ++  + +   H Q
Subjt:  DWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWGNSHKQ

Query:  KAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIEL
         AQ I       I   L+M+ V  Y   LL  Y KL+ ++     + +E+
Subjt:  KAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIEL

G3V9D0 Protein O-glucosyltransferase 13.3e-2327.25Show/hide
Query:  SPPTCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDLELMFDCVDWPVILTTHFS
        S   C  Y   I EDL P+ R GI+R  + E  +R       I+  + + E     F SR +     IL+++R    ++PD+E++ +  D+P +      
Subjt:  SPPTCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDLELMFDCVDWPVILTTHFS

Query:  GPNGPAPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKG-------NPEVAETRKD--LLKCNV
         P    P  P+F +       DI++P W+FW      WP     +  W+   +DL     + PW+ +   AY++G       +P +  +RK+  L+    
Subjt:  GPNGPAPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKG-------NPEVAETRKD--LLKCNV

Query:  SDQQDWNARVFAQDWMKES--QRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDW
        +  Q W +       MK++  +   K   L + C ++Y     G A S   K++  C S+   V   + +FF   L P  HY PVK+D     ++  + +
Subjt:  SDQQDWNARVFAQDWMKES--QRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDW

Query:  GNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF
          ++   AQ I K  S FI   L+MD +  Y  +LL+EYSK L++
Subjt:  GNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF

Q5E9Q1 Protein O-glucosyltransferase 17.1e-2628.12Show/hide
Query:  SPPTCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDLELMFDCVDWPVILTTHFS
        S P C  Y   I EDL P+ R GI+R  + E  +R       I+  + Y E+    F SR +     IL+++    G++PD+E++ +  D+P +      
Subjt:  SPPTCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDLELMFDCVDWPVILTTHFS

Query:  GPNGPAPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKG-------NPEVAETRKD--LLKCNV
         P    P  P+F +       DI++P W+FW      WP   + +  W+   +DL     + PWK +   AY++G       +P +  +RK+  L+    
Subjt:  GPNGPAPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKG-------NPEVAETRKD--LLKCNV

Query:  SDQQDWNARVFAQDWMKES--QRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDW
        +  Q W +       MK++  +   K   L + C ++Y     G A S   K++  C S+   V   + +FF   L P  HY PVK+D    +++  + +
Subjt:  SDQQDWNARVFAQDWMKES--QRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDW

Query:  GNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF
          ++   AQ I +  S FI   LKMD +  Y  +LL+EYSK L++
Subjt:  GNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF

Q8BYB9 Protein O-glucosyltransferase 17.4e-2326.96Show/hide
Query:  SPPTCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDLELMFDCVDWPVILTTHFS
        S   C  Y   I EDL P+ R GI+R  + E  +R       I+  + + E     F SR +     IL+++     ++PD+E++ +  D+P +      
Subjt:  SPPTCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDLELMFDCVDWPVILTTHFS

Query:  GPNGPAPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKG-------NPEVAETRKD--LLKCNV
         P    P  P+F +       DI++P W+FW      WP     +  W+   +DL     + PW+ +   AY++G       +P +  +RK+  L+    
Subjt:  GPNGPAPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKG-------NPEVAETRKD--LLKCNV

Query:  SDQQDWNARVFAQDWMKES--QRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDW
        +  Q W +       MK++  +   K   L + C +RY     G A S   K++  C S+   V   + +FF   L P  HY PVK+D    +++  + +
Subjt:  SDQQDWNARVFAQDWMKES--QRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDW

Query:  GNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF
          ++   AQ I K  S FI   L+MD +  Y  +LL++YSK L++
Subjt:  GNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF

Q8NBL1 Protein O-glucosyltransferase 17.9e-2527.83Show/hide
Query:  SPPTCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDLELMFDCVDWPVILTTHFS
        S   C  Y   I EDL P+ R GI+R  + E  +R       I   + Y E     F SR +     IL+++    G++PD+E++ +  D+P +      
Subjt:  SPPTCPDYFRWIHEDLRPWARTGITRATL-EAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDLELMFDCVDWPVILTTHFS

Query:  GPNGPAPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKG-------NPEVAETRKD--LLKCNV
         P    P  P+F +       DI++P W+FW      WP     +  W+   +DL     + PWK +   AY++G       +P +  +RK+  L+    
Subjt:  GPNGPAPP-PLFRYCGDDATLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKG-------NPEVAETRKD--LLKCNV

Query:  SDQQDWNARVFAQDWMKES--QRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDW
        +  Q W +       MK++  +   K   L + C ++Y     G A S   K++  C S+   V   + +FF   L P  HY PVK+D    +++  + +
Subjt:  SDQQDWNARVFAQDWMKES--QRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDW

Query:  GNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF
          ++   AQ I +  S FI+  L+MD +  Y  +LLSEYSK L++
Subjt:  GNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF

Arabidopsis top hitse value%identityAlignment
AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821)3.5e-16158.96Show/hide
Query:  LDCTSF-NNVTGGACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGIL
        +DC+SF N    G+C     + +   +  +  S   CPDYF+WIHEDL+PW  TGIT+  +E GK TA+FRLVILNGK +VE + KS Q+RD FT+WGIL
Subjt:  LDCTSF-NNVTGGACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGIL

Query:  QLLRWYPGKVPDLELMFDCVDWPVILTTHFSGPNGP---APPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWK
        QLLR YPGK+PD++LMFDC D PVI +  ++  N     APPPLFRYCGD  T+DIVFPDWSFWGW EINI+ W  +LK+++EG K+  +  R+ YAYWK
Subjt:  QLLRWYPGKVPDLELMFDCVDWPVILTTHFSGPNGP---APPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWK

Query:  GNPEVAE-TRKDLLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWP
        GNP VA  +R+DLL CN+S   DWNAR+F QDW+ E QRG++ S++ANQC +RYKIYIEG AWSVSEKYILACDSVTL+VKP+YYDFF+R L P+ HYWP
Subjt:  GNPEVAE-TRKDLLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWP

Query:  VKSDDKCRSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAE-----GLTRKFMTESLVK
        ++  DKCRSIKFAVDW N+H QKAQ IG+ AS F+Q +L M+ VYDYMFHLL+EYSKLL +KP +P+ ++ELC+EA+ CP+E     G+ +KFM  SLV 
Subjt:  VKSDDKCRSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAE-----GLTRKFMTESLVK

Query:  SPADSSPCTMPPPYDPASLHFVLSRKENSIKQVEKWETAFW
         P  S PC++PPP+D   L     +K N I+QVEKWE ++W
Subjt:  SPADSSPCTMPPPYDPASLHFVLSRKENSIKQVEKWETAFW

AT2G45830.1 downstream target of AGL15 21.3e-16054.53Show/hide
Query:  KSPARFSLFFFFSLLLLGGAFISTRLLNFTT-TADNLKGSEIPKNPHRRRHVEIPLDCTSFNNVTGGACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHE
        KS A+ +LF   SL +  G        +FTT T      + I K+P   +    P  C    N T    P N  +    +   +     TCP YFRWIHE
Subjt:  KSPARFSLFFFFSLLLLGGAFISTRLLNFTT-TADNLKGSEIPKNPHRRRHVEIPLDCTSFNNVTGGACPANYPTNWTAEEDPNRPSPPTCPDYFRWIHE

Query:  DLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCG
        DLRPW  TG+TR  LE  +RTA+FR+VIL+G+ YV+ + KS Q+RD FT+WGI+QLLRWYPG++PDLELMFD  D P + +  F G   PAPPPLFRYC 
Subjt:  DLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCG

Query:  DDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQC
        DDA+LDIVFPDWSFWGW E+NIKPW+  L  ++EGNK   WK R  YAYW+GNP VA TR+DLL+CNVS Q+DWN R++ QDW +ES+ G+K S+L NQC
Subjt:  DDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQC

Query:  LHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFH
         HRYKIYIEG AWSVSEKYI+ACDS+TL V+P +YDF+ RG+MP+ HYWP++   KC S+KFAV WGN+H  +A  IG+  S FI+EE+KM+YVYDYMFH
Subjt:  LHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFH

Query:  LLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASLHFVLSRKENSIKQVEKWETAFW
        L++EY+KLL FKP IP  A E+  + M C A G  R FM ES+V  P++ SPC MP P++P  L  +L RK N  +QVE WE  ++
Subjt:  LLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASLHFVLSRKENSIKQVEKWETAFW

AT3G48980.1 Arabidopsis thaliana protein of unknown function (DUF821)2.7e-19861.25Show/hide
Query:  YTSWVSRHFADHLTKPLLKSPA----RFSLFFFFSLLLLGGAFISTRLL--------NFTTTADNLKGSEIPKNPHRRRHV-----EIPLDCTSFNNVTG
        YTS       D +  PL+K+      R   FF   L LL GAF+STRLL            +    + ++ P+ P   + +     E  L+C +F+    
Subjt:  YTSWVSRHFADHLTKPLLKSPA----RFSLFFFFSLLLLGGAFISTRLL--------NFTTTADNLKGSEIPKNPHRRRHV-----EIPLDCTSFNNVTG

Query:  GACPA-NYPTNW---TAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPG
        G CP  NYPT++     E + +R    TCPDYFRWIHEDLRPW +TGITR  LE    TA FRL I+NG+ YVE F ++FQ+RD FT+WG +QLLR YPG
Subjt:  GACPA-NYPTNW---TAEEDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPG

Query:  KVPDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKD
        K+PDLELMFDCVDWPV+    F+G + P PPPLFRYC +D TLDIVFPDWS+WGW E+NIKPWE LLK+L+EGN+R  W  REPYAYWKGNP VAETR D
Subjt:  KVPDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKD

Query:  LLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKF
        L+KCN+S+  DW AR++ QDW+KES+ GYKQSDLA+QC HRYKIYIEGSAWSVSEKYILACDSVTL+VKPHYYDFFTRG+ P HHYWPVK DDKCRSIKF
Subjt:  LLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKF

Query:  AVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPA
        AVDWGN H +KAQ IGK AS F+Q+ELKMDYVYDYMFHLL +YSKLL FKP IP+ + ELCSEAMACP +G  RKFM ESLVK PA++ PC MPPPYDPA
Subjt:  AVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPA

Query:  SLHFVLSRKENSIKQVEKWETAFWDTQSK
        S + VL R++++  ++E+WE+ +W  Q+K
Subjt:  SLHFVLSRKENSIKQVEKWETAFWDTQSK

AT3G61270.1 Arabidopsis thaliana protein of unknown function (DUF821)2.8e-15860.29Show/hide
Query:  DPNRPS-PPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDLELMFDCVDWPVIL
        +PN  S   TCP YFRWIHEDLRPW +TGITR  +E   RTA+FRLVI NGKAYV+ + KS Q+RD FT+WGILQLLRWYPGK+PDLELMFD  D PV+ 
Subjt:  DPNRPS-PPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDLELMFDCVDWPVIL

Query:  TTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFA
        +  F G     PPP+FRYC DDA+LDIVFPDWSFWGW E+N+KPW   L+ +KEGN    WK R  YAYW+GNP V   R DLLKCN ++ ++WN R++ 
Subjt:  TTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFA

Query:  QDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWGNSHKQKAQAIGKA
        QDW KE++ G+K S+L NQC HRYKIYIEG AWSVSEKYI+ACDS+TL VKP +YDF+ RG+MP+ HYWP++ D KC S+KFAV WGN+H+ KA+ IG+ 
Subjt:  QDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWGNSHKQKAQAIGKA

Query:  ASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASLHFVLSRKENSIKQVEK
         S FI+EE+ M YVYDYMFHLL EY+ LL FKP IP  A E+  ++M CPA    R F  ES++ SP++ SPC M PPYDP +L  VL RK N  +QVE 
Subjt:  ASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASLHFVLSRKENSIKQVEK

Query:  WETAFWDTQSKQP
        WE  ++   + +P
Subjt:  WETAFWDTQSKQP

AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821)2.0e-20464.14Show/hide
Query:  SRHFADHLTKPLLK-----SPARFSLFFFFSLLLLGGAFISTRLL-----------NFTTTADNLKGSEIPKNPHRRRHV------EIPLDCTSFNNVTG
        SR + D +  P +K     SP R        +LL+ GAFISTRLL             TTT      +  PK P     +      E  L C++  N T 
Subjt:  SRHFADHLTKPLLK-----SPARFSLFFFFSLLLLGGAFISTRLL-----------NFTTTADNLKGSEIPKNPHRRRHV------EIPLDCTSFNNVTG

Query:  GACPAN-YPTNWTAE-EDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKV
         +CP+N YPT  + E +D N P   TCPDYFRWIHEDLRPW+RTGITR  LE  K+TA FRL I+ GK YVE F  +FQ+RD FT+WG LQLLR YPGK+
Subjt:  GACPAN-YPTNWTAE-EDPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKV

Query:  PDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLL
        PDLELMFDCVDWPV+  T F+G N P+PPPLFRYCG++ TLDIVFPDWSFWGW E+NIKPWE LLK+L+EGN+R  W +REPYAYWKGNP VAETR+DL+
Subjt:  PDLELMFDCVDWPVILTTHFSGPNGPAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLL

Query:  KCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAV
        KCNVS++ +WNAR++AQDW+KES+ GYKQSDLA+QC HRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGL+P HHYWPV+  DKCRSIKFAV
Subjt:  KCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCLHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAV

Query:  DWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASL
        DWGNSH QKAQ IGKAAS FIQ++LKMDYVYDYM+HLL+EYSKLL FKP IPR A+E+CSE MAC   G  RKFMTESLVK PADS PC MPPPYDPA+ 
Subjt:  DWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASL

Query:  HFVLSRKENSIKQVEKWETAFWDTQSK
        + V+ RK+++  ++ +WE  +W  Q++
Subjt:  HFVLSRKENSIKQVEKWETAFWDTQSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGAAGCCAGTTTCCAGCAGAGGTTTTCAGATTACACCTCCTGGGTTTCTCGCCATTTCGCAGATCATCTCACGAAGCCACTTCTCAAGTCTCCGGCTAGATTCTC
TCTCTTCTTCTTCTTCTCTCTCTTACTCCTCGGCGGCGCGTTCATCTCCACGCGCCTCCTCAATTTCACTACGACAGCGGATAATTTAAAAGGGAGCGAAATTCCCAAAA
ACCCACATCGACGACGACACGTGGAAATCCCACTGGATTGCACGTCCTTCAATAACGTCACAGGAGGAGCCTGCCCTGCCAACTACCCAACCAATTGGACCGCCGAGGAA
GATCCGAACCGTCCATCGCCACCCACGTGTCCGGATTATTTCCGTTGGATCCACGAGGACCTGAGGCCGTGGGCCCGGACTGGGATCACGAGGGCCACGCTGGAGGCCGG
CAAACGGACGGCGAATTTCCGGCTGGTGATTCTGAACGGAAAAGCTTACGTGGAGACTTTCACAAAATCTTTTCAGTCGAGAGATACTTTTACGGTGTGGGGGATTCTAC
AGTTGTTACGGTGGTACCCTGGGAAAGTTCCTGATTTGGAGCTCATGTTTGATTGCGTTGACTGGCCTGTGATTTTGACCACCCATTTTAGTGGGCCCAATGGGCCGGCC
CCACCTCCTTTGTTTCGTTATTGTGGGGATGATGCCACGTTGGACATTGTTTTTCCTGATTGGTCCTTCTGGGGATGGCCAGAGATCAATATAAAGCCATGGGAGCCGTT
GTTGAAGGATCTAAAAGAAGGGAACAAAAGGATCCCATGGAAGAGTAGAGAGCCTTATGCTTATTGGAAGGGAAATCCAGAGGTCGCTGAAACCCGAAAAGATCTGCTTA
AATGCAATGTCTCCGACCAACAAGATTGGAATGCTCGTGTATTCGCTCAGGATTGGATGAAAGAATCCCAGCGAGGATACAAGCAATCAGATCTTGCAAACCAATGTCTC
CATAGGTACAAAATTTATATAGAAGGATCAGCTTGGTCTGTTAGTGAAAAGTACATTCTTGCTTGTGATTCGGTTACGTTACTCGTAAAGCCTCATTACTACGACTTCTT
CACGAGAGGTTTGATGCCGGTGCACCATTATTGGCCCGTAAAAAGCGACGACAAGTGCAGGTCTATAAAATTTGCAGTTGATTGGGGCAACAGCCACAAGCAAAAGGCGC
AGGCCATTGGCAAGGCAGCAAGCAGTTTCATCCAAGAGGAGCTGAAGATGGACTATGTGTATGACTACATGTTTCATCTTCTAAGTGAATATTCTAAACTCCTAACATTC
AAGCCGAGCATACCGCGCCAGGCGATCGAGCTTTGTTCGGAGGCCATGGCTTGTCCAGCCGAAGGGCTCACCCGAAAATTCATGACAGAATCATTGGTGAAGAGCCCTGC
AGATTCGAGTCCGTGCACGATGCCGCCGCCGTATGATCCGGCATCGCTTCATTTTGTTCTTAGTAGAAAAGAGAATTCAATCAAACAAGTAGAAAAATGGGAGACAGCTT
TCTGGGATACTCAAAGTAAGCAGCCATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGGAAGCCAGTTTCCAGCAGAGGTTTTCAGATTACACCTCCTGGGTTTCTCGCCATTTCGCAGATCATCTCACGAAGCCACTTCTCAAGTCTCCGGCTAGATTCTC
TCTCTTCTTCTTCTTCTCTCTCTTACTCCTCGGCGGCGCGTTCATCTCCACGCGCCTCCTCAATTTCACTACGACAGCGGATAATTTAAAAGGGAGCGAAATTCCCAAAA
ACCCACATCGACGACGACACGTGGAAATCCCACTGGATTGCACGTCCTTCAATAACGTCACAGGAGGAGCCTGCCCTGCCAACTACCCAACCAATTGGACCGCCGAGGAA
GATCCGAACCGTCCATCGCCACCCACGTGTCCGGATTATTTCCGTTGGATCCACGAGGACCTGAGGCCGTGGGCCCGGACTGGGATCACGAGGGCCACGCTGGAGGCCGG
CAAACGGACGGCGAATTTCCGGCTGGTGATTCTGAACGGAAAAGCTTACGTGGAGACTTTCACAAAATCTTTTCAGTCGAGAGATACTTTTACGGTGTGGGGGATTCTAC
AGTTGTTACGGTGGTACCCTGGGAAAGTTCCTGATTTGGAGCTCATGTTTGATTGCGTTGACTGGCCTGTGATTTTGACCACCCATTTTAGTGGGCCCAATGGGCCGGCC
CCACCTCCTTTGTTTCGTTATTGTGGGGATGATGCCACGTTGGACATTGTTTTTCCTGATTGGTCCTTCTGGGGATGGCCAGAGATCAATATAAAGCCATGGGAGCCGTT
GTTGAAGGATCTAAAAGAAGGGAACAAAAGGATCCCATGGAAGAGTAGAGAGCCTTATGCTTATTGGAAGGGAAATCCAGAGGTCGCTGAAACCCGAAAAGATCTGCTTA
AATGCAATGTCTCCGACCAACAAGATTGGAATGCTCGTGTATTCGCTCAGGATTGGATGAAAGAATCCCAGCGAGGATACAAGCAATCAGATCTTGCAAACCAATGTCTC
CATAGGTACAAAATTTATATAGAAGGATCAGCTTGGTCTGTTAGTGAAAAGTACATTCTTGCTTGTGATTCGGTTACGTTACTCGTAAAGCCTCATTACTACGACTTCTT
CACGAGAGGTTTGATGCCGGTGCACCATTATTGGCCCGTAAAAAGCGACGACAAGTGCAGGTCTATAAAATTTGCAGTTGATTGGGGCAACAGCCACAAGCAAAAGGCGC
AGGCCATTGGCAAGGCAGCAAGCAGTTTCATCCAAGAGGAGCTGAAGATGGACTATGTGTATGACTACATGTTTCATCTTCTAAGTGAATATTCTAAACTCCTAACATTC
AAGCCGAGCATACCGCGCCAGGCGATCGAGCTTTGTTCGGAGGCCATGGCTTGTCCAGCCGAAGGGCTCACCCGAAAATTCATGACAGAATCATTGGTGAAGAGCCCTGC
AGATTCGAGTCCGTGCACGATGCCGCCGCCGTATGATCCGGCATCGCTTCATTTTGTTCTTAGTAGAAAAGAGAATTCAATCAAACAAGTAGAAAAATGGGAGACAGCTT
TCTGGGATACTCAAAGTAAGCAGCCATAG
Protein sequenceShow/hide protein sequence
MREASFQQRFSDYTSWVSRHFADHLTKPLLKSPARFSLFFFFSLLLLGGAFISTRLLNFTTTADNLKGSEIPKNPHRRRHVEIPLDCTSFNNVTGGACPANYPTNWTAEE
DPNRPSPPTCPDYFRWIHEDLRPWARTGITRATLEAGKRTANFRLVILNGKAYVETFTKSFQSRDTFTVWGILQLLRWYPGKVPDLELMFDCVDWPVILTTHFSGPNGPA
PPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRIPWKSREPYAYWKGNPEVAETRKDLLKCNVSDQQDWNARVFAQDWMKESQRGYKQSDLANQCL
HRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLMPVHHYWPVKSDDKCRSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTF
KPSIPRQAIELCSEAMACPAEGLTRKFMTESLVKSPADSSPCTMPPPYDPASLHFVLSRKENSIKQVEKWETAFWDTQSKQP