; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0008115 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0008115
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptionprotein O-glucosyltransferase 1-like
Genome locationchr06:64072..67621
RNA-Seq ExpressionIVF0008115
SyntenyIVF0008115
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR006598 - Glycosyl transferase CAP10 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033653.1 O-glucosyltransferase rumi-like protein [Cucumis melo var. makuwa]0.095.71Show/hide
Query:  MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSLCSCS-SLSSFSPASSSPRA--SFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRRRRQVEFALD
        MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSL     SL   +    S R   S      LTIKGSGKSQYYPNDTSEVPENPNHRRRRQVEFALD
Subjt:  MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSLCSCS-SLSSFSPASSSPRA--SFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRRRRQVEFALD

Query:  CTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLL
        CTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLL
Subjt:  CTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLL

Query:  RRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVA
        RRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVA
Subjt:  RRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVA

Query:  DTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKC
        DTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKC
Subjt:  DTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKC

Query:  KSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPP
        KSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPP
Subjt:  KSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPP

Query:  PYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP
        PYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP
Subjt:  PYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP

XP_004140839.1 protein O-glucosyltransferase 1 [Cucumis sativus]0.088.29Show/hide
Query:  MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSLCSCS-SLSSFSPASSSPRA--SFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRRRR-QVEFAL
        MREGS  SF NRFSHYA F DHIFKPFIKSPATFSL     SL   +    S R   S      LTIKGSGKSQYYP +TS+VP NPNH+ RR QVEF L
Subjt:  MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSLCSCS-SLSSFSPASSSPRA--SFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRRRR-QVEFAL

Query:  DCTSFNNITGGACPANYPTNWTTDEHENRPSSTT-CPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQ
         C SFNNIT GACPA+YPTNWTTDE +N PSS++ CP+YFRWIHEDLRPWARTGI+RA +EAGQRTANFRL+ILNGKAYVETYKKSFQTRDTFTVWGILQ
Subjt:  DCTSFNNITGGACPANYPTNWTTDEHENRPSSTT-CPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQ

Query:  LLRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPE
        LLRRYPGKV DLDLMFDCVDWPVIL+SHFSGP+GPTPPPLFRYCGDD T DIVFPDWSFWGWPEINIKPWEPLLKD+KEGNKRI WKSREPYAYWKGNPE
Subjt:  LLRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPE

Query:  VADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDD
        VADTRKDL+KCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDD
Subjt:  VADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDD

Query:  KCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTM
        KCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLS+YSKLLTFKPT+PP AIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTM
Subjt:  KCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTM

Query:  PPPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP
        PPPYDPASLHFVL RKENSIKQVEKWETSFWNTQSKQP
Subjt:  PPPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP

XP_008439228.1 PREDICTED: O-glucosyltransferase rumi homolog [Cucumis melo]0.094.97Show/hide
Query:  MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSLCSCS-SLSSFSPASSSPRA--SFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRRRR-QVEFAL
        MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSL     SL   +    S R   S      LTIKGSGKSQYYPNDTSEVPENPNHRRRR QVEFAL
Subjt:  MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSLCSCS-SLSSFSPASSSPRA--SFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRRRR-QVEFAL

Query:  DCTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQL
        DCTSFNNITGGACPANYPTN TTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQL
Subjt:  DCTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQL

Query:  LRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEV
        LRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEV
Subjt:  LRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEV

Query:  ADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDK
        ADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDK
Subjt:  ADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDK

Query:  CKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMP
        CKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLS+YSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMP
Subjt:  CKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMP

Query:  PPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP
        PPYDPASLHFVLRRKENSIKQVEKWETSFWNT+SKQP
Subjt:  PPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP

XP_023552264.1 protein O-glucosyltransferase 1-like isoform X1 [Cucurbita pepo subsp. pepo]6.34e-31378.2Show/hide
Query:  SFSDH-IFKPFIKSPATFSL------CSCSSLSSFSPASSSPRASFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRRRRQVEFALDCTSFNNITGGACP
        SFSDH +FKPF+KSPA FSL         S+L  +SP ++   +  I      IKGS K   YP+ +SE+P+ P+  R+RQV+F LDCTSFNN+T GACP
Subjt:  SFSDH-IFKPFIKSPATFSL------CSCSSLSSFSPASSSPRASFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRRRRQVEFALDCTSFNNITGGACP

Query:  ANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLM
        A YPT WT +E  N P S+TCPEYFRWIHEDLRPWA+TGI+RA++EA ++TANFRLVI+NG AYVETY+KSFQTRDTFT+WGILQLLRRYPGKV DL++M
Subjt:  ANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLM

Query:  FDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVADTRKDLLKCNVSD
        FDCVDWPVIL+++FS P+GP PPPLFRYCG+D TLD+VFPDWSFWGW EINIKPWE LLKDLKEGNKR  WK+RE YAYWKGNPEVA+TRKDLLKCNVSD
Subjt:  FDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVADTRKDLLKCNVSD

Query:  QQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGNSH
        QQDWNARVFAQDW KESQ+GYK+SDL+NQCLHRYKIYIEGSAWSVSEKYILACDSV LIVKPHYYDFFTRGLMP+HHYWPVKDDDKCKSIKFAVDWGNSH
Subjt:  QQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGNSH

Query:  KQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPPPYDPASLHFVLRR
        KQKA+ IGKAASSFIQEELKMDYVYDYMFHLLS+YSKLLTFKPT+P  AIELCSEAMACPAEGLTKKFM ESLVK PA+S PC MPPPYDPASLH VLRR
Subjt:  KQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPPPYDPASLHFVLRR

Query:  KENSIKQVEKWETSFWNTQSKQP
        KE+SIKQVE+WE +FW+ QS+QP
Subjt:  KENSIKQVEKWETSFWNTQSKQP

XP_038898817.1 protein O-glucosyltransferase 1-like [Benincasa hispida]0.082.75Show/hide
Query:  MREGSSSSFLNRFSHYAS-----FSDH-IFKPFIKSPATFSLC----SCSSLSS--FSPASSSPRASFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRR
        MRE    SF  RFSHYAS     FSDH + KPF+KSPATFSL     S   L+   FS        +   L   TIKGS K+Q+YPN+TS++PENPNHRR
Subjt:  MREGSSSSFLNRFSHYAS-----FSDH-IFKPFIKSPATFSLC----SCSSLSS--FSPASSSPRASFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRR

Query:  RRQVEFALDCTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTF
          QVEF LDCTSFNNIT G CP NYPT WT +E  +RPSS TCP++FRWIHEDL PWARTGI+RA +EAG+RTANFRLVILNGKAYVETYKKSFQTRDTF
Subjt:  RRQVEFALDCTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTF

Query:  TVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYA
        TVWGILQLLRRYPGKV DL+LMFDCVDWPVIL+SHFSGP+GPTPPPLFRYCGDD TLDIVFPDWSFWGWPEINIKPWE LLKDLKEGNKRI WK RE YA
Subjt:  TVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYA

Query:  YWKGNPEVADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHY
        YWKGNPEVA+TRKDLLKCNVSDQQDWN RVFAQDW KESQ+GYKQSDL+NQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHY
Subjt:  YWKGNPEVADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHY

Query:  WPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPA
        WPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLS+YSKLLTFKP +P  AI+LCSEAMACPAEGLTKKFM +SLVKRPA
Subjt:  WPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPA

Query:  ESNPCTMPPPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP
        +S+PC MPPPYDPASLHFVL RKENSIKQVEKWETSFWNTQSKQP
Subjt:  ESNPCTMPPPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP

TrEMBL top hitse value%identityAlignment
A0A0A0L5W3 CAP10 domain-containing protein3.0e-28588.29Show/hide
Query:  MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSLCSC-SSLSSFSPASSSPRA--SFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHR-RRRQVEFAL
        MREGS  SF NRFSHYA F DHIFKPFIKSPATFSL     SL   +    S R   S      LTIKGSGKSQYYP +TS+VP NPNH+ RR QVEF L
Subjt:  MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSLCSC-SSLSSFSPASSSPRA--SFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHR-RRRQVEFAL

Query:  DCTSFNNITGGACPANYPTNWTTDEHENRPSSTT-CPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQ
         C SFNNIT GACPA+YPTNWTTDE +N PSS++ CP+YFRWIHEDLRPWARTGI+RA +EAGQRTANFRL+ILNGKAYVETYKKSFQTRDTFTVWGILQ
Subjt:  DCTSFNNITGGACPANYPTNWTTDEHENRPSSTT-CPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQ

Query:  LLRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPE
        LLRRYPGKV DLDLMFDCVDWPVIL+SHFSGP+GPTPPPLFRYCGDD T DIVFPDWSFWGWPEINIKPWEPLLKD+KEGNKRI WKSREPYAYWKGNPE
Subjt:  LLRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPE

Query:  VADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDD
        VADTRKDL+KCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDD
Subjt:  VADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDD

Query:  KCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTM
        KCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLS+YSKLLTFKPT+PP AIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTM
Subjt:  KCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTM

Query:  PPPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP
        PPPYDPASLHFVL RKENSIKQVEKWETSFWNTQSKQP
Subjt:  PPPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP

A0A1S3AYX8 O-glucosyltransferase rumi homolog5.9e-30594.97Show/hide
Query:  MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSLCSC-SSLSSFSPASSSPRA--SFIPLVQLTIKGSGKSQYYPNDTSEVPENPNH-RRRRQVEFAL
        MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSL     SL   +    S R   S      LTIKGSGKSQYYPNDTSEVPENPNH RRRRQVEFAL
Subjt:  MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSLCSC-SSLSSFSPASSSPRA--SFIPLVQLTIKGSGKSQYYPNDTSEVPENPNH-RRRRQVEFAL

Query:  DCTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQL
        DCTSFNNITGGACPANYPTN TTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQL
Subjt:  DCTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQL

Query:  LRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEV
        LRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEV
Subjt:  LRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEV

Query:  ADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDK
        ADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDK
Subjt:  ADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDK

Query:  CKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMP
        CKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLS+YSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMP
Subjt:  CKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMP

Query:  PPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP
        PPYDPASLHFVLRRKENSIKQVEKWETSFWNT+SKQP
Subjt:  PPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP

A0A5D3DHB1 O-glucosyltransferase rumi-like protein8.7e-30995.71Show/hide
Query:  MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSLCSC-SSLSSFSPASSSPRA--SFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRRRRQVEFALD
        MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSL     SL   +    S R   S      LTIKGSGKSQYYPNDTSEVPENPNHRRRRQVEFALD
Subjt:  MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSLCSC-SSLSSFSPASSSPRA--SFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRRRRQVEFALD

Query:  CTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLL
        CTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLL
Subjt:  CTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLL

Query:  RRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVA
        RRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVA
Subjt:  RRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVA

Query:  DTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKC
        DTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKC
Subjt:  DTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKC

Query:  KSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPP
        KSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPP
Subjt:  KSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPP

Query:  PYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP
        PYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP
Subjt:  PYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP

A0A6J1E8T3 protein O-glucosyltransferase 1-like isoform X12.3e-24876.95Show/hide
Query:  SSFLNRF----SHYASFSDH-IFKPFIKSPATFSL------CSCSSLSSFSPASSSPRASFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRRRRQVEFA
        SSF  R     S + SFSDH +FKPF+KSPA FSL         S+L  +SP ++   +  +      IKGS K   YP+ TSE+P+ P+  R+RQV+F 
Subjt:  SSFLNRF----SHYASFSDH-IFKPFIKSPATFSL------CSCSSLSSFSPASSSPRASFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRRRRQVEFA

Query:  LDCTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQ
        LDCTSFNN+T GACPA YPT WT +E  N P S+TCPEYFRWIHEDLRPWA+TGI+RA++EA ++TANFRLVI+NG AYVETY+KSFQTRDTFT+WGILQ
Subjt:  LDCTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQ

Query:  LLRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPE
        LLRRYPGKV DL+LMFDCVDWPVIL+++FS P+GP+PPPLFRYCG+D TLD+VFPDWSFWGW EINIKPWE LLKDLKEGNKR  WK+RE YAYWKGNPE
Subjt:  LLRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPE

Query:  VADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDD
        VA+TRKDLLKCNVSDQQDWNARVFAQDW KESQ+GYK+SDL+NQCLHRYKIYIEGSAWSVSEKYILACDSV LIVKPHYYDFFTRGLMP+HHYWPVKDDD
Subjt:  VADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDD

Query:  KCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTM
        KCKSIKFAVDWGNSHK KA+AIGKAASSFI EELKMDYVYDYMFHLLS+YSKLLTFKPT+P  AIELCSE MACPAEGLTKKFM ESLVK PA+S PC M
Subjt:  KCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTM

Query:  PPPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP
        PPPYDPASLH VLRRKENSIKQVE+WE +FW+ QS+QP
Subjt:  PPPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP

A0A6J1J4W8 protein O-glucosyltransferase 1-like isoform X11.9e-24777.51Show/hide
Query:  SSFLNRF----SHYASFSDH-IFKPFIKSPATFSL------CSCSSLSSFSPASSSPRASFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRRRRQVEFA
        SSF  R     S + SFSDH +FKPF+KSPA FSL         S+L   SP ++   +  I      IKGS KS+ YP+ TSE+P+ P+  R+RQV+F 
Subjt:  SSFLNRF----SHYASFSDH-IFKPFIKSPATFSL------CSCSSLSSFSPASSSPRASFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRRRRQVEFA

Query:  LDCTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQ
        LDCTSFNN T GACPA YPT WT +E  N P S+TCPEYFRWIHEDLRPWA+TGI+RA++EA ++TANFRLVI+NG AYVETY+KSFQTRDTFT+WGILQ
Subjt:  LDCTSFNNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQ

Query:  LLRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPE
        LLRRYPGKV DL+LMFDCVDWPVIL+++FS P+GP  PPLFRYCG+D TLD+VFPDWSFWGW EINIKPWE LLKDLKEGNKR  WK+RE YAYWKGNPE
Subjt:  LLRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPE

Query:  VADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDD
        VA+TRKDLLKCNVSDQQDWNARVFAQDW KESQ+GYK+SDL+NQCLHRYKIYIEGSAWSVSEKYILACDSV LIVKPHYYDFFTRGLMP+HHYWPVKDDD
Subjt:  VADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDD

Query:  KCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTM
        KCKSIKFAVDWGNSHKQKA+AIGKAASSFIQEELKMDYVYDYMFHLLS+YSKLLTFKPT+P  AI+LCSEAMACPAEGLTKKFM ESLVK PA+S PCTM
Subjt:  KCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTM

Query:  PPPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP
        PPPYDPASLH VLRRKENSIKQVE+WE++ W+ QS+QP
Subjt:  PPPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP

SwissProt top hitse value%identityAlignment
G3V9D0 Protein O-glucosyltransferase 11.2e-2026.59Show/hide
Query:  SSTTCPEYFRWIHEDLRPWARTGISRAAV-EAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSHFS
        SS  C  Y   I EDL P+ R GISR  + E  +R       I+  + + E     F +R +     IL+++RR P    D++++ +  D+P +      
Subjt:  SSTTCPEYFRWIHEDLRPWARTGISRAAV-EAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSHFS

Query:  GPDGPTPP-PLFRYCGDDPTLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRILWKSREPYAYWKG-------NPEVADTRKD--LLKCNV
         P    P  P+F +       DI++P W+FW      WP     +  W+   +DL     +  W+ +   AY++G       +P +  +RK+  L+    
Subjt:  GPDGPTPP-PLFRYCGDDPTLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRILWKSREPYAYWKG-------NPEVADTRKD--LLKCNV

Query:  SDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGN
        +  Q W +           +   K   L + C ++Y     G A S   K++  C S+   V   + +FF   L P  HY PVK D     ++  + +  
Subjt:  SDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGN

Query:  SHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPT
        ++   AQ I K  S FI   L+MD +  Y  +LL++YSK L++  T
Subjt:  SHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPT

Q29AU6 O-glucosyltransferase rumi3.0e-1924.63Show/hide
Query:  SSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSHFSG
        +   C  +   I  DL P+  TG+SR  +E+  R    R  I   + Y E     F  R      GI   L      + D+DL+ +  D+P I  +  +G
Subjt:  SSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSHFSG

Query:  PDGPTPPPLFRYCGDDPTLDIVFPDWSFW-GWPEINIKP-----WEPLLKDLKEGNKRILWKSREPYAYWKGNPEVADTRKDLLKCNVSDQQDWNARVFA
          G    P+  +       DI++P W+FW G P   + P     W+ + + L++    I W  +    +++G+   +D R  L+  +  + +   A+   
Subjt:  PDGPTPPPLFRYCGDDPTLDIVFPDWSFW-GWPEINIKP-----WEPLLKDLKEGNKRILWKSREPYAYWKGNPEVADTRKDLLKCNVSDQQDWNARVFA

Query:  -QDW--TKESQEGYKQSDLS--NQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQ
         Q W   K++ +     ++S  + C ++Y     G A S   K++  C S+   V   + +FF   L P  HY P+K+    +  +  + +   +   AQ
Subjt:  -QDW--TKESQEGYKQSDLS--NQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQ

Query:  AIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFK
         I +    FI + L+M  +  Y   LL  Y KLLT++
Subjt:  AIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFK

Q5E9Q1 Protein O-glucosyltransferase 12.4e-2126.88Show/hide
Query:  SSTTCPEYFRWIHEDLRPWARTGISRAAV-EAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSHFS
        SS  C  Y   I EDL P+ R GISR  + E  +R       I+  + Y E+    F +R +     IL+++ R P    D++++ +  D+P +      
Subjt:  SSTTCPEYFRWIHEDLRPWARTGISRAAV-EAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSHFS

Query:  GPDGPTPP-PLFRYCGDDPTLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRILWKSREPYAYWKG-------NPEVADTRKD--LLKCNV
         P    P  P+F +       DI++P W+FW      WP   + +  W+   +DL     +  WK +   AY++G       +P +  +RK+  L+    
Subjt:  GPDGPTPP-PLFRYCGDDPTLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRILWKSREPYAYWKG-------NPEVADTRKD--LLKCNV

Query:  SDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGN
        +  Q W +           +   K   L + C ++Y     G A S   K++  C S+   V   + +FF   L P  HY PVK D    +++  + +  
Subjt:  SDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGN

Query:  SHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPT
        ++   AQ I +  S FI   LKMD +  Y  +LL++YSK L++  T
Subjt:  SHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPT

Q8BYB9 Protein O-glucosyltransferase 12.1e-2026.59Show/hide
Query:  SSTTCPEYFRWIHEDLRPWARTGISRAAV-EAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSHFS
        SS  C  Y   I EDL P+ R GISR  + E  +R       I+  + + E     F +R +     IL+++ R P    D++++ +  D+P +      
Subjt:  SSTTCPEYFRWIHEDLRPWARTGISRAAV-EAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSHFS

Query:  GPDGPTPP-PLFRYCGDDPTLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRILWKSREPYAYWKG-------NPEVADTRKD--LLKCNV
         P    P  P+F +       DI++P W+FW      WP     +  W+   +DL     +  W+ +   AY++G       +P +  +RK+  L+    
Subjt:  GPDGPTPP-PLFRYCGDDPTLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRILWKSREPYAYWKG-------NPEVADTRKD--LLKCNV

Query:  SDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGN
        +  Q W +           +   K   L + C +RY     G A S   K++  C S+   V   + +FF   L P  HY PVK D    +++  + +  
Subjt:  SDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGN

Query:  SHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPT
        ++   AQ I K  S FI   L+MD +  Y  +LL+ YSK L++  T
Subjt:  SHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPT

Q8NBL1 Protein O-glucosyltransferase 13.2e-2126.88Show/hide
Query:  SSTTCPEYFRWIHEDLRPWARTGISRAAV-EAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSHFS
        SS  C  Y   I EDL P+ R GISR  + E  +R       I   + Y E     F +R +     IL+++ R P    D++++ +  D+P +      
Subjt:  SSTTCPEYFRWIHEDLRPWARTGISRAAV-EAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSHFS

Query:  GPDGPTPP-PLFRYCGDDPTLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRILWKSREPYAYWKG-------NPEVADTRKD--LLKCNV
         P    P  P+F +       DI++P W+FW      WP     +  W+   +DL     +  WK +   AY++G       +P +  +RK+  L+    
Subjt:  GPDGPTPP-PLFRYCGDDPTLDIVFPDWSFWG-----WP--EINIKPWEPLLKDLKEGNKRILWKSREPYAYWKG-------NPEVADTRKD--LLKCNV

Query:  SDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGN
        +  Q W +           +   K   L + C ++Y     G A S   K++  C S+   V   + +FF   L P  HY PVK D    +++  + +  
Subjt:  SDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGN

Query:  SHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPT
        ++   AQ I +  S FI+  L+MD +  Y  +LLS+YSK L++  T
Subjt:  SHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPT

Arabidopsis top hitse value%identityAlignment
AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821)5.5e-16259.5Show/hide
Query:  ALDCTSF-NNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGI
        ++DC+SF N    G+C     + +  ++ E   S+ +CP+YF+WIHEDL+PW  TGI++  VE G+ TA+FRLVILNGK +VE YKKS QTRD FT+WGI
Subjt:  ALDCTSF-NNITGGACPANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGI

Query:  LQLLRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGP---TPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYW
        LQLLR+YPGK+ D+DLMFDC D PVI S  ++  +      PPPLFRYCGD  T+DIVFPDWSFWGW EINI+ W  +LK+++EG K+  +  R+ YAYW
Subjt:  LQLLRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGP---TPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYW

Query:  KGNPEVAD-TRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYW
        KGNP VA  +R+DLL CN+S   DWNAR+F QDW  E Q G++ S+++NQC +RYKIYIEG AWSVSEKYILACDSVTL+VKP+YYDFF+R L P+ HYW
Subjt:  KGNPEVAD-TRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYW

Query:  PVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAE-----GLTKKFMTESLV
        P++D DKC+SIKFAVDW N+H QKAQ IG+ AS F+Q +L M+ VYDYMFHLL++YSKLL +KP VP  ++ELC+EA+ CP+E     G+ KKFM  SLV
Subjt:  PVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAE-----GLTKKFMTESLV

Query:  KRPAESNPCTMPPPYDPASLHFVLRRKENSIKQVEKWETSFW
         RP  S PC++PPP+D   L    R+K N I+QVEKWE S+W
Subjt:  KRPAESNPCTMPPPYDPASLHFVLRRKENSIKQVEKWETSFW

AT2G45830.1 downstream target of AGL15 24.1e-15758Show/hide
Query:  YPTNWTTDEHENRPSS----TTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLD
        +P N ++  ++   SS    +TCP YFRWIHEDLRPW  TG++R  +E  +RTA+FR+VIL+G+ YV+ Y+KS QTRD FT+WGI+QLLR YPG++ DL+
Subjt:  YPTNWTTDEHENRPSS----TTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLD

Query:  LMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVADTRKDLLKCNV
        LMFD  D P + S  F G   P PPPLFRYC DD +LDIVFPDWSFWGW E+NIKPW+  L  ++EGNK   WK R  YAYW+GNP VA TR+DLL+CNV
Subjt:  LMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVADTRKDLLKCNV

Query:  SDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGN
        S Q+DWN R++ QDW +ES+EG+K S+L NQC HRYKIYIEG AWSVSEKYI+ACDS+TL V+P +YDF+ RG+MP+ HYWP++D  KC S+KFAV WGN
Subjt:  SDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGN

Query:  SHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPPPYDPASLHFVL
        +H  +A  IG+  S FI+EE+KM+YVYDYMFHL+++Y+KLL FKP +P  A E+  + M C A G  + FM ES+V  P+E +PC MP P++P  L  +L
Subjt:  SHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPPPYDPASLHFVL

Query:  RRKENSIKQVEKWETSFWN
         RK N  +QVE WE  +++
Subjt:  RRKENSIKQVEKWETSFWN

AT3G48980.1 Arabidopsis thaliana protein of unknown function (DUF821)6.2e-19864.71Show/hide
Query:  VQLTIKGSGKSQYYPNDTSEVPENPNHRRRRQVEFALDCTSFNNITGGACPA-NYPTNWTTDEHE---NRPSSTTCPEYFRWIHEDLRPWARTGISRAAV
        V +T + + +S  YP  T  + E P        EF L+C +F+    G CP  NYPT++ +   E   +R  S TCP+YFRWIHEDLRPW +TGI+R A+
Subjt:  VQLTIKGSGKSQYYPNDTSEVPENPNHRRRRQVEFALDCTSFNNITGGACPA-NYPTNWTTDEHE---NRPSSTTCPEYFRWIHEDLRPWARTGISRAAV

Query:  EAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFW
        E    TA FRL I+NG+ YVE ++++FQTRD FT+WG +QLLRRYPGK+ DL+LMFDCVDWPV+ ++ F+G D P PPPLFRYC +D TLDIVFPDWS+W
Subjt:  EAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFW

Query:  GWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSV
        GW E+NIKPWE LLK+L+EGN+R  W  REPYAYWKGNP VA+TR DL+KCN+S+  DW AR++ QDW KES+EGYKQSDL++QC HRYKIYIEGSAWSV
Subjt:  GWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSV

Query:  SEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTV
        SEKYILACDSVTL+VKPHYYDFFTRG+ P HHYWPVK+DDKC+SIKFAVDWGN H +KAQ IGK AS F+Q+ELKMDYVYDYMFHLL QYSKLL FKP +
Subjt:  SEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTV

Query:  PPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPPPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSK
        P  + ELCSEAMACP +G  +KFM ESLVKRPAE+ PC MPPPYDPAS + VL+R++++  ++E+WE+ +W  Q+K
Subjt:  PPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPPPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSK

AT3G61270.1 Arabidopsis thaliana protein of unknown function (DUF821)1.5e-15659.76Show/hide
Query:  NRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSH
        N   S+TCP YFRWIHEDLRPW +TGI+R  +E   RTA+FRLVI NGKAYV+ YKKS QTRD FT+WGILQLLR YPGK+ DL+LMFD  D PV+ S  
Subjt:  NRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSH

Query:  FSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVADTRKDLLKCNVSDQQDWNARVFAQDW
        F G     PPP+FRYC DD +LDIVFPDWSFWGW E+N+KPW   L+ +KEGN    WK R  YAYW+GNP V   R DLLKCN ++ ++WN R++ QDW
Subjt:  FSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVADTRKDLLKCNVSDQQDWNARVFAQDW

Query:  TKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASS
         KE++EG+K S+L NQC HRYKIYIEG AWSVSEKYI+ACDS+TL VKP +YDF+ RG+MP+ HYWP++DD KC S+KFAV WGN+H+ KA+ IG+  S 
Subjt:  TKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASS

Query:  FIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPPPYDPASLHFVLRRKENSIKQVEKWET
        FI+EE+ M YVYDYMFHLL +Y+ LL FKP +P  A E+  ++M CPA    + F  ES++  P+E +PC M PPYDP +L  VL RK N  +QVE WE 
Subjt:  FIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPPPYDPASLHFVLRRKENSIKQVEKWET

Query:  SFWNTQSKQP
         ++   + +P
Subjt:  SFWNTQSKQP

AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821)1.3e-19559.93Show/hide
Query:  HYASFSDHIFKPFIKSPATFSLCSCSSLSS-----FSPASSSPRASFIPLVQLTIKG----SGKSQ------YYPNDTSEVPENPNHRRRRQVEFALDCT
        H  +++D I+ PF+KS    S     +L S        A  S R      V L  K     + K+Q       YP  T+ + ++P      + EF L C+
Subjt:  HYASFSDHIFKPFIKSPATFSLCSCSSLSS-----FSPASSSPRASFIPLVQLTIKG----SGKSQ------YYPNDTSEVPENPNHRRRRQVEFALDCT

Query:  SFNNITGGACPAN-YPTNWT-TDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLL
        +  N T  +CP+N YPT  +  D+  N P + TCP+YFRWIHEDLRPW+RTGI+R A+E  ++TA FRL I+ GK YVE ++ +FQTRD FT+WG LQLL
Subjt:  SFNNITGGACPAN-YPTNWT-TDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLL

Query:  RRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVA
        R+YPGK+ DL+LMFDCVDWPV+ ++ F+G + P+PPPLFRYCG++ TLDIVFPDWSFWGW E+NIKPWE LLK+L+EGN+R  W +REPYAYWKGNP VA
Subjt:  RRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVA

Query:  DTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKC
        +TR+DL+KCNVS++ +WNAR++AQDW KES+EGYKQSDL++QC HRYKIYIEGSAWSVSEKYILACDSVTL+VKPHYYDFFTRGL+P HHYWPV++ DKC
Subjt:  DTRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKC

Query:  KSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPP
        +SIKFAVDWGNSH QKAQ IGKAAS FIQ++LKMDYVYDYM+HLL++YSKLL FKP +P  A+E+CSE MAC   G  +KFMTESLVK+PA+S PC MPP
Subjt:  KSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPP

Query:  PYDPASLHFVLRRKENSIKQVEKWETSFWNTQSK
        PYDPA+ + V++RK+++  ++ +WE  +W+ Q++
Subjt:  PYDPASLHFVLRRKENSIKQVEKWETSFWNTQSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGAAGGCTCCAGCAGTAGTTTCCTCAACAGGTTTTCGCATTACGCCTCTTTTTCCGATCATATATTCAAGCCGTTTATAAAATCTCCGGCCACTTTCTCTCTCTG
TTCTTGTTCTTCTCTCTCTTCCTTCTCGCCGGCATCTTCCTCTCCACGCGCCTCCTTCATTCCTCTCGTACAATTAACAATAAAAGGGAGCGGGAAATCCCAATATTACC
CTAACGACACTTCCGAAGTCCCAGAAAACCCAAACCACCGACGACGACGTCAAGTCGAATTCGCACTCGATTGTACTTCCTTCAATAACATCACAGGAGGAGCCTGCCCT
GCCAACTACCCCACCAATTGGACTACTGACGAACATGAGAACCGTCCATCTTCAACCACGTGCCCCGAGTACTTCCGTTGGATTCACGAGGACCTTAGACCGTGGGCCCG
GACGGGGATATCGAGGGCCGCGGTGGAGGCTGGGCAACGGACGGCGAATTTCCGGCTAGTGATTCTGAATGGGAAGGCTTACGTGGAGACTTATAAGAAGTCGTTTCAAA
CGAGAGATACTTTTACGGTGTGGGGGATTCTACAGCTGTTACGGAGGTACCCCGGAAAAGTGGCTGATTTGGATCTTATGTTTGATTGCGTTGATTGGCCTGTGATTTTG
AGCAGCCATTTTAGTGGGCCTGATGGGCCTACCCCACCTCCTTTGTTTCGTTATTGTGGAGATGATCCCACGTTGGATATTGTTTTTCCTGATTGGTCCTTCTGGGGATG
GCCAGAGATCAATATAAAGCCATGGGAGCCGTTGTTGAAGGATCTAAAAGAAGGGAATAAAAGGATTTTATGGAAGAGTAGAGAGCCTTATGCATACTGGAAAGGAAATC
CGGAGGTCGCCGACACCCGAAAAGATCTACTCAAATGCAATGTCTCTGACCAACAAGACTGGAATGCTCGTGTATTTGCTCAGGATTGGACGAAAGAATCCCAGGAGGGA
TACAAGCAATCAGATCTTTCAAACCAATGCCTTCATAGATATAAAATCTATATAGAAGGATCAGCTTGGTCTGTTAGTGAAAAGTACATTCTTGCTTGTGATTCCGTTAC
CTTAATTGTAAAGCCCCATTACTACGACTTCTTCACGAGAGGTTTGATGCCAGTGCACCACTATTGGCCTGTAAAGGATGACGACAAGTGCAAGTCTATAAAATTTGCAG
TTGATTGGGGCAACAGCCATAAGCAAAAGGCACAAGCCATTGGTAAAGCAGCTAGCAGTTTCATCCAAGAGGAGCTGAAGATGGACTATGTCTATGACTACATGTTCCAT
CTTCTAAGTCAATATTCCAAACTCCTTACTTTCAAGCCAACGGTACCGCCCACTGCAATCGAGCTTTGTTCTGAAGCCATGGCTTGTCCAGCTGAAGGGCTCACCAAGAA
ATTCATGACAGAGTCATTAGTGAAGAGGCCTGCAGAGTCGAACCCATGCACAATGCCTCCTCCATATGATCCCGCATCGCTTCATTTTGTTCTTAGAAGAAAAGAGAATT
CAATCAAACAAGTGGAAAAATGGGAGACAAGTTTCTGGAATACTCAAAGTAAGCAGCCATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGAAGGCTCCAGCAGTAGTTTCCTCAACAGGTTTTCGCATTACGCCTCTTTTTCCGATCATATATTCAAGCCGTTTATAAAATCTCCGGCCACTTTCTCTCTCTG
TTCTTGTTCTTCTCTCTCTTCCTTCTCGCCGGCATCTTCCTCTCCACGCGCCTCCTTCATTCCTCTCGTACAATTAACAATAAAAGGGAGCGGGAAATCCCAATATTACC
CTAACGACACTTCCGAAGTCCCAGAAAACCCAAACCACCGACGACGACGTCAAGTCGAATTCGCACTCGATTGTACTTCCTTCAATAACATCACAGGAGGAGCCTGCCCT
GCCAACTACCCCACCAATTGGACTACTGACGAACATGAGAACCGTCCATCTTCAACCACGTGCCCCGAGTACTTCCGTTGGATTCACGAGGACCTTAGACCGTGGGCCCG
GACGGGGATATCGAGGGCCGCGGTGGAGGCTGGGCAACGGACGGCGAATTTCCGGCTAGTGATTCTGAATGGGAAGGCTTACGTGGAGACTTATAAGAAGTCGTTTCAAA
CGAGAGATACTTTTACGGTGTGGGGGATTCTACAGCTGTTACGGAGGTACCCCGGAAAAGTGGCTGATTTGGATCTTATGTTTGATTGCGTTGATTGGCCTGTGATTTTG
AGCAGCCATTTTAGTGGGCCTGATGGGCCTACCCCACCTCCTTTGTTTCGTTATTGTGGAGATGATCCCACGTTGGATATTGTTTTTCCTGATTGGTCCTTCTGGGGATG
GCCAGAGATCAATATAAAGCCATGGGAGCCGTTGTTGAAGGATCTAAAAGAAGGGAATAAAAGGATTTTATGGAAGAGTAGAGAGCCTTATGCATACTGGAAAGGAAATC
CGGAGGTCGCCGACACCCGAAAAGATCTACTCAAATGCAATGTCTCTGACCAACAAGACTGGAATGCTCGTGTATTTGCTCAGGATTGGACGAAAGAATCCCAGGAGGGA
TACAAGCAATCAGATCTTTCAAACCAATGCCTTCATAGATATAAAATCTATATAGAAGGATCAGCTTGGTCTGTTAGTGAAAAGTACATTCTTGCTTGTGATTCCGTTAC
CTTAATTGTAAAGCCCCATTACTACGACTTCTTCACGAGAGGTTTGATGCCAGTGCACCACTATTGGCCTGTAAAGGATGACGACAAGTGCAAGTCTATAAAATTTGCAG
TTGATTGGGGCAACAGCCATAAGCAAAAGGCACAAGCCATTGGTAAAGCAGCTAGCAGTTTCATCCAAGAGGAGCTGAAGATGGACTATGTCTATGACTACATGTTCCAT
CTTCTAAGTCAATATTCCAAACTCCTTACTTTCAAGCCAACGGTACCGCCCACTGCAATCGAGCTTTGTTCTGAAGCCATGGCTTGTCCAGCTGAAGGGCTCACCAAGAA
ATTCATGACAGAGTCATTAGTGAAGAGGCCTGCAGAGTCGAACCCATGCACAATGCCTCCTCCATATGATCCCGCATCGCTTCATTTTGTTCTTAGAAGAAAAGAGAATT
CAATCAAACAAGTGGAAAAATGGGAGACAAGTTTCTGGAATACTCAAAGTAAGCAGCCATAG
Protein sequenceShow/hide protein sequence
MREGSSSSFLNRFSHYASFSDHIFKPFIKSPATFSLCSCSSLSSFSPASSSPRASFIPLVQLTIKGSGKSQYYPNDTSEVPENPNHRRRRQVEFALDCTSFNNITGGACP
ANYPTNWTTDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVIL
SSHFSGPDGPTPPPLFRYCGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVADTRKDLLKCNVSDQQDWNARVFAQDWTKESQEG
YKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFH
LLSQYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPPPYDPASLHFVLRRKENSIKQVEKWETSFWNTQSKQP