; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Csor.00g307660 (gene) of Silver-seed gourd (wild; sororia) v1 genome

Gene IDCsor.00g307660
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionExostosin family protein
Genome locationCsor_Chr05:1725649..1729128
RNA-Seq ExpressionCsor.00g307660
SyntenyCsor.00g307660
Gene Ontology termsGO:0006486 - protein glycosylation (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0050508 - glucuronosyl-N-acetylglucosaminyl-proteoglycan 4-alpha-N-acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR004263 - Exostosin-like
IPR040911 - Exostosin, GT47 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598563.1 putative glycosyltransferase, partial [Cucurbita argyrosperma subsp. sororia]0.0100Show/hide
Query:  MPYDPESTQVVPTGFILTSTSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHL
        MPYDPESTQVVPTGFILTSTSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHL
Subjt:  MPYDPESTQVVPTGFILTSTSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHL

Query:  PEISEELLAHRLNLRKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIY
        PEISEELLAHRLNLRKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIY
Subjt:  PEISEELLAHRLNLRKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIY

Query:  TVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNT
        TVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNT
Subjt:  TVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNT

Query:  SIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSG
        SIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSG
Subjt:  SIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSG

Query:  YEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRL
        YEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRL
Subjt:  YEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRL

Query:  NVRLA
        NVRLA
Subjt:  NVRLA

KAG7029502.1 putative glycosyltransferase, partial [Cucurbita argyrosperma subsp. argyrosperma]7.10e-31592.31Show/hide
Query:  MEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLPEISEELLAHRLNLRKAMETRLTRDEKLELGLARARA
        MEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLPEISEELLAHRLNLRKAMETRLTRDEKLELGLARARA
Subjt:  MEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLPEISEELLAHRLNLRKAMETRLTRDEKLELGLARARA

Query:  EIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWM
        EIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWM
Subjt:  EIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWM

Query:  VKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLL
        VKYLYEYGSFDQTPLRVFV                                   +GNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLL
Subjt:  VKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLL

Query:  SPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEG
        SPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEG
Subjt:  SPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEG

Query:  FSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRLA
        FSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRLA
Subjt:  FSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRLA

XP_022962166.1 probable glycosyltransferase At5g25310 [Cucurbita moschata]0.098.61Show/hide
Query:  MPYDPESTQVVPTGFILTSTSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHL
        MPYDPESTQVVPTGFILTSTSLLRYLP PKKKKTETTME+FHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHL
Subjt:  MPYDPESTQVVPTGFILTSTSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHL

Query:  PEISEELLAHRLNLRKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIY
        PEISEE LAHRLNLRKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIY
Subjt:  PEISEELLAHRLNLRKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIY

Query:  TVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNT
        TVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNT
Subjt:  TVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNT

Query:  SIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSG
        SIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHL FFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSG
Subjt:  SIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSG

Query:  YEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRL
        YEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSI VGVSEIPRLKEILMGVSEAEY+RLKEGLRIVR HFVLNRPAKRFDAFHMILHSIWLRRL
Subjt:  YEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRL

Query:  NVRLA
        NVRLA
Subjt:  NVRLA

XP_022996611.1 probable glycosyltransferase At5g25310 [Cucurbita maxima]0.096.04Show/hide
Query:  MPYDPESTQVVPTGFILTSTSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHL
        MPYDPESTQVVPTGFI TSTSLLR+LP P+KKKTETTME+FHLPTT PLLTSIAAASILLFLL+SDNYSDRFATKSPPPLK THLHQFPPISDRF+AVHL
Subjt:  MPYDPESTQVVPTGFILTSTSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHL

Query:  PEISEELLAHRLNLRKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIY
        PEISEELLAHRLNLRKA +TRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLP+THDGPCKNIY
Subjt:  PEISEELLAHRLNLRKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIY

Query:  TVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNT
        TVEGRFIHEMEHGANGFRTADPS AHVFFMPFSVAWMVKYLYE GSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNT
Subjt:  TVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNT

Query:  SIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSG
        SIRV CNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQR HLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPK LDYYELMLKSRFCLCPSG
Subjt:  SIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSG

Query:  YEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRL
        YEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSI VGVSEIPRLKEILMGVSEAEY RLKEGLRIVRKHFVLNRPAKR DAFHMILHSIWLRRL
Subjt:  YEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRL

Query:  NVRLA
        NVRLA
Subjt:  NVRLA

XP_023546550.1 probable glycosyltransferase At5g25310 [Cucurbita pepo subsp. pepo]0.096.58Show/hide
Query:  MEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLPEISEELLAHRLNLRKAMETRLTRDEKLELGLARARA
        ME+FHLPT  PLLTSIAAASILLFLL+S NYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLPEISEE LAHRLNLRKA +TRLTRDEKLELGLARARA
Subjt:  MEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLPEISEELLAHRLNLRKAMETRLTRDEKLELGLARARA

Query:  EIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWM
        EIRRAAKV+NLSTTVDYVPSFAVYHNPRAFFQSYVEMERR KVYVY EGDLP+THDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWM
Subjt:  EIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWM

Query:  VKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLL
        VKYLYE GSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLL
Subjt:  VKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLL

Query:  SPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEG
        SPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDS+IRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISE YVLPFSDVLRWEG
Subjt:  SPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEG

Query:  FSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRLA
        FSI VGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRLA
Subjt:  FSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRLA

TrEMBL top hitse value%identityAlignment
A0A0A0LM16 Exostosin domain-containing protein5.64e-26476.53Show/hide
Query:  MEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLH-QFPPISDRFRAVHLPEISEELLAHRLNLRKAMETRLTRDEKLELGLARAR
        ME FH P    + +SI+   +L+FLLLS NY+ +F T    PL STHLH +FPPISD+FRA+H P+ +      R+ LRK  +TRL+R+EKLELGLA+AR
Subjt:  MEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLH-QFPPISDRFRAVHLPEISEELLAHRLNLRKAMETRLTRDEKLELGLARAR

Query:  AEIRRAAKVSNLSTT-VDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVA
        A IR+AA  SNLST+ +DY+PS +VYHNPRAF+QSYVEME+RFKVYVYPEG+LP+TH GPCKNIYT+EGRFIHEME G NGFRT DPS AHV FMPFSVA
Subjt:  AEIRRAAKVSNLSTT-VDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVA

Query:  WMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPK
        WMVKYLY+ GS+DQTPLR+FVSDYV VVS+KYPFWNKT GADHFI++CHDWGPIATEGN FLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDG+ISPK
Subjt:  WMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPK

Query:  LLSPPDTQ--RPHLAFFAGGNHGPIRPIILKHWKDRD-SDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDV
        LLS  ++   RPHLAFFAGG HGPIRPI+L HWK+R  ++I VYEYLPK LDYY+ ML+SRFCLCPSGYEVASPRIVEAIYAECVPVIISE+YVLPFSDV
Subjt:  LLSPPDTQ--RPHLAFFAGGNHGPIRPIILKHWKDRD-SDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDV

Query:  LRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRLA
        LRWEGFSI V VSEIPRL+EILMGVSE  YE+L +GLR VRKHFVLNRPAKRFDAFHMILHS+WLRRLNV+LA
Subjt:  LRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRLA

A0A1S3BC66 probable glycosyltransferase At5g253101.21e-26077Show/hide
Query:  MEIFHLPTTPPLLTSIAAASILL-FLLLSDNYSDRFATKSPPPLKSTHLH-QFPPISDRFRAVHLPEISEELLAHRLNLRKAMETRLTRDEKLELGLARA
        ME FHLP T  + +SI+   +LL FLLLS NY+ +F T    PL STHLH QFPPISD+FRA+H P+ +      R+ LRK  +T L+R+EKLELGLA+A
Subjt:  MEIFHLPTTPPLLTSIAAASILL-FLLLSDNYSDRFATKSPPPLKSTHLH-QFPPISDRFRAVHLPEISEELLAHRLNLRKAMETRLTRDEKLELGLARA

Query:  RAEIRRAAKVSNLSTT-VDYVPSFA-VYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFS
        RA IR+AA  SNLS++ VDYVPS + VYHNPRAF+QSYVEME+RFKVYVYPEG+LP+THDGPCKNIYT+EGRFIHEME G NGFRT DP  AHV FMPFS
Subjt:  RAEIRRAAKVSNLSTT-VDYVPSFA-VYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFS

Query:  VAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDIS
        VAWMVKYLY+ GS+DQTPLR+FVSDYV VVS+KYPFWNKT GADHFI++CHDWGPIATEGN FLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDIS
Subjt:  VAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDIS

Query:  PKLLSPPDTQ-RPHLAFFAGGNHGPIRPIILKHWKDRD-SDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSD
        PKLLS      RPHLAFFAGG HGPIRPI+L HWK+R  ++I VYEYLP  LDYY+ ML+SRFCLCPSGYEVASPRIVEAIYAECVPVIISE+YVLPFSD
Subjt:  PKLLSPPDTQ-RPHLAFFAGGNHGPIRPIILKHWKDRD-SDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSD

Query:  VLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRLA
        VLRW+GFSI V  SEIPRLKEILMGVS+ +YE+LK+GLR VRKHFVLNRPAKRFDAFHMILHS+WLRRLNV+LA
Subjt:  VLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRLA

A0A5N6R880 Exostosin domain-containing protein3.22e-22066.74Show/hide
Query:  LLTSIAAASILLFLLLSDNYSDRFATKSPPP----LKSTHLHQFPPISDRFRAVHLPEISEELLAHRLNLRKAMETRLTRDEKLELGLARARAEIRR-AA
        +L+S+  ASIL+  +LS  +     + S P     +    L +     D+ RAV      +      +N +KA   +L+R+++LE GLA ARA IRR AA
Subjt:  LLTSIAAASILLFLLLSDNYSDRFATKSPPP----LKSTHLHQFPPISDRFRAVHLPEISEELLAHRLNLRKAMETRLTRDEKLELGLARARAEIRR-AA

Query:  KVSNLSTTV---DYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKY
           NLS TV   DYVPS  VY N R F+QSY+EMERRFKVYVYPEGDLP+THDGPCK+IY++EGRFIHEMEHG   FRT DP  AHV+FMPFSV WMVKY
Subjt:  KVSNLSTTV---DYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKY

Query:  LYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPP
        LY+  S + +PL  FVSDYVRVVS +YPFWN+T GADHF+++CHDWGPIA+ GNPFLYNTSIRVLCNANSSEGFNPQKD+SLPEIHLY G +SPK+ +PP
Subjt:  LYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPP

Query:  --DTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGF
          +  RPHLAFFAGG HGPIRPI+L+HWK+RD+D+RVYEYLPK LDYY LML+S+FCLCPSG+EVASPRIVEAIYAECVPVI+S+ YVLPFSDVLRWE F
Subjt:  --DTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGF

Query:  SINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL
        SI V VSEIPRLKE+L  V EA Y RLK+G+R+VR+HFVLN+P+KRFD FHMILHSIWLRRLN++L
Subjt:  SINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL

A0A6J1HCC2 probable glycosyltransferase At5g253100.098.61Show/hide
Query:  MPYDPESTQVVPTGFILTSTSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHL
        MPYDPESTQVVPTGFILTSTSLLRYLP PKKKKTETTME+FHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHL
Subjt:  MPYDPESTQVVPTGFILTSTSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHL

Query:  PEISEELLAHRLNLRKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIY
        PEISEE LAHRLNLRKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIY
Subjt:  PEISEELLAHRLNLRKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIY

Query:  TVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNT
        TVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNT
Subjt:  TVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNT

Query:  SIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSG
        SIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHL FFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSG
Subjt:  SIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSG

Query:  YEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRL
        YEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSI VGVSEIPRLKEILMGVSEAEY+RLKEGLRIVR HFVLNRPAKRFDAFHMILHSIWLRRL
Subjt:  YEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRL

Query:  NVRLA
        NVRLA
Subjt:  NVRLA

A0A6J1K598 probable glycosyltransferase At5g253100.096.04Show/hide
Query:  MPYDPESTQVVPTGFILTSTSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHL
        MPYDPESTQVVPTGFI TSTSLLR+LP P+KKKTETTME+FHLPTT PLLTSIAAASILLFLL+SDNYSDRFATKSPPPLK THLHQFPPISDRF+AVHL
Subjt:  MPYDPESTQVVPTGFILTSTSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHL

Query:  PEISEELLAHRLNLRKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIY
        PEISEELLAHRLNLRKA +TRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLP+THDGPCKNIY
Subjt:  PEISEELLAHRLNLRKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIY

Query:  TVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNT
        TVEGRFIHEMEHGANGFRTADPS AHVFFMPFSVAWMVKYLYE GSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNT
Subjt:  TVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNT

Query:  SIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSG
        SIRV CNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQR HLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPK LDYYELMLKSRFCLCPSG
Subjt:  SIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSG

Query:  YEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRL
        YEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSI VGVSEIPRLKEILMGVSEAEY RLKEGLRIVRKHFVLNRPAKR DAFHMILHSIWLRRL
Subjt:  YEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRL

Query:  NVRLA
        NVRLA
Subjt:  NVRLA

SwissProt top hitse value%identityAlignment
Q3E7Q9 Probable glycosyltransferase At5g253104.3e-14555.81Show/hide
Query:  SIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLPEISEE---------LLAHRLNLRKAMETRLTRDEKL------ELGLARAR
        SI   SI L LL+S + S  F   S    K      FP  ++  R V+     EE         +    L +R    T  ++ EKL      E GLA+AR
Subjt:  SIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLPEISEE---------LLAHRLNLRKAMETRLTRDEKL------ELGLARAR

Query:  AEIRRAAKVSNLSTTV--DYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSV
        A I  A+  SN++TT+    +P+  +Y NP A ++SY+EME+RFKVYVY EG+ PL HDGPCK++Y VEGRFI EME     FRT DP+ A+V+F+PFSV
Subjt:  AEIRRAAKVSNLSTTV--DYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSV

Query:  AWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISP
         W+V+YLYE G+ D  PL+ FVSDY+R+VS  +PFWN+T GADHF+++CHDWGP+ ++ N  L+NTSIRV+CNANSSEGFNP KDV+LPEI LY G++  
Subjt:  AWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISP

Query:  KL---LSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSD
        KL    +   + RP+L FFAGG HGP+RPI+LKHWK RD D+ VYEYLPK L+YY+ M  S+FC CPSGYEVASPR++EAIY+EC+PVI+S  +VLPF+D
Subjt:  KL---LSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSD

Query:  VLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL
        VLRWE FS+ V VSEIPRLKEILM +S  +YE LK  LR VR+HF LN P +RFDAFH+ LHSIWLRRLN++L
Subjt:  VLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL

Q3E9A4 Probable glycosyltransferase At5g202604.4e-11347.69Show/hide
Query:  SILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLPEISEELLAHRLNL---RKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVD
        ++LL L+L   Y       S P L S  L  F   +        P +S E      NL       E +  +   +E GLA++R+ IR A ++    +  +
Subjt:  SILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLPEISEELLAHRLNL---RKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVD

Query:  --YVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYE-YGSFDQT
          +VP  AVY N  AF QS++EME++FKV+VY EG+ PL H GP  NIY++EG+F+ E+E G + F   +P  AH F +P SVA +V YLY    ++ + 
Subjt:  --YVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYE-YGSFDQT

Query:  PLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLS-PPDTQRPHLAF
         L     DYV VV+ KYP+WN++ GADHF VSCHDW P  +  NP L    IRVLCNAN+SEGF PQ+DVS+PEI++  G + P  LS      RP LAF
Subjt:  PLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLS-PPDTQRPHLAF

Query:  FAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPR
        FAGG+HG IR I+L+HWKD+D +++V+EYL K  DY++LM  +RFCLCPSGYEVASPR+V AI   CVPVIIS+ Y LPFSDVL W  F+I+V   +IP 
Subjt:  FAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPR

Query:  LKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL
        +K IL  +S   Y  L+  +  V++HFV+NRP++ FD   M+LHS+WLRRLN+RL
Subjt:  LKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL

Q9FFN2 Probable glycosyltransferase At5g037951.8e-12747.73Show/hide
Query:  VVPTGFILT------STSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTS--IAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLP
        VV +GF+        STSLL          T  +    HLP  PP L++    A S LL  +L    +   +TK    ++S          D  R + L 
Subjt:  VVPTGFILT------STSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTS--IAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLP

Query:  EISEELLAHRLNLRKAMETR----LTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCK
         I+    ++ ++   ++E +    L+  EK+E  L +ARA I +AA + +     DYVP   +Y N + F +SY+EME++FK+YVY EG+ PL HDGPCK
Subjt:  EISEELLAHRLNLRKAMETR----LTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCK

Query:  NIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFL
        +IY++EG FI+E+E     FRT +P  AHVF++PFSV  MV+Y+YE  S D +P+R  V DY+ +V +KYP+WN++ GADHFI+SCHDWGP A+  +P L
Subjt:  NIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFL

Query:  YNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLC
         + SIR LCNAN+SE F P+KDVS+PEI+L  G ++  +  P  + RP LAFFAGG HGP+RP++L+HW+++D+DIRV++YLP+G  Y ++M  S+FC+C
Subjt:  YNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLC

Query:  PSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWL
        PSGYEVASPRIVEA+Y+ CVPV+I+  YV PFSDVL W  FS+ V V +IP LK IL  +S  +Y R+   +  VR+HF +N PAKRFD FHMILHSIW+
Subjt:  PSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWL

Query:  RRLNVRL
        RRLNV++
Subjt:  RRLNVRL

Q9LFP3 Probable glycosyltransferase At5g111302.6e-11349.48Show/hide
Query:  EKLELGLARARAEIRRAAKVSNL--------STTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGF
        E++E GLA ARA IR+A +  NL        ++ V  V + +VY N   F QS+ EME+RFK++ Y EG+ PL H GP  NIY +EG+F+ E+E+G + F
Subjt:  EKLELGLARARAEIRRAAKVSNL--------STTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGF

Query:  RTADPSGAHVFFMPFSVAWMVKYLYE-YGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNP
        + A P  A VF++P  +  +++++Y  Y S+ +  L+  V DY+ ++S +YP+WN++ GADHF +SCHDW P  +  +P LY   IR LCNANSSEGF P
Subjt:  RTADPSGAHVFFMPFSVAWMVKYLYE-YGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNP

Query:  QKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAEC
         +DVSLPEI++    +       P   R  LAFFAGG+HG +R I+ +HWK++D D+ VYE LPK ++Y ++M K++FCLCPSG+EVASPRIVE++Y+ C
Subjt:  QKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAEC

Query:  VPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL
        VPVII++ YVLPFSDVL W+ FS+++ +S++P +K+IL  ++E EY  ++  +  VRKHFV+NRP+K +D  HMI+HSIWLRRLNVR+
Subjt:  VPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL

Q9SSE8 Probable glycosyltransferase At3g076201.3e-12055.04Show/hide
Query:  RDEKLELGLARARAEIRRAAKVSNLSTT------VDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGF
        RD K+E  LA AR  IR  A+++  STT       DYVP   +Y NP AF +SY+ ME+ FK+YVY EGD P+ H G CK+IY++EG F++ ME+    +
Subjt:  RDEKLELGLARARAEIRRAAKVSNLSTT------VDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGF

Query:  RTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQ
        RT DP  AHV+F+PFSV  ++ +L++    D+  L   ++DYV+++S+KYP+WN + G DHF++SCHDWG  AT     L+  SIRVLCNAN SE FNP+
Subjt:  RTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQ

Query:  KDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECV
        KD   PEI+L  GDI+          R  LAFFAG +HG IRP++L HWK++D DI VYE LP GLDY E+M KSRFC+CPSG+EVASPR+ EAIY+ CV
Subjt:  KDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECV

Query:  PVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL
        PV+ISE YVLPFSDVL WE FS++V V EIP LK ILM + E  Y RL EG++ V++H ++N P KR+D F+MI+HSIWLRRLNV+L
Subjt:  PVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL

Arabidopsis top hitse value%identityAlignment
AT3G07620.1 Exostosin family protein9.0e-12255.04Show/hide
Query:  RDEKLELGLARARAEIRRAAKVSNLSTT------VDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGF
        RD K+E  LA AR  IR  A+++  STT       DYVP   +Y NP AF +SY+ ME+ FK+YVY EGD P+ H G CK+IY++EG F++ ME+    +
Subjt:  RDEKLELGLARARAEIRRAAKVSNLSTT------VDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGF

Query:  RTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQ
        RT DP  AHV+F+PFSV  ++ +L++    D+  L   ++DYV+++S+KYP+WN + G DHF++SCHDWG  AT     L+  SIRVLCNAN SE FNP+
Subjt:  RTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQ

Query:  KDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECV
        KD   PEI+L  GDI+          R  LAFFAG +HG IRP++L HWK++D DI VYE LP GLDY E+M KSRFC+CPSG+EVASPR+ EAIY+ CV
Subjt:  KDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECV

Query:  PVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL
        PV+ISE YVLPFSDVL WE FS++V V EIP LK ILM + E  Y RL EG++ V++H ++N P KR+D F+MI+HSIWLRRLNV+L
Subjt:  PVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL

AT5G03795.1 Exostosin family protein1.3e-12847.73Show/hide
Query:  VVPTGFILT------STSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTS--IAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLP
        VV +GF+        STSLL          T  +    HLP  PP L++    A S LL  +L    +   +TK    ++S          D  R + L 
Subjt:  VVPTGFILT------STSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTS--IAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLP

Query:  EISEELLAHRLNLRKAMETR----LTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCK
         I+    ++ ++   ++E +    L+  EK+E  L +ARA I +AA + +     DYVP   +Y N + F +SY+EME++FK+YVY EG+ PL HDGPCK
Subjt:  EISEELLAHRLNLRKAMETR----LTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCK

Query:  NIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFL
        +IY++EG FI+E+E     FRT +P  AHVF++PFSV  MV+Y+YE  S D +P+R  V DY+ +V +KYP+WN++ GADHFI+SCHDWGP A+  +P L
Subjt:  NIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFL

Query:  YNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLC
         + SIR LCNAN+SE F P+KDVS+PEI+L  G ++  +  P  + RP LAFFAGG HGP+RP++L+HW+++D+DIRV++YLP+G  Y ++M  S+FC+C
Subjt:  YNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLC

Query:  PSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWL
        PSGYEVASPRIVEA+Y+ CVPV+I+  YV PFSDVL W  FS+ V V +IP LK IL  +S  +Y R+   +  VR+HF +N PAKRFD FHMILHSIW+
Subjt:  PSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWL

Query:  RRLNVRL
        RRLNV++
Subjt:  RRLNVRL

AT5G11130.1 Exostosin family protein1.8e-11449.48Show/hide
Query:  EKLELGLARARAEIRRAAKVSNL--------STTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGF
        E++E GLA ARA IR+A +  NL        ++ V  V + +VY N   F QS+ EME+RFK++ Y EG+ PL H GP  NIY +EG+F+ E+E+G + F
Subjt:  EKLELGLARARAEIRRAAKVSNL--------STTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGF

Query:  RTADPSGAHVFFMPFSVAWMVKYLYE-YGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNP
        + A P  A VF++P  +  +++++Y  Y S+ +  L+  V DY+ ++S +YP+WN++ GADHF +SCHDW P  +  +P LY   IR LCNANSSEGF P
Subjt:  RTADPSGAHVFFMPFSVAWMVKYLYE-YGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNP

Query:  QKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAEC
         +DVSLPEI++    +       P   R  LAFFAGG+HG +R I+ +HWK++D D+ VYE LPK ++Y ++M K++FCLCPSG+EVASPRIVE++Y+ C
Subjt:  QKDVSLPEIHLYDGDISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAEC

Query:  VPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL
        VPVII++ YVLPFSDVL W+ FS+++ +S++P +K+IL  ++E EY  ++  +  VRKHFV+NRP+K +D  HMI+HSIWLRRLNVR+
Subjt:  VPVIISEKYVLPFSDVLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL

AT5G20260.1 Exostosin family protein3.1e-11447.69Show/hide
Query:  SILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLPEISEELLAHRLNL---RKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVD
        ++LL L+L   Y       S P L S  L  F   +        P +S E      NL       E +  +   +E GLA++R+ IR A ++    +  +
Subjt:  SILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLPEISEELLAHRLNL---RKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVD

Query:  --YVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYE-YGSFDQT
          +VP  AVY N  AF QS++EME++FKV+VY EG+ PL H GP  NIY++EG+F+ E+E G + F   +P  AH F +P SVA +V YLY    ++ + 
Subjt:  --YVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSVAWMVKYLYE-YGSFDQT

Query:  PLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLS-PPDTQRPHLAF
         L     DYV VV+ KYP+WN++ GADHF VSCHDW P  +  NP L    IRVLCNAN+SEGF PQ+DVS+PEI++  G + P  LS      RP LAF
Subjt:  PLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISPKLLS-PPDTQRPHLAF

Query:  FAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPR
        FAGG+HG IR I+L+HWKD+D +++V+EYL K  DY++LM  +RFCLCPSGYEVASPR+V AI   CVPVIIS+ Y LPFSDVL W  F+I+V   +IP 
Subjt:  FAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSINVGVSEIPR

Query:  LKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL
        +K IL  +S   Y  L+  +  V++HFV+NRP++ FD   M+LHS+WLRRLN+RL
Subjt:  LKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL

AT5G25310.1 Exostosin family protein3.1e-14655.81Show/hide
Query:  SIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLPEISEE---------LLAHRLNLRKAMETRLTRDEKL------ELGLARAR
        SI   SI L LL+S + S  F   S    K      FP  ++  R V+     EE         +    L +R    T  ++ EKL      E GLA+AR
Subjt:  SIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLPEISEE---------LLAHRLNLRKAMETRLTRDEKL------ELGLARAR

Query:  AEIRRAAKVSNLSTTV--DYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSV
        A I  A+  SN++TT+    +P+  +Y NP A ++SY+EME+RFKVYVY EG+ PL HDGPCK++Y VEGRFI EME     FRT DP+ A+V+F+PFSV
Subjt:  AEIRRAAKVSNLSTTV--DYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTADPSGAHVFFMPFSV

Query:  AWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISP
         W+V+YLYE G+ D  PL+ FVSDY+R+VS  +PFWN+T GADHF+++CHDWGP+ ++ N  L+NTSIRV+CNANSSEGFNP KDV+LPEI LY G++  
Subjt:  AWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGDISP

Query:  KL---LSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSD
        KL    +   + RP+L FFAGG HGP+RPI+LKHWK RD D+ VYEYLPK L+YY+ M  S+FC CPSGYEVASPR++EAIY+EC+PVI+S  +VLPF+D
Subjt:  KL---LSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSD

Query:  VLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL
        VLRWE FS+ V VSEIPRLKEILM +S  +YE LK  LR VR+HF LN P +RFDAFH+ LHSIWLRRLN++L
Subjt:  VLRWEGFSINVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATACGATCCGGAGTCGACTCAAGTCGTCCCGACTGGCTTCATATTAACTTCCACAAGCCTACTGCGATATCTTCCTCTGCCAAAGAAGAAAAAAACAGAGACAAC
AATGGAGATTTTTCACCTGCCGACTACACCACCACTCCTCACTTCGATCGCCGCCGCCTCGATTCTCCTTTTTCTTCTCCTCTCCGACAATTACTCCGACCGATTCGCAA
CCAAATCTCCGCCTCCACTAAAATCCACGCACCTCCACCAATTTCCTCCGATTTCCGATCGATTCCGAGCCGTTCACTTGCCGGAAATCTCCGAGGAGTTGCTCGCTCAT
CGCCTTAATCTAAGAAAAGCGATGGAAACGAGATTGACCAGAGACGAAAAGCTCGAACTAGGGCTCGCTCGGGCTAGGGCGGAAATTCGCCGAGCGGCGAAGGTCTCAAA
CTTATCGACCACCGTCGATTACGTTCCTTCTTTCGCAGTTTATCACAATCCCCGCGCTTTTTTTCAGAGCTACGTGGAGATGGAGAGAAGATTCAAAGTGTACGTGTACC
CGGAGGGGGATTTGCCTTTAACCCACGACGGGCCGTGTAAGAACATATACACAGTGGAAGGGAGGTTCATACATGAGATGGAGCATGGGGCGAATGGGTTCAGGACGGCG
GATCCGAGTGGGGCTCATGTTTTTTTTATGCCGTTCAGCGTGGCTTGGATGGTTAAGTACTTGTATGAATATGGAAGCTTTGATCAAACGCCGCTGCGGGTGTTTGTGAG
TGATTACGTGCGGGTGGTGTCTGAAAAGTACCCATTTTGGAATAAAACAACTGGGGCTGACCACTTTATTGTTTCATGCCATGATTGGGGTCCAATTGCAACAGAAGGAA
ACCCCTTCCTCTACAACACATCCATCCGCGTCCTCTGTAATGCCAACTCCTCCGAAGGATTCAACCCACAAAAAGACGTCAGCTTACCGGAGATCCACCTCTACGACGGC
GACATATCACCGAAGCTACTGTCGCCGCCAGACACCCAACGTCCCCACCTAGCATTCTTCGCAGGCGGTAACCACGGTCCAATAAGACCTATAATCCTAAAGCACTGGAA
AGACCGCGACAGCGACATCCGCGTGTACGAGTATCTCCCAAAGGGGCTGGACTATTACGAGCTAATGCTGAAGTCAAGATTTTGCCTGTGCCCAAGTGGGTATGAAGTGG
CAAGTCCCAGAATCGTGGAGGCAATTTATGCTGAATGTGTGCCGGTGATTATTTCGGAGAAGTACGTTTTGCCGTTCAGCGACGTGTTGAGATGGGAAGGGTTTTCGATT
AATGTGGGCGTGTCGGAGATTCCGAGGTTGAAGGAGATTTTGATGGGGGTGTCGGAGGCAGAGTACGAGAGGCTTAAGGAGGGCTTGAGGATTGTTCGAAAGCACTTTGT
GTTGAACCGTCCGGCTAAGAGGTTCGATGCTTTTCATATGATTTTGCATTCGATTTGGCTTAGGAGATTGAATGTAAGACTTGCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCATACGATCCGGAGTCGACTCAAGTCGTCCCGACTGGCTTCATATTAACTTCCACAAGCCTACTGCGATATCTTCCTCTGCCAAAGAAGAAAAAAACAGAGACAAC
AATGGAGATTTTTCACCTGCCGACTACACCACCACTCCTCACTTCGATCGCCGCCGCCTCGATTCTCCTTTTTCTTCTCCTCTCCGACAATTACTCCGACCGATTCGCAA
CCAAATCTCCGCCTCCACTAAAATCCACGCACCTCCACCAATTTCCTCCGATTTCCGATCGATTCCGAGCCGTTCACTTGCCGGAAATCTCCGAGGAGTTGCTCGCTCAT
CGCCTTAATCTAAGAAAAGCGATGGAAACGAGATTGACCAGAGACGAAAAGCTCGAACTAGGGCTCGCTCGGGCTAGGGCGGAAATTCGCCGAGCGGCGAAGGTCTCAAA
CTTATCGACCACCGTCGATTACGTTCCTTCTTTCGCAGTTTATCACAATCCCCGCGCTTTTTTTCAGAGCTACGTGGAGATGGAGAGAAGATTCAAAGTGTACGTGTACC
CGGAGGGGGATTTGCCTTTAACCCACGACGGGCCGTGTAAGAACATATACACAGTGGAAGGGAGGTTCATACATGAGATGGAGCATGGGGCGAATGGGTTCAGGACGGCG
GATCCGAGTGGGGCTCATGTTTTTTTTATGCCGTTCAGCGTGGCTTGGATGGTTAAGTACTTGTATGAATATGGAAGCTTTGATCAAACGCCGCTGCGGGTGTTTGTGAG
TGATTACGTGCGGGTGGTGTCTGAAAAGTACCCATTTTGGAATAAAACAACTGGGGCTGACCACTTTATTGTTTCATGCCATGATTGGGGTCCAATTGCAACAGAAGGAA
ACCCCTTCCTCTACAACACATCCATCCGCGTCCTCTGTAATGCCAACTCCTCCGAAGGATTCAACCCACAAAAAGACGTCAGCTTACCGGAGATCCACCTCTACGACGGC
GACATATCACCGAAGCTACTGTCGCCGCCAGACACCCAACGTCCCCACCTAGCATTCTTCGCAGGCGGTAACCACGGTCCAATAAGACCTATAATCCTAAAGCACTGGAA
AGACCGCGACAGCGACATCCGCGTGTACGAGTATCTCCCAAAGGGGCTGGACTATTACGAGCTAATGCTGAAGTCAAGATTTTGCCTGTGCCCAAGTGGGTATGAAGTGG
CAAGTCCCAGAATCGTGGAGGCAATTTATGCTGAATGTGTGCCGGTGATTATTTCGGAGAAGTACGTTTTGCCGTTCAGCGACGTGTTGAGATGGGAAGGGTTTTCGATT
AATGTGGGCGTGTCGGAGATTCCGAGGTTGAAGGAGATTTTGATGGGGGTGTCGGAGGCAGAGTACGAGAGGCTTAAGGAGGGCTTGAGGATTGTTCGAAAGCACTTTGT
GTTGAACCGTCCGGCTAAGAGGTTCGATGCTTTTCATATGATTTTGCATTCGATTTGGCTTAGGAGATTGAATGTAAGACTTGCGTAG
Protein sequenceShow/hide protein sequence
MPYDPESTQVVPTGFILTSTSLLRYLPLPKKKKTETTMEIFHLPTTPPLLTSIAAASILLFLLLSDNYSDRFATKSPPPLKSTHLHQFPPISDRFRAVHLPEISEELLAH
RLNLRKAMETRLTRDEKLELGLARARAEIRRAAKVSNLSTTVDYVPSFAVYHNPRAFFQSYVEMERRFKVYVYPEGDLPLTHDGPCKNIYTVEGRFIHEMEHGANGFRTA
DPSGAHVFFMPFSVAWMVKYLYEYGSFDQTPLRVFVSDYVRVVSEKYPFWNKTTGADHFIVSCHDWGPIATEGNPFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDG
DISPKLLSPPDTQRPHLAFFAGGNHGPIRPIILKHWKDRDSDIRVYEYLPKGLDYYELMLKSRFCLCPSGYEVASPRIVEAIYAECVPVIISEKYVLPFSDVLRWEGFSI
NVGVSEIPRLKEILMGVSEAEYERLKEGLRIVRKHFVLNRPAKRFDAFHMILHSIWLRRLNVRLA