; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G008040 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G008040
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionNucleotide-diphospho-sugar transferase family protein
Genome locationCiama_Chr01:9066876..9071239
RNA-Seq ExpressionCaUC01G008040
SyntenyCaUC01G008040
Gene Ontology termsGO:0071555 - cell wall organization (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR005069 - Nucleotide-diphospho-sugar transferase
IPR029044 - Nucleotide-diphospho-sugar transferases
IPR044821 - Putative nucleotide-diphospho-sugar transferase At1g28695/At4g15970-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067582.1 nucleotide-diphospho-sugar transferase family protein [Cucumis melo var. makuwa]2.5e-18285.41Show/hide
Query:  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMD
        MF Y S   +LHILLLFTAISLSCLVILRELNSLRYFPLFSFST SG PPV PFF SL  DD+       ++LA               DEYGLDKVL D
Subjt:  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMD

Query:  AATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVL
        AAT+DKTVILTTLNEAWA+PN+VIDLFLQSFRIGN+THQLLDHLVIIALDKKAF+RCLDIHVHC AL TEGVDF SEA+FMSPDYLKMMWRRIDFLR VL
Subjt:  AATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVL

Query:  EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR
        EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFI+EIGLKIR
Subjt:  EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR

Query:  FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNCRYGTISMS
        FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRI+LEDWKHYMSMPPY KTSS  SWRVPQNCRY TIS+S
Subjt:  FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNCRYGTISMS

KAE8648292.1 hypothetical protein Csa_017822 [Cucumis sativus]8.1e-18184.08Show/hide
Query:  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMD
        MF YSS   + HILLLFTAISLSCLVILRELNSLRYFPLFSFST SG PP+ PF  SL   D   P                     + DEYGLDKVL D
Subjt:  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMD

Query:  AATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVL
        AAT+DKTVILTTLNEAWASPN+VIDLFLQSFRIGNRTHQLLDHLVIIALDKKAF+RCLDIH+HC +L TEGVDF SEA+FMSPDYLKMMWRRIDFLR VL
Subjt:  AATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVL

Query:  EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR
        EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFI+EIGLKIR
Subjt:  EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR

Query:  FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNCRYGTISMS
        FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRI+LEDWKHYMSMPPY KTSS  SWRVPQNCRYGTI ++
Subjt:  FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNCRYGTISMS

XP_004149471.1 uncharacterized protein At4g15970 isoform X1 [Cucumis sativus]1.4e-17784.55Show/hide
Query:  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMD
        MF YSS   + HILLLFTAISLSCLVILRELNSLRYFPLFSFST SG PP+ PF  SL   D   P                     + DEYGLDKVL D
Subjt:  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMD

Query:  AATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVL
        AAT+DKTVILTTLNEAWASPN+VIDLFLQSFRIGNRTHQLLDHLVIIALDKKAF+RCLDIH+HC +L TEGVDF SEA+FMSPDYLKMMWRRIDFLR VL
Subjt:  AATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVL

Query:  EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR
        EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFI+EIGLKIR
Subjt:  EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR

Query:  FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC
        FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRI+LEDWKHYMSMPPY KTSS  SWRVPQNC
Subjt:  FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC

XP_008466761.1 PREDICTED: uncharacterized protein At4g15970 [Cucumis melo]7.6e-17985.64Show/hide
Query:  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMD
        MF Y S   +LHILLLFTAISLSCLVILRELNSLRYFPLFSFST SG PPV PFF SL  DD+       ++LA               DEYGLDKVL D
Subjt:  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMD

Query:  AATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVL
        AAT+DKTVILTTLNEAWA+PN+VIDLFLQSFRIGN+THQLLDHLVIIALDKKAF+RCLDIHVHC AL TEGVDF SEA+FMSPDYLKMMWRRIDFLR VL
Subjt:  AATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVL

Query:  EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR
        EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFI+EIGLKIR
Subjt:  EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR

Query:  FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC
        FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRI+LEDWKHYMSMPPY KTSS  SWRVPQNC
Subjt:  FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC

XP_038906612.1 uncharacterized protein At4g15970-like [Benincasa hispida]1.2e-18184.82Show/hide
Query:  MFMISASNSYLSAMFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPI
        M MISASNSY+SAMF+YS    TLHILLLFTAISL CLVILRE NSLRYF LFS    SG PPVSPF PSL D+D+  P                     
Subjt:  MFMISASNSYLSAMFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPI

Query:  DVDEYGLDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLK
          DEYGLDKVL DAAT+DKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLL+HLVIIALDKKAF RCLDIHVHCFAL TEGVDFHSEAHFMSPDYLK
Subjt:  DVDEYGLDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLK

Query:  MMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIK
        MMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGF YVKSNNRSIEFYKYWYSARE+YPGYHDQDVLNRIK
Subjt:  MMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIK

Query:  YDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC
        YDFFIDEIGLKIRFLDTAYFGGFCEPSKD+NRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMS PPY K  S  SWRVPQNC
Subjt:  YDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC

TrEMBL top hitse value%identityAlignment
A0A0A0KPV2 Glycosyltransferase6.9e-17884.55Show/hide
Query:  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMD
        MF YSS   + HILLLFTAISLSCLVILRELNSLRYFPLFSFST SG PP+ PF  SL   D   P                     + DEYGLDKVL D
Subjt:  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMD

Query:  AATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVL
        AAT+DKTVILTTLNEAWASPN+VIDLFLQSFRIGNRTHQLLDHLVIIALDKKAF+RCLDIH+HC +L TEGVDF SEA+FMSPDYLKMMWRRIDFLR VL
Subjt:  AATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVL

Query:  EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR
        EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFI+EIGLKIR
Subjt:  EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR

Query:  FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC
        FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRI+LEDWKHYMSMPPY KTSS  SWRVPQNC
Subjt:  FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC

A0A1S3CS74 Glycosyltransferase3.7e-17985.64Show/hide
Query:  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMD
        MF Y S   +LHILLLFTAISLSCLVILRELNSLRYFPLFSFST SG PPV PFF SL  DD+       ++LA               DEYGLDKVL D
Subjt:  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMD

Query:  AATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVL
        AAT+DKTVILTTLNEAWA+PN+VIDLFLQSFRIGN+THQLLDHLVIIALDKKAF+RCLDIHVHC AL TEGVDF SEA+FMSPDYLKMMWRRIDFLR VL
Subjt:  AATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVL

Query:  EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR
        EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFI+EIGLKIR
Subjt:  EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR

Query:  FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC
        FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRI+LEDWKHYMSMPPY KTSS  SWRVPQNC
Subjt:  FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC

A0A5A7VGT6 Glycosyltransferase1.2e-18285.41Show/hide
Query:  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMD
        MF Y S   +LHILLLFTAISLSCLVILRELNSLRYFPLFSFST SG PPV PFF SL  DD+       ++LA               DEYGLDKVL D
Subjt:  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMD

Query:  AATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVL
        AAT+DKTVILTTLNEAWA+PN+VIDLFLQSFRIGN+THQLLDHLVIIALDKKAF+RCLDIHVHC AL TEGVDF SEA+FMSPDYLKMMWRRIDFLR VL
Subjt:  AATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVL

Query:  EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR
        EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFI+EIGLKIR
Subjt:  EMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR

Query:  FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNCRYGTISMS
        FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRI+LEDWKHYMSMPPY KTSS  SWRVPQNCRY TIS+S
Subjt:  FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNCRYGTISMS

A0A6J1FH13 Glycosyltransferase7.9e-17480.31Show/hide
Query:  MFMISASNSYLSAMFSYSSS---SRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVS-PFFPSLDDDDESFPFGLSINLASLIVFLASR
        MF+I+AS+SY+SAMFSY SS    RTL ILLLFTAISL+CLVI REL+S RYFPLFSFSTFS SPP + PFFPSL DDDE                    
Subjt:  MFMISASNSYLSAMFSYSSS---SRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVS-PFFPSLDDDDESFPFGLSINLASLIVFLASR

Query:  SFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSP
            D DEY L KVL DAAT+++TVILTTLNEAWA+PNSVIDLFL+SFRIGN+T QLL+HLVIIA DKKAF+RCL IHVHCF+L TEGVDFHSEA+FMSP
Subjt:  SFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSP

Query:  DYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVL
        DYLKMMWRRIDFLR VLEMGYNFVFTDADVMWFRDPFPFFD++ADFQIACD YLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYS+RETY GYHDQDVL
Subjt:  DYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVL

Query:  NRIKYDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC
        N+IKYDFFI EIGLKI FLDTAYFGGFCEPSKDLNRVLTMHANCCIGM++KLHDLRIMLEDWKHYMSMPPY K SS  SWRVPQNC
Subjt:  NRIKYDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC

A0A6J1JWH4 Glycosyltransferase2.5e-17280.21Show/hide
Query:  MISASNSYLSAMFSYSSS---SRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVS-PFFPSLDDDDESFPFGLSINLASLIVFLASRSF
        MI+ASNSY+SAMFSY SS    RTL ILLLFTAISLSCLVI REL+S RYFPLFSFSTFS SPP + P FPSL DDDE                      
Subjt:  MISASNSYLSAMFSYSSS---SRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVS-PFFPSLDDDDESFPFGLSINLASLIVFLASRSF

Query:  PIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDY
          D DEY L K L DAAT+++TVILTTLNEAWA+PNSVIDLFL+SFRIGN+T QLL+HLVIIA DKKAF+RCL IHVHCF+L TEGVDFHSEA+FMS DY
Subjt:  PIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDY

Query:  LKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNR
        LKMMWRRIDFLR VLEMGYNFVFTDADVMWFRDPFPFFD++ADFQIACD YLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYS+RETY GYHDQDVLN+
Subjt:  LKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNR

Query:  IKYDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC
        IKYDFFI EIGLKI FLDTAYFGGFCEPSKDLNRVLTMHANCCIGM++KLHDLRIMLEDWKHYMSMPPY K SS  SWRVPQNC
Subjt:  IKYDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC

SwissProt top hitse value%identityAlignment
P0C042 Uncharacterized protein At4g159706.6e-8553.41Show/hide
Query:  LDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVH-CFALATEGVDFHSEAHFMSPDYLKMMWRR
        L K+L +AAT+DKTVI+TTLN+AW+ PNS  DLFL SF +G  T  LL HLV+  LD++A+ RC ++H H C+ + T G+DF  +  FM+PDYLKMMWRR
Subjt:  LDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVH-CFALATEGVDFHSEAHFMSPDYLKMMWRR

Query:  IDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFI
        I+FL  +L++ YNF+FT         PFP      DFQIACD+Y G   D+ N  NGGF +VK+N R+I+FY YWY +R  YP  HDQDVL++IK   + 
Subjt:  IDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFI

Query:  DEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSA--LSWRVPQNC
         +IGLK+RFLDT YFGGFCEPS+DL++V TMHANCC+G+++K+ DLR ++ DW++Y+S     KT+    ++WR P+NC
Subjt:  DEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSA--LSWRVPQNC

Q3E6Y3 Uncharacterized protein At1g286953.0e-5339.36Show/hide
Query:  NLASLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASP----NSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFAL
        N  + +  L +  +P+D  E  L      AA ++KTVI+T +N+A+       ++++DLFL+SF  G  T  LLDHL+++A+D+ A+ RC    +HC+ +
Subjt:  NLASLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASP----NSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFAL

Query:  ATE-GVDFHSEAHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKY
         TE GVD   E  FMS D+++MMWRR   +  VL  GYN +FTD DVMW R P    +++ D QI+ D+ + +   L N    GF +V+SNN++I  ++ 
Subjt:  ATE-GVDFHSEAHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKY

Query:  WYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHY
        WY  R    G  +QDVL  +    F +++GL + FL T  F GFC+ S  +  V T+HANCC+ + +K+ DL  +L DWK Y
Subjt:  WYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHY

Q54RP0 UDP-galactose:fucoside alpha-3-galactosyltransferase6.8e-0539.71Show/hide
Query:  VLEMGYNFVFTDADVMWFRDPFPFF--DINADFQIACDQYLGI-PDDLDNRPNGGFNYVKSNNRSIEF
        VL+ GYN ++TD D++W RDPF  F  DIN + Q   D  + +     D+    GF +++SN R+I+F
Subjt:  VLEMGYNFVFTDADVMWFRDPFPFF--DINADFQIACDQYLGI-PDDLDNRPNGGFNYVKSNNRSIEF

Q9FXA7 UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase 31.5e-0422.44Show/hide
Query:  FLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDIN
        FL ++ I     +  + +++IA D     +  +       L    +D  S   F S  +  +  RR   L  +LE+GYN ++ D D++W +DPF +   +
Subjt:  FLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDIN

Query:  ADFQIACDQY----LGIPDDLDNRPNGGFNYV-------KSNNRSIEFYKYWYSARETYPGY-------HDQDVLNRIKYDFFIDEIGLKIRFLDTA---
         D     D      L    DL      G  YV       +S +      K W    +  P         HDQ   NR  +        +K+  L  +   
Subjt:  ADFQIACDQY----LGIPDDLDNRPNGGFNYV-------KSNNRSIEFYKYWYSARETYPGY-------HDQDVLNRIKYDFFIDEIGLKIRFLDTA---

Query:  ----YFGGFCEPSKDLNRVLTMHANCCIGMDSKL---HDLRIMLEDWKHYMSMP
            YF      ++   + + +H N  IG D K+    D  + L D  H +  P
Subjt:  ----YFGGFCEPSKDLNRVLTMHANCCIGMDSKL---HDLRIMLEDWKHYMSMP

Arabidopsis top hitse value%identityAlignment
AT1G14590.1 Nucleotide-diphospho-sugar transferase family protein2.9e-12060.85Show/hide
Query:  LLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLN
        L   AIS+SC V+ R  +SL +           SPP+               F LS  L              D +E  L+ VL  AAT D+TV+LTTLN
Subjt:  LLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLN

Query:  EAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVM
         AWA+P SVIDLF +SFRIG  T Q+LDHLVI+ALD KA+ RCL++H HCF+L TEGVDF  EA+FM+  YLKMMWRRID LR VLEMGYNFVFTDADVM
Subjt:  EAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVM

Query:  WFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDTAYFGGFCEPS
        WFR+PFP F + ADFQIACD YLG  +DL NRPNGGFN+V+SNNR+I FYKYWY++R  +PGYHDQDVLN +K + F+  IGLK+RFL+TAYFGG CEPS
Subjt:  WFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDTAYFGGFCEPS

Query:  KDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC
        +DLN V TMHANCC GM+SKLHDLRIML+DWK +MS+P + K SS  SW+VPQNC
Subjt:  KDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC

AT2G02061.1 Nucleotide-diphospho-sugar transferase family protein1.4e-11158.21Show/hide
Query:  LRYFPLFSFSTF----------SGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASPNSV
        L +F  F FS +            S  +S  FPS++D   S     S++             P +++E  L++VL  AAT D TVILTTLNEAWA+P SV
Subjt:  LRYFPLFSFSTF----------SGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASPNSV

Query:  IDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFH-SEAHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPF
        IDLF +SFRIG  T +LL HLVIIALD KA+ RC ++H HCF L TEGVDF   EA+FM+P YL MMWRRI FLR VLE GYNFVFTDADVMWFR+PF  
Subjt:  IDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFH-SEAHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPF

Query:  FDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLT
        F  + DFQIACD Y+G P+D  NRPNGGF +V++NNRSI FYK+WY +R  YP  HDQDVLN IK D F+ ++ ++IRFL+T YFGGFCEPSKDLN V T
Subjt:  FDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLT

Query:  MHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC
        MHANCC G+DSKLHDLRIML+DW+ + S+P +   SS  +W VPQNC
Subjt:  MHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC

AT4G15970.1 Nucleotide-diphospho-sugar transferase family protein2.8e-8653.41Show/hide
Query:  LDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVH-CFALATEGVDFHSEAHFMSPDYLKMMWRR
        L K+L +AAT+DKTVI+TTLN+AW+ PNS  DLFL SF +G  T  LL HLV+  LD++A+ RC ++H H C+ + T G+DF  +  FM+PDYLKMMWRR
Subjt:  LDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVH-CFALATEGVDFHSEAHFMSPDYLKMMWRR

Query:  IDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFI
        I+FL  +L++ YNF+FT         PFP      DFQIACD+Y G   D+ N  NGGF +VK+N R+I+FY YWY +R  YP  HDQDVL++IK   + 
Subjt:  IDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFI

Query:  DEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSA--LSWRVPQNC
         +IGLK+RFLDT YFGGFCEPS+DL++V TMHANCC+G+++K+ DLR ++ DW++Y+S     KT+    ++WR P+NC
Subjt:  DEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSA--LSWRVPQNC

AT4G19970.1 CONTAINS InterPro DOMAIN/s: Nucleotide-diphospho-sugar transferase, predicted (InterPro:IPR005069)1.1e-9858.18Show/hide
Query:  KVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDF
        +VL +A+T+++TVI+TTLN+AWA PNS+ DLFL+SFRIG  T +LL H+V++ LD KAF RC  +H +C+ L T G DF  E  F +PDYLKMMWRRI+ 
Subjt:  KVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDF

Query:  LRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEI
        L  VLEMGYNF+FTDAD+MW RDPFP    + DFQ+ACD++ G P D DN  NGGF YVKSN+RSIEFYK+WY++R  YP  HDQDV N+IK+   + EI
Subjt:  LRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEI

Query:  GLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSM-PPYRKTSSALSWRVPQNC
        G+++RF DT YFGGFC+ S+D+N V TMHANCC+G+  KLHDL ++L+DW++Y+S+  P + T    +W VP  C
Subjt:  GLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSM-PPYRKTSSALSWRVPQNC

AT5G44820.1 Nucleotide-diphospho-sugar transferase family protein1.8e-9847.49Show/hide
Query:  SASNSYLSAMFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDE
        S+ + ++ + F       T  IL+LF  ++ SCLV+ +    L+   + + ++   SP  SP  P+L+  + S                   + P    +
Subjt:  SASNSYLSAMFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDE

Query:  YGLDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWR
            ++L +A+T + TVI+TTLN+AWA PNS+ DLFL+SFRIG  T QLL H+V++ LD KAF RC  +H +C+ + T   DF  E  + +PDYLKMMW 
Subjt:  YGLDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWR

Query:  RIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFF
        RID L  VLEMG+NF+FTDAD+MW RDPFP    + DFQ+ACD++ G P D DN  NGGF YV+SNNRSIEFYK+W+ +R  YP  HDQDV NRIK++ F
Subjt:  RIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFF

Query:  IDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSM-PPYRKTSSALSWRVPQNC
        I EIG+++RF DT YFGGFC+ S+D+N V TMHANCCIG+D KLHDL ++L+DW+ Y+S+  P + T    +W VP  C
Subjt:  IDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSM-PPYRKTSSALSWRVPQNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTATGATCTCTGCTTCTAATTCTTACCTTTCCGCCATGTTTAGTTACTCCTCCTCCTCCCGTACCCTTCACATTCTTCTCCTCTTCACTGCCATTTCCCTC
TCTTGCCTTGTTATTCTCAGAGAACTCAACTCCCTTCGCTACTTCCCTCTTTTCTCCTTCTCTACTTTTTCCGGTTCTCCTCCTGTTTCCCCTTTCTTCCCCTCC
CTCGATGATGACGACGAGTCTTTTCCGTTTGGGTTGTCGATTAATCTAGCTTCCTTAATCGTCTTTCTTGCTTCTCGTTCGTTTCCAATTGATGTTGATGAGTAT
GGACTGGACAAGGTCTTAATGGATGCTGCAACAGATGACAAAACTGTTATTTTAACTACTTTAAATGAAGCATGGGCATCTCCAAATTCAGTCATTGATCTCTTT
CTACAAAGCTTTAGAATTGGAAATCGAACTCACCAACTATTAGACCATTTGGTTATTATTGCATTGGACAAAAAGGCATTTGTTCGTTGCTTGGATATCCATGTC
CATTGCTTTGCTCTTGCTACTGAAGGAGTTGATTTTCATTCTGAGGCACATTTTATGTCACCTGACTACTTGAAGATGATGTGGAGAAGGATTGATTTTTTGCGA
ATTGTTCTTGAGATGGGGTACAATTTTGTATTCACGGATGCTGATGTTATGTGGTTCAGGGATCCGTTCCCTTTCTTTGATATCAATGCAGATTTCCAGATTGCT
TGTGATCAATACCTGGGCATCCCTGATGATTTAGATAACAGACCAAATGGAGGGTTTAACTATGTAAAATCCAATAATCGGTCAATTGAGTTCTACAAATATTGG
TACTCAGCTCGGGAAACTTATCCAGGATACCATGATCAGGACGTTCTAAATAGGATCAAATACGATTTTTTCATCGATGAAATTGGACTAAAGATTAGATTCTTG
GATACTGCTTACTTTGGTGGGTTCTGTGAACCCAGCAAAGATTTGAATCGTGTACTAACCATGCATGCAAACTGCTGTATTGGAATGGACAGTAAACTTCATGAT
CTCAGAATTATGCTCGAGGATTGGAAACATTACATGTCGATGCCGCCATATCGTAAGACATCATCAGCGTTGTCTTGGAGGGTTCCTCAGAACTGCAGGTATGGA
ACCATTTCTATGTCTTAG
mRNA sequenceShow/hide mRNA sequence
AGAACGCGTACGATGAACTGTTGATTGATGTTGCTGTTCTTGCTCCCTGCTTCGAATATGGTGGAAATTGGAATGCGATCTTCAACCCTAGACGTTTCTATATCG
GTTCGTGATGCGCCGCTGTAATGTTTATGATCTCTGCTTCTAATTCTTACCTTTCCGCCATGTTTAGTTACTCCTCCTCCTCCCGTACCCTTCACATTCTTCTCC
TCTTCACTGCCATTTCCCTCTCTTGCCTTGTTATTCTCAGAGAACTCAACTCCCTTCGCTACTTCCCTCTTTTCTCCTTCTCTACTTTTTCCGGTTCTCCTCCTG
TTTCCCCTTTCTTCCCCTCCCTCGATGATGACGACGAGTCTTTTCCGTTTGGGTTGTCGATTAATCTAGCTTCCTTAATCGTCTTTCTTGCTTCTCGTTCGTTTC
CAATTGATGTTGATGAGTATGGACTGGACAAGGTCTTAATGGATGCTGCAACAGATGACAAAACTGTTATTTTAACTACTTTAAATGAAGCATGGGCATCTCCAA
ATTCAGTCATTGATCTCTTTCTACAAAGCTTTAGAATTGGAAATCGAACTCACCAACTATTAGACCATTTGGTTATTATTGCATTGGACAAAAAGGCATTTGTTC
GTTGCTTGGATATCCATGTCCATTGCTTTGCTCTTGCTACTGAAGGAGTTGATTTTCATTCTGAGGCACATTTTATGTCACCTGACTACTTGAAGATGATGTGGA
GAAGGATTGATTTTTTGCGAATTGTTCTTGAGATGGGGTACAATTTTGTATTCACGGATGCTGATGTTATGTGGTTCAGGGATCCGTTCCCTTTCTTTGATATCA
ATGCAGATTTCCAGATTGCTTGTGATCAATACCTGGGCATCCCTGATGATTTAGATAACAGACCAAATGGAGGGTTTAACTATGTAAAATCCAATAATCGGTCAA
TTGAGTTCTACAAATATTGGTACTCAGCTCGGGAAACTTATCCAGGATACCATGATCAGGACGTTCTAAATAGGATCAAATACGATTTTTTCATCGATGAAATTG
GACTAAAGATTAGATTCTTGGATACTGCTTACTTTGGTGGGTTCTGTGAACCCAGCAAAGATTTGAATCGTGTACTAACCATGCATGCAAACTGCTGTATTGGAA
TGGACAGTAAACTTCATGATCTCAGAATTATGCTCGAGGATTGGAAACATTACATGTCGATGCCGCCATATCGTAAGACATCATCAGCGTTGTCTTGGAGGGTTC
CTCAGAACTGCAGGTATGGAACCATTTCTATGTCTTAGACAATTTTGCGCTATAATTAGGACGTTTCTATTTCTTTATTCTTGAATTGCATTTTTTATATTCGAA
TCTTTTTCTCCTCCCGTTCCAGTGTCTGATCTTCTAATCTCCAAACAAAGAATTGATGAATCCTACCAAACTTGGAGTGATGCACTTTGAGCAATCATTTTTGCA
TTGCACATTTCGAGTTTTGTGGACCAAAGATATTTCTGCACTACAGCTTATACCAAAGTGAATGTTAGAAGAGTATTGTGGCCGTAGTCTTTATTTGCCAGTTTG
TGTAAATTTATACCCAAATAAATTTCGAGGTCCCGCAATTGTACATCGCCAAATTAGAAAGCCTAACCATCATTAGGTAGTTCGTTTGCTAGAAATTTTGGTAAA
ATGATCAAATTCAGGTTTAGGTATACCTTTAGTTTTCAGATATAAGGAAATTTTCTCTTGCATTTTAATACAACTATTAAGTCCACTTTTATGTACAACTCTATT
ACGCCGTAGCCACGATCCCCTATTATATAAAATGATCAAGCTTATAACCACATAAGATAGAATAAACTACACTTTCTCTCCTGGCCTATAAAAGGCTGATATTGT
TTATGGGTAAGATATGTGTTT
Protein sequenceShow/hide protein sequence
MFMISASNSYLSAMFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFFPSLDDDDESFPFGLSINLASLIVFLASRSFPIDVDEY
GLDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLR
IVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIRFL
DTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNCRYGTISMS