; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS018821 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS018821
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionNucleotide-diphospho-sugar transferase family protein
Genome locationscaffold14:134889..137054
RNA-Seq ExpressionMS018821
SyntenyMS018821
Gene Ontology termsGO:0071555 - cell wall organization (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR005069 - Nucleotide-diphospho-sugar transferase
IPR044821 - Putative nucleotide-diphospho-sugar transferase At1g28695/At4g15970-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137392.1 uncharacterized protein At4g15970 [Cucumis sativus]5.8e-14673.75Show/hide
Query:  AAAPSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHI
        + AP+T    RTVR+S VL+GV L + VLYNSAINPF+FLP SY  YR     S   DP+LEK++K A+ EDGT+ILTTLNDAWAEP SLLDLFL+SFHI
Subjt:  AAAPSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHI

Query:  GNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIAC
        GNGT+RLLKHLVIVT+D+KAY+RCVA+HPHCY+LDTQG NFSSEAYFMT+DYL+MMWRRIEFL  VL MG SFVFTD+DIMWLQDPFNHF+ DADFQIA 
Subjt:  GNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIAC

Query:  DYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDN
        D +LGN E+LNN PNGGF YV++N +T+KFYKFWY+SRTIYPGQHDQDVLNKIK SPLI KIG+K+RFLDTANFGGFCQ  RD ++++TMHANCCVGL+N
Subjt:  DYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDN

Query:  KVHDLKILLHDWNTFFTQTPRD--KAASTPSWSVPQDCK
        KVHDL+ILL DWN+FF QT  D    +ST SW+VPQDCK
Subjt:  KVHDLKILLHDWNTFFTQTPRD--KAASTPSWSVPQDCK

XP_008438689.1 PREDICTED: uncharacterized protein At4g15970-like [Cucumis melo]2.1e-14871.55Show/hide
Query:  MKDSNSAADLQIA---AAPS--------TIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVIL
        MK++NSAAD + A   + PS        ++V  RTVR+S VL+GV L + VLYNSAINPF+FLPVSY TYR     S   DP+LEK++K A+ EDGT+I+
Subjt:  MKDSNSAADLQIA---AAPS--------TIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVIL

Query:  TTLNDAWAEPGSLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTD
        TTLNDAWAEP SL DLFL+SFH+GNGT+RLLKHLVIVT+D+KAY+RCVALHPHCY+LDTQG NFSSEAYFMTSDYL+MMWRRIEFL  VL MG SFVFTD
Subjt:  TTLNDAWAEPGSLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTD

Query:  SDIMWLQDPFNHFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGF
        +DIMWLQDPFNHF+ +ADFQIA D +LGN EDLNN PNGGF YV++NPKT+KFYKFWYQSRTIYPGQHDQDVLNKIK SPLI KIG+K+RFLDTANFGGF
Subjt:  SDIMWLQDPFNHFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGF

Query:  CQPSRDFNRVSTMHANCCVGLDNKVHDLKILLHDWNTFFTQT--PRDKAASTPSWSVPQDCK
        CQ  RD ++++T+HANCCVGL+NKVHDL+ILL DWN FF +T       +STPSW+VPQDC+
Subjt:  CQPSRDFNRVSTMHANCCVGLDNKVHDLKILLHDWNTFFTQT--PRDKAASTPSWSVPQDCK

XP_022137284.1 uncharacterized protein At4g15970-like [Momordica charantia]3.9e-206100Show/hide
Query:  MKDSNSAADLQIAAAPSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPG
        MKDSNSAADLQIAAAPSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPG
Subjt:  MKDSNSAADLQIAAAPSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPG

Query:  SLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFN
        SLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFN
Subjt:  SLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFN

Query:  HFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVS
        HFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVS
Subjt:  HFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVS

Query:  TMHANCCVGLDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDCKYVLL
        TMHANCCVGLDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDCKYVLL
Subjt:  TMHANCCVGLDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDCKYVLL

XP_023524447.1 uncharacterized protein At4g15970-like [Cucurbita pepo subsp. pepo]1.2e-14375.22Show/hide
Query:  APSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHIGN
        A + +V  +TVR+S    GV L +LVLYNSAINPF  LPVSY +YR   S S   +PLLEK L  AS ED TVILTTLN AWAEP SLLDLFL+SFH GN
Subjt:  APSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHIGN

Query:  GTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDY
        GT+RLLKHLVIV +D KAY RCVA HPHCY+LDT+G NFS EAYFMT+DYL+MMWRRI+FLTSVL MGFSFVFTDSDIMWLQDPFNHFHPDADFQIACD 
Subjt:  GTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDY

Query:  FLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDNKV
        F G+SEDLNNRPNGGF YVKSN KTI+FYKFWY+SRT++PG+HDQDVLNKIK SPLI +IGLKIRFLDTANFGGFCQ  RDF +V T+HANCCVGLDNKV
Subjt:  FLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDNKV

Query:  HDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDCK
        HDL+ILL+DW+ F       KA+S PSWSVPQDC+
Subjt:  HDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDCK

XP_038894961.1 uncharacterized protein At4g15970-like [Benincasa hispida]1.9e-15275.07Show/hide
Query:  SNSAADLQIA-----AAPS----TIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLND
        +NSAAD   A     +APS    T V  R  R S V +GV L +LVLYNS INPF+FLPVS  TYR     +   DPLLEK+LK A+ EDGT+ILTTLND
Subjt:  SNSAADLQIA-----AAPS----TIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLND

Query:  AWAEPGSLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMW
        AWAEP SLLDLFL+SFHIGNGT+RLLKHLVIVT+D+KAY+RCVALHPHCYEL+TQG NFSSEAYFMT DYL+MMWRRIEFLTSVL+MG+SFVFTDSDIMW
Subjt:  AWAEPGSLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMW

Query:  LQDPFNHFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSR
        LQDPFNHF+PDADFQIACD+F+GNSEDLNN PNGGF YVK+NPKT+KFYKFWY+SRTIYPG+HDQDVLNKIK SPLISKIGLK+RFLDTANFGGFCQ  R
Subjt:  LQDPFNHFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSR

Query:  DFNRVSTMHANCCVGLDNKVHDLKILLHDWNTFFTQTPRDK--AASTPSWSVPQDCK
        D N+++T+HANCCVGL+NKVHDL+ILL DW+ FF  T  D   A+STPSW+VPQDC+
Subjt:  DFNRVSTMHANCCVGLDNKVHDLKILLHDWNTFFTQTPRDK--AASTPSWSVPQDCK

TrEMBL top hitse value%identityAlignment
A0A0A0LT78 Glycosyltransferase2.8e-14673.75Show/hide
Query:  AAAPSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHI
        + AP+T    RTVR+S VL+GV L + VLYNSAINPF+FLP SY  YR     S   DP+LEK++K A+ EDGT+ILTTLNDAWAEP SLLDLFL+SFHI
Subjt:  AAAPSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHI

Query:  GNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIAC
        GNGT+RLLKHLVIVT+D+KAY+RCVA+HPHCY+LDTQG NFSSEAYFMT+DYL+MMWRRIEFL  VL MG SFVFTD+DIMWLQDPFNHF+ DADFQIA 
Subjt:  GNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIAC

Query:  DYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDN
        D +LGN E+LNN PNGGF YV++N +T+KFYKFWY+SRTIYPGQHDQDVLNKIK SPLI KIG+K+RFLDTANFGGFCQ  RD ++++TMHANCCVGL+N
Subjt:  DYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDN

Query:  KVHDLKILLHDWNTFFTQTPRD--KAASTPSWSVPQDCK
        KVHDL+ILL DWN+FF QT  D    +ST SW+VPQDCK
Subjt:  KVHDLKILLHDWNTFFTQTPRD--KAASTPSWSVPQDCK

A0A1S3AWN4 Glycosyltransferase1.0e-14871.55Show/hide
Query:  MKDSNSAADLQIA---AAPS--------TIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVIL
        MK++NSAAD + A   + PS        ++V  RTVR+S VL+GV L + VLYNSAINPF+FLPVSY TYR     S   DP+LEK++K A+ EDGT+I+
Subjt:  MKDSNSAADLQIA---AAPS--------TIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVIL

Query:  TTLNDAWAEPGSLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTD
        TTLNDAWAEP SL DLFL+SFH+GNGT+RLLKHLVIVT+D+KAY+RCVALHPHCY+LDTQG NFSSEAYFMTSDYL+MMWRRIEFL  VL MG SFVFTD
Subjt:  TTLNDAWAEPGSLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTD

Query:  SDIMWLQDPFNHFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGF
        +DIMWLQDPFNHF+ +ADFQIA D +LGN EDLNN PNGGF YV++NPKT+KFYKFWYQSRTIYPGQHDQDVLNKIK SPLI KIG+K+RFLDTANFGGF
Subjt:  SDIMWLQDPFNHFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGF

Query:  CQPSRDFNRVSTMHANCCVGLDNKVHDLKILLHDWNTFFTQT--PRDKAASTPSWSVPQDCK
        CQ  RD ++++T+HANCCVGL+NKVHDL+ILL DWN FF +T       +STPSW+VPQDC+
Subjt:  CQPSRDFNRVSTMHANCCVGLDNKVHDLKILLHDWNTFFTQT--PRDKAASTPSWSVPQDCK

A0A6J1C6T6 uncharacterized protein At4g15970-like1.9e-206100Show/hide
Query:  MKDSNSAADLQIAAAPSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPG
        MKDSNSAADLQIAAAPSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPG
Subjt:  MKDSNSAADLQIAAAPSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPG

Query:  SLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFN
        SLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFN
Subjt:  SLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFN

Query:  HFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVS
        HFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVS
Subjt:  HFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVS

Query:  TMHANCCVGLDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDCKYVLL
        TMHANCCVGLDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDCKYVLL
Subjt:  TMHANCCVGLDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDCKYVLL

A0A6J1GCE5 Glycosyltransferase6.5e-14374.63Show/hide
Query:  APSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHIGN
        A + +V  +TVR+S    GV L ++VLYNSAI PF  LPVSY +YR   S S   +PLLEK L  AS ED TVILTTLN AWAEP SLLDLFL+SFH GN
Subjt:  APSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHIGN

Query:  GTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDY
        GT+RLLKHLVIV +D KAY RCVA HPHCY+LDT+G NFS EAYFMT+DYL+MMWRRI+FLTSVL MGFSFVFTDSDIMWLQDPFNHFHPDADFQIACD 
Subjt:  GTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDY

Query:  FLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDNKV
        F G+SEDLNNRPNGGF YVKSN KTI+FYKFWY+SRT++PG+HDQDVLNKIK SPLI +IGLKIRFLDTANFGGFCQ  RDF +V T+HANCCVGLDNKV
Subjt:  FLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDNKV

Query:  HDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDCK
        HDL+ILL+DW+ F       KA+S PSWSVPQDC+
Subjt:  HDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDCK

A0A6J1KAZ2 Glycosyltransferase8.5e-14374.4Show/hide
Query:  AAPSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHIG
        +A + +V  +TVR+S    GV L +LVLYNSAINPF  LPVSY +YR   S S   +PLLEK L  AS ED TVILTTLN AWAEP SLLDLFL+SFH G
Subjt:  AAPSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHIG

Query:  NGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACD
        NGT+RLLKHLVIV +D KAY RC A HPHCY+LDT+G NFS EAYFMT+DYL+MMWRRI+FLTSVL MGFSFVFTDSDIMWLQDPFNHFHPDADFQIACD
Subjt:  NGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACD

Query:  YFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDNK
         F G+SEDLNNRPNGGF YVKSN KTI+FYKFWY+SRT++PG+HDQDVLNKIK SPLI +IGLKIRFLDTANFGGFCQ  RDF +V T+HANCCVGL+NK
Subjt:  YFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDNK

Query:  VHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDCK
        VHDL+ILL+DW+ F       KA+S PSWSVPQDC+
Subjt:  VHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDCK

SwissProt top hitse value%identityAlignment
P0C042 Uncharacterized protein At4g159703.9e-8451.99Show/hide
Query:  LEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPH-CYELDTQGINFSSEAYFMTSDYLQMMWRR
        L KIL  A+TED TVI+TTLN AW+EP S  DLFL SFH+G GT+ LL+HLV+  +D++AY+RC  +HPH CY + T GI+F+ +  FMT DYL+MMWRR
Subjt:  LEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPH-CYELDTQGINFSSEAYFMTSDYLQMMWRR

Query:  IEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLI
        IEFL ++L++ ++F+FT         PF     + DFQIACD + G+ +D++N  NGGF +VK+N +TI FY +WY SR  YP +HDQDVL++IK     
Subjt:  IEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLI

Query:  SKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDC
        +KIGLK+RFLDT  FGGFC+PSRD ++V TMHANCCVGL+NK+ DL+ ++ DW   +    +       +W  P++C
Subjt:  SKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDC

Q3E6Y3 Uncharacterized protein At1g286953.3e-5136.45Show/hide
Query:  VALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEP----GSLLDLFLESFHIGNGTERLLKHLVIVTMD
        + +A+ + +  A+  F +      T R +  P    D L   +   A+  + TVI+T +N A+ +      ++LDLFLESF  G GT  LL HL++V +D
Subjt:  VALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEP----GSLLDLFLESFHIGNGTERLLKHLVIVTMD

Query:  KKAYARCVALHPHCYELDTQ-GINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDYFLGNSEDLNNRPNG
        + AY RC     HCY+++T+ G++   E  FM+ D+++MMWRR   +  VLR G++ +FTD+D+MWL+ P +  +   D QI+ D      + +N     
Subjt:  KKAYARCVALHPHCYELDTQ-GINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDYFLGNSEDLNNRPNG

Query:  GFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDNKVHDLKILLHDWNTF
        GF +V+SN KTI  ++ WY  R    G  +QDVL  +  S   +++GL + FL T  F GFCQ S     V+T+HANCC+ +  KV DL  +L DW  +
Subjt:  GFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDNKVHDLKILLHDWNTF

Q54RP0 UDP-galactose:fucoside alpha-3-galactosyltransferase4.0e-0437.68Show/hide
Query:  VLRMGFSFVFTDSDIMWLQDPFNHFHPDAD----FQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKF
        VL+ G++ ++TD+DI+W +DPF HF+ D +    F    D  L   +D ++    GF +++SN +TIKF
Subjt:  VLRMGFSFVFTDSDIMWLQDPFNHFHPDAD----FQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKF

Q9FXA7 UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase 33.0e-0424.41Show/hide
Query:  FVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLT--TDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHIGNGTERLLKHLVIV
        FV+LGV    L L  S++  F F   + ++  PS+S S++   D  L + +K  +  + TVI+  ++  +         FL ++ I    ++  + ++++
Subjt:  FVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLT--TDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHIGNGTERLLKHLVIV

Query:  TMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDYF----LGNSEDL
          D     +     P    L    ++  S   F +  +  +  RR + L ++L +G++ ++ D D++WLQDPF++     D     D      L +S DL
Subjt:  TMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDYF----LGNSEDL

Query:  NNRPNGGFTYVKS
              G TYV S
Subjt:  NNRPNGGFTYVKS

Arabidopsis top hitse value%identityAlignment
AT1G14590.1 Nucleotide-diphospho-sugar transferase family protein3.1e-10553.89Show/hide
Query:  APSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHIGN
        +P   +P R  R +  L  ++++  VLY +A +   F P  +      +S     +P LE +L  A+T D TV+LTTLN AWA PGS++DLF ESF IG 
Subjt:  APSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHIGN

Query:  GTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDY
         T ++L HLVIV +D KAY+RC+ LH HC+ L T+G++FS EAYFMT  YL+MMWRRI+ L SVL MG++FVFTD+D+MW ++PF  F+  ADFQIACD+
Subjt:  GTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDY

Query:  FLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDNKV
        +LG S DL+NRPNGGF +V+SN +TI FYK+WY SR  +PG HDQDVLN +K  P + +IGLK+RFL+TA FGG C+PSRD N V TMHANCC G+++K+
Subjt:  FLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDNKV

Query:  HDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDC
        HDL+I+L DW  F +     K +S  SW VPQ+C
Subjt:  HDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDC

AT2G02061.1 Nucleotide-diphospho-sugar transferase family protein5.2e-10852.89Show/hide
Query:  SNSAADLQIAAAPSTIVP-------RRTVRISFVLLGVAL-----AILVLYNSAINPFRFLP-VSYTTYRPSASPSLT----TDPLLEKILKNASTEDGT
        S+SAA    AA  S ++P       R  + + F+    +L      + ++ +S+    R  P V+ ++  PS SPSL+     +P LE++L+ A+T+DGT
Subjt:  SNSAADLQIAAAPSTIVP-------RRTVRISFVLLGVAL-----AILVLYNSAINPFRFLP-VSYTTYRPSASPSLT----TDPLLEKILKNASTEDGT

Query:  VILTTLNDAWAEPGSLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFS-SEAYFMTSDYLQMMWRRIEFLTSVLRMGFSF
        VILTTLN+AWA PGS++DLF ESF IG GT RLLKHLVI+ +D KAY+RC  LH HC+ L+T+G++FS  EAYFMT  YL MMWRRI FL SVL  G++F
Subjt:  VILTTLNDAWAEPGSLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFS-SEAYFMTSDYLQMMWRRIEFLTSVLRMGFSF

Query:  VFTDSDIMWLQDPFNHFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTAN
        VFTD+D+MW ++PF  F+ D DFQIACD+++G   D  NRPNGGFT+V++N ++I FYKFWY SRT YP  HDQDVLN IKT P + K+ ++IRFL+T  
Subjt:  VFTDSDIMWLQDPFNHFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTAN

Query:  FGGFCQPSRDFNRVSTMHANCCVGLDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDC
        FGGFC+PS+D N V TMHANCC GLD+K+HDL+I+L DW  F +       +S  +WSVPQ+C
Subjt:  FGGFCQPSRDFNRVSTMHANCCVGLDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDC

AT4G15970.1 Nucleotide-diphospho-sugar transferase family protein7.3e-8652.35Show/hide
Query:  LEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPH-CYELDTQGINFSSEAYFMTSDYLQMMWRR
        L KIL  A+TED TVI+TTLN AW+EP S  DLFL SFH+G GT+ LL+HLV+  +D++AY+RC  +HPH CY + T GI+F+ +  FMT DYL+MMWRR
Subjt:  LEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHIGNGTERLLKHLVIVTMDKKAYARCVALHPH-CYELDTQGINFSSEAYFMTSDYLQMMWRR

Query:  IEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLI
        IEFL ++L++ ++F+FT         PF     + DFQIACD + G+ +D++N  NGGFT+VK+N +TI FY +WY SR  YP +HDQDVL++IK     
Subjt:  IEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLI

Query:  SKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDC
        +KIGLK+RFLDT  FGGFC+PSRD ++V TMHANCCVGL+NK+ DL+ ++ DW   +    +       +W  P++C
Subjt:  SKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDC

AT4G19970.1 CONTAINS InterPro DOMAIN/s: Nucleotide-diphospho-sugar transferase, predicted (InterPro:IPR005069)3.8e-10353.69Show/hide
Query:  STIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPL-------LEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLES
        S I  +   +I  ++LG+A A L+LY +A    + L V+  + RP    + ++ PL         ++L+NASTE+ TVI+TTLN AWAEP SL DLFLES
Subjt:  STIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPL-------LEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLES

Query:  FHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQ
        F IG GT++LL+H+V+V +D KA+ARC  LHP+CY L T G +FS E  F T DYL+MMWRRIE LT VL MG++F+FTD+DIMWL+DPF   +PD DFQ
Subjt:  FHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQ

Query:  IACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVG
        +ACD F G+  D +N  NGGFTYVKSN ++I+FYKFWY SR  YP  HDQDV N+IK   L+S+IG+++RF DT  FGGFCQ SRD N V TMHANCCVG
Subjt:  IACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVG

Query:  LDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDC
        L  K+HDL ++L DW  + + +   +     +WSVP  C
Subjt:  LDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDC

AT5G44820.1 Nucleotide-diphospho-sugar transferase family protein1.2e-10152.65Show/hide
Query:  RRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSL-----------TTDPLL--EKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLE
        +   RI  + LG+  + LVLY +A    R    + T+ + S SP L           TT P L  ++IL+NAST++ TVI+TTLN AWAEP SL DLFLE
Subjt:  RRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSL-----------TTDPLL--EKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLE

Query:  SFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADF
        SF IG GT++LLKH+V+V +D KA+ RC  LH +CY ++T   +FS E  + T DYL+MMW RI+ LT VL MGF+F+FTD+DIMWL+DPF   +PD DF
Subjt:  SFHIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADF

Query:  QIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCV
        Q+ACD F GN  D +N  NGGFTYV+SN ++I+FYKFW++SR  YP  HDQDV N+IK  P IS+IG+++RF DT  FGGFCQ SRD N V TMHANCC+
Subjt:  QIACDYFLGNSEDLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCV

Query:  GLDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDC
        GLD K+HDL ++L DW  + + +   +     +WSVP  C
Subjt:  GLDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGATTCCAATTCCGCCGCTGACCTTCAAATCGCCGCCGCTCCTTCCACCATTGTTCCCCGGAGGACGGTCAGGATCTCCTTCGTGCTGCTCGGCGTTGCTCTGGC
CATCCTCGTTCTCTACAACTCCGCCATTAATCCTTTCAGATTTCTCCCCGTTTCCTACACCACCTACCGCCCCTCTGCATCCCCTTCTCTCACCACAGACCCTCTTCTGG
AAAAAATTCTGAAGAATGCATCGACGGAAGATGGAACGGTAATATTGACAACACTGAACGACGCGTGGGCGGAGCCGGGTTCGCTACTGGATCTGTTTCTGGAAAGCTTC
CACATCGGAAACGGAACGGAGAGGCTACTGAAGCACTTGGTAATAGTGACGATGGACAAAAAGGCGTATGCCCGTTGCGTGGCGTTGCACCCGCATTGCTACGAACTGGA
CACACAAGGCATCAATTTCTCCAGCGAGGCCTACTTCATGACCTCTGATTACCTCCAGATGATGTGGCGGAGAATCGAATTTCTCACCTCTGTTCTCCGGATGGGTTTCA
GCTTCGTCTTCACCGATTCTGATATCATGTGGCTCCAAGACCCCTTCAACCACTTCCACCCAGATGCGGATTTTCAGATTGCGTGCGATTACTTTCTGGGGAATTCGGAG
GATCTAAACAACCGCCCCAACGGGGGGTTCACCTACGTCAAATCAAATCCCAAAACAATCAAATTCTACAAGTTTTGGTACCAATCCAGGACCATATATCCGGGCCAGCA
CGACCAAGACGTGCTCAACAAGATCAAGACCAGCCCATTGATCTCTAAAATAGGCCTCAAGATAAGGTTTCTGGACACTGCCAACTTCGGAGGCTTCTGTCAGCCCAGCC
GGGACTTCAACCGGGTCTCCACCATGCACGCCAATTGCTGCGTCGGCCTCGACAACAAAGTTCACGATCTCAAGATTCTGCTCCATGACTGGAATACCTTTTTTACGCAG
ACTCCCCGGGACAAAGCTGCCTCCACTCCTTCGTGGAGCGTTCCTCAAGATTGCAAGTACGTTTTATTA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGATTCCAATTCCGCCGCTGACCTTCAAATCGCCGCCGCTCCTTCCACCATTGTTCCCCGGAGGACGGTCAGGATCTCCTTCGTGCTGCTCGGCGTTGCTCTGGC
CATCCTCGTTCTCTACAACTCCGCCATTAATCCTTTCAGATTTCTCCCCGTTTCCTACACCACCTACCGCCCCTCTGCATCCCCTTCTCTCACCACAGACCCTCTTCTGG
AAAAAATTCTGAAGAATGCATCGACGGAAGATGGAACGGTAATATTGACAACACTGAACGACGCGTGGGCGGAGCCGGGTTCGCTACTGGATCTGTTTCTGGAAAGCTTC
CACATCGGAAACGGAACGGAGAGGCTACTGAAGCACTTGGTAATAGTGACGATGGACAAAAAGGCGTATGCCCGTTGCGTGGCGTTGCACCCGCATTGCTACGAACTGGA
CACACAAGGCATCAATTTCTCCAGCGAGGCCTACTTCATGACCTCTGATTACCTCCAGATGATGTGGCGGAGAATCGAATTTCTCACCTCTGTTCTCCGGATGGGTTTCA
GCTTCGTCTTCACCGATTCTGATATCATGTGGCTCCAAGACCCCTTCAACCACTTCCACCCAGATGCGGATTTTCAGATTGCGTGCGATTACTTTCTGGGGAATTCGGAG
GATCTAAACAACCGCCCCAACGGGGGGTTCACCTACGTCAAATCAAATCCCAAAACAATCAAATTCTACAAGTTTTGGTACCAATCCAGGACCATATATCCGGGCCAGCA
CGACCAAGACGTGCTCAACAAGATCAAGACCAGCCCATTGATCTCTAAAATAGGCCTCAAGATAAGGTTTCTGGACACTGCCAACTTCGGAGGCTTCTGTCAGCCCAGCC
GGGACTTCAACCGGGTCTCCACCATGCACGCCAATTGCTGCGTCGGCCTCGACAACAAAGTTCACGATCTCAAGATTCTGCTCCATGACTGGAATACCTTTTTTACGCAG
ACTCCCCGGGACAAAGCTGCCTCCACTCCTTCGTGGAGCGTTCCTCAAGATTGCAAGTACGTTTTATTA
Protein sequenceShow/hide protein sequence
MKDSNSAADLQIAAAPSTIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLPVSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESF
HIGNGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIEFLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDYFLGNSE
DLNNRPNGGFTYVKSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPSRDFNRVSTMHANCCVGLDNKVHDLKILLHDWNTFFTQ
TPRDKAASTPSWSVPQDCKYVLL