; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G13060 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G13060
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionNucleotide-diphospho-sugar transferase, nucleotide-diphospho-sugar transferase
Genome locationClcChr04:26307473..26319032
RNA-Seq ExpressionClc04G13060
SyntenyClc04G13060
Gene Ontology termsGO:0071555 - cell wall organization (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR005069 - Nucleotide-diphospho-sugar transferase
IPR029044 - Nucleotide-diphospho-sugar transferases
IPR044821 - Putative nucleotide-diphospho-sugar transferase At1g28695/At4g15970-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607414.1 hypothetical protein SDJN03_00756, partial [Cucurbita argyrosperma subsp. sororia]2.2e-17666.6Show/hide
Query:  SAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEP
        ++ D +P+   +AP   HTAVV W+TVR+SV   G++LGL+VLYNSAINPF  LPVSY+YRAFRS S  ++ LLEK L +A+ ED T+ILTTLN AWA P
Subjt:  SAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEP

Query:  DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPF
        DSLLDLFLKSFH GNGTQRLLKHLVIV LD KAY RCV+ HPHCY+L+T+G NFS EAYFMT+DYLKMMWRRI+FL S+LEMG SFVFTDSDIMWLQDPF
Subjt:  DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPF

Query:  NHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKT
        NHF+P+ADFQIACD F G+SEDLNN PNGGFVYVK+N KT++FYKFWY+SRT++PG+HDQDVLNKIKHSPLIP+IGLK+RFLDTANFGGFCQMGRD +K 
Subjt:  NHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKT

Query:  ATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDCRTSFQRGRQPKKTGNRRVIANWAKKERRIPKHRYQYHRHLCGHQPTGKYTK
         T+HANCCVGL+NKVHDLRILL DW+ F     N+KAS  PSW+VPQDCRTSFQRGRQ K                                        
Subjt:  ATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDCRTSFQRGRQPKKTGNRRVIANWAKKERRIPKHRYQYHRHLCGHQPTGKYTK

Query:  RRTFHLFRRPTMSDPYAGAKGGRLTFKGGALASRSKDIDKKKKKKKKEKSKTDDNPTDESEILTSADGVEGGDGAIYTIDAAKRM
                         GAKGGRLTFKGG LASRSK ID KKKKKKK KSK D+NPT E EIL SADG +GG G +YTIDAAKRM
Subjt:  RRTFHLFRRPTMSDPYAGAKGGRLTFKGGALASRSKDIDKKKKKKKKEKSKTDDNPTDESEILTSADGVEGGDGAIYTIDAAKRM

XP_004137392.1 uncharacterized protein At4g15970 [Cucumis sativus]1.1e-17883.2Show/hide
Query:  NNSAADC-EPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAW
        NNSA D  E A K SAPS   T    WRTVR+SVVLVG+ LGL VLYNSAINPFKFLP SY YRAFR SSPHKD +LEKV+KEAAMEDGTIILTTLNDAW
Subjt:  NNSAADC-EPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAW

Query:  AEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQ
        AEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCV++HPHCY+L+TQGTNFSSEAYFMT+DYLKMMWRRIEFLI +LEMGHSFVFTD+DIMWLQ
Subjt:  AEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQ

Query:  DPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDM
        DPFNHFY +ADFQIA DL+LGN E+LNN PNGGFVYV+AN +TV+FYKFWY+SRTIYPGQHDQDVLNKIKHSPLIPKIG+KLRFLDTANFGGFCQMGRDM
Subjt:  DPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDM

Query:  SKTATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASP--TPSWTVPQDCRTSFQRGRQ---PKKTGNRRV
        SK ATMHANCCVGLENKVHDLRILL+DWN+FFN T  +  SP  T SWTVPQDC+TSFQRGRQ    KK GNRR+
Subjt:  SKTATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASP--TPSWTVPQDCRTSFQRGRQ---PKKTGNRRV

XP_008438689.1 PREDICTED: uncharacterized protein At4g15970-like [Cucumis melo]3.7e-18485.19Show/hide
Query:  MELNNSAADCEPAAKSSAPSAV----HTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILT
        M+ NNSAAD E A K S PS V     T+VV WRTVR+SVVLVG+ LGL VLYNSAINPFKFLPVSYTYRAFR SSPHKD +LEKV+KEAAMEDGTII+T
Subjt:  MELNNSAADCEPAAKSSAPSAV----HTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILT

Query:  TLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDS
        TLNDAWAEPDSL DLFLKSFH+GNGTQRLLKHLVIVTLDQKAYSRCV+LHPHCY+L+TQGTNFSSEAYFMTSDYLKMMWRRIEFLI +LEMGHSFVFTD+
Subjt:  TLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDS

Query:  DIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFC
        DIMWLQDPFNHFY EADFQIA D +LGN EDLNN PNGGFVYV+ANPKTV+FYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIG+KLRFLDTANFGGFC
Subjt:  DIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFC

Query:  QMGRDMSKTATMHANCCVGLENKVHDLRILLKDWNNFFNPT--ANNKASPTPSWTVPQDCRTSFQRGRQ---PKKTGN
        QMGRDMSK AT+HANCCVGLENKVHDLRILL+DWNNFFN T   N   S TPSWTVPQDCRTSFQRGRQ    KKTGN
Subjt:  QMGRDMSKTATMHANCCVGLENKVHDLRILLKDWNNFFNPT--ANNKASPTPSWTVPQDCRTSFQRGRQ---PKKTGN

XP_023524447.1 uncharacterized protein At4g15970-like [Cucurbita pepo subsp. pepo]1.2e-15875.28Show/hide
Query:  SAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEP
        ++ D +P+   +AP   HTAVV W+TVR+SV   G++LGLLVLYNSAINPF  LPVSY+YRAFRS S  ++ LLEK L +A+ ED T+ILTTLN AWAEP
Subjt:  SAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEP

Query:  DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPF
        DSLLDLFLKSFH GNGTQRLLKHLVIV LD KAY RCV+ HPHCY+L+T+G NFS EAYFMT+DYLKMMWRRI+FL S+LEMG SFVFTDSDIMWLQDPF
Subjt:  DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPF

Query:  NHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKT
        NHF+P+ADFQIACD F G+SEDLNN PNGGFVYVK+N KT++FYKFWY+SRT++PG+HDQDVLNKIKHSPLIP+IGLK+RFLDTANFGGFCQMGRD +K 
Subjt:  NHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKT

Query:  ATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDCRTSFQRGRQPK
         T+HANCCVGL+NKVHDLRILL DW+ F     N+KAS  PSW+VPQDCRTSFQRGRQ K
Subjt:  ATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDCRTSFQRGRQPK

XP_038894961.1 uncharacterized protein At4g15970-like [Benincasa hispida]7.5e-18585.94Show/hide
Query:  NNSAADCEPA--AKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDA
        NNSAAD  PA   K SAPSAVHT  V WR  R SVV VG++LGLLVLYNS INPFKFLPVS TYRAFR S+PHKD LLEKVLKEAAMEDGTIILTTLNDA
Subjt:  NNSAADCEPA--AKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDA

Query:  WAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWL
        WAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLD+KAYSRCV+LHPHCYEL TQGTNFSSEAYFMT DYLKMMWRRIEFL S+L+MG+SFVFTDSDIMWL
Subjt:  WAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWL

Query:  QDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRD
        QDPFNHFYP+ADFQIACD F+GNSEDLNN+PNGGFVYVKANPKTV+FYKFWY+SRTIYPG+HDQDVLNKIKHSPLI KIGLKLRFLDTANFGGFCQMGRD
Subjt:  QDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRD

Query:  MSKTATMHANCCVGLENKVHDLRILLKDWNNFFNP-TANNK-ASPTPSWTVPQDCRTSFQRGRQ---PKKTGNRRVI
        M+K AT+HANCCVGLENKVHDLRILL+DW+NFFNP TA+NK AS TPSWTVPQDCRTSFQRGRQ    K TG+RR++
Subjt:  MSKTATMHANCCVGLENKVHDLRILLKDWNNFFNP-TANNK-ASPTPSWTVPQDCRTSFQRGRQ---PKKTGNRRVI

TrEMBL top hitse value%identityAlignment
A0A0A0LT78 Glycosyltransferase5.1e-17983.2Show/hide
Query:  NNSAADC-EPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAW
        NNSA D  E A K SAPS   T    WRTVR+SVVLVG+ LGL VLYNSAINPFKFLP SY YRAFR SSPHKD +LEKV+KEAAMEDGTIILTTLNDAW
Subjt:  NNSAADC-EPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAW

Query:  AEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQ
        AEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCV++HPHCY+L+TQGTNFSSEAYFMT+DYLKMMWRRIEFLI +LEMGHSFVFTD+DIMWLQ
Subjt:  AEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQ

Query:  DPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDM
        DPFNHFY +ADFQIA DL+LGN E+LNN PNGGFVYV+AN +TV+FYKFWY+SRTIYPGQHDQDVLNKIKHSPLIPKIG+KLRFLDTANFGGFCQMGRDM
Subjt:  DPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDM

Query:  SKTATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASP--TPSWTVPQDCRTSFQRGRQ---PKKTGNRRV
        SK ATMHANCCVGLENKVHDLRILL+DWN+FFN T  +  SP  T SWTVPQDC+TSFQRGRQ    KK GNRR+
Subjt:  SKTATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASP--TPSWTVPQDCRTSFQRGRQ---PKKTGNRRV

A0A1S3AWN4 Glycosyltransferase1.8e-18485.19Show/hide
Query:  MELNNSAADCEPAAKSSAPSAV----HTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILT
        M+ NNSAAD E A K S PS V     T+VV WRTVR+SVVLVG+ LGL VLYNSAINPFKFLPVSYTYRAFR SSPHKD +LEKV+KEAAMEDGTII+T
Subjt:  MELNNSAADCEPAAKSSAPSAV----HTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILT

Query:  TLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDS
        TLNDAWAEPDSL DLFLKSFH+GNGTQRLLKHLVIVTLDQKAYSRCV+LHPHCY+L+TQGTNFSSEAYFMTSDYLKMMWRRIEFLI +LEMGHSFVFTD+
Subjt:  TLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDS

Query:  DIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFC
        DIMWLQDPFNHFY EADFQIA D +LGN EDLNN PNGGFVYV+ANPKTV+FYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIG+KLRFLDTANFGGFC
Subjt:  DIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFC

Query:  QMGRDMSKTATMHANCCVGLENKVHDLRILLKDWNNFFNPT--ANNKASPTPSWTVPQDCRTSFQRGRQ---PKKTGN
        QMGRDMSK AT+HANCCVGLENKVHDLRILL+DWNNFFN T   N   S TPSWTVPQDCRTSFQRGRQ    KKTGN
Subjt:  QMGRDMSKTATMHANCCVGLENKVHDLRILLKDWNNFFNPT--ANNKASPTPSWTVPQDCRTSFQRGRQ---PKKTGN

A0A6J1C6T6 uncharacterized protein At4g15970-like4.2e-15776.97Show/hide
Query:  MELNNSAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSY-TYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLN
        M+ +NSAAD + AA   APS     +VP RTVRIS VL+G+ L +LVLYNSAINPF+FLPVSY TYR   S S   D LLEK+LK A+ EDGT+ILTTLN
Subjt:  MELNNSAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSY-TYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLN

Query:  DAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIM
        DAWAEP SLLDLFL+SFHIGNGT+RLLKHLVIVT+D+KAY+RCV+LHPHCYEL+TQG NFSSEAYFMTSDYL+MMWRRIEFL S+L MG SFVFTDSDIM
Subjt:  DAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIM

Query:  WLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMG
        WLQDPFNHF+P+ADFQIACD FLGNSEDLNN PNGGF YVK+NPKT++FYKFWYQSRTIYPGQHDQDVLNKIK SPLI KIGLK+RFLDTANFGGFCQ  
Subjt:  WLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMG

Query:  RDMSKTATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDCR
        RD ++ +TMHANCCVGL+NKVHDL+ILL DWN FF  T  +KA+ TPSW+VPQDC+
Subjt:  RDMSKTATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDCR

A0A6J1GCE5 Glycosyltransferase1.1e-15774.44Show/hide
Query:  SAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEP
        ++ D +P+   +AP   HTA+V W+TVR+SV   G++LGL+VLYNSAI PF  LPVSY+YRAFRS S  ++ LLEK L +A+ ED T+ILTTLN AWAEP
Subjt:  SAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEP

Query:  DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPF
        DSLLDLFLKSFH GNGTQRLLKHLVIV LD KAY RCV+ HPHCY+L+T+G NFS EAYFMT+DYLKMMWRRI+FL S+LEMG SFVFTDSDIMWLQDPF
Subjt:  DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPF

Query:  NHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKT
        NHF+P+ADFQIACD F G+SEDLNN PNGGFVYVK+N KT++FYKFWY+SRT++PG+HDQDVLNKIKHSPLIP+IGLK+RFLDTANFGGFCQMGRD +K 
Subjt:  NHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKT

Query:  ATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDCRTSFQRGRQPK
         T+HANCCVGL+NKVHDLRILL DW+ F     N+KAS  PSW+VPQDCRTSFQRGRQ K
Subjt:  ATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDCRTSFQRGRQPK

A0A6J1KAZ2 Glycosyltransferase5.0e-15875Show/hide
Query:  SAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEP
        S+ D +P    +AP + HTAVV W+TVR+SV   G++LGLLVLYNSAINPF  LPVSY+YRAFRS S  ++ LLEK L +A+ ED T+ILTTLN AWAEP
Subjt:  SAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEP

Query:  DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPF
        +SLLDLFLKSFH GNGTQRLLKHLVIV LD KAY RC + HPHCY+L+T+G NFS EAYFMT+DYLKMMWRRI+FL S+LEMG SFVFTDSDIMWLQDPF
Subjt:  DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPF

Query:  NHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKT
        NHF+P+ADFQIACD F G+SEDLNN PNGGFVYVK+N KT++FYKFWY+SRT++PG+HDQDVLNKIKHSPLIP+IGLK+RFLDTANFGGFCQMGRD +K 
Subjt:  NHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKT

Query:  ATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDCRTSFQRGRQPK
         T+HANCCVGL NKVHDLRILL DW+ F     N+KAS  PSW+VPQDCRTSFQRGRQ K
Subjt:  ATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDCRTSFQRGRQPK

SwissProt top hitse value%identityAlignment
P0C042 Uncharacterized protein At4g159701.4e-8553.43Show/hide
Query:  LEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPH-CYELETQGTNFSSEAYFMTSDYLKMMWRR
        L K+L EAA ED T+I+TTLN AW+EP+S  DLFL SFH+G GT+ LL+HLV+  LD++AYSRC  +HPH CY ++T G +F+ +  FMT DYLKMMWRR
Subjt:  LEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPH-CYELETQGTNFSSEAYFMTSDYLKMMWRR

Query:  IEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLI
        IEFL ++L++ ++F+FT         PF     E DFQIACD + G+ +D++N+ NGGF +VKAN +T+ FY +WY SR  YP +HDQDVL++IK     
Subjt:  IEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLI

Query:  PKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDC
         KIGLK+RFLDT  FGGFC+  RD+ K  TMHANCCVGLENK+ DLR ++ DW N+ +  A        +W  P++C
Subjt:  PKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDC

Q3E6Y3 Uncharacterized protein At1g286951.7e-5138.26Show/hide
Query:  AAMEDGTIILTTLNDAWAEP----DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQ-GTNFSSEAYFMTSDYLKMMWRRIEF
        AA  + T+I+T +N A+ +      ++LDLFL+SF  G GT  LL HL++V +DQ AY RC     HCY++ET+ G +   E  FM+ D+++MMWRR   
Subjt:  AAMEDGTIILTTLNDAWAEP----DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQ-GTNFSSEAYFMTSDYLKMMWRRIEF

Query:  LISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKI
        ++ +L  G++ +FTD+D+MWL+ P +      D QI+ D      + +N     GF +V++N KT+  ++ WY  R    G  +QDVL  +  S    ++
Subjt:  LISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKI

Query:  GLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASP
        GL + FL T  F GFCQ    M    T+HANCC+ +  KV DL  +L+DW  +     N+K SP
Subjt:  GLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASP

Q9FXA7 UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase 31.9e-0522.78Show/hide
Query:  FLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPE
        FL ++ I    Q+  + ++++  D     +     P    L     +  S   F +  +  +  RR + L++ILE+G++ ++ D D++WLQDPF++    
Subjt:  FLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPE

Query:  ADFQIACDLF----LGNSEDLNNSPNGGFVYV-------KANPKTVQFYKFWYQSRTIYPGQ-------HDQDVLNKIKH
         D     D+     L +S DL      G  YV       ++        K W +     P         HDQ   N+  H
Subjt:  ADFQIACDLF----LGNSEDLNNSPNGGFVYV-------KANPKTVQFYKFWYQSRTIYPGQ-------HDQDVLNKIKH

Q9M146 UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase MGP42.7e-0422.99Show/hide
Query:  SSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSD
        S S  +D  L + +K  A ++GT+I+  ++  +         FL ++ I    Q+    ++++  D     +     P    L     +  +   F +  
Subjt:  SSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSD

Query:  YLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDLF----LGNSEDLNNSPNGGFVYV
        +     RR + L+ ILE+G++ ++ D D++WLQDPF +   + D     D+     L +S DL      G  Y+
Subjt:  YLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDLF----LGNSEDLNNSPNGGFVYV

Q9ZSJ0 UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase4.7e-0423.94Show/hide
Query:  VVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSS-SPH-KDLLLEKVLKEAA---MEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLV
        +VL+ L L L V      +P    P   +  ++ SS SPH K       L +AA     +GT+I+  ++  +         FL ++ I    Q+  + ++
Subjt:  VVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSS-SPH-KDLLLEKVLKEAA---MEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLV

Query:  IVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDL----FLGNSE
        ++  D     +     P    L     +  +   F +  +     RR + L+ ILE+G++ ++ D D++WLQDPF +     D     D+     L +S 
Subjt:  IVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDL----FLGNSE

Query:  DLNNSPNGGFVYV
        DL      G  Y+
Subjt:  DLNNSPNGGFVYV

Arabidopsis top hitse value%identityAlignment
AT1G14590.1 Nucleotide-diphospho-sugar transferase family protein1.9e-10153.73Show/hide
Query:  RISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIV
        R ++ L  + +   VLY +A +   F P  +   ++  +   K   LE VL +AA  D T++LTTLN AWA P S++DLF +SF IG  T ++L HLVIV
Subjt:  RISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIV

Query:  TLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSP
         LD KAYSRC+ LH HC+ L T+G +FS EAYFMT  YLKMMWRRI+ L S+LEMG++FVFTD+D+MW ++PF  FY  ADFQIACD +LG S DL+N P
Subjt:  TLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSP

Query:  NGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVHDLRILLKDWNN
        NGGF +V++N +T+ FYK+WY SR  +PG HDQDVLN +K  P + +IGLK+RFL+TA FGG C+  RD++   TMHANCC G+E+K+HDLRI+L+DW +
Subjt:  NGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVHDLRILLKDWNN

Query:  FFNPTANNKASPTPSWTVPQDC
        F +   + K S   SW VPQ+C
Subjt:  FFNPTANNKASPTPSWTVPQDC

AT2G02061.1 Nucleotide-diphospho-sugar transferase family protein7.2e-10158.48Show/hide
Query:  LEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFS-SEAYFMTSDYLKMMWRR
        LE+VL+ AA +DGT+ILTTLN+AWA P S++DLF +SF IG GT+RLLKHLVI+ LD KAYSRC  LH HC+ LET+G +FS  EAYFMT  YL MMWRR
Subjt:  LEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFS-SEAYFMTSDYLKMMWRR

Query:  IEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLI
        I FL S+LE G++FVFTD+D+MW ++PF  FY + DFQIACD ++G   D  N PNGGF +V+AN +++ FYKFWY SRT YP  HDQDVLN IK  P +
Subjt:  IEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLI

Query:  PKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDC
         K+ +++RFL+T  FGGFC+  +D++   TMHANCC GL++K+HDLRI+L+DW +F +   ++  S   +W+VPQ+C
Subjt:  PKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDC

AT4G15970.1 Nucleotide-diphospho-sugar transferase family protein1.0e-8653.43Show/hide
Query:  LEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPH-CYELETQGTNFSSEAYFMTSDYLKMMWRR
        L K+L EAA ED T+I+TTLN AW+EP+S  DLFL SFH+G GT+ LL+HLV+  LD++AYSRC  +HPH CY ++T G +F+ +  FMT DYLKMMWRR
Subjt:  LEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPH-CYELETQGTNFSSEAYFMTSDYLKMMWRR

Query:  IEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLI
        IEFL ++L++ ++F+FT         PF     E DFQIACD + G+ +D++N+ NGGF +VKAN +T+ FY +WY SR  YP +HDQDVL++IK     
Subjt:  IEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLI

Query:  PKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDC
         KIGLK+RFLDT  FGGFC+  RD+ K  TMHANCCVGLENK+ DLR ++ DW N+ +  A        +W  P++C
Subjt:  PKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVHDLRILLKDWNNFFNPTANNKASPTPSWTVPQDC

AT4G19970.1 CONTAINS InterPro DOMAIN/s: Nucleotide-diphospho-sugar transferase, predicted (InterPro:IPR005069)2.6e-9851.65Show/hide
Query:  RTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRA-----FRSSSP---HKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNG
        + V+  +VLV  +   L+LY +A    + L V+            SSSP    K +   +VL+ A+ E+ T+I+TTLN AWAEP+SL DLFL+SF IG G
Subjt:  RTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRA-----FRSSSP---HKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNG

Query:  TQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDLF
        T++LL+H+V+V LD KA++RC  LHP+CY L+T GT+FS E  F T DYLKMMWRRIE L  +LEMG++F+FTD+DIMWL+DPF   YP+ DFQ+ACD F
Subjt:  TQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDLF

Query:  LGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVH
         G+  D +N  NGGF YVK+N ++++FYKFWY SR  YP  HDQDV N+IKH  L+ +IG+++RF DT  FGGFCQ  RD++   TMHANCCVGL  K+H
Subjt:  LGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVH

Query:  DLRILLKDWNNFFNPTANNKASPTPSWTVPQDC
        DL ++L DW N+ + +   K     +W+VP  C
Subjt:  DLRILLKDWNNFFNPTANNKASPTPSWTVPQDC

AT5G44820.1 Nucleotide-diphospho-sugar transferase family protein3.1e-9649.41Show/hide
Query:  RISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSP---------------HKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFH
        RI ++ +GL    LVLY +A  P + L VS       S SP                  L  +++L+ A+ ++ T+I+TTLN AWAEP+SL DLFL+SF 
Subjt:  RISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSP---------------HKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFH

Query:  IGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIA
        IG GTQ+LLKH+V+V LD KA+ RC  LH +CY +ET  T+FS E  + T DYLKMMW RI+ L  +LEMG +F+FTD+DIMWL+DPF   YP+ DFQ+A
Subjt:  IGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIA

Query:  CDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLE
        CD F GN  D +N  NGGF YV++N ++++FYKFW++SR  YP  HDQDV N+IKH P I +IG+++RF DT  FGGFCQ  RD++   TMHANCC+GL+
Subjt:  CDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLE

Query:  NKVHDLRILLKDWNNFFN---PTANNKASPTPSWTVPQDC
         K+HDL ++L DW  + +   P  N       +W+VP  C
Subjt:  NKVHDLRILLKDWNNFFN---PTANNKASPTPSWTVPQDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTAAATAATTCCGCCGCCGACTGCGAACCTGCCGCCAAGTCCTCGGCTCCCTCGGCCGTTCATACGGCGGTGGTTCCATGGAGGACGGTGAGGATCTCGGTTGT
GTTGGTGGGCCTTATGTTGGGCCTCCTTGTTCTCTACAACTCAGCCATTAATCCTTTCAAATTTCTTCCTGTTTCCTACACCTACCGTGCTTTTCGATCCTCTTCTCCTC
ACAAAGATCTTCTTTTGGAAAAAGTTCTGAAAGAAGCAGCAATGGAAGATGGAACAATAATCTTGACGACGTTGAATGATGCATGGGCAGAGCCAGATTCACTCCTCGAT
CTGTTTCTTAAAAGCTTCCATATTGGAAACGGAACCCAAAGATTATTGAAGCACTTAGTCATAGTCACGTTGGACCAAAAGGCGTATTCTCGTTGCGTGTCCTTACACCC
TCATTGTTATGAATTGGAAACTCAAGGAACCAACTTCTCCAGCGAAGCCTACTTCATGACCTCCGATTACTTGAAAATGATGTGGCGAAGAATCGAATTCCTTATCTCCA
TACTCGAGATGGGTCACAGCTTCGTGTTCACCGATTCTGATATAATGTGGTTGCAAGACCCATTCAATCACTTCTACCCAGAGGCAGATTTTCAAATTGCTTGTGATTTG
TTTTTGGGGAACTCAGAAGATTTAAACAATAGTCCCAATGGAGGGTTTGTGTACGTGAAAGCGAATCCAAAAACGGTACAATTCTACAAGTTTTGGTACCAATCAAGGAC
AATATATCCAGGTCAGCACGACCAAGATGTGCTGAACAAGATCAAACACAGTCCATTGATCCCTAAAATTGGGCTGAAATTAAGGTTTCTGGACACCGCGAATTTCGGAG
GGTTCTGTCAGATGGGGAGGGACATGAGCAAGACGGCTACAATGCATGCCAATTGTTGCGTTGGACTAGAGAACAAAGTTCACGATCTCAGGATTTTGCTAAAAGATTGG
AATAACTTCTTTAATCCAACTGCGAATAACAAAGCTTCCCCTACCCCTTCATGGACTGTTCCTCAAGATTGCAGAACTTCATTTCAAAGAGGGAGGCAACCTAAGAAAAC
TGGGAACAGAAGGGTTATTGCAAATTGGGCCAAGAAAGAACGGAGGATACCCAAACATAGATATCAATATCATCGTCATCTCTGCGGGCATCAACCAACTGGAAAGTACA
CCAAGCGGCGAACATTTCATCTCTTCCGACGTCCTACGATGTCGGATCCATATGCAGGGGCGAAAGGAGGTAGGCTCACCTTCAAGGGAGGAGCCTTAGCCTCTCGTAGC
AAGGATATTGACAAGAAGAAGAAGAAGAAGAAGAAAGAAAAAAGTAAAACCGACGATAACCCTACGGACGAGTCCGAGATTTTGACGTCGGCTGATGGTGTAGAAGGTGG
AGACGGAGCCATCTATACTATTGATGCGGCCAAGCGTATGAAGTATGAGGAGCTATTCCCTGTGGAGACCAGGAAGTTTGGTTACGATCCTAACAACTCCAATACCAAGT
TCAAGTCTGTGGAGGATGCTCTCGATGACCGTGTCAAGAAGAAGGCGGATCGCATCGGTCATAGTAGAGATTCCAGGTATTTCTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAATTAAATAATTCCGCCGCCGACTGCGAACCTGCCGCCAAGTCCTCGGCTCCCTCGGCCGTTCATACGGCGGTGGTTCCATGGAGGACGGTGAGGATCTCGGTTGT
GTTGGTGGGCCTTATGTTGGGCCTCCTTGTTCTCTACAACTCAGCCATTAATCCTTTCAAATTTCTTCCTGTTTCCTACACCTACCGTGCTTTTCGATCCTCTTCTCCTC
ACAAAGATCTTCTTTTGGAAAAAGTTCTGAAAGAAGCAGCAATGGAAGATGGAACAATAATCTTGACGACGTTGAATGATGCATGGGCAGAGCCAGATTCACTCCTCGAT
CTGTTTCTTAAAAGCTTCCATATTGGAAACGGAACCCAAAGATTATTGAAGCACTTAGTCATAGTCACGTTGGACCAAAAGGCGTATTCTCGTTGCGTGTCCTTACACCC
TCATTGTTATGAATTGGAAACTCAAGGAACCAACTTCTCCAGCGAAGCCTACTTCATGACCTCCGATTACTTGAAAATGATGTGGCGAAGAATCGAATTCCTTATCTCCA
TACTCGAGATGGGTCACAGCTTCGTGTTCACCGATTCTGATATAATGTGGTTGCAAGACCCATTCAATCACTTCTACCCAGAGGCAGATTTTCAAATTGCTTGTGATTTG
TTTTTGGGGAACTCAGAAGATTTAAACAATAGTCCCAATGGAGGGTTTGTGTACGTGAAAGCGAATCCAAAAACGGTACAATTCTACAAGTTTTGGTACCAATCAAGGAC
AATATATCCAGGTCAGCACGACCAAGATGTGCTGAACAAGATCAAACACAGTCCATTGATCCCTAAAATTGGGCTGAAATTAAGGTTTCTGGACACCGCGAATTTCGGAG
GGTTCTGTCAGATGGGGAGGGACATGAGCAAGACGGCTACAATGCATGCCAATTGTTGCGTTGGACTAGAGAACAAAGTTCACGATCTCAGGATTTTGCTAAAAGATTGG
AATAACTTCTTTAATCCAACTGCGAATAACAAAGCTTCCCCTACCCCTTCATGGACTGTTCCTCAAGATTGCAGAACTTCATTTCAAAGAGGGAGGCAACCTAAGAAAAC
TGGGAACAGAAGGGTTATTGCAAATTGGGCCAAGAAAGAACGGAGGATACCCAAACATAGATATCAATATCATCGTCATCTCTGCGGGCATCAACCAACTGGAAAGTACA
CCAAGCGGCGAACATTTCATCTCTTCCGACGTCCTACGATGTCGGATCCATATGCAGGGGCGAAAGGAGGTAGGCTCACCTTCAAGGGAGGAGCCTTAGCCTCTCGTAGC
AAGGATATTGACAAGAAGAAGAAGAAGAAGAAGAAAGAAAAAAGTAAAACCGACGATAACCCTACGGACGAGTCCGAGATTTTGACGTCGGCTGATGGTGTAGAAGGTGG
AGACGGAGCCATCTATACTATTGATGCGGCCAAGCGTATGAAGTATGAGGAGCTATTCCCTGTGGAGACCAGGAAGTTTGGTTACGATCCTAACAACTCCAATACCAAGT
TCAAGTCTGTGGAGGATGCTCTCGATGACCGTGTCAAGAAGAAGGCGGATCGCATCGGTCATAGTAGAGATTCCAGGTATTTCTTGTAAGGATGCTCATCATGCTCCACT
AAGCAACGGGTTTAGCATCACTATGCCTAAGATTAACGGGTTTCATGGTATTGTGATCTATATAGACTCATGGACCCATTACCATTGAAGAAATTTTAGTTTCGTGCTGA
TTGTAATTACTTCCATCCTCCCCTCTGATTGTTTTGCTATCCCACACCGTGTCTGAGGGGTCAAATTTGTACCGGCGAACATGAGAAGATGGCTTAATCTTTCGTTGTCA
GTATTTTGTATGGATATTTCTTACAATTCAGGAACCAATTAATATAGCATGGATTGAAATATAAATCTTCCTGACTTGCACAAAAGTCATTGTAATGGTCTTCTGTTGGT
TAAAGTTAAGAGTATGGAAAAGCAGGCTATACAGGAATACTTTTATTGTTACATATAAGTTTACATCAATG
Protein sequenceShow/hide protein sequence
MELNNSAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLD
LFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISILEMGHSFVFTDSDIMWLQDPFNHFYPEADFQIACDL
FLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVHDLRILLKDW
NNFFNPTANNKASPTPSWTVPQDCRTSFQRGRQPKKTGNRRVIANWAKKERRIPKHRYQYHRHLCGHQPTGKYTKRRTFHLFRRPTMSDPYAGAKGGRLTFKGGALASRS
KDIDKKKKKKKKEKSKTDDNPTDESEILTSADGVEGGDGAIYTIDAAKRMKYEELFPVETRKFGYDPNNSNTKFKSVEDALDDRVKKKADRIGHSRDSRYFL