; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020536 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020536
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionNucleotide-diphospho-sugar transferase family protein
Genome locationChr05:384802..390906
RNA-Seq ExpressionHG10020536
SyntenyHG10020536
Gene Ontology termsGO:0071555 - cell wall organization (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR005069 - Nucleotide-diphospho-sugar transferase
IPR029044 - Nucleotide-diphospho-sugar transferases
IPR044821 - Putative nucleotide-diphospho-sugar transferase At1g28695/At4g15970-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607414.1 hypothetical protein SDJN03_00756, partial [Cucurbita argyrosperma subsp. sororia]2.1e-17872.26Show/hide
Query:  SAADGEPAAKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAWAEP
        ++ D +P+   +AP   HTAVV W+TVR+SV F GV+LGL+VLYNSAINPF  LPVSY+YRAFRS S  R+ LLEK L +A+ ED T+ILTTLN AWA P
Subjt:  SAADGEPAAKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAWAEP

Query:  DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPF
        DSLLDLFLKSFH GNGTQRLLKHLVIV LD KAY RC+A HPHCY+L+T+G NFS EAYFMT+DYLKMMWRRI+FL SVLEMG+SFVFTDSDIMWLQDPF
Subjt:  DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPF

Query:  NHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDMNKM
        NHF+P+ADFQIACD F G+SEDLNN PNGGFVYVK+N KT++FYKFWY+SRT++PG+HDQDVLNKIKHSPLI +IGLK+RFLDTANFGGFCQMGRD  K+
Subjt:  NHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDMNKM

Query:  ATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDCRVIANWAKKERKIPKLRYHRHLCKHQATGAKGGRLTFKGGVLASRSKDIDK
         TVHANCCVGL+NKVHDLRILL DW+ F N    +KASS PSW+VPQDCR             + R  +H       GAKGGRLTFKGGVLASRSK ID 
Subjt:  ATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDCRVIANWAKKERKIPKLRYHRHLCKHQATGAKGGRLTFKGGVLASRSKDIDK

Query:  KKKKKKKKKEKSKTDENPTDEAEILTSADGVEGGDGAMYTIDAAKRM
          KKKKKKK KSK DENPT E EIL SADG +GG G +YTIDAAKRM
Subjt:  KKKKKKKKKEKSKTDENPTDEAEILTSADGVEGGDGAMYTIDAAKRM

XP_004137392.1 uncharacterized protein At4g15970 [Cucumis sativus]3.8e-17584.87Show/hide
Query:  MKNNSAADG-EPAAKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLND
        MKNNSA DG E A KLSAPS   T    WRTVR+SVV VGV LGL VLYNSAINPFKFLP SY YRAFR SSPH+D +LEKV+KEAAMEDGTIILTTLND
Subjt:  MKNNSAADG-EPAAKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLND

Query:  AWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMW
        AWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRC+A+HPHCY+L+TQGTNFSSEAYFMT+DYLKMMWRRIEFLI VLEMG+SFVFTD+DIMW
Subjt:  AWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMW

Query:  LQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGR
        LQDPFNHFY +ADFQIA D++LGN E+LNN PNGGFVYV+AN +TVKFYKFWY+SRTIYPGQHDQDVLNKIKHSPLI KIG+KLRFLDTANFGGFCQMGR
Subjt:  LQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGR

Query:  DMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNR-TADNKA-SSTPSWTVPQDCR
        DM+KMAT+HANCCVGLENKVHDLRILLQDWN+FFN+ T DNK+ SST SWTVPQDC+
Subjt:  DMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNR-TADNKA-SSTPSWTVPQDCR

XP_008438689.1 PREDICTED: uncharacterized protein At4g15970-like [Cucumis melo]3.5e-18187.99Show/hide
Query:  NNSAADGEPAAKLSAPSAV----HTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLN
        NNSAADGE A KLS PS V     T+VV WRTVR+SVV VGV LGL VLYNSAINPFKFLPVSYTYRAFR SSPH+D +LEKV+KEAAMEDGTII+TTLN
Subjt:  NNSAADGEPAAKLSAPSAV----HTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLN

Query:  DAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIM
        DAWAEPDSL DLFLKSFH+GNGTQRLLKHLVIVTLDQKAYSRC+ALHPHCY+L+TQGTNFSSEAYFMTSDYLKMMWRRIEFLI VLEMG+SFVFTD+DIM
Subjt:  DAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIM

Query:  WLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMG
        WLQDPFNHFY EADFQIA D +LGN EDLNN PNGGFVYV+ANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLI KIG+KLRFLDTANFGGFCQMG
Subjt:  WLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMG

Query:  RDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNRT-ADNKA-SSTPSWTVPQDCR
        RDM+KMATVHANCCVGLENKVHDLRILLQDWNNFFNRT A NK+ SSTPSWTVPQDCR
Subjt:  RDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNRT-ADNKA-SSTPSWTVPQDCR

XP_022137284.1 uncharacterized protein At4g15970-like [Momordica charantia]5.0e-15978.03Show/hide
Query:  NNSAADGEPAAKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSY-TYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAW
        +NSAAD + AA   APS     +VP RTVRIS V +GV L +LVLYNSAINPF+FLPVSY TYR   S S   D LLEK+LK A+ EDGT+ILTTLNDAW
Subjt:  NNSAADGEPAAKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSY-TYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAW

Query:  AEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQ
        AEP SLLDLFL+SFHIGNGT+RLLKHLVIVT+D+KAY+RC+ALHPHCYEL+TQG NFSSEAYFMTSDYL+MMWRRIEFL SVL MG+SFVFTDSDIMWLQ
Subjt:  AEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQ

Query:  DPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDM
        DPFNHF+P+ADFQIACD FLGNSEDLNN PNGGF YVK+NPKT+KFYKFWYQSRTIYPGQHDQDVLNKIK SPLISKIGLK+RFLDTANFGGFCQ  RD 
Subjt:  DPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDM

Query:  NKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDCRVI
        N+++T+HANCCVGL+NKVHDL+ILL DWN FF +T  +KA+STPSW+VPQDC+ +
Subjt:  NKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDCRVI

XP_038894961.1 uncharacterized protein At4g15970-like [Benincasa hispida]9.6e-18790.78Show/hide
Query:  MKNNSAADGEPA--AKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLN
        MKNNSAADG PA   KLSAPSAVHT  V WR  R SVVFVGV+LGLLVLYNS INPFKFLPVS TYRAFR S+PH+D LLEKVLKEAAMEDGTIILTTLN
Subjt:  MKNNSAADGEPA--AKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLN

Query:  DAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIM
        DAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLD+KAYSRC+ALHPHCYEL TQGTNFSSEAYFMT DYLKMMWRRIEFL SVL+MGYSFVFTDSDIM
Subjt:  DAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIM

Query:  WLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMG
        WLQDPFNHFYP+ADFQIACD F+GNSEDLNN PNGGFVYVKANPKTVKFYKFWY+SRTIYPG+HDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMG
Subjt:  WLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMG

Query:  RDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFN-RTADNK-ASSTPSWTVPQDCR
        RDMNKMATVHANCCVGLENKVHDLRILLQDW+NFFN  TADNK ASSTPSWTVPQDCR
Subjt:  RDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFN-RTADNK-ASSTPSWTVPQDCR

TrEMBL top hitse value%identityAlignment
A0A0A0LT78 Glycosyltransferase1.8e-17584.87Show/hide
Query:  MKNNSAADG-EPAAKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLND
        MKNNSA DG E A KLSAPS   T    WRTVR+SVV VGV LGL VLYNSAINPFKFLP SY YRAFR SSPH+D +LEKV+KEAAMEDGTIILTTLND
Subjt:  MKNNSAADG-EPAAKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLND

Query:  AWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMW
        AWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRC+A+HPHCY+L+TQGTNFSSEAYFMT+DYLKMMWRRIEFLI VLEMG+SFVFTD+DIMW
Subjt:  AWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMW

Query:  LQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGR
        LQDPFNHFY +ADFQIA D++LGN E+LNN PNGGFVYV+AN +TVKFYKFWY+SRTIYPGQHDQDVLNKIKHSPLI KIG+KLRFLDTANFGGFCQMGR
Subjt:  LQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGR

Query:  DMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNR-TADNKA-SSTPSWTVPQDCR
        DM+KMAT+HANCCVGLENKVHDLRILLQDWN+FFN+ T DNK+ SST SWTVPQDC+
Subjt:  DMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNR-TADNKA-SSTPSWTVPQDCR

A0A1S3AWN4 Glycosyltransferase1.7e-18187.99Show/hide
Query:  NNSAADGEPAAKLSAPSAV----HTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLN
        NNSAADGE A KLS PS V     T+VV WRTVR+SVV VGV LGL VLYNSAINPFKFLPVSYTYRAFR SSPH+D +LEKV+KEAAMEDGTII+TTLN
Subjt:  NNSAADGEPAAKLSAPSAV----HTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLN

Query:  DAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIM
        DAWAEPDSL DLFLKSFH+GNGTQRLLKHLVIVTLDQKAYSRC+ALHPHCY+L+TQGTNFSSEAYFMTSDYLKMMWRRIEFLI VLEMG+SFVFTD+DIM
Subjt:  DAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIM

Query:  WLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMG
        WLQDPFNHFY EADFQIA D +LGN EDLNN PNGGFVYV+ANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLI KIG+KLRFLDTANFGGFCQMG
Subjt:  WLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMG

Query:  RDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNRT-ADNKA-SSTPSWTVPQDCR
        RDM+KMATVHANCCVGLENKVHDLRILLQDWNNFFNRT A NK+ SSTPSWTVPQDCR
Subjt:  RDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNRT-ADNKA-SSTPSWTVPQDCR

A0A6J1C6T6 uncharacterized protein At4g15970-like2.4e-15978.03Show/hide
Query:  NNSAADGEPAAKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSY-TYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAW
        +NSAAD + AA   APS     +VP RTVRIS V +GV L +LVLYNSAINPF+FLPVSY TYR   S S   D LLEK+LK A+ EDGT+ILTTLNDAW
Subjt:  NNSAADGEPAAKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSY-TYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAW

Query:  AEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQ
        AEP SLLDLFL+SFHIGNGT+RLLKHLVIVT+D+KAY+RC+ALHPHCYEL+TQG NFSSEAYFMTSDYL+MMWRRIEFL SVL MG+SFVFTDSDIMWLQ
Subjt:  AEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQ

Query:  DPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDM
        DPFNHF+P+ADFQIACD FLGNSEDLNN PNGGF YVK+NPKT+KFYKFWYQSRTIYPGQHDQDVLNKIK SPLISKIGLK+RFLDTANFGGFCQ  RD 
Subjt:  DPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDM

Query:  NKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDCRVI
        N+++T+HANCCVGL+NKVHDL+ILL DWN FF +T  +KA+STPSW+VPQDC+ +
Subjt:  NKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDCRVI

A0A6J1GCE5 Glycosyltransferase1.0e-15475.43Show/hide
Query:  SAADGEPAAKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAWAEP
        ++ D +P+   +AP   HTA+V W+TVR+SV F GV+LGL+VLYNSAI PF  LPVSY+YRAFRS S  R+ LLEK L +A+ ED T+ILTTLN AWAEP
Subjt:  SAADGEPAAKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAWAEP

Query:  DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPF
        DSLLDLFLKSFH GNGTQRLLKHLVIV LD KAY RC+A HPHCY+L+T+G NFS EAYFMT+DYLKMMWRRI+FL SVLEMG+SFVFTDSDIMWLQDPF
Subjt:  DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPF

Query:  NHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDMNKM
        NHF+P+ADFQIACD F G+SEDLNN PNGGFVYVK+N KT++FYKFWY+SRT++PG+HDQDVLNKIKHSPLI +IGLK+RFLDTANFGGFCQMGRD  K+
Subjt:  NHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDMNKM

Query:  ATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDCR
         TVHANCCVGL+NKVHDLRILL DW+ F N    +KASS PSW+VPQDCR
Subjt:  ATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDCR

A0A6J1KAZ2 Glycosyltransferase2.7e-15576.29Show/hide
Query:  SAADGEPAAKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAWAEP
        S+ D +P    +AP + HTAVV W+TVR+SV F GV+LGLLVLYNSAINPF  LPVSY+YRAFRS S  R+ LLEK L +A+ ED T+ILTTLN AWAEP
Subjt:  SAADGEPAAKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAWAEP

Query:  DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPF
        +SLLDLFLKSFH GNGTQRLLKHLVIV LD KAY RC A HPHCY+L+T+G NFS EAYFMT+DYLKMMWRRI+FL SVLEMG+SFVFTDSDIMWLQDPF
Subjt:  DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPF

Query:  NHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDMNKM
        NHF+P+ADFQIACD F G+SEDLNN PNGGFVYVK+N KT++FYKFWY+SRT++PG+HDQDVLNKIKHSPLI +IGLK+RFLDTANFGGFCQMGRD  K+
Subjt:  NHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDMNKM

Query:  ATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDCR
         TVHANCCVGL NKVHDLRILL DW+ F N    +KASS PSW+VPQDCR
Subjt:  ATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDCR

SwissProt top hitse value%identityAlignment
P0C042 Uncharacterized protein At4g159701.2e-8651.72Show/hide
Query:  LEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPH-CYELETQGTNFSSEAYFMTSDYLKMMWRR
        L K+L EAA ED T+I+TTLN AW+EP+S  DLFL SFH+G GT+ LL+HLV+  LD++AYSRC  +HPH CY ++T G +F+ +  FMT DYLKMMWRR
Subjt:  LEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPH-CYELETQGTNFSSEAYFMTSDYLKMMWRR

Query:  IEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLI
        IEFL ++L++ Y+F+FT         PF     E DFQIACD + G+ +D++N  NGGF +VKAN +T+ FY +WY SR  YP +HDQDVL++IK     
Subjt:  IEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLI

Query:  SKIGLKLRFLDTANFGGFCQMGRDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDCRVIANWAKKERKI
        +KIGLK+RFLDT  FGGFC+  RD++K+ T+HANCCVGLENK+ DLR ++ DW N+ +  A        +W  P++C     W  K +++
Subjt:  SKIGLKLRFLDTANFGGFCQMGRDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDCRVIANWAKKERKI

Q3E6Y3 Uncharacterized protein At1g286951.6e-5139.53Show/hide
Query:  AAMEDGTIILTTLNDAWAEP----DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQ-GTNFSSEAYFMTSDYLKMMWRRIEF
        AA  + T+I+T +N A+ +      ++LDLFL+SF  G GT  LL HL++V +DQ AY RC     HCY++ET+ G +   E  FM+ D+++MMWRR   
Subjt:  AAMEDGTIILTTLNDAWAEP----DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQ-GTNFSSEAYFMTSDYLKMMWRRIEF

Query:  LISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKI
        ++ VL  GY+ +FTD+D+MWL+ P +      D QI+ D      + +N     GF +V++N KT+  ++ WY  R    G  +QDVL  +  S   +++
Subjt:  LISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKI

Query:  GLKLRFLDTANFGGFCQMGRDMNKMATVHANCCVGLENKVHDLRILLQDWNNF
        GL + FL T  F GFCQ    M  + TVHANCC+ +  KV DL  +L+DW  +
Subjt:  GLKLRFLDTANFGGFCQMGRDMNKMATVHANCCVGLENKVHDLRILLQDWNNF

Q9FXA7 UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase 34.1e-0722.46Show/hide
Query:  SSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSD
        S S  RD  L + +K  A  + T+I+  ++  +         FL ++ I    Q+  + ++++  D     +     P    L     +  S   F +  
Subjt:  SSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSD

Query:  YLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMF----LGNSEDLNNTPNGGFVYV-------KANPKTVKFYKFWYQSRTI
        +  +  RR + L+++LE+GY+ ++ D D++WLQDPF++     D     DM     L +S DL      G  YV       ++        K W +    
Subjt:  YLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMF----LGNSEDLNNTPNGGFVYV-------KANPKTVKFYKFWYQSRTI

Query:  YPGQ-------HDQDVLNKIKHSPLISKIGLKLRFLDTANF--GGF-----CQMGRDMNKMATVHANCCVGLENKV
         P         HDQ   N+  H    +   +K+  L  + F  GG        +     K   VH N  +G + K+
Subjt:  YPGQ-------HDQDVLNKIKHSPLISKIGLKLRFLDTANF--GGF-----CQMGRDMNKMATVHANCCVGLENKV

Q9M146 UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase MGP41.2e-0621.89Show/hide
Query:  SSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSD
        S S  RD  L + +K  A ++GT+I+  ++  +         FL ++ I    Q+    ++++  D     +     P    L     +  +   F +  
Subjt:  SSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSD

Query:  YLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMF----LGNSEDLNNTPNGG-------FVYVKANPKTVKFYKFWYQSRTI
        +     RR + L+ +LE+GY+ ++ D D++WLQDPF +   + D     DM     L +S DL      G        ++++         K W +    
Subjt:  YLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMF----LGNSEDLNNTPNGG-------FVYVKANPKTVKFYKFWYQSRTI

Query:  YPGQHDQDVLNKIKHSPLISKIG--LKLRFLDTANF--GGF-----CQMGRDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTP
         P    +   ++   +  ++K    + +  L  A F  GG        +     K A +H N  VG E K+   R    D+N +     D+ AS +P
Subjt:  YPGQHDQDVLNKIKHSPLISKIG--LKLRFLDTANF--GGF-----CQMGRDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTP

Q9ZSJ0 UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase5.0e-0523.47Show/hide
Query:  NPFKFLPVSYTYRAFRSS-SPH-----RDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHP
        +P    P   +  ++ SS SPH     R+  L +  K  A  +GT+I+  ++  +         FL ++ I    Q+  + ++++  D     +     P
Subjt:  NPFKFLPVSYTYRAFRSS-SPH-----RDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHP

Query:  HCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDM----FLGNSEDLNNTPNGGFVYV
            L     +  +   F +  +     RR + L+ +LE+GY+ ++ D D++WLQDPF +     D     DM     L +S DL      G  Y+
Subjt:  HCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDM----FLGNSEDLNNTPNGGFVYV

Arabidopsis top hitse value%identityAlignment
AT1G14590.1 Nucleotide-diphospho-sugar transferase family protein4.1e-10354.66Show/hide
Query:  RISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIV
        R ++    + +   VLY +A +   F P  +   ++  +   +   LE VL +AA  D T++LTTLN AWA P S++DLF +SF IG  T ++L HLVIV
Subjt:  RISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIV

Query:  TLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMFLGNSEDLNNTP
         LD KAYSRCL LH HC+ L T+G +FS EAYFMT  YLKMMWRRI+ L SVLEMGY+FVFTD+D+MW ++PF  FY  ADFQIACD +LG S DL+N P
Subjt:  TLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMFLGNSEDLNNTP

Query:  NGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDMNKMATVHANCCVGLENKVHDLRILLQDWNN
        NGGF +V++N +T+ FYK+WY SR  +PG HDQDVLN +K  P + +IGLK+RFL+TA FGG C+  RD+N + T+HANCC G+E+K+HDLRI+LQDW +
Subjt:  NGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDMNKMATVHANCCVGLENKVHDLRILLQDWNN

Query:  FFNRTADNKASSTPSWTVPQDC
        F +     K SS  SW VPQ+C
Subjt:  FFNRTADNKASSTPSWTVPQDC

AT2G02061.1 Nucleotide-diphospho-sugar transferase family protein7.0e-10359.93Show/hide
Query:  LEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFS-SEAYFMTSDYLKMMWRR
        LE+VL+ AA +DGT+ILTTLN+AWA P S++DLF +SF IG GT+RLLKHLVI+ LD KAYSRC  LH HC+ LET+G +FS  EAYFMT  YL MMWRR
Subjt:  LEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFS-SEAYFMTSDYLKMMWRR

Query:  IEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLI
        I FL SVLE GY+FVFTD+D+MW ++PF  FY + DFQIACD ++G   D  N PNGGF +V+AN +++ FYKFWY SRT YP  HDQDVLN IK  P +
Subjt:  IEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLI

Query:  SKIGLKLRFLDTANFGGFCQMGRDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDC
         K+ +++RFL+T  FGGFC+  +D+N + T+HANCC GL++K+HDLRI+LQDW +F +    +  SS  +W+VPQ+C
Subjt:  SKIGLKLRFLDTANFGGFCQMGRDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDC

AT4G15970.1 Nucleotide-diphospho-sugar transferase family protein8.3e-8851.72Show/hide
Query:  LEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPH-CYELETQGTNFSSEAYFMTSDYLKMMWRR
        L K+L EAA ED T+I+TTLN AW+EP+S  DLFL SFH+G GT+ LL+HLV+  LD++AYSRC  +HPH CY ++T G +F+ +  FMT DYLKMMWRR
Subjt:  LEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPH-CYELETQGTNFSSEAYFMTSDYLKMMWRR

Query:  IEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLI
        IEFL ++L++ Y+F+FT         PF     E DFQIACD + G+ +D++N  NGGF +VKAN +T+ FY +WY SR  YP +HDQDVL++IK     
Subjt:  IEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLI

Query:  SKIGLKLRFLDTANFGGFCQMGRDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDCRVIANWAKKERKI
        +KIGLK+RFLDT  FGGFC+  RD++K+ T+HANCCVGLENK+ DLR ++ DW N+ +  A        +W  P++C     W  K +++
Subjt:  SKIGLKLRFLDTANFGGFCQMGRDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDCRVIANWAKKERKI

AT4G19970.1 CONTAINS InterPro DOMAIN/s: Nucleotide-diphospho-sugar transferase, predicted (InterPro:IPR005069)8.6e-10156.94Show/hide
Query:  SSSP---HRDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFM
        SSSP    + +   +VL+ A+ E+ T+I+TTLN AWAEP+SL DLFL+SF IG GT++LL+H+V+V LD KA++RC  LHP+CY L+T GT+FS E  F 
Subjt:  SSSP---HRDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFM

Query:  TSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQD
        T DYLKMMWRRIE L  VLEMGY+F+FTD+DIMWL+DPF   YP+ DFQ+ACD F G+  D +N  NGGF YVK+N ++++FYKFWY SR  YP  HDQD
Subjt:  TSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQD

Query:  VLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDC
        V N+IKH  L+S+IG+++RF DT  FGGFCQ  RD+N + T+HANCCVGL  K+HDL ++L DW N+ + +   K     +W+VP  C
Subjt:  VLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDMNKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDC

AT5G44820.1 Nucleotide-diphospho-sugar transferase family protein3.0e-9849.85Show/hide
Query:  RISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSP---------------HRDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFH
        RI ++F+G+    LVLY +A  P + L VS       S SP                  L  +++L+ A+ ++ T+I+TTLN AWAEP+SL DLFL+SF 
Subjt:  RISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSP---------------HRDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFH

Query:  IGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIA
        IG GTQ+LLKH+V+V LD KA+ RC  LH +CY +ET  T+FS E  + T DYLKMMW RI+ L  VLEMG++F+FTD+DIMWL+DPF   YP+ DFQ+A
Subjt:  IGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIA

Query:  CDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDMNKMATVHANCCVGLE
        CD F GN  D +N  NGGF YV++N ++++FYKFW++SR  YP  HDQDV N+IKH P IS+IG+++RF DT  FGGFCQ  RD+N + T+HANCC+GL+
Subjt:  CDMFLGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDMNKMATVHANCCVGLE

Query:  NKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDC
         K+HDL ++L DW  + +    ++     +W+VP  C
Subjt:  NKVHDLRILLQDWNNFFNRTADNKASSTPSWTVPQDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAATAATTCCGCCGCCGACGGCGAACCCGCCGCCAAGTTGTCGGCTCCCTCGGCCGTTCATACGGCGGTGGTTCCGTGGAGGACGGTGAGGATCTCCGTTGTGTT
CGTTGGCGTTGTGTTGGGCCTCCTTGTTCTGTACAACTCAGCCATTAATCCTTTCAAATTTCTTCCTGTTTCCTACACCTACCGCGCTTTTCGATCCTCTTCTCCTCACA
GAGATCTTCTTTTGGAAAAAGTTCTGAAAGAAGCAGCAATGGAAGATGGAACAATAATCTTGACGACGTTGAATGATGCATGGGCAGAGCCAGATTCCCTCCTCGATCTG
TTTCTTAAAAGCTTCCACATTGGAAACGGTACTCAAAGATTATTGAAGCACTTAGTCATAGTCACGTTGGACCAAAAAGCGTATTCTCGTTGCCTGGCATTGCACCCTCA
TTGCTATGAATTGGAAACTCAAGGAACCAACTTCTCCAGCGAAGCCTACTTCATGACCTCTGATTACTTGAAAATGATGTGGCGAAGAATTGAATTTCTCATCTCTGTTC
TCGAGATGGGTTACAGTTTCGTATTCACTGATTCTGATATAATGTGGCTGCAAGACCCATTCAATCACTTCTACCCAGAGGCAGATTTTCAAATTGCTTGCGATATGTTT
TTGGGGAACTCGGAAGATTTAAACAACACTCCCAATGGAGGGTTTGTGTACGTGAAAGCGAATCCAAAAACAGTAAAATTCTACAAGTTTTGGTACCAATCAAGGACAAT
ATATCCAGGACAGCACGACCAAGATGTGCTGAACAAGATCAAACACAGTCCATTGATCTCTAAAATTGGGCTGAAATTAAGGTTTCTGGACACTGCGAATTTCGGAGGGT
TCTGTCAAATGGGGAGGGACATGAACAAGATGGCTACAGTGCATGCCAATTGCTGCGTTGGACTAGAGAACAAAGTTCACGATCTCAGGATTTTGCTTCAAGATTGGAAT
AACTTTTTTAACCGAACTGCAGATAATAAAGCTTCCTCAACCCCTTCATGGACTGTTCCTCAAGATTGCAGGGTTATTGCTAATTGGGCCAAGAAAGAACGGAAGATTCC
CAAACTTAGATATCATCGTCATCTCTGCAAGCATCAAGCAACCGGGGCGAAGGGAGGTAGGCTCACCTTCAAGGGAGGAGTCTTAGCCTCTCGTAGCAAGGATATTGACA
AGAAGAAGAAGAAGAAGAAGAAGAAGAAAGAAAAAAGTAAAACCGACGAGAACCCCACGGACGAGGCCGAGATTTTGACGTCGGCTGATGGTGTAGAAGGTGGAGACGGA
GCAATGTATACTATTGACGCGGCCAAGCGTATGAAGTATGAGGAGCTCTTCCCTGTGGAGACCAGGAAGTTTGGTTACGATCCTAACAACTCCAATACCAACTTCAAGTC
TGTGGAGGATGCTCTCGATGACCGTGTCAAGAAGAAGGCGGATCGTTATTGTAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAATAATTCCGCCGCCGACGGCGAACCCGCCGCCAAGTTGTCGGCTCCCTCGGCCGTTCATACGGCGGTGGTTCCGTGGAGGACGGTGAGGATCTCCGTTGTGTT
CGTTGGCGTTGTGTTGGGCCTCCTTGTTCTGTACAACTCAGCCATTAATCCTTTCAAATTTCTTCCTGTTTCCTACACCTACCGCGCTTTTCGATCCTCTTCTCCTCACA
GAGATCTTCTTTTGGAAAAAGTTCTGAAAGAAGCAGCAATGGAAGATGGAACAATAATCTTGACGACGTTGAATGATGCATGGGCAGAGCCAGATTCCCTCCTCGATCTG
TTTCTTAAAAGCTTCCACATTGGAAACGGTACTCAAAGATTATTGAAGCACTTAGTCATAGTCACGTTGGACCAAAAAGCGTATTCTCGTTGCCTGGCATTGCACCCTCA
TTGCTATGAATTGGAAACTCAAGGAACCAACTTCTCCAGCGAAGCCTACTTCATGACCTCTGATTACTTGAAAATGATGTGGCGAAGAATTGAATTTCTCATCTCTGTTC
TCGAGATGGGTTACAGTTTCGTATTCACTGATTCTGATATAATGTGGCTGCAAGACCCATTCAATCACTTCTACCCAGAGGCAGATTTTCAAATTGCTTGCGATATGTTT
TTGGGGAACTCGGAAGATTTAAACAACACTCCCAATGGAGGGTTTGTGTACGTGAAAGCGAATCCAAAAACAGTAAAATTCTACAAGTTTTGGTACCAATCAAGGACAAT
ATATCCAGGACAGCACGACCAAGATGTGCTGAACAAGATCAAACACAGTCCATTGATCTCTAAAATTGGGCTGAAATTAAGGTTTCTGGACACTGCGAATTTCGGAGGGT
TCTGTCAAATGGGGAGGGACATGAACAAGATGGCTACAGTGCATGCCAATTGCTGCGTTGGACTAGAGAACAAAGTTCACGATCTCAGGATTTTGCTTCAAGATTGGAAT
AACTTTTTTAACCGAACTGCAGATAATAAAGCTTCCTCAACCCCTTCATGGACTGTTCCTCAAGATTGCAGGGTTATTGCTAATTGGGCCAAGAAAGAACGGAAGATTCC
CAAACTTAGATATCATCGTCATCTCTGCAAGCATCAAGCAACCGGGGCGAAGGGAGGTAGGCTCACCTTCAAGGGAGGAGTCTTAGCCTCTCGTAGCAAGGATATTGACA
AGAAGAAGAAGAAGAAGAAGAAGAAGAAAGAAAAAAGTAAAACCGACGAGAACCCCACGGACGAGGCCGAGATTTTGACGTCGGCTGATGGTGTAGAAGGTGGAGACGGA
GCAATGTATACTATTGACGCGGCCAAGCGTATGAAGTATGAGGAGCTCTTCCCTGTGGAGACCAGGAAGTTTGGTTACGATCCTAACAACTCCAATACCAACTTCAAGTC
TGTGGAGGATGCTCTCGATGACCGTGTCAAGAAGAAGGCGGATCGTTATTGTAAATAA
Protein sequenceShow/hide protein sequence
MKNNSAADGEPAAKLSAPSAVHTAVVPWRTVRISVVFVGVVLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHRDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDL
FLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCLALHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDMF
LGNSEDLNNTPNGGFVYVKANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRDMNKMATVHANCCVGLENKVHDLRILLQDWN
NFFNRTADNKASSTPSWTVPQDCRVIANWAKKERKIPKLRYHRHLCKHQATGAKGGRLTFKGGVLASRSKDIDKKKKKKKKKKEKSKTDENPTDEAEILTSADGVEGGDG
AMYTIDAAKRMKYEELFPVETRKFGYDPNNSNTNFKSVEDALDDRVKKKADRYCK