; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0052281 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0052281
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionExostosin family protein
Genome locationCMiso1.1chr02:19510790..19518412
RNA-Seq ExpressionCmc02g0052281
SyntenyCmc02g0052281
Gene Ontology termsGO:0006486 - protein glycosylation (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR004263 - Exostosin-like
IPR040911 - Exostosin, GT47 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008465462.1 PREDICTED: probable glycosyltransferase At3g07620 isoform X2 [Cucumis melo]1.5e-22598.78Show/hide
Query:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNKDT
        MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNKDT
Subjt:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNKDT

Query:  ARSETNAELSYNPRMKGVLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVK
        A+SETNAELSYNPRMKGVLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVK
Subjt:  ARSETNAELSYNPRMKGVLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVK

Query:  KGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKVY
        KGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKVY
Subjt:  KGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKVY

Query:  IYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVAC
        IYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVAC
Subjt:  IYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVAC

Query:  HDWVDILTQ
        HDW   LT+
Subjt:  HDWVDILTQ

XP_016903391.1 PREDICTED: probable glycosyltransferase At3g07620 isoform X1 [Cucumis melo]3.6e-22498.54Show/hide
Query:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNKDT
        MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNKDT
Subjt:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNKDT

Query:  ARSETNAELSYNPRMKGVLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVK
        A+SETNAELSYNPRMKGVLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVK
Subjt:  ARSETNAELSYNPRMKGVLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVK

Query:  KGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSK-RLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKV
        KGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSK RLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKV
Subjt:  KGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSK-RLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKV

Query:  YIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVA
        YIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVA
Subjt:  YIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVA

Query:  CHDWVDILTQ
        CHDW   LT+
Subjt:  CHDWVDILTQ

XP_022931779.1 probable glycosyltransferase At3g07620 isoform X2 [Cucurbita moschata]5.0e-15770.45Show/hide
Query:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNK--
        MEI RR+ II  MIL+ LF+FQYSVF+YTK L+    DKAST M+VQNVCH+NN GLCRF  +D+GIN LDTK+  DYD+NK VR EVVDLTSEFL K  
Subjt:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNK--

Query:  -----------DTARSETNAELSYNPRMKG-VLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNI
                   DT   ETNAELSY+P MKG VLE+SNMTADE KA SSPG++E+ NQ +VV  QS GTMNNSIKKVD TYS+IS  P+  + Q+E  ++ 
Subjt:  -----------DTARSETNAELSYNPRMKG-VLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNI

Query:  SEELENNDRIELVKKGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMF
         EELEN+D + L KK   V  DR  GPD+STL+GPFISISQ+YSKLSRAHKS+C KR QC QTS+RDREL  AR EIEN+S LRSTP I+AS+FRNIS+F
Subjt:  SEELENNDRIELVKKGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMF

Query:  TRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRF
        TRSYELMEKMLKVYIY+EG++PIFHQPILTGIYASEGWFMKLLE+NKKF VKDPEKAHLFYLPFSSQFLR AFGNKFRNKRDLQK L+ Y+D+IGKKY F
Subjt:  TRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRF

Query:  WNKNGGSDHFLVACHDWVDILTQ
        W +NGGSDHFLVACHDW   LT+
Subjt:  WNKNGGSDHFLVACHDWVDILTQ

XP_031736280.1 probable glycosyltransferase At3g07620 [Cucumis sativus]8.1e-20892.44Show/hide
Query:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNKDT
        MEIRRRVFIITFMILLILFAFQY VFRYTKKLSLSFGDKAST MV QNVC LNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVV LTSEFLNKDT
Subjt:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNKDT

Query:  ARSETNAELSYNPRMKG-VLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELV
         +SE NAELSYN RMKG VLENSNMTADEAKANSSPGMNEV+NQIMVVPNQ  GTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKN SEELE NDRIELV
Subjt:  ARSETNAELSYNPRMKG-VLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELV

Query:  KKGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKV
        KKG FVF D+MVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTS+RD EL  ARLEIENAS LRSTP+IS+SVFRNISMFTRSYELMEKMLKV
Subjt:  KKGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKV

Query:  YIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVA
        Y+YDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNY+DVIGKKYRFWNKNGGSDHFLVA
Subjt:  YIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVA

Query:  CHDWVDILTQ
        CHDW   LT+
Subjt:  CHDWVDILTQ

XP_038880547.1 probable glycosyltransferase At3g07620 isoform X1 [Benincasa hispida]4.2e-18078.5Show/hide
Query:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNK--
        MEI RRV IITFMILLILFAFQY VF+YTK LSLSFGD+ASTFMVVQNV HLNN+GL RFHPID+ IN LDTK+ F YD+NK VR+EVVDLTSEFLNK  
Subjt:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNK--

Query:  ----------------DTARSETNAELSYNPRMKG-VLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKE
                        DT   + NAELSYNP MKG VL++SNMTADEAKANS+PGM+E+RNQIM VPNQS+GTMNNSI+ VDQTYS++SV P+ SS QKE
Subjt:  ----------------DTARSETNAELSYNPRMKG-VLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKE

Query:  KIKNISEELENNDRIELVKKGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFR
        ++KN  +ELENN+RIELVK G  V  DR + PDVSTL+GPFISISQIYSKLSRAHKS+C KR QCR TSQRDREL  AR EIENAS LRSTP+ISASVFR
Subjt:  KIKNISEELENNDRIELVKKGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFR

Query:  NISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIG
        N+SMFTRSYELMEKMLKVYIY+EGEKP+FHQPILTGIYASEGWFMKLLE+NKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQK LKNYVDVIG
Subjt:  NISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIG

Query:  KKYRFWNKNGGSDHFLVACHDWVDILTQ
        KKYRFWN+NGGSDHFLVACHDW   LT+
Subjt:  KKYRFWNKNGGSDHFLVACHDWVDILTQ

TrEMBL top hitse value%identityAlignment
A0A1S3CNV8 probable glycosyltransferase At3g07620 isoform X27.1e-22698.78Show/hide
Query:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNKDT
        MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNKDT
Subjt:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNKDT

Query:  ARSETNAELSYNPRMKGVLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVK
        A+SETNAELSYNPRMKGVLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVK
Subjt:  ARSETNAELSYNPRMKGVLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVK

Query:  KGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKVY
        KGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKVY
Subjt:  KGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKVY

Query:  IYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVAC
        IYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVAC
Subjt:  IYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVAC

Query:  HDWVDILTQ
        HDW   LT+
Subjt:  HDWVDILTQ

A0A1S4E582 probable glycosyltransferase At3g07620 isoform X11.8e-22498.54Show/hide
Query:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNKDT
        MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNKDT
Subjt:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNKDT

Query:  ARSETNAELSYNPRMKGVLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVK
        A+SETNAELSYNPRMKGVLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVK
Subjt:  ARSETNAELSYNPRMKGVLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVK

Query:  KGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSK-RLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKV
        KGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSK RLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKV
Subjt:  KGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSK-RLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKV

Query:  YIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVA
        YIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVA
Subjt:  YIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVA

Query:  CHDWVDILTQ
        CHDW   LT+
Subjt:  CHDWVDILTQ

A0A6J1EV67 probable glycosyltransferase At3g07620 isoform X22.4e-15770.45Show/hide
Query:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNK--
        MEI RR+ II  MIL+ LF+FQYSVF+YTK L+    DKAST M+VQNVCH+NN GLCRF  +D+GIN LDTK+  DYD+NK VR EVVDLTSEFL K  
Subjt:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNK--

Query:  -----------DTARSETNAELSYNPRMKG-VLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNI
                   DT   ETNAELSY+P MKG VLE+SNMTADE KA SSPG++E+ NQ +VV  QS GTMNNSIKKVD TYS+IS  P+  + Q+E  ++ 
Subjt:  -----------DTARSETNAELSYNPRMKG-VLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNI

Query:  SEELENNDRIELVKKGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMF
         EELEN+D + L KK   V  DR  GPD+STL+GPFISISQ+YSKLSRAHKS+C KR QC QTS+RDREL  AR EIEN+S LRSTP I+AS+FRNIS+F
Subjt:  SEELENNDRIELVKKGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMF

Query:  TRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRF
        TRSYELMEKMLKVYIY+EG++PIFHQPILTGIYASEGWFMKLLE+NKKF VKDPEKAHLFYLPFSSQFLR AFGNKFRNKRDLQK L+ Y+D+IGKKY F
Subjt:  TRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRF

Query:  WNKNGGSDHFLVACHDWVDILTQ
        W +NGGSDHFLVACHDW   LT+
Subjt:  WNKNGGSDHFLVACHDWVDILTQ

A0A6J1EZP5 probable glycosyltransferase At3g07620 isoform X16.0e-15670.28Show/hide
Query:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNK--
        MEI RR+ II  MIL+ LF+FQYSVF+YTK L+    DKAST M+VQNVCH+NN GLCRF  +D+GIN LDTK+  DYD+NK VR EVVDLTSEFL K  
Subjt:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNK--

Query:  -----------DTARSETNAELSYNPRMKG-VLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNI
                   DT   ETNAELSY+P MKG VLE+SNMTADE KA SSPG++E+ NQ +VV  QS GTMNNSIKKVD TYS+IS  P+  + Q+E  ++ 
Subjt:  -----------DTARSETNAELSYNPRMKG-VLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNI

Query:  SEELENNDRIELVKKGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSK-RLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISM
         EELEN+D + L KK   V  DR  GPD+STL+GPFISISQ+YSKLSRAHKS+C K R QC QTS+RDREL  AR EIEN+S LRSTP I+AS+FRNIS+
Subjt:  SEELENNDRIELVKKGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSK-RLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISM

Query:  FTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYR
        FTRSYELMEKMLKVYIY+EG++PIFHQPILTGIYASEGWFMKLLE+NKKF VKDPEKAHLFYLPFSSQFLR AFGNKFRNKRDLQK L+ Y+D+IGKKY 
Subjt:  FTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYR

Query:  FWNKNGGSDHFLVACHDWVDILTQ
        FW +NGGSDHFLVACHDW   LT+
Subjt:  FWNKNGGSDHFLVACHDWVDILTQ

A0A6J1HMF2 probable glycosyltransferase At3g07620 isoform X27.8e-15670.45Show/hide
Query:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNK--
        MEI RR+ II  MIL+ LF+FQYSVF+YTK L+    DKAST M+VQNVCH+NN GLCRF  +D+GIN LDTK+  DYD+NK VR EV DLTSEFL K  
Subjt:  MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNK--

Query:  -----------DTARSETNAELSYNPRMKG-VLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNI
                   DT   ETNAELSY+P MKG VLE+SNMTADE KA SSPG++E+ NQ +VV  QS GTMNNSIKKVD TYS+IS  P+ S+ Q+E  ++ 
Subjt:  -----------DTARSETNAELSYNPRMKG-VLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNI

Query:  SEELENNDRIELVKKGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMF
         EELEN+D I   KK   V  DR  GPD+STL+GPFISISQ+YSKLSRAHKS+C KR QC QTS+RDREL  AR EIEN+S LRSTP I+ S+FRNIS+F
Subjt:  SEELENNDRIELVKKGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMF

Query:  TRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRF
        TRSYELMEKMLKVYIY+EGEKPIFHQPILTGIYASEGWFMKLLE+NKKF VKDPEKAHLFYLPFSSQFLR A GNKFRNKRDLQK L+ Y+D+IGKKY F
Subjt:  TRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRF

Query:  WNKNGGSDHFLVACHDWVDILTQ
        W +NGGSDHFLVACHDW   LT+
Subjt:  WNKNGGSDHFLVACHDWVDILTQ

SwissProt top hitse value%identityAlignment
Q3E7Q9 Probable glycosyltransferase At5g253104.9e-2232.2Show/hide
Query:  LQCRQTSQRDRELLNARLEIENASALRSTPDISASVF----------RNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNK
        LQ +      R L+   L    AS L ++ +++ ++F          RN S   RSY  MEK  KVY+Y+EGE P+ H      +YA EG F+  +E  +
Subjt:  LQCRQTSQRDRELLNARLEIENASALRSTPDISASVF----------RNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNK

Query:  -KFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDWVDILTQ
         KF   DP +A++++LPFS  +L         + + L+  + +Y+ ++   + FWN+  G+DHF++ CHDW  + +Q
Subjt:  -KFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDWVDILTQ

Q3E9A4 Probable glycosyltransferase At5g202601.6e-2040.31Show/hide
Query:  SVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDN-KKFVVKDPEKAHLFYLPFS-SQFLRSAFGNKFRNKRD-LQKPLK
        +V+RN   F +S+  MEK  KV++Y EGE P+ H   +  IY+ EG FM  +E     F   +PE+AH F LP S +  +   +       R+ L K   
Subjt:  SVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDN-KKFVVKDPEKAHLFYLPFS-SQFLRSAFGNKFRNKRD-LQKPLK

Query:  NYVDVIGKKYRFWNKNGGSDHFLVACHDW
        +YVDV+  KY +WN++ G+DHF V+CHDW
Subjt:  NYVDVIGKKYRFWNKNGGSDHFLVACHDW

Q9FFN2 Probable glycosyltransferase At5g037957.3e-2632.81Show/hide
Query:  TLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDIS----ASVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQ
        T+    I+++   + +S        KR       + + +L  AR  I+ AS      D        ++ N  +F RSY  MEK  K+Y+Y EGE P+FH 
Subjt:  TLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDIS----ASVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQ

Query:  PILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFS-SQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDW
             IY+ EG F+  +E + +F   +P+KAH+FYLPFS  + +R  +    R+   ++  +K+Y++++G KY +WN++ G+DHF+++CHDW
Subjt:  PILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFS-SQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDW

Q9LFP3 Probable glycosyltransferase At5g111306.4e-2239.69Show/hide
Query:  SASVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLED-NKKFVVKDPEKAHLFYLPFS-SQFLRSAFGNKFRNKRD-LQKP
        + SV+ N   F +S++ MEK  K++ Y EGE P+FH+  L  IYA EG FM  +E+ N +F    PE+A +FY+P      +R  +       RD LQ  
Subjt:  SASVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLED-NKKFVVKDPEKAHLFYLPFS-SQFLRSAFGNKFRNKRD-LQKP

Query:  LKNYVDVIGKKYRFWNKNGGSDHFLVACHDW
        +K+Y+ +I  +Y +WN++ G+DHF ++CHDW
Subjt:  LKNYVDVIGKKYRFWNKNGGSDHFLVACHDW

Q9SSE8 Probable glycosyltransferase At3g076209.2e-2940.49Show/hide
Query:  DRELLNARLEIENASALRSTPDIS----------ASVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLE-DNKKFVVKDPE
        + EL  AR+ I  A    S+   S            ++RN   F RSY LMEKM K+Y+Y+EG+ PIFH  +   IY+ EG F+  +E D  K+  +DP+
Subjt:  DRELLNARLEIENASALRSTPDIS----------ASVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLE-DNKKFVVKDPE

Query:  KAHLFYLPFS-SQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDW
        KAH+++LPFS    L   F    R+K  L++ + +YV +I KKY +WN + G DHF+++CHDW
Subjt:  KAHLFYLPFS-SQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDW

Arabidopsis top hitse value%identityAlignment
AT4G16745.1 Exostosin family protein1.1e-4556.95Show/hide
Query:  RELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFS-S
        + L  A+LEI+ A  + +  D+ A +FRN+S+F RSYELME +LKVYIY +G+KPIFH+P L GIYASEGWFMKL+E NK+FV K+PE+AHLFY+P+S  
Subjt:  RELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFS-S

Query:  QFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDW
        Q  +S F     N + L   L++YV+++  KY FWN+  GSDHFLVACHDW
Subjt:  QFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDW

AT4G32790.1 Exostosin family protein6.7e-5144.03Show/hide
Query:  ISVIPNTSSDQKEKIKNISEE----LENNDRIELVKKGLFVFKDRM---VGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARL
        + ++P T S   E  + I E+     EN  ++E+++       D +   V   ++  N   +SI+++ + L ++  S  S  L+ +++S  D ELL AR 
Subjt:  ISVIPNTSSDQKEKIKNISEE----LENNDRIELVKKGLFVFKDRM---VGPDVSTLNGPFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARL

Query:  EIENASALRSTPDISASVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFL-RSAFG
        +IEN   + + P +   ++ N+SMF RSYELMEK LKVY+Y EG++P+ H+P+L GIYASEGWFMK L+ ++ FV KDP KAHLFYLPFSS+ L  + + 
Subjt:  EIENASALRSTPDISASVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPFSSQFL-RSAFG

Query:  NKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDW
            + ++L + LKNY+D+I  KY FWNK GGSDHFLVACHDW
Subjt:  NKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDW

AT5G19670.1 Exostosin family protein9.1e-4839.78Show/hide
Query:  NQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVKKGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACS
        ++  V+  +S  T NN  +  + T   +    N  S       +I+     N  + + KK   V K + +  D+   +    +I ++   L+R  ++  S
Subjt:  NQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVKKGLFVFKDRMVGPDVSTLNGPFISISQIYSKLSRAHKSACS

Query:  KRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPE
        + ++ R +S+RD E+L AR EIENA   +   ++   +FRN+S+F RSYELME++LKVY+Y EG +PIFH PIL G+YASEGWFMKL+E NK++ VKDP 
Subjt:  KRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKDPE

Query:  KAHLFYLPFSSQFLR-SAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDWVDILTQ
        KAHL+Y+PFS++ L  + +     N+ +L++ LK Y + I  KY F+N+  G+DHFLVACHDW    T+
Subjt:  KAHLFYLPFSSQFLR-SAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDWVDILTQ

AT5G25820.1 Exostosin family protein3.5e-4752.46Show/hide
Query:  ISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASE
        +SIS++  +L +   S      + +  ++ D ELL A+ +IENA      P + A ++RN+SMF RSYELMEK+LKVY Y EG KPI H PIL GIYASE
Subjt:  ISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASE

Query:  GWFMKLLE-DNKKFVVKDPEKAHLFYLPFSSQFLR-SAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDW
        GWFM ++E +N KFV KDP KAHLFYLPFSS+ L  + +     + R+L K LK+Y+D I  KY FWN+  G+DHFL ACHDW
Subjt:  GWFMKLLE-DNKKFVVKDPEKAHLFYLPFSSQFLR-SAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDW

AT5G37000.1 Exostosin family protein6.7e-5144.73Show/hide
Query:  SEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVKKGLFVFKDRMVGPDVST---LNGPFISISQIYSKLSRAHKSACSKRLQCR
        SE  +  S +KV+   S I++     S +  K+  +S E E++    +++      KD   G  +S      G  ISISQ+ S L ++  S   K  + R
Subjt:  SEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVKKGLFVFKDRMVGPDVST---LNGPFISISQIYSKLSRAHKSACSKRLQCR

Query:  QTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFT--------------RSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNK
         +S RD E+L+AR EIE  S +     ++  V+RNIS F               RSY+LME+ LK+Y+Y EG KPIFH P+  GIYASEGWFMKL+E NK
Subjt:  QTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFT--------------RSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNK

Query:  KFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDWVDILT
        KFVVKDP KAHLFY+P S + LRS+ G  F+  + L   LK YVD+I  KY+FWN+ GG+DHFLVACHDW + LT
Subjt:  KFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDWVDILT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATCCGGAGGCGGGTGTTCATTATTACATTTATGATTCTTCTTATTCTATTTGCTTTTCAATATTCTGTGTTTCGTTATACAAAAAAATTATCCCTATCGTTTGG
GGATAAGGCCTCAACTTTTATGGTGGTTCAGAATGTCTGTCATTTGAACAATGCTGGACTTTGTAGATTTCATCCAATCGATTCGGGTATTAACAAATTGGATACAAAGA
AAAAGTTTGATTATGATAGTAACAAAGGAGTAAGAGATGAAGTAGTTGATTTGACATCAGAATTCTTGAATAAAGATACTGCTAGAAGTGAAACAAATGCAGAATTAAGT
TACAATCCCCGGATGAAGGGAGTTTTAGAGAATAGTAACATGACAGCTGATGAAGCTAAAGCCAATAGCAGTCCAGGGATGAATGAAGTTAGAAACCAAATTATGGTTGT
TCCAAATCAATCCGAAGGAACTATGAACAATAGCATTAAAAAGGTTGATCAAACATATTCTGACATTTCTGTGATTCCTAACACATCTTCTGACCAAAAAGAGAAGATAA
AAAACATTAGTGAAGAATTAGAAAACAATGATAGGATTGAGCTAGTGAAGAAAGGTTTATTTGTCTTCAAAGATAGAATGGTCGGGCCTGACGTATCAACATTGAATGGG
CCATTTATATCCATATCTCAAATATACTCAAAGTTATCAAGGGCTCACAAGTCTGCTTGTTCGAAGAGGTTGCAGTGTAGGCAGACATCCCAGCGCGACCGAGAACTACT
TAATGCAAGACTGGAGATTGAAAATGCTTCTGCGTTAAGGAGCACTCCAGACATTAGTGCTTCTGTTTTCAGAAACATTTCAATGTTTACAAGGAGTTATGAGTTGATGG
AAAAAATGCTTAAAGTCTATATATATGATGAAGGAGAAAAACCCATTTTCCATCAACCTATATTGACTGGAATCTACGCCTCAGAAGGATGGTTTATGAAATTGCTGGAA
GATAACAAAAAGTTTGTTGTGAAGGACCCTGAGAAAGCTCATTTATTTTATTTGCCTTTCAGTTCACAGTTTTTAAGGTCTGCATTTGGAAATAAATTCCGCAACAAGAG
GGATCTACAGAAACCTCTCAAGAACTACGTTGATGTAATTGGTAAGAAGTATCGTTTTTGGAACAAAAATGGAGGATCAGACCATTTTTTAGTTGCCTGTCATGACTGGG
TAGACATTCTAACGCAATAG
mRNA sequenceShow/hide mRNA sequence
AGTCAGGTGATAAACCTTTTGGGCAATTCATTTGTGTTTCGGATTGGATTGTTAGAAAAGAACAAATTCCTTTTTAATCAAAACTTCAAATCAACCTCAATGTCATCGAT
GATTCAAGATTTCCGACCGGAAATGATTGATTCTTGAACTAGTACTTGATCTAAGGCTCACCTCAGCCCCACCGTTTGCCCTGTAAAAAACCTTGCAATGATTGCTGCTA
CAATCTATTTGAAATGGCAGGCCAACCCATCCACTTTCCGCCATTAACGAACTTGTAAGATGCTCCTCACTCCGGTTCTTTTCCTCTCTTGTCATCATCTACACTATATT
CTCATCCTAAAATCATTGGACCTACCTTTTCCTTGGTCAAAAGCACTTTACACATTCATGTAATCCTTTTTTCCTGCTTGTTACCATGTAATGGACACACTGATTTCAGA
TTTCAGATATAATTATAAAAACCAAGTTCACAATCGATGCGATGGGGATTCATGACTAGAGTTTGCAATTTCTATCTGGGAATAGAGGAATTTGTGTCTTTTTGTGGCAA
TTTCATGAAGCTATGTTTCTTTCGATCATGAGATTTGGAATATCCATTTGGTAATTGATTAGGTTTTTGGCAATATGGAGATCCGGAGGCGGGTGTTCATTATTACATTT
ATGATTCTTCTTATTCTATTTGCTTTTCAATATTCTGTGTTTCGTTATACAAAAAAATTATCCCTATCGTTTGGGGATAAGGCCTCAACTTTTATGGTGGTTCAGAATGT
CTGTCATTTGAACAATGCTGGACTTTGTAGATTTCATCCAATCGATTCGGGTATTAACAAATTGGATACAAAGAAAAAGTTTGATTATGATAGTAACAAAGGAGTAAGAG
ATGAAGTAGTTGATTTGACATCAGAATTCTTGAATAAAGATACTGCTAGAAGTGAAACAAATGCAGAATTAAGTTACAATCCCCGGATGAAGGGAGTTTTAGAGAATAGT
AACATGACAGCTGATGAAGCTAAAGCCAATAGCAGTCCAGGGATGAATGAAGTTAGAAACCAAATTATGGTTGTTCCAAATCAATCCGAAGGAACTATGAACAATAGCAT
TAAAAAGGTTGATCAAACATATTCTGACATTTCTGTGATTCCTAACACATCTTCTGACCAAAAAGAGAAGATAAAAAACATTAGTGAAGAATTAGAAAACAATGATAGGA
TTGAGCTAGTGAAGAAAGGTTTATTTGTCTTCAAAGATAGAATGGTCGGGCCTGACGTATCAACATTGAATGGGCCATTTATATCCATATCTCAAATATACTCAAAGTTA
TCAAGGGCTCACAAGTCTGCTTGTTCGAAGAGGTTGCAGTGTAGGCAGACATCCCAGCGCGACCGAGAACTACTTAATGCAAGACTGGAGATTGAAAATGCTTCTGCGTT
AAGGAGCACTCCAGACATTAGTGCTTCTGTTTTCAGAAACATTTCAATGTTTACAAGGAGTTATGAGTTGATGGAAAAAATGCTTAAAGTCTATATATATGATGAAGGAG
AAAAACCCATTTTCCATCAACCTATATTGACTGGAATCTACGCCTCAGAAGGATGGTTTATGAAATTGCTGGAAGATAACAAAAAGTTTGTTGTGAAGGACCCTGAGAAA
GCTCATTTATTTTATTTGCCTTTCAGTTCACAGTTTTTAAGGTCTGCATTTGGAAATAAATTCCGCAACAAGAGGGATCTACAGAAACCTCTCAAGAACTACGTTGATGT
AATTGGTAAGAAGTATCGTTTTTGGAACAAAAATGGAGGATCAGACCATTTTTTAGTTGCCTGTCATGACTGGGTAGACATTCTAACGCAATAGCTTGATGTAGTTTGAA
GTTGTACTATTGACTACTCTTTCTGCCTTTTTTGGTTTT
Protein sequenceShow/hide protein sequence
MEIRRRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPIDSGINKLDTKKKFDYDSNKGVRDEVVDLTSEFLNKDTARSETNAELS
YNPRMKGVLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKVDQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVKKGLFVFKDRMVGPDVSTLNG
PFISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASVFRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLE
DNKKFVVKDPEKAHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLVACHDWVDILTQ