; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G03640 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G03640
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionExostosin domain-containing protein
Genome locationClcChr07:3656167..3657765
RNA-Seq ExpressionClc07G03640
SyntenyClc07G03640
Gene Ontology termsGO:0006486 - protein glycosylation (biological process)
GO:0009396 - folic acid-containing compound biosynthetic process (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
GO:0016881 - acid-amino acid ligase activity (molecular function)
InterPro domainsIPR004263 - Exostosin-like
IPR040911 - Exostosin, GT47 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050374.1 putative glycosyltransferase [Cucumis melo var. makuwa]2.0e-14984.01Show/hide
Query:  MSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVEL
        MSA LRIFTYIPF  FSFSS AESLFY SLL SPY+THDPD+AHLFF+PFSPD+S RSL+RLIRTLRTDLPYWNRTLGADHFFLSSSG+GY  DRNVVEL
Subjt:  MSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVEL

Query:  KKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRERVLGFVGYGSVRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGG
        KKNAIQVSSFPVP GKFIPHKDISLPPVS  V   V   TV ER+LGFVGYG V+ LSLVKELIEDPEFLMESEPP TPS YG+K+AKSDFCLFEY    
Subjt:  KKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRERVLGFVGYGSVRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGG

Query:  DGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQ
           GSGDVSGIGEALRFGCVPVVISDR IQDLPLMD VRWQEMAV VGGGGGI+GVKKVLR VD E L RMKRLGAAAAQHF+WNSPPQPLDAFNTVAYQ
Subjt:  DGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQ

Query:  LWVRRHAVRYAERREWAQN
        LW+RRHAVRYA+RREWAQN
Subjt:  LWVRRHAVRYAERREWAQN

XP_008461738.1 PREDICTED: probable glycosyltransferase At3g07620 [Cucumis melo]9.8e-16583.61Show/hide
Query:  MASSLIVLSLLLSFSLLSTPIAPSPSPSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSL
        M SSLI  +LLLSFSLL TPI  +PSPSPYLSPIFLKNYNSMSA LRIFTYIPF  FSFSS AESLFY SLL SPY+THDPD+AHLFF+PFSPD+S RSL
Subjt:  MASSLIVLSLLLSFSLLSTPIAPSPSPSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSL

Query:  ARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRERVLGFVGYGSVRSLSL
        +RLIRTLRTDLPYWNRTLGADHFFLSSSG+GY  DRNVVELKKNAIQVSSFPVP GKFIPHKDISLPPVS  V   V   TV ER+LGFVGYG V+ LSL
Subjt:  ARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRERVLGFVGYGSVRSLSL

Query:  VKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKV
        VKELIEDPEFLMESEPP TPS YG+K+AKSDFCLFEY       GSGDVSGIGEALRFGCVPVVISDR IQDLPLMD VRWQEMAV VGGGGGI+GVKKV
Subjt:  VKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKV

Query:  LRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAERREWAQN
        LR VD E L RMKRLGAAAAQHF+WNSPPQPLDAFNTVAYQLW+RRHAVRYA+RREWAQN
Subjt:  LRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAERREWAQN

XP_011656179.1 probable glycosyltransferase At5g03795 [Cucumis sativus]3.3e-16884.62Show/hide
Query:  MASSLIVLSLLLSFSLLSTPI----APSPSPSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLS
        MASSLI LSLLLSFSLL TPI    +PSPSPSPYLSPIFLKNYNSMSA LRIFTYIPF PFSFSS AESLFY SLL SPY THDPD+AHLFFIPFSP +S
Subjt:  MASSLIVLSLLLSFSLLSTPI----APSPSPSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLS

Query:  TRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRERVLGFVGYGSVR
        TRSLARLIRTLRTDLPYWNRTLGADHFFLSSSG+GY SDRNVVELKKNAIQVSSFPV  GKFIPHKD+SLPPVS  V  PV  STV ER+LGFVGYG V+
Subjt:  TRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRERVLGFVGYGSVR

Query:  SLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQG
         LSLVKELIEDPEFLMESEPP TPS YG+KLAKSDFCLFEY         GDVSGIGEALRFGCVPVVISDR IQDLPLMD+VRW+EMAV V GGGGI+G
Subjt:  SLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQG

Query:  VKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAERREWAQN
        VKKVLRRVD E L RMK+LGAAAAQHF+WNSPPQPLDAFNTVAYQLWVRRHAVRYA+RREWAQN
Subjt:  VKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAERREWAQN

XP_022995754.1 probable glycosyltransferase At5g03795 [Cucurbita maxima]7.8e-14673.44Show/hide
Query:  SSLIVLSLLLSFSLLSTPIAPSPSPSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSLAR
        +SLI  SLLLS SLL+     + SPSPYLSPIF +NYN+MS TL+IFTYIPFKP SF SPAESLFY SLL SPY+TH+PD AH FFIPFSPD STRSLAR
Subjt:  SSLIVLSLLLSFSLLSTPIAPSPSPSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSLAR

Query:  LIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPV-------SGWVPVPVDESTVRERVLGFVGYGSV
        LIRTLR++LPYWNRTLGADHFFLSS GV Y+SDRN+VELKKNAIQVS  PVP G FI HKDI+LPPV       S W+P P       ERVLGFVGYG V
Subjt:  LIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPV-------SGWVPVPVDESTVRERVLGFVGYGSV

Query:  RSLSLVKELIEDPEFLMESEPPLTPSS----YGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGG
        R   LVKELIEDPEF MESEPP  P S    YGE+L KSDFCLFEY      GG G V  IGE +R+GCVPVVISDRPIQDLPLMD++RWQ+MAV V GG
Subjt:  RSLSLVKELIEDPEFLMESEPPLTPSS----YGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGG

Query:  GGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAERREWAQN
         GI+GVK+VLRRVDEESL +MKRLGAAAAQHF+WNSPPQPLDAFNTVAYQLW+RRH +RYAER+EWAQ+
Subjt:  GGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAERREWAQN

XP_038889277.1 probable glycosyltransferase At5g03795 [Benincasa hispida]8.0e-17587.78Show/hide
Query:  MASSLIVLSLLLSFSLLSTPIAPSPSPSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSL
        MASSLI LSLLLSFSLLSTPI  SPS SPYLSPIFLKNYNSMSA LRIFTYIPF+PFSFSSPAESLFY SLL SPYATHDPD+AHLFFIPFSPDLSTRSL
Subjt:  MASSLIVLSLLLSFSLLSTPIAPSPSPSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSL

Query:  ARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRERVLGFVGYGSVRSLSL
         RLIRTLRTDLPYWNRTLGADHFFLSS+GVGYSS+RNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGW  VPV+E T +ERVLGFVGYG V+SLSL
Subjt:  ARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRERVLGFVGYGSVRSLSL

Query:  VKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKV
        V ELIEDPEF+MESEPPLT SSYGEKLAKSDFCLFEY       G GDVSGIGEALRFGC+PVVIS RPIQDLPLMD++RWQEMAV +GG  GIQGVKKV
Subjt:  VKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKV

Query:  LRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAERREWAQN
        LR VD+ESLARMKRLGAAAAQHF WNSPPQPLDAFNTVA+QLWVRRHAVRYAERREWAQ+
Subjt:  LRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAERREWAQN

TrEMBL top hitse value%identityAlignment
A0A0A0KS95 Exostosin domain-containing protein1.6e-16884.62Show/hide
Query:  MASSLIVLSLLLSFSLLSTPI----APSPSPSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLS
        MASSLI LSLLLSFSLL TPI    +PSPSPSPYLSPIFLKNYNSMSA LRIFTYIPF PFSFSS AESLFY SLL SPY THDPD+AHLFFIPFSP +S
Subjt:  MASSLIVLSLLLSFSLLSTPI----APSPSPSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLS

Query:  TRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRERVLGFVGYGSVR
        TRSLARLIRTLRTDLPYWNRTLGADHFFLSSSG+GY SDRNVVELKKNAIQVSSFPV  GKFIPHKD+SLPPVS  V  PV  STV ER+LGFVGYG V+
Subjt:  TRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRERVLGFVGYGSVR

Query:  SLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQG
         LSLVKELIEDPEFLMESEPP TPS YG+KLAKSDFCLFEY         GDVSGIGEALRFGCVPVVISDR IQDLPLMD+VRW+EMAV V GGGGI+G
Subjt:  SLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQG

Query:  VKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAERREWAQN
        VKKVLRRVD E L RMK+LGAAAAQHF+WNSPPQPLDAFNTVAYQLWVRRHAVRYA+RREWAQN
Subjt:  VKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAERREWAQN

A0A1S3CF96 probable glycosyltransferase At3g076204.7e-16583.61Show/hide
Query:  MASSLIVLSLLLSFSLLSTPIAPSPSPSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSL
        M SSLI  +LLLSFSLL TPI  +PSPSPYLSPIFLKNYNSMSA LRIFTYIPF  FSFSS AESLFY SLL SPY+THDPD+AHLFF+PFSPD+S RSL
Subjt:  MASSLIVLSLLLSFSLLSTPIAPSPSPSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSL

Query:  ARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRERVLGFVGYGSVRSLSL
        +RLIRTLRTDLPYWNRTLGADHFFLSSSG+GY  DRNVVELKKNAIQVSSFPVP GKFIPHKDISLPPVS  V   V   TV ER+LGFVGYG V+ LSL
Subjt:  ARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRERVLGFVGYGSVRSLSL

Query:  VKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKV
        VKELIEDPEFLMESEPP TPS YG+K+AKSDFCLFEY       GSGDVSGIGEALRFGCVPVVISDR IQDLPLMD VRWQEMAV VGGGGGI+GVKKV
Subjt:  VKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKV

Query:  LRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAERREWAQN
        LR VD E L RMKRLGAAAAQHF+WNSPPQPLDAFNTVAYQLW+RRHAVRYA+RREWAQN
Subjt:  LRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAERREWAQN

A0A5A7U559 Putative glycosyltransferase9.6e-15084.01Show/hide
Query:  MSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVEL
        MSA LRIFTYIPF  FSFSS AESLFY SLL SPY+THDPD+AHLFF+PFSPD+S RSL+RLIRTLRTDLPYWNRTLGADHFFLSSSG+GY  DRNVVEL
Subjt:  MSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVEL

Query:  KKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRERVLGFVGYGSVRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGG
        KKNAIQVSSFPVP GKFIPHKDISLPPVS  V   V   TV ER+LGFVGYG V+ LSLVKELIEDPEFLMESEPP TPS YG+K+AKSDFCLFEY    
Subjt:  KKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRERVLGFVGYGSVRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGG

Query:  DGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQ
           GSGDVSGIGEALRFGCVPVVISDR IQDLPLMD VRWQEMAV VGGGGGI+GVKKVLR VD E L RMKRLGAAAAQHF+WNSPPQPLDAFNTVAYQ
Subjt:  DGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQ

Query:  LWVRRHAVRYAERREWAQN
        LW+RRHAVRYA+RREWAQN
Subjt:  LWVRRHAVRYAERREWAQN

A0A6J1FML5 probable glycosyltransferase At5g037956.7e-13575.38Show/hide
Query:  MSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVEL
        MS TL+IFTYIPFKP SF SPAESLFY SLL SPY+THDPD AH FFIPFSPD STRSLARLIRTLR+ LPYWNRTLGADHFFLSS GV Y+SDRN+VEL
Subjt:  MSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVEL

Query:  KKNAIQVSSFPVPAGKFIPHKDISLPPV-------SGWVPVPVDESTVRERVLGFVGYGSVRSLSLVKELIEDPEFLMESEPPLTPS---SYGEKLAKSD
        KKNAIQVS  PVP G FI HKDI+LPPV       S W+P P       ERVLGFVGYG VR   LVKELIEDPEF MESEPP  PS   +YGE+L KSD
Subjt:  KKNAIQVSSFPVPAGKFIPHKDISLPPV-------SGWVPVPVDESTVRERVLGFVGYGSVRSLSLVKELIEDPEFLMESEPPLTPS---SYGEKLAKSD

Query:  FCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQP
        FCLFEY      GG G V  IGE LR+GCVPVVISDRPIQDLPLMD++RWQ+MAV V GG GI+GVK+VLRRVD ESL +MKRLGAAAAQHF+WNSPPQP
Subjt:  FCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQP

Query:  LDAFNTVAYQLWVRRHAVRYAERREWAQN
        LDAFNTVAYQLW+RRH +RYAER+EWAQ+
Subjt:  LDAFNTVAYQLWVRRHAVRYAERREWAQN

A0A6J1K6U0 probable glycosyltransferase At5g037953.8e-14673.44Show/hide
Query:  SSLIVLSLLLSFSLLSTPIAPSPSPSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSLAR
        +SLI  SLLLS SLL+     + SPSPYLSPIF +NYN+MS TL+IFTYIPFKP SF SPAESLFY SLL SPY+TH+PD AH FFIPFSPD STRSLAR
Subjt:  SSLIVLSLLLSFSLLSTPIAPSPSPSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSLAR

Query:  LIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPV-------SGWVPVPVDESTVRERVLGFVGYGSV
        LIRTLR++LPYWNRTLGADHFFLSS GV Y+SDRN+VELKKNAIQVS  PVP G FI HKDI+LPPV       S W+P P       ERVLGFVGYG V
Subjt:  LIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPV-------SGWVPVPVDESTVRERVLGFVGYGSV

Query:  RSLSLVKELIEDPEFLMESEPPLTPSS----YGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGG
        R   LVKELIEDPEF MESEPP  P S    YGE+L KSDFCLFEY      GG G V  IGE +R+GCVPVVISDRPIQDLPLMD++RWQ+MAV V GG
Subjt:  RSLSLVKELIEDPEFLMESEPPLTPSS----YGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGG

Query:  GGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAERREWAQN
         GI+GVK+VLRRVDEESL +MKRLGAAAAQHF+WNSPPQPLDAFNTVAYQLW+RRH +RYAER+EWAQ+
Subjt:  GGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAERREWAQN

SwissProt top hitse value%identityAlignment
Q3E7Q9 Probable glycosyltransferase At5g253102.2e-1825.53Show/hide
Query:  SLLLSFSLLSTPIAPS--PSPSPYLSPIFL-KNYNSMSATLRIFTYIPFK-PFSFSSPAESLF--YNSLLT------SPYATHDPDRAHLFFIPFSPDLS
        S+L + S ++T +  S  P+   Y +P  L ++Y  M    +++ Y   + P     P +S++      +T      + + T+DP++A+++F+PFS    
Subjt:  SLLLSFSLLSTPIAPS--PSPSPYLSPIFL-KNYNSMSATLRIFTYIPFK-PFSFSSPAESLF--YNSLLT------SPYATHDPDRAHLFFIPFSPDLS

Query:  TRSL--------------ARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGW-------VPV
         R L              +  IR + T+ P+WNRT GADHF L+    G  + +   +L   +I+V      +  F P KD++LP +  +       + +
Subjt:  TRSL--------------ARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGW-------VPV

Query:  PVDESTVRERVLGFVG---YGSVRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSG--IGEALRFGCVPVVISDRPI
            S      LGF     +G VR + L      D +  +    P    +Y + +  S FC    G         +V+   + EA+   C+PV++S   +
Subjt:  PVDESTVRERVLGFVG---YGSVRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSG--IGEALRFGCVPVVISDRPI

Query:  QDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRR
          LP  D++RW+  +V+V     I  +K++L  +  E    +K       +HF  N PPQ  DAF+   + +W+RR
Subjt:  QDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRR

Q3E9A4 Probable glycosyltransferase At5g202601.8e-2027.91Show/hide
Query:  SPYATHDPDRAHLFFIPFSPDLSTRSLARLIRTLRTD----------------LPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGK
        SP+A ++P+ AH F +P S       L R + T   +                 PYWNR+LGADHF++S          +  EL KN I+V      +  
Subjt:  SPYATHDPDRAHLFFIPFSPDLSTRSLARLIRTLRTD----------------LPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGK

Query:  FIPHKDISLP----PVSGWVPVPVDESTVRER-VLGFVGYGS---VRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDV
        F+P +D+S+P    P     P  +  S+  +R +L F   GS   +R + L++   +  E +   E       Y + +A + FCL    SG +      V
Subjt:  FIPHKDISLP----PVSGWVPVPVDESTVRER-VLGFVGYGS---VRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDV

Query:  SGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAV
        +    A+  GCVPV+ISD     LP  D++ W +  + V     I  +K +L+ +       ++R      +HF+ N P QP D    + + +W+RR  +
Subjt:  SGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAV

Query:  R
        R
Subjt:  R

Q9FFN2 Probable glycosyltransferase At5g037951.4e-2527.42Show/hide
Query:  PSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSF-SSPAESLF-------YNSLLTSPYATHDPDRAHLFFIPFSP--------DLSTRSLARLIRTLRTD
        P  + + +F ++Y  M    +I+ Y   +P  F   P +S++       Y     + + T++PD+AH+F++PFS         + ++R  + +  T++  
Subjt:  PSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSF-SSPAESLF-------YNSLLTSPYATHDPDRAHLFFIPFSP--------DLSTRSLARLIRTLRTD

Query:  L-------PYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPV-------SGWVPVPVDESTVRERVLGFVG--YGS
        +       PYWNR++GADHF LS    G  +  +   L  N+I+       + +F P KD+S+P +       +G V  P   S  R  +  F G  +G 
Subjt:  L-------PYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPV-------SGWVPVPVDESTVRERVLGFVG--YGS

Query:  VRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSG--IGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGG
        VR + L     +D +  +    P   +SY + +  S FC+   G         +V+   I EAL  GCVPV+I+   +   P  D++ W+  +VIV    
Subjt:  VRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSG--IGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGG

Query:  GIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAE
         I  +K +L  +      RM R      +HF  NSP +  D F+ + + +WVRR  V+  E
Subjt:  GIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAE

Q9LFP3 Probable glycosyltransferase At5g111308.1e-2127.63Show/hide
Query:  SPYATHDPDRAHLFFIP----------------FSPDLSTRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVV--ELKKNAIQVSSFPVPA
        S +    P+ A +F+IP                ++ D     +   I  +    PYWNR+ GADHFFLS     ++ D + V  EL K+ I+       +
Subjt:  SPYATHDPDRAHLFFIP----------------FSPDLSTRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVV--ELKKNAIQVSSFPVPA

Query:  GKFIPHKDISLPPVSGWVP------VPVDESTVRERVLGFV---GYGSVRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGS
          F P +D+SLP ++  +P      V   E     ++L F     +G VR +       +D + L+    P T  +Y + + K+ FCL         G  
Subjt:  GKFIPHKDISLPPVSGWVP------VPVDESTVRERVLGFV---GYGSVRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGS

Query:  GDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRR
             I E+L  GCVPV+I+D  +  LP  D++ W+  +V +     +  +KK+L  + EE    M+R      +HF+ N P +P D  + + + +W+RR
Subjt:  GDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRR

Query:  HAVR
          VR
Subjt:  HAVR

Q9SSE8 Probable glycosyltransferase At3g076201.9e-2227.18Show/hide
Query:  SLLSTPIAPS---PSPSPYLSP-IFLKNYNSMSATLRIFTYIPFKPFSFS-------SPAESLFYNSLLTS--PYATHDPDRAHLFFIPFS---------
        S  S+P+      P    Y +P  F ++Y  M    +I+ Y    P  F           E LF N +      Y T DPD+AH++F+PFS         
Subjt:  SLLSTPIAPS---PSPSPYLSP-IFLKNYNSMSATLRIFTYIPFKPFSFS-------SPAESLFYNSLLTS--PYATHDPDRAHLFFIPFS---------

Query:  ------PDLSTRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVS-----------GWVPV
                +  R +A  ++ +    PYWN + G DHF LS    G+ +   V +L  N+I+V      +  F P KD   P ++           G  P+
Subjt:  ------PDLSTRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVS-----------GWVPV

Query:  PVDESTVRERVLGFVG--YGSVRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSG--IGEALRFGCVPVVISDRPIQ
               R  +  F G  +G +R + L     +D + L+    P     Y E + KS FC+   G         +V+   + EA+  GCVPV+IS+  + 
Subjt:  PVDESTVRERVLGFVG--YGSVRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSG--IGEALRFGCVPVVISDRPIQ

Query:  DLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVR
         LP  D++ W++ +V V     I  +K++L  + EE   R+        +H L N PP+  D FN + + +W+RR  V+
Subjt:  DLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVR

Arabidopsis top hitse value%identityAlignment
AT3G07620.1 Exostosin family protein1.4e-2327.18Show/hide
Query:  SLLSTPIAPS---PSPSPYLSP-IFLKNYNSMSATLRIFTYIPFKPFSFS-------SPAESLFYNSLLTS--PYATHDPDRAHLFFIPFS---------
        S  S+P+      P    Y +P  F ++Y  M    +I+ Y    P  F           E LF N +      Y T DPD+AH++F+PFS         
Subjt:  SLLSTPIAPS---PSPSPYLSP-IFLKNYNSMSATLRIFTYIPFKPFSFS-------SPAESLFYNSLLTS--PYATHDPDRAHLFFIPFS---------

Query:  ------PDLSTRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVS-----------GWVPV
                +  R +A  ++ +    PYWN + G DHF LS    G+ +   V +L  N+I+V      +  F P KD   P ++           G  P+
Subjt:  ------PDLSTRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVS-----------GWVPV

Query:  PVDESTVRERVLGFVG--YGSVRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSG--IGEALRFGCVPVVISDRPIQ
               R  +  F G  +G +R + L     +D + L+    P     Y E + KS FC+   G         +V+   + EA+  GCVPV+IS+  + 
Subjt:  PVDESTVRERVLGFVG--YGSVRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSG--IGEALRFGCVPVVISDRPIQ

Query:  DLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVR
         LP  D++ W++ +V V     I  +K++L  + EE   R+        +H L N PP+  D FN + + +W+RR  V+
Subjt:  DLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVR

AT4G16745.1 Exostosin family protein7.5e-2226.3Show/hide
Query:  IFLKNYNSMSATLRIFTYIPFKPFSFSSP------AESLFYNSLLTS--PYATHDPDRAHLFFIPFSPDLSTRS---------------LARLIRTLRTD
        +F ++Y  M   L+++ Y       F  P      A   ++  L+ S   + T +P+RAHLF++P+S     +S               L   +  L   
Subjt:  IFLKNYNSMSATLRIFTYIPFKPFSFSSP------AESLFYNSLLTS--PYATHDPDRAHLFFIPFSPDLSTRS---------------LARLIRTLRTD

Query:  LPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQ-VSSFPVPAGKFIPHKDISLPPVS---GWVPVP--VDESTVRER-VLGFVG---YGSVRSLSL
         P+WNRT G+DHF ++    G  +     ELK+NAI+ + +  +  G F+P KD+SLP  S      P+    + + V +R +L F     +G VR   L
Subjt:  LPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQ-VSSFPVPAGKFIPHKDISLPPVS---GWVPVP--VDESTVRER-VLGFVG---YGSVRSLSL

Query:  VKELIEDPEFLMESEPP---LTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGV
             +D +  +    P       +Y + +  S +CL   G         +   I EA+ + CVPVVI+D  +  LP  D++ W   +V+V     I  +
Subjt:  VKELIEDPEFLMESEPP---LTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGV

Query:  KKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLW
        K++L  +      +M+       +HFLW+  P+  D F+ + + +W
Subjt:  KKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLW

AT4G38040.1 Exostosin family protein1.3e-3730.17Show/hide
Query:  YLSP-IFLKNYNSMSATLRIFTYIPFKPFSF---------SSPAESLFYNSLLTSPYATHDPDRAHLFFIPFS------PDLSTRSLARLIRT----LRT
        Y SP  F  NY  M    +++ Y    P +F            +E  F+ ++  S + T DPD A LFFIP S         S  ++  +++     L  
Subjt:  YLSP-IFLKNYNSMSATLRIFTYIPFKPFSF---------SSPAESLFYNSLLTSPYATHDPDRAHLFFIPFS------PDLSTRSLARLIRT----LRT

Query:  DLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRER-VLGF-VGYGSVRSLSLVKELIED
          PYWNRTLGADHFF++   VG  +      L KN I+V   P     FIPHKD++LP V     +P   + V  R  LGF  G+ + +   ++  + E+
Subjt:  DLPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRER-VLGF-VGYGSVRSLSLVKELIED

Query:  PEFLMESEPPLTPSS----YGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRR
           L  S   +  ++    Y ++  ++ FC+        GG   + + I +++ +GC+PV++SD    DLP  DI+ W++ AV++     +  +K++L+ 
Subjt:  PEFLMESEPPLTPSS----YGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRR

Query:  VDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRY
        +       +        +HF WNSPP   DAF+ + Y+LW+R H V+Y
Subjt:  VDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRY

AT5G03795.1 Exostosin family protein1.0e-2627.42Show/hide
Query:  PSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSF-SSPAESLF-------YNSLLTSPYATHDPDRAHLFFIPFSP--------DLSTRSLARLIRTLRTD
        P  + + +F ++Y  M    +I+ Y   +P  F   P +S++       Y     + + T++PD+AH+F++PFS         + ++R  + +  T++  
Subjt:  PSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSF-SSPAESLF-------YNSLLTSPYATHDPDRAHLFFIPFSP--------DLSTRSLARLIRTLRTD

Query:  L-------PYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPV-------SGWVPVPVDESTVRERVLGFVG--YGS
        +       PYWNR++GADHF LS    G  +  +   L  N+I+       + +F P KD+S+P +       +G V  P   S  R  +  F G  +G 
Subjt:  L-------PYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPV-------SGWVPVPVDESTVRERVLGFVG--YGS

Query:  VRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSG--IGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGG
        VR + L     +D +  +    P   +SY + +  S FC+   G         +V+   I EAL  GCVPV+I+   +   P  D++ W+  +VIV    
Subjt:  VRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGSGDVSG--IGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGG

Query:  GIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAE
         I  +K +L  +      RM R      +HF  NSP +  D F+ + + +WVRR  V+  E
Subjt:  GIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRRHAVRYAE

AT5G11130.1 Exostosin family protein5.8e-2227.63Show/hide
Query:  SPYATHDPDRAHLFFIP----------------FSPDLSTRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVV--ELKKNAIQVSSFPVPA
        S +    P+ A +F+IP                ++ D     +   I  +    PYWNR+ GADHFFLS     ++ D + V  EL K+ I+       +
Subjt:  SPYATHDPDRAHLFFIP----------------FSPDLSTRSLARLIRTLRTDLPYWNRTLGADHFFLSSSGVGYSSDRNVV--ELKKNAIQVSSFPVPA

Query:  GKFIPHKDISLPPVSGWVP------VPVDESTVRERVLGFV---GYGSVRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGS
          F P +D+SLP ++  +P      V   E     ++L F     +G VR +       +D + L+    P T  +Y + + K+ FCL         G  
Subjt:  GKFIPHKDISLPPVSGWVP------VPVDESTVRERVLGFV---GYGSVRSLSLVKELIEDPEFLMESEPPLTPSSYGEKLAKSDFCLFEYGSGGDGGGS

Query:  GDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRR
             I E+L  GCVPV+I+D  +  LP  D++ W+  +V +     +  +KK+L  + EE    M+R      +HF+ N P +P D  + + + +W+RR
Subjt:  GDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQPLDAFNTVAYQLWVRR

Query:  HAVR
          VR
Subjt:  HAVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTCTCATCGTTCTCTCTCTCCTCCTCTCTTTCTCTCTCCTCTCCACACCCATAGCTCCTTCTCCTTCTCCTTCTCCTTATCTCTCCCCTATTTTTCTCAA
AAATTACAATTCCATGTCTGCAACTTTAAGAATCTTCACTTACATCCCATTCAAGCCATTTTCATTTTCTTCTCCCGCCGAATCGCTTTTCTACAACTCGCTTCTCACAA
GCCCCTACGCTACTCATGATCCTGACCGCGCCCATTTGTTTTTCATTCCCTTTTCCCCTGATCTCTCCACGCGCTCTCTCGCACGTTTGATTCGCACGCTCCGTACGGAC
CTGCCGTACTGGAATCGGACTCTTGGCGCCGATCACTTCTTTTTGTCGTCGTCCGGTGTTGGTTACTCCTCTGATCGGAACGTTGTAGAGTTGAAGAAGAACGCCATTCA
GGTCTCGTCCTTCCCTGTTCCTGCCGGGAAGTTTATTCCTCATAAGGATATTTCCTTGCCGCCGGTTTCCGGTTGGGTTCCCGTTCCGGTGGATGAGTCGACGGTGAGAG
AGAGAGTGTTGGGTTTTGTTGGGTATGGGTCGGTCAGGAGTTTGTCTTTGGTGAAGGAATTGATTGAGGATCCTGAGTTTCTGATGGAGTCGGAGCCGCCGCTTACGCCG
TCGAGTTACGGGGAAAAACTGGCGAAGAGTGACTTTTGTTTGTTCGAATACGGCAGCGGCGGCGATGGTGGTGGTAGTGGGGATGTTTCGGGGATTGGGGAAGCTTTACG
GTTTGGGTGTGTTCCGGTGGTGATTTCGGACCGTCCGATTCAGGACTTGCCGTTGATGGACATTGTACGGTGGCAGGAGATGGCGGTGATCGTCGGCGGCGGCGGCGGAA
TTCAAGGTGTGAAGAAAGTATTGAGGCGCGTGGATGAGGAAAGTTTAGCGAGGATGAAGAGATTGGGTGCGGCGGCAGCCCAGCATTTTCTGTGGAACTCGCCGCCTCAG
CCGTTGGATGCTTTCAATACGGTGGCGTATCAGCTTTGGGTGAGAAGGCACGCCGTTAGATATGCGGAAAGGAGAGAGTGGGCCCAGAATTGA
mRNA sequenceShow/hide mRNA sequence
ACACATTTAACCTTATATGTAATATAATCGATATAAAGAAGGGCAGGGGAAAAGGCAGAGTAAGGAAGCCGGCGGCGGCGGCGGACGTTAGCTTTGCCCGTATTTGACGA
TTCCAAATTCCATCTCTATTACTTTATTCAATTTGTCACACTCATCCCTCCCATAAGAAAATTCAACTCCCATCTCAAAATCACTCTGCAATTCCTTCCACCCTCAACAA
TGGCTTCTTCTCTCATCGTTCTCTCTCTCCTCCTCTCTTTCTCTCTCCTCTCCACACCCATAGCTCCTTCTCCTTCTCCTTCTCCTTATCTCTCCCCTATTTTTCTCAAA
AATTACAATTCCATGTCTGCAACTTTAAGAATCTTCACTTACATCCCATTCAAGCCATTTTCATTTTCTTCTCCCGCCGAATCGCTTTTCTACAACTCGCTTCTCACAAG
CCCCTACGCTACTCATGATCCTGACCGCGCCCATTTGTTTTTCATTCCCTTTTCCCCTGATCTCTCCACGCGCTCTCTCGCACGTTTGATTCGCACGCTCCGTACGGACC
TGCCGTACTGGAATCGGACTCTTGGCGCCGATCACTTCTTTTTGTCGTCGTCCGGTGTTGGTTACTCCTCTGATCGGAACGTTGTAGAGTTGAAGAAGAACGCCATTCAG
GTCTCGTCCTTCCCTGTTCCTGCCGGGAAGTTTATTCCTCATAAGGATATTTCCTTGCCGCCGGTTTCCGGTTGGGTTCCCGTTCCGGTGGATGAGTCGACGGTGAGAGA
GAGAGTGTTGGGTTTTGTTGGGTATGGGTCGGTCAGGAGTTTGTCTTTGGTGAAGGAATTGATTGAGGATCCTGAGTTTCTGATGGAGTCGGAGCCGCCGCTTACGCCGT
CGAGTTACGGGGAAAAACTGGCGAAGAGTGACTTTTGTTTGTTCGAATACGGCAGCGGCGGCGATGGTGGTGGTAGTGGGGATGTTTCGGGGATTGGGGAAGCTTTACGG
TTTGGGTGTGTTCCGGTGGTGATTTCGGACCGTCCGATTCAGGACTTGCCGTTGATGGACATTGTACGGTGGCAGGAGATGGCGGTGATCGTCGGCGGCGGCGGCGGAAT
TCAAGGTGTGAAGAAAGTATTGAGGCGCGTGGATGAGGAAAGTTTAGCGAGGATGAAGAGATTGGGTGCGGCGGCAGCCCAGCATTTTCTGTGGAACTCGCCGCCTCAGC
CGTTGGATGCTTTCAATACGGTGGCGTATCAGCTTTGGGTGAGAAGGCACGCCGTTAGATATGCGGAAAGGAGAGAGTGGGCCCAGAATTGAAAATGGAAACTTCGTGGA
CGGATTCTGATTCGTCTGACGTGGCTTGGCCTGGACGGGCCCCACTGAGTAAAATATTTATGGTTCATTTGGACGGTCGAGATTGTAATTGCATGGGGTTGGTATGAAAT
ATGTGAAATATTTAGGGATCAGATTGAAGTTGCGTTTTCTTGGGGGAAATATTTTTAATGTGAAAATAAATATGTGAAGGGCTTAGCATTATCACTTGGTGCATTAATAA
TTAACTTTTTTCAATTTTTTTTAAAATTTTTTTATTTAATTCTTTTAATTGTTTATTAT
Protein sequenceShow/hide protein sequence
MASSLIVLSLLLSFSLLSTPIAPSPSPSPYLSPIFLKNYNSMSATLRIFTYIPFKPFSFSSPAESLFYNSLLTSPYATHDPDRAHLFFIPFSPDLSTRSLARLIRTLRTD
LPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPAGKFIPHKDISLPPVSGWVPVPVDESTVRERVLGFVGYGSVRSLSLVKELIEDPEFLMESEPPLTP
SSYGEKLAKSDFCLFEYGSGGDGGGSGDVSGIGEALRFGCVPVVISDRPIQDLPLMDIVRWQEMAVIVGGGGGIQGVKKVLRRVDEESLARMKRLGAAAAQHFLWNSPPQ
PLDAFNTVAYQLWVRRHAVRYAERREWAQN