; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012614 (gene) of Snake gourd v1 genome

Gene IDTan0012614
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionExostosin domain-containing protein
Genome locationLG04:1997590..1999325
RNA-Seq ExpressionTan0012614
SyntenyTan0012614
Gene Ontology termsGO:0006486 - protein glycosylation (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR004263 - Exostosin-like
IPR040911 - Exostosin, GT47 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600032.1 putative glycosyltransferase, partial [Cucurbita argyrosperma subsp. sororia]2.0e-16282.96Show/hide
Query:  MASLFTLSLLLLSLSLLTTATASPSSPYLSPIFPKNYDSMSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLAR
        MASL T S LL SLSLL  A ASP SPYLSPIF +NY++MST+ KIFTYIP KP+SF SPAESLFYKSLLDSPYSTH+P  AH FF+PFSP  STRSLAR
Subjt:  MASLFTLSLLLLSLSLLTTATASPSSPYLSPIFPKNYDSMSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLAR

Query:  LIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRAL
        LIRTLR+ELPYWNRTLGADHFFLSS+G+ YASDRN+VELKKNAIQVSG PVP G FI HKDITLPPVF S  FSSS IPAPATERVLGFVGYGWVRDR L
Subjt:  LIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRAL

Query:  VKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLR
        VKELIEDP F ME+EPPPP SE WNYGE++ KSDFCLFEYGG  G VLRIGE LR+GCVPVVISDRP+QDLPLMDVLRWQDMAVFVNG  G+EGVKR LR
Subjt:  VKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLR

Query:  RVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAERKEWAQS
        RVDEESLVKMKRLG AAAQHFVWNSPP+PLDAFNTVAYQLW+RRH +RYAERKEWAQS
Subjt:  RVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAERKEWAQS

XP_022941986.1 probable glycosyltransferase At5g03795 [Cucurbita moschata]7.7e-15484.64Show/hide
Query:  MSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVEL
        MST+ KIFTYIP KP+SF SPAESLFYKSLLDSPYSTH+P  AH FF+PFSP  STRSLARLIRTLR++LPYWNRTLGADHFFLSS G+ YASDRN+VEL
Subjt:  MSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVEL

Query:  KKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRALVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFE
        KKNAIQVSG PVP G FI HKDITLPPVF S  FSSS IPAPATERVLGFVGYGWVRDR LVKELIEDPEF ME+EPPPP SE WNYGE++ KSDFCLFE
Subjt:  KKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRALVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFE

Query:  YGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQ
        YGG  G VLRIGE LR+GCVPVVISDRP+QDLPLMDVLRWQDMAVFVNGG G+EGVKR LRRVD ESLVKMKRLG AAAQHFVWNSPP+PLDAFNTVAYQ
Subjt:  YGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQ

Query:  LWVRRHAVRYAERKEWAQS
        LW+RRH +RYAERKEWAQS
Subjt:  LWVRRHAVRYAERKEWAQS

XP_022995754.1 probable glycosyltransferase At5g03795 [Cucurbita maxima]3.5e-16283.01Show/hide
Query:  MASLFTLSLLLLSLSLLTTATASPSSPYLSPIFPKNYDSMSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLAR
        MASL T S LLLSLSLL  A ASP SPYLSPIF +NY++MST+ KIFTYIP KP+SF SPAESLFYKSLLDSPYSTH P  AH FF+PFSP  STRSLAR
Subjt:  MASLFTLSLLLLSLSLLTTATASPSSPYLSPIFPKNYDSMSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLAR

Query:  LIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRAL
        LIRTLR+ELPYWNRTLGADHFFLSS G+ YASDRN+VELKKNAIQVSG PVP G FI HKDITLPPVF S  FSSS IPAPATERVLGFVGYGWVRDR L
Subjt:  LIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRAL

Query:  VKELIEDPEFLMETE-PPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKL
        VKELIEDPEF ME+E PPPP SE  NYGE++ KSDFCLFEYGG  G VLRIGE +R+GCVPVVISDRP+QDLPLMDVLRWQDMAVFVNGG G+EGVKR L
Subjt:  VKELIEDPEFLMETE-PPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKL

Query:  RRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAERKEWAQS
        RRVDEESLVKMKRLG AAAQHFVWNSPP+PLDAFNTVAYQLW+RRH +RYAERKEWAQS
Subjt:  RRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAERKEWAQS

XP_023542057.1 probable glycosyltransferase At5g03795 [Cucurbita pepo subsp. pepo]1.5e-15284.38Show/hide
Query:  MSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVEL
        MST+ KIFTYIP KP+SF SPAESLFYKSLLDSPYSTH+P  AH FF+PFSP  STRSLARLIRTLR+ELPYWNRTLGADHFFLSS+G+ Y SDRN+VEL
Subjt:  MSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVEL

Query:  KKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRALVKELIEDPEFLMETEPPPPS-SEAWNYGEKMAKSDFCLF
        KKNAIQVSG PVP G FI HKDITLPPVF S  FSSS IPAPATERVLGFVGYGWVRDR LVKELIEDPEF ME+EPPP S SE  NYGE++ KSDFCLF
Subjt:  KKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRALVKELIEDPEFLMETEPPPPS-SEAWNYGEKMAKSDFCLF

Query:  EYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAY
        EYGG  G VLRIGE LR+GCVPVVISDRP+QDLPLMDVLRWQDMAVFVNGG G+EGVKR LRRVDEESLVKMKRLG AAAQHFVWNSPP+PLDAFNTVAY
Subjt:  EYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAY

Query:  QLWVRRHAVRYAERKEWAQS
        QLW+RRH +RYAERKEWAQS
Subjt:  QLWVRRHAVRYAERKEWAQS

XP_038889277.1 probable glycosyltransferase At5g03795 [Benincasa hispida]3.6e-14373.74Show/hide
Query:  ASLFTLSLLLLSLSLLTTATASPS-SPYLSPIFPKNYDSMSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLAR
        +SL TLSLLL    L T  T+SPS SPYLSPIF KNY+SMS + +IFTYIP +P SFSSPAESLFYKSLL+SPY+TH+P  AHLFF+PFSP LSTRSL R
Subjt:  ASLFTLSLLLLSLSLLTTATASPS-SPYLSPIFPKNYDSMSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLAR

Query:  LIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRAL
        LIRTLRT+LPYWNRTLGADHFFLSSAG+GY+S+RNVVELKKNAIQVS  PVP GKFIPHKDI+LPPV G +  +         ERVLGFVGYGWV+  +L
Subjt:  LIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRAL

Query:  VKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLR
        V ELIEDPEF+ME+EPP  +S   +YGEK+AKSDFCLFEYGG  G V  IGEALR GC+PVVIS RP+QDLPLMDV+RWQ+MAVF+ G  G++GVK+ LR
Subjt:  VKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLR

Query:  RVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAERKEWAQS
         VD+ESL +MKRLG AAAQHF WNSPP+PLDAFNTVA+QLWVRRHAVRYAER+EWAQS
Subjt:  RVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAERKEWAQS

TrEMBL top hitse value%identityAlignment
A0A0A0KS95 Exostosin domain-containing protein1.1e-14273.48Show/hide
Query:  ASLFTLSLLLLSLSLLTTATASPS-----SPYLSPIFPKNYDSMSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTR
        +SL TLSLLL    L T  T SPS     SPYLSPIF KNY+SMS + +IFTYIP  P SFSS AESLFYKSLL+SPY+TH+P  AHLFF+PFSPH+STR
Subjt:  ASLFTLSLLLLSLSLLTTATASPS-----SPYLSPIFPKNYDSMSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTR

Query:  SLARLIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVR
        SLARLIRTLRT+LPYWNRTLGADHFFLSS+G+GY SDRNVVELKKNAIQVS  PV PGKFIPHKD++LPPV  S   S+    +  +ER+LGFVGYGWV+
Subjt:  SLARLIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVR

Query:  DRALVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVK
          +LVKELIEDPEFLME+EPP   S    YG+K+AKSDFCLFEY G  G V  IGEALR GCVPVVISDR +QDLPLMDV+RW++MAVFV GGGG+EGVK
Subjt:  DRALVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVK

Query:  RKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAERKEWAQS
        + LRRVD E L +MK+LG AAAQHFVWNSPP+PLDAFNTVAYQLWVRRHAVRYA+R+EWAQ+
Subjt:  RKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAERKEWAQS

A0A1S3CF96 probable glycosyltransferase At3g076201.1e-14072.63Show/hide
Query:  MASLFTLSLLLLSLSLLTTATASPSSPYLSPIFPKNYDSMSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLAR
        M S      LLLS SLL T      SPYLSPIF KNY+SMS + +IFTYIP    SFSS AESLFY+SLL+SPYSTH+P  AHLFFVPFSP +S RSL+R
Subjt:  MASLFTLSLLLLSLSLLTTATASPSSPYLSPIFPKNYDSMSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLAR

Query:  LIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRAL
        LIRTLRT+LPYWNRTLGADHFFLSS+G+GY  DRNVVELKKNAIQVS  PVPPGKFIPHKDI+LPPV   +    S +    +ER+LGFVGYGWV+  +L
Subjt:  LIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRAL

Query:  VKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLR
        VKELIEDPEFLME+EPPP  S    YG+KMAKSDFCLFEYG   G V  IGEALR GCVPVVISDR +QDLPLMD +RWQ+MAVFV GGGG+EGVK+ LR
Subjt:  VKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLR

Query:  RVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAERKEWAQS
         VD E L +MKRLG AAAQHFVWNSPP+PLDAFNTVAYQLW+RRHAVRYA+R+EWAQ+
Subjt:  RVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAERKEWAQS

A0A5A7U559 Putative glycosyltransferase1.8e-13274.61Show/hide
Query:  MSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVEL
        MS + +IFTYIP    SFSS AESLFY+SLL+SPYSTH+P  AHLFFVPFSP +S RSL+RLIRTLRT+LPYWNRTLGADHFFLSS+G+GY  DRNVVEL
Subjt:  MSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVEL

Query:  KKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRALVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFE
        KKNAIQVS  PVPPGKFIPHKDI+LPPV   +    S +    +ER+LGFVGYGWV+  +LVKELIEDPEFLME+EPPP  S    YG+KMAKSDFCLFE
Subjt:  KKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRALVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFE

Query:  YGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQ
        YG   G V  IGEALR GCVPVVISDR +QDLPLMD +RWQ+MAVFV GGGG+EGVK+ LR VD E L +MKRLG AAAQHFVWNSPP+PLDAFNTVAYQ
Subjt:  YGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQ

Query:  LWVRRHAVRYAERKEWAQS
        LW+RRHAVRYA+R+EWAQ+
Subjt:  LWVRRHAVRYAERKEWAQS

A0A6J1FML5 probable glycosyltransferase At5g037953.7e-15484.64Show/hide
Query:  MSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVEL
        MST+ KIFTYIP KP+SF SPAESLFYKSLLDSPYSTH+P  AH FF+PFSP  STRSLARLIRTLR++LPYWNRTLGADHFFLSS G+ YASDRN+VEL
Subjt:  MSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVEL

Query:  KKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRALVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFE
        KKNAIQVSG PVP G FI HKDITLPPVF S  FSSS IPAPATERVLGFVGYGWVRDR LVKELIEDPEF ME+EPPPP SE WNYGE++ KSDFCLFE
Subjt:  KKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRALVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFE

Query:  YGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQ
        YGG  G VLRIGE LR+GCVPVVISDRP+QDLPLMDVLRWQDMAVFVNGG G+EGVKR LRRVD ESLVKMKRLG AAAQHFVWNSPP+PLDAFNTVAYQ
Subjt:  YGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQ

Query:  LWVRRHAVRYAERKEWAQS
        LW+RRH +RYAERKEWAQS
Subjt:  LWVRRHAVRYAERKEWAQS

A0A6J1K6U0 probable glycosyltransferase At5g037951.7e-16283.01Show/hide
Query:  MASLFTLSLLLLSLSLLTTATASPSSPYLSPIFPKNYDSMSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLAR
        MASL T S LLLSLSLL  A ASP SPYLSPIF +NY++MST+ KIFTYIP KP+SF SPAESLFYKSLLDSPYSTH P  AH FF+PFSP  STRSLAR
Subjt:  MASLFTLSLLLLSLSLLTTATASPSSPYLSPIFPKNYDSMSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLAR

Query:  LIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRAL
        LIRTLR+ELPYWNRTLGADHFFLSS G+ YASDRN+VELKKNAIQVSG PVP G FI HKDITLPPVF S  FSSS IPAPATERVLGFVGYGWVRDR L
Subjt:  LIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRAL

Query:  VKELIEDPEFLMETE-PPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKL
        VKELIEDPEF ME+E PPPP SE  NYGE++ KSDFCLFEYGG  G VLRIGE +R+GCVPVVISDRP+QDLPLMDVLRWQDMAVFVNGG G+EGVKR L
Subjt:  VKELIEDPEFLMETE-PPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKL

Query:  RRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAERKEWAQS
        RRVDEESLVKMKRLG AAAQHFVWNSPP+PLDAFNTVAYQLW+RRH +RYAERKEWAQS
Subjt:  RRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAERKEWAQS

SwissProt top hitse value%identityAlignment
Q3E7Q9 Probable glycosyltransferase At5g253101.4e-2027.49Show/hide
Query:  KNYDSMSTSFKIFTYIPLK-PISFSSPAESLF--------YKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSL--------------ARLIRTLRTELPY
        ++Y  M   FK++ Y   + P+    P +S++              + + T++P+ A+++F+PFS     R L              +  IR + T  P+
Subjt:  KNYDSMSTSFKIFTYIPLK-PISFSSPAESLF--------YKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSL--------------ARLIRTLRTELPY

Query:  WNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPV--FGS-------LGFSSSRIPAPATERVLGFVGYGWVRDRALVK
        WNRT GADHF L+    G  + +   +L   +I+V         F P KD+TLP +  +G        L  + S  P P      G V +G VR   L  
Subjt:  WNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPV--FGS-------LGFSSSRIPAPATERVLGFVGYGWVRDRALVK

Query:  ELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRV
            D +  +    P    +  NY + M  S FC F   G +    R+ EA+   C+PV++S   +  LP  DVLRW+  +V V+    +  +K  L  +
Subjt:  ELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRV

Query:  DEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRR
          E    +K       +HF  N PP+  DAF+   + +W+RR
Subjt:  DEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRR

Q94AA9 Xylogalacturonan beta-1,3-xylosyltransferase8.1e-2128.85Show/hide
Query:  SSPYLSP-IFPKNYDSMSTSFKIFTY----IPL---KPISFSSPAESLFYKSL-LDSP-----YSTHNPHDAHLFFVPFS----------PHLSTR--SL
        SS Y +P  F +++  M   FK++TY    +PL    P++     E  F   + +D P     +    P +AH+FF+PFS          P  S    S 
Subjt:  SSPYLSP-IFPKNYDSMSTSFKIFTY----IPL---KPISFSSPAESLFYKSL-LDSP-----YSTHNPHDAHLFFVPFS----------PHLSTR--SL

Query:  ARLIRTLR-------TELPYWNRTLGADHFFLSSAGLG-YASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVF---GSLGFSSSRIPAPATERVL
        ARL R +        T+ PYWNR+ G DHF +S         D N    +K    +       G F P+ D+++P ++   G LG  S    +P    +L
Subjt:  ARLIRTLR-------TELPYWNRTLGADHFFLSSAGLG-YASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVF---GSLGFSSSRIPAPATERVL

Query:  GFV---GYGWVRDRALVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAV
         F     +G +R + L +   E    +   +  PP  +   Y + M  S FCL    G +    R  EA+  GCVPV+ISD     LP  DVL W   ++
Subjt:  GFV---GYGWVRDRALVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAV

Query:  FVNGGGGMEGVKRKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVR
         +     ++ +K  L+ V     +KM +      QHFV N P +P D  + + + +W+RR  +R
Subjt:  FVNGGGGMEGVKRKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVR

Q9FFN2 Probable glycosyltransferase At5g037956.8e-2828.16Show/hide
Query:  IFPKNYDSMSTSFKIFTYIPLKPISF-SSPAESLF-------YKSLLDSPYSTHNPHDAHLFFVPFSP--------HLSTRSLARLIRTLR-------TE
        +F ++Y  M   FKI+ Y   +P  F   P +S++       Y+   D+ + T+NP  AH+F++PFS           ++R  + +  T++        +
Subjt:  IFPKNYDSMSTSFKIFTYIPLKPISF-SSPAESLF-------YKSLLDSPYSTHNPHDAHLFFVPFSP--------HLSTRSLARLIRTLR-------TE

Query:  LPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRI--PAPATERVLGFVG---YGWVRDRALVKE
         PYWNR++GADHF LS    G  +  +   L  N+I+         +F P KD+++P +    G  +  +  P+P++  +L F     +G VR   L   
Subjt:  LPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRI--PAPATERVLGFVG---YGWVRDRALVKE

Query:  LIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVD
          +D +  +    P  +S    Y + M  S FC+    G +    RI EAL  GCVPV+I+   +   P  DVL W+  +V V+    +  +K  L  + 
Subjt:  LIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVD

Query:  EESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAE
            ++M R      +HF  NSP +  D F+ + + +WVRR  V+  E
Subjt:  EESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAE

Q9LFP3 Probable glycosyltransferase At5g111301.2e-2429.81Show/hide
Query:  SPSSPYLSPI-FPKNYDSMSTSFKIFTY----IPL---KPISFSSPAESLFYKSLL--DSPYSTHNPHDAHLFFVP----------FSPHLS------TR
        S  S YL+   F +++  M   FKI+TY     PL    P++     E  F   +   +S +   +P +A +F++P          + P+ S        
Subjt:  SPSSPYLSPI-FPKNYDSMSTSFKIFTY----IPL---KPISFSSPAESLFYKSLL--DSPYSTHNPHDAHLFFVP----------FSPHLS------TR

Query:  SLARLIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVV--ELKKNAIQVSGCPVPPGKFIPHKDITLPPV---FGSLGFSSSRIPAPATERVLGFVG
         +   I  +    PYWNR+ GADHFFLS     +A D + V  EL K+ I+          F P +D++LP +      LGF  +  P P   ++L F  
Subjt:  SLARLIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVV--ELKKNAIQVSGCPVPPGKFIPHKDITLPPV---FGSLGFSSSRIPAPATERVLGFVG

Query:  YGWVRD--RALVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGG
         G   D  + L +   E  + ++  E  P   +  NY + M K+ FCL    G +    RI E+L  GCVPV+I+D  +  LP  DVL W+  +V +   
Subjt:  YGWVRD--RALVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGG

Query:  GGMEGVKRKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVR
          M  +K+ L  + EE  + M+R      +HFV N P +P D  + + + +W+RR  VR
Subjt:  GGMEGVKRKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVR

Q9SSE8 Probable glycosyltransferase At3g076201.0e-2328.41Show/hide
Query:  FPKNYDSMSTSFKIFTYIPLKPISFS-------SPAESLFYKSLLDS--PYSTHNPHDAHLFFVPFS-----PHL----------STRSLARLIRTLRTE
        F ++Y  M   FKI+ Y    P  F           E LF   + +    Y T +P  AH++F+PFS      HL            R +A  ++ +  +
Subjt:  FPKNYDSMSTSFKIFTYIPLKPISFS-------SPAESLFYKSLLDS--PYSTHNPHDAHLFFVPFS-----PHL----------STRSLARLIRTLRTE

Query:  LPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPV---FGSLGFSSSRIPAPATERVLGFVG--YGWVRDRALVKE
         PYWN + G DHF LS    G+ +   V +L  N+I+V         F P KD   P +    G +   +  +   +   +  F G  +G +R   L   
Subjt:  LPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPV---FGSLGFSSSRIPAPATERVLGFVG--YGWVRDRALVKE

Query:  LIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVD
          +D + L+    P    +  +Y E M KS FC+    G +    R+ EA+  GCVPV+IS+  +  LP  DVL W+  +V V+     E +KR L  + 
Subjt:  LIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVD

Query:  EESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVR
        EE  +++    +   +H + N PP+  D FN + + +W+RR  V+
Subjt:  EESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVR

Arabidopsis top hitse value%identityAlignment
AT3G07620.1 Exostosin family protein7.2e-2528.41Show/hide
Query:  FPKNYDSMSTSFKIFTYIPLKPISFS-------SPAESLFYKSLLDS--PYSTHNPHDAHLFFVPFS-----PHL----------STRSLARLIRTLRTE
        F ++Y  M   FKI+ Y    P  F           E LF   + +    Y T +P  AH++F+PFS      HL            R +A  ++ +  +
Subjt:  FPKNYDSMSTSFKIFTYIPLKPISFS-------SPAESLFYKSLLDS--PYSTHNPHDAHLFFVPFS-----PHL----------STRSLARLIRTLRTE

Query:  LPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPV---FGSLGFSSSRIPAPATERVLGFVG--YGWVRDRALVKE
         PYWN + G DHF LS    G+ +   V +L  N+I+V         F P KD   P +    G +   +  +   +   +  F G  +G +R   L   
Subjt:  LPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPV---FGSLGFSSSRIPAPATERVLGFVG--YGWVRDRALVKE

Query:  LIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVD
          +D + L+    P    +  +Y E M KS FC+    G +    R+ EA+  GCVPV+IS+  +  LP  DVL W+  +V V+     E +KR L  + 
Subjt:  LIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVD

Query:  EESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVR
        EE  +++    +   +H + N PP+  D FN + + +W+RR  V+
Subjt:  EESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVR

AT4G16745.1 Exostosin family protein1.9e-2526.9Show/hide
Query:  IFPKNYDSMSTSFKIFTYIPLKPISFSSP------AESLFYKSLLDS--PYSTHNPHDAHLFFVPFSPHLSTRS---------------LARLIRTLRTE
        +F ++Y+ M    K++ Y       F  P      A   ++  L++S   + T NP  AHLF++P+S     +S               L   +  L  +
Subjt:  IFPKNYDSMSTSFKIFTYIPLKPISFSSP------AESLFYKSLLDS--PYSTHNPHDAHLFFVPFSPHLSTRS---------------LARLIRTLRTE

Query:  LPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQ-VSGCPVPPGKFIPHKDITLPPV--------FGSLGFSSSRIPAPATERVLGFVGYGWVRDRA
         P+WNRT G+DHF ++    G  +     ELK+NAI+ +    +  G F+P KD++LP            ++G  +     P      G + +G VR + 
Subjt:  LPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQ-VSGCPVPPGKFIPHKDITLPPV--------FGSLGFSSSRIPAPATERVLGFVGYGWVRDRA

Query:  LVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKL
        L     +D +  +    P   +    Y + M  S +CL    G +    RI EA+ + CVPVVI+D  M  LP  DVL W   +V V     +  +K  L
Subjt:  LVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKL

Query:  RRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLW
          +     +KM+   +   +HF+W+  PR  D F+ + + +W
Subjt:  RRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLW

AT4G38040.1 Exostosin family protein4.8e-3730.48Show/hide
Query:  SSPYLSP-IFPKNYDSMSTSFKIFTYIPLKPISF---------SSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPH------LSTRSLARLIRT----
        S  Y SP  F  NY  M   FK++ Y    P +F            +E  F++++ +S + T +P +A LFF+P S H       S  ++  +++     
Subjt:  SSPYLSP-IFPKNYDSMSTSFKIFTYIPLKPISF---------SSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPH------LSTRSLARLIRT----

Query:  LRTELPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATE----RVLGF-VGYGWVRDRA
        L  + PYWNRTLGADHFF++   +G  +      L KN I+V   P     FIPHKD+ LP V          +PA   +      LGF  G+   + R 
Subjt:  LRTELPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATE----RVLGF-VGYGWVRDRA

Query:  LVKELIE-DPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRK
        ++  + E D E  +       ++    Y ++  ++ FC+   GG      RI +++ +GC+PV++SD    DLP  D+L W+  AV +     +  +K+ 
Subjt:  LVKELIE-DPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRK

Query:  LRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRY
        L+ +     V +        +HF WNSPP   DAF+ + Y+LW+R H V+Y
Subjt:  LRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRY

AT5G03795.1 Exostosin family protein4.8e-2928.16Show/hide
Query:  IFPKNYDSMSTSFKIFTYIPLKPISF-SSPAESLF-------YKSLLDSPYSTHNPHDAHLFFVPFSP--------HLSTRSLARLIRTLR-------TE
        +F ++Y  M   FKI+ Y   +P  F   P +S++       Y+   D+ + T+NP  AH+F++PFS           ++R  + +  T++        +
Subjt:  IFPKNYDSMSTSFKIFTYIPLKPISF-SSPAESLF-------YKSLLDSPYSTHNPHDAHLFFVPFSP--------HLSTRSLARLIRTLR-------TE

Query:  LPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRI--PAPATERVLGFVG---YGWVRDRALVKE
         PYWNR++GADHF LS    G  +  +   L  N+I+         +F P KD+++P +    G  +  +  P+P++  +L F     +G VR   L   
Subjt:  LPYWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRI--PAPATERVLGFVG---YGWVRDRALVKE

Query:  LIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVD
          +D +  +    P  +S    Y + M  S FC+    G +    RI EAL  GCVPV+I+   +   P  DVL W+  +V V+    +  +K  L  + 
Subjt:  LIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVD

Query:  EESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAE
            ++M R      +HF  NSP +  D F+ + + +WVRR  V+  E
Subjt:  EESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVRYAE

AT5G11130.1 Exostosin family protein8.6e-2629.81Show/hide
Query:  SPSSPYLSPI-FPKNYDSMSTSFKIFTY----IPL---KPISFSSPAESLFYKSLL--DSPYSTHNPHDAHLFFVP----------FSPHLS------TR
        S  S YL+   F +++  M   FKI+TY     PL    P++     E  F   +   +S +   +P +A +F++P          + P+ S        
Subjt:  SPSSPYLSPI-FPKNYDSMSTSFKIFTY----IPL---KPISFSSPAESLFYKSLL--DSPYSTHNPHDAHLFFVP----------FSPHLS------TR

Query:  SLARLIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVV--ELKKNAIQVSGCPVPPGKFIPHKDITLPPV---FGSLGFSSSRIPAPATERVLGFVG
         +   I  +    PYWNR+ GADHFFLS     +A D + V  EL K+ I+          F P +D++LP +      LGF  +  P P   ++L F  
Subjt:  SLARLIRTLRTELPYWNRTLGADHFFLSSAGLGYASDRNVV--ELKKNAIQVSGCPVPPGKFIPHKDITLPPV---FGSLGFSSSRIPAPATERVLGFVG

Query:  YGWVRD--RALVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGG
         G   D  + L +   E  + ++  E  P   +  NY + M K+ FCL    G +    RI E+L  GCVPV+I+D  +  LP  DVL W+  +V +   
Subjt:  YGWVRD--RALVKELIEDPEFLMETEPPPPSSEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGG

Query:  GGMEGVKRKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVR
          M  +K+ L  + EE  + M+R      +HFV N P +P D  + + + +W+RR  VR
Subjt:  GGMEGVKRKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPLDAFNTVAYQLWVRRHAVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCCTCTTCACTCTCTCTCTCCTCCTCCTCTCTCTCTCTCTCCTCACCACAGCGACCGCCTCCCCTTCTTCTCCATATCTTTCTCCCATTTTCCCCAAAAACTA
CGATTCCATGTCCACGTCCTTCAAAATCTTTACCTACATCCCACTCAAACCCATTTCCTTCTCTTCTCCCGCCGAATCACTTTTCTACAAATCGCTTCTCGACAGCCCCT
ACTCTACTCACAATCCCCACGACGCCCATTTGTTCTTCGTCCCTTTTTCTCCCCATCTCTCCACGCGCTCTCTCGCCCGTTTGATCCGCACACTCCGTACAGAGTTGCCG
TACTGGAATCGGACTCTTGGGGCTGATCATTTCTTTCTCTCCTCCGCCGGCCTGGGATATGCTTCTGATCGGAACGTCGTCGAGTTGAAGAAGAACGCCATTCAGGTCTC
GGGATGCCCGGTGCCGCCTGGGAAGTTTATTCCTCATAAGGACATTACGTTGCCGCCGGTTTTCGGTTCGTTGGGATTTTCTTCTTCCCGGATTCCGGCGCCGGCGACGG
AGAGGGTGCTGGGTTTCGTCGGGTATGGATGGGTGAGGGATCGGGCGTTGGTGAAGGAGTTGATTGAGGATCCTGAGTTTTTGATGGAGACAGAGCCGCCGCCGCCATCG
TCGGAGGCGTGGAATTACGGGGAGAAAATGGCGAAAAGTGACTTTTGTTTGTTCGAATACGGCGGCGTTGACGGCGGTGTTTTGAGGATTGGGGAGGCTTTGCGGCATGG
GTGTGTGCCGGTGGTGATTTCTGACCGTCCGATGCAGGACTTGCCGTTGATGGACGTGTTACGGTGGCAGGACATGGCGGTGTTCGTCAACGGCGGCGGAGGAATGGAAG
GAGTGAAGAGAAAATTGAGGCGCGTGGACGAGGAGAGTCTCGTAAAAATGAAGAGATTGGGTGAGGCGGCGGCACAGCATTTCGTGTGGAACTCGCCACCTCGGCCGTTG
GATGCATTCAATACGGTGGCGTATCAGCTTTGGGTGAGAAGGCACGCCGTCAGATATGCCGAGAGGAAAGAGTGGGCCCAGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATTTGTGGCAGTCATCCTTTAGAATAATAATAACAATCTCTTCTCTCTTAGTTTTAATTTTAATTTCCGATACCCATTAATTAAATTAAACTCCCATATTCAACAATATT
ATCATCTTCTTCAACCCCAAAATCACTCTGCAACTTCCTCCTCCAACAATGGCTTCCCTCTTCACTCTCTCTCTCCTCCTCCTCTCTCTCTCTCTCCTCACCACAGCGAC
CGCCTCCCCTTCTTCTCCATATCTTTCTCCCATTTTCCCCAAAAACTACGATTCCATGTCCACGTCCTTCAAAATCTTTACCTACATCCCACTCAAACCCATTTCCTTCT
CTTCTCCCGCCGAATCACTTTTCTACAAATCGCTTCTCGACAGCCCCTACTCTACTCACAATCCCCACGACGCCCATTTGTTCTTCGTCCCTTTTTCTCCCCATCTCTCC
ACGCGCTCTCTCGCCCGTTTGATCCGCACACTCCGTACAGAGTTGCCGTACTGGAATCGGACTCTTGGGGCTGATCATTTCTTTCTCTCCTCCGCCGGCCTGGGATATGC
TTCTGATCGGAACGTCGTCGAGTTGAAGAAGAACGCCATTCAGGTCTCGGGATGCCCGGTGCCGCCTGGGAAGTTTATTCCTCATAAGGACATTACGTTGCCGCCGGTTT
TCGGTTCGTTGGGATTTTCTTCTTCCCGGATTCCGGCGCCGGCGACGGAGAGGGTGCTGGGTTTCGTCGGGTATGGATGGGTGAGGGATCGGGCGTTGGTGAAGGAGTTG
ATTGAGGATCCTGAGTTTTTGATGGAGACAGAGCCGCCGCCGCCATCGTCGGAGGCGTGGAATTACGGGGAGAAAATGGCGAAAAGTGACTTTTGTTTGTTCGAATACGG
CGGCGTTGACGGCGGTGTTTTGAGGATTGGGGAGGCTTTGCGGCATGGGTGTGTGCCGGTGGTGATTTCTGACCGTCCGATGCAGGACTTGCCGTTGATGGACGTGTTAC
GGTGGCAGGACATGGCGGTGTTCGTCAACGGCGGCGGAGGAATGGAAGGAGTGAAGAGAAAATTGAGGCGCGTGGACGAGGAGAGTCTCGTAAAAATGAAGAGATTGGGT
GAGGCGGCGGCACAGCATTTCGTGTGGAACTCGCCACCTCGGCCGTTGGATGCATTCAATACGGTGGCGTATCAGCTTTGGGTGAGAAGGCACGCCGTCAGATATGCCGA
GAGGAAAGAGTGGGCCCAGAGTTGAAAATTGGAAACTTCGTCTGACGTGGCTTTGGGCGGGCCCCACATAGCAAAATGTGATTAATTTGGATTTGAACGGTTGAGATTGC
GATTACGATTTGCATGGGGTTGGGACTTTTCTTAGTATATGAAAAAAGGTTAGGGGTCAGATTTGTAGTTGCATTTTTTTGGTGAAAATATTGAATGTCAAAATAAATAT
GTGATGTGAAGGGCTTAGAATTATCGCTTGGTGAAATAATTAATGAACTTTTTTTATTTTTATTTTGTCAGGTTATTGAAAGTTGAAACTATAAATTTATGTACAAAACT
TACTCGAAAGATTCGAGTAACCTAACATGCAATGTATGAATTAAGTTGGGTTTATAAATGCTTCCCAAAAAAAAAAAAAAATTGGATATTAAATTTTCTTGAATATTTGG
TTTACATTTAAATAAATTCAATGTATTTATAGATTGAGCTTGGGAGGACTATGAACTAGGATAGATCGCAAAAATGTGTTTGCATG
Protein sequenceShow/hide protein sequence
MASLFTLSLLLLSLSLLTTATASPSSPYLSPIFPKNYDSMSTSFKIFTYIPLKPISFSSPAESLFYKSLLDSPYSTHNPHDAHLFFVPFSPHLSTRSLARLIRTLRTELP
YWNRTLGADHFFLSSAGLGYASDRNVVELKKNAIQVSGCPVPPGKFIPHKDITLPPVFGSLGFSSSRIPAPATERVLGFVGYGWVRDRALVKELIEDPEFLMETEPPPPS
SEAWNYGEKMAKSDFCLFEYGGVDGGVLRIGEALRHGCVPVVISDRPMQDLPLMDVLRWQDMAVFVNGGGGMEGVKRKLRRVDEESLVKMKRLGEAAAQHFVWNSPPRPL
DAFNTVAYQLWVRRHAVRYAERKEWAQS