; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg18341 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg18341
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionExostosin domain-containing protein
Genome locationCarg_Chr04:972434..973469
RNA-Seq ExpressionCarg18341
SyntenyCarg18341
Gene Ontology termsGO:0006486 - protein glycosylation (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR004263 - Exostosin-like
IPR040911 - Exostosin, GT47 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600032.1 putative glycosyltransferase, partial [Cucurbita argyrosperma subsp. sororia]2.0e-18099.06Show/hide
Query:  MASLITFSLLFSLSFLAAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRT
        MASLITFSLLFSLS LAAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRT
Subjt:  MASLITFSLLFSLSFLAAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRT

Query:  LRSELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKEL
        LRSELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKEL
Subjt:  LRSELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKEL

Query:  IEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEES
        IEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEES
Subjt:  IEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEES

Query:  LVKMKRLGAAAAQHFVWVGP
        LVKMKRLGAAAAQHFVW  P
Subjt:  LVKMKRLGAAAAQHFVWVGP

KAG7030699.1 putative glycosyltransferase, partial [Cucurbita argyrosperma subsp. argyrosperma]8.6e-195100Show/hide
Query:  MASLITFSLLFSLSFLAAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRT
        MASLITFSLLFSLSFLAAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRT
Subjt:  MASLITFSLLFSLSFLAAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRT

Query:  LRSELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKEL
        LRSELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKEL
Subjt:  LRSELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKEL

Query:  IEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEES
        IEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEES
Subjt:  IEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEES

Query:  LVKMKRLGAAAAQHFVWVGPELKNGSYVCMGFGLLINLNI
        LVKMKRLGAAAAQHFVWVGPELKNGSYVCMGFGLLINLNI
Subjt:  LVKMKRLGAAAAQHFVWVGPELKNGSYVCMGFGLLINLNI

XP_022941986.1 probable glycosyltransferase At5g03795 [Cucurbita moschata]1.2e-15997.54Show/hide
Query:  MSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRTLRSELPYWNRTLGADHFFLSSSGVRYASDRNIVEL
        MSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRTLRS+LPYWNRTLGADHFFLSS GVRYASDRNIVEL
Subjt:  MSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRTLRSELPYWNRTLGADHFFLSSSGVRYASDRNIVEL

Query:  KKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEY
        KKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDP FFMESEPPPPSERWNYGERLGKSDFCLFEY
Subjt:  KKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEY

Query:  GGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEESLVKMKRLGAAAAQHFVWVGP
        GGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNG KGIEGVKRVLRRVD ESLVKMKRLGAAAAQHFVW  P
Subjt:  GGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEESLVKMKRLGAAAAQHFVWVGP

XP_022995754.1 probable glycosyltransferase At5g03795 [Cucurbita maxima]8.3e-17496.27Show/hide
Query:  MASLITFSLLFSLSFLAAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRT
        MASLITFSLL SLS LAAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTH+PDHAHFFFIPFSPDTSTRSLARLIRT
Subjt:  MASLITFSLLFSLSFLAAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRT

Query:  LRSELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKEL
        LRSELPYWNRTLGADHFFLSS GVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKEL
Subjt:  LRSELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKEL

Query:  IEDPAFFMESE--PPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDE
        IEDP FFMESE  PPPPSER NYGERLGKSDFCLFEYGGGGVVLRIGEV+RYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNG KGIEGVKRVLRRVDE
Subjt:  IEDPAFFMESE--PPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDE

Query:  ESLVKMKRLGAAAAQHFVWVGP
        ESLVKMKRLGAAAAQHFVW  P
Subjt:  ESLVKMKRLGAAAAQHFVWVGP

XP_023542057.1 probable glycosyltransferase At5g03795 [Cucurbita pepo subsp. pepo]1.1e-15796.86Show/hide
Query:  MSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRTLRSELPYWNRTLGADHFFLSSSGVRYASDRNIVEL
        MSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRTLRSELPYWNRTLGADHFFLSSSGVRY SDRNIVEL
Subjt:  MSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRTLRSELPYWNRTLGADHFFLSSSGVRYASDRNIVEL

Query:  KKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPAFFMESEPPP--PSERWNYGERLGKSDFCLF
        KKNAIQVSGGPVP+GNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDP FFMESEPPP  PSER NYGERLGKSDFCLF
Subjt:  KKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPAFFMESEPPP--PSERWNYGERLGKSDFCLF

Query:  EYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEESLVKMKRLGAAAAQHFVWVGP
        EYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNG KGIEGVKRVLRRVDEESLVKMKRLGAAAAQHFVW  P
Subjt:  EYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEESLVKMKRLGAAAAQHFVWVGP

TrEMBL top hitse value%identityAlignment
A0A0A0KS95 Exostosin domain-containing protein7.0e-11869.91Show/hide
Query:  ASLITFSLLFSLSFL---------AAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTR
        +SLIT SLL S S L          + SPSPYLSPIF +NYN+MS  L+IFTYIPF P SF S AESLFYKSLL+SPY+THDPD AH FFIPFSP  STR
Subjt:  ASLITFSLLFSLSFL---------AAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTR

Query:  SLARLIRTLRSELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVR
        SLARLIRTLR++LPYWNRTLGADHFFLSSSG+ Y SDRN+VELKKNAIQVS  PV  G FI HKD++LPPV  S+  S+    +  +ER+LGFVGYGWV+
Subjt:  SLARLIRTLRSELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVR

Query:  DRVLVKELIEDPAFFMESEPP-PPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKR
           LVKELIEDP F MESEPP  PS    YG++L KSDFCLFEY GG  V  IGE LR+GCVPVVISDR IQDLPLMDV+RW++MAVFV G  GIEGVK+
Subjt:  DRVLVKELIEDPAFFMESEPP-PPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKR

Query:  VLRRVDEESLVKMKRLGAAAAQHFVWVGP
        VLRRVD E L +MK+LGAAAAQHFVW  P
Subjt:  VLRRVDEESLVKMKRLGAAAAQHFVWVGP

A0A1S3CF96 probable glycosyltransferase At3g076207.0e-11869.88Show/hide
Query:  ASLITFSLLFSLSFL---AAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLI
        +SLIT +LL S S L      SPSPYLSPIF +NYN+MS  L+IFTYIPF   SF S AESLFY+SLL+SPYSTHDPD AH FF+PFSPD S RSL+RLI
Subjt:  ASLITFSLLFSLSFL---AAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLI

Query:  RTLRSELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVK
        RTLR++LPYWNRTLGADHFFLSSSG+ Y  DRN+VELKKNAIQVS  PVP G FI HKDI+LPPV  S+  S+       +ER+LGFVGYGWV+   LVK
Subjt:  RTLRSELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVK

Query:  ELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDE
        ELIEDP F MESEPPP      YG+++ KSDFCLFEYG G  V  IGE LR+GCVPVVISDR IQDLPLMD +RWQ+MAVFV G  GIEGVK+VLR VD 
Subjt:  ELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDE

Query:  ESLVKMKRLGAAAAQHFVWVGP
        E L +MKRLGAAAAQHFVW  P
Subjt:  ESLVKMKRLGAAAAQHFVWVGP

A0A5A7U559 Putative glycosyltransferase3.9e-10871.23Show/hide
Query:  MSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRTLRSELPYWNRTLGADHFFLSSSGVRYASDRNIVEL
        MS  L+IFTYIPF   SF S AESLFY+SLL+SPYSTHDPD AH FF+PFSPD S RSL+RLIRTLR++LPYWNRTLGADHFFLSSSG+ Y  DRN+VEL
Subjt:  MSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRTLRSELPYWNRTLGADHFFLSSSGVRYASDRNIVEL

Query:  KKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEY
        KKNAIQVS  PVP G FI HKDI+LPPV  S+  S+       +ER+LGFVGYGWV+   LVKELIEDP F MESEPPP      YG+++ KSDFCLFEY
Subjt:  KKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEY

Query:  GGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEESLVKMKRLGAAAAQHFVWVGP
        G G  V  IGE LR+GCVPVVISDR IQDLPLMD +RWQ+MAVFV G  GIEGVK+VLR VD E L +MKRLGAAAAQHFVW  P
Subjt:  GGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEESLVKMKRLGAAAAQHFVWVGP

A0A6J1FML5 probable glycosyltransferase At5g037955.7e-16097.54Show/hide
Query:  MSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRTLRSELPYWNRTLGADHFFLSSSGVRYASDRNIVEL
        MSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRTLRS+LPYWNRTLGADHFFLSS GVRYASDRNIVEL
Subjt:  MSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRTLRSELPYWNRTLGADHFFLSSSGVRYASDRNIVEL

Query:  KKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEY
        KKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDP FFMESEPPPPSERWNYGERLGKSDFCLFEY
Subjt:  KKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEY

Query:  GGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEESLVKMKRLGAAAAQHFVWVGP
        GGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNG KGIEGVKRVLRRVD ESLVKMKRLGAAAAQHFVW  P
Subjt:  GGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEESLVKMKRLGAAAAQHFVWVGP

A0A6J1K6U0 probable glycosyltransferase At5g037954.0e-17496.27Show/hide
Query:  MASLITFSLLFSLSFLAAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRT
        MASLITFSLL SLS LAAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTH+PDHAHFFFIPFSPDTSTRSLARLIRT
Subjt:  MASLITFSLLFSLSFLAAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRT

Query:  LRSELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKEL
        LRSELPYWNRTLGADHFFLSS GVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKEL
Subjt:  LRSELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKEL

Query:  IEDPAFFMESE--PPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDE
        IEDP FFMESE  PPPPSER NYGERLGKSDFCLFEYGGGGVVLRIGEV+RYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNG KGIEGVKRVLRRVDE
Subjt:  IEDPAFFMESE--PPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDE

Query:  ESLVKMKRLGAAAAQHFVWVGP
        ESLVKMKRLGAAAAQHFVW  P
Subjt:  ESLVKMKRLGAAAAQHFVWVGP

SwissProt top hitse value%identityAlignment
Q940Q8 Probable beta-1,4-xylosyltransferase IRX10L4.7e-1827.37Show/hide
Query:  AESLFYKSLLDSPYSTHDPDHAHFFFIPF------------SPDTSTRSLARLIRTLRSELPYWNRTLGADHFFLSSSGV--------RYASDRNIVELK
        AE    + LL SP  T +P+ A +F++P              P  S R +   I+ + S  PYWNRT GADHFF+               A  R I+ L 
Subjt:  AESLFYKSLLDSPYSTHDPDHAHFFFIPF------------SPDTSTRSLARLIRTLRSELPYWNRTLGADHFFLSSSGV--------RYASDRNIVELK

Query:  KNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFV-------------GYGWVRDRVLVKELIED-PAFFMESEPPPPSERWNYG
        + A  V          +    IT+PP     +  S  IP      +  +              GY     R  V E  +D P F + +E P       Y 
Subjt:  KNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFV-------------GYGWVRDRVLVKELIED-PAFFMESEPPPPSERWNYG

Query:  ERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEESLVKMKRL
        E + ++ FCL   G      R+ E + +GC+PV+I+D  +  LP  D + W+D+ VFV+  K +  +  +L  +  E +++ +RL
Subjt:  ERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEESLVKMKRL

Q9FFN2 Probable glycosyltransferase At5g037957.7e-2126.01Show/hide
Query:  PSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSF-PSPAESLF-------YKSLLDSPYSTHDPDHAHFFFIPFSP--------DTSTRSLARLIRTLR--
        P  + + +F R+Y  M    KI+ Y   +P  F   P +S++       Y+   D+ + T++PD AH F++PFS         + ++R  + +  T++  
Subjt:  PSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSF-PSPAESLF-------YKSLLDSPYSTHDPDHAHFFFIPFSP--------DTSTRSLARLIRTLR--

Query:  -----SELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWI--PAPATERVLGFVG---YGWVR
              + PYWNR++GADHF LS       +  +   L  N+I+          F   KD+++P +   +   +  +  P+P++  +L F     +G VR
Subjt:  -----SELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWI--PAPATERVLGFVG---YGWVR

Query:  DRVLVKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRV
          +L     +D    +    P  +   +Y + +  S FC+   G      RI E L  GCVPV+I+   +   P  DVL W+  +V V+ V+ I  +K +
Subjt:  DRVLVKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRV

Query:  LRRVDEESLVKMKRLGAAAAQHF
        L  +     ++M R      +HF
Subjt:  LRRVDEESLVKMKRLGAAAAQHF

Q9FZJ1 Probable beta-1,4-xylosyltransferase IRX104.7e-1826.32Show/hide
Query:  AESLFYKSLLDSPYSTHDPDHAHFFFIPFSPD------------TSTRSLARLIRTLRSELPYWNRTLGADHFFLSSSGV--------RYASDRNIVELK
        AE   ++ LL SP  T +PD A +F+ P  P              S R +   I+ + S  PYWNRT GADHFF+               A +R I+ L 
Subjt:  AESLFYKSLLDSPYSTHDPDHAHFFFIPFSPD------------TSTRSLARLIRTLRSELPYWNRTLGADHFFLSSSGV--------RYASDRNIVELK

Query:  KNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFV-------------GYGWVRDRVLVKELIE-DPAFFMESEPPPPSERWNYG
        + A  V          +    IT+PP     +  + +IP      +  +              GY     R  V E  + +P F + ++ P       Y 
Subjt:  KNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFV-------------GYGWVRDRVLVKELIE-DPAFFMESEPPPPSERWNYG

Query:  ERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEESLVKMKRL
        E + ++ FCL   G      R+ E + +GC+PV+I+D  +  LP  D + W+++ VFV   K +  +  +L  +  E +++ +RL
Subjt:  ERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEESLVKMKRL

Q9LFP3 Probable glycosyltransferase At5g111302.3e-1727.13Show/hide
Query:  SPYLSPI-FSRNYNAMSTTLKIFTYIPFK-PVSFPSPAESLF--YKSLLD------SPYSTHDPDHAHFFFIP----------------FSPDTSTRSLA
        S YL+   F +++  M    KI+TY   + P+    P  +++      +D      S +    P+ A  F+IP                ++ D     + 
Subjt:  SPYLSPI-FSRNYNAMSTTLKIFTYIPFK-PVSFPSPAESLF--YKSLLD------SPYSTHDPDHAHFFFIP----------------FSPDTSTRSLA

Query:  RLIRTLRSELPYWNRTLGADHFFLSSSGVRYASDRNIV--ELKKNAIQVSGGPVPVGNFISHKDITLPPV---FDSSEFSSSWIPAPATERVLGFVGYGW
          I  + +  PYWNR+ GADHFFLS     +A D + V  EL K+ I+          F   +D++LP +        F  +  P P   ++L F   G 
Subjt:  RLIRTLRSELPYWNRTLGADHFFLSSSGVRYASDRNIV--ELKKNAIQVSGGPVPVGNFISHKDITLPPV---FDSSEFSSSWIPAPATERVLGFVGYGW

Query:  VRD--RVLVKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEG
          D  ++L +   E     +  E  P  +  NY + + K+ FCL   G      RI E L  GCVPV+I+D  +  LP  DVL W+  +V +  +  +  
Subjt:  VRD--RVLVKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEG

Query:  VKRVLRRVDEESLVKMKRLGAAAAQHFV
        +K++L  + EE  + M+R      +HFV
Subjt:  VKRVLRRVDEESLVKMKRLGAAAAQHFV

Q9SSE8 Probable glycosyltransferase At3g076201.4e-1728.43Show/hide
Query:  FSRNYNAMSTTLKIFTYIPFKPVSFP-------SPAESLFYKSLLDS--PYSTHDPDHAHFFFIPFS---------------PDTSTRSLARLIRTLRSE
        F R+Y  M    KI+ Y    P  F           E LF   + +    Y T DPD AH +F+PFS                    R +A  ++ +  +
Subjt:  FSRNYNAMSTTLKIFTYIPFKPVSFP-------SPAESLFYKSLLDS--PYSTHDPDHAHFFFIPFS---------------PDTSTRSLARLIRTLRSE

Query:  LPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPV----FDSSEFSSSWIPAPATERVLGFVG--YGWVRDRVLVK
         PYWN + G DHF LS     + +   + +L  N+I+V         F   KD   P +     D +  +    P   T     F G  +G +R  VL+ 
Subjt:  LPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPV----FDSSEFSSSWIPAPATERVLGFVG--YGWVRDRVLVK

Query:  ELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDE
           E     +  E  P  +  +Y E + KS FC+   G      R+ E +  GCVPV+IS+  +  LP  DVL W+  +V V+ VK I  +KR+L  + E
Subjt:  ELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDE

Query:  ESLVKM
        E  +++
Subjt:  ESLVKM

Arabidopsis top hitse value%identityAlignment
AT4G32790.1 Exostosin family protein8.7e-2027.5Show/hide
Query:  IFSRNYNAMSTTLKIFTYIPFKPVSFPSP-------AESLFYKSLLDS-PYSTHDPDHAHFFFIPFSP-----------DTSTRSLARLIRT----LRSE
        +F R+Y  M   LK++ Y   K      P       +E  F K L  S  + T DP  AH F++PFS              S ++L + ++     + S+
Subjt:  IFSRNYNAMSTTLKIFTYIPFKPVSFPSP-------AESLFYKSLLDS-PYSTHDPDHAHFFFIPFSP-----------DTSTRSLARLIRT----LRSE

Query:  LPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLP--PVFDSSEFSSSWIPAPATER-VLGFVG---YGWVRDRVLVK
          +WN+T G+DHF ++     +A       + K    +    V  G F+  KD+ LP   +        +    P ++R +L F     +G++R  +L  
Subjt:  LPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLP--PVFDSSEFSSSWIPAPATER-VLGFVG---YGWVRDRVLVK

Query:  -ELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVD
             DP   + SE P    + +Y E +  S +C+   G      R+ E L Y CVPV+ISD  +   P  +VL W+  AVFV   K I  +K +L  + 
Subjt:  -ELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVD

Query:  EESLVKMKRLGAAAAQHFVW
        EE   +M+       +HF+W
Subjt:  EESLVKMKRLGAAAAQHFVW

AT4G38040.1 Exostosin family protein7.8e-2929.22Show/hide
Query:  YLSP-IFSRNYNAMSTTLKIFTYIPFKPVSF---------PSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFS------PDTSTRSLARLIRT----LRS
        Y SP  F  NY  M    K++ Y    P +F            +E  F++++ +S + T DPD A  FFIP S        TS  ++  +++     L +
Subjt:  YLSP-IFSRNYNAMSTTLKIFTYIPFKPVSF---------PSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFS------PDTSTRSLARLIRT----LRS

Query:  ELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATE----RVLGF-VGYGWVRDRVLVK
        + PYWNRTLGADHFF++   V   +      L KN I+V   P     FI HKD+ LP V          +PA   +      LGF  G+   + RV++ 
Subjt:  ELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATE----RVLGF-VGYGWVRDRVLVK

Query:  ELIEDPAFFMESEPPPPSERWN-------YGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKR
         + E+     ++E    + R N       Y +R  ++ FC+   G      RI + + YGC+PV++SD    DLP  D+L W+  AV +   + +  +K+
Subjt:  ELIEDPAFFMESEPPPPSERWN-------YGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKR

Query:  VLRRVDEESLVKMKRLGAAAAQHFVWVGPELK
        +L+ +     V +        +HF W  P +K
Subjt:  VLRRVDEESLVKMKRLGAAAAQHFVWVGPELK

AT5G03795.1 Exostosin family protein5.4e-2226.01Show/hide
Query:  PSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSF-PSPAESLF-------YKSLLDSPYSTHDPDHAHFFFIPFSP--------DTSTRSLARLIRTLR--
        P  + + +F R+Y  M    KI+ Y   +P  F   P +S++       Y+   D+ + T++PD AH F++PFS         + ++R  + +  T++  
Subjt:  PSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSF-PSPAESLF-------YKSLLDSPYSTHDPDHAHFFFIPFSP--------DTSTRSLARLIRTLR--

Query:  -----SELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWI--PAPATERVLGFVG---YGWVR
              + PYWNR++GADHF LS       +  +   L  N+I+          F   KD+++P +   +   +  +  P+P++  +L F     +G VR
Subjt:  -----SELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWI--PAPATERVLGFVG---YGWVR

Query:  DRVLVKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRV
          +L     +D    +    P  +   +Y + +  S FC+   G      RI E L  GCVPV+I+   +   P  DVL W+  +V V+ V+ I  +K +
Subjt:  DRVLVKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRV

Query:  LRRVDEESLVKMKRLGAAAAQHF
        L  +     ++M R      +HF
Subjt:  LRRVDEESLVKMKRLGAAAAQHF

AT5G11610.1 Exostosin family protein6.0e-2127.91Show/hide
Query:  IFSRNYNAMSTTLKIFTYIPFKPVSFPSP--------AESLFYKSLLDSP--YSTHDPDHAHFFFIPFSP----------DTSTRS-----LARLIRTLR
        IF R+Y  M  TLK++ Y       F  P        A   ++  L++S   + T DP  AH F+IPFS           D+ +R+     L   I  + 
Subjt:  IFSRNYNAMSTTLKIFTYIPFKPVSFPSP--------AESLFYKSLLDSP--YSTHDPDHAHFFFIPFSP----------DTSTRS-----LARLIRTLR

Query:  SELPYWNRTLGADHFFLSSSGVRYASDR----NIVELKKNAIQVSGGPVPVG-NFISHKDITLPPVFDSSEFSSSWI---PAPATERVLGFVG---YGWV
        S  P WNRT G+DHFF +         R    N +    NA         VG +F+  KD++LP    SS  + +       P+   +L F     +G+V
Subjt:  SELPYWNRTLGADHFFLSSSGVRYASDR----NIVELKKNAIQVSGGPVPVG-NFISHKDITLPPVFDSSEFSSSWI---PAPATERVLGFVG---YGWV

Query:  RDRVLVKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKR
        R  +L+ +    P    + +     +  +Y   + +S FC+   G      R+ E + YGCVPV+ISD  +   P +++L W+  AVFV   K I  +++
Subjt:  RDRVLVKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKR

Query:  VLRRVDEESLVKMKRLGAAAAQHFVW
        +L  +     V+M++      +HF+W
Subjt:  VLRRVDEESLVKMKRLGAAAAQHFVW

AT5G25820.1 Exostosin family protein1.3e-2026.4Show/hide
Query:  IFSRNYNAMSTTLKIFTY------IPFKPVSFPSPAESLFYKSLLDS---PYSTHDPDHAHFFFIPFS-----------PDTSTRSLARLIRT----LRS
        +F R+Y  M   LK++ Y      I   P+     A   ++ ++++S    + T DP  AH F++PFS              S R+L + ++     + +
Subjt:  IFSRNYNAMSTTLKIFTY------IPFKPVSFPSPAESLFYKSLLDS---PYSTHDPDHAHFFFIPFS-----------PDTSTRSLARLIRT----LRS

Query:  ELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVF--DSSEFSSSWIPAPATER-VLGFVG----YGWVRDRVL
        + P+WNRT GADHF  +     +A       + K+   +    V  G F+  KD +LP  F  D  +  S+     A +R +L F      +G++R  +L
Subjt:  ELPYWNRTLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVF--DSSEFSSSWIPAPATER-VLGFVG----YGWVRDRVL

Query:  -VKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRR
              +DP   +  + P      NY + +  S +C+   G      R+ E + Y CVPV+ISD  +   P  +VL W+  A+F+   K I  +K++L  
Subjt:  -VKELIEDPAFFMESEPPPPSERWNYGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRR

Query:  VDEESLVKMKRLGAAAAQHFVW
        + E     M+       +HF+W
Subjt:  VDEESLVKMKRLGAAAAQHFVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCCTCATCACTTTCTCTCTCCTCTTTTCTCTCTCTTTTCTCGCCGCCGCCTCCCCTTCCCCGTATCTCTCCCCCATTTTCTCCAGAAACTACAACGCGATGTC
CACAACGTTAAAGATCTTCACCTACATCCCATTCAAACCTGTCTCCTTCCCTTCCCCTGCCGAATCGCTTTTCTACAAATCGCTTCTCGACAGCCCCTACTCTACTCACG
ATCCTGACCACGCGCACTTCTTTTTTATTCCTTTTTCTCCCGATACTTCCACGCGCTCTCTTGCGCGTTTGATTCGCACTCTCCGTTCTGAGTTGCCCTATTGGAATCGG
ACTCTTGGCGCTGATCACTTCTTTCTCTCGTCGTCTGGCGTTCGCTATGCTTCTGATCGGAACATTGTCGAATTGAAGAAGAATGCTATTCAGGTCTCTGGTGGGCCCGT
GCCGGTTGGGAATTTTATTTCTCATAAGGACATTACGTTGCCACCGGTTTTCGATTCGTCGGAGTTTTCTTCTTCTTGGATTCCGGCTCCGGCGACGGAGAGGGTGTTGG
GTTTCGTCGGGTATGGGTGGGTGAGAGATCGGGTTTTGGTGAAGGAGTTGATTGAGGATCCTGCGTTTTTCATGGAGTCTGAGCCGCCGCCGCCGTCGGAGAGGTGGAAC
TACGGGGAGAGATTGGGGAAAAGTGATTTTTGTTTGTTTGAATACGGCGGTGGGGGTGTTGTTTTGAGGATTGGGGAGGTGTTGCGATATGGGTGTGTGCCGGTGGTTAT
TTCTGACCGTCCGATTCAGGACTTGCCGTTGATGGACGTGTTACGGTGGCAGGACATGGCGGTGTTCGTCAATGGCGTCAAAGGAATTGAAGGAGTGAAGAGAGTATTGA
GGCGCGTGGATGAGGAGAGTCTCGTAAAAATGAAGAGACTGGGTGCGGCGGCGGCACAGCATTTTGTGTGGGTGGGCCCAGAGTTGAAAAATGGAAGCTACGTTTGCATG
GGGTTTGGACTTTTAATTAATTTGAATATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCCTCATCACTTTCTCTCTCCTCTTTTCTCTCTCTTTTCTCGCCGCCGCCTCCCCTTCCCCGTATCTCTCCCCCATTTTCTCCAGAAACTACAACGCGATGTC
CACAACGTTAAAGATCTTCACCTACATCCCATTCAAACCTGTCTCCTTCCCTTCCCCTGCCGAATCGCTTTTCTACAAATCGCTTCTCGACAGCCCCTACTCTACTCACG
ATCCTGACCACGCGCACTTCTTTTTTATTCCTTTTTCTCCCGATACTTCCACGCGCTCTCTTGCGCGTTTGATTCGCACTCTCCGTTCTGAGTTGCCCTATTGGAATCGG
ACTCTTGGCGCTGATCACTTCTTTCTCTCGTCGTCTGGCGTTCGCTATGCTTCTGATCGGAACATTGTCGAATTGAAGAAGAATGCTATTCAGGTCTCTGGTGGGCCCGT
GCCGGTTGGGAATTTTATTTCTCATAAGGACATTACGTTGCCACCGGTTTTCGATTCGTCGGAGTTTTCTTCTTCTTGGATTCCGGCTCCGGCGACGGAGAGGGTGTTGG
GTTTCGTCGGGTATGGGTGGGTGAGAGATCGGGTTTTGGTGAAGGAGTTGATTGAGGATCCTGCGTTTTTCATGGAGTCTGAGCCGCCGCCGCCGTCGGAGAGGTGGAAC
TACGGGGAGAGATTGGGGAAAAGTGATTTTTGTTTGTTTGAATACGGCGGTGGGGGTGTTGTTTTGAGGATTGGGGAGGTGTTGCGATATGGGTGTGTGCCGGTGGTTAT
TTCTGACCGTCCGATTCAGGACTTGCCGTTGATGGACGTGTTACGGTGGCAGGACATGGCGGTGTTCGTCAATGGCGTCAAAGGAATTGAAGGAGTGAAGAGAGTATTGA
GGCGCGTGGATGAGGAGAGTCTCGTAAAAATGAAGAGACTGGGTGCGGCGGCGGCACAGCATTTTGTGTGGGTGGGCCCAGAGTTGAAAAATGGAAGCTACGTTTGCATG
GGGTTTGGACTTTTAATTAATTTGAATATATGA
Protein sequenceShow/hide protein sequence
MASLITFSLLFSLSFLAAASPSPYLSPIFSRNYNAMSTTLKIFTYIPFKPVSFPSPAESLFYKSLLDSPYSTHDPDHAHFFFIPFSPDTSTRSLARLIRTLRSELPYWNR
TLGADHFFLSSSGVRYASDRNIVELKKNAIQVSGGPVPVGNFISHKDITLPPVFDSSEFSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPAFFMESEPPPPSERWN
YGERLGKSDFCLFEYGGGGVVLRIGEVLRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGVKGIEGVKRVLRRVDEESLVKMKRLGAAAAQHFVWVGPELKNGSYVCM
GFGLLINLNI