; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg009661 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg009661
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionExostosin domain-containing protein
Genome locationscaffold7:9220450..9221397
RNA-Seq ExpressionSpg009661
SyntenySpg009661
Gene Ontology termsGO:0006486 - protein glycosylation (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR004263 - Exostosin-like
IPR040911 - Exostosin, GT47 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600032.1 putative glycosyltransferase, partial [Cucurbita argyrosperma subsp. sororia]4.9e-16088.01Show/hide
Query:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL
        MSTTL+IFTYIP KP SFPSPAESLFYKSLLDSPYSTH+PD AH FF+PF PD STRSL+RLIRTLR+ELPYWNRTLGADHFFLSS+GV +ASDRN+VEL
Subjt:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL

Query:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEY
        KKNAIQVSG PVP G F  HKDITLPPV  S E SSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDP FFMESEPPP SERWNYGERL KSDFCLFEY
Subjt:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEY

Query:  GGG--VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLW
        GGG  V  IGE LRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNG  GIEGVKRVLRRVDEESL KMK+LGAAAAQHFVWNSPPQPLDAFNTVAYQLW
Subjt:  GGG--VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLW

Query:  LRRHAVRYAERKEWAQS
        LRRH +RYAERKEWAQS
Subjt:  LRRHAVRYAERKEWAQS

XP_022941986.1 probable glycosyltransferase At5g03795 [Cucurbita moschata]2.2e-16088.01Show/hide
Query:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL
        MSTTL+IFTYIP KP SFPSPAESLFYKSLLDSPYSTH+PD AH FF+PF PD STRSL+RLIRTLR++LPYWNRTLGADHFFLSS GV +ASDRN+VEL
Subjt:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL

Query:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEY
        KKNAIQVSG PVP G F  HKDITLPPV  S E SSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPP SERWNYGERL KSDFCLFEY
Subjt:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEY

Query:  GGG--VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLW
        GGG  V  IGE LRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGG GIEGVKRVLRRVD ESL KMK+LGAAAAQHFVWNSPPQPLDAFNTVAYQLW
Subjt:  GGG--VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLW

Query:  LRRHAVRYAERKEWAQS
        LRRH +RYAERKEWAQS
Subjt:  LRRHAVRYAERKEWAQS

XP_022995754.1 probable glycosyltransferase At5g03795 [Cucurbita maxima]5.9e-15887.46Show/hide
Query:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL
        MSTTL+IFTYIP KP SFPSPAESLFYKSLLDSPYSTH PD AH FF+PF PD STRSL+RLIRTLR+ELPYWNRTLGADHFFLSS GV +ASDRN+VEL
Subjt:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL

Query:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESE--PPPSSERWNYGERLAKSDFCLF
        KKNAIQVSG PVP G F  HKDITLPPV  S E SSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESE  PPP SER NYGERL KSDFCLF
Subjt:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESE--PPPSSERWNYGERLAKSDFCLF

Query:  EYGGG--VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQ
        EYGGG  V  IGE +RYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGG GIEGVKRVLRRVDEESL KMK+LGAAAAQHFVWNSPPQPLDAFNTVAYQ
Subjt:  EYGGG--VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQ

Query:  LWLRRHAVRYAERKEWAQS
        LWLRRH +RYAERKEWAQS
Subjt:  LWLRRHAVRYAERKEWAQS

XP_023542057.1 probable glycosyltransferase At5g03795 [Cucurbita pepo subsp. pepo]1.2e-15887.77Show/hide
Query:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL
        MSTTL+IFTYIP KP SFPSPAESLFYKSLLDSPYSTH+PD AH FF+PF PD STRSL+RLIRTLR+ELPYWNRTLGADHFFLSS+GV + SDRN+VEL
Subjt:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL

Query:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPS--SERWNYGERLAKSDFCLF
        KKNAIQVSG PVP G F  HKDITLPPV  S E SSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPS  SER NYGERL KSDFCLF
Subjt:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPS--SERWNYGERLAKSDFCLF

Query:  EYGGG--VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQ
        EYGGG  V  IGE LRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGG GIEGVKRVLRRVDEESL KMK+LGAAAAQHFVWNSPPQPLDAFNTVAYQ
Subjt:  EYGGG--VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQ

Query:  LWLRRHAVRYAERKEWAQS
        LWLRRH +RYAERKEWAQS
Subjt:  LWLRRHAVRYAERKEWAQS

XP_038889277.1 probable glycosyltransferase At5g03795 [Benincasa hispida]1.9e-14077.74Show/hide
Query:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL
        MS  LRIFTYIP +PFSF SPAESLFYKSLL+SPY+TH+PD AHLFF+PF PDLSTRSL RLIRTLRT+LPYWNRTLGADHFFLSSAGVG++S+RNVVEL
Subjt:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL

Query:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPA---PATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCL
        KKNAIQVS +PVP GKF PHKDI+LPPVSG       W+P       ERVLGFVGYGWV+   LV ELIEDPEF MESEPP ++   +YGE+LAKSDFCL
Subjt:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPA---PATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCL

Query:  FEYGGG-VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQ
        FEYGGG VSGIGEALR+GC+PVVIS RPIQDLPLMDV+RWQ+MAVF+ G  GI+GVK+VLR VD+ESL +MK+LGAAAAQHF WNSPPQPLDAFNTVA+Q
Subjt:  FEYGGG-VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQ

Query:  LWLRRHAVRYAERKEWAQS
        LW+RRHAVRYAER+EWAQS
Subjt:  LWLRRHAVRYAERKEWAQS

TrEMBL top hitse value%identityAlignment
A0A0A0KS95 Exostosin domain-containing protein7.4e-13877.67Show/hide
Query:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL
        MS  LRIFTYIP  PFSF S AESLFYKSLL+SPY+TH+PD AHLFF+PF P +STRSL+RLIRTLRT+LPYWNRTLGADHFFLSS+G+G+ SDRNVVEL
Subjt:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL

Query:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSG--SPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLF
        KKNAIQVS +PV PGKF PHKD++LPPVS   S  VS+S +    +ER+LGFVGYGWV+   LVKELIEDPEF MESEPP +     YG++LAKSDFCLF
Subjt:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSG--SPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLF

Query:  EY-GGGVSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQL
        EY GG VSGIGEALR+GCVPVVISDR IQDLPLMDV+RW++MAVFV GGGGIEGVK+VLRRVD E L +MKKLGAAAAQHFVWNSPPQPLDAFNTVAYQL
Subjt:  EY-GGGVSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQL

Query:  WLRRHAVRYAERKEWAQS
        W+RRHAVRYA+R+EWAQ+
Subjt:  WLRRHAVRYAERKEWAQS

A0A1S3CF96 probable glycosyltransferase At3g076203.3e-13877.53Show/hide
Query:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL
        MS  LRIFTYIP   FSF S AESLFY+SLL+SPYSTH+PD AHLFF+PF PD+S RSLSRLIRTLRT+LPYWNRTLGADHFFLSS+G+G+  DRNVVEL
Subjt:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL

Query:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEY
        KKNAIQVS +PVPPGKF PHKDI+LPPVS    VS+       +ER+LGFVGYGWV+   LVKELIEDPEF MESEPPP+     YG+++AKSDFCLFEY
Subjt:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEY

Query:  GGG-VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWL
        G G VSGIGEALR+GCVPVVISDR IQDLPLMD +RWQ+MAVFV GGGGIEGVK+VLR VD E L +MK+LGAAAAQHFVWNSPPQPLDAFNTVAYQLWL
Subjt:  GGG-VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWL

Query:  RRHAVRYAERKEWAQS
        RRHAVRYA+R+EWAQ+
Subjt:  RRHAVRYAERKEWAQS

A0A5A7U559 Putative glycosyltransferase3.3e-13877.53Show/hide
Query:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL
        MS  LRIFTYIP   FSF S AESLFY+SLL+SPYSTH+PD AHLFF+PF PD+S RSLSRLIRTLRT+LPYWNRTLGADHFFLSS+G+G+  DRNVVEL
Subjt:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL

Query:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEY
        KKNAIQVS +PVPPGKF PHKDI+LPPVS    VS+       +ER+LGFVGYGWV+   LVKELIEDPEF MESEPPP+     YG+++AKSDFCLFEY
Subjt:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEY

Query:  GGG-VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWL
        G G VSGIGEALR+GCVPVVISDR IQDLPLMD +RWQ+MAVFV GGGGIEGVK+VLR VD E L +MK+LGAAAAQHFVWNSPPQPLDAFNTVAYQLWL
Subjt:  GGG-VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWL

Query:  RRHAVRYAERKEWAQS
        RRHAVRYA+R+EWAQ+
Subjt:  RRHAVRYAERKEWAQS

A0A6J1FML5 probable glycosyltransferase At5g037951.1e-16088.01Show/hide
Query:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL
        MSTTL+IFTYIP KP SFPSPAESLFYKSLLDSPYSTH+PD AH FF+PF PD STRSL+RLIRTLR++LPYWNRTLGADHFFLSS GV +ASDRN+VEL
Subjt:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL

Query:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEY
        KKNAIQVSG PVP G F  HKDITLPPV  S E SSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPP SERWNYGERL KSDFCLFEY
Subjt:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEY

Query:  GGG--VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLW
        GGG  V  IGE LRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGG GIEGVKRVLRRVD ESL KMK+LGAAAAQHFVWNSPPQPLDAFNTVAYQLW
Subjt:  GGG--VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLW

Query:  LRRHAVRYAERKEWAQS
        LRRH +RYAERKEWAQS
Subjt:  LRRHAVRYAERKEWAQS

A0A6J1K6U0 probable glycosyltransferase At5g037952.9e-15887.46Show/hide
Query:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL
        MSTTL+IFTYIP KP SFPSPAESLFYKSLLDSPYSTH PD AH FF+PF PD STRSL+RLIRTLR+ELPYWNRTLGADHFFLSS GV +ASDRN+VEL
Subjt:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVEL

Query:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESE--PPPSSERWNYGERLAKSDFCLF
        KKNAIQVSG PVP G F  HKDITLPPV  S E SSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESE  PPP SER NYGERL KSDFCLF
Subjt:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESE--PPPSSERWNYGERLAKSDFCLF

Query:  EYGGG--VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQ
        EYGGG  V  IGE +RYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGG GIEGVKRVLRRVDEESL KMK+LGAAAAQHFVWNSPPQPLDAFNTVAYQ
Subjt:  EYGGG--VSGIGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQ

Query:  LWLRRHAVRYAERKEWAQS
        LWLRRH +RYAERKEWAQS
Subjt:  LWLRRHAVRYAERKEWAQS

SwissProt top hitse value%identityAlignment
Q3E7Q9 Probable glycosyltransferase At5g253102.2e-2229.21Show/hide
Query:  YSTHNPDDAHLFFLPFPPDLSTRSL--------------SRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPH
        + T++P+ A+++FLPF      R L              S  IR + T  P+WNRT GADHF L+    G  + +   +L   +I+V         F P 
Subjt:  YSTHNPDDAHLFFLPFPPDLSTRSL--------------SRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPH

Query:  KDITLPPV-----SGSPEVSSSWIPAPATERVLGFVG---YGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG--IGEA
        KD+TLP +         ++  S   + +    LGF     +G VR  +L      D +  +    P   +  NY + +  S FC    G  V+   + EA
Subjt:  KDITLPPV-----SGSPEVSSSWIPAPATERVLGFVG---YGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG--IGEA

Query:  LRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWLRR
        +   C+PV++S   +  LP  DVLRW+  +V V+    I  +K +L  +  E    +K       +HF  N PPQ  DAF+   + +WLRR
Subjt:  LRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWLRR

Q3E9A4 Probable glycosyltransferase At5g202602.0e-2328.67Show/hide
Query:  SPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTE----------------LPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGK
        SP++ +NP++AH F LP         L R + T   E                 PYWNR+LGADHF++S          +  EL KN I+V         
Subjt:  SPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTE----------------LPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGK

Query:  FTPHKDITLPPVS------GSPEVSSSWIPAPATERVLGFV---GYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG-
        F P +D+++P ++      G P +S S   +     +L F     +G++R R+L++   +  E     E    ++  +Y + +A + FCL   G  V+  
Subjt:  FTPHKDITLPPVS------GSPEVSSSWIPAPATERVLGFV---GYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG-

Query:  -IGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWLRRHAVR
         +  A+  GCVPV+ISD     LP  DVL W    + V     I  +K +L+ +       +++      +HFV N P QP D    + + +WLRR  +R
Subjt:  -IGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWLRRHAVR

Q9FFN2 Probable glycosyltransferase At5g037952.5e-2627.22Show/hide
Query:  MSTTLRIFTYIPLKPFSF-PSPAESLF-------YKSLLDSPYSTHNPDDAHLFFLPFPP--------DLSTRSLSRLIRTLR-------TELPYWNRTL
        M    +I+ Y   +P  F   P +S++       Y+   D+ + T+NPD AH+F+LPF          + ++R  S +  T++        + PYWNR++
Subjt:  MSTTLRIFTYIPLKPFSF-PSPAESLF-------YKSLLDSPYSTHNPDDAHLFFLPFPP--------DLSTRSLSRLIRTLR-------TELPYWNRTL

Query:  GADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWI--PAPATERVLGFVG---YGWVRDRVLVKELIEDPEFF
        GADHF LS    G  +  +   L  N+I+         +F P KD+++P ++      +  +  P+P++  +L F     +G VR  +L     +D +  
Subjt:  GADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWI--PAPATERVLGFVG---YGWVRDRVLVKELIEDPEFF

Query:  MESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKL
        +    P  +   +Y + +  S FC+   G  V+   I EAL  GCVPV+I+   +   P  DVL W+  +V V+    I  +K +L  +      +M + 
Subjt:  MESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKL

Query:  GAAAAQHFVWNSPPQPLDAFNTVAYQLWLRRHAVRYAE
             +HF  NSP +  D F+ + + +W+RR  V+  E
Subjt:  GAAAAQHFVWNSPPQPLDAFNTVAYQLWLRRHAVRYAE

Q9LFP3 Probable glycosyltransferase At5g111301.2e-2328.76Show/hide
Query:  DSPYSTHNPDDAHLFFLP----------------FPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVV--ELKKNAIQVSGWPVP
        +S +   +P++A +F++P                +  D     +   I  +    PYWNR+ GADHFFLS      A D + V  EL K+ I+       
Subjt:  DSPYSTHNPDDAHLFFLP----------------FPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVV--ELKKNAIQVSGWPVP

Query:  PGKFTPHKDITLPPVSGSPEVSSSWI---PAPATERVLGFVGYGWVRD--RVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG--
           FTP +D++LP ++  P     ++     P   ++L F   G   D  ++L +   E  +  +  E  P +   NY + + K+ FCL   G  V+   
Subjt:  PGKFTPHKDITLPPVSGSPEVSSSWI---PAPATERVLGFVGYGWVRD--RVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG--

Query:  IGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWLRRHAVR
        I E+L  GCVPV+I+D  +  LP  DVL W+  +V +     +  +K++L  + EE    M++      +HFV N P +P D  + + + +WLRR  VR
Subjt:  IGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWLRRHAVR

Q9SSE8 Probable glycosyltransferase At3g076201.4e-2428.67Show/hide
Query:  YSTHNPDDAHLFFLPF---------------PPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTP
        Y T +PD AH++FLPF                  +  R ++  ++ +  + PYWN + G DHF LS    GH +   V +L  N+I+V         F P
Subjt:  YSTHNPDDAHLFFLPF---------------PPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTP

Query:  HKDITLPPV---SGSPEVSSSWIPAPATERVLGFVG--YGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALR
         KD   P +   +G     +  +   +   +  F G  +G +R  +L     +D +  +    P   +  +Y E + KS FC+   G  V+   + EA+ 
Subjt:  HKDITLPPV---SGSPEVSSSWIPAPATERVLGFVG--YGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALR

Query:  YGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWLRRHAVR
         GCVPV+IS+  +  LP  DVL W+  +V V+    I  +KR+L  + EE   ++ +      +H + N PP+  D FN + + +WLRR  V+
Subjt:  YGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWLRRHAVR

Arabidopsis top hitse value%identityAlignment
AT3G07620.1 Exostosin family protein9.8e-2628.67Show/hide
Query:  YSTHNPDDAHLFFLPF---------------PPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTP
        Y T +PD AH++FLPF                  +  R ++  ++ +  + PYWN + G DHF LS    GH +   V +L  N+I+V         F P
Subjt:  YSTHNPDDAHLFFLPF---------------PPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTP

Query:  HKDITLPPV---SGSPEVSSSWIPAPATERVLGFVG--YGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALR
         KD   P +   +G     +  +   +   +  F G  +G +R  +L     +D +  +    P   +  +Y E + KS FC+   G  V+   + EA+ 
Subjt:  HKDITLPPV---SGSPEVSSSWIPAPATERVLGFVG--YGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALR

Query:  YGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWLRRHAVR
         GCVPV+IS+  +  LP  DVL W+  +V V+    I  +KR+L  + EE   ++ +      +H + N PP+  D FN + + +WLRR  V+
Subjt:  YGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWLRRHAVR

AT4G16745.1 Exostosin family protein1.1e-2427.44Show/hide
Query:  LKPFSFPSPAESLFYKSLLDSPYS----------------THNPDDAHLFFLPF-----------PPDLSTRSLSRLIR----TLRTELPYWNRTLGADH
        LK + +P   + +F++  L+  Y+                T NP+ AHLF++P+           P   + + LS  +R     L  + P+WNRT G+DH
Subjt:  LKPFSFPSPAESLFYKSLLDSPYS----------------THNPDDAHLFFLPF-----------PPDLSTRSLSRLIR----TLRTELPYWNRTLGADH

Query:  FFLSSAGVGHASDRNVVELKKNAIQ-VSGWPVPPGKFTPHKDITLPPVS----GSPEVSSSWIPAPATERVLGFVG---YGWVRDRVLVKELIEDPEFFM
        F ++    G  +     ELK+NAI+ +    +  G F P KD++LP  S    G P  +       +   +L F     +G VR ++L     +D +  +
Subjt:  FFLSSAGVGHASDRNVVELKKNAIQ-VSGWPVPPGKFTPHKDITLPPVS----GSPEVSSSWIPAPATERVLGFVG---YGWVRDRVLVKELIEDPEFFM

Query:  ESEPPPS-SERWNYGERLAKSDFCLFEYGGGVSG--IGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKL
            P + + +  Y + +  S +CL   G  V+   I EA+ Y CVPVVI+D  +  LP  DVL W   +V V     I  +K +L  +      KM+  
Subjt:  ESEPPPS-SERWNYGERLAKSDFCLFEYGGGVSG--IGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKL

Query:  GAAAAQHFVWNSPPQPLDAFNTVAYQLW
             +HF+W+  P+  D F+ + + +W
Subjt:  GAAAAQHFVWNSPPQPLDAFNTVAYQLW

AT4G38040.1 Exostosin family protein1.6e-3631.39Show/hide
Query:  AESLFYKSLLDSPYSTHNPDDAHLFFLPFP------PDLSTRSLSRLIRT----LRTELPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWP
        +E  F++++ +S + T +PD+A LFF+P           S  +++ +++     L  + PYWNRTLGADHFF++   VG  +      L KN I+V   P
Subjt:  AESLFYKSLLDSPYSTHNPDDAHLFFLPFP------PDLSTRSLSRLIRT----LRTELPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWP

Query:  VPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATE----RVLGF-VGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWN-------YGERLAKSDFCLFE
             F PHKD+ LP V     +    +PA   +      LGF  G+   + RV++  + E+     ++E   S+ R N       Y +R  ++ FC+  
Subjt:  VPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATE----RVLGF-VGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWN-------YGERLAKSDFCLFE

Query:  YGGGVSG--IGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQL
         G  V+   I +++ YGC+PV++SD    DLP  D+L W+  AV +     +  +K++L+ +       +        +HF WNSPP   DAF+ + Y+L
Subjt:  YGGGVSG--IGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQL

Query:  WLRRHAVRY
        WLR H V+Y
Subjt:  WLRRHAVRY

AT5G03795.1 Exostosin family protein1.8e-2727.22Show/hide
Query:  MSTTLRIFTYIPLKPFSF-PSPAESLF-------YKSLLDSPYSTHNPDDAHLFFLPFPP--------DLSTRSLSRLIRTLR-------TELPYWNRTL
        M    +I+ Y   +P  F   P +S++       Y+   D+ + T+NPD AH+F+LPF          + ++R  S +  T++        + PYWNR++
Subjt:  MSTTLRIFTYIPLKPFSF-PSPAESLF-------YKSLLDSPYSTHNPDDAHLFFLPFPP--------DLSTRSLSRLIRTLR-------TELPYWNRTL

Query:  GADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWI--PAPATERVLGFVG---YGWVRDRVLVKELIEDPEFF
        GADHF LS    G  +  +   L  N+I+         +F P KD+++P ++      +  +  P+P++  +L F     +G VR  +L     +D +  
Subjt:  GADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSGSPEVSSSWI--PAPATERVLGFVG---YGWVRDRVLVKELIEDPEFF

Query:  MESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKL
        +    P  +   +Y + +  S FC+   G  V+   I EAL  GCVPV+I+   +   P  DVL W+  +V V+    I  +K +L  +      +M + 
Subjt:  MESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKL

Query:  GAAAAQHFVWNSPPQPLDAFNTVAYQLWLRRHAVRYAE
             +HF  NSP +  D F+ + + +W+RR  V+  E
Subjt:  GAAAAQHFVWNSPPQPLDAFNTVAYQLWLRRHAVRYAE

AT5G11130.1 Exostosin family protein8.3e-2528.76Show/hide
Query:  DSPYSTHNPDDAHLFFLP----------------FPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVV--ELKKNAIQVSGWPVP
        +S +   +P++A +F++P                +  D     +   I  +    PYWNR+ GADHFFLS      A D + V  EL K+ I+       
Subjt:  DSPYSTHNPDDAHLFFLP----------------FPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVV--ELKKNAIQVSGWPVP

Query:  PGKFTPHKDITLPPVSGSPEVSSSWI---PAPATERVLGFVGYGWVRD--RVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG--
           FTP +D++LP ++  P     ++     P   ++L F   G   D  ++L +   E  +  +  E  P +   NY + + K+ FCL   G  V+   
Subjt:  PGKFTPHKDITLPPVSGSPEVSSSWI---PAPATERVLGFVGYGWVRD--RVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEYGGGVSG--

Query:  IGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWLRRHAVR
        I E+L  GCVPV+I+D  +  LP  DVL W+  +V +     +  +K++L  + EE    M++      +HFV N P +P D  + + + +WLRR  VR
Subjt:  IGEALRYGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWLRRHAVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCACAACCTTAAGAATCTTCACCTACATCCCACTCAAACCCTTCTCCTTCCCTTCTCCCGCCGAATCGCTTTTCTACAAATCGCTTCTCGACAGCCCCTACTCAAC
TCACAACCCCGACGATGCCCACTTGTTCTTCCTCCCTTTTCCTCCCGATCTCTCCACGCGCTCCCTCTCGCGTTTGATCCGCACGCTCCGTACGGAGCTGCCGTACTGGA
ATCGGACTCTCGGCGCCGACCACTTCTTTCTGTCGTCGGCCGGCGTTGGCCATGCGTCTGACCGGAACGTCGTCGAGCTGAAGAAGAACGCCATTCAGGTCTCTGGATGG
CCGGTGCCGCCCGGGAAGTTTACTCCTCATAAGGACATTACTCTGCCGCCGGTTTCCGGTTCGCCGGAAGTTTCTTCTTCCTGGATTCCGGCGCCGGCGACGGAGAGGGT
GCTGGGTTTCGTCGGGTATGGGTGGGTGAGGGATCGGGTTTTGGTGAAGGAGTTGATTGAGGATCCTGAGTTTTTTATGGAGTCGGAGCCGCCGCCGTCGTCGGAGCGGT
GGAATTACGGGGAGAGATTGGCGAAAAGCGACTTTTGTTTGTTCGAATACGGCGGCGGTGTTTCGGGGATTGGGGAGGCTTTGCGATATGGGTGTGTGCCGGTGGTGATT
TCTGACCGTCCGATTCAGGACTTGCCGTTGATGGACGTGCTCCGGTGGCAGGACATGGCGGTGTTCGTCAACGGCGGTGGCGGAATAGAAGGAGTGAAGAGGGTATTAAG
GCGCGTGGACGAGGAGAGTCTTACGAAAATGAAGAAATTAGGTGCGGCGGCGGCACAGCATTTTGTGTGGAACTCGCCGCCCCAGCCGTTGGATGCTTTCAATACGGTGG
CGTATCAGCTTTGGCTGAGAAGGCACGCCGTTAGATATGCCGAGAGGAAGGAGTGGGCCCAGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCACAACCTTAAGAATCTTCACCTACATCCCACTCAAACCCTTCTCCTTCCCTTCTCCCGCCGAATCGCTTTTCTACAAATCGCTTCTCGACAGCCCCTACTCAAC
TCACAACCCCGACGATGCCCACTTGTTCTTCCTCCCTTTTCCTCCCGATCTCTCCACGCGCTCCCTCTCGCGTTTGATCCGCACGCTCCGTACGGAGCTGCCGTACTGGA
ATCGGACTCTCGGCGCCGACCACTTCTTTCTGTCGTCGGCCGGCGTTGGCCATGCGTCTGACCGGAACGTCGTCGAGCTGAAGAAGAACGCCATTCAGGTCTCTGGATGG
CCGGTGCCGCCCGGGAAGTTTACTCCTCATAAGGACATTACTCTGCCGCCGGTTTCCGGTTCGCCGGAAGTTTCTTCTTCCTGGATTCCGGCGCCGGCGACGGAGAGGGT
GCTGGGTTTCGTCGGGTATGGGTGGGTGAGGGATCGGGTTTTGGTGAAGGAGTTGATTGAGGATCCTGAGTTTTTTATGGAGTCGGAGCCGCCGCCGTCGTCGGAGCGGT
GGAATTACGGGGAGAGATTGGCGAAAAGCGACTTTTGTTTGTTCGAATACGGCGGCGGTGTTTCGGGGATTGGGGAGGCTTTGCGATATGGGTGTGTGCCGGTGGTGATT
TCTGACCGTCCGATTCAGGACTTGCCGTTGATGGACGTGCTCCGGTGGCAGGACATGGCGGTGTTCGTCAACGGCGGTGGCGGAATAGAAGGAGTGAAGAGGGTATTAAG
GCGCGTGGACGAGGAGAGTCTTACGAAAATGAAGAAATTAGGTGCGGCGGCGGCACAGCATTTTGTGTGGAACTCGCCGCCCCAGCCGTTGGATGCTTTCAATACGGTGG
CGTATCAGCTTTGGCTGAGAAGGCACGCCGTTAGATATGCCGAGAGGAAGGAGTGGGCCCAGAGTTGA
Protein sequenceShow/hide protein sequence
MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPDLSTRSLSRLIRTLRTELPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGW
PVPPGKFTPHKDITLPPVSGSPEVSSSWIPAPATERVLGFVGYGWVRDRVLVKELIEDPEFFMESEPPPSSERWNYGERLAKSDFCLFEYGGGVSGIGEALRYGCVPVVI
SDRPIQDLPLMDVLRWQDMAVFVNGGGGIEGVKRVLRRVDEESLTKMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWLRRHAVRYAERKEWAQS