; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G009120 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G009120
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionExostosin domain-containing protein
Genome locationchr04:9197915..9198856
RNA-Seq ExpressionLsi04G009120
SyntenyLsi04G009120
Gene Ontology termsGO:0006486 - protein glycosylation (biological process)
GO:0009396 - folic acid-containing compound biosynthetic process (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
GO:0016881 - acid-amino acid ligase activity (molecular function)
InterPro domainsIPR004263 - Exostosin-like
IPR040911 - Exostosin, GT47 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050374.1 putative glycosyltransferase [Cucumis melo var. makuwa]8.8e-14682.75Show/hide
Query:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL
        MSA LRIFTYIPF  FSFSS AESLFY+SLL SPY+THDPDQAHLFF+PFSPD+S RSL+RLIRTLRT+LPYWNRTLGADHFFLSSSG+GY  DRNVVEL
Subjt:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL

Query:  KKNAIQVSSFPVPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGG
        KKNAIQVSSFPVP  KFIPHKDISLPPVS  V   V  +  V +R+LGFVGYGWV+GLSLVKELIED EFLMESEPP T S Y +K+AKSDFCLFEYG G
Subjt:  KKNAIQVSSFPVPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGG

Query:  DVSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRH
        DVSGIGEALRFGCVPVVISDRSIQ+LPLMD +RWQEMAVFVGGGGGIEGVKKVLR +D + L +MKRLGAAAAQHF+WNS PQPLDAFNTVAYQLW+RRH
Subjt:  DVSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRH

Query:  TVRYAERREWAQN
         VRYA+RREWAQN
Subjt:  TVRYAERREWAQN

XP_008461738.1 PREDICTED: probable glycosyltransferase At3g07620 [Cucumis melo]8.8e-14682.75Show/hide
Query:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL
        MSA LRIFTYIPF  FSFSS AESLFY+SLL SPY+THDPDQAHLFF+PFSPD+S RSL+RLIRTLRT+LPYWNRTLGADHFFLSSSG+GY  DRNVVEL
Subjt:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL

Query:  KKNAIQVSSFPVPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGG
        KKNAIQVSSFPVP  KFIPHKDISLPPVS  V   V  +  V +R+LGFVGYGWV+GLSLVKELIED EFLMESEPP T S Y +K+AKSDFCLFEYG G
Subjt:  KKNAIQVSSFPVPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGG

Query:  DVSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRH
        DVSGIGEALRFGCVPVVISDRSIQ+LPLMD +RWQEMAVFVGGGGGIEGVKKVLR +D + L +MKRLGAAAAQHF+WNS PQPLDAFNTVAYQLW+RRH
Subjt:  DVSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRH

Query:  TVRYAERREWAQN
         VRYA+RREWAQN
Subjt:  TVRYAERREWAQN

XP_011656179.1 probable glycosyltransferase At5g03795 [Cucumis sativus]1.0e-14684.03Show/hide
Query:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL
        MSA LRIFTYIPF PFSFSS AESLFYKSLL SPY THDPDQAHLFFIPFSP +STRSLARLIRTLRT+LPYWNRTLGADHFFLSSSG+GY SDRNVVEL
Subjt:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL

Query:  KKNAIQVSSFPVPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGG
        KKNAIQVSSFPV   KFIPHKD+SLPPVS  V  PV  S  V +R+LGFVGYGWV+GLSLVKELIED EFLMESEPP T S Y +KLAKSDFCLFEY GG
Subjt:  KKNAIQVSSFPVPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGG

Query:  DVSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRH
        DVSGIGEALRFGCVPVVISDR IQ+LPLMDV+RW+EMAVFV GGGGIEGVKKVLR +D + L +MK+LGAAAAQHF+WNS PQPLDAFNTVAYQLWVRRH
Subjt:  DVSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRH

Query:  TVRYAERREWAQN
         VRYA+RREWAQN
Subjt:  TVRYAERREWAQN

XP_023542057.1 probable glycosyltransferase At5g03795 [Cucurbita pepo subsp. pepo]2.0e-13476.62Show/hide
Query:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL
        MS TL+IFTYIPFKP SF SPAESLFYKSLL SPY+THDPD AH FFIPFSPD STRSLARLIRTLR+ELPYWNRTLGADHFFLSSSGV Y SDRN+VEL
Subjt:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL

Query:  KKNAIQVSSFPVPADKFIPHKDISLPPV-------SGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSS----YVEKLAK
        KKNAIQVS  PVP   FI HKDI+LPPV       S W+P P  E      RVLGFVGYGWVR   LVKELIED EF MESEPP +S S    Y E+L K
Subjt:  KKNAIQVSSFPVPADKFIPHKDISLPPV-------SGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSS----YVEKLAK

Query:  SDFCLFEYGGGD-VSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAF
        SDFCLFEYGGG  V  IGE LR+GCVPVVISDR IQ+LPLMDVLRWQ+MAVFV GG GIEGVK+VLR +DE+ L KMKRLGAAAAQHF+WNS PQPLDAF
Subjt:  SDFCLFEYGGGD-VSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAF

Query:  NTVAYQLWVRRHTVRYAERREWAQN
        NTVAYQLW+RRHT+RYAER+EWAQ+
Subjt:  NTVAYQLWVRRHTVRYAERREWAQN

XP_038889277.1 probable glycosyltransferase At5g03795 [Benincasa hispida]1.2e-15385.94Show/hide
Query:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL
        MSA LRIFTYIPF+PFSFSSPAESLFYKSLL SPYATHDPDQAHLFFIPFSPDLSTRSL RLIRTLRT+LPYWNRTLGADHFFLSS+GVGYSS+RNVVEL
Subjt:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL

Query:  KKNAIQVSSFPVPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGG
        KKNAIQVSSFPVPA KFIPHKDISLPPVSGWVPV        ++RVLGFVGYGWV+ LSLV ELIED EF+MESEPPLT+SSY EKLAKSDFCLFEYGGG
Subjt:  KKNAIQVSSFPVPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGG

Query:  DVSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRH
        DVSGIGEALRFGC+PVVIS R IQ+LPLMDV+RWQEMAVF+GG  GI+GVKKVLRG+D++ LA+MKRLGAAAAQHF WNS PQPLDAFNTVA+QLWVRRH
Subjt:  DVSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRH

Query:  TVRYAERREWAQN
         VRYAERREWAQ+
Subjt:  TVRYAERREWAQN

TrEMBL top hitse value%identityAlignment
A0A0A0KS95 Exostosin domain-containing protein5.1e-14784.03Show/hide
Query:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL
        MSA LRIFTYIPF PFSFSS AESLFYKSLL SPY THDPDQAHLFFIPFSP +STRSLARLIRTLRT+LPYWNRTLGADHFFLSSSG+GY SDRNVVEL
Subjt:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL

Query:  KKNAIQVSSFPVPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGG
        KKNAIQVSSFPV   KFIPHKD+SLPPVS  V  PV  S  V +R+LGFVGYGWV+GLSLVKELIED EFLMESEPP T S Y +KLAKSDFCLFEY GG
Subjt:  KKNAIQVSSFPVPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGG

Query:  DVSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRH
        DVSGIGEALRFGCVPVVISDR IQ+LPLMDV+RW+EMAVFV GGGGIEGVKKVLR +D + L +MK+LGAAAAQHF+WNS PQPLDAFNTVAYQLWVRRH
Subjt:  DVSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRH

Query:  TVRYAERREWAQN
         VRYA+RREWAQN
Subjt:  TVRYAERREWAQN

A0A1S3CF96 probable glycosyltransferase At3g076204.3e-14682.75Show/hide
Query:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL
        MSA LRIFTYIPF  FSFSS AESLFY+SLL SPY+THDPDQAHLFF+PFSPD+S RSL+RLIRTLRT+LPYWNRTLGADHFFLSSSG+GY  DRNVVEL
Subjt:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL

Query:  KKNAIQVSSFPVPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGG
        KKNAIQVSSFPVP  KFIPHKDISLPPVS  V   V  +  V +R+LGFVGYGWV+GLSLVKELIED EFLMESEPP T S Y +K+AKSDFCLFEYG G
Subjt:  KKNAIQVSSFPVPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGG

Query:  DVSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRH
        DVSGIGEALRFGCVPVVISDRSIQ+LPLMD +RWQEMAVFVGGGGGIEGVKKVLR +D + L +MKRLGAAAAQHF+WNS PQPLDAFNTVAYQLW+RRH
Subjt:  DVSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRH

Query:  TVRYAERREWAQN
         VRYA+RREWAQN
Subjt:  TVRYAERREWAQN

A0A5A7U559 Putative glycosyltransferase4.3e-14682.75Show/hide
Query:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL
        MSA LRIFTYIPF  FSFSS AESLFY+SLL SPY+THDPDQAHLFF+PFSPD+S RSL+RLIRTLRT+LPYWNRTLGADHFFLSSSG+GY  DRNVVEL
Subjt:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL

Query:  KKNAIQVSSFPVPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGG
        KKNAIQVSSFPVP  KFIPHKDISLPPVS  V   V  +  V +R+LGFVGYGWV+GLSLVKELIED EFLMESEPP T S Y +K+AKSDFCLFEYG G
Subjt:  KKNAIQVSSFPVPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGG

Query:  DVSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRH
        DVSGIGEALRFGCVPVVISDRSIQ+LPLMD +RWQEMAVFVGGGGGIEGVKKVLR +D + L +MKRLGAAAAQHF+WNS PQPLDAFNTVAYQLW+RRH
Subjt:  DVSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRH

Query:  TVRYAERREWAQN
         VRYA+RREWAQN
Subjt:  TVRYAERREWAQN

A0A6J1FML5 probable glycosyltransferase At5g037953.2e-13375.85Show/hide
Query:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL
        MS TL+IFTYIPFKP SF SPAESLFYKSLL SPY+THDPD AH FFIPFSPD STRSLARLIRTLR++LPYWNRTLGADHFFLSS GV Y+SDRN+VEL
Subjt:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL

Query:  KKNAIQVSSFPVPADKFIPHKDISLPPV-------SGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSS--SYVEKLAKSD
        KKNAIQVS  PVP   FI HKDI+LPPV       S W+P P  E      RVLGFVGYGWVR   LVKELIED EF MESEPP  S   +Y E+L KSD
Subjt:  KKNAIQVSSFPVPADKFIPHKDISLPPV-------SGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSS--SYVEKLAKSD

Query:  FCLFEYGGGD-VSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNT
        FCLFEYGGG  V  IGE LR+GCVPVVISDR IQ+LPLMDVLRWQ+MAVFV GG GIEGVK+VLR +D + L KMKRLGAAAAQHF+WNS PQPLDAFNT
Subjt:  FCLFEYGGGD-VSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNT

Query:  VAYQLWVRRHTVRYAERREWAQN
        VAYQLW+RRHT+RYAER+EWAQ+
Subjt:  VAYQLWVRRHTVRYAERREWAQN

A0A6J1K6U0 probable glycosyltransferase At5g037955.4e-13375.38Show/hide
Query:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL
        MS TL+IFTYIPFKP SF SPAESLFYKSLL SPY+TH+PD AH FFIPFSPD STRSLARLIRTLR+ELPYWNRTLGADHFFLSS GV Y+SDRN+VEL
Subjt:  MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVEL

Query:  KKNAIQVSSFPVPADKFIPHKDISLPPV-------SGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSS----YVEKLAK
        KKNAIQVS  PVP   FI HKDI+LPPV       S W+P P  E      RVLGFVGYGWVR   LVKELIED EF MESEPP    S    Y E+L K
Subjt:  KKNAIQVSSFPVPADKFIPHKDISLPPV-------SGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSS----YVEKLAK

Query:  SDFCLFEYGGGD-VSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAF
        SDFCLFEYGGG  V  IGE +R+GCVPVVISDR IQ+LPLMDVLRWQ+MAVFV GG GIEGVK+VLR +DE+ L KMKRLGAAAAQHF+WNS PQPLDAF
Subjt:  SDFCLFEYGGGD-VSGIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAF

Query:  NTVAYQLWVRRHTVRYAERREWAQN
        NTVAYQLW+RRHT+RYAER+EWAQ+
Subjt:  NTVAYQLWVRRHTVRYAERREWAQN

SwissProt top hitse value%identityAlignment
Q3E7Q9 Probable glycosyltransferase At5g253102.7e-2028.62Show/hide
Query:  YATHDPDQAHLFFIPFSPDLSTRSL--------------ARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPADKFIPH
        + T+DP+QA+++F+PFS     R L              +  IR + T  P+WNRT GADHF L+    G  + +   +L   +I+V      ++ F P 
Subjt:  YATHDPDQAHLFFIPFSPDLSTRSL--------------ARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPADKFIPH

Query:  KDISLPPVSGW-------VPVPVDESAAVRDRVLGFVG--YGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVSG--IGEAL
        KD++LP +  +       + +    SA+ R  +  F G  +G VR + L      D +  +    P    +Y + +  S FC F   G +V+   + EA+
Subjt:  KDISLPPVSGW-------VPVPVDESAAVRDRVLGFVG--YGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVSG--IGEAL

Query:  RFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRR
           C+PV++S   +  LP  DVLRW+  +V V     I  +K++L  +  +    +K       +HF  N  PQ  DAF+   + +W+RR
Subjt:  RFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRR

Q3E9A4 Probable glycosyltransferase At5g202602.3e-2428.47Show/hide
Query:  SPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTE----------------LPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPADK
        SP+A ++P++AH F +P S       L R + T   E                 PYWNR+LGADHF++S          +  EL KN I+V      ++ 
Subjt:  SPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTE----------------LPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPADK

Query:  FIPHKDISLPPVS---GWVPVPVDESAAVRDR-VLGFV---GYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEA
        F+P +D+S+P ++   G +  P    ++  DR +L F     +G++R + L++   +  E +   E    +  Y + +A + FCL   G    S  +  A
Subjt:  FIPHKDISLPPVS---GWVPVPVDESAAVRDR-VLGFV---GYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEA

Query:  LRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVR
        +  GCVPV+ISD     LP  DVL W +  + V     I  +K +L+ +  +    ++R      +HF+ N   QP D    + + +W+RR  +R
Subjt:  LRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVR

Q9FFN2 Probable glycosyltransferase At5g037951.1e-2427.89Show/hide
Query:  MSATLRIFTYIPFKPFSF-SSPAESLF-------YKSLLTSPYATHDPDQAHLFFIPFSP--------DLSTRSLARLIRTLR-------TELPYWNRTL
        M    +I+ Y   +P  F   P +S++       Y+    + + T++PD+AH+F++PFS         + ++R  + +  T++        + PYWNR++
Subjt:  MSATLRIFTYIPFKPFSF-SSPAESLF-------YKSLLTSPYATHDPDQAHLFFIPFSP--------DLSTRSLARLIRTLR-------TELPYWNRTL

Query:  GADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPADKFIPHKDISLPPV-------SGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEF
        GADHF LS    G  +  +   L  N+I+       +++F P KD+S+P +       +G V  P   S  +     G V +G VR + L     +D++ 
Subjt:  GADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPADKFIPHKDISLPPV-------SGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEF

Query:  LMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLG
         +    P   +SY + +  S FC+   G    S  I EAL  GCVPV+I+   +   P  DVL W+  +V V     I  +K +L  +  +   +M R  
Subjt:  LMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLG

Query:  AAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVRYAE
            +HF  NS  +  D F+ + + +WVRR  V+  E
Subjt:  AAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVRYAE

Q9LFP3 Probable glycosyltransferase At5g111301.1e-2127.76Show/hide
Query:  FKPFSFSSPAESLFYKSLLTSPYATH-----------------DPDQAHLFFIP----------------FSPDLSTRSLARLIRTLRTELPYWNRTLGA
        FK +++      LF+K  L + YA                    P++A +F+IP                ++ D     +   I  +    PYWNR+ GA
Subjt:  FKPFSFSSPAESLFYKSLLTSPYATH-----------------DPDQAHLFFIP----------------FSPDLSTRSLARLIRTLRTELPYWNRTLGA

Query:  DHFFLSSSGVGYSSDRNVV--ELKKNAIQVSSFPVPADKFIPHKDISLPPVSGWVP------VPVDESAAVRDRVLGFVG--YGWVRGLSLVKELIEDSE
        DHFFLS     ++ D + V  EL K+ I+       ++ F P +D+SLP ++  +P      V   E    R  +  F G  +G VR +       +D +
Subjt:  DHFFLSSSGVGYSSDRNVV--ELKKNAIQVSSFPVPADKFIPHKDISLPPVSGWVP------VPVDESAAVRDRVLGFVG--YGWVRGLSLVKELIEDSE

Query:  FLMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRL
         L+    P T  +Y + + K+ FCL   G    S  I E+L  GCVPV+I+D  +  LP  DVL W+  +V +     +  +KK+L  + E+    M+R 
Subjt:  FLMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRL

Query:  GAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVR
             +HF+ N   +P D  + + + +W+RR  VR
Subjt:  GAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVR

Q9SSE8 Probable glycosyltransferase At3g076202.0e-2328.52Show/hide
Query:  YATHDPDQAHLFFIPFS---------------PDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPADKFIP
        Y T DPD+AH++F+PFS                 +  R +A  ++ +  + PYWN + G DHF LS    G+ +   V +L  N+I+V      ++ F P
Subjt:  YATHDPDQAHLFFIPFS---------------PDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPADKFIP

Query:  HKDISLPPV---SGWVPVPVDESAAVRDRVLGFV---GYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEALRFG
         KD   P +   +G +         +    L F     +G +R + L     +D + L+    P     Y E + KS FC+   G    S  + EA+  G
Subjt:  HKDISLPPV---SGWVPVPVDESAAVRDRVLGFV---GYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEALRFG

Query:  CVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVR
        CVPV+IS+  +  LP  DVL W++ +V V     I  +K++L  + E+   ++        +H L N  P+  D FN + + +W+RR  V+
Subjt:  CVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVR

Arabidopsis top hitse value%identityAlignment
AT3G07620.1 Exostosin family protein1.4e-2428.52Show/hide
Query:  YATHDPDQAHLFFIPFS---------------PDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPADKFIP
        Y T DPD+AH++F+PFS                 +  R +A  ++ +  + PYWN + G DHF LS    G+ +   V +L  N+I+V      ++ F P
Subjt:  YATHDPDQAHLFFIPFS---------------PDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPADKFIP

Query:  HKDISLPPV---SGWVPVPVDESAAVRDRVLGFV---GYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEALRFG
         KD   P +   +G +         +    L F     +G +R + L     +D + L+    P     Y E + KS FC+   G    S  + EA+  G
Subjt:  HKDISLPPV---SGWVPVPVDESAAVRDRVLGFV---GYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEALRFG

Query:  CVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVR
        CVPV+IS+  +  LP  DVL W++ +V V     I  +K++L  + E+   ++        +H L N  P+  D FN + + +W+RR  V+
Subjt:  CVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVR

AT4G38040.1 Exostosin family protein8.8e-3531Show/hide
Query:  AESLFYKSLLTSPYATHDPDQAHLFFIPFS------PDLSTRSLARLIRT----LRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFP
        +E  F++++  S + T DPD+A LFFIP S         S  ++  +++     L  + PYWNRTLGADHFF++   VG  +      L KN I+V   P
Subjt:  AESLFYKSLLTSPYATHDPDQAHLFFIPFS------PDLSTRSLARLIRT----LRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFP

Query:  VPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGF-VGYGWVRGLSLVKELIEDSEFLMESEPPLTSSS----YVEKLAKSDFCLFEYGGGDVSG--
             FIPHKD++LP V     +P   +       LGF  G+   +   ++  + E+   L  S   +  ++    Y ++  ++ FC+   GG  V+   
Subjt:  VPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGF-VGYGWVRGLSLVKELIEDSEFLMESEPPLTSSS----YVEKLAKSDFCLFEYGGGDVSG--

Query:  IGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVRY
        I +++ +GC+PV++SD    +LP  D+L W++ AV V     +  +K++L+ +       +        +HF WNS P   DAF+ + Y+LW+R H V+Y
Subjt:  IGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVRY

AT5G03795.1 Exostosin family protein7.5e-2627.89Show/hide
Query:  MSATLRIFTYIPFKPFSF-SSPAESLF-------YKSLLTSPYATHDPDQAHLFFIPFSP--------DLSTRSLARLIRTLR-------TELPYWNRTL
        M    +I+ Y   +P  F   P +S++       Y+    + + T++PD+AH+F++PFS         + ++R  + +  T++        + PYWNR++
Subjt:  MSATLRIFTYIPFKPFSF-SSPAESLF-------YKSLLTSPYATHDPDQAHLFFIPFSP--------DLSTRSLARLIRTLR-------TELPYWNRTL

Query:  GADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPADKFIPHKDISLPPV-------SGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEF
        GADHF LS    G  +  +   L  N+I+       +++F P KD+S+P +       +G V  P   S  +     G V +G VR + L     +D++ 
Subjt:  GADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPADKFIPHKDISLPPV-------SGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEF

Query:  LMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLG
         +    P   +SY + +  S FC+   G    S  I EAL  GCVPV+I+   +   P  DVL W+  +V V     I  +K +L  +  +   +M R  
Subjt:  LMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLG

Query:  AAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVRYAE
            +HF  NS  +  D F+ + + +WVRR  V+  E
Subjt:  AAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVRYAE

AT5G11130.1 Exostosin family protein7.7e-2327.76Show/hide
Query:  FKPFSFSSPAESLFYKSLLTSPYATH-----------------DPDQAHLFFIP----------------FSPDLSTRSLARLIRTLRTELPYWNRTLGA
        FK +++      LF+K  L + YA                    P++A +F+IP                ++ D     +   I  +    PYWNR+ GA
Subjt:  FKPFSFSSPAESLFYKSLLTSPYATH-----------------DPDQAHLFFIP----------------FSPDLSTRSLARLIRTLRTELPYWNRTLGA

Query:  DHFFLSSSGVGYSSDRNVV--ELKKNAIQVSSFPVPADKFIPHKDISLPPVSGWVP------VPVDESAAVRDRVLGFVG--YGWVRGLSLVKELIEDSE
        DHFFLS     ++ D + V  EL K+ I+       ++ F P +D+SLP ++  +P      V   E    R  +  F G  +G VR +       +D +
Subjt:  DHFFLSSSGVGYSSDRNVV--ELKKNAIQVSSFPVPADKFIPHKDISLPPVSGWVP------VPVDESAAVRDRVLGFVG--YGWVRGLSLVKELIEDSE

Query:  FLMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRL
         L+    P T  +Y + + K+ FCL   G    S  I E+L  GCVPV+I+D  +  LP  DVL W+  +V +     +  +KK+L  + E+    M+R 
Subjt:  FLMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEALRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRL

Query:  GAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVR
             +HF+ N   +P D  + + + +W+RR  VR
Subjt:  GAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVR

AT5G20260.1 Exostosin family protein1.7e-2528.47Show/hide
Query:  SPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTE----------------LPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPADK
        SP+A ++P++AH F +P S       L R + T   E                 PYWNR+LGADHF++S          +  EL KN I+V      ++ 
Subjt:  SPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTE----------------LPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSFPVPADK

Query:  FIPHKDISLPPVS---GWVPVPVDESAAVRDR-VLGFV---GYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEA
        F+P +D+S+P ++   G +  P    ++  DR +L F     +G++R + L++   +  E +   E    +  Y + +A + FCL   G    S  +  A
Subjt:  FIPHKDISLPPVS---GWVPVPVDESAAVRDR-VLGFV---GYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVS-GIGEA

Query:  LRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVR
        +  GCVPV+ISD     LP  DVL W +  + V     I  +K +L+ +  +    ++R      +HF+ N   QP D    + + +W+RR  +R
Subjt:  LRFGCVPVVISDRSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGCAACTTTAAGAATCTTCACTTACATTCCGTTCAAGCCATTTTCCTTTTCTTCTCCTGCCGAATCGCTTTTCTACAAATCGCTTCTCACTAGCCCCTACGCTAC
TCACGATCCTGACCAGGCCCATTTGTTTTTCATTCCCTTTTCCCCTGATCTCTCCACGCGCTCCCTCGCGCGTTTGATTCGCACGCTTCGTACGGAACTGCCGTACTGGA
ATCGGACTCTCGGTGCCGATCACTTCTTTTTGTCGTCTTCCGGCGTTGGTTACTCCTCTGATCGGAACGTTGTCGAGTTGAAGAAGAACGCCATTCAGGTCTCGTCCTTC
CCTGTTCCTGCGGATAAGTTTATTCCTCATAAGGATATTTCCTTGCCGCCGGTTTCCGGTTGGGTTCCGGTTCCGGTGGATGAGTCGGCGGCGGTGAGAGACAGAGTGTT
GGGTTTTGTTGGGTATGGGTGGGTGAGGGGTTTATCTTTGGTGAAGGAGTTGATTGAGGATTCTGAGTTTCTGATGGAGTCGGAGCCGCCGCTTACGTCGTCGAGTTACG
TGGAGAAACTGGCGAAAAGTGACTTTTGTTTGTTTGAATACGGCGGCGGGGATGTTTCCGGGATTGGGGAGGCTTTACGGTTTGGTTGTGTTCCGGTGGTGATTTCGGAT
CGTTCGATTCAGAACTTACCGCTGATGGACGTTCTACGGTGGCAGGAAATGGCGGTGTTCGTCGGTGGCGGCGGCGGAATTGAAGGTGTGAAGAAGGTTCTAAGGGGCAT
GGATGAGAAGGGTCTTGCGAAGATGAAGAGATTGGGTGCGGCGGCAGCCCAGCATTTTCTGTGGAACTCTTTGCCTCAGCCGTTGGATGCTTTCAATACGGTGGCGTATC
AGCTTTGGGTGAGAAGGCACACCGTTAGATATGCGGAGAGGAGAGAGTGGGCCCAGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGCAACTTTAAGAATCTTCACTTACATTCCGTTCAAGCCATTTTCCTTTTCTTCTCCTGCCGAATCGCTTTTCTACAAATCGCTTCTCACTAGCCCCTACGCTAC
TCACGATCCTGACCAGGCCCATTTGTTTTTCATTCCCTTTTCCCCTGATCTCTCCACGCGCTCCCTCGCGCGTTTGATTCGCACGCTTCGTACGGAACTGCCGTACTGGA
ATCGGACTCTCGGTGCCGATCACTTCTTTTTGTCGTCTTCCGGCGTTGGTTACTCCTCTGATCGGAACGTTGTCGAGTTGAAGAAGAACGCCATTCAGGTCTCGTCCTTC
CCTGTTCCTGCGGATAAGTTTATTCCTCATAAGGATATTTCCTTGCCGCCGGTTTCCGGTTGGGTTCCGGTTCCGGTGGATGAGTCGGCGGCGGTGAGAGACAGAGTGTT
GGGTTTTGTTGGGTATGGGTGGGTGAGGGGTTTATCTTTGGTGAAGGAGTTGATTGAGGATTCTGAGTTTCTGATGGAGTCGGAGCCGCCGCTTACGTCGTCGAGTTACG
TGGAGAAACTGGCGAAAAGTGACTTTTGTTTGTTTGAATACGGCGGCGGGGATGTTTCCGGGATTGGGGAGGCTTTACGGTTTGGTTGTGTTCCGGTGGTGATTTCGGAT
CGTTCGATTCAGAACTTACCGCTGATGGACGTTCTACGGTGGCAGGAAATGGCGGTGTTCGTCGGTGGCGGCGGCGGAATTGAAGGTGTGAAGAAGGTTCTAAGGGGCAT
GGATGAGAAGGGTCTTGCGAAGATGAAGAGATTGGGTGCGGCGGCAGCCCAGCATTTTCTGTGGAACTCTTTGCCTCAGCCGTTGGATGCTTTCAATACGGTGGCGTATC
AGCTTTGGGTGAGAAGGCACACCGTTAGATATGCGGAGAGGAGAGAGTGGGCCCAGAACTGA
Protein sequenceShow/hide protein sequence
MSATLRIFTYIPFKPFSFSSPAESLFYKSLLTSPYATHDPDQAHLFFIPFSPDLSTRSLARLIRTLRTELPYWNRTLGADHFFLSSSGVGYSSDRNVVELKKNAIQVSSF
PVPADKFIPHKDISLPPVSGWVPVPVDESAAVRDRVLGFVGYGWVRGLSLVKELIEDSEFLMESEPPLTSSSYVEKLAKSDFCLFEYGGGDVSGIGEALRFGCVPVVISD
RSIQNLPLMDVLRWQEMAVFVGGGGGIEGVKKVLRGMDEKGLAKMKRLGAAAAQHFLWNSLPQPLDAFNTVAYQLWVRRHTVRYAERREWAQN