; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040815 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040815
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionExostosin domain-containing protein
Genome locationchr13:8612330..8613388
RNA-Seq ExpressionLag0040815
SyntenyLag0040815
Gene Ontology termsGO:0006486 - protein glycosylation (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR004263 - Exostosin-like
IPR040911 - Exostosin, GT47 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600032.1 putative glycosyltransferase, partial [Cucurbita argyrosperma subsp. sororia]5.6e-16584.46Show/hide
Query:  MASLISLSLLLSLSLLTTASPSPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFIRT
        MASLI+ SLL SLSLL  ASPSPYLSPIF ++Y +MSTTL+IFTYIP KP SFPSPAESLFYKSLLDSPYSTH+PD AH FF+PF P  STRSL+R IRT
Subjt:  MASLISLSLLLSLSLLTTASPSPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFIRT

Query:  LRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVKEL
        LR++LPYWNRTLGADHFFLSS+GV +ASDRN+VELKKNAIQVSG PVP G F  HKDITLPPV DS E SSSWIPAPA ERVLGFVGYGWVRD+VLVKEL
Subjt:  LRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVKEL

Query:  IEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGG--VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVDE
        IEDP FFMESEPPP  SERWNYGERL KSDFCLFEYGGG  V  IGE LR GCVPVVISDRPIQDLPLMDVLRWQDMAVFVN G  GIEGVKRVLRRVDE
Subjt:  IEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGG--VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVDE

Query:  ESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAERKEWAQS
        ESL KMKRLGAAAAQHF WNSPPQPLDAFNTVAYQLWLRRH +RYAERKEWAQS
Subjt:  ESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAERKEWAQS

XP_022941986.1 probable glycosyltransferase At5g03795 [Cucurbita moschata]1.3e-15386.21Show/hide
Query:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVEL
        MSTTL+IFTYIP KP SFPSPAESLFYKSLLDSPYSTH+PD AH FF+PF P  STRSL+R IRTLR+ LPYWNRTLGADHFFLSS GV +ASDRN+VEL
Subjt:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVEL

Query:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFE
        KKNAIQVSG PVP G F  HKDITLPPV DS E SSSWIPAPA ERVLGFVGYGWVRD+VLVKELIEDPEFFMESEPPP  SERWNYGERL KSDFCLFE
Subjt:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFE

Query:  YGGG--VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQ
        YGGG  V  IGE LR GCVPVVISDRPIQDLPLMDVLRWQDMAVFVN GG GIEGVKRVLRRVD ESL KMKRLGAAAAQHF WNSPPQPLDAFNTVAYQ
Subjt:  YGGG--VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQ

Query:  LWLRRHAVRYAERKEWAQS
        LWLRRH +RYAERKEWAQS
Subjt:  LWLRRHAVRYAERKEWAQS

XP_022995754.1 probable glycosyltransferase At5g03795 [Cucurbita maxima]9.6e-16584.51Show/hide
Query:  MASLISLSLLLSLSLLTTASPSPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFIRT
        MASLI+ SLLLSLSLL  ASPSPYLSPIF ++Y +MSTTL+IFTYIP KP SFPSPAESLFYKSLLDSPYSTH PD AH FF+PF P  STRSL+R IRT
Subjt:  MASLISLSLLLSLSLLTTASPSPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFIRT

Query:  LRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVKEL
        LR++LPYWNRTLGADHFFLSS GV +ASDRN+VELKKNAIQVSG PVP G F  HKDITLPPV DS E SSSWIPAPA ERVLGFVGYGWVRD+VLVKEL
Subjt:  LRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVKEL

Query:  IEDPEFFMESE-PPPSSSERWNYGERLAKSDFCLFEYGGG--VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVD
        IEDPEFFMESE PPP  SER NYGERL KSDFCLFEYGGG  V  IGE +R GCVPVVISDRPIQDLPLMDVLRWQDMAVFVN GG GIEGVKRVLRRVD
Subjt:  IEDPEFFMESE-PPPSSSERWNYGERLAKSDFCLFEYGGG--VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVD

Query:  EESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAERKEWAQS
        EESL KMKRLGAAAAQHF WNSPPQPLDAFNTVAYQLWLRRH +RYAERKEWAQS
Subjt:  EESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAERKEWAQS

XP_023542057.1 probable glycosyltransferase At5g03795 [Cucurbita pepo subsp. pepo]9.9e-15486.25Show/hide
Query:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVEL
        MSTTL+IFTYIP KP SFPSPAESLFYKSLLDSPYSTH+PD AH FF+PF P  STRSL+R IRTLR++LPYWNRTLGADHFFLSS+GV + SDRN+VEL
Subjt:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVEL

Query:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVKELIEDPEFFMESEPPPSS-SERWNYGERLAKSDFCLF
        KKNAIQVSG PVP G F  HKDITLPPV DS E SSSWIPAPA ERVLGFVGYGWVRD+VLVKELIEDPEFFMESEPPPSS SER NYGERL KSDFCLF
Subjt:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVKELIEDPEFFMESEPPPSS-SERWNYGERLAKSDFCLF

Query:  EYGGG--VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAY
        EYGGG  V  IGE LR GCVPVVISDRPIQDLPLMDVLRWQDMAVFVN GG GIEGVKRVLRRVDEESL KMKRLGAAAAQHF WNSPPQPLDAFNTVAY
Subjt:  EYGGG--VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAY

Query:  QLWLRRHAVRYAERKEWAQS
        QLWLRRH +RYAERKEWAQS
Subjt:  QLWLRRHAVRYAERKEWAQS

XP_038889277.1 probable glycosyltransferase At5g03795 [Benincasa hispida]1.6e-14876.94Show/hide
Query:  ASLISLSLLLSLSLLT---TASP--SPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSR
        +SLI+LSLLLS SLL+   T+SP  SPYLSPIFLK+Y SMS  LRIFTYIP +PFSF SPAESLFYKSLL+SPY+TH+PD AHLFF+PF P LSTRSL R
Subjt:  ASLISLSLLLSLSLLT---TASP--SPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSR

Query:  FIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPA---PAAERVLGFVGYGWVRD
         IRTLRTDLPYWNRTLGADHFFLSSAGVG++S+RNVVELKKNAIQVS +PVP GKF PHKDI+LPPV       S W+P       ERVLGFVGYGWV+ 
Subjt:  FIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPA---PAAERVLGFVGYGWVRD

Query:  QVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGG-VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRV
          LV ELIEDPEF MESEPP ++S   +YGE+LAKSDFCLFEYGGG VSGIGEALR GC+PVVIS RPIQDLPLMDV+RWQ+MAVF+ GG  GI+GVK+V
Subjt:  QVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGG-VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRV

Query:  LRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAERKEWAQS
        LR VD+ESLA+MKRLGAAAAQHF+WNSPPQPLDAFNTVA+QLW+RRHAVRYAER+EWAQS
Subjt:  LRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAERKEWAQS

TrEMBL top hitse value%identityAlignment
A0A0A0KS95 Exostosin domain-containing protein8.2e-14675.48Show/hide
Query:  ASLISLSLLLSLSLLTT---------ASPSPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTR
        +SLI+LSLLLS SLL T          SPSPYLSPIFLK+Y SMS  LRIFTYIP  PFSF S AESLFYKSLL+SPY+TH+PD AHLFF+PF PH+STR
Subjt:  ASLISLSLLLSLSLLTT---------ASPSPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTR

Query:  SLSRFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSD--SPEVSSSWIPAPAAERVLGFVGYGW
        SL+R IRTLRTDLPYWNRTLGADHFFLSS+G+G+ SDRNVVELKKNAIQVS +PV PGKF PHKD++LPPVS   S  VS+S +    +ER+LGFVGYGW
Subjt:  SLSRFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSD--SPEVSSSWIPAPAAERVLGFVGYGW

Query:  VRDQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEY-GGGVSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGV
        V+   LVKELIEDPEF MESEPP + S    YG++LAKSDFCLFEY GG VSGIGEALR GCVPVVISDR IQDLPLMDV+RW++MAVFV  GGGGIEGV
Subjt:  VRDQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEY-GGGVSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGV

Query:  KRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAERKEWAQS
        K+VLRRVD E L +MK+LGAAAAQHF WNSPPQPLDAFNTVAYQLW+RRHAVRYA+R+EWAQ+
Subjt:  KRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAERKEWAQS

A0A1S3CF96 probable glycosyltransferase At3g076201.2e-14476.34Show/hide
Query:  ASLISLSLLLSLSLL---TTASPSPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFI
        +SLI+ +LLLS SLL    T SPSPYLSPIFLK+Y SMS  LRIFTYIP   FSF S AESLFY+SLL+SPYSTH+PD AHLFF+PF P +S RSLSR I
Subjt:  ASLISLSLLLSLSLL---TTASPSPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFI

Query:  RTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVK
        RTLRTDLPYWNRTLGADHFFLSS+G+G+  DRNVVELKKNAIQVS +PVPPGKF PHKDI+LPPV  S  VS+       +ER+LGFVGYGWV+   LVK
Subjt:  RTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVK

Query:  ELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGG-VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVD
        ELIEDPEF MESEPPP+ S    YG+++AKSDFCLFEYG G VSGIGEALR GCVPVVISDR IQDLPLMD +RWQ+MAVFV GGGGGIEGVK+VLR VD
Subjt:  ELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGG-VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVD

Query:  EESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAERKEWAQS
         E L +MKRLGAAAAQHF WNSPPQPLDAFNTVAYQLWLRRHAVRYA+R+EWAQ+
Subjt:  EESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAERKEWAQS

A0A5A7U559 Putative glycosyltransferase8.5e-13577.36Show/hide
Query:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVEL
        MS  LRIFTYIP   FSF S AESLFY+SLL+SPYSTH+PD AHLFF+PF P +S RSLSR IRTLRTDLPYWNRTLGADHFFLSS+G+G+  DRNVVEL
Subjt:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVEL

Query:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFE
        KKNAIQVS +PVPPGKF PHKDI+LPPV  S  VS+       +ER+LGFVGYGWV+   LVKELIEDPEF MESEPPP+ S    YG+++AKSDFCLFE
Subjt:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFE

Query:  YGGG-VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQL
        YG G VSGIGEALR GCVPVVISDR IQDLPLMD +RWQ+MAVFV GGGGGIEGVK+VLR VD E L +MKRLGAAAAQHF WNSPPQPLDAFNTVAYQL
Subjt:  YGGG-VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQL

Query:  WLRRHAVRYAERKEWAQS
        WLRRHAVRYA+R+EWAQ+
Subjt:  WLRRHAVRYAERKEWAQS

A0A6J1FML5 probable glycosyltransferase At5g037956.3e-15486.21Show/hide
Query:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVEL
        MSTTL+IFTYIP KP SFPSPAESLFYKSLLDSPYSTH+PD AH FF+PF P  STRSL+R IRTLR+ LPYWNRTLGADHFFLSS GV +ASDRN+VEL
Subjt:  MSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVEL

Query:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFE
        KKNAIQVSG PVP G F  HKDITLPPV DS E SSSWIPAPA ERVLGFVGYGWVRD+VLVKELIEDPEFFMESEPPP  SERWNYGERL KSDFCLFE
Subjt:  KKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFE

Query:  YGGG--VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQ
        YGGG  V  IGE LR GCVPVVISDRPIQDLPLMDVLRWQDMAVFVN GG GIEGVKRVLRRVD ESL KMKRLGAAAAQHF WNSPPQPLDAFNTVAYQ
Subjt:  YGGG--VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQ

Query:  LWLRRHAVRYAERKEWAQS
        LWLRRH +RYAERKEWAQS
Subjt:  LWLRRHAVRYAERKEWAQS

A0A6J1K6U0 probable glycosyltransferase At5g037954.6e-16584.51Show/hide
Query:  MASLISLSLLLSLSLLTTASPSPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFIRT
        MASLI+ SLLLSLSLL  ASPSPYLSPIF ++Y +MSTTL+IFTYIP KP SFPSPAESLFYKSLLDSPYSTH PD AH FF+PF P  STRSL+R IRT
Subjt:  MASLISLSLLLSLSLLTTASPSPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFIRT

Query:  LRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVKEL
        LR++LPYWNRTLGADHFFLSS GV +ASDRN+VELKKNAIQVSG PVP G F  HKDITLPPV DS E SSSWIPAPA ERVLGFVGYGWVRD+VLVKEL
Subjt:  LRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVKEL

Query:  IEDPEFFMESE-PPPSSSERWNYGERLAKSDFCLFEYGGG--VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVD
        IEDPEFFMESE PPP  SER NYGERL KSDFCLFEYGGG  V  IGE +R GCVPVVISDRPIQDLPLMDVLRWQDMAVFVN GG GIEGVKRVLRRVD
Subjt:  IEDPEFFMESE-PPPSSSERWNYGERLAKSDFCLFEYGGG--VSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVD

Query:  EESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAERKEWAQS
        EESL KMKRLGAAAAQHF WNSPPQPLDAFNTVAYQLWLRRH +RYAERKEWAQS
Subjt:  EESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAERKEWAQS

SwissProt top hitse value%identityAlignment
Q3E7Q9 Probable glycosyltransferase At5g253101.0e-2027.35Show/hide
Query:  ASLISLSLLLSLSLLTTASPSP--YLSPIFL-KSYKSMSTTLRIFTYIPLK-PFSFPSPAESLF--------YKSLLDSPYSTHNPDDAHLFFLPFPPHL
        AS++  S  ++ +L  +  P+   Y +P  L +SY  M    +++ Y   + P     P +S++              + + T++P+ A+++FLPF    
Subjt:  ASLISLSLLLSLSLLTTASPSP--YLSPIFL-KSYKSMSTTLRIFTYIPLK-PFSFPSPAESLF--------YKSLLDSPYSTHNPDDAHLFFLPFPPHL

Query:  STRSL--------------SRFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVS------DSPEV
          R L              S +IR + T+ P+WNRT GADHF L+    G  + +   +L   +I+V         F P KD+TLP +       D    
Subjt:  STRSL--------------SRFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVS------DSPEV

Query:  SSSWIPAPAAERVLGFVG--YGWVRDQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLP
         S  + A     +  F G  +G VR  +L      D +  +    P    +  NY + +  S FC    G  V+   + EA+   C+PV++S   +  LP
Subjt:  SSSWIPAPAAERVLGFVG--YGWVRDQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLP

Query:  LMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRR
          DVLRW+  +V V+     I  +K +L  +  E    +K       +HFE N PPQ  DAF+   + +WLRR
Subjt:  LMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRR

Q94AA9 Xylogalacturonan beta-1,3-xylosyltransferase1.3e-1825.69Show/hide
Query:  SPYLSP-IFLKSYKSMSTTLRIFTYIPLK-PFSFPSPAESLF-------YKSLLDSP-----YSTHNPDDAHLFFLPF----------PPHLSTRSLSR-
        S Y +P  F +S+  M    +++TY   + P     P   ++        +  +D P     +    P++AH+FF+PF           P  S    SR 
Subjt:  SPYLSP-IFLKSYKSMSTTLRIFTYIPLK-PFSFPSPAESLF-------YKSLLDSP-----YSTHNPDDAHLFFLPF----------PPHLSTRSLSR-

Query:  --------FIRTLRTDLPYWNRTLGADHFFLSSAG-VGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPV-SDSPEVSSSWI-PAPAAERVLGF
                ++  + T  PYWNR+ G DHF +S         D N    +K    +       G F P+ D+++P +     ++  S++  +P    +L F
Subjt:  --------FIRTLRTDLPYWNRTLGADHFFLSSAG-VGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPV-SDSPEVSSSWI-PAPAAERVLGF

Query:  V---GYGWVRDQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSGIG--EALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVN
             +G +R  +       D E  +    PP      +Y + +  S FCL   G  V+     EA+  GCVPV+ISD     LP  DVL W   ++ + 
Subjt:  V---GYGWVRDQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSGIG--EALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVN

Query:  GGGGGIEGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVR
             I+ +K +L+ V      KM +      QHF  N P +P D  + + + +WLRR  +R
Subjt:  GGGGGIEGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVR

Q9FFN2 Probable glycosyltransferase At5g037951.4e-2827.61Show/hide
Query:  PSPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSF-PSPAESLF-------YKSLLDSPYSTHNPDDAHLFFLPFPPHLSTR---------------SLSRF
        P  + + +F +SY  M    +I+ Y   +P  F   P +S++       Y+   D+ + T+NPD AH+F+LPF      R               ++  +
Subjt:  PSPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSF-PSPAESLF-------YKSLLDSPYSTHNPDDAHLFFLPFPPHLSTR---------------SLSRF

Query:  IRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWI--PAPAAERVLGFVG---YGWVR
        I  +    PYWNR++GADHF LS    G  +  +   L  N+I+         +F P KD+++P ++      +  +  P+P++  +L F     +G VR
Subjt:  IRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWI--PAPAAERVLGFVG---YGWVR

Query:  DQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVK
          +L     +D +  +    P  +S    Y + +  S FC+   G  V+   I EAL  GCVPV+I+   +   P  DVL W+  +V V+     I  +K
Subjt:  DQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVK

Query:  RVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAE
         +L  +      +M R      +HFE NSP +  D F+ + + +W+RR  V+  E
Subjt:  RVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAE

Q9LFP3 Probable glycosyltransferase At5g111302.5e-2228.65Show/hide
Query:  SPYLSPI-FLKSYKSMSTTLRIFTY----IPL---KPFSFPSPAESLFYKSLL--DSPYSTHNPDDAHLFFLP----------FPPHLS------TRSLS
        S YL+   F +S+K M    +I+TY     PL    P +     E  F   +   +S +   +P++A +F++P          + P+ S         + 
Subjt:  SPYLSPI-FLKSYKSMSTTLRIFTY----IPL---KPFSFPSPAESLFYKSLL--DSPYSTHNPDDAHLFFLP----------FPPHLS------TRSLS

Query:  RFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVV--ELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWI---PAPAAERVLGFVGYGW
         +I  +    PYWNR+ GADHFFLS      A D + V  EL K+ I+          FTP +D++LP + + P     ++     P   ++L F   G 
Subjt:  RFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVV--ELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWI---PAPAAERVLGFVGYGW

Query:  VRD--QVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGI
          D  ++L +   E  +  +  E  P +    NY + + K+ FCL   G  V+   I E+L  GCVPV+I+D  +  LP  DVL W+  +V +      +
Subjt:  VRD--QVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGI

Query:  EGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVR
          +K++L  + EE    M+R      +HF  N P +P D  + + + +WLRR  VR
Subjt:  EGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVR

Q9SSE8 Probable glycosyltransferase At3g076205.0e-2328.24Show/hide
Query:  FLKSYKSMSTTLRIFTYIPLKPFSFP-------SPAESLFYKSLLDS--PYSTHNPDDAHLFFLPFP-----PHL----------STRSLSRFIRTLRTD
        F +SY  M    +I+ Y    P  F           E LF   + +    Y T +PD AH++FLPF       HL            R ++ +++ +   
Subjt:  FLKSYKSMSTTLRIFTYIPLKPFSFP-------SPAESLFYKSLLDS--PYSTHNPDDAHLFFLPFP-----PHL----------STRSLSRFIRTLRTD

Query:  LPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVS----DSPEVSSSWIPAPAAERVLGFV---GYGWVRDQVLV
         PYWN + G DHF LS    GH +   V +L  N+I+V         F P KD   P ++    D   ++      P +   L F     +G +R  +L 
Subjt:  LPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVS----DSPEVSSSWIPAPAAERVLGFV---GYGWVRDQVLV

Query:  KELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRR
            +D +  +    P    +  +Y E + KS FC+   G  V+   + EA+  GCVPV+IS+  +  LP  DVL W+  +V V+     I  +KR+L  
Subjt:  KELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRR

Query:  VDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVR
        + EE   ++        +H   N PP+  D FN + + +WLRR  V+
Subjt:  VDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVR

Arabidopsis top hitse value%identityAlignment
AT3G07620.1 Exostosin family protein3.5e-2428.24Show/hide
Query:  FLKSYKSMSTTLRIFTYIPLKPFSFP-------SPAESLFYKSLLDS--PYSTHNPDDAHLFFLPFP-----PHL----------STRSLSRFIRTLRTD
        F +SY  M    +I+ Y    P  F           E LF   + +    Y T +PD AH++FLPF       HL            R ++ +++ +   
Subjt:  FLKSYKSMSTTLRIFTYIPLKPFSFP-------SPAESLFYKSLLDS--PYSTHNPDDAHLFFLPFP-----PHL----------STRSLSRFIRTLRTD

Query:  LPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVS----DSPEVSSSWIPAPAAERVLGFV---GYGWVRDQVLV
         PYWN + G DHF LS    GH +   V +L  N+I+V         F P KD   P ++    D   ++      P +   L F     +G +R  +L 
Subjt:  LPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVS----DSPEVSSSWIPAPAAERVLGFV---GYGWVRDQVLV

Query:  KELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRR
            +D +  +    P    +  +Y E + KS FC+   G  V+   + EA+  GCVPV+IS+  +  LP  DVL W+  +V V+     I  +KR+L  
Subjt:  KELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRR

Query:  VDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVR
        + EE   ++        +H   N PP+  D FN + + +WLRR  V+
Subjt:  VDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVR

AT4G16745.1 Exostosin family protein4.6e-2427.57Show/hide
Query:  IFLKSYKSMSTTLRIFTYIPLKPFSFPSP------AESLFYKSLLDS--PYSTHNPDDAHLFFLPF-----------PPHLSTRSLSRFIR----TLRTD
        +F +SY+ M   L+++ Y       F  P      A   ++  L++S   + T NP+ AHLF++P+           P   + + LS F+R     L   
Subjt:  IFLKSYKSMSTTLRIFTYIPLKPFSFPSP------AESLFYKSLLDS--PYSTHNPDDAHLFFLPF-----------PPHLSTRSLSRFIR----TLRTD

Query:  LPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQ-VSGWPVPPGKFTPHKDITLPPVS----DSPEVSSSWIPAPAAERVLGFVG---YGWVRDQVL
         P+WNRT G+DHF ++    G  +     ELK+NAI+ +    +  G F P KD++LP  S      P  +       +   +L F     +G VR ++L
Subjt:  LPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQ-VSGWPVPPGKFTPHKDITLPPVS----DSPEVSSSWIPAPAAERVLGFVG---YGWVRDQVL

Query:  VKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLR
             +D +  +    P + + +  Y + +  S +CL   G  V+   I EA+   CVPVVI+D  +  LP  DVL W   +V V      I  +K +L 
Subjt:  VKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLR

Query:  RVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLW
         +      KM+       +HF W+  P+  D F+ + + +W
Subjt:  RVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLW

AT4G38040.1 Exostosin family protein1.5e-3529.6Show/hide
Query:  YLSP-IFLKSYKSMSTTLRIFTYIPLKPFSF---------PSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHL----------STRSLSRFIRTLRT
        Y SP  F  +Y  M    +++ Y    P +F            +E  F++++ +S + T +PD+A LFF+P   H            T  +  ++  L  
Subjt:  YLSP-IFLKSYKSMSTTLRIFTYIPLKPFSF---------PSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHL----------STRSLSRFIRTLRT

Query:  DLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAE----RVLGF-VGYGWVRDQVLVK
          PYWNRTLGADHFF++   VG  +      L KN I+V   P     F PHKD+ LP V     +    +PA   +      LGF  G+   + +V++ 
Subjt:  DLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAE----RVLGF-VGYGWVRDQVLVK

Query:  ELIE-DPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRR
         + E D E  + +     ++    Y +R  ++ FC+   G  V+   I +++  GC+PV++SD    DLP  D+L W+  AV +      +  +K++L+ 
Subjt:  ELIE-DPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRR

Query:  VDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRY
        +       +        +HF+WNSPP   DAF+ + Y+LWLR H V+Y
Subjt:  VDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRY

AT5G03795.1 Exostosin family protein9.6e-3027.61Show/hide
Query:  PSPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSF-PSPAESLF-------YKSLLDSPYSTHNPDDAHLFFLPFPPHLSTR---------------SLSRF
        P  + + +F +SY  M    +I+ Y   +P  F   P +S++       Y+   D+ + T+NPD AH+F+LPF      R               ++  +
Subjt:  PSPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSF-PSPAESLF-------YKSLLDSPYSTHNPDDAHLFFLPFPPHLSTR---------------SLSRF

Query:  IRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWI--PAPAAERVLGFVG---YGWVR
        I  +    PYWNR++GADHF LS    G  +  +   L  N+I+         +F P KD+++P ++      +  +  P+P++  +L F     +G VR
Subjt:  IRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWI--PAPAAERVLGFVG---YGWVR

Query:  DQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVK
          +L     +D +  +    P  +S    Y + +  S FC+   G  V+   I EAL  GCVPV+I+   +   P  DVL W+  +V V+     I  +K
Subjt:  DQVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVK

Query:  RVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAE
         +L  +      +M R      +HFE NSP +  D F+ + + +W+RR  V+  E
Subjt:  RVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVRYAE

AT5G11130.1 Exostosin family protein1.8e-2328.65Show/hide
Query:  SPYLSPI-FLKSYKSMSTTLRIFTY----IPL---KPFSFPSPAESLFYKSLL--DSPYSTHNPDDAHLFFLP----------FPPHLS------TRSLS
        S YL+   F +S+K M    +I+TY     PL    P +     E  F   +   +S +   +P++A +F++P          + P+ S         + 
Subjt:  SPYLSPI-FLKSYKSMSTTLRIFTY----IPL---KPFSFPSPAESLFYKSLL--DSPYSTHNPDDAHLFFLP----------FPPHLS------TRSLS

Query:  RFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVV--ELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWI---PAPAAERVLGFVGYGW
         +I  +    PYWNR+ GADHFFLS      A D + V  EL K+ I+          FTP +D++LP + + P     ++     P   ++L F   G 
Subjt:  RFIRTLRTDLPYWNRTLGADHFFLSSAGVGHASDRNVV--ELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWI---PAPAAERVLGFVGYGW

Query:  VRD--QVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGI
          D  ++L +   E  +  +  E  P +    NY + + K+ FCL   G  V+   I E+L  GCVPV+I+D  +  LP  DVL W+  +V +      +
Subjt:  VRD--QVLVKELIEDPEFFMESEPPPSSSERWNYGERLAKSDFCLFEYGGGVSG--IGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGI

Query:  EGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVR
          +K++L  + EE    M+R      +HF  N P +P D  + + + +WLRR  VR
Subjt:  EGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTVAYQLWLRRHAVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCCTCATCTCTCTCTCTCTCCTCCTCTCTCTCTCTCTCCTCACAACAGCTTCCCCTTCCCCTTATCTCTCTCCCATTTTCCTCAAAAGCTACAAATCCATGTC
CACAACCTTAAGAATCTTCACCTACATCCCACTCAAACCCTTCTCCTTCCCTTCTCCCGCCGAATCGCTTTTCTACAAATCGCTTCTCGACAGCCCCTACTCCACTCACA
ACCCCGACGACGCCCACTTGTTCTTCCTCCCCTTTCCTCCCCATCTCTCCACGCGCTCCCTCTCGCGTTTCATCCGCACGCTCCGTACGGACCTGCCGTACTGGAATCGG
ACTCTCGGCGCCGACCACTTCTTTCTCTCCTCGGCCGGCGTTGGCCATGCGTCTGATCGGAACGTCGTCGAGTTGAAGAAGAACGCCATTCAGGTCTCTGGATGGCCGGT
GCCGCCCGGGAAGTTTACTCCTCATAAGGACATTACTCTGCCGCCGGTTTCCGATTCGCCAGAAGTTTCTTCTTCCTGGATTCCGGCGCCGGCGGCGGAGAGGGTGCTTG
GTTTCGTCGGGTATGGGTGGGTGAGGGATCAGGTCTTGGTGAAGGAGTTGATTGAGGATCCTGAGTTTTTTATGGAGTCGGAGCCGCCGCCGTCGTCGTCGGAGCGGTGG
AATTACGGGGAGAGATTGGCGAAAAGCGACTTTTGCTTGTTCGAATACGGCGGCGGTGTTTCGGGGATTGGGGAGGCTTTGCGATGTGGGTGTGTGCCGGTGGTGATTTC
TGACCGTCCGATTCAGGACTTGCCGTTGATGGACGTGCTCCGGTGGCAGGACATGGCGGTGTTCGTCAACGGCGGCGGCGGCGGAATAGAAGGAGTGAAGAGGGTATTAA
GGCGCGTGGACGAGGAGAGTCTTGCGAAAATGAAGAGATTGGGTGCGGCGGCGGCACAGCATTTTGAGTGGAACTCGCCGCCCCAGCCGTTGGATGCTTTCAATACGGTG
GCGTATCAGCTTTGGCTGAGAAGGCACGCCGTTAGATATGCCGAGAGGAAGGAGTGGGCCCAGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCCTCATCTCTCTCTCTCTCCTCCTCTCTCTCTCTCTCCTCACAACAGCTTCCCCTTCCCCTTATCTCTCTCCCATTTTCCTCAAAAGCTACAAATCCATGTC
CACAACCTTAAGAATCTTCACCTACATCCCACTCAAACCCTTCTCCTTCCCTTCTCCCGCCGAATCGCTTTTCTACAAATCGCTTCTCGACAGCCCCTACTCCACTCACA
ACCCCGACGACGCCCACTTGTTCTTCCTCCCCTTTCCTCCCCATCTCTCCACGCGCTCCCTCTCGCGTTTCATCCGCACGCTCCGTACGGACCTGCCGTACTGGAATCGG
ACTCTCGGCGCCGACCACTTCTTTCTCTCCTCGGCCGGCGTTGGCCATGCGTCTGATCGGAACGTCGTCGAGTTGAAGAAGAACGCCATTCAGGTCTCTGGATGGCCGGT
GCCGCCCGGGAAGTTTACTCCTCATAAGGACATTACTCTGCCGCCGGTTTCCGATTCGCCAGAAGTTTCTTCTTCCTGGATTCCGGCGCCGGCGGCGGAGAGGGTGCTTG
GTTTCGTCGGGTATGGGTGGGTGAGGGATCAGGTCTTGGTGAAGGAGTTGATTGAGGATCCTGAGTTTTTTATGGAGTCGGAGCCGCCGCCGTCGTCGTCGGAGCGGTGG
AATTACGGGGAGAGATTGGCGAAAAGCGACTTTTGCTTGTTCGAATACGGCGGCGGTGTTTCGGGGATTGGGGAGGCTTTGCGATGTGGGTGTGTGCCGGTGGTGATTTC
TGACCGTCCGATTCAGGACTTGCCGTTGATGGACGTGCTCCGGTGGCAGGACATGGCGGTGTTCGTCAACGGCGGCGGCGGCGGAATAGAAGGAGTGAAGAGGGTATTAA
GGCGCGTGGACGAGGAGAGTCTTGCGAAAATGAAGAGATTGGGTGCGGCGGCGGCACAGCATTTTGAGTGGAACTCGCCGCCCCAGCCGTTGGATGCTTTCAATACGGTG
GCGTATCAGCTTTGGCTGAGAAGGCACGCCGTTAGATATGCCGAGAGGAAGGAGTGGGCCCAGAGTTGA
Protein sequenceShow/hide protein sequence
MASLISLSLLLSLSLLTTASPSPYLSPIFLKSYKSMSTTLRIFTYIPLKPFSFPSPAESLFYKSLLDSPYSTHNPDDAHLFFLPFPPHLSTRSLSRFIRTLRTDLPYWNR
TLGADHFFLSSAGVGHASDRNVVELKKNAIQVSGWPVPPGKFTPHKDITLPPVSDSPEVSSSWIPAPAAERVLGFVGYGWVRDQVLVKELIEDPEFFMESEPPPSSSERW
NYGERLAKSDFCLFEYGGGVSGIGEALRCGCVPVVISDRPIQDLPLMDVLRWQDMAVFVNGGGGGIEGVKRVLRRVDEESLAKMKRLGAAAAQHFEWNSPPQPLDAFNTV
AYQLWLRRHAVRYAERKEWAQS