; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0013647 (gene) of Chayote v1 genome

Gene IDSed0013647
OrganismSechium edule (Chayote v1)
DescriptionHexosyltransferase
Genome locationLG05:2925965..2931404
RNA-Seq ExpressionSed0013647
SyntenySed0013647
Gene Ontology termsGO:0010405 - arabinogalactan protein metabolic process (biological process)
GO:0018258 - protein O-linked glycosylation via hydroxyproline (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
GO:1990714 - hydroxyproline O-galactosyltransferase activity (molecular function)
InterPro domainsIPR002659 - Glycosyl transferase, family 31
IPR025298 - Domain of unknown function DUF4094


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145914.1 hydroxyproline O-galactosyltransferase HPGT1 isoform X1 [Cucumis sativus]1.0e-17993.22Show/hide
Query:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK
        MRSKGSNARLS M  RSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLI ELDRLTG G SAISVDDTLKII CREQQKKLLALEM+LA ARQEGF+VK
Subjt:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK

Query:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF
        HSRETNETK+PLVVIGV+TRFGRKNNRDAIRKAWMGTG SLRKME+QKGIIARFVIG+SPNRGDSLDRAIDDEN QY+DFIIHNDHVEA EELSKKAKLF
Subjt:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS
        FAYA+DKW+AEFYAKVNDDVYINIDALGSTLA+ LDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDV+TGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

XP_008437561.1 PREDICTED: LOW QUALITY PROTEIN: hydroxyproline O-galactosyltransferase HPGT1 [Cucumis melo]3.5e-18093.81Show/hide
Query:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK
        MRSKGSNARLS M  RSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLI ELDRLTGQG SAISVDDTLKII CREQQKKLLALEM+LA ARQEGF+VK
Subjt:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK

Query:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF
        HSRETNETKIPLVVIGV+TRFGRKNNRDAIRKAWMGTG SLRKME+QKGIIARFVIG+SPNRGDSLDRAIDDEN QY+DFIIHNDHVEA EELSKKAK F
Subjt:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS
        FAYAVDKW+AEFYAKVNDDVYINIDALGSTLA+ LDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVT GSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

XP_022922391.1 hydroxyproline O-galactosyltransferase HPGT1 [Cucurbita moschata]5.9e-18093.51Show/hide
Query:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK
        MRSKGSNARLSSM  RSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLI ELDRLTGQG SAISVDDTLKII CREQQKKLLALEM+LA ARQEGF VK
Subjt:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK

Query:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF
        HSRETNETKIPLVVIGV+TRFGRK NRDAIRKAWMGTGASLRKMENQKGIIARFVIG+SPNRGDSLDRAIDDEN QY+DFIIHNDHVEA EELSKKAKLF
Subjt:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS
        FAYAVDKWDAEFYAKVNDDVYINID LGSTLA+ LDKPRVY+GCMKSGEVFSEP+HKWYEPDWWKFGDKK YFRHASGEMYVISKALAKFISINRSLLRS
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVT GSWFIGLD TYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

XP_023551313.1 hydroxyproline O-galactosyltransferase HPGT1 [Cucurbita pepo subsp. pepo]7.8e-18093.51Show/hide
Query:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK
        MRSKGSNARLSSM  RSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLI ELDRLTGQG SAISVDDTLKII CRE+QKKLLALEM+LA ARQEGF VK
Subjt:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK

Query:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF
        HSRETNETKIPLVVIGV+TRFGRK NRDAIRKAWMGTGASLRKMENQKGIIARFVIG+SPNRGDSLDRAIDDEN QY+DFIIHNDHVEA EELSKKAKLF
Subjt:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS
        FAYAVDKWDAEFYAKVNDDVYINID LGSTLA+ LDKPRVY+GCMKSGEVFSEPSHKWYEPDWWKFGDKK YFRHASGEMYVISKALAKFISINRSLLRS
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVT GSWFIGLD TYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

XP_038874869.1 hydroxyproline O-galactosyltransferase HPGT1 [Benincasa hispida]2.4e-18194.4Show/hide
Query:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK
        MRSKGSNARLS M  RSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLI ELDRLTGQG SAISVDDTLKII CREQQKKLLALEM+LA ARQEGF VK
Subjt:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK

Query:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF
        HSRETNETKIPLVVIGV+T FGRKNNRDAIRKAWMGTGASLRKME+QKGIIARFVIG+SPNRGDSLDRAIDDENRQY+DFIIHN+HVEA EELSKKAKLF
Subjt:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS
        FAYAVDKWDAEFYAKVNDDVYINIDALGSTLA+ LDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVTTG+WFIGLDVTYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

TrEMBL top hitse value%identityAlignment
A0A0A0KJY1 Hexosyltransferase4.9e-18093.22Show/hide
Query:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK
        MRSKGSNARLS M  RSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLI ELDRLTG G SAISVDDTLKII CREQQKKLLALEM+LA ARQEGF+VK
Subjt:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK

Query:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF
        HSRETNETK+PLVVIGV+TRFGRKNNRDAIRKAWMGTG SLRKME+QKGIIARFVIG+SPNRGDSLDRAIDDEN QY+DFIIHNDHVEA EELSKKAKLF
Subjt:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS
        FAYA+DKW+AEFYAKVNDDVYINIDALGSTLA+ LDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDV+TGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

A0A1S3ATZ9 Hexosyltransferase1.7e-18093.81Show/hide
Query:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK
        MRSKGSNARLS M  RSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLI ELDRLTGQG SAISVDDTLKII CREQQKKLLALEM+LA ARQEGF+VK
Subjt:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK

Query:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF
        HSRETNETKIPLVVIGV+TRFGRKNNRDAIRKAWMGTG SLRKME+QKGIIARFVIG+SPNRGDSLDRAIDDEN QY+DFIIHNDHVEA EELSKKAK F
Subjt:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS
        FAYAVDKW+AEFYAKVNDDVYINIDALGSTLA+ LDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVT GSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

A0A6J1DD78 Hexosyltransferase9.3e-17991.74Show/hide
Query:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK
        MRSKGSNARLS M  RSR+PTLLLSMFATFASIYVAGRLWQDAENRVYLI ELDRLTGQG SAISVDDTLKII CREQQKKL+ALEM+LA ARQEGF VK
Subjt:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK

Query:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF
        HSRETNET+IPLVVIGV+TRFGRKNNRDAIRKAWMGTGASL+KMENQKGIIARFVIG+S NRGDSLDRAIDDENRQY+DF+IHNDHVEA EEL KKAKLF
Subjt:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS
        FAYAVDKWDAEFYAKVNDDVYINIDALGSTLA+ LDKPRVYVGCMKSGEVFSEP HKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRS+LRS
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVT GSWF+GLDV YIDEGKFCCSSW+AGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

A0A6J1E406 Hexosyltransferase2.9e-18093.51Show/hide
Query:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK
        MRSKGSNARLSSM  RSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLI ELDRLTGQG SAISVDDTLKII CREQQKKLLALEM+LA ARQEGF VK
Subjt:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK

Query:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF
        HSRETNETKIPLVVIGV+TRFGRK NRDAIRKAWMGTGASLRKMENQKGIIARFVIG+SPNRGDSLDRAIDDEN QY+DFIIHNDHVEA EELSKKAKLF
Subjt:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS
        FAYAVDKWDAEFYAKVNDDVYINID LGSTLA+ LDKPRVY+GCMKSGEVFSEP+HKWYEPDWWKFGDKK YFRHASGEMYVISKALAKFISINRSLLRS
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVT GSWFIGLD TYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

A0A6J1IBW9 Hexosyltransferase2.4e-17992.63Show/hide
Query:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK
        MRSKGSNARLSSM  RSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLI ELDRLTGQG SAISVDDTLKII CREQQKKLLALEM+LA ARQEGF VK
Subjt:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVK

Query:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF
        HSRETNETK+PLVVIG++TRFGRK NRDAIRKAWMGTGASLRKME QKGIIARFVIG+SPNRGDSLDRAIDDEN QY+DFIIHNDHVEA EELSKKAKLF
Subjt:  HSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS
        FAYAVD+WDAEFYAKVNDDVYINID LGSTLA+ LDKPRVY+GCMKSGEVFSEPSHKWYEPDWWKFGDKK YFRHASGEMYVISKALAKFISINRSLLRS
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRS

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVT GSWFIGLD TYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

SwissProt top hitse value%identityAlignment
Q5XEZ1 Hydroxyproline O-galactosyltransferase HPGT38.9e-8646.36Show/hide
Query:  ARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVKHSRETNE
        AR S  +  S  P+++++ F+  A +YVAGRLWQDAENRV L N L +   Q    ++VDD L ++ C++ +++++  EMEL  A+ +G+       ++ 
Subjt:  ARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVKHSRETNE

Query:  TKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLFFAYAVDK
         K  L VIGV + FG    R+  R ++M  G +LRK+E ++GI+ RFVIG+SPNRGDSLDR ID+EN+   DF+I  +H EA EEL+KK K FF+ AV  
Subjt:  TKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLFFAYAVDK

Query:  WDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRSYAHDDVT
        WDAEFY KV+D++ ++++ L   L +   +   Y+GCMKSGEV +E   KWYEP+WWKFGD+K+YFRHA+G + ++SK LA++++IN   L++YA DD +
Subjt:  WDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRSYAHDDVT

Query:  TGSWFIGLDVTYIDEGKFCCSSWSAGAICA
         GSW IG+  TYID+ + CCSS     +C+
Subjt:  TGSWFIGLDVTYIDEGKFCCSSWSAGAICA

Q94A05 Hydroxyproline O-galactosyltransferase HPGT22.4e-8345.91Show/hide
Query:  PTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVKHSRETNETKIPLVVIGVLT
        P+L+L+ F+  A +YVAGRLWQDA+ R  L   L     Q    ++V+D L ++ C++ +++++  EMELA A+ +G+  K    ++  K  L VIGV T
Subjt:  PTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVKHSRETNETKIPLVVIGVLT

Query:  RFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLFFAYAVDKWDAEFYAKVNDD
         FG    R+  R +WM    +L+K+E ++G++ RFVIG+S NRGDSLDR ID+ENR   DF+I  +H EA EEL KK K F++ AV  WDAEFY KV+D+
Subjt:  RFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLFFAYAVDKWDAEFYAKVNDD

Query:  VYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRSYAHDDVTTGSWFIGLDVTY
        V ++++ + + L +   +   Y+GCMKSG+V +E   +WYEP+WWKFGD K+YFRHA+G + ++SK LA++++IN  LL++YA DD T GSW IG+  TY
Subjt:  VYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRSYAHDDVTTGSWFIGLDVTY

Query:  IDEGKFCCSSWSAGAICA
        ID+ + CCSS     +C+
Subjt:  IDEGKFCCSSWSAGAICA

Q94F27 Hydroxyproline O-galactosyltransferase HPGT11.3e-14071.35Show/hide
Query:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGF---
        M  KGS+ RLSS    SRI TLLL MFATFAS YVAGRLWQ+++ RV+LINELDR+TGQG SAISVDDTLKII CREQ+K L ALEMEL++ARQEGF   
Subjt:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGF---

Query:  SVKHSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKA
        S K +  T   K PLVVIG++T  G K  RDA+R+AWMGTGASL+K+E++KG+IARFVIG+S N+GDS+D++ID EN Q DDFII +D VEA EE SKK 
Subjt:  SVKHSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKA

Query:  KLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSL
        KLFFAYA D+WDA+FYAK  D++Y+NIDALG+TLAA+L+ PR Y+GCMKSGEVFSEP+HKWYEP+WWKFGDKK YFRHA GEMYVI+ ALA+F+SINR +
Subjt:  KLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSL

Query:  LRSYAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        L SYAHDDV+TGSWF+GLDV ++DEGKFCCS+WS+ AICAGV
Subjt:  LRSYAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

Q9MAP8 Beta-1,6-galactosyltransferase GALT31A1.9e-6439.72Show/hide
Query:  LSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAIS-VDDTLKIITCREQQKKLLALEMELATAR------QEGFSVKHS
        +SS      +   LL+ F T   I  A     D    +  + + +   G   S +S   D +K +      K + +LE+ELATAR      ++G      
Subjt:  LSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAIS-VDDTLKIITCREQQKKLLALEMELATAR------QEGFSVKHS

Query:  RETNETKI---PLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKL
           +++KI      V+G++T F  +  RD+IR  W+  G  L+++E +KGII RFVIG S + G  LD  I+ E  Q+ DF   N H+E   ELS K ++
Subjt:  RETNETKI---PLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKL

Query:  FFAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDK-KTYFRHASGEMYVISKALAKFISINRSLL
        +F+ AV KWDA+FY KV+DDV++N+  LGSTLA +  KPRVY+GCMKSG V ++   K++EP++WKFG++   YFRHA+G++Y ISK LA +IS+NR LL
Subjt:  FFAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDK-KTYFRHASGEMYVISKALAKFISINRSLL

Query:  RSYAHDDVTTGSWFIGLDVTYIDEGKFCCSS-------------------WSAGAICAGV
          YA++DV+ GSWFIGLDV +ID+   CC +                   WS   IC  V
Subjt:  RSYAHDDVTTGSWFIGLDVTYIDEGKFCCSS-------------------WSAGAICAGV

Q9ZV71 Probable beta-1,3-galactosyltransferase 32.5e-6447.08Show/hide
Query:  KKLLALEMELATARQEGFSVKH----SRETNETKIP-----LVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAI
        K + +LEMELA AR    S+ +    S +  + ++P     L+V+G+ T F  +  RD++R  WM +G   +K+E +KGII RFVIG S   G  LDR+I
Subjt:  KKLLALEMELATARQEGFSVKH----SRETNETKIP-----LVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAI

Query:  DDENRQYDDFIIHNDHVEASEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDK-
        + E++++ DF +  DHVE   ELS K K +F+ AV KWDAEFY KV+DDV++NI  LG TL  +  K RVY+GCMKSG V S+   +++EP++WKFG+  
Subjt:  DDENRQYDDFIIHNDHVEASEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDK-

Query:  KTYFRHASGEMYVISKALAKFISINRSLLRSYAHDDVTTGSWFIGLDVTYIDEGKFCCSS-----W--SAGAIC
          YFRHA+G++Y IS+ LA +IS+N+ +L  YA++DVT G+WFIGLDVT+ID+ + CC +     W   AG IC
Subjt:  KTYFRHASGEMYVISKALAKFISINRSLLRSYAHDDVTTGSWFIGLDVTYIDEGKFCCSS-----W--SAGAIC

Arabidopsis top hitse value%identityAlignment
AT1G05170.1 Galactosyltransferase family protein8.0e-6638.38Show/hide
Query:  SSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIIT--CREQQKK-------------------------LL
        SS +F SR  T+LL + +    ++   R+W   E++           G  H +++  + LK+++  C  + K+                         + 
Subjt:  SSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIIT--CREQQKK-------------------------LL

Query:  ALEMELATARQEGFSVKH---------SRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDEN
        +LEMELA AR    S+++          ++  E +  L+V+G+ T F  +  RD+IR  WM  G   +++E +KGII RFVIG S   G  LDRAI+ E+
Subjt:  ALEMELATARQEGFSVKH---------SRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDEN

Query:  RQYDDFIIHNDHVEASEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDK-KTYF
        R++ DF +  DHVE   ELS K K +F+ A   WDA+FY KV+DDV++NI  LG TL  +  KPRVY+GCMKSG V S+   +++EP++WKFG+    YF
Subjt:  RQYDDFIIHNDHVEASEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDK-KTYF

Query:  RHASGEMYVISKALAKFISINRSLLRSYAHDDVTTGSWFIGLDVTYIDEGKFCCSS-----W--SAGAIC
        RHA+G++Y IS+ LA +ISIN+ +L  YA++DV+ G+WFIG+DV +ID+ + CC +     W   AG IC
Subjt:  RHASGEMYVISKALAKFISINRSLLRSYAHDDVTTGSWFIGLDVTYIDEGKFCCSS-----W--SAGAIC

AT2G25300.1 Galactosyltransferase family protein6.3e-8746.36Show/hide
Query:  ARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVKHSRETNE
        AR S  +  S  P+++++ F+  A +YVAGRLWQDAENRV L N L +   Q    ++VDD L ++ C++ +++++  EMEL  A+ +G+       ++ 
Subjt:  ARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVKHSRETNE

Query:  TKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLFFAYAVDK
         K  L VIGV + FG    R+  R ++M  G +LRK+E ++GI+ RFVIG+SPNRGDSLDR ID+EN+   DF+I  +H EA EEL+KK K FF+ AV  
Subjt:  TKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLFFAYAVDK

Query:  WDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRSYAHDDVT
        WDAEFY KV+D++ ++++ L   L +   +   Y+GCMKSGEV +E   KWYEP+WWKFGD+K+YFRHA+G + ++SK LA++++IN   L++YA DD +
Subjt:  WDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRSYAHDDVT

Query:  TGSWFIGLDVTYIDEGKFCCSSWSAGAICA
         GSW IG+  TYID+ + CCSS     +C+
Subjt:  TGSWFIGLDVTYIDEGKFCCSSWSAGAICA

AT4G32120.1 Galactosyltransferase family protein1.7e-8445.91Show/hide
Query:  PTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVKHSRETNETKIPLVVIGVLT
        P+L+L+ F+  A +YVAGRLWQDA+ R  L   L     Q    ++V+D L ++ C++ +++++  EMELA A+ +G+  K    ++  K  L VIGV T
Subjt:  PTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVKHSRETNETKIPLVVIGVLT

Query:  RFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLFFAYAVDKWDAEFYAKVNDD
         FG    R+  R +WM    +L+K+E ++G++ RFVIG+S NRGDSLDR ID+ENR   DF+I  +H EA EEL KK K F++ AV  WDAEFY KV+D+
Subjt:  RFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLFFAYAVDKWDAEFYAKVNDD

Query:  VYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRSYAHDDVTTGSWFIGLDVTY
        V ++++ + + L +   +   Y+GCMKSG+V +E   +WYEP+WWKFGD K+YFRHA+G + ++SK LA++++IN  LL++YA DD T GSW IG+  TY
Subjt:  VYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRSYAHDDVTTGSWFIGLDVTY

Query:  IDEGKFCCSSWSAGAICA
        ID+ + CCSS     +C+
Subjt:  IDEGKFCCSSWSAGAICA

AT5G53340.1 Galactosyltransferase family protein8.9e-14271.35Show/hide
Query:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGF---
        M  KGS+ RLSS    SRI TLLL MFATFAS YVAGRLWQ+++ RV+LINELDR+TGQG SAISVDDTLKII CREQ+K L ALEMEL++ARQEGF   
Subjt:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGF---

Query:  SVKHSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKA
        S K +  T   K PLVVIG++T  G K  RDA+R+AWMGTGASL+K+E++KG+IARFVIG+S N+GDS+D++ID EN Q DDFII +D VEA EE SKK 
Subjt:  SVKHSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKA

Query:  KLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSL
        KLFFAYA D+WDA+FYAK  D++Y+NIDALG+TLAA+L+ PR Y+GCMKSGEVFSEP+HKWYEP+WWKFGDKK YFRHA GEMYVI+ ALA+F+SINR +
Subjt:  KLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSL

Query:  LRSYAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        L SYAHDDV+TGSWF+GLDV ++DEGKFCCS+WS+ AICAGV
Subjt:  LRSYAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

AT5G53340.2 Galactosyltransferase family protein1.3e-14071.35Show/hide
Query:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGF---
        M  KGS+ RLSS    SRI TLLL MFATFAS YVAGRLWQ+++ RV+LINELDR+TGQG SAISVDDTLKII CREQ+K L ALEMEL++ARQEGF   
Subjt:  MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGF---

Query:  SVKHSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKA
        S K +  T   K PLVVIG++T  G K  RDA+R+AWMGTGASL+K+E++KG+IARFVIG+S N+GDS+D++ID EN Q DDFII +D VEA EE SKK 
Subjt:  SVKHSRETNETKIPLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKA

Query:  KLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSL
        KLFFAYA D+WDA+FYAK  D++Y+NIDALG+TLAA+L+ PR Y+GCMKSGEVFSEP+HKWYEP+WWKFGDKK YFRHA GEMYVI+ ALA+F+SINR +
Subjt:  KLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSL

Query:  LRSYAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        L SYAHDDV+TGSWF+GLDV ++DEGKFCCS+WS+ AICAGV
Subjt:  LRSYAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGAGTAAGGGATCGAATGCTCGCCTCTCGAGCATGGCTTTTCGATCCCGAATTCCTACCCTTTTGCTCTCCATGTTCGCTACCTTCGCTTCGATCTACGTCGCCGG
CCGGCTATGGCAGGATGCGGAGAATAGGGTTTACTTGATTAATGAACTTGATAGGCTAACTGGTCAGGGACACTCTGCCATTTCGGTGGATGATACATTAAAAATAATAA
CCTGCAGGGAACAACAGAAGAAGTTGTTGGCGCTTGAGATGGAATTGGCTACGGCTAGACAGGAGGGTTTTTCGGTGAAGCATTCAAGAGAGACTAATGAAACAAAGATC
CCCTTGGTTGTAATTGGAGTCCTCACTAGATTTGGCCGTAAAAACAACAGAGATGCAATTCGTAAAGCATGGATGGGAACTGGCGCTTCTTTGAGAAAAATGGAGAATCA
GAAGGGCATAATTGCTAGATTTGTCATTGGAAAAAGTCCAAACCGTGGGGACAGTTTAGACAGGGCCATTGACGATGAAAACAGACAATATGATGATTTCATTATACATA
ACGACCATGTGGAGGCGTCGGAGGAGCTTTCCAAGAAGGCCAAGCTTTTTTTTGCCTATGCTGTTGATAAATGGGATGCTGAGTTTTATGCCAAGGTCAATGATGATGTA
TATATAAATATTGATGCCCTAGGGAGCACACTTGCTGCTAACTTGGACAAACCTCGTGTCTATGTTGGATGCATGAAATCTGGTGAAGTATTCTCAGAACCGAGCCATAA
ATGGTACGAACCAGATTGGTGGAAGTTTGGTGATAAAAAAACATACTTCCGTCATGCTTCTGGCGAAATGTATGTCATATCAAAAGCTCTGGCCAAGTTTATTTCAATAA
ACAGATCTCTTCTCCGTTCTTACGCCCACGACGATGTCACGACTGGCTCCTGGTTTATCGGGCTTGATGTCACATACATTGATGAGGGAAAGTTTTGCTGCTCTTCTTGG
TCTGCAGGTGCTATTTGTGCGGGTGTCTGA
mRNA sequenceShow/hide mRNA sequence
AAATTGTTATTTTCGAACAAAATTCAGATTTTGTAGGTTCGAAATTTAGGAAGAAATTGACTTTCCTTAGAACATTACCACAGCAAAACGTAACAGTGCTTCAGTTTTGT
GTTTGAGCAATTAGTTGCGTTTGTTGTCACTTACCAAATTTGATTATGAGATTTTAAACTGCACAATTTCAATCTGTTCATCACCATCTCGGAAATGGACTGAAAATTTT
CACCAAATCGCAAAATCAATCTGATTCTTCTTTGCTCTCATCGAATTTCTCGTCCAGCTTCGTTTTCTCTGAAATCAAATCGATTCCGCAGTTCAATTTCAACCCCGAAG
CAAGATTCAGACTCGATTTTGTGATCTTCAAGATCTGATAAATGCGTCAGGTTCATTTTAGGGTTTGAAGTTGAAGACGAAGCCGATCTGTTGCTGTAAAGATGCGGAGT
AAGGGATCGAATGCTCGCCTCTCGAGCATGGCTTTTCGATCCCGAATTCCTACCCTTTTGCTCTCCATGTTCGCTACCTTCGCTTCGATCTACGTCGCCGGCCGGCTATG
GCAGGATGCGGAGAATAGGGTTTACTTGATTAATGAACTTGATAGGCTAACTGGTCAGGGACACTCTGCCATTTCGGTGGATGATACATTAAAAATAATAACCTGCAGGG
AACAACAGAAGAAGTTGTTGGCGCTTGAGATGGAATTGGCTACGGCTAGACAGGAGGGTTTTTCGGTGAAGCATTCAAGAGAGACTAATGAAACAAAGATCCCCTTGGTT
GTAATTGGAGTCCTCACTAGATTTGGCCGTAAAAACAACAGAGATGCAATTCGTAAAGCATGGATGGGAACTGGCGCTTCTTTGAGAAAAATGGAGAATCAGAAGGGCAT
AATTGCTAGATTTGTCATTGGAAAAAGTCCAAACCGTGGGGACAGTTTAGACAGGGCCATTGACGATGAAAACAGACAATATGATGATTTCATTATACATAACGACCATG
TGGAGGCGTCGGAGGAGCTTTCCAAGAAGGCCAAGCTTTTTTTTGCCTATGCTGTTGATAAATGGGATGCTGAGTTTTATGCCAAGGTCAATGATGATGTATATATAAAT
ATTGATGCCCTAGGGAGCACACTTGCTGCTAACTTGGACAAACCTCGTGTCTATGTTGGATGCATGAAATCTGGTGAAGTATTCTCAGAACCGAGCCATAAATGGTACGA
ACCAGATTGGTGGAAGTTTGGTGATAAAAAAACATACTTCCGTCATGCTTCTGGCGAAATGTATGTCATATCAAAAGCTCTGGCCAAGTTTATTTCAATAAACAGATCTC
TTCTCCGTTCTTACGCCCACGACGATGTCACGACTGGCTCCTGGTTTATCGGGCTTGATGTCACATACATTGATGAGGGAAAGTTTTGCTGCTCTTCTTGGTCTGCAGGT
GCTATTTGTGCGGGTGTCTGATTTGCTTGCTTGAAGATCCCTGAAATACAAAACTGAAGAGAACACAGAACTAATTTATAGCTCTTTTATGTCATTGTGTCCTTGTGTGT
GGCTAGGCGAAAGAAGGTAGCTCTTCCAACACTACCTTCGGATTCTTACGACAACATCGAATTCTTGAAAGACTGGCCAGAGATTCAACAGTTTGTAAGATGTGTTACAC
AGAAACATTACCAGTTGAAGGTTCCTTGGATTTCACCTCTCTGACATTCGTTCTTATATGTGATTTTGCTGTTCATTAGCACATCAGATGTATGTTAATTCTTTCAGGCG
AGTTGGACCTTGAGAAATGTTGGTTTAAAGGCACATTTTCTATGAAAATAGCACCTATTATCGAGCA
Protein sequenceShow/hide protein sequence
MRSKGSNARLSSMAFRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLINELDRLTGQGHSAISVDDTLKIITCREQQKKLLALEMELATARQEGFSVKHSRETNETKI
PLVVIGVLTRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGKSPNRGDSLDRAIDDENRQYDDFIIHNDHVEASEELSKKAKLFFAYAVDKWDAEFYAKVNDDV
YINIDALGSTLAANLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRSYAHDDVTTGSWFIGLDVTYIDEGKFCCSSW
SAGAICAGV