; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012036 (gene) of Snake gourd v1 genome

Gene IDTan0012036
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHexosyltransferase
Genome locationLG07:8180867..8185693
RNA-Seq ExpressionTan0012036
SyntenyTan0012036
Gene Ontology termsGO:0010405 - arabinogalactan protein metabolic process (biological process)
GO:0018258 - protein O-linked glycosylation via hydroxyproline (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
GO:1990714 - hydroxyproline O-galactosyltransferase activity (molecular function)
InterPro domainsIPR002659 - Glycosyl transferase, family 31
IPR025298 - Domain of unknown function DUF4094


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145914.1 hydroxyproline O-galactosyltransferase HPGT1 isoform X1 [Cucumis sativus]7.5e-18394.4Show/hide
Query:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK
        MRSKGSNARLS MPIRSRI TL LSMFATFASIYVAGRLWQDAENRVYLIKELDRLTG GQSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF VK
Subjt:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK

Query:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF
        HSRETNETK+PLVVIGV+TRFGRKNNRDAIRKAWMGTG SLRKME+QKGIIARFVIGRSPNRGDSLDRAIDDEN QY+DFIIHNDHVE+PEELSKK KLF
Subjt:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA
        FAYA+DKW+AEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLR+
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDV+TGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

XP_008437561.1 PREDICTED: LOW QUALITY PROTEIN: hydroxyproline O-galactosyltransferase HPGT1 [Cucumis melo]2.6e-18395.28Show/hide
Query:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK
        MRSKGSNARLS MPIRSRI TL LSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF VK
Subjt:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK

Query:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF
        HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTG SLRKME+QKGIIARFVIGRSPNRGDSLDRAIDDEN QY+DFIIHNDHVE+PEELSKK K F
Subjt:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA
        FAYAVDKW+AEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLR+
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVT GSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

XP_022922391.1 hydroxyproline O-galactosyltransferase HPGT1 [Cucurbita moschata]1.2e-18394.99Show/hide
Query:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK
        MRSKGSNARLSSMPIRSRI TL LSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF+VK
Subjt:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK

Query:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF
        HSRETNETKIPLVVIGVITRFGRK NRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDEN QY+DFIIHNDHVE+PEELSKK KLF
Subjt:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA
        FAYAVDKWDAEFYAKVNDDVYINID LGSTLASYLDKPRVY+GCMKSGEVFSEP+HKWYEPDWWKFGDKK YFRHASGEMYVISKALAKFISINRSLLR+
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVT GSWFIGLD TYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

XP_023551313.1 hydroxyproline O-galactosyltransferase HPGT1 [Cucurbita pepo subsp. pepo]1.5e-18394.99Show/hide
Query:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK
        MRSKGSNARLSSMPIRSRI TL LSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACRE+QKKLLALEM+LA ARQEGF+VK
Subjt:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK

Query:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF
        HSRETNETKIPLVVIGVITRFGRK NRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDEN QY+DFIIHNDHVE+PEELSKK KLF
Subjt:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA
        FAYAVDKWDAEFYAKVNDDVYINID LGSTLASYLDKPRVY+GCMKSGEVFSEPSHKWYEPDWWKFGDKK YFRHASGEMYVISKALAKFISINRSLLR+
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVT GSWFIGLD TYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

XP_038874869.1 hydroxyproline O-galactosyltransferase HPGT1 [Benincasa hispida]4.7e-18595.87Show/hide
Query:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK
        MRSKGSNARLS MPIRSRI TL LSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF+VK
Subjt:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK

Query:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF
        HSRETNETKIPLVVIGVIT FGRKNNRDAIRKAWMGTGASLRKME+QKGIIARFVIGRSPNRGDSLDRAIDDENRQY+DFIIHN+HVE+PEELSKK KLF
Subjt:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA
        FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLR+
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVTTG+WFIGLDVTYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

TrEMBL top hitse value%identityAlignment
A0A0A0KJY1 Hexosyltransferase3.6e-18394.4Show/hide
Query:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK
        MRSKGSNARLS MPIRSRI TL LSMFATFASIYVAGRLWQDAENRVYLIKELDRLTG GQSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF VK
Subjt:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK

Query:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF
        HSRETNETK+PLVVIGV+TRFGRKNNRDAIRKAWMGTG SLRKME+QKGIIARFVIGRSPNRGDSLDRAIDDEN QY+DFIIHNDHVE+PEELSKK KLF
Subjt:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA
        FAYA+DKW+AEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLR+
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDV+TGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

A0A1S3ATZ9 Hexosyltransferase1.2e-18395.28Show/hide
Query:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK
        MRSKGSNARLS MPIRSRI TL LSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF VK
Subjt:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK

Query:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF
        HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTG SLRKME+QKGIIARFVIGRSPNRGDSLDRAIDDEN QY+DFIIHNDHVE+PEELSKK K F
Subjt:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA
        FAYAVDKW+AEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLR+
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVT GSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

A0A6J1DD78 Hexosyltransferase8.1e-18393.51Show/hide
Query:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK
        MRSKGSNARLS MPIRSR+ TL LSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKL+ALEM+LA ARQEGFLVK
Subjt:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK

Query:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF
        HSRETNET+IPLVVIGVITRFGRKNNRDAIRKAWMGTGASL+KMENQKGIIARFVIGRS NRGDSLDRAIDDENRQY+DF+IHNDHVE+PEEL KK KLF
Subjt:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA
        FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEP HKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRS+LR+
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVT GSWF+GLDV YIDEGKFCCSSW+AGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

A0A6J1E406 Hexosyltransferase5.6e-18494.99Show/hide
Query:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK
        MRSKGSNARLSSMPIRSRI TL LSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF+VK
Subjt:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK

Query:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF
        HSRETNETKIPLVVIGVITRFGRK NRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDEN QY+DFIIHNDHVE+PEELSKK KLF
Subjt:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA
        FAYAVDKWDAEFYAKVNDDVYINID LGSTLASYLDKPRVY+GCMKSGEVFSEP+HKWYEPDWWKFGDKK YFRHASGEMYVISKALAKFISINRSLLR+
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVT GSWFIGLD TYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

A0A6J1IBW9 Hexosyltransferase4.7e-18394.1Show/hide
Query:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK
        MRSKGSNARLSSMPIRSRI TL LSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF+VK
Subjt:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK

Query:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF
        HSRETNETK+PLVVIG+ITRFGRK NRDAIRKAWMGTGASLRKME QKGIIARFVIGRSPNRGDSLDRAIDDEN QY+DFIIHNDHVE+PEELSKK KLF
Subjt:  HSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLF

Query:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA
        FAYAVD+WDAEFYAKVNDDVYINID LGSTLASYLDKPRVY+GCMKSGEVFSEPSHKWYEPDWWKFGDKK YFRHASGEMYVISKALAKFISINRSLLR+
Subjt:  FAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRA

Query:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        YAHDDVT GSWFIGLD TYIDEGKFCCSSWSAGAICAGV
Subjt:  YAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

SwissProt top hitse value%identityAlignment
Q5XEZ1 Hydroxyproline O-galactosyltransferase HPGT31.7e-8446.36Show/hide
Query:  ARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNE
        AR S     S   ++ ++ F+  A +YVAGRLWQDAENRV L   L +   Q    ++VDD L ++ C++ +++++  EMEL +A+ +G+L      ++ 
Subjt:  ARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNE

Query:  TKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLFFAYAVDK
         K  L VIGV + FG    R+  R ++M  G +LRK+E ++GI+ RFVIGRSPNRGDSLDR ID+EN+   DF+I  +H E+ EEL+KK K FF+ AV  
Subjt:  TKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLFFAYAVDK

Query:  WDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRAYAHDDVT
        WDAEFY KV+D++ ++++ L   L S   +   Y+GCMKSGEV +E   KWYEP+WWKFGD+K+YFRHA+G + ++SK LA++++IN   L+ YA DD +
Subjt:  WDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRAYAHDDVT

Query:  TGSWFIGLDVTYIDEGKFCCSSWSAGAICA
         GSW IG+  TYID+ + CCSS     +C+
Subjt:  TGSWFIGLDVTYIDEGKFCCSSWSAGAICA

Q94A05 Hydroxyproline O-galactosyltransferase HPGT22.7e-8244.94Show/hide
Query:  RSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKH
        R + S ++ +S P      +L L+ F+  A +YVAGRLWQDA+ R  L   L     Q    ++V+D L ++ C++ +++++  EMELA A+ +G+L K 
Subjt:  RSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKH

Query:  SRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLFF
           ++  K  L VIGV T FG    R+  R +WM    +L+K+E ++G++ RFVIGRS NRGDSLDR ID+ENR   DF+I  +H E+ EEL KK K F+
Subjt:  SRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLFF

Query:  AYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRAY
        + AV  WDAEFY KV+D+V ++++ + + L S   +   Y+GCMKSG+V +E   +WYEP+WWKFGD K+YFRHA+G + ++SK LA++++IN  LL+ Y
Subjt:  AYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRAY

Query:  AHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICA
        A DD T GSW IG+  TYID+ + CCSS     +C+
Subjt:  AHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICA

Q94F27 Hydroxyproline O-galactosyltransferase HPGT12.8e-14070.76Show/hide
Query:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK
        M  KGS+ RLSS    SRISTL L MFATFAS YVAGRLWQ+++ RV+LI ELDR+TGQG+SAISVDDTLKIIACREQ+K L ALEMEL+ ARQEGF+ K
Subjt:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK

Query:  HSR---ETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKG
          +    T   K PLVVIG++T  G K  RDA+R+AWMGTGASL+K+E++KG+IARFVIGRS N+GDS+D++ID EN Q DDFII +D VE+PEE SKK 
Subjt:  HSR---ETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKG

Query:  KLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSL
        KLFFAYA D+WDA+FYAK  D++Y+NIDALG+TLA++L+ PR Y+GCMKSGEVFSEP+HKWYEP+WWKFGDKK YFRHA GEMYVI+ ALA+F+SINR +
Subjt:  KLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSL

Query:  LRAYAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        L +YAHDDV+TGSWF+GLDV ++DEGKFCCS+WS+ AICAGV
Subjt:  LRAYAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

Q9MAP8 Beta-1,6-galactosyltransferase GALT31A4.7e-6339.17Show/hide
Query:  LSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAIS-VDDTLKIIACREQQKKLLALEMELAVAR------QEGFLVKHS
        +SS  +   +    L+ F T   I  A     D    +  + + +   G   S +S   D +K +      K + +LE+ELA AR      ++G      
Subjt:  LSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAIS-VDDTLKIIACREQQKKLLALEMELAVAR------QEGFLVKHS

Query:  RETNETKI---PLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKL
           +++KI      V+G++T F  +  RD+IR  W+  G  L+++E +KGII RFVIG S + G  LD  I+ E  Q+ DF   N H+E   ELS K ++
Subjt:  RETNETKI---PLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKL

Query:  FFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDK-KTYFRHASGEMYVISKALAKFISINRSLL
        +F+ AV KWDA+FY KV+DDV++N+  LGSTLA +  KPRVY+GCMKSG V ++   K++EP++WKFG++   YFRHA+G++Y ISK LA +IS+NR LL
Subjt:  FFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDK-KTYFRHASGEMYVISKALAKFISINRSLL

Query:  RAYAHDDVTTGSWFIGLDVTYIDEGKFCCSS-------------------WSAGAICAGV
          YA++DV+ GSWFIGLDV +ID+   CC +                   WS   IC  V
Subjt:  RAYAHDDVTTGSWFIGLDVTYIDEGKFCCSS-------------------WSAGAICAGV

Q9ZV71 Probable beta-1,3-galactosyltransferase 34.3e-6446.79Show/hide
Query:  ACREQQKKLLALEMELAVAR--QEGFL--VKHSRETNETKIP-----LVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGD
        A +   K + +LEMELA AR  QE  +     S +  + ++P     L+V+G+ T F  +  RD++R  WM +G   +K+E +KGII RFVIG S   G 
Subjt:  ACREQQKKLLALEMELAVAR--QEGFL--VKHSRETNETKIP-----LVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGD

Query:  SLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWW
         LDR+I+ E++++ DF +  DHVE   ELS K K +F+ AV KWDAEFY KV+DDV++NI  LG TL  +  K RVY+GCMKSG V S+   +++EP++W
Subjt:  SLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWW

Query:  KFGDK-KTYFRHASGEMYVISKALAKFISINRSLLRAYAHDDVTTGSWFIGLDVTYIDEGKFCCSS-----W--SAGAIC
        KFG+    YFRHA+G++Y IS+ LA +IS+N+ +L  YA++DVT G+WFIGLDVT+ID+ + CC +     W   AG IC
Subjt:  KFGDK-KTYFRHASGEMYVISKALAKFISINRSLLRAYAHDDVTTGSWFIGLDVTYIDEGKFCCSS-----W--SAGAIC

Arabidopsis top hitse value%identityAlignment
AT1G77810.2 Galactosyltransferase family protein2.3e-6546.4Show/hide
Query:  DDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSL
        D T +++   E  + L      L+  R    +V  S ETN  K   +V+G+ T F  +  RD++R+ WM  G  L ++E +KGI+ +F+IG S      L
Subjt:  DDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSL

Query:  DRAIDDENRQYDDFIIHNDHVESPEELSKKGKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKF
        DRAID E+ Q+ DF +  +HVE   ELS K K+FF+ AV KWDAEFY KV+DDV++N+  L STLA +  KPRVY+GCMKSG V ++ + K++EP++WKF
Subjt:  DRAIDDENRQYDDFIIHNDHVESPEELSKKGKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKF

Query:  G-DKKTYFRHASGEMYVISKALAKFISINRSLLRAYAHDDVTTGSWFIGLDVTYIDEGKFCCSS-----W--SAGAIC
        G D   YFRHA+G++Y ISK LA +ISIN+ +L  YA++DV+ GSWFIGL+V +ID+  FCC +     W   AG +C
Subjt:  G-DKKTYFRHASGEMYVISKALAKFISINRSLLRAYAHDDVTTGSWFIGLDVTYIDEGKFCCSS-----W--SAGAIC

AT2G25300.1 Galactosyltransferase family protein1.2e-8546.36Show/hide
Query:  ARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNE
        AR S     S   ++ ++ F+  A +YVAGRLWQDAENRV L   L +   Q    ++VDD L ++ C++ +++++  EMEL +A+ +G+L      ++ 
Subjt:  ARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNE

Query:  TKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLFFAYAVDK
         K  L VIGV + FG    R+  R ++M  G +LRK+E ++GI+ RFVIGRSPNRGDSLDR ID+EN+   DF+I  +H E+ EEL+KK K FF+ AV  
Subjt:  TKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLFFAYAVDK

Query:  WDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRAYAHDDVT
        WDAEFY KV+D++ ++++ L   L S   +   Y+GCMKSGEV +E   KWYEP+WWKFGD+K+YFRHA+G + ++SK LA++++IN   L+ YA DD +
Subjt:  WDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRAYAHDDVT

Query:  TGSWFIGLDVTYIDEGKFCCSSWSAGAICA
         GSW IG+  TYID+ + CCSS     +C+
Subjt:  TGSWFIGLDVTYIDEGKFCCSSWSAGAICA

AT4G32120.1 Galactosyltransferase family protein1.9e-8344.94Show/hide
Query:  RSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKH
        R + S ++ +S P      +L L+ F+  A +YVAGRLWQDA+ R  L   L     Q    ++V+D L ++ C++ +++++  EMELA A+ +G+L K 
Subjt:  RSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKH

Query:  SRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLFF
           ++  K  L VIGV T FG    R+  R +WM    +L+K+E ++G++ RFVIGRS NRGDSLDR ID+ENR   DF+I  +H E+ EEL KK K F+
Subjt:  SRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLFF

Query:  AYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRAY
        + AV  WDAEFY KV+D+V ++++ + + L S   +   Y+GCMKSG+V +E   +WYEP+WWKFGD K+YFRHA+G + ++SK LA++++IN  LL+ Y
Subjt:  AYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRAY

Query:  AHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICA
        A DD T GSW IG+  TYID+ + CCSS     +C+
Subjt:  AHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICA

AT5G53340.1 Galactosyltransferase family protein2.0e-14170.76Show/hide
Query:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK
        M  KGS+ RLSS    SRISTL L MFATFAS YVAGRLWQ+++ RV+LI ELDR+TGQG+SAISVDDTLKIIACREQ+K L ALEMEL+ ARQEGF+ K
Subjt:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK

Query:  HSR---ETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKG
          +    T   K PLVVIG++T  G K  RDA+R+AWMGTGASL+K+E++KG+IARFVIGRS N+GDS+D++ID EN Q DDFII +D VE+PEE SKK 
Subjt:  HSR---ETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKG

Query:  KLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSL
        KLFFAYA D+WDA+FYAK  D++Y+NIDALG+TLA++L+ PR Y+GCMKSGEVFSEP+HKWYEP+WWKFGDKK YFRHA GEMYVI+ ALA+F+SINR +
Subjt:  KLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSL

Query:  LRAYAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        L +YAHDDV+TGSWF+GLDV ++DEGKFCCS+WS+ AICAGV
Subjt:  LRAYAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV

AT5G53340.2 Galactosyltransferase family protein2.9e-14070.76Show/hide
Query:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK
        M  KGS+ RLSS    SRISTL L MFATFAS YVAGRLWQ+++ RV+LI ELDR+TGQG+SAISVDDTLKIIACREQ+K L ALEMEL+ ARQEGF+ K
Subjt:  MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVK

Query:  HSR---ETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKG
          +    T   K PLVVIG++T  G K  RDA+R+AWMGTGASL+K+E++KG+IARFVIGRS N+GDS+D++ID EN Q DDFII +D VE+PEE SKK 
Subjt:  HSR---ETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKG

Query:  KLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSL
        KLFFAYA D+WDA+FYAK  D++Y+NIDALG+TLA++L+ PR Y+GCMKSGEVFSEP+HKWYEP+WWKFGDKK YFRHA GEMYVI+ ALA+F+SINR +
Subjt:  KLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSL

Query:  LRAYAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV
        L +YAHDDV+TGSWF+GLDV ++DEGKFCCS+WS+ AICAGV
Subjt:  LRAYAHDDVTTGSWFIGLDVTYIDEGKFCCSSWSAGAICAGV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGAGTAAGGGATCGAATGCTCGCCTGTCGAGCATGCCTATTCGATCCCGAATTTCCACCCTTTTTCTCTCCATGTTCGCTACCTTTGCTTCAATCTACGTCGCCGG
CCGGTTGTGGCAGGATGCGGAGAACAGGGTTTACTTGATTAAAGAGCTGGATAGGCTAACTGGTCAGGGACAATCTGCCATTTCAGTGGACGATACATTAAAAATCATAG
CCTGCAGGGAACAACAGAAGAAGTTGTTGGCGCTTGAGATGGAATTGGCTGTTGCTAGACAGGAGGGTTTTTTGGTGAAGCATTCAAGAGAGACTAATGAAACAAAGATC
CCCTTGGTTGTAATTGGAGTCATCACTAGATTTGGCCGGAAAAACAACAGAGATGCAATTCGTAAAGCATGGATGGGAACTGGCGCTTCTTTGAGAAAAATGGAGAATCA
GAAGGGCATAATTGCTAGATTTGTCATTGGAAGAAGTCCAAACCGCGGGGACAGTTTAGACAGGGCCATTGACGATGAAAACAGACAATATGATGATTTTATTATACATA
ATGACCATGTGGAGTCGCCTGAGGAGCTTTCAAAGAAGGGCAAGCTTTTTTTTGCTTATGCCGTTGATAAATGGGATGCCGAATTTTATGCCAAAGTCAATGATGATGTT
TATATAAATATTGATGCCCTAGGGAGCACACTTGCTTCTTACTTGGACAAACCTCGTGTCTATGTTGGGTGCATGAAATCTGGTGAAGTATTCTCAGAACCGAGCCATAA
GTGGTACGAACCAGATTGGTGGAAGTTTGGTGATAAAAAAACATACTTCCGCCATGCTTCTGGCGAAATGTATGTCATATCGAAAGCTCTGGCTAAGTTTATTTCAATAA
ACAGATCTCTTCTCCGTGCTTACGCCCATGACGATGTCACGACCGGTTCCTGGTTTATTGGGCTTGATGTCACGTACATTGATGAAGGGAAGTTTTGCTGCTCTTCTTGG
TCTGCAGGAGCTATTTGTGCAGGTGTCTGA
mRNA sequenceShow/hide mRNA sequence
GAAATGTTTTACCCGCCTAACAGTGTTTTTTGTTTTTGTGTTTAAGCAATTAATTGCGTTTTTGTCACTTACCAAATTCGATGATCAGATTTTGAATTTATAATTTCAAT
CTGTTCATCTCCAATTCCATTAATCTCTGGAATCGAGTGAAATTTTTCACCAACCGCAAAATCAATCTTCTCCTTCTTTGCTCTCATCGAATTTCTCATCCAGTCTCAGT
CTTGTCCACTTATCAGCTCTTTGTTCCGAAATCCAATCGATTCTGCGCATCAATTTCATTCCCGAAGCAAGTTTCAGCGTCGATTTTGTGATCTTCAAGATCTGTTAAAT
GCATCAGGTTTGATCGTTTTGGTATTTCAAGGTTGTAGACGAAGCCGATTAGTTGCTGTGAAGATGCGGAGTAAGGGATCGAATGCTCGCCTGTCGAGCATGCCTATTCG
ATCCCGAATTTCCACCCTTTTTCTCTCCATGTTCGCTACCTTTGCTTCAATCTACGTCGCCGGCCGGTTGTGGCAGGATGCGGAGAACAGGGTTTACTTGATTAAAGAGC
TGGATAGGCTAACTGGTCAGGGACAATCTGCCATTTCAGTGGACGATACATTAAAAATCATAGCCTGCAGGGAACAACAGAAGAAGTTGTTGGCGCTTGAGATGGAATTG
GCTGTTGCTAGACAGGAGGGTTTTTTGGTGAAGCATTCAAGAGAGACTAATGAAACAAAGATCCCCTTGGTTGTAATTGGAGTCATCACTAGATTTGGCCGGAAAAACAA
CAGAGATGCAATTCGTAAAGCATGGATGGGAACTGGCGCTTCTTTGAGAAAAATGGAGAATCAGAAGGGCATAATTGCTAGATTTGTCATTGGAAGAAGTCCAAACCGCG
GGGACAGTTTAGACAGGGCCATTGACGATGAAAACAGACAATATGATGATTTTATTATACATAATGACCATGTGGAGTCGCCTGAGGAGCTTTCAAAGAAGGGCAAGCTT
TTTTTTGCTTATGCCGTTGATAAATGGGATGCCGAATTTTATGCCAAAGTCAATGATGATGTTTATATAAATATTGATGCCCTAGGGAGCACACTTGCTTCTTACTTGGA
CAAACCTCGTGTCTATGTTGGGTGCATGAAATCTGGTGAAGTATTCTCAGAACCGAGCCATAAGTGGTACGAACCAGATTGGTGGAAGTTTGGTGATAAAAAAACATACT
TCCGCCATGCTTCTGGCGAAATGTATGTCATATCGAAAGCTCTGGCTAAGTTTATTTCAATAAACAGATCTCTTCTCCGTGCTTACGCCCATGACGATGTCACGACCGGT
TCCTGGTTTATTGGGCTTGATGTCACGTACATTGATGAAGGGAAGTTTTGCTGCTCTTCTTGGTCTGCAGGAGCTATTTGTGCAGGTGTCTGATTGGTTCGCTTGAAGAT
CCTTGATGAGAAGGGAAATACAAGCCTGAAGAGAACACAGAACTAATTTATATCTTTTTGATGTCCTTGTGTGTGACTAGACGAAAGAAGATAGCTCTCCAACGCTGCCT
TCGAATTCTTACGCTAACATCGAATTATCCGAAAGACTGGCCAGAGATTCAACAGTTTGTAAGATGTGCAACACAGAAACATTTAACAGTTGAAGGGGAACAGTACTTTC
TTCATTCTTTGGACTTCACATGATCTTACTGATTCATTTTTATGTGTGATTTTTTCGTTCATTAGCACATCGGAGATATGTTAATTCTTTCAGGCGAGTTGGACCTTGAG
AAATGTAGATTTAAAGGCACAGTTCTATGAAAAGGGCACCTATTAAAC
Protein sequenceShow/hide protein sequence
MRSKGSNARLSSMPIRSRISTLFLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKI
PLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENRQYDDFIIHNDHVESPEELSKKGKLFFAYAVDKWDAEFYAKVNDDV
YINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSLLRAYAHDDVTTGSWFIGLDVTYIDEGKFCCSSW
SAGAICAGV