; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014165 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014165
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSASA domain-containing protein
Genome locationChr02:8128888..8130679
RNA-Seq ExpressionHG10014165
SyntenyHG10014165
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005181 - Sialate O-acetylesterase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141392.1 probable carbohydrate esterase At4g34215 [Cucumis sativus]3.4e-3183.72Show/hide
Query:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAPLPIIATYLY
        +VGIASGEG YKEGVRRGQFG+DLVNVM VDALGL LEPDGLHL T SQVRLGGLLADAYRRFPSHPLATPLTNAAP+  I+T  +
Subjt:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAPLPIIATYLY

XP_008452605.1 PREDICTED: probable carbohydrate esterase At4g34215 [Cucumis melo]8.3e-3075.7Show/hide
Query:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAPLPIIATYLYLLSISTILTFLLLF
        +VGIASGEG YKEGVRRGQFG+DLVNVM VDALGL LEPDGLHL TPSQVRLGGLLA AYRRFPSHPLATPLTNAAP+  I +  +LLSI  I TFL  F
Subjt:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAPLPIIATYLYLLSISTILTFLLLF

Query:  LQLFLLL
        +  +  L
Subjt:  LQLFLLL

XP_022140685.1 probable carbohydrate esterase At4g34215 [Momordica charantia]6.3e-3085.71Show/hide
Query:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAP
        +VGIASG+GPYKEGVRRGQFG++L NVM+VDALGL LEPDGLHLNTP+QV+LGGLLADAYRRFPSHPLA+PL NAAP
Subjt:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAP

XP_022982273.1 probable carbohydrate esterase At4g34215 [Cucurbita maxima]1.6e-2567.65Show/hide
Query:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVD--ALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPL-ATPLTNAAPLPIIATYLYLLSISTILTFL
        +VGIASGEGPYKEGVRRGQFG++++NVM+VD  ALGLS EPDGLHLNTPSQV+LGG+LADAYRRFP HPL A+PL NAA     + Y + +S+   +TF+
Subjt:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVD--ALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPL-ATPLTNAAPLPIIATYLYLLSISTILTFL

Query:  LL
         L
Subjt:  LL

XP_038899610.1 probable carbohydrate esterase At4g34215 [Benincasa hispida]2.2e-3580.19Show/hide
Query:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAPLPIIATYLYLLSISTILTFLLLF
        +VGIA+GEGPYKEGVRRGQFG+DLVNVM+VDA+GLSLEPDGLHL TPSQV+LGGLLADAYRRFPSHPLATPLTNAA +P+I+T  + LSIS ILT +LLF
Subjt:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAPLPIIATYLYLLSISTILTFLLLF

Query:  LQLFLL
        L+L L+
Subjt:  LQLFLL

TrEMBL top hitse value%identityAlignment
A0A0A0L4Z7 SASA domain-containing protein1.6e-3183.72Show/hide
Query:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAPLPIIATYLY
        +VGIASGEG YKEGVRRGQFG+DLVNVM VDALGL LEPDGLHL T SQVRLGGLLADAYRRFPSHPLATPLTNAAP+  I+T  +
Subjt:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAPLPIIATYLY

A0A1S3BVE3 probable carbohydrate esterase At4g342154.0e-3075.7Show/hide
Query:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAPLPIIATYLYLLSISTILTFLLLF
        +VGIASGEG YKEGVRRGQFG+DLVNVM VDALGL LEPDGLHL TPSQVRLGGLLA AYRRFPSHPLATPLTNAAP+  I +  +LLSI  I TFL  F
Subjt:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAPLPIIATYLYLLSISTILTFLLLF

Query:  LQLFLLL
        +  +  L
Subjt:  LQLFLLL

A0A5A7VA07 Putative carbohydrate esterase4.0e-3075.7Show/hide
Query:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAPLPIIATYLYLLSISTILTFLLLF
        +VGIASGEG YKEGVRRGQFG+DLVNVM VDALGL LEPDGLHL TPSQVRLGGLLA AYRRFPSHPLATPLTNAAP+  I +  +LLSI  I TFL  F
Subjt:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAPLPIIATYLYLLSISTILTFLLLF

Query:  LQLFLLL
        +  +  L
Subjt:  LQLFLLL

A0A6J1CFT3 probable carbohydrate esterase At4g342153.1e-3085.71Show/hide
Query:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAP
        +VGIASG+GPYKEGVRRGQFG++L NVM+VDALGL LEPDGLHLNTP+QV+LGGLLADAYRRFPSHPLA+PL NAAP
Subjt:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAP

A0A6J1J4F1 probable carbohydrate esterase At4g342157.8e-2667.65Show/hide
Query:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVD--ALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPL-ATPLTNAAPLPIIATYLYLLSISTILTFL
        +VGIASGEGPYKEGVRRGQFG++++NVM+VD  ALGLS EPDGLHLNTPSQV+LGG+LADAYRRFP HPL A+PL NAA     + Y + +S+   +TF+
Subjt:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVD--ALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPL-ATPLTNAAPLPIIATYLYLLSISTILTFL

Query:  LL
         L
Subjt:  LL

SwissProt top hitse value%identityAlignment
Q8L9J9 Probable carbohydrate esterase At4g342155.4e-0855Show/hide
Query:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY
        +V IASG G Y + VR  Q G+ L NV+ VDA GL L+ D LHL T +QV+LG  LA AY
Subjt:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY

Arabidopsis top hitse value%identityAlignment
AT3G53010.1 Domain of unknown function (DUF303)8.8e-1452.31Show/hide
Query:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPS
        +V +A+G GPY + VR+ Q   DL NV  VDA GL LEPDGLHL T SQV+LG ++A+++   P+
Subjt:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPS

AT4G34215.1 Domain of unknown function (DUF303)3.8e-0955Show/hide
Query:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY
        +V IASG G Y + VR  Q G+ L NV+ VDA GL L+ D LHL T +QV+LG  LA AY
Subjt:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY

AT4G34215.2 Domain of unknown function (DUF303)3.8e-0955Show/hide
Query:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY
        +V IASG G Y + VR  Q G+ L NV+ VDA GL L+ D LHL T +QV+LG  LA AY
Subjt:  KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGAGAAAAAGGTTGGTATTGCGTCAGGAGAAGGGCCGTATAAAGAAGGAGTAAGAAGGGGGCAATTTGGAATGGATTTAGTGAACGTGATGAGTGTGGACGCATT
GGGCCTTTCATTGGAACCAGATGGGCTTCACTTAAACACTCCTTCCCAAGTTCGACTGGGTGGGCTTTTAGCCGATGCGTATCGACGATTTCCATCTCACCCACTGGCTA
CCCCATTAACAAACGCTGCTCCATTGCCTATAATTGCAACTTACTTATACTTGCTTTCCATTTCTACGATTCTCACATTTCTGTTGCTATTTCTACAACTATTTCTCTTA
TTATGA
mRNA sequenceShow/hide mRNA sequence
ATGATTGAGAAAAAGGTTGGTATTGCGTCAGGAGAAGGGCCGTATAAAGAAGGAGTAAGAAGGGGGCAATTTGGAATGGATTTAGTGAACGTGATGAGTGTGGACGCATT
GGGCCTTTCATTGGAACCAGATGGGCTTCACTTAAACACTCCTTCCCAAGTTCGACTGGGTGGGCTTTTAGCCGATGCGTATCGACGATTTCCATCTCACCCACTGGCTA
CCCCATTAACAAACGCTGCTCCATTGCCTATAATTGCAACTTACTTATACTTGCTTTCCATTTCTACGATTCTCACATTTCTGTTGCTATTTCTACAACTATTTCTCTTA
TTATGA
Protein sequenceShow/hide protein sequence
MIEKKVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAPLPIIATYLYLLSISTILTFLLLFLQLFLL
L