; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028650 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028650
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSASA domain-containing protein
Genome locationchr8:27479278..27481873
RNA-Seq ExpressionLag0028650
SyntenyLag0028650
Gene Ontology termsNA
InterPro domainsIPR005181 - Sialate O-acetylesterase domain
IPR036514 - SGNH hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016422.1 putative carbohydrate esterase, partial [Cucurbita argyrosperma subsp. argyrosperma]5.3e-3157.26Show/hide
Query:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT
        MA RGGV  D       WDGYIP +SQ   SI RF+  + WE A EPLHW+ID  KTNGVG GM FANELL +   S G IGLVPCAIGG+HLREW+KGT
Subjt:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT

Query:  INYTKICCFQNYYLNHKIGNAKGF
          YTK+         H  G  KGF
Subjt:  INYTKICCFQNYYLNHKIGNAKGF

XP_022134349.1 probable carbohydrate esterase At4g34215 [Momordica charantia]3.0e-4274.77Show/hide
Query:  MASRGGVRCDKFMAH-CKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKG
        MA RGGV CD    H CKWDGYIP +SQS+TSI+RF+ G +WELAHEPLHW+ID KKTNGVG GM FANELL + NKS G+IGLVPCAIGGT+LREW+KG
Subjt:  MASRGGVRCDKFMAH-CKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKG

Query:  TINYTKI
        TINYTK+
Subjt:  TINYTKI

XP_022141681.1 probable carbohydrate esterase At4g34215 [Momordica charantia]1.9e-3366.04Show/hide
Query:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT
        MA RGGV  D       WDGYIP +SQS  SILR +  LRWE A EPLHW+ID  KTNG+G GM FANEL  +  KS G+IGLVPCAIGGTHLREWIKGT
Subjt:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT

Query:  INYTKI
          YTK+
Subjt:  INYTKI

XP_022993914.1 probable carbohydrate esterase At4g34215 [Cucurbita maxima]5.3e-3157.26Show/hide
Query:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT
        MA RGGV  D       WDGYIP +SQ   SI RF+  + WE A EPLHW+ID  KTNGVG GM FANELL +   S G IGLVPCAIGG+HLREW+KGT
Subjt:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT

Query:  INYTKICCFQNYYLNHKIGNAKGF
          YTK+         H  G  KGF
Subjt:  INYTKICCFQNYYLNHKIGNAKGF

XP_023550941.1 probable carbohydrate esterase At4g34215 [Cucurbita pepo subsp. pepo]6.2e-3258.06Show/hide
Query:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT
        MA RGGV  D       WDGYIP +SQ   SI RF+  + WE AHEPLHW+ID  KTNGVG GM FANELL +   S G IGLVPCAIGG+HLREW+KGT
Subjt:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT

Query:  INYTKICCFQNYYLNHKIGNAKGF
          YTK+         H  G  KGF
Subjt:  INYTKICCFQNYYLNHKIGNAKGF

TrEMBL top hitse value%identityAlignment
A0A6J1BYJ2 probable carbohydrate esterase At4g342151.4e-4274.77Show/hide
Query:  MASRGGVRCDKFMAH-CKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKG
        MA RGGV CD    H CKWDGYIP +SQS+TSI+RF+ G +WELAHEPLHW+ID KKTNGVG GM FANELL + NKS G+IGLVPCAIGGT+LREW+KG
Subjt:  MASRGGVRCDKFMAH-CKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKG

Query:  TINYTKI
        TINYTK+
Subjt:  TINYTKI

A0A6J1CJZ1 probable carbohydrate esterase At4g342159.3e-3466.04Show/hide
Query:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT
        MA RGGV  D       WDGYIP +SQS  SILR +  LRWE A EPLHW+ID  KTNG+G GM FANEL  +  KS G+IGLVPCAIGGTHLREWIKGT
Subjt:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT

Query:  INYTKI
          YTK+
Subjt:  INYTKI

A0A6J1CKF9 probable carbohydrate esterase At4g342154.8e-3058.49Show/hide
Query:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT
        MA RGGV  D       WDG +P + Q   SILRFS    WE A EPLHW+ID  KTNG+G GM FA+E+L +    +G+IGLVPCAIGGTHLREW+KGT
Subjt:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT

Query:  INYTKI
         NYT++
Subjt:  INYTKI

A0A6J1FFF9 probable carbohydrate esterase At4g342152.5e-3157.26Show/hide
Query:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT
        MA RGGV  D       WDGYIP +SQ   SI RF+  + WE A EPLHW+ID  KTNGVG GM FANELL +   S G IGLVPCAIGG+HLREW+KGT
Subjt:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT

Query:  INYTKICCFQNYYLNHKIGNAKGF
          YTK+         H  G  KGF
Subjt:  INYTKICCFQNYYLNHKIGNAKGF

A0A6J1K1G7 probable carbohydrate esterase At4g342152.5e-3157.26Show/hide
Query:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT
        MA RGGV  D       WDGYIP +SQ   SI RF+  + WE A EPLHW+ID  KTNGVG GM FANELL +   S G IGLVPCAIGG+HLREW+KGT
Subjt:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT

Query:  INYTKICCFQNYYLNHKIGNAKGF
          YTK+         H  G  KGF
Subjt:  INYTKICCFQNYYLNHKIGNAKGF

SwissProt top hitse value%identityAlignment
Q8L9J9 Probable carbohydrate esterase At4g342154.6e-2247.17Show/hide
Query:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT
        MA RGGV  D       WD  +P +    +SILR S  LRWE AHEPLH +ID  K  GVG GM FAN +       + +IGLVPCA GGT ++EW +G+
Subjt:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT

Query:  INYTKI
          Y ++
Subjt:  INYTKI

Arabidopsis top hitse value%identityAlignment
AT3G53010.1 Domain of unknown function (DUF303)2.8e-2250.49Show/hide
Query:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT
        MA RGGV  D       WDG IP + +S  SILR +  L W+ A EPLH +ID  KTNGVG GM FAN ++       G +GLVPC+IGGT L +W KG 
Subjt:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT

Query:  INY
          Y
Subjt:  INY

AT4G34215.1 Domain of unknown function (DUF303)3.3e-2347.17Show/hide
Query:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT
        MA RGGV  D       WD  +P +    +SILR S  LRWE AHEPLH +ID  K  GVG GM FAN +       + +IGLVPCA GGT ++EW +G+
Subjt:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT

Query:  INYTKI
          Y ++
Subjt:  INYTKI

AT4G34215.2 Domain of unknown function (DUF303)3.3e-2347.17Show/hide
Query:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT
        MA RGGV  D       WD  +P +    +SILR S  LRWE AHEPLH +ID  K  GVG GM FAN +       + +IGLVPCA GGT ++EW +G+
Subjt:  MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGT

Query:  INYTKI
          Y ++
Subjt:  INYTKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAGTCGAGGTGGGGTCAGATGTGATAAATTCATGGCACATTGTAAATGGGATGGATATATCCCTTCCCAATCTCAATCCCAAACATCTATCCTTCGATTTTCGGT
TGGATTAAGATGGGAACTGGCCCATGAACCACTTCATTGGAATATTGATCCCAAAAAGACCAATGGAGTTGGTATTGGTATGACTTTCGCAAATGAGCTTCTGACCGAAA
CTAACAAGAGCACTGGAATTATTGGTCTTGTTCCCTGTGCAATCGGAGGAACCCACCTCAGAGAGTGGATTAAAGGGACCATTAATTACACAAAAATATGTTGTTTTCAA
AATTACTATTTAAATCACAAGATTGGAAACGCCAAAGGTTTCGTCTCGTCCATTGGATTATTGTTGGAGAAGCAGGTCACGCACGCTGCACGTGCCCCGCCGGAGTACGG
CGTTTGCGATTGGGAATACTCCATCAGCGGCAGAAAAGCCGGCGAGTTCCGGTCGGCCTCGAACGACTTCGTCACCGAAAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAGTCGAGGTGGGGTCAGATGTGATAAATTCATGGCACATTGTAAATGGGATGGATATATCCCTTCCCAATCTCAATCCCAAACATCTATCCTTCGATTTTCGGT
TGGATTAAGATGGGAACTGGCCCATGAACCACTTCATTGGAATATTGATCCCAAAAAGACCAATGGAGTTGGTATTGGTATGACTTTCGCAAATGAGCTTCTGACCGAAA
CTAACAAGAGCACTGGAATTATTGGTCTTGTTCCCTGTGCAATCGGAGGAACCCACCTCAGAGAGTGGATTAAAGGGACCATTAATTACACAAAAATATGTTGTTTTCAA
AATTACTATTTAAATCACAAGATTGGAAACGCCAAAGGTTTCGTCTCGTCCATTGGATTATTGTTGGAGAAGCAGGTCACGCACGCTGCACGTGCCCCGCCGGAGTACGG
CGTTTGCGATTGGGAATACTCCATCAGCGGCAGAAAAGCCGGCGAGTTCCGGTCGGCCTCGAACGACTTCGTCACCGAAAACTAA
Protein sequenceShow/hide protein sequence
MASRGGVRCDKFMAHCKWDGYIPSQSQSQTSILRFSVGLRWELAHEPLHWNIDPKKTNGVGIGMTFANELLTETNKSTGIIGLVPCAIGGTHLREWIKGTINYTKICCFQ
NYYLNHKIGNAKGFVSSIGLLLEKQVTHAARAPPEYGVCDWEYSISGRKAGEFRSASNDFVTEN