; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022113 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022113
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Description2S albumin
Genome locationChr05:21013345..21013779
RNA-Seq ExpressionHG10022113
SyntenyHG10022113
Gene Ontology termsGO:0043086 - negative regulation of catalytic activity (biological process)
GO:0000322 - storage vacuole (cellular component)
GO:0033095 - aleurone grain (cellular component)
GO:0015066 - alpha-amylase inhibitor activity (molecular function)
GO:0045735 - nutrient reservoir activity (molecular function)
InterPro domainsIPR000617 - Napin/ Bra allergen
IPR016140 - Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain
IPR036312 - Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain superfamily
IPR044723 - AAI/SS protein, conserved domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025475.1 2S albumin precursor [Cucumis melo var. makuwa]2.4e-4574.64Show/hide
Query:  IRTIIALLAAALVVADAHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLE
        + +IIALLA ALV+ADAHRT I TVEV+EEN  RG  +RCRQMRA+EEIGSC EYL QQSRYVL+MRGI+NQRRRGG  F+ECCREL NV+EECRCELL+
Subjt:  IRTIIALLAAALVVADAHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLE

Query:  EIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF
        EIA  E+RK  GQE  Q+ Q ARNLPSMCG RPQQCYF
Subjt:  EIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF

XP_011650534.1 2S albumin [Cucumis sativus]1.1e-4572.46Show/hide
Query:  IRTIIALLAAALVVADAHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLE
        + +IIALLA ALV+ADAHRT I TVEV+EEN GR   +RCRQMRA+EEIGSC +YL QQSRYVL+MRGI+NQRRRGG LF+ECC EL NV+EECRCELL+
Subjt:  IRTIIALLAAALVVADAHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLE

Query:  EIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF
        EIA  E+R+  GQE  Q+ Q A+NLPSMCG+RPQQCYF
Subjt:  EIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF

XP_022993226.1 2S albumin [Cucurbita maxima]1.9e-3967.86Show/hide
Query:  IRTIIALLAAALVVAD--AHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCEL
        + +IIAL A AL+VAD  A+RT I TVEVEE  QGR  ++RCRQM A+EE+ SC++YLRQQSR VL+MRGIEN  RR G  F+ECCRELRNVDEECRC++
Subjt:  IRTIIALLAAALVVAD--AHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCEL

Query:  LEEIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF
        LEEIA  E+R+  GQE RQ+ Q+ARNLPSMCGIRPQ+C F
Subjt:  LEEIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF

XP_038889520.1 2S albumin-like [Benincasa hispida]3.8e-5177.4Show/hide
Query:  MGRSVVIRTIIALLAAALVVADAHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSR--YVLEMRGIENQRRRGGDLFNECCRELRNVDE
        M +   I  I AL    LVVADAHRTII T+EVEEE   + Q +RCRQMRA+EEIG C EYL QQSR  YVLEMRGIENQRRRGG+L NECC ELRNVDE
Subjt:  MGRSVVIRTIIALLAAALVVADAHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSR--YVLEMRGIENQRRRGGDLFNECCRELRNVDE

Query:  ECRCELLEEIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF
        ECRCELLEEI  MERRKGGGQEERQ+FQRARNLPSMCGIRPQQCYF
Subjt:  ECRCELLEEIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF

XP_038904176.1 2S albumin-like [Benincasa hispida]3.2e-4274.07Show/hide
Query:  IIALLAAALVVADAHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLEEIA
        I A LA AL+VADAHRT I TVEV+E+NQGR   +RCRQMRA+EEIGSC +YL QQSRY L+MRGI+NQRRR G  F ECC ELRNVDEECRCELLEEIA
Subjt:  IIALLAAALVVADAHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLEEIA

Query:  GMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF
          E+R+  GQ   Q+ QRARNLPSMCGIRPQQCYF
Subjt:  GMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF

TrEMBL top hitse value%identityAlignment
A0A0A0L7Q2 AAI domain-containing protein5.2e-4672.46Show/hide
Query:  IRTIIALLAAALVVADAHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLE
        + +IIALLA ALV+ADAHRT I TVEV+EEN GR   +RCRQMRA+EEIGSC +YL QQSRYVL+MRGI+NQRRRGG LF+ECC EL NV+EECRCELL+
Subjt:  IRTIIALLAAALVVADAHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLE

Query:  EIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF
        EIA  E+R+  GQE  Q+ Q A+NLPSMCG+RPQQCYF
Subjt:  EIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF

A0A5A7SLZ0 2S albumin1.2e-4574.64Show/hide
Query:  IRTIIALLAAALVVADAHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLE
        + +IIALLA ALV+ADAHRT I TVEV+EEN  RG  +RCRQMRA+EEIGSC EYL QQSRYVL+MRGI+NQRRRGG  F+ECCREL NV+EECRCELL+
Subjt:  IRTIIALLAAALVVADAHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLE

Query:  EIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF
        EIA  E+RK  GQE  Q+ Q ARNLPSMCG RPQQCYF
Subjt:  EIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF

A0A6J1EW73 2S albumin-like3.6e-3968.38Show/hide
Query:  IIALLAAALVVADAHR-TIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLEEI
        IIAL+  A++V+DA+    I TVEV ++N+ R   +RC QMRA EEIGSC +YL QQSR VL+MRGIENQRRR G +F+ECCRELRNVDE+CRCELLE+I
Subjt:  IIALLAAALVVADAHR-TIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLEEI

Query:  AGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF
        A  E RKG  QE RQ+ QRARNLPSMCG+RPQQCYF
Subjt:  AGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF

A0A6J1FV75 2S albumin6.1e-3967.14Show/hide
Query:  IRTIIALLAAALVVAD--AHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCEL
        + +IIAL A AL+VAD  A+RT I TVEVEE    RG ++RCRQM A+EE+ SC++YLRQQSR VL+MRGIEN  RR G  F+ECCRELRNVDEECRC++
Subjt:  IRTIIALLAAALVVAD--AHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCEL

Query:  LEEIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF
        LEEIA  E+R+  GQE RQ+ Q+ARNLPSMCGIRPQ+C F
Subjt:  LEEIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF

A0A6J1JVR2 2S albumin9.4e-4067.86Show/hide
Query:  IRTIIALLAAALVVAD--AHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCEL
        + +IIAL A AL+VAD  A+RT I TVEVEE  QGR  ++RCRQM A+EE+ SC++YLRQQSR VL+MRGIEN  RR G  F+ECCRELRNVDEECRC++
Subjt:  IRTIIALLAAALVVAD--AHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCEL

Query:  LEEIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF
        LEEIA  E+R+  GQE RQ+ Q+ARNLPSMCGIRPQ+C F
Subjt:  LEEIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF

SwissProt top hitse value%identityAlignment
P04403 2S sulfur-rich seed storage protein 12.0e-1035.66Show/hide
Query:  IALLAAALVV------ADAHRTIIRTVEVEEENQGRGQQQRCR-QMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGD-LFNECCRELRNVDEECRC
        I++ AAAL+V      A A R  + T  VEEEN     Q+ CR QM+ Q+ +  C+ Y+RQQ    +E    +   RRG +   +ECC +L  +DE CRC
Subjt:  IALLAAALVV------ADAHRTIIRTVEVEEENQGRGQQQRCR-QMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGD-LFNECCRELRNVDEECRC

Query:  ELLEEI---AGMERRKGGGQEERQLFQRARNLPSMCGIRPQQC
        E L  +      E  +  G++ R++ + A N+PS C + P +C
Subjt:  ELLEEI---AGMERRKGGGQEERQLFQRARNLPSMCGIRPQQC

P0C8Y8 2S sulfur-rich seed storage protein 26.8e-1137.41Show/hide
Query:  SVVIRTIIALLAAALVVADAHRTIIRTV---EVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQR--RRGGD-LFNECCRELRNVD
        SVV   ++ALL   L  A A RT + T    E EE  +GR +QQ   QM  Q+++  C+ YLRQQ    +E    +N R  RRG +   +ECC +L  +D
Subjt:  SVVIRTIIALLAAALVVADAHRTIIRTV---EVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQR--RRGGD-LFNECCRELRNVD

Query:  EECRCELLEEIAGMERRKG--GGQEERQLFQRARNLPSMCGIRPQQC
        E CRCE L  +   +R +    G++ +++ ++A NL S C + PQ+C
Subjt:  EECRCELLEEIAGMERRKG--GGQEERQLFQRARNLPSMCGIRPQQC

P15461 2S seed storage protein9.8e-1035.25Show/hide
Query:  GRGQQQRCRQMRAQEEIGSCQEYLRQQ-------SRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLEEIA---------GMERRKG--GGQE
        G G QQ CR+   Q  +G CQ ++ QQ       +R   +  G + Q++RG  L  +CC EL+NV  EC CE ++E+A           ++R+G  GGQE
Subjt:  GRGQQQRCRQMRAQEEIGSCQEYLRQQ-------SRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLEEIA---------GMERRKG--GGQE

Query:  ERQLFQRARNLPSMCGIRPQQC
             +  +NLP+ C +  QQC
Subjt:  ERQLFQRARNLPSMCGIRPQQC

P93198 2S albumin seed storage protein (Fragment)1.9e-1338.69Show/hide
Query:  ALLAAALVVAD--AHRTIIRTVEVEEE-NQGRGQQQRCR-QMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLEE
        ALL A L VA+  A RT I T+E++E+ +  R + + CR Q++ Q+ +  CQ YLRQQSR      G +   +R    F +CC++L  +DE+C+CE L +
Subjt:  ALLAAALVVAD--AHRTIIRTVEVEEE-NQGRGQQQRCR-QMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLEE

Query:  IAGMERRKGG--GQEERQLFQRARNLPSMCGIRPQQC
        +   ++++ G  G+E  ++ Q AR+LP+ CGI  Q+C
Subjt:  IAGMERRKGG--GQEERQLFQRARNLPSMCGIRPQQC

Q39649 2S albumin4.4e-4267.14Show/hide
Query:  IRTIIALLAAALVVAD--AHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCEL
        + +IIAL A AL+VAD  A+RT I TVEVEE  QGR  ++RCRQM A+EE+ SC++YLRQQSR VL+MRGIEN  RR G  F+ECCREL+NVDEECRC++
Subjt:  IRTIIALLAAALVVAD--AHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCEL

Query:  LEEIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF
        LEEIA  E+R+  GQE RQ+ Q+ARNLPSMCGIRPQ+C F
Subjt:  LEEIAGMERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF

Arabidopsis top hitse value%identityAlignment
AT4G27140.1 seed storage albumin 15.2e-0626.95Show/hide
Query:  IIALLAAALVVADA--HRTIIRTVEVEEENQGRGQQQRCR-QMRAQEEIGSCQEYLRQQSR--------YVLEMRGIENQRRRGGDLFNECCRELRNVDE
        + A LA   ++ +A  +RT++   E +  N    + ++CR + + ++ + +CQ+ + QQ+R        +  +M   + Q++    LF +CC ELR  + 
Subjt:  IIALLAAALVVADA--HRTIIRTVEVEEENQGRGQQQRCR-QMRAQEEIGSCQEYLRQQSR--------YVLEMRGIENQRRRGGDLFNECCRELRNVDE

Query:  ECRCELLEEIAGMERRKGGGQ--EERQLFQRARNLPSMCGI
        +C C  L++ A   R +G  Q  + R+++Q A++LP++C I
Subjt:  ECRCELLEEIAGMERRKGGGQ--EERQLFQRARNLPSMCGI

AT5G54740.1 seed storage albumin 53.3e-0530.67Show/hide
Query:  RTIIALLAAALVVADAHRTIIRT-VEVEEENQ-GRGQQQRC-RQMRAQEEIGSCQEYLR---QQSRYVLEMRGIE---------NQRRRG---GDLFNEC
        + I+     AL +  A+ +I RT VE EE++     QQ +C R+    +++  C++++R   QQ R   E    E         ++   G         C
Subjt:  RTIIALLAAALVVADAHRTIIRT-VEVEEENQ-GRGQQQRC-RQMRAQEEIGSCQEYLR---QQSRYVLEMRGIE---------NQRRRG---GDLFNEC

Query:  CRELRNVDEECRCELLEEIAGMERRKG--GGQEERQLFQRARNLPSMCGI
        C ELR VD+ C C  L++ A   R +G  G Q+ + +FQ A+NLP++C I
Subjt:  CRELRNVDEECRCELLEEIAGMERRKG--GGQEERQLFQRARNLPSMCGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGATCGGTAGTGATCAGAACTATCATTGCTCTGTTGGCAGCGGCCTTGGTGGTTGCAGATGCCCACCGCACCATCATCAGGACGGTGGAAGTGGAGGAAGAGAA
CCAAGGGCGGGGGCAGCAGCAGAGGTGCCGGCAAATGAGGGCCCAGGAGGAGATAGGGAGCTGCCAAGAGTACCTGAGGCAGCAGAGCAGATATGTTTTGGAGATGCGTG
GAATTGAAAACCAGAGGAGGAGAGGAGGGGATTTGTTCAATGAGTGTTGCCGTGAACTGAGGAATGTGGATGAGGAATGTAGGTGTGAACTGTTGGAAGAGATTGCTGGG
ATGGAGCGGAGGAAAGGAGGAGGGCAAGAAGAGAGGCAATTGTTTCAGAGAGCTAGAAACTTGCCATCCATGTGTGGAATCCGCCCACAGCAATGCTACTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGATCGGTAGTGATCAGAACTATCATTGCTCTGTTGGCAGCGGCCTTGGTGGTTGCAGATGCCCACCGCACCATCATCAGGACGGTGGAAGTGGAGGAAGAGAA
CCAAGGGCGGGGGCAGCAGCAGAGGTGCCGGCAAATGAGGGCCCAGGAGGAGATAGGGAGCTGCCAAGAGTACCTGAGGCAGCAGAGCAGATATGTTTTGGAGATGCGTG
GAATTGAAAACCAGAGGAGGAGAGGAGGGGATTTGTTCAATGAGTGTTGCCGTGAACTGAGGAATGTGGATGAGGAATGTAGGTGTGAACTGTTGGAAGAGATTGCTGGG
ATGGAGCGGAGGAAAGGAGGAGGGCAAGAAGAGAGGCAATTGTTTCAGAGAGCTAGAAACTTGCCATCCATGTGTGGAATCCGCCCACAGCAATGCTACTTCTAA
Protein sequenceShow/hide protein sequence
MGRSVVIRTIIALLAAALVVADAHRTIIRTVEVEEENQGRGQQQRCRQMRAQEEIGSCQEYLRQQSRYVLEMRGIENQRRRGGDLFNECCRELRNVDEECRCELLEEIAG
MERRKGGGQEERQLFQRARNLPSMCGIRPQQCYF