; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020733 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020733
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCystatin domain-containing protein
Genome locationChr05:1941960..1942619
RNA-Seq ExpressionHG10020733
SyntenyHG10020733
Gene Ontology termsGO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137488.1 uncharacterized protein LOC111008920 isoform X1 [Momordica charantia]5.9e-1841.54Show/hide
Query:  FFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMTFKAKPS
        ++D  VR++ +EE + YYKA  ESEGFDVP +P      +  P+  ++   EE+    G AI+ YN +N   FE V+  K N+ +  G L+F+TF+ K +
Subjt:  FFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMTFKAKPS

Query:  GTPAHYPTTTFQARVYAGIGKT--EVSFCR
        G P   PTTT QARV AGI     +V  CR
Subjt:  GTPAHYPTTTFQARVYAGIGKT--EVSFCR

XP_022975633.1 uncharacterized protein LOC111475320 [Cucurbita maxima]2.3e-1437.78Show/hide
Query:  FFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVP--ILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMTFKAK
        FF      +T +E  EY+ A  ES+GFDVPY+  +   G+  P  +       +E+      AI+ YN +N T FE+V+  K N     G +Y++TF  K
Subjt:  FFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVP--ILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMTFKAK

Query:  PSGTPAHYPTTTFQARVYAGI---GKTEVSFCRQK
        P GT A +P+ TFQA+VY  I      EV  CR K
Subjt:  PSGTPAHYPTTTFQARVYAGI---GKTEVSFCRQK

XP_023525925.1 uncharacterized protein LOC111789396 [Cucurbita pepo subsp. pepo]3.9e-1440.77Show/hide
Query:  FDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMTFKAKPSG
        ++  VR +  EE S YY+   +++GFDVP +P+    G+ VPI      K  L      AI  YN +N T FE V+  K N   V+G  Y++TF  K  G
Subjt:  FDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMTFKAKPSG

Query:  TPAHYPTTTFQARVYAGIGKT-EVSFCRQK
        T   +PTTTF+A+V  GI  T EV  CR K
Subjt:  TPAHYPTTTFQARVYAGIGKT-EVSFCRQK

XP_028071877.1 uncharacterized protein LOC114274199 [Camellia sinensis]7.9e-1538.62Show/hide
Query:  FDNYTDLELEQLALNFFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTAS
        F NY D ++E  +    DG   + TD EY EY +   ES+GFDV  +P     G T P+ N D   ++L  +  LA+++YNE+  T  E V+  KVN A 
Subjt:  FDNYTDLELEQLALNFFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTAS

Query:  VAGLLYFMTFKAKPSGTPAHYPTTTFQARVYAGIGKTEVSFCRQK
        VAG LY++T   + +       T  FQA V+ GI  TEV   R K
Subjt:  VAGLLYFMTFKAKPSGTPAHYPTTTFQARVYAGIGKTEVSFCRQK

XP_028071956.1 uncharacterized protein LOC114274267 [Camellia sinensis]1.4e-1437.24Show/hide
Query:  FDNYTDLELEQLALNFFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTAS
        F NY + ++E  +    DG + + TD EY EY +   ES+GFDV  +P     G T P+ NL+   ++L     LA+++YNE+  T  E V+  K N A 
Subjt:  FDNYTDLELEQLALNFFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTAS

Query:  VAGLLYFMTFKAKPSGTPAHYPTTTFQARVYAGIGKTEVSFCRQK
         AG  Y++TF  + +       TT FQA V+ GI  TEV   R K
Subjt:  VAGLLYFMTFKAKPSGTPAHYPTTTFQARVYAGIGKTEVSFCRQK

TrEMBL top hitse value%identityAlignment
A0A4S4E3W0 Cystatin domain-containing protein2.5e-1438.89Show/hide
Query:  VRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMTFKAKPSGTPAH
        + ++TD+EY EY +   +SEGFDV  +P     G  +PI++L+ R   L     LA+++Y E+ +TK E V+F K N     GL Y++TF  K +     
Subjt:  VRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMTFKAKPSGTPAH

Query:  YPTTTFQARVYAGIGKTEVSFCRQKL
         P  TFQA V+ GI + EV  CR K+
Subjt:  YPTTTFQARVYAGIGKTEVSFCRQKL

A0A4S4EJD4 Cystatin domain-containing protein6.5e-1537.24Show/hide
Query:  FDNYTDLELEQLALNFFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTAS
        F NY + ++E  +    DG + + TD EY EY +   ES+GFDV  +P     G T P+ NL+   ++L     LA+++YNE+  T  E V+  K N A 
Subjt:  FDNYTDLELEQLALNFFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTAS

Query:  VAGLLYFMTFKAKPSGTPAHYPTTTFQARVYAGIGKTEVSFCRQK
         AG  Y++TF  + +       TT FQA V+ GI  TEV   R K
Subjt:  VAGLLYFMTFKAKPSGTPAHYPTTTFQARVYAGIGKTEVSFCRQK

A0A6J1C8H7 uncharacterized protein LOC1110089522.5e-1438.64Show/hide
Query:  FFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYG--GLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMTFKAK
        +F  E R++T EE  EYY A  ++EGFD+P +P     G    I       EEL        A+  +N+QN T FE V+  K    +V G++Y++TF+ K
Subjt:  FFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYG--GLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMTFKAK

Query:  PSGTPAHYPTTTFQARVYAG--IGKTEVSFCR
          G+P + PT T QARV  G  IG  +V  CR
Subjt:  PSGTPAHYPTTTFQARVYAG--IGKTEVSFCR

A0A6J1CAH1 uncharacterized protein LOC111008920 isoform X12.8e-1841.54Show/hide
Query:  FFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMTFKAKPS
        ++D  VR++ +EE + YYKA  ESEGFDVP +P      +  P+  ++   EE+    G AI+ YN +N   FE V+  K N+ +  G L+F+TF+ K +
Subjt:  FFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMTFKAKPS

Query:  GTPAHYPTTTFQARVYAGIGKT--EVSFCR
        G P   PTTT QARV AGI     +V  CR
Subjt:  GTPAHYPTTTFQARVYAGIGKT--EVSFCR

A0A6J1IJT3 uncharacterized protein LOC1114753201.1e-1437.78Show/hide
Query:  FFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVP--ILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMTFKAK
        FF      +T +E  EY+ A  ES+GFDVPY+  +   G+  P  +       +E+      AI+ YN +N T FE+V+  K N     G +Y++TF  K
Subjt:  FFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVP--ILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMTFKAK

Query:  PSGTPAHYPTTTFQARVYAGI---GKTEVSFCRQK
        P GT A +P+ TFQA+VY  I      EV  CR K
Subjt:  PSGTPAHYPTTTFQARVYAGI---GKTEVSFCRQK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37435.1 Cystatin/monellin superfamily protein1.2e-0527.94Show/hide
Query:  DGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPI--------LNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMT
        D E +   +EEY    K   +S+GFD+ +     V     P+        L  +  +E +D     +++ +NE + TK+E V F K N    AG++YF+T
Subjt:  DGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPI--------LNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTASVAGLLYFMT

Query:  FKAKPSGTPAHYPTTTFQARVYAGIGKTEVSFCRQK
        F+ K         +  FQA++    G  E+  C  K
Subjt:  FKAKPSGTPAHYPTTTFQARVYAGIGKTEVSFCRQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCATGTTTTGATAACTACACCGACTTGGAGTTAGAACAACTCGCGTTGAATTTTTTTGATGGGGAAGTTCGTGATTTGACGGATGAGGAGTATTCAGAATACTA
TAAAGCGGAAGCAGAGAGCGAGGGTTTTGATGTTCCATACTATCCTCATATTGCTGTACCTGGGATGACTGTGCCTATACTAAATTTAGATAGGCGCAAGGAAGAGCTTG
ATCACTATGGAGGCTTGGCCATTCAACAGTATAATGAGCAGAATGAAACCAAGTTTGAGATTGTAGAATTTGAGAAGGTGAATACAGCATCAGTGGCGGGTTTGCTTTAT
TTCATGACGTTCAAGGCAAAGCCAAGTGGAACTCCTGCCCACTATCCAACCACAACATTTCAAGCACGAGTGTATGCTGGCATCGGAAAAACAGAGGTATCTTTTTGCAG
GCAAAAGCTCCTATCTGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCATGTTTTGATAACTACACCGACTTGGAGTTAGAACAACTCGCGTTGAATTTTTTTGATGGGGAAGTTCGTGATTTGACGGATGAGGAGTATTCAGAATACTA
TAAAGCGGAAGCAGAGAGCGAGGGTTTTGATGTTCCATACTATCCTCATATTGCTGTACCTGGGATGACTGTGCCTATACTAAATTTAGATAGGCGCAAGGAAGAGCTTG
ATCACTATGGAGGCTTGGCCATTCAACAGTATAATGAGCAGAATGAAACCAAGTTTGAGATTGTAGAATTTGAGAAGGTGAATACAGCATCAGTGGCGGGTTTGCTTTAT
TTCATGACGTTCAAGGCAAAGCCAAGTGGAACTCCTGCCCACTATCCAACCACAACATTTCAAGCACGAGTGTATGCTGGCATCGGAAAAACAGAGGTATCTTTTTGCAG
GCAAAAGCTCCTATCTGCCTGA
Protein sequenceShow/hide protein sequence
MASCFDNYTDLELEQLALNFFDGEVRDLTDEEYSEYYKAEAESEGFDVPYYPHIAVPGMTVPILNLDRRKEELDHYGGLAIQQYNEQNETKFEIVEFEKVNTASVAGLLY
FMTFKAKPSGTPAHYPTTTFQARVYAGIGKTEVSFCRQKLLSA