; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024095 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024095
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCystatin domain-containing protein
Genome locationchr10:492734..494014
RNA-Seq ExpressionLag0024095
SyntenyLag0024095
Gene Ontology termsGO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain
IPR006525 - Cystatin-related, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
THG04121.1 hypothetical protein TEA_024172 [Camellia sinensis var. sinensis]9.3e-1741.8Show/hide
Query:  EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  FP     G I PI+++      +    ELA+K+YNE+  T LE  + VK N     G  YYITF  K      + P
Subjt:  EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHP

Query:  TTTFQARVFDGMESVEVDFCRL
          TFQA V+DG++ +EV  CRL
Subjt:  TTTFQARVFDGMESVEVDFCRL

THG21948.1 hypothetical protein TEA_004640 [Camellia sinensis var. sinensis]1.9e-1742.06Show/hide
Query:  EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  FP     G I PI+++      +    ELA+K+YNE+  T LE  + VK N     G  YYITF  K      + P
Subjt:  EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHP

Query:  TTTFQARVFDGMESVEVDFCRLAPSN
          TFQA V+DG++ +EV  CRL  SN
Subjt:  TTTFQARVFDGMESVEVDFCRLAPSN

XP_022137488.1 uncharacterized protein LOC111008920 isoform X1 [Momordica charantia]3.2e-1738.57Show/hide
Query:  FDESFVCSDSDDEMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITF
        +D      D   EM +EE  +Y +AI +SEGFDVP FP       I P+  I    EE+      A+K YN +NG S E  + +K N   A G  +++TF
Subjt:  FDESFVCSDSDDEMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITF

Query:  KAKLSGTPSNHPTTTFQARVFDGM--ESVEVDFCRLAPSN
        + K +G P + PTTT QARV  G+  +  +V  CR  P+N
Subjt:  KAKLSGTPSNHPTTTFQARVFDGM--ESVEVDFCRLAPSN

XP_028053036.1 uncharacterized protein LOC114257478 [Camellia sinensis]1.9e-1742.06Show/hide
Query:  EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  FP     G I PI+++      +    ELA+K+YNE+  T LE  + VK N     G  YYITF  K      + P
Subjt:  EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHP

Query:  TTTFQARVFDGMESVEVDFCRLAPSN
          TFQA V+DG++ +EV  CRL  SN
Subjt:  TTTFQARVFDGMESVEVDFCRLAPSN

XP_028091213.1 uncharacterized protein LOC114291567 [Camellia sinensis]1.9e-1742.06Show/hide
Query:  EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  FP     G I PI+++      +    ELA+K+YNE+  T LE  + VK N     G  YYITF  K      + P
Subjt:  EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHP

Query:  TTTFQARVFDGMESVEVDFCRLAPSN
          TFQA V+DG++ +EV  CRL  SN
Subjt:  TTTFQARVFDGMESVEVDFCRLAPSN

TrEMBL top hitse value%identityAlignment
A0A4S4DMB7 Cystatin domain-containing protein4.5e-1741.8Show/hide
Query:  EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  FP     G I PI+++      +    ELA+K+YNE+  T LE  + VK N     G  YYITF  K      + P
Subjt:  EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHP

Query:  TTTFQARVFDGMESVEVDFCRL
          TFQA V+DG++ +EV  CRL
Subjt:  TTTFQARVFDGMESVEVDFCRL

A0A4S4EJD4 Cystatin domain-containing protein5.9e-1738.04Show/hide
Query:  MASSSKFEKLDKDSDA--------FDESFVCSDSD--DEMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQN
        + SS++ EK  +  D         +D S V  DSD   E TD E+++YN  I +S+GFDV  FP     G   P+ ++    +++    ELA+K+YNE+ 
Subjt:  MASSSKFEKLDKDSDA--------FDESFVCSDSD--DEMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQN

Query:  GTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHPTTTFQARVFDGMESVEVDFCRLAPS
         T+LE  + VK N AV AG  YYITF  + +    +  TT FQA V+DG+   EV   RL  S
Subjt:  GTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHPTTTFQARVFDGMESVEVDFCRLAPS

A0A4S4EY14 Cystatin domain-containing protein9.1e-1842.06Show/hide
Query:  EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  FP     G I PI+++      +    ELA+K+YNE+  T LE  + VK N     G  YYITF  K      + P
Subjt:  EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHP

Query:  TTTFQARVFDGMESVEVDFCRLAPSN
          TFQA V+DG++ +EV  CRL  SN
Subjt:  TTTFQARVFDGMESVEVDFCRLAPSN

A0A6J1C8H7 uncharacterized protein LOC1110089523.8e-1637.72Show/hide
Query:  MASSSKFEKLDKDSDAFDESFVCSDSDD--------EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAG--ELAMKQYNEQN
        MASS+  +  D D D  DE    +  DD        EMT EE  +Y  A+ K+EGFD+P FP+    G I+ I   +   EE+      + A+  +N+QN
Subjt:  MASSSKFEKLDKDSDAFDESFVCSDSDD--------EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAG--ELAMKQYNEQN

Query:  GTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHPTTTFQARVFDGME--SVEVDFCRLAPSNN
        GTS E  + VK       G  YY+TF+ K  G+P N PT T QARV  G+     +V+ CR  PS +
Subjt:  GTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHPTTTFQARVFDGME--SVEVDFCRLAPSNN

A0A6J1CAH1 uncharacterized protein LOC111008920 isoform X11.5e-1738.57Show/hide
Query:  FDESFVCSDSDDEMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITF
        +D      D   EM +EE  +Y +AI +SEGFDVP FP       I P+  I    EE+      A+K YN +NG S E  + +K N   A G  +++TF
Subjt:  FDESFVCSDSDDEMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEITEYVKVNFAVAAGTHYYITF

Query:  KAKLSGTPSNHPTTTFQARVFDGM--ESVEVDFCRLAPSN
        + K +G P + PTTT QARV  G+  +  +V  CR  P+N
Subjt:  KAKLSGTPSNHPTTTFQARVFDGM--ESVEVDFCRLAPSN

SwissProt top hitse value%identityAlignment
Q9SV54 UPF0725 protein At4g289205.9e-0634.65Show/hide
Query:  SDSDDEMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQ--PIMSIKPWKEEIDRAGELAMKQYN-EQNGTSLE---ITEYVKVNFAVAAGTHYYITFK
        S+SD EM  EE + Y   + +S+GFDV YF       GI+  P+     +  +I+  G L +  YN    GT+L+   I +Y   N  V++G +YYIT +
Subjt:  SDSDDEMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQ--PIMSIKPWKEEIDRAGELAMKQYN-EQNGTSLE---ITEYVKVNFAVAAGTHYYITFK

Query:  AKLSGTPSNHPTTTFQARVFDGMESVE
        A    T +N P  TFQ  V +  ++ E
Subjt:  AKLSGTPSNHPTTTFQARVFDGMESVE

Arabidopsis top hitse value%identityAlignment
AT4G28920.1 Protein of unknown function (DUF626)4.2e-0734.65Show/hide
Query:  SDSDDEMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQ--PIMSIKPWKEEIDRAGELAMKQYN-EQNGTSLE---ITEYVKVNFAVAAGTHYYITFK
        S+SD EM  EE + Y   + +S+GFDV YF       GI+  P+     +  +I+  G L +  YN    GT+L+   I +Y   N  V++G +YYIT +
Subjt:  SDSDDEMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQ--PIMSIKPWKEEIDRAGELAMKQYN-EQNGTSLE---ITEYVKVNFAVAAGTHYYITFK

Query:  AKLSGTPSNHPTTTFQARVFDGMESVE
        A    T +N P  TFQ  V +  ++ E
Subjt:  AKLSGTPSNHPTTTFQARVFDGMESVE

AT5G05040.1 Cystatin/monellin superfamily protein3.5e-0629.7Show/hide
Query:  QMASSSKFEKLDKDSDAFDES-FVCSDSDD-EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILG-GIQPI----MSIKPW---KEEIDRAGELAMKQYNEQ
        ++ S  K + +++ SD  DES F   DSDD E   ++ R  +E  GKS GFDV +       G G   +    M  +P    ++ ++R   +A+  YN++
Subjt:  QMASSSKFEKLDKDSDAFDES-FVCSDSDD-EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILG-GIQPI----MSIKPW---KEEIDRAGELAMKQYNEQ

Query:  NGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHPTTTFQARVFDGMESVEVDFCRLAPSN
          +SLE+ + ++ NF  +A    YITF+A  +     + T  +QA V      ++V  C+  PS+
Subjt:  NGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHPTTTFQARVFDGMESVEVDFCRLAPSN

AT5G05060.1 Cystatin/monellin superfamily protein3.5e-0630.12Show/hide
Query:  QMASSSKFEKLDKDSDAFDES-FVCSDSDD-EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILG-GIQPI----MSIKPW---KEEIDRAGELAMKQYNEQ
        ++ S  K + +++ SD  DES F   DSDD E   ++ R  +E  GKS GFDV +       G G   +    M  +P    ++ ++R   +A+  YN++
Subjt:  QMASSSKFEKLDKDSDAFDES-FVCSDSDD-EMTDEEFRQYNEAIGKSEGFDVPYFPNYDILG-GIQPI----MSIKPW---KEEIDRAGELAMKQYNEQ

Query:  NGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHPTTTFQARVFDGMESVEVDFCRLAPSNN
         G  LE+ + ++ NF  +A    YITF+A  +     + T  +QA V      +EV  C   PS++
Subjt:  NGTSLEITEYVKVNFAVAAGTHYYITFKAKLSGTPSNHPTTTFQARVFDGMESVEVDFCRLAPSNN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCACTCCGCGGGCGATCGGAACTTTATCGGCAGATGGCTTCATCATCAAAGTTCGAAAAATTGGACAAGGATTCCGATGCCTTTGATGAATCTTTTGTTTGTAG
TGATAGTGATGATGAGATGACTGATGAAGAGTTTCGACAATATAACGAAGCAATTGGAAAGAGCGAGGGTTTTGATGTTCCATACTTTCCAAATTATGATATATTAGGTG
GAATTCAGCCTATAATGTCTATAAAACCATGGAAGGAAGAAATCGATCGAGCTGGAGAGTTAGCCATGAAACAGTACAATGAGCAAAATGGTACAAGCTTGGAGATTACA
GAATACGTAAAGGTAAATTTTGCGGTAGCGGCTGGTACGCATTATTATATTACTTTCAAGGCAAAGCTGAGTGGAACTCCTTCCAACCATCCAACCACGACATTTCAAGC
TCGGGTGTTCGATGGTATGGAATCTGTTGAAGTAGACTTTTGTAGGCTAGCGCCCTCCAACAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCCACTCCGCGGGCGATCGGAACTTTATCGGCAGATGGCTTCATCATCAAAGTTCGAAAAATTGGACAAGGATTCCGATGCCTTTGATGAATCTTTTGTTTGTAG
TGATAGTGATGATGAGATGACTGATGAAGAGTTTCGACAATATAACGAAGCAATTGGAAAGAGCGAGGGTTTTGATGTTCCATACTTTCCAAATTATGATATATTAGGTG
GAATTCAGCCTATAATGTCTATAAAACCATGGAAGGAAGAAATCGATCGAGCTGGAGAGTTAGCCATGAAACAGTACAATGAGCAAAATGGTACAAGCTTGGAGATTACA
GAATACGTAAAGGTAAATTTTGCGGTAGCGGCTGGTACGCATTATTATATTACTTTCAAGGCAAAGCTGAGTGGAACTCCTTCCAACCATCCAACCACGACATTTCAAGC
TCGGGTGTTCGATGGTATGGAATCTGTTGAAGTAGACTTTTGTAGGCTAGCGCCCTCCAACAACTGA
Protein sequenceShow/hide protein sequence
MDPLRGRSELYRQMASSSKFEKLDKDSDAFDESFVCSDSDDEMTDEEFRQYNEAIGKSEGFDVPYFPNYDILGGIQPIMSIKPWKEEIDRAGELAMKQYNEQNGTSLEIT
EYVKVNFAVAAGTHYYITFKAKLSGTPSNHPTTTFQARVFDGMESVEVDFCRLAPSNN