; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024097 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024097
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCystatin domain-containing protein
Genome locationchr10:496868..497502
RNA-Seq ExpressionLag0024097
SyntenyLag0024097
Gene Ontology termsGO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain
IPR006525 - Cystatin-related, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
THG04121.1 hypothetical protein TEA_024172 [Camellia sinensis var. sinensis]2.9e-1743.44Show/hide
Query:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITYIKPWKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  F      G I PI  +      +    ELA+K+YNE+  T LE  + VK NL    G  YYITF+ K      + P
Subjt:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITYIKPWKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHP

Query:  TTTFQALVFDGIKSVEVEFCRL
          TFQALV+DGI  +EV  CRL
Subjt:  TTTFQALVFDGIKSVEVEFCRL

THG21948.1 hypothetical protein TEA_004640 [Camellia sinensis var. sinensis]3.5e-1843.65Show/hide
Query:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITYIKPWKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  F      G I PI  +      +    ELA+K+YNE+  T LE  + VK NL    G  YYITF+ K      + P
Subjt:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITYIKPWKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHP

Query:  TTTFQALVFDGIKSVEVEFCRLAPSN
          TFQALV+DGI  +EV  CRL  SN
Subjt:  TTTFQALVFDGIKSVEVEFCRLAPSN

XP_022975633.1 uncharacterized protein LOC111475320 [Cucurbita maxima]2.9e-1741.21Show/hide
Query:  MASSSKF-----EKLDKDSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPIT---YIKPWKEAIDRAGELAMKQYNEQNGT
        MASSS       E++D D + F+      D    +T +E  +Y  A+ +SQGFDVPYF      G I P+    + + +KE    A E A+K YN +NGT
Subjt:  MASSSKF-----EKLDKDSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPIT---YIKPWKEAIDRAGELAMKQYNEQNGT

Query:  NLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHPTTTFQALVFDGI---KSVEVEFCRLAPSN
        N E+ + VK N  G  G  YYITFN K  GT +  P+ TFQA V+  I     +EVE CR  PSN
Subjt:  NLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHPTTTFQALVFDGI---KSVEVEFCRLAPSN

XP_028053036.1 uncharacterized protein LOC114257478 [Camellia sinensis]3.5e-1843.65Show/hide
Query:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITYIKPWKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  F      G I PI  +      +    ELA+K+YNE+  T LE  + VK NL    G  YYITF+ K      + P
Subjt:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITYIKPWKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHP

Query:  TTTFQALVFDGIKSVEVEFCRLAPSN
          TFQALV+DGI  +EV  CRL  SN
Subjt:  TTTFQALVFDGIKSVEVEFCRLAPSN

XP_028091213.1 uncharacterized protein LOC114291567 [Camellia sinensis]3.5e-1843.65Show/hide
Query:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITYIKPWKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  F      G I PI  +      +    ELA+K+YNE+  T LE  + VK NL    G  YYITF+ K      + P
Subjt:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITYIKPWKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHP

Query:  TTTFQALVFDGIKSVEVEFCRLAPSN
          TFQALV+DGI  +EV  CRL  SN
Subjt:  TTTFQALVFDGIKSVEVEFCRLAPSN

TrEMBL top hitse value%identityAlignment
A0A4S4DMB7 Cystatin domain-containing protein1.4e-1743.44Show/hide
Query:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITYIKPWKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  F      G I PI  +      +    ELA+K+YNE+  T LE  + VK NL    G  YYITF+ K      + P
Subjt:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITYIKPWKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHP

Query:  TTTFQALVFDGIKSVEVEFCRL
          TFQALV+DGI  +EV  CRL
Subjt:  TTTFQALVFDGIKSVEVEFCRL

A0A4S4EJD4 Cystatin domain-containing protein7.1e-1738.04Show/hide
Query:  MASSSKFEKLDKDSDA--------FEESSVCSDSDG--EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITYIKPWKEAIDRAGELAMKQYNEQN
        + SS++ EK  +  D         ++ S V  DSDG  E TD E+++YN  I +S GFDV  F      G   P+  +    + +    ELA+K+YNE+ 
Subjt:  MASSSKFEKLDKDSDA--------FEESSVCSDSDG--EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITYIKPWKEAIDRAGELAMKQYNEQN

Query:  GTNLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHPTTTFQALVFDGIKSVEVEFCRLAPS
         TNLE  + VK N A  AG  YYITF+ + +    +  TT FQA V+DGI   EV   RL  S
Subjt:  GTNLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHPTTTFQALVFDGIKSVEVEFCRLAPS

A0A4S4EY14 Cystatin domain-containing protein1.7e-1843.65Show/hide
Query:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITYIKPWKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  F      G I PI  +      +    ELA+K+YNE+  T LE  + VK NL    G  YYITF+ K      + P
Subjt:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITYIKPWKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHP

Query:  TTTFQALVFDGIKSVEVEFCRLAPSN
          TFQALV+DGI  +EV  CRL  SN
Subjt:  TTTFQALVFDGIKSVEVEFCRLAPSN

A0A6J1IJT3 uncharacterized protein LOC1114753201.4e-1741.21Show/hide
Query:  MASSSKF-----EKLDKDSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPIT---YIKPWKEAIDRAGELAMKQYNEQNGT
        MASSS       E++D D + F+      D    +T +E  +Y  A+ +SQGFDVPYF      G I P+    + + +KE    A E A+K YN +NGT
Subjt:  MASSSKF-----EKLDKDSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPIT---YIKPWKEAIDRAGELAMKQYNEQNGT

Query:  NLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHPTTTFQALVFDGI---KSVEVEFCRLAPSN
        N E+ + VK N  G  G  YYITFN K  GT +  P+ TFQA V+  I     +EVE CR  PSN
Subjt:  NLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHPTTTFQALVFDGI---KSVEVEFCRLAPSN

A0A6J1IL21 uncharacterized protein LOC1114751789.2e-1741.21Show/hide
Query:  MASSSKF-----EKLDKDSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPIT---YIKPWKEAIDRAGELAMKQYNEQNGT
        MASSS       E++D D + F+      D    +T +E  +Y  A+ +SQGFDVPYF      G I P+    +   +KE    A E A+K YN +NGT
Subjt:  MASSSKF-----EKLDKDSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPIT---YIKPWKEAIDRAGELAMKQYNEQNGT

Query:  NLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHPTTTFQALVFDGI---KSVEVEFCRLAPSN
        N E+ + VK N AG  G  YYITFN K  GT +   + TFQA V+  I     +EVE CR  PSN
Subjt:  NLEITEYVKVNLAGAAGMHYYITFNAKLSGTPSNHPTTTFQALVFDGI---KSVEVEFCRLAPSN

SwissProt top hitse value%identityAlignment
Q9SV54 UPF0725 protein At4g289204.1e-0635.43Show/hide
Query:  SDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQ--PITYIKPWKEAIDRAGELAMKQYN-EQNGTNLE---ITEYVKVNLAGAAGMHYYITFN
        S+SD EM  EE + Y   + +S GFDV YFR      GI+  P+     +   I+  G L +  YN    GTNL+   I +Y   N+  ++G +YYIT  
Subjt:  SDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQ--PITYIKPWKEAIDRAGELAMKQYN-EQNGTNLE---ITEYVKVNLAGAAGMHYYITFN

Query:  AKLSGTPSNHPTTTFQALVFDGIKSVE
        A    T +N P  TFQ  V +  ++ E
Subjt:  AKLSGTPSNHPTTTFQALVFDGIKSVE

Arabidopsis top hitse value%identityAlignment
AT1G50690.1 Cystatin/monellin superfamily protein4.0e-0429.53Show/hide
Query:  KFEKLDKDSDA-FEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITY------IKPWKEA--IDRAGELAMKQYNEQNGTNLEI
        K +KL+++ +   EE S  S+S      E+ R   E   +S  +D    +   L+    P+ +       KP  +   + R  ++A+++YN+    NLE+
Subjt:  KFEKLDKDSDA-FEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITY------IKPWKEA--IDRAGELAMKQYNEQNGTNLEI

Query:  TEYVKVNLAGAAGMHYYITFNAKLSGTPSNHPTTTFQALV--FDGIKSV
           VK N    AG  +YITF AK     S+    TFQA V    GI++V
Subjt:  TEYVKVNLAGAAGMHYYITFNAKLSGTPSNHPTTTFQALV--FDGIKSV

AT2G37435.1 Cystatin/monellin superfamily protein2.5e-0627.13Show/hide
Query:  DEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITY------IKP--WKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFNAKLSGT
        +EE+    + +  S+GFD+  F  F  +   +P+        ++P   +E +DR    +++ +NE + T  E   ++K N   +AGM Y+ITF  KL   
Subjt:  DEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITY------IKP--WKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFNAKLSGT

Query:  PSNHPTTTFQALVFDGIKSVEVEFCRLAP
         ++  +  FQA +     + E+  C L P
Subjt:  PSNHPTTTFQALVFDGIKSVEVEFCRLAP

AT4G28920.1 Protein of unknown function (DUF626)2.9e-0735.43Show/hide
Query:  SDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQ--PITYIKPWKEAIDRAGELAMKQYN-EQNGTNLE---ITEYVKVNLAGAAGMHYYITFN
        S+SD EM  EE + Y   + +S GFDV YFR      GI+  P+     +   I+  G L +  YN    GTNL+   I +Y   N+  ++G +YYIT  
Subjt:  SDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQ--PITYIKPWKEAIDRAGELAMKQYN-EQNGTNLE---ITEYVKVNLAGAAGMHYYITFN

Query:  AKLSGTPSNHPTTTFQALVFDGIKSVE
        A    T +N P  TFQ  V +  ++ E
Subjt:  AKLSGTPSNHPTTTFQALVFDGIKSVE

AT5G17150.1 Cystatin/monellin superfamily protein1.4e-0427.81Show/hide
Query:  DSDAFEESSVCSDSDGEMTDEEFRQYNE-AIAKSQGFDVPYFRNFDLLGGIQPITYI-------KPWKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNL
        D D+++     SD +      +  +YNE  I K Q F    F   + L GI PI Y+          +E +     L +K+ N++ G  +E+ E V+V  
Subjt:  DSDAFEESSVCSDSDGEMTDEEFRQYNE-AIAKSQGFDVPYFRNFDLLGGIQPITYI-------KPWKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNL

Query:  AGAAGMHYYITFNAKLSGTPSNHPTTTFQALVFDGIKSVEVEF---CRLAP
        +G    + YITF A+      N P   +QA V +   + +  F   CR +P
Subjt:  AGAAGMHYYITFNAKLSGTPSNHPTTTFQALVFDGIKSVEVEF---CRLAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCATCATCAAAGTTTGAAAAATTGGACAAGGATTCCGATGCCTTTGAGGAATCTTCTGTTTGTAGTGATAGTGATGGTGAGATGACTGATGAAGAGTTTCGACA
ATATAATGAAGCAATTGCAAAGAGCCAGGGTTTTGATGTTCCATACTTTCGTAATTTTGATTTACTCGGTGGAATTCAGCCTATAACGTATATAAAACCATGGAAGGAAG
CAATCGATCGAGCTGGAGAGTTAGCCATGAAACAGTACAATGAGCAAAATGGTACAAACTTGGAGATTACAGAATACGTAAAGGTGAATTTGGCGGGAGCGGCTGGTATG
CATTATTATATTACTTTCAATGCAAAGCTGAGTGGAACTCCTTCCAACCATCCAACCACGACATTTCAAGCTCTTGTGTTCGATGGTATAAAATCTGTTGAAGTAGAATT
TTGTAGGCTAGCGCCTTCTAACAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCATCATCAAAGTTTGAAAAATTGGACAAGGATTCCGATGCCTTTGAGGAATCTTCTGTTTGTAGTGATAGTGATGGTGAGATGACTGATGAAGAGTTTCGACA
ATATAATGAAGCAATTGCAAAGAGCCAGGGTTTTGATGTTCCATACTTTCGTAATTTTGATTTACTCGGTGGAATTCAGCCTATAACGTATATAAAACCATGGAAGGAAG
CAATCGATCGAGCTGGAGAGTTAGCCATGAAACAGTACAATGAGCAAAATGGTACAAACTTGGAGATTACAGAATACGTAAAGGTGAATTTGGCGGGAGCGGCTGGTATG
CATTATTATATTACTTTCAATGCAAAGCTGAGTGGAACTCCTTCCAACCATCCAACCACGACATTTCAAGCTCTTGTGTTCGATGGTATAAAATCTGTTGAAGTAGAATT
TTGTAGGCTAGCGCCTTCTAACAACTAA
Protein sequenceShow/hide protein sequence
MASSSKFEKLDKDSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLLGGIQPITYIKPWKEAIDRAGELAMKQYNEQNGTNLEITEYVKVNLAGAAGM
HYYITFNAKLSGTPSNHPTTTFQALVFDGIKSVEVEFCRLAPSNN