; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036143 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036143
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCystatin domain-containing protein
Genome locationscaffold5:48321894..48332865
RNA-Seq ExpressionSpg036143
SyntenySpg036143
Gene Ontology termsGO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain
IPR006525 - Cystatin-related, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
THG21948.1 hypothetical protein TEA_004640 [Camellia sinensis var. sinensis]1.8e-1642.86Show/hide
Query:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWKEEIDRAGELAIKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  F      G I PI  +      +    ELA+K+YNE+  T LE  + VK NL    G  YYITF  K      + P
Subjt:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWKEEIDRAGELAIKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHP

Query:  TTTFQARVFDGMESIEVDFCRLAPSN
          TFQA V+DG++ IEV  CRL  SN
Subjt:  TTTFQARVFDGMESIEVDFCRLAPSN

XP_022149809.1 uncharacterized protein LOC111018153 [Momordica charantia]1.4e-1647.12Show/hide
Query:  YFRNFDLPGGIQPITYI-KPWKEEIDRAGELAIKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHPTTTFQARVFDGMESIEVDFCRL
        YFR   L   I PI  I + + EEI    + AIK YN++N TN E+ E  K N+ G+ G+  YITFK K SGTP+ +PTTTFQA+V D      ++ CR+
Subjt:  YFRNFDLPGGIQPITYI-KPWKEEIDRAGELAIKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHPTTTFQARVFDGMESIEVDFCRL

Query:  APSN
         PSN
Subjt:  APSN

XP_022975633.1 uncharacterized protein LOC111475320 [Cucurbita maxima]3.0e-1638.24Show/hide
Query:  MASSSKF-----EKLDKDSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWK--------EEIDRAGELAIKQYN
        MASSS       E++D D + F+      D    +T +E  +Y  A+ +SQGFDVPYF      G +     I P K        +E+  +   AIK YN
Subjt:  MASSSKF-----EKLDKDSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWK--------EEIDRAGELAIKQYN

Query:  EQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHPTTTFQARVFDGM---ESIEVDFCRLAPSN
         +NGTN E+ + VK N  G  G  YYITF  K  GT +  P+ TFQA+V+  +   + IEV+ CR  PSN
Subjt:  EQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHPTTTFQARVFDGM---ESIEVDFCRLAPSN

XP_028053036.1 uncharacterized protein LOC114257478 [Camellia sinensis]1.8e-1642.86Show/hide
Query:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWKEEIDRAGELAIKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  F      G I PI  +      +    ELA+K+YNE+  T LE  + VK NL    G  YYITF  K      + P
Subjt:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWKEEIDRAGELAIKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHP

Query:  TTTFQARVFDGMESIEVDFCRLAPSN
          TFQA V+DG++ IEV  CRL  SN
Subjt:  TTTFQARVFDGMESIEVDFCRLAPSN

XP_028091213.1 uncharacterized protein LOC114291567 [Camellia sinensis]1.8e-1642.86Show/hide
Query:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWKEEIDRAGELAIKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  F      G I PI  +      +    ELA+K+YNE+  T LE  + VK NL    G  YYITF  K      + P
Subjt:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWKEEIDRAGELAIKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHP

Query:  TTTFQARVFDGMESIEVDFCRLAPSN
          TFQA V+DG++ IEV  CRL  SN
Subjt:  TTTFQARVFDGMESIEVDFCRLAPSN

TrEMBL top hitse value%identityAlignment
A0A4S4EY14 Cystatin domain-containing protein8.6e-1742.86Show/hide
Query:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWKEEIDRAGELAIKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHP
        EMTD+E+++Y+  I +S+GFDV  F      G I PI  +      +    ELA+K+YNE+  T LE  + VK NL    G  YYITF  K      + P
Subjt:  EMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWKEEIDRAGELAIKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHP

Query:  TTTFQARVFDGMESIEVDFCRLAPSN
          TFQA V+DG++ IEV  CRL  SN
Subjt:  TTTFQARVFDGMESIEVDFCRLAPSN

A0A6J1CAH1 uncharacterized protein LOC111008920 isoform X13.3e-1637.5Show/hide
Query:  DSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWKEEIDRAGELAIKQYNEQNGTNLEITEYVKVNLAGAAGMHY
        DSD + +  V      EM +EE  +Y +AI +S+GFDVP F        I P+  I    EE+      AIK YN +NG + E  + +K N   A G  +
Subjt:  DSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWKEEIDRAGELAIKQYNEQNGTNLEITEYVKVNLAGAAGMHY

Query:  YITFKAKLSGTPSNHPTTTFQARVFDGM--ESIEVDFCRLAPSN
        ++TF+ K +G P + PTTT QARV  G+  +  +V  CR  P+N
Subjt:  YITFKAKLSGTPSNHPTTTFQARVFDGM--ESIEVDFCRLAPSN

A0A6J1D850 uncharacterized protein LOC1110181536.6e-1747.12Show/hide
Query:  YFRNFDLPGGIQPITYI-KPWKEEIDRAGELAIKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHPTTTFQARVFDGMESIEVDFCRL
        YFR   L   I PI  I + + EEI    + AIK YN++N TN E+ E  K N+ G+ G+  YITFK K SGTP+ +PTTTFQA+V D      ++ CR+
Subjt:  YFRNFDLPGGIQPITYI-KPWKEEIDRAGELAIKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHPTTTFQARVFDGMESIEVDFCRL

Query:  APSN
         PSN
Subjt:  APSN

A0A6J1IJT3 uncharacterized protein LOC1114753201.5e-1638.24Show/hide
Query:  MASSSKF-----EKLDKDSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWK--------EEIDRAGELAIKQYN
        MASSS       E++D D + F+      D    +T +E  +Y  A+ +SQGFDVPYF      G +     I P K        +E+  +   AIK YN
Subjt:  MASSSKF-----EKLDKDSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWK--------EEIDRAGELAIKQYN

Query:  EQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHPTTTFQARVFDGM---ESIEVDFCRLAPSN
         +NGTN E+ + VK N  G  G  YYITF  K  GT +  P+ TFQA+V+  +   + IEV+ CR  PSN
Subjt:  EQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHPTTTFQARVFDGM---ESIEVDFCRLAPSN

A0A6J1IL21 uncharacterized protein LOC1114751784.2e-1638.24Show/hide
Query:  MASSSKF-----EKLDKDSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWK--------EEIDRAGELAIKQYN
        MASSS       E++D D + F+      D    +T +E  +Y  A+ +SQGFDVPYF      G +     I P K        +E+  +   AIK YN
Subjt:  MASSSKF-----EKLDKDSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWK--------EEIDRAGELAIKQYN

Query:  EQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHPTTTFQARVFDGM---ESIEVDFCRLAPSN
         +NGTN E+ + VK N AG  G  YYITF  K  GT +   + TFQA+V+  +   + IEV+ CR  PSN
Subjt:  EQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHPTTTFQARVFDGM---ESIEVDFCRLAPSN

SwissProt top hitse value%identityAlignment
Q9SV54 UPF0725 protein At4g289205.0e-0636.64Show/hide
Query:  SDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWKEE------IDRAGELAIKQYN-EQNGTNLE---ITEYVKVNLAGAAGMHYY
        S+SD EM  EE + Y   + +S GFDV YFR      GI+P     P K+E      I+  G L +  YN    GTNL+   I +Y   N+  ++G +YY
Subjt:  SDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWKEE------IDRAGELAIKQYN-EQNGTNLE---ITEYVKVNLAGAAGMHYY

Query:  ITFKAKLSGTPSNHPTTTFQARVFDGMESIE
        IT +A    T +N P  TFQ  V +  ++ E
Subjt:  ITFKAKLSGTPSNHPTTTFQARVFDGMESIE

Arabidopsis top hitse value%identityAlignment
AT4G28920.1 Protein of unknown function (DUF626)3.6e-0736.64Show/hide
Query:  SDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWKEE------IDRAGELAIKQYN-EQNGTNLE---ITEYVKVNLAGAAGMHYY
        S+SD EM  EE + Y   + +S GFDV YFR      GI+P     P K+E      I+  G L +  YN    GTNL+   I +Y   N+  ++G +YY
Subjt:  SDSDGEMTDEEFRQYNEAIAKSQGFDVPYFRNFDLPGGIQPITYIKPWKEE------IDRAGELAIKQYN-EQNGTNLE---ITEYVKVNLAGAAGMHYY

Query:  ITFKAKLSGTPSNHPTTTFQARVFDGMESIE
        IT +A    T +N P  TFQ  V +  ++ E
Subjt:  ITFKAKLSGTPSNHPTTTFQARVFDGMESIE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAACGAGTAAGGACACATGTAAATTTACATCGATCCAGAAAATGGTGATTTGTAAATTTACATCGATCCAGAAAATGGCAATTTTGGACCACCCCGATATACAAGG
AGCTGACAAGGACAATCGGGAAGAAATCGGGCTGAGAGATGGACCAAGGAGGCAAAACCGGCAAGTGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTTGGCCCGACCCC
CTGCTCGGCCTCGGCCGACCCTCGGCCCGCTCGCGCGAGCCGAGCCCGTCCTACTCCATTTGGTCCCCACCGCCTTTGGTCGCCTCGGTTTCGCCTGCATTGGAGCCGGT
GTGGCGAGCACCACACCGATGTGCAGGTTTACTGTCTTGCAGGCCACGTCTTCCCCCTCATCTACAAATTTACCGTTGGTGGCACGTGAAGGTCAGTACCCTCTTTCGTA
TTCTTTAACTTCTGTTTCTTGGCCATCTAATTGGCATTGTCGATTGGAAAACCTAAGGATCCACGGCGATAAGGATCCACTCCGCGGGCGATCGGAACTTGATCGGCAGC
TTGCTGGTCTTGGTCTCTGTGATGTTATGTTATGTTATGTTGCTCTGTTTGCTTTACTCTGCTTCCTCCTTTGCTTATCGATTGATTTTGATGGCTTCTCCACCCGAATA
CCGAATACCACGGTTGTTCTGAGTCATATGGCTTCATCATCAAAGTTCGAAAAATTGGAAAAGGATTCCGATGCGTATCTGGAATCTTTTGTTTATAAGGAATTTGAAGA
TGAGATGACTGATGAAGAGTTTCGACAATATAATCGAGCATTTGGAAAGAGCGAGGGTTTTGATGTTCCATACTTTCCTAATAATGACGTATGCGGTATAATTCAGCCTG
TGATGAATATAAAACCATTTAAGAAAGACATTGACCGAGTTGGAGAGTTAGCCATGAAACAGTACAATGAGCAAAATGGTACAAGCTTCGAGATTACAGAATATGTAAAG
GTGAATTCTGGGGCAGCGGCCGGGAAAAATACACTTTTAGTCCCTGAGGGCTTTTTCTTTTCCCAATTCAACGTATCAAATAATCCTCCCAATAATTGGCATCGTCGATT
GGAAAACCTAAGGATCCACTCCGCCGGCGACAAGGATCCACTCCGCGGGCGATCGGAACTTGATCGGCAGATGGCTTCATCATCAAAGTTCGAAAAATTGGACAAGGATT
CCGATGCCTTTGAGGAATCTTCTGTTTGTAGTGATAGTGATGGTGAGATGACTGATGAAGAGTTTCGACAATATAATGAAGCAATTGCAAAGAGCCAGGGTTTTGATGTT
CCATACTTTCGTAATTTTGATTTACCCGGTGGAATTCAGCCTATAACGTATATAAAACCATGGAAGGAAGAAATCGATCGAGCTGGAGAGTTAGCCATTAAACAGTACAA
TGAGCAAAATGGTACAAACTTGGAGATTACAGAATACGTAAAGGTGAATTTGGCGGGAGCGGCTGGTATGCATTATTATATTACTTTCAAGGCAAAGCTGAGTGGAACTC
CTTCCAACCACCCAACCACGACATTTCAAGCTCGGGTGTTCGATGGTATGGAATCTATTGAAGTAGACTTTTGTAGGCTAGCGCCCTCCAACAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAACGAGTAAGGACACATGTAAATTTACATCGATCCAGAAAATGGTGATTTGTAAATTTACATCGATCCAGAAAATGGCAATTTTGGACCACCCCGATATACAAGG
AGCTGACAAGGACAATCGGGAAGAAATCGGGCTGAGAGATGGACCAAGGAGGCAAAACCGGCAAGTGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTTGGCCCGACCCC
CTGCTCGGCCTCGGCCGACCCTCGGCCCGCTCGCGCGAGCCGAGCCCGTCCTACTCCATTTGGTCCCCACCGCCTTTGGTCGCCTCGGTTTCGCCTGCATTGGAGCCGGT
GTGGCGAGCACCACACCGATGTGCAGGTTTACTGTCTTGCAGGCCACGTCTTCCCCCTCATCTACAAATTTACCGTTGGTGGCACGTGAAGGTCAGTACCCTCTTTCGTA
TTCTTTAACTTCTGTTTCTTGGCCATCTAATTGGCATTGTCGATTGGAAAACCTAAGGATCCACGGCGATAAGGATCCACTCCGCGGGCGATCGGAACTTGATCGGCAGC
TTGCTGGTCTTGGTCTCTGTGATGTTATGTTATGTTATGTTGCTCTGTTTGCTTTACTCTGCTTCCTCCTTTGCTTATCGATTGATTTTGATGGCTTCTCCACCCGAATA
CCGAATACCACGGTTGTTCTGAGTCATATGGCTTCATCATCAAAGTTCGAAAAATTGGAAAAGGATTCCGATGCGTATCTGGAATCTTTTGTTTATAAGGAATTTGAAGA
TGAGATGACTGATGAAGAGTTTCGACAATATAATCGAGCATTTGGAAAGAGCGAGGGTTTTGATGTTCCATACTTTCCTAATAATGACGTATGCGGTATAATTCAGCCTG
TGATGAATATAAAACCATTTAAGAAAGACATTGACCGAGTTGGAGAGTTAGCCATGAAACAGTACAATGAGCAAAATGGTACAAGCTTCGAGATTACAGAATATGTAAAG
GTGAATTCTGGGGCAGCGGCCGGGAAAAATACACTTTTAGTCCCTGAGGGCTTTTTCTTTTCCCAATTCAACGTATCAAATAATCCTCCCAATAATTGGCATCGTCGATT
GGAAAACCTAAGGATCCACTCCGCCGGCGACAAGGATCCACTCCGCGGGCGATCGGAACTTGATCGGCAGATGGCTTCATCATCAAAGTTCGAAAAATTGGACAAGGATT
CCGATGCCTTTGAGGAATCTTCTGTTTGTAGTGATAGTGATGGTGAGATGACTGATGAAGAGTTTCGACAATATAATGAAGCAATTGCAAAGAGCCAGGGTTTTGATGTT
CCATACTTTCGTAATTTTGATTTACCCGGTGGAATTCAGCCTATAACGTATATAAAACCATGGAAGGAAGAAATCGATCGAGCTGGAGAGTTAGCCATTAAACAGTACAA
TGAGCAAAATGGTACAAACTTGGAGATTACAGAATACGTAAAGGTGAATTTGGCGGGAGCGGCTGGTATGCATTATTATATTACTTTCAAGGCAAAGCTGAGTGGAACTC
CTTCCAACCACCCAACCACGACATTTCAAGCTCGGGTGTTCGATGGTATGGAATCTATTGAAGTAGACTTTTGTAGGCTAGCGCCCTCCAACAACTGA
Protein sequenceShow/hide protein sequence
MVTSKDTCKFTSIQKMVICKFTSIQKMAILDHPDIQGADKDNREEIGLRDGPRRQNRQVGRAKTEGVGFLARPPARPRPTLGPLARAEPVLLHLVPTAFGRLGFACIGAG
VASTTPMCRFTVLQATSSPSSTNLPLVAREGQYPLSYSLTSVSWPSNWHCRLENLRIHGDKDPLRGRSELDRQLAGLGLCDVMLCYVALFALLCFLLCLSIDFDGFSTRI
PNTTVVLSHMASSSKFEKLEKDSDAYLESFVYKEFEDEMTDEEFRQYNRAFGKSEGFDVPYFPNNDVCGIIQPVMNIKPFKKDIDRVGELAMKQYNEQNGTSFEITEYVK
VNSGAAAGKNTLLVPEGFFFSQFNVSNNPPNNWHRRLENLRIHSAGDKDPLRGRSELDRQMASSSKFEKLDKDSDAFEESSVCSDSDGEMTDEEFRQYNEAIAKSQGFDV
PYFRNFDLPGGIQPITYIKPWKEEIDRAGELAIKQYNEQNGTNLEITEYVKVNLAGAAGMHYYITFKAKLSGTPSNHPTTTFQARVFDGMESIEVDFCRLAPSNN