; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033468 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033468
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCystatin domain-containing protein
Genome locationscaffold5:2978024..2979747
RNA-Seq ExpressionSpg033468
SyntenySpg033468
Gene Ontology termsGO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain
IPR006525 - Cystatin-related, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
THG04121.1 hypothetical protein TEA_024172 [Camellia sinensis var. sinensis]2.5e-1640.16Show/hide
Query:  NDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDH
        +++T++E+KEY     +S+GFDV  +P +   G ++P++NLN + + L     LA+K+YN++  T  E V+ VK N     GF Y++TF  K +   VD 
Subjt:  NDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDH

Query:  PTTTFQARVFAGIQETEVDFCR
        P  TFQA V+ GI E EV  CR
Subjt:  PTTTFQARVFAGIQETEVDFCR

XP_022137488.1 uncharacterized protein LOC111008920 isoform X1 [Momordica charantia]8.2e-2043.09Show/hide
Query:  DLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDHP
        ++ EEE   Y  A  +SEGFDVP +P  +   ++ P+  +N   +E+ E  G AIK YN++NG +FE V+ +K N+  A G  +F+TF+ K +G P D P
Subjt:  DLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDHP

Query:  TTTFQARVFAGI--QETEVDFCR
        TTT QARV AGI   + +V  CR
Subjt:  TTTFQARVFAGI--QETEVDFCR

XP_023525925.1 uncharacterized protein LOC111789396 [Cucurbita pepo subsp. pepo]1.4e-1640.71Show/hide
Query:  DCSNSDTDFANGELNVENDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGF
        D  +SD ++ +G + V N    EE   Y      ++GFDVP +P+ +  G+++P+      K  L +    AI  YN +NGTNFE V+ VK N  V +GF
Subjt:  DCSNSDTDFANGELNVENDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGF

Query:  FYFMTFKAKPSGTPVDHPTTTFQARVFAGIQET-EVDFCR
        FY++TF  K  GT  + PTTTF+A+V  GI +T EV  CR
Subjt:  FYFMTFKAKPSGTPVDHPTTTFQARVFAGIQET-EVDFCR

XP_028071891.1 multicystatin-like [Camellia sinensis]1.4e-1642.37Show/hide
Query:  NDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDH
        ++ TE E++E+     +S+GFDV  +P +   G   P+ N N   D+L  +  LA+K+YN++  TN E V+ VKVN AV AGF Y++TF  + +   VD 
Subjt:  NDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDH

Query:  PTTTFQARVFAGIQETEV
         TT FQA V+ GI  TEV
Subjt:  PTTTFQARVFAGIQETEV

XP_028091213.1 uncharacterized protein LOC114291567 [Camellia sinensis]2.5e-1640.16Show/hide
Query:  NDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDH
        +++T++E+KEY     +S+GFDV  +P +   G ++P++NLN + + L     LA+K+YN++  T  E V+ VK N     GF Y++TF  K +   VD 
Subjt:  NDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDH

Query:  PTTTFQARVFAGIQETEVDFCR
        P  TFQA V+ GI E EV  CR
Subjt:  PTTTFQARVFAGIQETEVDFCR

TrEMBL top hitse value%identityAlignment
A0A4S4DMB7 Cystatin domain-containing protein1.2e-1640.16Show/hide
Query:  NDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDH
        +++T++E+KEY     +S+GFDV  +P +   G ++P++NLN + + L     LA+K+YN++  T  E V+ VK N     GF Y++TF  K +   VD 
Subjt:  NDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDH

Query:  PTTTFQARVFAGIQETEVDFCR
        P  TFQA V+ GI E EV  CR
Subjt:  PTTTFQARVFAGIQETEVDFCR

A0A4S4EHW2 Cystatin domain-containing protein2.0e-1641.88Show/hide
Query:  NDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDH
        ++ TE E++E+     +S+GFDV  +P +   G   P+ N N   D+L  +  LA+K+YN++  TN E V+ VKVN AV AGF Y++TF  + +   VD 
Subjt:  NDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDH

Query:  PTTTFQARVFAGIQETE
         TT FQA V+ GI  TE
Subjt:  PTTTFQARVFAGIQETE

A0A4S4EJD4 Cystatin domain-containing protein2.7e-1638.64Show/hide
Query:  NSDTDFANGELNVENDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYF
        N D      + +  ++ T+ E++EY     +S+GFDV  +P +   G   P+ NLN   D+L     LA+K+YN++  TN E V+ VK N AV AG  Y+
Subjt:  NSDTDFANGELNVENDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYF

Query:  MTFKAKPSGTPVDHPTTTFQARVFAGIQETEV
        +TF  + +   VD  TT FQA V+ GI  TEV
Subjt:  MTFKAKPSGTPVDHPTTTFQARVFAGIQETEV

A0A4S4EY14 Cystatin domain-containing protein3.5e-1638.46Show/hide
Query:  NSDTDFANGELNVE-------NDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAV
        N D+D    E+N E       +++T++E+KEY     +S+GFDV  +P +   G ++P++NLN + + L     LA+K+YN++  T  E V+ VK N   
Subjt:  NSDTDFANGELNVE-------NDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAV

Query:  AAGFFYFMTFKAKPSGTPVDHPTTTFQARVFAGIQETEVDFCR
          GF Y++TF  K +   VD P  TFQA V+ GI E EV  CR
Subjt:  AAGFFYFMTFKAKPSGTPVDHPTTTFQARVFAGIQETEVDFCR

A0A6J1CAH1 uncharacterized protein LOC111008920 isoform X14.0e-2043.09Show/hide
Query:  DLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDHP
        ++ EEE   Y  A  +SEGFDVP +P  +   ++ P+  +N   +E+ E  G AIK YN++NG +FE V+ +K N+  A G  +F+TF+ K +G P D P
Subjt:  DLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDHP

Query:  TTTFQARVFAGI--QETEVDFCR
        TTT QARV AGI   + +V  CR
Subjt:  TTTFQARVFAGI--QETEVDFCR

SwissProt top hitse value%identityAlignment
Q9SV54 UPF0725 protein At4g289203.8e-0429.69Show/hide
Query:  NGELNVENDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYN-DQNGTNFEIVEFVKVNT---AVAAGFFYFMTF
        + E ++E D   EE K Y     +S+GFDV Y+ +A +     P+ + N Y  +++ +G L +  YN    GTN +++   K NT    V++G +Y++T 
Subjt:  NGELNVENDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYN-DQNGTNFEIVEFVKVNT---AVAAGFFYFMTF

Query:  KAKPSGTPVDHPTTTFQARVFAGIQETE
        +A  +    ++   TFQ  V    Q +E
Subjt:  KAKPSGTPVDHPTTTFQARVFAGIQETE

Arabidopsis top hitse value%identityAlignment
AT1G63190.1 Cystatin/monellin superfamily protein3.0e-0430.09Show/hide
Query:  SEGFDVPYYPHAHVPGMVLPMLNLNRYKDE-------LDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDHPTTTFQARV-
        S+GFD+ +     V    L  L+ + + DE       L+     A+  YN ++ T FE V+ VK N        + +TF+      P D+    FQ RV 
Subjt:  SEGFDVPYYPHAHVPGMVLPMLNLNRYKDE-------LDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFYFMTFKAKPSGTPVDHPTTTFQARV-

Query:  FAGIQETEVDFCR
         A    TE  FCR
Subjt:  FAGIQETEVDFCR

AT4G28920.1 Protein of unknown function (DUF626)2.7e-0529.69Show/hide
Query:  NGELNVENDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYN-DQNGTNFEIVEFVKVNT---AVAAGFFYFMTF
        + E ++E D   EE K Y     +S+GFDV Y+ +A +     P+ + N Y  +++ +G L +  YN    GTN +++   K NT    V++G +Y++T 
Subjt:  NGELNVENDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYN-DQNGTNFEIVEFVKVNT---AVAAGFFYFMTF

Query:  KAKPSGTPVDHPTTTFQARVFAGIQETE
        +A  +    ++   TFQ  V    Q +E
Subjt:  KAKPSGTPVDHPTTTFQARVFAGIQETE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCGACATCAAAAGTTGAAGACTGCTCCAACTCGGATACTGATTTTGCTAATGGGGAATTGAATGTGGAGAATGATTTGACTGAAGAGGAGTTCAAAGAATATCT
TGCAGCAGAAGCAAAGAGCGAGGGTTTTGATGTTCCATACTATCCGCATGCCCATGTACCTGGTATGGTCTTGCCTATGCTGAATTTAAATCGTTACAAGGATGAGCTTG
ATGAATATGGAGGCTTAGCCATTAAACAGTATAATGACCAAAATGGAACCAACTTTGAGATAGTAGAATTTGTTAAGGTCAATACTGCAGTTGCAGCTGGTTTTTTTTAT
TTCATGACCTTCAAGGCAAAGCCTAGTGGAACCCCTGTCGATCACCCGACCACGACGTTTCAAGCTCGAGTGTTCGCTGGTATTCAAGAAACGGAGGTGGATTTTTGCAG
GAAAGCTGTGATATCTGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCGACATCAAAAGTTGAAGACTGCTCCAACTCGGATACTGATTTTGCTAATGGGGAATTGAATGTGGAGAATGATTTGACTGAAGAGGAGTTCAAAGAATATCT
TGCAGCAGAAGCAAAGAGCGAGGGTTTTGATGTTCCATACTATCCGCATGCCCATGTACCTGGTATGGTCTTGCCTATGCTGAATTTAAATCGTTACAAGGATGAGCTTG
ATGAATATGGAGGCTTAGCCATTAAACAGTATAATGACCAAAATGGAACCAACTTTGAGATAGTAGAATTTGTTAAGGTCAATACTGCAGTTGCAGCTGGTTTTTTTTAT
TTCATGACCTTCAAGGCAAAGCCTAGTGGAACCCCTGTCGATCACCCGACCACGACGTTTCAAGCTCGAGTGTTCGCTGGTATTCAAGAAACGGAGGTGGATTTTTGCAG
GAAAGCTGTGATATCTGGTTGA
Protein sequenceShow/hide protein sequence
MASTSKVEDCSNSDTDFANGELNVENDLTEEEFKEYLAAEAKSEGFDVPYYPHAHVPGMVLPMLNLNRYKDELDEYGGLAIKQYNDQNGTNFEIVEFVKVNTAVAAGFFY
FMTFKAKPSGTPVDHPTTTFQARVFAGIQETEVDFCRKAVISG