; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021276 (gene) of Snake gourd v1 genome

Gene IDTan0021276
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCystatin domain-containing protein
Genome locationLG03:2025502..2025959
RNA-Seq ExpressionTan0021276
SyntenyTan0021276
Gene Ontology termsGO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain
IPR027214 - Cystatin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040521.1 Cystatin [Cucumis melo var. makuwa]7.8e-0943.02Show/hide
Query:  MSNVEAVEGNPLPIKDIDAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLLLAGFR-TSTTFDIVVEAENNDGINWTYVAKV
        MS+ +    +P+PIKD+D+ ++Q +G FAV E+N+++G NL+F +V+NG        G R     F IV+ A+N +GI WTY AKV
Subjt:  MSNVEAVEGNPLPIKDIDAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLLLAGFR-TSTTFDIVVEAENNDGINWTYVAKV

KAE8649716.1 hypothetical protein Csa_012476 [Cucumis sativus]1.4e-1349.51Show/hide
Query:  PIKDIDAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLLLAGF------RTSTTFDIVVEAENNDGINWTYVAKVSHVTGFFP---PTSLQSFQPI
        PIKDI+ PQVQEIG+  V+++NKK G NLKF RVVNG     L AGF         T +++V+EA N   INWTY  KVS  T   P     + QSF+P+
Subjt:  PIKDIDAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLLLAGF------RTSTTFDIVVEAENNDGINWTYVAKVSHVTGFFP---PTSLQSFQPI

Query:  LKY
        L+Y
Subjt:  LKY

KGN54823.1 hypothetical protein Csa_012607 [Cucumis sativus]8.4e-1143.75Show/hide
Query:  PIKDIDAPQVQEIGSFAV-NEYNKKEGANLKFKRVVNGLRTTLLLA-GFRTSTTFDIVVEAENNDGINWTYVAKVSHVTGFFPPTSLQSFQPILKY
        PI DI+ P VQ +G  AV N Y+K  G  LKF RVVNGL++   +  GF     + +V+EA+ N+ INWTY  K+  V+G        SF+P+L Y
Subjt:  PIKDIDAPQVQEIGSFAV-NEYNKKEGANLKFKRVVNGLRTTLLLA-GFRTSTTFDIVVEAENNDGINWTYVAKVSHVTGFFPPTSLQSFQPILKY

XP_008442277.1 PREDICTED: cysteine proteinase inhibitor A-like [Cucumis melo]5.4e-1043.27Show/hide
Query:  PIKDI-DAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLL-----LAGFRTSTTFDIVVEAENNDGINWTYVAKVSHVTGFFPP----TSLQSFQP
        PI+DI D  +VQ++G+  V+++NKK G NLKF RVVNGL  +       L G      +++V+EA     INWTY A+VS      PP     S QSF+P
Subjt:  PIKDI-DAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLL-----LAGFRTSTTFDIVVEAENNDGINWTYVAKVSHVTGFFPP----TSLQSFQP

Query:  ILKY
        +L+Y
Subjt:  ILKY

XP_038891087.1 uncharacterized protein LOC120080488 isoform X2 [Benincasa hispida]5.1e-0846.05Show/hide
Query:  NPLPIKDIDAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLLLAGFRT-STTFDIVVEAENNDGINWTYVAK
        +P+PIKD+ + ++Q  GSFAV E+N+  G  L+F+RV+NGL       G R+    F +V+ A+N+ GINWTY AK
Subjt:  NPLPIKDIDAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLLLAGFRT-STTFDIVVEAENNDGINWTYVAK

TrEMBL top hitse value%identityAlignment
A0A0A0KZ98 Uncharacterized protein4.0e-1143.75Show/hide
Query:  PIKDIDAPQVQEIGSFAV-NEYNKKEGANLKFKRVVNGLRTTLLLA-GFRTSTTFDIVVEAENNDGINWTYVAKVSHVTGFFPPTSLQSFQPILKY
        PI DI+ P VQ +G  AV N Y+K  G  LKF RVVNGL++   +  GF     + +V+EA+ N+ INWTY  K+  V+G        SF+P+L Y
Subjt:  PIKDIDAPQVQEIGSFAV-NEYNKKEGANLKFKRVVNGLRTTLLLA-GFRTSTTFDIVVEAENNDGINWTYVAKVSHVTGFFPPTSLQSFQPILKY

A0A0A0L191 Cystatin domain-containing protein1.3e-0938.89Show/hide
Query:  NVEAVEGNPLPIKDIDAPQVQEIGSFAVNEYNK-KEGANLKFKRVVNGLRTTLLLA-GFRTSTTFDIVVEAENNDGINWTYVAKVSHV-TGFFPPTSLQS
        ++E +     PI DI+ P VQ IG  AV +++    G  LKF RVVNGL++   +  GF     + +V+EA+ N+ INWTY  K+  V +G        S
Subjt:  NVEAVEGNPLPIKDIDAPQVQEIGSFAVNEYNK-KEGANLKFKRVVNGLRTTLLLA-GFRTSTTFDIVVEAENNDGINWTYVAKVSHV-TGFFPPTSLQS

Query:  FQPILKYN
        F+P+L YN
Subjt:  FQPILKYN

A0A1S3B5B9 cysteine proteinase inhibitor A-like2.6e-1043.27Show/hide
Query:  PIKDI-DAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLL-----LAGFRTSTTFDIVVEAENNDGINWTYVAKVSHVTGFFPP----TSLQSFQP
        PI+DI D  +VQ++G+  V+++NKK G NLKF RVVNGL  +       L G      +++V+EA     INWTY A+VS      PP     S QSF+P
Subjt:  PIKDI-DAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLL-----LAGFRTSTTFDIVVEAENNDGINWTYVAKVSHVTGFFPP----TSLQSFQP

Query:  ILKY
        +L+Y
Subjt:  ILKY

A0A5A7TFU6 Cystatin3.8e-0943.02Show/hide
Query:  MSNVEAVEGNPLPIKDIDAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLLLAGFR-TSTTFDIVVEAENNDGINWTYVAKV
        MS+ +    +P+PIKD+D+ ++Q +G FAV E+N+++G NL+F +V+NG        G R     F IV+ A+N +GI WTY AKV
Subjt:  MSNVEAVEGNPLPIKDIDAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLLLAGFR-TSTTFDIVVEAENNDGINWTYVAKV

A0A5A7TJK0 Cysteine proteinase inhibitor A-like2.6e-1043.27Show/hide
Query:  PIKDI-DAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLL-----LAGFRTSTTFDIVVEAENNDGINWTYVAKVSHVTGFFPP----TSLQSFQP
        PI+DI D  +VQ++G+  V+++NKK G NLKF RVVNGL  +       L G      +++V+EA     INWTY A+VS      PP     S QSF+P
Subjt:  PIKDI-DAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLL-----LAGFRTSTTFDIVVEAENNDGINWTYVAKVSHVTGFFPP----TSLQSFQP

Query:  ILKY
        +L+Y
Subjt:  ILKY

SwissProt top hitse value%identityAlignment
Q41916 Cysteine proteinase inhibitor 53.4e-0742.47Show/hide
Query:  PIKDIDAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLLLAGFRTSTTFDIVVEAENNDGINWTYVAKV
        PI ++  PQV EIG FAV+EYNK+  + LKF+ VV+G   T +++G    T + + V A + DG++  Y+A V
Subjt:  PIKDIDAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLLLAGFRTSTTFDIVVEAENNDGINWTYVAKV

Arabidopsis top hitse value%identityAlignment
AT4G16500.1 Cystatin/monellin superfamily protein1.2e-0430.53Show/hide
Query:  GNPLPIKDIDAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLLLAGFRTSTTFDIVVEAENNDGINWTYVAKVSHVTGFFPPTSLQSFQPI
        G+  PIK++  P V  +  +A+ E+NK+    L F +VV G  TT +++G    T +D+ + A++  G    Y A V     +    SL+SF+ +
Subjt:  GNPLPIKDIDAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLLLAGFRTSTTFDIVVEAENNDGINWTYVAKVSHVTGFFPPTSLQSFQPI

AT5G47550.1 Cystatin/monellin superfamily protein2.4e-0842.47Show/hide
Query:  PIKDIDAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLLLAGFRTSTTFDIVVEAENNDGINWTYVAKV
        PI ++  PQV EIG FAV+EYNK+  + LKF+ VV+G   T +++G    T + + V A + DG++  Y+A V
Subjt:  PIKDIDAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLLLAGFRTSTTFDIVVEAENNDGINWTYVAKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAATGTGGAAGCAGTTGAGGGTAATCCATTACCGATTAAGGATATCGATGCGCCGCAAGTGCAAGAAATTGGAAGTTTTGCCGTGAATGAATATAACAAGAAAGA
AGGGGCGAATTTGAAGTTCAAACGCGTAGTGAATGGTTTGAGAACAACTCTTTTGCTTGCGGGTTTTCGTACATCAACAACATTTGATATTGTTGTGGAGGCCGAAAACA
ATGATGGAATTAATTGGACATATGTGGCTAAAGTCTCACATGTTACCGGATTCTTTCCCCCTACTTCTCTCCAGTCCTTTCAACCTATCCTTAAATACAACATCTAA
mRNA sequenceShow/hide mRNA sequence
GAGGTTTAAAGGAGAGTTATAAATAGTGAAGTTGTGTTTGAATATTTCATACAAATCAAGTATTTTTCAATGAGTAATGTGGAAGCAGTTGAGGGTAATCCATTACCGAT
TAAGGATATCGATGCGCCGCAAGTGCAAGAAATTGGAAGTTTTGCCGTGAATGAATATAACAAGAAAGAAGGGGCGAATTTGAAGTTCAAACGCGTAGTGAATGGTTTGA
GAACAACTCTTTTGCTTGCGGGTTTTCGTACATCAACAACATTTGATATTGTTGTGGAGGCCGAAAACAATGATGGAATTAATTGGACATATGTGGCTAAAGTCTCACAT
GTTACCGGATTCTTTCCCCCTACTTCTCTCCAGTCCTTTCAACCTATCCTTAAATACAACATCTAATACTTCAAATCCTGTCTTTTATATAATATCAATAAAATAATATC
AATTATACTTATCTATTA
Protein sequenceShow/hide protein sequence
MSNVEAVEGNPLPIKDIDAPQVQEIGSFAVNEYNKKEGANLKFKRVVNGLRTTLLLAGFRTSTTFDIVVEAENNDGINWTYVAKVSHVTGFFPPTSLQSFQPILKYNI