; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008599 (gene) of Snake gourd v1 genome

Gene IDTan0008599
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCysteine proteinase inhibitor, putative
Genome locationLG03:64243275..64243747
RNA-Seq ExpressionTan0008599
SyntenyTan0008599
Gene Ontology termsGO:0010466 - negative regulation of peptidase activity (biological process)
GO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3974132.1 hypothetical protein CMV_002505 [Castanea mollissima]2.4e-1857.3Show/hide
Query:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKPL
        GGW+ IKN   +PHV++IGE+AVKEYNKE+ + L F SV  G+ QVVAG+NY L++   DGA   + Y A VWEKVWE FR LT F P+
Subjt:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKPL

RRT32119.1 hypothetical protein B296_00051057 [Ensete ventricosum]4.8e-1956.99Show/hide
Query:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK
        SS    GGW  IKN N DPHVR+I E+A+ E+NK+   SL    V  G+ QVVAG NY LVL V D + +L KY A VWEK WE+FRQLT FK
Subjt:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK

THU72579.1 hypothetical protein C4D60_Mb04t13660 [Musa balbisiana]5.3e-1856.99Show/hide
Query:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK
        SS    GGW  IKN  DDPHVR+I E+AV E+NK+   SL    VE G+ QVVAG NY LVL V D +    KY A VWEK W  FRQLT FK
Subjt:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK

XP_009394387.1 PREDICTED: cysteine proteinase inhibitor 1 isoform X1 [Musa acuminata subsp. malaccensis]1.2e-1752.08Show/hide
Query:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKPLI
        SS    GGW  IK+ N DPHV++I ++AV E+NK+   +L    V  G+ QVV+G NY LVL   DG+    KY A VWEK WEKFRQLT FK L+
Subjt:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKPLI

XP_009396391.1 PREDICTED: cysteine proteinase inhibitor 1-like [Musa acuminata subsp. malaccensis]6.9e-1856.99Show/hide
Query:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK
        SS    GGW  IKN  DDPHVR+I E+AV E+NK+   SL    V  G+ QVVAG NY LVL V D +    KY A VWEK WE FRQLT FK
Subjt:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK

TrEMBL top hitse value%identityAlignment
A0A426WY08 Cystatin domain-containing protein2.3e-1956.99Show/hide
Query:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK
        SS    GGW  IKN N DPHVR+I E+A+ E+NK+   SL    V  G+ QVVAG NY LVL V D + +L KY A VWEK WE+FRQLT FK
Subjt:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK

A0A4S8KBV9 Cystatin domain-containing protein2.6e-1856.99Show/hide
Query:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK
        SS    GGW  IKN  DDPHVR+I E+AV E+NK+   SL    VE G+ QVVAG NY LVL V D +    KY A VWEK W  FRQLT FK
Subjt:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK

A0A7N2LTU6 Cystatin domain-containing protein2.6e-1857.3Show/hide
Query:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKPL
        GGW+ IKN   +PHV++IGE+AV+EYNKE+ + L F SVE G+ QVVAG NY L+L   +GA   + Y A VWEKVWE FR LT F P+
Subjt:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKPL

M0SIZ8 Cystatin domain-containing protein5.7e-1852.08Show/hide
Query:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKPLI
        SS    GGW  IK+ N DPHV++I ++AV E+NK+   +L    V  G+ QVV+G NY LVL   DG+    KY A VWEK WEKFRQLT FK L+
Subjt:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKPLI

M0SNR0 Cystatin domain-containing protein3.4e-1856.99Show/hide
Query:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK
        SS    GGW  IKN  DDPHVR+I E+AV E+NK+   SL    V  G+ QVVAG NY LVL V D +    KY A VWEK WE FRQLT FK
Subjt:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK

SwissProt top hitse value%identityAlignment
P31726 Cystatin-11.3e-1138.78Show/hide
Query:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKPLIFIDEGKSA
        GG +++    +D  ++++  +AV E+N++    L F  +   K QVVAG  Y+L + V DG    + Y A VWEK WE F+QL  FKP   ++EG SA
Subjt:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKPLIFIDEGKSA

P37842 Multicystatin3.6e-0934.48Show/hide
Query:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK
        GG   + NPN +   +++  +A+++YNK+    L F    + K+QVVAG+ Y++ L   D A   + Y A +W K WE F+++  FK
Subjt:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK

Q10J94 Cysteine proteinase inhibitor 81.7e-1445.45Show/hide
Query:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKP
        GGW  I +   DPH++++G WAV+ +   +   L F  V SG+QQVV+GMNY LV+   D A     Y A V+E+ W   RQLT FKP
Subjt:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKP

Q10Q46 Cysteine proteinase inhibitor 61.2e-1243.68Show/hide
Query:  GWEEIKNPNDDPHVRDIGEWAVKEYNKEN-GTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLR-KYGADVWEKVWEKFRQLTFF
        GW  IKN  DDPH++++G WA+ E N+ +    L FH V  G+QQVV+GMNY L +    G  ++   YGA V+E+ W   R+L  F
Subjt:  GWEEIKNPNDDPHVRDIGEWAVKEYNKEN-GTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLR-KYGADVWEKVWEKFRQLTFF

Q41916 Cysteine proteinase inhibitor 52.7e-1747.87Show/hide
Query:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKP
        S+  + GGW  I N   DP V +IGE+AV EYNK + + L F +V SG+ QVV+G NY L +  +DG    + Y A VW+K W KFR LT F+P
Subjt:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKP

Arabidopsis top hitse value%identityAlignment
AT2G40880.1 cystatin A4.3e-1031.87Show/hide
Query:  THKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK
        T   GG  +++   +   +  +  +A++E+NK+    L F  +   ++QVVAG  YHL L   +G +  + + A VW K W  F+QL  FK
Subjt:  THKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFK

AT3G12490.1 cystatin B8.2e-0936.36Show/hide
Query:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKP
        GG  ++    +   V  +  +AV E+NK+    L F  V   K+QVVAG  +HL L + + A   + Y A VW K W  F++L  FKP
Subjt:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKP

AT3G12490.2 cystatin B8.2e-0936.36Show/hide
Query:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKP
        GG  ++    +   V  +  +AV E+NK+    L F  V   K+QVVAG  +HL L + + A   + Y A VW K W  F++L  FKP
Subjt:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKP

AT5G12140.1 cystatin-12.8e-0938.2Show/hide
Query:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKPL
        GG  +I    +D  V  +  +AV E+NK    +L +  +   K QVVAG  +HL + V DG  N + Y A V EK WE  +QL  F  L
Subjt:  GGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKPL

AT5G47550.1 Cystatin/monellin superfamily protein1.9e-1847.87Show/hide
Query:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKP
        S+  + GGW  I N   DP V +IGE+AV EYNK + + L F +V SG+ QVV+G NY L +  +DG    + Y A VW+K W KFR LT F+P
Subjt:  SSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCCACACACAAGTTTGGAGGTTGGGAGGAGATTAAGAATCCAAATGACGACCCACATGTGCGAGATATCGGAGAGTGGGCAGTGAAAGAATATAACAAAGAAAA
TGGGACGTCACTAATATTCCATAGCGTTGAGAGTGGTAAGCAACAGGTGGTGGCTGGAATGAACTACCACCTTGTGTTGTTGGTTGATGATGGTGCAAATAATCTTCGAA
AATATGGGGCTGACGTGTGGGAGAAGGTATGGGAGAAATTTAGGCAGCTCACATTTTTTAAACCTCTCATCTTTATTGATGAGGGAAAAAGCGCTGCTTAA
mRNA sequenceShow/hide mRNA sequence
GCACCCCATTTTCAAAATTTGTCCACACGCCAGTTATCTATTCAAAGTTACGATGAGTTCCACACACAAGTTTGGAGGTTGGGAGGAGATTAAGAATCCAAATGACGACC
CACATGTGCGAGATATCGGAGAGTGGGCAGTGAAAGAATATAACAAAGAAAATGGGACGTCACTAATATTCCATAGCGTTGAGAGTGGTAAGCAACAGGTGGTGGCTGGA
ATGAACTACCACCTTGTGTTGTTGGTTGATGATGGTGCAAATAATCTTCGAAAATATGGGGCTGACGTGTGGGAGAAGGTATGGGAGAAATTTAGGCAGCTCACATTTTT
TAAACCTCTCATCTTTATTGATGAGGGAAAAAGCGCTGCTTAAGTGCTTTTGTACTTTAAGTGCCTATGTTGCACTACCCTTTATCATCTATCTATCCATAATAAAACAA
TAAAATAACTAAAACCTATTATCATTATTTGTA
Protein sequenceShow/hide protein sequence
MSSTHKFGGWEEIKNPNDDPHVRDIGEWAVKEYNKENGTSLIFHSVESGKQQVVAGMNYHLVLLVDDGANNLRKYGADVWEKVWEKFRQLTFFKPLIFIDEGKSAA