; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017901 (gene) of Snake gourd v1 genome

Gene IDTan0017901
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCystatin domain-containing protein
Genome locationLG03:64263502..64263977
RNA-Seq ExpressionTan0017901
SyntenyTan0017901
Gene Ontology termsGO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647756.1 hypothetical protein Csa_003034 [Cucumis sativus]8.6e-1552.69Show/hide
Query:  MSSPIIKGGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISF
        MSS  I GG+   KDPN DPHV+DI EWAV EYNK  G  LT  SI   E Q+VAG+ +R +L   DE      +E  VWEK WE+ R LI+F
Subjt:  MSSPIIKGGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISF

NP_001267677.1 cysteine proteinase inhibitor 5-like [Cucumis sativus]8.6e-1552.69Show/hide
Query:  MSSPIIKGGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISF
        MSS  I GG+   KDPN DPHV+DI EWAV EYNK  G  LT  SI   E Q+VAG+ +R +L   DE      +E  VWEK WE+ R LI+F
Subjt:  MSSPIIKGGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISF

RRT32119.1 hypothetical protein B296_00051057 [Ensete ventricosum]1.0e-1549.44Show/hide
Query:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI
        GGW  IK+ N DPHVR+I E+A+ E+NK+   +L    +  GE Q+VAG  YR +L V D  G+L  ++  VWEKPWE  R L SFKL+
Subjt:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI

THU72579.1 hypothetical protein C4D60_Mb04t13660 [Musa balbisiana]1.9e-1448.31Show/hide
Query:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI
        GGW  IK+  DDPHVR+I E+AV E+NK+   +L    ++ GE Q+VAG  YR +L V D  G    ++  VWEKPW + R L SFKL+
Subjt:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI

XP_009396391.1 PREDICTED: cysteine proteinase inhibitor 1-like [Musa acuminata subsp. malaccensis]8.6e-1549.44Show/hide
Query:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI
        GGW  IK+  DDPHVR+I E+AV E+NK+   +L    +  GE Q+VAG  YR +L V D  G    ++  VWEKPWE+ R L SFKL+
Subjt:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI

TrEMBL top hitse value%identityAlignment
A0A426WY08 Cystatin domain-containing protein4.9e-1649.44Show/hide
Query:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI
        GGW  IK+ N DPHVR+I E+A+ E+NK+   +L    +  GE Q+VAG  YR +L V D  G+L  ++  VWEKPWE  R L SFKL+
Subjt:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI

A0A4S8KBV9 Cystatin domain-containing protein9.2e-1548.31Show/hide
Query:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI
        GGW  IK+  DDPHVR+I E+AV E+NK+   +L    ++ GE Q+VAG  YR +L V D  G    ++  VWEKPW + R L SFKL+
Subjt:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI

M0SNR0 Cystatin domain-containing protein4.1e-1549.44Show/hide
Query:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI
        GGW  IK+  DDPHVR+I E+AV E+NK+   +L    +  GE Q+VAG  YR +L V D  G    ++  VWEKPWE+ R L SFKL+
Subjt:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI

O80389 Cystein proteinase inhibitor4.1e-1552.69Show/hide
Query:  MSSPIIKGGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISF
        MSS  I GG+   KDPN DPHV+DI EWAV EYNK  G  LT  SI   E Q+VAG+ +R +L   DE      +E  VWEK WE+ R LI+F
Subjt:  MSSPIIKGGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISF

Q9FQ13 Cystatin-like protein1.2e-1449.43Show/hide
Query:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFK
        GGW+ I+DP  + HV +I ++AV EYNK++  AL F S++ GE Q+V+G  YR +L+V D     + FE  VWEKPWEH + L SFK
Subjt:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFK

SwissProt top hitse value%identityAlignment
P37842 Multicystatin1.2e-0830.34Show/hide
Query:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI
        GG   + +PN +   +++  +A+++YNK+    L F    + ++Q+VAGI Y   L   D+ G  +I++ ++W K WE  + ++ FKL+
Subjt:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI

P86472 Cysteine proteinase inhibitor 11.3e-1037.23Show/hide
Query:  IIKGGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEE--GNLRIFEVEVWEKPWEHSRMLISFKLI
        +  GGW  I++ N    V+D+ ++AV E+NK+    L + S+  G  Q+VAG  YR ++   D    GN   +E  VW+KPW H R L SF+ +
Subjt:  IIKGGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEE--GNLRIFEVEVWEKPWEHSRMLISFKLI

Q10J94 Cysteine proteinase inhibitor 81.7e-1035.63Show/hide
Query:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFK
        GGW  I D   DPH++++  WAV+ +   +   L F  + SGE+Q+V+G+ YR ++   D  G    +   V+E+ W ++R L SFK
Subjt:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFK

Q41916 Cysteine proteinase inhibitor 51.6e-1141.38Show/hide
Query:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFK
        GGW  I +   DP V +I E+AV EYNK +   L F ++ SGE Q+V+G  YR  +   D +G  + +   VW+KPW   R L SF+
Subjt:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFK

Q6TPK4 Cysteine proteinase inhibitor 11.7e-1037.23Show/hide
Query:  IIKGGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEE--GNLRIFEVEVWEKPWEHSRMLISFKLI
        +  GGW  I+  N    V+D+ ++AV E+NK+    L + S+  G  Q+VAG  YR ++   D    GN   +E  VW+KPW H R L SF+ +
Subjt:  IIKGGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEE--GNLRIFEVEVWEKPWEHSRMLISFKLI

Arabidopsis top hitse value%identityAlignment
AT2G31980.1 PHYTOCYSTATIN 21.7e-0523.58Show/hide
Query:  IIKGGWEEIKDPNDDPHVRDIVEWAVKEYNKEN-----------------GRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHS
        ++ GG   + +   +  ++ +  + V+++N++                     L F  + S +KQ+VAG+KY   + V    G+ R+F+  V  +PW HS
Subjt:  IIKGGWEEIKDPNDDPHVRDIVEWAVKEYNKEN-----------------GRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHS

Query:  RMLISF
        + L+ F
Subjt:  RMLISF

AT3G12490.1 cystatin B7.0e-0729.89Show/hide
Query:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFK
        GG  ++    +   V  +  +AV E+NK+    L F  +   ++Q+VAG  +   L +  E G  +++E +VW KPW + + L  FK
Subjt:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFK

AT3G12490.2 cystatin B7.0e-0729.89Show/hide
Query:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFK
        GG  ++    +   V  +  +AV E+NK+    L F  +   ++Q+VAG  +   L +  E G  +++E +VW KPW + + L  FK
Subjt:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFK

AT5G05110.1 Cystatin/monellin family protein2.2e-0836.67Show/hide
Query:  GGWEEIK-DPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI
        GG+ + K D N    + DI  +AV+E+N+     L    +    +Q+VAG  YR  L V  E G  +I+E +VW KPW + + L  FK I
Subjt:  GGWEEIK-DPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLI

AT5G47550.1 Cystatin/monellin superfamily protein1.1e-1241.38Show/hide
Query:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFK
        GGW  I +   DP V +I E+AV EYNK +   L F ++ SGE Q+V+G  YR  +   D +G  + +   VW+KPW   R L SF+
Subjt:  GGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCCCCAATTATAAAAGGAGGTTGGGAGGAGATTAAGGATCCAAATGATGACCCACATGTGCGAGATATCGTAGAGTGGGCAGTGAAAGAATATAACAAAGAAAA
TGGGAGAGCACTAACGTTCCATAGCATTAAGAGTGGTGAGAAACAGATGGTGGCTGGAATAAAGTATCGCTTTTTGCTGTTGGTTTATGATGAAGAAGGTAATCTTCGAA
TATTTGAGGTTGAGGTGTGGGAGAAGCCATGGGAGCACTCTAGGATGCTCATATCTTTTAAGCTCATATCTTAA
mRNA sequenceShow/hide mRNA sequence
CCCCATTTTCAAAGTTTGTCCACACGCTAGTTATCTATTAAAAGTTACGATGAGTTCCCCAATTATAAAAGGAGGTTGGGAGGAGATTAAGGATCCAAATGATGACCCAC
ATGTGCGAGATATCGTAGAGTGGGCAGTGAAAGAATATAACAAAGAAAATGGGAGAGCACTAACGTTCCATAGCATTAAGAGTGGTGAGAAACAGATGGTGGCTGGAATA
AAGTATCGCTTTTTGCTGTTGGTTTATGATGAAGAAGGTAATCTTCGAATATTTGAGGTTGAGGTGTGGGAGAAGCCATGGGAGCACTCTAGGATGCTCATATCTTTTAA
GCTCATATCTTAAAAAACCTCTTCTTTCTTGATGAGGGAAAAAGCACTGTTGTTGTATTTTAAGTGCCAATGTTACACTCCCCTTTATCATCTATCTATCCATAATGAAA
CAATAAAATGACTATAACCTATTATTGTTATTTGTA
Protein sequenceShow/hide protein sequence
MSSPIIKGGWEEIKDPNDDPHVRDIVEWAVKEYNKENGRALTFHSIKSGEKQMVAGIKYRFLLLVYDEEGNLRIFEVEVWEKPWEHSRMLISFKLIS