; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002228 (gene) of Snake gourd v1 genome

Gene IDTan0002228
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCystatin domain-containing protein
Genome locationLG05:72596478..72597136
RNA-Seq ExpressionTan0002228
SyntenyTan0002228
Gene Ontology termsGO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5932436.1 hypothetical protein HYC85_028607 [Camellia sinensis]3.7e-1537.3Show/hide
Query:  NFSDEEIIDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFY
        N+ D+  ++D     ++ T ++++EY   + +S+GFDV Y+P +   G TAP+ NLD   ++L  F  LA+KQYN +   N E V+ +K N  + AG  Y
Subjt:  NFSDEEIIDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFY

Query:  YMTFKAKSSDL----TKTFQARVFAG
        Y+TF  K +D     T  FQA V+ G
Subjt:  YMTFKAKSSDL----TKTFQARVFAG

THG21948.1 hypothetical protein TEA_004640 [Camellia sinensis var. sinensis]2.8e-1534.97Show/hide
Query:  NFSDEEIIDDTDIDQ---NDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAG
        N+  ++ I+  D+D+   ++MT ++Y+EY   + +S+GFDV  +P +   G   PI+NL+     L     LA+K+YN++ +T  E V+ +K N     G
Subjt:  NFSDEEIIDDTDIDQ---NDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAG

Query:  LFYYMTFKAKSSDLTK----TFQARVFAGIEETELDFCREVLS
          YY+TF  K +D       TFQA V+ GI+E E+  CR  +S
Subjt:  LFYYMTFKAKSSDLTK----TFQARVFAGIEETELDFCREVLS

XP_028053036.1 uncharacterized protein LOC114257478 [Camellia sinensis]2.8e-1534.97Show/hide
Query:  NFSDEEIIDDTDIDQ---NDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAG
        N+  ++ I+  D+D+   ++MT ++Y+EY   + +S+GFDV  +P +   G   PI+NL+     L     LA+K+YN++ +T  E V+ +K N     G
Subjt:  NFSDEEIIDDTDIDQ---NDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAG

Query:  LFYYMTFKAKSSDLTK----TFQARVFAGIEETELDFCREVLS
          YY+TF  K +D       TFQA V+ GI+E E+  CR  +S
Subjt:  LFYYMTFKAKSSDLTK----TFQARVFAGIEETELDFCREVLS

XP_028071877.1 uncharacterized protein LOC114274199 [Camellia sinensis]2.5e-1637.5Show/hide
Query:  NFSDEEIIDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFY
        N+ D ++ DD+D  +++ T  +Y+EY   + +S+GFDV  +P +   G TAP+ N D   ++L  F  LA+K+YN++  TN E V+ +KVN A  AG  Y
Subjt:  NFSDEEIIDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFY

Query:  YMTFKAKSSDL----TKTFQARVFAGIEETELDFCR
        Y+T   + +      T  FQA V+ GI  TE+   R
Subjt:  YMTFKAKSSDL----TKTFQARVFAGIEETELDFCR

XP_028121389.1 uncharacterized protein LOC114318651 [Camellia sinensis]1.3e-1536.36Show/hide
Query:  NFSDEEIIDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFY
        N+ D ++ DD+D   ++ T  +Y+E+   + +S+GFDV  +P +   G T P+ N +   +++  F  LA+K+YN++  TN E V+ +KVN AV AG  Y
Subjt:  NFSDEEIIDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFY

Query:  YMTFKAKSSDL----TKTFQARVFAGIEETEL
        Y+TF  + +      T  FQA V+ GI  TE+
Subjt:  YMTFKAKSSDL----TKTFQARVFAGIEETEL

TrEMBL top hitse value%identityAlignment
A0A4S4D066 Cystatin domain-containing protein6.1e-1636.36Show/hide
Query:  NFSDEEIIDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFY
        N+ D ++ DD+D   ++ T  +Y+E+   + +S+GFDV  +P +   G T P+ N +   +++  F  LA+K+YN++  TN E V+ +KVN AV AG  Y
Subjt:  NFSDEEIIDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFY

Query:  YMTFKAKSSDL----TKTFQARVFAGIEETEL
        Y+TF  + +      T  FQA V+ GI  TE+
Subjt:  YMTFKAKSSDL----TKTFQARVFAGIEETEL

A0A4S4DMB7 Cystatin domain-containing protein3.0e-1535.07Show/hide
Query:  IDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFYYMTFKAK
        +D  +   ++MT ++Y+EY   + +S+GFDV  +P +   G   PI+NL+     L     LA+K+YN++ +T  E V+ +K N     G  YY+TF  K
Subjt:  IDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFYYMTFKAK

Query:  SSDLTK----TFQARVFAGIEETELDFCREVLSG
         +D       TFQA V+ GI+E E+  CR  ++G
Subjt:  SSDLTK----TFQARVFAGIEETELDFCREVLSG

A0A4S4EJD4 Cystatin domain-containing protein5.2e-1536.09Show/hide
Query:  NFSDEEIIDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFY
        N+ + ++ DD+D   ++ T  +Y+EY   + +S+GFDV  +P +   G T P+ NL+   ++L     LA+K+YN++  TN E V+ +K N AV AG  Y
Subjt:  NFSDEEIIDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFY

Query:  YMTFKAKSSDL-----TKTFQARVFAGIEETEL
        Y+TF  + +       T  FQA V+ GI  TE+
Subjt:  YMTFKAKSSDL-----TKTFQARVFAGIEETEL

A0A4S4EY14 Cystatin domain-containing protein1.4e-1534.97Show/hide
Query:  NFSDEEIIDDTDIDQ---NDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAG
        N+  ++ I+  D+D+   ++MT ++Y+EY   + +S+GFDV  +P +   G   PI+NL+     L     LA+K+YN++ +T  E V+ +K N     G
Subjt:  NFSDEEIIDDTDIDQ---NDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAG

Query:  LFYYMTFKAKSSDLTK----TFQARVFAGIEETELDFCREVLS
          YY+TF  K +D       TFQA V+ GI+E E+  CR  +S
Subjt:  LFYYMTFKAKSSDLTK----TFQARVFAGIEETELDFCREVLS

A0A7J7FZM0 Cystatin domain-containing protein1.8e-1537.3Show/hide
Query:  NFSDEEIIDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFY
        N+ D+  ++D     ++ T ++++EY   + +S+GFDV Y+P +   G TAP+ NLD   ++L  F  LA+KQYN +   N E V+ +K N  + AG  Y
Subjt:  NFSDEEIIDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFY

Query:  YMTFKAKSSDL----TKTFQARVFAG
        Y+TF  K +D     T  FQA V+ G
Subjt:  YMTFKAKSSDL----TKTFQARVFAG

SwissProt top hitse value%identityAlignment
P37842 Multicystatin5.2e-0430Show/hide
Query:  KEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFYYMTFKAKSSDLTKTFQARV
        K E +     A++ YNQ+N+++ E  + + V   + AG+ YY+TF+A      K ++A++
Subjt:  KEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFYYMTFKAKSSDLTKTFQARV

Arabidopsis top hitse value%identityAlignment
AT2G37435.1 Cystatin/monellin superfamily protein1.6e-0528.83Show/hide
Query:  EQYREYFAAVAKSEGFDVP---------YYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFYYMTFKAK---S
        E+Y      V  S+GFD+          Y P  +     A  L  +  +E ++     +++ +N+ + T +E V FIK N  V+AG+ Y++TF+ K   +
Subjt:  EQYREYFAAVAKSEGFDVP---------YYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKVNTAVAAGLFYYMTFKAK---S

Query:  SDLTKTFQARV
         D +K FQA++
Subjt:  SDLTKTFQARV

AT5G05040.1 Cystatin/monellin superfamily protein9.0e-0427.27Show/hide
Query:  DYPNMNTLEINKNFSDEEI-IDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEE--------LNHFGGLAIKQYNQQNE
        + P+   L++ +  SD+E    + D D  +   +  R       KS GFDV +       G  A  L+ D    E        LN    +AI  YN + +
Subjt:  DYPNMNTLEINKNFSDEEI-IDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEE--------LNHFGGLAIKQYNQQNE

Query:  TNFEIVEFIKVNTAVAAGLFYYMTFKA---KSSDLTKTFQARV
        ++ E+V+ ++ N   +A +  Y+TF+A   K  + TK +QA V
Subjt:  TNFEIVEFIKVNTAVAAGLFYYMTFKA---KSSDLTKTFQARV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAACATCAAAATTTGATGACTATCCCAATATGAACACTCTAGAAATCAACAAGAATTTCTCTGATGAGGAAATTATTGATGATACTGATATTGATCAGAATGA
TATGACTCTAGAGCAGTATCGAGAATATTTTGCAGCAGTGGCAAAGAGCGAGGGTTTTGATGTTCCATACTATCCTAATACTATTGTACCTGGTATAACTGCACCTATAC
TGAATTTAGATCGCCGCAAGGAAGAGCTTAATCACTTTGGAGGCTTAGCCATTAAACAATACAATCAACAAAATGAAACCAACTTTGAGATTGTCGAATTTATTAAGGTG
AATACTGCTGTTGCGGCTGGTCTTTTTTATTATATGACCTTCAAGGCAAAGTCGAGTGACTTAACAAAGACATTTCAAGCTCGAGTTTTTGCTGGTATCGAAGAAACAGA
GCTGGATTTCTGCAGGGAAGTGTTATCTGGTTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAACATCAAAATTTGATGACTATCCCAATATGAACACTCTAGAAATCAACAAGAATTTCTCTGATGAGGAAATTATTGATGATACTGATATTGATCAGAATGA
TATGACTCTAGAGCAGTATCGAGAATATTTTGCAGCAGTGGCAAAGAGCGAGGGTTTTGATGTTCCATACTATCCTAATACTATTGTACCTGGTATAACTGCACCTATAC
TGAATTTAGATCGCCGCAAGGAAGAGCTTAATCACTTTGGAGGCTTAGCCATTAAACAATACAATCAACAAAATGAAACCAACTTTGAGATTGTCGAATTTATTAAGGTG
AATACTGCTGTTGCGGCTGGTCTTTTTTATTATATGACCTTCAAGGCAAAGTCGAGTGACTTAACAAAGACATTTCAAGCTCGAGTTTTTGCTGGTATCGAAGAAACAGA
GCTGGATTTCTGCAGGGAAGTGTTATCTGGTTCATAG
Protein sequenceShow/hide protein sequence
MASTSKFDDYPNMNTLEINKNFSDEEIIDDTDIDQNDMTLEQYREYFAAVAKSEGFDVPYYPNTIVPGITAPILNLDRRKEELNHFGGLAIKQYNQQNETNFEIVEFIKV
NTAVAAGLFYYMTFKAKSSDLTKTFQARVFAGIEETELDFCREVLSGS