; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001233 (gene) of Snake gourd v1 genome

Gene IDTan0001233
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCystatin domain-containing protein
Genome locationLG05:72613578..72614934
RNA-Seq ExpressionTan0001233
SyntenyTan0001233
Gene Ontology termsGO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
THG04121.1 hypothetical protein TEA_024172 [Camellia sinensis var. sinensis]1.6e-1742.98Show/hide
Query:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK--SPTDDS
        ++MT+++YKEY   + +S+GFDV  +P +F  G+  PI NL+Y    L +   LA++ YNE+  TK EFV+LVK N     GF Y++TF  K     D  
Subjt:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK--SPTDDS

Query:  PTTFQARVFAGIEKTEVDFCR
          TFQA V+ GI++ EV  CR
Subjt:  PTTFQARVFAGIEKTEVDFCR

THG21948.1 hypothetical protein TEA_004640 [Camellia sinensis var. sinensis]1.6e-1742.98Show/hide
Query:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK--SPTDDS
        ++MT+++YKEY   + +S+GFDV  +P +F  G+  PI NL+Y    L +   LA++ YNE+  TK EFV+LVK N     GF Y++TF  K     D  
Subjt:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK--SPTDDS

Query:  PTTFQARVFAGIEKTEVDFCR
          TFQA V+ GI++ EV  CR
Subjt:  PTTFQARVFAGIEKTEVDFCR

XP_022137488.1 uncharacterized protein LOC111008920 isoform X1 [Momordica charantia]1.6e-1741.46Show/hide
Query:  DMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK---SPTDDS
        +M EE+   Y++A+ +SEGFDVP +P T+     +P+  ++   E++    G AI+ YN +NG  FEFV+++K NS  A G L+F+TF+ K   +P D  
Subjt:  DMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK---SPTDDS

Query:  PTTFQARVFAGI--EKTEVDFCR
         TT QARV AGI  +  +V  CR
Subjt:  PTTFQARVFAGI--EKTEVDFCR

XP_028053036.1 uncharacterized protein LOC114257478 [Camellia sinensis]1.6e-1742.98Show/hide
Query:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK--SPTDDS
        ++MT+++YKEY   + +S+GFDV  +P +F  G+  PI NL+Y    L +   LA++ YNE+  TK EFV+LVK N     GF Y++TF  K     D  
Subjt:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK--SPTDDS

Query:  PTTFQARVFAGIEKTEVDFCR
          TFQA V+ GI++ EV  CR
Subjt:  PTTFQARVFAGIEKTEVDFCR

XP_028091213.1 uncharacterized protein LOC114291567 [Camellia sinensis]1.6e-1742.98Show/hide
Query:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK--SPTDDS
        ++MT+++YKEY   + +S+GFDV  +P +F  G+  PI NL+Y    L +   LA++ YNE+  TK EFV+LVK N     GF Y++TF  K     D  
Subjt:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK--SPTDDS

Query:  PTTFQARVFAGIEKTEVDFCR
          TFQA V+ GI++ EV  CR
Subjt:  PTTFQARVFAGIEKTEVDFCR

TrEMBL top hitse value%identityAlignment
A0A4S4DMB7 Cystatin domain-containing protein7.9e-1842.98Show/hide
Query:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK--SPTDDS
        ++MT+++YKEY   + +S+GFDV  +P +F  G+  PI NL+Y    L +   LA++ YNE+  TK EFV+LVK N     GF Y++TF  K     D  
Subjt:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK--SPTDDS

Query:  PTTFQARVFAGIEKTEVDFCR
          TFQA V+ GI++ EV  CR
Subjt:  PTTFQARVFAGIEKTEVDFCR

A0A4S4EHW2 Cystatin domain-containing protein5.7e-1640.52Show/hide
Query:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK--SPTDDS
        ++ TE +Y+E+   + +S+GFDV  +P +F  G T P+ N +   + L  +  LA++ YNE+  T  EFV++VKVN  V AGFLY++TF  +  +  D  
Subjt:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK--SPTDDS

Query:  PTTFQARVFAGIEKTE
         T FQA V+ GI  TE
Subjt:  PTTFQARVFAGIEKTE

A0A4S4EY14 Cystatin domain-containing protein7.9e-1842.98Show/hide
Query:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK--SPTDDS
        ++MT+++YKEY   + +S+GFDV  +P +F  G+  PI NL+Y    L +   LA++ YNE+  TK EFV+LVK N     GF Y++TF  K     D  
Subjt:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK--SPTDDS

Query:  PTTFQARVFAGIEKTEVDFCR
          TFQA V+ GI++ EV  CR
Subjt:  PTTFQARVFAGIEKTEVDFCR

A0A6A4L4Y3 Cystatin domain-containing protein (Fragment)4.4e-1641.46Show/hide
Query:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQN--LDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAKSPTDDS
        ++MT+++++EY   + +S+GFDV ++P +F  G+  P+ +  LD   E L +Y  LA++ YN+    K EFV++VK N    AGFLYF+TF  K P D  
Subjt:  YDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQN--LDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAKSPTDDS

Query:  PTT--FQARVFAGIEKTEVDFCR
          T  FQA V+ GI KT +  CR
Subjt:  PTT--FQARVFAGIEKTEVDFCR

A0A6J1CAH1 uncharacterized protein LOC111008920 isoform X17.9e-1841.46Show/hide
Query:  DMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK---SPTDDS
        +M EE+   Y++A+ +SEGFDVP +P T+     +P+  ++   E++    G AI+ YN +NG  FEFV+++K NS  A G L+F+TF+ K   +P D  
Subjt:  DMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK---SPTDDS

Query:  PTTFQARVFAGI--EKTEVDFCR
         TT QARV AGI  +  +V  CR
Subjt:  PTTFQARVFAGI--EKTEVDFCR

SwissProt top hitse value%identityAlignment
Q9LH42 UPF0725 protein At3g195208.2e-0430.17Show/hide
Query:  DHYDMTEEQYKEYFEALAKSEGFDVPYYPNTFVP-GRTAPIQNLDYQREKL-------NNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFK
        D  D  + Q K Y+  +A+S+GFD+   P   VP G  A + ++D +  +          Y  + +  YN   GT FE +EL+K N  +     Y++T  
Subjt:  DHYDMTEEQYKEYFEALAKSEGFDVPYYPNTFVP-GRTAPIQNLDYQREKL-------NNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFK

Query:  AKSPTDDSPTTFQARV
        A   +     TFQ RV
Subjt:  AKSPTDDSPTTFQARV

Q9SV54 UPF0725 protein At4g289207.4e-0527.35Show/hide
Query:  DMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYN-EQNGTKFEFVELVKVNS---GVAAGFLYFMTFKAKSPTDD
        +M  E+ K Y   + +S+GFDV Y+   +   +  P+++ +     +  +G L +  YN    GT  + + + K N+   GV++G  Y++T +A    ++
Subjt:  DMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYN-EQNGTKFEFVELVKVNS---GVAAGFLYFMTFKAKSPTDD

Query:  SPTTFQARVFAGIEKTE
        SP TFQ  V    + +E
Subjt:  SPTTFQARVFAGIEKTE

Arabidopsis top hitse value%identityAlignment
AT1G63190.1 Cystatin/monellin superfamily protein1.2e-0535.53Show/hide
Query:  REKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAKSPTDDSPTTFQARV-FAGIEKTEVDFCR
        R+ L      A++ YN+++ T+FEFV++VK N       ++ +TF+   P D+    FQ RV  A    TE  FCR
Subjt:  REKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAKSPTDDSPTTFQARV-FAGIEKTEVDFCR

AT1G63200.1 Cystatin/monellin superfamily protein2.1e-0729.6Show/hide
Query:  EEQYKEYFEALAKSEGFDVP---------YYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAKSPT
        EE+     + +  S+GFD+          Y+P      + A     D   + L +    A++ +N ++GT++EFV++VK N   A   ++ +TF+ K P 
Subjt:  EEQYKEYFEALAKSEGFDVP---------YYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAKSPT

Query:  DDSPTTFQARVFAGIE-KTEVDFCR
        DD    FQ RV  G    T   FCR
Subjt:  DDSPTTFQARVFAGIE-KTEVDFCR

AT1G63206.1 Cystatin/monellin superfamily protein2.0e-0529.41Show/hide
Query:  KSEGFDVPYYP-NTFVPGRTAPIQNLDYQR---------EKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAA--GFLYFMTFKAKSPTDDSPTTFQA
        KS+GFD+ +    +    R   + +LDY+          E L      +++ YN +  T++EF+++VK N+ +    G +YF+TF+ + P D+    FQ 
Subjt:  KSEGFDVPYYP-NTFVPGRTAPIQNLDYQR---------EKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAA--GFLYFMTFKAKSPTDDSPTTFQA

Query:  RV
        RV
Subjt:  RV

AT2G37435.1 Cystatin/monellin superfamily protein4.8e-0731.53Show/hide
Query:  EEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQ--------REKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK-SPT
        EE+Y    + +  S+GFD+ +     V     P+   D +        RE ++     ++E +NE + TK+EFV  +K N  V+AG +YF+TF+ K    
Subjt:  EEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQ--------REKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFKAK-SPT

Query:  DDSPTTFQARV
        DD    FQA++
Subjt:  DDSPTTFQARV

AT4G28920.1 Protein of unknown function (DUF626)5.3e-0627.35Show/hide
Query:  DMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYN-EQNGTKFEFVELVKVNS---GVAAGFLYFMTFKAKSPTDD
        +M  E+ K Y   + +S+GFDV Y+   +   +  P+++ +     +  +G L +  YN    GT  + + + K N+   GV++G  Y++T +A    ++
Subjt:  DMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYN-EQNGTKFEFVELVKVNS---GVAAGFLYFMTFKAKSPTDD

Query:  SPTTFQARVFAGIEKTE
        SP TFQ  V    + +E
Subjt:  SPTTFQARVFAGIEKTE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATTTTTGATGCTACACCTGGTGCTCATCGTGGCCGGAATGGTACTGTTGATCATTATGATATGACCGAAGAGCAGTACAAAGAATACTTTGAAGCCTTAGCAAA
GAGTGAGGGTTTTGATGTTCCATACTATCCTAATACTTTTGTACCTGGTAGAACTGCACCTATACAGAATTTAGATTACCAGAGGGAAAAGCTCAATAACTATGGAGGCT
TAGCCATTGAACTGTACAATGAGCAAAATGGAACAAAGTTTGAGTTTGTAGAACTTGTTAAGGTGAATAGTGGAGTTGCAGCTGGTTTCCTTTATTTTATGACCTTCAAG
GCAAAGTCTCCTACAGACGACTCCCCTACAACGTTTCAAGCTCGAGTGTTTGCTGGTATCGAAAAAACAGAGGTGGATTTTTGCAGACAAGCTCTGTTATCTGGTTCTGC
TTGA
mRNA sequenceShow/hide mRNA sequence
GTTGGTTTTGTGATCCCATTCATCAAATCATTACCCTTGTTCTTCACTGGGGTCTTCGTCTTCCAAGCTTTGCCGGCAGCCACCGACCCACCCACCGACGCCGTCTTGCC
GATACTGGGCAAATCACTATGGATATTTTTGATGCTACACCTGGTGCTCATCGTGGCCGGAATGGTACTGTTGATCATTATGATATGACCGAAGAGCAGTACAAAGAATA
CTTTGAAGCCTTAGCAAAGAGTGAGGGTTTTGATGTTCCATACTATCCTAATACTTTTGTACCTGGTAGAACTGCACCTATACAGAATTTAGATTACCAGAGGGAAAAGC
TCAATAACTATGGAGGCTTAGCCATTGAACTGTACAATGAGCAAAATGGAACAAAGTTTGAGTTTGTAGAACTTGTTAAGGTGAATAGTGGAGTTGCAGCTGGTTTCCTT
TATTTTATGACCTTCAAGGCAAAGTCTCCTACAGACGACTCCCCTACAACGTTTCAAGCTCGAGTGTTTGCTGGTATCGAAAAAACAGAGGTGGATTTTTGCAGACAAGC
TCTGTTATCTGGTTCTGCTTGATTGTTCTTCCCATTCATCCTATATATTAAATTAAATATTATCTACCTACCAATCTTTACTTTACTTTACTTTTTGTAATGAAGTCCCC
TGTTTGATTACTGTCTACCTAAGACTTAATTTATTATTTTTTATGTCTCTTATGAACTTAAATGTTGTTGTAGCTAGACTTGTAGTTGTGTCTATTATATGAATTTAAAT
AATGTTGTTCTA
Protein sequenceShow/hide protein sequence
MDIFDATPGAHRGRNGTVDHYDMTEEQYKEYFEALAKSEGFDVPYYPNTFVPGRTAPIQNLDYQREKLNNYGGLAIELYNEQNGTKFEFVELVKVNSGVAAGFLYFMTFK
AKSPTDDSPTTFQARVFAGIEKTEVDFCRQALLSGSA