; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009491 (gene) of Snake gourd v1 genome

Gene IDTan0009491
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCystatin domain-containing protein
Genome locationLG05:85233282..85235147
RNA-Seq ExpressionTan0009491
SyntenyTan0009491
Gene Ontology termsGO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain
IPR006525 - Cystatin-related, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137488.1 uncharacterized protein LOC111008920 isoform X1 [Momordica charantia]2.5e-3152.94Show/hide
Query:  SHGCFEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLI--VPMFAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEV
        S G ++D  REM +EE+N YY+A+ ESEGFDVP+F  +YA  +I  +P+      EV++ A +AIKHYN++NG +FEFV ++ AN Q   G L+F+TF+V
Subjt:  SHGCFEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLI--VPMFAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEV

Query:  KPIGTPPVFPTTTFQARVLLGI-PDTVEVELCRPKP
        K  G PP  PTTT QARVL GI PD  +V+LCRP+P
Subjt:  KPIGTPPVFPTTTFQARVLLGI-PDTVEVELCRPKP

XP_022942206.1 uncharacterized protein LOC111447330 [Cucurbita moschata]2.8e-3056.3Show/hide
Query:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM----FAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP
        F+D Y  +T +E   YY AV ES+GFDVP+F + YA  LI PM          EV+  A EAIKHYN++NGTNFE V IV AN  G CG +Y+ITF VKP
Subjt:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM----FAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP

Query:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPKP
        IGTP  FP+ TFQA+V   IP  D ++VELCRPKP
Subjt:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPKP

XP_022975599.1 uncharacterized protein LOC111475178 [Cucurbita maxima]6.9e-2956.3Show/hide
Query:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM----FAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP
        F+D Y  +T +E   Y+ AV ES+GFDVP F   YA GLI PM    F     EV+  A EAIKHYN++NGTNFE V IV AN  G CG +Y+ITF VKP
Subjt:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM----FAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP

Query:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPKP
        IGT   F + TFQA+V   IP  D +EVELCRPKP
Subjt:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPKP

XP_022975633.1 uncharacterized protein LOC111475320 [Cucurbita maxima]9.6e-3157.78Show/hide
Query:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM----FAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP
        F+D Y  +T +E   Y+ AV ES+GFDVP F   YA GLI PM    F     EV+  A EAIKHYNS+NGTNFE V IV AN +G CG +Y+ITF VKP
Subjt:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM----FAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP

Query:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPKP
        IGT   FP+ TFQA+V   IP  D +EVELCRPKP
Subjt:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPKP

XP_023548353.1 UPF0725 protein At1g02770-like [Cucurbita pepo subsp. pepo]8.1e-3057.04Show/hide
Query:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM-FAPFPS---EVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP
        F+D Y  +T +E   YY AV ES+GFDVP+F   YA  LI PM    FPS   EV+    EAIKHYN++NGTNFE V IV AN  G CG +Y+ITF VKP
Subjt:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM-FAPFPS---EVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP

Query:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPKP
        IGT   FP  TFQA+V   IP  D ++VELCRPKP
Subjt:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPKP

TrEMBL top hitse value%identityAlignment
A0A6J1CAH1 uncharacterized protein LOC111008920 isoform X11.2e-3152.94Show/hide
Query:  SHGCFEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLI--VPMFAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEV
        S G ++D  REM +EE+N YY+A+ ESEGFDVP+F  +YA  +I  +P+      EV++ A +AIKHYN++NG +FEFV ++ AN Q   G L+F+TF+V
Subjt:  SHGCFEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLI--VPMFAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEV

Query:  KPIGTPPVFPTTTFQARVLLGI-PDTVEVELCRPKP
        K  G PP  PTTT QARVL GI PD  +V+LCRP+P
Subjt:  KPIGTPPVFPTTTFQARVLLGI-PDTVEVELCRPKP

A0A6J1FN74 uncharacterized protein LOC1114473301.3e-3056.3Show/hide
Query:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM----FAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP
        F+D Y  +T +E   YY AV ES+GFDVP+F + YA  LI PM          EV+  A EAIKHYN++NGTNFE V IV AN  G CG +Y+ITF VKP
Subjt:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM----FAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP

Query:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPKP
        IGTP  FP+ TFQA+V   IP  D ++VELCRPKP
Subjt:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPKP

A0A6J1FVU0 uncharacterized protein LOC1114473293.4e-2650.75Show/hide
Query:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPMFAPFPS----EVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP
        ++D Y  +T +E   Y+ AV ESEGFDVP+F   ++   I+P+     S    EV+  A +AIKHYN +NGTNFE V IV AN  G CG +Y+ITF VKP
Subjt:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPMFAPFPS----EVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP

Query:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPK
        IGT   FP  TFQA+V   IP  D ++V+LCRPK
Subjt:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPK

A0A6J1IJT3 uncharacterized protein LOC1114753204.6e-3157.78Show/hide
Query:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM----FAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP
        F+D Y  +T +E   Y+ AV ES+GFDVP F   YA GLI PM    F     EV+  A EAIKHYNS+NGTNFE V IV AN +G CG +Y+ITF VKP
Subjt:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM----FAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP

Query:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPKP
        IGT   FP+ TFQA+V   IP  D +EVELCRPKP
Subjt:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPKP

A0A6J1IL21 uncharacterized protein LOC1114751783.3e-2956.3Show/hide
Query:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM----FAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP
        F+D Y  +T +E   Y+ AV ES+GFDVP F   YA GLI PM    F     EV+  A EAIKHYN++NGTNFE V IV AN  G CG +Y+ITF VKP
Subjt:  FEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM----FAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKP

Query:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPKP
        IGT   F + TFQA+V   IP  D +EVELCRPKP
Subjt:  IGTPPVFPTTTFQARVLLGIP--DTVEVELCRPKP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G63190.1 Cystatin/monellin superfamily protein2.4e-0830Show/hide
Query:  EDDYREMTDEEINLYYEAVWESEGFDVP--------NFHDSYAIGLIVPMFAPFPSE----VKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYF
        ED+ +   ++E+ L  + +  S+GFD+         N+H +Y   L    F   P      +++ + +A+  YN ++ T FEFV +V AN    C +++ 
Subjt:  EDDYREMTDEEINLYYEAVWESEGFDVP--------NFHDSYAIGLIVPMFAPFPSE----VKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYF

Query:  ITFEVKPIGTPPVFPTTTFQARVLLGIPDTVEVELCRPKP
        ITFEV     P       FQ RV        E   CRPKP
Subjt:  ITFEVKPIGTPPVFPTTTFQARVLLGIPDTVEVELCRPKP

AT1G63200.1 Cystatin/monellin superfamily protein3.1e-1133.09Show/hide
Query:  EDDYREMT-DEEINLYYEAVWESEGFDVP--------NFHDS--YAIGLIVPMFAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFI
        E+D  E+T +EE+ L  + V  S+GFD+         N+H +  ++       F      +K  A+EA+  +N ++GT +EFV +V AN    C +++ I
Subjt:  EDDYREMT-DEEINLYYEAVWESEGFDVP--------NFHDS--YAIGLIVPMFAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFI

Query:  TFEVKPIGTPPVFPTTTFQARVLLGIPDTVEVELCRPKP
        TF+VK    P       FQ RV  G   T     CRPKP
Subjt:  TFEVKPIGTPPVFPTTTFQARVLLGIPDTVEVELCRPKP

AT1G63205.1 Cystatin/monellin superfamily protein2.5e-1334.51Show/hide
Query:  EDDYREMTDEEINLYYEAVWESEGFDV--PNFHDSYAIGLIVPMFAPFPSE---------VKKFAEEAIKHYNSQNGTN-FEFVSIVMANIQGVCGLLYF
        +D+ +   DEEI +  E + +SEGFD+    F   +   L+ P    F  +         +K+F++E++K YN + GTN +EF  +V AN  G CG ++ 
Subjt:  EDDYREMTDEEINLYYEAVWESEGFDV--PNFHDSYAIGLIVPMFAPFPSE---------VKKFAEEAIKHYNSQNGTN-FEFVSIVMANIQGVCGLLYF

Query:  ITFEVKPIGTPPVFPTTTFQARVLLGIPDTVEVELCRPKPKP
        ITF+V     P      TFQAR+        E   CRPKP P
Subjt:  ITFEVKPIGTPPVFPTTTFQARVLLGIPDTVEVELCRPKPKP

AT1G63206.1 Cystatin/monellin superfamily protein2.3e-0639.02Show/hide
Query:  VKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVC---GLLYFITFEVKPIGTPPVFPTTTFQARVLLGIPDTVEVELCRPKP
        +K+ ++ ++K+YNS+  T +EF+ +V AN   +C   G +YFITFEV+    P       FQ RV      T +  LCRPKP
Subjt:  VKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVC---GLLYFITFEVKPIGTPPVFPTTTFQARVLLGIPDTVEVELCRPKP

AT2G37435.1 Cystatin/monellin superfamily protein1.5e-0526.67Show/hide
Query:  EDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM------FAPFPSEVKKFAE----EAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFIT
        +++ +   +EE  L  + V +S+GFD+ +F     +    P+       A  P   ++F +    ++++H+N  + T +EFV  + AN     G++YFIT
Subjt:  EDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPM------FAPFPSEVKKFAE----EAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFIT

Query:  FEVKPIGTPPVFPTTTFQARVLL--GIPDTVEVEL
        FE K +       +  FQA++    G P+ +  EL
Subjt:  FEVKPIGTPPVFPTTTFQARVLL--GIPDTVEVEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTTCACCCAAATCCCACGGATGTTTTGAAGACGATTACCGCGAGATGACTGACGAAGAGATAAATCTATACTATGAAGCTGTATGGGAAAGCGAGGGTTTTGA
TGTCCCCAATTTTCATGATTCCTATGCCATTGGTCTTATTGTGCCCATGTTTGCCCCGTTCCCATCTGAGGTTAAAAAATTCGCAGAGGAAGCTATAAAGCATTACAACA
GCCAAAATGGAACAAATTTTGAGTTTGTGAGTATTGTAATGGCCAATATCCAAGGTGTCTGTGGTCTCTTATACTTCATCACCTTCGAGGTCAAACCAATTGGAACACCG
CCTGTTTTTCCAACAACCACCTTCCAAGCTCGGGTTTTGCTCGGTATTCCTGATACTGTAGAGGTTGAACTTTGCCGACCAAAACCAAAACCTCTCTAA
mRNA sequenceShow/hide mRNA sequence
CAAAATTAAAACTGGAGAAATTTTAAAAGGAAGTTCTGAGGCGTTCGGAGAAGCGTGGTGTGCGAAGAAACTTGCATTAGGGTTTTCTGTAATTTCGCGAAGAGAAGCAA
TATCGATTGATTGGTTTTGATGGCATCTTCACCCAAATCCCACGGATGTTTTGAAGACGATTACCGCGAGATGACTGACGAAGAGATAAATCTATACTATGAAGCTGTAT
GGGAAAGCGAGGGTTTTGATGTCCCCAATTTTCATGATTCCTATGCCATTGGTCTTATTGTGCCCATGTTTGCCCCGTTCCCATCTGAGGTTAAAAAATTCGCAGAGGAA
GCTATAAAGCATTACAACAGCCAAAATGGAACAAATTTTGAGTTTGTGAGTATTGTAATGGCCAATATCCAAGGTGTCTGTGGTCTCTTATACTTCATCACCTTCGAGGT
CAAACCAATTGGAACACCGCCTGTTTTTCCAACAACCACCTTCCAAGCTCGGGTTTTGCTCGGTATTCCTGATACTGTAGAGGTTGAACTTTGCCGACCAAAACCAAAAC
CTCTCTAAGTGAGCACTAGCAACAAATGCTACCACATCTCTATGAACTTATTTTTACTAGTTGATGTTTGGACTTATTTTTACTAGTTGATGTTTCTACACTTTTATATG
GAACCATGGTTCAAATCCTAATCAATAATGCTATCTCACCAAATTTCTAAATTTTTG
Protein sequenceShow/hide protein sequence
MASSPKSHGCFEDDYREMTDEEINLYYEAVWESEGFDVPNFHDSYAIGLIVPMFAPFPSEVKKFAEEAIKHYNSQNGTNFEFVSIVMANIQGVCGLLYFITFEVKPIGTP
PVFPTTTFQARVLLGIPDTVEVELCRPKPKPL