; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012878 (gene) of Snake gourd v1 genome

Gene IDTan0012878
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPeptidase_S15 domain-containing protein
Genome locationLG09:60405243..60407113
RNA-Seq ExpressionTan0012878
SyntenyTan0012878
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000383 - Xaa-Pro dipeptidyl-peptidase-like domain
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588170.1 hypothetical protein SDJN03_16735, partial [Cucurbita argyrosperma subsp. sororia]7.0e-11393.72Show/hide
Query:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA
        MANCTV+SCKVETSDGVKLHTR+FKPTDEE RENLVVVLVHPYSVLGGCQGLLRGIAGGLAERG+RAVTFDMRGAGKSSGR SLTGFAEIKDVTAVC W 
Subjt:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA

Query:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH
        CE+LSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKA+LQSPKPKLFVMGT+DG TSVKQLQNKLKSAAGR E HLIEGVSH
Subjt:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH

Query:  FEMEGPAYDAQMVNLILHFISSL
        FEMEGPAYDAQMVNLIL FISSL
Subjt:  FEMEGPAYDAQMVNLILHFISSL

KAG7022069.1 hypothetical protein SDJN02_15798 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-11394.17Show/hide
Query:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA
        MANCTV+SCKVETSDGVKLHTR+FKPTDEE RENLVVVLVHPYSVLGGCQGLLRGIAGGLAERG+RAVTFDMRGAGKSSGR SLTGFAEIKDVTAVC W 
Subjt:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA

Query:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH
        CE+LSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKA+LQSPKPKLFVMGT+DGFTSVKQLQNKLKSAAGR E HLIEGVSH
Subjt:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH

Query:  FEMEGPAYDAQMVNLILHFISSL
        FEMEGPAYDAQMVNLIL FISSL
Subjt:  FEMEGPAYDAQMVNLILHFISSL

XP_022933651.1 uncharacterized protein LOC111441007 [Cucurbita moschata]1.1e-11394.62Show/hide
Query:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA
        MANCTV+SCKVETSDGVKLHTRVFKPTDEE RENLVVVLVHPYSVLGGCQGLLRGIAGGLAERG+RAVTFDMRGAGKSSGR SLTGFAEIKDVTAVC W 
Subjt:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA

Query:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH
        CE+LSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKA+LQSPKPKLFVMGT+DGFTSVKQLQNKLKSAAGR E HLIEGVSH
Subjt:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH

Query:  FEMEGPAYDAQMVNLILHFISSL
        FEMEGPAYDAQMVNLIL FISSL
Subjt:  FEMEGPAYDAQMVNLILHFISSL

XP_023530620.1 uncharacterized protein LOC111793117 [Cucurbita pepo subsp. pepo]8.3e-11494.62Show/hide
Query:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA
        MANCTVESCKVETSDGVKLHTRVFKPTDEE RENLVVVLVHPYSVLGGCQGLLRGIAGGLA+RG+RAVTFDMRGAGKSSGR SLTGFAEIKDVTAVC W 
Subjt:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA

Query:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH
        C +LSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKA+LQSPKPKLFVMGT+DGFTSVKQLQNKLKSAAGRVE HLIEGVSH
Subjt:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH

Query:  FEMEGPAYDAQMVNLILHFISSL
        FEMEGPAYDAQMVNLIL FISSL
Subjt:  FEMEGPAYDAQMVNLILHFISSL

XP_038878932.1 uncharacterized protein LOC120071019 [Benincasa hispida]2.0e-11293.27Show/hide
Query:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA
        M+N TVESCKVETSDGVKLHTRVFKP DEEARENL VVLVHPYS+LGGCQGLLRGIA GLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDV AVCKW 
Subjt:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA

Query:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH
        CE LSV RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKA+LQSPKPKLFVMGT+DGFTSVKQLQNKLKSAAGR+E HLIEGVSH
Subjt:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH

Query:  FEMEGPAYDAQMVNLILHFISSL
        FEMEGPAYDAQMVNLI HFISSL
Subjt:  FEMEGPAYDAQMVNLILHFISSL

TrEMBL top hitse value%identityAlignment
A0A0A0M2M4 Peptidase_S15 domain-containing protein3.2e-11192Show/hide
Query:  MANCTVESCKVETSDGVKLHTRVFKPTDEEAR--ENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCK
        M+NCTVESCKVETSDGVKLHTRVFKP DEEA+  ENL VVLVHPYS+LGGCQGLLRGIA GLAERGY+AVTFDMRGAGKSSGRASLTGFAEIKDV AVCK
Subjt:  MANCTVESCKVETSDGVKLHTRVFKPTDEEAR--ENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCK

Query:  WACEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGV
        W CE+LSV+RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGL ASILFGRHHKA+L SPKPKLFVMGT+DGFTSVKQLQNKLKSAAGRVE HLIEGV
Subjt:  WACEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGV

Query:  SHFEMEGPAYDAQMVNLILHFISSL
        SHFEMEGPAYDAQMVNLILHFISSL
Subjt:  SHFEMEGPAYDAQMVNLILHFISSL

A0A5A7TJY6 Abhydrolase_5 domain-containing protein4.6e-11091.11Show/hide
Query:  MANCTVESCKVETSDGVKLHTRVFKPTDEEAR--ENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCK
        M+NCTVESCKVETSDGVKLHTRVFKP DE A+  ENL VVLVHPYS+LGGCQGLLRGIA GLAE+GY+AVTFDMRGAGKSSGRASLTGFAEIKDV AVCK
Subjt:  MANCTVESCKVETSDGVKLHTRVFKPTDEEAR--ENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCK

Query:  WACEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGV
        W CE+LSV+RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGL ASILFGRHHKA+L SPKPKLFVMGT+DGFTSVKQLQNKLKSAAGRVE HLIEGV
Subjt:  WACEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGV

Query:  SHFEMEGPAYDAQMVNLILHFISSL
        SHFEMEGPAYDAQMVNLILHFISSL
Subjt:  SHFEMEGPAYDAQMVNLILHFISSL

A0A6J1DJB7 uncharacterized protein LOC1110214302.2e-11292.38Show/hide
Query:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA
        M+NC VESCKVETSDGVKLH RVFKP DEEARENLVVVLVHPYSVLGGCQGLLRGIA GLAERG+RAVTFDMRGAGKSSG+ASLTGFAEIKDVTAVCKW 
Subjt:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA

Query:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH
        CE+LSV+RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKA+LQSPKPKLFVMGT+DGFTSVKQL+NKL SA GRVE HLIEGVSH
Subjt:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH

Query:  FEMEGPAYDAQMVNLILHFISSL
        FEMEGP+YDAQMVNLILHFISSL
Subjt:  FEMEGPAYDAQMVNLILHFISSL

A0A6J1EZN6 uncharacterized protein LOC1114410075.3e-11494.62Show/hide
Query:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA
        MANCTV+SCKVETSDGVKLHTRVFKPTDEE RENLVVVLVHPYSVLGGCQGLLRGIAGGLAERG+RAVTFDMRGAGKSSGR SLTGFAEIKDVTAVC W 
Subjt:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA

Query:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH
        CE+LSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKA+LQSPKPKLFVMGT+DGFTSVKQLQNKLKSAAGR E HLIEGVSH
Subjt:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH

Query:  FEMEGPAYDAQMVNLILHFISSL
        FEMEGPAYDAQMVNLIL FISSL
Subjt:  FEMEGPAYDAQMVNLILHFISSL

A0A6J1HXF4 uncharacterized protein LOC1114675261.7e-11293.27Show/hide
Query:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA
        M NCTV+SCKVETS+GVKLHTRVFKPTDEE RENLVVVLVHPYSVLGGCQGLLRGIAGGLAERG+RAVTFDMRGAGKSSGR SLTGFAEIKDVTAVC W 
Subjt:  MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWA

Query:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH
        CE+LSVNRILLVGSSAGAPIAGSSVDLIEQVVGY+SLGYPFGLTASILFGRHHKA+LQSPKPKLFVMGT+DGFTSVKQLQNKLKSAAGR E HLIEGVSH
Subjt:  CEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSH

Query:  FEMEGPAYDAQMVNLILHFISSL
        FEMEGPAYDAQMVNLIL FISSL
Subjt:  FEMEGPAYDAQMVNLILHFISSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G26750.1 alpha/beta-Hydrolases superfamily protein7.5e-0432.73Show/hide
Query:  DGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRA---SLTGFAEIKDVTAVCKWACEDLSVNRILL
        +G+ +H  +  P+D       +V+L+H +  L       R    GLA RGYRAV  D+RG G S   A   S T F  + D+ AV     ++    ++ +
Subjt:  DGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRA---SLTGFAEIKDVTAVCKWACEDLSVNRILL

Query:  VGSSAGAPIA
        VG   GA IA
Subjt:  VGSSAGAPIA

AT5G19630.1 alpha/beta-Hydrolases superfamily protein9.5e-9271.24Show/hide
Query:  ANCTVESCKVETSDGVKLHTRVFKPTDEEAR----ENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVC
        +N  VESC V++ +GVKLHTR+FKP +E       ENLV+VLVHP+S+LGGCQ LL+GIA  LA +G+++VTFD RGAGKS+GRA+LTGFAE+KDV AVC
Subjt:  ANCTVESCKVETSDGVKLHTRVFKPTDEEAR----ENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVC

Query:  KWACEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEG
        +W C+++  +RILLVGSSAGAPIAGS+V+ +EQVVGYVSLGYPFGL ASILFGRHHKA+L SPKPKLFVMGTQDGFTSV QL+ KLKSA GR E HLIEG
Subjt:  KWACEDLSVNRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEG

Query:  VSHFEMEGPAYDAQMVNLILHFISSL
        VSHF+MEGP YD+Q+ ++I  FISSL
Subjt:  VSHFEMEGPAYDAQMVNLILHFISSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAACTGCACCGTCGAGTCTTGTAAAGTCGAAACCAGTGATGGGGTCAAGCTCCACACGAGGGTTTTCAAGCCAACAGATGAGGAGGCGAGGGAAAATCTGGTGGT
TGTTTTGGTCCATCCCTATTCGGTTTTGGGTGGTTGTCAAGGGCTTTTGAGAGGAATTGCAGGTGGGTTAGCGGAGAGGGGTTATAGGGCTGTGACTTTTGATATGAGGG
GTGCTGGGAAATCGTCTGGAAGGGCTTCTCTGACTGGATTTGCAGAAATTAAGGATGTGACTGCTGTTTGCAAGTGGGCCTGTGAGGATTTGTCTGTTAATCGAATTTTG
TTGGTGGGTTCATCTGCAGGAGCCCCCATTGCAGGCTCATCTGTGGATTTGATAGAACAAGTGGTAGGCTATGTCAGCCTTGGCTATCCCTTTGGCCTAACTGCCTCAAT
TCTTTTTGGAAGACATCACAAAGCCGTTTTACAGTCTCCGAAACCAAAGCTTTTCGTGATGGGCACACAGGACGGGTTCACGAGTGTGAAGCAACTGCAGAACAAGTTAA
AATCTGCAGCAGGACGTGTCGAAATGCATCTAATAGAAGGTGTGAGCCACTTTGAGATGGAAGGCCCTGCATATGATGCTCAAATGGTGAATCTTATCCTTCATTTCATT
TCTTCTTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAACTGCACCGTCGAGTCTTGTAAAGTCGAAACCAGTGATGGGGTCAAGCTCCACACGAGGGTTTTCAAGCCAACAGATGAGGAGGCGAGGGAAAATCTGGTGGT
TGTTTTGGTCCATCCCTATTCGGTTTTGGGTGGTTGTCAAGGGCTTTTGAGAGGAATTGCAGGTGGGTTAGCGGAGAGGGGTTATAGGGCTGTGACTTTTGATATGAGGG
GTGCTGGGAAATCGTCTGGAAGGGCTTCTCTGACTGGATTTGCAGAAATTAAGGATGTGACTGCTGTTTGCAAGTGGGCCTGTGAGGATTTGTCTGTTAATCGAATTTTG
TTGGTGGGTTCATCTGCAGGAGCCCCCATTGCAGGCTCATCTGTGGATTTGATAGAACAAGTGGTAGGCTATGTCAGCCTTGGCTATCCCTTTGGCCTAACTGCCTCAAT
TCTTTTTGGAAGACATCACAAAGCCGTTTTACAGTCTCCGAAACCAAAGCTTTTCGTGATGGGCACACAGGACGGGTTCACGAGTGTGAAGCAACTGCAGAACAAGTTAA
AATCTGCAGCAGGACGTGTCGAAATGCATCTAATAGAAGGTGTGAGCCACTTTGAGATGGAAGGCCCTGCATATGATGCTCAAATGGTGAATCTTATCCTTCATTTCATT
TCTTCTTTGTAG
Protein sequenceShow/hide protein sequence
MANCTVESCKVETSDGVKLHTRVFKPTDEEARENLVVVLVHPYSVLGGCQGLLRGIAGGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVTAVCKWACEDLSVNRIL
LVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAVLQSPKPKLFVMGTQDGFTSVKQLQNKLKSAAGRVEMHLIEGVSHFEMEGPAYDAQMVNLILHFI
SSL