; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021980 (gene) of Snake gourd v1 genome

Gene IDTan0021980
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPhloem filament protein
Genome locationLG02:49830524..49850959
RNA-Seq ExpressionTan0021980
SyntenyTan0021980
Gene Ontology termsGO:0010466 - negative regulation of peptidase activity (biological process)
GO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR009994 - Phloem filament PP1
IPR027214 - Cystatin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAC12676.1 phloem filament protein [Cucurbita maxima]1.7e-5855.19Show/hide
Query:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK
        MS  VK C GQ   VEKWIKIPDV+  CV EV K AV +FN++   +LK+D I+EGWY ELG + LKY+L ++A D   R L +EA+V EEKP  ERIRK
Subjt:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK

Query:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF
        L  F+ ++  GH+VG V PP+ E WIKIP+L  PFV +++KF + E+N K  + LK+  IY+GWYAEMG+D+++FRLHVK KDCLGR+RN+E +V +K F
Subjt:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF

Query:  FGIVIKTFESFK
            IK  ESFK
Subjt:  FGIVIKTFESFK

KAG6588840.1 hypothetical protein SDJN03_17405, partial [Cucurbita argyrosperma subsp. sororia]5.7e-5953.95Show/hide
Query:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK
        MS  VK C GQ   VEKWIKIPDV+  CV EV K AV +FN++   +LK++ I+EGW+ ELG + LKY+L ++A D   R L +EA+V EEKP  ERIRK
Subjt:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK

Query:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF
        L  F  ++  GH+VG V PPK E WIKIP+L  PFV++++KF +  +N K+ + LK+  +Y+GWYAEMG+D+++FRLHVK KDCLGR+RN+E +V +KHF
Subjt:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF

Query:  FGIVIKTFESFKPIE
            IK FESFK ++
Subjt:  FGIVIKTFESFKPIE

XP_022927698.1 uncharacterized protein LOC111434515 [Cucurbita moschata]1.5e-5954.88Show/hide
Query:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK
        MS  VK C GQ   VEKWIKIPDV+  CV EV K AV +FN++   +LK++ I+EGW+ ELG + LKY+L ++A D   R L +EA+V EEKP  ERIRK
Subjt:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK

Query:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF
        L  F+ I+  GH+VG V PPK E WIKIP+L  PFV++++KF +  +N K+ + LK+  IY+GWYAEMG+D+++FRLHVK KDCLGR+RN+E +V +KHF
Subjt:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF

Query:  FGIVIKTFESFKPIE
            IK FESFK ++
Subjt:  FGIVIKTFESFKPIE

XP_022989566.1 uncharacterized protein LOC111486626 isoform X1 [Cucurbita maxima]9.8e-5954.42Show/hide
Query:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK
        MS  VK C GQ   VEKWIKIPDV+  CV EV K AV +FN++   +LK+D I+EGWY ELG + LKY+L ++A D   R L +EA+V EEKP  ERIRK
Subjt:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK

Query:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF
        L  F+ ++  GH+VG V PP+ E WIKIP+L  PFV +++KF + E+N K  + LK+  IY+GWYAEMG+D+++FRLHVK KDCLGR+RN+E +V +K F
Subjt:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF

Query:  FGIVIKTFESFKPIE
            IK  ESFK ++
Subjt:  FGIVIKTFESFKPIE

XP_022989567.1 uncharacterized protein LOC111486626 isoform X2 [Cucurbita maxima]9.8e-5954.42Show/hide
Query:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK
        MS  VK C GQ   VEKWIKIPDV+  CV EV K AV +FN++   +LK+D I+EGWY ELG + LKY+L ++A D   R L +EA+V EEKP  ERIRK
Subjt:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK

Query:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF
        L  F+ ++  GH+VG V PP+ E WIKIP+L  PFV +++KF + E+N K  + LK+  IY+GWYAEMG+D+++FRLHVK KDCLGR+RN+E +V +K F
Subjt:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF

Query:  FGIVIKTFESFKPIE
            IK  ESFK ++
Subjt:  FGIVIKTFESFKPIE

TrEMBL top hitse value%identityAlignment
A0A6J1EIQ8 uncharacterized protein LOC1114345179.8e-5756.13Show/hide
Query:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK
        MS   K C  Q  VVEKWIKIPDVN  C+ +V KFAV +FN +    L ++ I+EGWY+ELG + LKY+L +KA D  GRLL +E +V EEKP KERIRK
Subjt:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK

Query:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF
        L  FV+I   GH VG V PPK + WIKIPNL   FVI+++KF V EFN K  D L F+SIYEGWY EMG D ++FRL +K  DCL RV ++E VVF+K+ 
Subjt:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF

Query:  FGIVIKTFESFK
         G  I   ESF+
Subjt:  FGIVIKTFESFK

A0A6J1ELQ9 uncharacterized protein LOC1114345157.3e-6054.88Show/hide
Query:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK
        MS  VK C GQ   VEKWIKIPDV+  CV EV K AV +FN++   +LK++ I+EGW+ ELG + LKY+L ++A D   R L +EA+V EEKP  ERIRK
Subjt:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK

Query:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF
        L  F+ I+  GH+VG V PPK E WIKIP+L  PFV++++KF +  +N K+ + LK+  IY+GWYAEMG+D+++FRLHVK KDCLGR+RN+E +V +KHF
Subjt:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF

Query:  FGIVIKTFESFKPIE
            IK FESFK ++
Subjt:  FGIVIKTFESFKPIE

A0A6J1JG70 uncharacterized protein LOC111486626 isoform X24.7e-5954.42Show/hide
Query:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK
        MS  VK C GQ   VEKWIKIPDV+  CV EV K AV +FN++   +LK+D I+EGWY ELG + LKY+L ++A D   R L +EA+V EEKP  ERIRK
Subjt:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK

Query:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF
        L  F+ ++  GH+VG V PP+ E WIKIP+L  PFV +++KF + E+N K  + LK+  IY+GWYAEMG+D+++FRLHVK KDCLGR+RN+E +V +K F
Subjt:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF

Query:  FGIVIKTFESFKPIE
            IK  ESFK ++
Subjt:  FGIVIKTFESFKPIE

A0A6J1JKF7 uncharacterized protein LOC111486626 isoform X14.7e-5954.42Show/hide
Query:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK
        MS  VK C GQ   VEKWIKIPDV+  CV EV K AV +FN++   +LK+D I+EGWY ELG + LKY+L ++A D   R L +EA+V EEKP  ERIRK
Subjt:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK

Query:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF
        L  F+ ++  GH+VG V PP+ E WIKIP+L  PFV +++KF + E+N K  + LK+  IY+GWYAEMG+D+++FRLHVK KDCLGR+RN+E +V +K F
Subjt:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF

Query:  FGIVIKTFESFKPIE
            IK  ESFK ++
Subjt:  FGIVIKTFESFKPIE

P94012 Phloem filament protein8.1e-5955.19Show/hide
Query:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK
        MS  VK C GQ   VEKWIKIPDV+  CV EV K AV +FN++   +LK+D I+EGWY ELG + LKY+L ++A D   R L +EA+V EEKP  ERIRK
Subjt:  MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRK

Query:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF
        L  F+ ++  GH+VG V PP+ E WIKIP+L  PFV +++KF + E+N K  + LK+  IY+GWYAEMG+D+++FRLHVK KDCLGR+RN+E +V +K F
Subjt:  LQGFVVIVTQGHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHF

Query:  FGIVIKTFESFK
            IK  ESFK
Subjt:  FGIVIKTFESFK

SwissProt top hitse value%identityAlignment
Q41916 Cysteine proteinase inhibitor 56.5e-0529.35Show/hide
Query:  WIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHFFGIVIKTFESFKPIEN
        W  I N+ +P V+++ +F V E+NK++   LKFE++  G    +   +  +RL V   D  G  +N+  +V+ K +  +  +   SF+P  N
Subjt:  WIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHFFGIVIKTFESFKPIEN

Arabidopsis top hitse value%identityAlignment
AT5G47550.1 Cystatin/monellin superfamily protein4.6e-0629.35Show/hide
Query:  WIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHFFGIVIKTFESFKPIEN
        W  I N+ +P V+++ +F V E+NK++   LKFE++  G    +   +  +RL V   D  G  +N+  +V+ K +  +  +   SF+P  N
Subjt:  WIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHFFGIVIKTFESFKPIEN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCTGAAGTAAAGGCTTGCGGTGGTCAAGCCGTGGTTGTGGAGAAGTGGATTAAAATCCCTGATGTTAATGAGAAATGTGTGTTGGAGGTTGTAAAGTTTGCAGT
CGCGGAGTTCAACATTAAATTTAAACAAACTCTCAAATTTGATATCATTTATGAAGGTTGGTATGTCGAGTTGGGTGTAGACAAACTAAAGTACAAGCTCCTACTTAAGG
CGCACGACCTTTTTGGACGCTTGCTGAATTTTGAGGCTATTGTAACCGAAGAGAAGCCTCTAAAGGAAAGAATCAGGAAGCTACAAGGTTTCGTCGTCATAGTCACACAA
GGACACCATGTTGGCAATGTTATTCCGCCGAAGCCTGAGATCTGGATTAAAATCCCTAATCTTTGTGAGCCATTCGTGATAGATCTAGCAAAGTTTGTGGTCATAGAATT
CAACAAGAAAAATAATGATAGCCTAAAATTTGAGAGCATTTACGAGGGTTGGTATGCAGAGATGGGCGAAGACCACATGAGGTTTCGTCTCCATGTTAAAGTGAAAGATT
GTCTCGGACGAGTGCGCAACTTTGAGGGTGTTGTGTTCATAAAGCACTTCTTTGGTATAGTAATCAAGACGTTCGAAAGTTTCAAACCTATCGAAAATAAGTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCTGAAGTAAAGGCTTGCGGTGGTCAAGCCGTGGTTGTGGAGAAGTGGATTAAAATCCCTGATGTTAATGAGAAATGTGTGTTGGAGGTTGTAAAGTTTGCAGT
CGCGGAGTTCAACATTAAATTTAAACAAACTCTCAAATTTGATATCATTTATGAAGGTTGGTATGTCGAGTTGGGTGTAGACAAACTAAAGTACAAGCTCCTACTTAAGG
CGCACGACCTTTTTGGACGCTTGCTGAATTTTGAGGCTATTGTAACCGAAGAGAAGCCTCTAAAGGAAAGAATCAGGAAGCTACAAGGTTTCGTCGTCATAGTCACACAA
GGACACCATGTTGGCAATGTTATTCCGCCGAAGCCTGAGATCTGGATTAAAATCCCTAATCTTTGTGAGCCATTCGTGATAGATCTAGCAAAGTTTGTGGTCATAGAATT
CAACAAGAAAAATAATGATAGCCTAAAATTTGAGAGCATTTACGAGGGTTGGTATGCAGAGATGGGCGAAGACCACATGAGGTTTCGTCTCCATGTTAAAGTGAAAGATT
GTCTCGGACGAGTGCGCAACTTTGAGGGTGTTGTGTTCATAAAGCACTTCTTTGGTATAGTAATCAAGACGTTCGAAAGTTTCAAACCTATCGAAAATAAGTGTTAA
Protein sequenceShow/hide protein sequence
MSSEVKACGGQAVVVEKWIKIPDVNEKCVLEVVKFAVAEFNIKFKQTLKFDIIYEGWYVELGVDKLKYKLLLKAHDLFGRLLNFEAIVTEEKPLKERIRKLQGFVVIVTQ
GHHVGNVIPPKPEIWIKIPNLCEPFVIDLAKFVVIEFNKKNNDSLKFESIYEGWYAEMGEDHMRFRLHVKVKDCLGRVRNFEGVVFIKHFFGIVIKTFESFKPIENKC