; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS018807 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS018807
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPept_C1 domain-containing protein
Genome locationscaffold14:33079..33345
RNA-Seq ExpressionMS018807
SyntenyMS018807
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_003561839.2 chymomexicain [Brachypodium distachyon]4.5e-1042.86Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQ
        CWA+  A +IE +Y I    R    LQ + Q L+DC      N GC+G    S  +AF+W+IKN GI++E+DYPF A++G+C +
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQ

XP_022949074.1 zingipain-2-like [Cucurbita moschata]2.6e-1040.23Show/hide
Query:  ITAAGAIESIYNIKY--YDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQIRGV
        + A  AIES+YNIK    +   +  Q +PQ LIDC++  P     +GCY  S  +AFRW+I N GI  E  YP+   +G+ ++I  V
Subjt:  ITAAGAIESIYNIKY--YDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQIRGV

XP_023521676.1 low-temperature-induced cysteine proteinase-like [Cucurbita pepo subsp. pepo]3.8e-2561.9Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQ
        CWAI AAGAIES+YNI Y  ++ N LQAAPQHLIDCL P P + G + CYTS++ +AF+W++ NKGIA+EKDY F A +G+CK+
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQ

XP_030939750.1 ervatamin-B-like [Quercus lobata]3.4e-1045Show/hide
Query:  ITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK
        +TA GAIE++Y ++  D     L  APQ LIDC    P++   KGCY  +  +AF+W++ N GI+ E+DYPF+ +KGD K
Subjt:  ITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK

XP_030939752.1 ervatamin-B-like [Quercus lobata]7.9e-1550.6Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK
        CWAITA  AIE++Y ++  D     L  APQ LIDC    P+N   KGCY  +  +AF+W++KN GI+ E+DYPF+ +KGDCK
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK

TrEMBL top hitse value%identityAlignment
A0A6J1GBS9 zingipain-2-like1.3e-1040.23Show/hide
Query:  ITAAGAIESIYNIKY--YDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQIRGV
        + A  AIES+YNIK    +   +  Q +PQ LIDC++  P     +GCY  S  +AFRW+I N GI  E  YP+   +G+ ++I  V
Subjt:  ITAAGAIESIYNIKY--YDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQIRGV

A0A6P6YDT6 RNA helicase4.1e-0941.67Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQ
        CWA +A  +IES + IK  +     ++ + Q LIDC +P   N    GCY   + +AF+ VIKN GI  EK YP++A  G+C+Q
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQ

A0A7N2MRH3 Pept_C1 domain-containing protein3.8e-1550.6Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK
        CWAITA  AIE++Y ++  D     L  APQ LIDC    P+N   KGCY  +  +AF+W++KN GI+ E+DYPF+ +KGDCK
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK

A0A7N2RC65 Pept_C1 domain-containing protein1.7e-1045Show/hide
Query:  ITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK
        +TA GAIE++Y ++  D     L  APQ LIDC    P++   KGCY  +  +AF+W++ N GI+ E+DYPF+ +KGD K
Subjt:  ITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK

I1H7M2 Pept_C1 domain-containing protein2.2e-1042.86Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQ
        CWA+  A +IE +Y I    R    LQ + Q L+DC      N GC+G    S  +AF+W+IKN GI++E+DYPF A++G+C +
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQ

SwissProt top hitse value%identityAlignment
P14518 Stem bromelain1.1e-0840.96Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK
        CWA  A   +ESIY IK     P     + Q ++DC     +  GCKG +     RAF ++I NKG+A    YP+KA KG CK
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK

P43297 Cysteine proteinase RD21A1.3e-0940.7Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQIR
        CWA +  GA+E I  I   D     +  + Q L+DC   T  N GC G     +  AF ++IKN GI  +KDYP+K   G C QIR
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQIR

P80884 Ananain1.5e-0838.55Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK
        CWA  +   +ESIY IK      N +  + Q ++DC      + GCKG +   I +A+ ++I NKG+A    YP+KA KG CK
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK

Q9FMH8 Probable cysteine protease RD21B5.1e-0940.7Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQIR
        CWA +  GA+E I  I   D     +  + Q L+DC   T  N GC G     +  AF ++IKN GI  E DYP+KA  G C Q R
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQIR

Q9R014 Cathepsin J6.7e-0936.14Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK
        CWA  AAGAIE     + + +  N    + Q+L+DC     +  G KGC + +  +AF +V+KNKG+  E  YP++ + G C+
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK

Arabidopsis top hitse value%identityAlignment
AT1G09850.1 xylem bark cysteine peptidase 31.4e-0940.48Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQ
        CW+ +A GA+E I  I   D     +  + Q LIDC      N GC G     +  AF +VIKN GI  EKDYP++   G CK+
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQ

AT1G20850.1 xylem cysteine peptidase 21.2e-0836.14Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK
        CWA +   A+E I  I       N    + Q LIDC   T  N GC G     +  AF +++KN G+ +E+DYP+  E+G C+
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK

AT1G29080.1 Papain family cysteine protease8.1e-1033.73Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK
        CWA +A  A+E +  I       N +  + Q L+DC +   +N GCKG    +   AF ++IK++GI+ E +YP++ ++G C+
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCK

AT1G47128.1 Granulin repeat cysteine protease family protein9.6e-1140.7Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQIR
        CWA +  GA+E I  I   D     +  + Q L+DC   T  N GC G     +  AF ++IKN GI  +KDYP+K   G C QIR
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQIR

AT5G43060.1 Granulin repeat cysteine protease family protein3.6e-1040.7Show/hide
Query:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQIR
        CWA +  GA+E I  I   D     +  + Q L+DC   T  N GC G     +  AF ++IKN GI  E DYP+KA  G C Q R
Subjt:  CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TGTTGGGCAATAACCGCGGCAGGGGCCATCGAATCTATTTACAATATTAAGTATTACGATCGATTCCCAAATGGACTTCAAGCCGCACCTCAACATCTCATAGATTGTCT
TCAGCCTACCCCAGAAAACCCAGGTTGTAAAGGATGCTATACGTCATCAATAGCAAGAGCTTTTCGGTGGGTCATCAAAAATAAAGGAATAGCAAGAGAGAAAGACTACC
CATTTAAAGCAGAAAAAGGAGATTGCAAACAAATTAGAGGGGTAAAA
mRNA sequenceShow/hide mRNA sequence
TGTTGGGCAATAACCGCGGCAGGGGCCATCGAATCTATTTACAATATTAAGTATTACGATCGATTCCCAAATGGACTTCAAGCCGCACCTCAACATCTCATAGATTGTCT
TCAGCCTACCCCAGAAAACCCAGGTTGTAAAGGATGCTATACGTCATCAATAGCAAGAGCTTTTCGGTGGGTCATCAAAAATAAAGGAATAGCAAGAGAGAAAGACTACC
CATTTAAAGCAGAAAAAGGAGATTGCAAACAAATTAGAGGGGTAAAA
Protein sequenceShow/hide protein sequence
CWAITAAGAIESIYNIKYYDRFPNGLQAAPQHLIDCLQPTPENPGCKGCYTSSIARAFRWVIKNKGIAREKDYPFKAEKGDCKQIRGVK