; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g28760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g28760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr9:21560312..21561438
RNA-Seq ExpressionMoc09g28760
SyntenyMoc09g28760
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148137.1 uncharacterized protein LOC111016890 [Momordica charantia]1.9e-3353.91Show/hide
Query:  GTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQSGS
        G  +D+   W D D VY P N+GGNHW+ML IDL +G+I VWDS++  T    LE EL+PM  +LP L+H  G+  +RP LP  PW +R+V   PQQS  
Subjt:  GTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQSGS

Query:  GDCGMFCVKYLEYDVTGSNMTSLTQDNI
         DCG+F V+Y EYD TGSNM +LTQDNI
Subjt:  GDCGMFCVKYLEYDVTGSNMTSLTQDNI

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]9.8e-3854.96Show/hide
Query:  MDGTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQS
        +DGT++D  TRW+D+DAVYLPYNIGG HWI++ ID  EGE+IVWDS   MT  P LE EL+PM  ++P L+ R GV + +P +P TPW IR+V+  PQQ 
Subjt:  MDGTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQS

Query:  GSGDCGMFCVKYLEYDVTGSNMTSLTQDNIS
          GDCG+FC+ + EYDVT  +  +LTQ  +S
Subjt:  GSGDCGMFCVKYLEYDVTGSNMTSLTQDNIS

XP_022155154.1 uncharacterized protein LOC111022296 [Momordica charantia]3.3e-3353.72Show/hide
Query:  RWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQSGSGDCGMFCV
        +W +++AVYLP+N+  NHW+M+ ID  EGEI+VWDS+  +T   +LE +L  M  V+P L+H++    +RP LP TPW I +VT TPQQ GSGDCG+FCV
Subjt:  RWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQSGSGDCGMFCV

Query:  KYLEYDVTGSNMTSLTQDNIS
        KY EYDVTG+++ +L Q+N+S
Subjt:  KYLEYDVTGSNMTSLTQDNIS

XP_022156568.1 uncharacterized protein LOC111023442 [Momordica charantia]1.7e-3752.67Show/hide
Query:  MDGTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQS
        +D +++DY  +W +++AVYLP+N+ GNHW+M+ ID  EGEI+VWDS+R +T + +LE +L+ M  V+P+L+H++ V  +RP LP TPW IR+VT TP+Q 
Subjt:  MDGTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQS

Query:  GSGDCGMFCVKYLEYDVTGSNMTSLTQDNIS
         SGDCG+FCVKY EYDVT +++ +L Q+N+S
Subjt:  GSGDCGMFCVKYLEYDVTGSNMTSLTQDNIS

XP_022159362.1 uncharacterized protein LOC111025779 [Momordica charantia]4.7e-5683.21Show/hide
Query:  MDGTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQS
        MDGT+TDYQTRWLDLD VYLPYNIGGNHWIM+HIDLQ+ EII+WD MR MT FP LESELR M VVL ALMHRAGVQVLR TLP  PW I QVT  PQQS
Subjt:  MDGTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQS

Query:  GSGDCGMFCVKYLEYDVTGSNMTSLTQDNIS
        GSGDCGMFCVKY EYDVT SNMTSLTQDNIS
Subjt:  GSGDCGMFCVKYLEYDVTGSNMTSLTQDNIS

TrEMBL top hitse value%identityAlignment
A0A6J1D492 uncharacterized protein LOC1110168909.2e-3453.91Show/hide
Query:  GTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQSGS
        G  +D+   W D D VY P N+GGNHW+ML IDL +G+I VWDS++  T    LE EL+PM  +LP L+H  G+  +RP LP  PW +R+V   PQQS  
Subjt:  GTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQSGS

Query:  GDCGMFCVKYLEYDVTGSNMTSLTQDNI
         DCG+F V+Y EYD TGSNM +LTQDNI
Subjt:  GDCGMFCVKYLEYDVTGSNMTSLTQDNI

A0A6J1DLV0 uncharacterized protein LOC1110216464.7e-3854.96Show/hide
Query:  MDGTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQS
        +DGT++D  TRW+D+DAVYLPYNIGG HWI++ ID  EGE+IVWDS   MT  P LE EL+PM  ++P L+ R GV + +P +P TPW IR+V+  PQQ 
Subjt:  MDGTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQS

Query:  GSGDCGMFCVKYLEYDVTGSNMTSLTQDNIS
          GDCG+FC+ + EYDVT  +  +LTQ  +S
Subjt:  GSGDCGMFCVKYLEYDVTGSNMTSLTQDNIS

A0A6J1DPE8 uncharacterized protein LOC1110222961.6e-3353.72Show/hide
Query:  RWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQSGSGDCGMFCV
        +W +++AVYLP+N+  NHW+M+ ID  EGEI+VWDS+  +T   +LE +L  M  V+P L+H++    +RP LP TPW I +VT TPQQ GSGDCG+FCV
Subjt:  RWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQSGSGDCGMFCV

Query:  KYLEYDVTGSNMTSLTQDNIS
        KY EYDVTG+++ +L Q+N+S
Subjt:  KYLEYDVTGSNMTSLTQDNIS

A0A6J1DQZ3 uncharacterized protein LOC1110234428.1e-3852.67Show/hide
Query:  MDGTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQS
        +D +++DY  +W +++AVYLP+N+ GNHW+M+ ID  EGEI+VWDS+R +T + +LE +L+ M  V+P+L+H++ V  +RP LP TPW IR+VT TP+Q 
Subjt:  MDGTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQS

Query:  GSGDCGMFCVKYLEYDVTGSNMTSLTQDNIS
         SGDCG+FCVKY EYDVT +++ +L Q+N+S
Subjt:  GSGDCGMFCVKYLEYDVTGSNMTSLTQDNIS

A0A6J1DZM8 uncharacterized protein LOC1110257792.3e-5683.21Show/hide
Query:  MDGTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQS
        MDGT+TDYQTRWLDLD VYLPYNIGGNHWIM+HIDLQ+ EII+WD MR MT FP LESELR M VVL ALMHRAGVQVLR TLP  PW I QVT  PQQS
Subjt:  MDGTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQS

Query:  GSGDCGMFCVKYLEYDVTGSNMTSLTQDNIS
        GSGDCGMFCVKY EYDVT SNMTSLTQDNIS
Subjt:  GSGDCGMFCVKYLEYDVTGSNMTSLTQDNIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G08430.1 Ulp1 protease family protein1.7e-0826.02Show/hide
Query:  WLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHR-AGVQVLRPTLPNTPWSIRQVTFTPQQSGSGDCGMFCV
        ++D+D +Y    + GNHW+ L IDL +  I V+DS+  +T    +  +   +  ++PA++      +  R +     W  +++T  P+   + DC ++ +
Subjt:  WLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHR-AGVQVLRPTLPNTPWSIRQVTFTPQQSGSGDCGMFCV

Query:  KYLEYDVTGSNMTSLTQDNISSL
        KY+E    G +   L  +N+ SL
Subjt:  KYLEYDVTGSNMTSLTQDNISSL

AT4G15880.1 Cysteine proteinases superfamily protein4.4e-0426.42Show/hide
Query:  LDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPT---LPNTPWSIRQVTFTPQQSGSGDCGMFC
        +D D +++P +  G HW +  I+ +E +++  DS+          + + PM  +L AL    G +    +   +    W +  V   PQQ    DCGMF 
Subjt:  LDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPT---LPNTPWSIRQVTFTPQQSGSGDCGMFC

Query:  VKYLEY
        +KY+++
Subjt:  VKYLEY

AT5G45570.1 Ulp1 protease family protein3.5e-0926.02Show/hide
Query:  WLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHR-AGVQVLRPTLPNTPWSIRQVTFTPQQSGSGDCGMFCV
        ++D+D +Y    + GNHW+ L IDL    + V+DS+  +T    +  +   +  ++PA++      +  R +     W  +++T  P+    GDC ++ +
Subjt:  WLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHR-AGVQVLRPTLPNTPWSIRQVTFTPQQSGSGDCGMFCV

Query:  KYLEYDVTGSNMTSLTQDNISSL
        KY+E    G +   L  +N+ SL
Subjt:  KYLEYDVTGSNMTSLTQDNISSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGGACGTACACCGACTATCAGACACGATGGCTTGATCTGGATGCTGTTTACCTGCCATACAACATCGGTGGCAACCATTGGATTATGCTACATATCGATCTGCA
GGAGGGTGAGATCATTGTGTGGGATTCGATGAGGTTGATGACACACTTTCCAGCTCTGGAGTCCGAGTTGAGGCCGATGGCTGTTGTCCTACCTGCATTGATGCACAGGG
CCGGTGTTCAGGTACTGAGACCGACACTACCGAATACGCCATGGAGCATTCGTCAAGTAACGTTCACGCCCCAGCAAAGCGGGTCTGGTGACTGTGGGATGTTTTGCGTT
AAATATTTAGAGTATGATGTAACAGGGTCAAATATGACCAGCTTAACTCAGGACAACATTAGCTCGCTTGTTGATGGTATTCGTACTGTTTGCCGTCTACCAACTCTTGC
AACGAACTTCGGAGGCTCGATGGTGTACTCAACAAAGTCGTCGGGTAAAGGCCAGTCCTCCTCATCACCCAATGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGGGACGTACACCGACTATCAGACACGATGGCTTGATCTGGATGCTGTTTACCTGCCATACAACATCGGTGGCAACCATTGGATTATGCTACATATCGATCTGCA
GGAGGGTGAGATCATTGTGTGGGATTCGATGAGGTTGATGACACACTTTCCAGCTCTGGAGTCCGAGTTGAGGCCGATGGCTGTTGTCCTACCTGCATTGATGCACAGGG
CCGGTGTTCAGGTACTGAGACCGACACTACCGAATACGCCATGGAGCATTCGTCAAGTAACGTTCACGCCCCAGCAAAGCGGGTCTGGTGACTGTGGGATGTTTTGCGTT
AAATATTTAGAGTATGATGTAACAGGGTCAAATATGACCAGCTTAACTCAGGACAACATTAGCTCGCTTGTTGATGGTATTCGTACTGTTTGCCGTCTACCAACTCTTGC
AACGAACTTCGGAGGCTCGATGGTGTACTCAACAAAGTCGTCGGGTAAAGGCCAGTCCTCCTCATCACCCAATGGGTAG
Protein sequenceShow/hide protein sequence
MDGTYTDYQTRWLDLDAVYLPYNIGGNHWIMLHIDLQEGEIIVWDSMRLMTHFPALESELRPMAVVLPALMHRAGVQVLRPTLPNTPWSIRQVTFTPQQSGSGDCGMFCV
KYLEYDVTGSNMTSLTQDNISSLVDGIRTVCRLPTLATNFGGSMVYSTKSSGKGQSSSSPNG