; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g15280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g15280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr1:9626225..9627928
RNA-Seq ExpressionMoc01g15280
SyntenyMoc01g15280
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141784.1 uncharacterized protein LOC111012067 [Momordica charantia]1.1e-4253.64Show/hide
Query:  LSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAP
        ++Y+  +HS+Y   W++V+AVY+P N+ GN+W+M+C+DF EGE+VVWDSL ++T D  LE  L+ MRTV+P+LLH   +++V P+L ++ WR RRV SAP
Subjt:  LSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAP

Query:  QQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANIPVF
        QQ G  DC IF VK+FEYDVTG++  TL Q+ M  FRRQFA Q+W+N  V+
Subjt:  QQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANIPVF

XP_022153247.1 uncharacterized protein LOC111020782 [Momordica charantia]7.3e-4455.28Show/hide
Query:  YDWELNAKSILSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIML
        YDW    ++I  YV G  SDYD  W + D VY PMNIGGNHW+M+ +D  EG+L VWDSL ++T   +LE ALKPM T++P +LH  GM+++RP L  + 
Subjt:  YDWELNAKSILSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIML

Query:  WRARRVVSAPQQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANIPVF
        WR RR  + PQQ G  DCGIF V+FFEYDVTGS   TL Q  + LFRRQ+AVQ+WA  P F
Subjt:  WRARRVVSAPQQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANIPVF

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]3.2e-5553.59Show/hide
Query:  MFTSKKLAQRMDLCARKFTTDDVLFT--------------TPFSVATRM--KYDWELNAKSILSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVD
        MF   KL  R +LC RKFTT DVL +              +P  +A+R+   YDWE  A S+LSY+ GTHSD D  W DVDAVY+P NIGG HWI++C+D
Subjt:  MFTSKKLAQRMDLCARKFTTDDVLFT--------------TPFSVATRM--KYDWELNAKSILSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVD

Query:  FEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAPQQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRR
        F+EGEL+VWDS +++T   +LE  LKPM T++P L+   G+   +P++ +  WR RRV SAPQQ  DGDCGIF + FFEYDVT  +  TLTQ RM  FRR
Subjt:  FEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAPQQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRR

Query:  QFAVQIWAN
        QFAVQ+WAN
Subjt:  QFAVQIWAN

XP_022156568.1 uncharacterized protein LOC111023442 [Momordica charantia]2.8e-4354.3Show/hide
Query:  LSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAP
        ++Y+  +HSDY   W++V+AVY+P N+ GNHW+M+C+DF EGE+VVWDSL ++TS   LE  LK M TV+P+LLH   +++VRP+L +  WR RRV S P
Subjt:  LSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAP

Query:  QQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANIPVF
        +Q   GDCGIF VK+FEYDVT ++  TL Q+ M  FRRQFA Q+W+N  V+
Subjt:  QQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANIPVF

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]1.3e-4545.62Show/hide
Query:  EVIDTLFMFTSKKLAQRMDLCARKFTTDDVLFTT-------------PFSVATRMKYDWELNAKSILSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIM
        E ID+L M T++K+ +   L   +F   DVL +              P  + ++  YDW    ++I  YV G  SDYD  W + D VY  MNIGGNHW+M
Subjt:  EVIDTLFMFTSKKLAQRMDLCARKFTTDDVLFTT-------------PFSVATRMKYDWELNAKSILSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIM

Query:  VCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAPQQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRML
        + +D  EG+L VWDSL ++T   +LE ALKPM T++PA+LH  G++++RP+L ++ WR RR  + PQQ G  DC IF V+FFEYDV GS   TL Q  + 
Subjt:  VCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAPQQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRML

Query:  LFRRQFAVQIWANIPVF
        LFRRQ+AVQ+WA  P F
Subjt:  LFRRQFAVQIWANIPVF

TrEMBL top hitse value%identityAlignment
A0A6J1CJT2 uncharacterized protein LOC1110120675.1e-4353.64Show/hide
Query:  LSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAP
        ++Y+  +HS+Y   W++V+AVY+P N+ GN+W+M+C+DF EGE+VVWDSL ++T D  LE  L+ MRTV+P+LLH   +++V P+L ++ WR RRV SAP
Subjt:  LSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAP

Query:  QQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANIPVF
        QQ G  DC IF VK+FEYDVTG++  TL Q+ M  FRRQFA Q+W+N  V+
Subjt:  QQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANIPVF

A0A6J1DID7 uncharacterized protein LOC1110207823.5e-4455.28Show/hide
Query:  YDWELNAKSILSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIML
        YDW    ++I  YV G  SDYD  W + D VY PMNIGGNHW+M+ +D  EG+L VWDSL ++T   +LE ALKPM T++P +LH  GM+++RP L  + 
Subjt:  YDWELNAKSILSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIML

Query:  WRARRVVSAPQQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANIPVF
        WR RR  + PQQ G  DCGIF V+FFEYDVTGS   TL Q  + LFRRQ+AVQ+WA  P F
Subjt:  WRARRVVSAPQQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANIPVF

A0A6J1DLV0 uncharacterized protein LOC1110216461.5e-5553.59Show/hide
Query:  MFTSKKLAQRMDLCARKFTTDDVLFT--------------TPFSVATRM--KYDWELNAKSILSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVD
        MF   KL  R +LC RKFTT DVL +              +P  +A+R+   YDWE  A S+LSY+ GTHSD D  W DVDAVY+P NIGG HWI++C+D
Subjt:  MFTSKKLAQRMDLCARKFTTDDVLFT--------------TPFSVATRM--KYDWELNAKSILSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVD

Query:  FEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAPQQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRR
        F+EGEL+VWDS +++T   +LE  LKPM T++P L+   G+   +P++ +  WR RRV SAPQQ  DGDCGIF + FFEYDVT  +  TLTQ RM  FRR
Subjt:  FEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAPQQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRR

Query:  QFAVQIWAN
        QFAVQ+WAN
Subjt:  QFAVQIWAN

A0A6J1DQZ3 uncharacterized protein LOC1110234421.3e-4354.3Show/hide
Query:  LSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAP
        ++Y+  +HSDY   W++V+AVY+P N+ GNHW+M+C+DF EGE+VVWDSL ++TS   LE  LK M TV+P+LLH   +++VRP+L +  WR RRV S P
Subjt:  LSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAP

Query:  QQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANIPVF
        +Q   GDCGIF VK+FEYDVT ++  TL Q+ M  FRRQFA Q+W+N  V+
Subjt:  QQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANIPVF

A0A6J1DY60 uncharacterized protein LOC1110252736.4e-4645.62Show/hide
Query:  EVIDTLFMFTSKKLAQRMDLCARKFTTDDVLFTT-------------PFSVATRMKYDWELNAKSILSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIM
        E ID+L M T++K+ +   L   +F   DVL +              P  + ++  YDW    ++I  YV G  SDYD  W + D VY  MNIGGNHW+M
Subjt:  EVIDTLFMFTSKKLAQRMDLCARKFTTDDVLFTT-------------PFSVATRMKYDWELNAKSILSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIM

Query:  VCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAPQQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRML
        + +D  EG+L VWDSL ++T   +LE ALKPM T++PA+LH  G++++RP+L ++ WR RR  + PQQ G  DC IF V+FFEYDV GS   TL Q  + 
Subjt:  VCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAPQQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRML

Query:  LFRRQFAVQIWANIPVF
        LFRRQ+AVQ+WA  P F
Subjt:  LFRRQFAVQIWANIPVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases8.1e-0927.47Show/hide
Query:  RADFHTPEVIDTLFMFTSKKLAQRMDLCARKFTTDDVLFTTPFSVATRM--KYDWELNAK------SILSYVYGT-HSDYDHAWKDVDAVYMPMNIGGNH
        R    + +V+D L  F S+ L +  D+   K    DVL +   S  TR+  K+   L  K      +++  + G   S+    + + D VYMP N    H
Subjt:  RADFHTPEVIDTLFMFTSKKLAQRMDLCARKFTTDDVLFTTPFSVATRM--KYDWELNAK------SILSYVYGT-HSDYDHAWKDVDAVYMPMNIGGNH

Query:  WIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAPQQEGDGDCGIFAV
        W+ +CVD +  ++ + DS + L  D  L A L+P+  +LP L       S    +S+  +   R    PQ     D G+ +V
Subjt:  WIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMSVRPSLSIMLWRARRVVSAPQQEGDGDCGIFAV

AT4G08430.1 Ulp1 protease family protein1.5e-1027.82Show/hide
Query:  DVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALL-HHCGMMSVRPSLSIMLWRARRVVSAPQQEGDGDCGIFAVKF
        DVD +Y  + + GNHW+ + +D  +  + V+DS+ SLT+D E+      + T++PA+L         R S S + W  +R+   P+     DC I+++K+
Subjt:  DVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALL-HHCGMMSVRPSLSIMLWRARRVVSAPQQEGDGDCGIFAVKF

Query:  FEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANI
         E    G +   L  + M     + AV+++  +
Subjt:  FEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANI

AT5G28235.1 Ulp1 protease family protein2.4e-0536.21Show/hide
Query:  DVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALL
        DVD +Y  + + GNHW+ + +D  +  + V+DS+ SLT+D E+      + T++PA+L
Subjt:  DVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALL

AT5G45570.1 Ulp1 protease family protein9.3e-1330.08Show/hide
Query:  DVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALL-HHCGMMSVRPSLSIMLWRARRVVSAPQQEGDGDCGIFAVKF
        DVD +Y  + + GNHW+ + +D     + V+DS+ SLT+D E+      + T++PA+L         R S S + W  +R+   P+    GDC I+++K+
Subjt:  DVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALL-HHCGMMSVRPSLSIMLWRARRVVSAPQQEGDGDCGIFAVKF

Query:  FEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANI
         E    G +   L  + M   R + AV+++  I
Subjt:  FEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGGCGTTTAGATGGGATGGAAGATGACCTGAAGGCGATTAGGAAGTACCTGCGGCATCTTTCTAAGGCCGTTGATCCTAACGAGCTTAAGAAATCGCAACGAAA
TGTGCATCTGGAATCTGCAGAATTGGAACAGAACGATTTGGAGGACGTTATCGATGCGACACCGAAAAAGGTTGGGAAATCACCTCATTCAGTTGTCAAATCTGGACCAA
CCACCGTCGAGGTCGTGGCTCCAAAGGATCCCCTGCCCGATGCCCGTACTAGTGATAGGGACCCGGACGCATGTACAGAACGAGATGTGCCGCCCATCGATGTTCAGGAT
CCCATACTGGACGCAGGTACTAGTGTTAGGGACCTGGCCATCGATTTTCAGGATCACCGTCTCGATGCCTGTACTAGTGAGGTGGCGGCCGCAGACAATTTTCTTGCATC
GCAACCCGACGATAAGGACCCGCCTATTGACACGGTCACCGCTGATATCGTTTCTTCGAAGGATCCATCGATTGACGTGCACTCGCAAGCGGAGTTGGTGGTAGCTTGTA
GTGAGTCTGAGGACTCACGAGCTGATTTTCATACTCCGGAAGTTATCGATACACTCTTCATGTTCACGAGCAAGAAGCTCGCGCAGCGGATGGACCTGTGTGCTCGAAAA
TTTACCACCGACGACGTACTCTTTACGACACCTTTTTCCGTGGCGACAAGAATGAAGTACGATTGGGAGCTAAACGCGAAGTCTATATTGAGCTACGTATACGGAACACA
CTCCGACTATGACCATGCATGGAAAGACGTGGACGCAGTCTACATGCCAATGAACATTGGAGGAAACCACTGGATTATGGTTTGTGTCGATTTCGAGGAGGGTGAATTGG
TCGTGTGGGATTCCCTCCTGTCATTAACGTCGGACGTCGAGTTAGAAGCGGCGTTGAAACCCATGCGGACAGTTTTACCAGCATTACTGCATCACTGCGGGATGATGTCT
GTTCGCCCGAGTCTCTCAATCATGCTGTGGCGCGCTCGGCGGGTGGTGTCAGCACCCCAACAGGAAGGTGATGGGGATTGTGGTATATTTGCTGTAAAATTTTTTGAATA
CGATGTAACGGGTTCTAATACAAGAACCCTAACTCAGGATAGAATGTTATTATTTAGGAGGCAGTTCGCAGTCCAAATATGGGCGAACATACCGGTATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAGGCGTTTAGATGGGATGGAAGATGACCTGAAGGCGATTAGGAAGTACCTGCGGCATCTTTCTAAGGCCGTTGATCCTAACGAGCTTAAGAAATCGCAACGAAA
TGTGCATCTGGAATCTGCAGAATTGGAACAGAACGATTTGGAGGACGTTATCGATGCGACACCGAAAAAGGTTGGGAAATCACCTCATTCAGTTGTCAAATCTGGACCAA
CCACCGTCGAGGTCGTGGCTCCAAAGGATCCCCTGCCCGATGCCCGTACTAGTGATAGGGACCCGGACGCATGTACAGAACGAGATGTGCCGCCCATCGATGTTCAGGAT
CCCATACTGGACGCAGGTACTAGTGTTAGGGACCTGGCCATCGATTTTCAGGATCACCGTCTCGATGCCTGTACTAGTGAGGTGGCGGCCGCAGACAATTTTCTTGCATC
GCAACCCGACGATAAGGACCCGCCTATTGACACGGTCACCGCTGATATCGTTTCTTCGAAGGATCCATCGATTGACGTGCACTCGCAAGCGGAGTTGGTGGTAGCTTGTA
GTGAGTCTGAGGACTCACGAGCTGATTTTCATACTCCGGAAGTTATCGATACACTCTTCATGTTCACGAGCAAGAAGCTCGCGCAGCGGATGGACCTGTGTGCTCGAAAA
TTTACCACCGACGACGTACTCTTTACGACACCTTTTTCCGTGGCGACAAGAATGAAGTACGATTGGGAGCTAAACGCGAAGTCTATATTGAGCTACGTATACGGAACACA
CTCCGACTATGACCATGCATGGAAAGACGTGGACGCAGTCTACATGCCAATGAACATTGGAGGAAACCACTGGATTATGGTTTGTGTCGATTTCGAGGAGGGTGAATTGG
TCGTGTGGGATTCCCTCCTGTCATTAACGTCGGACGTCGAGTTAGAAGCGGCGTTGAAACCCATGCGGACAGTTTTACCAGCATTACTGCATCACTGCGGGATGATGTCT
GTTCGCCCGAGTCTCTCAATCATGCTGTGGCGCGCTCGGCGGGTGGTGTCAGCACCCCAACAGGAAGGTGATGGGGATTGTGGTATATTTGCTGTAAAATTTTTTGAATA
CGATGTAACGGGTTCTAATACAAGAACCCTAACTCAGGATAGAATGTTATTATTTAGGAGGCAGTTCGCAGTCCAAATATGGGCGAACATACCGGTATTTTGA
Protein sequenceShow/hide protein sequence
MDRRLDGMEDDLKAIRKYLRHLSKAVDPNELKKSQRNVHLESAELEQNDLEDVIDATPKKVGKSPHSVVKSGPTTVEVVAPKDPLPDARTSDRDPDACTERDVPPIDVQD
PILDAGTSVRDLAIDFQDHRLDACTSEVAAADNFLASQPDDKDPPIDTVTADIVSSKDPSIDVHSQAELVVACSESEDSRADFHTPEVIDTLFMFTSKKLAQRMDLCARK
FTTDDVLFTTPFSVATRMKYDWELNAKSILSYVYGTHSDYDHAWKDVDAVYMPMNIGGNHWIMVCVDFEEGELVVWDSLLSLTSDVELEAALKPMRTVLPALLHHCGMMS
VRPSLSIMLWRARRVVSAPQQEGDGDCGIFAVKFFEYDVTGSNTRTLTQDRMLLFRRQFAVQIWANIPVF