; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g27770 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g27770
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr3:19977645..19979825
RNA-Seq ExpressionMoc03g27770
SyntenyMoc03g27770
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132578.1 uncharacterized protein LOC111005406 [Momordica charantia]8.7e-4263.49Show/hide
Query:  MFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGRARTVLSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICID
        M VRKKLE R DLC RKFTTGD VL+N+ RR DG+YACMQS S++ S+VA EYDW+GR ++++ Y DGTH DY  RWM+VD +YLP+NI GKHWVM+CID
Subjt:  MFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGRARTVLSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICID

Query:  FEEGELIVWDSFMSMTPLSKLEVELK
         EEGE++VWDS   MT    +E  LK
Subjt:  FEEGELIVWDSFMSMTPLSKLEVELK

XP_022148308.1 uncharacterized protein LOC111016993 [Momordica charantia]1.0e-5855.91Show/hide
Query:  STRTTCYAKRGKQWFQDILTPRKWCSCEVIDSLVMFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGRARTVLSYI
        +TR T Y+ + K WF+++L P  WC+ EV+D L M +RKKLE R DLC RKFTTGD VL+N+ RR D +YACMQS S++ S+VA EYDW+GR  +++ Y 
Subjt:  STRTTCYAKRGKQWFQDILTPRKWCSCEVIDSLVMFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGRARTVLSYI

Query:  DGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICIDFEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTP
        DGTH+DY  RWM+VDA+Y+P+NI GKHWVM+CID EEGE++VWDS  SMT    +E  LK M TIIP ++ +  V   +PN+P+ P
Subjt:  DGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICIDFEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTP

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]8.7e-7477.14Show/hide
Query:  MFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGRARTVLSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICID
        MFV  KL+LR +LCRRKFTTGD ++SNFLR TDGVY  MQS + IASRVA++YDWEGRA ++LSYIDGTHSD + RWMDVDAVYLPYNIGG HW++ICID
Subjt:  MFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGRARTVLSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICID

Query:  FEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTPWRIRRVASAPQQDMGGDCGTTAI
        F+EGELIVWDSFM+MTPL +LE ELKPM+TIIP LICR GVHL KPNIPLTPWRIRRV+SAPQQ M GDCG   I
Subjt:  FEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTPWRIRRVASAPQQDMGGDCGTTAI

XP_022156568.1 uncharacterized protein LOC111023442 [Momordica charantia]9.1e-3155.96Show/hide
Query:  LSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICIDFEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTPWRIRRVASAP
        ++YID +HSDY  +W +V+AVYLP+N+ G HWVMICIDF EGE++VWDS  ++T  + LE +LK M T+IP+L+ ++ V   +PN+P+TPWRIRRV S P
Subjt:  LSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICIDFEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTPWRIRRVASAP

Query:  QQDMGGDCG
        +Q   GDCG
Subjt:  QQDMGGDCG

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]5.7e-4144.81Show/hide
Query:  LDDANNPRSTRTTCYAKRGKQWFQDILTPRKWCSCEVIDSLVMFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGR
        +DD +   ++R+T    + K WF  +L P+     E IDSL+M   +K+E    L R +F  GD +LSN LRRTDG YA M+    + S+    YDW  +
Subjt:  LDDANNPRSTRTTCYAKRGKQWFQDILTPRKWCSCEVIDSLVMFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGR

Query:  ARTVLSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICIDFEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTPWRIRRV
         RT+  Y+ G  SDY+  W + D VY   NIGG HWVMI ID  EG+L VWDS  ++TPL  LE  LKPM TIIPA++  +G+   +PN+P+ PWR+RR 
Subjt:  ARTVLSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICIDFEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTPWRIRRV

Query:  ASAPQQDMGGDC
         + PQQ    DC
Subjt:  ASAPQQDMGGDC

TrEMBL top hitse value%identityAlignment
A0A6J1BWN0 uncharacterized protein LOC1110054064.2e-4263.49Show/hide
Query:  MFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGRARTVLSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICID
        M VRKKLE R DLC RKFTTGD VL+N+ RR DG+YACMQS S++ S+VA EYDW+GR ++++ Y DGTH DY  RWM+VD +YLP+NI GKHWVM+CID
Subjt:  MFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGRARTVLSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICID

Query:  FEEGELIVWDSFMSMTPLSKLEVELK
         EEGE++VWDS   MT    +E  LK
Subjt:  FEEGELIVWDSFMSMTPLSKLEVELK

A0A6J1D3R7 uncharacterized protein LOC1110169936.5e-5955.91Show/hide
Query:  STRTTCYAKRGKQWFQDILTPRKWCSCEVIDSLVMFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGRARTVLSYI
        +TR T Y+ + K WF+++L P  WC+ EV+D L M +RKKLE R DLC RKFTTGD VL+N+ RR D +YACMQS S++ S+VA EYDW+GR  +++ Y 
Subjt:  STRTTCYAKRGKQWFQDILTPRKWCSCEVIDSLVMFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGRARTVLSYI

Query:  DGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICIDFEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTP
        DGTH+DY  RWM+VDA+Y+P+NI GKHWVM+CID EEGE++VWDS  SMT    +E  LK M TIIP ++ +  V   +PN+P+ P
Subjt:  DGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICIDFEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTP

A0A6J1DLV0 uncharacterized protein LOC1110216464.2e-7477.14Show/hide
Query:  MFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGRARTVLSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICID
        MFV  KL+LR +LCRRKFTTGD ++SNFLR TDGVY  MQS + IASRVA++YDWEGRA ++LSYIDGTHSD + RWMDVDAVYLPYNIGG HW++ICID
Subjt:  MFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGRARTVLSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICID

Query:  FEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTPWRIRRVASAPQQDMGGDCGTTAI
        F+EGELIVWDSFM+MTPL +LE ELKPM+TIIP LICR GVHL KPNIPLTPWRIRRV+SAPQQ M GDCG   I
Subjt:  FEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTPWRIRRVASAPQQDMGGDCGTTAI

A0A6J1DQZ3 uncharacterized protein LOC1110234424.4e-3155.96Show/hide
Query:  LSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICIDFEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTPWRIRRVASAP
        ++YID +HSDY  +W +V+AVYLP+N+ G HWVMICIDF EGE++VWDS  ++T  + LE +LK M T+IP+L+ ++ V   +PN+P+TPWRIRRV S P
Subjt:  LSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICIDFEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTPWRIRRVASAP

Query:  QQDMGGDCG
        +Q   GDCG
Subjt:  QQDMGGDCG

A0A6J1DY60 uncharacterized protein LOC1110252732.7e-4144.81Show/hide
Query:  LDDANNPRSTRTTCYAKRGKQWFQDILTPRKWCSCEVIDSLVMFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGR
        +DD +   ++R+T    + K WF  +L P+     E IDSL+M   +K+E    L R +F  GD +LSN LRRTDG YA M+    + S+    YDW  +
Subjt:  LDDANNPRSTRTTCYAKRGKQWFQDILTPRKWCSCEVIDSLVMFVRKKLELRLDLCRRKFTTGDRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGR

Query:  ARTVLSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICIDFEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTPWRIRRV
         RT+  Y+ G  SDY+  W + D VY   NIGG HWVMI ID  EG+L VWDS  ++TPL  LE  LKPM TIIPA++  +G+   +PN+P+ PWR+RR 
Subjt:  ARTVLSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICIDFEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTPWRIRRV

Query:  ASAPQQDMGGDC
         + PQQ    DC
Subjt:  ASAPQQDMGGDC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases5.0e-1121.35Show/hide
Query:  KDTDKPHDGVNEADKDT----DAMEAEPTGRGGEVG-DALNVVEAMETVEAKHQDVGGTLKDSETVETKQSDVLETDHLETTALASVTQQV----TEHPK
        +D + P DGV      T     ++  E T  G +V  D LN V  ++ + +              V    S V      E+   A  + Q+      HP 
Subjt:  KDTDKPHDGVNEADKDT----DAMEAEPTGRGGEVG-DALNVVEAMETVEAKHQDVGGTLKDSETVETKQSDVLETDHLETTALASVTQQV----TEHPK

Query:  KIPVLHTPEVNELYDDFSDTESDELLDIQSQPLESSSHVADDHSEDVLVLSQPFNPPPP---------------------RRGERKRRTPWKLRGSFEVK
         +P       +      +DT   + +        SS   +   +  V ++ + FNPPPP                     R+ +R R    KL G F   
Subjt:  KIPVLHTPEVNELYDDFSDTESDELLDIQSQPLESSSHVADDHSEDVLVLSQPFNPPPP---------------------RRGERKRRTPWKLRGSFEVK

Query:  VGSKRKKVMV-----YNLVADIPKEQDSKFQKWLDDANNPRSTRTTCYAKRGKQWFQDILTPRKWCSCEVIDSLVMFVRKKL--------ELRLDLCRRK
           K+ K++V     +  V ++  + + ++Q+ L       S      A    +   DI+  +K  S +V+D L+ F R  L        +LR+D+   K
Subjt:  VGSKRKKVMV-----YNLVADIPKEQDSKFQKWLDDANNPRSTRTTCYAKRGKQWFQDILTPRKWCSCEVIDSLVMFVRKKL--------ELRLDLCRRK

Query:  FTTG-DRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGRARTVLSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICIDFEEGELIVWDSFMSMT
        F +   R+   F                  ++     D+   +  V   I    S+  + + + D VY+P+N   KHWV +C+D +  ++ + DS + + 
Subjt:  FTTG-DRVLSNFLRRTDGVYACMQSHSAIASRVATEYDWEGRARTVLSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICIDFEEGELIVWDSFMSMT

Query:  PLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTPWRIRRVASAPQQDMGGDCGTTAI-ILSIHGRGVVNSC
          + L  EL+P+  ++P L  +     +  +I L P+ + R    PQ     D G  ++ ++  H  G V  C
Subjt:  PLSKLEVELKPMVTIIPALICRAGVHLNKPNIPLTPWRIRRVASAPQQDMGGDCGTTAI-ILSIHGRGVVNSC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTACAACAGCTGGCATGGAAGGTGACTTGAAAAGCATTCGGAAGTTCTTGCGTCGTCTAGTTAAGGGTATACCAGTCGATACTTCAGTTCTTCGAAAGAACAATAG
TGGAGATGGCAACGGCATTGGTGATGGAGATTCTGCAGGACCAATTGGTGGTAGTAAGAGTCCGACTAAGGACACAGACAAACCGCACGACGGTGTAAATGAGGCTGACA
AAGACACCGACGCAATGGAAGCGGAGCCTACGGGCCGAGGAGGTGAGGTTGGGGACGCGTTGAATGTAGTGGAGGCTATGGAAACCGTGGAAGCGAAGCATCAAGATGTC
GGTGGGACTTTGAAAGATTCTGAAACTGTAGAAACGAAGCAGTCCGACGTTCTAGAAACCGACCATTTGGAAACTACGGCATTAGCCAGTGTGACGCAGCAAGTTACAGA
ACATCCGAAGAAGATACCAGTTCTACATACACCCGAGGTAAACGAACTATACGATGACTTTTCGGACACAGAATCGGATGAGCTCCTTGACATCCAGTCACAACCCCTAG
AGAGCTCATCACACGTGGCAGACGATCATTCAGAGGACGTGCTCGTTCTATCCCAGCCATTTAATCCCCCTCCACCTCGTCGGGGTGAACGGAAGAGGAGAACACCATGG
AAGCTTCGAGGGTCCTTCGAAGTTAAGGTTGGAAGTAAACGAAAGAAGGTCATGGTTTACAATCTCGTCGCCGATATACCGAAGGAACAGGACTCCAAATTTCAGAAGTG
GCTGGATGATGCCAATAACCCACGATCAACGCGGACAACATGCTACGCAAAACGAGGTAAGCAATGGTTTCAGGATATTCTAACACCAAGAAAATGGTGTTCATGCGAGG
TAATCGATTCGTTGGTCATGTTCGTGCGTAAAAAACTCGAATTGCGACTGGACTTGTGTCGCCGCAAGTTCACCACCGGTGACCGAGTACTCTCGAATTTTCTTAGACGA
ACGGACGGTGTCTATGCCTGCATGCAATCTCATAGCGCCATTGCTTCAAGGGTTGCGACTGAATATGACTGGGAAGGAAGAGCGAGGACTGTGCTCAGCTACATCGATGG
TACTCACTCGGACTACGAGAAGCGCTGGATGGATGTTGATGCCGTTTATCTACCGTATAACATCGGTGGAAAGCACTGGGTAATGATATGCATCGACTTCGAAGAGGGTG
AGCTCATTGTGTGGGACTCCTTCATGTCCATGACACCACTGTCCAAGTTGGAAGTAGAGCTGAAGCCGATGGTCACTATTATACCAGCACTTATTTGTAGGGCCGGTGTT
CATCTAAACAAGCCGAATATACCACTCACGCCATGGCGCATCCGTAGAGTCGCATCAGCACCCCAGCAGGACATGGGCGGTGATTGCGGCACCACGGCCATAATATTATC
AATTCACGGCCGTGGTGTAGTGAATTCATGCCCGTGGTTGTCTGGATTCAAGGCCGTGGTGTTGAAAATTAAAGCTCATGGTTATCTAGATTCAAGTTCATGGTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTACAACAGCTGGCATGGAAGGTGACTTGAAAAGCATTCGGAAGTTCTTGCGTCGTCTAGTTAAGGGTATACCAGTCGATACTTCAGTTCTTCGAAAGAACAATAG
TGGAGATGGCAACGGCATTGGTGATGGAGATTCTGCAGGACCAATTGGTGGTAGTAAGAGTCCGACTAAGGACACAGACAAACCGCACGACGGTGTAAATGAGGCTGACA
AAGACACCGACGCAATGGAAGCGGAGCCTACGGGCCGAGGAGGTGAGGTTGGGGACGCGTTGAATGTAGTGGAGGCTATGGAAACCGTGGAAGCGAAGCATCAAGATGTC
GGTGGGACTTTGAAAGATTCTGAAACTGTAGAAACGAAGCAGTCCGACGTTCTAGAAACCGACCATTTGGAAACTACGGCATTAGCCAGTGTGACGCAGCAAGTTACAGA
ACATCCGAAGAAGATACCAGTTCTACATACACCCGAGGTAAACGAACTATACGATGACTTTTCGGACACAGAATCGGATGAGCTCCTTGACATCCAGTCACAACCCCTAG
AGAGCTCATCACACGTGGCAGACGATCATTCAGAGGACGTGCTCGTTCTATCCCAGCCATTTAATCCCCCTCCACCTCGTCGGGGTGAACGGAAGAGGAGAACACCATGG
AAGCTTCGAGGGTCCTTCGAAGTTAAGGTTGGAAGTAAACGAAAGAAGGTCATGGTTTACAATCTCGTCGCCGATATACCGAAGGAACAGGACTCCAAATTTCAGAAGTG
GCTGGATGATGCCAATAACCCACGATCAACGCGGACAACATGCTACGCAAAACGAGGTAAGCAATGGTTTCAGGATATTCTAACACCAAGAAAATGGTGTTCATGCGAGG
TAATCGATTCGTTGGTCATGTTCGTGCGTAAAAAACTCGAATTGCGACTGGACTTGTGTCGCCGCAAGTTCACCACCGGTGACCGAGTACTCTCGAATTTTCTTAGACGA
ACGGACGGTGTCTATGCCTGCATGCAATCTCATAGCGCCATTGCTTCAAGGGTTGCGACTGAATATGACTGGGAAGGAAGAGCGAGGACTGTGCTCAGCTACATCGATGG
TACTCACTCGGACTACGAGAAGCGCTGGATGGATGTTGATGCCGTTTATCTACCGTATAACATCGGTGGAAAGCACTGGGTAATGATATGCATCGACTTCGAAGAGGGTG
AGCTCATTGTGTGGGACTCCTTCATGTCCATGACACCACTGTCCAAGTTGGAAGTAGAGCTGAAGCCGATGGTCACTATTATACCAGCACTTATTTGTAGGGCCGGTGTT
CATCTAAACAAGCCGAATATACCACTCACGCCATGGCGCATCCGTAGAGTCGCATCAGCACCCCAGCAGGACATGGGCGGTGATTGCGGCACCACGGCCATAATATTATC
AATTCACGGCCGTGGTGTAGTGAATTCATGCCCGTGGTTGTCTGGATTCAAGGCCGTGGTGTTGAAAATTAAAGCTCATGGTTATCTAGATTCAAGTTCATGGTGTTGA
Protein sequenceShow/hide protein sequence
MPTTAGMEGDLKSIRKFLRRLVKGIPVDTSVLRKNNSGDGNGIGDGDSAGPIGGSKSPTKDTDKPHDGVNEADKDTDAMEAEPTGRGGEVGDALNVVEAMETVEAKHQDV
GGTLKDSETVETKQSDVLETDHLETTALASVTQQVTEHPKKIPVLHTPEVNELYDDFSDTESDELLDIQSQPLESSSHVADDHSEDVLVLSQPFNPPPPRRGERKRRTPW
KLRGSFEVKVGSKRKKVMVYNLVADIPKEQDSKFQKWLDDANNPRSTRTTCYAKRGKQWFQDILTPRKWCSCEVIDSLVMFVRKKLELRLDLCRRKFTTGDRVLSNFLRR
TDGVYACMQSHSAIASRVATEYDWEGRARTVLSYIDGTHSDYEKRWMDVDAVYLPYNIGGKHWVMICIDFEEGELIVWDSFMSMTPLSKLEVELKPMVTIIPALICRAGV
HLNKPNIPLTPWRIRRVASAPQQDMGGDCGTTAIILSIHGRGVVNSCPWLSGFKAVVLKIKAHGYLDSSSWC