; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g15330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g15330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr8:11821684..11822795
RNA-Seq ExpressionMoc08g15330
SyntenyMoc08g15330
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132578.1 uncharacterized protein LOC111005406 [Momordica charantia]4.7e-3653.97Show/hide
Query:  MFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIG
        M  R KLEQR DLCSR+FTTGD+VLAN+ R  +GLY  M + +++P++VAAEYDW G+ K+I+ Y DGTH +Y  RW+++DV+Y+P+NI G HW+M+ I 
Subjt:  MFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIG

Query:  LQEGEIIVWDSMRSMTSFPTLESEFR
        L+EGEI+VWDS+  MT+   +E   +
Subjt:  LQEGEIIVWDSMRSMTSFPTLESEFR

XP_022148308.1 uncharacterized protein LOC111016993 [Momordica charantia]1.4e-5140.39Show/hide
Query:  TDSDAEGPGATLQPDLDEGSFNVMVD--EKRKKVMR--------YDPLVNVLSEQVQKFHAWL--ANPNTDRATRKLCYGDWGKTWFRDLITSGKWMMSE
        TD   +  GA    ++D       +   EK+KKV +         D  V V+ ++++     L         ATRK  Y    K WFR+L+  G W  +E
Subjt:  TDSDAEGPGATLQPDLDEGSFNVMVD--EKRKKVMR--------YDPLVNVLSEQVQKFHAWL--ANPNTDRATRKLCYGDWGKTWFRDLITSGKWMMSE

Query:  VIDSLFMFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHW
        V+D LFM  R KLEQR DLCSR+FTTGD+VLAN+ R  + +Y  M + +++P++VAAEYDW G+ ++I+ Y DGTHT+Y  RW+++D +Y+P+NI G HW
Subjt:  VIDSLFMFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHW

Query:  IMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLP
        +M+ I L+EGEI+VWDS+ SMT+   +E   + M  ++P ++ +  V  ++P LP
Subjt:  IMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLP

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]2.3e-4351.66Show/hide
Query:  MFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIG
        MF  NKL+ R +LC R+FTTGD++++NFLR T+G+Y  M +PN + +RVA++YDW G+  ++LSY+DGTH++   RW+D+D VY+PYNIGG HWI++ I 
Subjt:  MFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIG

Query:  LQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLPNT
          EGE+IVWDS  +MT  P LE E +PM+ ++P L+ R GV + +P +P T
Subjt:  LQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLPNT

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]7.0e-3239.15Show/hide
Query:  NPNTDRATRKLCYGDWGKTWFRDLITSGKWMMSEVIDSLFMFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCK
        +P+TD  +R    G   K+WF  L+     +  E IDSL M T  K+E+ + L   +F  GD++L+N LR T+G Y  M  P  +P++    YDW  + +
Subjt:  NPNTDRATRKLCYGDWGKTWFRDLITSGKWMMSEVIDSLFMFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCK

Query:  TILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLP
        TI  Y+ G  ++Y   W + D+VY   NIGGNHW+M+ I L EG++ VWDS++++T    LE   +PM  ++PA++H  G+  LRP LP
Subjt:  TILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLP

XP_022159362.1 uncharacterized protein LOC111025779 [Momordica charantia]4.6e-3180.95Show/hide
Query:  MDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLP
        MDGTHT+YQ RWLDLDVVY+PYNIGGNHWIM+HI LQ+ EII+WD MRSMT FPTLESE R M VVL ALMHR GVQVLR TLP
Subjt:  MDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLP

TrEMBL top hitse value%identityAlignment
A0A6J1BWN0 uncharacterized protein LOC1110054062.3e-3653.97Show/hide
Query:  MFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIG
        M  R KLEQR DLCSR+FTTGD+VLAN+ R  +GLY  M + +++P++VAAEYDW G+ K+I+ Y DGTH +Y  RW+++DV+Y+P+NI G HW+M+ I 
Subjt:  MFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIG

Query:  LQEGEIIVWDSMRSMTSFPTLESEFR
        L+EGEI+VWDS+  MT+   +E   +
Subjt:  LQEGEIIVWDSMRSMTSFPTLESEFR

A0A6J1D3R7 uncharacterized protein LOC1110169935.0e-5240.78Show/hide
Query:  TDSDAEGPGATLQPDLDEGSFNVMVD--EKRKKVMR--------YDPLVNVLSEQVQKFHAWL--ANPNTDRATRKLCYGDWGKTWFRDLITSGKWMMSE
        TD   +  GA    ++D       +   EK+KKV +         D  V V+ ++++     L         ATRK  Y    K WFR+L+  G W  +E
Subjt:  TDSDAEGPGATLQPDLDEGSFNVMVD--EKRKKVMR--------YDPLVNVLSEQVQKFHAWL--ANPNTDRATRKLCYGDWGKTWFRDLITSGKWMMSE

Query:  VIDSLFMFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHW
        V+D LFM  R KLEQR DLCSR+FTTGD+VLAN+ R  + LY  M + +++P++VAAEYDW G+ ++I+ Y DGTHT+Y  RW+++D +Y+P+NI G HW
Subjt:  VIDSLFMFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHW

Query:  IMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLP
        +M+ I L+EGEI+VWDS+ SMT+   +E   + M  ++P ++ +  V  ++P LP
Subjt:  IMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLP

A0A6J1DLV0 uncharacterized protein LOC1110216461.1e-4351.66Show/hide
Query:  MFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIG
        MF  NKL+ R +LC R+FTTGD++++NFLR T+G+Y  M +PN + +RVA++YDW G+  ++LSY+DGTH++   RW+D+D VY+PYNIGG HWI++ I 
Subjt:  MFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIG

Query:  LQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLPNT
          EGE+IVWDS  +MT  P LE E +PM+ ++P L+ R GV + +P +P T
Subjt:  LQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLPNT

A0A6J1DY60 uncharacterized protein LOC1110252733.4e-3239.15Show/hide
Query:  NPNTDRATRKLCYGDWGKTWFRDLITSGKWMMSEVIDSLFMFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCK
        +P+TD  +R    G   K+WF  L+     +  E IDSL M T  K+E+ + L   +F  GD++L+N LR T+G Y  M  P  +P++    YDW  + +
Subjt:  NPNTDRATRKLCYGDWGKTWFRDLITSGKWMMSEVIDSLFMFTRNKLEQRQDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCK

Query:  TILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLP
        TI  Y+ G  ++Y   W + D+VY   NIGGNHW+M+ I L EG++ VWDS++++T    LE   +PM  ++PA++H  G+  LRP LP
Subjt:  TILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLP

A0A6J1DZM8 uncharacterized protein LOC1110257792.2e-3180.95Show/hide
Query:  MDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLP
        MDGTHT+YQ RWLDLDVVY+PYNIGGNHWIM+HI LQ+ EII+WD MRSMT FPTLESE R M VVL ALMHR GVQVLR TLP
Subjt:  MDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALMHRPGVQVLRPTLP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G08430.1 Ulp1 protease family protein5.1e-0431.75Show/hide
Query:  KRW-LDLDVVYVPYNIGGNHWIMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALM
        K+W +D+D +Y    + GNHW+ L I L +  I V+DS+ S+T+   +  +   ++ ++PA++
Subjt:  KRW-LDLDVVYVPYNIGGNHWIMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALM

AT5G28235.1 Ulp1 protease family protein5.1e-0430.16Show/hide
Query:  KRW-LDLDVVYVPYNIGGNHWIMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALM
        K+W +D+D +Y    + GNHW+ L I L +  + V+DS+ S+T+   +  +   ++ ++PA++
Subjt:  KRW-LDLDVVYVPYNIGGNHWIMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALM

AT5G45570.1 Ulp1 protease family protein6.6e-0430.16Show/hide
Query:  KRW-LDLDVVYVPYNIGGNHWIMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALM
        K+W +D+D +Y    + GNHW+ L I L    + V+DS+ S+T+   +  +   ++ ++PA++
Subjt:  KRW-LDLDVVYVPYNIGGNHWIMLHIGLQEGEIIVWDSMRSMTSFPTLESEFRPMVVVLPALM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTACGAGGACTTCACGGACTCTGATGCGGAAGGGCCGGGAGCGACGTTGCAACCAGACCTGGATGAGGGGTCGTTCAATGTCATGGTGGACGAAAAGCGGAAGAA
GGTAATGCGATATGACCCACTAGTCAACGTCCTCTCTGAACAAGTCCAGAAGTTTCATGCTTGGCTGGCGAACCCTAACACTGATCGCGCCACTCGCAAATTATGTTACG
GTGATTGGGGAAAGACATGGTTTCGTGACCTTATCACCTCAGGCAAGTGGATGATGAGTGAGGTGATCGATTCGTTGTTCATGTTCACGCGGAACAAACTCGAGCAACGG
CAGGACTTGTGTTCTCGAAGATTCACCACCGGTGACATTGTCCTTGCGAACTTTCTTCGACTAACAAACGGACTATATCAACGCATGACGGCCCCGAACGCAGTACCTGC
GAGAGTTGCAGCGGAATATGATTGGGCTGGCAAGTGTAAGACCATCCTGAGCTATATGGATGGGACGCACACCAACTATCAGAAACGGTGGCTTGATCTGGATGTTGTTT
ACGTGCCATACAACATCGGTGGCAACCATTGGATTATGCTACACATTGGTCTGCAGGAGGGTGAGATCATTGTGTGGGATTCTATGAGGTCGATGACATCATTTCCAACT
CTGGAGTCCGAGTTTAGGCCGATGGTTGTTGTCCTACCAGCGTTGATGCACAGACCCGGTGTTCAGGTATTGAGGCCGACACTACCGAATACCCATGGCGCATTCATCAA
GTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACTACGAGGACTTCACGGACTCTGATGCGGAAGGGCCGGGAGCGACGTTGCAACCAGACCTGGATGAGGGGTCGTTCAATGTCATGGTGGACGAAAAGCGGAAGAA
GGTAATGCGATATGACCCACTAGTCAACGTCCTCTCTGAACAAGTCCAGAAGTTTCATGCTTGGCTGGCGAACCCTAACACTGATCGCGCCACTCGCAAATTATGTTACG
GTGATTGGGGAAAGACATGGTTTCGTGACCTTATCACCTCAGGCAAGTGGATGATGAGTGAGGTGATCGATTCGTTGTTCATGTTCACGCGGAACAAACTCGAGCAACGG
CAGGACTTGTGTTCTCGAAGATTCACCACCGGTGACATTGTCCTTGCGAACTTTCTTCGACTAACAAACGGACTATATCAACGCATGACGGCCCCGAACGCAGTACCTGC
GAGAGTTGCAGCGGAATATGATTGGGCTGGCAAGTGTAAGACCATCCTGAGCTATATGGATGGGACGCACACCAACTATCAGAAACGGTGGCTTGATCTGGATGTTGTTT
ACGTGCCATACAACATCGGTGGCAACCATTGGATTATGCTACACATTGGTCTGCAGGAGGGTGAGATCATTGTGTGGGATTCTATGAGGTCGATGACATCATTTCCAACT
CTGGAGTCCGAGTTTAGGCCGATGGTTGTTGTCCTACCAGCGTTGATGCACAGACCCGGTGTTCAGGTATTGAGGCCGACACTACCGAATACCCATGGCGCATTCATCAA
GTAA
Protein sequenceShow/hide protein sequence
MDYEDFTDSDAEGPGATLQPDLDEGSFNVMVDEKRKKVMRYDPLVNVLSEQVQKFHAWLANPNTDRATRKLCYGDWGKTWFRDLITSGKWMMSEVIDSLFMFTRNKLEQR
QDLCSRRFTTGDIVLANFLRLTNGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTNYQKRWLDLDVVYVPYNIGGNHWIMLHIGLQEGEIIVWDSMRSMTSFPT
LESEFRPMVVVLPALMHRPGVQVLRPTLPNTHGAFIK