; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g35120 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g35120
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr9:26884517..26886547
RNA-Seq ExpressionMoc09g35120
SyntenyMoc09g35120
Gene Ontology termsNA
InterPro domainsIPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031708.1 hypothetical protein E6C27_scaffold139G004940 [Cucumis melo var. makuwa]6.9e-3039.63Show/hide
Query:  GSNVLADELGSHILWQGHL-IAITEK-GDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP
        G  +L+ E+GS +LW  HL I + EK   V   +  L        R+ P+ LR LL E+    S++ + V   VFG  R   ++ + +Q F +M+ I+  
Subjt:  GSNVLADELGSHILWQGHL-IAITEK-GDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP

Query:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA
        CIDA++ +LYK +   G    YKF DAGSIS   +SKE R + L  +L+  +  Q+L+ PYNSG+HW LI ID+S+   Y ++PLRNR++ND  DVV  A
Subjt:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA

Query:  LNKCNKVKLSVWEMPCC
         +  NK K  VW +  C
Subjt:  LNKCNKVKLSVWEMPCC

KAA0063750.1 transposase [Cucumis melo var. makuwa]6.9e-3039.63Show/hide
Query:  GSNVLADELGSHILWQGHL-IAITEK-GDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP
        G  +L+ E+GS +LW  HL I + EK   V   +  L        R+ P+ LR LL E+    S++ + V   VFG  R   ++ + +Q F +M+ I+  
Subjt:  GSNVLADELGSHILWQGHL-IAITEK-GDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP

Query:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA
        CIDA++ +LYK +   G    YKF DAGSIS   +SKE R + L  +L+  +  Q+L+ PYNSG+HW LI ID+S+   Y ++PLRNR++ND  DVV  A
Subjt:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA

Query:  LNKCNKVKLSVWEMPCC
         +  NK K  VW +  C
Subjt:  LNKCNKVKLSVWEMPCC

TYJ96009.1 uncharacterized protein E5676_scaffold2612G00150 [Cucumis melo var. makuwa]1.1e-3037.79Show/hide
Query:  RGSNVLADELGSHILWQGHLIAITE-KGDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP
        +G   ++ E+GSHILW   L+     K D     + +  +     ++ P+ LR LLR ++   S + +   +DVFG  R   +  + ++ F  M+ I   
Subjt:  RGSNVLADELGSHILWQGHLIAITE-KGDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP

Query:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA
        C+DAYI+YLY ++ +    ++YKFLDAGSIS  + SKE R + LT +L+  +  QLLL PYNSG+HW L+VI+ +K   + I+PL+NR+D D+ +VV R+
Subjt:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA

Query:  LNKCNKVKLSVWEMPCC
         N  NK K   W +  C
Subjt:  LNKCNKVKLSVWEMPCC

TYK08419.1 uncharacterized protein E5676_scaffold654G00340 [Cucumis melo var. makuwa]1.1e-3037.79Show/hide
Query:  RGSNVLADELGSHILWQGHLIAITE-KGDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP
        +G   ++ E+GSHILW   L+     K D     + +  +     ++ P+ LR LLR ++   S + +   +DVFG  R   +  + ++ F  M+ I   
Subjt:  RGSNVLADELGSHILWQGHLIAITE-KGDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP

Query:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA
        C+DAYI+YLY ++ +    ++YKFLDAGSIS  + SKE R + LT +L+  +  QLLL PYNSG+HW L+VI+ +K   + I+PL+NR+D D+ +VV R+
Subjt:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA

Query:  LNKCNKVKLSVWEMPCC
         N  NK K   W +  C
Subjt:  LNKCNKVKLSVWEMPCC

TYK16521.1 uncharacterized protein E5676_scaffold21G003330 [Cucumis melo var. makuwa]2.2e-3139.17Show/hide
Query:  RGSNVLADELGSHILWQGHL-IAITEKGDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP
        +G   ++ E+GSHILW   L IA   K D     + +  +     ++ PI LR LLR ++   S + +   +DVFG  R   +  + ++ F  M+ I   
Subjt:  RGSNVLADELGSHILWQGHL-IAITEKGDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP

Query:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA
        C+DAYI+YLY ++ +    ++YKF DAGSIS  + SKE R + LT++L+  +  QLLL PYNSG+HW L+VI+ +K   + I+PL+NR+D D+ +VV R+
Subjt:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA

Query:  LNKCNKVKLSVWEMPCC
         N  NK K  VW +  C
Subjt:  LNKCNKVKLSVWEMPCC

TrEMBL top hitse value%identityAlignment
A0A5A7TVG6 ULP_PROTEASE domain-containing protein3.4e-3039.17Show/hide
Query:  GSNVLADELGSHILWQGHL-IAITEK-GDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP
        G  +L+ E+GS +LW  HL I + EK   V   +  L        R+ P+ LR LL E+    S++ + V   VFG  R   ++ + +Q F +M+ I+  
Subjt:  GSNVLADELGSHILWQGHL-IAITEK-GDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP

Query:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA
        CIDA++ +LYK +   G    YKF DAGS+S   +SKE R + L  +L+  +  Q+L+ PYNSG+HW LI ID+S+   Y ++PLRNR++ND  DVV  A
Subjt:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA

Query:  LNKCNKVKLSVWEMPCC
         +  NK K  VW +  C
Subjt:  LNKCNKVKLSVWEMPCC

A0A5A7V975 Transposase3.4e-3039.63Show/hide
Query:  GSNVLADELGSHILWQGHL-IAITEK-GDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP
        G  +L+ E+GS +LW  HL I + EK   V   +  L        R+ P+ LR LL E+    S++ + V   VFG  R   ++ + +Q F +M+ I+  
Subjt:  GSNVLADELGSHILWQGHL-IAITEK-GDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP

Query:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA
        CIDA++ +LYK +   G    YKF DAGSIS   +SKE R + L  +L+  +  Q+L+ PYNSG+HW LI ID+S+   Y ++PLRNR++ND  DVV  A
Subjt:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA

Query:  LNKCNKVKLSVWEMPCC
         +  NK K  VW +  C
Subjt:  LNKCNKVKLSVWEMPCC

A0A5D3CDJ5 ULP_PROTEASE domain-containing protein5.2e-3137.79Show/hide
Query:  RGSNVLADELGSHILWQGHLIAITE-KGDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP
        +G   ++ E+GSHILW   L+     K D     + +  +     ++ P+ LR LLR ++   S + +   +DVFG  R   +  + ++ F  M+ I   
Subjt:  RGSNVLADELGSHILWQGHLIAITE-KGDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP

Query:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA
        C+DAYI+YLY ++ +    ++YKFLDAGSIS  + SKE R + LT +L+  +  QLLL PYNSG+HW L+VI+ +K   + I+PL+NR+D D+ +VV R+
Subjt:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA

Query:  LNKCNKVKLSVWEMPCC
         N  NK K   W +  C
Subjt:  LNKCNKVKLSVWEMPCC

A0A5D3CX60 ULP_PROTEASE domain-containing protein1.0e-3139.17Show/hide
Query:  RGSNVLADELGSHILWQGHL-IAITEKGDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP
        +G   ++ E+GSHILW   L IA   K D     + +  +     ++ PI LR LLR ++   S + +   +DVFG  R   +  + ++ F  M+ I   
Subjt:  RGSNVLADELGSHILWQGHL-IAITEKGDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP

Query:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA
        C+DAYI+YLY ++ +    ++YKF DAGSIS  + SKE R + LT++L+  +  QLLL PYNSG+HW L+VI+ +K   + I+PL+NR+D D+ +VV R+
Subjt:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA

Query:  LNKCNKVKLSVWEMPCC
         N  NK K  VW +  C
Subjt:  LNKCNKVKLSVWEMPCC

A0A5D3D5Q6 ULP_PROTEASE domain-containing protein5.2e-3137.79Show/hide
Query:  RGSNVLADELGSHILWQGHLIAITE-KGDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP
        +G   ++ E+GSHILW   L+     K D     + +  +     ++ P+ LR LLR ++   S + +   +DVFG  R   +  + ++ F  M+ I   
Subjt:  RGSNVLADELGSHILWQGHLIAITE-KGDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSITDP

Query:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA
        C+DAYI+YLY ++ +    ++YKFLDAGSIS  + SKE R + LT +L+  +  QLLL PYNSG+HW L+VI+ +K   + I+PL+NR+D D+ +VV R+
Subjt:  CIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRA

Query:  LNKCNKVKLSVWEMPCC
         N  NK K   W +  C
Subjt:  LNKCNKVKLSVWEMPCC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCACTCGAGAGTGATGAAGATGATCTCGACTTTTTTGAGAGGATCGAACGTACTAGCTGACGAGTTAGGTTCGCACATATTATGGCAAGGACACCTCATTGCCAT
CACTGAAAAAGGGGATGTTGCAACCAGAGAGGAGGTGCTTGAGCTGTACCCGTTCGAATATGATCGAAGCATTCCAATTGTTCTGCGATGCCTGCTTCGTGAGATGAAGA
TATGTAAATCTCAGGTAATATTGCCAGTGACTAATGACGTGTTCGGATGTACCCGCATCTATAGAGTATGGACGGACATTGTTCAAACCTTTTATGAGATGAAGTCGATA
ACTGATCCGTGCATAGATGCTTATATCATATACTTATACAAGAAGTTGGTGACGAAAGGTGCATCTCATATGTACAAATTTCTTGATGCTGGATCAATATCGACATCAAA
TCTTTCAAAAGAAGGACGAACGGAGAATTTGACAAAACAACTTATGCAAATGGAGTCGGGTCAATTGCTTCTTGCTCCCTACAATAGTGGATCACATTGGATATTGATAG
TCATTGATTACTCGAAGACCATGGTGTATTCAATCAACCCTTTGAGAAATCGTCTGGACAATGATATAATGGATGTCGTCAACCGGGCTTTGAATAAGTGCAACAAAGTA
AAGCTTAGTGTTTGGGAAATGCCTTGTTGTGTCCATAGTGCCAACAACCTGGCTCAACAGAATGTGGGAGCAATCTTCGCACTAGAAATGCTTGTCGAACTGCGTTCGGG
TGAGCTGAAGCATGTTATGGCCGCTTCATTCAAAGTTCCATCATAA
mRNA sequenceShow/hide mRNA sequence
ATGATGCACTCGAGAGTGATGAAGATGATCTCGACTTTTTTGAGAGGATCGAACGTACTAGCTGACGAGTTAGGTTCGCACATATTATGGCAAGGACACCTCATTGCCAT
CACTGAAAAAGGGGATGTTGCAACCAGAGAGGAGGTGCTTGAGCTGTACCCGTTCGAATATGATCGAAGCATTCCAATTGTTCTGCGATGCCTGCTTCGTGAGATGAAGA
TATGTAAATCTCAGGTAATATTGCCAGTGACTAATGACGTGTTCGGATGTACCCGCATCTATAGAGTATGGACGGACATTGTTCAAACCTTTTATGAGATGAAGTCGATA
ACTGATCCGTGCATAGATGCTTATATCATATACTTATACAAGAAGTTGGTGACGAAAGGTGCATCTCATATGTACAAATTTCTTGATGCTGGATCAATATCGACATCAAA
TCTTTCAAAAGAAGGACGAACGGAGAATTTGACAAAACAACTTATGCAAATGGAGTCGGGTCAATTGCTTCTTGCTCCCTACAATAGTGGATCACATTGGATATTGATAG
TCATTGATTACTCGAAGACCATGGTGTATTCAATCAACCCTTTGAGAAATCGTCTGGACAATGATATAATGGATGTCGTCAACCGGGCTTTGAATAAGTGCAACAAAGTA
AAGCTTAGTGTTTGGGAAATGCCTTGTTGTGTCCATAGTGCCAACAACCTGGCTCAACAGAATGTGGGAGCAATCTTCGCACTAGAAATGCTTGTCGAACTGCGTTCGGG
TGAGCTGAAGCATGTTATGGCCGCTTCATTCAAAGTTCCATCATAA
Protein sequenceShow/hide protein sequence
MMHSRVMKMISTFLRGSNVLADELGSHILWQGHLIAITEKGDVATREEVLELYPFEYDRSIPIVLRCLLREMKICKSQVILPVTNDVFGCTRIYRVWTDIVQTFYEMKSI
TDPCIDAYIIYLYKKLVTKGASHMYKFLDAGSISTSNLSKEGRTENLTKQLMQMESGQLLLAPYNSGSHWILIVIDYSKTMVYSINPLRNRLDNDIMDVVNRALNKCNKV
KLSVWEMPCCVHSANNLAQQNVGAIFALEMLVELRSGELKHVMAASFKVPS