; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g18580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g18580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr8:14032017..14040325
RNA-Seq ExpressionMoc08g18580
SyntenyMoc08g18580
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141784.1 uncharacterized protein LOC111012067 [Momordica charantia]1.7e-1963.75Show/hide
Query:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVTG
        EIVVWDSL ++T D ++E+ L+ MRT+IP +L K  V+ V PNL + PW IRRVTSAPQQ G  DC IF VKYFEYDVTG
Subjt:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVTG

XP_022148308.1 uncharacterized protein LOC111016993 [Momordica charantia]1.6e-1489.58Show/hide
Query:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQP
        EIVVWDSL SMTTDHAMEDHLK M TIIPVMLFKCDVMKV+PNL IQP
Subjt:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQP

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]4.7e-1447.78Show/hide
Query:  WSPVPSRADMCEIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVT
        W  +    D  E++VWDS  +MT    +E  LK M TIIP ++ +  V   +PN+ + PW IRRV+SAPQQ   GDCGIF + +FEYDVT
Subjt:  WSPVPSRADMCEIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVT

XP_022155154.1 uncharacterized protein LOC111022296 [Momordica charantia]9.7e-2062.5Show/hide
Query:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVTG
        EIVVWDSL ++T D ++E+ L  MR +IP +L K   + VRPNL + PW I RVTS PQQ G GDCGIF VKYFEYDVTG
Subjt:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVTG

XP_022156568.1 uncharacterized protein LOC111023442 [Momordica charantia]4.8e-1962.03Show/hide
Query:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVT
        EIVVWDSL ++T+  ++E+ LK M T+IP +L K  V+ VRPNL + PW IRRVTS P+Q   GDCGIF VKYFEYDVT
Subjt:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVT

TrEMBL top hitse value%identityAlignment
A0A6J1CJT2 uncharacterized protein LOC1110120678.1e-2063.75Show/hide
Query:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVTG
        EIVVWDSL ++T D ++E+ L+ MRT+IP +L K  V+ V PNL + PW IRRVTSAPQQ G  DC IF VKYFEYDVTG
Subjt:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVTG

A0A6J1D3R7 uncharacterized protein LOC1110169937.8e-1589.58Show/hide
Query:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQP
        EIVVWDSL SMTTDHAMEDHLK M TIIPVMLFKCDVMKV+PNL IQP
Subjt:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQP

A0A6J1DLV0 uncharacterized protein LOC1110216462.3e-1447.78Show/hide
Query:  WSPVPSRADMCEIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVT
        W  +    D  E++VWDS  +MT    +E  LK M TIIP ++ +  V   +PN+ + PW IRRV+SAPQQ   GDCGIF + +FEYDVT
Subjt:  WSPVPSRADMCEIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVT

A0A6J1DPE8 uncharacterized protein LOC1110222964.7e-2062.5Show/hide
Query:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVTG
        EIVVWDSL ++T D ++E+ L  MR +IP +L K   + VRPNL + PW I RVTS PQQ G GDCGIF VKYFEYDVTG
Subjt:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVTG

A0A6J1DQZ3 uncharacterized protein LOC1110234422.3e-1962.03Show/hide
Query:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVT
        EIVVWDSL ++T+  ++E+ LK M T+IP +L K  V+ VRPNL + PW IRRVTS P+Q   GDCGIF VKYFEYDVT
Subjt:  EIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G08430.1 Ulp1 protease family protein5.0e-0632.98Show/hide
Query:  IVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVTGQGRQGRWGGYDQRLW
        I V+DS+ S+TTD  M      + T+IP ML      K R     +  W +R+T  P+ +   DC I+ +KY E    G+   G      Q LW
Subjt:  IVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVTGQGRQGRWGGYDQRLW

AT5G45570.1 Ulp1 protease family protein2.9e-0634.15Show/hide
Query:  VWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVTGQGRQG
        V+DS+ S+TTD  M      + T+IP ML      K R     +  W +R+T  P+ +  GDC I+ +KY E    G+   G
Subjt:  VWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNLLIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVTGQGRQG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACGCGCTGTACGTTTGGTCAGAGGGGAACGCGCTTCTAGCAGGTGCAGGGTGGCACCGCCACCTGCGGCTTTGGGCATGTCATCTGCGGCTGTGGAATGTGCCCG
TCGTACCGACAGCATGCGTTCCCAACTGCAACTACGTATGGGGTGTAATCGCCTGTGGTCCCCTGTTCCTTCTAGGGCGGACATGTGCGAGATCGTAGTATGGGACTCCT
TGACTTCAATGACCACGGATCATGCTATGGAGGATCATTTGAAGGCAATGCGCACCATCATCCCAGTAATGCTCTTTAAATGTGATGTTATGAAAGTCCGACCCAATCTA
CTGATCCAACCATGGTGGATTCGACGAGTTACGTCAGCACCACAGCAGATTGGCGGTGGTGACTGTGGGATTTTCGGCGTCAAGTATTTTGAGTACGATGTAACCGGGCA
GGGGAGACAAGGTCGTTGGGGCGGTTACGACCAGCGGTTGTGGCTGTCGGATCCCCCAGGTGTCAAGTGGGCTTACCGTGGAAAGAATAATTGGAAATCAATGGATGATT
CTAGGGAAGAAAGAGATGTGGTGTGGACTCTATCACTTCTGTCGGACACCATGGGCAACGACTGCTACATACTTGTAGATACAGACCCCGTAGATAGTTACCCATATGGG
GTCGTTCGGGCCCCAAAGCCGAACCCTGTCCCCGTTCCTCTACCGATGCTGGAGGCATCTCGGCCAAAGCCCACGAATATCAAACCTAGGGTATCTGGTCCTGTAATATC
TGTTTCTGGTGCATGGAATGGAGTAAGCCAGTCTACCAGAATCCAAGCATCAACAATTGGCGCCGTCCGTGGGAAACGATACAAGATTAAGCCAAGAACAGAGACGAAGC
TCAAATCCAAGGAAGGTGGGAATGAAGCGAGTGGATACGAACTTAGAGGTCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCACGCGCTGTACGTTTGGTCAGAGGGGAACGCGCTTCTAGCAGGTGCAGGGTGGCACCGCCACCTGCGGCTTTGGGCATGTCATCTGCGGCTGTGGAATGTGCCCG
TCGTACCGACAGCATGCGTTCCCAACTGCAACTACGTATGGGGTGTAATCGCCTGTGGTCCCCTGTTCCTTCTAGGGCGGACATGTGCGAGATCGTAGTATGGGACTCCT
TGACTTCAATGACCACGGATCATGCTATGGAGGATCATTTGAAGGCAATGCGCACCATCATCCCAGTAATGCTCTTTAAATGTGATGTTATGAAAGTCCGACCCAATCTA
CTGATCCAACCATGGTGGATTCGACGAGTTACGTCAGCACCACAGCAGATTGGCGGTGGTGACTGTGGGATTTTCGGCGTCAAGTATTTTGAGTACGATGTAACCGGGCA
GGGGAGACAAGGTCGTTGGGGCGGTTACGACCAGCGGTTGTGGCTGTCGGATCCCCCAGGTGTCAAGTGGGCTTACCGTGGAAAGAATAATTGGAAATCAATGGATGATT
CTAGGGAAGAAAGAGATGTGGTGTGGACTCTATCACTTCTGTCGGACACCATGGGCAACGACTGCTACATACTTGTAGATACAGACCCCGTAGATAGTTACCCATATGGG
GTCGTTCGGGCCCCAAAGCCGAACCCTGTCCCCGTTCCTCTACCGATGCTGGAGGCATCTCGGCCAAAGCCCACGAATATCAAACCTAGGGTATCTGGTCCTGTAATATC
TGTTTCTGGTGCATGGAATGGAGTAAGCCAGTCTACCAGAATCCAAGCATCAACAATTGGCGCCGTCCGTGGGAAACGATACAAGATTAAGCCAAGAACAGAGACGAAGC
TCAAATCCAAGGAAGGTGGGAATGAAGCGAGTGGATACGAACTTAGAGGTCAATAG
Protein sequenceShow/hide protein sequence
MSRAVRLVRGERASSRCRVAPPPAALGMSSAAVECARRTDSMRSQLQLRMGCNRLWSPVPSRADMCEIVVWDSLTSMTTDHAMEDHLKAMRTIIPVMLFKCDVMKVRPNL
LIQPWWIRRVTSAPQQIGGGDCGIFGVKYFEYDVTGQGRQGRWGGYDQRLWLSDPPGVKWAYRGKNNWKSMDDSREERDVVWTLSLLSDTMGNDCYILVDTDPVDSYPYG
VVRAPKPNPVPVPLPMLEASRPKPTNIKPRVSGPVISVSGAWNGVSQSTRIQASTIGAVRGKRYKIKPRTETKLKSKEGGNEASGYELRGQ