; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g28210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g28210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionEnzymatic polyprotein
Genome locationchr11:20582390..20584898
RNA-Seq ExpressionMoc11g28210
SyntenyMoc11g28210
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038280.1 polyprotein [Cucumis melo var. makuwa]8.1e-2474.36Show/hide
Query:  QEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSKSKDPEFPRRKRKYYNRSK
        Q IC+N+  ENKHTTKVIKD DYRKELGTFCKQYDLD GPK+E+KK+++ S+KRLFS+SK+KDPEF +RKRKYYN++K
Subjt:  QEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSKSKDPEFPRRKRKYYNRSK

KAA0066178.1 hypothetical protein E6C27_scaffold21G001880 [Cucumis melo var. makuwa]2.4e-2883.33Show/hide
Query:  QEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSKSKDPEFPRRKRKYYNRSK
        Q IC+N+C+ENKHTTKVIKD +YRKELG FCK+Y LD GPKDERKKR+KSSNKRLFSKSKSKDPEFPRRKRKYYN++K
Subjt:  QEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSKSKDPEFPRRKRKYYNRSK

KAA0066201.1 Enzymatic polyprotein [Cucumis melo var. makuwa]1.2e-2755.22Show/hide
Query:  DKRSCYNT---YNDSSLIFNNSTSNYAYGNDVEALLGLRFQEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSK
        D+R    T   + D+S+  N+ T    +       +    Q IC+N+C ENKHTTKVIKD DYRKELGTFCKQY L  GPK+E+KK+KKSS+KRLFS+SK
Subjt:  DKRSCYNT---YNDSSLIFNNSTSNYAYGNDVEALLGLRFQEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSK

Query:  SKDPEFPRRKRKYYNRSKG--PSGNSKVLSKALR
        +KDPEFPRRKRKYYN+ K   PS  + V  K  R
Subjt:  SKDPEFPRRKRKYYNRSKG--PSGNSKVLSKALR

TYK01213.1 Enzymatic polyprotein [Cucumis melo var. makuwa]1.2e-2755.22Show/hide
Query:  DKRSCYNT---YNDSSLIFNNSTSNYAYGNDVEALLGLRFQEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSK
        D+R    T   + D+S+  N+ T    +       +    Q IC+N+C ENKHTTKVIKD DYRKELGTFCKQY L  GPK+E+KK+KKSS+KRLFS+SK
Subjt:  DKRSCYNT---YNDSSLIFNNSTSNYAYGNDVEALLGLRFQEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSK

Query:  SKDPEFPRRKRKYYNRSKG--PSGNSKVLSKALR
        +KDPEFPRRKRKYYN+ K   PS  + V  K  R
Subjt:  SKDPEFPRRKRKYYNRSKG--PSGNSKVLSKALR

XP_022151716.1 uncharacterized protein LOC111019629 [Momordica charantia]5.4e-2862.73Show/hide
Query:  SLIFNNSTSNYAYGNDVEALLGLRFQEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSKSKDPEFPRRKRKYYN
        +++ N++T+   +       +    Q+ICVN+CLENKHT KVIK+PDYRKELGTFCKQY LD   ++ERKK+KKSSNKRLFSKSKSKD E PRRKRKYYN
Subjt:  SLIFNNSTSNYAYGNDVEALLGLRFQEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSKSKDPEFPRRKRKYYN

Query:  RSKGPSGNSK
        R+KG    SK
Subjt:  RSKGPSGNSK

TrEMBL top hitse value%identityAlignment
A0A5A7T4I9 Polyprotein3.9e-2474.36Show/hide
Query:  QEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSKSKDPEFPRRKRKYYNRSK
        Q IC+N+  ENKHTTKVIKD DYRKELGTFCKQYDLD GPK+E+KK+++ S+KRLFS+SK+KDPEF +RKRKYYN++K
Subjt:  QEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSKSKDPEFPRRKRKYYNRSK

A0A5A7VEN9 Uncharacterized protein1.2e-2883.33Show/hide
Query:  QEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSKSKDPEFPRRKRKYYNRSK
        Q IC+N+C+ENKHTTKVIKD +YRKELG FCK+Y LD GPKDERKKR+KSSNKRLFSKSKSKDPEFPRRKRKYYN++K
Subjt:  QEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSKSKDPEFPRRKRKYYNRSK

A0A5A7VKK8 Enzymatic polyprotein5.8e-2855.22Show/hide
Query:  DKRSCYNT---YNDSSLIFNNSTSNYAYGNDVEALLGLRFQEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSK
        D+R    T   + D+S+  N+ T    +       +    Q IC+N+C ENKHTTKVIKD DYRKELGTFCKQY L  GPK+E+KK+KKSS+KRLFS+SK
Subjt:  DKRSCYNT---YNDSSLIFNNSTSNYAYGNDVEALLGLRFQEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSK

Query:  SKDPEFPRRKRKYYNRSKG--PSGNSKVLSKALR
        +KDPEFPRRKRKYYN+ K   PS  + V  K  R
Subjt:  SKDPEFPRRKRKYYNRSKG--PSGNSKVLSKALR

A0A5D3BN76 Enzymatic polyprotein5.8e-2855.22Show/hide
Query:  DKRSCYNT---YNDSSLIFNNSTSNYAYGNDVEALLGLRFQEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSK
        D+R    T   + D+S+  N+ T    +       +    Q IC+N+C ENKHTTKVIKD DYRKELGTFCKQY L  GPK+E+KK+KKSS+KRLFS+SK
Subjt:  DKRSCYNT---YNDSSLIFNNSTSNYAYGNDVEALLGLRFQEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSK

Query:  SKDPEFPRRKRKYYNRSKG--PSGNSKVLSKALR
        +KDPEFPRRKRKYYN+ K   PS  + V  K  R
Subjt:  SKDPEFPRRKRKYYNRSKG--PSGNSKVLSKALR

A0A6J1DFI7 uncharacterized protein LOC1110196292.6e-2862.73Show/hide
Query:  SLIFNNSTSNYAYGNDVEALLGLRFQEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSKSKDPEFPRRKRKYYN
        +++ N++T+   +       +    Q+ICVN+CLENKHT KVIK+PDYRKELGTFCKQY LD   ++ERKK+KKSSNKRLFSKSKSKD E PRRKRKYYN
Subjt:  SLIFNNSTSNYAYGNDVEALLGLRFQEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSKSKDPEFPRRKRKYYN

Query:  RSKGPSGNSK
        R+KG    SK
Subjt:  RSKGPSGNSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTACTACCTGGGGAGAAATGGTGCGATCATTTCGAGACGACGGGGTGGCTGGTTGTTTTGCGGCGGGCCGCTGAGTGTTTGGCCACCTTCGACCAGGATGAG
GTTCCATCGGGAGGAGGAGAGAAGGATGTGCTCAGGGTTGATTTGGGTGTCTTCAAGAGATGTTCTCTTGAGAGGCTCGTTAATCATGGCGGAGATCGTCCCAGC
CTAGATCAGGAGAAGATGGTTGAGGATAAACGTAGTTGCTACAATACCTACAACGACTCAAGCCTCATCTTCAACAATTCTACCAGTAACTATGCATACGGAAAT
GATGTCGAAGCTCTATTAGGACTCCGATTTCAAGAGATATGCGTTAATATCTGCCTAGAGAATAAGCATACTACCAAAGTTATCAAAGATCCCGACTACCGTAAG
GAATTGGGAACTTTCTGCAAACAATACGATCTTGACTACGGACCTAAAGATGAAAGGAAGAAAAGAAAGAAATCTTCCAACAAACGACTCTTTAGCAAGAGTAAG
TCAAAGGATCCCGAATTTCCTCGGCGCAAGAGGAAATACTACAACAGGAGCAAGGGACCCAGTGGCAATTCCAAAGTGTTGAGCAAAGCTTTACGCATCAAATGG
TGGGAAAATTTTGACTACTCCTACCTAGAAGCTGAAAAGATAAAAATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTACTACCTGGGGAGAAATGGTGCGATCATTTCGAGACGACGGGGTGGCTGGTTGTTTTGCGGCGGGCCGCTGAGTGTTTGGCCACCTTCGACCAGGATGAG
GTTCCATCGGGAGGAGGAGAGAAGGATGTGCTCAGGGTTGATTTGGGTGTCTTCAAGAGATGTTCTCTTGAGAGGCTCGTTAATCATGGCGGAGATCGTCCCAGC
CTAGATCAGGAGAAGATGGTTGAGGATAAACGTAGTTGCTACAATACCTACAACGACTCAAGCCTCATCTTCAACAATTCTACCAGTAACTATGCATACGGAAAT
GATGTCGAAGCTCTATTAGGACTCCGATTTCAAGAGATATGCGTTAATATCTGCCTAGAGAATAAGCATACTACCAAAGTTATCAAAGATCCCGACTACCGTAAG
GAATTGGGAACTTTCTGCAAACAATACGATCTTGACTACGGACCTAAAGATGAAAGGAAGAAAAGAAAGAAATCTTCCAACAAACGACTCTTTAGCAAGAGTAAG
TCAAAGGATCCCGAATTTCCTCGGCGCAAGAGGAAATACTACAACAGGAGCAAGGGACCCAGTGGCAATTCCAAAGTGTTGAGCAAAGCTTTACGCATCAAATGG
TGGGAAAATTTTGACTACTCCTACCTAGAAGCTGAAAAGATAAAAATCTAG
Protein sequenceShow/hide protein sequence
MLLPGEKWCDHFETTGWLVVLRRAAECLATFDQDEVPSGGGEKDVLRVDLGVFKRCSLERLVNHGGDRPSLDQEKMVEDKRSCYNTYNDSSLIFNNSTSNYAYGN
DVEALLGLRFQEICVNICLENKHTTKVIKDPDYRKELGTFCKQYDLDYGPKDERKKRKKSSNKRLFSKSKSKDPEFPRRKRKYYNRSKGPSGNSKVLSKALRIKW
WENFDYSYLEAEKIKI