; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g19310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g19310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag-protease polyprotein
Genome locationchr2:14370338..14370712
RNA-Seq ExpressionMoc02g19310
SyntenyMoc02g19310
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032356.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-1361.29Show/hide
Query:  ESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFMLKD
        + +S EAK+L+DF+KY   +FDE   DP  A+ WLSS+ETIFRYM+CP++QK+QCAVFML D
Subjt:  ESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFMLKD

XP_022156662.1 uncharacterized protein LOC111023512 [Momordica charantia]4.3e-2778.41Show/hide
Query:  METLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFMLKDVAF
        METLQTLVQT VSNQM QLTQ+R S+SIEAKYL+DFKKY   SFD L  DPMLAEAWLS +ETIFRYMRC +EQK+QC VFMLKD AF
Subjt:  METLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFMLKDVAF

XP_022928602.1 uncharacterized protein LOC111435460 [Cucurbita moschata]9.3e-1445.92Show/hide
Query:  PEPTIVSLIMETLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFMLKDVAFY
        P P +V L  E  Q L+Q   +NQ  +   D  SM+ E KYL+DF+KY    F+    DP+L E+W+ SIETIF +M CP++QK++CA FMLK  A +
Subjt:  PEPTIVSLIMETLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFMLKDVAFY

XP_022967882.1 uncharacterized protein LOC111467258 isoform X1 [Cucurbita maxima]1.9e-1445.28Show/hide
Query:  PVGQTFGQPEPTIVSLIMETLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFML
        PV +    P P +V L ME  Q L+Q   +NQ      D  SM+ E KYL+DF+KY    F+    DP+L E+W+ SIETIF +M+CP++QK++CA FML
Subjt:  PVGQTFGQPEPTIVSLIMETLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFML

Query:  KDVAFY
        K  A +
Subjt:  KDVAFY

XP_022967883.1 uncharacterized protein LOC111467258 isoform X2 [Cucurbita maxima]1.9e-1445.28Show/hide
Query:  PVGQTFGQPEPTIVSLIMETLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFML
        PV +    P P +V L ME  Q L+Q   +NQ      D  SM+ E KYL+DF+KY    F+    DP+L E+W+ SIETIF +M+CP++QK++CA FML
Subjt:  PVGQTFGQPEPTIVSLIMETLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFML

Query:  KDVAFY
        K  A +
Subjt:  KDVAFY

TrEMBL top hitse value%identityAlignment
A0A5A7SN11 Gag-protease polyprotein7.7e-1462.9Show/hide
Query:  ESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFMLKD
        + +S EAK+L+DFKKY   +FD    DP  A+ WLSS+ETIFRYM+CPK+QK+QCAVFML D
Subjt:  ESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFMLKD

A0A6J1DSJ6 uncharacterized protein LOC1110235122.1e-2778.41Show/hide
Query:  METLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFMLKDVAF
        METLQTLVQT VSNQM QLTQ+R S+SIEAKYL+DFKKY   SFD L  DPMLAEAWLS +ETIFRYMRC +EQK+QC VFMLKD AF
Subjt:  METLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFMLKDVAF

A0A6J1EKD9 uncharacterized protein LOC1114354604.5e-1445.92Show/hide
Query:  PEPTIVSLIMETLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFMLKDVAFY
        P P +V L  E  Q L+Q   +NQ  +   D  SM+ E KYL+DF+KY    F+    DP+L E+W+ SIETIF +M CP++QK++CA FMLK  A +
Subjt:  PEPTIVSLIMETLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFMLKDVAFY

A0A6J1HS13 uncharacterized protein LOC111467258 isoform X19.0e-1545.28Show/hide
Query:  PVGQTFGQPEPTIVSLIMETLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFML
        PV +    P P +V L ME  Q L+Q   +NQ      D  SM+ E KYL+DF+KY    F+    DP+L E+W+ SIETIF +M+CP++QK++CA FML
Subjt:  PVGQTFGQPEPTIVSLIMETLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFML

Query:  KDVAFY
        K  A +
Subjt:  KDVAFY

A0A6J1HWE2 uncharacterized protein LOC111467258 isoform X29.0e-1545.28Show/hide
Query:  PVGQTFGQPEPTIVSLIMETLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFML
        PV +    P P +V L ME  Q L+Q   +NQ      D  SM+ E KYL+DF+KY    F+    DP+L E+W+ SIETIF +M+CP++QK++CA FML
Subjt:  PVGQTFGQPEPTIVSLIMETLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFML

Query:  KDVAFY
        K  A +
Subjt:  KDVAFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGACGGAGAATCACCAGTGGGTCAAACTTTTGGACAGCCAGAGCCTACTATAGTATCCTTAATTATGGAGACCTTACAGACACTTGTTCAAACTGTCGTCTCTAA
CCAAATGGTTCAACTGACTCAAGATCGAGAGAGCATGTCAATAGAAGCTAAATATCTGCAGGATTTTAAGAAGTACTATCATCATTCTTTTGACGAACTATTTGCAGATC
CGATGTTGGCAGAGGCTTGGTTATCCTCAATAGAGACTATCTTTCGTTATATGAGGTGTCCGAAGGAGCAAAAATTGCAATGTGCTGTCTTTATGCTAAAAGATGTGGCC
TTTTATGGTGGGAGTTTGCAGAAAGGTTTATCGATATTAGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAGACGGAGAATCACCAGTGGGTCAAACTTTTGGACAGCCAGAGCCTACTATAGTATCCTTAATTATGGAGACCTTACAGACACTTGTTCAAACTGTCGTCTCTAA
CCAAATGGTTCAACTGACTCAAGATCGAGAGAGCATGTCAATAGAAGCTAAATATCTGCAGGATTTTAAGAAGTACTATCATCATTCTTTTGACGAACTATTTGCAGATC
CGATGTTGGCAGAGGCTTGGTTATCCTCAATAGAGACTATCTTTCGTTATATGAGGTGTCCGAAGGAGCAAAAATTGCAATGTGCTGTCTTTATGCTAAAAGATGTGGCC
TTTTATGGTGGGAGTTTGCAGAAAGGTTTATCGATATTAGTGTAG
Protein sequenceShow/hide protein sequence
MSDGESPVGQTFGQPEPTIVSLIMETLQTLVQTVVSNQMVQLTQDRESMSIEAKYLQDFKKYYHHSFDELFADPMLAEAWLSSIETIFRYMRCPKEQKLQCAVFMLKDVA
FYGGSLQKGLSILV