; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g18510 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g18510
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr2:13820797..13825591
RNA-Seq ExpressionMoc02g18510
SyntenyMoc02g18510
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.3e-9053.87Show/hide
Query:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
        MN+SIVQL ASEKLNG NYS WK+NLNTILVVDDLRFVLTEECPQ PA NANR VREA+DRWVKANDKARVYILASMTDVLAKKH+ + TAK IMDSL+ 
Subjt:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA

Query:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEA------------------------------------------------
        MFG+PS +LRHEA+K++Y + MKEGTSVREHVLDMM+HFN AEVN   IDEA                                                
Subjt:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEA------------------------------------------------

Query:  ---------------------------------------------------------------------------AEKATTGKYDLLVVETCLVEYDTST
                                                                                   AEKAT GKYDLLVVETCLVE D ST
Subjt:  ---------------------------------------------------------------------------AEKATTGKYDLLVVETCLVEYDTST

Query:  WILDSGATNHICFSFQETSSWKKLEEGEIILKVGTGDVVSAKAMGDLKLFFDDRYILLENVL
        WILDSGATNHICFSFQETSSWKKL+EGEI LKVGTG+VVSA+A+GDL LFF DRY++L++VL
Subjt:  WILDSGATNHICFSFQETSSWKKLEEGEIILKVGTGDVVSAKAMGDLKLFFDDRYILLENVL

KAA0048103.1 gag/pol protein [Cucumis melo var. makuwa]7.3e-8651.1Show/hide
Query:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
        MNS IVQL ASEKLN  NY+TWK+NLNTILVVDDLRFVLTEECPQTPA+NANR  REA+DRW+KAN+KARVYILASM+DVLAKKHE L TAKEIMDSLK 
Subjt:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA

Query:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEA------------------------------------------------
        MFG+P  +LRH+A+KY+Y + MKEGTS+REHVL MM+HFN AEVN   IDEA                                                
Subjt:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEA------------------------------------------------

Query:  -----------------------------------------------------------------------------AEKATTGKYDLLVVETCLVEYDT
                                                                                     AEK    KYDLLV+ETCLVE + 
Subjt:  -----------------------------------------------------------------------------AEKATTGKYDLLVVETCLVEYDT

Query:  STWILDSGATNHICFSFQETSSWKKLEEGEIILKVGTGDVVSAKAMGDLKLFFDDRYILLENVL
        STWILDSGATNHICFSFQE SSWK L EG+I LKVGTG++VSAK +GDLKLFF+DRYI+L+NVL
Subjt:  STWILDSGATNHICFSFQETSSWKKLEEGEIILKVGTGDVVSAKAMGDLKLFFDDRYILLENVL

XP_022143540.1 uncharacterized protein LOC111013417 [Momordica charantia]2.2e-82100Show/hide
Query:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
        MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
Subjt:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA

Query:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEAAEKATTG
        MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEAAEKATTG
Subjt:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEAAEKATTG

XP_022157632.1 uncharacterized protein LOC111024294 [Momordica charantia]7.9e-8097.48Show/hide
Query:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
        MNSSIVQL ASEKLNGVNYSTWKNNLNTILVVDDL+FVLTEECPQTPA NANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
Subjt:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA

Query:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEAAEKATTG
        MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVN ATIDEAAEKATTG
Subjt:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEAAEKATTG

XP_022157844.1 uncharacterized protein LOC111024457 [Momordica charantia]2.1e-8098.11Show/hide
Query:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
        MNSSIVQL ASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPA NANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
Subjt:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA

Query:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEAAEKATTG
        MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVN ATIDEAAEKATTG
Subjt:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEAAEKATTG

TrEMBL top hitse value%identityAlignment
A0A5A7TWX1 Gag/pol protein3.6e-8651.1Show/hide
Query:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
        MNS IVQL ASEKLN  NY+TWK+NLNTILVVDDLRFVLTEECPQTPA+NANR  REA+DRW+KAN+KARVYILASM+DVLAKKHE L TAKEIMDSLK 
Subjt:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA

Query:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEA------------------------------------------------
        MFG+P  +LRH+A+KY+Y + MKEGTS+REHVL MM+HFN AEVN   IDEA                                                
Subjt:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEA------------------------------------------------

Query:  -----------------------------------------------------------------------------AEKATTGKYDLLVVETCLVEYDT
                                                                                     AEK    KYDLLV+ETCLVE + 
Subjt:  -----------------------------------------------------------------------------AEKATTGKYDLLVVETCLVEYDT

Query:  STWILDSGATNHICFSFQETSSWKKLEEGEIILKVGTGDVVSAKAMGDLKLFFDDRYILLENVL
        STWILDSGATNHICFSFQE SSWK L EG+I LKVGTG++VSAK +GDLKLFF+DRYI+L+NVL
Subjt:  STWILDSGATNHICFSFQETSSWKKLEEGEIILKVGTGDVVSAKAMGDLKLFFDDRYILLENVL

A0A6J1CP29 uncharacterized protein LOC1110134171.1e-82100Show/hide
Query:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
        MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
Subjt:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA

Query:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEAAEKATTG
        MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEAAEKATTG
Subjt:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEAAEKATTG

A0A6J1DUZ9 uncharacterized protein LOC1110242943.8e-8097.48Show/hide
Query:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
        MNSSIVQL ASEKLNGVNYSTWKNNLNTILVVDDL+FVLTEECPQTPA NANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
Subjt:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA

Query:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEAAEKATTG
        MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVN ATIDEAAEKATTG
Subjt:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEAAEKATTG

A0A6J1DXQ5 uncharacterized protein LOC1110244571.0e-8098.11Show/hide
Query:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
        MNSSIVQL ASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPA NANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
Subjt:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA

Query:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEAAEKATTG
        MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVN ATIDEAAEKATTG
Subjt:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEAAEKATTG

E2GK51 Gag/pol protein (Fragment)6.3e-9153.87Show/hide
Query:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA
        MN+SIVQL ASEKLNG NYS WK+NLNTILVVDDLRFVLTEECPQ PA NANR VREA+DRWVKANDKARVYILASMTDVLAKKH+ + TAK IMDSL+ 
Subjt:  MNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKA

Query:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEA------------------------------------------------
        MFG+PS +LRHEA+K++Y + MKEGTSVREHVLDMM+HFN AEVN   IDEA                                                
Subjt:  MFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEA------------------------------------------------

Query:  ---------------------------------------------------------------------------AEKATTGKYDLLVVETCLVEYDTST
                                                                                   AEKAT GKYDLLVVETCLVE D ST
Subjt:  ---------------------------------------------------------------------------AEKATTGKYDLLVVETCLVEYDTST

Query:  WILDSGATNHICFSFQETSSWKKLEEGEIILKVGTGDVVSAKAMGDLKLFFDDRYILLENVL
        WILDSGATNHICFSFQETSSWKKL+EGEI LKVGTG+VVSA+A+GDL LFF DRY++L++VL
Subjt:  WILDSGATNHICFSFQETSSWKKLEEGEIILKVGTGDVVSAKAMGDLKLFFDDRYILLENVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTCTTCCATTAGGTCGCACCGTGAGAGTCCTATAATGCGCCTGCATGTCTGCCCTGGAGCGACCTTACCAATCGGAGGGTGCATGAATAGCTCAATAGTTCAACT
CTTCGCTTCGGAAAAACTAAACGGGGTCAATTATTCAACGTGGAAAAACAATCTCAATACGATACTAGTCGTCGATGACCTACGATTCGTCTTAACTGAGGAATGTCCAC
AAACCCCTGCCGCAAATGCCAACCGAAATGTTCGGGAAGCATTTGATCGATGGGTCAAGGCCAACGATAAGGCCCGTGTCTACATTCTTGCCAGCATGACTGATGTATTG
GCAAAGAAACATGAACCCTTGATGACTGCAAAGGAAATCATGGATTCATTAAAGGCGATGTTTGGGGAACCTTCATCGACCTTGAGGCACGAGGCACTTAAATATGTTTA
CAATGAGCATATGAAGGAGGGGACCTCTGTTAGAGAACATGTCCTGGACATGATGGTCCACTTCAATACTGCTGAAGTGAACGAGGCCACCATTGATGAAGCAGCAGAGA
AAGCAACGACAGGTAAATACGATTTACTAGTTGTTGAAACATGTTTAGTGGAGTATGATACTTCCACCTGGATACTTGATTCAGGAGCCACTAACCATATTTGTTTCTCA
TTTCAGGAAACTAGTTCCTGGAAGAAGCTGGAAGAAGGCGAGATAATTCTCAAAGTTGGAACTGGAGATGTTGTCTCAGCCAAAGCAATGGGAGATTTAAAGTTGTTTTT
TGACGATAGATACATTTTACTAGAAAATGTTTTATCTGCATGGGTGAGAGCAGCTCAACAGCGCTGGCTCAATAAGCCTCCCATTTCAGGGGTAAGACCCGGTAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCTCTTCCATTAGGTCGCACCGTGAGAGTCCTATAATGCGCCTGCATGTCTGCCCTGGAGCGACCTTACCAATCGGAGGGTGCATGAATAGCTCAATAGTTCAACT
CTTCGCTTCGGAAAAACTAAACGGGGTCAATTATTCAACGTGGAAAAACAATCTCAATACGATACTAGTCGTCGATGACCTACGATTCGTCTTAACTGAGGAATGTCCAC
AAACCCCTGCCGCAAATGCCAACCGAAATGTTCGGGAAGCATTTGATCGATGGGTCAAGGCCAACGATAAGGCCCGTGTCTACATTCTTGCCAGCATGACTGATGTATTG
GCAAAGAAACATGAACCCTTGATGACTGCAAAGGAAATCATGGATTCATTAAAGGCGATGTTTGGGGAACCTTCATCGACCTTGAGGCACGAGGCACTTAAATATGTTTA
CAATGAGCATATGAAGGAGGGGACCTCTGTTAGAGAACATGTCCTGGACATGATGGTCCACTTCAATACTGCTGAAGTGAACGAGGCCACCATTGATGAAGCAGCAGAGA
AAGCAACGACAGGTAAATACGATTTACTAGTTGTTGAAACATGTTTAGTGGAGTATGATACTTCCACCTGGATACTTGATTCAGGAGCCACTAACCATATTTGTTTCTCA
TTTCAGGAAACTAGTTCCTGGAAGAAGCTGGAAGAAGGCGAGATAATTCTCAAAGTTGGAACTGGAGATGTTGTCTCAGCCAAAGCAATGGGAGATTTAAAGTTGTTTTT
TGACGATAGATACATTTTACTAGAAAATGTTTTATCTGCATGGGTGAGAGCAGCTCAACAGCGCTGGCTCAATAAGCCTCCCATTTCAGGGGTAAGACCCGGTAGATAA
Protein sequenceShow/hide protein sequence
MVSSIRSHRESPIMRLHVCPGATLPIGGCMNSSIVQLFASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAANANRNVREAFDRWVKANDKARVYILASMTDVL
AKKHEPLMTAKEIMDSLKAMFGEPSSTLRHEALKYVYNEHMKEGTSVREHVLDMMVHFNTAEVNEATIDEAAEKATTGKYDLLVVETCLVEYDTSTWILDSGATNHICFS
FQETSSWKKLEEGEIILKVGTGDVVSAKAMGDLKLFFDDRYILLENVLSAWVRAAQQRWLNKPPISGVRPGR