; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g15940 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g15940
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr1:10288078..10290859
RNA-Seq ExpressionMoc01g15940
SyntenyMoc01g15940
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]3.1e-7159.66Show/hide
Query:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M+++ + +LA  KLN  NY  WK+ +NT+L+IDDLRFVL E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ 
Subjt:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKLEYTLTTLLNELPTYQSLMK---
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNVAE NGAVI+E SQV+FILES+P+SFL FRSNA+MNK+ YTLTTLLNEL T++SLMK   
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKLEYTLTTLLNELPTYQSLMK---

Query:  ------------NFQKAAGKGSKPDSTTAAAKKGKAKE
                     F + +  G+K   +++  KK K K+
Subjt:  ------------NFQKAAGKGSKPDSTTAAAKKGKAKE

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]3.1e-7159.66Show/hide
Query:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M+++ + +LA  KLN  NY  WK+ +NT+L+IDDLRFVL E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ 
Subjt:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKLEYTLTTLLNELPTYQSLMK---
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNVAE NGAVI+E SQV+FILES+P+SFL FRSNA+MNK+ YTLTTLLNEL T++SLMK   
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKLEYTLTTLLNELPTYQSLMK---

Query:  ------------NFQKAAGKGSKPDSTTAAAKKGKAKE
                     F + +  G+K   +++  KK K K+
Subjt:  ------------NFQKAAGKGSKPDSTTAAAKKGKAKE

XP_022158062.1 uncharacterized protein LOC111024637 [Momordica charantia]3.9e-7478.46Show/hide
Query:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS
        MSTSII LL  QKLNDENYKQWKSN+NTIL+IDDLRFVLQEDCPQAP PNATVAVRN+YDRWIKANDKA+V ILASISDVLAKKHE++V  KEIMDSLQS
Subjt:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKL------EYTLTTLLNEL
        MFGQPSSQARHEAL  IYNSRMK+ SSVREHVLNLMVHFNVAESN  VI+EQSQV FILES+PK+FL F SNA ++ L      E TL   + E+
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKL------EYTLTTLLNEL

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]1.5e-7888.44Show/hide
Query:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS
        MS SIIALLA QKLN ENY+QWKSNLNTILVIDDLRFVLQEDCPQAP+ NATVAVRN YDRWIK+NDKA+VYILASISDVLAKKHEDTVT KEIMDSLQS
Subjt:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNA
        MFGQPS QARHEALKF+YNSRMKEGSSVREHVLNLMVHFNVAESNG VI+EQSQ +FILES+PK+FL F SNA
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNA

XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]9.7e-7364.13Show/hide
Query:  YGSMSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDS
        + +MSTS I LLA+ KLN +NY  WKSNLNTILVIDDLRFVL E+CP AP PNA   VR+ YDRW+KAN+KA+VYILASIS+VL+KKHE   T +EIMDS
Subjt:  YGSMSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDS

Query:  LQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKLEYTLTTLLNELPTYQSLMK
        LQ++FGQPS+   H+A+K++YN RMKEGSSVREHVLN+MVHFNVAE N  V+NE SQV FI++S+PKS+  F+ NA+MNK+EY+LTTLLNEL  Y+SL+K
Subjt:  LQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKLEYTLTTLLNELPTYQSLMK

Query:  NFQKAAGKGSKPDSTTAAAKKGK
        N      KG + ++  A   K K
Subjt:  NFQKAAGKGSKPDSTTAAAKKGK

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.5e-7159.66Show/hide
Query:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M+++ + +LA  KLN  NY  WK+ +NT+L+IDDLRFVL E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ 
Subjt:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKLEYTLTTLLNELPTYQSLMK---
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNVAE NGAVI+E SQV+FILES+P+SFL FRSNA+MNK+ YTLTTLLNEL T++SLMK   
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKLEYTLTTLLNELPTYQSLMK---

Query:  ------------NFQKAAGKGSKPDSTTAAAKKGKAKE
                     F + +  G+K   +++  KK K K+
Subjt:  ------------NFQKAAGKGSKPDSTTAAAKKGKAKE

A0A5D3CPJ6 Gag/pol protein1.5e-7159.66Show/hide
Query:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M+++ + +LA  KLN  NY  WK+ +NT+L+IDDLRFVL E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ 
Subjt:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKLEYTLTTLLNELPTYQSLMK---
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNVAE NGAVI+E SQV+FILES+P+SFL FRSNA+MNK+ YTLTTLLNEL T++SLMK   
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKLEYTLTTLLNELPTYQSLMK---

Query:  ------------NFQKAAGKGSKPDSTTAAAKKGKAKE
                     F + +  G+K   +++  KK K K+
Subjt:  ------------NFQKAAGKGSKPDSTTAAAKKGKAKE

A0A6J1DW68 uncharacterized protein LOC1110246371.9e-7478.46Show/hide
Query:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS
        MSTSII LL  QKLNDENYKQWKSN+NTIL+IDDLRFVLQEDCPQAP PNATVAVRN+YDRWIKANDKA+V ILASISDVLAKKHE++V  KEIMDSLQS
Subjt:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKL------EYTLTTLLNEL
        MFGQPSSQARHEAL  IYNSRMK+ SSVREHVLNLMVHFNVAESN  VI+EQSQV FILES+PK+FL F SNA ++ L      E TL   + E+
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKL------EYTLTTLLNEL

A0A6J1DWG6 uncharacterized protein LOC1110250214.7e-7364.13Show/hide
Query:  YGSMSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDS
        + +MSTS I LLA+ KLN +NY  WKSNLNTILVIDDLRFVL E+CP AP PNA   VR+ YDRW+KAN+KA+VYILASIS+VL+KKHE   T +EIMDS
Subjt:  YGSMSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDS

Query:  LQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKLEYTLTTLLNELPTYQSLMK
        LQ++FGQPS+   H+A+K++YN RMKEGSSVREHVLN+MVHFNVAE N  V+NE SQV FI++S+PKS+  F+ NA+MNK+EY+LTTLLNEL  Y+SL+K
Subjt:  LQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNAIMNKLEYTLTTLLNELPTYQSLMK

Query:  NFQKAAGKGSKPDSTTAAAKKGK
        N      KG + ++  A   K K
Subjt:  NFQKAAGKGSKPDSTTAAAKKGK

A0A6J1DWL0 uncharacterized protein LOC1110247347.5e-7988.44Show/hide
Query:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS
        MS SIIALLA QKLN ENY+QWKSNLNTILVIDDLRFVLQEDCPQAP+ NATVAVRN YDRWIK+NDKA+VYILASISDVLAKKHEDTVT KEIMDSLQS
Subjt:  MSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVYDRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNA
        MFGQPS QARHEALKF+YNSRMKEGSSVREHVLNLMVHFNVAESNG VI+EQSQ +FILES+PK+FL F SNA
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPKSFLSFRSNA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCGCAGTCGAGTCAGGTCTTCTGAAGTTCTCGGTCGGGGAGTCGAGAACTGCTCGGCGCCAACCACTAGTCGGTCGGGGGTCGGCTTGGGTCACCAAGCA
CTGACCTCCTTTCTGTTATCGAATCTCGTCGGTGGAAGAGACCTAGTTGGCAGAAGATGCTTGGTCGGCAGAATAGGTCTGGTTGGTGGAAGAGACCTGGTCAAC
AGTAGAGACTTGTTCGGCGAAAGATATCTGGTTGGCAGAGACTTGCTAGGCCGAGACATGGTCATGCCTTGGATTTGGTCATGCCCTAGAGTTTGGTCATGCCCT
GGATCAGGTTGGGTACCTTATCCTGGCAACACTATGGATACGGACAACTCTGTAAATGTTAGAGACGACTCGATGCAGGTCATTGTGAGCTCCATGCTCGGCTTC
GTGTCGCCTTGGGCGCGGACTCCCTACGGAAGCATGTCTACTTCTATTATTGCACTCCTAGCCACGCAAAAACTTAACGACGAGAATTACAAACAGTGGAAATCG
AATCTAAACACTATTCTCGTGATAGATGATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTATGCCTAACGCCACTGTGGCGGTGCGCAACGTCTAT
GACAGATGGATCAAGGCCAATGACAAGGCCCAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCTAAGAAGCACGAGGACACGGTCACTGCTAAGGAGATC
ATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCCTCACAAGCTCGACATGAAGCCCTTAAGTTCATTTACAACTCCCGCATGAAGGAGGGCTCCTCAGTACGA
GAACACGTTCTCAACCTGATGGTCCACTTCAACGTGGCTGAGTCGAACGGGGCTGTCATAAACGAGCAGAGTCAGGTCAACTTCATTCTGGAATCTATTCCGAAG
AGTTTCCTGTCATTCCGTAGCAATGCGATTATGAATAAGCTGGAGTACACTCTTACCACGCTTTTAAACGAGCTGCCGACCTACCAGTCTCTTATGAAAAACTTT
CAAAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCCACTACTGCCGCTGCCAAGAAAGGCAAGGCCAAGGAGCCACTAATCACGTTTGTTCTTCATTTCAGGGAA
TTAGTTTCTGGAGGTAGCTTGATGCCGGAGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCATCGCAGTCGAGTCAGGTCTTCTGAAGTTCTCGGTCGGGGAGTCGAGAACTGCTCGGCGCCAACCACTAGTCGGTCGGGGGTCGGCTTGGGTCACCAAGCA
CTGACCTCCTTTCTGTTATCGAATCTCGTCGGTGGAAGAGACCTAGTTGGCAGAAGATGCTTGGTCGGCAGAATAGGTCTGGTTGGTGGAAGAGACCTGGTCAAC
AGTAGAGACTTGTTCGGCGAAAGATATCTGGTTGGCAGAGACTTGCTAGGCCGAGACATGGTCATGCCTTGGATTTGGTCATGCCCTAGAGTTTGGTCATGCCCT
GGATCAGGTTGGGTACCTTATCCTGGCAACACTATGGATACGGACAACTCTGTAAATGTTAGAGACGACTCGATGCAGGTCATTGTGAGCTCCATGCTCGGCTTC
GTGTCGCCTTGGGCGCGGACTCCCTACGGAAGCATGTCTACTTCTATTATTGCACTCCTAGCCACGCAAAAACTTAACGACGAGAATTACAAACAGTGGAAATCG
AATCTAAACACTATTCTCGTGATAGATGATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTATGCCTAACGCCACTGTGGCGGTGCGCAACGTCTAT
GACAGATGGATCAAGGCCAATGACAAGGCCCAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCTAAGAAGCACGAGGACACGGTCACTGCTAAGGAGATC
ATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCCTCACAAGCTCGACATGAAGCCCTTAAGTTCATTTACAACTCCCGCATGAAGGAGGGCTCCTCAGTACGA
GAACACGTTCTCAACCTGATGGTCCACTTCAACGTGGCTGAGTCGAACGGGGCTGTCATAAACGAGCAGAGTCAGGTCAACTTCATTCTGGAATCTATTCCGAAG
AGTTTCCTGTCATTCCGTAGCAATGCGATTATGAATAAGCTGGAGTACACTCTTACCACGCTTTTAAACGAGCTGCCGACCTACCAGTCTCTTATGAAAAACTTT
CAAAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCCACTACTGCCGCTGCCAAGAAAGGCAAGGCCAAGGAGCCACTAATCACGTTTGTTCTTCATTTCAGGGAA
TTAGTTTCTGGAGGTAGCTTGATGCCGGAGAGATGA
Protein sequenceShow/hide protein sequence
MHRSRVRSSEVLGRGVENCSAPTTSRSGVGLGHQALTSFLLSNLVGGRDLVGRRCLVGRIGLVGGRDLVNSRDLFGERYLVGRDLLGRDMVMPWIWSCPRVWSCP
GSGWVPYPGNTMDTDNSVNVRDDSMQVIVSSMLGFVSPWARTPYGSMSTSIIALLATQKLNDENYKQWKSNLNTILVIDDLRFVLQEDCPQAPMPNATVAVRNVY
DRWIKANDKAQVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVINEQSQVNFILESIPK
SFLSFRSNAIMNKLEYTLTTLLNELPTYQSLMKNFQKAAGKGSKPDSTTAAAKKGKAKEPLITFVLHFRELVSGGSLMPER