; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g31230 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g31230
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr6:23499181..23507573
RNA-Seq ExpressionMoc06g31230
SyntenyMoc06g31230
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]5.5e-6472.34Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS+SGT+ +PSSSG+K +KKKK  G+G+K + AA A+  K   A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW

Query:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK
        KRNCPKYLAEKKKA + KYDLLVLETCLVENDDSAWI+DSGATNHVC SFQGISSWRQL+ GEMT++VGTG VVSA+AVG L+ C  K
Subjt:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]7.2e-6472.43Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS+SGT+ +PSSSG+K +KKKK  G+G+K + AA A+  K   A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW

Query:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELKCY
        KRNCPKYLAEKKKA + KYDLLVLETCLVENDDSAWI+DSGATNHVC SFQGISSW+QL+ GEMT++VGTG VVSA+AVG L+ Y
Subjt:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELKCY

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]9.4e-6472.34Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS+SGT+ +PSSSG+K +KKKK  G+G+K + AA A+  K   A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW

Query:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK
        KRNCPKYLAEKKKA + KYDLLVLETCLVENDDSAWI+DSGATNHVC SFQGISSWRQL+ GEMT++VGTG VVSA+AVG L+ C  K
Subjt:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]5.5e-6472.34Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS+SGT+ +PSSSG+K +KKKK  G+G+K + AA A+  K   A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW

Query:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK
        KRNCPKYLAEKKKA + KYDLLVLETCLVENDDSAWI+DSGATNHVC SFQGISSWRQL+ GEMT++VGTG VVSA+AVG L+ C  K
Subjt:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK

TYK26319.1 gag/pol protein [Cucumis melo var. makuwa]5.5e-6472.34Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS+SGT+ +PSSSG+K +KKKK  G+G+K + AA A+  K   A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW

Query:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK
        KRNCPKYLAEKKKA + KYDLLVLETCLVENDDSAWI+DSGATNHVC SFQGISSWRQL+ GEMT++VGTG VVSA+AVG L+ C  K
Subjt:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein4.5e-6472.34Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS+SGT+ +PSSSG+K +KKKK  G+G+K + AA A+  K   A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW

Query:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK
        KRNCPKYLAEKKKA + KYDLLVLETCLVENDDSAWI+DSGATNHVC SFQGISSWRQL+ GEMT++VGTG VVSA+AVG L+ C  K
Subjt:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK

A0A5A7TU93 Gag/pol protein3.5e-6472.43Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS+SGT+ +PSSSG+K +KKKK  G+G+K + AA A+  K   A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW

Query:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELKCY
        KRNCPKYLAEKKKA + KYDLLVLETCLVENDDSAWI+DSGATNHVC SFQGISSW+QL+ GEMT++VGTG VVSA+AVG L+ Y
Subjt:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELKCY

A0A5D3CPJ6 Gag/pol protein2.7e-6472.34Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS+SGT+ +PSSSG+K +KKKK  G+G+K + AA A+  K   A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW

Query:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK
        KRNCPKYLAEKKKA + KYDLLVLETCLVENDDSAWI+DSGATNHVC SFQGISSWRQL+ GEMT++VGTG VVSA+AVG L+ C  K
Subjt:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK

A0A5D3CSZ6 Gag/pol protein2.7e-6472.34Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS+SGT+ +PSSSG+K +KKKK  G+G+K + AA A+  K   A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW

Query:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK
        KRNCPKYLAEKKKA + KYDLLVLETCLVENDDSAWI+DSGATNHVC SFQGISSWRQL+ GEMT++VGTG VVSA+AVG L+ C  K
Subjt:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK

A0A5D3DS88 Gag/pol protein2.7e-6472.34Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS+SGT+ +PSSSG+K +KKKK  G+G+K + AA A+  K   A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHW

Query:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK
        KRNCPKYLAEKKKA + KYDLLVLETCLVENDDSAWI+DSGATNHVC SFQGISSWRQL+ GEMT++VGTG VVSA+AVG L+ C  K
Subjt:  KRNCPKYLAEKKKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELK-CYHK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAAGCTGGAGTACACTCTTACCACGCTCCTAAACGAGCTGCAGACCTACCAATCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAA
GAGGTTCAATAGAGGATCGTCCTCTGGAACCAGGTATGTGCCCTCTTCTTCTGGAAGTAAGACTTTTAAGAAGAAGAAGGCTGCTGGTAAAGGGTCTAAACCTGACTCAG
CTGCTGTTGCCCAGAAAGGTAAGGTCAATGTTGCAGAGAAAGGAAAGTGTTTCCACCGCAATATGGACGGGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAAAAG
AAGAAAGCCAACGAAGTTAAATATGATTTACTTGTATTGGAAACATGTTTAGTGGAGAATGATGACTCCGCCTGGATACTGGATTCAGGAGCCACTAATCACGTTTGTTT
TTCATTTCAGGGAATTAGTTCCTGGAGGCAGCTTGACGCCGGAGAGATGACTCTCAAAGTCGGAACGGGAGAGGTTGTCTCAGCTGTGGCGGTAGGGGAGCTCAAGTGCT
ACCACAAGCACGATCTCGAGACTCAAGAGGATAGCGAGAAAGATCCGGTGGTGGTGTTCGAGGGGAACTTACTGAAGAAACGTTCTTCAAAGAAAACAGCGAAGAAGACG
AAGCAGACTGCGCAGACAGCGCCATGGCGCTATGCAGCAGCGCCATGGCGTTGCGGGATAGCACATAGCGCCACGACGCTGCACTGTAGCGCCGGGCGCCGCGGCGCTGC
CCTTAGGTGCCGAGGCGCTGTCCCGGGTGTTTTCGACGCGGTTCCGAAGCTCCGGTTAGCGGTCTTTGGATCTTCGATGTCGTCATCAGGTATCACAACTTGGGTTTACA
ACCCACATAGCGGTCAGTGGAAACATTTTTCCTTGTCTGCAACTTTGGCCTTGCCTTTCTTGGCAGCAGTAGTAGTAGAGTCAAGTTTGGGCCCTTTACCAGCAGCCTTT
ATGTTCTCGAAAGTCTTACTTCCAGAAAAAGAGGGCTCAAACTTAGTTCAAGAGGAAACTCTTCGAAAGAGATTCCAAAATGAAGCTGACCTGACTTTTCTCGTCTATGA
CGGTCCCGTTCGTCTGTGCCACATTGAAGTGGATCATCAGCGAGTCCATGATCTCCTTGGCGGTGACCATGTTCTCGTGGTTCTTAACAAGCACATCAGATATGTTCGCC
AATATGTAGACCTTTGTCTTGTCGTTGGCCTCGATCCATCGGTCATAGGCGTTGTGCATCACTGTAGTGGCGTTAGGCGCATGAGCTTGAGGATAATCCTCTTGCAAGAT
GAACCTAAGATCATGTATCACGAGTATAGTGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATAAGCTGGAGTACACTCTTACCACGCTCCTAAACGAGCTGCAGACCTACCAATCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAA
GAGGTTCAATAGAGGATCGTCCTCTGGAACCAGGTATGTGCCCTCTTCTTCTGGAAGTAAGACTTTTAAGAAGAAGAAGGCTGCTGGTAAAGGGTCTAAACCTGACTCAG
CTGCTGTTGCCCAGAAAGGTAAGGTCAATGTTGCAGAGAAAGGAAAGTGTTTCCACCGCAATATGGACGGGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAAAAG
AAGAAAGCCAACGAAGTTAAATATGATTTACTTGTATTGGAAACATGTTTAGTGGAGAATGATGACTCCGCCTGGATACTGGATTCAGGAGCCACTAATCACGTTTGTTT
TTCATTTCAGGGAATTAGTTCCTGGAGGCAGCTTGACGCCGGAGAGATGACTCTCAAAGTCGGAACGGGAGAGGTTGTCTCAGCTGTGGCGGTAGGGGAGCTCAAGTGCT
ACCACAAGCACGATCTCGAGACTCAAGAGGATAGCGAGAAAGATCCGGTGGTGGTGTTCGAGGGGAACTTACTGAAGAAACGTTCTTCAAAGAAAACAGCGAAGAAGACG
AAGCAGACTGCGCAGACAGCGCCATGGCGCTATGCAGCAGCGCCATGGCGTTGCGGGATAGCACATAGCGCCACGACGCTGCACTGTAGCGCCGGGCGCCGCGGCGCTGC
CCTTAGGTGCCGAGGCGCTGTCCCGGGTGTTTTCGACGCGGTTCCGAAGCTCCGGTTAGCGGTCTTTGGATCTTCGATGTCGTCATCAGGTATCACAACTTGGGTTTACA
ACCCACATAGCGGTCAGTGGAAACATTTTTCCTTGTCTGCAACTTTGGCCTTGCCTTTCTTGGCAGCAGTAGTAGTAGAGTCAAGTTTGGGCCCTTTACCAGCAGCCTTT
ATGTTCTCGAAAGTCTTACTTCCAGAAAAAGAGGGCTCAAACTTAGTTCAAGAGGAAACTCTTCGAAAGAGATTCCAAAATGAAGCTGACCTGACTTTTCTCGTCTATGA
CGGTCCCGTTCGTCTGTGCCACATTGAAGTGGATCATCAGCGAGTCCATGATCTCCTTGGCGGTGACCATGTTCTCGTGGTTCTTAACAAGCACATCAGATATGTTCGCC
AATATGTAGACCTTTGTCTTGTCGTTGGCCTCGATCCATCGGTCATAGGCGTTGTGCATCACTGTAGTGGCGTTAGGCGCATGAGCTTGAGGATAATCCTCTTGCAAGAT
GAACCTAAGATCATGTATCACGAGTATAGTGTTTAG
Protein sequenceShow/hide protein sequence
MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSKRFNRGSSSGTRYVPSSSGSKTFKKKKAAGKGSKPDSAAVAQKGKVNVAEKGKCFHRNMDGHWKRNCPKYLAEK
KKANEVKYDLLVLETCLVENDDSAWILDSGATNHVCFSFQGISSWRQLDAGEMTLKVGTGEVVSAVAVGELKCYHKHDLETQEDSEKDPVVVFEGNLLKKRSSKKTAKKT
KQTAQTAPWRYAAAPWRCGIAHSATTLHCSAGRRGAALRCRGAVPGVFDAVPKLRLAVFGSSMSSSGITTWVYNPHSGQWKHFSLSATLALPFLAAVVVESSLGPLPAAF
MFSKVLLPEKEGSNLVQEETLRKRFQNEADLTFLVYDGPVRLCHIEVDHQRVHDLLGGDHVLVVLNKHIRYVRQYVDLCLVVGLDPSVIGVVHHCSGVRRMSLRIILLQD
EPKIMYHEYSV