; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g00540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g00540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr9:561406..568606
RNA-Seq ExpressionMoc09g00540
SyntenyMoc09g00540
Gene Ontology termsGO:0019538 - protein metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0050789 - regulation of biological process (biological process)
GO:0016020 - membrane (cellular component)
GO:0016787 - hydrolase activity (molecular function)
GO:0140096 - catalytic activity, acting on a protein (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU44375.1 hypothetical protein TSUD_243070 [Trifolium subterraneum]9.4e-1657.14Show/hide
Query:  DLPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE
        +L L+EYI++FK L DKL A+ KP+ D  KVF +++GLG+KYKEFR A+LSK  YP+FNQF++SL+  EQ+ L EE+
Subjt:  DLPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE

PNX56937.1 hypothetical protein L195_g050142, partial [Trifolium pratense]7.2e-1659.21Show/hide
Query:  LPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE
        L LEEYI++FK + DKLAA+ KPL D  KVF +++GLGNKYKEF+ A+LSK  YP F+QF++SL+  EQ+ L EE+
Subjt:  LPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE

PNX82907.1 hypothetical protein L195_g038944, partial [Trifolium pratense]3.2e-1660.53Show/hide
Query:  LPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE
        L L+EYI++FK + DKLAA+ KPL D  KVF V++GLGNKYK+FR A+LSK  YP+FNQF++SL+  EQ+ L EE+
Subjt:  LPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE

RVX10186.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.7e-1550Show/hide
Query:  STVEVTSP--PSSDLPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE
        ST++  +P    S   L+EY++ FK + D LAA +KP+ D  KVF +A+GLG KY +FR AMLSK  YP++NQFVL+L+ HEQ+ + E E
Subjt:  STVEVTSP--PSSDLPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE

XP_022154021.1 uncharacterized protein LOC111021379 [Momordica charantia]6.9e-1964.56Show/hide
Query:  LPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEEGKA
        L ++EYI++FK L D+L AMKKPLDD +KVF +ARGLG KYK+FRTAMLSK  YP++NQFVL+LKAH+Q    EEE ++
Subjt:  LPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEEGKA

TrEMBL top hitse value%identityAlignment
A0A2K3JSC9 Uncharacterized protein (Fragment)3.5e-1659.21Show/hide
Query:  LPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE
        L LEEYI++FK + DKLAA+ KPL D  KVF +++GLGNKYKEF+ A+LSK  YP F+QF++SL+  EQ+ L EE+
Subjt:  LPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE

A0A2K3LWK7 Uncharacterized protein (Fragment)1.6e-1660.53Show/hide
Query:  LPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE
        L L+EYI++FK + DKLAA+ KPL D  KVF V++GLGNKYK+FR A+LSK  YP+FNQF++SL+  EQ+ L EE+
Subjt:  LPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE

A0A2Z6P7T0 Reverse transcriptase Ty1/copia-type domain-containing protein4.5e-1657.14Show/hide
Query:  DLPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE
        +L L+EYI++FK L DKL A+ KP+ D  KVF +++GLG+KYKEFR A+LSK  YP+FNQF++SL+  EQ+ L EE+
Subjt:  DLPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE

A0A438JMJ0 Retrovirus-related Pol polyprotein from transposon RE11.3e-1550Show/hide
Query:  STVEVTSP--PSSDLPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE
        ST++  +P    S   L+EY++ FK + D LAA +KP+ D  KVF +A+GLG KY +FR AMLSK  YP++NQFVL+L+ HEQ+ + E E
Subjt:  STVEVTSP--PSSDLPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEE

A0A6J1DMG5 uncharacterized protein LOC1110213793.4e-1964.56Show/hide
Query:  LPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEEGKA
        L ++EYI++FK L D+L AMKKPLDD +KVF +ARGLG KYK+FRTAMLSK  YP++NQFVL+LKAH+Q    EEE ++
Subjt:  LPLEEYIKRFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEEGKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATAGTCGAAGAAACAGAGTGCATGCCAAGGAGGCAGCACCGCAGTGTTGCACAACGCCTTAGCGCTAAGGACGGCATCGTGGTGCTGTCAGGAAGGCGGGCGCA
CCGATTTAATTCTACCCGTAATGCCATGACGCTGTTTCAGACAGAGGCTCTCGGTATTGGCAGCATTGAGGCATATCAAGATCGATATATTCAGGAGTCATATGATAAAA
AACCTAGCCTTGTGGGAGAAATCATCGGTGACCGAACAAGAGTGATGAATGAGCCTTATAGTCATTACATAATTGTGGATGAGGGTGCGATAGTGGTCGACACCAGAAGC
GGACATGTAGTGTGCTCCAAAGAAGAGGACTTGGCCTATATACGAGTTGGAGGGATGGATGATTACTTGAGGAGAGACTCCATGAGCTTGCACGTAGGCTATCTTCAACG
TCAACTTTCTGCTATTGTCTCATTTGATCTTGGTTCAAAATTCATCTACAATAATCCTGCTGTTGTAATTTCTCATGGCGAACAACGAGACTGCACTTACAATCCAACCT
TTTCACCAATGTTCCAGCTTAATTTCCATAAATCTAAATACCACCAATTATCTACTGTGGAAGTCACAAGTCCTCCCTCTAGTGATCTACCATTAGAAGAGTACATCAAA
AGGTTCAAAGCGTTGGCTGACAAATTGGCAGCCATGAAAAAGCCTCTTGATGACCCAAGTAAGGTGTTCACTGTAGCTCGAGGACTGGGAAACAAGTACAAAGAATTCAG
AACTGCAATGTTGTCCAAGACTGCCTATCCTACATTCAATCAGTTTGTTCTCTCTTTAAAAGCTCATGAACAGTTGAACCTTCTTGAAGAAGAAGGAAAAGCTTTGTTGA
GAATGCCCCCAATGGCCATTAGTTTTGCACCCATCATGCCAGGAAGATCAACAGTCATGAAGTTTTCAGTTCTTATTGAAATTGTCTCAATAGAACCTTCTTTAATGAAC
ATGGTTCTCGAAGTTGCGATGATTGTACGAGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATAGTCGAAGAAACAGAGTGCATGCCAAGGAGGCAGCACCGCAGTGTTGCACAACGCCTTAGCGCTAAGGACGGCATCGTGGTGCTGTCAGGAAGGCGGGCGCA
CCGATTTAATTCTACCCGTAATGCCATGACGCTGTTTCAGACAGAGGCTCTCGGTATTGGCAGCATTGAGGCATATCAAGATCGATATATTCAGGAGTCATATGATAAAA
AACCTAGCCTTGTGGGAGAAATCATCGGTGACCGAACAAGAGTGATGAATGAGCCTTATAGTCATTACATAATTGTGGATGAGGGTGCGATAGTGGTCGACACCAGAAGC
GGACATGTAGTGTGCTCCAAAGAAGAGGACTTGGCCTATATACGAGTTGGAGGGATGGATGATTACTTGAGGAGAGACTCCATGAGCTTGCACGTAGGCTATCTTCAACG
TCAACTTTCTGCTATTGTCTCATTTGATCTTGGTTCAAAATTCATCTACAATAATCCTGCTGTTGTAATTTCTCATGGCGAACAACGAGACTGCACTTACAATCCAACCT
TTTCACCAATGTTCCAGCTTAATTTCCATAAATCTAAATACCACCAATTATCTACTGTGGAAGTCACAAGTCCTCCCTCTAGTGATCTACCATTAGAAGAGTACATCAAA
AGGTTCAAAGCGTTGGCTGACAAATTGGCAGCCATGAAAAAGCCTCTTGATGACCCAAGTAAGGTGTTCACTGTAGCTCGAGGACTGGGAAACAAGTACAAAGAATTCAG
AACTGCAATGTTGTCCAAGACTGCCTATCCTACATTCAATCAGTTTGTTCTCTCTTTAAAAGCTCATGAACAGTTGAACCTTCTTGAAGAAGAAGGAAAAGCTTTGTTGA
GAATGCCCCCAATGGCCATTAGTTTTGCACCCATCATGCCAGGAAGATCAACAGTCATGAAGTTTTCAGTTCTTATTGAAATTGTCTCAATAGAACCTTCTTTAATGAAC
ATGGTTCTCGAAGTTGCGATGATTGTACGAGGATAA
Protein sequenceShow/hide protein sequence
MKIVEETECMPRRQHRSVAQRLSAKDGIVVLSGRRAHRFNSTRNAMTLFQTEALGIGSIEAYQDRYIQESYDKKPSLVGEIIGDRTRVMNEPYSHYIIVDEGAIVVDTRS
GHVVCSKEEDLAYIRVGGMDDYLRRDSMSLHVGYLQRQLSAIVSFDLGSKFIYNNPAVVISHGEQRDCTYNPTFSPMFQLNFHKSKYHQLSTVEVTSPPSSDLPLEEYIK
RFKALADKLAAMKKPLDDPSKVFTVARGLGNKYKEFRTAMLSKTAYPTFNQFVLSLKAHEQLNLLEEEGKALLRMPPMAISFAPIMPGRSTVMKFSVLIEIVSIEPSLMN
MVLEVAMIVRG