; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g02210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g02210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr3:1641194..1648375
RNA-Seq ExpressionMoc03g02210
SyntenyMoc03g02210
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-4570.39Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F RGS+SGT+S  SSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW

Query:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG
        K NCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI++SGATNH C +  G
Subjt:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-4570.39Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F RGS+SGT+S  SSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW

Query:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG
        K NCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI++SGATNH C +  G
Subjt:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-4570.39Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F RGS+SGT+S  SSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW

Query:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG
        K NCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI++SGATNH C +  G
Subjt:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-4570.39Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F RGS+SGT+S  SSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW

Query:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG
        K NCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI++SGATNH C +  G
Subjt:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-4570.39Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F RGS+SGT+S  SSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW

Query:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG
        K NCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI++SGATNH C +  G
Subjt:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein4.9e-4670.39Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F RGS+SGT+S  SSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW

Query:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG
        K NCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI++SGATNH C +  G
Subjt:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG

A0A5A7TU93 Gag/pol protein4.9e-4670.39Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F RGS+SGT+S  SSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW

Query:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG
        K NCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI++SGATNH C +  G
Subjt:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG

A0A5A7V4M1 Gag/pol protein4.9e-4670.39Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F RGS+SGT+S  SSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW

Query:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG
        K NCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI++SGATNH C +  G
Subjt:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG

A0A5D3CPJ6 Gag/pol protein4.9e-4670.39Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F RGS+SGT+S  SSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW

Query:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG
        K NCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI++SGATNH C +  G
Subjt:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG

A0A5D3DS88 Gag/pol protein4.9e-4670.39Show/hide
Query:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW
        MNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F RGS+SGT+S  SSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N +GHW
Subjt:  MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHW

Query:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG
        K NCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI++SGATNH C +  G
Subjt:  KLNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAAGCTGGAGTACACTCTTACCACGCTCCTAAACGAGCTTCAGACCTACCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAA
GAGGTTCAGCAGAGGATCGTCCTCTGGAACCAGGTCTGCGCTCTCTTCTTCTGGAAGTAAGACTTTTAAGAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCAG
CTGCTGCTGCCCAGAAAGGCAAGATCAAGGTTGCAGAGAAAGGAAAGTGTTTCCACTACAACATGGACGGGCATTGGAAGCTCAACTGCCCAAAGTACTTGGCCGAGAAG
AAGAAAGCCAACGAAGGTAAATATGATTTACTTGTATTGGAAACATGTTTAGTGGAGAATGATGACTCCGCCTGGATACTGAATTCAGGAGCCACTAATCACTTCTGCAA
AACACAGAACGGAAGAAGAACAACTCCCAACCATTTTTCTCTCAATAAAAAAGATCTCTCTCAAGCTCTCCCTCTCGTTCCAAAGGATTGCTCCCACCAGCACGATCCCG
AGACCCAAGAGGATAGCGAGGAAGACACAGTGGTGGTGTTCGAGGGAAACTTGTTGAAGAAACCAACGAAAATCCGCTTCCGCTCCGGGATTCACATCCCTTCAGAGTTG
TCATCATACCCGTCTTCTGAAATTGGGGTTTTCCGGCCTTCATGGTTTGATAACTCTACCCTCACTACTAGAGAGCTGCGTTGTCTTCACGTAAAGTATCATATCCCGGA
CTCCGTTAGCCTTCGACTTCCACGTCCCGGCGAAAGCATCGATGGGTCTTCCCCTGGGGAAGTGAATTGGAAGGATGAATTGGTGGTGACTGGCTGGCTCGTGATGACAG
TGGGCTATACTTTCCTGTCCAGACAAACCCCCCGTAACTACGAGCTCACTGCCGAGGTGTTGGACTTCTACAATACTGTTCTCCAGTTACCTAAATCTAAGCGGTGGGGT
CCTGACTTGGTGAAGGAGCAAAACTTGGTGGATTTTGATTTGGCACCCCTTCGTCGGACTCCAGAGATGTCAAGGCGTGCTACTTCCTTGTTCAAGAGGGAACACCAGCG
TAATGTTCGGAGCTCTGAAGCTGGGTCCAATCGAGCCTCTGATGTTGAAGTAACTGACTTGACGGAGGATAATCCTCCATCTCATCAGCATCAAGCTTGGCCTTTGATGC
CTCCACCTGCTCCTCGCTGGCCTGCTTCGTCTCCAAGGGCTATCTACCTCATCCTCTTCTTGTGGTCACTTGGGGGCTTGGAGAATCATCTAGCGGATCGAGTTTTCGAT
ACTCGAATCCTCAGTAGAGCATCAACACCTGTCTCAGTGACCATCATCAGTCTGCCGGAGAACGCCCAACGTGAGCTTGAAATTGAGAAGAGGAGGTCTCGGGCCGAGTT
GGCAGACCTAGAGAGGAGACTTGACGAGGTGAAGACTCAGCTCTCTAATTCTGAGTTTTTGGCGGGAGAATTGAAGAAGACTGTGGAATATGTCAATCTGCAGAATGTCG
TCATAGAGTGCGGGATTTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAAGCTGGAGTACACTCTTACCACGCTCCTAAACGAGCTTCAGACCTACCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAA
GAGGTTCAGCAGAGGATCGTCCTCTGGAACCAGGTCTGCGCTCTCTTCTTCTGGAAGTAAGACTTTTAAGAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCAG
CTGCTGCTGCCCAGAAAGGCAAGATCAAGGTTGCAGAGAAAGGAAAGTGTTTCCACTACAACATGGACGGGCATTGGAAGCTCAACTGCCCAAAGTACTTGGCCGAGAAG
AAGAAAGCCAACGAAGGTAAATATGATTTACTTGTATTGGAAACATGTTTAGTGGAGAATGATGACTCCGCCTGGATACTGAATTCAGGAGCCACTAATCACTTCTGCAA
AACACAGAACGGAAGAAGAACAACTCCCAACCATTTTTCTCTCAATAAAAAAGATCTCTCTCAAGCTCTCCCTCTCGTTCCAAAGGATTGCTCCCACCAGCACGATCCCG
AGACCCAAGAGGATAGCGAGGAAGACACAGTGGTGGTGTTCGAGGGAAACTTGTTGAAGAAACCAACGAAAATCCGCTTCCGCTCCGGGATTCACATCCCTTCAGAGTTG
TCATCATACCCGTCTTCTGAAATTGGGGTTTTCCGGCCTTCATGGTTTGATAACTCTACCCTCACTACTAGAGAGCTGCGTTGTCTTCACGTAAAGTATCATATCCCGGA
CTCCGTTAGCCTTCGACTTCCACGTCCCGGCGAAAGCATCGATGGGTCTTCCCCTGGGGAAGTGAATTGGAAGGATGAATTGGTGGTGACTGGCTGGCTCGTGATGACAG
TGGGCTATACTTTCCTGTCCAGACAAACCCCCCGTAACTACGAGCTCACTGCCGAGGTGTTGGACTTCTACAATACTGTTCTCCAGTTACCTAAATCTAAGCGGTGGGGT
CCTGACTTGGTGAAGGAGCAAAACTTGGTGGATTTTGATTTGGCACCCCTTCGTCGGACTCCAGAGATGTCAAGGCGTGCTACTTCCTTGTTCAAGAGGGAACACCAGCG
TAATGTTCGGAGCTCTGAAGCTGGGTCCAATCGAGCCTCTGATGTTGAAGTAACTGACTTGACGGAGGATAATCCTCCATCTCATCAGCATCAAGCTTGGCCTTTGATGC
CTCCACCTGCTCCTCGCTGGCCTGCTTCGTCTCCAAGGGCTATCTACCTCATCCTCTTCTTGTGGTCACTTGGGGGCTTGGAGAATCATCTAGCGGATCGAGTTTTCGAT
ACTCGAATCCTCAGTAGAGCATCAACACCTGTCTCAGTGACCATCATCAGTCTGCCGGAGAACGCCCAACGTGAGCTTGAAATTGAGAAGAGGAGGTCTCGGGCCGAGTT
GGCAGACCTAGAGAGGAGACTTGACGAGGTGAAGACTCAGCTCTCTAATTCTGAGTTTTTGGCGGGAGAATTGAAGAAGACTGTGGAATATGTCAATCTGCAGAATGTCG
TCATAGAGTGCGGGATTTCGTGA
Protein sequenceShow/hide protein sequence
MNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSKRFSRGSSSGTRSALSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKIKVAEKGKCFHYNMDGHWKLNCPKYLAEK
KKANEGKYDLLVLETCLVENDDSAWILNSGATNHFCKTQNGRRTTPNHFSLNKKDLSQALPLVPKDCSHQHDPETQEDSEEDTVVVFEGNLLKKPTKIRFRSGIHIPSEL
SSYPSSEIGVFRPSWFDNSTLTTRELRCLHVKYHIPDSVSLRLPRPGESIDGSSPGEVNWKDELVVTGWLVMTVGYTFLSRQTPRNYELTAEVLDFYNTVLQLPKSKRWG
PDLVKEQNLVDFDLAPLRRTPEMSRRATSLFKREHQRNVRSSEAGSNRASDVEVTDLTEDNPPSHQHQAWPLMPPPAPRWPASSPRAIYLILFLWSLGGLENHLADRVFD
TRILSRASTPVSVTIISLPENAQRELEIEKRRSRAELADLERRLDEVKTQLSNSEFLAGELKKTVEYVNLQNVVIECGIS