; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g17580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g17580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr11:13511656..13515789
RNA-Seq ExpressionMoc11g17580
SyntenyMoc11g17580
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035676.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-2568.54Show/hide
Query:  MSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENIA
        M++SI+ L A++KLNG+NY  WK+NLNTILVVDDL+F+LTEECPQ PT NA RAS++AYDRWI    KA+VYILAS+ DVLAKK+E +A
Subjt:  MSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENIA

XP_022156751.1 uncharacterized protein LOC111023591 [Momordica charantia]2.6e-2574.7Show/hide
Query:  IVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENI
        I L A +KLN +NYTQWKTNLNTILVVDDL+FVLTE+CPQAPT NAARASQDAYDRWI    KAK+YILA+I +VLAKK++++
Subjt:  IVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENI

XP_022157449.1 uncharacterized protein LOC111024145 [Momordica charantia]5.0e-2973.12Show/hide
Query:  HTYDNMSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENI
        + Y + S SII L A EK N ENYTQWKTNLNTILVVDDL+F+LTEECPQAPTPNAARAS+DAYDRWI    KA VYIL SI DVL+KK+E++
Subjt:  HTYDNMSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENI

XP_022158202.1 uncharacterized protein LOC111024739 [Momordica charantia]2.7e-2775Show/hide
Query:  MSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENI
        MS S+I L A EKLNG+NYTQWKTNLN ILVVDDL+FVLTEEC Q PTPNA RAS+DAYDRWI    KAKVYI ASI DVLAKK++ +
Subjt:  MSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENI

XP_038882358.1 uncharacterized protein LOC120073622 [Benincasa hispida]3.3e-2567.42Show/hide
Query:  MSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENIA
        M++SII L  +EKLNG+NY+ WK+NLNTILVVDDL+FVLTEECPQAPT NA R  ++AYDRW+    KA++YILAS+ DVLAKK+E++A
Subjt:  MSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENIA

TrEMBL top hitse value%identityAlignment
A0A5A7T0E9 Gag/pol protein1.2e-2568.54Show/hide
Query:  MSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENIA
        M++SI+ L A++KLNG+NY  WK+NLNTILVVDDL+F+LTEECPQ PT NA RAS++AYDRWI    KA+VYILAS+ DVLAKK+E +A
Subjt:  MSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENIA

A0A6J1DVX8 uncharacterized protein LOC1110235911.2e-2574.7Show/hide
Query:  IVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENI
        I L A +KLN +NYTQWKTNLNTILVVDDL+FVLTE+CPQAPT NAARASQDAYDRWI    KAK+YILA+I +VLAKK++++
Subjt:  IVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENI

A0A6J1DWI4 uncharacterized protein LOC1110241452.4e-2973.12Show/hide
Query:  HTYDNMSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENI
        + Y + S SII L A EK N ENYTQWKTNLNTILVVDDL+F+LTEECPQAPTPNAARAS+DAYDRWI    KA VYIL SI DVL+KK+E++
Subjt:  HTYDNMSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENI

A0A6J1DWL0 uncharacterized protein LOC1110247341.6e-2572.41Show/hide
Query:  MSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNEN
        MSASII L A +KLNGENY QWK+NLNTILV+DDL+FVL E+CPQAP  NA  A ++AYDRWI    KAKVYILASI DVLAKK+E+
Subjt:  MSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNEN

A0A6J1DWL4 uncharacterized protein LOC1110247391.3e-2775Show/hide
Query:  MSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENI
        MS S+I L A EKLNG+NYTQWKTNLN ILVVDDL+FVLTEEC Q PTPNA RAS+DAYDRWI    KAKVYI ASI DVLAKK++ +
Subjt:  MSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWI----KAKVYILASIFDVLAKKNENI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTGGAGCGACCACCCCTACGGAGGGCTTATTGGGGCGGACCTCTGGGGTCTGAAAATGACGGGTCACACTTATGATAACATGTCTGCATCCATTATT
GTCCTGTTTGCCACCGAAAAACTTAATGGTGAAAATTACACACAATGGAAAACGAACCTTAACACGATACTCGTGGTAGATGATCTTAAGTTCGTTTTAACTGAG
GAGTGTCCTCAGGCTCCCACGCCTAATGCAGCCCGAGCCAGTCAGGATGCCTATGACAGATGGATCAAGGCCAAGGTCTACATCTTGGCAAGCATATTTGATGTG
CTGGCCAAGAAGAATGAGAACATCGCACCGCAAGGGAGATCATGGACTCGTTGCGGGACATGTTTGGACAGCCGAGCAATTATTACAAGAAAGGCAAGGAAGATT
CAAGAGGCTTTCACACTGCACCTTCAAAAGCTTGTTAATGCACAAGAACCAACAAAGAGTTTTGAGCCCGAATTTATTCATAATGTTACTTCAATGAGTCAAGAA
GAGAATGGAGCAAAGATGGCACGGGAAAAGTTGTCTATTTTGAGAGATGGCACGGAGGACAAAAAAAGTGTGCAGATTCGTGAACAGGCTTGTAATGCTCTTGTT
ATGGATAACTTAAGATTGAAACATATTGGAATTCAAGAACTTTTGTTCCTTAACACATCTGGTGTTGCTTCTTTCAAATCTTGCTTTAGGTTTGATGTTAAACCG
ATTCATTGGATTAGTCTTGATCAAGCTAATTCTGGAGTGAGTCGACCCGCGCCGCGATTTCCGACCACCAAGGGAGATCGAATCCGCTCACGAGGGGGGCTGCGC
CCCGGAAAACGCCCCAGGGATTGCCGCCTACTGTTGCCCGACGCCGCGAGCACGCCTGCTGCTGCCCGAAGCCGCGCGCCTATTGCTGTCCGACGCCGCGTGCCG
CTGATGCCGGAAGCGGCGCCACCTGTTGCAGTCCGTCAGAGATGGTGCGTCGTTGCATCGAGGGTCCCCAGCTGTTCAGCCCGCGTCGGGGAGGAACAGGTTGAC
GATAGAAGGAGGAAACAGAAAGAAGGAAAATGGGGGGTGGGTGTTTCACGAAGGAGATGGGTTTGGGGGCTAGGGTTTTCCCTTCACTTCTCTTTTGTTCTAAAC
ATAAAGAAGGCTGACTTGGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTGGAGCGACCACCCCTACGGAGGGCTTATTGGGGCGGACCTCTGGGGTCTGAAAATGACGGGTCACACTTATGATAACATGTCTGCATCCATTATT
GTCCTGTTTGCCACCGAAAAACTTAATGGTGAAAATTACACACAATGGAAAACGAACCTTAACACGATACTCGTGGTAGATGATCTTAAGTTCGTTTTAACTGAG
GAGTGTCCTCAGGCTCCCACGCCTAATGCAGCCCGAGCCAGTCAGGATGCCTATGACAGATGGATCAAGGCCAAGGTCTACATCTTGGCAAGCATATTTGATGTG
CTGGCCAAGAAGAATGAGAACATCGCACCGCAAGGGAGATCATGGACTCGTTGCGGGACATGTTTGGACAGCCGAGCAATTATTACAAGAAAGGCAAGGAAGATT
CAAGAGGCTTTCACACTGCACCTTCAAAAGCTTGTTAATGCACAAGAACCAACAAAGAGTTTTGAGCCCGAATTTATTCATAATGTTACTTCAATGAGTCAAGAA
GAGAATGGAGCAAAGATGGCACGGGAAAAGTTGTCTATTTTGAGAGATGGCACGGAGGACAAAAAAAGTGTGCAGATTCGTGAACAGGCTTGTAATGCTCTTGTT
ATGGATAACTTAAGATTGAAACATATTGGAATTCAAGAACTTTTGTTCCTTAACACATCTGGTGTTGCTTCTTTCAAATCTTGCTTTAGGTTTGATGTTAAACCG
ATTCATTGGATTAGTCTTGATCAAGCTAATTCTGGAGTGAGTCGACCCGCGCCGCGATTTCCGACCACCAAGGGAGATCGAATCCGCTCACGAGGGGGGCTGCGC
CCCGGAAAACGCCCCAGGGATTGCCGCCTACTGTTGCCCGACGCCGCGAGCACGCCTGCTGCTGCCCGAAGCCGCGCGCCTATTGCTGTCCGACGCCGCGTGCCG
CTGATGCCGGAAGCGGCGCCACCTGTTGCAGTCCGTCAGAGATGGTGCGTCGTTGCATCGAGGGTCCCCAGCTGTTCAGCCCGCGTCGGGGAGGAACAGGTTGAC
GATAGAAGGAGGAAACAGAAAGAAGGAAAATGGGGGGTGGGTGTTTCACGAAGGAGATGGGTTTGGGGGCTAGGGTTTTCCCTTCACTTCTCTTTTGTTCTAAAC
ATAAAGAAGGCTGACTTGGGCTAG
Protein sequenceShow/hide protein sequence
MSSWSDHPYGGLIGADLWGLKMTGHTYDNMSASIIVLFATEKLNGENYTQWKTNLNTILVVDDLKFVLTEECPQAPTPNAARASQDAYDRWIKAKVYILASIFDV
LAKKNENIAPQGRSWTRCGTCLDSRAIITRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQACNALV
MDNLRLKHIGIQELLFLNTSGVASFKSCFRFDVKPIHWISLDQANSGVSRPAPRFPTTKGDRIRSRGGLRPGKRPRDCRLLLPDAASTPAAARSRAPIAVRRRVP
LMPEAAPPVAVRQRWCVVASRVPSCSARVGEEQVDDRRRKQKEGKWGVGVSRRRWVWGLGFSLHFSFVLNIKKADLG