; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g25090 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g25090
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr9:18809692..18812812
RNA-Seq ExpressionMoc09g25090
SyntenyMoc09g25090
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035676.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-1971.01Show/hide
Query:  SKSIALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKI
        S  + LLA +KLNG+NY  WK NLNTILVVDDLRF+LTEECPQ PT +A RASR+AYDRWI+ANEK ++
Subjt:  SKSIALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKI

KAA0067803.1 gag/pol protein [Cucumis melo var. makuwa]3.9e-1935.74Show/hide
Query:  MSKSIA-LLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKITLEITNT-------QSKVVLNEISDEATN
        M++SI  LLA EKLN +NY  WK NLNT LVVDDLRFVL EECPQ    +A RASR AYDRWI+ANEK  + +  + +       +S     EI D    
Subjt:  MSKSIA-LLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKITLEITNT-------QSKVVLNEISDEATN

Query:  -------TSTRVADKAITSTRVVDGASTSR---------------------------------------QSHSS------------------QKLIVSRR
               +  + A K I + R+ +G S                                          Q+++S                  Q L    R
Subjt:  -------TSTRVADKAITSTRVVDGASTSR---------------------------------------QSHSS------------------QKLIVSRR

Query:  SGRIMSQLDRYVGLTEIQVVIPDDGVEDPLTYKNA
          RI  Q DRY GL E Q++IPDDG++D LTYK A
Subjt:  SGRIMSQLDRYVGLTEIQVVIPDDGVEDPLTYKNA

XP_022156751.1 uncharacterized protein LOC111023591 [Momordica charantia]1.1e-2153.91Show/hide
Query:  MSKSIALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKITLEITNTQSKVVLNEISDEATNTSTRVADK
        MSKSIALLA +KLN +NY QWK NLNTILVVDDLRFVLTE+CPQAPT +AARAS+DAYDRWI+AN+K KI   I  T S V+  +      + S   A +
Subjt:  MSKSIALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKITLEITNTQSKVVLNEISDEATNTSTRVADK

Query:  AITSTRVVDGASTSRQSHSSQKLIVSRR
         + S + + G  + +  H + K I + R
Subjt:  AITSTRVVDGASTSRQSHSSQKLIVSRR

XP_022157449.1 uncharacterized protein LOC111024145 [Momordica charantia]2.2e-2267.03Show/hide
Query:  IALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKITL--EITNTQSK-----VVLNEISD
        IALLA EK N ENY QWK NLNTILVVDDLRF+LTEECPQAPTP+AARASRDAYDRWI+AN+K  + +   I++  SK     V   EI D
Subjt:  IALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKITL--EITNTQSK-----VVLNEISD

XP_022158202.1 uncharacterized protein LOC111024739 [Momordica charantia]8.3e-2280.3Show/hide
Query:  IALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKI
        IALLA+EKLNG+NY QWK NLN ILVVDDLRFVLTEEC Q PTP+A RASRDAYDRWI+AN+K K+
Subjt:  IALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKI

TrEMBL top hitse value%identityAlignment
A0A5A7T0E9 Gag/pol protein4.9e-2071.01Show/hide
Query:  SKSIALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKI
        S  + LLA +KLNG+NY  WK NLNTILVVDDLRF+LTEECPQ PT +A RASR+AYDRWI+ANEK ++
Subjt:  SKSIALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKI

A0A5A7VI85 Gag/pol protein1.9e-1935.74Show/hide
Query:  MSKSIA-LLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKITLEITNT-------QSKVVLNEISDEATN
        M++SI  LLA EKLN +NY  WK NLNT LVVDDLRFVL EECPQ    +A RASR AYDRWI+ANEK  + +  + +       +S     EI D    
Subjt:  MSKSIA-LLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKITLEITNT-------QSKVVLNEISDEATN

Query:  -------TSTRVADKAITSTRVVDGASTSR---------------------------------------QSHSS------------------QKLIVSRR
               +  + A K I + R+ +G S                                          Q+++S                  Q L    R
Subjt:  -------TSTRVADKAITSTRVVDGASTSR---------------------------------------QSHSS------------------QKLIVSRR

Query:  SGRIMSQLDRYVGLTEIQVVIPDDGVEDPLTYKNA
          RI  Q DRY GL E Q++IPDDG++D LTYK A
Subjt:  SGRIMSQLDRYVGLTEIQVVIPDDGVEDPLTYKNA

A0A6J1DVX8 uncharacterized protein LOC1110235915.3e-2253.91Show/hide
Query:  MSKSIALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKITLEITNTQSKVVLNEISDEATNTSTRVADK
        MSKSIALLA +KLN +NY QWK NLNTILVVDDLRFVLTE+CPQAPT +AARAS+DAYDRWI+AN+K KI   I  T S V+  +      + S   A +
Subjt:  MSKSIALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKITLEITNTQSKVVLNEISDEATNTSTRVADK

Query:  AITSTRVVDGASTSRQSHSSQKLIVSRR
         + S + + G  + +  H + K I + R
Subjt:  AITSTRVVDGASTSRQSHSSQKLIVSRR

A0A6J1DWI4 uncharacterized protein LOC1110241451.1e-2267.03Show/hide
Query:  IALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKITL--EITNTQSK-----VVLNEISD
        IALLA EK N ENY QWK NLNTILVVDDLRF+LTEECPQAPTP+AARASRDAYDRWI+AN+K  + +   I++  SK     V   EI D
Subjt:  IALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKITL--EITNTQSK-----VVLNEISD

A0A6J1DWL4 uncharacterized protein LOC1110247394.0e-2280.3Show/hide
Query:  IALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKI
        IALLA+EKLNG+NY QWK NLN ILVVDDLRFVLTEEC Q PTP+A RASRDAYDRWI+AN+K K+
Subjt:  IALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGGCCTGGTGTCGCCCTGGGCGCGGCCTCCCTACGGAAGGTGTTTGCATGGTTCAATACCAAGCATGTCTAAATCTATTGCCTTGCTTGCCGTCGAAAAACTCAA
CGGCGAAAATTACAGACAATGGAAAATGAACCTTAACACAATACTCGTGGTAGATGATCTGAGGTTCGTCTTAACTGAGGAGTGTCCTCAGGCTCCCACGCCTAGTGCAG
CTCGAGCGAGTCGGGATGCCTATGACAGATGGATCCAGGCCAATGAGAAGAAGAAGATCACATTAGAGATCACAAACACACAAAGCAAGGTTGTGTTAAATGAGATTTCC
GATGAAGCTACAAATACATCAACAAGAGTTGCTGATAAAGCTATCACTTCAACAAGAGTTGTTGATGGCGCTAGTACATCACGTCAGTCACATTCATCTCAAAAGTTGAT
AGTGTCTCGACGTAGTGGGAGGATTATGTCACAACTTGATCGTTACGTGGGTTTAACAGAAATCCAGGTCGTCATACCTGATGATGGCGTTGAGGATCCATTGACATACA
AAAATGCAAATGGAAGATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTGGCCTGGTGTCGCCCTGGGCGCGGCCTCCCTACGGAAGGTGTTTGCATGGTTCAATACCAAGCATGTCTAAATCTATTGCCTTGCTTGCCGTCGAAAAACTCAA
CGGCGAAAATTACAGACAATGGAAAATGAACCTTAACACAATACTCGTGGTAGATGATCTGAGGTTCGTCTTAACTGAGGAGTGTCCTCAGGCTCCCACGCCTAGTGCAG
CTCGAGCGAGTCGGGATGCCTATGACAGATGGATCCAGGCCAATGAGAAGAAGAAGATCACATTAGAGATCACAAACACACAAAGCAAGGTTGTGTTAAATGAGATTTCC
GATGAAGCTACAAATACATCAACAAGAGTTGCTGATAAAGCTATCACTTCAACAAGAGTTGTTGATGGCGCTAGTACATCACGTCAGTCACATTCATCTCAAAAGTTGAT
AGTGTCTCGACGTAGTGGGAGGATTATGTCACAACTTGATCGTTACGTGGGTTTAACAGAAATCCAGGTCGTCATACCTGATGATGGCGTTGAGGATCCATTGACATACA
AAAATGCAAATGGAAGATGTTGA
Protein sequenceShow/hide protein sequence
MLGLVSPWARPPYGRCLHGSIPSMSKSIALLAVEKLNGENYRQWKMNLNTILVVDDLRFVLTEECPQAPTPSAARASRDAYDRWIQANEKKKITLEITNTQSKVVLNEIS
DEATNTSTRVADKAITSTRVVDGASTSRQSHSSQKLIVSRRSGRIMSQLDRYVGLTEIQVVIPDDGVEDPLTYKNANGRC