; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g11680 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g11680
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr9:9922621..9934295
RNA-Seq ExpressionMoc09g11680
SyntenyMoc09g11680
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.3e-2935.37Show/hide
Query:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKT-----------------------------RD
        M++ ++ LL +EKL G+NY+ WK NLNTILVVDDLRFVLT+ECPQ P  +A+R    AYDRW KANDK                              R+
Subjt:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKT-----------------------------RD

Query:  MFGQPSTQARHDALKFIFNAHMKEETSIQR--------------------ESSQLFLGISSEEFPTVPQQYLKTFEKKKKS----------------GKG
        MFGQPS   RH+A+K I+   MKE TS++                     E++Q+   + S     VP Q   +  K + +                 KG
Subjt:  MFGQPSTQARHDALKFIFNAHMKEETSIQR--------------------ESSQLFLGISSEEFPTVPQQYLKTFEKKKKS----------------GKG

Query:  NEADPAIAVAR------------------------KGKA-------KVAKKGKCFNCNVDEYWKRNYPKYLAEKKKVKEGERRVPDQKATNDKY
         E +  +AV +                        KGKA       K A KGKCF+CN D +WKRN PKYLAEKK           +KAT  KY
Subjt:  NEADPAIAVAR------------------------KGKA-------KVAKKGKCFNCNVDEYWKRNYPKYLAEKKKVKEGERRVPDQKATNDKY

KAA0046201.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-3038.1Show/hide
Query:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKTR-----------------------------D
        M++ ++ LL ++KL G+NYT WK NLNTILVV+DLRFVLT+ECPQ P   A+R    AYDRW KAN+K R                              
Subjt:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKTR-----------------------------D

Query:  MFGQPSTQARHDALKFIFNAHMKEETSIQRESSQLFLGIS-SEEFPTVPQQYLKTFEK-----KKKSGKGNEADPAIAVA-----RKG------KAKVAK
        MFGQPS   +H+A+K+I+   +KE TS++     + +  + +E F  +     K  E      +K+  +G+ +   +  +     RKG      K K   
Subjt:  MFGQPSTQARHDALKFIFNAHMKEETSIQRESSQLFLGIS-SEEFPTVPQQYLKTFEK-----KKKSGKGNEADPAIAVA-----RKG------KAKVAK

Query:  KGKCFNCNVDEYWKRNYPKYLAEKKKVKEGE
        KGKC++CN D +W RN PKYLAEKK  KE +
Subjt:  KGKCFNCNVDEYWKRNYPKYLAEKKKVKEGE

KAA0050233.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-2936.52Show/hide
Query:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKT-----------------------------RD
        M++ ++ LL ++KL G+NY  WK NLN ILV+DDLRFVLT+E PQTP  +A++    AYDRW KAN+K                              R+
Subjt:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKT-----------------------------RD

Query:  MFGQPSTQARHDALKFIFNAHMKEETSIQRESSQLFLGISSEEFPTVP-------------QQYLKTFEKKKKSGKGNE--ADPAIA-------VARKGK
        MFGQPS   RH+A+K+I+   MKE TS++     + +  +  E    P             +  + T EKK   G  ++    P++          +K +
Subjt:  MFGQPSTQARHDALKFIFNAHMKEETSIQRESSQLFLGISSEEFPTVP-------------QQYLKTFEKKKKSGKGNE--ADPAIA-------VARKGK

Query:  AKVAKKGKCFNCNVDEYWKRNYPKYLAEKK
         K   KGKC+ CN DE+W RN PKYLAEK+
Subjt:  AKVAKKGKCFNCNVDEYWKRNYPKYLAEKK

KAA0058365.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-3038.71Show/hide
Query:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKTR-----------------------------D
        MSS +I LL  ++L GENY  WK+ LN ILV+ DLRFVL +ECP  P+ +AS+   + YD WTKANDK R                             +
Subjt:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKTR-----------------------------D

Query:  MFGQPSTQARHDALKFIFNAHMKEETSIQRESSQLFLGISSEEFPTVPQQYLKTFEKKKKSGKGNEADPAIAVARKGKAKVAKKGKCFNCNVDEYWKRNY
        MFGQ S Q + +A+K+++NA MKE   ++     + +  +      + Q     F++KK+  +  + DP +A    GKAK+  KGK F+CNVD +WKRN 
Subjt:  MFGQPSTQARHDALKFIFNAHMKEETSIQRESSQLFLGISSEEFPTVPQQYLKTFEKKKKSGKGNEADPAIAVARKGKAKVAKKGKCFNCNVDEYWKRNY

Query:  PKYLAEKKKVKEGERRV
        PKYL +KK+ K     V
Subjt:  PKYLAEKKKVKEGERRV

XP_022158579.1 uncharacterized protein LOC111025033 [Momordica charantia]1.3e-2976.53Show/hide
Query:  KYLSQVRDQFSQFSKYEILQISRAQNVNAAALARLAAAYETDLGKTVPVEILPELSIEVQETIDIDEQGQPEENWMSHLIKYLKDGTPPNEMIETQQL
        KYLSQVR+Q  QFSKYEI QI  AQNVNA  LARLA AYETDLG+TVPVEILP  SIE QE +DID QGQPEENW S LIKYLKDG  P E IE QQL
Subjt:  KYLSQVRDQFSQFSKYEILQISRAQNVNAAALARLAAAYETDLGKTVPVEILPELSIEVQETIDIDEQGQPEENWMSHLIKYLKDGTPPNEMIETQQL

TrEMBL top hitse value%identityAlignment
A0A5A7TXW7 Gag/pol protein7.3e-3138.1Show/hide
Query:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKTR-----------------------------D
        M++ ++ LL ++KL G+NYT WK NLNTILVV+DLRFVLT+ECPQ P   A+R    AYDRW KAN+K R                              
Subjt:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKTR-----------------------------D

Query:  MFGQPSTQARHDALKFIFNAHMKEETSIQRESSQLFLGIS-SEEFPTVPQQYLKTFEK-----KKKSGKGNEADPAIAVA-----RKG------KAKVAK
        MFGQPS   +H+A+K+I+   +KE TS++     + +  + +E F  +     K  E      +K+  +G+ +   +  +     RKG      K K   
Subjt:  MFGQPSTQARHDALKFIFNAHMKEETSIQRESSQLFLGIS-SEEFPTVPQQYLKTFEK-----KKKSGKGNEADPAIAVA-----RKG------KAKVAK

Query:  KGKCFNCNVDEYWKRNYPKYLAEKKKVKEGE
        KGKC++CN D +W RN PKYLAEKK  KE +
Subjt:  KGKCFNCNVDEYWKRNYPKYLAEKKKVKEGE

A0A5A7U2U6 Gag/pol protein1.4e-2936.52Show/hide
Query:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKT-----------------------------RD
        M++ ++ LL ++KL G+NY  WK NLN ILV+DDLRFVLT+E PQTP  +A++    AYDRW KAN+K                              R+
Subjt:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKT-----------------------------RD

Query:  MFGQPSTQARHDALKFIFNAHMKEETSIQRESSQLFLGISSEEFPTVP-------------QQYLKTFEKKKKSGKGNE--ADPAIA-------VARKGK
        MFGQPS   RH+A+K+I+   MKE TS++     + +  +  E    P             +  + T EKK   G  ++    P++          +K +
Subjt:  MFGQPSTQARHDALKFIFNAHMKEETSIQRESSQLFLGISSEEFPTVP-------------QQYLKTFEKKKKSGKGNE--ADPAIA-------VARKGK

Query:  AKVAKKGKCFNCNVDEYWKRNYPKYLAEKK
         K   KGKC+ CN DE+W RN PKYLAEK+
Subjt:  AKVAKKGKCFNCNVDEYWKRNYPKYLAEKK

A0A5D3DIM3 Gag/pol protein5.6e-3138.71Show/hide
Query:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKTR-----------------------------D
        MSS +I LL  ++L GENY  WK+ LN ILV+ DLRFVL +ECP  P+ +AS+   + YD WTKANDK R                             +
Subjt:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKTR-----------------------------D

Query:  MFGQPSTQARHDALKFIFNAHMKEETSIQRESSQLFLGISSEEFPTVPQQYLKTFEKKKKSGKGNEADPAIAVARKGKAKVAKKGKCFNCNVDEYWKRNY
        MFGQ S Q + +A+K+++NA MKE   ++     + +  +      + Q     F++KK+  +  + DP +A    GKAK+  KGK F+CNVD +WKRN 
Subjt:  MFGQPSTQARHDALKFIFNAHMKEETSIQRESSQLFLGISSEEFPTVPQQYLKTFEKKKKSGKGNEADPAIAVARKGKAKVAKKGKCFNCNVDEYWKRNY

Query:  PKYLAEKKKVKEGERRV
        PKYL +KK+ K     V
Subjt:  PKYLAEKKKVKEGERRV

A0A6J1DZU0 Ribonuclease H6.1e-3076.53Show/hide
Query:  KYLSQVRDQFSQFSKYEILQISRAQNVNAAALARLAAAYETDLGKTVPVEILPELSIEVQETIDIDEQGQPEENWMSHLIKYLKDGTPPNEMIETQQL
        KYLSQVR+Q  QFSKYEI QI  AQNVNA  LARLA AYETDLG+TVPVEILP  SIE QE +DID QGQPEENW S LIKYLKDG  P E IE QQL
Subjt:  KYLSQVRDQFSQFSKYEILQISRAQNVNAAALARLAAAYETDLGKTVPVEILPELSIEVQETIDIDEQGQPEENWMSHLIKYLKDGTPPNEMIETQQL

E2GK51 Gag/pol protein (Fragment)6.1e-3035.37Show/hide
Query:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKT-----------------------------RD
        M++ ++ LL +EKL G+NY+ WK NLNTILVVDDLRFVLT+ECPQ P  +A+R    AYDRW KANDK                              R+
Subjt:  MSSFVIVLLTAEKLKGENYTQWKMNLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKT-----------------------------RD

Query:  MFGQPSTQARHDALKFIFNAHMKEETSIQR--------------------ESSQLFLGISSEEFPTVPQQYLKTFEKKKKS----------------GKG
        MFGQPS   RH+A+K I+   MKE TS++                     E++Q+   + S     VP Q   +  K + +                 KG
Subjt:  MFGQPSTQARHDALKFIFNAHMKEETSIQR--------------------ESSQLFLGISSEEFPTVPQQYLKTFEKKKKS----------------GKG

Query:  NEADPAIAVAR------------------------KGKA-------KVAKKGKCFNCNVDEYWKRNYPKYLAEKKKVKEGERRVPDQKATNDKY
         E +  +AV +                        KGKA       K A KGKCF+CN D +WKRN PKYLAEKK           +KAT  KY
Subjt:  NEADPAIAVAR------------------------KGKA-------KVAKKGKCFNCNVDEYWKRNYPKYLAEKKKVKEGERRVPDQKATNDKY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTGTTTGCATGGTTTAATATCAAGGTGCATGGAGAAAGTATTTATAGTATGTGGGAGAAGGATGTGTGGCAACACACCGTATGGCTTCCTCTATTTGTTTGCAT
TGTGAGGTTTCATACACAGCCTGTGTGTCGTCCTAGAATCAATATCAGGGCAAACTCCAGAAATGGATATGGTTTCTTACATTTTGCACCAATATTGCCTTTCTCTATGG
TGGGCTTGTTGGGGCGGACCTCTGGGGTCTCAAAATGCATGTCTAGTTTTGTGATTGTGTTACTTACCGCCGAAAAACTTAAAGGTGAAAACTACACTCAATGGAAAATG
AACTTGAATACAATTCTCGTGGTAGATGATCTAAGGTTCGTCTTGACTAAGGAGTGTCCTCAGACTCCCACACCCGATGCATCCCGAGTTGGTTGGAATGCCTATGACAG
ATGGACCAAGGCCAATGACAAGACCAGAGACATGTTTGGACAACCGTCCACTCAAGCTCGGCACGATGCTCTCAAGTTCATTTTCAATGCCCACATGAAAGAAGAAACAT
CAATACAAAGGGAGTCAAGTCAGCTTTTTCTTGGAATCTCTTCCGAAGAATTTCCTACAGTTCCGCAGCAATACTTAAAGACTTTCGAGAAGAAAAAGAAGAGTGGTAAG
GGGAATGAAGCTGACCCTGCTATTGCTGTTGCCCGAAAGGGGAAGGCCAAGGTTGCAAAGAAAGGAAAGTGTTTCAACTGTAACGTGGATGAGTACTGGAAAAGAAACTA
CCCAAAATACTTGGCGGAAAAGAAGAAAGTCAAAGAAGGTGAAAGAAGAGTTCCAGACCAGAAAGCCACGAATGACAAGTACCTTTCTCAGGTAAGAGACCAGTTCAGCC
AATTCTCGAAATATGAGATCCTACAAATTTCTCGTGCTCAAAACGTAAATGCAGCTGCGTTAGCTCGACTAGCTGCGGCTTATGAAACCGACTTGGGAAAAACTGTGCCA
GTAGAAATTTTGCCCGAGCTAAGCATAGAAGTGCAGGAGACAATAGATATCGATGAGCAGGGGCAACCAGAGGAGAACTGGATGAGCCATTTGATCAAATATTTAAAGGA
TGGGACCCCACCTAATGAAATGATAGAGACCCAACAACTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGTGTTTGCATGGTTTAATATCAAGGTGCATGGAGAAAGTATTTATAGTATGTGGGAGAAGGATGTGTGGCAACACACCGTATGGCTTCCTCTATTTGTTTGCAT
TGTGAGGTTTCATACACAGCCTGTGTGTCGTCCTAGAATCAATATCAGGGCAAACTCCAGAAATGGATATGGTTTCTTACATTTTGCACCAATATTGCCTTTCTCTATGG
TGGGCTTGTTGGGGCGGACCTCTGGGGTCTCAAAATGCATGTCTAGTTTTGTGATTGTGTTACTTACCGCCGAAAAACTTAAAGGTGAAAACTACACTCAATGGAAAATG
AACTTGAATACAATTCTCGTGGTAGATGATCTAAGGTTCGTCTTGACTAAGGAGTGTCCTCAGACTCCCACACCCGATGCATCCCGAGTTGGTTGGAATGCCTATGACAG
ATGGACCAAGGCCAATGACAAGACCAGAGACATGTTTGGACAACCGTCCACTCAAGCTCGGCACGATGCTCTCAAGTTCATTTTCAATGCCCACATGAAAGAAGAAACAT
CAATACAAAGGGAGTCAAGTCAGCTTTTTCTTGGAATCTCTTCCGAAGAATTTCCTACAGTTCCGCAGCAATACTTAAAGACTTTCGAGAAGAAAAAGAAGAGTGGTAAG
GGGAATGAAGCTGACCCTGCTATTGCTGTTGCCCGAAAGGGGAAGGCCAAGGTTGCAAAGAAAGGAAAGTGTTTCAACTGTAACGTGGATGAGTACTGGAAAAGAAACTA
CCCAAAATACTTGGCGGAAAAGAAGAAAGTCAAAGAAGGTGAAAGAAGAGTTCCAGACCAGAAAGCCACGAATGACAAGTACCTTTCTCAGGTAAGAGACCAGTTCAGCC
AATTCTCGAAATATGAGATCCTACAAATTTCTCGTGCTCAAAACGTAAATGCAGCTGCGTTAGCTCGACTAGCTGCGGCTTATGAAACCGACTTGGGAAAAACTGTGCCA
GTAGAAATTTTGCCCGAGCTAAGCATAGAAGTGCAGGAGACAATAGATATCGATGAGCAGGGGCAACCAGAGGAGAACTGGATGAGCCATTTGATCAAATATTTAAAGGA
TGGGACCCCACCTAATGAAATGATAGAGACCCAACAACTCTAG
Protein sequenceShow/hide protein sequence
MEVFAWFNIKVHGESIYSMWEKDVWQHTVWLPLFVCIVRFHTQPVCRPRINIRANSRNGYGFLHFAPILPFSMVGLLGRTSGVSKCMSSFVIVLLTAEKLKGENYTQWKM
NLNTILVVDDLRFVLTKECPQTPTPDASRVGWNAYDRWTKANDKTRDMFGQPSTQARHDALKFIFNAHMKEETSIQRESSQLFLGISSEEFPTVPQQYLKTFEKKKKSGK
GNEADPAIAVARKGKAKVAKKGKCFNCNVDEYWKRNYPKYLAEKKKVKEGERRVPDQKATNDKYLSQVRDQFSQFSKYEILQISRAQNVNAAALARLAAAYETDLGKTVP
VEILPELSIEVQETIDIDEQGQPEENWMSHLIKYLKDGTPPNEMIETQQL