; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g27190 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g27190
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr9:20357000..20360957
RNA-Seq ExpressionMoc09g27190
SyntenyMoc09g27190
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041279.1 gag/pol protein [Cucumis melo var. makuwa]5.3e-4260.56Show/hide
Query:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI
        F+L +EC   P  +A+ +V+  YDCWIKAND A++YI+AS+S++L KKH+ M+TA++IMDSL  MFGQSS Q +++A+K++YN+RMKEG SV+EHVLN+I
Subjt:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI

Query:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV
        V+FNV + N AV DE+SQVS+ILKSL K+FLQF SN  MNK+
Subjt:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV

KAA0062993.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-4264.79Show/hide
Query:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI
        FVL KEC Q P  +AT  VR+ Y+ W KAN++A+ YI+AS+S VL KKH++M+TA+EIMDSL  MFGQ+S Q +  ALK+IYN+RM EG+SV+EHVLN++
Subjt:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI

Query:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV
        VHFNV E N AVIDE SQVSFIL+SL +SFLQF SNA MNK+
Subjt:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV

XP_022155999.1 uncharacterized protein LOC111022974 [Momordica charantia]8.8e-4573.88Show/hide
Query:  QAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLIVHFNVVET
        +AP+P+A  A R  YD WIKAND+AKVYI+ASIS+VL KKH+NM+ A+EIMDSL  MFGQ S QAR +ALKFIYNSRMKEG+S+QEHVLNL+VHFNV E 
Subjt:  QAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLIVHFNVVET

Query:  NKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV
        N AVIDE SQVSFIL+SL KSFLQF SNA MNK+
Subjt:  NKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV

XP_022158062.1 uncharacterized protein LOC111024637 [Momordica charantia]1.5e-4469.01Show/hide
Query:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI
        FVLQ++C QAP P+ATVAVR +YD WIKAND+AKV I+ASIS+VL KKH+N +  KEIMDSL  MFGQ SSQAR +AL  IYNSRMK+ SSV+EHVLNL+
Subjt:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI

Query:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV
        VHFNV E+N  VIDEQSQV FIL+SL K+FL F SNA ++ +
Subjt:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]1.2e-4671.13Show/hide
Query:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI
        FVLQ++C QAPV +ATVAVR  YD WIK+ND+AKVYI+ASIS+VL KKH++ +T KEIMDSL  MFGQ S QAR +ALKF+YNSRMKEGSSV+EHVLNL+
Subjt:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI

Query:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV
        VHFNV E+N  VIDEQSQ SFIL+SL K+FL F SNA   +V
Subjt:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV

TrEMBL top hitse value%identityAlignment
A0A5A7TIU9 Gag/pol protein2.6e-4260.56Show/hide
Query:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI
        F+L +EC   P  +A+ +V+  YDCWIKAND A++YI+AS+S++L KKH+ M+TA++IMDSL  MFGQSS Q +++A+K++YN+RMKEG SV+EHVLN+I
Subjt:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI

Query:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV
        V+FNV + N AV DE+SQVS+ILKSL K+FLQF SN  MNK+
Subjt:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV

A0A5A7V4M1 Gag/pol protein1.2e-4264.79Show/hide
Query:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI
        FVL KEC Q P  +AT  VR+ Y+ W KAN++A+ YI+AS+S VL KKH++M+TA+EIMDSL  MFGQ+S Q +  ALK+IYN+RM EG+SV+EHVLN++
Subjt:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI

Query:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV
        VHFNV E N AVIDE SQVSFIL+SL +SFLQF SNA MNK+
Subjt:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV

A0A6J1DRZ2 uncharacterized protein LOC1110229744.3e-4573.88Show/hide
Query:  QAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLIVHFNVVET
        +AP+P+A  A R  YD WIKAND+AKVYI+ASIS+VL KKH+NM+ A+EIMDSL  MFGQ S QAR +ALKFIYNSRMKEG+S+QEHVLNL+VHFNV E 
Subjt:  QAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLIVHFNVVET

Query:  NKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV
        N AVIDE SQVSFIL+SL KSFLQF SNA MNK+
Subjt:  NKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV

A0A6J1DW68 uncharacterized protein LOC1110246377.3e-4569.01Show/hide
Query:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI
        FVLQ++C QAP P+ATVAVR +YD WIKAND+AKV I+ASIS+VL KKH+N +  KEIMDSL  MFGQ SSQAR +AL  IYNSRMK+ SSV+EHVLNL+
Subjt:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI

Query:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV
        VHFNV E+N  VIDEQSQV FIL+SL K+FL F SNA ++ +
Subjt:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV

A0A6J1DWL0 uncharacterized protein LOC1110247345.9e-4771.13Show/hide
Query:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI
        FVLQ++C QAPV +ATVAVR  YD WIK+ND+AKVYI+ASIS+VL KKH++ +T KEIMDSL  MFGQ S QAR +ALKF+YNSRMKEGSSV+EHVLNL+
Subjt:  FVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKALKFIYNSRMKEGSSVQEHVLNLI

Query:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV
        VHFNV E+N  VIDEQSQ SFIL+SL K+FL F SNA   +V
Subjt:  VHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAACCCCATGGTCGTCTGTCGCAATCCCTCGCCGAAAACGAAGCTTCGCCATACATCATGGTTTCCGGCCGCTCACTTCAAGTCCAGGGGTTTGTCTTGCAAAA
AGAATGTCATCAAGCTCCTGTGCCTGACGCCACTGTGGCAGTGCGCAAGGTGTATGATTGCTGGATCAAGGCCAACGATAGGGCCAAAGTCTACATCATGGCGAGCATAT
CTAATGTGCTTCCTAAGAAGCACAAGAATATGATCACTGCCAAGGAGATCATGGACTCGCTGCATATTATGTTTGGACAATCGTCCTCACAGGCTCGAGAAAAAGCTCTT
AAGTTCATCTATAACTCCCGCATGAAGGAGGGTTCATCAGTACAAGAACACGTTCTCAACTTAATAGTCCACTTTAACGTGGTTGAGACGAACAAGGCTGTCATAGACGA
GCAGAGTCAAGTCAGCTTTATTTTGAAATCTCTTCAGAAGAGTTTTTTGCAATTTTGGAGTAATGCTGCTATGAATAAAGTTGGAATACACCCTTACCATACTCTTAAAT
GA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAACCCCATGGTCGTCTGTCGCAATCCCTCGCCGAAAACGAAGCTTCGCCATACATCATGGTTTCCGGCCGCTCACTTCAAGTCCAGGGGTTTGTCTTGCAAAA
AGAATGTCATCAAGCTCCTGTGCCTGACGCCACTGTGGCAGTGCGCAAGGTGTATGATTGCTGGATCAAGGCCAACGATAGGGCCAAAGTCTACATCATGGCGAGCATAT
CTAATGTGCTTCCTAAGAAGCACAAGAATATGATCACTGCCAAGGAGATCATGGACTCGCTGCATATTATGTTTGGACAATCGTCCTCACAGGCTCGAGAAAAAGCTCTT
AAGTTCATCTATAACTCCCGCATGAAGGAGGGTTCATCAGTACAAGAACACGTTCTCAACTTAATAGTCCACTTTAACGTGGTTGAGACGAACAAGGCTGTCATAGACGA
GCAGAGTCAAGTCAGCTTTATTTTGAAATCTCTTCAGAAGAGTTTTTTGCAATTTTGGAGTAATGCTGCTATGAATAAAGTTGGAATACACCCTTACCATACTCTTAAAT
GA
Protein sequenceShow/hide protein sequence
MKKPHGRLSQSLAENEASPYIMVSGRSLQVQGFVLQKECHQAPVPDATVAVRKVYDCWIKANDRAKVYIMASISNVLPKKHKNMITAKEIMDSLHIMFGQSSSQAREKAL
KFIYNSRMKEGSSVQEHVLNLIVHFNVVETNKAVIDEQSQVSFILKSLQKSFLQFWSNAAMNKVGIHPYHTLK