; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g29760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g29760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr8:21295369..21306855
RNA-Seq ExpressionMoc08g29760
SyntenyMoc08g29760
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026069.1 gag/pol protein [Cucumis melo var. makuwa]6.8e-3267.97Show/hide
Query:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK
        MVHFN+AE NGAVIDE +QV FILESLP+SFLQFRSNAVMNK+ Y LTT+LNELQT++SL+K KGQ+GEANVATS +KFH GSTSG KS+PSSSG+K +K
Subjt:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK

Query:  KKNATGKGPGSDPTTATAKKGKAKVSGK
        KK     G G+    ATAK  K   + K
Subjt:  KKNATGKGPGSDPTTATAKKGKAKVSGK

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]6.8e-3262.42Show/hide
Query:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK
        MVHFN+AE NGAVIDE +QV FILESLP+SFLQFRSNAVMNK+ Y LTTLLNELQT++SL+K KGQ+GEANVATS +KFH GSTSG KS+PSSSG+K +K
Subjt:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK

Query:  KKNATGKGPGSDPTTATAKKGKAKVSGKESVSNATWTGIGNLDVREMTL
        KK     G G+    A AK  K K    + +S  T    G  + R  TL
Subjt:  KKNATGKGPGSDPTTATAKKGKAKVSGKESVSNATWTGIGNLDVREMTL

TYJ96755.1 gag/pol protein [Cucumis melo var. makuwa]6.8e-3258.24Show/hide
Query:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK
        MVHFNMAE N AVIDE +QV FILESLP+SFLQFRSNAVMNK+ Y LTTLLNELQT++SL+K KGQ+GEANVATS +KF+ GS SG KS+P SS +K  K
Subjt:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK

Query:  KKNATGKGPGSDPTTATAK-KGKAKVSGKESVSN---ATWTGIGN---LDVREMTLKVGTEEVVSTVVVG
        KK     G G+    A AK   KAK +     +N   +++ GI +   L+  EMT++VGT  VVS + VG
Subjt:  KKNATGKGPGSDPTTATAK-KGKAKVSGKESVSN---ATWTGIGN---LDVREMTLKVGTEEVVSTVVVG

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-3167.97Show/hide
Query:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK
        MVHFN+AE NGAVIDE +QV FILESLP+SFLQFRSNAVMNK+ Y LTTLLNELQT++SL+K KGQ+GEANVATS +KFH GSTSG KS+PSSSG+K +K
Subjt:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK

Query:  KKNATGKGPGSDPTTATAKKGKAKVSGK
        KK     G G+    A AK  K   + K
Subjt:  KKNATGKGPGSDPTTATAKKGKAKVSGK

XP_022147761.1 uncharacterized protein LOC111016619 [Momordica charantia]4.7e-3340.43Show/hide
Query:  MKEKAPLKFKLPTLTQYDSSTDPIDHLNAYREWEDIYGITEAIRCRVFSFTLTGSTCI------------------------------------------
        MKEK P KFKLP + +YD STDPIDHL+ Y EW DIYGITEAIRCRVFSFTLTGST I                                          
Subjt:  MKEKAPLKFKLPTLTQYDSSTDPIDHLNAYREWEDIYGITEAIRCRVFSFTLTGSTCI------------------------------------------

Query:  ----------------------------------------------------------QVQKYISVGELIHSRHDPERECADRTVKRERLGEKRHGSSWE
                                                                  + Q Y+SVGELIHS+ DP+ + AD   KR+R GEKRHG  WE
Subjt:  ----------------------------------------------------------QVQKYISVGELIHSRHDPERECADRTVKRERLGEKRHGSSWE

Query:  KSDSGHGRLAQQDPQRKFERYTPTVAPLEK
        +SDSG GRL Q+DPQRKFE+YTPT  PLE+
Subjt:  KSDSGHGRLAQQDPQRKFERYTPTVAPLEK

TrEMBL top hitse value%identityAlignment
A0A5A7SLD1 Gag/pol protein3.3e-3267.97Show/hide
Query:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK
        MVHFN+AE NGAVIDE +QV FILESLP+SFLQFRSNAVMNK+ Y LTT+LNELQT++SL+K KGQ+GEANVATS +KFH GSTSG KS+PSSSG+K +K
Subjt:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK

Query:  KKNATGKGPGSDPTTATAKKGKAKVSGK
        KK     G G+    ATAK  K   + K
Subjt:  KKNATGKGPGSDPTTATAKKGKAKVSGK

A0A5A7V6N0 Gag/pol protein3.3e-3262.42Show/hide
Query:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK
        MVHFN+AE NGAVIDE +QV FILESLP+SFLQFRSNAVMNK+ Y LTTLLNELQT++SL+K KGQ+GEANVATS +KFH GSTSG KS+PSSSG+K +K
Subjt:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK

Query:  KKNATGKGPGSDPTTATAKKGKAKVSGKESVSNATWTGIGNLDVREMTL
        KK     G G+    A AK  K K    + +S  T    G  + R  TL
Subjt:  KKNATGKGPGSDPTTATAKKGKAKVSGKESVSNATWTGIGNLDVREMTL

A0A5D3BE74 Gag/pol protein3.3e-3258.24Show/hide
Query:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK
        MVHFNMAE N AVIDE +QV FILESLP+SFLQFRSNAVMNK+ Y LTTLLNELQT++SL+K KGQ+GEANVATS +KF+ GS SG KS+P SS +K  K
Subjt:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK

Query:  KKNATGKGPGSDPTTATAK-KGKAKVSGKESVSN---ATWTGIGN---LDVREMTLKVGTEEVVSTVVVG
        KK     G G+    A AK   KAK +     +N   +++ GI +   L+  EMT++VGT  VVS + VG
Subjt:  KKNATGKGPGSDPTTATAK-KGKAKVSGKESVSN---ATWTGIGN---LDVREMTLKVGTEEVVSTVVVG

A0A5D3CPJ6 Gag/pol protein7.3e-3267.97Show/hide
Query:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK
        MVHFN+AE NGAVIDE +QV FILESLP+SFLQFRSNAVMNK+ Y LTTLLNELQT++SL+K KGQ+GEANVATS +KFH GSTSG KS+PSSSG+K +K
Subjt:  MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATS-KKFHLGSTSGAKSVPSSSGSKTFK

Query:  KKNATGKGPGSDPTTATAKKGKAKVSGK
        KK     G G+    A AK  K   + K
Subjt:  KKNATGKGPGSDPTTATAKKGKAKVSGK

A0A6J1D3B7 uncharacterized protein LOC1110166192.3e-3340.43Show/hide
Query:  MKEKAPLKFKLPTLTQYDSSTDPIDHLNAYREWEDIYGITEAIRCRVFSFTLTGSTCI------------------------------------------
        MKEK P KFKLP + +YD STDPIDHL+ Y EW DIYGITEAIRCRVFSFTLTGST I                                          
Subjt:  MKEKAPLKFKLPTLTQYDSSTDPIDHLNAYREWEDIYGITEAIRCRVFSFTLTGSTCI------------------------------------------

Query:  ----------------------------------------------------------QVQKYISVGELIHSRHDPERECADRTVKRERLGEKRHGSSWE
                                                                  + Q Y+SVGELIHS+ DP+ + AD   KR+R GEKRHG  WE
Subjt:  ----------------------------------------------------------QVQKYISVGELIHSRHDPERECADRTVKRERLGEKRHGSSWE

Query:  KSDSGHGRLAQQDPQRKFERYTPTVAPLEK
        +SDSG GRL Q+DPQRKFE+YTPT  PLE+
Subjt:  KSDSGHGRLAQQDPQRKFERYTPTVAPLEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCACTTCAATATGGCGGAGTCAAACGGGGCCGTCATAGACGAGCAGACTCAGGTCATCTTTATTCTGGAATCTCTTCCGAAGAGTTTCCTACAATTCCGCAGCAA
TGCGGTTATGAATAAGCTGGAATACCCTCTTACCACGCTCTTAAATGAACTACAGACTTACCAGTCTCTTATAAAAAATAAGGGACAAGAAGGAGAGGCAAACGTTGCCA
CCTCGAAAAAGTTCCACCTAGGTTCGACCTCTGGAGCCAAGTCTGTGCCATCTTCTTCTGGAAGTAAGACTTTCAAGAAGAAGAATGCCACTGGTAAGGGGCCTGGATCT
GACCCCACTACTGCTACTGCCAAGAAAGGCAAGGCCAAGGTTTCAGGAAAGGAAAGTGTTTCCAATGCAACATGGACGGGCATTGGAAACCTTGACGTTAGAGAGATGAC
TCTCAAGGTCGGAACAGAAGAGGTCGTCTCAACTGTGGTGGTAGGGGAGGTAGTGATAAACGAGATTTTCGAAGAAGCTACAAACGTGTCAACAAGAGATGTTGATCAAG
CTGACACTTTAACAAGAGTTGTTAATGAAGGCAGCATATCACGTCAGTCACATCCACCTGAATGCCTCGACGTAGTGGAAGGAATGTACCACAACCTGACTGCTACGAGT
TGGGGAGACAACCAGCCATGCACAGAGCAAGTTGAAGATTGGAGGAAACCACAAACCTCAAACCAACGTCCTCGACCGATCAAGCCGGCAAGCAGCTATACGACAACAGA
AGTTGGTCATGTCAAACAAAAGCAAGAGAGCAACCCAGTTGACCGAGTCGGCCTCAGGCATACTACTACCAGTAAAAGAAGACCACCGGATCGATTAGAGGTACGATCAG
AAGGACACCAAGACAAGAAATTTGGTCTCGATGAGCTGATGGACCAGGCAAACTCTCTGTTTACAGAGAAGATAATGAAGGAAAAAGCTCCTCTAAAGTTCAAGCTACCC
ACCCTCACACAATATGACAGTTCGACAGATCCCATCGATCACCTCAATGCATACCGAGAATGGGAAGATATTTACGGCATAACGGAAGCCATTAGATGCAGAGTGTTTTC
TTTCACTCTGACTGGATCGACATGCATCCAAGTCCAGAAATACATTAGTGTTGGAGAATTAATTCACTCAAGGCATGACCCTGAAAGAGAATGTGCGGATCGAACCGTAA
AGAGGGAGAGGCTCGGAGAAAAGCGACATGGATCAAGCTGGGAGAAGTCAGATAGTGGACATGGTCGACTAGCCCAGCAGGATCCTCAACGAAAATTCGAAAGATACACT
CCAACAGTTGCCCCGCTCGAGAAGAAAGTAGAGATCAGACTAAGCAACCGAAAAGAGGAAAGAGAACCAAAGAAGGAGCGGACCCGAAGAGAGGTAAGTGGATCGGGGCC
CAGCAACCCGGGTCGGGTAGCTAAGCAGTGCTCGGCGTCAAGAGATCGTGTGGGTTTACTCGTGGAGGATCACTGGAGGATGCTGTGGGTGTTGGTGAAGCCGTCACGTG
GCTGGGGTGATAACGCAGTTCGTAGGTCGCAGATATGGGCTTGTGCCACTGCTCATGGGCCATTGTTCGAGCCACTTGCTGTGCTGCACATCGCCAGAGAAGGAGGGCGT
GGCGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCCACTTCAATATGGCGGAGTCAAACGGGGCCGTCATAGACGAGCAGACTCAGGTCATCTTTATTCTGGAATCTCTTCCGAAGAGTTTCCTACAATTCCGCAGCAA
TGCGGTTATGAATAAGCTGGAATACCCTCTTACCACGCTCTTAAATGAACTACAGACTTACCAGTCTCTTATAAAAAATAAGGGACAAGAAGGAGAGGCAAACGTTGCCA
CCTCGAAAAAGTTCCACCTAGGTTCGACCTCTGGAGCCAAGTCTGTGCCATCTTCTTCTGGAAGTAAGACTTTCAAGAAGAAGAATGCCACTGGTAAGGGGCCTGGATCT
GACCCCACTACTGCTACTGCCAAGAAAGGCAAGGCCAAGGTTTCAGGAAAGGAAAGTGTTTCCAATGCAACATGGACGGGCATTGGAAACCTTGACGTTAGAGAGATGAC
TCTCAAGGTCGGAACAGAAGAGGTCGTCTCAACTGTGGTGGTAGGGGAGGTAGTGATAAACGAGATTTTCGAAGAAGCTACAAACGTGTCAACAAGAGATGTTGATCAAG
CTGACACTTTAACAAGAGTTGTTAATGAAGGCAGCATATCACGTCAGTCACATCCACCTGAATGCCTCGACGTAGTGGAAGGAATGTACCACAACCTGACTGCTACGAGT
TGGGGAGACAACCAGCCATGCACAGAGCAAGTTGAAGATTGGAGGAAACCACAAACCTCAAACCAACGTCCTCGACCGATCAAGCCGGCAAGCAGCTATACGACAACAGA
AGTTGGTCATGTCAAACAAAAGCAAGAGAGCAACCCAGTTGACCGAGTCGGCCTCAGGCATACTACTACCAGTAAAAGAAGACCACCGGATCGATTAGAGGTACGATCAG
AAGGACACCAAGACAAGAAATTTGGTCTCGATGAGCTGATGGACCAGGCAAACTCTCTGTTTACAGAGAAGATAATGAAGGAAAAAGCTCCTCTAAAGTTCAAGCTACCC
ACCCTCACACAATATGACAGTTCGACAGATCCCATCGATCACCTCAATGCATACCGAGAATGGGAAGATATTTACGGCATAACGGAAGCCATTAGATGCAGAGTGTTTTC
TTTCACTCTGACTGGATCGACATGCATCCAAGTCCAGAAATACATTAGTGTTGGAGAATTAATTCACTCAAGGCATGACCCTGAAAGAGAATGTGCGGATCGAACCGTAA
AGAGGGAGAGGCTCGGAGAAAAGCGACATGGATCAAGCTGGGAGAAGTCAGATAGTGGACATGGTCGACTAGCCCAGCAGGATCCTCAACGAAAATTCGAAAGATACACT
CCAACAGTTGCCCCGCTCGAGAAGAAAGTAGAGATCAGACTAAGCAACCGAAAAGAGGAAAGAGAACCAAAGAAGGAGCGGACCCGAAGAGAGGTAAGTGGATCGGGGCC
CAGCAACCCGGGTCGGGTAGCTAAGCAGTGCTCGGCGTCAAGAGATCGTGTGGGTTTACTCGTGGAGGATCACTGGAGGATGCTGTGGGTGTTGGTGAAGCCGTCACGTG
GCTGGGGTGATAACGCAGTTCGTAGGTCGCAGATATGGGCTTGTGCCACTGCTCATGGGCCATTGTTCGAGCCACTTGCTGTGCTGCACATCGCCAGAGAAGGAGGGCGT
GGCGATTGA
Protein sequenceShow/hide protein sequence
MVHFNMAESNGAVIDEQTQVIFILESLPKSFLQFRSNAVMNKLEYPLTTLLNELQTYQSLIKNKGQEGEANVATSKKFHLGSTSGAKSVPSSSGSKTFKKKNATGKGPGS
DPTTATAKKGKAKVSGKESVSNATWTGIGNLDVREMTLKVGTEEVVSTVVVGEVVINEIFEEATNVSTRDVDQADTLTRVVNEGSISRQSHPPECLDVVEGMYHNLTATS
WGDNQPCTEQVEDWRKPQTSNQRPRPIKPASSYTTTEVGHVKQKQESNPVDRVGLRHTTTSKRRPPDRLEVRSEGHQDKKFGLDELMDQANSLFTEKIMKEKAPLKFKLP
TLTQYDSSTDPIDHLNAYREWEDIYGITEAIRCRVFSFTLTGSTCIQVQKYISVGELIHSRHDPERECADRTVKRERLGEKRHGSSWEKSDSGHGRLAQQDPQRKFERYT
PTVAPLEKKVEIRLSNRKEEREPKKERTRREVSGSGPSNPGRVAKQCSASRDRVGLLVEDHWRMLWVLVKPSRGWGDNAVRRSQIWACATAHGPLFEPLAVLHIAREGGR
GD