; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g08080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g08080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr3:5640526..5645369
RNA-Seq ExpressionMoc03g08080
SyntenyMoc03g08080
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026069.1 gag/pol protein [Cucumis melo var. makuwa]5.0e-2555.8Show/hide
Query:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR-EFED
        M EG SV EHVL MMVHFNV EMN AVI+E S VSFIL+SLP+SFLQFRSNA+MNKI Y LTT LN+LQT+ SL+K KGQ  EAN+  S++ FHR     
Subjt:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR-EFED

Query:  VQHMCLRQIQRNWNAFK---------AISKTTKVVDTA
         + M      + W   K         A +KTTK   TA
Subjt:  VQHMCLRQIQRNWNAFK---------AISKTTKVVDTA

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]8.6e-2570.53Show/hide
Query:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR
        M EG SV EHVL MMVHFNV EMN AVI+E S VSFIL+SLP+SFLQFRSNA+MNKI Y LTT LN+LQT+ SL+K KGQ  EAN+  S++ FHR
Subjt:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR

KAA0045330.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]3.0e-2570.53Show/hide
Query:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR
        M EG SV EHVL MMVHFN+ EMN AVI+E S VSFILKSLP+SFLQFRSNA+MNKI Y LTT LN+LQT+ SL+K KGQ  EAN+  S++ FHR
Subjt:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]8.6e-2570.53Show/hide
Query:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR
        M EG SV EHVL MMVHFNV EMN AVI+E S VSFIL+SLP+SFLQFRSNA+MNKI Y LTT LN+LQT+ SL+K KGQ  EAN+  S++ FHR
Subjt:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]8.6e-2570.53Show/hide
Query:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR
        M EG SV EHVL MMVHFNV EMN AVI+E S VSFIL+SLP+SFLQFRSNA+MNKI Y LTT LN+LQT+ SL+K KGQ  EAN+  S++ FHR
Subjt:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR

TrEMBL top hitse value%identityAlignment
A0A5A7SLD1 Gag/pol protein2.4e-2555.8Show/hide
Query:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR-EFED
        M EG SV EHVL MMVHFNV EMN AVI+E S VSFIL+SLP+SFLQFRSNA+MNKI Y LTT LN+LQT+ SL+K KGQ  EAN+  S++ FHR     
Subjt:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR-EFED

Query:  VQHMCLRQIQRNWNAFK---------AISKTTKVVDTA
         + M      + W   K         A +KTTK   TA
Subjt:  VQHMCLRQIQRNWNAFK---------AISKTTKVVDTA

A0A5A7SMH8 Gag/pol protein4.2e-2570.53Show/hide
Query:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR
        M EG SV EHVL MMVHFNV EMN AVI+E S VSFIL+SLP+SFLQFRSNA+MNKI Y LTT LN+LQT+ SL+K KGQ  EAN+  S++ FHR
Subjt:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR

A0A5A7TQ86 Retrotransposon protein, putative, Ty1-copia subclass1.4e-2570.53Show/hide
Query:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR
        M EG SV EHVL MMVHFN+ EMN AVI+E S VSFILKSLP+SFLQFRSNA+MNKI Y LTT LN+LQT+ SL+K KGQ  EAN+  S++ FHR
Subjt:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR

A0A5A7V4M1 Gag/pol protein4.2e-2570.53Show/hide
Query:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR
        M EG SV EHVL MMVHFNV EMN AVI+E S VSFIL+SLP+SFLQFRSNA+MNKI Y LTT LN+LQT+ SL+K KGQ  EAN+  S++ FHR
Subjt:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR

A0A5D3CPJ6 Gag/pol protein4.2e-2570.53Show/hide
Query:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR
        M EG SV EHVL MMVHFNV EMN AVI+E S VSFIL+SLP+SFLQFRSNA+MNKI Y LTT LN+LQT+ SL+K KGQ  EAN+  S++ FHR
Subjt:  MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANI-VSSKNFHR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAAGGAACTTCTGTCTGGGAACATGTTCTTGGTATGATGGTCCATTTCAACGTGGTAGAGATGAACAGTGCTGTCATAAATGAGTTTAGCCTCGTCAGC
TTTATTCTGAAATCTCTTCCGAAGAGTTTTCTCCAATTTCGTAGCAATGCTATTATGAACAAAATTGATTACAACCTTACGACCCAACTAAACAAACTCCAAACT
TACCGGTCCCTGATAAAAGCAAAGGGACAAGACAAGGAGGCAAATATTGTATCTTCCAAAAATTTCCACAGAGAGTTTGAGGATGTCCAGCATATGTGCTTACGA
CAAATCCAAAGAAACTGGAACGCGTTCAAAGCTATCAGTAAGACAACAAAAGTTGTTGACACTGCTGGCCCATCAACAAGGGTTGTTGACGGAGCTAGTACATCG
AGTCAACATAATTCTTCTCAAGATCAGAGAAGGCCTCGACGTAGTAGGAGGTATCACGTGCAATGGACAAAAGGCGGGAGTATTTCTACTATAGAGCTTGGTTCT
TGGACTGTAAGATTGAGGTTCCAGCCTCAGATGCAGCGACATTGGAAGGAGAATTCTCATCCCTCACTAAAACATAGCCCACGAACGTGCCTCACTGAAGTGGCT
GACTTCCTAGGAGTTACAACGTCGGAAGGCTCTACACAAACCTGGATTTGGTCCCCAGATGAGATGCCTAAAGTTGAATGGAATCCCCTTCATCAGGCAATATAT
GAGCAGCATATGGACCCTCTGGATGAGTCACGAGAAGTAGGCATGAACCGGTTCTTCACCATATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAAGGAACTTCTGTCTGGGAACATGTTCTTGGTATGATGGTCCATTTCAACGTGGTAGAGATGAACAGTGCTGTCATAAATGAGTTTAGCCTCGTCAGC
TTTATTCTGAAATCTCTTCCGAAGAGTTTTCTCCAATTTCGTAGCAATGCTATTATGAACAAAATTGATTACAACCTTACGACCCAACTAAACAAACTCCAAACT
TACCGGTCCCTGATAAAAGCAAAGGGACAAGACAAGGAGGCAAATATTGTATCTTCCAAAAATTTCCACAGAGAGTTTGAGGATGTCCAGCATATGTGCTTACGA
CAAATCCAAAGAAACTGGAACGCGTTCAAAGCTATCAGTAAGACAACAAAAGTTGTTGACACTGCTGGCCCATCAACAAGGGTTGTTGACGGAGCTAGTACATCG
AGTCAACATAATTCTTCTCAAGATCAGAGAAGGCCTCGACGTAGTAGGAGGTATCACGTGCAATGGACAAAAGGCGGGAGTATTTCTACTATAGAGCTTGGTTCT
TGGACTGTAAGATTGAGGTTCCAGCCTCAGATGCAGCGACATTGGAAGGAGAATTCTCATCCCTCACTAAAACATAGCCCACGAACGTGCCTCACTGAAGTGGCT
GACTTCCTAGGAGTTACAACGTCGGAAGGCTCTACACAAACCTGGATTTGGTCCCCAGATGAGATGCCTAAAGTTGAATGGAATCCCCTTCATCAGGCAATATAT
GAGCAGCATATGGACCCTCTGGATGAGTCACGAGAAGTAGGCATGAACCGGTTCTTCACCATATAG
Protein sequenceShow/hide protein sequence
MKEGTSVWEHVLGMMVHFNVVEMNSAVINEFSLVSFILKSLPKSFLQFRSNAIMNKIDYNLTTQLNKLQTYRSLIKAKGQDKEANIVSSKNFHREFEDVQHMCLR
QIQRNWNAFKAISKTTKVVDTAGPSTRVVDGASTSSQHNSSQDQRRPRRSRRYHVQWTKGGSISTIELGSWTVRLRFQPQMQRHWKENSHPSLKHSPRTCLTEVA
DFLGVTTSEGSTQTWIWSPDEMPKVEWNPLHQAIYEQHMDPLDESREVGMNRFFTI