; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g18690 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g18690
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr8:14104425..14104877
RNA-Seq ExpressionMoc08g18690
SyntenyMoc08g18690
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151295.1 uncharacterized protein LOC111019259 [Momordica charantia]5.9e-5575.33Show/hide
Query:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS
        MS SIIALLA +KLNSENYKQ KSNLNIILVI+DLRFVLQE+ P APA +ATVAV   YDRWIKAND+A+VYI  SIS+VL KKHE+ +TAKEIMDSLQS
Subjt:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS

Query:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID
        MFG  SSQA+HE  KF+YNS M EG S+REHVLNLM+HFN+AE+N+A+ID
Subjt:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID

XP_022152352.1 uncharacterized protein LOC111020095 [Momordica charantia]1.5e-6689.33Show/hide
Query:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS
        MSTSIIALLAAQ+LN ENYKQWKSNLN ILVIDDL+FVLQEDCPQA APNATVAVR  YDRWIKAND+AKVYI ASISDVL KKHEDTITAKEIMDSLQS
Subjt:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS

Query:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID
        MFG  SSQARHEA KFIYNSRM EGSS+REHVLNLMVHFNVAESN AVID
Subjt:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID

XP_022154837.1 uncharacterized protein LOC111022000 [Momordica charantia]5.1e-5186.55Show/hide
Query:  VIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQSMFGPLSSQARHEAFKFIYNSRMNEGSSIRE
        +IDDLRFVLQEDCPQAPAPNAT+AVRN YDRWIKAND+AKVYI +SISDVL KKHEDT+TAKEIMDSLQSMFG  SSQARHEA KF+YNSRM +GSS+RE
Subjt:  VIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQSMFGPLSSQARHEAFKFIYNSRMNEGSSIRE

Query:  HVLNLMVHFNVAESNKAVI
        HVLNLMVHFNVAESN AVI
Subjt:  HVLNLMVHFNVAESNKAVI

XP_022158062.1 uncharacterized protein LOC111024637 [Momordica charantia]1.9e-6182Show/hide
Query:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS
        MSTSII LL AQKLN ENYKQWKSN+N IL+IDDLRFVLQEDCPQAPAPNATVAVRN+YDRWIKAND+AKV I ASISDVL KKHE+++  KEIMDSLQS
Subjt:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS

Query:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID
        MFG  SSQARHEA   IYNSRM + SS+REHVLNLMVHFNVAESN  VID
Subjt:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]1.4e-6485.33Show/hide
Query:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS
        MS SIIALLAAQKLN ENY+QWKSNLN ILVIDDLRFVLQEDCPQAP  NATVAVRN YDRWIK+ND+AKVYI ASISDVL KKHEDT+T KEIMDSLQS
Subjt:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS

Query:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID
        MFG  S QARHEA KF+YNSRM EGSS+REHVLNLMVHFNVAESN  VID
Subjt:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID

TrEMBL top hitse value%identityAlignment
A0A6J1DAT1 uncharacterized protein LOC1110192592.8e-5575.33Show/hide
Query:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS
        MS SIIALLA +KLNSENYKQ KSNLNIILVI+DLRFVLQE+ P APA +ATVAV   YDRWIKAND+A+VYI  SIS+VL KKHE+ +TAKEIMDSLQS
Subjt:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS

Query:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID
        MFG  SSQA+HE  KF+YNS M EG S+REHVLNLM+HFN+AE+N+A+ID
Subjt:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID

A0A6J1DFZ2 uncharacterized protein LOC1110200957.2e-6789.33Show/hide
Query:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS
        MSTSIIALLAAQ+LN ENYKQWKSNLN ILVIDDL+FVLQEDCPQA APNATVAVR  YDRWIKAND+AKVYI ASISDVL KKHEDTITAKEIMDSLQS
Subjt:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS

Query:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID
        MFG  SSQARHEA KFIYNSRM EGSS+REHVLNLMVHFNVAESN AVID
Subjt:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID

A0A6J1DMS3 uncharacterized protein LOC1110220002.5e-5186.55Show/hide
Query:  VIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQSMFGPLSSQARHEAFKFIYNSRMNEGSSIRE
        +IDDLRFVLQEDCPQAPAPNAT+AVRN YDRWIKAND+AKVYI +SISDVL KKHEDT+TAKEIMDSLQSMFG  SSQARHEA KF+YNSRM +GSS+RE
Subjt:  VIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQSMFGPLSSQARHEAFKFIYNSRMNEGSSIRE

Query:  HVLNLMVHFNVAESNKAVI
        HVLNLMVHFNVAESN AVI
Subjt:  HVLNLMVHFNVAESNKAVI

A0A6J1DW68 uncharacterized protein LOC1110246379.1e-6282Show/hide
Query:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS
        MSTSII LL AQKLN ENYKQWKSN+N IL+IDDLRFVLQEDCPQAPAPNATVAVRN+YDRWIKAND+AKV I ASISDVL KKHE+++  KEIMDSLQS
Subjt:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS

Query:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID
        MFG  SSQARHEA   IYNSRM + SS+REHVLNLMVHFNVAESN  VID
Subjt:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID

A0A6J1DWL0 uncharacterized protein LOC1110247346.8e-6585.33Show/hide
Query:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS
        MS SIIALLAAQKLN ENY+QWKSNLN ILVIDDLRFVLQEDCPQAP  NATVAVRN YDRWIK+ND+AKVYI ASISDVL KKHEDT+T KEIMDSLQS
Subjt:  MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQS

Query:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID
        MFG  S QARHEA KF+YNSRM EGSS+REHVLNLMVHFNVAESN  VID
Subjt:  MFGPLSSQARHEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTACTTCTATTATTGCACTCCTAGCCGCCCAAAAACTTAACAGCGAGAATTACAAACAATGGAAATCGAATCTAAACATTATTCTTGTGATAGATGATCTTAGGTT
CGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGCAGTGCGCAACGTCTATGATAGATGGATCAAGGCCAATGACCAGGCGAAGGTCTACATCT
TTGCGAGCATATCTGATGTGCTGACTAAGAAGCACGAGGACACAATCACCGCTAAGGAGATCATGGACTCACTGCAGAGCATGTTTGGACCACTGTCTTCACAGGCTCGA
CACGAAGCCTTTAAGTTCATTTACAACTCCCGCATGAATGAGGGCTCCTCAATACGAGAACACGTTCTCAACCTGATGGTCCACTTCAACGTGGCAGAGTCGAACAAGGC
TGTCATAGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTACTTCTATTATTGCACTCCTAGCCGCCCAAAAACTTAACAGCGAGAATTACAAACAATGGAAATCGAATCTAAACATTATTCTTGTGATAGATGATCTTAGGTT
CGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGCAGTGCGCAACGTCTATGATAGATGGATCAAGGCCAATGACCAGGCGAAGGTCTACATCT
TTGCGAGCATATCTGATGTGCTGACTAAGAAGCACGAGGACACAATCACCGCTAAGGAGATCATGGACTCACTGCAGAGCATGTTTGGACCACTGTCTTCACAGGCTCGA
CACGAAGCCTTTAAGTTCATTTACAACTCCCGCATGAATGAGGGCTCCTCAATACGAGAACACGTTCTCAACCTGATGGTCCACTTCAACGTGGCAGAGTCGAACAAGGC
TGTCATAGACTAG
Protein sequenceShow/hide protein sequence
MSTSIIALLAAQKLNSENYKQWKSNLNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDQAKVYIFASISDVLTKKHEDTITAKEIMDSLQSMFGPLSSQAR
HEAFKFIYNSRMNEGSSIREHVLNLMVHFNVAESNKAVID