; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g01990 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g01990
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr1:1336678..1341620
RNA-Seq ExpressionMoc01g01990
SyntenyMoc01g01990
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151295.1 uncharacterized protein LOC111019259 [Momordica charantia]1.0e-3979.28Show/hide
Query:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN
        ++FVLQE+ P APA +ATVAV   YDRWIKANDKA+VYIL SIS+VLAKKHE+ VTAKEIMDSLQSMFGQSSSQA+HE LKF+YNS MKEG SVREHVLN
Subjt:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN

Query:  LMVHFNVAEWN
        LM+HFN+AE N
Subjt:  LMVHFNVAEWN

XP_022152352.1 uncharacterized protein LOC111020095 [Momordica charantia]2.5e-4993.75Show/hide
Query:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN
        +KFVLQEDCPQA APNATVAVR AYDRWIKANDKAKVYILASISDVLAKKHEDT+TAKEIMDSLQSMFGQ SSQARHEALKFIYNSRMKEG SVREHVLN
Subjt:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN

Query:  LMVHFNVAEWNG
        LMVHFNVAE NG
Subjt:  LMVHFNVAEWNG

XP_022154837.1 uncharacterized protein LOC111022000 [Momordica charantia]4.2e-4991.89Show/hide
Query:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN
        ++FVLQEDCPQAPAPNAT+AVRNAYDRWIKANDKAKVYIL+SISDVLAKKHEDTVTAKEIMDSLQSMFGQ SSQARHEALKF+YNSRMK+G SVREHVLN
Subjt:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN

Query:  LMVHFNVAEWN
        LMVHFNVAE N
Subjt:  LMVHFNVAEWN

XP_022158062.1 uncharacterized protein LOC111024637 [Momordica charantia]1.2e-4386.49Show/hide
Query:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN
        ++FVLQEDCPQAPAPNATVAVRN YDRWIKANDKAKV ILASISDVLAKKHE++V  KEIMDSLQSMFGQ SSQARHEAL  IYNSRMK+  SVREHVLN
Subjt:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN

Query:  LMVHFNVAEWN
        LMVHFNVAE N
Subjt:  LMVHFNVAEWN

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]4.6e-4890.18Show/hide
Query:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN
        ++FVLQEDCPQAP  NATVAVRNAYDRWIK+NDKAKVYILASISDVLAKKHEDTVT KEIMDSLQSMFGQ S QARHEALKF+YNSRMKEG SVREHVLN
Subjt:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN

Query:  LMVHFNVAEWNG
        LMVHFNVAE NG
Subjt:  LMVHFNVAEWNG

TrEMBL top hitse value%identityAlignment
A0A6J1DAT1 uncharacterized protein LOC1110192595.0e-4079.28Show/hide
Query:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN
        ++FVLQE+ P APA +ATVAV   YDRWIKANDKA+VYIL SIS+VLAKKHE+ VTAKEIMDSLQSMFGQSSSQA+HE LKF+YNS MKEG SVREHVLN
Subjt:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN

Query:  LMVHFNVAEWN
        LM+HFN+AE N
Subjt:  LMVHFNVAEWN

A0A6J1DFZ2 uncharacterized protein LOC1110200951.2e-4993.75Show/hide
Query:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN
        +KFVLQEDCPQA APNATVAVR AYDRWIKANDKAKVYILASISDVLAKKHEDT+TAKEIMDSLQSMFGQ SSQARHEALKFIYNSRMKEG SVREHVLN
Subjt:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN

Query:  LMVHFNVAEWNG
        LMVHFNVAE NG
Subjt:  LMVHFNVAEWNG

A0A6J1DMS3 uncharacterized protein LOC1110220002.0e-4991.89Show/hide
Query:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN
        ++FVLQEDCPQAPAPNAT+AVRNAYDRWIKANDKAKVYIL+SISDVLAKKHEDTVTAKEIMDSLQSMFGQ SSQARHEALKF+YNSRMK+G SVREHVLN
Subjt:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN

Query:  LMVHFNVAEWN
        LMVHFNVAE N
Subjt:  LMVHFNVAEWN

A0A6J1DW68 uncharacterized protein LOC1110246375.7e-4486.49Show/hide
Query:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN
        ++FVLQEDCPQAPAPNATVAVRN YDRWIKANDKAKV ILASISDVLAKKHE++V  KEIMDSLQSMFGQ SSQARHEAL  IYNSRMK+  SVREHVLN
Subjt:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN

Query:  LMVHFNVAEWN
        LMVHFNVAE N
Subjt:  LMVHFNVAEWN

A0A6J1DWL0 uncharacterized protein LOC1110247342.2e-4890.18Show/hide
Query:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN
        ++FVLQEDCPQAP  NATVAVRNAYDRWIK+NDKAKVYILASISDVLAKKHEDTVT KEIMDSLQSMFGQ S QARHEALKF+YNSRMKEG SVREHVLN
Subjt:  VKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLN

Query:  LMVHFNVAEWNG
        LMVHFNVAE NG
Subjt:  LMVHFNVAEWNG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGTCGGCTGCTCCGCATCGCGATGACGTTGTGACAGAACTCAGAACCAAACTCGGTGTGGTTTCCTTCCCCACTCGTTTGAGAGGGAATGTCAAGATAGCT
ATGTGCTACCACAAGCACGATCCCGAGACCCAAGAGGATAGCGAGGAAGATCCGGTGGTGGTGTTCGAGGGGAACTCACTGAAGAAACGTTCTTCAAAGTCGTTC
GTGGCGTCGGATTGGGCAAAAAGTTGCAGAAAACGGCAAAGAAGACGAAGCAGACTGCACAGACAGCACACAGCGCCACGGCGCTGCACTGTCGCGTCGCGGCGC
TGTGTAGCACCATGGCGCCATGCTAGGGCGCCGCGGCGCTGCTGCTGCAGCATTTTGCTGCCATTAGGCGCCGAGGCGCTGTCCCGAGTGTCTTTCGACCCGGTT
CCGAAGCTCCGGTTCGCGGTTCGAGGGCGGATGCAGTCGGATTATTGGGGTGGACCTCTGAGGTCCGAAAATGTTGGGTCACACTTACGAGGAGTTGTTAAGTTC
GTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGCGGTGCGCAACGCCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAGGTCTAC
ATCTTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACGGTCACCGCTAAGGAGATCATGGACTCGCTGCAAAGCATGTTTGGACAATCGTCCTCA
CAGGCTCGACATGAAGCCCTTAAGTTCATTTACAACTCCCGCATGAAGGAGGGCTTCTCAGTGCGAGAACACGTTCTCAACCTGATGGTTCACTTCAATGTGGCT
GAGTGGAACGGAGGCAGCTTGACGCCGGAGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGACGTCGGCTGCTCCGCATCGCGATGACGTTGTGACAGAACTCAGAACCAAACTCGGTGTGGTTTCCTTCCCCACTCGTTTGAGAGGGAATGTCAAGATAGCT
ATGTGCTACCACAAGCACGATCCCGAGACCCAAGAGGATAGCGAGGAAGATCCGGTGGTGGTGTTCGAGGGGAACTCACTGAAGAAACGTTCTTCAAAGTCGTTC
GTGGCGTCGGATTGGGCAAAAAGTTGCAGAAAACGGCAAAGAAGACGAAGCAGACTGCACAGACAGCACACAGCGCCACGGCGCTGCACTGTCGCGTCGCGGCGC
TGTGTAGCACCATGGCGCCATGCTAGGGCGCCGCGGCGCTGCTGCTGCAGCATTTTGCTGCCATTAGGCGCCGAGGCGCTGTCCCGAGTGTCTTTCGACCCGGTT
CCGAAGCTCCGGTTCGCGGTTCGAGGGCGGATGCAGTCGGATTATTGGGGTGGACCTCTGAGGTCCGAAAATGTTGGGTCACACTTACGAGGAGTTGTTAAGTTC
GTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGCGGTGCGCAACGCCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAGGTCTAC
ATCTTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACGGTCACCGCTAAGGAGATCATGGACTCGCTGCAAAGCATGTTTGGACAATCGTCCTCA
CAGGCTCGACATGAAGCCCTTAAGTTCATTTACAACTCCCGCATGAAGGAGGGCTTCTCAGTGCGAGAACACGTTCTCAACCTGATGGTTCACTTCAATGTGGCT
GAGTGGAACGGAGGCAGCTTGACGCCGGAGAGATGA
Protein sequenceShow/hide protein sequence
MTSAAPHRDDVVTELRTKLGVVSFPTRLRGNVKIAMCYHKHDPETQEDSEEDPVVVFEGNSLKKRSSKSFVASDWAKSCRKRQRRRSRLHRQHTAPRRCTVASRR
CVAPWRHARAPRRCCCSILLPLGAEALSRVSFDPVPKLRFAVRGRMQSDYWGGPLRSENVGSHLRGVVKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVY
ILASISDVLAKKHEDTVTAKEIMDSLQSMFGQSSSQARHEALKFIYNSRMKEGFSVREHVLNLMVHFNVAEWNGGSLTPER