; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g17850 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g17850
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr6:13961776..13966925
RNA-Seq ExpressionMoc06g17850
SyntenyMoc06g17850
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151295.1 uncharacterized protein LOC111019259 [Momordica charantia]2.7e-3473.04Show/hide
Query:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM
        FVLQE+ P APA +ATVAV   Y RWI    KA+VYIL SIS+VLAKKHE+ VTAKEI+DSLQSM GQ SSQA+HE LKFVYNS MKEG S+REHVLNLM
Subjt:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM

Query:  VHFKVAELNGAIIDE
        +HF +AE N AIIDE
Subjt:  VHFKVAELNGAIIDE

XP_022152352.1 uncharacterized protein LOC111020095 [Momordica charantia]2.6e-4586.09Show/hide
Query:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM
        FVLQEDCPQA APNATVAVR  Y RWI    KAKVYILASISDVLAKKHEDT+TAKEI+DSLQSM GQPSSQARHEALKF+YNSRMKEGSS+REHVLNLM
Subjt:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM

Query:  VHFKVAELNGAIIDE
        VHF VAE NGA+IDE
Subjt:  VHFKVAELNGAIIDE

XP_022154837.1 uncharacterized protein LOC111022000 [Momordica charantia]1.3e-4485.22Show/hide
Query:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM
        FVLQEDCPQAPAPNAT+AVRN Y RWI    KAKVYIL+SISDVLAKKHEDTVTAKEI+DSLQSM GQPSSQARHEALKFVYNSRMK+GSS+REHVLNLM
Subjt:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM

Query:  VHFKVAELNGAIIDE
        VHF VAE N A+I E
Subjt:  VHFKVAELNGAIIDE

XP_022158062.1 uncharacterized protein LOC111024637 [Momordica charantia]4.3e-4079.13Show/hide
Query:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM
        FVLQEDCPQAPAPNATVAVRN+Y RWI    KAKV ILASISDVLAKKHE++V  KEI+DSLQSM GQPSSQARHEAL  +YNSRMK+ SS+REHVLNLM
Subjt:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM

Query:  VHFKVAELNGAIIDE
        VHF VAE N  +IDE
Subjt:  VHFKVAELNGAIIDE

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]2.2e-4485.22Show/hide
Query:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM
        FVLQEDCPQAP  NATVAVRN Y RWI    KAKVYILASISDVLAKKHEDTVT KEI+DSLQSM GQPS QARHEALKFVYNSRMKEGSS+REHVLNLM
Subjt:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM

Query:  VHFKVAELNGAIIDE
        VHF VAE NG +IDE
Subjt:  VHFKVAELNGAIIDE

TrEMBL top hitse value%identityAlignment
A0A6J1DAT1 uncharacterized protein LOC1110192591.3e-3473.04Show/hide
Query:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM
        FVLQE+ P APA +ATVAV   Y RWI    KA+VYIL SIS+VLAKKHE+ VTAKEI+DSLQSM GQ SSQA+HE LKFVYNS MKEG S+REHVLNLM
Subjt:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM

Query:  VHFKVAELNGAIIDE
        +HF +AE N AIIDE
Subjt:  VHFKVAELNGAIIDE

A0A6J1DFZ2 uncharacterized protein LOC1110200951.3e-4586.09Show/hide
Query:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM
        FVLQEDCPQA APNATVAVR  Y RWI    KAKVYILASISDVLAKKHEDT+TAKEI+DSLQSM GQPSSQARHEALKF+YNSRMKEGSS+REHVLNLM
Subjt:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM

Query:  VHFKVAELNGAIIDE
        VHF VAE NGA+IDE
Subjt:  VHFKVAELNGAIIDE

A0A6J1DMS3 uncharacterized protein LOC1110220006.3e-4585.22Show/hide
Query:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM
        FVLQEDCPQAPAPNAT+AVRN Y RWI    KAKVYIL+SISDVLAKKHEDTVTAKEI+DSLQSM GQPSSQARHEALKFVYNSRMK+GSS+REHVLNLM
Subjt:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM

Query:  VHFKVAELNGAIIDE
        VHF VAE N A+I E
Subjt:  VHFKVAELNGAIIDE

A0A6J1DW68 uncharacterized protein LOC1110246372.1e-4079.13Show/hide
Query:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM
        FVLQEDCPQAPAPNATVAVRN+Y RWI    KAKV ILASISDVLAKKHE++V  KEI+DSLQSM GQPSSQARHEAL  +YNSRMK+ SS+REHVLNLM
Subjt:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM

Query:  VHFKVAELNGAIIDE
        VHF VAE N  +IDE
Subjt:  VHFKVAELNGAIIDE

A0A6J1DWL0 uncharacterized protein LOC1110247341.1e-4485.22Show/hide
Query:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM
        FVLQEDCPQAP  NATVAVRN Y RWI    KAKVYILASISDVLAKKHEDTVT KEI+DSLQSM GQPS QARHEALKFVYNSRMKEGSS+REHVLNLM
Subjt:  FVLQEDCPQAPAPNATVAVRNVYGRWI----KAKVYILASISDVLAKKHEDTVTAKEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLM

Query:  VHFKVAELNGAIIDE
        VHF VAE NG +IDE
Subjt:  VHFKVAELNGAIIDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAGTTTTTCTCACAGTCAGGCGAACCAAGCTAAATTCTTCCATAAGGCTAATTCTACCCTTGACATGGAGAAAGTTGAATTGGAGACGGAGATTGAAGCT
CTTTGTGCTGACGAGGCTCTTCGTGACCGTGAAGAGTCACCTACACAGCAACCGATCTATGATCCTCTCATGGGTAGTACATACGTAGGTGCTCGGCATTCCACG
GGTGGCGAAGGAGCTTTCCTTTCAAATCTTCAAGCGTGTCGCCGGAGAAGCACGTCGTTAGAGAAGAAGGAGGTAGGGGGTAAGTGGCAGCTAGTGCAAGGATGG
GTTAAGGATGAAGACAACAGGGGTAGCGCGCATGGGCGCTGGGGCGGACAATGGCGCGCAGCCGCTAGGGCAGGCATGGACGTGTGGGTATGTGCGGTAGACTGG
GGCGCTAGGCAGGCGTGGGCACGCGGGCATGAGCAGCAGGCGGGGTGCCAAAAACCGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACT
GTGGCAGTGCGCAATGTCTATGGCAGGTGGATCAAGGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACGGTCACCGCT
AAGGAGATCATCGACTCGCTGCAGAGCATGTTAGGACAACCGTCCTCACAGGCTCGGCATGAAGCCCTTAAGTTCGTTTACAACTCCCGCATGAAGGAGGGTTCC
TCATTGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAAAGTGGCTGAGTTGAACGGGGCCATCATAGACGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGAGTTTTTCTCACAGTCAGGCGAACCAAGCTAAATTCTTCCATAAGGCTAATTCTACCCTTGACATGGAGAAAGTTGAATTGGAGACGGAGATTGAAGCT
CTTTGTGCTGACGAGGCTCTTCGTGACCGTGAAGAGTCACCTACACAGCAACCGATCTATGATCCTCTCATGGGTAGTACATACGTAGGTGCTCGGCATTCCACG
GGTGGCGAAGGAGCTTTCCTTTCAAATCTTCAAGCGTGTCGCCGGAGAAGCACGTCGTTAGAGAAGAAGGAGGTAGGGGGTAAGTGGCAGCTAGTGCAAGGATGG
GTTAAGGATGAAGACAACAGGGGTAGCGCGCATGGGCGCTGGGGCGGACAATGGCGCGCAGCCGCTAGGGCAGGCATGGACGTGTGGGTATGTGCGGTAGACTGG
GGCGCTAGGCAGGCGTGGGCACGCGGGCATGAGCAGCAGGCGGGGTGCCAAAAACCGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACT
GTGGCAGTGCGCAATGTCTATGGCAGGTGGATCAAGGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACGGTCACCGCT
AAGGAGATCATCGACTCGCTGCAGAGCATGTTAGGACAACCGTCCTCACAGGCTCGGCATGAAGCCCTTAAGTTCGTTTACAACTCCCGCATGAAGGAGGGTTCC
TCATTGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAAAGTGGCTGAGTTGAACGGGGCCATCATAGACGAGTAG
Protein sequenceShow/hide protein sequence
MRSFSHSQANQAKFFHKANSTLDMEKVELETEIEALCADEALRDREESPTQQPIYDPLMGSTYVGARHSTGGEGAFLSNLQACRRRSTSLEKKEVGGKWQLVQGW
VKDEDNRGSAHGRWGGQWRAAARAGMDVWVCAVDWGARQAWARGHEQQAGCQKPFVLQEDCPQAPAPNATVAVRNVYGRWIKAKVYILASISDVLAKKHEDTVTA
KEIIDSLQSMLGQPSSQARHEALKFVYNSRMKEGSSLREHVLNLMVHFKVAELNGAIIDE