; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g19150 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g19150
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr3:12794820..12797223
RNA-Seq ExpressionMoc03g19150
SyntenyMoc03g19150
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151295.1 uncharacterized protein LOC111019259 [Momordica charantia]1.3e-3376.64Show/hide
Query:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS
        MS SIIALL  +KLN ENYKQ KSNLN ILVI+DLRFVLQE+ P AP  +ATVAV   YDRWIKANDKA+VYIL SIS+VLAKK E+ V AKEIMDSLQS
Subjt:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS

Query:  MFGQPSS
        MFGQ SS
Subjt:  MFGQPSS

XP_022152352.1 uncharacterized protein LOC111020095 [Momordica charantia]2.4e-4388.79Show/hide
Query:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS
        MSTSIIALL AQ+LNGENYKQWKSNLNTILVIDDL+FVLQEDCPQA   NATVAV   YDRWIKANDKAKVYILASISDVLAKK EDT+ AKEIMDSLQS
Subjt:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS

Query:  MFGQPSS
        MFGQPSS
Subjt:  MFGQPSS

XP_022158062.1 uncharacterized protein LOC111024637 [Momordica charantia]9.1e-4387.85Show/hide
Query:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS
        MSTSII LL AQKLN ENYKQWKSN+NTIL+IDDLRFVLQEDCPQAP  NATVAV NIYDRWIKANDKAKV ILASISDVLAKK E++VI KEIMDSLQS
Subjt:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS

Query:  MFGQPSS
        MFGQPSS
Subjt:  MFGQPSS

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]5.7e-4591.51Show/hide
Query:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS
        MS SIIALL AQKLNGENY+QWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAV N YDRWIK+NDKAKVYILASISDVLAKK EDTV  KEIMDSLQS
Subjt:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS

Query:  MFGQPS
        MFGQPS
Subjt:  MFGQPS

XP_038882358.1 uncharacterized protein LOC120073622 [Benincasa hispida]2.7e-3469.81Show/hide
Query:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS
        M++SII LLT++KLNG+NY  WKSNLNTILV+DDLRFVL E+CPQAP SNA   V   YDRW+KAN+KA++YILAS+SDVLAKK E    AKEI+DSL+ 
Subjt:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS

Query:  MFGQPS
        +FGQPS
Subjt:  MFGQPS

TrEMBL top hitse value%identityAlignment
A0A6J1DAT1 uncharacterized protein LOC1110192596.4e-3476.64Show/hide
Query:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS
        MS SIIALL  +KLN ENYKQ KSNLN ILVI+DLRFVLQE+ P AP  +ATVAV   YDRWIKANDKA+VYIL SIS+VLAKK E+ V AKEIMDSLQS
Subjt:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS

Query:  MFGQPSS
        MFGQ SS
Subjt:  MFGQPSS

A0A6J1DFZ2 uncharacterized protein LOC1110200951.2e-4388.79Show/hide
Query:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS
        MSTSIIALL AQ+LNGENYKQWKSNLNTILVIDDL+FVLQEDCPQA   NATVAV   YDRWIKANDKAKVYILASISDVLAKK EDT+ AKEIMDSLQS
Subjt:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS

Query:  MFGQPSS
        MFGQPSS
Subjt:  MFGQPSS

A0A6J1DW68 uncharacterized protein LOC1110246374.4e-4387.85Show/hide
Query:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS
        MSTSII LL AQKLN ENYKQWKSN+NTIL+IDDLRFVLQEDCPQAP  NATVAV NIYDRWIKANDKAKV ILASISDVLAKK E++VI KEIMDSLQS
Subjt:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS

Query:  MFGQPSS
        MFGQPSS
Subjt:  MFGQPSS

A0A6J1DWL0 uncharacterized protein LOC1110247342.8e-4591.51Show/hide
Query:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS
        MS SIIALL AQKLNGENY+QWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAV N YDRWIK+NDKAKVYILASISDVLAKK EDTV  KEIMDSLQS
Subjt:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS

Query:  MFGQPS
        MFGQPS
Subjt:  MFGQPS

E2GK51 Gag/pol protein (Fragment)1.4e-3368.87Show/hide
Query:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS
        M+TSI+ LL ++KLNG+NY  WKSNLNTILV+DDLRFVL E+CPQAP  NA   V   YDRW+KANDKA+VYILAS++DVLAKK +    AK IMDSL+ 
Subjt:  MSTSIIALLTAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQS

Query:  MFGQPS
        MFGQPS
Subjt:  MFGQPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGGCTGGCGGTGGCGCGCGCGCAGTGAGGCTAGCACTAGGGGTGGGCGCGCGCATGGGCGGGCCCTCGTGCTGGAACTCATGGCTAGTAGGTGTGCGTTGTGGGCG
TAGGCAGCAGGCATGCGAGGCGAAGCACGCAGGCGGTAGAGGGGGGCGTGCGGTGTGGGCGGCAGACAGCACCGTGCATGAGAGGTTTTCGGGCATAGTGCCGCTGCATT
ATAGCGCTGCAACGCTACGGCTGTCGCTTTTTGCTGCAGCAGCGCCGTGCATGTCTACTTCTATTATTGCACTCCTAACCGCGCAAAAACTTAACGGTGAGAATTACAAA
CAATGGAAATCGAATCTAAACACTATTCTCGTGATAGATGATCTTAGGTTCGTCTTGCAAGAAGATTGTCCTCAAGCTCCTGTGTCTAACGCCACTGTGGCGGTGTGCAA
CATCTATGACAGATGGATCAAGGCCAATGATAAGGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCTAAGAAGCAAGAGGACACGGTCATCGCTAAAGAGA
TCATGGACTCGCTACAGAGCATGTTTGGACAACCGTCCTCAGGCTCGACACGAAGCCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGGGCTGGCGGTGGCGCGCGCGCAGTGAGGCTAGCACTAGGGGTGGGCGCGCGCATGGGCGGGCCCTCGTGCTGGAACTCATGGCTAGTAGGTGTGCGTTGTGGGCG
TAGGCAGCAGGCATGCGAGGCGAAGCACGCAGGCGGTAGAGGGGGGCGTGCGGTGTGGGCGGCAGACAGCACCGTGCATGAGAGGTTTTCGGGCATAGTGCCGCTGCATT
ATAGCGCTGCAACGCTACGGCTGTCGCTTTTTGCTGCAGCAGCGCCGTGCATGTCTACTTCTATTATTGCACTCCTAACCGCGCAAAAACTTAACGGTGAGAATTACAAA
CAATGGAAATCGAATCTAAACACTATTCTCGTGATAGATGATCTTAGGTTCGTCTTGCAAGAAGATTGTCCTCAAGCTCCTGTGTCTAACGCCACTGTGGCGGTGTGCAA
CATCTATGACAGATGGATCAAGGCCAATGATAAGGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCTAAGAAGCAAGAGGACACGGTCATCGCTAAAGAGA
TCATGGACTCGCTACAGAGCATGTTTGGACAACCGTCCTCAGGCTCGACACGAAGCCCTTAA
Protein sequenceShow/hide protein sequence
MRAGGGARAVRLALGVGARMGGPSCWNSWLVGVRCGRRQQACEAKHAGGRGGRAVWAADSTVHERFSGIVPLHYSAATLRLSLFAAAAPCMSTSIIALLTAQKLNGENYK
QWKSNLNTILVIDDLRFVLQEDCPQAPVSNATVAVCNIYDRWIKANDKAKVYILASISDVLAKKQEDTVIAKEIMDSLQSMFGQPSSGSTRSP