; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g13580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g13580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr7:10206202..10209026
RNA-Seq ExpressionMoc07g13580
SyntenyMoc07g13580
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151295.1 uncharacterized protein LOC111019259 [Momordica charantia]3.0e-5578.15Show/hide
Query:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        MS SIIALLA +KLN ENYKQ KSNLN ILVI+DLRFVLQE+ P APA +ATVAV   YD+WIKANDKA+VYIL SIS+VLAKKHE+ VTAKEIMDSLQS
Subjt:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE
        MFGQ SSQA+HE LKFVYNS M EG SVRE++LNLM+HFN+AE+N AIIDE
Subjt:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE

XP_022152352.1 uncharacterized protein LOC111020095 [Momordica charantia]1.7e-7192.72Show/hide
Query:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        MSTSIIALLAAQ+LNGENYKQWKSNLNTILVIDDL+FVLQEDCPQA APNATVAVR AYD+WIKANDKAKVYILASISDVLAKKHEDT+TAKEIMDSLQS
Subjt:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE
        MFGQPSSQARHEALKF+YNSRM EGSSVRE++LNLMVHFNVAESNGA+IDE
Subjt:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE

XP_022157844.1 uncharacterized protein LOC111024457 [Momordica charantia]3.3e-5470.2Show/hide
Query:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M++SI+ LLA++KLNG NY  WK+NLNTILV+DDLRFVL E+CPQ PA NA   VR A+D+W+KANDKA+VYILAS++DVLAKKHE  +TAKEIMDSL++
Subjt:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE
        MFG+PSS  RHEALK+VYN  M EG+SVRE++L++MVHFN AE NGA IDE
Subjt:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE

XP_022158062.1 uncharacterized protein LOC111024637 [Momordica charantia]1.0e-6384.77Show/hide
Query:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        MSTSII LL AQKLN ENYKQWKSN+NTIL+IDDLRFVLQEDCPQAPAPNATVAVRN YD+WIKANDKAKV ILASISDVLAKKHE++V  KEIMDSLQS
Subjt:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE
        MFGQPSSQARHEAL  +YNSRM + SSVRE++LNLMVHFNVAESN  +IDE
Subjt:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]5.6e-7091.39Show/hide
Query:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        MS SIIALLAAQKLNGENY+QWKSNLNTILVIDDLRFVLQEDCPQAP  NATVAVRNAYD+WIK+NDKAKVYILASISDVLAKKHEDTVT KEIMDSLQS
Subjt:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE
        MFGQPS QARHEALKFVYNSRM EGSSVRE++LNLMVHFNVAESNG +IDE
Subjt:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE

TrEMBL top hitse value%identityAlignment
A0A6J1DAT1 uncharacterized protein LOC1110192591.4e-5578.15Show/hide
Query:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        MS SIIALLA +KLN ENYKQ KSNLN ILVI+DLRFVLQE+ P APA +ATVAV   YD+WIKANDKA+VYIL SIS+VLAKKHE+ VTAKEIMDSLQS
Subjt:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE
        MFGQ SSQA+HE LKFVYNS M EG SVRE++LNLM+HFN+AE+N AIIDE
Subjt:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE

A0A6J1DFZ2 uncharacterized protein LOC1110200958.5e-7292.72Show/hide
Query:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        MSTSIIALLAAQ+LNGENYKQWKSNLNTILVIDDL+FVLQEDCPQA APNATVAVR AYD+WIKANDKAKVYILASISDVLAKKHEDT+TAKEIMDSLQS
Subjt:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE
        MFGQPSSQARHEALKF+YNSRM EGSSVRE++LNLMVHFNVAESNGA+IDE
Subjt:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE

A0A6J1DW68 uncharacterized protein LOC1110246375.0e-6484.77Show/hide
Query:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        MSTSII LL AQKLN ENYKQWKSN+NTIL+IDDLRFVLQEDCPQAPAPNATVAVRN YD+WIKANDKAKV ILASISDVLAKKHE++V  KEIMDSLQS
Subjt:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE
        MFGQPSSQARHEAL  +YNSRM + SSVRE++LNLMVHFNVAESN  +IDE
Subjt:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE

A0A6J1DWL0 uncharacterized protein LOC1110247342.7e-7091.39Show/hide
Query:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        MS SIIALLAAQKLNGENY+QWKSNLNTILVIDDLRFVLQEDCPQAP  NATVAVRNAYD+WIK+NDKAKVYILASISDVLAKKHEDTVT KEIMDSLQS
Subjt:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE
        MFGQPS QARHEALKFVYNSRM EGSSVRE++LNLMVHFNVAESNG +IDE
Subjt:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE

A0A6J1DXQ5 uncharacterized protein LOC1110244571.6e-5470.2Show/hide
Query:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M++SI+ LLA++KLNG NY  WK+NLNTILV+DDLRFVL E+CPQ PA NA   VR A+D+W+KANDKA+VYILAS++DVLAKKHE  +TAKEIMDSL++
Subjt:  MSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE
        MFG+PSS  RHEALK+VYN  M EG+SVRE++L++MVHFN AE NGA IDE
Subjt:  MFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTCACTCGTTTTCTGCAGAAGTTCTACAATGGTTACAGAACAAGATGAGAACAGAACTCCAATTCTTCTCTCTCAATTTGATCTCTCAAACTCTCCCTCAC
ATTCCAGAGAATTGCTCCCACAAGCACGATCTCGAGACCCAAGAGGATAGCAAGGAAGATCGTTTGGTGGTGTTCGTTGAGAAATCATTGAAGAAACGTTCTTCA
AAGAAAACAGCGAAGAAGACGAAGCAGACTGCGCAGACGGCGCTATGGCGCTACGCAGCAGTGCCATGGCGCCGTGCCTGTGCGCCGCGGCGCTGCTGCTGCGGC
ATTTTGCTGCAGCAGCGCGGTGGCACTGCCTTTAGGCGCCGAGGCACTGTCCCGGGTGTTTTTCGGCGCGTTTCCGTGGCTCTGGTTGTTTTTACTTTCAACATG
TCTACTTCCATTATTGCACTCCTAGCCGCTCAAAAACTTAACGGCGAGAATTACAAACAATGGAAATCGAATCTAAATACTATTCTCGTGATAGATGATCTTAGG
TTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGCGGTGCGCAACGCCTATGATAAGTGGATCAAGGCCAATGACAAGGCCAAGGTC
TACATCTTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCATGAGGACACGGTCACCGCTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCC
TCACAGGCTCGACATGAAGCCCTTAAGTTCGTTTATAACTCCCGCATGAATGAGGGTTCCTCAGTGCGAGAAAACATTCTCAACCTGATGGTCCACTTCAATGTG
GCTGAGTCGAACGGGGCCATCATAGACGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGTCACTCGTTTTCTGCAGAAGTTCTACAATGGTTACAGAACAAGATGAGAACAGAACTCCAATTCTTCTCTCTCAATTTGATCTCTCAAACTCTCCCTCAC
ATTCCAGAGAATTGCTCCCACAAGCACGATCTCGAGACCCAAGAGGATAGCAAGGAAGATCGTTTGGTGGTGTTCGTTGAGAAATCATTGAAGAAACGTTCTTCA
AAGAAAACAGCGAAGAAGACGAAGCAGACTGCGCAGACGGCGCTATGGCGCTACGCAGCAGTGCCATGGCGCCGTGCCTGTGCGCCGCGGCGCTGCTGCTGCGGC
ATTTTGCTGCAGCAGCGCGGTGGCACTGCCTTTAGGCGCCGAGGCACTGTCCCGGGTGTTTTTCGGCGCGTTTCCGTGGCTCTGGTTGTTTTTACTTTCAACATG
TCTACTTCCATTATTGCACTCCTAGCCGCTCAAAAACTTAACGGCGAGAATTACAAACAATGGAAATCGAATCTAAATACTATTCTCGTGATAGATGATCTTAGG
TTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGCGGTGCGCAACGCCTATGATAAGTGGATCAAGGCCAATGACAAGGCCAAGGTC
TACATCTTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCATGAGGACACGGTCACCGCTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCC
TCACAGGCTCGACATGAAGCCCTTAAGTTCGTTTATAACTCCCGCATGAATGAGGGTTCCTCAGTGCGAGAAAACATTCTCAACCTGATGGTCCACTTCAATGTG
GCTGAGTCGAACGGGGCCATCATAGACGAGTAG
Protein sequenceShow/hide protein sequence
MRHSFSAEVLQWLQNKMRTELQFFSLNLISQTLPHIPENCSHKHDLETQEDSKEDRLVVFVEKSLKKRSSKKTAKKTKQTAQTALWRYAAVPWRRACAPRRCCCG
ILLQQRGGTAFRRRGTVPGVFRRVSVALVVFTFNMSTSIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDKWIKANDKAKV
YILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFVYNSRMNEGSSVRENILNLMVHFNVAESNGAIIDE