; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g17410 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g17410
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag protease polyprotein
Genome locationchr7:12474016..12474710
RNA-Seq ExpressionMoc07g17410
SyntenyMoc07g17410
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0004518 - nuclease activity (molecular function)
GO:0005488 - binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051724.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-1654.84Show/hide
Query:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKD-DALLWTQAEFLNLKQGNRSVKEFEREFTKLSHFALELVDTEAKKTE
        +P +FDG   DPT  + WLSS+ETIFRYM   E+QKVQCAVFML D DA    + EFLNL+QG+ +VK+++ EF  LS FA E++ TEA + +
Subjt:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKD-DALLWTQAEFLNLKQGNRSVKEFEREFTKLSHFALELVDTEAKKTE

XP_022156662.1 uncharacterized protein LOC111023512 [Momordica charantia]1.1e-2557.03Show/hide
Query:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKDDALLW------------------------------------TQAEFLNLKQGNR
        DPRSFDGL VDP L +AWLS METIFRYM  LEEQKVQC VFMLKDDA LW                                     Q EFLNLKQ NR
Subjt:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKDDALLW------------------------------------TQAEFLNLKQGNR

Query:  SVKEFEREFTKLSHFALELVDTEAKKTE
        SV+E++REFTKLS FA ELVDTEA K E
Subjt:  SVKEFEREFTKLSHFALELVDTEAKKTE

XP_038875108.1 uncharacterized protein LOC120067638 [Benincasa hispida]1.2e-1645.9Show/hide
Query:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKDDALLW-----------------------------------TQAEFLNLKQGNRS
        DPRSFDG   DPT  + WLSS+ETIFR+M   EE K+QCAVFML  +A +W                                    QAEFLNL+QG  S
Subjt:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKDDALLW-----------------------------------TQAEFLNLKQGNRS

Query:  VKEFEREFTKLSHFALELVDTE
        V+++E+EF KLSHF+ ELV TE
Subjt:  VKEFEREFTKLSHFALELVDTE

XP_038883046.1 uncharacterized protein LOC120074107 [Benincasa hispida]1.2e-1639.88Show/hide
Query:  QTPRQMENPPVGQTSE--QAEATIASLMMETLH----TLCDPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKDDALLW---------
        QT  Q E  P+ Q  +       +  L +E  H       DPRSFDG   DPT  K WLSS+ETIFR+M   EE K+QC VFML  +  +W         
Subjt:  QTPRQMENPPVGQTSE--QAEATIASLMMETLH----TLCDPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKDDALLW---------

Query:  ---------------------------TQAEFLNLKQGNRSVKEFEREFTKLSHFALELVDTEAKKTE
                                    QAEFLNLKQG  S++++E+EF KLSHF  ELV TEA +TE
Subjt:  ---------------------------TQAEFLNLKQGNRSVKEFEREFTKLSHFALELVDTEAKKTE

XP_038887018.1 uncharacterized protein LOC120077183 [Benincasa hispida]9.0e-1745.31Show/hide
Query:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKDDALLW------------------------------------TQAEFLNLKQGNR
        D RSFDG   DPT  K WLSS+ETIF +M  LEE K+QCAVFML  +A +W                                     Q EFLNLKQ   
Subjt:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKDDALLW------------------------------------TQAEFLNLKQGNR

Query:  SVKEFEREFTKLSHFALELVDTEAKKTE
        SV+E+E+EF KLSHF+LELV  EA +T+
Subjt:  SVKEFEREFTKLSHFALELVDTEAKKTE

TrEMBL top hitse value%identityAlignment
A0A5A7TGL9 Reverse transcriptase7.4e-1747.57Show/hide
Query:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKDDALLWTQA-----------EFLNLKQGNRSVKEFEREFTKLSHFALELVDTEAK
        +P +FDG   DPT  + WLSS+ETIFRYM   E+QKVQCAVFML D    W +            EFLNL+QG+ +V++++ +F  LS FALE++ TEA 
Subjt:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKDDALLWTQA-----------EFLNLKQGNRSVKEFEREFTKLSHFALELVDTEAK

Query:  KTE
        + +
Subjt:  KTE

A0A5A7U8R8 Reverse transcriptase5.7e-1754.84Show/hide
Query:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKD-DALLWTQAEFLNLKQGNRSVKEFEREFTKLSHFALELVDTEAKKTE
        +P +FDG   DPT  + WLSS+ETIFRYM   E+QKVQCAVFML D DA    + EFLNL+QG+ +VK+++ EF  LS FA E++ TEA + +
Subjt:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKD-DALLWTQAEFLNLKQGNRSVKEFEREFTKLSHFALELVDTEAKKTE

A0A5A7VFT1 Ty3-gypsy retrotransposon protein7.4e-1753.76Show/hide
Query:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKD-DALLWTQAEFLNLKQGNRSVKEFEREFTKLSHFALELVDTEAKKTE
        +P +FDG   DPT  + WLSS+ETIFRYM   E+QKVQC VFML D DA    Q EFLNL+QG+ +V++++ EF  LS FA E++ TEA + +
Subjt:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKD-DALLWTQAEFLNLKQGNRSVKEFEREFTKLSHFALELVDTEAKKTE

A0A5D3E3C0 Ty3-gypsy retrotransposon protein7.4e-1753.76Show/hide
Query:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKD-DALLWTQAEFLNLKQGNRSVKEFEREFTKLSHFALELVDTEAKKTE
        +P +FDG   DPT  + WLSS+ETIFRYM   E+QKVQC VFML D DA    Q EFLNL+QG+ +V++++ EF  LS FA E++ TEA + +
Subjt:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKD-DALLWTQAEFLNLKQGNRSVKEFEREFTKLSHFALELVDTEAKKTE

A0A6J1DSJ6 uncharacterized protein LOC1110235125.1e-2657.03Show/hide
Query:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKDDALLW------------------------------------TQAEFLNLKQGNR
        DPRSFDGL VDP L +AWLS METIFRYM  LEEQKVQC VFMLKDDA LW                                     Q EFLNLKQ NR
Subjt:  DPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEEQKVQCAVFMLKDDALLW------------------------------------TQAEFLNLKQGNR

Query:  SVKEFEREFTKLSHFALELVDTEAKKTE
        SV+E++REFTKLS FA ELVDTEA K E
Subjt:  SVKEFEREFTKLSHFALELVDTEAKKTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGACCCTGAGACAATGGAGCCGACGGTGAATAACCAAACTTCAGGTAAGACGGAGAATCCATCAATGGTTCAACCTACTGGTCAGATGGGGAATCTACCA
CTGGGTCAAACTCCTAGACAAATGGAGAATCCACCAGTGGGTCAAACTTCTGAACAAGCAGAGGCTACTATAGCGTCTTTGATGATGGAGACTCTACATACACTT
TGCGATCCTCGCTCTTTTGATGGACTATTTGTTGATCCAACGTTAACAAAGGCTTGGTTGTCGTCGATGGAAACCATTTTTCGTTATATGAGCTATCTGGAGGAA
CAAAAAGTGCAGTGTGCTGTCTTTATGCTAAAAGATGATGCCCTTTTGTGGACACAAGCAGAGTTTCTAAACCTAAAGCAAGGTAACAGATCAGTGAAGGAATTT
GAGAGGGAATTCACAAAATTGTCTCATTTTGCCCTTGAACTAGTAGACACGGAGGCGAAGAAGACCGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGACCCTGAGACAATGGAGCCGACGGTGAATAACCAAACTTCAGGTAAGACGGAGAATCCATCAATGGTTCAACCTACTGGTCAGATGGGGAATCTACCA
CTGGGTCAAACTCCTAGACAAATGGAGAATCCACCAGTGGGTCAAACTTCTGAACAAGCAGAGGCTACTATAGCGTCTTTGATGATGGAGACTCTACATACACTT
TGCGATCCTCGCTCTTTTGATGGACTATTTGTTGATCCAACGTTAACAAAGGCTTGGTTGTCGTCGATGGAAACCATTTTTCGTTATATGAGCTATCTGGAGGAA
CAAAAAGTGCAGTGTGCTGTCTTTATGCTAAAAGATGATGCCCTTTTGTGGACACAAGCAGAGTTTCTAAACCTAAAGCAAGGTAACAGATCAGTGAAGGAATTT
GAGAGGGAATTCACAAAATTGTCTCATTTTGCCCTTGAACTAGTAGACACGGAGGCGAAGAAGACCGAATGA
Protein sequenceShow/hide protein sequence
MFDPETMEPTVNNQTSGKTENPSMVQPTGQMGNLPLGQTPRQMENPPVGQTSEQAEATIASLMMETLHTLCDPRSFDGLFVDPTLTKAWLSSMETIFRYMSYLEE
QKVQCAVFMLKDDALLWTQAEFLNLKQGNRSVKEFEREFTKLSHFALELVDTEAKKTE