; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g20470 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g20470
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr4:14839866..14842404
RNA-Seq ExpressionMoc04g20470
SyntenyMoc04g20470
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-3270.64Show/hide
Query:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY
        LA++H   +TA+EI+ S Q +FGQ S Q +H+ALK+IYN+RM EG+SVRE VLN+MV+FNVAEMNG VIDE SQVSFI+ESLP+SF+QFRSNAVMNK+ Y
Subjt:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY

Query:  TLTTHLNEL
        TLTT LNEL
Subjt:  TLTTHLNEL

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-3270.64Show/hide
Query:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY
        LA++H   +TA+EI+ S Q +FGQ S Q +H+ALK+IYN+RM EG+SVRE VLN+MV+FNVAEMNG VIDE SQVSFI+ESLP+SF+QFRSNAVMNK+ Y
Subjt:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY

Query:  TLTTHLNEL
        TLTT LNEL
Subjt:  TLTTHLNEL

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-3270.64Show/hide
Query:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY
        LA++H   +TA+EI+ S Q +FGQ S Q +H+ALK+IYN+RM EG+SVRE VLN+MV+FNVAEMNG VIDE SQVSFI+ESLP+SF+QFRSNAVMNK+ Y
Subjt:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY

Query:  TLTTHLNEL
        TLTT LNEL
Subjt:  TLTTHLNEL

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-3270.64Show/hide
Query:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY
        LA++H   +TA+EI+ S Q +FGQ S Q +H+ALK+IYN+RM EG+SVRE VLN+MV+FNVAEMNG VIDE SQVSFI+ESLP+SF+QFRSNAVMNK+ Y
Subjt:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY

Query:  TLTTHLNEL
        TLTT LNEL
Subjt:  TLTTHLNEL

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-3270.64Show/hide
Query:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY
        LA++H   +TA+EI+ S Q +FGQ S Q +H+ALK+IYN+RM EG+SVRE VLN+MV+FNVAEMNG VIDE SQVSFI+ESLP+SF+QFRSNAVMNK+ Y
Subjt:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY

Query:  TLTTHLNEL
        TLTT LNEL
Subjt:  TLTTHLNEL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.4e-3270.64Show/hide
Query:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY
        LA++H   +TA+EI+ S Q +FGQ S Q +H+ALK+IYN+RM EG+SVRE VLN+MV+FNVAEMNG VIDE SQVSFI+ESLP+SF+QFRSNAVMNK+ Y
Subjt:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY

Query:  TLTTHLNEL
        TLTT LNEL
Subjt:  TLTTHLNEL

A0A5A7TU93 Gag/pol protein1.4e-3270.64Show/hide
Query:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY
        LA++H   +TA+EI+ S Q +FGQ S Q +H+ALK+IYN+RM EG+SVRE VLN+MV+FNVAEMNG VIDE SQVSFI+ESLP+SF+QFRSNAVMNK+ Y
Subjt:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY

Query:  TLTTHLNEL
        TLTT LNEL
Subjt:  TLTTHLNEL

A0A5A7TWB9 Gag/pol protein1.4e-3270.64Show/hide
Query:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY
        LA++H   +TA+EI+ S Q +FGQ S Q +H+ALK+IYN+RM EG+SVRE VLN+MV+FNVAEMNG VIDE SQVSFI+ESLP+SF+QFRSNAVMNK+ Y
Subjt:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY

Query:  TLTTHLNEL
        TLTT LNEL
Subjt:  TLTTHLNEL

A0A5A7V4M1 Gag/pol protein1.4e-3270.64Show/hide
Query:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY
        LA++H   +TA+EI+ S Q +FGQ S Q +H+ALK+IYN+RM EG+SVRE VLN+MV+FNVAEMNG VIDE SQVSFI+ESLP+SF+QFRSNAVMNK+ Y
Subjt:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY

Query:  TLTTHLNEL
        TLTT LNEL
Subjt:  TLTTHLNEL

A0A5D3CPJ6 Gag/pol protein1.4e-3270.64Show/hide
Query:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY
        LA++H   +TA+EI+ S Q +FGQ S Q +H+ALK+IYN+RM EG+SVRE VLN+MV+FNVAEMNG VIDE SQVSFI+ESLP+SF+QFRSNAVMNK+ Y
Subjt:  LAREH---VTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIMESLPKSFMQFRSNAVMNKLEY

Query:  TLTTHLNEL
        TLTT LNEL
Subjt:  TLTTHLNEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGGGGGTCGATGCGAAGAGTCCTTTGGAAGGAAAGCTATTGGGGCCTTGGGTAAAAATGGTCAAGGGCCAATAGATGGTGAAGTCATCGGGGCCTTGGCACGAGA
ACATGTCACTGCCAAGGAGATCATATACTCGTTCCAGAGCATTTTTGGGCAACCATCCTCACAAGCTCGACATGAAGCCCTTAAGTTCATTTATAACTCCCGCATGAAAG
AGGGCTCGTCAGTGCGAGAACAAGTTCTCAACCTGATGGTCTACTTCAATGTGGCGGAGATGAATGGGGATGTCATAGACGAGCTAAGTCAGGTCAGCTTTATTATGGAA
TCTCTTCCGAAGAGTTTCATGCAATTCCGCAGCAATGCAGTTATGAATAAGCTAGAGTACACTCTTACCACACATTTAAACGAGTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCGGGGGTCGATGCGAAGAGTCCTTTGGAAGGAAAGCTATTGGGGCCTTGGGTAAAAATGGTCAAGGGCCAATAGATGGTGAAGTCATCGGGGCCTTGGCACGAGA
ACATGTCACTGCCAAGGAGATCATATACTCGTTCCAGAGCATTTTTGGGCAACCATCCTCACAAGCTCGACATGAAGCCCTTAAGTTCATTTATAACTCCCGCATGAAAG
AGGGCTCGTCAGTGCGAGAACAAGTTCTCAACCTGATGGTCTACTTCAATGTGGCGGAGATGAATGGGGATGTCATAGACGAGCTAAGTCAGGTCAGCTTTATTATGGAA
TCTCTTCCGAAGAGTTTCATGCAATTCCGCAGCAATGCAGTTATGAATAAGCTAGAGTACACTCTTACCACACATTTAAACGAGTTGTAG
Protein sequenceShow/hide protein sequence
MVGGRCEESFGRKAIGALGKNGQGPIDGEVIGALAREHVTAKEIIYSFQSIFGQPSSQARHEALKFIYNSRMKEGSSVREQVLNLMVYFNVAEMNGDVIDELSQVSFIME
SLPKSFMQFRSNAVMNKLEYTLTTHLNEL