; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g10620 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g10620
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr1:6549646..6552037
RNA-Seq ExpressionMoc01g10620
SyntenyMoc01g10620
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-7667.89Show/hide
Query:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS
        M+++ + +LAA KL+G NY  WK+ +NT+L+IDDLRFVL E   Q PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG
        MFGQ S Q +H+ALK++YN RM EG+SVREHVLN+MVHFNV+E NGAVIDE SQVSFILESLP+SFL F SN VMNK+ YTLTTLLNELQT++S MK KG
Subjt:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG

Query:  QEGEANLATS-KRFNRGS
        Q+GEAN+ATS ++F+RGS
Subjt:  QEGEANLATS-KRFNRGS

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-7667.89Show/hide
Query:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS
        M+++ + +LAA KL+G NY  WK+ +NT+L+IDDLRFVL E   Q PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG
        MFGQ S Q +H+ALK++YN RM EG+SVREHVLN+MVHFNV+E NGAVIDE SQVSFILESLP+SFL F SN VMNK+ YTLTTLLNELQT++S MK KG
Subjt:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG

Query:  QEGEANLATS-KRFNRGS
        Q+GEAN+ATS ++F+RGS
Subjt:  QEGEANLATS-KRFNRGS

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-7667.89Show/hide
Query:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS
        M+++ + +LAA KL+G NY  WK+ +NT+L+IDDLRFVL E   Q PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG
        MFGQ S Q +H+ALK++YN RM EG+SVREHVLN+MVHFNV+E NGAVIDE SQVSFILESLP+SFL F SN VMNK+ YTLTTLLNELQT++S MK KG
Subjt:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG

Query:  QEGEANLATS-KRFNRGS
        Q+GEAN+ATS ++F+RGS
Subjt:  QEGEANLATS-KRFNRGS

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-7667.89Show/hide
Query:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS
        M+++ + +LAA KL+G NY  WK+ +NT+L+IDDLRFVL E   Q PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG
        MFGQ S Q +H+ALK++YN RM EG+SVREHVLN+MVHFNV+E NGAVIDE SQVSFILESLP+SFL F SN VMNK+ YTLTTLLNELQT++S MK KG
Subjt:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG

Query:  QEGEANLATS-KRFNRGS
        Q+GEAN+ATS ++F+RGS
Subjt:  QEGEANLATS-KRFNRGS

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]7.3e-7990.7Show/hide
Query:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS
        MSASIIALLAAQKL+G+NY+QWKSNLNTIL+IDDLRFVLQE   QAP  NATVAVRNAYDRWIK+NDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS
Subjt:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSN
        MFGQPS QARHEALKFVYN RMKEGSSVREHVLNLMVHFNV+ESNG VIDEQSQ SFILESLPK+FLPFHSN
Subjt:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSN

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.2e-7667.89Show/hide
Query:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS
        M+++ + +LAA KL+G NY  WK+ +NT+L+IDDLRFVL E   Q PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG
        MFGQ S Q +H+ALK++YN RM EG+SVREHVLN+MVHFNV+E NGAVIDE SQVSFILESLP+SFL F SN VMNK+ YTLTTLLNELQT++S MK KG
Subjt:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG

Query:  QEGEANLATS-KRFNRGS
        Q+GEAN+ATS ++F+RGS
Subjt:  QEGEANLATS-KRFNRGS

A0A5A7TU93 Gag/pol protein2.2e-7667.89Show/hide
Query:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS
        M+++ + +LAA KL+G NY  WK+ +NT+L+IDDLRFVL E   Q PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG
        MFGQ S Q +H+ALK++YN RM EG+SVREHVLN+MVHFNV+E NGAVIDE SQVSFILESLP+SFL F SN VMNK+ YTLTTLLNELQT++S MK KG
Subjt:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG

Query:  QEGEANLATS-KRFNRGS
        Q+GEAN+ATS ++F+RGS
Subjt:  QEGEANLATS-KRFNRGS

A0A5A7TWB9 Gag/pol protein2.2e-7667.89Show/hide
Query:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS
        M+++ + +LAA KL+G NY  WK+ +NT+L+IDDLRFVL E   Q PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG
        MFGQ S Q +H+ALK++YN RM EG+SVREHVLN+MVHFNV+E NGAVIDE SQVSFILESLP+SFL F SN VMNK+ YTLTTLLNELQT++S MK KG
Subjt:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG

Query:  QEGEANLATS-KRFNRGS
        Q+GEAN+ATS ++F+RGS
Subjt:  QEGEANLATS-KRFNRGS

A0A5D3CPJ6 Gag/pol protein2.2e-7667.89Show/hide
Query:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS
        M+++ + +LAA KL+G NY  WK+ +NT+L+IDDLRFVL E   Q PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG
        MFGQ S Q +H+ALK++YN RM EG+SVREHVLN+MVHFNV+E NGAVIDE SQVSFILESLP+SFL F SN VMNK+ YTLTTLLNELQT++S MK KG
Subjt:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKG

Query:  QEGEANLATS-KRFNRGS
        Q+GEAN+ATS ++F+RGS
Subjt:  QEGEANLATS-KRFNRGS

A0A6J1DWL0 uncharacterized protein LOC1110247343.6e-7990.7Show/hide
Query:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS
        MSASIIALLAAQKL+G+NY+QWKSNLNTIL+IDDLRFVLQE   QAP  NATVAVRNAYDRWIK+NDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS
Subjt:  MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSN
        MFGQPS QARHEALKFVYN RMKEGSSVREHVLNLMVHFNV+ESNG VIDEQSQ SFILESLPK+FLPFHSN
Subjt:  MFGQPSSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCAGACTACGCAGACAGCGTCGTGGCGCTATGCAACAGCGCCATGGCGCTGTGGGGACAACACACAGCGCCACGGCGCTGCTACTGCGGCATTTTGCTTA
GGCGCCGAGGCGCTGTCCCGGTTTTTTTTCGTGGGAGAAGGACGTGTGACAACACATCCTGCGGTCTCCGCCATTGGTTTGCACCGTGAGGTTTCATACATAACC
TGCGTGTCGTCCAAGAGCGACCATCCCTACGGAGGGTTCATTGATTATTGGGGTGGATCTCTGAGGTCCGAAAATGACGAGTTACACTTACAGGGAGTTGTTAAC
ATGTCTGCTTCCATTATTGCACTCCTAGCCGCTCAAAAACTTAGCGGCAAGAATTACAAACAATGGAAATCGAATCTAAACACTATTCTCCTGATAGATGATCTT
AGGTTCGTCTTGCAAGAGTATTATTCTCAAGCTCCTGCGCCTAACGCCACTGTGGCGGTGCGAAACGCCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAG
GTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACGGTCACCACTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCG
TCCTCACAGGCTCGACATGAAGCCCTTAAGTTCGTTTACAACTGTCGCATGAAGGAGGGCTCCTCAGTGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAAC
GTGTCTGAGTCGAACGGGGCCGTCATAGACGAGCAGAGTCAGGTCAGCTTCATTCTGGAATCTCTTCCGAAGAGTTTCCTGCCATTCCACAGCAATGAGGTTATG
AATAAGCTGGAGTACACTCTTACCACGCTCTTAAACGAGCTGCAGACCTACCAGTCTTTTATGAAGAGTAAGGGACAAGAAGGGGAGGCAAATCTTGCCACCTCA
AAAAGGTTCAACCGAGGTTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCAGACTACGCAGACAGCGTCGTGGCGCTATGCAACAGCGCCATGGCGCTGTGGGGACAACACACAGCGCCACGGCGCTGCTACTGCGGCATTTTGCTTA
GGCGCCGAGGCGCTGTCCCGGTTTTTTTTCGTGGGAGAAGGACGTGTGACAACACATCCTGCGGTCTCCGCCATTGGTTTGCACCGTGAGGTTTCATACATAACC
TGCGTGTCGTCCAAGAGCGACCATCCCTACGGAGGGTTCATTGATTATTGGGGTGGATCTCTGAGGTCCGAAAATGACGAGTTACACTTACAGGGAGTTGTTAAC
ATGTCTGCTTCCATTATTGCACTCCTAGCCGCTCAAAAACTTAGCGGCAAGAATTACAAACAATGGAAATCGAATCTAAACACTATTCTCCTGATAGATGATCTT
AGGTTCGTCTTGCAAGAGTATTATTCTCAAGCTCCTGCGCCTAACGCCACTGTGGCGGTGCGAAACGCCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAG
GTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACGGTCACCACTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCG
TCCTCACAGGCTCGACATGAAGCCCTTAAGTTCGTTTACAACTGTCGCATGAAGGAGGGCTCCTCAGTGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAAC
GTGTCTGAGTCGAACGGGGCCGTCATAGACGAGCAGAGTCAGGTCAGCTTCATTCTGGAATCTCTTCCGAAGAGTTTCCTGCCATTCCACAGCAATGAGGTTATG
AATAAGCTGGAGTACACTCTTACCACGCTCTTAAACGAGCTGCAGACCTACCAGTCTTTTATGAAGAGTAAGGGACAAGAAGGGGAGGCAAATCTTGCCACCTCA
AAAAGGTTCAACCGAGGTTCGTGA
Protein sequenceShow/hide protein sequence
MKQTTQTASWRYATAPWRCGDNTQRHGAATAAFCLGAEALSRFFFVGEGRVTTHPAVSAIGLHREVSYITCVSSKSDHPYGGFIDYWGGSLRSENDELHLQGVVN
MSASIIALLAAQKLSGKNYKQWKSNLNTILLIDDLRFVLQEYYSQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTTKEIMDSLQSMFGQP
SSQARHEALKFVYNCRMKEGSSVREHVLNLMVHFNVSESNGAVIDEQSQVSFILESLPKSFLPFHSNEVMNKLEYTLTTLLNELQTYQSFMKSKGQEGEANLATS
KRFNRGS