; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g31780 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g31780
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr9:23976972..23994430
RNA-Seq ExpressionMoc09g31780
SyntenyMoc09g31780
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-4938.57Show/hide
Query:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNAT-----------------------------VVKRGDTVTGKEIMDSLQSM
        +++ + +LA +KLN  NY  WK+ +NT+L+IDDL FVL E+CPQ PA NAT                               K    +T +EIMDSLQ M
Subjt:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNAT-----------------------------VVKRGDTVTGKEIMDSLQSM

Query:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVM--------------------------
        FGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAV DE SQVSFILESLP+SFL   SNAVM                          
Subjt:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVM--------------------------

Query:  ---------------------------------------------------------------------------------------------------M
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------M

Query:  ENDDSAWILDSEASNHVCSSFQGISSWRQLEAGEMTFKVGTGEVVSTVAV
        ENDDSAWI+DS A+NHVCSSFQGISSWRQLE GEMT +VGTG VVS +AV
Subjt:  ENDDSAWILDSEASNHVCSSFQGISSWRQLEAGEMTFKVGTGEVVSTVAV

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-4938.57Show/hide
Query:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNAT-----------------------------VVKRGDTVTGKEIMDSLQSM
        +++ + +LA +KLN  NY  WK+ +NT+L+IDDL FVL E+CPQ PA NAT                               K    +T +EIMDSLQ M
Subjt:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNAT-----------------------------VVKRGDTVTGKEIMDSLQSM

Query:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVM--------------------------
        FGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAV DE SQVSFILESLP+SFL   SNAVM                          
Subjt:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVM--------------------------

Query:  ---------------------------------------------------------------------------------------------------M
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------M

Query:  ENDDSAWILDSEASNHVCSSFQGISSWRQLEAGEMTFKVGTGEVVSTVAV
        ENDDSAWI+DS A+NHVCSSFQGISSWRQLE GEMT +VGTG VVS +AV
Subjt:  ENDDSAWILDSEASNHVCSSFQGISSWRQLEAGEMTFKVGTGEVVSTVAV

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-4938.57Show/hide
Query:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNAT-----------------------------VVKRGDTVTGKEIMDSLQSM
        +++ + +LA +KLN  NY  WK+ +NT+L+IDDL FVL E+CPQ PA NAT                               K    +T +EIMDSLQ M
Subjt:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNAT-----------------------------VVKRGDTVTGKEIMDSLQSM

Query:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVM--------------------------
        FGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAV DE SQVSFILESLP+SFL   SNAVM                          
Subjt:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVM--------------------------

Query:  ---------------------------------------------------------------------------------------------------M
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------M

Query:  ENDDSAWILDSEASNHVCSSFQGISSWRQLEAGEMTFKVGTGEVVSTVAV
        ENDDSAWI+DS A+NHVCSSFQGISSWRQLE GEMT +VGTG VVS +AV
Subjt:  ENDDSAWILDSEASNHVCSSFQGISSWRQLEAGEMTFKVGTGEVVSTVAV

XP_022158062.1 uncharacterized protein LOC111024637 [Momordica charantia]1.9e-5559.21Show/hide
Query:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNATVV-----------------------------KRGDTVTGKEIMDSLQSM
        STSII LL  +KLN ENYKQWKSN+NTIL+IDDL FVLQEDCPQAPAPNATV                              K  ++V  KEIMDSLQSM
Subjt:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNATVV-----------------------------KRGDTVTGKEIMDSLQSM

Query:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVMMENDDSAWILDSEASNHVCSSFQGIS
        FGQPSSQARHEAL  IYNSRMK+ SSVREHVLNLMVHFNV ESN  V DEQSQV FILESLPK+FLP CSNA                          I 
Subjt:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVMMENDDSAWILDSEASNHVCSSFQGIS

Query:  SWRQLEAGEMTFKVGTGEVVSTVAVVIN
        S RQ +A EMT KV  GEVVS VAV  +
Subjt:  SWRQLEAGEMTFKVGTGEVVSTVAVVIN

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]4.8e-5470.93Show/hide
Query:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNATVV-----------------------------KRGDTVTGKEIMDSLQSM
        S SIIALLA +KLN ENY+QWKSNLNTILVIDDL FVLQEDCPQAP  NATV                              K  DTVT KEIMDSLQSM
Subjt:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNATVV-----------------------------KRGDTVTGKEIMDSLQSM

Query:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNA
        FGQPS QARHEALKF+YNSRMKEGSSVREHVLNLMVHFNV ESNG V DEQSQ SFILESLPK+FLP  SNA
Subjt:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNA

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein5.9e-5038.57Show/hide
Query:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNAT-----------------------------VVKRGDTVTGKEIMDSLQSM
        +++ + +LA +KLN  NY  WK+ +NT+L+IDDL FVL E+CPQ PA NAT                               K    +T +EIMDSLQ M
Subjt:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNAT-----------------------------VVKRGDTVTGKEIMDSLQSM

Query:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVM--------------------------
        FGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAV DE SQVSFILESLP+SFL   SNAVM                          
Subjt:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVM--------------------------

Query:  ---------------------------------------------------------------------------------------------------M
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------M

Query:  ENDDSAWILDSEASNHVCSSFQGISSWRQLEAGEMTFKVGTGEVVSTVAV
        ENDDSAWI+DS A+NHVCSSFQGISSWRQLE GEMT +VGTG VVS +AV
Subjt:  ENDDSAWILDSEASNHVCSSFQGISSWRQLEAGEMTFKVGTGEVVSTVAV

A0A5A7TWB9 Gag/pol protein5.9e-5038.57Show/hide
Query:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNAT-----------------------------VVKRGDTVTGKEIMDSLQSM
        +++ + +LA +KLN  NY  WK+ +NT+L+IDDL FVL E+CPQ PA NAT                               K    +T +EIMDSLQ M
Subjt:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNAT-----------------------------VVKRGDTVTGKEIMDSLQSM

Query:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVM--------------------------
        FGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAV DE SQVSFILESLP+SFL   SNAVM                          
Subjt:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVM--------------------------

Query:  ---------------------------------------------------------------------------------------------------M
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------M

Query:  ENDDSAWILDSEASNHVCSSFQGISSWRQLEAGEMTFKVGTGEVVSTVAV
        ENDDSAWI+DS A+NHVCSSFQGISSWRQLE GEMT +VGTG VVS +AV
Subjt:  ENDDSAWILDSEASNHVCSSFQGISSWRQLEAGEMTFKVGTGEVVSTVAV

A0A5D3CPJ6 Gag/pol protein5.9e-5038.57Show/hide
Query:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNAT-----------------------------VVKRGDTVTGKEIMDSLQSM
        +++ + +LA +KLN  NY  WK+ +NT+L+IDDL FVL E+CPQ PA NAT                               K    +T +EIMDSLQ M
Subjt:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNAT-----------------------------VVKRGDTVTGKEIMDSLQSM

Query:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVM--------------------------
        FGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAV DE SQVSFILESLP+SFL   SNAVM                          
Subjt:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVM--------------------------

Query:  ---------------------------------------------------------------------------------------------------M
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------M

Query:  ENDDSAWILDSEASNHVCSSFQGISSWRQLEAGEMTFKVGTGEVVSTVAV
        ENDDSAWI+DS A+NHVCSSFQGISSWRQLE GEMT +VGTG VVS +AV
Subjt:  ENDDSAWILDSEASNHVCSSFQGISSWRQLEAGEMTFKVGTGEVVSTVAV

A0A6J1DW68 uncharacterized protein LOC1110246379.4e-5659.21Show/hide
Query:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNATVV-----------------------------KRGDTVTGKEIMDSLQSM
        STSII LL  +KLN ENYKQWKSN+NTIL+IDDL FVLQEDCPQAPAPNATV                              K  ++V  KEIMDSLQSM
Subjt:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNATVV-----------------------------KRGDTVTGKEIMDSLQSM

Query:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVMMENDDSAWILDSEASNHVCSSFQGIS
        FGQPSSQARHEAL  IYNSRMK+ SSVREHVLNLMVHFNV ESN  V DEQSQV FILESLPK+FLP CSNA                          I 
Subjt:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVMMENDDSAWILDSEASNHVCSSFQGIS

Query:  SWRQLEAGEMTFKVGTGEVVSTVAVVIN
        S RQ +A EMT KV  GEVVS VAV  +
Subjt:  SWRQLEAGEMTFKVGTGEVVSTVAVVIN

A0A6J1DWL0 uncharacterized protein LOC1110247342.3e-5470.93Show/hide
Query:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNATVV-----------------------------KRGDTVTGKEIMDSLQSM
        S SIIALLA +KLN ENY+QWKSNLNTILVIDDL FVLQEDCPQAP  NATV                              K  DTVT KEIMDSLQSM
Subjt:  STSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNATVV-----------------------------KRGDTVTGKEIMDSLQSM

Query:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNA
        FGQPS QARHEALKF+YNSRMKEGSSVREHVLNLMVHFNV ESNG V DEQSQ SFILESLPK+FLP  SNA
Subjt:  FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCTGCAGCACTGCGGCGCTAGGAGACAGCACCTCGGCGCTGTCCTAGTTGCCGAAAAAGGGGTTTTGTCCAGTCGATTTTGTGTCCTTTTTCAGAGGGTTTCGTG
GCAGTTTCAAGGGGAAGGCTCGGGCATTGGCAAAGGCATTGAGGCTCATCAAGATCAACACTTTCGGGGATCCCACAATCCGGTTTTTGTTGCTAGAGTCATAGCAAGGA
AGTCTTGTGGTGGTGTTCTATTATTCGTGCGATTGTATTCGTTTGTACGTGTCAAACCAAAATCTGGGTTTGAGGATGAAGAATCCTCAACGGTCTTCAAACAATTTTTC
CAGCAAGCGCCATGGCGTTGCACTGTAGCACTACGGTTCTATGCTGTAGTGACTGCTAGAATATCTGCACTAGTGTTGCAATGCTGCCCTATGGCGCCGCGGCGTTGTCC
CAACACGTCAACTTCTATTATTGCTTTGTTAGCCACTGAAAAACTTAACAGCGAGAACTACAAACAATGGAAATCGAACCTAAACACTATACTAGTGATAGATGATCTTA
TGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCACCTAATGCCACTGTAGTGAAGCGTGGGGACACAGTCACCGGTAAGGAGATCATGGACTCGCTGCAGAGCATG
TTTGGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAGTTCATTTACAACTCCCGCATGAAGGAGGGCTCCTCAGTGCGAGAACACGTTCTCAACCTGATGGTCCA
CTTCAACGTGAAAGAGTCGAATGGGGCCGTCAGAGACGAGCAAAGTCAAGTCAGCTTCATTTTGGAATCTCTTCCGAAGAGTTTCCTACCATCCTGCAGTAATGCAGTTA
TGATGGAGAATGATGACTCCGCTTGGATATTAGATTCAGAAGCCAGTAATCACGTTTGTTCTTCGTTTCAGGGAATTAGTTCCTGGAGGCAGCTTGAAGCCGGAGAGATG
ACATTCAAGGTCGGAACGGGAGAGGTTGTCTCAACTGTGGCGGTAGTGATAAATGAGATTTCTAAAGAGGCTACAAATACGTCAACAAGAGTTGTTGATCAAGCTTACAC
TTTAACAAAAATTGTTGGTGAAGCTAGCACGTCACATCAGTCACATGAACCTCAAGCATTGAGGGTGCCTCGACATAGTGGGAGAATTGTGTCACAAACTGACCGCTACG
TGGGTTTAACTGAAACCCAAGTGGTCATACCTGATGACGTGTCGAGGATCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCTGCAGCACTGCGGCGCTAGGAGACAGCACCTCGGCGCTGTCCTAGTTGCCGAAAAAGGGGTTTTGTCCAGTCGATTTTGTGTCCTTTTTCAGAGGGTTTCGTG
GCAGTTTCAAGGGGAAGGCTCGGGCATTGGCAAAGGCATTGAGGCTCATCAAGATCAACACTTTCGGGGATCCCACAATCCGGTTTTTGTTGCTAGAGTCATAGCAAGGA
AGTCTTGTGGTGGTGTTCTATTATTCGTGCGATTGTATTCGTTTGTACGTGTCAAACCAAAATCTGGGTTTGAGGATGAAGAATCCTCAACGGTCTTCAAACAATTTTTC
CAGCAAGCGCCATGGCGTTGCACTGTAGCACTACGGTTCTATGCTGTAGTGACTGCTAGAATATCTGCACTAGTGTTGCAATGCTGCCCTATGGCGCCGCGGCGTTGTCC
CAACACGTCAACTTCTATTATTGCTTTGTTAGCCACTGAAAAACTTAACAGCGAGAACTACAAACAATGGAAATCGAACCTAAACACTATACTAGTGATAGATGATCTTA
TGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCACCTAATGCCACTGTAGTGAAGCGTGGGGACACAGTCACCGGTAAGGAGATCATGGACTCGCTGCAGAGCATG
TTTGGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAGTTCATTTACAACTCCCGCATGAAGGAGGGCTCCTCAGTGCGAGAACACGTTCTCAACCTGATGGTCCA
CTTCAACGTGAAAGAGTCGAATGGGGCCGTCAGAGACGAGCAAAGTCAAGTCAGCTTCATTTTGGAATCTCTTCCGAAGAGTTTCCTACCATCCTGCAGTAATGCAGTTA
TGATGGAGAATGATGACTCCGCTTGGATATTAGATTCAGAAGCCAGTAATCACGTTTGTTCTTCGTTTCAGGGAATTAGTTCCTGGAGGCAGCTTGAAGCCGGAGAGATG
ACATTCAAGGTCGGAACGGGAGAGGTTGTCTCAACTGTGGCGGTAGTGATAAATGAGATTTCTAAAGAGGCTACAAATACGTCAACAAGAGTTGTTGATCAAGCTTACAC
TTTAACAAAAATTGTTGGTGAAGCTAGCACGTCACATCAGTCACATGAACCTCAAGCATTGAGGGTGCCTCGACATAGTGGGAGAATTGTGTCACAAACTGACCGCTACG
TGGGTTTAACTGAAACCCAAGTGGTCATACCTGATGACGTGTCGAGGATCCATTGA
Protein sequenceShow/hide protein sequence
MALQHCGARRQHLGAVLVAEKGVLSSRFCVLFQRVSWQFQGEGSGIGKGIEAHQDQHFRGSHNPVFVARVIARKSCGGVLLFVRLYSFVRVKPKSGFEDEESSTVFKQFF
QQAPWRCTVALRFYAVVTARISALVLQCCPMAPRRCPNTSTSIIALLATEKLNSENYKQWKSNLNTILVIDDLMFVLQEDCPQAPAPNATVVKRGDTVTGKEIMDSLQSM
FGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVKESNGAVRDEQSQVSFILESLPKSFLPSCSNAVMMENDDSAWILDSEASNHVCSSFQGISSWRQLEAGEM
TFKVGTGEVVSTVAVVINEISKEATNTSTRVVDQAYTLTKIVGEASTSHQSHEPQALRVPRHSGRIVSQTDRYVGLTETQVVIPDDVSRIH