; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g19590 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g19590
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:15417703..15421361
RNA-Seq ExpressionMoc06g19590
SyntenyMoc06g19590
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR025312 - Domain of unknown function DUF4216
IPR025452 - Domain of unknown function DUF4218


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035742.1 uncharacterized protein E6C27_scaffold403G00130 [Cucumis melo var. makuwa]2.7e-2437.91Show/hide
Query:  KSLRTLKQYVKNKARLEGSIAEAIIANEALTFCSMYLDGIETELNRPIRNDDMVDLRESKE-HLPVFSQNARPIGGSQLKELLPAELLRAHW-----CD-
        +SLR LK+YV+NKAR EGSIAEA + NE+L FCSMYL GIET  NR  RN+D +D  E     L +FSQ+ R +GG+  ++L P EL ++HW     CD 
Subjt:  KSLRTLKQYVKNKARLEGSIAEAIIANEALTFCSMYLDGIETELNRPIRNDDMVDLRESKE-HLPVFSQNARPIGGSQLKELLPAELLRAHW-----CD-

Query:  -------------------------------WYDTNS------------------------------KKNRILQYHNFTSIDTRYLWCQNERFILVSQAQ
                                       W+   +                              +KNRI     FTS++TR  W ++E FILVSQA 
Subjt:  -------------------------------WYDTNS------------------------------KKNRILQYHNFTSIDTRYLWCQNERFILVSQAQ

Query:  QVFYIEDLKLG
        QVFY++D KLG
Subjt:  QVFYIEDLKLG

TYK21879.1 uncharacterized protein E5676_scaffold494G00120 [Cucumis melo var. makuwa]4.6e-2437.91Show/hide
Query:  KSLRTLKQYVKNKARLEGSIAEAIIANEALTFCSMYLDGIETELNRPIRNDDMVDLRESKE-HLPVFSQNARPIGGSQLKELLPAELLRAHW-----CD-
        +SL  LK+YV+NKAR EGSIAEA + NE+L FCSMYL GIET  NR  RN+D +D  E     L +FSQ+ R +GG+  ++L P EL ++HW     CD 
Subjt:  KSLRTLKQYVKNKARLEGSIAEAIIANEALTFCSMYLDGIETELNRPIRNDDMVDLRESKE-HLPVFSQNARPIGGSQLKELLPAELLRAHW-----CD-

Query:  -------------------------------WYDTNS------------------------------KKNRILQYHNFTSIDTRYLWCQNERFILVSQAQ
                                       W+   +                              +KNRI     FTSI+TR  W ++E FILVSQA 
Subjt:  -------------------------------WYDTNS------------------------------KKNRILQYHNFTSIDTRYLWCQNERFILVSQAQ

Query:  QVFYIEDLKLG
        QVFY++D KLG
Subjt:  QVFYIEDLKLG

XP_022152232.1 uncharacterized protein LOC111020001 [Momordica charantia]1.6e-2987.95Show/hide
Query:  MCLMIMKRSILETFRGSIVEGTNAKGFLKEMKQYITKNDMAEVSTLMAKLTSSRYVGKGNIREYIMKISNVATKLKTLKLEVS
        M LMIMKRSI ETFRGSI+EGTNAKGFLKEM+QY TKND AE STLMAKLTSSRYVGKGNIREYIM++SNVATKLK LKLEVS
Subjt:  MCLMIMKRSILETFRGSIVEGTNAKGFLKEMKQYITKNDMAEVSTLMAKLTSSRYVGKGNIREYIMKISNVATKLKTLKLEVS

XP_022155096.1 uncharacterized protein LOC111022228 [Momordica charantia]1.1e-3081.52Show/hide
Query:  MCLMIMKRSILETFRGSIVEGTNAKGFLKEMKQYITKNDMAEVSTLMAKLTSSRYVGKGNIREYIMKISNVATKLKTLKLEVSEDFLVEFLV
        MCLMIMKRSI ETFRGSIVEGTNAK FLKEMKQY TKND AE STLM KLTSSRYVGKGNIREY M++S+VATKLK LKL+VSE+FLV  ++
Subjt:  MCLMIMKRSILETFRGSIVEGTNAKGFLKEMKQYITKNDMAEVSTLMAKLTSSRYVGKGNIREYIMKISNVATKLKTLKLEVSEDFLVEFLV

XP_022156979.1 uncharacterized protein LOC111023808 [Momordica charantia]2.5e-3090.24Show/hide
Query:  MCLMIMKRSILETFRGSIVEGTNAKGFLKEMKQYITKNDMAEVSTLMAKLTSSRYVGKGNIREYIMKISNVATKLKTLKLEV
        MCLMIMKRSI ETFRGSIVEGTNAKGFLKEM+QY TKND AE STLMAKLTSSRYVGKGNIREYIM++SNVATKLK LKLEV
Subjt:  MCLMIMKRSILETFRGSIVEGTNAKGFLKEMKQYITKNDMAEVSTLMAKLTSSRYVGKGNIREYIMKISNVATKLKTLKLEV

TrEMBL top hitse value%identityAlignment
A0A5A7SWV9 Uncharacterized protein1.3e-2437.91Show/hide
Query:  KSLRTLKQYVKNKARLEGSIAEAIIANEALTFCSMYLDGIETELNRPIRNDDMVDLRESKE-HLPVFSQNARPIGGSQLKELLPAELLRAHW-----CD-
        +SLR LK+YV+NKAR EGSIAEA + NE+L FCSMYL GIET  NR  RN+D +D  E     L +FSQ+ R +GG+  ++L P EL ++HW     CD 
Subjt:  KSLRTLKQYVKNKARLEGSIAEAIIANEALTFCSMYLDGIETELNRPIRNDDMVDLRESKE-HLPVFSQNARPIGGSQLKELLPAELLRAHW-----CD-

Query:  -------------------------------WYDTNS------------------------------KKNRILQYHNFTSIDTRYLWCQNERFILVSQAQ
                                       W+   +                              +KNRI     FTS++TR  W ++E FILVSQA 
Subjt:  -------------------------------WYDTNS------------------------------KKNRILQYHNFTSIDTRYLWCQNERFILVSQAQ

Query:  QVFYIEDLKLG
        QVFY++D KLG
Subjt:  QVFYIEDLKLG

A0A5D3DEV9 Uncharacterized protein2.2e-2437.91Show/hide
Query:  KSLRTLKQYVKNKARLEGSIAEAIIANEALTFCSMYLDGIETELNRPIRNDDMVDLRESKE-HLPVFSQNARPIGGSQLKELLPAELLRAHW-----CD-
        +SL  LK+YV+NKAR EGSIAEA + NE+L FCSMYL GIET  NR  RN+D +D  E     L +FSQ+ R +GG+  ++L P EL ++HW     CD 
Subjt:  KSLRTLKQYVKNKARLEGSIAEAIIANEALTFCSMYLDGIETELNRPIRNDDMVDLRESKE-HLPVFSQNARPIGGSQLKELLPAELLRAHW-----CD-

Query:  -------------------------------WYDTNS------------------------------KKNRILQYHNFTSIDTRYLWCQNERFILVSQAQ
                                       W+   +                              +KNRI     FTSI+TR  W ++E FILVSQA 
Subjt:  -------------------------------WYDTNS------------------------------KKNRILQYHNFTSIDTRYLWCQNERFILVSQAQ

Query:  QVFYIEDLKLG
        QVFY++D KLG
Subjt:  QVFYIEDLKLG

A0A6J1DFM1 uncharacterized protein LOC1110200017.9e-3087.95Show/hide
Query:  MCLMIMKRSILETFRGSIVEGTNAKGFLKEMKQYITKNDMAEVSTLMAKLTSSRYVGKGNIREYIMKISNVATKLKTLKLEVS
        M LMIMKRSI ETFRGSI+EGTNAKGFLKEM+QY TKND AE STLMAKLTSSRYVGKGNIREYIM++SNVATKLK LKLEVS
Subjt:  MCLMIMKRSILETFRGSIVEGTNAKGFLKEMKQYITKNDMAEVSTLMAKLTSSRYVGKGNIREYIMKISNVATKLKTLKLEVS

A0A6J1DQP2 uncharacterized protein LOC1110222285.5e-3181.52Show/hide
Query:  MCLMIMKRSILETFRGSIVEGTNAKGFLKEMKQYITKNDMAEVSTLMAKLTSSRYVGKGNIREYIMKISNVATKLKTLKLEVSEDFLVEFLV
        MCLMIMKRSI ETFRGSIVEGTNAK FLKEMKQY TKND AE STLM KLTSSRYVGKGNIREY M++S+VATKLK LKL+VSE+FLV  ++
Subjt:  MCLMIMKRSILETFRGSIVEGTNAKGFLKEMKQYITKNDMAEVSTLMAKLTSSRYVGKGNIREYIMKISNVATKLKTLKLEVSEDFLVEFLV

A0A6J1DV67 uncharacterized protein LOC1110238081.2e-3090.24Show/hide
Query:  MCLMIMKRSILETFRGSIVEGTNAKGFLKEMKQYITKNDMAEVSTLMAKLTSSRYVGKGNIREYIMKISNVATKLKTLKLEV
        MCLMIMKRSI ETFRGSIVEGTNAKGFLKEM+QY TKND AE STLMAKLTSSRYVGKGNIREYIM++SNVATKLK LKLEV
Subjt:  MCLMIMKRSILETFRGSIVEGTNAKGFLKEMKQYITKNDMAEVSTLMAKLTSSRYVGKGNIREYIMKISNVATKLKTLKLEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCTGATGATCATGAAGCGCTCAATTCTAGAAACGTTTAGAGGCTCTATTGTTGAGGGAACGAATGCCAAAGGTTTTTTAAAGGAAATGAAGCAGTACATTACCAA
AAACGATATGGCAGAGGTGAGTACCCTTATGGCAAAACTCACCTCTTCAAGATACGTTGGTAAAGGAAACATCAGGGAATACATAATGAAAATATCAAATGTTGCAACAA
AACTTAAAACACTGAAGTTGGAAGTTTCTGAAGACTTTTTAGTGGAATTTTTGGTGCTAGCATTCATGATTAAGAGTCTGAGGACACTGAAACAATATGTGAAAAATAAA
GCTCGACTTGAAGGTTCTATAGCAGAAGCTATCATTGCGAATGAAGCATTGACGTTTTGCTCAATGTATCTGGATGGAATTGAAACCGAATTAAATAGGCCCATTCGAAA
TGATGACATGGTGGACTTAAGAGAATCAAAGGAACATCTTCCTGTATTTTCACAAAATGCACGTCCAATCGGTGGTTCACAACTGAAGGAATTATTGCCTGCTGAGTTGC
TAAGGGCGCATTGGTGTGATTGGTATGATACGAATTCTAAGAAAAATCGTATTCTTCAGTACCATAATTTTACGAGCATAGACACACGTTATTTGTGGTGTCAGAACGAA
CGATTCATACTTGTCTCCCAAGCACAACAAGTATTTTATATTGAAGATCTGAAGTTAGGAACAATGTTGTTAGATAACGACTATGATGGAGAACTAGGAGTTCGCAGAGA
ATCTCGAGGAGTCGTGATGATAGTTGCAAATGGAGGTAATCCCATACCAATTTCGTGGACTGCAAAAGCAGCAAACCTGTTGGGTATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGTCTGATGATCATGAAGCGCTCAATTCTAGAAACGTTTAGAGGCTCTATTGTTGAGGGAACGAATGCCAAAGGTTTTTTAAAGGAAATGAAGCAGTACATTACCAA
AAACGATATGGCAGAGGTGAGTACCCTTATGGCAAAACTCACCTCTTCAAGATACGTTGGTAAAGGAAACATCAGGGAATACATAATGAAAATATCAAATGTTGCAACAA
AACTTAAAACACTGAAGTTGGAAGTTTCTGAAGACTTTTTAGTGGAATTTTTGGTGCTAGCATTCATGATTAAGAGTCTGAGGACACTGAAACAATATGTGAAAAATAAA
GCTCGACTTGAAGGTTCTATAGCAGAAGCTATCATTGCGAATGAAGCATTGACGTTTTGCTCAATGTATCTGGATGGAATTGAAACCGAATTAAATAGGCCCATTCGAAA
TGATGACATGGTGGACTTAAGAGAATCAAAGGAACATCTTCCTGTATTTTCACAAAATGCACGTCCAATCGGTGGTTCACAACTGAAGGAATTATTGCCTGCTGAGTTGC
TAAGGGCGCATTGGTGTGATTGGTATGATACGAATTCTAAGAAAAATCGTATTCTTCAGTACCATAATTTTACGAGCATAGACACACGTTATTTGTGGTGTCAGAACGAA
CGATTCATACTTGTCTCCCAAGCACAACAAGTATTTTATATTGAAGATCTGAAGTTAGGAACAATGTTGTTAGATAACGACTATGATGGAGAACTAGGAGTTCGCAGAGA
ATCTCGAGGAGTCGTGATGATAGTTGCAAATGGAGGTAATCCCATACCAATTTCGTGGACTGCAAAAGCAGCAAACCTGTTGGGTATCTAG
Protein sequenceShow/hide protein sequence
MCLMIMKRSILETFRGSIVEGTNAKGFLKEMKQYITKNDMAEVSTLMAKLTSSRYVGKGNIREYIMKISNVATKLKTLKLEVSEDFLVEFLVLAFMIKSLRTLKQYVKNK
ARLEGSIAEAIIANEALTFCSMYLDGIETELNRPIRNDDMVDLRESKEHLPVFSQNARPIGGSQLKELLPAELLRAHWCDWYDTNSKKNRILQYHNFTSIDTRYLWCQNE
RFILVSQAQQVFYIEDLKLGTMLLDNDYDGELGVRRESRGVVMIVANGGNPIPISWTAKAANLLGI