; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g25530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g25530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationchr8:18420567..18424605
RNA-Seq ExpressionMoc08g25530
SyntenyMoc08g25530
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040926.1 uncharacterized protein E6C27_scaffold125G00500 [Cucumis melo var. makuwa]1.1e-2571.91Show/hide
Query:  QANSGGTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSKRNESLQPKGK
        Q+ + GTKSVAEYY+EM+T++ + +I+E EEDTMSRFLGGLN+EIAH VDRNPP  +EDMYHYA+KIE QLKEEKE SKRNES+Q +GK
Subjt:  QANSGGTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSKRNESLQPKGK

KAA0057648.1 putative polyprotein [Cucumis melo var. makuwa]1.3e-2371.08Show/hide
Query:  TKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSKRNESLQPKGK
        ++ VAEYY+EM+T++ + +I+E+EEDTMSRFLGGLN+EIAH VDRNPP  +EDMYHYA+KIE QLKEEKE SKRNES+Q +GK
Subjt:  TKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSKRNESLQPKGK

XP_022153198.1 uncharacterized protein LOC111020753 [Momordica charantia]3.4e-3273.87Show/hide
Query:  QANSGGTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSK---------------------
        QA   GTKSVAEYYQEM+T+MLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHY VKIEDQLKEEKEYSK                     
Subjt:  QANSGGTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSK---------------------

Query:  -RNESLQPKGK
         RNESLQPK K
Subjt:  -RNESLQPKGK

XP_022932136.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111438459, partial [Cucurbita moschata]7.4e-2783.33Show/hide
Query:  GTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSKRNES
        GTKSVAEYYQEM+T+M +  +REEEEDTMSRFLGGLNREIAH VDRNPPPYLEDM HYA+KIEDQLKEEKE+SKR  S
Subjt:  GTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSKRNES

XP_023544048.1 uncharacterized protein LOC111803745 [Cucurbita pepo subsp. pepo]6.7e-2884.62Show/hide
Query:  GTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSKRNES
        GTKSVAEYYQEM+T+M +  +REEEEDTMSRFLGGLNREIAH VDRNPPPYLEDMYHYA+KIEDQLKEEKE+SKR  S
Subjt:  GTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSKRNES

TrEMBL top hitse value%identityAlignment
A0A5A7TFV5 Retrotrans_gag domain-containing protein5.2e-2671.91Show/hide
Query:  QANSGGTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSKRNESLQPKGK
        Q+ + GTKSVAEYY+EM+T++ + +I+E EEDTMSRFLGGLN+EIAH VDRNPP  +EDMYHYA+KIE QLKEEKE SKRNES+Q +GK
Subjt:  QANSGGTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSKRNESLQPKGK

A0A5A7UR86 Putative polyprotein6.3e-2471.08Show/hide
Query:  TKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSKRNESLQPKGK
        ++ VAEYY+EM+T++ + +I+E+EEDTMSRFLGGLN+EIAH VDRNPP  +EDMYHYA+KIE QLKEEKE SKRNES+Q +GK
Subjt:  TKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSKRNESLQPKGK

A0A5D3C8C6 CCHC-type domain-containing protein2.7e-2259.43Show/hide
Query:  GTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSK----------------------RNES
        GTKSVAEYY+EM+T++ + +I+E+EEDTMSRFLGGLN+EIAH VDRNPP  +EDMYHYA+KIE QLKEEKE SK                      RNES
Subjt:  GTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSK----------------------RNES

Query:  LQPKGK
        +Q +GK
Subjt:  LQPKGK

A0A6J1DGU9 uncharacterized protein LOC1110207531.7e-3273.87Show/hide
Query:  QANSGGTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSK---------------------
        QA   GTKSVAEYYQEM+T+MLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHY VKIEDQLKEEKEYSK                     
Subjt:  QANSGGTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSK---------------------

Query:  -RNESLQPKGK
         RNESLQPK K
Subjt:  -RNESLQPKGK

A0A6J1EVI6 LOW QUALITY PROTEIN: uncharacterized protein LOC1114384593.6e-2783.33Show/hide
Query:  GTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSKRNES
        GTKSVAEYYQEM+T+M +  +REEEEDTMSRFLGGLNREIAH VDRNPPPYLEDM HYA+KIEDQLKEEKE+SKR  S
Subjt:  GTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGGLNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSKRNES

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGGTGGGGCCCCTTGTTCAAGTCCCAGAGTCAGCATTTAAGGGAACACTCATCTACTCCCCTAAAGTCAGAGAGGAGTGAATTCCATCTTGTGAAGTTA
TGTTCCCAGCTCCCCACTCGGTCTCGTCCCCAAAGTGGTAGGAATGTTGAGTCGGCAACTTGGGCCACTCTCACCCATACAGATCAAAGGACGAACCTTCACGGG
ACACCCCCACTCGCATGTCTCCACACGAACGACCTGGATCGAGTCATCTGTAACCTTTACAGAGCGGGCCGTATCCATAGTGTTGCCAGGATAAGGCAGCTTGAA
GCCGGAGAGATGACTATCAAAGTCGGAACGGGAGAAGTCGTCTTAACTGTGGCAGTAGGAAAGCTTAAGTTGTTTTTGAAAGGATCAGATCCTGCAGTGGAAACG
CGTGAACGCGTGTGCTTCGATCTTGTTTTAGAATTACAACAAAGAAAAGACACTATATTGTACACCTCGTATGATGCAAATCCAACCACGGATCCATTGACTGTA
CCCGGGGGACCAATTACAAGAAGCAAAGCAAAGAAGATTCAAGAGGTTTTCATAATACATCTTCAAAGGCTAGCTAATGCACACGAGGAGACAAAGATTTCTGAG
GCAAAAATTCTTTACAATGTTAATTTAATGAGTCAAGAAGAGAATGGAGCAAAGATGGCACGGGAAAAGTTGTCTATTTTGAGAGATGGCACGGAGGACAAAAAA
AGTGTGCAGATTGGTGAACAGGTGCCTTTGCAGATTGCACCTCTTGGGAGTTTGTTTGATGTTAAACCGATTCATTGGATTAGTCTCGATCAAGCTAATTCTGGA
GGAACAAAAAGTGTTGCTGAGTATTATCAAGAGATGAAGACCATAATGCTAAAAACCCAAATCCGAGAAGAAGAGGAAGACACCATGTCTAGATTCCTTGGAGGT
TTGAATCGAGAAATTGCTCATGCTGTTGATAGAAATCCACCGCCTTACCTAGAAGATATGTACCATTATGCTGTCAAAATTGAAGATCAATTGAAGGAAGAAAAG
GAGTATTCAAAAAGGAATGAATCATTGCAACCAAAAGGAAAGTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGGTGGGGCCCCTTGTTCAAGTCCCAGAGTCAGCATTTAAGGGAACACTCATCTACTCCCCTAAAGTCAGAGAGGAGTGAATTCCATCTTGTGAAGTTA
TGTTCCCAGCTCCCCACTCGGTCTCGTCCCCAAAGTGGTAGGAATGTTGAGTCGGCAACTTGGGCCACTCTCACCCATACAGATCAAAGGACGAACCTTCACGGG
ACACCCCCACTCGCATGTCTCCACACGAACGACCTGGATCGAGTCATCTGTAACCTTTACAGAGCGGGCCGTATCCATAGTGTTGCCAGGATAAGGCAGCTTGAA
GCCGGAGAGATGACTATCAAAGTCGGAACGGGAGAAGTCGTCTTAACTGTGGCAGTAGGAAAGCTTAAGTTGTTTTTGAAAGGATCAGATCCTGCAGTGGAAACG
CGTGAACGCGTGTGCTTCGATCTTGTTTTAGAATTACAACAAAGAAAAGACACTATATTGTACACCTCGTATGATGCAAATCCAACCACGGATCCATTGACTGTA
CCCGGGGGACCAATTACAAGAAGCAAAGCAAAGAAGATTCAAGAGGTTTTCATAATACATCTTCAAAGGCTAGCTAATGCACACGAGGAGACAAAGATTTCTGAG
GCAAAAATTCTTTACAATGTTAATTTAATGAGTCAAGAAGAGAATGGAGCAAAGATGGCACGGGAAAAGTTGTCTATTTTGAGAGATGGCACGGAGGACAAAAAA
AGTGTGCAGATTGGTGAACAGGTGCCTTTGCAGATTGCACCTCTTGGGAGTTTGTTTGATGTTAAACCGATTCATTGGATTAGTCTCGATCAAGCTAATTCTGGA
GGAACAAAAAGTGTTGCTGAGTATTATCAAGAGATGAAGACCATAATGCTAAAAACCCAAATCCGAGAAGAAGAGGAAGACACCATGTCTAGATTCCTTGGAGGT
TTGAATCGAGAAATTGCTCATGCTGTTGATAGAAATCCACCGCCTTACCTAGAAGATATGTACCATTATGCTGTCAAAATTGAAGATCAATTGAAGGAAGAAAAG
GAGTATTCAAAAAGGAATGAATCATTGCAACCAAAAGGAAAGTTGTAG
Protein sequenceShow/hide protein sequence
MRGWGPLFKSQSQHLREHSSTPLKSERSEFHLVKLCSQLPTRSRPQSGRNVESATWATLTHTDQRTNLHGTPPLACLHTNDLDRVICNLYRAGRIHSVARIRQLE
AGEMTIKVGTGEVVLTVAVGKLKLFLKGSDPAVETRERVCFDLVLELQQRKDTILYTSYDANPTTDPLTVPGGPITRSKAKKIQEVFIIHLQRLANAHEETKISE
AKILYNVNLMSQEENGAKMAREKLSILRDGTEDKKSVQIGEQVPLQIAPLGSLFDVKPIHWISLDQANSGGTKSVAEYYQEMKTIMLKTQIREEEEDTMSRFLGG
LNREIAHAVDRNPPPYLEDMYHYAVKIEDQLKEEKEYSKRNESLQPKGKL