; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g06140 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g06140
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationchr3:4481972..4486367
RNA-Seq ExpressionMoc03g06140
SyntenyMoc03g06140
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]3.1e-4653.81Show/hide
Query:  IMLNNAAKGAFTKKTFNEIVGILNDLASHNELWCSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQPVQSDYCTLAP---
        +MLNNAA GAFTKKTFNEIV ILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLA P QPVQ DYCT AP   
Subjt:  IMLNNAAKGAFTKKTFNEIVGILNDLASHNELWCSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQPVQSDYCTLAP---

Query:  -----------------------------------------------------------------------------------------QYNQRTQTPPV
                                                                                                 QYNQRT+TP V
Subjt:  -----------------------------------------------------------------------------------------QYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQ
        QNNNSNLENMMKEYMARTD VIQ
Subjt:  QNNNSNLENMMKEYMARTDAVIQ

XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]1.2e-5848.65Show/hide
Query:  LDHPTKIMLNNAAKGAFTKKTFNEIVGILNDLASHNELWCSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQP-------
        LDHPTK+MLNNAA GAFTKKTFNEIV IL DLASHNELWCSQR + APKKQD AGVLALDIATSMQKEM TMNQ LKE+AL  K+    P QP       
Subjt:  LDHPTKIMLNNAAKGAFTKKTFNEIVGILNDLASHNELWCSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQP-------

Query:  ----VQSDYCT----------------------------------------------------------------------------LAP---QYNQRTQ
            +   YC+                                                                            + P   QYNQ  +
Subjt:  ----VQSDYCT----------------------------------------------------------------------------LAP---QYNQRTQ

Query:  TP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRSGMAYDGPTMP
        TP  P  NNN++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKA+TLRSG++Y+GP MP
Subjt:  TP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRSGMAYDGPTMP

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]2.9e-14558.63Show/hide
Query:  MSTRSFLLPLDPEIEQTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADVPPRDPVDPPVVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIE+TLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMAD+PPRDPVDPP VNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIEQTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADVPPRDPVDPPVVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGI-------------------------------------------------------------------------------------------
        AFQNFDSGI                                                                                           
Subjt:  AFQNFDSGI-------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKIMLNNAAKGAFTKKTFNEIVGILNDLASHNELW
                                                              IEHFFRGLDHPTK+MLNNAA GAFTKKTFNEIV ILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKIMLNNAAKGAFTKKTFNEIVGILNDLASHNELW

Query:  CSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQPVQSDYCTLAP------------------------------------
        CSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT  QPVQSDYCT AP                                    
Subjt:  CSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQPVQSDYCTLAP------------------------------------

Query:  ----------------QYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRS
                        QYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQGSFPGHTELP+REGKEQCKA+TLRS
Subjt:  ----------------QYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRS

Query:  GMAYDGPTMPTTDVQIPSTEPTVKILE
        G+ YDGPTMPTTDVQIPST+PTVKI E
Subjt:  GMAYDGPTMPTTDVQIPSTEPTVKILE

XP_022158979.1 uncharacterized protein LOC111025424 [Momordica charantia]5.5e-4386.79Show/hide
Query:  YNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRSGMAYDGPTMPTTDVQIP
        YNQR QTPPVQNNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELPK EGKE CKA+TLRSG+ Y+ PTMPTTDVQI 
Subjt:  YNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRSGMAYDGPTMPTTDVQIP

Query:  STEPTV
        STEPT+
Subjt:  STEPTV

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]5.1e-8963.33Show/hide
Query:  IMLNNAAKGAFTKKTFNEIVGILNDLASHNELWCSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQPVQSDYCTLAP---
        +MLNNAA GAFTKKTFNEIV ILNDLASHNELWCSQR RAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATP QPVQSDYCT AP   
Subjt:  IMLNNAAKGAFTKKTFNEIVGILNDLASHNELWCSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQPVQSDYCTLAP---

Query:  -----------------------------------------------------------------------------------------QYNQRTQTPPV
                                                                                                 +YNQRTQTPPV
Subjt:  -----------------------------------------------------------------------------------------QYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRSGMAYDGPTMPTTDVQIPSTEPTVKILE
        QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKA+TLRSG+AYD PTMPT DVQIPST PTVKI E
Subjt:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRSGMAYDGPTMPTTDVQIPSTEPTVKILE

TrEMBL top hitse value%identityAlignment
A0A6J1CR45 uncharacterized protein LOC1110134641.5e-4653.81Show/hide
Query:  IMLNNAAKGAFTKKTFNEIVGILNDLASHNELWCSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQPVQSDYCTLAP---
        +MLNNAA GAFTKKTFNEIV ILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLA P QPVQ DYCT AP   
Subjt:  IMLNNAAKGAFTKKTFNEIVGILNDLASHNELWCSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQPVQSDYCTLAP---

Query:  -----------------------------------------------------------------------------------------QYNQRTQTPPV
                                                                                                 QYNQRT+TP V
Subjt:  -----------------------------------------------------------------------------------------QYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQ
        QNNNSNLENMMKEYMARTD VIQ
Subjt:  QNNNSNLENMMKEYMARTDAVIQ

A0A6J1DAE9 uncharacterized protein LOC1110185145.9e-5948.65Show/hide
Query:  LDHPTKIMLNNAAKGAFTKKTFNEIVGILNDLASHNELWCSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQP-------
        LDHPTK+MLNNAA GAFTKKTFNEIV IL DLASHNELWCSQR + APKKQD AGVLALDIATSMQKEM TMNQ LKE+AL  K+    P QP       
Subjt:  LDHPTKIMLNNAAKGAFTKKTFNEIVGILNDLASHNELWCSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQP-------

Query:  ----VQSDYCT----------------------------------------------------------------------------LAP---QYNQRTQ
            +   YC+                                                                            + P   QYNQ  +
Subjt:  ----VQSDYCT----------------------------------------------------------------------------LAP---QYNQRTQ

Query:  TP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRSGMAYDGPTMP
        TP  P  NNN++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKA+TLRSG++Y+GP MP
Subjt:  TP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRSGMAYDGPTMP

A0A6J1DW02 uncharacterized protein LOC1110248971.4e-14558.63Show/hide
Query:  MSTRSFLLPLDPEIEQTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADVPPRDPVDPPVVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIE+TLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMAD+PPRDPVDPP VNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIEQTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADVPPRDPVDPPVVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGI-------------------------------------------------------------------------------------------
        AFQNFDSGI                                                                                           
Subjt:  AFQNFDSGI-------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKIMLNNAAKGAFTKKTFNEIVGILNDLASHNELW
                                                              IEHFFRGLDHPTK+MLNNAA GAFTKKTFNEIV ILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKIMLNNAAKGAFTKKTFNEIVGILNDLASHNELW

Query:  CSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQPVQSDYCTLAP------------------------------------
        CSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT  QPVQSDYCT AP                                    
Subjt:  CSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQPVQSDYCTLAP------------------------------------

Query:  ----------------QYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRS
                        QYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQGSFPGHTELP+REGKEQCKA+TLRS
Subjt:  ----------------QYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRS

Query:  GMAYDGPTMPTTDVQIPSTEPTVKILE
        G+ YDGPTMPTTDVQIPST+PTVKI E
Subjt:  GMAYDGPTMPTTDVQIPSTEPTVKILE

A0A6J1DYG0 uncharacterized protein LOC1110257642.5e-8963.33Show/hide
Query:  IMLNNAAKGAFTKKTFNEIVGILNDLASHNELWCSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQPVQSDYCTLAP---
        +MLNNAA GAFTKKTFNEIV ILNDLASHNELWCSQR RAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATP QPVQSDYCT AP   
Subjt:  IMLNNAAKGAFTKKTFNEIVGILNDLASHNELWCSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQPVQSDYCTLAP---

Query:  -----------------------------------------------------------------------------------------QYNQRTQTPPV
                                                                                                 +YNQRTQTPPV
Subjt:  -----------------------------------------------------------------------------------------QYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRSGMAYDGPTMPTTDVQIPSTEPTVKILE
        QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKA+TLRSG+AYD PTMPT DVQIPST PTVKI E
Subjt:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRSGMAYDGPTMPTTDVQIPSTEPTVKILE

A0A6J1E110 uncharacterized protein LOC1110254241.6e-4387.74Show/hide
Query:  YNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRSGMAYDGPTMPTTDVQIP
        YNQR QTPPVQNNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELPK EGKE CKA+TLRSG+ YD PTMPTTDVQI 
Subjt:  YNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRSGMAYDGPTMPTTDVQIP

Query:  STEPTV
        STEPT+
Subjt:  STEPTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACAGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGA
TGCATACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATT
GAGCAGACCCTTCGAAAAACTAGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGT
ACGAGCACATCAATGGCAGATGTTCCACCTCGTGATCCGGTTGATCCACCTGTTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATC
CAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAATAGAACATTTCTTTAGAGGTTTAGATCAT
CCTACTAAGATAATGCTAAACAATGCTGCCAAAGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTCGGCATCCTAAATGACTTAGCTTCGCACAACGAACTA
TGGTGTTCGCAAAGACCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAAC
CAGAGGCTGAAAGAGATGGCGTTGGGAATAAAGAATCCATTAGCCACGCCGACACAACCTGTGCAGTCAGATTATTGCACTCTTGCCCCTCAGTACAATCAGAGA
ACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCTCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATG
AGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGAAAAGAA
CAGTGCAAAGCTATCACCCTTAGGAGTGGAATGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAACTGTAAAGATTCTA
GAGAGAGACTTTGAAGAGTGCTCTGCTATAACTAGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAG
ATAGTAGAAGGACCGGAAGATGTGACTAATCATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACGATAGCAGATATCCGAGGGATAAGCCCGACCTTTTGCAT
GCACAAAATTCTATTGGAAGAAGATGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACAGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGA
TGCATACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATT
GAGCAGACCCTTCGAAAAACTAGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGT
ACGAGCACATCAATGGCAGATGTTCCACCTCGTGATCCGGTTGATCCACCTGTTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATC
CAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAATAGAACATTTCTTTAGAGGTTTAGATCAT
CCTACTAAGATAATGCTAAACAATGCTGCCAAAGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTCGGCATCCTAAATGACTTAGCTTCGCACAACGAACTA
TGGTGTTCGCAAAGACCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAAC
CAGAGGCTGAAAGAGATGGCGTTGGGAATAAAGAATCCATTAGCCACGCCGACACAACCTGTGCAGTCAGATTATTGCACTCTTGCCCCTCAGTACAATCAGAGA
ACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCTCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATG
AGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGAAAAGAA
CAGTGCAAAGCTATCACCCTTAGGAGTGGAATGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAACTGTAAAGATTCTA
GAGAGAGACTTTGAAGAGTGCTCTGCTATAACTAGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAG
ATAGTAGAAGGACCGGAAGATGTGACTAATCATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACGATAGCAGATATCCGAGGGATAAGCCCGACCTTTTGCAT
GCACAAAATTCTATTGGAAGAAGATGCTAA
Protein sequenceShow/hide protein sequence
MGGARRLGSLQKNRFSSNFALNETRLPMRFGGSNRCIRVEEVFHYQFEHDLGKLKCMSTRSFLLPLDPEIEQTLRKTRKEQRLRKQLEKQKEREGEISPESEVES
TSTSMADVPPRDPVDPPVVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIIEHFFRGLDHPTKIMLNNAAKGAFTKKTFNEIVGILNDLASHNEL
WCSQRPRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPTQPVQSDYCTLAPQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASM
RNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAITLRSGMAYDGPTMPTTDVQIPSTEPTVKILERDFEECSAITSLNPVMFDEFYDLLVTEIEEELDK
IVEGPEDVTNHFGEAQKGHWMDDSRYPRDKPDLLHAQNSIGRRC