; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g19930 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g19930
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA-directed DNA polymerase
Genome locationchr2:14800213..14812044
RNA-Seq ExpressionMoc02g19930
SyntenyMoc02g19930
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]1.4e-3889.01Show/hide
Query:  RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKGMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCNENHIYDNCPHNPADFIILD
        RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLK MALGIKNPLA PIQPVQ DYCTPAPVCQVNDLICSFC+ENHIYDNCPHNPA    ++
Subjt:  RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKGMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCNENHIYDNCPHNPADFIILD

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]1.2e-7450.54Show/hide
Query:  STRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATA
        STRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAATA
Subjt:  STRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATA

Query:  FQNFDSGIVNPIPAHANFELKPMIFQI-------------------------------------------------------------------------
        FQNFDSGIVNPIPAH NFELKPM+FQ+                                                                         
Subjt:  FQNFDSGIVNPIPAHANFELKPMIFQI-------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKGMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLIC
           SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLK MALGIKNPLAT IQPVQSDYCT APVCQVNDLIC
Subjt:  ---SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKGMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLIC

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]6.8e-3890.91Show/hide
Query:  QISRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKGMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCNENHIYDNCPHNPA
        Q SRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLK M LG+KNPLATPIQPVQSDYCTPAPVCQVNDLICSFC+ENHIYD CPHNPA
Subjt:  QISRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKGMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCNENHIYDNCPHNPA

XP_030479560.1 uncharacterized protein LOC115696816 [Cannabis sativa]1.2e-2937.88Show/hide
Query:  LKPMIFQISRAAPKKQDPAGVLAL-DIATSMQK----EMVTMNQRLKGMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCNENHIYDNCPHNPAD
        LK +   IS     +Q P  V  L DI T  ++    E V + +    M LGI   + T +    SD     P  ++ D++          D    +P D
Subjt:  LKPMIFQISRAAPKKQDPAGVLAL-DIATSMQK----EMVTMNQRLKGMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCNENHIYDNCPHNPAD

Query:  FIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSLNPVMFDEFY-----DSLVTEIEEELDKIAEGPE
        FIILD EAD +VPIILGRPFLAT  T+ +V+ GE+TMRVN++++ FNV +AM+ P + ++CS ++ ++ ++ + F+     D  V    E+L+ ++E  E
Subjt:  FIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSLNPVMFDEFY-----DSLVTEIEEELDKIAEGPE

Query:  -DVA-----NPIEK---------IQKEECKSLLPSIVAPPTLEQKPLPSHLKYAYLGDNDTLPV
          VA      P  K         +++   K   PSI  PP LE KPLPSHLKYAYLGDN+TLPV
Subjt:  -DVA-----NPIEK---------IQKEECKSLLPSIVAPPTLEQKPLPSHLKYAYLGDNDTLPV

XP_030504461.1 uncharacterized protein LOC115719521 [Cannabis sativa]8.9e-3046.99Show/hide
Query:  ADFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSLNPVMFDEFY-----DSLVTEIEEELDKIAEG
        ADF ILD EAD +VPIILGRPFLATG T+ +V+ GE+TMRVN+++V FNV +AM+ P + EECS ++ ++ ++ + F+     D  V    E+L+ ++E 
Subjt:  ADFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSLNPVMFDEFY-----DSLVTEIEEELDKIAEG

Query:  PEDVANPIEKIQ---------------KEECKSLLPSIVAPPTLEQKPLPSHLKYAYLGDNDTLPV
         E   + +E +Q               +   K L PSI  PP LE KPLPSHLKYAYLG+N+TLPV
Subjt:  PEDVANPIEKIQ---------------KEECKSLLPSIVAPPTLEQKPLPSHLKYAYLGDNDTLPV

TrEMBL top hitse value%identityAlignment
A0A2G9GK35 Reverse transcriptase2.4e-2846.34Show/hide
Query:  PADFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSL-----NPVMFDEFYDSLVTEIEEELDKIAE
        PADF++LD E D++VPIILGRPFLATG T+ +V+KGE+TMRV ++++ FNV  AMK P + +EC A+N       N  + ++  D L   + + LD+  E
Subjt:  PADFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSL-----NPVMFDEFYDSLVTEIEEELDKIAE

Query:  GPEDV-----------ANPIEKIQK-EECKSLLPSIVAPPTLEQKPLPSHLKYAYLGDNDTLPV
           +V           +  +E +++    K L PSI  PPTLE KPLPSHL YAYLG++DTLPV
Subjt:  GPEDV-----------ANPIEKIQK-EECKSLLPSIVAPPTLEQKPLPSHLKYAYLGDNDTLPV

A0A2G9HH15 Reverse transcriptase1.5e-2745.12Show/hide
Query:  PADFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAIN---------SLNPVMFDEFYDSLVTEIEEELD
        PADF++LD E D++VPIILGRPFLATG T+ +V+KGE+TMRV ++++ FNV  AMK P + +EC A++         S+     D    +L+  ++EE +
Subjt:  PADFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAIN---------SLNPVMFDEFYDSLVTEIEEELD

Query:  KIAEGPEDV-------ANPIEKIQK-EECKSLLPSIVAPPTLEQKPLPSHLKYAYLGDNDTLPV
        +  E  + +       +  +E +++    K L PSI  PPTLE KPLPSHL YAYLG++DTLPV
Subjt:  KIAEGPEDV-------ANPIEKIQK-EECKSLLPSIVAPPTLEQKPLPSHLKYAYLGDNDTLPV

A0A6J1CR45 uncharacterized protein LOC1110134646.6e-3989.01Show/hide
Query:  RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKGMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCNENHIYDNCPHNPADFIILD
        RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLK MALGIKNPLA PIQPVQ DYCTPAPVCQVNDLICSFC+ENHIYDNCPHNPA    ++
Subjt:  RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKGMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCNENHIYDNCPHNPADFIILD

A0A6J1DW02 uncharacterized protein LOC1110248975.8e-7550.54Show/hide
Query:  STRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATA
        STRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAATA
Subjt:  STRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATA

Query:  FQNFDSGIVNPIPAHANFELKPMIFQI-------------------------------------------------------------------------
        FQNFDSGIVNPIPAH NFELKPM+FQ+                                                                         
Subjt:  FQNFDSGIVNPIPAHANFELKPMIFQI-------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKGMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLIC
           SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLK MALGIKNPLAT IQPVQSDYCT APVCQVNDLIC
Subjt:  ---SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKGMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLIC

A0A6J1DYG0 uncharacterized protein LOC1110257643.3e-3890.91Show/hide
Query:  QISRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKGMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCNENHIYDNCPHNPA
        Q SRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLK M LG+KNPLATPIQPVQSDYCTPAPVCQVNDLICSFC+ENHIYD CPHNPA
Subjt:  QISRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKGMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCNENHIYDNCPHNPA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCTCCATCTCTTTTCGGCTCTATCGTTTTTGTCTCTATCCTTTCATCTCTTTGTACTTCGCCGCGACGTCTATACTCCTCAGCCAAAACTTCTTTTATTAATATC
AGCTGTGTTCCGAATTTTACCTGTTCTTATTTTGCTTTGTACTCCACAGCTGCAAGTCTTTAAAGCCTTTAAATTTTCCAAGTCTCAATTCACACCTTACCTCGAACTCG
TAAGACCAAGTGTCAGATCTGAAAACTTTGAGGTCGGACTCTCGAGTTATAGTGCCTTGCCATTCGATTTTGGAGGGGTAATGAAAATCCGCGCCGGTACATTGTTCCGT
CTCAAAGCAAGTACCAAAGTTGCTTTCCTTGTCAATTTTTTGGCTTCCCTCGGGTCCATTGGAATATTGCCTTCTCGGAAGTTTCTAATCAGATCCATCCATGACAGAGG
CTGGGAGTCGACTTCCATCATATCCGGCTCCACGATGGACGGGTTGTACAAGATTTCCACCAAGACCGATCACGCCAGGACCATTTGGGAGTGCAAATTAATCAAAAGAA
GCAAAAAGACGAAAAAAGCTTCATGGGAGGCGCCAGGCGCCTGCGTGCCTGCAGAAAACAGTTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCCATGCGTTTT
GGTGGTTCCAACCGATGCATACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAAGTTAAAGTGCACGAGCACAAGATCTTTTCTTCTACCCCTTGA
CCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAG
AGAGTACAAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATC
CAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCT
TAAACCAATGATATTCCAAATATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACTTCGATGCAAAAAGAGATGGTTACAATGA
ACCAGAGGCTGAAAGGGATGGCGTTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGAT
CTCATTTGTTCATTTTGCAATGAAAACCATATTTATGATAATTGTCCACATAACCCTGCGGACTTTATCATTCTAGATTGCGAGGCAGACTTAGACGTTCCCATTATTTT
GGGAAGACCATTTCTAGCCACTGGGGATACAGTTTTTAATGTGAGGAAGGGAGAAATTACAATGAGGGTAAATAATGAAGAAGTTAAATTTAACGTTCTAGATGCCATGA
AATTACCAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTCGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAG
ATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGCACCACCCACGTTGGAGCAGAAGCC
ATTGCCGTCGCATTTGAAATATGCGTATCTAGGGGATAACGACACTTTACCAGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCAGTTT
TACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCAGTACTGACACTGATCCTTGTGCTAGTACCCGGTCTGTCAGATGTAAAAGTCTTAGTGTA
AGGAGGGAGTGTGCAGATTCCTTAGGGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGATGGAAAAACCTACATGGGAGGCGCCAGGCGCCTGGGAAGCCT
GCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACACGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCATACGTGTAGAAGAAGTGTTCCACTATCAGT
TTGAGCACGATTTGGATTTGGAGAAGGCCACTGTAAGTCTTAACAAGTCAATACTGAAAAGGTTGCTTATTTGCATTGTTGAGTGGTTAAACCGAGGAGAGGAGCGCCTT
AGACATAAGGGTTCCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCTCCATCTCTTTTCGGCTCTATCGTTTTTGTCTCTATCCTTTCATCTCTTTGTACTTCGCCGCGACGTCTATACTCCTCAGCCAAAACTTCTTTTATTAATATC
AGCTGTGTTCCGAATTTTACCTGTTCTTATTTTGCTTTGTACTCCACAGCTGCAAGTCTTTAAAGCCTTTAAATTTTCCAAGTCTCAATTCACACCTTACCTCGAACTCG
TAAGACCAAGTGTCAGATCTGAAAACTTTGAGGTCGGACTCTCGAGTTATAGTGCCTTGCCATTCGATTTTGGAGGGGTAATGAAAATCCGCGCCGGTACATTGTTCCGT
CTCAAAGCAAGTACCAAAGTTGCTTTCCTTGTCAATTTTTTGGCTTCCCTCGGGTCCATTGGAATATTGCCTTCTCGGAAGTTTCTAATCAGATCCATCCATGACAGAGG
CTGGGAGTCGACTTCCATCATATCCGGCTCCACGATGGACGGGTTGTACAAGATTTCCACCAAGACCGATCACGCCAGGACCATTTGGGAGTGCAAATTAATCAAAAGAA
GCAAAAAGACGAAAAAAGCTTCATGGGAGGCGCCAGGCGCCTGCGTGCCTGCAGAAAACAGTTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCCATGCGTTTT
GGTGGTTCCAACCGATGCATACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAAGTTAAAGTGCACGAGCACAAGATCTTTTCTTCTACCCCTTGA
CCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAG
AGAGTACAAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATC
CAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCT
TAAACCAATGATATTCCAAATATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACTTCGATGCAAAAAGAGATGGTTACAATGA
ACCAGAGGCTGAAAGGGATGGCGTTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGAT
CTCATTTGTTCATTTTGCAATGAAAACCATATTTATGATAATTGTCCACATAACCCTGCGGACTTTATCATTCTAGATTGCGAGGCAGACTTAGACGTTCCCATTATTTT
GGGAAGACCATTTCTAGCCACTGGGGATACAGTTTTTAATGTGAGGAAGGGAGAAATTACAATGAGGGTAAATAATGAAGAAGTTAAATTTAACGTTCTAGATGCCATGA
AATTACCAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTCGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAG
ATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGCACCACCCACGTTGGAGCAGAAGCC
ATTGCCGTCGCATTTGAAATATGCGTATCTAGGGGATAACGACACTTTACCAGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCAGTTT
TACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCAGTACTGACACTGATCCTTGTGCTAGTACCCGGTCTGTCAGATGTAAAAGTCTTAGTGTA
AGGAGGGAGTGTGCAGATTCCTTAGGGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGATGGAAAAACCTACATGGGAGGCGCCAGGCGCCTGGGAAGCCT
GCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACACGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCATACGTGTAGAAGAAGTGTTCCACTATCAGT
TTGAGCACGATTTGGATTTGGAGAAGGCCACTGTAAGTCTTAACAAGTCAATACTGAAAAGGTTGCTTATTTGCATTGTTGAGTGGTTAAACCGAGGAGAGGAGCGCCTT
AGACATAAGGGTTCCTCTTAA
Protein sequenceShow/hide protein sequence
MLLHLFSALSFLSLSFHLFVLRRDVYTPQPKLLLLISAVFRILPVLILLCTPQLQVFKAFKFSKSQFTPYLELVRPSVRSENFEVGLSSYSALPFDFGGVMKIRAGTLFR
LKASTKVAFLVNFLASLGSIGILPSRKFLIRSIHDRGWESTSIISGSTMDGLYKISTKTDHARTIWECKLIKRSKKTKKASWEAPGACVPAENSFSSNFALNETRLPMRF
GGSNRCIRVEEVFHYQFEHDLGKLKCTSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHI
QMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMIFQISRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKGMALGIKNPLATPIQPVQSDYCTPAPVCQVND
LICSFCNENHIYDNCPHNPADFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSLNPVMFDEFYDSLVTEIEEELDK
IAEGPEDVANPIEKIQKEECKSLLPSIVAPPTLEQKPLPSHLKYAYLGDNDTLPVREVVQHIYNLRASLDFAVLPSWPPALAAILGHPSPSTDTDPCASTRSVRCKSLSV
RRECADSLGDHLGVQINQKKQKDGKTYMGGARRLGSLQKNWFSSNFALNETRLPMRFGGSNRCIRVEEVFHYQFEHDLDLEKATVSLNKSILKRLLICIVEWLNRGEERL
RHKGSS