; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g18580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g18580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA-directed DNA polymerase
Genome locationchr3:12280912..12282939
RNA-Seq ExpressionMoc03g18580
SyntenyMoc03g18580
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017227899.1 PREDICTED: uncharacterized protein LOC108203467 [Daucus carota subsp. sativus]7.1e-6152.49Show/hide
Query:  PPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKR-------------------------------------------SH
        PP+PQR QK+ QD Q  K FLDVLKQLHINIP VEALEQM NY K +KDILTKKR                                             
Subjt:  PPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKR-------------------------------------------SH

Query:  ALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRMLIDVHNGQLTMRV
        ALCDLG SINLMP+SV+++LG GEVRPT VTLQLAD  + H EGKIEDVLV+VDKFIFPADFI++DYE + ++PIIL RPFL+TGR LIDV NG+LTMRV
Subjt:  ALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRMLIDVHNGQLTMRV

Query:  NDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLDRLEEELTIAYEHVGDTVD
         D++VTF VF  +K+  DVE+CS + + D+L+S ++   +  D LE+ +    +   D V+
Subjt:  NDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLDRLEEELTIAYEHVGDTVD

XP_022147186.1 uncharacterized protein LOC111016198 [Momordica charantia]1.9e-5869.66Show/hide
Query:  KKRSHALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRMLIDVHNGQ
        K   H LCDLG  INL+PL VYQ+LG GE RPT VTLQLAD  ITH EGK EDVLVQVDKFIFPADFII+DYEVN +IPIIL RPFLSTGR LIDVHNG+
Subjt:  KKRSHALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRMLIDVHNGQ

Query:  LTMRVNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLDRLEEELTIAYEHVGDTVDTGASQPTGGSND
        LTMRVNDQQVTF +FN +KF AD+EECSLLRLADDL S+E+Q E LLD+LEEE+T  +E           +P   S+D
Subjt:  LTMRVNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLDRLEEELTIAYEHVGDTVDTGASQPTGGSND

XP_022156989.1 uncharacterized protein LOC111023818 [Momordica charantia]5.0e-7561.98Show/hide
Query:  RKSVETPGKYRRPPPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKR--------------------------------
        +K V+ P +++ PPPYPQRLQKKNQD Q    FL+VLKQLHINIP +EALEQM NY K LKDIL KKR                                
Subjt:  RKSVETPGKYRRPPPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKR--------------------------------

Query:  -----------SHALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRM
                    HALCDLG SINLMPLSVYQ+LG GE RP  VTLQLAD  IT+LEGKIEDVLVQVDKFIFPADFII+DYE + +IPIIL RPFLSTGR 
Subjt:  -----------SHALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRM

Query:  LIDVHNGQLTMRVNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLDRLEEELT
        LIDVHNG+LT+RVNDQQVT  +FN +K+  DVEECS LR+ADDL+S EIQ E LL++LE+ELT
Subjt:  LIDVHNGQLTMRVNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLDRLEEELT

XP_024028757.1 uncharacterized protein LOC112093792 [Morus notabilis]8.6e-5950.74Show/hide
Query:  RRPPPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKR------------------------------------------
        R PPP+PQR Q + QD Q  + FLDVLKQLHINIP VEALEQM +Y K +KDILTKKR                                          
Subjt:  RRPPPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKR------------------------------------------

Query:  -SHALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRMLIDVHNGQLT
           ALCDLG SINLMP+S++++LG GEV PT VTLQLAD    H EGKIEDVLV+VDKFIFPADFI++DYE + ++PIIL RPFL+TG+ LIDV  G+LT
Subjt:  -SHALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRMLIDVHNGQLT

Query:  MRVNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLDRLEEELTIAYEHVGDTVDTGASQPTG
        MRV+DQQVTF VF  ++F  +VEECS + + D L++ E +K      + EE  I  E   D  D   S+  G
Subjt:  MRVNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLDRLEEELTIAYEHVGDTVDTGASQPTG

XP_030502183.1 uncharacterized protein LOC115717351 [Cannabis sativa]2.5e-5854.1Show/hide
Query:  PPPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKR-------------------------------------------S
        P P+PQR    N++D   K FLDVLKQLHINIP VEALEQMSNY K LKDILTKKR                                            
Subjt:  PPPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKR-------------------------------------------S

Query:  HALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRMLIDVHNGQLTMR
         ALCDLG SINLMP+S++++LG GE RPT VTLQLAD  + H EGKIEDVLVQVDKFIFPADFII+DYE +  +PIIL R FL+TGR LIDV N +LTMR
Subjt:  HALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRMLIDVHNGQLTMR

Query:  VNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLD
        VNDQ+VTF VFN ++F  ++EECS + + D +++++  KE   D
Subjt:  VNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLD

TrEMBL top hitse value%identityAlignment
A0A2G9HWF8 Reverse transcriptase5.3e-5450.79Show/hide
Query:  KSVETPGKYRRP----PPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKR-----------------------------
        K VE P +  +P    PP+PQRLQK+    Q LK FL+V K+LHINIP  EALEQM +Y K +KDIL+KKR                             
Subjt:  KSVETPGKYRRP----PPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKR-----------------------------

Query:  -SHALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRMLIDVHNGQLT
           ALCDLG SINLMP S+Y+ LG  E +PT +TLQLAD  +T+ +G IED+LV+VDKFIFPADF+++D EV++++PIIL RPFL+TGR LIDV  G+LT
Subjt:  -SHALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRMLIDVHNGQLT

Query:  MRVNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLDRLEEEL
        MRV DQQ+TF VF  +KF  + +EC  + L D+L  K+   E  LD LE  L
Subjt:  MRVNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLDRLEEEL

A0A6J1D1L0 uncharacterized protein LOC1110161989.3e-5969.66Show/hide
Query:  KKRSHALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRMLIDVHNGQ
        K   H LCDLG  INL+PL VYQ+LG GE RPT VTLQLAD  ITH EGK EDVLVQVDKFIFPADFII+DYEVN +IPIIL RPFLSTGR LIDVHNG+
Subjt:  KKRSHALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRMLIDVHNGQ

Query:  LTMRVNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLDRLEEELTIAYEHVGDTVDTGASQPTGGSND
        LTMRVNDQQVTF +FN +KF AD+EECSLLRLADDL S+E+Q E LLD+LEEE+T  +E           +P   S+D
Subjt:  LTMRVNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLDRLEEELTIAYEHVGDTVDTGASQPTGGSND

A0A6J1D3P6 uncharacterized protein LOC1110170142.4e-5450Show/hide
Query:  RKKKHQVHRKSVETPGKYRRPPPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKR------------------------
        +K+K   H      P KYR  PPYP+RLQKK Q+ Q  K  LDVLKQLH+NIP VEALEQ+ NY + LK+IL KKR                        
Subjt:  RKKKHQVHRKSVETPGKYRRPPPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKR------------------------

Query:  -------------------SHALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILER
                             ALCDLG SINL+PLS+Y +LG GE RPT VTLQLAD  +TH EGKIEDVLVQVDKFIFP DFII+DY+ + ++ II+ R
Subjt:  -------------------SHALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILER

Query:  PFLSTGRMLIDVHNGQLTMRVNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQ
        PFL+T R L++VH G+LTMRV DQ+V F V+  + F A  EEC ++++ D+ L KE++
Subjt:  PFLSTGRMLIDVHNGQLTMRVNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQ

A0A6J1DV77 uncharacterized protein LOC1110238182.4e-7561.98Show/hide
Query:  RKSVETPGKYRRPPPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKR--------------------------------
        +K V+ P +++ PPPYPQRLQKKNQD Q    FL+VLKQLHINIP +EALEQM NY K LKDIL KKR                                
Subjt:  RKSVETPGKYRRPPPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKR--------------------------------

Query:  -----------SHALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRM
                    HALCDLG SINLMPLSVYQ+LG GE RP  VTLQLAD  IT+LEGKIEDVLVQVDKFIFPADFII+DYE + +IPIIL RPFLSTGR 
Subjt:  -----------SHALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRM

Query:  LIDVHNGQLTMRVNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLDRLEEELT
        LIDVHNG+LT+RVNDQQVT  +FN +K+  DVEECS LR+ADDL+S EIQ E LL++LE+ELT
Subjt:  LIDVHNGQLTMRVNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLDRLEEELT

A0A6J1E1F3 uncharacterized protein LOC1110250654.8e-5563.49Show/hide
Query:  PRKKKHQVHRKSVETPGKYRRPPPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKRS--------HALCDLGVSINLMP
        P K+K   H  +   P +Y   PPYP+RLQKK ++ Q  K FLDVLKQLH+NIP VEALEQM NY + LK+IL KKR+         ALCDLG +INLMP
Subjt:  PRKKKHQVHRKSVETPGKYRRPPPYPQRLQKKNQDDQVLKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKRS--------HALCDLGVSINLMP

Query:  LSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRMLIDVHNGQLTMRVND
        LS+Y +LG GE RPT+VTLQLAD  ITH EGKIEDVLV VDKF FPADFII+DY+ + ++PIIL RPFL+TGR L+DVH G+LTMRV D
Subjt:  LSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEVNNKIPIILERPFLSTGRMLIDVHNGQLTMRVND

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGATTTGGGAGGTCGGCCGGCAGCTCTCAAAGCGATTCTTTGGGATACCTGCAAGGAGGAAAGTAGAGTCCGAGCTCAACAGATCATCGTGCTCGGATTGGGCTCT
CCCAAGCGACCTCCGGTCCCAGCTGAGGACAGTGTCAATCCGAATCACCACGTGGCATACCCTTGTGGATGCGTGGACCCGATACGGGTCTCGAATCCCTTACCGCCCAA
GAAAAAAGAAGCACCAAGTGCACCGGAAAAGTGTGGAAACACCAGGGAAATATAGACGACCTCCTCCCTATCCTCAAAGGCTTCAAAAGAAAAACCAAGATGATCAAGTA
CTTAAGCATTTTTTGGATGTGTTGAAGCAACTCCACATCAATATACCCTCAGTTGAGGCTCTTGAGCAGATGTCTAACTATGCAAAAATTTTGAAGGATATCTTGACTAA
GAAAAGGAGCCATGCTCTATGCGATTTGGGTGTAAGCATAAACCTTATGCCATTATCAGTATATCAGAGGTTGGGTACTGGTGAAGTAAGACCCACCATGGTGACACTAC
AACTAGCTGACTGGTTAATCACTCACCTAGAGGGCAAGATCGAAGATGTGTTAGTACAGGTGGACAAGTTCATTTTCCCAGCTGATTTTATCATCATCGATTATGAAGTA
AACAACAAGATCCCAATAATTTTGGAGAGGCCTTTTCTATCCACCGGTAGAATGCTAATAGATGTGCATAATGGGCAGTTAACTATGAGAGTAAACGATCAACAGGTTAC
CTTTTTTGTTTTTAATTTTGTTAAATTCCTTGCTGATGTAGAAGAATGTTCTCTCTTAAGACTTGCAGATGACTTGCTGAGTAAGGAAATACAAAAGGAGACCTTGTTGG
ATCGCTTGGAGGAAGAACTTACCATAGCCTATGAACATGTAGGGGACACTGTCGATACTGGAGCATCTCAGCCCACGGGCGGCTCCAACGATGAAGAAGTAGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGACGATTTGGGAGGTCGGCCGGCAGCTCTCAAAGCGATTCTTTGGGATACCTGCAAGGAGGAAAGTAGAGTCCGAGCTCAACAGATCATCGTGCTCGGATTGGGCTCT
CCCAAGCGACCTCCGGTCCCAGCTGAGGACAGTGTCAATCCGAATCACCACGTGGCATACCCTTGTGGATGCGTGGACCCGATACGGGTCTCGAATCCCTTACCGCCCAA
GAAAAAAGAAGCACCAAGTGCACCGGAAAAGTGTGGAAACACCAGGGAAATATAGACGACCTCCTCCCTATCCTCAAAGGCTTCAAAAGAAAAACCAAGATGATCAAGTA
CTTAAGCATTTTTTGGATGTGTTGAAGCAACTCCACATCAATATACCCTCAGTTGAGGCTCTTGAGCAGATGTCTAACTATGCAAAAATTTTGAAGGATATCTTGACTAA
GAAAAGGAGCCATGCTCTATGCGATTTGGGTGTAAGCATAAACCTTATGCCATTATCAGTATATCAGAGGTTGGGTACTGGTGAAGTAAGACCCACCATGGTGACACTAC
AACTAGCTGACTGGTTAATCACTCACCTAGAGGGCAAGATCGAAGATGTGTTAGTACAGGTGGACAAGTTCATTTTCCCAGCTGATTTTATCATCATCGATTATGAAGTA
AACAACAAGATCCCAATAATTTTGGAGAGGCCTTTTCTATCCACCGGTAGAATGCTAATAGATGTGCATAATGGGCAGTTAACTATGAGAGTAAACGATCAACAGGTTAC
CTTTTTTGTTTTTAATTTTGTTAAATTCCTTGCTGATGTAGAAGAATGTTCTCTCTTAAGACTTGCAGATGACTTGCTGAGTAAGGAAATACAAAAGGAGACCTTGTTGG
ATCGCTTGGAGGAAGAACTTACCATAGCCTATGAACATGTAGGGGACACTGTCGATACTGGAGCATCTCAGCCCACGGGCGGCTCCAACGATGAAGAAGTAGAAGATTGA
Protein sequenceShow/hide protein sequence
MTIWEVGRQLSKRFFGIPARRKVESELNRSSCSDWALPSDLRSQLRTVSIRITTWHTLVDAWTRYGSRIPYRPRKKKHQVHRKSVETPGKYRRPPPYPQRLQKKNQDDQV
LKHFLDVLKQLHINIPSVEALEQMSNYAKILKDILTKKRSHALCDLGVSINLMPLSVYQRLGTGEVRPTMVTLQLADWLITHLEGKIEDVLVQVDKFIFPADFIIIDYEV
NNKIPIILERPFLSTGRMLIDVHNGQLTMRVNDQQVTFFVFNFVKFLADVEECSLLRLADDLLSKEIQKETLLDRLEEELTIAYEHVGDTVDTGASQPTGGSNDEEVED