; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g03980 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g03980
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA-directed DNA polymerase
Genome locationchr1:2584832..2585830
RNA-Seq ExpressionMoc01g03980
SyntenyMoc01g03980
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156989.1 uncharacterized protein LOC111023818 [Momordica charantia]9.1e-7455.15Show/hide
Query:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI
        MPNYVKFL DILAKKRRL EFE VALTK+ SAILTGKL QKMGDP SFTIP+ IGGKNVG+ALCDLGASINL+ LS+YQK  IG+ARP T+TLQLADRSI
Subjt:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI

Query:  TRLEGKIEDVLLQIS----------------------WKRSCASSSR--------------REKEAKLLKLNEGGTPADI-----------VFSRTFEPL
        T LEGKIEDVL+Q+                         R   S+ R               +++  L   N    P D+           + S   +  
Subjt:  TRLEGKIEDVLLQIS----------------------WKRSCASSSR--------------REKEAKLLKLNEGGTPADI-----------VFSRTFEPL

Query:  EL------------KDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRLLELLKNHRRGISWTIADIKGINTSYCVHKNNL
        EL            KDR +APLQPSV KAP LE+K LPSHLKYAYLGE +TLP+ IA DLAEEKE RL+E+L+NH++ I WT+ADIKGI+ SYC+HK NL
Subjt:  EL------------KDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRLLELLKNHRRGISWTIADIKGINTSYCVHKNNL

Query:  E
        E
Subjt:  E

XP_023521781.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785639, partial [Cucurbita pepo subsp. pepo]5.9e-5741.56Show/hide
Query:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI
        MPNYVKFL D+L  +R+ EEF+ V L ++CSAIL  K+  K  DP SFTIP+SIGGK +G ALCDLG+SINL+ LS+Y+K  IG+ARPTT+TLQLADRS 
Subjt:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI

Query:  TRLEGKIEDVLLQI---------------------------------------------------------------SWKRSCASSSRREKEAKLLKLNE
        T  EGKIED+L+Q+                                                               +  + C++     ++    + ++
Subjt:  TRLEGKIEDVLLQI---------------------------------------------------------------SWKRSCASSSRREKEAKLLKLNE

Query:  G--GTPADI-------------VFSRTFEPLELKDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRLLELLKNHRRGISW
        G  G   D               F+RTFE LE + R+ +P++PS+E+AP L++K LP +LKYAYLG+ KTLPIII+  L+  +E  LLE LK H+  I W
Subjt:  G--GTPADI-------------VFSRTFEPLELKDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRLLELLKNHRRGISW

Query:  TIADIKGINTSYCVHKNNLE
        T+ADIKGI+ S C+HK  LE
Subjt:  TIADIKGINTSYCVHKNNLE

XP_023522102.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785979 [Cucurbita pepo subsp. pepo]4.5e-5741.88Show/hide
Query:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI
        MPNYVKFL D+L  +R+ EEF+ V L ++CSAIL  K+  K  DP SFTIP+SIGGK +G ALCDLG+SINL+ LS+Y+K  IG+ARPTT+TLQLADRS 
Subjt:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI

Query:  TRLEGKIEDVLLQI---------------------------------------------------------------SWKRSCASSSRREKEAKLLKLNE
        T  EGKIED+L+Q+                                                               +    C++     ++    + ++
Subjt:  TRLEGKIEDVLLQI---------------------------------------------------------------SWKRSCASSSRREKEAKLLKLNE

Query:  G--GTPADI-------------VFSRTFEPLELKDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRLLELLKNHRRGISW
        G  G   D               FSRTFE LE + R+ +P++PS+E+AP L++K LP +LKYAYLG+ KTLPIII+  L+  +E  LLE LK H+  I W
Subjt:  G--GTPADI-------------VFSRTFEPLELKDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRLLELLKNHRRGISW

Query:  TIADIKGINTSYCVHKNNLE
        T+ADIKGI+ S C+HK  LE
Subjt:  TIADIKGINTSYCVHKNNLE

XP_030485610.1 uncharacterized protein LOC115702304 [Cannabis sativa]2.0e-5743.28Show/hide
Query:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI
        MP YVKFL DIL KKRRL EFETVALT+ CSA+L  K+  K+ DP SFTIP+SIGG+NVG ALCDLGASINL+ +S ++K  IG+ARPTT+TLQLADRS+
Subjt:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI

Query:  TRLEGKIEDVLLQIS-------------------------------------------------------------------------------------
           EGKIEDVL+Q+                                                                                      
Subjt:  TRLEGKIEDVLLQIS-------------------------------------------------------------------------------------

Query:  -WKRSCASSSRREKEAKLLKLNEGGTPADI-------VFSRTFEPLELKDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKES
         WK     SS  + EA  L  +E    A +        F R+FE LELK+    P +PS+++ P LE+K LPSHLKYAYLGE + LPIII+  L  E E 
Subjt:  -WKRSCASSSRREKEAKLLKLNEGGTPADI-------VFSRTFEPLELKDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKES

Query:  RLLELLKNHRRGISWTIADIKGINTSYCVHKNNLE
         LLE+LK H+R I WT+ADIKGI+ + C HK  LE
Subjt:  RLLELLKNHRRGISWTIADIKGINTSYCVHKNNLE

XP_030502183.1 uncharacterized protein LOC115717351 [Cannabis sativa]6.6e-5641.44Show/hide
Query:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI
        M NYVKFL DIL KKRRL EFETVALT+ CSA+L  K+  K+ DP SFTIP SIGG++VG ALCDLGASINL+ +S+++K  IG+ARPTT+TLQLADRS+
Subjt:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI

Query:  TRLEGKIEDVLLQIS----------------------WKRSCASSSRR----EKEAKLLKLNEGGTPADIV-----------------------------
           EGKIEDVL+Q+                         RS  ++ R     + E   +++N+     ++                              
Subjt:  TRLEGKIEDVLLQIS----------------------WKRSCASSSRR----EKEAKLLKLNEGGTPADIV-----------------------------

Query:  ------------------------------------FSRTFEPLELKDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRL
                                            F + FE LELK+    P +PS ++ P LE+K LPSHLKYAYLGE  TLP+IIA +L  E E  L
Subjt:  ------------------------------------FSRTFEPLELKDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRL

Query:  LELLKNHRRGISWTIADIKGINTSYCVHKNNLE
        LE+LK H++ I WT+ADI+GI+ + C HK  LE
Subjt:  LELLKNHRRGISWTIADIKGINTSYCVHKNNLE

TrEMBL top hitse value%identityAlignment
A0A2G9GK35 Reverse transcriptase5.4e-4838.79Show/hide
Query:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI
        MP+YVKF+ DIL+KKRRL ++ETVALT++CSAI+  KL  K+ DP SFTIP +IG    G ALCDLGASINL+  S+Y+   +G+A+PT+ITLQLADRS+
Subjt:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI

Query:  TRLEGKIEDVLLQI------------------------------------------------------------------------------SWKRSCAS
        T  +G IED+L+++                                                                              +   S A 
Subjt:  TRLEGKIEDVLLQI------------------------------------------------------------------------------SWKRSCAS

Query:  SSRREKEAKLLKLNEGGTPAD------IVFSRTFEPLELKDRER-AP---LQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRLLEL
         S    E  LL L +     D      +  S+ F+   ++  ER AP   L+PS+E+ P LE+K LPSHL YAYLGE+ TLP+II+  L++ +  +LL +
Subjt:  SSRREKEAKLLKLNEGGTPAD------IVFSRTFEPLELKDRER-AP---LQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRLLEL

Query:  LKNHRRGISWTIADIKGINTSYCVHKNNLE
        L+NH+  I WTI DIKGI+ S+C+HK  LE
Subjt:  LKNHRRGISWTIADIKGINTSYCVHKNNLE

A0A6J1D1L0 uncharacterized protein LOC1110161981.3e-5245.89Show/hide
Query:  MGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSITRLEGKIEDVLLQIS----------------------WKRS
        MGD  SFTIPMSIGGKNVG+ LCDLGA INLI L +YQK  IG+ARPTT+TLQLADRSIT  EGK EDVL+Q+                         R 
Subjt:  MGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSITRLEGKIEDVLLQIS----------------------WKRS

Query:  CASSSR---------------------------------------------------------------------REKEAKLLKLNEGGTPADIVFSRTF
          S+ R                                                                     +EKEAKL++       +D V+ + F
Subjt:  CASSSR---------------------------------------------------------------------REKEAKLLKLNEGGTPADIVFSRTF

Query:  EPLELKDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRLLELLKNHRRGISWTIADIKGINTSYCVHKNNLE
        E  ELKDRE+A LQ SVEKA  LE+K LP+HLKYAYLG+A+TL I IA DLAE+KE RL+E+LKNH+R I WTIADIK IN SYC+HK NLE
Subjt:  EPLELKDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRLLELLKNHRRGISWTIADIKGINTSYCVHKNNLE

A0A6J1DAJ9 uncharacterized protein LOC1110188984.3e-5345.62Show/hide
Query:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI
        M  YVKFL D+L KK  L E ETV LTK+CS ILT K+ +K+ DP SFTIP+SIGG  +G ALCDLGASINL+ LS+Y++  +G  RPTT+TLQL+DRSI
Subjt:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI

Query:  TRLEGKIEDVLLQI-------------------------------------------SWKRSCASSSRREKEAKLLKLNEGGTPADIV--FSRTFEPLEL
           EGKI+ V++Q+                                           S +RSC    +        + N G    D+   +SR  E L+L
Subjt:  TRLEGKIEDVLLQI-------------------------------------------SWKRSCASSSRREKEAKLLKLNEGGTPADIV--FSRTFEPLEL

Query:  KDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRLLELLKNHRRGISWTIADIKG
        +D  R  L+PS+E+ P LE+K L  HLKYAYLG + TLPIII  DL  EKE+ LL +L+ HR+ I WTIADI+G
Subjt:  KDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRLLELLKNHRRGISWTIADIKG

A0A6J1DV77 uncharacterized protein LOC1110238184.4e-7455.15Show/hide
Query:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI
        MPNYVKFL DILAKKRRL EFE VALTK+ SAILTGKL QKMGDP SFTIP+ IGGKNVG+ALCDLGASINL+ LS+YQK  IG+ARP T+TLQLADRSI
Subjt:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI

Query:  TRLEGKIEDVLLQIS----------------------WKRSCASSSR--------------REKEAKLLKLNEGGTPADI-----------VFSRTFEPL
        T LEGKIEDVL+Q+                         R   S+ R               +++  L   N    P D+           + S   +  
Subjt:  TRLEGKIEDVLLQIS----------------------WKRSCASSSR--------------REKEAKLLKLNEGGTPADI-----------VFSRTFEPL

Query:  EL------------KDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRLLELLKNHRRGISWTIADIKGINTSYCVHKNNL
        EL            KDR +APLQPSV KAP LE+K LPSHLKYAYLGE +TLP+ IA DLAEEKE RL+E+L+NH++ I WT+ADIKGI+ SYC+HK NL
Subjt:  EL------------KDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRLLELLKNHRRGISWTIADIKGINTSYCVHKNNL

Query:  E
        E
Subjt:  E

A0A6J1DYF9 uncharacterized protein LOC1110246745.4e-5672.94Show/hide
Query:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI
        MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI
Subjt:  MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSI

Query:  TRLEGKIEDVLLQISWK-RSCASSSRREKEAKLLKLNEGGTPADIVFSRTFEPLELKDRERAPLQPSVEK
        TRLEGKIEDVLLQ+++   +        +E  LL++      AD + +   +  EL D+    L+   EK
Subjt:  TRLEGKIEDVLLQISWK-RSCASSSRREKEAKLLKLNEGGTPADIVFSRTFEPLELKDRERAPLQPSVEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAATTATGTTAAATTCCTCAATGACATTCTAGCCAAGAAGAGGAGGCTAGAGGAGTTTGAAACAGTAGCTCTAACAAAGAAATGCAGTGCCATCTTAACAGGTAA
ACTTCATCAGAAAATGGGAGATCCATGGAGCTTCACTATCCCTATGTCAATAGGGGGCAAGAATGTAGGAAATGCCCTATGTGACTTAGGCGCGAGCATAAACTTGATCT
CTTTGTCCTTGTATCAGAAGTTTACAATAGGGAAGGCACGCCCCACCACAATAACACTGCAGCTGGCTGATAGGTCCATCACACGTCTAGAGGGAAAGATAGAAGATGTG
TTATTACAGATCAGTTGGAAGAGGAGCTGCGCATCATCTTCGAGAAGGGAGAAAGAAGCAAAATTGCTCAAGCTCAATGAAGGAGGGACACCCGCAGATATAGTGTTCAG
CAGGACATTTGAACCACTGGAATTAAAAGACAGAGAACGAGCACCGCTACAACCCTCAGTGGAAAAGGCACCTAACCTAGAAATGAAAGCTCTGCCGTCACACCTGAAGT
ATGCCTATTTGGGTGAAGCTAAAACACTACCGATCATTATTGCAGTTGATCTGGCTGAAGAAAAAGAATCTCGACTGTTGGAATTGTTGAAGAATCATAGGAGAGGCATC
AGTTGGACTATAGCAGACATCAAGGGGATCAATACAAGCTACTGTGTGCATAAAAACAACTTAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCAAATTATGTTAAATTCCTCAATGACATTCTAGCCAAGAAGAGGAGGCTAGAGGAGTTTGAAACAGTAGCTCTAACAAAGAAATGCAGTGCCATCTTAACAGGTAA
ACTTCATCAGAAAATGGGAGATCCATGGAGCTTCACTATCCCTATGTCAATAGGGGGCAAGAATGTAGGAAATGCCCTATGTGACTTAGGCGCGAGCATAAACTTGATCT
CTTTGTCCTTGTATCAGAAGTTTACAATAGGGAAGGCACGCCCCACCACAATAACACTGCAGCTGGCTGATAGGTCCATCACACGTCTAGAGGGAAAGATAGAAGATGTG
TTATTACAGATCAGTTGGAAGAGGAGCTGCGCATCATCTTCGAGAAGGGAGAAAGAAGCAAAATTGCTCAAGCTCAATGAAGGAGGGACACCCGCAGATATAGTGTTCAG
CAGGACATTTGAACCACTGGAATTAAAAGACAGAGAACGAGCACCGCTACAACCCTCAGTGGAAAAGGCACCTAACCTAGAAATGAAAGCTCTGCCGTCACACCTGAAGT
ATGCCTATTTGGGTGAAGCTAAAACACTACCGATCATTATTGCAGTTGATCTGGCTGAAGAAAAAGAATCTCGACTGTTGGAATTGTTGAAGAATCATAGGAGAGGCATC
AGTTGGACTATAGCAGACATCAAGGGGATCAATACAAGCTACTGTGTGCATAAAAACAACTTAGAATAG
Protein sequenceShow/hide protein sequence
MPNYVKFLNDILAKKRRLEEFETVALTKKCSAILTGKLHQKMGDPWSFTIPMSIGGKNVGNALCDLGASINLISLSLYQKFTIGKARPTTITLQLADRSITRLEGKIEDV
LLQISWKRSCASSSRREKEAKLLKLNEGGTPADIVFSRTFEPLELKDRERAPLQPSVEKAPNLEMKALPSHLKYAYLGEAKTLPIIIAVDLAEEKESRLLELLKNHRRGI
SWTIADIKGINTSYCVHKNNLE