; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g19090 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g19090
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA-directed DNA polymerase
Genome locationchr3:12691678..12700069
RNA-Seq ExpressionMoc03g19090
SyntenyMoc03g19090
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150097.1 uncharacterized protein LOC111018357 [Momordica charantia]2.6e-5678.23Show/hide
Query:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT
        TVTLQFADR IRK +GKI+DVL+K+DK  FP +FIILDCE DL++PIILG PFL TGDTIFN+RKGEITMKVNDEQV FNVLDAMRL DEIEECS IQIT
Subjt:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT

Query:  NPVSMEEFCDLVTVGLEQELEAVEKEGDADASSSPMEKFEFGNLNKD
        N VSMEEFCDLV VGLEQELE  +KE +AD   SPMEKFEFG+L+ D
Subjt:  NPVSMEEFCDLVTVGLEQELEAVEKEGDADASSSPMEKFEFGNLNKD

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]4.6e-3758.7Show/hide
Query:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT
        TVTLQ ADR I KP+GKI+DVLVK+DKFIFPT+FIILDCEAD +VPIILGRPFL TG+T+ +++KGE+TM+V+D++V FN+LDAM+  D++EEC+ I I 
Subjt:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT

Query:  NPVSMEEFCDLVTVGLEQELEAVEKEGDADASSSPMEK
          ++  E  DL+   +E ELE  EKEG    ++   EK
Subjt:  NPVSMEEFCDLVTVGLEQELEAVEKEGDADASSSPMEK

XP_024028757.1 uncharacterized protein LOC112093792 [Morus notabilis]3.9e-3637.79Show/hide
Query:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT
        TVTLQ ADR    P+GKI+DVLV++DKFIFP +FI+LD EAD EVPIILGRPFL TG T+ +++KGE+TM+V+D+QV FNV  AMR +DE+EECSA+ + 
Subjt:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT

Query:  NPVSMEEFCDLVTVGLEQELEAVEKEGDADASSSPMEKFE-----------FGNLNKDSH------------QSIRTKTLACSFKYAFLGENETLPVIIS
        + +   EF       L  E + ++ E + D +   + + E           F +L+  +               +  + L    +YA+LG+++TLPVII+
Subjt:  NPVSMEEFCDLVTVGLEQELEAVEKEGDADASSSPMEKFE-----------FGNLNKDSH------------QSIRTKTLACSFKYAFLGENETLPVIIS

Query:  TTLPNEHEILLLQLYKLYGSS-GTLALEVWVRHPVLEFSKSKLEEEDTFVNNEIEKEKKYVP
        + L +  EI LL++ K +  + G    ++    P +   K  L+E     +N +E++++  P
Subjt:  TTLPNEHEILLLQLYKLYGSS-GTLALEVWVRHPVLEFSKSKLEEEDTFVNNEIEKEKKYVP

XP_030485610.1 uncharacterized protein LOC115702304 [Cannabis sativa]1.6e-3745.83Show/hide
Query:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT
        TVTLQ ADR +  P+GKI+DVLV++DKFIFP +FIILD EAD EVPIILGRPFL TG  + +++ GE+TM+VND++V FNV +AMR  DEIEECS + + 
Subjt:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT

Query:  NPVSMEEFCDLV-----TVGLEQELEAVEKEGDADAS------SSPMEKFEFGNLN-KDSH-----------QSIRTKTLACSFKYAFLGENETLPVIIS
        + +  E F   V      +   ++LEA+ ++ +   +       SP  K  F +L  K+S+             +  K L    KYA+LGENE LP+IIS
Subjt:  NPVSMEEFCDLV-----TVGLEQELEAVEKEGDADAS------SSPMEKFEFGNLN-KDSH-----------QSIRTKTLACSFKYAFLGENETLPVIIS

Query:  TTLPNEHEILLLQLYK
          L  E E LLL++ K
Subjt:  TTLPNEHEILLLQLYK

XP_030508913.1 uncharacterized protein LOC115723563 [Cannabis sativa]3.9e-3648.51Show/hide
Query:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT
        TVTLQ ADR +  P+GKI+DVLV++DKFIFP +FIILD EAD EVPIILGR FL TG T+ +++KGE+TM+VND+QV FNV +AMR  DEIEECS I   
Subjt:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT

Query:  NPVSMEEFCDLVTVGLEQELEAVEKEGDADASSSPMEKFEFGNLN----KDSHQ---SIRTKTLACSFKYAFLGENETLPVIISTTLPNEHEILLLQLYK
                        E ++  VE +     S  P E  E    N    K S Q    +  K L    KY +LG+ ETLPVIIS  L  + E LL+ + K
Subjt:  NPVSMEEFCDLVTVGLEQELEAVEKEGDADASSSPMEKFEFGNLN----KDSHQ---SIRTKTLACSFKYAFLGENETLPVIISTTLPNEHEILLLQLYK

Query:  LY
         Y
Subjt:  LY

TrEMBL top hitse value%identityAlignment
A0A6J1CPJ3 uncharacterized protein LOC1110129474.7e-3557.25Show/hide
Query:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT
        TVTL  ADR I KP+GKI+DVLVK+DKFIFP +FIILDCEAD +VPIILGRPFL TG+T+ +++KGE+TM+V+D++V FN+LDAM+  D+ EEC  I I 
Subjt:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT

Query:  NPVSMEEFCDLVTVGLEQELEAVEKEGDADASSSPMEK
          ++  E  DL+   +E ELE  EKEG    ++   EK
Subjt:  NPVSMEEFCDLVTVGLEQELEAVEKEGDADASSSPMEK

A0A6J1D9S6 uncharacterized protein LOC1110183571.3e-5678.23Show/hide
Query:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT
        TVTLQFADR IRK +GKI+DVL+K+DK  FP +FIILDCE DL++PIILG PFL TGDTIFN+RKGEITMKVNDEQV FNVLDAMRL DEIEECS IQIT
Subjt:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT

Query:  NPVSMEEFCDLVTVGLEQELEAVEKEGDADASSSPMEKFEFGNLNKD
        N VSMEEFCDLV VGLEQELE  +KE +AD   SPMEKFEFG+L+ D
Subjt:  NPVSMEEFCDLVTVGLEQELEAVEKEGDADASSSPMEKFEFGNLNKD

A0A6J1DBX0 uncharacterized protein LOC1110189013.4e-3363.08Show/hide
Query:  MTVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQI
        +TVTLQ A+R I+KP+GKI+DVLVK+DKFIFP +FI        EVPIILGRPFL TG T+FN+RKGEITMKVNDEQV FNVLDAMRL D++EECS I  
Subjt:  MTVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQI

Query:  TNPVSMEEFCDLVTVGLEQELEAVEKEGDA
             MEE   ++   LE +LE  EKE  A
Subjt:  TNPVSMEEFCDLVTVGLEQELEAVEKEGDA

A0A6J1DY39 uncharacterized protein LOC1110256532.2e-3758.7Show/hide
Query:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT
        TVTLQ ADR I KP+GKI+DVLVK+DKFIFPT+FIILDCEAD +VPIILGRPFL TG+T+ +++KGE+TM+V+D++V FN+LDAM+  D++EEC+ I I 
Subjt:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT

Query:  NPVSMEEFCDLVTVGLEQELEAVEKEGDADASSSPMEK
          ++  E  DL+   +E ELE  EKEG    ++   EK
Subjt:  NPVSMEEFCDLVTVGLEQELEAVEKEGDADASSSPMEK

A0A6J1DZC3 uncharacterized protein LOC1110244492.7e-3559.84Show/hide
Query:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT
        TVTLQ ADR I KP+GKI+DVLVK+DKFIFP +FIIL+CEAD +VPIILGRPFL TG+T+ +++KGE+TM V+D++V FN+LDAM+  D++EEC+ I I 
Subjt:  TVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQIT

Query:  NPVSMEEFCDLVTVGLEQELEAVEKEG
          ++  E  DL+   +E +LE  EKEG
Subjt:  NPVSMEEFCDLVTVGLEQELEAVEKEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGTAACCCTCCAATTTGCAGATAGGCCTATTAGAAAACCCAAGGGAAAAATAGATGACGTACTAGTCAAGATTGATAAATTTATCTTCCCTACAAATTTCATTAT
TTTAGATTGTGAAGCAGACTTGGAAGTCCCTATCATACTTGGAAGACCGTTTCTTATAACAGGAGATACAATTTTCAACCTAAGGAAGGGAGAAATCACAATGAAGGTGA
ATGACGAGCAAGTAATATTCAACGTATTAGATGCTATGCGGCTATCAGATGAAATAGAAGAATGCTCGGCTATTCAGATAACTAATCCTGTGTCTATGGAAGAATTTTGT
GATTTGGTGACTGTAGGTTTAGAGCAAGAGTTAGAAGCAGTAGAGAAAGAAGGAGATGCAGACGCTTCCTCGTCACCAATGGAAAAGTTTGAGTTTGGAAACCTGAACAA
GGATAGCCATCAATCAATTAGAACAAAAACTCTTGCTTGCTCATTTAAATATGCATTCTTAGGAGAGAATGAAACTTTACCTGTAATCATCTCTACAACTCTACCTAATG
AGCATGAAATTTTGTTATTGCAGCTTTACAAGTTATATGGAAGTTCTGGAACTCTTGCTCTCGAGGTATGGGTTCGACACCCTGTTCTGGAATTCTCAAAATCGAAGCTT
GAGGAAGAAGATACCTTTGTCAATAATGAGATTGAAAAGGAGAAAAAATATGTGCCAACCCCATCGAATGAACATCATAGTAAGGGAAATGTGGCTGGCATATCAGAATG
TCAATTTCAAACATCGGAGGCTACTGTTCCAGGTGTTTTTAGGCCATCGTCATCTCCTTTGGTTTATTATAAGAGGCGGTCATTATGCACAAGTGAGGTGGACAAGGAGA
ACACTATTGAGACTGACCCTAGCATGGTTGATTTTTTTCATTCCAATGAAACTTCTCCACAGGTTCGATCTCCATTTTTCCAAGGTCCTAATAGTTATGCACAAGAGAAA
GAATATTCTGATGGGCCAATGGTTAATCTATATAATATCTTGAATTATTTGGCCTTTAATTCAAATGATTTTGTGAGTGGAGATCATAAAGTAAAGGGAGTGTATCAGTA
G
mRNA sequenceShow/hide mRNA sequence
ATGACTGTAACCCTCCAATTTGCAGATAGGCCTATTAGAAAACCCAAGGGAAAAATAGATGACGTACTAGTCAAGATTGATAAATTTATCTTCCCTACAAATTTCATTAT
TTTAGATTGTGAAGCAGACTTGGAAGTCCCTATCATACTTGGAAGACCGTTTCTTATAACAGGAGATACAATTTTCAACCTAAGGAAGGGAGAAATCACAATGAAGGTGA
ATGACGAGCAAGTAATATTCAACGTATTAGATGCTATGCGGCTATCAGATGAAATAGAAGAATGCTCGGCTATTCAGATAACTAATCCTGTGTCTATGGAAGAATTTTGT
GATTTGGTGACTGTAGGTTTAGAGCAAGAGTTAGAAGCAGTAGAGAAAGAAGGAGATGCAGACGCTTCCTCGTCACCAATGGAAAAGTTTGAGTTTGGAAACCTGAACAA
GGATAGCCATCAATCAATTAGAACAAAAACTCTTGCTTGCTCATTTAAATATGCATTCTTAGGAGAGAATGAAACTTTACCTGTAATCATCTCTACAACTCTACCTAATG
AGCATGAAATTTTGTTATTGCAGCTTTACAAGTTATATGGAAGTTCTGGAACTCTTGCTCTCGAGGTATGGGTTCGACACCCTGTTCTGGAATTCTCAAAATCGAAGCTT
GAGGAAGAAGATACCTTTGTCAATAATGAGATTGAAAAGGAGAAAAAATATGTGCCAACCCCATCGAATGAACATCATAGTAAGGGAAATGTGGCTGGCATATCAGAATG
TCAATTTCAAACATCGGAGGCTACTGTTCCAGGTGTTTTTAGGCCATCGTCATCTCCTTTGGTTTATTATAAGAGGCGGTCATTATGCACAAGTGAGGTGGACAAGGAGA
ACACTATTGAGACTGACCCTAGCATGGTTGATTTTTTTCATTCCAATGAAACTTCTCCACAGGTTCGATCTCCATTTTTCCAAGGTCCTAATAGTTATGCACAAGAGAAA
GAATATTCTGATGGGCCAATGGTTAATCTATATAATATCTTGAATTATTTGGCCTTTAATTCAAATGATTTTGTGAGTGGAGATCATAAAGTAAAGGGAGTGTATCAGTA
G
Protein sequenceShow/hide protein sequence
MTVTLQFADRPIRKPKGKIDDVLVKIDKFIFPTNFIILDCEADLEVPIILGRPFLITGDTIFNLRKGEITMKVNDEQVIFNVLDAMRLSDEIEECSAIQITNPVSMEEFC
DLVTVGLEQELEAVEKEGDADASSSPMEKFEFGNLNKDSHQSIRTKTLACSFKYAFLGENETLPVIISTTLPNEHEILLLQLYKLYGSSGTLALEVWVRHPVLEFSKSKL
EEEDTFVNNEIEKEKKYVPTPSNEHHSKGNVAGISECQFQTSEATVPGVFRPSSSPLVYYKRRSLCTSEVDKENTIETDPSMVDFFHSNETSPQVRSPFFQGPNSYAQEK
EYSDGPMVNLYNILNYLAFNSNDFVSGDHKVKGVYQ