; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g14310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g14310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr2:10597786..10604841
RNA-Seq ExpressionMoc02g14310
SyntenyMoc02g14310
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571397.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.8e-3086.08Show/hide
Query:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        MGLGLDPFITNALVDMYAKCGS+ EAE  F SSVWKDT CWNSMISMYA HGKAEEAL+ FETMM NDI PNYVTFVSV
Subjt:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

XP_022155466.1 pentatricopeptide repeat-containing protein At4g39530 isoform X2 [Momordica charantia]2.1e-3493.67Show/hide
Query:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        MGLGLDPFITNALVDMYAKCGSM EAE MFYSSVWKD ACWNSMISMYAAHGKAE+ALK+FETMMRNDINPNYVTFVSV
Subjt:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

XP_022928040.1 pentatricopeptide repeat-containing protein At4g39530 [Cucurbita moschata]7.0e-3084.81Show/hide
Query:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        MGLGLD FITNALVDMYAKCGS+ EAE  F SSVWKDT CWNSMISMYA HGKA+EAL+MFETMM NDI PNYVTFVSV
Subjt:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

XP_022971725.1 pentatricopeptide repeat-containing protein At4g39530 [Cucurbita maxima]1.6e-2987.34Show/hide
Query:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        MGLGLDPFITNALVDMYAKCGS+ EAE  F SSVWKDT CWNSMISMYA HGKAEEAL MFETMM NDI+PNYVTFVSV
Subjt:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

XP_038900776.1 pentatricopeptide repeat-containing protein At4g39530 [Benincasa hispida]1.4e-3086.08Show/hide
Query:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        +GLGLDPFITNALVDMYAKCGS+ EAE  F SSVWKDTACWNSMISMYA HGKAE+AL+MFE MM NDINPNYVTFVSV
Subjt:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

TrEMBL top hitse value%identityAlignment
A0A0A0LM26 Uncharacterized protein1.3e-2983.54Show/hide
Query:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        MGL  DPFITNALVDMYAKCGS+ EAE +F SSVWKDTACWNSMISMYA HGK EEAL+MFETM+ N+INPNYVTFVSV
Subjt:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

A0A1S4E1S3 pentatricopeptide repeat-containing protein At4g395305.4e-2882.28Show/hide
Query:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        MGL  DPFITNALVDMYAKCGS+ EAE +F SSV KDTACWNSMISMYA HGK EEAL+MFE M+ NDINPNYVTFVSV
Subjt:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

A0A6J1DRQ9 pentatricopeptide repeat-containing protein At4g39530 isoform X21.0e-3493.67Show/hide
Query:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        MGLGLDPFITNALVDMYAKCGSM EAE MFYSSVWKD ACWNSMISMYAAHGKAE+ALK+FETMMRNDINPNYVTFVSV
Subjt:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

A0A6J1EQJ5 pentatricopeptide repeat-containing protein At4g395303.4e-3084.81Show/hide
Query:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        MGLGLD FITNALVDMYAKCGS+ EAE  F SSVWKDT CWNSMISMYA HGKA+EAL+MFETMM NDI PNYVTFVSV
Subjt:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

A0A6J1I6J1 pentatricopeptide repeat-containing protein At4g395307.5e-3087.34Show/hide
Query:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        MGLGLDPFITNALVDMYAKCGS+ EAE  F SSVWKDT CWNSMISMYA HGKAEEAL MFETMM NDI+PNYVTFVSV
Subjt:  MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

SwissProt top hitse value%identityAlignment
Q56X05 Pentatricopeptide repeat-containing protein At1g061435.0e-1553.85Show/hide
Query:  GLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        G  LD +I +ALVDMY+KCGS+  A  +F++   K+  CWNS+I   AAHG A+EALKMF  M    + PN VTFVSV
Subjt:  GLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

Q9C501 Pentatricopeptide repeat-containing protein At1g333503.9e-1541.35Show/hide
Query:  LGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMR---NDINPNYVTFVSVCYQLVVILHNHLLSLEKSKG
        L  D F++N+LVD+Y KCG++ EA ++F  +  K    WNSMI+ +A HG++EEA+ +FE MM+   NDI P+++TF+ +   L    H  L+S  K +G
Subjt:  LGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMR---NDINPNYVTFVSVCYQLVVILHNHLLSLEKSKG

Query:  RKDI
          D+
Subjt:  RKDI

Q9FWA6 Pentatricopeptide repeat-containing protein At3g02330, mitochondrial5.0e-1548.65Show/hide
Query:  DPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        D +I + LVDMY+KCG + ++  MF  S+ +D   WN+MI  YA HGK EEA+++FE M+  +I PN+VTF+S+
Subjt:  DPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

Q9LVF9 Pentatricopeptide repeat-containing protein At3g214701.7e-1550Show/hide
Query:  GLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        G+ L+ F++NAL+DMYAKCG +  A ++F S   +  AC NSMIS  A HGK +EAL+MF TM   D+ P+ +TF++V
Subjt:  GLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

Q9SVA5 Pentatricopeptide repeat-containing protein At4g395304.4e-1956.41Show/hide
Query:  GLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        GL  +P+ITNAL+DMYAKCGS  +A   F S+  +D  CWNS+IS YA HG+ ++AL+M E MM   I PNY+TFV V
Subjt:  GLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

Arabidopsis top hitse value%identityAlignment
AT1G06150.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.6e-1653.85Show/hide
Query:  GLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        G  LD +I +ALVDMY+KCGS+  A  +F++   K+  CWNS+I   AAHG A+EALKMF  M    + PN VTFVSV
Subjt:  GLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

AT1G33350.1 Pentatricopeptide repeat (PPR) superfamily protein2.7e-1641.35Show/hide
Query:  LGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMR---NDINPNYVTFVSVCYQLVVILHNHLLSLEKSKG
        L  D F++N+LVD+Y KCG++ EA ++F  +  K    WNSMI+ +A HG++EEA+ +FE MM+   NDI P+++TF+ +   L    H  L+S  K +G
Subjt:  LGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMR---NDINPNYVTFVSVCYQLVVILHNHLLSLEKSKG

Query:  RKDI
          D+
Subjt:  RKDI

AT3G02330.1 Pentatricopeptide repeat (PPR) superfamily protein3.6e-1648.65Show/hide
Query:  DPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        D +I + LVDMY+KCG + ++  MF  S+ +D   WN+MI  YA HGK EEA+++FE M+  +I PN+VTF+S+
Subjt:  DPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

AT3G21470.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.2e-1650Show/hide
Query:  GLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        G+ L+ F++NAL+DMYAKCG +  A ++F S   +  AC NSMIS  A HGK +EAL+MF TM   D+ P+ +TF++V
Subjt:  GLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV

AT4G39530.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-2056.41Show/hide
Query:  GLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV
        GL  +P+ITNAL+DMYAKCGS  +A   F S+  +D  CWNS+IS YA HG+ ++AL+M E MM   I PNY+TFV V
Subjt:  GLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTAGGACTAGACCCTTTCATCACTAATGCCCTTGTGGATATGTATGCCAAGTGTGGGAGCATGGGAGAAGCTGAAAATATGTTTTACTCCTCGGTATGGAAAGA
TACGGCATGCTGGAACTCCATGATTTCGATGTATGCAGCACACGGAAAAGCAGAAGAAGCTCTTAAGATGTTTGAAACAATGATGAGAAATGACATAAATCCCAATTATG
TCACTTTTGTGAGTGTGTGCTATCAGCTTGTAGTCATATTACATAATCATCTACTCAGCTTAGAGAAATCAAAGGGTAGGAAGGACATCTGTAGTACTCACCAACCTAAT
CCGAGTCAAGCTGATTCAACGAAAGGTATCGACCGACCTGAGGAATCCGACCTCAGGCTCCCTGCTCGAAGTACTTTCAACTCATCTTCCACGTCTTCTTCCAACCGAGC
CGAGAGCATTCTTATGAAGACTCTACTCAAATCTCAAGACTTATGGGAGTTGGTGGAACAAGGCTATGCAGATCTCGACGACGATCAAGGCAGAGCAATGGCAATAGTCA
GTCAAATGCGGTCCTATGGCGAGACGATTACCGACCGAACCATTGTTGAAAAGCTCATGAGTGGAGGATTAACATATCGCTCGAAAAAAACCGAAGAAAAAGCATTTGAG
GTGAAAGAGACAGCCTCCAAATTCGGAGAAGGTGATCATTCGGTGAATCGAGGTCGCGGAAGAGGAGGATTTCGTGGTCGATTCGTGGTTTCGGAAGAGGCAGAGGAAGA
TTCGAAGGACAAAGGCAGTCCTACGAGCAAACAAGCAACGAGAATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTAGGACTAGACCCTTTCATCACTAATGCCCTTGTGGATATGTATGCCAAGTGTGGGAGCATGGGAGAAGCTGAAAATATGTTTTACTCCTCGGTATGGAAAGA
TACGGCATGCTGGAACTCCATGATTTCGATGTATGCAGCACACGGAAAAGCAGAAGAAGCTCTTAAGATGTTTGAAACAATGATGAGAAATGACATAAATCCCAATTATG
TCACTTTTGTGAGTGTGTGCTATCAGCTTGTAGTCATATTACATAATCATCTACTCAGCTTAGAGAAATCAAAGGGTAGGAAGGACATCTGTAGTACTCACCAACCTAAT
CCGAGTCAAGCTGATTCAACGAAAGGTATCGACCGACCTGAGGAATCCGACCTCAGGCTCCCTGCTCGAAGTACTTTCAACTCATCTTCCACGTCTTCTTCCAACCGAGC
CGAGAGCATTCTTATGAAGACTCTACTCAAATCTCAAGACTTATGGGAGTTGGTGGAACAAGGCTATGCAGATCTCGACGACGATCAAGGCAGAGCAATGGCAATAGTCA
GTCAAATGCGGTCCTATGGCGAGACGATTACCGACCGAACCATTGTTGAAAAGCTCATGAGTGGAGGATTAACATATCGCTCGAAAAAAACCGAAGAAAAAGCATTTGAG
GTGAAAGAGACAGCCTCCAAATTCGGAGAAGGTGATCATTCGGTGAATCGAGGTCGCGGAAGAGGAGGATTTCGTGGTCGATTCGTGGTTTCGGAAGAGGCAGAGGAAGA
TTCGAAGGACAAAGGCAGTCCTACGAGCAAACAAGCAACGAGAATGTAG
Protein sequenceShow/hide protein sequence
MGLGLDPFITNALVDMYAKCGSMGEAENMFYSSVWKDTACWNSMISMYAAHGKAEEALKMFETMMRNDINPNYVTFVSVCYQLVVILHNHLLSLEKSKGRKDICSTHQPN
PSQADSTKGIDRPEESDLRLPARSTFNSSSTSSSNRAESILMKTLLKSQDLWELVEQGYADLDDDQGRAMAIVSQMRSYGETITDRTIVEKLMSGGLTYRSKKTEEKAFE
VKETASKFGEGDHSVNRGRGRGGFRGRFVVSEEAEEDSKDKGSPTSKQATRM