; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g39200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g39200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr4:29157731..29158397
RNA-Seq ExpressionMoc04g39200
SyntenyMoc04g39200
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578360.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]7.7e-4281.65Show/hide
Query:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG
        M+PH+ +LATRVSR +LS       AG+WTPSLEQNLHRLGFR+TLNPSLVSQVIDPHLL+HHSLALGFFNWASQQPGFAHNSESYKSVLK LS SRQ+G
Subjt:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG

Query:  AIHSLLKQM
        AIHSLLKQ+
Subjt:  AIHSLLKQM

XP_022133909.1 pentatricopeptide repeat-containing protein At5g14080 [Momordica charantia]1.5e-5099.02Show/hide
Query:  MEPHVPKLATRVSRAMLSAGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLK
        MEPHVPKLATRVSRAMLSAGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLK
Subjt:  MEPHVPKLATRVSRAMLSAGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLK

Query:  QM
        Q+
Subjt:  QM

XP_022939514.1 pentatricopeptide repeat-containing protein At5g14080 isoform X1 [Cucurbita moschata]7.7e-4281.65Show/hide
Query:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG
        M+PH+ +LATRVSR +LS       AG+WTPSLEQNLHRLGFR+TLNPSLVSQVIDPHLL+HHSLALGFFNWASQQPGFAHNSESYKSVLK LS SRQ+G
Subjt:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG

Query:  AIHSLLKQM
        AIHSLLKQ+
Subjt:  AIHSLLKQM

XP_022939515.1 pentatricopeptide repeat-containing protein At5g14080 isoform X2 [Cucurbita moschata]2.6e-4278.95Show/hide
Query:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG
        M+PH+ +LATRVSR +LS       AG+WTPSLEQNLHRLGFR+TLNPSLVSQVIDPHLL+HHSLALGFFNWASQQPGFAHNSESYKSVLK LS SRQ+G
Subjt:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG

Query:  AIHSLLKQMMPGLT
        AIHSLLKQ +  +T
Subjt:  AIHSLLKQMMPGLT

XP_038884953.1 pentatricopeptide repeat-containing protein At5g14080 [Benincasa hispida]8.2e-4483.49Show/hide
Query:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG
        M+PH+P+LATRVSRA+LS       AG+WTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNS+SYKS+LK LS SRQ+G
Subjt:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG

Query:  AIHSLLKQM
        AIHSLLKQ+
Subjt:  AIHSLLKQM

TrEMBL top hitse value%identityAlignment
A0A6J1BXA4 pentatricopeptide repeat-containing protein At5g140807.5e-5199.02Show/hide
Query:  MEPHVPKLATRVSRAMLSAGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLK
        MEPHVPKLATRVSRAMLSAGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLK
Subjt:  MEPHVPKLATRVSRAMLSAGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLK

Query:  QM
        Q+
Subjt:  QM

A0A6J1FHE2 pentatricopeptide repeat-containing protein At5g14080 isoform X13.7e-4281.65Show/hide
Query:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG
        M+PH+ +LATRVSR +LS       AG+WTPSLEQNLHRLGFR+TLNPSLVSQVIDPHLL+HHSLALGFFNWASQQPGFAHNSESYKSVLK LS SRQ+G
Subjt:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG

Query:  AIHSLLKQM
        AIHSLLKQ+
Subjt:  AIHSLLKQM

A0A6J1FMX2 pentatricopeptide repeat-containing protein At5g14080 isoform X21.3e-4278.95Show/hide
Query:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG
        M+PH+ +LATRVSR +LS       AG+WTPSLEQNLHRLGFR+TLNPSLVSQVIDPHLL+HHSLALGFFNWASQQPGFAHNSESYKSVLK LS SRQ+G
Subjt:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG

Query:  AIHSLLKQMMPGLT
        AIHSLLKQ +  +T
Subjt:  AIHSLLKQMMPGLT

A0A6J1JR00 pentatricopeptide repeat-containing protein At5g14080 isoform X26.3e-4278.07Show/hide
Query:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG
        M+PH+ +LATRVSR +LS       AG+WTPSLEQNLHRLGFR+TLNPSLVSQVIDPHLL+HHSLALGFFNWASQQPGFAHNSESYKSVLK LS SRQ+G
Subjt:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG

Query:  AIHSLLKQMMPGLT
        AIH LLKQ +  +T
Subjt:  AIHSLLKQMMPGLT

A0A6J1JUL2 pentatricopeptide repeat-containing protein At5g14080 isoform X11.4e-4180.73Show/hide
Query:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG
        M+PH+ +LATRVSR +LS       AG+WTPSLEQNLHRLGFR+TLNPSLVSQVIDPHLL+HHSLALGFFNWASQQPGFAHNSESYKSVLK LS SRQ+G
Subjt:  MEPHVPKLATRVSRAMLS-------AGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYG

Query:  AIHSLLKQM
        AIH LLKQ+
Subjt:  AIHSLLKQM

SwissProt top hitse value%identityAlignment
P0C8A0 Pentatricopeptide repeat-containing protein At3g497303.5e-0532.61Show/hide
Query:  RVSRAMLSAGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQM
        ++ R + +  +  P LE  L+  G    L P L+ +V+        +L   FF WA++QPG+ H+ E  KS++  LS  RQ+GA+  L+++M
Subjt:  RVSRAMLSAGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQM

Q8GYP6 Pentatricopeptide repeat-containing protein At1g189001.7e-0735.37Show/hide
Query:  WTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQMM
        W P+ E+ L  LG R  ++    +QV+    +  +  ALGFF W  +QPGF H+  +Y +++  L  ++Q+GAI+ LL +M+
Subjt:  WTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQMM

Q9FH87 Putative pentatricopeptide repeat-containing protein At5g658202.4e-0637.97Show/hide
Query:  PSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQM
        P LE  L+  G    L P L+ +V++       +L   FF WA++QP + H+ E YKS++K LS  RQ+GA+  L+++M
Subjt:  PSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQM

Q9FMU2 Pentatricopeptide repeat-containing protein At5g140807.2e-2751.75Show/hide
Query:  KLATRVSRAML-------SAGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLL
        +LA R+ R +L       +A  W+P +EQ+LH LGFR +++PSLV++VIDP LL HHSLALGFFNWA+QQPG++H+S SY S+ K LS SRQ+ A+ +L 
Subjt:  KLATRVSRAML-------SAGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLL

Query:  KQMMPGLTIERSTV
        KQ+     +  S+V
Subjt:  KQMMPGLTIERSTV

Q9SSF9 Pentatricopeptide repeat-containing protein At1g747502.9e-0734.15Show/hide
Query:  WTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQMM
        W  + E+ LH  GFR  ++    +QV+    + +++ ALGFF W  +QPGF H+  +Y +++  L  ++Q+G I+ LL +M+
Subjt:  WTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQMM

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-0835.37Show/hide
Query:  WTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQMM
        W P+ E+ L  LG R  ++    +QV+    +  +  ALGFF W  +QPGF H+  +Y +++  L  ++Q+GAI+ LL +M+
Subjt:  WTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQMM

AT1G18900.2 Pentatricopeptide repeat (PPR) superfamily protein1.2e-0835.37Show/hide
Query:  WTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQMM
        W P+ E+ L  LG R  ++    +QV+    +  +  ALGFF W  +QPGF H+  +Y +++  L  ++Q+GAI+ LL +M+
Subjt:  WTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQMM

AT1G18900.3 Pentatricopeptide repeat (PPR) superfamily protein1.2e-0835.37Show/hide
Query:  WTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQMM
        W P+ E+ L  LG R  ++    +QV+    +  +  ALGFF W  +QPGF H+  +Y +++  L  ++Q+GAI+ LL +M+
Subjt:  WTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQMM

AT1G74750.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-0834.15Show/hide
Query:  WTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQMM
        W  + E+ LH  GFR  ++    +QV+    + +++ ALGFF W  +QPGF H+  +Y +++  L  ++Q+G I+ LL +M+
Subjt:  WTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQMM

AT5G14080.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.1e-2851.75Show/hide
Query:  KLATRVSRAML-------SAGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLL
        +LA R+ R +L       +A  W+P +EQ+LH LGFR +++PSLV++VIDP LL HHSLALGFFNWA+QQPG++H+S SY S+ K LS SRQ+ A+ +L 
Subjt:  KLATRVSRAML-------SAGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLL

Query:  KQMMPGLTIERSTV
        KQ+     +  S+V
Subjt:  KQMMPGLTIERSTV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCCCATGTACCTAAATTGGCTACTCGAGTCAGCAGAGCTATGCTTTCCGCTGGAGCATGGACGCCTTCACTGGAGCAGAATTTGCATCGACTCGGTTTCCGCCA
AACGCTAAATCCATCTCTCGTATCTCAAGTAATCGACCCGCACCTTCTCACTCATCACTCCCTCGCTCTCGGCTTCTTCAATTGGGCTTCTCAGCAACCCGGTTTCGCCC
ACAATTCGGAATCCTACAAGTCGGTTCTTAAGTGTCTCTCTTTCTCGCGCCAATATGGGGCCATCCATAGTCTCTTAAAACAGATGATGCCAGGACTAACAATCGAGAGA
TCAACGGTTCTATTATTGCCACATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACCCCATGTACCTAAATTGGCTACTCGAGTCAGCAGAGCTATGCTTTCCGCTGGAGCATGGACGCCTTCACTGGAGCAGAATTTGCATCGACTCGGTTTCCGCCA
AACGCTAAATCCATCTCTCGTATCTCAAGTAATCGACCCGCACCTTCTCACTCATCACTCCCTCGCTCTCGGCTTCTTCAATTGGGCTTCTCAGCAACCCGGTTTCGCCC
ACAATTCGGAATCCTACAAGTCGGTTCTTAAGTGTCTCTCTTTCTCGCGCCAATATGGGGCCATCCATAGTCTCTTAAAACAGATGATGCCAGGACTAACAATCGAGAGA
TCAACGGTTCTATTATTGCCACATTGA
Protein sequenceShow/hide protein sequence
MEPHVPKLATRVSRAMLSAGAWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSESYKSVLKCLSFSRQYGAIHSLLKQMMPGLTIER
STVLLLPH