; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS000483 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS000483
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold44:1997391..1997769
RNA-Seq ExpressionMS000483
SyntenyMS000483
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576664.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.4e-4064.96Show/hide
Query:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--
        R CS+A++ASV D+IFG+ IK+G+FAS+VCVGC LIDMF K RGDLVSAF+VFEKMP R A+TWTLM+T FM+FG   EA+++FLDMIL+G EPDR    
Subjt:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--

Query:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL
                  +LSL QQLHSQAI  GLTLD CVGCCL
Subjt:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL

KAG7014714.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]3.4e-4064.96Show/hide
Query:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--
        R CS+A++ASV D+IFG+ IK+G+FAS+VCVGC LIDMF K RGDLVSAF+VFEKMP R A+TWTLM+T FM+FG   EA+++FLDMIL+G EPDR    
Subjt:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--

Query:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL
                  +LSL QQLHSQAI  GLTLD CVGCCL
Subjt:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL

XP_022141097.1 pentatricopeptide repeat-containing protein At3g49170, chloroplastic [Momordica charantia]6.9e-4167.88Show/hide
Query:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--
        R  S A+Y+SV D+IFGF IKSG+FAS+VCVGCALIDM+ K RGDLVSAFKVFEKMP R A+TWTLM+T +M+FG   EA+NLFLDMIL+G EPDR    
Subjt:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--

Query:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL
                  ILSL QQLHSQAI  GLTLD CVGCCL
Subjt:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL

XP_022922533.1 pentatricopeptide repeat-containing protein At3g49170, chloroplastic isoform X1 [Cucurbita moschata]3.4e-4064.96Show/hide
Query:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--
        R CS+A++ASV D+IFG+ IK+G+FAS+VCVGC LIDMF K RGDLVSAF+VFEKMP R A+TWTLM+T FM+FG   EA+++FLDMIL+G EPDR    
Subjt:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--

Query:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL
                  +LSL QQLHSQAI  GLTLD CVGCCL
Subjt:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL

XP_022922534.1 pentatricopeptide repeat-containing protein At3g49170, chloroplastic isoform X2 [Cucurbita moschata]3.4e-4064.96Show/hide
Query:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--
        R CS+A++ASV D+IFG+ IK+G+FAS+VCVGC LIDMF K RGDLVSAF+VFEKMP R A+TWTLM+T FM+FG   EA+++FLDMIL+G EPDR    
Subjt:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--

Query:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL
                  +LSL QQLHSQAI  GLTLD CVGCCL
Subjt:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL

TrEMBL top hitse value%identityAlignment
A0A6J1CIW7 pentatricopeptide repeat-containing protein At3g49170, chloroplastic3.4e-4167.88Show/hide
Query:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--
        R  S A+Y+SV D+IFGF IKSG+FAS+VCVGCALIDM+ K RGDLVSAFKVFEKMP R A+TWTLM+T +M+FG   EA+NLFLDMIL+G EPDR    
Subjt:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--

Query:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL
                  ILSL QQLHSQAI  GLTLD CVGCCL
Subjt:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL

A0A6J1E3N3 pentatricopeptide repeat-containing protein At3g49170, chloroplastic isoform X21.7e-4064.96Show/hide
Query:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--
        R CS+A++ASV D+IFG+ IK+G+FAS+VCVGC LIDMF K RGDLVSAF+VFEKMP R A+TWTLM+T FM+FG   EA+++FLDMIL+G EPDR    
Subjt:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--

Query:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL
                  +LSL QQLHSQAI  GLTLD CVGCCL
Subjt:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL

A0A6J1E6W5 pentatricopeptide repeat-containing protein At3g49170, chloroplastic isoform X11.7e-4064.96Show/hide
Query:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--
        R CS+A++ASV D+IFG+ IK+G+FAS+VCVGC LIDMF K RGDLVSAF+VFEKMP R A+TWTLM+T FM+FG   EA+++FLDMIL+G EPDR    
Subjt:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--

Query:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL
                  +LSL QQLHSQAI  GLTLD CVGCCL
Subjt:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL

A0A6J1J7R2 pentatricopeptide repeat-containing protein At3g49170, chloroplastic isoform X21.4e-3964.23Show/hide
Query:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--
        R CS+A++ASV D+IFG+ IK+ +FAS+VCVGC LIDMF K RGDLVSAF+VFEKMP R A+TWTLM+T FM+FG   EA+++FLDMIL+G EPDR    
Subjt:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--

Query:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL
                  +LSL QQLHSQAI  GLTLD CVGCCL
Subjt:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL

A0A6J1JAX9 pentatricopeptide repeat-containing protein At3g49170, chloroplastic isoform X11.4e-3964.23Show/hide
Query:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--
        R CS+A++ASV D+IFG+ IK+ +FAS+VCVGC LIDMF K RGDLVSAF+VFEKMP R A+TWTLM+T FM+FG   EA+++FLDMIL+G EPDR    
Subjt:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLE--

Query:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL
                  +LSL QQLHSQAI  GLTLD CVGCCL
Subjt:  ---------TILSLEQQLHSQAICDGLTLDCCVGCCL

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210657.3e-0938.71Show/hide
Query:  VSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLETILSL
        + +TI    I+SG F S + V  +L+ ++A + GD+ SA+KVF+KMP +  + W  ++  F + G PEEA+ L+ +M   G +PD   TI+SL
Subjt:  VSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLETILSL

P0C8Q2 Pentatricopeptide repeat-containing protein At4g19191, mitochondrial1.6e-0837.14Show/hide
Query:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLETI
        + C+        + +    IKS F+ S+V VG A +DMF K    +  A KVFE+MP R A TW  M++ F + G+ ++A +LF +M LN   PD + T+
Subjt:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLETI

Query:  LSLEQ
        ++L Q
Subjt:  LSLEQ

Q5G1T1 Pentatricopeptide repeat-containing protein At3g49170, chloroplastic4.0e-2345.26Show/hide
Query:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDR--LE
        R CS++ +  V     GF +K+G F S+VCVGC+LIDMF K      +A+KVF+KM     +TWTLM+T  M+ G P EA+  FLDM+L+G E D+  L 
Subjt:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDR--LE

Query:  TI---------LSLEQQLHSQAICDGLTLDCCVGCCL
        ++         LSL +QLHS AI  GL  D  V C L
Subjt:  TI---------LSLEQQLHSQAICDGLTLDCCVGCCL

Q7Y211 Pentatricopeptide repeat-containing protein At3g57430, chloroplastic1.2e-0831.03Show/hide
Query:  CSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLETILS
        C++    +    I  + IK+   A++V VG AL+DM+AK  G L  + KVF+++P +  ITW +++  +   GN +EA++L   M++ G +P+ +  I  
Subjt:  CSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLETILS

Query:  LEQQLHSQAICDGLTL
             HS  + +GL +
Subjt:  LEQQLHSQAICDGLTL

Q9FM64 Pentatricopeptide repeat-containing protein At5g55740, chloroplastic2.9e-1033.58Show/hide
Query:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDR--LE
        + C + K++     + G+ +KSG     V V  +L DM+ K  G L  A KVF+++P R A+ W  ++  +++ G  EEA+ LF DM   G EP R  + 
Subjt:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDR--LE

Query:  TILSLE---------QQLHSQAICDGLTLDCCVGCCL
        T LS           +Q H+ AI +G+ LD  +G  L
Subjt:  TILSLE---------QQLHSQAICDGLTLDCCVGCCL

Arabidopsis top hitse value%identityAlignment
AT3G49170.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-2445.26Show/hide
Query:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDR--LE
        R CS++ +  V     GF +K+G F S+VCVGC+LIDMF K      +A+KVF+KM     +TWTLM+T  M+ G P EA+  FLDM+L+G E D+  L 
Subjt:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDR--LE

Query:  TI---------LSLEQQLHSQAICDGLTLDCCVGCCL
        ++         LSL +QLHS AI  GL  D  V C L
Subjt:  TI---------LSLEQQLHSQAICDGLTLDCCVGCCL

AT3G57430.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.8e-1031.03Show/hide
Query:  CSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLETILS
        C++    +    I  + IK+   A++V VG AL+DM+AK  G L  + KVF+++P +  ITW +++  +   GN +EA++L   M++ G +P+ +  I  
Subjt:  CSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLETILS

Query:  LEQQLHSQAICDGLTL
             HS  + +GL +
Subjt:  LEQQLHSQAICDGLTL

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.2e-1038.71Show/hide
Query:  VSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLETILSL
        + +TI    I+SG F S + V  +L+ ++A + GD+ SA+KVF+KMP +  + W  ++  F + G PEEA+ L+ +M   G +PD   TI+SL
Subjt:  VSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLETILSL

AT4G21065.2 Tetratricopeptide repeat (TPR)-like superfamily protein5.2e-1038.71Show/hide
Query:  VSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLETILSL
        + +TI    I+SG F S + V  +L+ ++A + GD+ SA+KVF+KMP +  + W  ++  F + G PEEA+ L+ +M   G +PD   TI+SL
Subjt:  VSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLETILSL

AT5G55740.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-1133.58Show/hide
Query:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDR--LE
        + C + K++     + G+ +KSG     V V  +L DM+ K  G L  A KVF+++P R A+ W  ++  +++ G  EEA+ LF DM   G EP R  + 
Subjt:  RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDR--LE

Query:  TILSLE---------QQLHSQAICDGLTLDCCVGCCL
        T LS           +Q H+ AI +G+ LD  +G  L
Subjt:  TILSLE---------QQLHSQAICDGLTLDCCVGCCL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CGCGTGTGTTCGAGTGCCAAATATGCATCGGTGAGTGACACTATTTTTGGGTTTACCATTAAAAGTGGGTTCTTTGCCTCTAATGTATGTGTTGGGTGTGCTTTAATTGA
CATGTTTGCAAAGGACCGCGGCGACTTGGTTTCAGCTTTTAAGGTGTTTGAGAAAATGCCTGGAAGAATCGCAATTACTTGGACTCTGATGGTTACTTGGTTTATGAAAT
TTGGCAACCCGGAAGAAGCAGTTAATTTGTTTTTGGATATGATATTGAATGGAAAAGAGCCAGACAGATTAGAAACTATCCTATCGCTAGAGCAGCAGTTGCATTCTCAA
GCCATATGCGATGGGTTGACTCTGGATTGCTGTGTTGGTTGTTGTCTA
mRNA sequenceShow/hide mRNA sequence
CGCGTGTGTTCGAGTGCCAAATATGCATCGGTGAGTGACACTATTTTTGGGTTTACCATTAAAAGTGGGTTCTTTGCCTCTAATGTATGTGTTGGGTGTGCTTTAATTGA
CATGTTTGCAAAGGACCGCGGCGACTTGGTTTCAGCTTTTAAGGTGTTTGAGAAAATGCCTGGAAGAATCGCAATTACTTGGACTCTGATGGTTACTTGGTTTATGAAAT
TTGGCAACCCGGAAGAAGCAGTTAATTTGTTTTTGGATATGATATTGAATGGAAAAGAGCCAGACAGATTAGAAACTATCCTATCGCTAGAGCAGCAGTTGCATTCTCAA
GCCATATGCGATGGGTTGACTCTGGATTGCTGTGTTGGTTGTTGTCTA
Protein sequenceShow/hide protein sequence
RVCSSAKYASVSDTIFGFTIKSGFFASNVCVGCALIDMFAKDRGDLVSAFKVFEKMPGRIAITWTLMVTWFMKFGNPEEAVNLFLDMILNGKEPDRLETILSLEQQLHSQ
AICDGLTLDCCVGCCL