; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS022259 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS022259
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold47:2061628..2062552
RNA-Seq ExpressionMS022259
SyntenyMS022259
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572189.1 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.3e-10483.13Show/hide
Query:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA  PS +TI S P  LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVR E WYKPQVSLYADI++VLASNGLFEQVQIIHSY K ETDLAPEI+GFN LL+ALV+YNLG+LAMESYYLMK+VGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSM
        TSFRIVIKGLEST E+VDLR VK+DAQ++YG+ LEFLEEEDEAA   SM
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSM

XP_022135810.1 pentatricopeptide repeat-containing protein At1g62350-like [Momordica charantia]8.4e-13199.6Show/hide
Query:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        MNFLAIPPSPTTIFSSP KLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR
        TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR

XP_022952712.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita moschata]1.0e-10483.87Show/hide
Query:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA  PS +TI S P  LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVR E WYKPQVSLYADI++VLASNGLFEQVQIIHSY K ETDLAPEI+GFN LL+ALVSYNLGELAMESYYLMK+VGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS
        TSFRIVIKGLEST E+VDLR VK+DAQ++YG+ LEFLEEEDEAA   S
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS

XP_023554588.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita pepo subsp. pepo]6.1e-10583.47Show/hide
Query:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA  PSP TI S P  LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVR E WYKPQ+SLYADI++VLASNGLFE VQIIHSYLK ETDLAPEI+GFN LL+ALVSYNLGELAMESYYLMKKVGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS
        TSFRIVIKGLEST E+VDLR VK+DAQ++YG+ LEFLEEED+ A   S
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS

XP_038888189.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Benincasa hispida]6.9e-10986.29Show/hide
Query:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FL  PPSP TI S   KLLSSG ++SCL LGRAE YR+VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVRNEHWYKPQVSLYADII VLASNGLFE+VQIIHSYLK ETDLAPEIDGFNALL+ALVS+NLGELAMESYYLMK++GCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS
         SFRIVIKGLES  EAVDLR VKQDAQK+YG+ LEFLEEE+EAAIA S
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS

TrEMBL top hitse value%identityAlignment
A0A0A0K6A9 Uncharacterized protein3.6e-10382.4Show/hide
Query:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA P SP TIFS   K  SS   + CL+LGRAE Y RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDM+AVL
Subjt:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVR EHWYKPQVSLYADIITVLASNGLFE+VQII SY+K E DLAPEIDGFNALL+ALVS+NLGELAMESYYLMK VGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR
         SFRIVIKGLES  EAVDLR VKQDAQ++YG+ LEFLEEE+E A ATS++
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR

A0A5A7UZI6 Pentatricopeptide repeat-containing protein4.0e-10281.93Show/hide
Query:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FL   PSP TI S P KL SS  K  CL+LGRAE Y RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDM+AVL
Subjt:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFE+VQII SY+K ETDLAPEIDGFNALL+ALV +NLG+LAMESYYLMK+VGCEP+K
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSM
         SFRIVIKGLE   EAVDLR VKQDAQK+YG+ LEFLEE +E A A S+
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSM

A0A6J1C1T5 pentatricopeptide repeat-containing protein At1g62350-like4.1e-13199.6Show/hide
Query:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        MNFLAIPPSPTTIFSSP KLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR
        TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR

A0A6J1GL50 protein THYLAKOID ASSEMBLY 8-like, chloroplastic5.0e-10583.87Show/hide
Query:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA  PS +TI S P  LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVR E WYKPQVSLYADI++VLASNGLFEQVQIIHSY K ETDLAPEI+GFN LL+ALVSYNLGELAMESYYLMK+VGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS
        TSFRIVIKGLEST E+VDLR VK+DAQ++YG+ LEFLEEEDEAA   S
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS

A0A6J1HX85 protein THYLAKOID ASSEMBLY 8-like, chloroplastic1.1e-10483.87Show/hide
Query:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA  PS +TI S P  LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVR E WYKPQVSLYADI++VLASNGLFEQVQIIHSYLK ETDLAPEI+GFN LL+ALVSYNLGELAMESYYLMK+VGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS
         SFRIVIKGLEST E+VDLR VK+DAQ++YG+ LEFLEEEDEAA   S
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351303.5e-0727.06Show/hide
Query:  LQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIH
        L K +  + EAI   Q +KR        DR   +  +  L  ++        + ++  ++ K++ ++R+ H  KP +  Y  ++   A  GL E+ + I 
Subjt:  LQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIH

Query:  SYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVI-----KGLESTVEAV
          L+ E  L P++  +NAL+ +         A E + LM+ +GCEPD+ S+ I++      GL S  EAV
Subjt:  SYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVI-----KGLESTVEAV

Q1PFH7 Pentatricopeptide repeat-containing protein At1g623501.7e-1733.7Show/hide
Query:  LSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVE
        +S E + A + LKR++    +LDR   S +SRLLK D+++VL E  RQN+  L +K++E VR E WY+P +  Y D++ +LA N   ++ + +   LK E
Subjt:  LSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVE

Query:  TDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGLESTVEAVDLR-IVKQDAQKIYGDLLEFLEEED
          L  +   F  L+R  +   L   AM  Y  M++    P    FR+++KGL   V   +LR  VK D  +++  ++ +   ED
Subjt:  TDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGLESTVEAVDLR-IVKQDAQKIYGDLLEFLEEED

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic6.0e-1537.02Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKP-QVSLYADIITVLASNG
        G  +NR PL KGR LS EAIQ++QSLKR       L       + RL+K D+++VLRELLRQ+ C LA+ V   +R E  Y P  + LYADI+  L  N 
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKP-QVSLYADIITVLASNG

Query:  LFEQVQIIHSYLKVETDLAPEIDGFN---------ALLRALVSYNLGELAMESYYLMKKVG-----CEPDKTSFRIVIKGL
         F+++            L  EIDG +          L+RA+V     E  +  Y LM++ G      E D+    ++ KGL
Subjt:  LFEQVQIIHSYLKVETDLAPEIDGFN---------ALLRALVSYNLGELAMESYYLMKKVG-----CEPDKTSFRIVIKGL

Q9SCP4 Pentatricopeptide repeat-containing protein At3g531701.0e-0628.32Show/hide
Query:  MMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVG
        ++  L E +++N    ALK+F  +R +HWY+P+   Y  +  VL +    +Q  ++   +  E  L P ID + +L+       L + A  +   MK V 
Subjt:  MMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVG

Query:  -CEPDKTSFRIVI
         C+PD  +F ++I
Subjt:  -CEPDKTSFRIVI

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic1.8e-1935.19Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQV
        R PL +G+ L   EA+  +  LKR+K D ++LD+   + + RLLK DM+AV+ EL RQ E  LA+K+FE ++ + WY+P V +Y D+I  LA +   ++ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQV

Query:  QIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGL
          +   +K E +L P+   +  ++R  +       AM  Y  M K    P++  FR+++KGL
Subjt:  QIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGL

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-1833.7Show/hide
Query:  LSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVE
        +S E + A + LKR++    +LDR   S +SRLLK D+++VL E  RQN+  L +K++E VR E WY+P +  Y D++ +LA N   ++ + +   LK E
Subjt:  LSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVE

Query:  TDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGLESTVEAVDLR-IVKQDAQKIYGDLLEFLEEED
          L  +   F  L+R  +   L   AM  Y  M++    P    FR+++KGL   V   +LR  VK D  +++  ++ +   ED
Subjt:  TDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGLESTVEAVDLR-IVKQDAQKIYGDLLEFLEEED

AT2G35130.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-0827.06Show/hide
Query:  LQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIH
        L K +  + EAI   Q +KR        DR   +  +  L  ++        + ++  ++ K++ ++R+ H  KP +  Y  ++   A  GL E+ + I 
Subjt:  LQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIH

Query:  SYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVI-----KGLESTVEAV
          L+ E  L P++  +NAL+ +         A E + LM+ +GCEPD+ S+ I++      GL S  EAV
Subjt:  SYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVI-----KGLESTVEAV

AT3G27750.1 FUNCTIONS IN: molecular_function unknown4.3e-1637.02Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKP-QVSLYADIITVLASNG
        G  +NR PL KGR LS EAIQ++QSLKR       L       + RL+K D+++VLRELLRQ+ C LA+ V   +R E  Y P  + LYADI+  L  N 
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKP-QVSLYADIITVLASNG

Query:  LFEQVQIIHSYLKVETDLAPEIDGFN---------ALLRALVSYNLGELAMESYYLMKKVG-----CEPDKTSFRIVIKGL
         F+++            L  EIDG +          L+RA+V     E  +  Y LM++ G      E D+    ++ KGL
Subjt:  LFEQVQIIHSYLKVETDLAPEIDGFN---------ALLRALVSYNLGELAMESYYLMKKVG-----CEPDKTSFRIVIKGL

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-2035.19Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQV
        R PL +G+ L   EA+  +  LKR+K D ++LD+   + + RLLK DM+AV+ EL RQ E  LA+K+FE ++ + WY+P V +Y D+I  LA +   ++ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQV

Query:  QIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGL
          +   +K E +L P+   +  ++R  +       AM  Y  M K    P++  FR+++KGL
Subjt:  QIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGL

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain4.2e-5649.41Show/hide
Query:  SSPDKLLSSGEKSSCLRLGRAEEYRR---VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR---------------VKNDLQQLDRVYDSKISRLLKFDM
        S  D +L  G K      GR +  +R   + MR  S+NRKPLQ+GR LSIEAIQAVQ+LKR                 +    LDRV  SK  RLLKFDM
Subjt:  SSPDKLLSSGEKSSCLRLGRAEEYRR---VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR---------------VKNDLQQLDRVYDSKISRLLKFDM

Query:  MAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGC
        +AVLRELLRQNEC LALKVFE++R E+WYKPQV +Y D+ITV+A N L E+V  ++S +K E  L  EI+ FN LL  L+++ L +L M+ Y  M+ +G 
Subjt:  MAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGC

Query:  EPDKTSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSM
        EPD+ SFR+++ GLES  E     IV+QDA + YG+ LEF+EE++E +  TS+
Subjt:  EPDKTSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTCTTAGCAATTCCTCCGTCGCCGACGACGATTTTCAGTTCGCCGGACAAGTTACTGAGCTCCGGCGAGAAATCATCTTGCCTCCGACTAGGTAGGGCGGAGGA
ATATCGGAGAGTGACAATGAGAGGCGGAAGCGAGAACCGGAAGCCGTTGCAGAAGGGGAGGAACCTCAGCATCGAAGCAATTCAAGCGGTACAGTCGTTGAAACGAGTCA
AGAACGATCTACAACAATTGGACCGAGTGTATGATTCCAAAATTAGCCGCTTATTGAAGTTCGATATGATGGCTGTCCTTCGCGAGCTGCTGCGCCAGAACGAGTGTCTT
TTGGCTCTCAAGGTTTTTGAAGATGTTAGAAATGAACACTGGTACAAACCTCAGGTCTCGCTGTATGCTGATATTATCACAGTATTGGCTAGCAATGGATTATTCGAACA
AGTACAAATTATTCATTCCTACTTGAAAGTAGAAACTGACTTAGCGCCTGAAATTGACGGTTTTAACGCTCTTTTGAGGGCTTTGGTTAGTTATAATTTAGGTGAGCTTG
CGATGGAGTCCTATTACTTGATGAAAAAAGTGGGTTGCGAGCCAGATAAGACTTCTTTCAGGATAGTCATAAAAGGATTGGAATCAACGGTAGAGGCAGTTGATTTAAGA
ATTGTGAAGCAGGATGCACAAAAGATTTATGGTGATTTACTTGAGTTTCTAGAGGAAGAGGATGAGGCAGCTATAGCCACTTCTATGCGC
mRNA sequenceShow/hide mRNA sequence
ATGAATTTCTTAGCAATTCCTCCGTCGCCGACGACGATTTTCAGTTCGCCGGACAAGTTACTGAGCTCCGGCGAGAAATCATCTTGCCTCCGACTAGGTAGGGCGGAGGA
ATATCGGAGAGTGACAATGAGAGGCGGAAGCGAGAACCGGAAGCCGTTGCAGAAGGGGAGGAACCTCAGCATCGAAGCAATTCAAGCGGTACAGTCGTTGAAACGAGTCA
AGAACGATCTACAACAATTGGACCGAGTGTATGATTCCAAAATTAGCCGCTTATTGAAGTTCGATATGATGGCTGTCCTTCGCGAGCTGCTGCGCCAGAACGAGTGTCTT
TTGGCTCTCAAGGTTTTTGAAGATGTTAGAAATGAACACTGGTACAAACCTCAGGTCTCGCTGTATGCTGATATTATCACAGTATTGGCTAGCAATGGATTATTCGAACA
AGTACAAATTATTCATTCCTACTTGAAAGTAGAAACTGACTTAGCGCCTGAAATTGACGGTTTTAACGCTCTTTTGAGGGCTTTGGTTAGTTATAATTTAGGTGAGCTTG
CGATGGAGTCCTATTACTTGATGAAAAAAGTGGGTTGCGAGCCAGATAAGACTTCTTTCAGGATAGTCATAAAAGGATTGGAATCAACGGTAGAGGCAGTTGATTTAAGA
ATTGTGAAGCAGGATGCACAAAAGATTTATGGTGATTTACTTGAGTTTCTAGAGGAAGAGGATGAGGCAGCTATAGCCACTTCTATGCGC
Protein sequenceShow/hide protein sequence
MNFLAIPPSPTTIFSSPDKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECL
LALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGLESTVEAVDLR
IVKQDAQKIYGDLLEFLEEEDEAAIATSMR