; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0305 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0305
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationMC04:2416911..2421930
RNA-Seq ExpressionMC04g0305
SyntenyMC04g0305
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572189.1 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]5.93e-13483.13Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA  PS +TI S P+ LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVR E WYKPQVSLYADI++VLASNGLFEQVQIIHSY K ETDLAPEI+GFN LL+ALV+YNLG+LAMESYYLMK+VGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSM
        TSFRIVIKGLEST E+VDLR VK+DAQ++YG+ LEFLEEEDEAA   SM
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSM

XP_022135810.1 pentatricopeptide repeat-containing protein At1g62350-like [Momordica charantia]5.54e-169100Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR
        TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR

XP_022952712.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita moschata]4.17e-13483.87Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA  PS +TI S P+ LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVR E WYKPQVSLYADI++VLASNGLFEQVQIIHSY K ETDLAPEI+GFN LL+ALVSYNLGELAMESYYLMK+VGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS
        TSFRIVIKGLEST E+VDLR VK+DAQ++YG+ LEFLEEEDEAA   S
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS

XP_023554588.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita pepo subsp. pepo]2.07e-13483.47Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA  PSPT I S P+ LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVR E WYKPQ+SLYADI++VLASNGLFE VQIIHSYLK ETDLAPEI+GFN LL+ALVSYNLGELAMESYYLMKKVGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS
        TSFRIVIKGLEST E+VDLR VK+DAQ++YG+ LEFLEEED+ A   S
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS

XP_038888189.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Benincasa hispida]1.37e-13986.29Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FL  PPSPT I S  +KLLSSG ++SCL LGRAE YR+VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVRNEHWYKPQVSLYADII VLASNGLFE+VQIIHSYLK ETDLAPEIDGFNALL+ALVS+NLGELAMESYYLMK++GCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS
         SFRIVIKGLES  EAVDLR VKQDAQK+YG+ LEFLEEE+EAAIA S
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS

TrEMBL top hitse value%identityAlignment
A0A0A0K6A9 Uncharacterized protein2.24e-13182.4Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA P SPT IFS   K  SS   + CL+LGRAE Y RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDM+AVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVR EHWYKPQVSLYADIITVLASNGLFE+VQII SY+K E DLAPEIDGFNALL+ALVS+NLGELAMESYYLMK VGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR
         SFRIVIKGLES  EAVDLR VKQDAQ++YG+ LEFLEEE+E A ATS++
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR

A0A5A7UZI6 Pentatricopeptide repeat-containing protein5.26e-13081.93Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FL   PSPT I S P KL SS  K  CL+LGRAE Y RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDM+AVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFE+VQII SY+K ETDLAPEIDGFNALL+ALV +NLG+LAMESYYLMK+VGCEP+K
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSM
         SFRIVIKGLE   EAVDLR VKQDAQK+YG+ LEFLEE +E A A S+
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSM

A0A6J1C1T5 pentatricopeptide repeat-containing protein At1g62350-like2.68e-169100Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR
        TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATSMR

A0A6J1GL50 protein THYLAKOID ASSEMBLY 8-like, chloroplastic2.02e-13483.87Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA  PS +TI S P+ LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVR E WYKPQVSLYADI++VLASNGLFEQVQIIHSY K ETDLAPEI+GFN LL+ALVSYNLGELAMESYYLMK+VGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS
        TSFRIVIKGLEST E+VDLR VK+DAQ++YG+ LEFLEEEDEAA   S
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS

A0A6J1HX85 protein THYLAKOID ASSEMBLY 8-like, chloroplastic5.79e-13483.87Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA  PS +TI S P+ LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK
        RELLRQNEC LALKVFEDVR E WYKPQVSLYADI++VLASNGLFEQVQIIHSYLK ETDLAPEI+GFN LL+ALVSYNLGELAMESYYLMK+VGCEPDK
Subjt:  RELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDK

Query:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS
         SFRIVIKGLEST E+VDLR VK+DAQ++YG+ LEFLEEEDEAA   S
Subjt:  TSFRIVIKGLESTVEAVDLRIVKQDAQKIYGDLLEFLEEEDEAAIATS

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351303.5e-0727.06Show/hide
Query:  LQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIH
        L K +  + EAI   Q +KR        DR   +  +  L  ++        + ++  ++ K++ ++R+ H  KP +  Y  ++   A  GL E+ + I 
Subjt:  LQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIH

Query:  SYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVI-----KGLESTVEAV
          L+ E  L P++  +NAL+ +         A E + LM+ +GCEPD+ S+ I++      GL S  EAV
Subjt:  SYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVI-----KGLESTVEAV

Q1PFH7 Pentatricopeptide repeat-containing protein At1g623501.7e-1733.7Show/hide
Query:  LSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVE
        +S E + A + LKR++    +LDR   S +SRLLK D+++VL E  RQN+  L +K++E VR E WY+P +  Y D++ +LA N   ++ + +   LK E
Subjt:  LSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVE

Query:  TDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGLESTVEAVDLR-IVKQDAQKIYGDLLEFLEEED
          L  +   F  L+R  +   L   AM  Y  M++    P    FR+++KGL   V   +LR  VK D  +++  ++ +   ED
Subjt:  TDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGLESTVEAVDLR-IVKQDAQKIYGDLLEFLEEED

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic6.0e-1537.02Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKP-QVSLYADIITVLASNG
        G  +NR PL KGR LS EAIQ++QSLKR       L       + RL+K D+++VLRELLRQ+ C LA+ V   +R E  Y P  + LYADI+  L  N 
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKP-QVSLYADIITVLASNG

Query:  LFEQVQIIHSYLKVETDLAPEIDGFN---------ALLRALVSYNLGELAMESYYLMKKVG-----CEPDKTSFRIVIKGL
         F+++            L  EIDG +          L+RA+V     E  +  Y LM++ G      E D+    ++ KGL
Subjt:  LFEQVQIIHSYLKVETDLAPEIDGFN---------ALLRALVSYNLGELAMESYYLMKKVG-----CEPDKTSFRIVIKGL

Q9SCP4 Pentatricopeptide repeat-containing protein At3g531701.0e-0628.32Show/hide
Query:  MMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVG
        ++  L E +++N    ALK+F  +R +HWY+P+   Y  +  VL +    +Q  ++   +  E  L P ID + +L+       L + A  +   MK V 
Subjt:  MMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVG

Query:  -CEPDKTSFRIVI
         C+PD  +F ++I
Subjt:  -CEPDKTSFRIVI

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic1.8e-1935.19Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQV
        R PL +G+ L   EA+  +  LKR+K D ++LD+   + + RLLK DM+AV+ EL RQ E  LA+K+FE ++ + WY+P V +Y D+I  LA +   ++ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQV

Query:  QIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGL
          +   +K E +L P+   +  ++R  +       AM  Y  M K    P++  FR+++KGL
Subjt:  QIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGL

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-1833.7Show/hide
Query:  LSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVE
        +S E + A + LKR++    +LDR   S +SRLLK D+++VL E  RQN+  L +K++E VR E WY+P +  Y D++ +LA N   ++ + +   LK E
Subjt:  LSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVE

Query:  TDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGLESTVEAVDLR-IVKQDAQKIYGDLLEFLEEED
          L  +   F  L+R  +   L   AM  Y  M++    P    FR+++KGL   V   +LR  VK D  +++  ++ +   ED
Subjt:  TDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGLESTVEAVDLR-IVKQDAQKIYGDLLEFLEEED

AT2G35130.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-0827.06Show/hide
Query:  LQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIH
        L K +  + EAI   Q +KR        DR   +  +  L  ++        + ++  ++ K++ ++R+ H  KP +  Y  ++   A  GL E+ + I 
Subjt:  LQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIH

Query:  SYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVI-----KGLESTVEAV
          L+ E  L P++  +NAL+ +         A E + LM+ +GCEPD+ S+ I++      GL S  EAV
Subjt:  SYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVI-----KGLESTVEAV

AT3G27750.1 FUNCTIONS IN: molecular_function unknown4.3e-1637.02Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKP-QVSLYADIITVLASNG
        G  +NR PL KGR LS EAIQ++QSLKR       L       + RL+K D+++VLRELLRQ+ C LA+ V   +R E  Y P  + LYADI+  L  N 
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKP-QVSLYADIITVLASNG

Query:  LFEQVQIIHSYLKVETDLAPEIDGFN---------ALLRALVSYNLGELAMESYYLMKKVG-----CEPDKTSFRIVIKGL
         F+++            L  EIDG +          L+RA+V     E  +  Y LM++ G      E D+    ++ KGL
Subjt:  LFEQVQIIHSYLKVETDLAPEIDGFN---------ALLRALVSYNLGELAMESYYLMKKVG-----CEPDKTSFRIVIKGL

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-2035.19Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQV
        R PL +G+ L   EA+  +  LKR+K D ++LD+   + + RLLK DM+AV+ EL RQ E  LA+K+FE ++ + WY+P V +Y D+I  LA +   ++ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQV

Query:  QIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGL
          +   +K E +L P+   +  ++R  +       AM  Y  M K    P++  FR+++KGL
Subjt:  QIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGL

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain3.2e-5652.23Show/hide
Query:  VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR---------------VKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWY
        + MR  S+NRKPLQ+GR LSIEAIQAVQ+LKR                 +    LDRV  SK  RLLKFDM+AVLRELLRQNEC LALKVFE++R E+WY
Subjt:  VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR---------------VKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALKVFEDVRNEHWY

Query:  KPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGLESTVEAVDLRIVKQD
        KPQV +Y D+ITV+A N L E+V  ++S +K E  L  EI+ FN LL  L+++ L +L M+ Y  M+ +G EPD+ SFR+++ GLES  E     IV+QD
Subjt:  KPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGLESTVEAVDLRIVKQD

Query:  AQKIYGDLLEFLEEEDEAAIATSM
        A + YG+ LEF+EE++E +  TS+
Subjt:  AQKIYGDLLEFLEEEDEAAIATSM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTCTTAGCAATTCCTCCGTCGCCGACGACGATTTTCAGTTCGCCGCACAAGTTACTGAGCTCCGGCGAGAAATCATCTTGCCTCCGACTAGGTAGGGCGGAGGA
ATATCGGAGAGTGACAATGAGAGGCGGAAGCGAGAACCGGAAGCCGTTGCAGAAGGGGAGGAACCTCAGCATCGAAGCAATTCAAGCGGTACAGTCGTTGAAACGAGTCA
AGAACGATCTACAACAATTGGACCGAGTGTATGATTCCAAAATTAGCCGCTTATTGAAGTTCGATATGATGGCTGTCCTTCGCGAGCTGCTGCGCCAGAACGAGTGTCTT
TTGGCTCTCAAGGTTTTTGAAGATGTTAGAAATGAACACTGGTACAAACCTCAGGTCTCGCTGTATGCTGATATTATCACAGTATTGGCTAGCAATGGATTATTCGAACA
AGTACAAATTATTCATTCCTACTTGAAAGTAGAAACTGACTTAGCGCCTGAAATTGACGGTTTTAACGCTCTTTTGAGGGCTTTGGTTAGTTATAATTTAGGTGAGCTTG
CGATGGAGTCCTATTACTTGATGAAAAAAGTGGGTTGCGAGCCAGATAAGACTTCTTTCAGGATAGTCATAAAAGGATTGGAATCAACGGTAGAGGCAGTTGATTTAAGA
ATTGTGAAGCAGGATGCACAAAAGATTTATGGTGATTTACTTGAGTTTCTAGAGGAAGAGGATGAGGCAGCTATAGCCACTTCTATGCGCTGA
mRNA sequenceShow/hide mRNA sequence
GTAATAGGGGCGGGACTTCACCCGGAGAGAGCGCGAACGATGAATTTCTTAGCAATTCCTCCGTCGCCGACGACGATTTTCAGTTCGCCGCACAAGTTACTGAGCTCCGG
CGAGAAATCATCTTGCCTCCGACTAGGTAGGGCGGAGGAATATCGGAGAGTGACAATGAGAGGCGGAAGCGAGAACCGGAAGCCGTTGCAGAAGGGGAGGAACCTCAGCA
TCGAAGCAATTCAAGCGGTACAGTCGTTGAAACGAGTCAAGAACGATCTACAACAATTGGACCGAGTGTATGATTCCAAAATTAGCCGCTTATTGAAGTTCGATATGATG
GCTGTCCTTCGCGAGCTGCTGCGCCAGAACGAGTGTCTTTTGGCTCTCAAGGTTTTTGAAGATGTTAGAAATGAACACTGGTACAAACCTCAGGTCTCGCTGTATGCTGA
TATTATCACAGTATTGGCTAGCAATGGATTATTCGAACAAGTACAAATTATTCATTCCTACTTGAAAGTAGAAACTGACTTAGCGCCTGAAATTGACGGTTTTAACGCTC
TTTTGAGGGCTTTGGTTAGTTATAATTTAGGTGAGCTTGCGATGGAGTCCTATTACTTGATGAAAAAAGTGGGTTGCGAGCCAGATAAGACTTCTTTCAGGATAGTCATA
AAAGGATTGGAATCAACGGTAGAGGCAGTTGATTTAAGAATTGTGAAGCAGGATGCACAAAAGATTTATGGTGATTTACTTGAGTTTCTAGAGGAAGAGGATGAGGCAGC
TATAGCCACTTCTATGCGCTGAGCTGCAGAATCTAGATCCCAATCATGGAAGCCAATATATGCACAGATTGAACTTGTGGCTAGATGTAGTTCGTCTGGGTTTCCTTTGA
AGAAACCTGAAGAGACCAAGAAAGTTGCCTTGAACATGAATTTATGCACACATGGCAGTGAAATATTTGTGCTTCCCATGCTGCTGGGATCCAAATATTTCACGCAATTG
GTACTGTCGTTAGTCTATGCAGAGAGAGCTGTACATCTTGCAATGCAACATACTTTAGCCTTCTGTTGTGAGTATGAGAATTTCATTAAGTCCACCGAAGATGGAAATAT
AAGGAAAATAACTGCATTATAATGCTAGGTGCATGATACTGGGTTTCTTTTTCCTTCCACAGTGGGCATAATTATATGTAGAATACAAGGACAAAACCGGGTTCTGTAGA
GCTACATTGCTTTTGTAAAACTATACATAGTTCAGATGAAAAATGCAGGCTCTATGCGACTGCTAATTTTGTGCAGTCTAATTCTGATGCGCGTTTACACATCAAGAGAT
TTAACATGCCACTGCCAGTGTAATACTAATGCTTGAGTTTCTAGTTATATTTCCTTGAAGAAGGCCTAAAATTTTGATTTAGATTCCTCAATTGCATGACAATGTCAAAA
CTGTTTCCCTTCAAATGTCTGCCCGAACTGCCAGTTTCCAGGTGCAACACCATAGGAAGTAAGTGCTCTTCCACTGCTAGAGGTAACTTCAAAAGAGAGAGGTTGCCCAT
GAAGATTGACATTGCTTTGCCAGTTTTGTCCCCAGTTCCTTGCCAGTGGTATCCATCCTGTTCTTGATCCCTTCACTTTCACTGCCACCAATTCACCGTCCATTCCGACA
TTGGTAATCAGAACTTGAAAGAAGCGAGAATTGCCACGGACTGTGAATCTCAATCCACCACTCCTGTCACACCTCACCCTACACACATTTTAGACACGGTTTTTCCCTTA
GCCGAAGATGACAAAACGTTCGAATCTATATGCATTTAGATTGATATTACCTCCTGTATTGAACTGGCACAATATCTGCTTTTCTCTCTGCTATTTCAGCAAACGCTGCC
TCTGACATCTCAAAGTGTTCTTGGGGAAAATTGCACCACCCACCATAATCAGAGGATAAGCCATAATTGGGAGGACAGAAGTCCGTCGCTGTTAGAATCACGGTCGGGCT
TCCTTGCAAGCACCATAAGATGTGGTCAACACATCTAAGCTCATAGCAAGCTCCACAGTAGCAGATTTCCACTCTTCATCGTTTGCCACATAAATGTTGGGTGTTCTCAG
AAGAACCAAGCAGACTATTACAGCTACAAAGGCAACCATGTTCAGAAGAATATAAGGCCAGGGTCTTGAAGAAGCCTGAGGCAGCACTATCCGGAAACAGGAAAAACTTT
GACAAATCTACAACATCATAGCCCCTCGTTCACATTGGTGGCGCGTTTAACGAGGCAGAGGAAAAGACACACACATTTCATTGTTTACTGTCCGCCGAGAATGGGGGCTC
AAAGAAGCGACCTTGGAGCCACAGCCACCATTTTTGCAGATATCGAGGTGGAAGGAAGGGTTCGAAGCTTTATGCTGATGGCGGCACGGCAGACAAGAGAGCAGCAGAAA
GCCATAGAAGCAGGTATGTTTATGTTTATGTAATTTATATTTTAAGATCCCATGGCCTTCTTTCTTGACTTGAACGAGAGAATAAAAAGAATCAGAAGAAGCTATAAAGA
ACCCAAAAAAGGGGTCTGTTTTGGGTCTTTTGATTGGAGTGGAGAAGGACCCGAACGAGTTTAAGAAAATGAAAAACAGTGGGCTTTTTTTTTTTTCTCTTGGCTCTTCT
TTTCATAACAAAAAAAGGTAGTGAGTGGGTCCCACTGTTGGAAGACGTGGATTGCCGCTGCCGGGTAATGAAAGCCCGTTTCCTTTGGCGGGAAACGGTGCTGGATGGCC
TTTTCTTTTTTTTTTTTTTAAATCTTTTATTTCTTTTGAGCCATTTACATAAATGATAAATAATAGTATGCATTTTAAAATATAACAATAAACTTATACTCAAATCCACC
AAAAATATAACTTTTTTTTTTCTTTTTATCGACTTGGTTAATTTTCTGCTTTGTAGGCTGCTTTTTTTGTCCCTGGAAATTTGTTTGACATCTGAAGATATTTCTTTCTT
TTCTTGTTTTTGAAAAACCTTTATAACAGTTAAAAAATTACTATAAATATAACGATTTTTTTTTTCCTTCTCTTGTAAAAAATAACTGGAGATTTCAAAATGCGAAAACA
CAAAAATAATACTGTAACCAAAATATAGTAGGACAAAACAATTGTTATGTATAGTAGGACAAAACGATTGTTATGTGAGAATAGTTGGGGTCAAACATTCAGAAAAAAGA
CCATTTAAGTTTCATTTAAGTACTAGGAATCAAAAAGAAATCAATTAGAATAAAAGCGATCTGTCATATAATTTACTATACATTTTTTGAAATCATAATCAGAAACTAAC
TTGATCTATATGGTTTGGATTACTTTCAATGTGGTTTTTACAATTATAAAAGTTAAAATTTGATTCTTATAGTTTGATCTACTATCAATTGGAAATGAACAAGTGTGTTC
TATTATAAAGATTTAGGATCAAGTAAAAAAATAAAAGAAAATATGTATTGGTAGAGTTTAACCACCGGGACCAAATTAAAACTAAGCTCGAATCATAAAAATTAAATTAA
AACTCAGGAAACCATAGGAATAAAACAAATAATTTAACCAAAAAAAAAACATACAAATATGACTTCTTCTTGCTATATATATTTCTACTACATAAAAATAATAATTTCAA
AATCCAGAAGATTGCCTTCCCATTTCCTACCCTAATTTCTAATCAACGGTAGGAGTGAAAATTTCATATTCACTGTTTGACTATTCTGATCTCTTTAAGTCTTCCAAAAT
TCATACCAAAATTGCTGTTCATAAGATATGACATAAAAATAATTAAAGAATAGGTGTCTGTGGGGAAAGAAGTTTAAGTTAGGTTATAATATCTTTCAAACTAACAATGC
AGATGATGTCTTCTTCTGTATTGTCTTCAATCATATTTCTGTTGTGATCTTGTTGTTCTTGCTTGTTCTTGCTGCTCTTCTTGTTATTGTTATTGCTATTGCTGTGCCAA
TTGTTCTTGTGAAAGTTGTAGGAGTTGAGTGTTGTTTGATCAAGAGTCCCTTCTCTGTATACAAAATTACCCCCTTCTTCGTCCATTCGTCGCCACTTCCTGTTTCGATC
CTGGTTTCGGTACAATGGCAGTCCTCTACAAAATAAACAACAAATATATGTATGTATTCTATCTTTGTAACCACTATTTAGATGCTTTAAATACTATTTTAGTCCATCAA
CTTTCGAAACATTTATTTTGATTCATGTACTTCTGAAAAGTGACTGTTTCGGTCCTTCATTTATCATAATTTCAACACGACATTTCGATCACTAAGATCTAGTTTGATAA
CTGTTTTGTTTTTT
Protein sequenceShow/hide protein sequence
MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECL
LALKVFEDVRNEHWYKPQVSLYADIITVLASNGLFEQVQIIHSYLKVETDLAPEIDGFNALLRALVSYNLGELAMESYYLMKKVGCEPDKTSFRIVIKGLESTVEAVDLR
IVKQDAQKIYGDLLEFLEEEDEAAIATSMR