; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G02280 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G02280
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr7:1970071..1971021
RNA-Seq ExpressionCSPI07G02280
SyntenyCSPI07G02280
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK08129.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]4.3e-11991.94Show/hide
Query:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
        MSFL TL SPTI SPPLKLPSSV   CCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
Subjt:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA
        ELLRQNECSLALKVFEDVR EHWYKPQVSLYADIITVLA N LFERVQIILSYMKAE DLAP+IDGFNALLK LV HNLG+LAMESYYLMK+VGCEP+KA
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA

Query:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSI
        SFRIVIKGLE KGEAVDLRTVKQDAQ+LYGESLEFLEE EEGATA SI
Subjt:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSI

XP_008451190.1 PREDICTED: pentatricopeptide repeat-containing protein At3g46870-like [Cucumis melo]1.5e-11992.34Show/hide
Query:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
        MSFL TL SPTI SPPLKLPSSV   CCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
Subjt:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA
        ELLRQNECSLALKVFEDVR EHWYKPQVSLYADIITVLA N LFERVQIILSYMKAE DLAP+IDGFNALLKALV HNLG+LAMESYYLMK+VGCEP+KA
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA

Query:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSI
        SFRIVIKGLE KGEAVDLRTVKQDAQ+LYGESLEFLEE EEGATA SI
Subjt:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSI

XP_022952712.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita moschata]1.6e-10282.59Show/hide
Query:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
        MSFLAT  S TI SPP  L  S      L+LGR EGY RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR KKDLQQLDRVYDSKIRRLLKFDM+AVLR
Subjt:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI++VLA N LFE+VQII SY KAE DLAP+I+GFN LLKALVS+NLGELAMESYYLMK+VGCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA

Query:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATS
        SFRIVIKGLES  E+VDLRTVK+DAQ LYGESLEFLEEE+E AT  S
Subjt:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATS

XP_031744771.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucumis sativus]4.8e-12697.19Show/hide
Query:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
        MSFLAT SSPTIFSP LK PSSV TACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
Subjt:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA
        ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLA N LFERVQIILSYMKAEADLAP+IDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA

Query:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSIQ
        SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSIQ
Subjt:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSIQ

XP_038888189.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Benincasa hispida]2.8e-11087.85Show/hide
Query:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
        MSFL    SPTI SP  KL SS   A CL LGRAEGY +VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDM+AVLR
Subjt:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA
        ELLRQNECSLALKVFEDVR EHWYKPQVSLYADII VLA N LFERVQII SY+KAE DLAP+IDGFNALLKALVSHNLGELAMESYYLMK++GCEPDKA
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA

Query:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATS
        SFRIVIKGLESKGEAVDLRTVKQDAQ+LYGESLEFLEEEEE A A S
Subjt:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATS

TrEMBL top hitse value%identityAlignment
A0A0A0K6A9 Uncharacterized protein2.3e-12697.19Show/hide
Query:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
        MSFLAT SSPTIFSP LK PSSV TACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
Subjt:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA
        ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLA N LFERVQIILSYMKAEADLAP+IDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA

Query:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSIQ
        SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSIQ
Subjt:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSIQ

A0A1S3BRZ8 pentatricopeptide repeat-containing protein At3g46870-like7.2e-12092.34Show/hide
Query:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
        MSFL TL SPTI SPPLKLPSSV   CCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
Subjt:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA
        ELLRQNECSLALKVFEDVR EHWYKPQVSLYADIITVLA N LFERVQIILSYMKAE DLAP+IDGFNALLKALV HNLG+LAMESYYLMK+VGCEP+KA
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA

Query:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSI
        SFRIVIKGLE KGEAVDLRTVKQDAQ+LYGESLEFLEE EEGATA SI
Subjt:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSI

A0A5A7UZI6 Pentatricopeptide repeat-containing protein7.2e-12092.34Show/hide
Query:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
        MSFL TL SPTI SPPLKLPSSV   CCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
Subjt:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA
        ELLRQNECSLALKVFEDVR EHWYKPQVSLYADIITVLA N LFERVQIILSYMKAE DLAP+IDGFNALLKALV HNLG+LAMESYYLMK+VGCEP+KA
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA

Query:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSI
        SFRIVIKGLE KGEAVDLRTVKQDAQ+LYGESLEFLEE EEGATA SI
Subjt:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSI

A0A5D3CCT4 Pentatricopeptide repeat-containing protein2.1e-11991.94Show/hide
Query:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
        MSFL TL SPTI SPPLKLPSSV   CCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
Subjt:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA
        ELLRQNECSLALKVFEDVR EHWYKPQVSLYADIITVLA N LFERVQIILSYMKAE DLAP+IDGFNALLK LV HNLG+LAMESYYLMK+VGCEP+KA
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA

Query:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSI
        SFRIVIKGLE KGEAVDLRTVKQDAQ+LYGESLEFLEE EEGATA SI
Subjt:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATSI

A0A6J1HX85 protein THYLAKOID ASSEMBLY 8-like, chloroplastic8.0e-10382.59Show/hide
Query:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR
        MSFLAT  S TI SPP  L  S      L+LGR EGY RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR KKDLQQLDRVYDSKIRRLLKFDM+AVLR
Subjt:  MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI++VLA N LFE+VQII SY+KAE DLAP+I+GFN LLKALVS+NLGELAMESYYLMK+VGCEPDKA
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKA

Query:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATS
        SFRIVIKGLES  E+VDLR VK+DAQ LYGESLEFLEEE+E AT  S
Subjt:  SFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEEEGATATS

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351303.5e-0729.36Show/hide
Query:  KEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKASFRIVIKGLESKGEAVDLR
        + H  KP +  Y  ++   A   L E+ + I   ++ E  L PD+  +NAL+++         A E + LM+ +GCEPD+AS+ I++      G   D  
Subjt:  KEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKASFRIVIKGLESKGEAVDLR

Query:  TVKQDAQRL
         V ++ +RL
Subjt:  TVKQDAQRL

Q1PFH7 Pentatricopeptide repeat-containing protein At1g623502.5e-1633.55Show/hide
Query:  LSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAE
        +S E + A + LKR +    +LDR   S + RLLK D+++VL E  RQN+  L +K++E VR+E WY+P +  Y D++ +LA N+  +  + +   +K E
Subjt:  LSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAE

Query:  ADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKASFRIVIKGL
         ++  D   F  L++  + + L   AM  Y  M++    P    FR+++KGL
Subjt:  ADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKASFRIVIKGL

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic6.5e-1737.29Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIITVLACNE
        G  +NR PL KGR LS EAIQ++QSLKR  +    L       +RRL+K D+++VLRELLRQ+ C+LA+ V   +R E  Y P  + LYADI+  L  N+
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIITVLACNE

Query:  LFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVG-----CEPDKASFRIVIKGLESKGE
         F+ +  ++  +    D   D      L++A+V     E  +  Y LM++ G      E D+    ++ KGL   GE
Subjt:  LFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVG-----CEPDKASFRIVIKGLESKGE

Q9SCP4 Pentatricopeptide repeat-containing protein At3g531705.5e-0826.85Show/hide
Query:  IEAIQAVQSLKRTKKDLQQLDRVYDS-KIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEA
        ++  + +  + RT   ++ ++R  +S K   L    +L  L E +++N    ALK+F  +RK+HWY+P+   Y  +  VL   +  ++  ++   M +E 
Subjt:  IEAIQAVQSLKRTKKDLQQLDRVYDS-KIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEA

Query:  DLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVG-CEPDKASFRIVI
         L P ID + +L+       L + A  +   MK V  C+PD  +F ++I
Subjt:  DLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVG-CEPDKASFRIVI

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic3.7e-2034.2Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERV
        R PL +G+ L   EA+  +  LKR K+D ++LD+   + + RLLK DMLAV+ EL RQ E +LA+K+FE ++K+ WY+P V +Y D+I  LA ++  +  
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERV

Query:  QIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKASFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEE
          +   MK E +L PD   +  +++  +       AM  Y  M      P++  FR+++KGL      +    VK+D + L+ E   +   EE
Subjt:  QIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKASFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEE

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-1733.55Show/hide
Query:  LSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAE
        +S E + A + LKR +    +LDR   S + RLLK D+++VL E  RQN+  L +K++E VR+E WY+P +  Y D++ +LA N+  +  + +   +K E
Subjt:  LSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAE

Query:  ADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKASFRIVIKGL
         ++  D   F  L++  + + L   AM  Y  M++    P    FR+++KGL
Subjt:  ADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKASFRIVIKGL

AT3G27750.1 FUNCTIONS IN: molecular_function unknown4.6e-1837.29Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIITVLACNE
        G  +NR PL KGR LS EAIQ++QSLKR  +    L       +RRL+K D+++VLRELLRQ+ C+LA+ V   +R E  Y P  + LYADI+  L  N+
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIITVLACNE

Query:  LFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVG-----CEPDKASFRIVIKGLESKGE
         F+ +  ++  +    D   D      L++A+V     E  +  Y LM++ G      E D+    ++ KGL   GE
Subjt:  LFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVG-----CEPDKASFRIVIKGLESKGE

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein2.6e-2134.2Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERV
        R PL +G+ L   EA+  +  LKR K+D ++LD+   + + RLLK DMLAV+ EL RQ E +LA+K+FE ++K+ WY+P V +Y D+I  LA ++  +  
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERV

Query:  QIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKASFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEE
          +   MK E +L PD   +  +++  +       AM  Y  M      P++  FR+++KGL      +    VK+D + L+ E   +   EE
Subjt:  QIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKASFRIVIKGLESKGEAVDLRTVKQDAQRLYGESLEFLEEEE

AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.9e-0926.85Show/hide
Query:  IEAIQAVQSLKRTKKDLQQLDRVYDS-KIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEA
        ++  + +  + RT   ++ ++R  +S K   L    +L  L E +++N    ALK+F  +RK+HWY+P+   Y  +  VL   +  ++  ++   M +E 
Subjt:  IEAIQAVQSLKRTKKDLQQLDRVYDS-KIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEA

Query:  DLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVG-CEPDKASFRIVI
         L P ID + +L+       L + A  +   MK V  C+PD  +F ++I
Subjt:  DLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVG-CEPDKASFRIVI

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain4.4e-6156.25Show/hide
Query:  VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR---------------TKKDLQQLDRVYDSKIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWY
        + MR  S+NRKPLQ+GR LSIEAIQAVQ+LKR               T      LDRV  SK RRLLKFDM+AVLRELLRQNECSLALKVFE++RKE+WY
Subjt:  VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR---------------TKKDLQQLDRVYDSKIRRLLKFDMLAVLRELLRQNECSLALKVFEDVRKEHWY

Query:  KPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKASFRIVIKGLESKGEAVDLRTVKQD
        KPQV +Y D+ITV+A N L E V  + S MK+E  L  +I+ FN LL  L++H L +L M+ Y  M+ +G EPD+ASFR+++ GLES GE      V+QD
Subjt:  KPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKASFRIVIKGLESKGEAVDLRTVKQD

Query:  AQRLYGESLEFLEEEEEGATATSI
        A   YGESLEF+EE+EE ++ TS+
Subjt:  AQRLYGESLEFLEEEEEGATATSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTCTTAGCAACTCTTTCGTCGCCGACGATTTTCAGTCCGCCACTCAAGTTACCAAGCTCTGTCGAGACAGCATGCTGCCTGCAACTAGGTAGGGCGGAGGGATA
TCCGAGAGTGACAATGAGAGGCGGAAGTGAAAACCGGAAGCCACTGCAGAAGGGGAGGAACCTCAGCATCGAAGCAATTCAAGCCGTGCAATCGTTGAAGCGAACGAAGA
AAGATTTACAACAGTTGGACCGAGTGTATGATTCCAAAATTAGGCGCTTATTGAAGTTTGATATGTTGGCTGTTCTTCGCGAGCTCCTCCGCCAGAACGAGTGTTCTTTG
GCTCTTAAGGTTTTCGAAGATGTTAGAAAAGAACACTGGTACAAGCCTCAGGTCTCGCTGTATGCTGATATTATTACAGTATTGGCTTGCAATGAATTGTTCGAACGAGT
ACAAATTATTCTTTCGTACATGAAAGCAGAGGCTGATTTAGCACCTGACATTGACGGGTTTAACGCTCTTTTGAAGGCATTGGTTAGTCATAATTTAGGTGAACTGGCGA
TGGAGTCGTATTATTTGATGAAGGATGTAGGTTGTGAGCCAGATAAGGCTTCTTTCAGGATTGTCATAAAAGGATTGGAATCAAAGGGAGAGGCAGTTGATTTAAGAACT
GTGAAACAGGATGCACAAAGACTTTATGGTGAATCACTCGAGTTTTTGGAGGAAGAAGAAGAGGGAGCTACAGCCACATCTATACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTTCTTAGCAACTCTTTCGTCGCCGACGATTTTCAGTCCGCCACTCAAGTTACCAAGCTCTGTCGAGACAGCATGCTGCCTGCAACTAGGTAGGGCGGAGGGATA
TCCGAGAGTGACAATGAGAGGCGGAAGTGAAAACCGGAAGCCACTGCAGAAGGGGAGGAACCTCAGCATCGAAGCAATTCAAGCCGTGCAATCGTTGAAGCGAACGAAGA
AAGATTTACAACAGTTGGACCGAGTGTATGATTCCAAAATTAGGCGCTTATTGAAGTTTGATATGTTGGCTGTTCTTCGCGAGCTCCTCCGCCAGAACGAGTGTTCTTTG
GCTCTTAAGGTTTTCGAAGATGTTAGAAAAGAACACTGGTACAAGCCTCAGGTCTCGCTGTATGCTGATATTATTACAGTATTGGCTTGCAATGAATTGTTCGAACGAGT
ACAAATTATTCTTTCGTACATGAAAGCAGAGGCTGATTTAGCACCTGACATTGACGGGTTTAACGCTCTTTTGAAGGCATTGGTTAGTCATAATTTAGGTGAACTGGCGA
TGGAGTCGTATTATTTGATGAAGGATGTAGGTTGTGAGCCAGATAAGGCTTCTTTCAGGATTGTCATAAAAGGATTGGAATCAAAGGGAGAGGCAGTTGATTTAAGAACT
GTGAAACAGGATGCACAAAGACTTTATGGTGAATCACTCGAGTTTTTGGAGGAAGAAGAAGAGGGAGCTACAGCCACATCTATACAATGA
Protein sequenceShow/hide protein sequence
MSFLATLSSPTIFSPPLKLPSSVETACCLQLGRAEGYPRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRTKKDLQQLDRVYDSKIRRLLKFDMLAVLRELLRQNECSL
ALKVFEDVRKEHWYKPQVSLYADIITVLACNELFERVQIILSYMKAEADLAPDIDGFNALLKALVSHNLGELAMESYYLMKDVGCEPDKASFRIVIKGLESKGEAVDLRT
VKQDAQRLYGESLEFLEEEEEGATATSIQ