; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G02700 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G02700
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr4:1643322..1645782
RNA-Seq ExpressionCSPI04G02700
SyntenyCSPI04G02700
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044536.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]3.3e-11990.87Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTL+G LQLHF QLGLRQNLTNRSL C TAAPPPNIICGLRKG  +PLG SRVPSNEAIQAVQSLKLAK TSKMEDVINTKL RLLKADLFDAL+ELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEW+EPDLRLYHGMIM+MGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERA +TYRLMIASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTK
        LIKNLEKF+EEFA+VVK+DC+EYLDNPQKFFND GQKLTTK
Subjt:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTK

KGN53003.1 hypothetical protein Csa_015115 [Cucumis sativus]6.8e-13399.59Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTKVRIL
        LIKNLEKFREEFAVVVK+DCNEYLDNPQKFFNDNGQKLTTKVRIL
Subjt:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTKVRIL

XP_004152311.2 pentatricopeptide repeat-containing protein At1g62350 [Cucumis sativus]6.8e-13399.59Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTKVRIL
        LIKNLEKFREEFAVVVK+DCNEYLDNPQKFFNDNGQKLTTKVRIL
Subjt:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTKVRIL

XP_008454079.1 PREDICTED: pentatricopeptide repeat-containing protein At1g62350-like [Cucumis melo]1.6e-11890.46Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTL+G LQLHF +LGLRQNLTNRSL C TAAPPPNIICGLRKG  +PLG SRVPSNEAIQAVQSLKLAK TSKMEDVINTKL RLLKADLFDAL+ELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEW+EPDLRLYHGMIM+MGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERA +TYRLMIASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTK
        LIKNLEKF+EEFA+VVK+DC EYLDNPQKFFND GQKLTTK
Subjt:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTK

XP_038878176.1 pentatricopeptide repeat-containing protein At1g62350-like [Benincasa hispida]1.0e-10983.67Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTL+G LQ HF QLGLRQ+LTN SL C TAAPPPNIICGLRKG  +PLG SRVPS EAIQAVQSLKLAKSTSKMEDVIN+KL RLLKADLFDAL+ELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKF+QNEEW+EPDLRLYHGMI++ GKNKMIEMAEE+FHKL+KDGLEPD RAFNEMMGAYLQVDM+ERAVETY LM ASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTKVRIL
        LIKNLEKFREEFA VVK++CN YLD+P+KF ND  QK T K RIL
Subjt:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTKVRIL

TrEMBL top hitse value%identityAlignment
A0A0A0KU23 Uncharacterized protein3.3e-13399.59Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTKVRIL
        LIKNLEKFREEFAVVVK+DCNEYLDNPQKFFNDNGQKLTTKVRIL
Subjt:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTKVRIL

A0A1S3BX97 pentatricopeptide repeat-containing protein At1g62350-like7.8e-11990.46Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTL+G LQLHF +LGLRQNLTNRSL C TAAPPPNIICGLRKG  +PLG SRVPSNEAIQAVQSLKLAK TSKMEDVINTKL RLLKADLFDAL+ELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEW+EPDLRLYHGMIM+MGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERA +TYRLMIASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTK
        LIKNLEKF+EEFA+VVK+DC EYLDNPQKFFND GQKLTTK
Subjt:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTK

A0A5A7TQZ8 Pentatricopeptide repeat-containing protein1.6e-11990.87Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTL+G LQLHF QLGLRQNLTNRSL C TAAPPPNIICGLRKG  +PLG SRVPSNEAIQAVQSLKLAK TSKMEDVINTKL RLLKADLFDAL+ELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEW+EPDLRLYHGMIM+MGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERA +TYRLMIASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTK
        LIKNLEKF+EEFA+VVK+DC+EYLDNPQKFFND GQKLTTK
Subjt:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTK

A0A5D3CZ95 Pentatricopeptide repeat-containing protein7.8e-11990.46Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTL+G LQLHF +LGLRQNLTNRSL C TAAPPPNIICGLRKG  +PLG SRVPSNEAIQAVQSLKLAK TSKMEDVINTKL RLLKADLFDAL+ELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEW+EPDLRLYHGMIM+MGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERA +TYRLMIASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTK
        LIKNLEKF+EEFA+VVK+DC EYLDNPQKFFND GQKLTTK
Subjt:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTK

A0A6J1IY98 pentatricopeptide repeat-containing protein At1g62350-like1.2e-10379.59Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTL+G LQLH  QLG RQNLTN +L C TA PPPNIICGLRKG  +PLG SRVPS E+IQAVQSLKLAKS SKMEDVIN+KL RLLKADLFDAL+ELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFM+NEEW+EPDL LYH MI +MGKNKMIEMAEEVFH  ++DGLEPDTRAFNEMMGAYLQVDM+ERAVETY LM ASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTKVRIL
        LIKNLE+FREEFA VVK++C E+LD+P+KF  D  QKL  K++IL
Subjt:  LIKNLEKFREEFAVVVKEDCNEYLDNPQKFFNDNGQKLTTKVRIL

SwissProt top hitse value%identityAlignment
A7LN87 Pentatricopeptide repeat-containing protein PPR5, chloroplastic4.9e-0926.97Show/hide
Query:  EAIQAVQSL--KLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGL
        EA + V+SL  + A    ++  V++  +  +     F    EL R++     L VF++MQ + W+  D  +Y  +I +MG+   I MA  +F ++R  G 
Subjt:  EAIQAVQSL--KLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGL

Query:  EPDTRAFNEMMGAYL----QVDMIERAVETY-RLMIASGCTPDELTFKILIK
        +PDT  +N ++GA+L    +   + +A+  + ++     C P  +T+ IL++
Subjt:  EPDTRAFNEMMGAYL----QVDMIERAVETY-RLMIASGCTPDELTFKILIK

Q1PFH7 Pentatricopeptide repeat-containing protein At1g623501.1e-1627.75Show/hide
Query:  SNEAIQAVQSLKLAKSTS-KMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDG
        S E + A + LK  ++ S +++  I + + RLLK+DL   L+E QRQN++ L +++++ ++ E W+ PD+  Y  M+M++ +NK ++  ++V+  L+K+ 
Subjt:  SNEAIQAVQSLKLAKSTS-KMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDG

Query:  LEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNLEKFREEFAVVVKEDCNEYL------DNPQKFFNDNGQKLTT
        +  D   F +++  +L  ++   A+  Y  M  S   P  L F++++K L  +  E    VK+D  E        D P+    D+ ++  T
Subjt:  LEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNLEKFREEFAVVVKEDCNEYL------DNPQKFFNDNGQKLTT

Q9FKC3 Pentatricopeptide repeat-containing protein At5g48730, chloroplastic1.9e-0826.45Show/hide
Query:  SRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLG--RLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHK
        +++ S +AI  +   +  KS      +I  K G  +LL   + ++L E       E ++QVF+ ++ + W++P++ +Y  +I+++GK K  E A E+F +
Subjt:  SRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLG--RLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHK

Query:  LRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIAS-GCTPDELTFKILIKN
        +  +G   +   +  ++ AY +    + A      M +S  C PD  T+ ILIK+
Subjt:  LRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIAS-GCTPDELTFKILIKN

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic2.9e-0930.56Show/hide
Query:  NIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEP-DLRLYHGMIMLM
        +I CG R  +  PL   R+ S EAIQ++QSLK A  T     +    L RL+K+DL   L EL RQ+   L++ V   ++ E  + P DL LY  ++  +
Subjt:  NIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEP-DLRLYHGMIMLM

Query:  GKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASG-----CTPDELTFKILIKNLEKFRE
         +NK  +  + +  ++       D +A  +++ A +  +  E  V  Y LM  SG        DE   ++L K L +  E
Subjt:  GKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASG-----CTPDELTFKILIKNLEKFRE

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic1.8e-1928.57Show/hide
Query:  PPPNIICGLRKGSNRPLGL----SRVPSNEAIQAVQSLK-LAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLY
        P P  +   R    RP G      ++   EA+  +  LK L +   K++  I T + RLLK D+   + EL+RQ E  L++++F+ +Q +EW++PD+ +Y
Subjt:  PPPNIICGLRKGSNRPLGL----SRVPSNEAIQAVQSLK-LAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLY

Query:  HGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNL-------EKFREEFAVVVKEDCN
          +I+ + K+K ++ A  ++ K++K+ L PD++ + E++  +L+      A+  Y  M+ S   P+EL F++L+K L        K +++F  +  E   
Subjt:  HGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNL-------EKFREEFAVVVKEDCN

Query:  EYLDNPQKFF
           D P++ F
Subjt:  EYLDNPQKFF

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein7.7e-1827.75Show/hide
Query:  SNEAIQAVQSLKLAKSTS-KMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDG
        S E + A + LK  ++ S +++  I + + RLLK+DL   L+E QRQN++ L +++++ ++ E W+ PD+  Y  M+M++ +NK ++  ++V+  L+K+ 
Subjt:  SNEAIQAVQSLKLAKSTS-KMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDG

Query:  LEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNLEKFREEFAVVVKEDCNEYL------DNPQKFFNDNGQKLTT
        +  D   F +++  +L  ++   A+  Y  M  S   P  L F++++K L  +  E    VK+D  E        D P+    D+ ++  T
Subjt:  LEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNLEKFREEFAVVVKEDCNEYL------DNPQKFFNDNGQKLTT

AT3G27750.1 FUNCTIONS IN: molecular_function unknown2.0e-1030.56Show/hide
Query:  NIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEP-DLRLYHGMIMLM
        +I CG R  +  PL   R+ S EAIQ++QSLK A  T     +    L RL+K+DL   L EL RQ+   L++ V   ++ E  + P DL LY  ++  +
Subjt:  NIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEP-DLRLYHGMIMLM

Query:  GKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASG-----CTPDELTFKILIKNLEKFRE
         +NK  +  + +  ++       D +A  +++ A +  +  E  V  Y LM  SG        DE   ++L K L +  E
Subjt:  GKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASG-----CTPDELTFKILIKNLEKFRE

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-2028.57Show/hide
Query:  PPPNIICGLRKGSNRPLGL----SRVPSNEAIQAVQSLK-LAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLY
        P P  +   R    RP G      ++   EA+  +  LK L +   K++  I T + RLLK D+   + EL+RQ E  L++++F+ +Q +EW++PD+ +Y
Subjt:  PPPNIICGLRKGSNRPLGL----SRVPSNEAIQAVQSLK-LAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLY

Query:  HGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNL-------EKFREEFAVVVKEDCN
          +I+ + K+K ++ A  ++ K++K+ L PD++ + E++  +L+      A+  Y  M+ S   P+EL F++L+K L        K +++F  +  E   
Subjt:  HGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNL-------EKFREEFAVVVKEDCN

Query:  EYLDNPQKFF
           D P++ F
Subjt:  EYLDNPQKFF

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain3.9e-2230.95Show/hide
Query:  RKGSNRPLGLSRVPSNEAIQAVQSLKLA----------------KSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLR
        R  + +PL   R+ S EAIQAVQ+LK A                 S++ ++ VI +K  RLLK D+   L EL RQNE  L+L+VF+ ++ E W++P +R
Subjt:  RKGSNRPLGLSRVPSNEAIQAVQSLKLA----------------KSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLR

Query:  LYHGMIMLMGKNKMIEMAEEVFHKLRKD-GLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNLEKFRE-EFAVVVKEDCNEYL
        +Y  MI +M  N ++E    ++  ++ + GL  +   FN ++   L   + +  ++ Y  M + G  PD  +F++L+  LE   E   + +V++D +EY 
Subjt:  LYHGMIMLMGKNKMIEMAEEVFHKLRKD-GLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNLEKFRE-EFAVVVKEDCNEYL

Query:  DNPQKFFNDN
            +F  ++
Subjt:  DNPQKFFNDN

AT5G48730.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-0926.45Show/hide
Query:  SRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLG--RLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHK
        +++ S +AI  +   +  KS      +I  K G  +LL   + ++L E       E ++QVF+ ++ + W++P++ +Y  +I+++GK K  E A E+F +
Subjt:  SRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLG--RLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHK

Query:  LRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIAS-GCTPDELTFKILIKN
        +  +G   +   +  ++ AY +    + A      M +S  C PD  T+ ILIK+
Subjt:  LRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIAS-GCTPDELTFKILIKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCTACTCTAGTGGGTCCTCTTCAACTCCATTTTCTTCAATTGGGTCTTCGCCAAAACCTCACGAACCGAAGCCTCCGTTGTGGTACGGCAGCTCCACCACCAAA
TATCATTTGTGGCCTCAGAAAGGGCTCGAACAGGCCCTTAGGGTTGTCAAGGGTGCCCTCCAATGAGGCAATTCAAGCAGTTCAATCTCTCAAGCTTGCTAAATCTACCT
CCAAAATGGAAGACGTTATCAATACCAAGCTCGGCAGATTGCTTAAAGCAGACTTGTTTGATGCTCTGTCTGAATTACAAAGGCAAAATGAACTGGAATTATCGCTTCAG
GTCTTCAAATTTATGCAAAATGAAGAATGGTTCGAGCCAGATTTAAGGTTATACCATGGAATGATTATGCTGATGGGAAAGAACAAAATGATTGAAATGGCTGAAGAGGT
TTTTCATAAGTTAAGAAAGGATGGGTTAGAACCAGATACAAGAGCTTTCAATGAAATGATGGGAGCATATCTGCAAGTGGACATGATCGAAAGAGCTGTTGAGACATACA
GATTGATGATAGCTTCAGGTTGTACTCCAGATGAACTGACTTTCAAGATCTTGATAAAGAATCTTGAGAAATTTAGGGAAGAATTTGCTGTAGTGGTGAAGGAAGACTGT
AATGAGTACTTGGATAATCCTCAGAAGTTTTTCAACGATAACGGGCAGAAACTGACAACGAAAGTTCGAATTCTTTAA
mRNA sequenceShow/hide mRNA sequence
CCTAAGTAGAAAGTTAGAATTCATAGATGTGTGAGAAAGTATCGTAACATGATTGAAAAAAGAGTCATCTATACAAAAAAGCTTCCAAGGGATACGAAAATTCTATTATT
AAATGGCTTGCTTTTTAAGTGATAAACCAAAACTGAGATAAACCAAAAAAGCTTCGAAGGCTCTTCTATTGCTATCCCGACAAACTGAGATATACCGAAGTCGCAACGCC
GAGAGCGTACACTGCAACCACAGTTCCAACCACATCCTTTCCGGACCGAGTCACGGCGCCGATTCCGGCGATTCCGGCGATTCCCACCGGAACTCGAACCATCACTTTCT
TTCCCACGTTCTTCACCACAGCCTCTGCGAATAGTTCAATCGCCCAACGCTTAAACAGGATCAGTCCTAGTGCTTCGTCTTCACGATCCGAGTCATCGCGATTGACGGCT
CGTAGTGCTGAGTCCACCTGGAGACGAGTCGGGGGAGAGGGAAGCCAGTTGGTGATTAGTGGATGCGGCTGGACGACGGTGAGGAGGTGGTGAAGGTGGTCGGCGGCGGT
GCAGAGTTGATAGGGGAAAACGCCGTTGTATGCGTGCTGTGTTAGGGCAAGGCAGTGGGAGAAGGTTGAATTGCAGGCTGAGTCAAATTCATGCGAGTTGGTAAGTGAGT
TTCTGACTCGTTTCGAAGCTGATATTCCCATTATCTTTCTTCCTCGCTCTCGCTCTCTCCTTCCCTCCACCAATTCTGGTTTTGAAGGACACTTTAATAAATATCTGAAG
ACAAAAGCCTTTTATATCAAGTGTAGTTTTCATATAGATTAGAAAAATATCTTTAAACCTTTCCAATAAGCTTCTCAAAAATTTTAAGAATACCCATACCATTAGATTTC
TGCTACATCTCATCTTCACTGAACTGAGTTTCTCGCAAAAGGGCGGTTGGAGATGAAATCTACTCTAGTGGGTCCTCTTCAACTCCATTTTCTTCAATTGGGTCTTCGCC
AAAACCTCACGAACCGAAGCCTCCGTTGTGGTACGGCAGCTCCACCACCAAATATCATTTGTGGCCTCAGAAAGGGCTCGAACAGGCCCTTAGGGTTGTCAAGGGTGCCC
TCCAATGAGGCAATTCAAGCAGTTCAATCTCTCAAGCTTGCTAAATCTACCTCCAAAATGGAAGACGTTATCAATACCAAGCTCGGCAGATTGCTTAAAGCAGACTTGTT
TGATGCTCTGTCTGAATTACAAAGGCAAAATGAACTGGAATTATCGCTTCAGGTCTTCAAATTTATGCAAAATGAAGAATGGTTCGAGCCAGATTTAAGGTTATACCATG
GAATGATTATGCTGATGGGAAAGAACAAAATGATTGAAATGGCTGAAGAGGTTTTTCATAAGTTAAGAAAGGATGGGTTAGAACCAGATACAAGAGCTTTCAATGAAATG
ATGGGAGCATATCTGCAAGTGGACATGATCGAAAGAGCTGTTGAGACATACAGATTGATGATAGCTTCAGGTTGTACTCCAGATGAACTGACTTTCAAGATCTTGATAAA
GAATCTTGAGAAATTTAGGGAAGAATTTGCTGTAGTGGTGAAGGAAGACTGTAATGAGTACTTGGATAATCCTCAGAAGTTTTTCAACGATAACGGGCAGAAACTGACAA
CGAAAGTTCGAATTCTTTAAACATGCAGTTGGGTTACTTCTGGGAATGTATTTGATTTATTATCTGATCAATGTCTTTGGGATTCTCATAATCCTTGAATACTTTGTAAC
AGCTAGTAATGAATGAAGGTAAACTCTAGTCCTAGAGGTTCTCTCATTAACTAATAACTATGATGCGAGTGGC
Protein sequenceShow/hide protein sequence
MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQ
VFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNLEKFREEFAVVVKEDC
NEYLDNPQKFFNDNGQKLTTKVRIL