; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G16598 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G16598
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionPentatricopeptide repeat-containing protein
Genome locationctg24:1354641..1356984
RNA-Seq ExpressionCucsat.G16598
SyntenyCucsat.G16598
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044536.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]8.19e-15691.29Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTL+G LQLHF QLGLRQNLTNRSL C TAAPPPNIICGLRKG  +PLG SRVPSNEAIQAVQSLKLAK TSKMEDVINTKL RLLKADLFDAL+ELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEW+EPDLRLYHGMIM+MGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERA +TYRLMIASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTK
        LIKNLEKF+EEFA+VVKKDC+EYLDNPQKFFND GQKLTTK
Subjt:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTK

KGN53003.1 hypothetical protein Csa_015115 [Cucumis sativus]1.34e-173100Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTKVRIL
        LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTKVRIL
Subjt:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTKVRIL

XP_004152311.2 pentatricopeptide repeat-containing protein At1g62350 [Cucumis sativus]1.61e-178100Show/hide
Query:  MRVGRLEMKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLF
        MRVGRLEMKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLF
Subjt:  MRVGRLEMKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLF

Query:  DALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTP
        DALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTP
Subjt:  DALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTP

Query:  DELTFKILIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTKVRIL
        DELTFKILIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTKVRIL
Subjt:  DELTFKILIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTKVRIL

XP_008454079.1 PREDICTED: pentatricopeptide repeat-containing protein At1g62350-like [Cucumis melo]6.72e-15590.87Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTL+G LQLHF +LGLRQNLTNRSL C TAAPPPNIICGLRKG  +PLG SRVPSNEAIQAVQSLKLAK TSKMEDVINTKL RLLKADLFDAL+ELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEW+EPDLRLYHGMIM+MGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERA +TYRLMIASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTK
        LIKNLEKF+EEFA+VVKKDC EYLDNPQKFFND GQKLTTK
Subjt:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTK

XP_038878176.1 pentatricopeptide repeat-containing protein At1g62350-like [Benincasa hispida]4.04e-14384.08Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTL+G LQ HF QLGLRQ+LTN SL C TAAPPPNIICGLRKG  +PLG SRVPS EAIQAVQSLKLAKSTSKMEDVIN+KL RLLKADLFDAL+ELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKF+QNEEW+EPDLRLYHGMI++ GKNKMIEMAEE+FHKL+KDGLEPD RAFNEMMGAYLQVDM+ERAVETY LM ASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTKVRIL
        LIKNLEKFREEFA VVKK+CN YLD+P+KF ND  QK T K RIL
Subjt:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTKVRIL

TrEMBL top hitse value%identityAlignment
A0A0A0KU23 Uncharacterized protein6.47e-174100Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTKVRIL
        LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTKVRIL
Subjt:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTKVRIL

A0A1S3BX97 pentatricopeptide repeat-containing protein At1g62350-like3.25e-15590.87Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTL+G LQLHF +LGLRQNLTNRSL C TAAPPPNIICGLRKG  +PLG SRVPSNEAIQAVQSLKLAK TSKMEDVINTKL RLLKADLFDAL+ELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEW+EPDLRLYHGMIM+MGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERA +TYRLMIASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTK
        LIKNLEKF+EEFA+VVKKDC EYLDNPQKFFND GQKLTTK
Subjt:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTK

A0A5A7TQZ8 Pentatricopeptide repeat-containing protein3.96e-15691.29Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTL+G LQLHF QLGLRQNLTNRSL C TAAPPPNIICGLRKG  +PLG SRVPSNEAIQAVQSLKLAK TSKMEDVINTKL RLLKADLFDAL+ELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEW+EPDLRLYHGMIM+MGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERA +TYRLMIASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTK
        LIKNLEKF+EEFA+VVKKDC+EYLDNPQKFFND GQKLTTK
Subjt:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTK

A0A5D3CZ95 Pentatricopeptide repeat-containing protein3.25e-15590.87Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTL+G LQLHF +LGLRQNLTNRSL C TAAPPPNIICGLRKG  +PLG SRVPSNEAIQAVQSLKLAK TSKMEDVINTKL RLLKADLFDAL+ELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEW+EPDLRLYHGMIM+MGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERA +TYRLMIASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTK
        LIKNLEKF+EEFA+VVKKDC EYLDNPQKFFND GQKLTTK
Subjt:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTK

A0A6J1IY98 pentatricopeptide repeat-containing protein At1g62350-like3.17e-13580Show/hide
Query:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ
        MKSTL+G LQLH  QLG RQNLTN +L C TA PPPNIICGLRKG  +PLG SRVPS E+IQAVQSLKLAKS SKMEDVIN+KL RLLKADLFDAL+ELQ
Subjt:  MKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQ

Query:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI
        RQNELELSLQVFKFM+NEEW+EPDL LYH MI +MGKNKMIEMAEEVFH  ++DGLEPDTRAFNEMMGAYLQVDM+ERAVETY LM ASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKI

Query:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTKVRIL
        LIKNLE+FREEFA VVKK+C E+LD+P+KF  D  QKL  K++IL
Subjt:  LIKNLEKFREEFAVVVKKDCNEYLDNPQKFFNDNGQKLTTKVRIL

SwissProt top hitse value%identityAlignment
A7LN87 Pentatricopeptide repeat-containing protein PPR5, chloroplastic2.2e-0925.56Show/hide
Query:  EAIQAVQSL--KLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGL
        EA + V+SL  + A    ++  V++  +  +     F    EL R++     L VF++MQ + W+  D  +Y  +I +MG+   I MA  +F ++R  G 
Subjt:  EAIQAVQSL--KLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGL

Query:  EPDTRAFNEMMGAYL----QVDMIERAVETY-RLMIASGCTPDELTFKILIKNLEKFREEFAV-VVKKDCNEYLDNPQKF
        +PDT  +N ++GA+L    +   + +A+  + ++     C P  +T+ IL++   +  +   V ++ KD +E + +P  +
Subjt:  EPDTRAFNEMMGAYL----QVDMIERAVETY-RLMIASGCTPDELTFKILIKNLEKFREEFAV-VVKKDCNEYLDNPQKF

Q1PFH7 Pentatricopeptide repeat-containing protein At1g623503.2e-1627.75Show/hide
Query:  SNEAIQAVQSLKLAKSTS-KMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDG
        S E + A + LK  ++ S +++  I + + RLLK+DL   L+E QRQN++ L +++++ ++ E W+ PD+  Y  M+M++ +NK ++  ++V+  L+K+ 
Subjt:  SNEAIQAVQSLKLAKSTS-KMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDG

Query:  LEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNLEKFREEFAVVVKKDCNEYL------DNPQKFFNDNGQKLTT
        +  D   F +++  +L  ++   A+  Y  M  S   P  L F++++K L  +  E    VK D  E        D P+    D+ ++  T
Subjt:  LEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNLEKFREEFAVVVKKDCNEYL------DNPQKFFNDNGQKLTT

Q9FKC3 Pentatricopeptide repeat-containing protein At5g48730, chloroplastic1.9e-0826.45Show/hide
Query:  SRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLG--RLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHK
        +++ S +AI  +   +  KS      +I  K G  +LL   + ++L E       E ++QVF+ ++ + W++P++ +Y  +I+++GK K  E A E+F +
Subjt:  SRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLG--RLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHK

Query:  LRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIAS-GCTPDELTFKILIKN
        +  +G   +   +  ++ AY +    + A      M +S  C PD  T+ ILIK+
Subjt:  LRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIAS-GCTPDELTFKILIKN

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic2.9e-0930.56Show/hide
Query:  NIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEP-DLRLYHGMIMLM
        +I CG R  +  PL   R+ S EAIQ++QSLK A  T     +    L RL+K+DL   L EL RQ+   L++ V   ++ E  + P DL LY  ++  +
Subjt:  NIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEP-DLRLYHGMIMLM

Query:  GKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASG-----CTPDELTFKILIKNLEKFRE
         +NK  +  + +  ++       D +A  +++ A +  +  E  V  Y LM  SG        DE   ++L K L +  E
Subjt:  GKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASG-----CTPDELTFKILIKNLEKFRE

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic4.1e-1930.51Show/hide
Query:  PPPNIICGLRKGSNRPLGL----SRVPSNEAIQAVQSLK-LAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLY
        P P  +   R    RP G      ++   EA+  +  LK L +   K++  I T + RLLK D+   + EL+RQ E  L++++F+ +Q +EW++PD+ +Y
Subjt:  PPPNIICGLRKGSNRPLGL----SRVPSNEAIQAVQSLK-LAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLY

Query:  HGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNL
          +I+ + K+K ++ A  ++ K++K+ L PD++ + E++  +L+      A+  Y  M+ S   P+EL F++L+K L
Subjt:  HGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNL

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-1727.75Show/hide
Query:  SNEAIQAVQSLKLAKSTS-KMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDG
        S E + A + LK  ++ S +++  I + + RLLK+DL   L+E QRQN++ L +++++ ++ E W+ PD+  Y  M+M++ +NK ++  ++V+  L+K+ 
Subjt:  SNEAIQAVQSLKLAKSTS-KMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDG

Query:  LEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNLEKFREEFAVVVKKDCNEYL------DNPQKFFNDNGQKLTT
        +  D   F +++  +L  ++   A+  Y  M  S   P  L F++++K L  +  E    VK D  E        D P+    D+ ++  T
Subjt:  LEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNLEKFREEFAVVVKKDCNEYL------DNPQKFFNDNGQKLTT

AT3G27750.1 FUNCTIONS IN: molecular_function unknown2.1e-1030.56Show/hide
Query:  NIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEP-DLRLYHGMIMLM
        +I CG R  +  PL   R+ S EAIQ++QSLK A  T     +    L RL+K+DL   L EL RQ+   L++ V   ++ E  + P DL LY  ++  +
Subjt:  NIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEP-DLRLYHGMIMLM

Query:  GKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASG-----CTPDELTFKILIKNLEKFRE
         +NK  +  + +  ++       D +A  +++ A +  +  E  V  Y LM  SG        DE   ++L K L +  E
Subjt:  GKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASG-----CTPDELTFKILIKNLEKFRE

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein2.9e-2030.51Show/hide
Query:  PPPNIICGLRKGSNRPLGL----SRVPSNEAIQAVQSLK-LAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLY
        P P  +   R    RP G      ++   EA+  +  LK L +   K++  I T + RLLK D+   + EL+RQ E  L++++F+ +Q +EW++PD+ +Y
Subjt:  PPPNIICGLRKGSNRPLGL----SRVPSNEAIQAVQSLK-LAKSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLY

Query:  HGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNL
          +I+ + K+K ++ A  ++ K++K+ L PD++ + E++  +L+      A+  Y  M+ S   P+EL F++L+K L
Subjt:  HGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNL

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain6.9e-2230.95Show/hide
Query:  RKGSNRPLGLSRVPSNEAIQAVQSLKLA----------------KSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLR
        R  + +PL   R+ S EAIQAVQ+LK A                 S++ ++ VI +K  RLLK D+   L EL RQNE  L+L+VF+ ++ E W++P +R
Subjt:  RKGSNRPLGLSRVPSNEAIQAVQSLKLA----------------KSTSKMEDVINTKLGRLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLR

Query:  LYHGMIMLMGKNKMIEMAEEVFHKLRKD-GLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNLEKFRE-EFAVVVKKDCNEYL
        +Y  MI +M  N ++E    ++  ++ + GL  +   FN ++   L   + +  ++ Y  M + G  PD  +F++L+  LE   E   + +V++D +EY 
Subjt:  LYHGMIMLMGKNKMIEMAEEVFHKLRKD-GLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNLEKFRE-EFAVVVKKDCNEYL

Query:  DNPQKFFNDN
            +F  ++
Subjt:  DNPQKFFNDN

AT5G48730.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-0926.45Show/hide
Query:  SRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLG--RLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHK
        +++ S +AI  +   +  KS      +I  K G  +LL   + ++L E       E ++QVF+ ++ + W++P++ +Y  +I+++GK K  E A E+F +
Subjt:  SRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLG--RLLKADLFDALSELQRQNELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHK

Query:  LRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIAS-GCTPDELTFKILIKN
        +  +G   +   +  ++ AY +    + A      M +S  C PD  T+ ILIK+
Subjt:  LRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIAS-GCTPDELTFKILIKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAGTTGGGCGGTTGGAGATGAAATCTACTCTAGTGGGTCCTCTTCAACTCCATTTTCTTCAATTGGGTCTTCGCCAAAACCTCACGAACCGAAGCCTCCGTTGTGG
TACGGCAGCTCCACCTCCAAATATCATTTGTGGCCTCAGAAAGGGCTCGAACAGGCCCTTAGGGTTGTCAAGGGTGCCCTCCAATGAGGCAATTCAAGCAGTTCAATCTC
TCAAGCTTGCTAAATCCACCTCCAAAATGGAAGACGTTATCAATACCAAGCTCGGCAGATTGCTTAAAGCAGACTTGTTTGATGCTCTGTCTGAATTACAAAGGCAAAAT
GAACTGGAATTATCGCTTCAGGTCTTCAAATTTATGCAAAATGAAGAATGGTTCGAGCCAGATTTAAGGTTATACCATGGAATGATTATGCTGATGGGAAAGAACAAAAT
GATTGAAATGGCTGAAGAGGTTTTTCATAAGTTAAGAAAGGATGGGTTAGAACCAGATACAAGAGCTTTCAATGAAATGATGGGAGCATATCTGCAAGTGGACATGATCG
AAAGAGCTGTTGAGACATACAGATTGATGATAGCTTCAGGTTGTACTCCAGATGAACTGACTTTCAAGATCTTGATAAAGAATCTTGAGAAATTTAGGGAAGAATTTGCT
GTAGTGGTGAAGAAAGACTGTAATGAGTACTTGGATAATCCTCAGAAGTTTTTCAACGATAACGGGCAGAAACTGACAACGAAAGTTCGAATTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGAGTTGGGCGGTTGGAGATGAAATCTACTCTAGTGGGTCCTCTTCAACTCCATTTTCTTCAATTGGGTCTTCGCCAAAACCTCACGAACCGAAGCCTCCGTTGTGG
TACGGCAGCTCCACCTCCAAATATCATTTGTGGCCTCAGAAAGGGCTCGAACAGGCCCTTAGGGTTGTCAAGGGTGCCCTCCAATGAGGCAATTCAAGCAGTTCAATCTC
TCAAGCTTGCTAAATCCACCTCCAAAATGGAAGACGTTATCAATACCAAGCTCGGCAGATTGCTTAAAGCAGACTTGTTTGATGCTCTGTCTGAATTACAAAGGCAAAAT
GAACTGGAATTATCGCTTCAGGTCTTCAAATTTATGCAAAATGAAGAATGGTTCGAGCCAGATTTAAGGTTATACCATGGAATGATTATGCTGATGGGAAAGAACAAAAT
GATTGAAATGGCTGAAGAGGTTTTTCATAAGTTAAGAAAGGATGGGTTAGAACCAGATACAAGAGCTTTCAATGAAATGATGGGAGCATATCTGCAAGTGGACATGATCG
AAAGAGCTGTTGAGACATACAGATTGATGATAGCTTCAGGTTGTACTCCAGATGAACTGACTTTCAAGATCTTGATAAAGAATCTTGAGAAATTTAGGGAAGAATTTGCT
GTAGTGGTGAAGAAAGACTGTAATGAGTACTTGGATAATCCTCAGAAGTTTTTCAACGATAACGGGCAGAAACTGACAACGAAAGTTCGAATTCTTTAA
Protein sequenceShow/hide protein sequence
MRVGRLEMKSTLVGPLQLHFLQLGLRQNLTNRSLRCGTAAPPPNIICGLRKGSNRPLGLSRVPSNEAIQAVQSLKLAKSTSKMEDVINTKLGRLLKADLFDALSELQRQN
ELELSLQVFKFMQNEEWFEPDLRLYHGMIMLMGKNKMIEMAEEVFHKLRKDGLEPDTRAFNEMMGAYLQVDMIERAVETYRLMIASGCTPDELTFKILIKNLEKFREEFA
VVVKKDCNEYLDNPQKFFNDNGQKLTTKVRIL