; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0015831 (gene) of Chayote v1 genome

Gene IDSed0015831
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG08:19631865..19633421
RNA-Seq ExpressionSed0015831
SyntenySed0015831
Gene Ontology termsGO:1900865 - chloroplast RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK18848.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]1.1e-24682.1Show/hide
Query:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI
        MPLKP KPSISIAQ TQIHA L+TNP   I NPLLG+LV+S  PENG FL+NQML YPSSHNH+TFTYALKACC LHQTQ GL+IHA L+KSGHL DIFI
Subjt:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI

Query:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN
        QNSLLHFYI +GDV SAS IFDS+P+PDVVSWTSIISGLSKLGFEKEAL KFLSMNV PN  T V+ALSACS LRCL++GKAIHGL LR+LNEE+VSL+N
Subjt:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN

Query:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG
        ALL FY RC  L+S ENLF+KMHKRDVVSWTT+IGGYAQSGLCEEAVRVFQNMVHV GEA PNEATL+NVL ACSS+SALH+GQWVHSYINSR DVI+DG
Subjt:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG

Query:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP
        NVGNALINMYVKCG+MEMAI+IF  +EHKDIISWST+ISGLAMNGLG+QAF LFSLM+VHGISPD +TFLGLLSACSHGGLI+QG MVFEAMKDVYN+ P
Subjt:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP

Query:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK
        Q+ HY CMVD+YG+AGLLDEAEAFIKEMPMEAEG VWGA+LHACQIHGNEKKY KV E LL SKGV++G FALLSNTYASCDRWNDAN+VR  MRSRGLK
Subjt:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK

Query:  KIAGCSWIELVDAS
        K+AGCSWIELV+ S
Subjt:  KIAGCSWIELVDAS

XP_008450427.1 PREDICTED: pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucumis melo]2.1e-24581.71Show/hide
Query:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI
        MPLKP KPSIS AQ TQIHA L+TNP   I NPLLG+LV+S  PENG FL+NQML YPSSHNH+TFTYALKACC LHQTQ GL+IHA L+KSGHL DIFI
Subjt:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI

Query:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN
        QNSLLHFYI HGDV SAS IFDS+P+PDVVSWTSIISG SKLGFEKEAL KFLSMNV PN  T V+ALSACS LR L++GKAIHGL LR+LNEE+VSL+N
Subjt:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN

Query:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG
        ALL FY RC  L+S ENLF+KMHKRDVVSWTT+IGGYAQSGLCEEAVRVFQNMVH  GEA PNEATL+NVL ACSS+SALH+GQWVHSYINSR DVI+DG
Subjt:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG

Query:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP
        NVGNALINMYVKCG+MEMAI+IFK +EHKDIISWST+ISGLAMNGLG+QAF LFSLM+VHGISPD +TFLGLLSACSHGGLI+QG MVFEAMKDVYN+ P
Subjt:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP

Query:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK
        Q+ HY CMVD+YG+AGLLDEAEAFIKEMPMEAEG VWGA+LHACQIHGNEKKY KV E LL SKGV++G FALLSNTYASCDRWNDAN+VR  MRSRGLK
Subjt:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK

Query:  KIAGCSWIELVDAS
        K+AGCSWIELV+ S
Subjt:  KIAGCSWIELVDAS

XP_011660133.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic isoform X1 [Cucumis sativus]1.5e-24681.43Show/hide
Query:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI
        MPLKP KPSISIAQ TQIHA L+TNP   I NPLLG+LV+S  PENG FL+NQMLRYPSSHNH+TFTYALKACC LHQTQ GL+IHA L+KSGHL DIFI
Subjt:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI

Query:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN
        QNSLLHFYI  GDV SAS IFDS+PDPDVVSWTSIISGLSKLGFEKEAL KFLSMNV PN  T V+ALSACS LRCL++GKAIHGL +R+LNEE+V L+N
Subjt:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN

Query:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG
        ALL FY RC  L+S ENLF+KM KRDVVSWTT+IGGYAQSGLCEEAVRVFQNMVHV GEA PNEATL+NVL ACSS+SALH+GQWVHSYINSR DVI+DG
Subjt:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG

Query:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP
        NVGNALINMYVKCG+MEMAI+IFK +EHKDI+SWSTIISGLAMNGLG+QAF LFSLM+VHG+SPD +TFLGLLSACSHGGLI+QG MVFEAMKDVYN+ P
Subjt:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP

Query:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK
        QM HY CMVD+YG+AGLLDEAEAFIKEMPMEAEG VWGA+LHACQ+HGNEKKY KV EWLL SKGV++GTFALLSNTYA CDRWNDAN+VR  MRSRGLK
Subjt:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK

Query:  KIAGCSWIELVDASIAL
        K+AG SWIE+VD++  L
Subjt:  KIAGCSWIELVDASIAL

XP_022960642.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucurbita moschata]5.8e-24380.86Show/hide
Query:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI
        MP +PPK  IS AQL+QIHA L+TNP  R+ NPLLGALV S  PENG FL+NQMLR+PSSHNHYTFTYALKAC  LH+T  GL+IHARL+KSGHL DIFI
Subjt:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI

Query:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN
        QNSLLHFYI  GDVPSASR+FDS+PDPDVVSWTSIISGLSKLGF++EAL KFLSMNV PN AT VSALSACS LRC+++GKAIHGL LRSLNEESV+LDN
Subjt:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN

Query:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG
        ALL FY RCGSL+  +NLFD+M +RDVVSWTT+IGGYA +GLCEEAVRVFQNMVH   EA PNEATLINVL ACSSMSALH+GQWVHSYINSR DVI+DG
Subjt:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG

Query:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP
        N+GNALINMYVKCGSM+ AI IFK VEHKDIISWSTIISGLAMNG G+QAFGLFSLM+VHGI+PDA+TFL LLSACSHGGLI+QG MVFEAMKDVYNV P
Subjt:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP

Query:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK
        +M HY CMVD+YG+AGLLDEAEAFIKEMP+EAEG VWGA+LHACQ+HGNE +Y KV +WLLSSK +++GT+ALLSNTYASCDRWNDANEVRD MRSRGLK
Subjt:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK

Query:  KIAGCSWIELVD
        K+AGCSWIEL D
Subjt:  KIAGCSWIELVD

XP_038878297.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Benincasa hispida]1.4e-25283.37Show/hide
Query:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI
        M LKPPKPSI IAQL QIH +L+ NP   ILNPLLG+LV+S  PENG FL+NQMLRYPSSHNH+TFTYALKACC LH+TQ GL+IHA L+KSGHL DIF+
Subjt:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI

Query:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN
        QNSLLHFYI  GDVPSASRIFDS+PDPDV+SWTSIISGLSKLGFEKEAL KFLSMNV PN  T V+ALSACS LRCL++GKAIHGL LRSLNEE+VSLDN
Subjt:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN

Query:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG
        ALL FY RCG L+S E LFD+M KRDVVSWTT+IGGYAQ GLCEEAVRVFQNMVHV GEA PNEATLINVL ACSS+SALH+GQWVHSYINSR DVI+DG
Subjt:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG

Query:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP
        NVGNALINMYVKCG+MEMAI+IFK +EHKDIISWSTIISGLAMNGLG QAFGLFSLM+VHGISPD +TFL LLSACSHGGLI+QG MVFEAMKDVYN+ P
Subjt:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP

Query:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK
        QM HY CMVD+YG+AGLLDEAEAFIKEMPMEAEG VWGA+LHACQIHGNEKKY KV EWLL SKGV++GTFALLSNTYASCDRWNDANEVRDTMRS+GLK
Subjt:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK

Query:  KIAGCSWIELVDASIAL
        K+AGCSWIELVD+S +L
Subjt:  KIAGCSWIELVDASIAL

TrEMBL top hitse value%identityAlignment
A0A0A0LXJ1 Uncharacterized protein7.1e-24781.43Show/hide
Query:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI
        MPLKP KPSISIAQ TQIHA L+TNP   I NPLLG+LV+S  PENG FL+NQMLRYPSSHNH+TFTYALKACC LHQTQ GL+IHA L+KSGHL DIFI
Subjt:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI

Query:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN
        QNSLLHFYI  GDV SAS IFDS+PDPDVVSWTSIISGLSKLGFEKEAL KFLSMNV PN  T V+ALSACS LRCL++GKAIHGL +R+LNEE+V L+N
Subjt:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN

Query:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG
        ALL FY RC  L+S ENLF+KM KRDVVSWTT+IGGYAQSGLCEEAVRVFQNMVHV GEA PNEATL+NVL ACSS+SALH+GQWVHSYINSR DVI+DG
Subjt:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG

Query:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP
        NVGNALINMYVKCG+MEMAI+IFK +EHKDI+SWSTIISGLAMNGLG+QAF LFSLM+VHG+SPD +TFLGLLSACSHGGLI+QG MVFEAMKDVYN+ P
Subjt:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP

Query:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK
        QM HY CMVD+YG+AGLLDEAEAFIKEMPMEAEG VWGA+LHACQ+HGNEKKY KV EWLL SKGV++GTFALLSNTYA CDRWNDAN+VR  MRSRGLK
Subjt:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK

Query:  KIAGCSWIELVDASIAL
        K+AG SWIE+VD++  L
Subjt:  KIAGCSWIELVDASIAL

A0A1S4DY27 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X11.0e-24581.71Show/hide
Query:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI
        MPLKP KPSIS AQ TQIHA L+TNP   I NPLLG+LV+S  PENG FL+NQML YPSSHNH+TFTYALKACC LHQTQ GL+IHA L+KSGHL DIFI
Subjt:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI

Query:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN
        QNSLLHFYI HGDV SAS IFDS+P+PDVVSWTSIISG SKLGFEKEAL KFLSMNV PN  T V+ALSACS LR L++GKAIHGL LR+LNEE+VSL+N
Subjt:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN

Query:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG
        ALL FY RC  L+S ENLF+KMHKRDVVSWTT+IGGYAQSGLCEEAVRVFQNMVH  GEA PNEATL+NVL ACSS+SALH+GQWVHSYINSR DVI+DG
Subjt:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG

Query:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP
        NVGNALINMYVKCG+MEMAI+IFK +EHKDIISWST+ISGLAMNGLG+QAF LFSLM+VHGISPD +TFLGLLSACSHGGLI+QG MVFEAMKDVYN+ P
Subjt:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP

Query:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK
        Q+ HY CMVD+YG+AGLLDEAEAFIKEMPMEAEG VWGA+LHACQIHGNEKKY KV E LL SKGV++G FALLSNTYASCDRWNDAN+VR  MRSRGLK
Subjt:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK

Query:  KIAGCSWIELVDAS
        K+AGCSWIELV+ S
Subjt:  KIAGCSWIELVDAS

A0A5D3D5L6 Pentatricopeptide repeat-containing protein5.5e-24782.1Show/hide
Query:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI
        MPLKP KPSISIAQ TQIHA L+TNP   I NPLLG+LV+S  PENG FL+NQML YPSSHNH+TFTYALKACC LHQTQ GL+IHA L+KSGHL DIFI
Subjt:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI

Query:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN
        QNSLLHFYI +GDV SAS IFDS+P+PDVVSWTSIISGLSKLGFEKEAL KFLSMNV PN  T V+ALSACS LRCL++GKAIHGL LR+LNEE+VSL+N
Subjt:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN

Query:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG
        ALL FY RC  L+S ENLF+KMHKRDVVSWTT+IGGYAQSGLCEEAVRVFQNMVHV GEA PNEATL+NVL ACSS+SALH+GQWVHSYINSR DVI+DG
Subjt:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG

Query:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP
        NVGNALINMYVKCG+MEMAI+IF  +EHKDIISWST+ISGLAMNGLG+QAF LFSLM+VHGISPD +TFLGLLSACSHGGLI+QG MVFEAMKDVYN+ P
Subjt:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP

Query:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK
        Q+ HY CMVD+YG+AGLLDEAEAFIKEMPMEAEG VWGA+LHACQIHGNEKKY KV E LL SKGV++G FALLSNTYASCDRWNDAN+VR  MRSRGLK
Subjt:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK

Query:  KIAGCSWIELVDAS
        K+AGCSWIELV+ S
Subjt:  KIAGCSWIELVDAS

A0A6J1H7Z9 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X12.8e-24380.86Show/hide
Query:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI
        MP +PPK  IS AQL+QIHA L+TNP  R+ NPLLGALV S  PENG FL+NQMLR+PSSHNHYTFTYALKAC  LH+T  GL+IHARL+KSGHL DIFI
Subjt:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI

Query:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN
        QNSLLHFYI  GDVPSASR+FDS+PDPDVVSWTSIISGLSKLGF++EAL KFLSMNV PN AT VSALSACS LRC+++GKAIHGL LRSLNEESV+LDN
Subjt:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN

Query:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG
        ALL FY RCGSL+  +NLFD+M +RDVVSWTT+IGGYA +GLCEEAVRVFQNMVH   EA PNEATLINVL ACSSMSALH+GQWVHSYINSR DVI+DG
Subjt:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG

Query:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP
        N+GNALINMYVKCGSM+ AI IFK VEHKDIISWSTIISGLAMNG G+QAFGLFSLM+VHGI+PDA+TFL LLSACSHGGLI+QG MVFEAMKDVYNV P
Subjt:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP

Query:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK
        +M HY CMVD+YG+AGLLDEAEAFIKEMP+EAEG VWGA+LHACQ+HGNE +Y KV +WLLSSK +++GT+ALLSNTYASCDRWNDANEVRD MRSRGLK
Subjt:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK

Query:  KIAGCSWIELVD
        K+AGCSWIEL D
Subjt:  KIAGCSWIELVD

A0A6J1JC26 pentatricopeptide repeat-containing protein At4g38010-like isoform X14.9e-24080.08Show/hide
Query:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI
        MP KPPK  IS AQL+QIHA L+TNP   + NPLLGAL+ S  PENG FL+NQMLR+PSSHNHYTFTYALKAC  LH+T  GL+IHARL+KSGHL DIFI
Subjt:  MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFI

Query:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN
        QNSLLHFYI  GDV SASR+FDS+PDPDVVSWTSIISGLSKLGF++EAL KFLSMNV PN AT VSALSACS LRCL++GKAIHGL LRSLNEESV+LDN
Subjt:  QNSLLHFYIHHGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDN

Query:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG
        ALL FY RCGSL+  +NLFD+M +RDVVSWTT+IGGYA +GLCEEAVRVFQNMVH   EA PNEATLINVL ACSSMSALH GQWVHSY+NSR D+I+DG
Subjt:  ALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDG

Query:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP
        N+GNALINMYVKCGSME AI IFK VEHKDIISWSTIISGLAMNG G+QAFGLFSLM+VHGI+PDA+TFL LLSACSHGGLI+QG MVFEAMKDVYNV P
Subjt:  NVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVP

Query:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK
        +M  Y CMVD+YG+AGLLDEAEAFIKEMP+EAEG VWGA+LHACQ+HGNE +Y KV +WLLSSK +++GT+ALLSNTYASC RWNDANEVRD MRSRGLK
Subjt:  QMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLK

Query:  KIAGCSWIELVD
        K+AGCSWIEL D
Subjt:  KIAGCSWIELVD

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.1e-9235.95Show/hide
Query:  ITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSH-NHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASRIF
        I  PN+   N L+ A     DP    + F  M+     + N YTF + +KA   +     G  +H   VKS    D+F+ NSL+H Y   GD+ SA ++F
Subjt:  ITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSH-NHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASRIF

Query:  DSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSM---NVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDNALLAFYFRCGSLKSVENL
         ++ + DVVSW S+I+G  + G   +AL+ F  M   +V  +  T V  LSAC+++R LE G+ +   I  +    +++L NA+L  Y +CGS++  + L
Subjt:  DSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSM---NVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDNALLAFYFRCGSLKSVENL

Query:  FDKMHKRDVVSWTTIIGGYA-------------------------------QSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVH
        FD M ++D V+WTT++ GYA                               Q+G   EA+ VF  +  +    K N+ TL++ L AC+ + AL +G+W+H
Subjt:  FDKMHKRDVVSWTTIIGGYA-------------------------------QSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVH

Query:  SYINSRPDVIVDGNVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSM
        SYI  +  + ++ +V +ALI+MY KCG +E +  +F  VE +D+  WS +I GLAM+G G +A  +F  M    + P+ VTF  +  ACSH GL+D+   
Subjt:  SYINSRPDVIVDGNVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSM

Query:  VFEAMKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDA
        +F  M+  Y +VP+  HY C+VD+ GR+G L++A  FI+ MP+     VWGA+L AC+IH N          LL  +  + G   LLSN YA   +W + 
Subjt:  VFEAMKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKIAGCSWIEL
        +E+R  MR  GLKK  GCS IE+
Subjt:  NEVRDTMRSRGLKKIAGCSWIEL

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic2.4e-9835.95Show/hide
Query:  ITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGD---------
        I  PN  I N +      SSDP +   L+  M+      N YTF + LK+C +    + G QIH  ++K G   D+++  SL+  Y+ +G          
Subjt:  ITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGD---------

Query:  ----------------------VPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSM---NVCPNFATFVSALSACSRLRCLEVGKAIHGLIL
                              + +A ++FD +P  DVVSW ++ISG ++ G  KEAL+ F  M   NV P+ +T V+ +SAC++   +E+G+ +H  I 
Subjt:  ----------------------VPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSM---NVCPNFATFVSALSACSRLRCLEVGKAIHGLIL

Query:  RSLNEESVSLDNALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHS
              ++ + NAL+  Y +CG L++   LF+++  +DV+SW T+IGGY    L +EA+ +FQ M+  G    PN+ T++++L AC+ + A+ +G+W+H 
Subjt:  RSLNEESVSLDNALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHS

Query:  YINSRPDVIVD-GNVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSM
        YI+ R   + +  ++  +LI+MY KCG +E A  +F  + HK + SW+ +I G AM+G    +F LFS M   GI PD +TF+GLLSACSH G++D G  
Subjt:  YINSRPDVIVD-GNVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSM

Query:  VFEAMKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDA
        +F  M   Y + P++ HY CM+D+ G +GL  EAE  I  M ME +G +W ++L AC++HGN +     +E L+  +  + G++ LLSN YAS  RWN+ 
Subjt:  VFEAMKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKIAGCSWIEL
         + R  +  +G+KK+ GCS IE+
Subjt:  NEVRDTMRSRGLKKIAGCSWIEL

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136003.4e-8934.49Show/hide
Query:  NPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASRIFDSMPDPDVVS
        N ++         E     F  M +     N Y+F   L AC  L+    G+Q+H+ + KS  L D++I ++L+  Y   G+V  A R+FD M D +VVS
Subjt:  NPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASRIFDSMPDPDVVS

Query:  WTSIISGLSKLGFEKEALDKF---LSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLN-EESVSLDNALLAFYFRCGSLKSVENLFD-------
        W S+I+   + G   EALD F   L   V P+  T  S +SAC+ L  ++VG+ +HG ++++      + L NA +  Y +C  +K    +FD       
Subjt:  WTSIISGLSKLGFEKEALDKF---LSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLN-EESVSLDNALLAFYFRCGSLKSVENLFD-------

Query:  ------------------------KMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDV
                                KM +R+VVSW  +I GY Q+G  EEA+ +F  +        P   +  N+L AC+ ++ LH+G   H ++      
Subjt:  ------------------------KMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDV

Query:  IVDGN-----VGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEA
           G      VGN+LI+MYVKCG +E   ++F+ +  +D +SW+ +I G A NG G +A  LF  M+  G  PD +T +G+LSAC H G +++G   F +
Subjt:  IVDGN-----VGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEA

Query:  MKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVR
        M   + V P   HY CMVD+ GRAG L+EA++ I+EMPM+ +  +WG++L AC++H N      V+E LL  +  + G + LLSN YA   +W D   VR
Subjt:  MKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVR

Query:  DTMRSRGLKKIAGCSWIEL
         +MR  G+ K  GCSWI++
Subjt:  DTMRSRGLKKIAGCSWIEL

Q9SJZ3 Pentatricopeptide repeat-containing protein At2g22410, mitochondrial6.3e-9134.61Show/hide
Query:  ITNPNARILNPLLGALVHSSDPENGFFLFNQMLRY---PSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASR
        I NPN    N  +     S +P+  F L+ QMLR+    S  +H+T+    K C  L  +  G  I   ++K        + N+ +H +   GD+ +A +
Subjt:  ITNPNARILNPLLGALVHSSDPENGFFLFNQMLRY---PSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASR

Query:  IFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSM---NVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDNALLAFYFRCGSLKSVE
        +FD  P  D+VSW  +I+G  K+G  ++A+  +  M    V P+  T +  +S+CS L  L  GK  +  +  +    ++ L NAL+  + +CG +    
Subjt:  IFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSM---NVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDNALLAFYFRCGSLKSVE

Query:  NLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNM---------VHVGG--------------------EAKPNEATLINVLFACSSMSALHMGQWVH
         +FD + KR +VSWTT+I GYA+ GL + + ++F +M           +GG                      KP+E T+I+ L ACS + AL +G W+H
Subjt:  NLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNM---------VHVGG--------------------EAKPNEATLINVLFACSSMSALHMGQWVH

Query:  SYINSRPDVIVDGNVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSM
         YI  +  + ++  +G +L++MY KCG++  A+ +F  ++ ++ ++++ II GLA++G    A   F+ MI  GI+PD +TF+GLLSAC HGG+I  G  
Subjt:  SYINSRPDVIVDGNVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSM

Query:  VFEAMKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDA
         F  MK  +N+ PQ+ HY  MVD+ GRAGLL+EA+  ++ MPMEA+  VWGA+L  C++HGN +   K ++ LL       G + LL   Y   + W DA
Subjt:  VFEAMKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKIAGCSWIEL
           R  M  RG++KI GCS IE+
Subjt:  NEVRDTMRSRGLKKIAGCSWIEL

Q9SZK1 Pentatricopeptide repeat-containing protein At4g380102.7e-9437.71Show/hide
Query:  NPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASRIFDSMPDPDVVS
        N LL +      P    F +   +    S + +TF    KAC +    + G QIH  + K G   DI++QNSL+HFY   G+  +A ++F  MP  DVVS
Subjt:  NPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASRIFDSMPDPDVVS

Query:  WTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDNALLAFYFRCGSLKSVENLFDKMHKRDVVSWT
        WT II+G ++ G  KEALD F  M+V PN AT+V  L +  R+ CL +GK IHGLIL+  +  S+   NAL+  Y +C  L     +F ++ K+D VSW 
Subjt:  WTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDNALLAFYFRCGSLKSVENLFDKMHKRDVVSWT

Query:  TIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDGNVGNALINMYVKCGSMEMAIMIFKVVEHKDI
        ++I G       +EA+ +F +++      KP+   L +VL AC+S+ A+  G+WVH YI +   +  D ++G A+++MY KCG +E A+ IF  +  K++
Subjt:  TIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDGNVGNALINMYVKCGSMEMAIMIFKVVEHKDI

Query:  ISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKD-VYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPM
         +W+ ++ GLA++G G ++   F  M+  G  P+ VTFL  L+AC H GL+D+G   F  MK   YN+ P++ HY CM+D+  RAGLLDEA   +K MP+
Subjt:  ISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKD-VYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPM

Query:  EAEGQVWGAMLHACQIHGNEKKYAK-VSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKIAGCSWIE
        + + ++ GA+L AC+  G   +  K + +  L  +    G + LLSN +A+  RW+D   +R  M+ +G+ K+ G S+IE
Subjt:  EAEGQVWGAMLHACQIHGNEKKYAK-VSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKIAGCSWIE

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-9935.95Show/hide
Query:  ITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGD---------
        I  PN  I N +      SSDP +   L+  M+      N YTF + LK+C +    + G QIH  ++K G   D+++  SL+  Y+ +G          
Subjt:  ITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGD---------

Query:  ----------------------VPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSM---NVCPNFATFVSALSACSRLRCLEVGKAIHGLIL
                              + +A ++FD +P  DVVSW ++ISG ++ G  KEAL+ F  M   NV P+ +T V+ +SAC++   +E+G+ +H  I 
Subjt:  ----------------------VPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSM---NVCPNFATFVSALSACSRLRCLEVGKAIHGLIL

Query:  RSLNEESVSLDNALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHS
              ++ + NAL+  Y +CG L++   LF+++  +DV+SW T+IGGY    L +EA+ +FQ M+  G    PN+ T++++L AC+ + A+ +G+W+H 
Subjt:  RSLNEESVSLDNALLAFYFRCGSLKSVENLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHS

Query:  YINSRPDVIVD-GNVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSM
        YI+ R   + +  ++  +LI+MY KCG +E A  +F  + HK + SW+ +I G AM+G    +F LFS M   GI PD +TF+GLLSACSH G++D G  
Subjt:  YINSRPDVIVD-GNVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSM

Query:  VFEAMKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDA
        +F  M   Y + P++ HY CM+D+ G +GL  EAE  I  M ME +G +W ++L AC++HGN +     +E L+  +  + G++ LLSN YAS  RWN+ 
Subjt:  VFEAMKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKIAGCSWIEL
         + R  +  +G+KK+ GCS IE+
Subjt:  NEVRDTMRSRGLKKIAGCSWIEL

AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-9034.49Show/hide
Query:  NPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASRIFDSMPDPDVVS
        N ++         E     F  M +     N Y+F   L AC  L+    G+Q+H+ + KS  L D++I ++L+  Y   G+V  A R+FD M D +VVS
Subjt:  NPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASRIFDSMPDPDVVS

Query:  WTSIISGLSKLGFEKEALDKF---LSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLN-EESVSLDNALLAFYFRCGSLKSVENLFD-------
        W S+I+   + G   EALD F   L   V P+  T  S +SAC+ L  ++VG+ +HG ++++      + L NA +  Y +C  +K    +FD       
Subjt:  WTSIISGLSKLGFEKEALDKF---LSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLN-EESVSLDNALLAFYFRCGSLKSVENLFD-------

Query:  ------------------------KMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDV
                                KM +R+VVSW  +I GY Q+G  EEA+ +F  +        P   +  N+L AC+ ++ LH+G   H ++      
Subjt:  ------------------------KMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDV

Query:  IVDGN-----VGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEA
           G      VGN+LI+MYVKCG +E   ++F+ +  +D +SW+ +I G A NG G +A  LF  M+  G  PD +T +G+LSAC H G +++G   F +
Subjt:  IVDGN-----VGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEA

Query:  MKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVR
        M   + V P   HY CMVD+ GRAG L+EA++ I+EMPM+ +  +WG++L AC++H N      V+E LL  +  + G + LLSN YA   +W D   VR
Subjt:  MKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVR

Query:  DTMRSRGLKKIAGCSWIEL
         +MR  G+ K  GCSWI++
Subjt:  DTMRSRGLKKIAGCSWIEL

AT2G22410.1 SLOW GROWTH 14.5e-9234.61Show/hide
Query:  ITNPNARILNPLLGALVHSSDPENGFFLFNQMLRY---PSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASR
        I NPN    N  +     S +P+  F L+ QMLR+    S  +H+T+    K C  L  +  G  I   ++K        + N+ +H +   GD+ +A +
Subjt:  ITNPNARILNPLLGALVHSSDPENGFFLFNQMLRY---PSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASR

Query:  IFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSM---NVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDNALLAFYFRCGSLKSVE
        +FD  P  D+VSW  +I+G  K+G  ++A+  +  M    V P+  T +  +S+CS L  L  GK  +  +  +    ++ L NAL+  + +CG +    
Subjt:  IFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSM---NVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDNALLAFYFRCGSLKSVE

Query:  NLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNM---------VHVGG--------------------EAKPNEATLINVLFACSSMSALHMGQWVH
         +FD + KR +VSWTT+I GYA+ GL + + ++F +M           +GG                      KP+E T+I+ L ACS + AL +G W+H
Subjt:  NLFDKMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNM---------VHVGG--------------------EAKPNEATLINVLFACSSMSALHMGQWVH

Query:  SYINSRPDVIVDGNVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSM
         YI  +  + ++  +G +L++MY KCG++  A+ +F  ++ ++ ++++ II GLA++G    A   F+ MI  GI+PD +TF+GLLSAC HGG+I  G  
Subjt:  SYINSRPDVIVDGNVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSM

Query:  VFEAMKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDA
         F  MK  +N+ PQ+ HY  MVD+ GRAGLL+EA+  ++ MPMEA+  VWGA+L  C++HGN +   K ++ LL       G + LL   Y   + W DA
Subjt:  VFEAMKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKIAGCSWIEL
           R  M  RG++KI GCS IE+
Subjt:  NEVRDTMRSRGLKKIAGCSWIEL

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.1e-9435.95Show/hide
Query:  ITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSH-NHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASRIF
        I  PN+   N L+ A     DP    + F  M+     + N YTF + +KA   +     G  +H   VKS    D+F+ NSL+H Y   GD+ SA ++F
Subjt:  ITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSH-NHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASRIF

Query:  DSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSM---NVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDNALLAFYFRCGSLKSVENL
         ++ + DVVSW S+I+G  + G   +AL+ F  M   +V  +  T V  LSAC+++R LE G+ +   I  +    +++L NA+L  Y +CGS++  + L
Subjt:  DSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSM---NVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDNALLAFYFRCGSLKSVENL

Query:  FDKMHKRDVVSWTTIIGGYA-------------------------------QSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVH
        FD M ++D V+WTT++ GYA                               Q+G   EA+ VF  +  +    K N+ TL++ L AC+ + AL +G+W+H
Subjt:  FDKMHKRDVVSWTTIIGGYA-------------------------------QSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVH

Query:  SYINSRPDVIVDGNVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSM
        SYI  +  + ++ +V +ALI+MY KCG +E +  +F  VE +D+  WS +I GLAM+G G +A  +F  M    + P+ VTF  +  ACSH GL+D+   
Subjt:  SYINSRPDVIVDGNVGNALINMYVKCGSMEMAIMIFKVVEHKDIISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSM

Query:  VFEAMKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDA
        +F  M+  Y +VP+  HY C+VD+ GR+G L++A  FI+ MP+     VWGA+L AC+IH N          LL  +  + G   LLSN YA   +W + 
Subjt:  VFEAMKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAMLHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKIAGCSWIEL
        +E+R  MR  GLKK  GCS IE+
Subjt:  NEVRDTMRSRGLKKIAGCSWIEL

AT4G38010.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.9e-9537.71Show/hide
Query:  NPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASRIFDSMPDPDVVS
        N LL +      P    F +   +    S + +TF    KAC +    + G QIH  + K G   DI++QNSL+HFY   G+  +A ++F  MP  DVVS
Subjt:  NPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIHHGDVPSASRIFDSMPDPDVVS

Query:  WTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDNALLAFYFRCGSLKSVENLFDKMHKRDVVSWT
        WT II+G ++ G  KEALD F  M+V PN AT+V  L +  R+ CL +GK IHGLIL+  +  S+   NAL+  Y +C  L     +F ++ K+D VSW 
Subjt:  WTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDNALLAFYFRCGSLKSVENLFDKMHKRDVVSWT

Query:  TIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDGNVGNALINMYVKCGSMEMAIMIFKVVEHKDI
        ++I G       +EA+ +F +++      KP+   L +VL AC+S+ A+  G+WVH YI +   +  D ++G A+++MY KCG +E A+ IF  +  K++
Subjt:  TIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDGNVGNALINMYVKCGSMEMAIMIFKVVEHKDI

Query:  ISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKD-VYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPM
         +W+ ++ GLA++G G ++   F  M+  G  P+ VTFL  L+AC H GL+D+G   F  MK   YN+ P++ HY CM+D+  RAGLLDEA   +K MP+
Subjt:  ISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKD-VYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPM

Query:  EAEGQVWGAMLHACQIHGNEKKYAK-VSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKIAGCSWIE
        + + ++ GA+L AC+  G   +  K + +  L  +    G + LLSN +A+  RW+D   +R  M+ +G+ K+ G S+IE
Subjt:  EAEGQVWGAMLHACQIHGNEKKYAK-VSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKIAGCSWIE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTAAAACCTCCAAAACCTTCAATTTCAATCGCTCAGTTAACTCAAATCCATGCCCATCTCATCACAAATCCAAATGCCCGCATCTTGAACCCATTGCTTGGTGC
TCTCGTCCATTCCAGCGACCCTGAAAATGGCTTCTTTCTCTTCAACCAAATGCTTCGATACCCATCTTCCCACAATCATTACACTTTCACCTATGCCCTCAAAGCTTGTT
GCCGCCTCCATCAAACCCAAGCCGGCCTCCAAATCCATGCTCGTCTTGTCAAATCTGGACACCTTCCTGACATCTTCATCCAAAATTCATTGCTCCATTTCTACATTCAT
CATGGCGATGTTCCTTCTGCTTCTCGAATCTTCGATTCCATGCCTGATCCAGATGTGGTTTCATGGACTTCGATCATTTCAGGGCTTTCCAAGCTTGGTTTTGAAAAGGA
GGCTTTGGATAAGTTCTTGTCTATGAATGTATGCCCTAATTTTGCTACTTTTGTTAGTGCTTTATCTGCTTGTTCTAGGCTTAGATGTCTCGAAGTAGGGAAAGCCATAC
ATGGGTTGATATTGCGGAGTTTGAATGAGGAAAGTGTTAGTTTGGACAATGCCCTTCTAGCTTTCTATTTTAGGTGTGGGTCTTTGAAGAGTGTTGAGAACCTGTTTGAT
AAAATGCATAAGAGAGATGTAGTGTCCTGGACAACAATAATCGGTGGTTATGCACAAAGTGGATTGTGTGAAGAGGCTGTTAGGGTATTTCAAAACATGGTTCATGTGGG
AGGAGAGGCTAAGCCCAATGAGGCCACTCTAATTAATGTATTATTTGCATGTTCTTCCATGTCTGCTCTGCATATGGGTCAGTGGGTGCATTCCTATATCAACTCTAGGC
CTGATGTGATAGTTGATGGAAATGTTGGAAATGCTTTGATTAACATGTATGTAAAATGTGGTAGCATGGAAATGGCAATTATGATCTTCAAAGTTGTTGAACACAAGGAT
ATCATATCATGGAGCACAATAATAAGTGGGCTAGCCATGAATGGCTTAGGCAGGCAAGCTTTTGGTCTCTTCTCACTCATGATAGTTCATGGCATTTCTCCGGACGCTGT
AACGTTTCTTGGGCTATTATCTGCCTGCAGCCATGGTGGGTTGATCGATCAAGGCTCGATGGTGTTTGAAGCCATGAAAGATGTATACAATGTTGTACCTCAGATGAGTC
ATTATGTTTGCATGGTCGACATATATGGAAGGGCTGGGCTTTTAGATGAAGCAGAGGCATTCATAAAGGAGATGCCTATGGAAGCTGAAGGTCAAGTTTGGGGAGCTATG
CTCCATGCTTGTCAAATTCATGGGAATGAGAAGAAGTATGCGAAAGTTAGCGAATGGTTACTTAGCAGCAAGGGGGTCTCAATGGGAACTTTTGCTTTGTTATCAAATAC
TTATGCTAGTTGTGATAGATGGAATGATGCTAATGAAGTTAGAGATACCATGAGAAGTAGAGGGTTGAAGAAAATAGCTGGATGTAGTTGGATTGAACTAGTTGATGCTT
CGATTGCATTGAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCATTAAAACCTCCAAAACCTTCAATTTCAATCGCTCAGTTAACTCAAATCCATGCCCATCTCATCACAAATCCAAATGCCCGCATCTTGAACCCATTGCTTGGTGC
TCTCGTCCATTCCAGCGACCCTGAAAATGGCTTCTTTCTCTTCAACCAAATGCTTCGATACCCATCTTCCCACAATCATTACACTTTCACCTATGCCCTCAAAGCTTGTT
GCCGCCTCCATCAAACCCAAGCCGGCCTCCAAATCCATGCTCGTCTTGTCAAATCTGGACACCTTCCTGACATCTTCATCCAAAATTCATTGCTCCATTTCTACATTCAT
CATGGCGATGTTCCTTCTGCTTCTCGAATCTTCGATTCCATGCCTGATCCAGATGTGGTTTCATGGACTTCGATCATTTCAGGGCTTTCCAAGCTTGGTTTTGAAAAGGA
GGCTTTGGATAAGTTCTTGTCTATGAATGTATGCCCTAATTTTGCTACTTTTGTTAGTGCTTTATCTGCTTGTTCTAGGCTTAGATGTCTCGAAGTAGGGAAAGCCATAC
ATGGGTTGATATTGCGGAGTTTGAATGAGGAAAGTGTTAGTTTGGACAATGCCCTTCTAGCTTTCTATTTTAGGTGTGGGTCTTTGAAGAGTGTTGAGAACCTGTTTGAT
AAAATGCATAAGAGAGATGTAGTGTCCTGGACAACAATAATCGGTGGTTATGCACAAAGTGGATTGTGTGAAGAGGCTGTTAGGGTATTTCAAAACATGGTTCATGTGGG
AGGAGAGGCTAAGCCCAATGAGGCCACTCTAATTAATGTATTATTTGCATGTTCTTCCATGTCTGCTCTGCATATGGGTCAGTGGGTGCATTCCTATATCAACTCTAGGC
CTGATGTGATAGTTGATGGAAATGTTGGAAATGCTTTGATTAACATGTATGTAAAATGTGGTAGCATGGAAATGGCAATTATGATCTTCAAAGTTGTTGAACACAAGGAT
ATCATATCATGGAGCACAATAATAAGTGGGCTAGCCATGAATGGCTTAGGCAGGCAAGCTTTTGGTCTCTTCTCACTCATGATAGTTCATGGCATTTCTCCGGACGCTGT
AACGTTTCTTGGGCTATTATCTGCCTGCAGCCATGGTGGGTTGATCGATCAAGGCTCGATGGTGTTTGAAGCCATGAAAGATGTATACAATGTTGTACCTCAGATGAGTC
ATTATGTTTGCATGGTCGACATATATGGAAGGGCTGGGCTTTTAGATGAAGCAGAGGCATTCATAAAGGAGATGCCTATGGAAGCTGAAGGTCAAGTTTGGGGAGCTATG
CTCCATGCTTGTCAAATTCATGGGAATGAGAAGAAGTATGCGAAAGTTAGCGAATGGTTACTTAGCAGCAAGGGGGTCTCAATGGGAACTTTTGCTTTGTTATCAAATAC
TTATGCTAGTTGTGATAGATGGAATGATGCTAATGAAGTTAGAGATACCATGAGAAGTAGAGGGTTGAAGAAAATAGCTGGATGTAGTTGGATTGAACTAGTTGATGCTT
CGATTGCATTGAACTAA
Protein sequenceShow/hide protein sequence
MPLKPPKPSISIAQLTQIHAHLITNPNARILNPLLGALVHSSDPENGFFLFNQMLRYPSSHNHYTFTYALKACCRLHQTQAGLQIHARLVKSGHLPDIFIQNSLLHFYIH
HGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEKEALDKFLSMNVCPNFATFVSALSACSRLRCLEVGKAIHGLILRSLNEESVSLDNALLAFYFRCGSLKSVENLFD
KMHKRDVVSWTTIIGGYAQSGLCEEAVRVFQNMVHVGGEAKPNEATLINVLFACSSMSALHMGQWVHSYINSRPDVIVDGNVGNALINMYVKCGSMEMAIMIFKVVEHKD
IISWSTIISGLAMNGLGRQAFGLFSLMIVHGISPDAVTFLGLLSACSHGGLIDQGSMVFEAMKDVYNVVPQMSHYVCMVDIYGRAGLLDEAEAFIKEMPMEAEGQVWGAM
LHACQIHGNEKKYAKVSEWLLSSKGVSMGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKIAGCSWIELVDASIALN