; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002825 (gene) of Snake gourd v1 genome

Gene IDTan0002825
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG10:6782881..6785406
RNA-Seq ExpressionTan0002825
SyntenyTan0002825
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597431.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0088.59Show/hide
Query:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT
        MAANLH T+  +P+FIIPTAQNDSKH     FQIGLF+NCKTMD+LRQLH YASKQG IRKQSTVTKLISTCVEMGTLESL YARK FELFQED+E NVT
Subjt:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT

Query:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER
        VF+YNSLIRGYSA+GLC+EA+SLYIQMIE GF+PDNFTFPF+LSACAKTAAF  G QLHGALMKIGLEG+MFVANSLIHLYAEAG+FS ARKVFD M ER
Subjt:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER

Query:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD
        NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELA +IHAYI+ESE+ELNTHMVNALVDMYMKCGE GAA  LY+ECVD
Subjt:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD

Query:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS
        KNLVLCNTIMSN AR+G+  EVLAVLVD+F+VDLRPDRVSLLSAISACGQ+GDYLLG+CCH+++LRNGYE WDNICNAMIDMYMKCGKQEMAYRVFD MS
Subjt:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS

Query:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN
        NKTIVSWNSLLV YVRNRDLES +KIFNEM +KDIVSWNTMV+ALVQESMFEEAIELFREMQTK+IEADRVTMVE+ASACGYLGALELAKWIYAYI+KNN
Subjt:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN

Query:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY
        I CDMLLETALVDMFARCGDS SAMK+FDNMDRKDVSAWTAAIGAMAVDGNGERAIELY+EMLRQGVKPD VVFVNILTACSH GFVEQGQHIFESMKQ+
Subjt:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY

Query:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE
        GISPQIVHYGCMVDLLGRAGKLEEAL+IIKSM MKPNGIIWGSLLAACRTHKNVE+A FAAERLAE APERTGIHVLLSNIYASAE+WADVANVR+ LK 
Subjt:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE

Query:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM
        KGV+K PGSSSIEVDGV+HEFTSGDRSHPE C IDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQ LLN+HSEKLAMAYGLIST+KH+PIRVIKNLR 
Subjt:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM

Query:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        CSDCH+F KY+SKVYDREIT+RDNNRFHFFR GSCSCGDYW
Subjt:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

KAG7028890.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0089.18Show/hide
Query:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT
        MAANLH T+S +P+FI+PTAQND KH   H FQIGLF+NCKTMDELRQLH YASKQG IRKQSTVTKLISTCVEMGTLESL YARK FELFQED+E NVT
Subjt:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT

Query:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER
        VF+YNSLIRGYSA+GLC+EA+SLYIQMIE GF+PDNFTFPF+LSACAKTAAF  G QLHGALMKIGLEG+MFVANSLIHLYAEAG+FS ARKVFD M ER
Subjt:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER

Query:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD
        NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELA +IHAYI+ESE+ELNTHMVNALVDMYMKCGE GAA  LY+ECVD
Subjt:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD

Query:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS
        KNLVLCNTIMSN AR+G+  EVLAVLVD+F+VDLRPDRVSLLSAISACGQ+GDYLLG+CCH+++LRNGYE WDNICNAMIDMYMKCGKQEMAYRVFD MS
Subjt:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS

Query:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN
        NKTIVSWNSLLV YVRNRDLES +KIFNEM +KDIVSWNTMV+ALVQESMFEEAIELFREMQTK+IEADRVTMVE+ASACGYLGALELAKWIYAYI+KNN
Subjt:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN

Query:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY
        I CDMLLETALVDMFARCGDS SAMK+FDNMDRKDVSAWTAAIGAMAVDGNGERAIELY+EMLRQGVKPD VVFVNILTACSH GFVEQGQHIFESMKQ+
Subjt:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY

Query:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE
        GISPQIVHYGCMVDLLGRAGKLEEALDIIKSM MKPNGIIWGSLLAACRTHKNVE+A FAAERLAE APERTGIHVLLSNIYASAE+WADVANVRL LKE
Subjt:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE

Query:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM
        KGV+K PGSSSIEVDGV+HEFTSGDRSHPE C IDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLN+HSEKLAMAYGLIST+KH+PIRVIKNLR 
Subjt:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM

Query:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        CSDCH+F KY+SKVYDREIT+RDNNRFHFFR GSCSCGDYW
Subjt:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

XP_011650966.1 pentatricopeptide repeat-containing protein At3g22690 [Cucumis sativus]0.0e+0088.11Show/hide
Query:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT
        MAANL PTIS  PNFI PT QNDSKHTPPHHFQIGLFK+CKT+DEL QLH YA KQG IRKQSTVTKLISTCVEMGT ESL +ARKAFELF ED E NVT
Subjt:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT

Query:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER
        +FMYN LIRGYSAAGL +EAISLY+QMIE GFMPDNFTFPF+LSACAKTAAFVEG QLHGAL+KIGLEGDMFVANSLIHLYAE GEF FARKVFDGMLER
Subjt:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER

Query:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD
        NVVSWTSLICGY+RTD   EAVALFFQMIEAGV+PNSVTMVCVISACAKLKDLELAKR+HAYIEESEMELNTHMVNAL DM+MKCGETGAA+RLYDECVD
Subjt:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD

Query:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS
        KNLVLCNTIMSN+AR+G+ NEVLAVLVD+ ++DLRPDRVSLL AISACGQMGDYLLGKCCH+YSLRNGYE WDNICNAMIDMYMKCGKQEMAYRVFD M 
Subjt:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS

Query:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN
        NKTIVSWNSLLVGY+RN+DLESARKIFNEM +KDIVSWNTM+NALVQESMF+EAIELFREMQ KEI+ DRVTMVE+ASACG LGALELAKW+Y++IVKN 
Subjt:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN

Query:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY
        IYCDMLLETALVDMFARCGD HSAM +F+NM RKDVSAWTAAIGAMAV+GNG+RAIELYNEMLRQGVKPD VVFVNILTACSH GFVEQG+HIFESMKQ+
Subjt:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY

Query:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE
        G+SPQIVHYGCMVDLLGRAGKLEEALDII+SMPM+PNGIIWGSLLAACRTHKN++MA FAAERLAEVAPE+TGIH+LLSNIYASAEKW DVANVRLQLKE
Subjt:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE

Query:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM
        KGVQKMPGSSSI+VDGV+HEFTSGDRSHPEN  IDMML EIT+RLGD GYVPD+TNVLLDVNEQEKQYLLNRHSEKLA+AYGLIST+KH+PIRV+KNLRM
Subjt:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM

Query:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        CSDCHSF KYISKVY REITVRDNNRFH FRQGSCSCGDYW
Subjt:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

XP_022974456.1 pentatricopeptide repeat-containing protein At3g22690-like [Cucurbita maxima]0.0e+0088.82Show/hide
Query:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT
        MAANLH T+S +P+FIIPTAQNDSKH     FQIGLF+NCKTMDELRQLH YASKQG IRKQSTVTKLISTCVEMGTLESL YARK FELFQED+E NVT
Subjt:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT

Query:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER
        VF+YNSLIRGYSA+GLC+EA+SLYIQMIE GF+PDNFTFPF+LSACAKTAAF  G QLHGAL KIGLEG+MFVANSLIHLYAEA +FS ARKVFD M ER
Subjt:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER

Query:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD
        NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISAC+KLKDLELA +IHAYIEESE+ELNTHMVNALVDMYMKCGET AA  LY+ECVD
Subjt:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD

Query:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS
        KNLVLCNTIMSNFAR G+  EVL+V+VD+F+VDL+PDRVSLLSAISACGQMGDYLLG+CCH+++LRNGYE WDNICNAMIDMYMKCGKQEMAYRVF  MS
Subjt:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS

Query:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN
        NKTIVSWNSLLV YVRNRDLES +KIFNEML+KDIVSWNTMV+ALVQESMFEEAIELFREMQTKEIEADRVTMVE+ASACGYLGALELAKWIYAYI+KNN
Subjt:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN

Query:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY
        I CDMLLETALVDMFARCGDS SAMK+FDNMDRKDVSAWTAAIGAMAVDGNGERAIELY+EMLRQGVKPD VVFVNILTACSH GFVEQGQ IFESMKQ+
Subjt:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY

Query:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE
        GISPQIVHYGCMVDLLGRAGKLEEALDIIKSM MKPNGIIWGSLLAACRTHKNVE+A FAAERLAE APER+GIHVLLSNIYASAE+WADVANVRL LKE
Subjt:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE

Query:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM
        KGV+KMPGSSSIEVDGV+HEFTSGDRSHPE C IDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEK YLLN+HSEKLAMAYGLIST+KH+PIRVIKNLR 
Subjt:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM

Query:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        CSDCH+F KY+SKVYDREIT+RDNNRFHFFRQG CSCGDYW
Subjt:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

XP_023538684.1 pentatricopeptide repeat-containing protein At3g22690 [Cucurbita pepo subsp. pepo]0.0e+0088.35Show/hide
Query:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT
        MAANLH T+S +P+FIIPTAQNDSKH   H FQIGLF+NCKTMDELRQLH YASKQG IRKQSTVTKLISTCVEMGTLESL YARK FELFQE++E  VT
Subjt:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT

Query:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER
        VF+YNSLIRGYSA+GLC+EA+SLYIQMIE GFMPDNFTFPF+LSACAKTAAF  G QLHGALMKIGLEG+MFVANSLIHLYAEA +FS ARKVFD M ER
Subjt:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER

Query:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD
        NVVSWTSLICGYAR+DSSSEAVALFFQMIEAGVRPNSVTMVCVISAC+KLKDLELA +IHAYI+ESE+ELNTHMVNALVDMYMKCGE GAA  LY+ECVD
Subjt:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD

Query:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS
        KNLVLCNTIMSN AR+G+  EVLAVLVD+F+VDLRPDRVSLLSAISACGQMGDYLLG+CCHS++LRNGYE WDNICNAMIDMYMKCGKQEMAY+VFD MS
Subjt:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS

Query:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN
        NKT VSWNSLLV YVRNRDLES +KIFN M +KDIVSWNTMV+AL+QESMF+EAIELFREMQTKEIEADRVTMVE+ASACGYLGALELAKWIYAYI+KNN
Subjt:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN

Query:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY
        I CDMLLETALVDMFARCGDS SAMK+FDNMDRKDVSAWTAAIGAMAVDGNGERAIELY+EMLRQGV+PD VVFVNILTACSH GFVEQGQHIFESMKQ+
Subjt:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY

Query:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE
        GISPQIVHYGCMVDLLGRAGKLEEALDIIKSM MKPNGIIWGSLLAACRTHKNVE+A FAAERLAE APERTGIHVLLSNIYASAE+WADVANVRL LKE
Subjt:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE

Query:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM
        KGV+K PGSSSIEVDGV+HEFTSGDRSHPE C IDMMLKEI NRLGDAGYVPDLTNVLLDVN+QEKQYLLN+HSEKLAMAYGLIST+KH+PIRVIKNLR 
Subjt:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM

Query:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        CSDCH+F KY+SKVYDREIT+RDNNRFHFFR GSCSCGDYW
Subjt:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A0A0LA65 DYW_deaminase domain-containing protein0.0e+0088.11Show/hide
Query:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT
        MAANL PTIS  PNFI PT QNDSKHTPPHHFQIGLFK+CKT+DEL QLH YA KQG IRKQSTVTKLISTCVEMGT ESL +ARKAFELF ED E NVT
Subjt:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT

Query:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER
        +FMYN LIRGYSAAGL +EAISLY+QMIE GFMPDNFTFPF+LSACAKTAAFVEG QLHGAL+KIGLEGDMFVANSLIHLYAE GEF FARKVFDGMLER
Subjt:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER

Query:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD
        NVVSWTSLICGY+RTD   EAVALFFQMIEAGV+PNSVTMVCVISACAKLKDLELAKR+HAYIEESEMELNTHMVNAL DM+MKCGETGAA+RLYDECVD
Subjt:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD

Query:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS
        KNLVLCNTIMSN+AR+G+ NEVLAVLVD+ ++DLRPDRVSLL AISACGQMGDYLLGKCCH+YSLRNGYE WDNICNAMIDMYMKCGKQEMAYRVFD M 
Subjt:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS

Query:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN
        NKTIVSWNSLLVGY+RN+DLESARKIFNEM +KDIVSWNTM+NALVQESMF+EAIELFREMQ KEI+ DRVTMVE+ASACG LGALELAKW+Y++IVKN 
Subjt:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN

Query:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY
        IYCDMLLETALVDMFARCGD HSAM +F+NM RKDVSAWTAAIGAMAV+GNG+RAIELYNEMLRQGVKPD VVFVNILTACSH GFVEQG+HIFESMKQ+
Subjt:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY

Query:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE
        G+SPQIVHYGCMVDLLGRAGKLEEALDII+SMPM+PNGIIWGSLLAACRTHKN++MA FAAERLAEVAPE+TGIH+LLSNIYASAEKW DVANVRLQLKE
Subjt:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE

Query:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM
        KGVQKMPGSSSI+VDGV+HEFTSGDRSHPEN  IDMML EIT+RLGD GYVPD+TNVLLDVNEQEKQYLLNRHSEKLA+AYGLIST+KH+PIRV+KNLRM
Subjt:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM

Query:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        CSDCHSF KYISKVY REITVRDNNRFH FRQGSCSCGDYW
Subjt:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

A0A1S3AXK0 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At3g226900.0e+0087.63Show/hide
Query:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT
        MAANL+PTIS +PNFI P  +NDSKHT PHHFQIGLFK+CKTMDEL QLH YA KQG IRKQSTVTKLISTCVEMGT ESL +ARK FELF ED E NVT
Subjt:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT

Query:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER
        +FMYNSLIRGYS AGLC+EAISLY+QMIE GFMPDNFTFPF+LSACAKTAAFVEG QLHGALMKIGLE DMFVANSLIHLYAE GEF FARKVFDGMLER
Subjt:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER

Query:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD
        NVVSWTSLICGY+RT+ S EAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKR+HAYIEESEMELNTHMVNALVDM+MKCGETGAA+RLYDECVD
Subjt:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD

Query:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS
        KNLVLCNTIMSN+AR+G+ NEVLAVLVD+ ++DLRPDRVSLL AISACGQMGDYLLGK CH+YSLRNGYE WDNICNAMIDMYMKCGK EMAYRVFD M 
Subjt:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS

Query:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN
        NKTIVSWNSLLVGY+RN+DLESARK FNEM +KDIVSWNTM+NALVQESMF+EAIELFREMQ KEI+ADRVTMVE+ASACGYLGALELAKW+Y+YIVKN 
Subjt:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN

Query:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY
        I  DMLLET LVDMFARCGD HSAM++F+NMDRKDVSAWTAAIGAMAV+GNG RAIELYNEMLRQGVKPD VVFVNILTACSH GFVEQG+HIFESMKQ+
Subjt:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY

Query:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE
        GISPQIVHYGCMVDLLGRAGKLEEALDII+SMPMKPNGIIWGSLLAACRTHKN++MA FAAERL EVAPE+TGIH+LLSNIYASAEKW DVANVRLQLKE
Subjt:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE

Query:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM
        KGVQKMPGSSSI+VDGV+HEFTSGDRSHPEN  +DMML EIT+RL D GYVP++TNVLLDVNEQEK YLLNRHSEKLAMAYGLIST+KH+PIRV+KNLRM
Subjt:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM

Query:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        CSDCHSF KYISKVY REI VRDNNRFHFFRQGSCSCGDYW
Subjt:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

A0A5D3D302 Pentatricopeptide repeat-containing protein0.0e+0087.75Show/hide
Query:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT
        MAANL+PTIS +PNFI P  +NDSKHT PHHFQIGLFK+CKTMDEL QLH YA KQG IRKQSTVTKLISTCVEMGT ESL +ARK FELF ED E NVT
Subjt:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT

Query:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER
        +FMYNSLIRGYS AGLC+EAISLY+QMIE GFMPDNFTFPF+LSACAKTAAFVEG QLHGALMKIGLE DMFVANSLIHLYAE GEF FARKVFDGMLER
Subjt:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER

Query:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD
        NVVSWTSLICGY+RT+ S EAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKR+HAYIEESEMELNTHMVNALVDM+MKCGETGAA+RLYDECVD
Subjt:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD

Query:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS
        KNLVLCNTIMSN+AR+G+ NEVLAVLVD+ ++DLRPDRVSLL AISACGQMGDYLLGK CH+YSLRNGYE WDNICNAMIDMYMKCGK EMAYRVFD M 
Subjt:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS

Query:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN
        NKTIVSWNSLLVGY+RN+DLESARK FNEM +KDIVSWNTM+NALVQESMF+EAIELFREMQ KEI+ADRVTMVE+ASACGYLGALELAKW+Y+YIVKN 
Subjt:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN

Query:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY
        I  DMLLET LVDMFARCGD HSAM++F+NMDRKDVSAWTAAIGAMAV+GNG RAIELYNEMLRQGVKPD VVFVNILTACSH GFVEQG+HIFESMKQ+
Subjt:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY

Query:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE
        GISPQIVHYGCMVDLLGRAGKLEEALDII+SMPMKPNGIIWGSLLAACRTHKN++MA FAAERL EVAPE+TGIH+LLSNIYASAEKW DVANVRLQLKE
Subjt:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE

Query:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM
        KGVQKMPGSSSI+VDGV+HEFTSGDRSHPEN  +DMML EIT+RL D GYVPD+TNVLLDVNEQEK+YLLNRHSEKLAMAYGLIST+KH+PIRV+KNLRM
Subjt:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM

Query:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        CSDCHSF KYISKVY REI VRDNNRFHFFRQGSCSCGDYW
Subjt:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

A0A6J1GGL0 pentatricopeptide repeat-containing protein At3g226900.0e+0088.35Show/hide
Query:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT
        MAANLH T+  +P+FIIPTAQNDSKH     FQIGLF+NCKTMDELRQLH YASKQG IRKQSTVTKLISTCVEMGTLESL YARK FELFQED+E NVT
Subjt:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT

Query:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER
        VF+YNSLIRGYSA+GLC+EA+SLYIQMIE GF+PDNFTFPF+LSACAKTAAF  G QLHGALMKIGLEG+MFVANSLIHLYAEAG+FS ARKVFD M ER
Subjt:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER

Query:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD
        NVVSWTSLICGY RTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELA +IHAYI+ESE+ELNTHMVNALVDMYMKCGE GAA  LY+ECVD
Subjt:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD

Query:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS
        KNLVLCNTIMSN AR+G+  EVLAVLVD+F+VDL+PDRVSLLSAISACGQ+GDYLLG+CCH+++LRNGYE WDNICNAMIDMYMKCGKQEMAYRVFD MS
Subjt:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS

Query:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN
        NKTIVSWNSLLV YVRNRDLES +KIFNEM +KDIVSWNTMV++LVQESMFEEAIELFREMQTK+IEADRVTMVE+ASACGYLGALELAKWIYAYI+KNN
Subjt:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN

Query:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY
        I CDMLLETALVDMFARCGDS SAMK+FDNMDRKDVSAWTAAIGAMAVDGNGERAIELY+EMLRQGVKPD VVFVNILTACSH GFVEQGQHIFESMKQ+
Subjt:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY

Query:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE
        GISPQIVHYGCMVDLLGRAG LEEAL+IIKSM MKPNGIIWGSLLAACRTHKNVE+A FAAERLAE APERTGIHVLLSNIYASAE+WADVANVRL LK 
Subjt:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE

Query:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM
        KGV+K PGSSSIEVDGV++EFTSGDRSHPE C IDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLN+HSEKLAMAYGLIST+KH+PIRVIKNLR 
Subjt:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM

Query:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        CSDCH+F KY+SKVYDREIT+RDNNRFHFFR GSCSCGDYW
Subjt:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

A0A6J1IBE6 pentatricopeptide repeat-containing protein At3g22690-like0.0e+0088.82Show/hide
Query:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT
        MAANLH T+S +P+FIIPTAQNDSKH     FQIGLF+NCKTMDELRQLH YASKQG IRKQSTVTKLISTCVEMGTLESL YARK FELFQED+E NVT
Subjt:  MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVT

Query:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER
        VF+YNSLIRGYSA+GLC+EA+SLYIQMIE GF+PDNFTFPF+LSACAKTAAF  G QLHGAL KIGLEG+MFVANSLIHLYAEA +FS ARKVFD M ER
Subjt:  VFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLER

Query:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD
        NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISAC+KLKDLELA +IHAYIEESE+ELNTHMVNALVDMYMKCGET AA  LY+ECVD
Subjt:  NVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVD

Query:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS
        KNLVLCNTIMSNFAR G+  EVL+V+VD+F+VDL+PDRVSLLSAISACGQMGDYLLG+CCH+++LRNGYE WDNICNAMIDMYMKCGKQEMAYRVF  MS
Subjt:  KNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMS

Query:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN
        NKTIVSWNSLLV YVRNRDLES +KIFNEML+KDIVSWNTMV+ALVQESMFEEAIELFREMQTKEIEADRVTMVE+ASACGYLGALELAKWIYAYI+KNN
Subjt:  NKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNN

Query:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY
        I CDMLLETALVDMFARCGDS SAMK+FDNMDRKDVSAWTAAIGAMAVDGNGERAIELY+EMLRQGVKPD VVFVNILTACSH GFVEQGQ IFESMKQ+
Subjt:  IYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQY

Query:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE
        GISPQIVHYGCMVDLLGRAGKLEEALDIIKSM MKPNGIIWGSLLAACRTHKNVE+A FAAERLAE APER+GIHVLLSNIYASAE+WADVANVRL LKE
Subjt:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE

Query:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM
        KGV+KMPGSSSIEVDGV+HEFTSGDRSHPE C IDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEK YLLN+HSEKLAMAYGLIST+KH+PIRVIKNLR 
Subjt:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM

Query:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        CSDCH+F KY+SKVYDREIT+RDNNRFHFFRQG CSCGDYW
Subjt:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

SwissProt top hitse value%identityAlignment
O81767 Pentatricopeptide repeat-containing protein At4g339905.5e-15337.21Show/hide
Query:  NDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFMYNSLIRGYSAAGLCEEAI
        N+SK     H    LF+ C  +   + LH+      +I+      KL++    +G   ++  AR  F+  Q     N  V+ +N +I GY  AG   E I
Subjt:  NDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFMYNSLIRGYSAAGLCEEAI

Query:  SLY-IQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVVSWTSLICGYARTDSSSE
          + + M+  G  PD  TFP +L AC      ++G ++H   +K G   D++VA SLIHLY+       AR +FD M  R++ SW ++I GY ++ ++ E
Subjt:  SLY-IQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVVSWTSLICGYARTDSSSE

Query:  AVALFFQMIEAGVRP-NSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKNLVLCNTIMSNFARNGLS
        A+ L       G+R  +SVT+V ++SAC +  D      IH+Y  +  +E    + N L+D+Y + G     ++++D    ++L+  N+I+  +  N   
Subjt:  AVALFFQMIEAGVRP-NSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKNLVLCNTIMSNFARNGLS

Query:  NEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWD-NICNAMIDMYMKCGKQEMAYRVFDSMSNKTIVSWNSLLVGYVRNR
           +++  ++    ++PD ++L+S  S   Q+GD    +    ++LR G+   D  I NA++ MY K G                               
Subjt:  NEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWD-NICNAMIDMYMKCGKQEMAYRVFDSMSNKTIVSWNSLLVGYVRNR

Query:  DLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTK-EIEADRVTMVELASACGYLGALELAKWIYAYIVKNNIYCDMLLETALVDMFAR
         ++SAR +FN +   D++SWNT+++   Q     EAIE++  M+ + EI A++ T V +  AC   GAL     ++  ++KN +Y D+ + T+L DM+ +
Subjt:  DLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTK-EIEADRVTMVELASACGYLGALELAKWIYAYIVKNNIYCDMLLETALVDMFAR

Query:  CGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMK-QYGISPQIVHYGCMVDLL
        CG    A+ LF  + R +   W   I      G+GE+A+ L+ EML +GVKPD + FV +L+ACSH G V++GQ  FE M+  YGI+P + HYGCMVD+ 
Subjt:  CGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMK-QYGISPQIVHYGCMVDLL

Query:  GRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDG
        GRAG+LE AL  IKSM ++P+  IWG+LL+ACR H NV++ K A+E L EV PE  G HVLLSN+YASA KW  V  +R     KG++K PG SS+EVD 
Subjt:  GRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDG

Query:  VVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRMCSDCHSFTKYISKVYD
         V  F +G+++HP   ++   L  +  +L   GYVPD   VL DV + EK+++L  HSE+LA+A+ LI+T     IR+ KNLR+C DCHS TK+ISK+ +
Subjt:  VVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRMCSDCHSFTKYISKVYD

Query:  REITVRDNNRFHFFRQGSCSCGDYW
        REI VRD+NRFH F+ G CSCGDYW
Subjt:  REITVRDNNRFHFFRQGSCSCGDYW

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic8.5e-15439.07Show/hide
Query:  QLHGALMKIGLEGDMFVANSLIHLYA--EAGEFSFARKVFDGMLERNVVSWTSLICGYARTDSSSEAVALFFQMI-EAGVRPNSVTMVCVISACAKLKDL
        Q HG +++ G   D + A+ L  + A        +ARKVFD + + N  +W +LI  YA       ++  F  M+ E+   PN  T   +I A A++  L
Subjt:  QLHGALMKIGLEGDMFVANSLIHLYA--EAGEFSFARKVFDGMLERNVVSWTSLICGYARTDSSSEAVALFFQMI-EAGVRPNSVTMVCVISACAKLKDL

Query:  ELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGD
         L + +H    +S +  +  + N+L+  Y  CG+  +A +++    +K++V  N++++ F + G  ++ L +   +   D++   V+++  +SAC ++ +
Subjt:  ELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGD

Query:  YLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEE
           G+   SY   N       + NAM+DMY KCG  E A R+FD+M  K  V+W ++L GY  + D E+AR++ N M +KDIV+WN +++A  Q     E
Subjt:  YLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEE

Query:  AIELFREMQ-TKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNNIYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNG
        A+ +F E+Q  K ++ +++T+V   SAC  +GALEL +WI++YI K+ I  +  + +AL+ M+++CGD   + ++F++++++DV  W+A IG +A+ G G
Subjt:  AIELFREMQ-TKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNNIYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNG

Query:  ERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMK-QYGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTH
          A++++ +M    VKP+ V F N+  ACSH G V++ + +F  M+  YGI P+  HY C+VD+LGR+G LE+A+  I++MP+ P+  +WG+LL AC+ H
Subjt:  ERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMK-QYGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTH

Query:  KNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYV
         N+ +A+ A  RL E+ P   G HVLLSNIYA   KW +V+ +R  ++  G++K PG SSIE+DG++HEF SGD +HP +  +   L E+  +L   GY 
Subjt:  KNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYV

Query:  PDLTNVLLDVNEQE-KQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRMCSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        P+++ VL  + E+E K+  LN HSEKLA+ YGLISTE    IRVIKNLR+C DCHS  K IS++YDREI VRD  RFH FR G CSC D+W
Subjt:  PDLTNVLLDVNEQE-KQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRMCSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic4.5e-15535.69Show/hide
Query:  LFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPD
        L + C ++ ELRQ+     K G  ++    TKL+S     G+++    A + FE    D +LNV   +Y+++++G++     ++A+  +++M      P 
Subjt:  LFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPD

Query:  NFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRP
         + F ++L  C   A    G ++HG L+K G   D+F    L ++YA+  + + ARKVFD M ER++VSW +++ GY++   +  A+ +   M E  ++P
Subjt:  NFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRP

Query:  NSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLR
        + +T+V V+ A + L+ + + K IH Y   S  +   ++  ALVDMY KCG    A +L+D  +++N+V  N+++  + +N    E + +   +    ++
Subjt:  NSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLR

Query:  PDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDI
        P  VS++ A+ AC  +GD   G+  H  S+  G +   ++ N++I MY KC + + A  +F  + ++T+VSWN++++G+ +N      R I         
Subjt:  PDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDI

Query:  VSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNNIYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKD
                         +A+  F +M+++ ++ D  T V + +A   L     AKWI+  ++++ +  ++ + TALVDM+A+CG    A  +FD M  + 
Subjt:  VSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNNIYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKD

Query:  VSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQ-YGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPM
        V+ W A I      G G+ A+EL+ EM +  +KP+ V F+++++ACSH G VE G   F  MK+ Y I   + HYG MVDLLGRAG+L EA D I  MP+
Subjt:  VSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQ-YGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPM

Query:  KPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDI
        KP   ++G++L AC+ HKNV  A+ AAERL E+ P+  G HVLL+NIY +A  W  V  VR+ +  +G++K PG S +E+   VH F SG  +HP++  I
Subjt:  KPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDI

Query:  DMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRMCSDCHSFTKYISKVYDREITVRDNNRFHFFRQGS
           L+++   + +AGYVPD TN++L V    K+ LL+ HSEKLA+++GL++T     I V KNLR+C+DCH+ TKYIS V  REI VRD  RFH F+ G+
Subjt:  DMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRMCSDCHSFTKYISKVYDREITVRDNNRFHFFRQGS

Query:  CSCGDYW
        CSCGDYW
Subjt:  CSCGDYW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.1e-16136.53Show/hide
Query:  SASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFMYNSLIR
        S+ P   +P++ +    +  +H  + L  NCKT+  LR +H+   K G       ++KLI  C+     E L YA   F+  QE       + ++N++ R
Subjt:  SASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFMYNSLIR

Query:  GYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVVSWTSLI
        G++ +     A+ LY+ MI +G +P+++TFPF+L +CAK+ AF EG Q+HG ++K+G + D++V  SLI +Y + G    A KVFD    R+VVS+T+LI
Subjt:  GYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVVSWTSLI

Query:  CGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKNLVLCNTI
         GYA                                                YIE                          A++L+DE   K++V  N +
Subjt:  CGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKNLVLCNTI

Query:  MSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNKTIVSWNS
        +S +A  G   E L +  D+ + ++RPD  ++++ +SAC Q G   LG+  H +   +G+ +   I NA+ID+Y KCG                      
Subjt:  MSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNKTIVSWNS

Query:  LLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVK--NNIYCDMLL
                 +LE+A  +F  +  KD++SWNT++      ++++EA+ LF+EM       + VTM+ +  AC +LGA+++ +WI+ YI K    +     L
Subjt:  LLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVK--NNIYCDMLL

Query:  ETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQ-YGISPQI
         T+L+DM+A+CGD  +A ++F+++  K +S+W A I   A+ G  + + +L++ M + G++PD + FV +L+ACSH G ++ G+HIF +M Q Y ++P++
Subjt:  ETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQ-YGISPQI

Query:  VHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKM
         HYGCM+DLLG +G  +EA ++I  M M+P+G+IW SLL AC+ H NVE+ +  AE L ++ PE  G +VLLSNIYASA +W +VA  R  L +KG++K+
Subjt:  VHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKM

Query:  PGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRMCSDCHS
        PG SSIE+D VVHEF  GD+ HP N +I  ML+E+   L  AG+VPD + VL ++ E+ K+  L  HSEKLA+A+GLIST+    + ++KNLR+C +CH 
Subjt:  PGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRMCSDCHS

Query:  FTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
         TK ISK+Y REI  RD  RFH FR G CSC DYW
Subjt:  FTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226903.3e-29158.5Show/hide
Query:  NLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFM
        +L P + A+     P+  N SK T      +   KNCKT+DEL+  H   +KQG     ST+TKL++   E+GT ESL +A++ F    E+ E   T FM
Subjt:  NLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFM

Query:  YNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVV
        YNSLIRGY+++GLC EAI L+++M+  G  PD +TFPF LSACAK+ A   G Q+HG ++K+G   D+FV NSL+H YAE GE   ARKVFD M ERNVV
Subjt:  YNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVV

Query:  SWTSLICGYARTDSSSEAVALFFQMI-EAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKN
        SWTS+ICGYAR D + +AV LFF+M+ +  V PNSVTMVCVISACAKL+DLE  ++++A+I  S +E+N  MV+ALVDMYMKC     A+RL+DE    N
Subjt:  SWTSLICGYARTDSSSEAVALFFQMI-EAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKN

Query:  LVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNK
        L LCN + SN+ R GL+ E L V   +    +RPDR+S+LSAIS+C Q+ + L GK CH Y LRNG+E+WDNICNA+IDMYMKC +Q+ A+R+FD MSNK
Subjt:  LVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNK

Query:  TIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKE-IEADRVTMVELASACGYLGALELAKWIYAYIVKNNI
        T+V+WNS++ GYV N ++++A + F  M +K+IVSWNT+++ LVQ S+FEEAIE+F  MQ++E + AD VTM+ +ASACG+LGAL+LAKWIY YI KN I
Subjt:  TIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKE-IEADRVTMVELASACGYLGALELAKWIYAYIVKNNI

Query:  YCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESM-KQY
          D+ L T LVDMF+RCGD  SAM +F+++  +DVSAWTAAIGAMA+ GN ERAIEL+++M+ QG+KPD V FV  LTACSH G V+QG+ IF SM K +
Subjt:  YCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESM-KQY

Query:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE
        G+SP+ VHYGCMVDLLGRAG LEEA+ +I+ MPM+PN +IW SLLAACR   NVEMA +AAE++  +APERTG +VLLSN+YASA +W D+A VRL +KE
Subjt:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE

Query:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM
        KG++K PG+SSI++ G  HEFTSGD SHPE  +I+ ML E++ R    G+VPDL+NVL+DV+E+EK ++L+RHSEKLAMAYGLIS+ K   IR++KNLR+
Subjt:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM

Query:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        CSDCHSF K+ SKVY+REI +RDNNRFH+ RQG CSCGD+W
Subjt:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.8e-16336.53Show/hide
Query:  SASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFMYNSLIR
        S+ P   +P++ +    +  +H  + L  NCKT+  LR +H+   K G       ++KLI  C+     E L YA   F+  QE       + ++N++ R
Subjt:  SASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFMYNSLIR

Query:  GYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVVSWTSLI
        G++ +     A+ LY+ MI +G +P+++TFPF+L +CAK+ AF EG Q+HG ++K+G + D++V  SLI +Y + G    A KVFD    R+VVS+T+LI
Subjt:  GYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVVSWTSLI

Query:  CGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKNLVLCNTI
         GYA                                                YIE                          A++L+DE   K++V  N +
Subjt:  CGYARTDSSSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKNLVLCNTI

Query:  MSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNKTIVSWNS
        +S +A  G   E L +  D+ + ++RPD  ++++ +SAC Q G   LG+  H +   +G+ +   I NA+ID+Y KCG                      
Subjt:  MSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNKTIVSWNS

Query:  LLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVK--NNIYCDMLL
                 +LE+A  +F  +  KD++SWNT++      ++++EA+ LF+EM       + VTM+ +  AC +LGA+++ +WI+ YI K    +     L
Subjt:  LLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVK--NNIYCDMLL

Query:  ETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQ-YGISPQI
         T+L+DM+A+CGD  +A ++F+++  K +S+W A I   A+ G  + + +L++ M + G++PD + FV +L+ACSH G ++ G+HIF +M Q Y ++P++
Subjt:  ETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQ-YGISPQI

Query:  VHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKM
         HYGCM+DLLG +G  +EA ++I  M M+P+G+IW SLL AC+ H NVE+ +  AE L ++ PE  G +VLLSNIYASA +W +VA  R  L +KG++K+
Subjt:  VHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKM

Query:  PGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRMCSDCHS
        PG SSIE+D VVHEF  GD+ HP N +I  ML+E+   L  AG+VPD + VL ++ E+ K+  L  HSEKLA+A+GLIST+    + ++KNLR+C +CH 
Subjt:  PGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRMCSDCHS

Query:  FTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
         TK ISK+Y REI  RD  RFH FR G CSC DYW
Subjt:  FTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein3.2e-15635.69Show/hide
Query:  LFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPD
        L + C ++ ELRQ+     K G  ++    TKL+S     G+++    A + FE    D +LNV   +Y+++++G++     ++A+  +++M      P 
Subjt:  LFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFMYNSLIRGYSAAGLCEEAISLYIQMIEVGFMPD

Query:  NFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRP
         + F ++L  C   A    G ++HG L+K G   D+F    L ++YA+  + + ARKVFD M ER++VSW +++ GY++   +  A+ +   M E  ++P
Subjt:  NFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVVSWTSLICGYARTDSSSEAVALFFQMIEAGVRP

Query:  NSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLR
        + +T+V V+ A + L+ + + K IH Y   S  +   ++  ALVDMY KCG    A +L+D  +++N+V  N+++  + +N    E + +   +    ++
Subjt:  NSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLR

Query:  PDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDI
        P  VS++ A+ AC  +GD   G+  H  S+  G +   ++ N++I MY KC + + A  +F  + ++T+VSWN++++G+ +N      R I         
Subjt:  PDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDI

Query:  VSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNNIYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKD
                         +A+  F +M+++ ++ D  T V + +A   L     AKWI+  ++++ +  ++ + TALVDM+A+CG    A  +FD M  + 
Subjt:  VSWNTMVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNNIYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKD

Query:  VSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQ-YGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPM
        V+ W A I      G G+ A+EL+ EM +  +KP+ V F+++++ACSH G VE G   F  MK+ Y I   + HYG MVDLLGRAG+L EA D I  MP+
Subjt:  VSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQ-YGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPM

Query:  KPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDI
        KP   ++G++L AC+ HKNV  A+ AAERL E+ P+  G HVLL+NIY +A  W  V  VR+ +  +G++K PG S +E+   VH F SG  +HP++  I
Subjt:  KPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDI

Query:  DMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRMCSDCHSFTKYISKVYDREITVRDNNRFHFFRQGS
           L+++   + +AGYVPD TN++L V    K+ LL+ HSEKLA+++GL++T     I V KNLR+C+DCH+ TKYIS V  REI VRD  RFH F+ G+
Subjt:  DMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRMCSDCHSFTKYISKVYDREITVRDNNRFHFFRQGS

Query:  CSCGDYW
        CSCGDYW
Subjt:  CSCGDYW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.0e-15539.07Show/hide
Query:  QLHGALMKIGLEGDMFVANSLIHLYA--EAGEFSFARKVFDGMLERNVVSWTSLICGYARTDSSSEAVALFFQMI-EAGVRPNSVTMVCVISACAKLKDL
        Q HG +++ G   D + A+ L  + A        +ARKVFD + + N  +W +LI  YA       ++  F  M+ E+   PN  T   +I A A++  L
Subjt:  QLHGALMKIGLEGDMFVANSLIHLYA--EAGEFSFARKVFDGMLERNVVSWTSLICGYARTDSSSEAVALFFQMI-EAGVRPNSVTMVCVISACAKLKDL

Query:  ELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGD
         L + +H    +S +  +  + N+L+  Y  CG+  +A +++    +K++V  N++++ F + G  ++ L +   +   D++   V+++  +SAC ++ +
Subjt:  ELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKNLVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGD

Query:  YLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEE
           G+   SY   N       + NAM+DMY KCG  E A R+FD+M  K  V+W ++L GY  + D E+AR++ N M +KDIV+WN +++A  Q     E
Subjt:  YLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEE

Query:  AIELFREMQ-TKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNNIYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNG
        A+ +F E+Q  K ++ +++T+V   SAC  +GALEL +WI++YI K+ I  +  + +AL+ M+++CGD   + ++F++++++DV  W+A IG +A+ G G
Subjt:  AIELFREMQ-TKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNNIYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNG

Query:  ERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMK-QYGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTH
          A++++ +M    VKP+ V F N+  ACSH G V++ + +F  M+  YGI P+  HY C+VD+LGR+G LE+A+  I++MP+ P+  +WG+LL AC+ H
Subjt:  ERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMK-QYGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTH

Query:  KNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYV
         N+ +A+ A  RL E+ P   G HVLLSNIYA   KW +V+ +R  ++  G++K PG SSIE+DG++HEF SGD +HP +  +   L E+  +L   GY 
Subjt:  KNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYV

Query:  PDLTNVLLDVNEQE-KQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRMCSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        P+++ VL  + E+E K+  LN HSEKLA+ YGLISTE    IRVIKNLR+C DCHS  K IS++YDREI VRD  RFH FR G CSC D+W
Subjt:  PDLTNVLLDVNEQE-KQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRMCSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)4.4e-29158.45Show/hide
Query:  NLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFM
        +L P + A+     P+  N SK T      +   KNCKT+DEL+  H   +KQG     ST+TKL++   E+GT ESL +A++ F    E+ E   T FM
Subjt:  NLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFM

Query:  YNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVV
        YNSLIRGY+++GLC EAI L+++M+  G  PD +TFPF LSACAK+ A   G Q+HG ++K+G   D+FV NSL+H YAE GE   ARKVFD M ERNVV
Subjt:  YNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVV

Query:  SWTSLICGYARTDSSSEAVALFFQMI-EAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKN
        SWTS+ICGYAR D + +AV LFF+M+ +  V PNSVTMVCVISACAKL+DLE  ++++A+I  S +E+N  MV+ALVDMYMKC     A+RL+DE    N
Subjt:  SWTSLICGYARTDSSSEAVALFFQMI-EAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKN

Query:  LVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNK
        L LCN + SN+ R GL+ E L V   +    +RPDR+S+LSAIS+C Q+ + L GK CH Y LRNG+E+WDNICNA+IDMYMKC +Q+ A+R+FD MSNK
Subjt:  LVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNK

Query:  TIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKE-IEADRVTMVELASACGYLGALELAKWIYAYIVKNNI
        T+V+WNS++ GYV N ++++A + F  M +K+IVSWNT+++ LVQ S+FEEAIE+F  MQ++E + AD VTM+ +ASACG+LGAL+LAKWIY YI KN I
Subjt:  TIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKE-IEADRVTMVELASACGYLGALELAKWIYAYIVKNNI

Query:  YCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESM-KQY
          D+ L T LVDMF+RCGD  SAM +F+++  +DVSAWTAAIGAMA+ GN ERAIEL+++M+ QG+KPD V FV  LTACSH G V+QG+ IF SM K +
Subjt:  YCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESM-KQY

Query:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE
        G+SP+ VHYGCMVDLLGRAG LEEA+ +I+ MPM+PN +IW SLLAACR   NVEMA +AAE++  +APERTG +VLLSN+YASA +W D+A VRL +KE
Subjt:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE

Query:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM
        KG++K PG+SSI++ G  HEFTSGD SHPE  +I+ ML E++ R    G+VPDL+NVL+DV+E+EK ++L+RHSEKLAMAYGLIS+ K   IR++KNLR+
Subjt:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM

Query:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDY
        CSDCHSF K+ SKVY+REI +RDNNRFH+ RQG CSCGD+
Subjt:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDY

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification2.3e-29258.5Show/hide
Query:  NLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFM
        +L P + A+     P+  N SK T      +   KNCKT+DEL+  H   +KQG     ST+TKL++   E+GT ESL +A++ F    E+ E   T FM
Subjt:  NLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFM

Query:  YNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVV
        YNSLIRGY+++GLC EAI L+++M+  G  PD +TFPF LSACAK+ A   G Q+HG ++K+G   D+FV NSL+H YAE GE   ARKVFD M ERNVV
Subjt:  YNSLIRGYSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVV

Query:  SWTSLICGYARTDSSSEAVALFFQMI-EAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKN
        SWTS+ICGYAR D + +AV LFF+M+ +  V PNSVTMVCVISACAKL+DLE  ++++A+I  S +E+N  MV+ALVDMYMKC     A+RL+DE    N
Subjt:  SWTSLICGYARTDSSSEAVALFFQMI-EAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKN

Query:  LVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNK
        L LCN + SN+ R GL+ E L V   +    +RPDR+S+LSAIS+C Q+ + L GK CH Y LRNG+E+WDNICNA+IDMYMKC +Q+ A+R+FD MSNK
Subjt:  LVLCNTIMSNFARNGLSNEVLAVLVDVFRVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNK

Query:  TIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKE-IEADRVTMVELASACGYLGALELAKWIYAYIVKNNI
        T+V+WNS++ GYV N ++++A + F  M +K+IVSWNT+++ LVQ S+FEEAIE+F  MQ++E + AD VTM+ +ASACG+LGAL+LAKWIY YI KN I
Subjt:  TIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNTMVNALVQESMFEEAIELFREMQTKE-IEADRVTMVELASACGYLGALELAKWIYAYIVKNNI

Query:  YCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESM-KQY
          D+ L T LVDMF+RCGD  SAM +F+++  +DVSAWTAAIGAMA+ GN ERAIEL+++M+ QG+KPD V FV  LTACSH G V+QG+ IF SM K +
Subjt:  YCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESM-KQY

Query:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE
        G+SP+ VHYGCMVDLLGRAG LEEA+ +I+ MPM+PN +IW SLLAACR   NVEMA +AAE++  +APERTG +VLLSN+YASA +W D+A VRL +KE
Subjt:  GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKE

Query:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM
        KG++K PG+SSI++ G  HEFTSGD SHPE  +I+ ML E++ R    G+VPDL+NVL+DV+E+EK ++L+RHSEKLAMAYGLIS+ K   IR++KNLR+
Subjt:  KGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHMPIRVIKNLRM

Query:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        CSDCHSF K+ SKVY+REI +RDNNRFH+ RQG CSCGD+W
Subjt:  CSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCAAATCTTCATCCAACCATCTCTGCATCGCCGAACTTCATCATACCCACCGCTCAGAATGACTCAAAGCATACCCCTCCCCACCATTTTCAAATTGGGTTGTT
CAAAAATTGCAAAACCATGGATGAGCTTCGACAATTGCACTCTTACGCGTCGAAGCAGGGTCGAATTCGTAAACAATCGACTGTAACTAAGCTTATTTCCACTTGCGTGG
AAATGGGCACTTTAGAAAGCTTGGGTTATGCTCGAAAGGCATTCGAGCTCTTCCAGGAAGATGAAGAATTAAACGTTACTGTTTTCATGTACAATTCGTTAATCAGAGGA
TACTCTGCTGCAGGGCTTTGTGAGGAAGCTATTTCGCTGTACATTCAGATGATCGAGGTTGGGTTTATGCCAGACAACTTCACTTTTCCATTTATGTTGAGTGCGTGTGC
TAAGACTGCCGCGTTTGTAGAAGGGGCTCAGCTCCATGGAGCTCTTATGAAGATTGGTTTAGAGGGAGATATGTTTGTTGCCAATTCTCTGATACATCTGTATGCAGAAG
CAGGAGAATTTTCGTTTGCTCGAAAGGTGTTTGATGGAATGCTTGAGAGAAATGTTGTTTCATGGACTAGCTTGATTTGTGGCTATGCTAGGACAGATTCTTCCAGTGAG
GCTGTGGCTTTGTTTTTCCAAATGATCGAGGCTGGTGTTAGACCCAATTCTGTCACAATGGTGTGTGTGATATCGGCTTGTGCCAAGTTGAAAGATCTTGAACTGGCCAA
GAGAATTCATGCTTATATTGAAGAGTCAGAAATGGAGCTTAATACTCATATGGTAAATGCACTTGTGGATATGTACATGAAATGTGGAGAAACTGGTGCTGCAGAGCGAC
TATATGATGAATGTGTTGATAAAAATTTAGTTTTATGTAACACAATCATGTCAAATTTCGCACGCAATGGCTTGTCGAACGAGGTACTTGCTGTCTTGGTAGATGTGTTC
CGAGTAGATCTTCGACCGGATAGAGTTTCATTGTTATCAGCAATCTCAGCGTGTGGACAGATGGGTGACTATCTACTTGGGAAATGCTGCCACAGTTATTCTCTAAGAAA
TGGGTATGAAGCTTGGGATAACATTTGTAATGCAATGATTGACATGTATATGAAGTGTGGAAAACAAGAAATGGCCTACAGAGTTTTTGACAGTATGTCAAATAAGACCA
TTGTGTCGTGGAACTCATTACTTGTTGGTTATGTTAGAAATAGAGATTTAGAGTCAGCTAGAAAGATATTCAACGAGATGCTTAAAAAGGACATAGTGTCTTGGAACACA
ATGGTTAATGCTTTGGTTCAAGAGAGCATGTTTGAGGAAGCAATTGAGCTCTTCCGGGAGATGCAAACAAAGGAAATAGAGGCAGACAGGGTAACAATGGTAGAACTTGC
ATCTGCATGTGGATATCTCGGAGCTCTCGAGCTCGCCAAGTGGATATATGCCTATATTGTAAAGAACAACATCTACTGTGATATGTTGCTTGAGACAGCATTAGTTGATA
TGTTTGCTAGGTGTGGCGACTCTCATAGTGCGATGAAACTGTTCGACAATATGGATAGAAAAGACGTCTCTGCATGGACAGCAGCCATTGGAGCAATGGCTGTGGATGGG
AATGGAGAACGAGCTATAGAACTTTACAATGAGATGCTAAGGCAAGGGGTGAAACCGGATCCAGTAGTTTTTGTGAACATATTAACAGCTTGTAGCCATGTTGGTTTTGT
GGAACAAGGGCAGCACATATTCGAGTCAATGAAGCAATATGGAATCTCCCCACAGATTGTTCATTATGGTTGCATGGTTGATCTATTAGGTCGTGCAGGTAAGTTAGAAG
AAGCCCTAGATATTATAAAGAGCATGCCAATGAAACCCAATGGGATTATATGGGGATCTCTATTGGCTGCATGTCGTACCCATAAAAATGTCGAAATGGCGAAATTTGCA
GCTGAGAGGTTAGCAGAAGTGGCTCCAGAAAGGACTGGGATTCATGTACTTCTATCAAACATATATGCTTCAGCTGAAAAGTGGGCTGATGTTGCTAATGTAAGGCTACA
GTTGAAGGAGAAAGGAGTTCAGAAAATGCCTGGTTCGAGCTCGATAGAAGTCGATGGAGTTGTTCATGAGTTTACTTCAGGTGACAGATCACACCCAGAAAACTGTGACA
TTGACATGATGTTAAAAGAAATTACCAACAGGCTTGGGGATGCTGGTTATGTTCCTGATTTAACCAATGTTCTGCTTGACGTAAACGAGCAAGAGAAACAATATCTACTC
AATCGGCATAGCGAGAAGCTGGCCATGGCTTACGGGCTCATAAGTACAGAAAAGCACATGCCGATTCGTGTTATTAAGAATCTCCGAATGTGCTCAGACTGCCATTCATT
CACCAAATACATTTCAAAAGTGTATGACAGGGAAATAACAGTACGAGATAATAATAGGTTTCACTTCTTTCGACAAGGGTCTTGTTCATGCGGTGATTATTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCAAATCTTCATCCAACCATCTCTGCATCGCCGAACTTCATCATACCCACCGCTCAGAATGACTCAAAGCATACCCCTCCCCACCATTTTCAAATTGGGTTGTT
CAAAAATTGCAAAACCATGGATGAGCTTCGACAATTGCACTCTTACGCGTCGAAGCAGGGTCGAATTCGTAAACAATCGACTGTAACTAAGCTTATTTCCACTTGCGTGG
AAATGGGCACTTTAGAAAGCTTGGGTTATGCTCGAAAGGCATTCGAGCTCTTCCAGGAAGATGAAGAATTAAACGTTACTGTTTTCATGTACAATTCGTTAATCAGAGGA
TACTCTGCTGCAGGGCTTTGTGAGGAAGCTATTTCGCTGTACATTCAGATGATCGAGGTTGGGTTTATGCCAGACAACTTCACTTTTCCATTTATGTTGAGTGCGTGTGC
TAAGACTGCCGCGTTTGTAGAAGGGGCTCAGCTCCATGGAGCTCTTATGAAGATTGGTTTAGAGGGAGATATGTTTGTTGCCAATTCTCTGATACATCTGTATGCAGAAG
CAGGAGAATTTTCGTTTGCTCGAAAGGTGTTTGATGGAATGCTTGAGAGAAATGTTGTTTCATGGACTAGCTTGATTTGTGGCTATGCTAGGACAGATTCTTCCAGTGAG
GCTGTGGCTTTGTTTTTCCAAATGATCGAGGCTGGTGTTAGACCCAATTCTGTCACAATGGTGTGTGTGATATCGGCTTGTGCCAAGTTGAAAGATCTTGAACTGGCCAA
GAGAATTCATGCTTATATTGAAGAGTCAGAAATGGAGCTTAATACTCATATGGTAAATGCACTTGTGGATATGTACATGAAATGTGGAGAAACTGGTGCTGCAGAGCGAC
TATATGATGAATGTGTTGATAAAAATTTAGTTTTATGTAACACAATCATGTCAAATTTCGCACGCAATGGCTTGTCGAACGAGGTACTTGCTGTCTTGGTAGATGTGTTC
CGAGTAGATCTTCGACCGGATAGAGTTTCATTGTTATCAGCAATCTCAGCGTGTGGACAGATGGGTGACTATCTACTTGGGAAATGCTGCCACAGTTATTCTCTAAGAAA
TGGGTATGAAGCTTGGGATAACATTTGTAATGCAATGATTGACATGTATATGAAGTGTGGAAAACAAGAAATGGCCTACAGAGTTTTTGACAGTATGTCAAATAAGACCA
TTGTGTCGTGGAACTCATTACTTGTTGGTTATGTTAGAAATAGAGATTTAGAGTCAGCTAGAAAGATATTCAACGAGATGCTTAAAAAGGACATAGTGTCTTGGAACACA
ATGGTTAATGCTTTGGTTCAAGAGAGCATGTTTGAGGAAGCAATTGAGCTCTTCCGGGAGATGCAAACAAAGGAAATAGAGGCAGACAGGGTAACAATGGTAGAACTTGC
ATCTGCATGTGGATATCTCGGAGCTCTCGAGCTCGCCAAGTGGATATATGCCTATATTGTAAAGAACAACATCTACTGTGATATGTTGCTTGAGACAGCATTAGTTGATA
TGTTTGCTAGGTGTGGCGACTCTCATAGTGCGATGAAACTGTTCGACAATATGGATAGAAAAGACGTCTCTGCATGGACAGCAGCCATTGGAGCAATGGCTGTGGATGGG
AATGGAGAACGAGCTATAGAACTTTACAATGAGATGCTAAGGCAAGGGGTGAAACCGGATCCAGTAGTTTTTGTGAACATATTAACAGCTTGTAGCCATGTTGGTTTTGT
GGAACAAGGGCAGCACATATTCGAGTCAATGAAGCAATATGGAATCTCCCCACAGATTGTTCATTATGGTTGCATGGTTGATCTATTAGGTCGTGCAGGTAAGTTAGAAG
AAGCCCTAGATATTATAAAGAGCATGCCAATGAAACCCAATGGGATTATATGGGGATCTCTATTGGCTGCATGTCGTACCCATAAAAATGTCGAAATGGCGAAATTTGCA
GCTGAGAGGTTAGCAGAAGTGGCTCCAGAAAGGACTGGGATTCATGTACTTCTATCAAACATATATGCTTCAGCTGAAAAGTGGGCTGATGTTGCTAATGTAAGGCTACA
GTTGAAGGAGAAAGGAGTTCAGAAAATGCCTGGTTCGAGCTCGATAGAAGTCGATGGAGTTGTTCATGAGTTTACTTCAGGTGACAGATCACACCCAGAAAACTGTGACA
TTGACATGATGTTAAAAGAAATTACCAACAGGCTTGGGGATGCTGGTTATGTTCCTGATTTAACCAATGTTCTGCTTGACGTAAACGAGCAAGAGAAACAATATCTACTC
AATCGGCATAGCGAGAAGCTGGCCATGGCTTACGGGCTCATAAGTACAGAAAAGCACATGCCGATTCGTGTTATTAAGAATCTCCGAATGTGCTCAGACTGCCATTCATT
CACCAAATACATTTCAAAAGTGTATGACAGGGAAATAACAGTACGAGATAATAATAGGTTTCACTTCTTTCGACAAGGGTCTTGTTCATGCGGTGATTATTGGTAA
Protein sequenceShow/hide protein sequence
MAANLHPTISASPNFIIPTAQNDSKHTPPHHFQIGLFKNCKTMDELRQLHSYASKQGRIRKQSTVTKLISTCVEMGTLESLGYARKAFELFQEDEELNVTVFMYNSLIRG
YSAAGLCEEAISLYIQMIEVGFMPDNFTFPFMLSACAKTAAFVEGAQLHGALMKIGLEGDMFVANSLIHLYAEAGEFSFARKVFDGMLERNVVSWTSLICGYARTDSSSE
AVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIHAYIEESEMELNTHMVNALVDMYMKCGETGAAERLYDECVDKNLVLCNTIMSNFARNGLSNEVLAVLVDVF
RVDLRPDRVSLLSAISACGQMGDYLLGKCCHSYSLRNGYEAWDNICNAMIDMYMKCGKQEMAYRVFDSMSNKTIVSWNSLLVGYVRNRDLESARKIFNEMLKKDIVSWNT
MVNALVQESMFEEAIELFREMQTKEIEADRVTMVELASACGYLGALELAKWIYAYIVKNNIYCDMLLETALVDMFARCGDSHSAMKLFDNMDRKDVSAWTAAIGAMAVDG
NGERAIELYNEMLRQGVKPDPVVFVNILTACSHVGFVEQGQHIFESMKQYGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNVEMAKFA
AERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVVHEFTSGDRSHPENCDIDMMLKEITNRLGDAGYVPDLTNVLLDVNEQEKQYLL
NRHSEKLAMAYGLISTEKHMPIRVIKNLRMCSDCHSFTKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW