; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G24470 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G24470
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr1:19784116..19786366
RNA-Seq ExpressionCSPI01G24470
SyntenyCSPI01G24470
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573373.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0082.29Show/hide
Query:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL
        M HL RSKPI     F NFPATQSRLLNTLS LFSRC S Q LQQIHARF+LHGFHQNPTLS KLIDCYAN GLLN S  VF S+IDPN  L+NAILRNL
Subjt:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL

Query:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE
        TR+GE ERTLLVY++MVAKSMHPDE+TYPFVLRSC   SNV FG+ IHG L+KLG D +D V T L EMYE+CI+FENAHQLFDK SVKDL   SSL TE
Subjt:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE

Query:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA
         PQN NG+ I R+FGRM++E LV DS TF NLLR ++GL+SIQLAKIVHCIAIVS L GDLLV+TAVLSLYSKL SLVDARKLF+K+PEKDRVVWNIMIA
Subjt:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI
        AYAREG+P ECLELF+SMARSGIR+DLFTALPVISSI+QLK  DWGKQTHA+ILRNGSDSQVSVHNSLIDMYCEC  LDSACKIFN +T+K+VISWSAMI
Subjt:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN
        KG VK+G  L ALSLF +MKSDGIQADF+ +INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCG I+MAQRLFEEE++DDKDLIMWN
Subjt:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR
        SMISAHANHGDWSQCF LYN+MKCSNS PDQVTFLGLLTACVNSGLVEKGKEFFKEM ESY CQPSQEHYACMVNLLGRAGLI+EAGELV+NMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG
        VWGPLLSACK+HPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKWDGVAKMRSFLR+KGLKK PGCSWLEING V EFRVAD+THPRA DIY ILG
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG

Query:  NLELEIKEVREKSPDTL
        NLEL+IKE +E SP+ L
Subjt:  NLELEIKEVREKSPDTL

XP_004145299.2 pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucumis sativus]0.0e+0099.84Show/hide
Query:  MVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFG
        MVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFG
Subjt:  MVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFG

Query:  RMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELF
        RM AEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELF
Subjt:  RMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELF

Query:  KSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSL
        KSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSL
Subjt:  KSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSL

Query:  FSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQC
        FSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQC
Subjt:  FSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQC

Query:  FKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGS
        FKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGS
Subjt:  FKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGS

Query:  KLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPD
        KLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPD
Subjt:  KLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPD

Query:  TLVNPLL
        TLVNPLL
Subjt:  TLVNPLL

XP_008444579.1 PREDICTED: pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucumis melo]0.0e+0094.32Show/hide
Query:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL
        MLHL RSKPIIH+PI LNFPATQSRLLNTLSLLF+RCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLL HSLQVFCS+IDPNLTLFNAILRNL
Subjt:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL

Query:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE
        TRYGESER LLVYQQMVAKSMHPDEETYPF+ RSCSSFSNVGFGRTIHGYLVKLGFD FDVVATALAEMYE+ I FENAHQLFDKRSVKDLGW SSLTTE
Subjt:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE

Query:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA
        G QN NGEGIFRVF RMRAEQLVPDS TF NLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLV TAVLSLYSKLRSLVDAR+LFDKMPEKDRVVWNIMIA
Subjt:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI
        AYAREGKP ECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECK+LDSAC IFNWMTDKSVISWSAMI
Subjt:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN
        KGYVKNGQSLTA SLFSKMKSDGIQADFV MINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQRLFEEE+IDDKDLIMWN
Subjt:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR
        SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGL+EKGKEFFKEMTESYGC PSQEH+ACMVNLLGRAGLISEAGELV+NMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG
        VWGPLLSACKMHPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKW+ VAKMRSFLRNKGLKK PGCS LEING VTEFRVADQTHPRA DIYTILG
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG

Query:  NLELEIKEVREKSPDTLVNPLL
        NLELEIKEVREKS DTLVNPLL
Subjt:  NLELEIKEVREKSPDTLVNPLL

XP_022139869.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Momordica charantia]0.0e+0082.85Show/hide
Query:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL
        MLHL RSKPI     F NFPATQSR LNTLS LFSRC+S Q L+QIHARFILHG HQNP LS +LID YANLGLL  S QVF S+IDP  TL++AILRNL
Subjt:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL

Query:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE
        + +GE ERTLLVY++M AKSMHPDEETYP VLRSC   SNV +GR IHG+LVKLG DL+D  ATALAEMY +CI FEN H LFDK  +KD    +SL +E
Subjt:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE

Query:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA
          QN NG+ IF++FGRMR EQLV DS TF NLLR I GLNSIQLAKIVHC+AI S L GDLLVNTAVLSLYSKL  LV+ARKLFDKMPEKDRVVWNIMIA
Subjt:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI
        AY REG P ECLELFKSMARSGIR+DLFTALPVISSI+QLKCVDWGKQTHAH LRNGSD+QVSVHNSLIDMYCE  ILDSACKIF+WMT+K+VISWSAMI
Subjt:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN
        KG VK+GQSL ALSLFS+MKSDGIQADF+ +INILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQRLFEEE++DDKDLIMWN
Subjt:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR
        SMISAHANHGDWSQCFK+YN+MKCSNS+PDQVTFLGLLTACVNSGLVEKGKE FKEM E+YGCQPSQEHYACMVNLLGRAGLI++AG LV+NMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG
        VWGPLLSACK+HPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKWDGVAKMRSFLR+KGLKK PGCSWLEINGHVTEFRVAD+THPRA DIYTILG
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG

Query:  NLELEIKEVREKSPDTL
        NLELEIKE REKSP+ L
Subjt:  NLELEIKEVREKSPDTL

XP_038894029.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Benincasa hispida]0.0e+0088.21Show/hide
Query:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL
        MLHL RSKP+IHS IF NFPATQSRLLNTLS LF RC+S QHL+QIHARF+LHGFHQNPTLSSKLIDCYANLGLLN SLQVF S+ +PN T++NAILRNL
Subjt:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL

Query:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE
        TRYGE ERTLLVY+QMVAKSMHPDEETYP VLRSC SFSNVG GR IHGYLVKLGFD FD+VATAL EMYEECI+FE+AHQLFDKRSVKDL   SS TTE
Subjt:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE

Query:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA
         PQN NGEGIF VFGRMR EQLV DS TF NLLRFIAG NSIQLAKIVHCIAIVSKL GDLLVNTAVLSLYSKL SLVDARKLFDKMPE DRVVWNIMIA
Subjt:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI
        AYAREGKPTECL LFKSMARSGIRSD+FTALPVISSI+QLK  DWGKQTHA+ILRNGSDSQVSV+NSLIDMYCEC ILDSACKIFNWM DK+VISWSAMI
Subjt:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN
        KGYVK+G SL ALSLFS MKSDGIQ+DF+ +INILPAFVHIG LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQR+FEEE+IDDKDLIMWN
Subjt:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR
        SMISAHANHGDWSQCFKLYN+MKCSN+KPDQVTFLGLLTACVNSGLVEKGKEF KEMTE+YGCQPSQEHYACMVNLLGRAGLI+EAGELV+NMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG
        VWGPLLSACK+HPGSKLAEFAAEKL++MEP+NAGNYILLSNIYAAAGKWD VAKMRSFLR+KGLKK PGCSWLEINGHVTEFRVADQTHPRA DIYTILG
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG

Query:  NLELEIKEVREKSPDTLVNPL
        NLELEIKE REKS + L NPL
Subjt:  NLELEIKEVREKSPDTLVNPL

TrEMBL top hitse value%identityAlignment
A0A0A0M0Z6 Uncharacterized protein0.0e+0099.86Show/hide
Query:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL
        MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL
Subjt:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL

Query:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE
        TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE
Subjt:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE

Query:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA
        GPQNDNGEGIFRVFGRM AEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA
Subjt:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI
        AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI
Subjt:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN
        KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN
Subjt:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR
        SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG
        VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG

Query:  NLELEIKEVREKSPDTLVNPLL
        NLELEIKEVREKSPDTLVNPLL
Subjt:  NLELEIKEVREKSPDTLVNPLL

A0A1S3BBG7 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0094.32Show/hide
Query:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL
        MLHL RSKPIIH+PI LNFPATQSRLLNTLSLLF+RCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLL HSLQVFCS+IDPNLTLFNAILRNL
Subjt:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL

Query:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE
        TRYGESER LLVYQQMVAKSMHPDEETYPF+ RSCSSFSNVGFGRTIHGYLVKLGFD FDVVATALAEMYE+ I FENAHQLFDKRSVKDLGW SSLTTE
Subjt:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE

Query:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA
        G QN NGEGIFRVF RMRAEQLVPDS TF NLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLV TAVLSLYSKLRSLVDAR+LFDKMPEKDRVVWNIMIA
Subjt:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI
        AYAREGKP ECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECK+LDSAC IFNWMTDKSVISWSAMI
Subjt:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN
        KGYVKNGQSLTA SLFSKMKSDGIQADFV MINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQRLFEEE+IDDKDLIMWN
Subjt:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR
        SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGL+EKGKEFFKEMTESYGC PSQEH+ACMVNLLGRAGLISEAGELV+NMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG
        VWGPLLSACKMHPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKW+ VAKMRSFLRNKGLKK PGCS LEING VTEFRVADQTHPRA DIYTILG
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG

Query:  NLELEIKEVREKSPDTLVNPLL
        NLELEIKEVREKS DTLVNPLL
Subjt:  NLELEIKEVREKSPDTLVNPLL

A0A5D3DB69 Pentatricopeptide repeat-containing protein0.0e+0094.32Show/hide
Query:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL
        MLHL RSKPIIH+PI LNFPATQSRLLNTLSLLF+RCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLL HSLQVFCS+IDPNLTLFNAILRNL
Subjt:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL

Query:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE
        TRYGESER LLVYQQMVAKSMHPDEETYPF+ RSCSSFSNVGFGRTIHGYLVKLGFD FDVVATALAEMYE+ I FENAHQLFDKRSVKDLGW SSLTTE
Subjt:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE

Query:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA
        G QN NGEGIFRVF RMRAEQLVPDS TF NLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLV TAVLSLYSKLRSLVDAR+LFDKMPEKDRVVWNIMIA
Subjt:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI
        AYAREGKP ECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECK+LDSAC IFNWMTDKSVISWSAMI
Subjt:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN
        KGYVKNGQSLTA SLFSKMKSDGIQADFV MINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQRLFEEE+IDDKDLIMWN
Subjt:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR
        SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGL+EKGKEFFKEMTESYGC PSQEH+ACMVNLLGRAGLISEAGELV+NMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG
        VWGPLLSACKMHPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKW+ VAKMRSFLRNKGLKK PGCS LEING VTEFRVADQTHPRA DIYTILG
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG

Query:  NLELEIKEVREKSPDTLVNPLL
        NLELEIKEVREKS DTLVNPLL
Subjt:  NLELEIKEVREKSPDTLVNPLL

A0A6J1CE61 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0082.85Show/hide
Query:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL
        MLHL RSKPI     F NFPATQSR LNTLS LFSRC+S Q L+QIHARFILHG HQNP LS +LID YANLGLL  S QVF S+IDP  TL++AILRNL
Subjt:  MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNL

Query:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE
        + +GE ERTLLVY++M AKSMHPDEETYP VLRSC   SNV +GR IHG+LVKLG DL+D  ATALAEMY +CI FEN H LFDK  +KD    +SL +E
Subjt:  TRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTE

Query:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA
          QN NG+ IF++FGRMR EQLV DS TF NLLR I GLNSIQLAKIVHC+AI S L GDLLVNTAVLSLYSKL  LV+ARKLFDKMPEKDRVVWNIMIA
Subjt:  GPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI
        AY REG P ECLELFKSMARSGIR+DLFTALPVISSI+QLKCVDWGKQTHAH LRNGSD+QVSVHNSLIDMYCE  ILDSACKIF+WMT+K+VISWSAMI
Subjt:  AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN
        KG VK+GQSL ALSLFS+MKSDGIQADF+ +INILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQRLFEEE++DDKDLIMWN
Subjt:  KGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR
        SMISAHANHGDWSQCFK+YN+MKCSNS+PDQVTFLGLLTACVNSGLVEKGKE FKEM E+YGCQPSQEHYACMVNLLGRAGLI++AG LV+NMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG
        VWGPLLSACK+HPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKWDGVAKMRSFLR+KGLKK PGCSWLEINGHVTEFRVAD+THPRA DIYTILG
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILG

Query:  NLELEIKEVREKSPDTL
        NLELEIKE REKSP+ L
Subjt:  NLELEIKEVREKSPDTL

A0A6J1K3Q8 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0081.97Show/hide
Query:  MLHLHRSKPIIHSPIFL----NFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAI
        M HL RSK I  SPIF     NFPATQSRLLNTLS LFSRC S Q L+QIHARF+LHGFHQNPTLS KLIDCYAN GLLN S  VF S+IDPN TL+NAI
Subjt:  MLHLHRSKPIIHSPIFL----NFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAI

Query:  LRNLTRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSS
        LRNLTR+GE ERTLLVY++MVAKSMHPDE+TYPFVL+SC   SNV FG+ IHG L+KLG D +D V T LAEMY +CI+FENAHQLFDK SVKDL   SS
Subjt:  LRNLTRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSS

Query:  LTTEGPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWN
        L +E PQN NG+ I  + GRM++E LV DS TF NLLR I+GL+SIQLAKIVHCIAIVS L GDLLV+TAVLSLYSKL SLVDARKLF+KMPEKDRVVWN
Subjt:  LTTEGPQNDNGEGIFRVFGRMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWN

Query:  IMIAAYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISW
        IMIAAYAREG+P ECLELF+SMARSGIR+DLFTALPVISSI+QLKC DWGKQTHA+ILRNGSDSQVSVHNSLIDMYCEC  L+SACKIFN +T+K+VISW
Subjt:  IMIAAYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISW

Query:  SAMIKGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDL
        SAMIKG VK+G  L ALSLF  MKSDGIQADF+ +INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCG IEMAQRLFEEE+++DKDL
Subjt:  SAMIKGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDL

Query:  IMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIK
        IMWNSMISAHANHGDWSQCFKLYN+MKCSNS PDQVTFLGLLTACVNSGLVEKGKEFFKEM ESY CQPSQEHYACMVNLLGRAGLI+EAGELV+NMPIK
Subjt:  IMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIK

Query:  PDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIY
        PDARVWGPLLSACK+HPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKWDGVAKMRSFLR+KGLKK PGCSWLEING V EFRVAD+THPRA DIY
Subjt:  PDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIY

Query:  TILGNLELEIKEVREKSPDTL
         ILGNLEL+IKE +E SP+ L
Subjt:  TILGNLELEIKEVREKSPDTL

SwissProt top hitse value%identityAlignment
O81767 Pentatricopeptide repeat-containing protein At4g339901.9e-10932.85Show/hide
Query:  QSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQ-MVAKSM
        +S+ ++ +  LF  C ++Q  + +HAR ++    QN  +S+KL++ Y  LG +  +   F  + + ++  +N ++    R G S   +  +   M++  +
Subjt:  QSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQ-MVAKSM

Query:  HPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDV-VATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMRAE
         PD  T+P VL++C +  +   G  IH   +K GF ++DV VA +L  +Y       NA  LFD+  V+D+G  +++ +   Q+ N +    +   +RA 
Subjt:  HPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDV-VATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMRAE

Query:  QLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMAR
            DS T  +LL              +H  +I   L  +L V+  ++ LY++   L D +K+FD+M  +D + WN +I AY    +P   + LF+ M  
Subjt:  QLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMAR

Query:  SGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNG-SDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKM
        S I+ D  T + + S ++QL  +   +      LR G     +++ N+++ MY +  ++DSA  +FNW+ +  VISW+ +I GY +NG +  A+ +++ M
Subjt:  SGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNG-SDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKM

Query:  KSDG-IQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKL
        + +G I A+    +++LPA    GAL     LHG  +K GL     + T+L   Y KCG +E A  LF +  I   + + WN++I+ H  HG   +   L
Subjt:  KSDG-IQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKL

Query:  YNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLA
        +  M     KPD +TF+ LL+AC +SGLV++G+  F+ M   YG  PS +HY CMV++ GRAG +  A + +K+M ++PDA +WG LLSAC++H    L 
Subjt:  YNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLA

Query:  EFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEV
        + A+E L  +EP + G ++LLSN+YA+AGKW+GV ++RS    KGL+K PG S +E++  V  F   +QTHP   ++Y  L  L+ ++K +
Subjt:  EFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEV

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic1.0e-11532.6Show/hide
Query:  SLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDEETYPF
        +LL  RC+S++ L+QI      +G +Q     +KL+  +   G ++ + +VF  +      L++ +L+   +  + ++ L  + +M    + P    + +
Subjt:  SLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDEETYPF

Query:  VLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMRAEQLVPDSFTFF
        +L+ C   + +  G+ IHG LVK GF L     T L  MY +C +   A ++FD+   +DL   +++     QN        +   M  E L P   T  
Subjt:  VLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMRAEQLVPDSFTFF

Query:  NLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRSDLFTA
        ++L  ++ L  I + K +H  A+ S     + ++TA++ +Y+K  SL  AR+LFD M E++ V WN MI AY +   P E + +F+ M   G++    + 
Subjt:  NLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRSDLFTA

Query:  LPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQADFVI
        +  + + A L  ++ G+  H   +  G D  VSV NSLI MYC+CK +D+A  +F  +  ++++SW+AMI G+ +NG+ + AL+ FS+M+S  ++ D   
Subjt:  LPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQADFVI

Query:  MINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPD
         ++++ A   +    + K++HG  M+  L     + TAL+  YAKCG+I +A+ +F  + + ++ +  WN+MI  +  HG      +L+  M+    KP+
Subjt:  MINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPD

Query:  QVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEP
         VTFL +++AC +SGLVE G + F  M E+Y  + S +HY  MV+LLGRAG ++EA + +  MP+KP   V+G +L AC++H     AE AAE+L  + P
Subjt:  QVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEP

Query:  RNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKE
         + G ++LL+NIY AA  W+ V ++R  +  +GL+K PGCS +EI   V  F      HP +  IY  L  L   IKE
Subjt:  RNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKE

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic6.2e-12136.5Show/hide
Query:  TLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLID-CYANLGL--LNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDE
        +LSLL + C ++Q L+ IHA+ I  G H      SKLI+ C  +     L +++ VF ++ +PNL ++N + R      +    L +Y  M++  + P+ 
Subjt:  TLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLID-CYANLGL--LNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDE

Query:  ETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMRAEQLVPD
         T+PFVL+SC+       G+ IHG+++KLG DL   V T+L  MY +    E+AH++FDK   +                                    
Subjt:  ETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMRAEQLVPD

Query:  SFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRS
                                          D++  TA++  Y+    + +A+KLFD++P KD V WN MI+ YA  G   E LELFK M ++ +R 
Subjt:  SFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRS

Query:  DLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQ
        D  T + V+S+ AQ   ++ G+Q H  I  +G  S + + N+LID+Y +C  L++AC +F  +  K VISW+ +I GY        AL LF +M   G  
Subjt:  DLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQ

Query:  ADFVIMINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMK
         + V M++ILPA  H+GA++  +++H Y  K   G+T+  SL T+L+  YAKCG IE A ++F    I  K L  WN+MI   A HG     F L++RM+
Subjt:  ADFVIMINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMK

Query:  CSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAE
            +PD +TF+GLL+AC +SG+++ G+  F+ MT+ Y   P  EHY CM++LLG +GL  EA E++  M ++PD  +W  LL ACKMH   +L E  AE
Subjt:  CSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAE

Query:  KLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKE
         LI +EP N G+Y+LLSNIYA+AG+W+ VAK R+ L +KG+KK+PGCS +EI+  V EF + D+ HPR  +IY +L  +E+ +++
Subjt:  KLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKE

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226901.5e-11132.05Show/hide
Query:  QSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGL---LNHSLQVF-CSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVA
        QS+           C +I  L+  H      G   + +  +KL+     LG    L+ + +VF  S       ++N+++R     G     +L++ +M+ 
Subjt:  QSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGL---LNHSLQVF-CSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVA

Query:  KSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGF--DLFDVVATALAEMYEECIEFENAHQLFDKRSVKD-LGWPSSLTTEGPQNDNGEGIFRVFG
          + PD+ T+PF L +C+     G G  IHG +VK+G+  DLF  V  +L   Y EC E ++A ++FD+ S ++ + W S +     ++   + +   F 
Subjt:  KSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGF--DLFDVVATALAEMYEECIEFENAHQLFDKRSVKD-LGWPSSLTTEGPQNDNGEGIFRVFG

Query:  RMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELF
         +R E++ P+S T   ++   A L  ++  + V+     S +  + L+ +A++ +Y K  ++  A++LFD+    +  + N M + Y R+G   E L +F
Subjt:  RMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELF

Query:  KSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQ-------
          M  SG+R D  + L  ISS +QL+ + WGK  H ++LRNG +S  ++ N+LIDMY +C   D+A +IF+ M++K+V++W++++ GYV+NG+       
Subjt:  KSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQ-------

Query:  ----------------------SL--TALSLFSKMKS-DGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQ
                              SL   A+ +F  M+S +G+ AD V M++I  A  H+GAL+  K+++ Y  K G+     L T L+  +++CG  E A 
Subjt:  ----------------------SL--TALSLFSKMKS-DGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQ

Query:  RLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLI
         +F    + ++D+  W + I A A  G+  +  +L++ M     KPD V F+G LTAC + GLV++GKE F  M + +G  P   HY CMV+LLGRAGL+
Subjt:  RLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLI

Query:  SEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFR
         EA +L+++MP++P+  +W  LL+AC++    ++A +AAEK+  + P   G+Y+LLSN+YA+AG+W+ +AK+R  ++ KGL+K PG S ++I G   EF 
Subjt:  SEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFR

Query:  VADQTHPRAGDIYTILGNLELEIKEVREKS
          D++HP        + N+E  + EV +++
Subjt:  VADQTHPRAGDIYTILGNLELEIKEVREKS

Q9SVP7 Pentatricopeptide repeat-containing protein At4g136503.2e-10931.48Show/hide
Query:  SLLFSRCNSIQHL---QQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDEET
        S + S C  I+ L   +Q+H   +  GF  +  + + L+  Y +LG L  +  +F ++   +   +N ++  L++ G  E+ + ++++M    + PD  T
Subjt:  SLLFSRCNSIQHL---QQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDEET

Query:  YPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLG-WPSSLTTEGPQNDNGEGIFRVFGRMRAEQLVPDS
           ++ +CS+   +  G+ +H Y  KLGF   + +  AL  +Y +C + E A   F +  V+++  W   L   G  +D     FR+F +M+ E++VP+ 
Subjt:  YPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLG-WPSSLTTEGPQNDNGEGIFRVFGRMRAEQLVPDS

Query:  FTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRSD
        +T+ ++L+    L  ++L + +H   I +    +  V + ++ +Y+KL  L  A  +  +   KD V W  MIA Y +     + L  F+ M   GIRSD
Subjt:  FTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRSD

Query:  LFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQA
               +S+ A L+ +  G+Q HA    +G  S +   N+L+ +Y  C  ++ +   F        I+W+A++ G+ ++G +  AL +F +M  +GI  
Subjt:  LFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQA

Query:  DFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSN
        +     + + A      ++  K +H    K G  S   +  AL+  YAKCGSI  A++ F E  +  K+ + WN++I+A++ HG  S+    +++M  SN
Subjt:  DFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSN

Query:  SKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLI
         +P+ VT +G+L+AC + GLV+KG  +F+ M   YG  P  EHY C+V++L RAGL+S A E ++ MPIKPDA VW  LLSAC +H   ++ EFAA  L+
Subjt:  SKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLI

Query:  NMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEV
         +EP ++  Y+LLSN+YA + KWD     R  ++ KG+KK PG SW+E+   +  F V DQ HP A +I+    +L     E+
Subjt:  NMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEV

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.4e-12236.5Show/hide
Query:  TLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLID-CYANLGL--LNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDE
        +LSLL + C ++Q L+ IHA+ I  G H      SKLI+ C  +     L +++ VF ++ +PNL ++N + R      +    L +Y  M++  + P+ 
Subjt:  TLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLID-CYANLGL--LNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDE

Query:  ETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMRAEQLVPD
         T+PFVL+SC+       G+ IHG+++KLG DL   V T+L  MY +    E+AH++FDK   +                                    
Subjt:  ETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMRAEQLVPD

Query:  SFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRS
                                          D++  TA++  Y+    + +A+KLFD++P KD V WN MI+ YA  G   E LELFK M ++ +R 
Subjt:  SFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRS

Query:  DLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQ
        D  T + V+S+ AQ   ++ G+Q H  I  +G  S + + N+LID+Y +C  L++AC +F  +  K VISW+ +I GY        AL LF +M   G  
Subjt:  DLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQ

Query:  ADFVIMINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMK
         + V M++ILPA  H+GA++  +++H Y  K   G+T+  SL T+L+  YAKCG IE A ++F    I  K L  WN+MI   A HG     F L++RM+
Subjt:  ADFVIMINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMK

Query:  CSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAE
            +PD +TF+GLL+AC +SG+++ G+  F+ MT+ Y   P  EHY CM++LLG +GL  EA E++  M ++PD  +W  LL ACKMH   +L E  AE
Subjt:  CSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAE

Query:  KLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKE
         LI +EP N G+Y+LLSNIYA+AG+W+ VAK R+ L +KG+KK+PGCS +EI+  V EF + D+ HPR  +IY +L  +E+ +++
Subjt:  KLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKE

AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein7.3e-11732.6Show/hide
Query:  SLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDEETYPF
        +LL  RC+S++ L+QI      +G +Q     +KL+  +   G ++ + +VF  +      L++ +L+   +  + ++ L  + +M    + P    + +
Subjt:  SLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDEETYPF

Query:  VLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMRAEQLVPDSFTFF
        +L+ C   + +  G+ IHG LVK GF L     T L  MY +C +   A ++FD+   +DL   +++     QN        +   M  E L P   T  
Subjt:  VLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMRAEQLVPDSFTFF

Query:  NLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRSDLFTA
        ++L  ++ L  I + K +H  A+ S     + ++TA++ +Y+K  SL  AR+LFD M E++ V WN MI AY +   P E + +F+ M   G++    + 
Subjt:  NLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRSDLFTA

Query:  LPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQADFVI
        +  + + A L  ++ G+  H   +  G D  VSV NSLI MYC+CK +D+A  +F  +  ++++SW+AMI G+ +NG+ + AL+ FS+M+S  ++ D   
Subjt:  LPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQADFVI

Query:  MINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPD
         ++++ A   +    + K++HG  M+  L     + TAL+  YAKCG+I +A+ +F  + + ++ +  WN+MI  +  HG      +L+  M+    KP+
Subjt:  MINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPD

Query:  QVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEP
         VTFL +++AC +SGLVE G + F  M E+Y  + S +HY  MV+LLGRAG ++EA + +  MP+KP   V+G +L AC++H     AE AAE+L  + P
Subjt:  QVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEP

Query:  RNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKE
         + G ++LL+NIY AA  W+ V ++R  +  +GL+K PGCS +EI   V  F      HP +  IY  L  L   IKE
Subjt:  RNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKE

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)1.1e-11232.05Show/hide
Query:  QSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGL---LNHSLQVF-CSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVA
        QS+           C +I  L+  H      G   + +  +KL+     LG    L+ + +VF  S       ++N+++R     G     +L++ +M+ 
Subjt:  QSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGL---LNHSLQVF-CSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVA

Query:  KSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGF--DLFDVVATALAEMYEECIEFENAHQLFDKRSVKD-LGWPSSLTTEGPQNDNGEGIFRVFG
          + PD+ T+PF L +C+     G G  IHG +VK+G+  DLF  V  +L   Y EC E ++A ++FD+ S ++ + W S +     ++   + +   F 
Subjt:  KSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGF--DLFDVVATALAEMYEECIEFENAHQLFDKRSVKD-LGWPSSLTTEGPQNDNGEGIFRVFG

Query:  RMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELF
         +R E++ P+S T   ++   A L  ++  + V+     S +  + L+ +A++ +Y K  ++  A++LFD+    +  + N M + Y R+G   E L +F
Subjt:  RMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELF

Query:  KSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQ-------
          M  SG+R D  + L  ISS +QL+ + WGK  H ++LRNG +S  ++ N+LIDMY +C   D+A +IF+ M++K+V++W++++ GYV+NG+       
Subjt:  KSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQ-------

Query:  ----------------------SL--TALSLFSKMKS-DGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQ
                              SL   A+ +F  M+S +G+ AD V M++I  A  H+GAL+  K+++ Y  K G+     L T L+  +++CG  E A 
Subjt:  ----------------------SL--TALSLFSKMKS-DGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQ

Query:  RLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLI
         +F    + ++D+  W + I A A  G+  +  +L++ M     KPD V F+G LTAC + GLV++GKE F  M + +G  P   HY CMV+LLGRAGL+
Subjt:  RLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLI

Query:  SEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFR
         EA +L+++MP++P+  +W  LL+AC++    ++A +AAEK+  + P   G+Y+LLSN+YA+AG+W+ +AK+R  ++ KGL+K PG S ++I G   EF 
Subjt:  SEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFR

Query:  VADQTHPRAGDIYTILGNLELEIKEVREKS
          D++HP        + N+E  + EV +++
Subjt:  VADQTHPRAGDIYTILGNLELEIKEVREKS

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification1.1e-11232.05Show/hide
Query:  QSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGL---LNHSLQVF-CSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVA
        QS+           C +I  L+  H      G   + +  +KL+     LG    L+ + +VF  S       ++N+++R     G     +L++ +M+ 
Subjt:  QSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGL---LNHSLQVF-CSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVA

Query:  KSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGF--DLFDVVATALAEMYEECIEFENAHQLFDKRSVKD-LGWPSSLTTEGPQNDNGEGIFRVFG
          + PD+ T+PF L +C+     G G  IHG +VK+G+  DLF  V  +L   Y EC E ++A ++FD+ S ++ + W S +     ++   + +   F 
Subjt:  KSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGF--DLFDVVATALAEMYEECIEFENAHQLFDKRSVKD-LGWPSSLTTEGPQNDNGEGIFRVFG

Query:  RMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELF
         +R E++ P+S T   ++   A L  ++  + V+     S +  + L+ +A++ +Y K  ++  A++LFD+    +  + N M + Y R+G   E L +F
Subjt:  RMRAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELF

Query:  KSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQ-------
          M  SG+R D  + L  ISS +QL+ + WGK  H ++LRNG +S  ++ N+LIDMY +C   D+A +IF+ M++K+V++W++++ GYV+NG+       
Subjt:  KSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQ-------

Query:  ----------------------SL--TALSLFSKMKS-DGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQ
                              SL   A+ +F  M+S +G+ AD V M++I  A  H+GAL+  K+++ Y  K G+     L T L+  +++CG  E A 
Subjt:  ----------------------SL--TALSLFSKMKS-DGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQ

Query:  RLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLI
         +F    + ++D+  W + I A A  G+  +  +L++ M     KPD V F+G LTAC + GLV++GKE F  M + +G  P   HY CMV+LLGRAGL+
Subjt:  RLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLI

Query:  SEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFR
         EA +L+++MP++P+  +W  LL+AC++    ++A +AAEK+  + P   G+Y+LLSN+YA+AG+W+ +AK+R  ++ KGL+K PG S ++I G   EF 
Subjt:  SEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFR

Query:  VADQTHPRAGDIYTILGNLELEIKEVREKS
          D++HP        + N+E  + EV +++
Subjt:  VADQTHPRAGDIYTILGNLELEIKEVREKS

AT4G33990.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-11032.85Show/hide
Query:  QSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQ-MVAKSM
        +S+ ++ +  LF  C ++Q  + +HAR ++    QN  +S+KL++ Y  LG +  +   F  + + ++  +N ++    R G S   +  +   M++  +
Subjt:  QSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQ-MVAKSM

Query:  HPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDV-VATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMRAE
         PD  T+P VL++C +  +   G  IH   +K GF ++DV VA +L  +Y       NA  LFD+  V+D+G  +++ +   Q+ N +    +   +RA 
Subjt:  HPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDV-VATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMRAE

Query:  QLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMAR
            DS T  +LL              +H  +I   L  +L V+  ++ LY++   L D +K+FD+M  +D + WN +I AY    +P   + LF+ M  
Subjt:  QLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMAR

Query:  SGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNG-SDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKM
        S I+ D  T + + S ++QL  +   +      LR G     +++ N+++ MY +  ++DSA  +FNW+ +  VISW+ +I GY +NG +  A+ +++ M
Subjt:  SGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNG-SDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKM

Query:  KSDG-IQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKL
        + +G I A+    +++LPA    GAL     LHG  +K GL     + T+L   Y KCG +E A  LF +  I   + + WN++I+ H  HG   +   L
Subjt:  KSDG-IQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKL

Query:  YNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLA
        +  M     KPD +TF+ LL+AC +SGLV++G+  F+ M   YG  PS +HY CMV++ GRAG +  A + +K+M ++PDA +WG LLSAC++H    L 
Subjt:  YNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLA

Query:  EFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEV
        + A+E L  +EP + G ++LLSN+YA+AGKW+GV ++RS    KGL+K PG S +E++  V  F   +QTHP   ++Y  L  L+ ++K +
Subjt:  EFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCACCTTCATCGATCAAAGCCCATTATTCATAGTCCCATTTTCCTCAACTTTCCCGCCACCCAATCAAGACTGCTCAACACGCTTTCCCTCCTTTTCAGTCGATG
CAACTCCATTCAACACCTCCAGCAAATTCATGCCAGGTTCATCCTCCACGGCTTCCACCAAAACCCAACTCTCTCTTCCAAACTTATCGATTGCTATGCCAATCTTGGAC
TCCTCAATCACTCTCTCCAAGTTTTCTGCTCTGTAATCGACCCCAATTTGACTCTTTTCAACGCCATACTGAGAAATTTGACAAGATATGGAGAATCCGAGCGGACCCTG
TTGGTGTATCAACAAATGGTCGCCAAATCTATGCACCCAGATGAAGAAACTTACCCTTTTGTTTTGCGATCATGTTCTTCTTTTTCAAATGTTGGATTTGGGAGGACGAT
TCATGGGTATTTGGTTAAGCTGGGTTTTGATTTGTTTGATGTTGTAGCGACTGCTCTGGCTGAGATGTATGAGGAATGCATTGAATTTGAGAATGCTCATCAACTGTTTG
ATAAAAGATCTGTGAAGGATTTGGGATGGCCGAGTTCCTTGACTACGGAGGGTCCTCAAAATGATAACGGGGAGGGAATTTTTCGGGTTTTTGGAAGAATGAGAGCAGAA
CAATTAGTACCAGACTCATTCACATTCTTCAATCTCTTGAGGTTCATTGCAGGTTTGAACTCAATTCAACTTGCAAAGATTGTTCATTGTATTGCAATTGTGAGCAAATT
GAGTGGAGATTTGTTAGTAAATACTGCTGTGTTGTCTCTTTACTCTAAGTTGCGTAGCTTAGTAGATGCTAGAAAGTTATTTGACAAAATGCCAGAGAAAGATCGTGTTG
TATGGAATATAATGATAGCAGCTTATGCTCGGGAAGGTAAACCGACGGAATGCCTTGAGCTCTTCAAGTCCATGGCACGATCAGGGATTAGATCTGATCTATTTACTGCA
CTGCCTGTTATCTCTTCGATTGCACAGTTGAAATGTGTTGATTGGGGGAAACAAACCCATGCCCATATATTGAGGAATGGTTCCGATAGTCAAGTTTCAGTTCATAATTC
TCTCATTGACATGTACTGCGAATGTAAAATTTTAGATTCAGCTTGTAAGATCTTCAACTGGATGACAGACAAGTCTGTAATTTCATGGAGTGCTATGATCAAGGGGTATG
TCAAAAATGGTCAGTCCCTCACTGCATTGTCTCTCTTCTCCAAGATGAAATCTGATGGGATTCAAGCTGATTTTGTTATAATGATCAATATCCTGCCTGCATTTGTTCAC
ATTGGAGCACTTGAAAATGTCAAATATTTACATGGGTACTCAATGAAGCTAGGCCTGACTTCCCTTCCATCACTTAACACAGCCCTTCTGATAACCTATGCAAAATGTGG
GTCCATAGAGATGGCTCAAAGACTATTTGAGGAAGAGAAAATTGATGATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGCCAATCATGGAGACTGGTCCC
AGTGTTTTAAGCTATACAATCGAATGAAGTGCTCAAATTCAAAGCCAGACCAAGTAACATTCTTGGGACTACTAACAGCTTGTGTCAATTCTGGTCTTGTCGAAAAGGGG
AAAGAATTTTTCAAGGAGATGACTGAAAGTTATGGGTGCCAACCAAGCCAAGAGCATTATGCTTGTATGGTTAACCTCTTGGGGAGAGCTGGGCTTATCAGTGAAGCTGG
AGAACTTGTAAAAAACATGCCTATCAAACCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCTTGTAAGATGCATCCTGGGTCCAAGCTTGCAGAGTTCGCAGCCGAGA
AGCTCATTAATATGGAGCCCAGAAATGCAGGGAATTACATACTGCTTTCGAACATATATGCTGCTGCAGGTAAATGGGATGGAGTGGCAAAAATGAGAAGTTTCTTAAGG
AATAAAGGGCTGAAGAAAATCCCTGGTTGTAGTTGGCTGGAGATAAATGGCCATGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGGAGATATATATAC
CATCCTAGGAAACCTTGAACTTGAAATCAAAGAGGTTAGAGAAAAAAGTCCAGATACATTGGTAAATCCTCTTCTATAA
mRNA sequenceShow/hide mRNA sequence
CGAAGTTCTCGAACATCGACTCTTGGTTGAGTTTGAAAAGCCAAACTCCGAGTCCAAAAATGTGCCGCTCATTTCGTCCGACATGCTTCACCTTCATCGATCAAAGCCCA
TTATTCATAGTCCCATTTTCCTCAACTTTCCCGCCACCCAATCAAGACTGCTCAACACGCTTTCCCTCCTTTTCAGTCGATGCAACTCCATTCAACACCTCCAGCAAATT
CATGCCAGGTTCATCCTCCACGGCTTCCACCAAAACCCAACTCTCTCTTCCAAACTTATCGATTGCTATGCCAATCTTGGACTCCTCAATCACTCTCTCCAAGTTTTCTG
CTCTGTAATCGACCCCAATTTGACTCTTTTCAACGCCATACTGAGAAATTTGACAAGATATGGAGAATCCGAGCGGACCCTGTTGGTGTATCAACAAATGGTCGCCAAAT
CTATGCACCCAGATGAAGAAACTTACCCTTTTGTTTTGCGATCATGTTCTTCTTTTTCAAATGTTGGATTTGGGAGGACGATTCATGGGTATTTGGTTAAGCTGGGTTTT
GATTTGTTTGATGTTGTAGCGACTGCTCTGGCTGAGATGTATGAGGAATGCATTGAATTTGAGAATGCTCATCAACTGTTTGATAAAAGATCTGTGAAGGATTTGGGATG
GCCGAGTTCCTTGACTACGGAGGGTCCTCAAAATGATAACGGGGAGGGAATTTTTCGGGTTTTTGGAAGAATGAGAGCAGAACAATTAGTACCAGACTCATTCACATTCT
TCAATCTCTTGAGGTTCATTGCAGGTTTGAACTCAATTCAACTTGCAAAGATTGTTCATTGTATTGCAATTGTGAGCAAATTGAGTGGAGATTTGTTAGTAAATACTGCT
GTGTTGTCTCTTTACTCTAAGTTGCGTAGCTTAGTAGATGCTAGAAAGTTATTTGACAAAATGCCAGAGAAAGATCGTGTTGTATGGAATATAATGATAGCAGCTTATGC
TCGGGAAGGTAAACCGACGGAATGCCTTGAGCTCTTCAAGTCCATGGCACGATCAGGGATTAGATCTGATCTATTTACTGCACTGCCTGTTATCTCTTCGATTGCACAGT
TGAAATGTGTTGATTGGGGGAAACAAACCCATGCCCATATATTGAGGAATGGTTCCGATAGTCAAGTTTCAGTTCATAATTCTCTCATTGACATGTACTGCGAATGTAAA
ATTTTAGATTCAGCTTGTAAGATCTTCAACTGGATGACAGACAAGTCTGTAATTTCATGGAGTGCTATGATCAAGGGGTATGTCAAAAATGGTCAGTCCCTCACTGCATT
GTCTCTCTTCTCCAAGATGAAATCTGATGGGATTCAAGCTGATTTTGTTATAATGATCAATATCCTGCCTGCATTTGTTCACATTGGAGCACTTGAAAATGTCAAATATT
TACATGGGTACTCAATGAAGCTAGGCCTGACTTCCCTTCCATCACTTAACACAGCCCTTCTGATAACCTATGCAAAATGTGGGTCCATAGAGATGGCTCAAAGACTATTT
GAGGAAGAGAAAATTGATGATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGCCAATCATGGAGACTGGTCCCAGTGTTTTAAGCTATACAATCGAATGAA
GTGCTCAAATTCAAAGCCAGACCAAGTAACATTCTTGGGACTACTAACAGCTTGTGTCAATTCTGGTCTTGTCGAAAAGGGGAAAGAATTTTTCAAGGAGATGACTGAAA
GTTATGGGTGCCAACCAAGCCAAGAGCATTATGCTTGTATGGTTAACCTCTTGGGGAGAGCTGGGCTTATCAGTGAAGCTGGAGAACTTGTAAAAAACATGCCTATCAAA
CCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCTTGTAAGATGCATCCTGGGTCCAAGCTTGCAGAGTTCGCAGCCGAGAAGCTCATTAATATGGAGCCCAGAAATGC
AGGGAATTACATACTGCTTTCGAACATATATGCTGCTGCAGGTAAATGGGATGGAGTGGCAAAAATGAGAAGTTTCTTAAGGAATAAAGGGCTGAAGAAAATCCCTGGTT
GTAGTTGGCTGGAGATAAATGGCCATGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGGAGATATATATACCATCCTAGGAAACCTTGAACTTGAAATC
AAAGAGGTTAGAGAAAAAAGTCCAGATACATTGGTAAATCCTCTTCTATAA
Protein sequenceShow/hide protein sequence
MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTL
LVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMRAE
QLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRSDLFTA
LPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVH
IGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKG
KEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLR
NKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPDTLVNPLL