; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi08G001665 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi08G001665
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr8:58881315..58883992
RNA-Seq ExpressionBhi08G001665
SyntenyBhi08G001665
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573373.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0084.66Show/hide
Query:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL
        M HLQRSKP+     FPNFPATQSRLLNTLS LF RC SRQ L+QIHARFVLHGFHQNPTLS KLIDCYAN GLLNLS  VF SI +PNS +YNAILRNL
Subjt:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL

Query:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE
        TR+GE ERTLLVYR+MVAKSMHPDE+TYP VLRSCC  SNV  G+ IHG L+KLG DS+D V T L EMYE+CIDFE+AHQLFDK SVKDL+CWSS  TE
Subjt:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE

Query:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA
        APQNGNG+ I  +FGRM+ E LVTDSLTFINLLR ++G +SIQLAKIVHCIAIVS LCGDLLV+TAVLSLYSKLGSLVDARKLF+K+PE DRVVWNIMIA
Subjt:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA

Query:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI
        AYAREG+P ECL LF+SMARSGIR+D+FTALPVISSISQLK  DWGKQTHA ILRNGSDSQVSV+NSLIDMYCECN LDSACKIFN + +KTVISWSAMI
Subjt:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI

Query:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KG VKHG+ LIALSLF  MKSDGIQ+DFITVINI+PAFV IG LENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCI+MAQR+FEEER+DDKDLIMWN
Subjt:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCF LYNQMKCSN+ PDQVTFLGLLTACVNSGLVEKGKEF KEM E+Y CQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACKLHPGSKLAEFAAEKL+DMEPKNAGNYILLSNIYAAAGKWD VAKMRSFLRDKGLKKTPGCSWLEING V EFRVAD+THPRAEDIY ILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEAREKSLEKL
        NLEL+IKEA+E S EKL
Subjt:  NLELEIKEAREKSLEKL

KAG7012542.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0084.19Show/hide
Query:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL
        M HLQRSKP+     FPNFPATQSRLLNTLS LF RC SRQ L+QIHARFVLHGFHQNPTLS KLIDCYAN GLLNLS  VF SI +PNS +YNAILRNL
Subjt:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL

Query:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE
        TR+GE ERTLLVYR+MVAKSMHPDE+TYP VLRSCC  SNV  G+ IHG L+KLG DS+D V T L EMYE+CIDFE+AHQLFDK SVKDL+CWSS  ++
Subjt:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE

Query:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA
        APQNGNG+ I  +FGRM+ E LVTDSLTFINLLR I+G +SIQLAK+VHCIAIVS LCGDLLV+TAVLSLYSKLGSLVDARKLF+K+PE DRVVWNIMIA
Subjt:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA

Query:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI
        AYAREG+P ECL LF+SMARSGIR+D+FTALPVISSISQLK  DWGKQTHA ILRNGSDSQVSV+NSLIDMYCECN LDSACKIFN + +KTVISWSAMI
Subjt:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI

Query:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KG VKHG+ LIALSLF  MKSDGIQ+DFITVINI+PAFV IG LENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCI+MAQR+FEEER+DDKDLIMWN
Subjt:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCFKLYNQMKCSN+ PDQVTFLGLLTACVNSGLVEKGKEF KEM E Y CQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACKLHPGSKLAEFAAEKL+DMEPKNAGNYILLSNIYAAAGKWD VAKMRSFLRDKGLKKTPGCSWLEING V EFRVAD+THPRAEDIY ILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEAREKSLEKLGNPL
        NLEL+IKE +E S EKLG  L
Subjt:  NLELEIKEAREKSLEKLGNPL

XP_008444579.1 PREDICTED: pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucumis melo]0.0e+0087.1Show/hide
Query:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL
        MLHLQRSKP+IH+ I  NFPATQSRLLNTLS LF+RC+S QHL+QIHARF+LHGFHQNPTLSSKLIDCYANLGLL  SLQVF SI +PN T++NAILRNL
Subjt:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL

Query:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE
        TRYGE ER LLVY+QMVAKSMHPDEETYP + RSC SFSNVG GR IHGYLVKLGFDSFD+VATAL EMYE+ I FE+AHQLFDKRSVKDL   SS TTE
Subjt:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE

Query:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA
          QNGNGEGIF VF RMR EQLV DSLTF+NLLRFIAG NSIQLAKIVHCIAIVSKL GDLLV TAVLSLYSKL SLVDAR+LFDKMPE DRVVWNIMIA
Subjt:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA

Query:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI
        AYAREGKP ECL LFKSMARSGIRSD+FTALPVISSI+QLK  DWGKQTHA+ILRNGSDSQVSV+NSLIDMYCEC +LDSAC IFNWM DK+VISWSAMI
Subjt:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI

Query:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KGYVK+G SL A SLFS MKSDGIQ+DF+T+INILPAFVHIG LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQR+FEEERIDDKDLIMWN
Subjt:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCFKLYN+MKCSN+KPDQVTFLGLLTACVNSGL+EKGKEF KEMTE+YGC PSQEH+ACMVNLLGRAGLI+EAGELVRNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACK+HPGSKLAEFAAEKL+DMEPKNAGNYILLSNIYAAAGKW+EVAKMRSFLR+KGLKKTPGCS LEING VTEFRVADQTHPRAEDIYTILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEAREKSLEKLGNPL
        NLELEIKE REKSL+ L NPL
Subjt:  NLELEIKEAREKSLEKLGNPL

XP_022139869.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Momordica charantia]0.0e+0084.96Show/hide
Query:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL
        MLHLQRSKP+     F NFPATQSR LNTLSFLF RCSSRQ L+QIHARF+LHG HQNP LS +LID YANLGLL LS QVF SI +P ST+Y+AILRNL
Subjt:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL

Query:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE
        + +GE ERTLLVYR+M AKSMHPDEETYPSVLRSCC  SNV  GRKIHG+LVKLG D +D  ATAL EMY +CI FE+ H LFDK  +KD ECW+S  +E
Subjt:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE

Query:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA
        A QNGNG+ IF +FGRMR EQLV+DSLTFINLLR I G NSIQLAKIVHC+AI S LCGDLLVNTAVLSLYSKLG LV+ARKLFDKMPE DRVVWNIMIA
Subjt:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA

Query:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI
        AY REG P ECL LFKSMARSGIR+D+FTALPVISSISQLK  DWGKQTHA+ LRNGSD+QVSV+NSLIDMYCE NILDSACKIF+WM +KTVISWSAMI
Subjt:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI

Query:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KG VKHG SL ALSLFS MKSDGIQ+DFITVINILPAFVHIG LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQR+FEEER+DDKDLIMWN
Subjt:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCFK+YNQMKCSN++PDQVTFLGLLTACVNSGLVEKGKE  KEM ENYGCQPSQEHYACMVNLLGRAGLIN+AG LVRNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACKLHPGSKLAEFAAEKL+DMEPKNAGNYILLSNIYAAAGKWD VAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVAD+THPRAEDIYTILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEAREKSLEKLG
        NLELEIKEAREKS EKLG
Subjt:  NLELEIKEAREKSLEKLG

XP_038894029.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Benincasa hispida]0.0e+00100Show/hide
Query:  MGTMTNFWNIDSFLNLKSQLRNQTNNVPQLSSHMLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLID
        MGTMTNFWNIDSFLNLKSQLRNQTNNVPQLSSHMLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLID
Subjt:  MGTMTNFWNIDSFLNLKSQLRNQTNNVPQLSSHMLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLID

Query:  CYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALG
        CYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALG
Subjt:  CYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALG

Query:  EMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAV
        EMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAV
Subjt:  EMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAV

Query:  LSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNS
        LSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNS
Subjt:  LSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNS

Query:  LIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNT
        LIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNT
Subjt:  LIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNT

Query:  ALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQ
        ALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQ
Subjt:  ALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQ

Query:  EHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKT
        EHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKT
Subjt:  EHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKT

Query:  PGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEAREKSLEKLGNPL
        PGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEAREKSLEKLGNPL
Subjt:  PGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEAREKSLEKLGNPL

TrEMBL top hitse value%identityAlignment
A0A0A0M0Z6 Uncharacterized protein0.0e+0088.07Show/hide
Query:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL
        MLHL RSKP+IHS IF NFPATQSRLLNTLS LF RC+S QHL+QIHARF+LHGFHQNPTLSSKLIDCYANLGLLN SLQVF S+ +PN T++NAILRNL
Subjt:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL

Query:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE
        TRYGE ERTLLVY+QMVAKSMHPDEETYP VLRSC SFSNVG GR IHGYLVKLGFD FD+VATAL EMYEECI+FE+AHQLFDKRSVKDL   SS TTE
Subjt:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE

Query:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA
         PQN NGEGIF VFGRM  EQLV DS TF NLLRFIAG NSIQLAKIVHCIAIVSKL GDLLVNTAVLSLYSKL SLVDARKLFDKMPE DRVVWNIMIA
Subjt:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA

Query:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI
        AYAREGKPTECL LFKSMARSGIRSD+FTALPVISSI+QLK  DWGKQTHA+ILRNGSDSQVSV+NSLIDMYCEC ILDSACKIFNWM DK+VISWSAMI
Subjt:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI

Query:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KGYVK+G SL ALSLFS MKSDGIQ+DF+ +INILPAFVHIG LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQR+FEEE+IDDKDLIMWN
Subjt:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCFKLYN+MKCSN+KPDQVTFLGLLTACVNSGLVEKGKEF KEMTE+YGCQPSQEHYACMVNLLGRAGLI+EAGELV+NMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACK+HPGSKLAEFAAEKL++MEP+NAGNYILLSNIYAAAGKWD VAKMRSFLR+KGLKK PGCSWLEINGHVTEFRVADQTHPRA DIYTILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEAREKSLEKLGNPL
        NLELEIKE REKS + L NPL
Subjt:  NLELEIKEAREKSLEKLGNPL

A0A1S3BBG7 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0087.1Show/hide
Query:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL
        MLHLQRSKP+IH+ I  NFPATQSRLLNTLS LF+RC+S QHL+QIHARF+LHGFHQNPTLSSKLIDCYANLGLL  SLQVF SI +PN T++NAILRNL
Subjt:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL

Query:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE
        TRYGE ER LLVY+QMVAKSMHPDEETYP + RSC SFSNVG GR IHGYLVKLGFDSFD+VATAL EMYE+ I FE+AHQLFDKRSVKDL   SS TTE
Subjt:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE

Query:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA
          QNGNGEGIF VF RMR EQLV DSLTF+NLLRFIAG NSIQLAKIVHCIAIVSKL GDLLV TAVLSLYSKL SLVDAR+LFDKMPE DRVVWNIMIA
Subjt:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA

Query:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI
        AYAREGKP ECL LFKSMARSGIRSD+FTALPVISSI+QLK  DWGKQTHA+ILRNGSDSQVSV+NSLIDMYCEC +LDSAC IFNWM DK+VISWSAMI
Subjt:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI

Query:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KGYVK+G SL A SLFS MKSDGIQ+DF+T+INILPAFVHIG LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQR+FEEERIDDKDLIMWN
Subjt:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCFKLYN+MKCSN+KPDQVTFLGLLTACVNSGL+EKGKEF KEMTE+YGC PSQEH+ACMVNLLGRAGLI+EAGELVRNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACK+HPGSKLAEFAAEKL+DMEPKNAGNYILLSNIYAAAGKW+EVAKMRSFLR+KGLKKTPGCS LEING VTEFRVADQTHPRAEDIYTILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEAREKSLEKLGNPL
        NLELEIKE REKSL+ L NPL
Subjt:  NLELEIKEAREKSLEKLGNPL

A0A5D3DB69 Pentatricopeptide repeat-containing protein0.0e+0087.1Show/hide
Query:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL
        MLHLQRSKP+IH+ I  NFPATQSRLLNTLS LF+RC+S QHL+QIHARF+LHGFHQNPTLSSKLIDCYANLGLL  SLQVF SI +PN T++NAILRNL
Subjt:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL

Query:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE
        TRYGE ER LLVY+QMVAKSMHPDEETYP + RSC SFSNVG GR IHGYLVKLGFDSFD+VATAL EMYE+ I FE+AHQLFDKRSVKDL   SS TTE
Subjt:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE

Query:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA
          QNGNGEGIF VF RMR EQLV DSLTF+NLLRFIAG NSIQLAKIVHCIAIVSKL GDLLV TAVLSLYSKL SLVDAR+LFDKMPE DRVVWNIMIA
Subjt:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA

Query:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI
        AYAREGKP ECL LFKSMARSGIRSD+FTALPVISSI+QLK  DWGKQTHA+ILRNGSDSQVSV+NSLIDMYCEC +LDSAC IFNWM DK+VISWSAMI
Subjt:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI

Query:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KGYVK+G SL A SLFS MKSDGIQ+DF+T+INILPAFVHIG LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQR+FEEERIDDKDLIMWN
Subjt:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCFKLYN+MKCSN+KPDQVTFLGLLTACVNSGL+EKGKEF KEMTE+YGC PSQEH+ACMVNLLGRAGLI+EAGELVRNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACK+HPGSKLAEFAAEKL+DMEPKNAGNYILLSNIYAAAGKW+EVAKMRSFLR+KGLKKTPGCS LEING VTEFRVADQTHPRAEDIYTILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEAREKSLEKLGNPL
        NLELEIKE REKSL+ L NPL
Subjt:  NLELEIKEAREKSLEKLGNPL

A0A6J1CE61 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0084.96Show/hide
Query:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL
        MLHLQRSKP+     F NFPATQSR LNTLSFLF RCSSRQ L+QIHARF+LHG HQNP LS +LID YANLGLL LS QVF SI +P ST+Y+AILRNL
Subjt:  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNL

Query:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE
        + +GE ERTLLVYR+M AKSMHPDEETYPSVLRSCC  SNV  GRKIHG+LVKLG D +D  ATAL EMY +CI FE+ H LFDK  +KD ECW+S  +E
Subjt:  TRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTE

Query:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA
        A QNGNG+ IF +FGRMR EQLV+DSLTFINLLR I G NSIQLAKIVHC+AI S LCGDLLVNTAVLSLYSKLG LV+ARKLFDKMPE DRVVWNIMIA
Subjt:  APQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA

Query:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI
        AY REG P ECL LFKSMARSGIR+D+FTALPVISSISQLK  DWGKQTHA+ LRNGSD+QVSV+NSLIDMYCE NILDSACKIF+WM +KTVISWSAMI
Subjt:  AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMI

Query:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KG VKHG SL ALSLFS MKSDGIQ+DFITVINILPAFVHIG LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQR+FEEER+DDKDLIMWN
Subjt:  KGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCFK+YNQMKCSN++PDQVTFLGLLTACVNSGLVEKGKE  KEM ENYGCQPSQEHYACMVNLLGRAGLIN+AG LVRNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACKLHPGSKLAEFAAEKL+DMEPKNAGNYILLSNIYAAAGKWD VAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVAD+THPRAEDIYTILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEAREKSLEKLG
        NLELEIKEAREKS EKLG
Subjt:  NLELEIKEAREKSLEKLG

A0A6J1K3Q8 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0083.86Show/hide
Query:  MLHLQRSKPVIHSLI----FPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAI
        M HLQRSK +  S I    FPNFPATQSRLLNTLS LF RC SRQ L+QIHARFVLHGFHQNPTLS KLIDCYAN GLLN+S  VF SI +PNST+YNAI
Subjt:  MLHLQRSKPVIHSLI----FPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAI

Query:  LRNLTRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSS
        LRNLTR+GE ERTLLVYR+MVAKSMHPDE+TYP VL+SCC  SNV  G+ IHG L+KLG DS+D V T L EMY +CIDFE+AHQLFDK SVKDL+CWSS
Subjt:  LRNLTRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSS

Query:  FTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWN
          +EAPQNGNG+ I  + GRM+ E LVTDSLTFINLLR I+G +SIQLAKIVHCIAIVS LCGDLLV+TAVLSLYSKLGSLVDARKLF+KMPE DRVVWN
Subjt:  FTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWN

Query:  IMIAAYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISW
        IMIAAYAREG+P ECL LF+SMARSGIR+D+FTALPVISSISQLK  DWGKQTHA ILRNGSDSQVSV+NSLIDMYCECN L+SACKIFN + +KTVISW
Subjt:  IMIAAYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISW

Query:  SAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDL
        SAMIKG VKHG+ LIALSLF  MKSDGIQ+DFITVINI+PAFV IG LENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCIEMAQR+FEEER++DKDL
Subjt:  SAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDL

Query:  IMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK
        IMWNSMISAHANHGDWSQCFKLYNQMKCSN+ PDQVTFLGLLTACVNSGLVEKGKEF KEM E+Y CQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK
Subjt:  IMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK

Query:  PDARVWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIY
        PDARVWGPLLSACKLHPGSKLAEFAAEKL+DMEPKNAGNYILLSNIYAAAGKWD VAKMRSFLRDKGLKKTPGCSWLEING V EFRVAD+THPRAEDIY
Subjt:  PDARVWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIY

Query:  TILGNLELEIKEAREKSLEKLGNPL
         ILGNLEL+IKEA+E S EKLG  L
Subjt:  TILGNLELEIKEAREKSLEKLGNPL

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic1.9e-12033.53Show/hide
Query:  SFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDEETYPS
        + L +RCSS + L+QI      +G +Q     +KL+  +   G ++ + +VF  I    + +Y+ +L+   +  + ++ L  + +M    + P    +  
Subjt:  SFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDEETYPS

Query:  VLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVA-TALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTF
        +L+ C   + +  G++IHG LVK GF S D+ A T L  MY +C     A ++FD+   +DL  W++      QNG       +   M  E L    +T 
Subjt:  VLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVA-TALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTF

Query:  INLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSGIRSDMFT
        +++L  ++    I + K +H  A+ S     + ++TA++ +Y+K GSL  AR+LFD M E + V WN MI AY +   P E + +F+ M   G++    +
Subjt:  INLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSGIRSDMFT

Query:  ALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFI
         +  + + + L   + G+  H   +  G D  VSV NSLI MYC+C  +D+A  +F  ++ +T++SW+AMI G+ ++G  + AL+ FS M+S  ++ D  
Subjt:  ALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFI

Query:  TVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKP
        T ++++ A   + +  + K++HG  M+  L     + TAL+  YAKCG I +A+ IF  + + ++ +  WN+MI  +  HG      +L+ +M+    KP
Subjt:  TVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKP

Query:  DQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVDME
        + VTFL +++AC +SGLVE G +    M ENY  + S +HY  MV+LLGRAG +NEA + +  MP+KP   V+G +L AC++H     AE AAE+L ++ 
Subjt:  DQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVDME

Query:  PKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEA
        P + G ++LL+NIY AA  W++V ++R  +  +GL+KTPGCS +EI   V  F      HP ++ IY  L  L   IKEA
Subjt:  PKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEA

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic3.0e-11835.34Show/hide
Query:  LFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLID-CYANLGL--LNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDEETYP
        L   C + Q L+ IHA+ +  G H      SKLI+ C  +     L  ++ VF +I EPN  I+N + R      +    L +Y  M++  + P+  T+P
Subjt:  LFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLID-CYANLGL--LNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDEETYP

Query:  SVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTF
         VL+SC        G++IHG+++KLG D    V T+L  MY +    E AH++FDK   +D+  +                                   
Subjt:  SVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTF

Query:  INLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSGIRSDMFT
                                           TA++  Y+  G + +A+KLFD++P  D V WN MI+ YA  G   E L LFK M ++ +R D  T
Subjt:  INLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSGIRSDMFT

Query:  ALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFI
         + V+S+ +Q    + G+Q H +I  +G  S + + N+LID+Y +C  L++AC +F  +  K VISW+ +I GY        AL LF  M   G   + +
Subjt:  ALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFI

Query:  TVINILPAFVHIGVLENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNT
        T+++ILPA  H+G ++  +++H Y  K   G+T+  SL T+L+  YAKCG IE A ++F    I  K L  WN+MI   A HG     F L+++M+    
Subjt:  TVINILPAFVHIGVLENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNT

Query:  KPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVD
        +PD +TF+GLL+AC +SG+++ G+   + MT++Y   P  EHY CM++LLG +GL  EA E++  M ++PD  +W  LL ACK+H   +L E  AE L+ 
Subjt:  KPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVD

Query:  MEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEA
        +EP+N G+Y+LLSNIYA+AG+W+EVAK R+ L DKG+KK PGCS +EI+  V EF + D+ HPR  +IY +L  +E+ +++A
Subjt:  MEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEA

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226905.9e-11432.31Show/hide
Query:  QSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGL---LNLSLQVFYSITEPNST--IYNAILRNLTRYGECERTLLVYRQMV
        QS+           C +   LK  H      G   + +  +KL+     LG    L+ + +VF + +E   T  +YN+++R     G C   +L++ +M+
Subjt:  QSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGL---LNLSLQVFYSITEPNST--IYNAILRNLTRYGECERTLLVYRQMV

Query:  AKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRM
           + PD+ T+P  L +C      G+G +IHG +VK+G+     V  +L   Y EC + +SA ++FD+ S +++  W+S      +    +    +F RM
Subjt:  AKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRM

Query:  -RVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFK
         R E++  +S+T + ++   A    ++  + V+     S +  + L+ +A++ +Y K  ++  A++LFD+   ++  + N M + Y R+G   E L +F 
Subjt:  -RVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFK

Query:  SMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGH--------
         M  SG+R D  + L  ISS SQL+   WGK  H Y+LRNG +S  ++ N+LIDMY +C+  D+A +IF+ M +KTV++W++++ GYV++G         
Subjt:  SMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGH--------

Query:  ---------------------SLI--ALSLFSSMKS-DGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQR
                             SL   A+ +F SM+S +G+ +D +T+++I  A  H+G L+  K+++ Y  K G+     L T L+  +++CG  E A  
Subjt:  ---------------------SLI--ALSLFSSMKS-DGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQR

Query:  IFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLIN
        IF    + ++D+  W + I A A  G+  +  +L++ M     KPD V F+G LTAC + GLV++GKE    M + +G  P   HY CMV+LLGRAGL+ 
Subjt:  IFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLIN

Query:  EAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRV
        EA +L+ +MP++P+  +W  LL+AC++    ++A +AAEK+  + P+  G+Y+LLSN+YA+AG+W+++AK+R  +++KGL+K PG S ++I G   EF  
Subjt:  EAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRV

Query:  ADQTHPRAEDIYTIL
         D++HP   +I  +L
Subjt:  ADQTHPRAEDIYTIL

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic6.9e-11533.29Show/hide
Query:  TLSFLFDRCSSRQHL---KQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDE
        TL  +   C+  + L   K++      +GF  +  L SKL   Y N G L  + +VF  +    +  +N ++  L + G+   ++ ++++M++  +  D 
Subjt:  TLSFLFDRCSSRQHL---KQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDE

Query:  ETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTD
         T+  V +S  S  +V  G ++HG+++K GF   + V  +L   Y +    +SA ++FD+ + +D+  W+S       NG  E    VF +M V  +  D
Subjt:  ETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTD

Query:  SLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLC---GDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSG
          T +++    A    I L + VH I +  K C    D   NT +L +YSK G L  A+ +F +M +   V +  MIA YAREG   E + LF+ M   G
Subjt:  SLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLC---GDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSG

Query:  IRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFS-SMKS
        I  D++T   V++  ++ +  D GK+ H +I  N     + V N+L+DMY +C  +  A  +F+ M+ K +ISW+ +I GY K+ ++  ALSLF+  ++ 
Subjt:  IRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFS-SMKS

Query:  DGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQ
             D  TV  +LPA   +   +  + +HGY M+ G  S   +  +L+  YAKCG + +A  +F++  I  KDL+ W  MI+ +  HG   +   L+NQ
Subjt:  DGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQ

Query:  MKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFA
        M+ +  + D+++F+ LL AC +SGLV++G  F   M      +P+ EHYAC+V++L R G + +A   + NMPI PDA +WG LL  C++H   KLAE  
Subjt:  MKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFA

Query:  AEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEAREKSLEKLGNPL
        AEK+ ++EP+N G Y+L++NIYA A KW++V ++R  +  +GL+K PGCSW+EI G V  F   D ++P  E       N+E  +++ R + +E+  +PL
Subjt:  AEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEAREKSLEKLGNPL

Q9SVP7 Pentatricopeptide repeat-containing protein At4g136502.2e-10830.83Show/hide
Query:  KQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGS
        +Q+H   +  GF  +  + + L+  Y +LG L  +  +F ++++ ++  YN ++  L++ G  E+ + ++++M    + PD  T  S++ +C +   +  
Subjt:  KQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGS

Query:  GRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQ
        G+++H Y  KLGF S + +  AL  +Y +C D E+A   F +  V+++  W+          +    F +F +M++E++V +  T+ ++L+       ++
Subjt:  GRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQ

Query:  LAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYF
        L + +H   I +    +  V + ++ +Y+KLG L  A  +  +    D V W  MIA Y +     + L  F+ M   GIRSD       +S+ + L+  
Subjt:  LAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYF

Query:  DWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGV
          G+Q HA    +G  S +   N+L+ +Y  C  ++ +   F   +    I+W+A++ G+ + G++  AL +F  M  +GI ++  T  + + A      
Subjt:  DWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGV

Query:  LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVN
        ++  K +H    K G  S   +  AL+  YAKCG I  A++ F E  +  K+ + WN++I+A++ HG  S+    ++QM  SN +P+ VT +G+L+AC +
Subjt:  LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVN

Query:  SGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIY
         GLV+KG  + + M   YG  P  EHY C+V++L RAGL++ A E ++ MPIKPDA VW  LLSAC +H   ++ EFAA  L+++EP+++  Y+LLSN+Y
Subjt:  SGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIY

Query:  AAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKE
        A + KWD     R  +++KG+KK PG SW+E+   +  F V DQ HP A++I+    +L     E
Subjt:  AAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKE

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-11935.34Show/hide
Query:  LFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLID-CYANLGL--LNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDEETYP
        L   C + Q L+ IHA+ +  G H      SKLI+ C  +     L  ++ VF +I EPN  I+N + R      +    L +Y  M++  + P+  T+P
Subjt:  LFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLID-CYANLGL--LNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDEETYP

Query:  SVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTF
         VL+SC        G++IHG+++KLG D    V T+L  MY +    E AH++FDK   +D+  +                                   
Subjt:  SVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTF

Query:  INLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSGIRSDMFT
                                           TA++  Y+  G + +A+KLFD++P  D V WN MI+ YA  G   E L LFK M ++ +R D  T
Subjt:  INLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSGIRSDMFT

Query:  ALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFI
         + V+S+ +Q    + G+Q H +I  +G  S + + N+LID+Y +C  L++AC +F  +  K VISW+ +I GY        AL LF  M   G   + +
Subjt:  ALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFI

Query:  TVINILPAFVHIGVLENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNT
        T+++ILPA  H+G ++  +++H Y  K   G+T+  SL T+L+  YAKCG IE A ++F    I  K L  WN+MI   A HG     F L+++M+    
Subjt:  TVINILPAFVHIGVLENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNT

Query:  KPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVD
        +PD +TF+GLL+AC +SG+++ G+   + MT++Y   P  EHY CM++LLG +GL  EA E++  M ++PD  +W  LL ACK+H   +L E  AE L+ 
Subjt:  KPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVD

Query:  MEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEA
        +EP+N G+Y+LLSNIYA+AG+W+EVAK R+ L DKG+KK PGCS +EI+  V EF + D+ HPR  +IY +L  +E+ +++A
Subjt:  MEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEA

AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-12133.53Show/hide
Query:  SFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDEETYPS
        + L +RCSS + L+QI      +G +Q     +KL+  +   G ++ + +VF  I    + +Y+ +L+   +  + ++ L  + +M    + P    +  
Subjt:  SFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDEETYPS

Query:  VLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVA-TALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTF
        +L+ C   + +  G++IHG LVK GF S D+ A T L  MY +C     A ++FD+   +DL  W++      QNG       +   M  E L    +T 
Subjt:  VLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVA-TALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTF

Query:  INLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSGIRSDMFT
        +++L  ++    I + K +H  A+ S     + ++TA++ +Y+K GSL  AR+LFD M E + V WN MI AY +   P E + +F+ M   G++    +
Subjt:  INLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSGIRSDMFT

Query:  ALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFI
         +  + + + L   + G+  H   +  G D  VSV NSLI MYC+C  +D+A  +F  ++ +T++SW+AMI G+ ++G  + AL+ FS M+S  ++ D  
Subjt:  ALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFI

Query:  TVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKP
        T ++++ A   + +  + K++HG  M+  L     + TAL+  YAKCG I +A+ IF  + + ++ +  WN+MI  +  HG      +L+ +M+    KP
Subjt:  TVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKP

Query:  DQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVDME
        + VTFL +++AC +SGLVE G +    M ENY  + S +HY  MV+LLGRAG +NEA + +  MP+KP   V+G +L AC++H     AE AAE+L ++ 
Subjt:  DQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVDME

Query:  PKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEA
        P + G ++LL+NIY AA  W++V ++R  +  +GL+KTPGCS +EI   V  F      HP ++ IY  L  L   IKEA
Subjt:  PKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEA

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)4.2e-11532.31Show/hide
Query:  QSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGL---LNLSLQVFYSITEPNST--IYNAILRNLTRYGECERTLLVYRQMV
        QS+           C +   LK  H      G   + +  +KL+     LG    L+ + +VF + +E   T  +YN+++R     G C   +L++ +M+
Subjt:  QSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGL---LNLSLQVFYSITEPNST--IYNAILRNLTRYGECERTLLVYRQMV

Query:  AKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRM
           + PD+ T+P  L +C      G+G +IHG +VK+G+     V  +L   Y EC + +SA ++FD+ S +++  W+S      +    +    +F RM
Subjt:  AKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRM

Query:  -RVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFK
         R E++  +S+T + ++   A    ++  + V+     S +  + L+ +A++ +Y K  ++  A++LFD+   ++  + N M + Y R+G   E L +F 
Subjt:  -RVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFK

Query:  SMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGH--------
         M  SG+R D  + L  ISS SQL+   WGK  H Y+LRNG +S  ++ N+LIDMY +C+  D+A +IF+ M +KTV++W++++ GYV++G         
Subjt:  SMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGH--------

Query:  ---------------------SLI--ALSLFSSMKS-DGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQR
                             SL   A+ +F SM+S +G+ +D +T+++I  A  H+G L+  K+++ Y  K G+     L T L+  +++CG  E A  
Subjt:  ---------------------SLI--ALSLFSSMKS-DGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQR

Query:  IFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLIN
        IF    + ++D+  W + I A A  G+  +  +L++ M     KPD V F+G LTAC + GLV++GKE    M + +G  P   HY CMV+LLGRAGL+ 
Subjt:  IFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLIN

Query:  EAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRV
        EA +L+ +MP++P+  +W  LL+AC++    ++A +AAEK+  + P+  G+Y+LLSN+YA+AG+W+++AK+R  +++KGL+K PG S ++I G   EF  
Subjt:  EAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRV

Query:  ADQTHPRAEDIYTIL
         D++HP   +I  +L
Subjt:  ADQTHPRAEDIYTIL

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification4.2e-11532.31Show/hide
Query:  QSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGL---LNLSLQVFYSITEPNST--IYNAILRNLTRYGECERTLLVYRQMV
        QS+           C +   LK  H      G   + +  +KL+     LG    L+ + +VF + +E   T  +YN+++R     G C   +L++ +M+
Subjt:  QSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANLGL---LNLSLQVFYSITEPNST--IYNAILRNLTRYGECERTLLVYRQMV

Query:  AKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRM
           + PD+ T+P  L +C      G+G +IHG +VK+G+     V  +L   Y EC + +SA ++FD+ S +++  W+S      +    +    +F RM
Subjt:  AKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRM

Query:  -RVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFK
         R E++  +S+T + ++   A    ++  + V+     S +  + L+ +A++ +Y K  ++  A++LFD+   ++  + N M + Y R+G   E L +F 
Subjt:  -RVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFK

Query:  SMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGH--------
         M  SG+R D  + L  ISS SQL+   WGK  H Y+LRNG +S  ++ N+LIDMY +C+  D+A +IF+ M +KTV++W++++ GYV++G         
Subjt:  SMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGH--------

Query:  ---------------------SLI--ALSLFSSMKS-DGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQR
                             SL   A+ +F SM+S +G+ +D +T+++I  A  H+G L+  K+++ Y  K G+     L T L+  +++CG  E A  
Subjt:  ---------------------SLI--ALSLFSSMKS-DGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQR

Query:  IFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLIN
        IF    + ++D+  W + I A A  G+  +  +L++ M     KPD V F+G LTAC + GLV++GKE    M + +G  P   HY CMV+LLGRAGL+ 
Subjt:  IFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLIN

Query:  EAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRV
        EA +L+ +MP++P+  +W  LL+AC++    ++A +AAEK+  + P+  G+Y+LLSN+YA+AG+W+++AK+R  +++KGL+K PG S ++I G   EF  
Subjt:  EAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRV

Query:  ADQTHPRAEDIYTIL
         D++HP   +I  +L
Subjt:  ADQTHPRAEDIYTIL

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein4.9e-11633.29Show/hide
Query:  TLSFLFDRCSSRQHL---KQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDE
        TL  +   C+  + L   K++      +GF  +  L SKL   Y N G L  + +VF  +    +  +N ++  L + G+   ++ ++++M++  +  D 
Subjt:  TLSFLFDRCSSRQHL---KQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDE

Query:  ETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTD
         T+  V +S  S  +V  G ++HG+++K GF   + V  +L   Y +    +SA ++FD+ + +D+  W+S       NG  E    VF +M V  +  D
Subjt:  ETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTD

Query:  SLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLC---GDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSG
          T +++    A    I L + VH I +  K C    D   NT +L +YSK G L  A+ +F +M +   V +  MIA YAREG   E + LF+ M   G
Subjt:  SLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLC---GDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSG

Query:  IRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFS-SMKS
        I  D++T   V++  ++ +  D GK+ H +I  N     + V N+L+DMY +C  +  A  +F+ M+ K +ISW+ +I GY K+ ++  ALSLF+  ++ 
Subjt:  IRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFS-SMKS

Query:  DGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQ
             D  TV  +LPA   +   +  + +HGY M+ G  S   +  +L+  YAKCG + +A  +F++  I  KDL+ W  MI+ +  HG   +   L+NQ
Subjt:  DGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQ

Query:  MKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFA
        M+ +  + D+++F+ LL AC +SGLV++G  F   M      +P+ EHYAC+V++L R G + +A   + NMPI PDA +WG LL  C++H   KLAE  
Subjt:  MKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFA

Query:  AEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEAREKSLEKLGNPL
        AEK+ ++EP+N G Y+L++NIYA A KW++V ++R  +  +GL+K PGCSW+EI G V  F   D ++P  E       N+E  +++ R + +E+  +PL
Subjt:  AEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEAREKSLEKLGNPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGACGATGACGAATTTCTGGAACATTGACTCATTCTTGAATTTGAAAAGCCAACTCCGAAACCAAACAAACAATGTGCCGCAGCTTTCGTCCCACATGCTT
CACCTTCAACGATCAAAACCCGTTATTCATAGTCTCATTTTCCCCAACTTTCCCGCCACCCAATCAAGACTGCTCAACACGCTTTCCTTCCTCTTCGATCGATGC
AGCTCCCGTCAACACCTCAAGCAGATTCATGCCAGGTTCGTCCTCCATGGCTTCCACCAAAACCCAACTCTCTCTTCCAAACTTATTGACTGCTATGCCAATCTT
GGACTCCTCAATCTCTCCCTCCAAGTTTTCTACTCTATAACCGAACCCAATTCCACTATTTACAACGCCATACTGAGAAATTTGACAAGATATGGAGAATGTGAG
CGGACCTTGTTGGTGTACCGACAAATGGTCGCCAAGTCTATGCACCCAGATGAAGAAACTTACCCTTCTGTTTTGCGATCATGTTGTTCTTTTTCAAATGTCGGA
TCTGGGAGGAAGATTCATGGGTATTTGGTTAAACTGGGTTTTGATTCGTTTGATATGGTAGCTACTGCTTTGGGTGAGATGTACGAGGAATGCATTGATTTTGAG
AGTGCTCATCAACTGTTTGATAAAAGGTCTGTGAAGGATTTGGAATGCTGGAGTTCCTTCACTACGGAAGCTCCTCAAAATGGGAATGGGGAGGGAATTTTTGGG
GTTTTTGGGAGGATGAGAGTAGAACAATTAGTAACAGATTCACTCACATTCATCAATCTCTTGCGGTTCATTGCAGGTTTCAATTCAATTCAACTCGCAAAGATT
GTTCATTGTATTGCAATTGTGAGCAAATTGTGTGGAGATTTGTTAGTAAATACTGCTGTGTTGTCTCTTTACTCAAAGTTAGGTAGCTTAGTAGATGCTAGAAAG
TTATTTGACAAAATGCCTGAGAACGACCGTGTTGTATGGAATATAATGATAGCAGCTTATGCTCGGGAAGGGAAACCGACAGAATGTCTCGCGCTATTTAAGTCC
ATGGCAAGATCAGGGATTAGATCTGATATGTTTACTGCACTTCCTGTTATCTCTTCAATTTCACAGTTGAAATATTTTGACTGGGGCAAACAAACTCATGCATAT
ATATTGAGGAATGGTTCCGACAGTCAAGTTTCAGTTTATAACTCTCTCATCGACATGTACTGCGAATGTAACATTTTAGATTCAGCTTGTAAGATCTTCAATTGG
ATGAAAGACAAGACTGTAATTTCATGGAGTGCTATGATCAAGGGGTATGTCAAACATGGTCACTCCCTCATTGCATTGTCTCTCTTCTCCAGTATGAAATCTGAT
GGTATTCAATCTGACTTTATTACAGTGATCAATATCTTGCCTGCATTTGTTCACATAGGAGTACTTGAAAATGTCAAATATTTACATGGATACTCCATGAAGCTA
GGCCTGACTTCCCTTCCATCCCTTAACACAGCCCTCCTAATCACCTATGCAAAATGTGGGTGTATAGAGATGGCCCAGAGGATATTTGAGGAAGAGAGAATTGAT
GATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGCCAACCATGGAGACTGGTCCCAATGTTTTAAGCTATACAATCAAATGAAGTGCTCAAATACA
AAGCCAGACCAAGTAACATTTCTGGGATTACTAACAGCTTGTGTCAATTCTGGTCTCGTAGAAAAGGGGAAAGAGTTTTTGAAGGAGATGACTGAAAATTATGGC
TGCCAACCAAGTCAAGAGCATTATGCTTGTATGGTTAACCTCTTAGGGAGAGCTGGACTTATCAATGAAGCTGGAGAACTTGTAAGAAACATGCCCATCAAACCC
GATGCTCGAGTTTGGGGTCCATTGTTGAGTGCTTGTAAGTTGCATCCCGGTTCCAAGCTTGCAGAGTTTGCGGCCGAGAAGCTCGTTGATATGGAGCCTAAAAAT
GCAGGGAATTACATACTGCTTTCGAACATATATGCTGCTGCAGGGAAATGGGATGAAGTTGCAAAAATGAGAAGTTTCCTAAGGGATAAAGGGCTCAAGAAAACT
CCTGGTTGTAGTTGGTTGGAGATAAATGGGCATGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGAAGATATATATACCATCCTAGGAAACCTT
GAACTTGAAATCAAAGAGGCTAGAGAAAAGAGTCTAGAGAAATTGGGAAATCCTCTATAA
mRNA sequenceShow/hide mRNA sequence
GAAAAGCCGTGGGTTATTAAAACCATATTTTTTTTACTTCAATCAAAATAAGAGCAATAAAGTGGACTAGAGTGAGAACGAGGGAGGGAAGAATGTTTGAAAGTT
TTGTTTCGAAAAAATTGGTCAACCCATAAAAGTCTCCGTACAAAAGATTCTTCTATTTTGAATGGGGACGATGACGAATTTCTGGAACATTGACTCATTCTTGAA
TTTGAAAAGCCAACTCCGAAACCAAACAAACAATGTGCCGCAGCTTTCGTCCCACATGCTTCACCTTCAACGATCAAAACCCGTTATTCATAGTCTCATTTTCCC
CAACTTTCCCGCCACCCAATCAAGACTGCTCAACACGCTTTCCTTCCTCTTCGATCGATGCAGCTCCCGTCAACACCTCAAGCAGATTCATGCCAGGTTCGTCCT
CCATGGCTTCCACCAAAACCCAACTCTCTCTTCCAAACTTATTGACTGCTATGCCAATCTTGGACTCCTCAATCTCTCCCTCCAAGTTTTCTACTCTATAACCGA
ACCCAATTCCACTATTTACAACGCCATACTGAGAAATTTGACAAGATATGGAGAATGTGAGCGGACCTTGTTGGTGTACCGACAAATGGTCGCCAAGTCTATGCA
CCCAGATGAAGAAACTTACCCTTCTGTTTTGCGATCATGTTGTTCTTTTTCAAATGTCGGATCTGGGAGGAAGATTCATGGGTATTTGGTTAAACTGGGTTTTGA
TTCGTTTGATATGGTAGCTACTGCTTTGGGTGAGATGTACGAGGAATGCATTGATTTTGAGAGTGCTCATCAACTGTTTGATAAAAGGTCTGTGAAGGATTTGGA
ATGCTGGAGTTCCTTCACTACGGAAGCTCCTCAAAATGGGAATGGGGAGGGAATTTTTGGGGTTTTTGGGAGGATGAGAGTAGAACAATTAGTAACAGATTCACT
CACATTCATCAATCTCTTGCGGTTCATTGCAGGTTTCAATTCAATTCAACTCGCAAAGATTGTTCATTGTATTGCAATTGTGAGCAAATTGTGTGGAGATTTGTT
AGTAAATACTGCTGTGTTGTCTCTTTACTCAAAGTTAGGTAGCTTAGTAGATGCTAGAAAGTTATTTGACAAAATGCCTGAGAACGACCGTGTTGTATGGAATAT
AATGATAGCAGCTTATGCTCGGGAAGGGAAACCGACAGAATGTCTCGCGCTATTTAAGTCCATGGCAAGATCAGGGATTAGATCTGATATGTTTACTGCACTTCC
TGTTATCTCTTCAATTTCACAGTTGAAATATTTTGACTGGGGCAAACAAACTCATGCATATATATTGAGGAATGGTTCCGACAGTCAAGTTTCAGTTTATAACTC
TCTCATCGACATGTACTGCGAATGTAACATTTTAGATTCAGCTTGTAAGATCTTCAATTGGATGAAAGACAAGACTGTAATTTCATGGAGTGCTATGATCAAGGG
GTATGTCAAACATGGTCACTCCCTCATTGCATTGTCTCTCTTCTCCAGTATGAAATCTGATGGTATTCAATCTGACTTTATTACAGTGATCAATATCTTGCCTGC
ATTTGTTCACATAGGAGTACTTGAAAATGTCAAATATTTACATGGATACTCCATGAAGCTAGGCCTGACTTCCCTTCCATCCCTTAACACAGCCCTCCTAATCAC
CTATGCAAAATGTGGGTGTATAGAGATGGCCCAGAGGATATTTGAGGAAGAGAGAATTGATGATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGC
CAACCATGGAGACTGGTCCCAATGTTTTAAGCTATACAATCAAATGAAGTGCTCAAATACAAAGCCAGACCAAGTAACATTTCTGGGATTACTAACAGCTTGTGT
CAATTCTGGTCTCGTAGAAAAGGGGAAAGAGTTTTTGAAGGAGATGACTGAAAATTATGGCTGCCAACCAAGTCAAGAGCATTATGCTTGTATGGTTAACCTCTT
AGGGAGAGCTGGACTTATCAATGAAGCTGGAGAACTTGTAAGAAACATGCCCATCAAACCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCTTGTAAGTTGCA
TCCCGGTTCCAAGCTTGCAGAGTTTGCGGCCGAGAAGCTCGTTGATATGGAGCCTAAAAATGCAGGGAATTACATACTGCTTTCGAACATATATGCTGCTGCAGG
GAAATGGGATGAAGTTGCAAAAATGAGAAGTTTCCTAAGGGATAAAGGGCTCAAGAAAACTCCTGGTTGTAGTTGGTTGGAGATAAATGGGCATGTAACTGAGTT
TCGTGTTGCTGATCAAACTCATCCTAGAGCAGAAGATATATATACCATCCTAGGAAACCTTGAACTTGAAATCAAAGAGGCTAGAGAAAAGAGTCTAGAGAAATT
GGGAAATCCTCTATAACACTTGCATTTTTTTTTAAAAAATTATTATTATTATTATTATTATTATTATCTTCTCATGTGATAGGTTTTACTTATTCCTTTATTTAC
ATCTTCACACATTACATTGTTTACAGACTTCTTGGTCGCTAATACATTTGTATGTCAGTTGAAATATGCTCACTTTGACTTTACATGATTTATTCTAATGTAGAA
CAATATAAATGATCTCATTTCATCAGTATTGATATTGTTATTTAATTTCAAGT
Protein sequenceShow/hide protein sequence
MGTMTNFWNIDSFLNLKSQLRNQTNNVPQLSSHMLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPTLSSKLIDCYANL
GLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFE
SAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARK
LFDKMPENDRVVWNIMIAAYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDSQVSVYNSLIDMYCECNILDSACKIFNW
MKDKTVISWSAMIKGYVKHGHSLIALSLFSSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERID
DKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKP
DARVWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNL
ELEIKEAREKSLEKLGNPL