; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10021379 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10021379
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr05:8361555..8363717
RNA-Seq ExpressionHG10021379
SyntenyHG10021379
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573373.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0085.08Show/hide
Query:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL
        M HLQRSKPI +   FPNFPATQSRLLNTLS LF+RC SRQ LQQIHARFVLHGFHQNPTLS KLIDCYAN GLLNLS  VF +IIDPNS LYNAILRNL
Subjt:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL

Query:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE
        TR+GE ERTLLVY++MVAKSMHPDE+TYP VLRSCC  SNV FG+ +HG L+KLG DS+D V T L EMYE+CIDFE+AHQLFDK SVKDL+CWSSL TE
Subjt:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE

Query:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA
         PQNGNG+ I RLFGRM++E LV DSLTFINLLRS++GL+SI+LAKIVH I IVS LCGDLLV+TA+LSLYSKLGSLVDARKLF+K+PEKDRVVWNIMIA
Subjt:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI
        AYAR G+P ECLELF+SMARSGIR+D+FTALPVISSISQLK  DWGKQTHA++LRNGSDSQVSVHNSLIDMY ECN LDSACKIFN +T+K+VISWSAMI
Subjt:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI

Query:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KG VKHG  LIALSLF RMKSDGIQADFITVINI+PAFV IG LENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCI+MAQR+FEEER+DDKDLIMWN
Subjt:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHG+WSQCF LYNQMKCSN+ PDQVTFLGLLTACVNSGLVE+GKEFFKEM E+Y CQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
Subjt:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEING V EFRVAD+THPRAEDIY ILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEVREKSIDKL
        NLEL+IKE +E S +KL
Subjt:  NLELEIKEVREKSIDKL

XP_008444579.1 PREDICTED: pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucumis melo]0.0e+0088.63Show/hide
Query:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL
        MLHLQRSKPII +PI  NFPATQSRLLNTLS LFNRC+S QHLQQIHARF+LHGFHQNPTLSSKLIDCYANLGLL  SLQVF +IIDPN TL+NAILRNL
Subjt:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL

Query:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE
        TRYGE ER LLVYQQMVAKSMHPDEETYP + RSC SFSNVGFGR +HGYLVKLGFDSFD+VATALAEMYE+ I FE+AHQLFDKRSVKDL   SSLTTE
Subjt:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE

Query:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA
          QNGNGEGIFR+F RMRAEQLVPDSLTF+NLLR IAGLNSI+LAKIVH I IVSKL GDLLV TA+LSLYSKL SLVDAR+LFDKMPEKDRVVWNIMIA
Subjt:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI
        AYAR GKP ECLELFKSMARSGIRSD+FTALPVISSI+QLKCVDWGKQTHAH+LRNGSDSQVSVHNSLIDMY EC +LDSAC IFN MTDKSVISWSAMI
Subjt:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI

Query:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KGYVK+GQSL A SLFS+MKSDGIQADF+T+INILPAFVHIG LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQR+FEEERIDDKDLIMWN
Subjt:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHG+WSQCFKLYN+MKCSN+KPDQVTFLGLLTACVNSGL+E+GKEFFKEMTE+Y C PSQEH+ACMVNLLGRAGLI+EAGELVRNMPIKPDAR
Subjt:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACK+HPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKW+ VAKMRSFLR+KGLKKTPGCS LEING VTEFRVADQTHPRAEDIYTILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEVREKSIDKL-NPL
        NLELEIKEVREKS+D L NPL
Subjt:  NLELEIKEVREKSIDKL-NPL

XP_022139869.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Momordica charantia]0.0e+0085.42Show/hide
Query:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL
        MLHLQRSKPI +   F NFPATQSR LNTLSFLF+RCSSRQ L+QIHARF+LHG HQNP LS +LID YANLGLL LS QVF +IIDP STLY+AILRNL
Subjt:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL

Query:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE
        + +GE ERTLLVY++M AKSMHPDEETYPSVLRSCC  SNV +GRK+HG+LVKLG D +D  ATALAEMY +CI FE+ H LFDK  +KD ECW+SL +E
Subjt:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE

Query:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA
          QNGNG+ IF+LFGRMR EQLV DSLTFINLLRSI GLNSI+LAKIVH + I S LCGDLLVNTA+LSLYSKLG LV+ARKLFDKMPEKDRVVWNIMIA
Subjt:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI
        AY R G P ECLELFKSMARSGIR+D+FTALPVISSISQLKCVDWGKQTHAH LRNGSD+QVSVHNSLIDMY E NILDSACKIF+ MT+K+VISWSAMI
Subjt:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI

Query:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KG VKHGQSL ALSLFSRMKSDGIQADFITVINILPAFVHIG LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQR+FEEER+DDKDLIMWN
Subjt:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHG+WSQCFK+YNQMKCSN++PDQVTFLGLLTACVNSGLVE+GKE FKEM ENY CQPSQEHYACMVNLLGRAGLIN+AG LVRNMPIKPDAR
Subjt:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVAD+THPRAEDIYTILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEVREKSIDKLNPL
        NLELEIKE REKS +KL  L
Subjt:  NLELEIKEVREKSIDKLNPL

XP_023541395.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucurbita pepo subsp. pepo]0.0e+0084.25Show/hide
Query:  MLHLQRSKPIIQSPI----FPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAI
        M HLQRSKPI QSPI    FPNFPATQSRL NTLS LF+RC SRQ LQQIHARFVLHGFHQNPTLS KLIDCYAN GLLNLS  VF +IIDPNSTLYNAI
Subjt:  MLHLQRSKPIIQSPI----FPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAI

Query:  LRNLTRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSS
        LRNLTR+GE ERTLL+Y++MV KSMHPDE+TYP VLRSCC  S+V FG+ +HG L+KLG DS+D V T LAEMYE+CIDFE+AHQLFDK SVKDL+CWSS
Subjt:  LRNLTRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSS

Query:  LTTETPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWN
        L +E PQNGNG+ I  LFGRM++E +V DSLTFIN LRS++GL+SI+LAKIVH I IVS LCGDLLV+TA+LSLYSKLGSLVDARKLF+KMPEKDRVVWN
Subjt:  LTTETPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWN

Query:  IMIAAYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISW
        IMIAAYAR G+P ECLELF+SMARSGIR+D+FTALPVISSISQLK  DWGKQTHA++LRNGSDSQVSVHNSLIDMY ECN LDSACKIFN +T+K+VISW
Subjt:  IMIAAYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISW

Query:  SAMIKGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDL
        SAMIKG VKHG  LIALSLF RMKSDGIQADFITVINI+PAFV IG LENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCI+MAQR+FEEER+DDKDL
Subjt:  SAMIKGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDL

Query:  IMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK
        IMWNSMISAHANHG+WSQCFKLY+QMKCSN+ PDQVTFLGLLTACVNSGLVE+GKEFFKEM E+Y CQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK
Subjt:  IMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK

Query:  PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIY
        PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEING V EFRVAD+THPRAEDIY
Subjt:  PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIY

Query:  TILGNLELEIKEVREKSIDKLNPL
         ILGNLEL+IKE +E S +KL  L
Subjt:  TILGNLELEIKEVREKSIDKLNPL

XP_038894029.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Benincasa hispida]0.0e+0092.93Show/hide
Query:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL
        MLHLQRSKP+I S IFPNFPATQSRLLNTLSFLF+RCSSRQHL+QIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFY+I +PNST+YNAILRNL
Subjt:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL

Query:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE
        TRYGECERTLLVY+QMVAKSMHPDEETYPSVLRSCCSFSNVG GRK+HGYLVKLGFDSFDMVATAL EMYEECIDFE AHQLFDKRSVKDLECWSS TTE
Subjt:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE

Query:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA
         PQNGNGEGIF +FGRMR EQLV DSLTFINLLR IAG NSI+LAKIVH I IVSKLCGDLLVNTA+LSLYSKLGSLVDARKLFDKMPE DRVVWNIMIA
Subjt:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI
        AYAR GKPTECL LFKSMARSGIRSDMFTALPVISSISQLK  DWGKQTHA++LRNGSDSQVSV+NSLIDMY ECNILDSACKIFN M DK+VISWSAMI
Subjt:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI

Query:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KGYVKHG SLIALSLFS MKSDGIQ+DFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
Subjt:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHG+WSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVE+GKEF KEMTENY CQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
Subjt:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACKLHPGSKLAEFAAEKL+DMEPKNAGNYILLSNIYAAAGKWD VAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEVREKSIDKL-NPL
        NLELEIKE REKS++KL NPL
Subjt:  NLELEIKEVREKSIDKL-NPL

TrEMBL top hitse value%identityAlignment
A0A0A0M0Z6 Uncharacterized protein0.0e+0089.88Show/hide
Query:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL
        MLHL RSKPII SPIF NFPATQSRLLNTLS LF+RC+S QHLQQIHARF+LHGFHQNPTLSSKLIDCYANLGLLN SLQVF ++IDPN TL+NAILRNL
Subjt:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL

Query:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE
        TRYGE ERTLLVYQQMVAKSMHPDEETYP VLRSC SFSNVGFGR +HGYLVKLGFD FD+VATALAEMYEECI+FE+AHQLFDKRSVKDL   SSLTTE
Subjt:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE

Query:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA
         PQN NGEGIFR+FGRM AEQLVPDS TF NLLR IAGLNSI+LAKIVH I IVSKL GDLLVNTA+LSLYSKL SLVDARKLFDKMPEKDRVVWNIMIA
Subjt:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI
        AYAR GKPTECLELFKSMARSGIRSD+FTALPVISSI+QLKCVDWGKQTHAH+LRNGSDSQVSVHNSLIDMY EC ILDSACKIFN MTDKSVISWSAMI
Subjt:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI

Query:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KGYVK+GQSL ALSLFS+MKSDGIQADF+ +INILPAFVHIG LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQR+FEEE+IDDKDLIMWN
Subjt:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHG+WSQCFKLYN+MKCSN+KPDQVTFLGLLTACVNSGLVE+GKEFFKEMTE+Y CQPSQEHYACMVNLLGRAGLI+EAGELV+NMPIKPDAR
Subjt:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACK+HPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKWDGVAKMRSFLR+KGLKK PGCSWLEINGHVTEFRVADQTHPRA DIYTILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEVREKSIDKL-NPL
        NLELEIKEVREKS D L NPL
Subjt:  NLELEIKEVREKSIDKL-NPL

A0A1S3BBG7 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0088.63Show/hide
Query:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL
        MLHLQRSKPII +PI  NFPATQSRLLNTLS LFNRC+S QHLQQIHARF+LHGFHQNPTLSSKLIDCYANLGLL  SLQVF +IIDPN TL+NAILRNL
Subjt:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL

Query:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE
        TRYGE ER LLVYQQMVAKSMHPDEETYP + RSC SFSNVGFGR +HGYLVKLGFDSFD+VATALAEMYE+ I FE+AHQLFDKRSVKDL   SSLTTE
Subjt:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE

Query:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA
          QNGNGEGIFR+F RMRAEQLVPDSLTF+NLLR IAGLNSI+LAKIVH I IVSKL GDLLV TA+LSLYSKL SLVDAR+LFDKMPEKDRVVWNIMIA
Subjt:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI
        AYAR GKP ECLELFKSMARSGIRSD+FTALPVISSI+QLKCVDWGKQTHAH+LRNGSDSQVSVHNSLIDMY EC +LDSAC IFN MTDKSVISWSAMI
Subjt:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI

Query:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KGYVK+GQSL A SLFS+MKSDGIQADF+T+INILPAFVHIG LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQR+FEEERIDDKDLIMWN
Subjt:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHG+WSQCFKLYN+MKCSN+KPDQVTFLGLLTACVNSGL+E+GKEFFKEMTE+Y C PSQEH+ACMVNLLGRAGLI+EAGELVRNMPIKPDAR
Subjt:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACK+HPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKW+ VAKMRSFLR+KGLKKTPGCS LEING VTEFRVADQTHPRAEDIYTILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEVREKSIDKL-NPL
        NLELEIKEVREKS+D L NPL
Subjt:  NLELEIKEVREKSIDKL-NPL

A0A5D3DB69 Pentatricopeptide repeat-containing protein0.0e+0088.63Show/hide
Query:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL
        MLHLQRSKPII +PI  NFPATQSRLLNTLS LFNRC+S QHLQQIHARF+LHGFHQNPTLSSKLIDCYANLGLL  SLQVF +IIDPN TL+NAILRNL
Subjt:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL

Query:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE
        TRYGE ER LLVYQQMVAKSMHPDEETYP + RSC SFSNVGFGR +HGYLVKLGFDSFD+VATALAEMYE+ I FE+AHQLFDKRSVKDL   SSLTTE
Subjt:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE

Query:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA
          QNGNGEGIFR+F RMRAEQLVPDSLTF+NLLR IAGLNSI+LAKIVH I IVSKL GDLLV TA+LSLYSKL SLVDAR+LFDKMPEKDRVVWNIMIA
Subjt:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI
        AYAR GKP ECLELFKSMARSGIRSD+FTALPVISSI+QLKCVDWGKQTHAH+LRNGSDSQVSVHNSLIDMY EC +LDSAC IFN MTDKSVISWSAMI
Subjt:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI

Query:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KGYVK+GQSL A SLFS+MKSDGIQADF+T+INILPAFVHIG LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQR+FEEERIDDKDLIMWN
Subjt:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHG+WSQCFKLYN+MKCSN+KPDQVTFLGLLTACVNSGL+E+GKEFFKEMTE+Y C PSQEH+ACMVNLLGRAGLI+EAGELVRNMPIKPDAR
Subjt:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACK+HPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKW+ VAKMRSFLR+KGLKKTPGCS LEING VTEFRVADQTHPRAEDIYTILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEVREKSIDKL-NPL
        NLELEIKEVREKS+D L NPL
Subjt:  NLELEIKEVREKSIDKL-NPL

A0A6J1CE61 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0085.42Show/hide
Query:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL
        MLHLQRSKPI +   F NFPATQSR LNTLSFLF+RCSSRQ L+QIHARF+LHG HQNP LS +LID YANLGLL LS QVF +IIDP STLY+AILRNL
Subjt:  MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNL

Query:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE
        + +GE ERTLLVY++M AKSMHPDEETYPSVLRSCC  SNV +GRK+HG+LVKLG D +D  ATALAEMY +CI FE+ H LFDK  +KD ECW+SL +E
Subjt:  TRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTE

Query:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA
          QNGNG+ IF+LFGRMR EQLV DSLTFINLLRSI GLNSI+LAKIVH + I S LCGDLLVNTA+LSLYSKLG LV+ARKLFDKMPEKDRVVWNIMIA
Subjt:  TPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIA

Query:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI
        AY R G P ECLELFKSMARSGIR+D+FTALPVISSISQLKCVDWGKQTHAH LRNGSD+QVSVHNSLIDMY E NILDSACKIF+ MT+K+VISWSAMI
Subjt:  AYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMI

Query:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN
        KG VKHGQSL ALSLFSRMKSDGIQADFITVINILPAFVHIG LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQR+FEEER+DDKDLIMWN
Subjt:  KGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWN

Query:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
        SMISAHANHG+WSQCFK+YNQMKCSN++PDQVTFLGLLTACVNSGLVE+GKE FKEM ENY CQPSQEHYACMVNLLGRAGLIN+AG LVRNMPIKPDAR
Subjt:  SMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG
        VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVAD+THPRAEDIYTILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILG

Query:  NLELEIKEVREKSIDKLNPL
        NLELEIKE REKS +KL  L
Subjt:  NLELEIKEVREKSIDKLNPL

A0A6J1K3Q8 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0084.39Show/hide
Query:  MLHLQRSKPIIQSPI----FPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAI
        M HLQRSK I QSPI    FPNFPATQSRLLNTLS LF+RC SRQ L+QIHARFVLHGFHQNPTLS KLIDCYAN GLLN+S  VF +IIDPNSTLYNAI
Subjt:  MLHLQRSKPIIQSPI----FPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAI

Query:  LRNLTRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSS
        LRNLTR+GE ERTLLVY++MVAKSMHPDE+TYP VL+SCC  SNV FG+ +HG L+KLG DS+D V T LAEMY +CIDFE+AHQLFDK SVKDL+CWSS
Subjt:  LRNLTRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSS

Query:  LTTETPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWN
        L +E PQNGNG+ I  L GRM++E LV DSLTFINLLRSI+GL+SI+LAKIVH I IVS LCGDLLV+TA+LSLYSKLGSLVDARKLF+KMPEKDRVVWN
Subjt:  LTTETPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWN

Query:  IMIAAYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISW
        IMIAAYAR G+P ECLELF+SMARSGIR+D+FTALPVISSISQLKC DWGKQTHA++LRNGSDSQVSVHNSLIDMY ECN L+SACKIFN +T+K+VISW
Subjt:  IMIAAYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISW

Query:  SAMIKGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDL
        SAMIKG VKHG  LIALSLF  MKSDGIQADFITVINI+PAFV IG LENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCIEMAQR+FEEER++DKDL
Subjt:  SAMIKGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDL

Query:  IMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK
        IMWNSMISAHANHG+WSQCFKLYNQMKCSN+ PDQVTFLGLLTACVNSGLVE+GKEFFKEM E+Y CQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK
Subjt:  IMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK

Query:  PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIY
        PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEING V EFRVAD+THPRAEDIY
Subjt:  PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIY

Query:  TILGNLELEIKEVREKSIDKLNPL
         ILGNLEL+IKE +E S +KL  L
Subjt:  TILGNLELEIKEVREKSIDKLNPL

SwissProt top hitse value%identityAlignment
O81767 Pentatricopeptide repeat-containing protein At4g339902.8e-11333.48Show/hide
Query:  QSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNLTRYGECERTLLVYQQ-MVAKSM
        +S+ ++ +  LF  C++ Q  + +HAR V+    QN  +S+KL++ Y  LG + L+   F  I + +   +N ++    R G     +  +   M++  +
Subjt:  QSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNLTRYGECERTLLVYQQ-MVAKSM

Query:  HPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRMRAEQ
         PD  T+PSVL++C     V  G K+H   +K GF     VA +L  +Y       +A  LFD+  V+D+  W+++ +   Q+GN +    L   +RA  
Subjt:  HPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRMRAEQ

Query:  LVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKSMARS
           DS+T ++LL +            +HS +I   L  +L V+  L+ LY++ G L D +K+FD+M  +D + WN +I AY    +P   + LF+ M  S
Subjt:  LVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKSMARS

Query:  GIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNG-SDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQSLIALSLFSRMK
         I+ D  T + + S +SQL  +   +      LR G     +++ N+++ MY +  ++DSA  +FN + +  VISW+ +I GY ++G +  A+ +++ M+
Subjt:  GIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNG-SDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQSLIALSLFSRMK

Query:  SDG-IQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGEWSQCFKLY
         +G I A+  T +++LPA    G L     LHG  +K GL     + T+L   Y KCG +E A  +F +  I   + + WN++I+ H  HG   +   L+
Subjt:  SDG-IQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGEWSQCFKLY

Query:  NQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAE
         +M     KPD +TF+ LL+AC +SGLV+ G+  F+ M  +Y   PS +HY CMV++ GRAG +  A + +++M ++PDA +WG LLSAC++H    L +
Subjt:  NQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAE

Query:  FAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEV
         A+E L ++EP++ G ++LLSN+YA+AGKW+GV ++RS    KGL+KTPG S +E++  V  F   +QTHP  E++Y  L  L+ ++K +
Subjt:  FAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEV

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic6.2e-12132.87Show/hide
Query:  SKPIIQSPIFPNFPATQSRLLNTLS---------------FLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNST
        S  ++Q    P  P   SR  + LS                L  RCSS + L+QI      +G +Q     +KL+  +   G ++ + +VF  I    + 
Subjt:  SKPIIQSPIFPNFPATQSRLLNTLS---------------FLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNST

Query:  LYNAILRNLTRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVA-TALAEMYEECIDFEHAHQLFDKRSVKD
        LY+ +L+   +  + ++ L  + +M    + P    +  +L+ C   + +  G+++HG LVK GF S D+ A T L  MY +C     A ++FD+   +D
Subjt:  LYNAILRNLTRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVA-TALAEMYEECIDFEHAHQLFDKRSVKD

Query:  LECWSSLTTETPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEK
        L  W+++     QNG       +   M  E L P  +T +++L +++ L  I + K +H   + S     + ++TAL+ +Y+K GSL  AR+LFD M E+
Subjt:  LECWSSLTTETPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEK

Query:  DRVVWNIMIAAYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTD
        + V WN MI AY +   P E + +F+ M   G++    + +  + + + L  ++ G+  H   +  G D  VSV NSLI MY +C  +D+A  +F  +  
Subjt:  DRVVWNIMIAAYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTD

Query:  KSVISWSAMIKGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEER
        ++++SW+AMI G+ ++G+ + AL+ FS+M+S  ++ D  T ++++ A   + +  + K++HG  M+  L     + TAL+  YAKCG I +A+ IF  + 
Subjt:  KSVISWSAMIKGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEER

Query:  IDDKDLIMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELV
        + ++ +  WN+MI  +  HG      +L+ +M+    KP+ VTFL +++AC +SGLVE G + F  M ENY  + S +HY  MV+LLGRAG +NEA + +
Subjt:  IDDKDLIMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELV

Query:  RNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHP
          MP+KP   V+G +L AC++H     AE AAE+L ++ P + G ++LL+NIY AA  W+ V ++R  +  +GL+KTPGCS +EI   V  F      HP
Subjt:  RNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHP

Query:  RAEDIYTILGNLELEIKE
         ++ IY  L  L   IKE
Subjt:  RAEDIYTILGNLELEIKE

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.3e-11835.47Show/hide
Query:  TLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLID-CYANLGL--LNLSLQVFYAIIDPNSTLYNAILRNLTRYGECERTLLVYQQMVAKSMHPDE
        +LS L N C + Q L+ IHA+ +  G H      SKLI+ C  +     L  ++ VF  I +PN  ++N + R      +    L +Y  M++  + P+ 
Subjt:  TLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLID-CYANLGL--LNLSLQVFYAIIDPNSTLYNAILRNLTRYGECERTLLVYQQMVAKSMHPDE

Query:  ETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRMRAEQLVPD
         T+P VL+SC        G+++HG+++KLG D    V T+L  MY +    E AH++FDK   +D+  +                               
Subjt:  ETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRMRAEQLVPD

Query:  SLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKSMARSGIRS
                                               TAL+  Y+  G + +A+KLFD++P KD V WN MI+ YA  G   E LELFK M ++ +R 
Subjt:  SLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKSMARSGIRS

Query:  DMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQSLIALSLFSRMKSDGIQ
        D  T + V+S+ +Q   ++ G+Q H  +  +G  S + + N+LID+Y +C  L++AC +F  +  K VISW+ +I GY        AL LF  M   G  
Subjt:  DMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQSLIALSLFSRMKSDGIQ

Query:  ADFITVINILPAFVHIGVLENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGEWSQCFKLYNQMK
         + +T+++ILPA  H+G ++  +++H Y  K   G+T+  SL T+L+  YAKCG IE A ++F    I  K L  WN+MI   A HG     F L+++M+
Subjt:  ADFITVINILPAFVHIGVLENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGEWSQCFKLYNQMK

Query:  CSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAE
            +PD +TF+GLL+AC +SG+++ G+  F+ MT++Y   P  EHY CM++LLG +GL  EA E++  M ++PD  +W  LL ACK+H   +L E  AE
Subjt:  CSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAE

Query:  KLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKE
         LI +EP+N G+Y+LLSNIYA+AG+W+ VAK R+ L DKG+KK PGCS +EI+  V EF + D+ HPR  +IY +L  +E+ +++
Subjt:  KLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKE

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226908.1e-11331.93Show/hide
Query:  QSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGL---LNLSLQVFYAIIDPNST-LYNAILRNLTRYGECERTLLVYQQMVA
        QS+           C +   L+  H      G   + +  +KL+     LG    L+ + +VF       +  +YN+++R     G C   +L++ +M+ 
Subjt:  QSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGL---LNLSLQVFYAIIDPNST-LYNAILRNLTRYGECERTLLVYQQMVA

Query:  KSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRM-
          + PD+ T+P  L +C      G G ++HG +VK+G+     V  +L   Y EC + + A ++FD+ S +++  W+S+     +    +    LF RM 
Subjt:  KSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRM-

Query:  RAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKS
        R E++ P+S+T + ++ + A L  +   + V++    S +  + L+ +AL+ +Y K  ++  A++LFD+    +  + N M + Y R G   E L +F  
Subjt:  RAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKS

Query:  MARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQ---------
        M  SG+R D  + L  ISS SQL+ + WGK  H +VLRNG +S  ++ N+LIDMY +C+  D+A +IF+ M++K+V++W++++ GYV++G+         
Subjt:  MARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQ---------

Query:  ---------------SLIALSLF--------SRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRI
                        L+  SLF        S    +G+ AD +T+++I  A  H+G L+  K+++ Y  K G+     L T L+  +++CG  E A  I
Subjt:  ---------------SLIALSLF--------SRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRI

Query:  FEEERIDDKDLIMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINE
        F    + ++D+  W + I A A  G   +  +L++ M     KPD V F+G LTAC + GLV++GKE F  M + +   P   HY CMV+LLGRAGL+ E
Subjt:  FEEERIDDKDLIMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINE

Query:  AGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVA
        A +L+ +MP++P+  +W  LL+AC++    ++A +AAEK+  + P+  G+Y+LLSN+YA+AG+W+ +AK+R  +++KGL+K PG S ++I G   EF   
Subjt:  AGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVA

Query:  DQTHPRAEDIYTIL
        D++HP   +I  +L
Subjt:  DQTHPRAEDIYTIL

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.1e-11233.68Show/hide
Query:  HGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNLTRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLV
        +GF  +  L SKL   Y N G L  + +VF  +    +  +N ++  L + G+   ++ ++++M++  +  D  T+  V +S  S  +V  G ++HG+++
Subjt:  HGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNLTRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLV

Query:  KLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSIT
        K GF   + V  +L   Y +    + A ++FD+ + +D+  W+S+      NG  E    +F +M    +  D  T +++    A    I L + VHSI 
Subjt:  KLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSIT

Query:  IVSKLC---GDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQT
        +  K C    D   NT LL +YSK G L  A+ +F +M ++  V +  MIA YAR G   E ++LF+ M   GI  D++T   V++  ++ + +D GK+ 
Subjt:  IVSKLC---GDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQT

Query:  HAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQSLIALSLFS-RMKSDGIQADFITVINILPAFVHIGVLENVK
        H  +  N     + V N+L+DMY +C  +  A  +F+ M  K +ISW+ +I GY K+  +  ALSLF+  ++      D  TV  +LPA   +   +  +
Subjt:  HAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQSLIALSLFS-RMKSDGIQADFITVINILPAFVHIGVLENVK

Query:  YLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVE
         +HGY M+ G  S   +  +L+  YAKCG + +A  +F++  I  KDL+ W  MI+ +  HG   +   L+NQM+ +  + D+++F+ LL AC +SGLV+
Subjt:  YLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVE

Query:  RGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGK
         G  FF  M      +P+ EHYAC+V++L R G + +A   + NMPI PDA +WG LL  C++H   KLAE  AEK+ ++EP+N G Y+L++NIYA A K
Subjt:  RGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGK

Query:  WDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEVREKSIDK
        W+ V ++R  +  +GL+K PGCSW+EI G V  F   D ++P  E       N+E  +++VR + I++
Subjt:  WDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEVREKSIDK

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.2e-12035.47Show/hide
Query:  TLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLID-CYANLGL--LNLSLQVFYAIIDPNSTLYNAILRNLTRYGECERTLLVYQQMVAKSMHPDE
        +LS L N C + Q L+ IHA+ +  G H      SKLI+ C  +     L  ++ VF  I +PN  ++N + R      +    L +Y  M++  + P+ 
Subjt:  TLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLID-CYANLGL--LNLSLQVFYAIIDPNSTLYNAILRNLTRYGECERTLLVYQQMVAKSMHPDE

Query:  ETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRMRAEQLVPD
         T+P VL+SC        G+++HG+++KLG D    V T+L  MY +    E AH++FDK   +D+  +                               
Subjt:  ETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRMRAEQLVPD

Query:  SLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKSMARSGIRS
                                               TAL+  Y+  G + +A+KLFD++P KD V WN MI+ YA  G   E LELFK M ++ +R 
Subjt:  SLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKSMARSGIRS

Query:  DMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQSLIALSLFSRMKSDGIQ
        D  T + V+S+ +Q   ++ G+Q H  +  +G  S + + N+LID+Y +C  L++AC +F  +  K VISW+ +I GY        AL LF  M   G  
Subjt:  DMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQSLIALSLFSRMKSDGIQ

Query:  ADFITVINILPAFVHIGVLENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGEWSQCFKLYNQMK
         + +T+++ILPA  H+G ++  +++H Y  K   G+T+  SL T+L+  YAKCG IE A ++F    I  K L  WN+MI   A HG     F L+++M+
Subjt:  ADFITVINILPAFVHIGVLENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGEWSQCFKLYNQMK

Query:  CSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAE
            +PD +TF+GLL+AC +SG+++ G+  F+ MT++Y   P  EHY CM++LLG +GL  EA E++  M ++PD  +W  LL ACK+H   +L E  AE
Subjt:  CSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAE

Query:  KLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKE
         LI +EP+N G+Y+LLSNIYA+AG+W+ VAK R+ L DKG+KK PGCS +EI+  V EF + D+ HPR  +IY +L  +E+ +++
Subjt:  KLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKE

AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein4.4e-12232.87Show/hide
Query:  SKPIIQSPIFPNFPATQSRLLNTLS---------------FLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNST
        S  ++Q    P  P   SR  + LS                L  RCSS + L+QI      +G +Q     +KL+  +   G ++ + +VF  I    + 
Subjt:  SKPIIQSPIFPNFPATQSRLLNTLS---------------FLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNST

Query:  LYNAILRNLTRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVA-TALAEMYEECIDFEHAHQLFDKRSVKD
        LY+ +L+   +  + ++ L  + +M    + P    +  +L+ C   + +  G+++HG LVK GF S D+ A T L  MY +C     A ++FD+   +D
Subjt:  LYNAILRNLTRYGECERTLLVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVA-TALAEMYEECIDFEHAHQLFDKRSVKD

Query:  LECWSSLTTETPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEK
        L  W+++     QNG       +   M  E L P  +T +++L +++ L  I + K +H   + S     + ++TAL+ +Y+K GSL  AR+LFD M E+
Subjt:  LECWSSLTTETPQNGNGEGIFRLFGRMRAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEK

Query:  DRVVWNIMIAAYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTD
        + V WN MI AY +   P E + +F+ M   G++    + +  + + + L  ++ G+  H   +  G D  VSV NSLI MY +C  +D+A  +F  +  
Subjt:  DRVVWNIMIAAYARGGKPTECLELFKSMARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTD

Query:  KSVISWSAMIKGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEER
        ++++SW+AMI G+ ++G+ + AL+ FS+M+S  ++ D  T ++++ A   + +  + K++HG  M+  L     + TAL+  YAKCG I +A+ IF  + 
Subjt:  KSVISWSAMIKGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEER

Query:  IDDKDLIMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELV
        + ++ +  WN+MI  +  HG      +L+ +M+    KP+ VTFL +++AC +SGLVE G + F  M ENY  + S +HY  MV+LLGRAG +NEA + +
Subjt:  IDDKDLIMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELV

Query:  RNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHP
          MP+KP   V+G +L AC++H     AE AAE+L ++ P + G ++LL+NIY AA  W+ V ++R  +  +GL+KTPGCS +EI   V  F      HP
Subjt:  RNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHP

Query:  RAEDIYTILGNLELEIKE
         ++ IY  L  L   IKE
Subjt:  RAEDIYTILGNLELEIKE

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)5.8e-11431.93Show/hide
Query:  QSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGL---LNLSLQVFYAIIDPNST-LYNAILRNLTRYGECERTLLVYQQMVA
        QS+           C +   L+  H      G   + +  +KL+     LG    L+ + +VF       +  +YN+++R     G C   +L++ +M+ 
Subjt:  QSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGL---LNLSLQVFYAIIDPNST-LYNAILRNLTRYGECERTLLVYQQMVA

Query:  KSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRM-
          + PD+ T+P  L +C      G G ++HG +VK+G+     V  +L   Y EC + + A ++FD+ S +++  W+S+     +    +    LF RM 
Subjt:  KSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRM-

Query:  RAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKS
        R E++ P+S+T + ++ + A L  +   + V++    S +  + L+ +AL+ +Y K  ++  A++LFD+    +  + N M + Y R G   E L +F  
Subjt:  RAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKS

Query:  MARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQ---------
        M  SG+R D  + L  ISS SQL+ + WGK  H +VLRNG +S  ++ N+LIDMY +C+  D+A +IF+ M++K+V++W++++ GYV++G+         
Subjt:  MARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQ---------

Query:  ---------------SLIALSLF--------SRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRI
                        L+  SLF        S    +G+ AD +T+++I  A  H+G L+  K+++ Y  K G+     L T L+  +++CG  E A  I
Subjt:  ---------------SLIALSLF--------SRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRI

Query:  FEEERIDDKDLIMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINE
        F    + ++D+  W + I A A  G   +  +L++ M     KPD V F+G LTAC + GLV++GKE F  M + +   P   HY CMV+LLGRAGL+ E
Subjt:  FEEERIDDKDLIMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINE

Query:  AGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVA
        A +L+ +MP++P+  +W  LL+AC++    ++A +AAEK+  + P+  G+Y+LLSN+YA+AG+W+ +AK+R  +++KGL+K PG S ++I G   EF   
Subjt:  AGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVA

Query:  DQTHPRAEDIYTIL
        D++HP   +I  +L
Subjt:  DQTHPRAEDIYTIL

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification5.8e-11431.93Show/hide
Query:  QSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGL---LNLSLQVFYAIIDPNST-LYNAILRNLTRYGECERTLLVYQQMVA
        QS+           C +   L+  H      G   + +  +KL+     LG    L+ + +VF       +  +YN+++R     G C   +L++ +M+ 
Subjt:  QSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGL---LNLSLQVFYAIIDPNST-LYNAILRNLTRYGECERTLLVYQQMVA

Query:  KSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRM-
          + PD+ T+P  L +C      G G ++HG +VK+G+     V  +L   Y EC + + A ++FD+ S +++  W+S+     +    +    LF RM 
Subjt:  KSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRM-

Query:  RAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKS
        R E++ P+S+T + ++ + A L  +   + V++    S +  + L+ +AL+ +Y K  ++  A++LFD+    +  + N M + Y R G   E L +F  
Subjt:  RAEQLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKS

Query:  MARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQ---------
        M  SG+R D  + L  ISS SQL+ + WGK  H +VLRNG +S  ++ N+LIDMY +C+  D+A +IF+ M++K+V++W++++ GYV++G+         
Subjt:  MARSGIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQ---------

Query:  ---------------SLIALSLF--------SRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRI
                        L+  SLF        S    +G+ AD +T+++I  A  H+G L+  K+++ Y  K G+     L T L+  +++CG  E A  I
Subjt:  ---------------SLIALSLF--------SRMKSDGIQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRI

Query:  FEEERIDDKDLIMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINE
        F    + ++D+  W + I A A  G   +  +L++ M     KPD V F+G LTAC + GLV++GKE F  M + +   P   HY CMV+LLGRAGL+ E
Subjt:  FEEERIDDKDLIMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINE

Query:  AGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVA
        A +L+ +MP++P+  +W  LL+AC++    ++A +AAEK+  + P+  G+Y+LLSN+YA+AG+W+ +AK+R  +++KGL+K PG S ++I G   EF   
Subjt:  AGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVA

Query:  DQTHPRAEDIYTIL
        D++HP   +I  +L
Subjt:  DQTHPRAEDIYTIL

AT4G33990.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-11433.48Show/hide
Query:  QSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNLTRYGECERTLLVYQQ-MVAKSM
        +S+ ++ +  LF  C++ Q  + +HAR V+    QN  +S+KL++ Y  LG + L+   F  I + +   +N ++    R G     +  +   M++  +
Subjt:  QSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNLTRYGECERTLLVYQQ-MVAKSM

Query:  HPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRMRAEQ
         PD  T+PSVL++C     V  G K+H   +K GF     VA +L  +Y       +A  LFD+  V+D+  W+++ +   Q+GN +    L   +RA  
Subjt:  HPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRMRAEQ

Query:  LVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKSMARS
           DS+T ++LL +            +HS +I   L  +L V+  L+ LY++ G L D +K+FD+M  +D + WN +I AY    +P   + LF+ M  S
Subjt:  LVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKSMARS

Query:  GIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNG-SDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQSLIALSLFSRMK
         I+ D  T + + S +SQL  +   +      LR G     +++ N+++ MY +  ++DSA  +FN + +  VISW+ +I GY ++G +  A+ +++ M+
Subjt:  GIRSDMFTALPVISSISQLKCVDWGKQTHAHVLRNG-SDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQSLIALSLFSRMK

Query:  SDG-IQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGEWSQCFKLY
         +G I A+  T +++LPA    G L     LHG  +K GL     + T+L   Y KCG +E A  +F +  I   + + WN++I+ H  HG   +   L+
Subjt:  SDG-IQADFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGEWSQCFKLY

Query:  NQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAE
         +M     KPD +TF+ LL+AC +SGLV+ G+  F+ M  +Y   PS +HY CMV++ GRAG +  A + +++M ++PDA +WG LLSAC++H    L +
Subjt:  NQMKCSNTKPDQVTFLGLLTACVNSGLVERGKEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAE

Query:  FAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEV
         A+E L ++EP++ G ++LLSN+YA+AGKW+GV ++RS    KGL+KTPG S +E++  V  F   +QTHP  E++Y  L  L+ ++K +
Subjt:  FAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCACCTTCAACGATCAAAACCCATTATTCAGAGTCCCATTTTCCCCAATTTTCCCGCCACCCAATCAAGACTGCTCAACACGCTTTCTTTCCTCTTCAATCGATG
CAGTTCCCGTCAACACCTGCAGCAAATTCATGCCAGGTTCGTCCTCCATGGCTTCCACCAAAACCCAACTCTCTCTTCCAAACTTATTGATTGCTATGCCAATCTTGGGC
TCCTTAATCTCTCTCTCCAAGTTTTCTACGCTATAATCGACCCCAATTCGACTCTTTACAACGCCATACTAAGAAATTTGACAAGATATGGAGAATGTGAGCGGACCCTG
TTGGTGTACCAACAAATGGTCGCAAAGTCTATGCACCCAGATGAAGAAACTTACCCTTCTGTTTTGCGATCATGTTGTTCTTTTTCAAATGTCGGATTTGGGAGGAAGGT
TCATGGGTACTTGGTTAAGCTGGGTTTTGATTCGTTTGATATGGTAGCTACTGCTCTGGCTGAGATGTACGAGGAATGCATTGATTTTGAGCATGCTCATCAACTGTTTG
ATAAAAGGTCTGTGAAGGATTTGGAATGCTGGAGTTCCTTGACTACGGAGACTCCTCAAAATGGGAATGGGGAGGGAATTTTTCGGCTCTTTGGGAGGATGAGAGCAGAA
CAATTAGTACCAGACTCACTTACATTCATCAATCTCTTGAGGTCCATTGCAGGGTTGAATTCAATTCGACTTGCAAAGATTGTTCATTCTATTACAATTGTGAGCAAATT
ATGTGGAGATTTGTTAGTAAATACTGCTTTGTTGTCTCTTTACTCAAAGTTAGGTAGCTTAGTAGATGCTAGAAAGTTATTTGACAAAATGCCAGAGAAAGACCGTGTTG
TATGGAATATAATGATAGCAGCTTACGCCCGGGGAGGGAAACCGACGGAATGTCTCGAGCTTTTCAAGTCCATGGCAAGATCAGGGATTAGATCTGATATGTTTACTGCA
CTTCCTGTTATTTCTTCAATTTCACAGTTGAAATGTGTTGATTGGGGCAAACAAACTCATGCTCATGTATTGAGGAATGGTTCCGACAGTCAAGTTTCAGTTCATAACTC
TCTTATTGACATGTACCGGGAATGTAACATTTTAGATTCAGCTTGTAAGATCTTCAACTGTATGACAGACAAGTCTGTAATTTCATGGAGTGCTATGATTAAGGGGTATG
TCAAACATGGTCAGTCCCTCATTGCATTGTCTCTCTTCTCCAGGATGAAATCTGATGGGATTCAAGCTGATTTCATTACAGTAATCAATATCTTACCTGCATTTGTTCAC
ATAGGAGTACTTGAAAATGTCAAATATTTACATGGGTACTCAATGAAACTAGGCCTGACTTCCCTTCCATCACTTAACACAGCCCTTCTAATCACCTATGCAAAATGTGG
GTGTATAGAGATGGCCCAGAGGATATTTGAGGAAGAGAGAATTGATGATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGCCAACCATGGAGAATGGTCCC
AATGTTTTAAGCTATACAATCAAATGAAGTGCTCAAACACAAAACCAGACCAAGTAACATTTCTGGGATTGCTAACAGCTTGTGTCAATTCCGGTCTTGTAGAAAGGGGG
AAAGAGTTTTTCAAGGAGATGACTGAAAATTATGACTGCCAACCAAGTCAAGAGCATTATGCTTGTATGGTTAACCTCTTAGGGAGAGCTGGGCTTATCAATGAAGCTGG
AGAACTTGTGAGAAACATGCCCATCAAACCCGATGCTCGAGTTTGGGGTCCGTTGTTGAGTGCATGTAAGTTGCATCCTGGGTCCAAGCTTGCAGAGTTTGCGGCCGAGA
AGCTCATCGATATGGAGCCTAAAAATGCAGGGAATTACATACTGCTTTCAAACATATATGCCGCTGCAGGTAAATGGGATGGAGTTGCAAAAATGAGAAGTTTCTTAAGG
GATAAAGGGCTAAAGAAAACCCCTGGTTGTAGTTGGCTGGAGATAAATGGCCATGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGAAGATATATATAC
CATCCTAGGAAACCTTGAACTTGAAATCAAAGAAGTTAGAGAAAAGAGTATAGATAAATTAAATCCTCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCACCTTCAACGATCAAAACCCATTATTCAGAGTCCCATTTTCCCCAATTTTCCCGCCACCCAATCAAGACTGCTCAACACGCTTTCTTTCCTCTTCAATCGATG
CAGTTCCCGTCAACACCTGCAGCAAATTCATGCCAGGTTCGTCCTCCATGGCTTCCACCAAAACCCAACTCTCTCTTCCAAACTTATTGATTGCTATGCCAATCTTGGGC
TCCTTAATCTCTCTCTCCAAGTTTTCTACGCTATAATCGACCCCAATTCGACTCTTTACAACGCCATACTAAGAAATTTGACAAGATATGGAGAATGTGAGCGGACCCTG
TTGGTGTACCAACAAATGGTCGCAAAGTCTATGCACCCAGATGAAGAAACTTACCCTTCTGTTTTGCGATCATGTTGTTCTTTTTCAAATGTCGGATTTGGGAGGAAGGT
TCATGGGTACTTGGTTAAGCTGGGTTTTGATTCGTTTGATATGGTAGCTACTGCTCTGGCTGAGATGTACGAGGAATGCATTGATTTTGAGCATGCTCATCAACTGTTTG
ATAAAAGGTCTGTGAAGGATTTGGAATGCTGGAGTTCCTTGACTACGGAGACTCCTCAAAATGGGAATGGGGAGGGAATTTTTCGGCTCTTTGGGAGGATGAGAGCAGAA
CAATTAGTACCAGACTCACTTACATTCATCAATCTCTTGAGGTCCATTGCAGGGTTGAATTCAATTCGACTTGCAAAGATTGTTCATTCTATTACAATTGTGAGCAAATT
ATGTGGAGATTTGTTAGTAAATACTGCTTTGTTGTCTCTTTACTCAAAGTTAGGTAGCTTAGTAGATGCTAGAAAGTTATTTGACAAAATGCCAGAGAAAGACCGTGTTG
TATGGAATATAATGATAGCAGCTTACGCCCGGGGAGGGAAACCGACGGAATGTCTCGAGCTTTTCAAGTCCATGGCAAGATCAGGGATTAGATCTGATATGTTTACTGCA
CTTCCTGTTATTTCTTCAATTTCACAGTTGAAATGTGTTGATTGGGGCAAACAAACTCATGCTCATGTATTGAGGAATGGTTCCGACAGTCAAGTTTCAGTTCATAACTC
TCTTATTGACATGTACCGGGAATGTAACATTTTAGATTCAGCTTGTAAGATCTTCAACTGTATGACAGACAAGTCTGTAATTTCATGGAGTGCTATGATTAAGGGGTATG
TCAAACATGGTCAGTCCCTCATTGCATTGTCTCTCTTCTCCAGGATGAAATCTGATGGGATTCAAGCTGATTTCATTACAGTAATCAATATCTTACCTGCATTTGTTCAC
ATAGGAGTACTTGAAAATGTCAAATATTTACATGGGTACTCAATGAAACTAGGCCTGACTTCCCTTCCATCACTTAACACAGCCCTTCTAATCACCTATGCAAAATGTGG
GTGTATAGAGATGGCCCAGAGGATATTTGAGGAAGAGAGAATTGATGATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGCCAACCATGGAGAATGGTCCC
AATGTTTTAAGCTATACAATCAAATGAAGTGCTCAAACACAAAACCAGACCAAGTAACATTTCTGGGATTGCTAACAGCTTGTGTCAATTCCGGTCTTGTAGAAAGGGGG
AAAGAGTTTTTCAAGGAGATGACTGAAAATTATGACTGCCAACCAAGTCAAGAGCATTATGCTTGTATGGTTAACCTCTTAGGGAGAGCTGGGCTTATCAATGAAGCTGG
AGAACTTGTGAGAAACATGCCCATCAAACCCGATGCTCGAGTTTGGGGTCCGTTGTTGAGTGCATGTAAGTTGCATCCTGGGTCCAAGCTTGCAGAGTTTGCGGCCGAGA
AGCTCATCGATATGGAGCCTAAAAATGCAGGGAATTACATACTGCTTTCAAACATATATGCCGCTGCAGGTAAATGGGATGGAGTTGCAAAAATGAGAAGTTTCTTAAGG
GATAAAGGGCTAAAGAAAACCCCTGGTTGTAGTTGGCTGGAGATAAATGGCCATGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGAAGATATATATAC
CATCCTAGGAAACCTTGAACTTGAAATCAAAGAAGTTAGAGAAAAGAGTATAGATAAATTAAATCCTCTGTAA
Protein sequenceShow/hide protein sequence
MLHLQRSKPIIQSPIFPNFPATQSRLLNTLSFLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDCYANLGLLNLSLQVFYAIIDPNSTLYNAILRNLTRYGECERTL
LVYQQMVAKSMHPDEETYPSVLRSCCSFSNVGFGRKVHGYLVKLGFDSFDMVATALAEMYEECIDFEHAHQLFDKRSVKDLECWSSLTTETPQNGNGEGIFRLFGRMRAE
QLVPDSLTFINLLRSIAGLNSIRLAKIVHSITIVSKLCGDLLVNTALLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIAAYARGGKPTECLELFKSMARSGIRSDMFTA
LPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVHNSLIDMYRECNILDSACKIFNCMTDKSVISWSAMIKGYVKHGQSLIALSLFSRMKSDGIQADFITVINILPAFVH
IGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGEWSQCFKLYNQMKCSNTKPDQVTFLGLLTACVNSGLVERG
KEFFKEMTENYDCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLR
DKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEVREKSIDKLNPL