; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10011684 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10011684
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat
Genome locationChr01:9312176..9314392
RNA-Seq ExpressionHG10011684
SyntenyHG10011684
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015445.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0086.54Show/hide
Query:  MSLYRFLLRSLRPSSTSPSNSRALTIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL
        MSLYR LLRS R SSTSPS+S+AL+IGPLNHH   P PPSSQ SSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL
Subjt:  MSLYRFLLRSLRPSSTSPSNSRALTIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL

Query:  PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV
        PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV
Subjt:  PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV

Query:  GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFN
        GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLV+NNLISGFLNLEN+EKANELFDELKERCLVYDGVVNATFMDWFFN
Subjt:  GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFN

Query:  RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAM
        RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNEC KLGKFSEAVETFRKVGTQPKSRPFAM
Subjt:  RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAM

Query:  DVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGE
        DVAGYNNII RFCEQGMM DAETFFAELCSKSLSPDVPTHRTLIE+YLKI QIDD LRVFNRMVDVGLRVVASFG+ VFGELIKNGK VDCAQILTKMGE
Subjt:  DVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGE

Query:  RDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGPP
        RDPKPDPTCYDVVIRGLCNEGALD SRELLDQI RYGIGLTPTL+EFVKEAFVKAGR EEIERLLN+N  GH PYR PSGPPRI QSQVPPQM PP  PP
Subjt:  RDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGPP

Query:  Q----MAEPNWRPSLNPQARGSYAPSSPQMSGPN---YFQGPNYSQSGSAQMSGPNYSQSGSTQMTGPNYFQSGPAQMTSPNYSESGSTQMTGPNYSQSG
        Q    MAEP+WRPS+NPQA GSY PSSPQM+GP        P++  S + Q +G +Y+ S S QMTGP     G   M  P++  S + Q  G +Y  S 
Subjt:  Q----MAEPNWRPSLNPQARGSYAPSSPQMSGPN---YFQGPNYSQSGSAQMSGPNYSQSGSTQMTGPNYFQSGPAQMTSPNYSESGSTQMTGPNYSQSG

Query:  PAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA
        P QM GPNYFQSG+AQ+TRP QP  DP PMEEQHHSQQPPQ+A
Subjt:  PAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA

XP_022984746.1 pentatricopeptide repeat-containing protein At1g10270-like isoform X1 [Cucurbita maxima]0.0e+0086.27Show/hide
Query:  MSLYRFLLRSLRPSSTSPSNSRALTIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL
        MSLYR LLRS R SSTSPS+S++L+IGPLNHH   PIPPSSQ+SSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL
Subjt:  MSLYRFLLRSLRPSSTSPSNSRALTIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL

Query:  PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV
        PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV
Subjt:  PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV

Query:  GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFN
         LEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLV+NNLISGFLNLEN+EKANELFDELKERCLVYDGVVNATFMDWFFN
Subjt:  GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFN

Query:  RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAM
        RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNEC KLGKFSEAVETFRKVGTQPKSRPFAM
Subjt:  RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAM

Query:  DVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGE
        DVAGYNNII RFCEQGMM DAETFFAELCSKSLSPDVPTHRTLIE+YLKI QIDD LRVFNRMVDVGLRVVASFG+ VFGELIKNGK VDCAQILTKMGE
Subjt:  DVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGE

Query:  RDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGPP
        RDPKPDPTCYDVVI+GLCNEGALD SRELLDQI RYGIGLTP L+EFVKEAFVKAGR EEIERLLN+N  GH PYRPPSGPPRI QSQVPPQM PP+ PP
Subjt:  RDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGPP

Query:  Q----MAEPNWRPSLNPQARGSYAPSSPQMSGPN---YFQGPNYSQSGSAQMSGPNYSQSGSTQMTGPNYFQSGPAQMTSPNYSESGSTQMTGPNYSQSG
        Q    MAEP+W+PS+NPQA GS APSSPQM+GP        P++  S + Q  G +Y+ S S QMTGP     G   M  P++  S + Q  G +Y  S 
Subjt:  Q----MAEPNWRPSLNPQARGSYAPSSPQMSGPN---YFQGPNYSQSGSAQMSGPNYSQSGSTQMTGPNYFQSGPAQMTSPNYSESGSTQMTGPNYSQSG

Query:  PAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA
        P QM GPNYFQSG+AQ+TRPQQP  DP PMEEQHHSQQPPQ+A
Subjt:  PAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA

XP_023552661.1 pentatricopeptide repeat-containing protein At1g10270-like isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0084.02Show/hide
Query:  MSLYRFLLRSLRPSSTSPSNSRALTIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL
        MSLYR LLRS R SSTSPS+S+AL+IGPLNHH   PIPPSSQ+SSPISLLH RSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL
Subjt:  MSLYRFLLRSLRPSSTSPSNSRALTIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL

Query:  PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV
        PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV
Subjt:  PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV

Query:  GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFN
        GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLV+NNLISGFLNLEN+EKANELFDELKERCLVYDGVVNATFMDWFFN
Subjt:  GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFN

Query:  RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAM
        RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNEC KLGKFSEAVETFRKVGTQPKSRPFAM
Subjt:  RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAM

Query:  DVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGE
        DVAGYNNII RFCEQGMM DAETFFAELCSKSLSPDVPTHRTLIE+YLKI QIDD LRVFNRMVDVGLRVVASFG+ VFGELIKNGK VDCAQILTKMGE
Subjt:  DVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGE

Query:  RDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGPP
        RDPKPDPTCYDVVIRGLCNEGALD SRELLDQI RYGIGLTPTL+EFVKEAFVKAGR EEIERLLN+N  GH PYRPPSGPPRI QSQVPPQM PP+ PP
Subjt:  RDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGPP

Query:  Q----MAEPNWRPSLNPQARGSYAPSSPQMSGP---------------NYFQGPNYSQSGSAQMSGP---------------------NYSQSGSTQMTG
        Q    MAEP+WRPS+NPQA GSYAPSSPQM+GP               N   G +Y  S S QM+GP                     +Y+ S S QMTG
Subjt:  Q----MAEPNWRPSLNPQARGSYAPSSPQMSGP---------------NYFQGPNYSQSGSAQMSGP---------------------NYSQSGSTQMTG

Query:  PNYFQSGPAQMTSPNYSESGSTQMTGPNYSQSGPAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA
        P     G   M  P++  S + Q  G +Y  S P QM GPNYFQSG+AQ+TRPQQPS DP PMEEQHHSQQPPQ+A
Subjt:  PNYFQSGPAQMTSPNYSESGSTQMTGPNYSQSGPAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA

XP_023552663.1 pentatricopeptide repeat-containing protein At1g10270-like isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0086.94Show/hide
Query:  MSLYRFLLRSLRPSSTSPSNSRALTIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL
        MSLYR LLRS R SSTSPS+S+AL+IGPLNHH   PIPPSSQ+SSPISLLH RSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL
Subjt:  MSLYRFLLRSLRPSSTSPSNSRALTIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL

Query:  PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV
        PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV
Subjt:  PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV

Query:  GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFN
        GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLV+NNLISGFLNLEN+EKANELFDELKERCLVYDGVVNATFMDWFFN
Subjt:  GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFN

Query:  RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAM
        RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNEC KLGKFSEAVETFRKVGTQPKSRPFAM
Subjt:  RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAM

Query:  DVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGE
        DVAGYNNII RFCEQGMM DAETFFAELCSKSLSPDVPTHRTLIE+YLKI QIDD LRVFNRMVDVGLRVVASFG+ VFGELIKNGK VDCAQILTKMGE
Subjt:  DVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGE

Query:  RDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGPP
        RDPKPDPTCYDVVIRGLCNEGALD SRELLDQI RYGIGLTPTL+EFVKEAFVKAGR EEIERLLN+N  GH PYRPPSGPPRI QSQVPPQM PP+ PP
Subjt:  RDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGPP

Query:  Q----MAEPNWRPSLNPQARGSYAPSSPQMSGPN---YFQGPNYSQSGSAQMSGPNYSQSGSTQMTGPNYFQSGPAQMTSPNYSESGSTQMTGPNYSQSG
        Q    MAEP+WRPS+NPQA GSY PSSPQM+GP        P++  S + Q +G +Y+ S S QMTGP     G   M  P++  S + Q  G +Y  S 
Subjt:  Q----MAEPNWRPSLNPQARGSYAPSSPQMSGPN---YFQGPNYSQSGSAQMSGPNYSQSGSTQMTGPNYFQSGPAQMTSPNYSESGSTQMTGPNYSQSG

Query:  PAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA
        P QM GPNYFQSG+AQ+TRPQQPS DP PMEEQHHSQQPPQ+A
Subjt:  PAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA

XP_038905008.1 pentatricopeptide repeat-containing protein At1g10270 [Benincasa hispida]0.0e+0083.56Show/hide
Query:  MSLYRFLLRSLRPSSTSPSNSRALTIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL
        MSLYRFLLRSL  SSTSPSNSR LTIGPLNHHFQEPIPP     SPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL
Subjt:  MSLYRFLLRSLRPSSTSPSNSRALTIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL

Query:  PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV
        PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRY DAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV
Subjt:  PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV

Query:  GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFN
        GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRI EAVDLLREMLNKGHGADSLVFNNLISGFLNLEN+EKANELFDELKERCLVYDGVVNATFMDWFFN
Subjt:  GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFN

Query:  RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAM
        RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNEC KLGKFSEAVETFRKVGTQPKSRPFAM
Subjt:  RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAM

Query:  DVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGE
        DVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKI QIDD LRVFNRMVDVGLRVVASFG+ VFGELIKNGKAVDCAQILTKMGE
Subjt:  DVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGE

Query:  RDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGPP
        RDPKPDPTCYDVVIRGLCNEG LD SRELLDQI RYGIGLTPTL+EFVKEAF KAGRHEEIERLLN+N  GH PYRPPSGPPRI QSQVPPQM PPQG P
Subjt:  RDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGPP

Query:  QMAEPNWRPSLNPQARGSYAPSSPQMSGPNYFQG-----------------------------PNYSQS-----------GSAQMSGPNYSQSGSTQMTG
        QM EPNWRPS+NPQARGSYAPSSPQMSG NYFQ                              PN+  S            S Q+S P+Y Q+GSTQMTG
Subjt:  QMAEPNWRPSLNPQARGSYAPSSPQMSGPNYFQG-----------------------------PNYSQS-----------GSAQMSGPNYSQSGSTQMTG

Query:  PNYFQSGPAQMT----------------------SPNYSES-----------GSTQMTGPNYSQSGPAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQH
        PNYFQSG AQMT                       PN+  S            S+QM+GP+Y QSG  QM GPNYFQSG++Q+TRPQ PSSDPPPMEEQ+
Subjt:  PNYFQSGPAQMT----------------------SPNYSES-----------GSTQMTGPNYSQSGPAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQH

Query:  HSQQPPQMA
        HSQQPPQMA
Subjt:  HSQQPPQMA

TrEMBL top hitse value%identityAlignment
A0A1S3C908 pentatricopeptide repeat-containing protein At1g102700.0e+0084.87Show/hide
Query:  MSLYRFLLRSLRPSSTSPSNSRAL-TIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPR
        MS YRFLLRSLR SSTSPS + AL TI PLNHH    IPPSSQTSSPISLL ARSF+FSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPR
Subjt:  MSLYRFLLRSLRPSSTSPSNSRAL-TIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPR

Query:  LPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVD
        LPDSTSALVGPRLNLHNRVQSLIRAGDLDAAS+VARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVD
Subjt:  LPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVD

Query:  VGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFF
        VGLE+YRHIIANAPFSPSAVTYRHLTKGLID+GRI EAVDLLREMLNKGHGADSLVFNNLISGFLNL N+ KANELFDELKERCLVYDGVVNATFMDWFF
Subjt:  VGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFF

Query:  NRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFA
        N+GKEKEAMESYKSLLDRQFKM+PATCNVLLEVLLKH KKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNEC KLGKF+EAVETFRKVGTQPKSRPFA
Subjt:  NRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFA

Query:  MDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMG
        MDVAGYNNIIARFCEQGMM DAETFFAELCSKSLSPDVPTHRTLIESYLKI QIDD LRVFNRMVDVGLRVVASFG+ VFGELIKNGKA DCAQILTKMG
Subjt:  MDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMG

Query:  ERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQM-RPPQG
        ERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQI RYGIGLTPTLEEFVK+AFVKAGRHEEIERLLN+N  GH  YRPPSGPPRI QSQVPPQM RP QG
Subjt:  ERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQM-RPPQG

Query:  PPQMAEPNWRPSLNPQARGSYAPSSPQMSGPNYFQGPNYSQSGSAQMSGPNYSQSGSTQMTGP-----------NYFQSGPAQMTSPNYSE---------
        PPQMAEPNWRPS+NPQARGSY  SSPQMS P++F      QSG  Q +G NY QSGS QMT P            +    P QM  PN+           
Subjt:  PPQMAEPNWRPSLNPQARGSYAPSSPQMSGPNYFQGPNYSQSGSAQMSGPNYSQSGSTQMTGP-----------NYFQSGPAQMTSPNYSE---------

Query:  --SGSTQMTGPNYSQSGPAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA
            S QM+GP+Y QS  +QM G NYFQS + Q+TRPQQPSSD  P+EEQ+HS+QPPQMA
Subjt:  --SGSTQMTGPNYSQSGPAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA

A0A5A7TJN2 ACT11D09.40.0e+0085.22Show/hide
Query:  MSLYRFLLRSLRPSSTSPSNSRAL-TIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPR
        MS YRFLLRSLR SSTSPS + AL TI PLNHH    IPPSSQTSSPISLL ARSF+FSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPR
Subjt:  MSLYRFLLRSLRPSSTSPSNSRAL-TIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPR

Query:  LPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVD
        LPDSTSALVGPRLNLHNRVQSLIRAGDLDAAS+VARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVD
Subjt:  LPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVD

Query:  VGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFF
        VGLE+YRHIIANAPFSPSAVTYRHLTKGLID+GRI EAVDLLREMLNKGHGADSLVFNNLISGFLNL N+ KANELFDELKERCLVYDGVVNATFMDWFF
Subjt:  VGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFF

Query:  NRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFA
        N+GKEKEAMESYKSLLDRQFKM+PATCNVLLEVLLKH KKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNEC KLGKF+EAVETFRKVGTQPKSRPFA
Subjt:  NRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFA

Query:  MDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMG
        MDVAGYNNIIARFCEQGMM DAETFFAELCSKSLSPDVPTHRTLIESYLKI QIDD LRVFNRMVDVGLRVVASFG+ VFGELIKNGKA DCAQILTKMG
Subjt:  MDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMG

Query:  ERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQM-RPPQG
        ERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQI RYGIGLTPTLEEFVK+AFVKAGRHEEIERLLN+N  GH  YRPPSGPPRI QSQVPPQM RP QG
Subjt:  ERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQM-RPPQG

Query:  PPQMAEPNWRPSLNPQARGSYAPSSPQMSGPNYFQGPNYSQSGSAQMSGPNYSQSGSTQMTGP-----------NYFQSGPAQMTSPNYSES--------
        PPQMAEPNWRPS+NPQARGSY  SSPQMS P++F      QSG  QM+G NY QSGS QMT P            +    P QM  PN+  S        
Subjt:  PPQMAEPNWRPSLNPQARGSYAPSSPQMSGPNYFQGPNYSQSGSAQMSGPNYSQSGSTQMTGP-----------NYFQSGPAQMTSPNYSES--------

Query:  -GSTQMTGPNYSQSGPAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA
          S QM+  ++ QSGP Q  G NYFQSG+AQ+T+PQ  S DPP MEE HHSQQPPQMA
Subjt:  -GSTQMTGPNYSQSGPAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA

A0A6J1EMZ6 pentatricopeptide repeat-containing protein At1g10270-like0.0e+0086.41Show/hide
Query:  MSLYRFLLRSLRPSSTSPSNSRALTIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL
        MSLYR LLRS R SSTSPS+S+AL+IGPLNHH   P PPSSQ SSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL
Subjt:  MSLYRFLLRSLRPSSTSPSNSRALTIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL

Query:  PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV
        PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV
Subjt:  PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV

Query:  GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFN
        GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLV+NNLISGFLNLEN+EKANELFDELKERCLVYDGVVNATFMDWFFN
Subjt:  GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFN

Query:  RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAM
        RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNEC KLGKFSEAVETFRKVGTQPKSRPFAM
Subjt:  RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAM

Query:  DVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGE
        DVAGYNNII RFCEQGMM DAETFFAELCSKSLSPDVPTHRTLIE+YLKI QIDD LRVFNRMVDVGLRVVASFG+ VFGELIKNGK VDCAQILTKMGE
Subjt:  DVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGE

Query:  RDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGPP
        RDPKPDPTCYDVVIRGLCNEGALD SRELLDQI RYGIGLTPTL+EFVKEAFVKAGR EEIERLLN+N  GH PYR PSGPPRI QSQVPPQM PP  PP
Subjt:  RDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGPP

Query:  Q----MAEPNWRPSLNPQARGSYAPSSPQMSGPN---YFQGPNYSQSGSAQMSGPNYSQSGSTQMTGPNYFQSGPAQMTSPNYSESGSTQMTGPNYSQSG
        Q    MAEP+WRPS+NPQA GSY PSSPQM+GP        P++  S + Q +G +Y+ S S QMTGP     G   M  P++  S + Q  G +Y  S 
Subjt:  Q----MAEPNWRPSLNPQARGSYAPSSPQMSGPN---YFQGPNYSQSGSAQMSGPNYSQSGSTQMTGPNYFQSGPAQMTSPNYSESGSTQMTGPNYSQSG

Query:  PAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA
        P QM GP YFQSG+AQ+TRP QP  DP PMEEQHHSQQPPQ+A
Subjt:  PAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA

A0A6J1J312 pentatricopeptide repeat-containing protein At1g10270-like isoform X10.0e+0086.27Show/hide
Query:  MSLYRFLLRSLRPSSTSPSNSRALTIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL
        MSLYR LLRS R SSTSPS+S++L+IGPLNHH   PIPPSSQ+SSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL
Subjt:  MSLYRFLLRSLRPSSTSPSNSRALTIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRL

Query:  PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV
        PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV
Subjt:  PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDV

Query:  GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFN
         LEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLV+NNLISGFLNLEN+EKANELFDELKERCLVYDGVVNATFMDWFFN
Subjt:  GLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFN

Query:  RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAM
        RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNEC KLGKFSEAVETFRKVGTQPKSRPFAM
Subjt:  RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAM

Query:  DVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGE
        DVAGYNNII RFCEQGMM DAETFFAELCSKSLSPDVPTHRTLIE+YLKI QIDD LRVFNRMVDVGLRVVASFG+ VFGELIKNGK VDCAQILTKMGE
Subjt:  DVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGE

Query:  RDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGPP
        RDPKPDPTCYDVVI+GLCNEGALD SRELLDQI RYGIGLTP L+EFVKEAFVKAGR EEIERLLN+N  GH PYRPPSGPPRI QSQVPPQM PP+ PP
Subjt:  RDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGPP

Query:  Q----MAEPNWRPSLNPQARGSYAPSSPQMSGPN---YFQGPNYSQSGSAQMSGPNYSQSGSTQMTGPNYFQSGPAQMTSPNYSESGSTQMTGPNYSQSG
        Q    MAEP+W+PS+NPQA GS APSSPQM+GP        P++  S + Q  G +Y+ S S QMTGP     G   M  P++  S + Q  G +Y  S 
Subjt:  Q----MAEPNWRPSLNPQARGSYAPSSPQMSGPN---YFQGPNYSQSGSAQMSGPNYSQSGSTQMTGPNYFQSGPAQMTSPNYSESGSTQMTGPNYSQSG

Query:  PAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA
        P QM GPNYFQSG+AQ+TRPQQP  DP PMEEQHHSQQPPQ+A
Subjt:  PAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA

Q6E438 ACT11D09.40.0e+0085.22Show/hide
Query:  MSLYRFLLRSLRPSSTSPSNSRAL-TIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPR
        MS YRFLLRSLR SSTSPS + AL TI PLNHH    IPPSSQTSSPISLL ARSF+FSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPR
Subjt:  MSLYRFLLRSLRPSSTSPSNSRAL-TIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPR

Query:  LPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVD
        LPDSTSALVGPRLNLHNRVQSLIRAGDLDAAS+VARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVD
Subjt:  LPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVD

Query:  VGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFF
        VGLE+YRHIIANAPFSPSAVTYRHLTKGLID+GRI EAVDLLREMLNKGHGADSLVFNNLISGFLNL N+ KANELFDELKERCLVYDGVVNATFMDWFF
Subjt:  VGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFF

Query:  NRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFA
        N+GKEKEAMESYKSLLDRQFKM+PATCNVLLEVLLKH KKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNEC KLGKF+EAVETFRKVGTQPKSRPFA
Subjt:  NRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFA

Query:  MDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMG
        MDVAGYNNIIARFCEQGMM DAETFFAELCSKSLSPDVPTHRTLIESYLKI QIDD LRVFNRMVDVGLRVVASFG+ VFGELIKNGKA DCAQILTKMG
Subjt:  MDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMG

Query:  ERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQM-RPPQG
        ERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQI RYGIGLTPTLEEFVK+AFVKAGRHEEIERLLN+N  GH  YRPPSGPPRI QSQVPPQM RP QG
Subjt:  ERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQM-RPPQG

Query:  PPQMAEPNWRPSLNPQARGSYAPSSPQMSGPNYFQGPNYSQSGSAQMSGPNYSQSGSTQMTGP-----------NYFQSGPAQMTSPNYSES--------
        PPQMAEPNWRPS+NPQARGSY  SSPQMS P++F      QSG  QM+G NY QSGS QMT P            +    P QM  PN+  S        
Subjt:  PPQMAEPNWRPSLNPQARGSYAPSSPQMSGPNYFQGPNYSQSGSAQMSGPNYSQSGSTQMTGP-----------NYFQSGPAQMTSPNYSES--------

Query:  -GSTQMTGPNYSQSGPAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA
          S QM+  ++ QSGP Q  G NYFQSG+AQ+T+PQ  S DPP MEE HHSQQPPQMA
Subjt:  -GSTQMTGPNYSQSGPAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMA

SwissProt top hitse value%identityAlignment
Q6NQ83 Pentatricopeptide repeat-containing protein At3g22470, mitochondrial1.8e-3523.9Show/hide
Query:  GDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHL
        G +  A A+    V    RP + T + +I  +    R S+A+ L      +    P+ V+Y  ++N  C  G   + L+++R  +       S V Y  +
Subjt:  GDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHL

Query:  TKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFNRGKEKEAMESYKSLLDRQFKMIPA
           L   G   +A+ L  EM  KG  AD + +++LI G  N    +   ++  E+  R ++ D V  +  +D F   GK  EA E Y  ++ R       
Subjt:  TKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFNRGKEKEAMESYKSLLDRQFKMIPA

Query:  TCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETF
        T N L++   K     EA  +FD M+     P+       T++I++N   K  +  + +  FR++     S+    +   YN ++  FC+ G +  A+  
Subjt:  TCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETF

Query:  FAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALD
        F E+ S+ + P V T+  L++     G+++  L +F +M    + +     + +   +    K  D   +   + ++  KPD   Y+V+I GLC +G+L 
Subjt:  FAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALD

Query:  TSRELLDQIKRYGIGLTPTLEEFVKEAFVKA
         +  L  ++K  G     T ++F     ++A
Subjt:  TSRELLDQIKRYGIGLTPTLEEFVKEAFVKA

Q9LEX5 Pentatricopeptide repeat-containing protein At3g60980, mitochondrial1.7e-5737.89Show/hide
Query:  RVQSLIR-AGDLDAASAVARHSVFSN--TRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAP
        RV  LIR  GDLD A+  AR +VF++  +  T   C +II  M R KR  DA  L++FFFNQ N+ PN   +N +I +   +G V+  L  +   I +  
Subjt:  RVQSLIR-AGDLDAASAVARHSVFSN--TRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAP

Query:  FS--PSAVTYRHLTKGLIDSGRIGEAVDLLR-EMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLV-------------YDGVV---NATF
            PS  ++R LTKGL+ SGR+ +A   LR   +N+    D + +NNLI GFL+L N +KAN +  E K   L+             Y+  V    ATF
Subjt:  FS--PSAVTYRHLTKGLIDSGRIGEAVDLLR-EMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLV-------------YDGVV---NATF

Query:  MDWFFNRGKEKEAMESY-KSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQP
        M+++F +GK+ EAME Y + +L  +  +   T N LL+VLLK+G+K  AW L+ ++LD +       ++SDT  IMV+EC  +G FSEA+ET++K   +P
Subjt:  MDWFFNRGKEKEAMESY-KSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQP

Query:  KSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLS-PDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVA
        K+     D      II RFCE  M+++AE+ F +  +      DV T++T+I++Y+K G+I D ++  N+M+D  L+ V+
Subjt:  KSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLS-PDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVA

Q9LEX6 Pentatricopeptide repeat-containing protein At3g60960, mitochondrial9.0e-5936.82Show/hide
Query:  PPQRDPNA-PRL-PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSV---FSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVS
        P  RDP++ P+L P S S +    ++L  RV+++I   +LD AS ++R +V   F   R TVF CN++I AM  AKRY DAI+LF +FFN+S  +PN +S
Subjt:  PPQRDPNA-PRL-PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSV---FSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVS

Query:  YNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCL
         + +I AHCD+G VD  LE+YRHI+ +   +P   TY  L K L+D+ R  EA  L R M         +V++ LI GFL++ N  KA+++F+ELK    
Subjt:  YNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCL

Query:  VYDG--------VVNATFMDWFFNRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKL
           G        + N +FM+++F +GK++EAME   +L D Q  + P   N +L+VL+KHGKKTEAW LF +M+        +  +S+T +IM       
Subjt:  VYDG--------VVNATFMDWFFNRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKL

Query:  GKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSK------SLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRV
          FSE    F +           +    Y  +I   CE G ++DAE  FAE+ +        + PD+   R +I  Y+ +G++DD ++  N+M    LR 
Subjt:  GKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSK------SLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRV

Query:  VA
        +A
Subjt:  VA

Q9M3A8 Pentatricopeptide repeat-containing protein At3g49240, mitochondrial1.8e-9137.36Show/hide
Query:  PISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRR-----DNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFS
        P   L  R  +F++ EEAAAERRRRKRRLR+EPP+++  R        P P ++PN P+LP+S SALVG RL+LHN +  LIR  DL+ A+   RHSV+S
Subjt:  PISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRR-----DNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFS

Query:  NTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL
        N RPT+FT N ++AA  R  +Y  A+     F NQ+ I PN+++YN +  A+ D  + ++ LE Y+  I NAP +PS  T+R L KGL+ +  + +A+++
Subjt:  NTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL

Query:  LREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERC--LVYDGVVNATFMDWFFNRGKEKEAMESYKSLL--DRQFKMIPATCNVLLEVLLKH
          +M  KG   D +V++ L+ G +   + +   +L+ ELKE+    V DGVV    M  +F +  EKEAME Y+  +  + + +M     N +LE L ++
Subjt:  LREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERC--LVYDGVVNATFMDWFFNRGKEKEAMESYKSLL--DRQFKMIPATCNVLLEVLLKH

Query:  GKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPD
        GK  EA  LFD +   H PP   AVN  TFN+MVN     GKF EA+E FR++G   K  P   D   +NN++ + C+  ++ +AE  + E+  K++ PD
Subjt:  GKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPD

Query:  VPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQ-IKR
          T+  L+++  K G+ID+    +  MV+  LR   +  + +  +LIK GK  D       M  +  K D   Y  ++R L   G LD   +++D+ +  
Subjt:  VPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQ-IKR

Query:  YGIGLTPTLEEFVKEAFVKAGRHEEIERLL
          + ++  L+EFVKE   K GR  ++E+L+
Subjt:  YGIGLTPTLEEFVKEAFVKAGRHEEIERLL

Q9SY69 Pentatricopeptide repeat-containing protein At1g102702.5e-21055.83Show/hide
Query:  PSSTSPSNSRALTIGPLNHHFQEP----IPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRD-NYPPPQRDPNAPRLPDSTSAL
        P +T+ +N  +     LN    +P    IP +     P   +  R+ AFSSAEEAAAERRRRKRRLRIEPPLHALRRD + PPP+RDPNAPRLPDSTSAL
Subjt:  PSSTSPSNSRALTIGPLNHHFQEP----IPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRD-NYPPPQRDPNAPRLPDSTSAL

Query:  VGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRH
        VG RLNLHNRVQSLIRA DLDAAS +AR SVFSNTRPTVFTCNAIIAAMYRAKRYS++I+LFQ+FF QSNIVPNVVSYN +INAHCDEG VD  LE+YRH
Subjt:  VGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRH

Query:  IIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFNRGKEKEA
        I+ANAPF+PS+VTYRHLTKGL+ +GRIG+A  LLREML+KG  ADS V+NNLI G+L+L + +KA E FDELK +C VYDG+VNATFM+++F +G +KEA
Subjt:  IIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFNRGKEKEA

Query:  MESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNN
        MESY+SLLD++F+M P T NVLLEV LK GKK EAW LF++MLDNH PPN  +VNSDT  IMVNEC K+G+FSEA+ TF+KVG++  S+PF MD  GY N
Subjt:  MESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNN

Query:  IIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGERDPKPDP
        I+ RFCEQGM+T+AE FFAE  S+SL  D P+HR +I++YLK  +IDD +++ +RMVDV LRVVA FG+ VFGELIKNGK  + A++LTKMGER+PKPDP
Subjt:  IIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGERDPKPDP

Query:  TCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLL--------NVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGP
        + YDVV+RGLC+  ALD +++++ ++ R+ +G+T  L EF+ E F KAGR EEIE++L        N   SG+ P R P+     P +   P+ R P   
Subjt:  TCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLL--------NVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGP

Query:  PQMAEPN--WRPSLNPQ-ARGSYAPSSPQMSGPNYFQGPNYSQSGSAQMSGPNYSQSGSTQMTGPNYFQSGPAQMTSPNYSESGSTQ-MTGPNYSQSGPA
          +   N  W      Q A G+Y  ++ Q    +        QS S Q +G    Q  S     P Y Q       S   S SG  Q  T     Q  P 
Subjt:  PQMAEPN--WRPSLNPQ-ARGSYAPSSPQMSGPNYFQGPNYSQSGSAQMSGPNYSQSGSTQMTGPNYFQSGPAQMTSPNYSESGSTQ-MTGPNYSQSGPA

Query:  QMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPP
            P   Q  A Q    QQ  ++  P ++Q  + Q P
Subjt:  QMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPP

Arabidopsis top hitse value%identityAlignment
AT1G10270.1 glutamine-rich protein 231.8e-21155.83Show/hide
Query:  PSSTSPSNSRALTIGPLNHHFQEP----IPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRD-NYPPPQRDPNAPRLPDSTSAL
        P +T+ +N  +     LN    +P    IP +     P   +  R+ AFSSAEEAAAERRRRKRRLRIEPPLHALRRD + PPP+RDPNAPRLPDSTSAL
Subjt:  PSSTSPSNSRALTIGPLNHHFQEP----IPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRD-NYPPPQRDPNAPRLPDSTSAL

Query:  VGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRH
        VG RLNLHNRVQSLIRA DLDAAS +AR SVFSNTRPTVFTCNAIIAAMYRAKRYS++I+LFQ+FF QSNIVPNVVSYN +INAHCDEG VD  LE+YRH
Subjt:  VGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRH

Query:  IIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFNRGKEKEA
        I+ANAPF+PS+VTYRHLTKGL+ +GRIG+A  LLREML+KG  ADS V+NNLI G+L+L + +KA E FDELK +C VYDG+VNATFM+++F +G +KEA
Subjt:  IIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFNRGKEKEA

Query:  MESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNN
        MESY+SLLD++F+M P T NVLLEV LK GKK EAW LF++MLDNH PPN  +VNSDT  IMVNEC K+G+FSEA+ TF+KVG++  S+PF MD  GY N
Subjt:  MESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNN

Query:  IIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGERDPKPDP
        I+ RFCEQGM+T+AE FFAE  S+SL  D P+HR +I++YLK  +IDD +++ +RMVDV LRVVA FG+ VFGELIKNGK  + A++LTKMGER+PKPDP
Subjt:  IIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGERDPKPDP

Query:  TCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLL--------NVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGP
        + YDVV+RGLC+  ALD +++++ ++ R+ +G+T  L EF+ E F KAGR EEIE++L        N   SG+ P R P+     P +   P+ R P   
Subjt:  TCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKEAFVKAGRHEEIERLL--------NVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGP

Query:  PQMAEPN--WRPSLNPQ-ARGSYAPSSPQMSGPNYFQGPNYSQSGSAQMSGPNYSQSGSTQMTGPNYFQSGPAQMTSPNYSESGSTQ-MTGPNYSQSGPA
          +   N  W      Q A G+Y  ++ Q    +        QS S Q +G    Q  S     P Y Q       S   S SG  Q  T     Q  P 
Subjt:  PQMAEPN--WRPSLNPQ-ARGSYAPSSPQMSGPNYFQGPNYSQSGSAQMSGPNYSQSGSTQMTGPNYFQSGPAQMTSPNYSESGSTQ-MTGPNYSQSGPA

Query:  QMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPP
            P   Q  A Q    QQ  ++  P ++Q  + Q P
Subjt:  QMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPP

AT3G49240.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-9237.36Show/hide
Query:  PISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRR-----DNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFS
        P   L  R  +F++ EEAAAERRRRKRRLR+EPP+++  R        P P ++PN P+LP+S SALVG RL+LHN +  LIR  DL+ A+   RHSV+S
Subjt:  PISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRR-----DNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFS

Query:  NTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL
        N RPT+FT N ++AA  R  +Y  A+     F NQ+ I PN+++YN +  A+ D  + ++ LE Y+  I NAP +PS  T+R L KGL+ +  + +A+++
Subjt:  NTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL

Query:  LREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERC--LVYDGVVNATFMDWFFNRGKEKEAMESYKSLL--DRQFKMIPATCNVLLEVLLKH
          +M  KG   D +V++ L+ G +   + +   +L+ ELKE+    V DGVV    M  +F +  EKEAME Y+  +  + + +M     N +LE L ++
Subjt:  LREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERC--LVYDGVVNATFMDWFFNRGKEKEAMESYKSLL--DRQFKMIPATCNVLLEVLLKH

Query:  GKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPD
        GK  EA  LFD +   H PP   AVN  TFN+MVN     GKF EA+E FR++G   K  P   D   +NN++ + C+  ++ +AE  + E+  K++ PD
Subjt:  GKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPD

Query:  VPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQ-IKR
          T+  L+++  K G+ID+    +  MV+  LR   +  + +  +LIK GK  D       M  +  K D   Y  ++R L   G LD   +++D+ +  
Subjt:  VPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQ-IKR

Query:  YGIGLTPTLEEFVKEAFVKAGRHEEIERLL
          + ++  L+EFVKE   K GR  ++E+L+
Subjt:  YGIGLTPTLEEFVKEAFVKAGRHEEIERLL

AT3G60960.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.4e-6036.82Show/hide
Query:  PPQRDPNA-PRL-PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSV---FSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVS
        P  RDP++ P+L P S S +    ++L  RV+++I   +LD AS ++R +V   F   R TVF CN++I AM  AKRY DAI+LF +FFN+S  +PN +S
Subjt:  PPQRDPNA-PRL-PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSV---FSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVS

Query:  YNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCL
         + +I AHCD+G VD  LE+YRHI+ +   +P   TY  L K L+D+ R  EA  L R M         +V++ LI GFL++ N  KA+++F+ELK    
Subjt:  YNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCL

Query:  VYDG--------VVNATFMDWFFNRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKL
           G        + N +FM+++F +GK++EAME   +L D Q  + P   N +L+VL+KHGKKTEAW LF +M+        +  +S+T +IM       
Subjt:  VYDG--------VVNATFMDWFFNRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKL

Query:  GKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSK------SLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRV
          FSE    F +           +    Y  +I   CE G ++DAE  FAE+ +        + PD+   R +I  Y+ +G++DD ++  N+M    LR 
Subjt:  GKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSK------SLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRV

Query:  VA
        +A
Subjt:  VA

AT3G60980.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-5837.89Show/hide
Query:  RVQSLIR-AGDLDAASAVARHSVFSN--TRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAP
        RV  LIR  GDLD A+  AR +VF++  +  T   C +II  M R KR  DA  L++FFFNQ N+ PN   +N +I +   +G V+  L  +   I +  
Subjt:  RVQSLIR-AGDLDAASAVARHSVFSN--TRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAP

Query:  FS--PSAVTYRHLTKGLIDSGRIGEAVDLLR-EMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLV-------------YDGVV---NATF
            PS  ++R LTKGL+ SGR+ +A   LR   +N+    D + +NNLI GFL+L N +KAN +  E K   L+             Y+  V    ATF
Subjt:  FS--PSAVTYRHLTKGLIDSGRIGEAVDLLR-EMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLV-------------YDGVV---NATF

Query:  MDWFFNRGKEKEAMESY-KSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQP
        M+++F +GK+ EAME Y + +L  +  +   T N LL+VLLK+G+K  AW L+ ++LD +       ++SDT  IMV+EC  +G FSEA+ET++K   +P
Subjt:  MDWFFNRGKEKEAMESY-KSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQP

Query:  KSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLS-PDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVA
        K+     D      II RFCE  M+++AE+ F +  +      DV T++T+I++Y+K G+I D ++  N+M+D  L+ V+
Subjt:  KSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLS-PDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVA

AT5G28380.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.3e-4834.55Show/hide
Query:  YSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLIS
        Y +AI+LF +FFN+S  +PN++S N +I AHCD+G VD  LE+YRHI+ +   +P   TYR LTK L+ + R+ EA D++R M       D  V++ LI 
Subjt:  YSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLIS

Query:  GFLNLENMEKANELFDELK--------ERCLVYDGVVNATFMDWFFNRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNH
        GFL+     +A+++F+ELK                + N +FMD++F +GK++EAME + +L   +  +   + N +L+ L++HG+KTEAW LF  M+   
Subjt:  GFLNLENMEKANELFDELK--------ERCLVYDGVVNATFMDWFFNRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNH

Query:  TPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSK------SLSPDVPTHRTLIESY
             +  +S+T  I+++   K G F E    F +V               Y  +IA  C+QG M +AE  FA++ +          PDV T R +I  Y
Subjt:  TPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSK------SLSPDVPTHRTLIESY

Query:  LKIGQIDDVLRVFNRMVDVGLRVVASFGST
        +K+G++DD ++  N+M    LR ++   +T
Subjt:  LKIGQIDDVLRVFNRMVDVGLRVVASFGST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACTTTACCGCTTCCTCCTTCGCTCTCTCCGCCCCTCTTCCACCTCTCCCTCAAATTCCCGAGCTCTTACCATTGGTCCTCTCAACCACCATTTTCAGGAGCCGAT
TCCACCGTCCTCTCAAACTTCTTCTCCCATCTCGCTCCTCCATGCCCGCTCGTTTGCCTTTTCCTCTGCCGAAGAAGCTGCTGCCGAAAGACGCCGTAGAAAGCGCCGTC
TTCGTATTGAACCACCTCTCCATGCACTTCGTCGCGACAACTACCCGCCCCCACAGCGTGATCCCAATGCTCCTCGTCTTCCTGACTCCACATCCGCTCTTGTGGGGCCG
CGTCTTAACCTTCACAATCGTGTTCAATCCCTGATTCGTGCTGGTGATCTTGATGCGGCCTCTGCGGTCGCTCGCCACTCTGTCTTCTCGAACACGCGGCCGACGGTTTT
CACTTGTAACGCTATTATTGCTGCTATGTATCGGGCTAAGAGGTATAGTGATGCGATTGCACTGTTTCAGTTCTTCTTTAACCAGTCGAATATAGTTCCCAATGTTGTGT
CATATAATAATTTGATTAATGCTCATTGCGATGAGGGCCGTGTTGATGTGGGTCTTGAAATTTATCGTCATATTATTGCGAATGCTCCCTTTAGTCCTTCGGCAGTGACT
TATCGGCATTTGACTAAGGGATTGATTGATTCTGGGAGGATTGGGGAGGCTGTGGATCTTCTGCGGGAAATGTTGAATAAAGGGCATGGTGCTGATTCGTTGGTTTTTAA
TAATTTGATTTCTGGGTTTCTAAATTTGGAGAATATGGAGAAGGCGAATGAACTGTTTGATGAGTTGAAGGAGAGGTGTTTGGTGTATGATGGAGTTGTGAATGCTACGT
TCATGGATTGGTTTTTTAATAGAGGGAAAGAAAAGGAGGCCATGGAATCGTACAAGTCATTGCTTGATAGGCAATTCAAGATGATTCCAGCGACTTGCAATGTGCTGTTG
GAGGTTTTGCTTAAGCATGGGAAGAAAACGGAGGCTTGGACCTTATTTGATCAGATGTTGGATAACCACACTCCACCTAATTTCCAAGCAGTCAATTCAGATACGTTCAA
CATAATGGTTAATGAGTGCCTTAAGCTCGGCAAGTTCTCAGAGGCAGTGGAGACTTTCCGGAAGGTGGGAACTCAACCAAAGTCAAGGCCTTTTGCTATGGACGTTGCTG
GGTATAATAATATCATTGCAAGGTTTTGTGAGCAGGGAATGATGACAGACGCAGAGACTTTCTTTGCTGAACTTTGCTCGAAGTCTTTGTCCCCTGATGTCCCAACTCAT
AGAACACTGATAGAATCTTATTTAAAGATTGGGCAGATTGATGATGTATTGAGAGTTTTTAACAGAATGGTCGATGTTGGTTTGAGAGTTGTTGCTAGCTTCGGAAGCAC
AGTATTTGGTGAATTGATTAAGAATGGCAAGGCAGTTGACTGCGCTCAGATTTTAACAAAAATGGGAGAGCGGGATCCTAAACCAGATCCCACATGCTATGACGTTGTGA
TTAGAGGGCTATGTAACGAAGGTGCGCTGGATACTAGTCGGGAGTTGCTTGACCAGATAAAGAGGTATGGTATTGGTCTCACTCCAACACTTGAGGAATTTGTTAAAGAG
GCGTTTGTAAAGGCTGGTCGGCATGAAGAGATTGAAAGACTGCTAAATGTGAACATATCAGGACACGTTCCATATCGCCCCCCCTCTGGACCCCCAAGAATTCCACAATC
GCAGGTACCACCTCAAATGAGACCGCCTCAAGGACCCCCTCAAATGGCAGAACCAAATTGGCGACCTTCCTTAAACCCTCAAGCCAGAGGAAGTTATGCCCCTTCATCAC
CTCAGATGTCAGGTCCTAATTATTTTCAAGGTCCTAATTATTCTCAATCTGGATCAGCTCAAATGTCAGGTCCTAATTATTCTCAATCAGGATCAACTCAAATGACAGGT
CCTAATTATTTTCAATCAGGACCGGCTCAAATGACAAGTCCTAATTATTCTGAATCAGGATCAACTCAAATGACAGGTCCTAACTATTCTCAATCAGGACCAGCTCAAAT
GCCAGGTCCTAATTATTTTCAATCAGGAGCGGCTCAAGTGACAAGACCGCAACAGCCCTCATCCGATCCGCCCCCAATGGAAGAACAGCATCACTCACAACAACCCCCTC
AAATGGCCAGGGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCACTTTACCGCTTCCTCCTTCGCTCTCTCCGCCCCTCTTCCACCTCTCCCTCAAATTCCCGAGCTCTTACCATTGGTCCTCTCAACCACCATTTTCAGGAGCCGAT
TCCACCGTCCTCTCAAACTTCTTCTCCCATCTCGCTCCTCCATGCCCGCTCGTTTGCCTTTTCCTCTGCCGAAGAAGCTGCTGCCGAAAGACGCCGTAGAAAGCGCCGTC
TTCGTATTGAACCACCTCTCCATGCACTTCGTCGCGACAACTACCCGCCCCCACAGCGTGATCCCAATGCTCCTCGTCTTCCTGACTCCACATCCGCTCTTGTGGGGCCG
CGTCTTAACCTTCACAATCGTGTTCAATCCCTGATTCGTGCTGGTGATCTTGATGCGGCCTCTGCGGTCGCTCGCCACTCTGTCTTCTCGAACACGCGGCCGACGGTTTT
CACTTGTAACGCTATTATTGCTGCTATGTATCGGGCTAAGAGGTATAGTGATGCGATTGCACTGTTTCAGTTCTTCTTTAACCAGTCGAATATAGTTCCCAATGTTGTGT
CATATAATAATTTGATTAATGCTCATTGCGATGAGGGCCGTGTTGATGTGGGTCTTGAAATTTATCGTCATATTATTGCGAATGCTCCCTTTAGTCCTTCGGCAGTGACT
TATCGGCATTTGACTAAGGGATTGATTGATTCTGGGAGGATTGGGGAGGCTGTGGATCTTCTGCGGGAAATGTTGAATAAAGGGCATGGTGCTGATTCGTTGGTTTTTAA
TAATTTGATTTCTGGGTTTCTAAATTTGGAGAATATGGAGAAGGCGAATGAACTGTTTGATGAGTTGAAGGAGAGGTGTTTGGTGTATGATGGAGTTGTGAATGCTACGT
TCATGGATTGGTTTTTTAATAGAGGGAAAGAAAAGGAGGCCATGGAATCGTACAAGTCATTGCTTGATAGGCAATTCAAGATGATTCCAGCGACTTGCAATGTGCTGTTG
GAGGTTTTGCTTAAGCATGGGAAGAAAACGGAGGCTTGGACCTTATTTGATCAGATGTTGGATAACCACACTCCACCTAATTTCCAAGCAGTCAATTCAGATACGTTCAA
CATAATGGTTAATGAGTGCCTTAAGCTCGGCAAGTTCTCAGAGGCAGTGGAGACTTTCCGGAAGGTGGGAACTCAACCAAAGTCAAGGCCTTTTGCTATGGACGTTGCTG
GGTATAATAATATCATTGCAAGGTTTTGTGAGCAGGGAATGATGACAGACGCAGAGACTTTCTTTGCTGAACTTTGCTCGAAGTCTTTGTCCCCTGATGTCCCAACTCAT
AGAACACTGATAGAATCTTATTTAAAGATTGGGCAGATTGATGATGTATTGAGAGTTTTTAACAGAATGGTCGATGTTGGTTTGAGAGTTGTTGCTAGCTTCGGAAGCAC
AGTATTTGGTGAATTGATTAAGAATGGCAAGGCAGTTGACTGCGCTCAGATTTTAACAAAAATGGGAGAGCGGGATCCTAAACCAGATCCCACATGCTATGACGTTGTGA
TTAGAGGGCTATGTAACGAAGGTGCGCTGGATACTAGTCGGGAGTTGCTTGACCAGATAAAGAGGTATGGTATTGGTCTCACTCCAACACTTGAGGAATTTGTTAAAGAG
GCGTTTGTAAAGGCTGGTCGGCATGAAGAGATTGAAAGACTGCTAAATGTGAACATATCAGGACACGTTCCATATCGCCCCCCCTCTGGACCCCCAAGAATTCCACAATC
GCAGGTACCACCTCAAATGAGACCGCCTCAAGGACCCCCTCAAATGGCAGAACCAAATTGGCGACCTTCCTTAAACCCTCAAGCCAGAGGAAGTTATGCCCCTTCATCAC
CTCAGATGTCAGGTCCTAATTATTTTCAAGGTCCTAATTATTCTCAATCTGGATCAGCTCAAATGTCAGGTCCTAATTATTCTCAATCAGGATCAACTCAAATGACAGGT
CCTAATTATTTTCAATCAGGACCGGCTCAAATGACAAGTCCTAATTATTCTGAATCAGGATCAACTCAAATGACAGGTCCTAACTATTCTCAATCAGGACCAGCTCAAAT
GCCAGGTCCTAATTATTTTCAATCAGGAGCGGCTCAAGTGACAAGACCGCAACAGCCCTCATCCGATCCGCCCCCAATGGAAGAACAGCATCACTCACAACAACCCCCTC
AAATGGCCAGGGGGTAG
Protein sequenceShow/hide protein sequence
MSLYRFLLRSLRPSSTSPSNSRALTIGPLNHHFQEPIPPSSQTSSPISLLHARSFAFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGP
RLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVT
YRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENMEKANELFDELKERCLVYDGVVNATFMDWFFNRGKEKEAMESYKSLLDRQFKMIPATCNVLL
EVLLKHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECLKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTH
RTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGSTVFGELIKNGKAVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIKRYGIGLTPTLEEFVKE
AFVKAGRHEEIERLLNVNISGHVPYRPPSGPPRIPQSQVPPQMRPPQGPPQMAEPNWRPSLNPQARGSYAPSSPQMSGPNYFQGPNYSQSGSAQMSGPNYSQSGSTQMTG
PNYFQSGPAQMTSPNYSESGSTQMTGPNYSQSGPAQMPGPNYFQSGAAQVTRPQQPSSDPPPMEEQHHSQQPPQMARG