; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0011850 (gene) of Chayote v1 genome

Gene IDSed0011850
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG13:1406033..1407689
RNA-Seq ExpressionSed0011850
SyntenySed0011850
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600165.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.5e-24981.44Show/hide
Query:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL
        MLIS + SISPI+SL L CSS PKKSKKER+KLL EKLIRI+KAKE   LPFPKSSSTPLLIHHKP SQ+K QALD+V++DLEAS+ NGVPIDA IFSSL
Subjt:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL

Query:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF
        LETCY+LRA++ GIRIHRLIPTN LRRNVGV SKLLRLYASFG ME AH+VFDEM +RN SAF WNSLISGYAELGLYEDALALYFQMEEEGVEPD FTF
Subjt:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF

Query:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA
        PRVLK+CGGIGS R+GEAVHRH VRSGFAGD+FVLNAL+DMY+KCGD++RARKVFDQI  KD VSWNSMLTGYTRHGLLLEAL+ FD+MI+EG EPDS+A
Subjt:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA

Query:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL
        LST++S++ S K KLHIHGWAIRHG+EWNLSIANSL+ MYAN GKI+RA+WLF+ MP+RDIVSWN+IISAH N+ +ALTYFEVMESL VLPD+VTFVSLL
Subjt:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL

Query:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK
        STCAHL LVK+G KLYS+MKGKY I+PT+EHYACMVNLYGRA +IEEAYRIITNGM +EAGPTVWGALLYAC+LHGNVDIAE+AA +LFE EPDNELNFK
Subjt:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK

Query:  LLMKIYSSAGRSEDERRVRLMMLERGLD
        LLMKIY +AGR EDE+RVRLMM ERGLD
Subjt:  LLMKIYSSAGRSEDERRVRLMMLERGLD

KAG7030831.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-24981.44Show/hide
Query:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL
        MLIS + SISPI+SL L CSS PKKSKKER+KLL EKLIRI+KAKE   LPFPKSSSTPLLIHHKP SQ+K QALD+V++DLEAS+ NGVPIDA IFSSL
Subjt:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL

Query:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF
        LETCY+LRA++ GIRIHRLIPTN LRRNVGV SKLLRLYASFG ME AH+VFDEM +RN SAF WNSLISGYAELGLYEDALALYFQMEEEGVEPD FTF
Subjt:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF

Query:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA
        PRVLK+CGGIGS R+GEAVHRH VRSGFAGD+FVLNAL+DMY+KCGD++RARKVFDQI  KD VSWNSMLTGYTRHGLLLEAL+ FD+MI+EG EPDS+A
Subjt:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA

Query:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL
        LST++S++ S K KLHIHGWAIRHG+EWNLSIANSL+ MYAN GKI+RA+WLF+ MP+RDIVSWN+IISAH N+ +ALTYFEVMESL VLPD+VTFVSLL
Subjt:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL

Query:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK
        STCAHL LVK+G KLYS+MKGKY I+PT+EHYACMVNLYGRA +IEEAYRIITNGM +EAGPTVWGALLYAC+LHGNVDIAE+AA +LFE EPDNELNFK
Subjt:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK

Query:  LLMKIYSSAGRSEDERRVRLMMLERGLD
        LLMKIY +AGR EDE+RVRLMM ERGLD
Subjt:  LLMKIYSSAGRSEDERRVRLMMLERGLD

XP_022942651.1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucurbita moschata]9.4e-24981.25Show/hide
Query:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL
        MLIS + SISPI+SL L+CSS PKKSKKER+KLL EKLIRI+KAKE   LPFPKSSSTPLLIHHKP SQ+K QALD+V++DLEAS+ NGVPIDA IFSSL
Subjt:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL

Query:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF
        LETCY+LRA++ GIRIHRLIPTN LRRNVGV SKLLRLYASFG ME AH+VFDEM +RN SAF WNSLISGYAELGLYEDALALYFQMEEEGVEPD FTF
Subjt:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF

Query:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA
        PRVLK+CGGIGS R+GEAVHRH VRSGFAGD+FVLNAL+DMY+KCGD++RARKVFDQI  KD VSWNSMLTGYTRHGLLLEAL+ FD+MI+EG EPDS+A
Subjt:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA

Query:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL
        LST++S++ S K KLHIHGWAIRHG+EWNLSIANSL+ MYAN GKI+RA+WLF+ MP+RDIVSWN+IISAH N+ +ALTYFEVMESL VLPD+VTFVSLL
Subjt:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL

Query:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK
        STCAHL LVK+G KLY+ MKGKY I+PT+EHYACMVNLYGRA +IEEAYRIITNGM +EAGPTVWGALLYAC+LHGNVDIAE+AA +LFE EPDNELNFK
Subjt:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK

Query:  LLMKIYSSAGRSEDERRVRLMMLERGLD
        LLMKIY +AGR EDE+RVRLMM ERGLD
Subjt:  LLMKIYSSAGRSEDERRVRLMMLERGLD

XP_022993795.1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucurbita maxima]5.5e-24981.44Show/hide
Query:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL
        MLIS + SISPI+SL L CSS PKKSKKER+KLL EKLIRI+KAKE   LPFPKSSSTPLLIHHKP S++K QALD+V++DLEAS+DNGVPIDA IFSSL
Subjt:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL

Query:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF
        LETCY+LRA++ GIRIHRLIPTN LRRNVGV SKLLRLYASFG ME AH+VFDEM +RN SAF WNSLISGYAELGLYEDALALYFQMEEEGVEPD FTF
Subjt:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF

Query:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA
        PRVLK+CGGIGS R+GEAVHRH VRSGFAGD+FVLNAL+DMY+KCGD++RARKVFDQI  KD VSWNSMLTGYTRHGLLLEAL+ FD+MI+EG EPDS+A
Subjt:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA

Query:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL
        LST++S++ S K KLHIHGWAIRHG+EWNLSIANSL+ MYAN GKI+RA+WLF+ MPQRDIVSWN+IISAH N+ +ALTYFEVMESL VLPD+VTFVSLL
Subjt:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL

Query:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK
        STCAHL LVK+G KLYS+MKGKY I+PT+EHYACMVNLYGRA +IEEAYRII NGM +EAGPTVWGALLYAC+LHGNVDIAE+AA +LFE EPDNELNFK
Subjt:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK

Query:  LLMKIYSSAGRSEDERRVRLMMLERGLD
        LLMKIY +AGR EDE+RVRLMM ERGLD
Subjt:  LLMKIYSSAGRSEDERRVRLMMLERGLD

XP_038901542.1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Benincasa hispida]3.2e-24982.01Show/hide
Query:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL
        MLIS Q SISP +SLLL CSSKPKKSKKERKKLL++KL+RI+KA+E   LPFPKSSSTPLLIH KP  QTK QALD++++DLE SVDNG+  D  IFSSL
Subjt:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL

Query:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF
        LE CY+LR+I  GIRIHRLIPTNLLRRNVGV SKLLRLYASFG MEIAH+VFDEM +RN SAF WNSLISGYAELGLYEDALALYFQMEEEGVEPD FTF
Subjt:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF

Query:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA
        PRVLK+CGGI S +IGEAVHRH +RSGFAGDVFVLNAL+DMYSKCG +VRARKVFDQI CKD VSWNSMLTGYTRHGLL EALDIFD+MI+EG EPDS+A
Subjt:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA

Query:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL
        LST+LS++ S K +LHIHGW IRHGVEWNLSIANSL++MYAN GKI+RAKWLFQ MPQ+D VSWNSIISAHFNS EALTYFEVMESL VLPD+VTFVSLL
Subjt:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL

Query:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK
        STCA+LGLVK+G KLYS+MKGKY+I+PT EHYACMVNLYGRA +IEEAYRIIT  M IEAGPTVWGALLYAC+LH NVDIAEIAA RLFELEPDNELNF+
Subjt:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK

Query:  LLMKIYSSAGRSEDERRVRLMMLERGLD
        LLMKIY +AGRSEDE+RV+LMM ERGLD
Subjt:  LLMKIYSSAGRSEDERRVRLMMLERGLD

TrEMBL top hitse value%identityAlignment
A0A0A0L5M0 Uncharacterized protein1.4e-24280.11Show/hide
Query:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL
        MLIS      P SSLLL CSSKPKKSKKER+KLL++KL+RI+KAK+   L FPKSS TPLLIH KP  Q+K QALD+V++DLEAS+DNG+ ID  IFSSL
Subjt:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL

Query:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF
        LE CY+L+AI  GIRIHRLIPTNLLRRNVG+ SKLLRLYASFG ME AH+VFDEM  RNFSAF WNSLISGYAELGLYEDALALYFQMEEEGVEPD FTF
Subjt:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF

Query:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA
        PRVLK+CGGIGS +IGEAVHRH VRSGFAGDVFVLNAL+DMYSKCG +VRARKVFDQI  KD VSWNSMLTGYTRHGL  EALDIFD+MI+EG EPDS+A
Subjt:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA

Query:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL
        LST+LS++ S K KLHIHGW IRHGVEWNLSIANSL++MYA  GK++RAKWLFQ MPQ+D+VSWNSIISAHFNS EALTYFEVMESL V PD VTFVSLL
Subjt:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL

Query:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK
        STCAHLGLVK+G KLY +MKGKY I+PT+EHYACMVNLYGRA MIEEAY+IIT GM IEAGPT+WGALLYAC+LH +VDIAEIAA RLFELEPDNELNF+
Subjt:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK

Query:  LLMKIYSSAGRSEDERRVRLMMLERGLD
        LLMKIY +AGRSEDE+RV+LMM ERGL+
Subjt:  LLMKIYSSAGRSEDERRVRLMMLERGLD

A0A1S4DSF7 pentatricopeptide repeat-containing protein At4g25270, chloroplastic5.7e-24480.11Show/hide
Query:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL
        MLIS Q    P +SLLL CSSKPKKSKKER+KLL++KL+RI+KAK+   L FPKSSSTPLLIH KP  Q+K QALD+V++DLE S+DNG+ ID  IFSSL
Subjt:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL

Query:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF
        LE CY+LRAI  GIRIHRLIPTNLLRRNVG+ SKLLRLYAS G ME AH+VFDEM +RNFSAF WNSLISGYAELGLYEDALALYFQMEEEGVEPD FTF
Subjt:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF

Query:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA
        PRVLK+CGGIGS +IGEAVHRH VRSGFAGDVFVLNAL+DMYSKCG +VRARKVFDQI  KD VSWNSMLTGYTRHGL  EALDIFD+MI+EG +PDS+A
Subjt:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA

Query:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL
        LST+LS++LS K KLHIHGW IRHGVEWNLSIANSL++MYA  GK++RAKWLFQ MPQ+D+VSWNSIISAHFN+ EALTYFEVMESL VLPD VTFVSLL
Subjt:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL

Query:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK
        STCAHLGLVK+G +LYS+MKGKY+I+PT+EHYACMVNLYGRA MIEEAY+IIT GM IEAGPT+WGALLYAC+LH NVDIAEIAA RLFELEPDNELNF+
Subjt:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK

Query:  LLMKIYSSAGRSEDERRVRLMMLERGLD
        LLMKIY +AGRS+DE+RV+LMM ERGL+
Subjt:  LLMKIYSSAGRSEDERRVRLMMLERGLD

A0A5D3DJ70 Pentatricopeptide repeat-containing protein2.6e-24480.3Show/hide
Query:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL
        MLIS Q    P +SLLL CSSKPKKSKKER+KLL++KL+RI+KAK+   L FPKSSSTPLLIH KP  Q+K QALD+V++DLE S+DNG+ ID  IFSSL
Subjt:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL

Query:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF
        LE CY+LRAI  GIRIHRLIPTNLLRRNVG+ SKLLRLYAS G ME AH+VFDEM +RNFSAF WNSLISGYAELGLYEDALALYFQMEEEGVEPD FTF
Subjt:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF

Query:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA
        PRVLK+CGGIGS +IGEAVHRH VRSGFAGDVFVLNAL+DMYSKCG +VRARKVFDQI  KD VSWNSMLTGYTRHGL  EALDIFD+MI+EG +PDS+A
Subjt:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA

Query:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL
        LST+LS++LS K KLHIHGW IRHGVEWNLSIANSL++MYA  GK++RAKWLFQ MPQ+D+VSWNSIISAHFN+ EALTYFEVMESL VLPD VTFVSLL
Subjt:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL

Query:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK
        STCAHLGLVK+G +LYS+MKGKY+I+PT+EHYACMVNLYGRA MIEEAY+IIT GM IEAGPT+WGALLYAC+LH NVDIAEIAA RLFELEPDNELNF+
Subjt:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK

Query:  LLMKIYSSAGRSEDERRVRLMMLERGLD
        LLMKIY +AGRSEDE+RV+LMM ERGL+
Subjt:  LLMKIYSSAGRSEDERRVRLMMLERGLD

A0A6J1FPF9 pentatricopeptide repeat-containing protein At4g25270, chloroplastic4.5e-24981.25Show/hide
Query:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL
        MLIS + SISPI+SL L+CSS PKKSKKER+KLL EKLIRI+KAKE   LPFPKSSSTPLLIHHKP SQ+K QALD+V++DLEAS+ NGVPIDA IFSSL
Subjt:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL

Query:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF
        LETCY+LRA++ GIRIHRLIPTN LRRNVGV SKLLRLYASFG ME AH+VFDEM +RN SAF WNSLISGYAELGLYEDALALYFQMEEEGVEPD FTF
Subjt:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF

Query:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA
        PRVLK+CGGIGS R+GEAVHRH VRSGFAGD+FVLNAL+DMY+KCGD++RARKVFDQI  KD VSWNSMLTGYTRHGLLLEAL+ FD+MI+EG EPDS+A
Subjt:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA

Query:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL
        LST++S++ S K KLHIHGWAIRHG+EWNLSIANSL+ MYAN GKI+RA+WLF+ MP+RDIVSWN+IISAH N+ +ALTYFEVMESL VLPD+VTFVSLL
Subjt:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL

Query:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK
        STCAHL LVK+G KLY+ MKGKY I+PT+EHYACMVNLYGRA +IEEAYRIITNGM +EAGPTVWGALLYAC+LHGNVDIAE+AA +LFE EPDNELNFK
Subjt:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK

Query:  LLMKIYSSAGRSEDERRVRLMMLERGLD
        LLMKIY +AGR EDE+RVRLMM ERGLD
Subjt:  LLMKIYSSAGRSEDERRVRLMMLERGLD

A0A6J1JZI1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic2.7e-24981.44Show/hide
Query:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL
        MLIS + SISPI+SL L CSS PKKSKKER+KLL EKLIRI+KAKE   LPFPKSSSTPLLIHHKP S++K QALD+V++DLEAS+DNGVPIDA IFSSL
Subjt:  MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSL

Query:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF
        LETCY+LRA++ GIRIHRLIPTN LRRNVGV SKLLRLYASFG ME AH+VFDEM +RN SAF WNSLISGYAELGLYEDALALYFQMEEEGVEPD FTF
Subjt:  LETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTF

Query:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA
        PRVLK+CGGIGS R+GEAVHRH VRSGFAGD+FVLNAL+DMY+KCGD++RARKVFDQI  KD VSWNSMLTGYTRHGLLLEAL+ FD+MI+EG EPDS+A
Subjt:  PRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIA

Query:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL
        LST++S++ S K KLHIHGWAIRHG+EWNLSIANSL+ MYAN GKI+RA+WLF+ MPQRDIVSWN+IISAH N+ +ALTYFEVMESL VLPD+VTFVSLL
Subjt:  LSTVLSSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLL

Query:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK
        STCAHL LVK+G KLYS+MKGKY I+PT+EHYACMVNLYGRA +IEEAYRII NGM +EAGPTVWGALLYAC+LHGNVDIAE+AA +LFE EPDNELNFK
Subjt:  STCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFK

Query:  LLMKIYSSAGRSEDERRVRLMMLERGLD
        LLMKIY +AGR EDE+RVRLMM ERGLD
Subjt:  LLMKIYSSAGRSEDERRVRLMMLERGLD

SwissProt top hitse value%identityAlignment
P0C899 Putative pentatricopeptide repeat-containing protein At3g491423.8e-7532.35Show/hide
Query:  LIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAA-IFSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERN
        L+H     + + + + S +  LE  +D   P +   +   +L+T   +R +     +H  I    LR N  +  KL+R YAS   +  A KVFDE+ ERN
Subjt:  LIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAA-IFSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERN

Query:  FSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTFPRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIG
            + N +I  Y   G Y + + ++  M    V PD +TFP VLK+C   G+  IG  +H  A + G +  +FV N L+ MY KCG L  AR V D++ 
Subjt:  FSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTFPRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIG

Query:  CKDAVSWNSMLTGYT--------------------------------------------------------------------RHGLLLEALDIFDRMIR
         +D VSWNS++ GY                                                                     ++ + +EA++++ RM  
Subjt:  CKDAVSWNSMLTGYT--------------------------------------------------------------------RHGLLLEALDIFDRMIR

Query:  EGCEPDSIALSTVL-----SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNS---LEALTYFEV
        +G EPD++++++VL     +S LS  L   IHG+  R  +  NL + N+L+ MYA  G +++A+ +F+NM  RD+VSW ++ISA+  S    +A+  F  
Subjt:  EGCEPDSIALSTVL-----SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNS---LEALTYFEV

Query:  MESLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEI
        ++   ++PD++ FV+ L+ C+H GL+++G   + +M   YKI P +EH ACMV+L GRA  ++EAYR I + M++E    VWGALL AC +H + DI  +
Subjt:  MESLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEI

Query:  AAGRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMMLERGL
        AA +LF+L P+    + LL  IY+ AGR E+   +R +M  +GL
Subjt:  AAGRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMMLERGL

Q9SB36 Pentatricopeptide repeat-containing protein At4g25270, chloroplastic1.4e-17558.77Show/hide
Query:  PISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPI-DAAIFSSLLETCYRLRA
        P  S   + SS  KK  +  ++L   +  + N       L F K S TPLLI  + + +T+ +ALDSV++DLE S   G+ + +  IF+SLLETCY LRA
Subjt:  PISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPI-DAAIFSSLLETCYRLRA

Query:  IECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTFPRVLKSCGG
        I+ G+R+H LIP  LLR N+G+ SKL+RLYAS G  E+AH+VFD M +R+ S F WNSLISGYAELG YEDA+ALYFQM E+GV+PDRFTFPRVLK+CGG
Subjt:  IECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTFPRVLKSCGG

Query:  IGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIALSTVLSSVL
        IGS +IGEA+HR  V+ GF  DV+VLNAL+ MY+KCGD+V+AR VFD I  KD VSWNSMLTGY  HGLL EALDIF  M++ G EPD +A+S+VL+ VL
Subjt:  IGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIALSTVLSSVL

Query:  SWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLLSTCAHLGLV
        S+K    +HGW IR G+EW LS+AN+L+++Y+  G++ +A ++F  M +RD VSWN+IISAH  +   L YFE M   N  PD +TFVS+LS CA+ G+V
Subjt:  SWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLLSTCAHLGLV

Query:  KKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFKLLMKIYSSA
        + GE+L+S+M  +Y I P MEHYACMVNLYGRA M+EEAY +I   M +EAGPTVWGALLYAC+LHGN DI E+AA RLFELEPDNE NF+LL++IYS A
Subjt:  KKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFKLLMKIYSSA

Query:  GRSEDERRVRLMMLERGLD
         R+ED  RVR MM++RGL+
Subjt:  GRSEDERRVRLMMLERGLD

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220704.5e-7633.52Show/hide
Query:  SSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEM
        S T +++ +K + Q  +     V+ D+   V  G+       +++L +    R +E G ++H  I    LR NV V + LL +YA  G   +A  VFD M
Subjt:  SSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEM

Query:  RERNFSAF-----------------------------VWNSLISGYAELGLYEDALALYFQMEEEG-VEPDRFTFPRVLKSCGGIGSFRIGEAVHRHAVR
          R+ S++                              WNS+ISG+ + G    AL ++ +M  +  + PDRFT   VL +C  +    IG+ +H H V 
Subjt:  RERNFSAF-----------------------------VWNSLISGYAELGLYEDALALYFQMEEEG-VEPDRFTFPRVLKSCGGIGSFRIGEAVHRHAVR

Query:  SGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCK---------------------------------DAVSWNSMLTGYTRHGLLLEALDIFDRMIRE
        +GF     VLNALI MYS+CG +  AR++ +Q G K                                 D V+W +M+ GY +HG   EA+++F  M+  
Subjt:  SGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCK---------------------------------DAVSWNSMLTGYTRHGLLLEALDIFDRMIRE

Query:  GCEPDSIALSTVL---SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMP-QRDIVSWNSIISA---HFNSLEALTYFEVME
        G  P+S  L+ +L   SS+ S      IHG A++ G  +++S++N+L+ MYA  G I  A   F  +  +RD VSW S+I A   H ++ EAL  FE M 
Subjt:  GCEPDSIALSTVL---SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMP-QRDIVSWNSIISA---HFNSLEALTYFEVME

Query:  SLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAA
           + PD++T+V + S C H GLV +G + + MMK   KI PT+ HYACMV+L+GRA +++EA   I   M IE     WG+LL AC +H N+D+ ++AA
Subjt:  SLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAA

Query:  GRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMM
         RL  LEP+N   +  L  +YS+ G+ E+  ++R  M
Subjt:  GRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMM

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic8.1e-7836.69Show/hide
Query:  NGVPIDAAIFSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQ
        +G+ ID A   S+   C   R I  G  +H +       R    C+ LL +Y+  G ++ A  VF EM +R  S   + S+I+GYA  GL  +A+ L+ +
Subjt:  NGVPIDAAIFSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQ

Query:  MEEEGVEPDRFTFPRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFD
        MEEEG+ PD +T   VL  C        G+ VH     +    D+FV NAL+DMY+KCG +  A  VF ++  KD +SWN+++ GY+++    EAL +F+
Subjt:  MEEEGVEPDRFTFPRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFD

Query:  RMIRE-GCEPDSIALSTVL---SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISA---HFNSLEALTY
         ++ E    PD   ++ VL   +S+ ++     IHG+ +R+G   +  +ANSLV MYA  G +  A  LF ++  +D+VSW  +I+    H    EA+  
Subjt:  RMIRE-GCEPDSIALSTVL---SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISA---HFNSLEALTY

Query:  FEVMESLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDI
        F  M    +  D ++FVSLL  C+H GLV +G + +++M+ + KI+PT+EHYAC+V++  R   + +AYR I N M I    T+WGALL  C +H +V +
Subjt:  FEVMESLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDI

Query:  AEIAAGRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMMLERGL
        AE  A ++FELEP+N   + L+  IY+ A + E  +R+R  + +RGL
Subjt:  AEIAAGRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMMLERGL

Q9STF3 Pentatricopeptide repeat-containing protein At3g46790, chloroplastic1.6e-8137.25Show/hide
Query:  FSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPD
        +  L+  C    ++   +R+HR I  N   ++  + +KL+ +Y+  G ++ A KVFD+ R+R  + +VWN+L       G  E+ L LY++M   GVE D
Subjt:  FSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPD

Query:  RFTFPRVLKSCGG----IGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIRE
        RFT+  VLK+C      +     G+ +H H  R G++  V+++  L+DMY++ G +  A  VF  +  ++ VSW++M+  Y ++G   EAL  F  M+RE
Subjt:  RFTFPRVLKSCGG----IGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIRE

Query:  --GCEPDSIALSTVL---SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISA---HFNSLEALTYFEVM
             P+S+ + +VL   +S+ + +    IHG+ +R G++  L + ++LV MY   GK++  + +F  M  RD+VSWNS+IS+   H    +A+  FE M
Subjt:  --GCEPDSIALSTVL---SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISA---HFNSLEALTYFEVM

Query:  ESLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIA
         +    P  VTFVS+L  C+H GLV++G++L+  M   + IKP +EHYACMV+L GRA  ++EA +++ + M  E GP VWG+LL +C +HGNV++AE A
Subjt:  ESLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIA

Query:  AGRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMMLERGL
        + RLF LEP N  N+ LL  IY+ A   ++ +RV+ ++  RGL
Subjt:  AGRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMMLERGL

Arabidopsis top hitse value%identityAlignment
AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein3.2e-7733.52Show/hide
Query:  SSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEM
        S T +++ +K + Q  +     V+ D+   V  G+       +++L +    R +E G ++H  I    LR NV V + LL +YA  G   +A  VFD M
Subjt:  SSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEM

Query:  RERNFSAF-----------------------------VWNSLISGYAELGLYEDALALYFQMEEEG-VEPDRFTFPRVLKSCGGIGSFRIGEAVHRHAVR
          R+ S++                              WNS+ISG+ + G    AL ++ +M  +  + PDRFT   VL +C  +    IG+ +H H V 
Subjt:  RERNFSAF-----------------------------VWNSLISGYAELGLYEDALALYFQMEEEG-VEPDRFTFPRVLKSCGGIGSFRIGEAVHRHAVR

Query:  SGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCK---------------------------------DAVSWNSMLTGYTRHGLLLEALDIFDRMIRE
        +GF     VLNALI MYS+CG +  AR++ +Q G K                                 D V+W +M+ GY +HG   EA+++F  M+  
Subjt:  SGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCK---------------------------------DAVSWNSMLTGYTRHGLLLEALDIFDRMIRE

Query:  GCEPDSIALSTVL---SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMP-QRDIVSWNSIISA---HFNSLEALTYFEVME
        G  P+S  L+ +L   SS+ S      IHG A++ G  +++S++N+L+ MYA  G I  A   F  +  +RD VSW S+I A   H ++ EAL  FE M 
Subjt:  GCEPDSIALSTVL---SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMP-QRDIVSWNSIISA---HFNSLEALTYFEVME

Query:  SLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAA
           + PD++T+V + S C H GLV +G + + MMK   KI PT+ HYACMV+L+GRA +++EA   I   M IE     WG+LL AC +H N+D+ ++AA
Subjt:  SLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAA

Query:  GRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMM
         RL  LEP+N   +  L  +YS+ G+ E+  ++R  M
Subjt:  GRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMM

AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-8237.25Show/hide
Query:  FSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPD
        +  L+  C    ++   +R+HR I  N   ++  + +KL+ +Y+  G ++ A KVFD+ R+R  + +VWN+L       G  E+ L LY++M   GVE D
Subjt:  FSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPD

Query:  RFTFPRVLKSCGG----IGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIRE
        RFT+  VLK+C      +     G+ +H H  R G++  V+++  L+DMY++ G +  A  VF  +  ++ VSW++M+  Y ++G   EAL  F  M+RE
Subjt:  RFTFPRVLKSCGG----IGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIRE

Query:  --GCEPDSIALSTVL---SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISA---HFNSLEALTYFEVM
             P+S+ + +VL   +S+ + +    IHG+ +R G++  L + ++LV MY   GK++  + +F  M  RD+VSWNS+IS+   H    +A+  FE M
Subjt:  --GCEPDSIALSTVL---SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISA---HFNSLEALTYFEVM

Query:  ESLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIA
         +    P  VTFVS+L  C+H GLV++G++L+  M   + IKP +EHYACMV+L GRA  ++EA +++ + M  E GP VWG+LL +C +HGNV++AE A
Subjt:  ESLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIA

Query:  AGRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMMLERGL
        + RLF LEP N  N+ LL  IY+ A   ++ +RV+ ++  RGL
Subjt:  AGRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMMLERGL

AT3G49142.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-7632.35Show/hide
Query:  LIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAA-IFSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERN
        L+H     + + + + S +  LE  +D   P +   +   +L+T   +R +     +H  I    LR N  +  KL+R YAS   +  A KVFDE+ ERN
Subjt:  LIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAA-IFSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERN

Query:  FSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTFPRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIG
            + N +I  Y   G Y + + ++  M    V PD +TFP VLK+C   G+  IG  +H  A + G +  +FV N L+ MY KCG L  AR V D++ 
Subjt:  FSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTFPRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIG

Query:  CKDAVSWNSMLTGYT--------------------------------------------------------------------RHGLLLEALDIFDRMIR
         +D VSWNS++ GY                                                                     ++ + +EA++++ RM  
Subjt:  CKDAVSWNSMLTGYT--------------------------------------------------------------------RHGLLLEALDIFDRMIR

Query:  EGCEPDSIALSTVL-----SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNS---LEALTYFEV
        +G EPD++++++VL     +S LS  L   IHG+  R  +  NL + N+L+ MYA  G +++A+ +F+NM  RD+VSW ++ISA+  S    +A+  F  
Subjt:  EGCEPDSIALSTVL-----SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNS---LEALTYFEV

Query:  MESLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEI
        ++   ++PD++ FV+ L+ C+H GL+++G   + +M   YKI P +EH ACMV+L GRA  ++EAYR I + M++E    VWGALL AC +H + DI  +
Subjt:  MESLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEI

Query:  AAGRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMMLERGL
        AA +LF+L P+    + LL  IY+ AGR E+   +R +M  +GL
Subjt:  AAGRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMMLERGL

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein5.8e-7936.69Show/hide
Query:  NGVPIDAAIFSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQ
        +G+ ID A   S+   C   R I  G  +H +       R    C+ LL +Y+  G ++ A  VF EM +R  S   + S+I+GYA  GL  +A+ L+ +
Subjt:  NGVPIDAAIFSSLLETCYRLRAIECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQ

Query:  MEEEGVEPDRFTFPRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFD
        MEEEG+ PD +T   VL  C        G+ VH     +    D+FV NAL+DMY+KCG +  A  VF ++  KD +SWN+++ GY+++    EAL +F+
Subjt:  MEEEGVEPDRFTFPRVLKSCGGIGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFD

Query:  RMIRE-GCEPDSIALSTVL---SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISA---HFNSLEALTY
         ++ E    PD   ++ VL   +S+ ++     IHG+ +R+G   +  +ANSLV MYA  G +  A  LF ++  +D+VSW  +I+    H    EA+  
Subjt:  RMIRE-GCEPDSIALSTVL---SSVLSWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISA---HFNSLEALTY

Query:  FEVMESLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDI
        F  M    +  D ++FVSLL  C+H GLV +G + +++M+ + KI+PT+EHYAC+V++  R   + +AYR I N M I    T+WGALL  C +H +V +
Subjt:  FEVMESLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDI

Query:  AEIAAGRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMMLERGL
        AE  A ++FELEP+N   + L+  IY+ A + E  +R+R  + +RGL
Subjt:  AEIAAGRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMMLERGL

AT4G25270.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.0e-17658.77Show/hide
Query:  PISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPI-DAAIFSSLLETCYRLRA
        P  S   + SS  KK  +  ++L   +  + N       L F K S TPLLI  + + +T+ +ALDSV++DLE S   G+ + +  IF+SLLETCY LRA
Subjt:  PISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPI-DAAIFSSLLETCYRLRA

Query:  IECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTFPRVLKSCGG
        I+ G+R+H LIP  LLR N+G+ SKL+RLYAS G  E+AH+VFD M +R+ S F WNSLISGYAELG YEDA+ALYFQM E+GV+PDRFTFPRVLK+CGG
Subjt:  IECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTFPRVLKSCGG

Query:  IGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIALSTVLSSVL
        IGS +IGEA+HR  V+ GF  DV+VLNAL+ MY+KCGD+V+AR VFD I  KD VSWNSMLTGY  HGLL EALDIF  M++ G EPD +A+S+VL+ VL
Subjt:  IGSFRIGEAVHRHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIALSTVLSSVL

Query:  SWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLLSTCAHLGLV
        S+K    +HGW IR G+EW LS+AN+L+++Y+  G++ +A ++F  M +RD VSWN+IISAH  +   L YFE M   N  PD +TFVS+LS CA+ G+V
Subjt:  SWKLKLHIHGWAIRHGVEWNLSIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLLSTCAHLGLV

Query:  KKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFKLLMKIYSSA
        + GE+L+S+M  +Y I P MEHYACMVNLYGRA M+EEAY +I   M +EAGPTVWGALLYAC+LHGN DI E+AA RLFELEPDNE NF+LL++IYS A
Subjt:  KKGEKLYSMMKGKYKIKPTMEHYACMVNLYGRARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFKLLMKIYSSA

Query:  GRSEDERRVRLMMLERGLD
         R+ED  RVR MM++RGL+
Subjt:  GRSEDERRVRLMMLERGLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGATCTCTTCACAACAATCAATTTCTCCCATTTCTTCCCTTCTTCTCATCTGTTCATCCAAACCCAAAAAATCCAAAAAAGAAAGAAAAAAACTTCTCAACGAAAA
GCTAATTCGCATAAACAAAGCCAAAGAAATCCCTCATCTTCCTTTCCCCAAATCCTCATCGACCCCACTCTTAATCCACCACAAACCCCTCTCCCAAACCAAATTCCAAG
CTCTCGATTCCGTCGTCAGCGACCTCGAAGCCTCGGTCGACAATGGCGTCCCCATCGACGCCGCAATCTTCTCCTCCCTTTTGGAAACTTGTTACCGCTTGCGCGCCATT
GAATGCGGTATTCGGATTCATCGACTAATACCCACTAATCTTTTGCGCAGAAATGTGGGTGTTTGTTCTAAGCTTCTTCGGTTGTATGCTTCTTTTGGGCGTATGGAGAT
TGCCCATAAGGTGTTTGATGAAATGCGTGAACGAAATTTCTCTGCATTTGTTTGGAATTCTCTTATTTCTGGCTATGCTGAACTTGGTCTTTATGAAGATGCTTTGGCGC
TTTACTTTCAAATGGAGGAAGAAGGTGTCGAACCTGACCGCTTTACGTTCCCTCGCGTACTCAAGTCCTGTGGTGGTATTGGGTCGTTTCGAATCGGGGAGGCTGTGCAC
CGGCATGCCGTTCGTTCGGGATTTGCTGGAGATGTCTTTGTCCTCAACGCGTTGATTGATATGTATTCCAAATGTGGTGATCTTGTGAGAGCTAGGAAGGTTTTTGATCA
GATTGGTTGTAAGGATGCAGTTTCTTGGAACTCCATGCTTACTGGCTACACACGCCATGGGCTTCTCTTGGAGGCATTGGATATCTTTGATCGAATGATTCGAGAAGGTT
GCGAGCCGGATTCGATTGCTTTGTCGACCGTTCTTTCTAGTGTTTTGTCGTGGAAACTCAAGTTACACATTCACGGATGGGCAATTCGACACGGCGTGGAGTGGAATTTG
TCCATTGCTAACTCCTTGGTCATCATGTATGCCAATTATGGTAAGATTGACAGAGCAAAATGGCTGTTTCAGAATATGCCTCAAAGAGACATAGTTTCATGGAACTCCAT
AATATCTGCTCATTTCAATTCCTTAGAAGCTTTGACATATTTTGAAGTTATGGAGAGTCTTAATGTTTTGCCTGACAATGTAACATTTGTGTCATTGTTGTCAACTTGTG
CCCATTTGGGATTGGTGAAGAAAGGGGAAAAATTGTATTCTATGATGAAGGGAAAGTATAAAATAAAGCCAACCATGGAACATTATGCTTGTATGGTGAATCTCTATGGG
AGAGCAAGGATGATTGAAGAAGCTTACCGAATCATTACAAATGGGATGGCGATCGAGGCAGGCCCGACCGTGTGGGGGGCGTTGTTGTATGCGTGCCATCTTCACGGCAA
TGTAGATATCGCCGAGATTGCTGCTGGAAGGCTTTTCGAATTGGAACCGGATAATGAGCTCAATTTCAAGCTTTTGATGAAGATTTACAGCAGTGCGGGAAGATCAGAAG
ATGAGAGGAGAGTGAGATTGATGATGTTGGAACGAGGATTGGATTAG
mRNA sequenceShow/hide mRNA sequence
AACCTATAGAACTCCAAATGAGCGCTTTGAAGCCGCCATTTTTGCTCATTTCCAACGGTTTCTTTTCCCAATGCTGATCTCTTCACAACAATCAATTTCTCCCATTTCTT
CCCTTCTTCTCATCTGTTCATCCAAACCCAAAAAATCCAAAAAAGAAAGAAAAAAACTTCTCAACGAAAAGCTAATTCGCATAAACAAAGCCAAAGAAATCCCTCATCTT
CCTTTCCCCAAATCCTCATCGACCCCACTCTTAATCCACCACAAACCCCTCTCCCAAACCAAATTCCAAGCTCTCGATTCCGTCGTCAGCGACCTCGAAGCCTCGGTCGA
CAATGGCGTCCCCATCGACGCCGCAATCTTCTCCTCCCTTTTGGAAACTTGTTACCGCTTGCGCGCCATTGAATGCGGTATTCGGATTCATCGACTAATACCCACTAATC
TTTTGCGCAGAAATGTGGGTGTTTGTTCTAAGCTTCTTCGGTTGTATGCTTCTTTTGGGCGTATGGAGATTGCCCATAAGGTGTTTGATGAAATGCGTGAACGAAATTTC
TCTGCATTTGTTTGGAATTCTCTTATTTCTGGCTATGCTGAACTTGGTCTTTATGAAGATGCTTTGGCGCTTTACTTTCAAATGGAGGAAGAAGGTGTCGAACCTGACCG
CTTTACGTTCCCTCGCGTACTCAAGTCCTGTGGTGGTATTGGGTCGTTTCGAATCGGGGAGGCTGTGCACCGGCATGCCGTTCGTTCGGGATTTGCTGGAGATGTCTTTG
TCCTCAACGCGTTGATTGATATGTATTCCAAATGTGGTGATCTTGTGAGAGCTAGGAAGGTTTTTGATCAGATTGGTTGTAAGGATGCAGTTTCTTGGAACTCCATGCTT
ACTGGCTACACACGCCATGGGCTTCTCTTGGAGGCATTGGATATCTTTGATCGAATGATTCGAGAAGGTTGCGAGCCGGATTCGATTGCTTTGTCGACCGTTCTTTCTAG
TGTTTTGTCGTGGAAACTCAAGTTACACATTCACGGATGGGCAATTCGACACGGCGTGGAGTGGAATTTGTCCATTGCTAACTCCTTGGTCATCATGTATGCCAATTATG
GTAAGATTGACAGAGCAAAATGGCTGTTTCAGAATATGCCTCAAAGAGACATAGTTTCATGGAACTCCATAATATCTGCTCATTTCAATTCCTTAGAAGCTTTGACATAT
TTTGAAGTTATGGAGAGTCTTAATGTTTTGCCTGACAATGTAACATTTGTGTCATTGTTGTCAACTTGTGCCCATTTGGGATTGGTGAAGAAAGGGGAAAAATTGTATTC
TATGATGAAGGGAAAGTATAAAATAAAGCCAACCATGGAACATTATGCTTGTATGGTGAATCTCTATGGGAGAGCAAGGATGATTGAAGAAGCTTACCGAATCATTACAA
ATGGGATGGCGATCGAGGCAGGCCCGACCGTGTGGGGGGCGTTGTTGTATGCGTGCCATCTTCACGGCAATGTAGATATCGCCGAGATTGCTGCTGGAAGGCTTTTCGAA
TTGGAACCGGATAATGAGCTCAATTTCAAGCTTTTGATGAAGATTTACAGCAGTGCGGGAAGATCAGAAGATGAGAGGAGAGTGAGATTGATGATGTTGGAACGAGGATT
GGATTAG
Protein sequenceShow/hide protein sequence
MLISSQQSISPISSLLLICSSKPKKSKKERKKLLNEKLIRINKAKEIPHLPFPKSSSTPLLIHHKPLSQTKFQALDSVVSDLEASVDNGVPIDAAIFSSLLETCYRLRAI
ECGIRIHRLIPTNLLRRNVGVCSKLLRLYASFGRMEIAHKVFDEMRERNFSAFVWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTFPRVLKSCGGIGSFRIGEAVH
RHAVRSGFAGDVFVLNALIDMYSKCGDLVRARKVFDQIGCKDAVSWNSMLTGYTRHGLLLEALDIFDRMIREGCEPDSIALSTVLSSVLSWKLKLHIHGWAIRHGVEWNL
SIANSLVIMYANYGKIDRAKWLFQNMPQRDIVSWNSIISAHFNSLEALTYFEVMESLNVLPDNVTFVSLLSTCAHLGLVKKGEKLYSMMKGKYKIKPTMEHYACMVNLYG
RARMIEEAYRIITNGMAIEAGPTVWGALLYACHLHGNVDIAEIAAGRLFELEPDNELNFKLLMKIYSSAGRSEDERRVRLMMLERGLD