; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh12G010470 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh12G010470
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCmo_Chr12:9712338..9713897
RNA-Seq ExpressionCmoCh12G010470
SyntenyCmoCh12G010470
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453700.1 PREDICTED: pentatricopeptide repeat-containing protein At1g09190 [Cucumis melo]4.0e-23683.88Show/hide
Query:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKN   IERRILRLL GHKS THLTQIHAHFLRH LHQSNQILAHFIS+C + N I YA+R+FSQS NPNIFLFNS+IKAHSLS PF QSLLLFS MKN
Subjt:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN
         RIVPD+YTFAPLLKSC+NL +Y LG+CVI EVL RGF CFGSIRIGVVELYVCCEKM+DA K FDEM  RDVVVWNLMIRGFCKMGNVD GL LFRQM+
Subjt:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN

Query:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK
        +RSLVSWNT ISCLAQ+ RDVEALELFQQ+EEHGF+PDEVTVVTMLPVCSRLGAL+VGQ IHSY +SK +L+ TTMVGNSLIDFYCK GN E AYNIFQK
Subjt:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK

Query:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI
        MTCK+VVSWNT+ILGFALNGKGE AIDLFMEM + DVKPNDAT VA+LTACVHSGLLEKGRE+FSSMAE YEI+PKLEHFGCMVDLLGRGGCVEEAH+LI
Subjt:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI

Query:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG
        +SMPMQPNATLWGALLGACRTHGNLKLAEMA  ELISLEP NSGNYVLLSN+LAEEGRW++VENVR  MR K+VKKAPG+SASG
Subjt:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG

XP_022937515.1 pentatricopeptide repeat-containing protein At1g09190 [Cucurbita moschata]2.8e-282100Show/hide
Query:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
Subjt:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN
        RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN
Subjt:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN

Query:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK
        DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK
Subjt:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK

Query:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI
        MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI
Subjt:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI

Query:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG
        RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG
Subjt:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG

XP_022965696.1 pentatricopeptide repeat-containing protein At1g09190 [Cucurbita maxima]1.7e-27195.87Show/hide
Query:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICG  NEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPF+QSLLLFSS+KN
Subjt:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN
        RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCE+MDDAQKVFDEMP  DVVVWNLMIRGFCKMGNVDLGLSLFRQMN
Subjt:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN

Query:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK
        DRSLVSWNTTISCLAQSGRDVEALELFQQ+EEHGFEPDEVTVVTMLPVCSRLGA+DVGQMIHSYATSKADL+NTTMVGNSLIDFYCKSGNTERAYNIFQK
Subjt:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK

Query:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI
        MTCK+VVSWNTMILGFALNGKGELAIDLFMEMG+GD KPND TLVAILTACVHSGLLEKG+EVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI
Subjt:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI

Query:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG
        RSMPMQPNATLWGALLGACRTHGNLKLAEMA NELISLEPSNSGNYVLLSN LAEE RW+DVENVR SMRGKNVKKAPGRSASG
Subjt:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG

XP_023538021.1 pentatricopeptide repeat-containing protein At1g09190 [Cucurbita pepo subsp. pepo]2.8e-27497.31Show/hide
Query:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGA NEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
Subjt:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN
        RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFE FGSIRIGVVELYVCCE+MDDAQKVFDEMP+RDVVVWNLMIRGFCKMGNVDLGLSLFRQMN
Subjt:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN

Query:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK
        DRSLVSWNTTISCLAQSGRDVEAL+LFQQ+EEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADL+NTTMVGNSLIDFYCKSGNTE+AYNIFQK
Subjt:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK

Query:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI
        MTCK+VVSWNTMILGFALNGKGELAIDLF EMGRGD KPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI
Subjt:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI

Query:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG
        RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRW+DVENVR SMRGKNVKKAPGRSASG
Subjt:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG

XP_038890516.1 pentatricopeptide repeat-containing protein At1g09190 [Benincasa hispida]8.1e-23783.26Show/hide
Query:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        M+KN   IERRILRLL G KS THLT+IHAHFLRH LHQSNQILAHFISIC A N I+YA+R+FSQS NPNIFLFNS+IKAHSL  PFQQSLLLFSSMKN
Subjt:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN
         RIVPDEYTFAPLLKSC+NL +Y LG+CVI EVLRRGF CFGSIRIGVVELYVCCE+M+DA+KVFDEMP RDVVVWNLMIRGFCKMGNVD GL LFRQMN
Subjt:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN

Query:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK
        +RSL+SWNT +SCLAQS  D EALELFQQ+EE GF+PDEVTVVTMLPVCSRLGALDVGQ IH+YA+SK D+++ T +GNSL+DFYCK GN ERAYNIFQK
Subjt:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK

Query:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI
        MTCK+VVSWNTMILGFALNG GE AIDLFM+MG+ DVKPNDAT VA+LTACVHSGLLEKGRE+FSSMA+KYEI+PKLEHFGCMVDLLGRGGC+EEAH+LI
Subjt:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI

Query:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG
        +SMPMQPNATLWGALLGACRTHGNLKLAEMA  EL SLEP NSGNYVLLSN+LAEEGRW+DVENVR  M+GK+VKKAPG+SASG
Subjt:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG

TrEMBL top hitse value%identityAlignment
A0A0A0LJW6 Uncharacterized protein1.4e-23483.26Show/hide
Query:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKN   IERRILRLL GHKS THLTQIHAHFLRH LHQSNQILAHFIS+C + N IAYA+R+FSQS NPNIFLFNS+IKAHSLS PF QSLLLFSSMKN
Subjt:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN
         RIVPD+YTFAPLLKSC+NL +Y LG+CVI EV RRGF CFGSIRIGVVELYVCCEKM+DA K+FDEM  RDVVVWNLMIRGFCK GNVD GL LFRQM+
Subjt:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN

Query:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK
        +RSLVSWNT ISCLAQ+ RDVEALELFQQ+EEHGF+PDEVTVVTMLPVCSRLGAL+VGQ IHSYA+SK +L+  T VGNSLIDFYCK GN E+AYNIFQK
Subjt:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK

Query:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI
        MTCK+VVSWNT+ILGFALNGKGE AIDLFMEM +  +KPNDAT VA+LTACVHSGLLEKGRE+FSSMAE YEI+PKLEHFGCMVDLLGRGGCVEEAH LI
Subjt:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI

Query:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG
        +SMPMQPNATLWGA+LGACRTHGNLKLAEMA  ELISLEP NSGNYVLLSN+LAEEGRW++VENVR  MR K+VKKAPG+SASG
Subjt:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG

A0A1S3BXN7 pentatricopeptide repeat-containing protein At1g091901.9e-23683.88Show/hide
Query:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKN   IERRILRLL GHKS THLTQIHAHFLRH LHQSNQILAHFIS+C + N I YA+R+FSQS NPNIFLFNS+IKAHSLS PF QSLLLFS MKN
Subjt:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN
         RIVPD+YTFAPLLKSC+NL +Y LG+CVI EVL RGF CFGSIRIGVVELYVCCEKM+DA K FDEM  RDVVVWNLMIRGFCKMGNVD GL LFRQM+
Subjt:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN

Query:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK
        +RSLVSWNT ISCLAQ+ RDVEALELFQQ+EEHGF+PDEVTVVTMLPVCSRLGAL+VGQ IHSY +SK +L+ TTMVGNSLIDFYCK GN E AYNIFQK
Subjt:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK

Query:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI
        MTCK+VVSWNT+ILGFALNGKGE AIDLFMEM + DVKPNDAT VA+LTACVHSGLLEKGRE+FSSMAE YEI+PKLEHFGCMVDLLGRGGCVEEAH+LI
Subjt:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI

Query:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG
        +SMPMQPNATLWGALLGACRTHGNLKLAEMA  ELISLEP NSGNYVLLSN+LAEEGRW++VENVR  MR K+VKKAPG+SASG
Subjt:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG

A0A5A7U1F2 Pentatricopeptide repeat-containing protein1.9e-23683.88Show/hide
Query:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKN   IERRILRLL GHKS THLTQIHAHFLRH LHQSNQILAHFIS+C + N I YA+R+FSQS NPNIFLFNS+IKAHSLS PF QSLLLFS MKN
Subjt:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN
         RIVPD+YTFAPLLKSC+NL +Y LG+CVI EVL RGF CFGSIRIGVVELYVCCEKM+DA K FDEM  RDVVVWNLMIRGFCKMGNVD GL LFRQM+
Subjt:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN

Query:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK
        +RSLVSWNT ISCLAQ+ RDVEALELFQQ+EEHGF+PDEVTVVTMLPVCSRLGAL+VGQ IHSY +SK +L+ TTMVGNSLIDFYCK GN E AYNIFQK
Subjt:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK

Query:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI
        MTCK+VVSWNT+ILGFALNGKGE AIDLFMEM + DVKPNDAT VA+LTACVHSGLLEKGRE+FSSMAE YEI+PKLEHFGCMVDLLGRGGCVEEAH+LI
Subjt:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI

Query:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG
        +SMPMQPNATLWGALLGACRTHGNLKLAEMA  ELISLEP NSGNYVLLSN+LAEEGRW++VENVR  MR K+VKKAPG+SASG
Subjt:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG

A0A6J1FGY4 pentatricopeptide repeat-containing protein At1g091901.4e-282100Show/hide
Query:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
Subjt:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN
        RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN
Subjt:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN

Query:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK
        DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK
Subjt:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK

Query:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI
        MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI
Subjt:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI

Query:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG
        RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG
Subjt:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG

A0A6J1HMD5 pentatricopeptide repeat-containing protein At1g091908.4e-27295.87Show/hide
Query:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICG  NEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPF+QSLLLFSS+KN
Subjt:  MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN
        RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCE+MDDAQKVFDEMP  DVVVWNLMIRGFCKMGNVDLGLSLFRQMN
Subjt:  RRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN

Query:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK
        DRSLVSWNTTISCLAQSGRDVEALELFQQ+EEHGFEPDEVTVVTMLPVCSRLGA+DVGQMIHSYATSKADL+NTTMVGNSLIDFYCKSGNTERAYNIFQK
Subjt:  DRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQK

Query:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI
        MTCK+VVSWNTMILGFALNGKGELAIDLFMEMG+GD KPND TLVAILTACVHSGLLEKG+EVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI
Subjt:  MTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLI

Query:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG
        RSMPMQPNATLWGALLGACRTHGNLKLAEMA NELISLEPSNSGNYVLLSN LAEE RW+DVENVR SMRGKNVKKAPGRSASG
Subjt:  RSMPMQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG

SwissProt top hitse value%identityAlignment
O80488 Pentatricopeptide repeat-containing protein At1g091901.4e-15958.11Show/hide
Query:  IERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRIVPDE
        IER++LRLL GH + T L +IHAH LRH LH SN +LAHFISICG+ +   YANRVFS  QNPN+ +FN+MIK +SL GP  +SL  FSSMK+R I  DE
Subjt:  IERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRIVPDE

Query:  YTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSLVSW
        YT+APLLKSCS+L D R GKCV GE++R GF   G IRIGVVELY    +M DAQKVFDEM  R+VVVWNLMIRGFC  G+V+ GL LF+QM++RS+VSW
Subjt:  YTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSLVSW

Query:  NTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVV
        N+ IS L++ GRD EALELF ++ + GF+PDE TVVT+LP+ + LG LD G+ IHS A S     +   VGN+L+DFYCKSG+ E A  IF+KM  +NVV
Subjt:  NTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVV

Query:  SWNTMILGFALNGKGELAIDLFMEM-GRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQ
        SWNT+I G A+NGKGE  IDLF  M   G V PN+AT + +L  C ++G +E+G E+F  M E++++E + EH+G MVDL+ R G + EA   +++MP+ 
Subjt:  SWNTMILGFALNGKGELAIDLFMEM-GRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQ

Query:  PNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRS
         NA +WG+LL ACR+HG++KLAE+AA EL+ +EP NSGNYVLLSN+ AEEGRW+DVE VR  M+   ++K+ G+S
Subjt:  PNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRS

Q9FFG8 Pentatricopeptide repeat-containing protein At5g442304.2e-9539.52Show/hide
Query:  LTQIHAHFLRHDLHQSNQILAHFI---SICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRIVPDEYTFAPLLKSCSNLY
        + QIH H LR  L QS  IL   I   +  G   +  YA RV    Q  N FL+ ++I+ +++ G F +++ ++  M+   I P  +TF+ LLK+C  + 
Subjt:  LTQIHAHFLRHDLHQSNQILAHFI---SICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRIVPDEYTFAPLLKSCSNLY

Query:  DYRLGKCVIGEVLR-RGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSLVSWNTTISCLAQSGRD
        D  LG+    +  R RGF CF  +   ++++YV CE +D A+KVFDEMP RDV+ W  +I  + ++GN++    LF  +  + +V+W   ++  AQ+ + 
Subjt:  DYRLGKCVIGEVLR-RGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSLVSWNTTISCLAQSGRD

Query:  VEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALD-VGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVVSWNTMILGFALN
         EALE F ++E+ G   DEVTV   +  C++LGA     + +     S     +  ++G++LID Y K GN E A N+F  M  KNV ++++MILG A +
Subjt:  VEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALD-VGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVVSWNTMILGFALN

Query:  GKGELAIDLFMEM-GRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQPNATLWGALLGA
        G+ + A+ LF  M  + ++KPN  T V  L AC HSGL+++GR+VF SM + + ++P  +H+ CMVDLLGR G ++EA  LI++M ++P+  +WGALLGA
Subjt:  GKGELAIDLFMEM-GRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQPNATLWGALLGA

Query:  CRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRS
        CR H N ++AE+AA  L  LEP   GNY+LLSN+ A  G W  V  VR  ++ K +KK P  S
Subjt:  CRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRS

Q9FMA1 Pentatricopeptide repeat-containing protein At5g563101.1e-9540.17Show/hide
Query:  LTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHS-LSGPFQQSLLLFSSMKNRRIV--PDEYTFAPLLKSCSNLY
        L Q H + +   L++ N  +A FI  C  +  + YA  VF+    PN +L N+MI+A S L  P   S+ +    K   +   PD +TF  +LK    + 
Subjt:  LTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHS-LSGPFQQSLLLFSSMKNRRIV--PDEYTFAPLLKSCSNLY

Query:  DYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN--DRSLVSWNTTISCLAQSGR
        D   G+ + G+V+  GF+    +  G++++Y  C  + DA+K+FDEM  +DV VWN ++ G+ K+G +D   SL   M    R+ VSW   IS  A+SGR
Subjt:  DYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN--DRSLVSWNTTISCLAQSGR

Query:  DVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVVSWNTMILGFALN
          EA+E+FQ++     EPDEVT++ +L  C+ LG+L++G+ I SY   +  +     + N++ID Y KSGN  +A ++F+ +  +NVV+W T+I G A +
Subjt:  DVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVVSWNTMILGFALN

Query:  GKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQPNATLWGALLGAC
        G G  A+ +F  M +  V+PND T +AIL+AC H G ++ G+ +F+SM  KY I P +EH+GCM+DLLGR G + EA  +I+SMP + NA +WG+LL A 
Subjt:  GKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQPNATLWGALLGAC

Query:  RTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSA
          H +L+L E A +ELI LEP+NSGNY+LL+N+ +  GRW +   +R  M+G  VKK  G S+
Subjt:  RTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSA

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic7.4e-9232.87Show/hide
Query:  LRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGAS---NEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRIVPDEYT
        L LL   K+   L  IHA  ++  LH +N  L+  I  C  S     + YA  VF   Q PN+ ++N+M + H+LS     +L L+  M +  ++P+ YT
Subjt:  LRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGAS---NEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRIVPDEYT

Query:  FAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSLVSWNT
        F  +LKSC+    ++ G+ + G VL+ G +    +   ++ +YV   +++DA KVFD+ P RDVV +  +I+G+   G ++    LF ++  + +VSWN 
Subjt:  FAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSLVSWNT

Query:  TISCLAQSGRDVEALELFQQ-----------------------------------IEEHGF---------------------------------------
         IS  A++G   EALELF+                                    I++HGF                                       
Subjt:  TISCLAQSGRDVEALELFQQ-----------------------------------IEEHGF---------------------------------------

Query:  ---------------------------EPDEVTVVTMLPVCSRLGALDVGQMIHSYATSK-ADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVV
                                    P++VT++++LP C+ LGA+D+G+ IH Y   +   + N + +  SLID Y K G+ E A+ +F  +  K++ 
Subjt:  ---------------------------EPDEVTVVTMLPVCSRLGALDVGQMIHSYATSK-ADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVV

Query:  SWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQP
        SWN MI GFA++G+ + + DLF  M +  ++P+D T V +L+AC HSG+L+ GR +F +M + Y++ PKLEH+GCM+DLLG  G  +EA  +I  M M+P
Subjt:  SWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQP

Query:  NATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSA
        +  +W +LL AC+ HGN++L E  A  LI +EP N G+YVLLSNI A  GRW +V   R  +  K +KK PG S+
Subjt:  NATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSA

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205402.1e-10238.16Show/hide
Query:  RNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRI-V
        R +E   +  L   KS     +I+A  + H L QS+ ++   +  C    ++ YA R+F+Q  NPN+FL+NS+I+A++ +  +   + ++  +  +   +
Subjt:  RNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRI-V

Query:  PDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSL
        PD +TF  + KSC++L    LGK V G + + G          ++++Y+  + + DA KVFDEM  RDV+ WN ++ G+ ++G +     LF  M D+++
Subjt:  PDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSL

Query:  VSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCK
        VSW   IS     G  VEA++ F++++  G EPDE++++++LP C++LG+L++G+ IH YA  +   +  T V N+LI+ Y K G   +A  +F +M  K
Subjt:  VSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCK

Query:  NVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMP
        +V+SW+TMI G+A +G    AI+ F EM R  VKPN  T + +L+AC H G+ ++G   F  M + Y+IEPK+EH+GC++D+L R G +E A  + ++MP
Subjt:  NVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMP

Query:  MQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRS
        M+P++ +WG+LL +CRT GNL +A +A + L+ LEP + GNYVLL+NI A+ G+W+DV  +R  +R +N+KK PG S
Subjt:  MQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRS

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.3e-9332.87Show/hide
Query:  LRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGAS---NEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRIVPDEYT
        L LL   K+   L  IHA  ++  LH +N  L+  I  C  S     + YA  VF   Q PN+ ++N+M + H+LS     +L L+  M +  ++P+ YT
Subjt:  LRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGAS---NEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRIVPDEYT

Query:  FAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSLVSWNT
        F  +LKSC+    ++ G+ + G VL+ G +    +   ++ +YV   +++DA KVFD+ P RDVV +  +I+G+   G ++    LF ++  + +VSWN 
Subjt:  FAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSLVSWNT

Query:  TISCLAQSGRDVEALELFQQ-----------------------------------IEEHGF---------------------------------------
         IS  A++G   EALELF+                                    I++HGF                                       
Subjt:  TISCLAQSGRDVEALELFQQ-----------------------------------IEEHGF---------------------------------------

Query:  ---------------------------EPDEVTVVTMLPVCSRLGALDVGQMIHSYATSK-ADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVV
                                    P++VT++++LP C+ LGA+D+G+ IH Y   +   + N + +  SLID Y K G+ E A+ +F  +  K++ 
Subjt:  ---------------------------EPDEVTVVTMLPVCSRLGALDVGQMIHSYATSK-ADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVV

Query:  SWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQP
        SWN MI GFA++G+ + + DLF  M +  ++P+D T V +L+AC HSG+L+ GR +F +M + Y++ PKLEH+GCM+DLLG  G  +EA  +I  M M+P
Subjt:  SWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQP

Query:  NATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSA
        +  +W +LL AC+ HGN++L E  A  LI +EP N G+YVLLSNI A  GRW +V   R  +  K +KK PG S+
Subjt:  NATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSA

AT1G09190.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.0e-16058.11Show/hide
Query:  IERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRIVPDE
        IER++LRLL GH + T L +IHAH LRH LH SN +LAHFISICG+ +   YANRVFS  QNPN+ +FN+MIK +SL GP  +SL  FSSMK+R I  DE
Subjt:  IERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRIVPDE

Query:  YTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSLVSW
        YT+APLLKSCS+L D R GKCV GE++R GF   G IRIGVVELY    +M DAQKVFDEM  R+VVVWNLMIRGFC  G+V+ GL LF+QM++RS+VSW
Subjt:  YTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSLVSW

Query:  NTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVV
        N+ IS L++ GRD EALELF ++ + GF+PDE TVVT+LP+ + LG LD G+ IHS A S     +   VGN+L+DFYCKSG+ E A  IF+KM  +NVV
Subjt:  NTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVV

Query:  SWNTMILGFALNGKGELAIDLFMEM-GRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQ
        SWNT+I G A+NGKGE  IDLF  M   G V PN+AT + +L  C ++G +E+G E+F  M E++++E + EH+G MVDL+ R G + EA   +++MP+ 
Subjt:  SWNTMILGFALNGKGELAIDLFMEM-GRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQ

Query:  PNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRS
         NA +WG+LL ACR+HG++KLAE+AA EL+ +EP NSGNYVLLSN+ AEEGRW+DVE VR  M+   ++K+ G+S
Subjt:  PNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRS

AT2G20540.1 mitochondrial editing factor 211.5e-10338.16Show/hide
Query:  RNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRI-V
        R +E   +  L   KS     +I+A  + H L QS+ ++   +  C    ++ YA R+F+Q  NPN+FL+NS+I+A++ +  +   + ++  +  +   +
Subjt:  RNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRI-V

Query:  PDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSL
        PD +TF  + KSC++L    LGK V G + + G          ++++Y+  + + DA KVFDEM  RDV+ WN ++ G+ ++G +     LF  M D+++
Subjt:  PDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSL

Query:  VSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCK
        VSW   IS     G  VEA++ F++++  G EPDE++++++LP C++LG+L++G+ IH YA  +   +  T V N+LI+ Y K G   +A  +F +M  K
Subjt:  VSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCK

Query:  NVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMP
        +V+SW+TMI G+A +G    AI+ F EM R  VKPN  T + +L+AC H G+ ++G   F  M + Y+IEPK+EH+GC++D+L R G +E A  + ++MP
Subjt:  NVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMP

Query:  MQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRS
        M+P++ +WG+LL +CRT GNL +A +A + L+ LEP + GNYVLL+NI A+ G+W+DV  +R  +R +N+KK PG S
Subjt:  MQPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRS

AT5G44230.1 Pentatricopeptide repeat (PPR) superfamily protein3.0e-9639.52Show/hide
Query:  LTQIHAHFLRHDLHQSNQILAHFI---SICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRIVPDEYTFAPLLKSCSNLY
        + QIH H LR  L QS  IL   I   +  G   +  YA RV    Q  N FL+ ++I+ +++ G F +++ ++  M+   I P  +TF+ LLK+C  + 
Subjt:  LTQIHAHFLRHDLHQSNQILAHFI---SICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRIVPDEYTFAPLLKSCSNLY

Query:  DYRLGKCVIGEVLR-RGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSLVSWNTTISCLAQSGRD
        D  LG+    +  R RGF CF  +   ++++YV CE +D A+KVFDEMP RDV+ W  +I  + ++GN++    LF  +  + +V+W   ++  AQ+ + 
Subjt:  DYRLGKCVIGEVLR-RGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMNDRSLVSWNTTISCLAQSGRD

Query:  VEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALD-VGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVVSWNTMILGFALN
         EALE F ++E+ G   DEVTV   +  C++LGA     + +     S     +  ++G++LID Y K GN E A N+F  M  KNV ++++MILG A +
Subjt:  VEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALD-VGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVVSWNTMILGFALN

Query:  GKGELAIDLFMEM-GRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQPNATLWGALLGA
        G+ + A+ LF  M  + ++KPN  T V  L AC HSGL+++GR+VF SM + + ++P  +H+ CMVDLLGR G ++EA  LI++M ++P+  +WGALLGA
Subjt:  GKGELAIDLFMEM-GRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQPNATLWGALLGA

Query:  CRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRS
        CR H N ++AE+AA  L  LEP   GNY+LLSN+ A  G W  V  VR  ++ K +KK P  S
Subjt:  CRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRS

AT5G56310.1 Pentatricopeptide repeat (PPR) superfamily protein7.9e-9740.17Show/hide
Query:  LTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHS-LSGPFQQSLLLFSSMKNRRIV--PDEYTFAPLLKSCSNLY
        L Q H + +   L++ N  +A FI  C  +  + YA  VF+    PN +L N+MI+A S L  P   S+ +    K   +   PD +TF  +LK    + 
Subjt:  LTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLFNSMIKAHS-LSGPFQQSLLLFSSMKNRRIV--PDEYTFAPLLKSCSNLY

Query:  DYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN--DRSLVSWNTTISCLAQSGR
        D   G+ + G+V+  GF+    +  G++++Y  C  + DA+K+FDEM  +DV VWN ++ G+ K+G +D   SL   M    R+ VSW   IS  A+SGR
Subjt:  DYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCKMGNVDLGLSLFRQMN--DRSLVSWNTTISCLAQSGR

Query:  DVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVVSWNTMILGFALN
          EA+E+FQ++     EPDEVT++ +L  C+ LG+L++G+ I SY   +  +     + N++ID Y KSGN  +A ++F+ +  +NVV+W T+I G A +
Subjt:  DVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAYNIFQKMTCKNVVSWNTMILGFALN

Query:  GKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQPNATLWGALLGAC
        G G  A+ +F  M +  V+PND T +AIL+AC H G ++ G+ +F+SM  KY I P +EH+GCM+DLLGR G + EA  +I+SMP + NA +WG+LL A 
Subjt:  GKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQPNATLWGALLGAC

Query:  RTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSA
          H +L+L E A +ELI LEP+NSGNY+LL+N+ +  GRW +   +R  M+G  VKK  G S+
Subjt:  RTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTCCAAAACCATTTCCACCGCCCGCCGGCGCCAAGTTTCATACTCTGCAACTCCGTTCTTCTTCTTCTCTCTGTGTATCGTACTAATATTCTCCATAACATGAG
CAAGAACTACCGGAACATCGAGCGGAGAATCCTCCGCCTCCTTTCCGGCCACAAATCCCCAACCCATCTCACTCAAATCCACGCCCACTTCCTCCGTCATGACCTTCACC
AATCCAACCAAATCCTCGCCCATTTCATCTCCATTTGCGGAGCTTCCAACGAAATTGCCTATGCCAATCGCGTCTTTTCCCAATCCCAAAATCCCAACATTTTCCTTTTC
AATTCCATGATCAAAGCCCATTCCCTTTCCGGCCCCTTCCAACAATCCCTTCTCCTGTTTTCCTCCATGAAGAATCGCAGGATTGTTCCTGACGAGTACACTTTCGCGCC
GCTGCTTAAATCGTGTTCGAATCTTTATGATTATCGGCTTGGTAAGTGCGTTATTGGTGAAGTTTTGCGTCGTGGATTTGAGTGTTTTGGGTCTATTCGTATTGGGGTGG
TTGAGTTGTATGTTTGTTGTGAGAAGATGGATGATGCACAGAAGGTGTTCGATGAAATGCCTCGCAGGGATGTGGTTGTTTGGAACTTGATGATTCGTGGGTTTTGCAAG
ATGGGTAATGTTGATTTGGGGTTATCTCTCTTTAGGCAAATGAATGATCGTAGCCTCGTTTCTTGGAACACAACTATTTCCTGTTTAGCTCAAAGTGGGCGTGATGTTGA
AGCTTTGGAACTCTTTCAACAAATAGAAGAACATGGTTTTGAACCAGATGAGGTAACTGTGGTCACAATGTTGCCTGTATGCTCTCGTTTGGGAGCTCTTGATGTTGGAC
AAATGATCCATTCTTATGCAACTTCCAAGGCAGATTTAATGAATACAACAATGGTAGGGAATTCGCTTATCGATTTCTACTGTAAATCTGGCAATACAGAAAGAGCTTAC
AACATTTTCCAGAAAATGACTTGCAAAAATGTTGTTTCATGGAACACAATGATCTTAGGCTTTGCTTTGAATGGGAAGGGAGAGCTTGCCATTGACCTTTTCATGGAGAT
GGGACGAGGGGATGTGAAGCCGAACGACGCGACACTCGTAGCCATCTTGACTGCTTGTGTTCATTCAGGATTGTTAGAGAAGGGTCGAGAGGTGTTTTCTTCAATGGCTG
AGAAGTATGAGATTGAGCCAAAACTTGAACATTTTGGTTGTATGGTTGATCTTTTGGGACGTGGTGGATGCGTGGAGGAGGCTCATAGCTTGATTAGAAGTATGCCAATG
CAGCCAAATGCCACTTTGTGGGGTGCTTTGCTTGGTGCTTGCAGAACTCATGGTAACTTGAAACTTGCAGAAATGGCAGCTAATGAGCTCATCAGTCTTGAACCATCGAA
CTCTGGTAATTATGTGTTGCTATCTAATATATTGGCTGAAGAAGGACGATGGAAAGATGTTGAGAATGTCCGATGCTCGATGAGAGGAAAAAACGTCAAGAAAGCCCCGG
GGCGGAGTGCAAGTGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTCCAAAACCATTTCCACCGCCCGCCGGCGCCAAGTTTCATACTCTGCAACTCCGTTCTTCTTCTTCTCTCTGTGTATCGTACTAATATTCTCCATAACATGAG
CAAGAACTACCGGAACATCGAGCGGAGAATCCTCCGCCTCCTTTCCGGCCACAAATCCCCAACCCATCTCACTCAAATCCACGCCCACTTCCTCCGTCATGACCTTCACC
AATCCAACCAAATCCTCGCCCATTTCATCTCCATTTGCGGAGCTTCCAACGAAATTGCCTATGCCAATCGCGTCTTTTCCCAATCCCAAAATCCCAACATTTTCCTTTTC
AATTCCATGATCAAAGCCCATTCCCTTTCCGGCCCCTTCCAACAATCCCTTCTCCTGTTTTCCTCCATGAAGAATCGCAGGATTGTTCCTGACGAGTACACTTTCGCGCC
GCTGCTTAAATCGTGTTCGAATCTTTATGATTATCGGCTTGGTAAGTGCGTTATTGGTGAAGTTTTGCGTCGTGGATTTGAGTGTTTTGGGTCTATTCGTATTGGGGTGG
TTGAGTTGTATGTTTGTTGTGAGAAGATGGATGATGCACAGAAGGTGTTCGATGAAATGCCTCGCAGGGATGTGGTTGTTTGGAACTTGATGATTCGTGGGTTTTGCAAG
ATGGGTAATGTTGATTTGGGGTTATCTCTCTTTAGGCAAATGAATGATCGTAGCCTCGTTTCTTGGAACACAACTATTTCCTGTTTAGCTCAAAGTGGGCGTGATGTTGA
AGCTTTGGAACTCTTTCAACAAATAGAAGAACATGGTTTTGAACCAGATGAGGTAACTGTGGTCACAATGTTGCCTGTATGCTCTCGTTTGGGAGCTCTTGATGTTGGAC
AAATGATCCATTCTTATGCAACTTCCAAGGCAGATTTAATGAATACAACAATGGTAGGGAATTCGCTTATCGATTTCTACTGTAAATCTGGCAATACAGAAAGAGCTTAC
AACATTTTCCAGAAAATGACTTGCAAAAATGTTGTTTCATGGAACACAATGATCTTAGGCTTTGCTTTGAATGGGAAGGGAGAGCTTGCCATTGACCTTTTCATGGAGAT
GGGACGAGGGGATGTGAAGCCGAACGACGCGACACTCGTAGCCATCTTGACTGCTTGTGTTCATTCAGGATTGTTAGAGAAGGGTCGAGAGGTGTTTTCTTCAATGGCTG
AGAAGTATGAGATTGAGCCAAAACTTGAACATTTTGGTTGTATGGTTGATCTTTTGGGACGTGGTGGATGCGTGGAGGAGGCTCATAGCTTGATTAGAAGTATGCCAATG
CAGCCAAATGCCACTTTGTGGGGTGCTTTGCTTGGTGCTTGCAGAACTCATGGTAACTTGAAACTTGCAGAAATGGCAGCTAATGAGCTCATCAGTCTTGAACCATCGAA
CTCTGGTAATTATGTGTTGCTATCTAATATATTGGCTGAAGAAGGACGATGGAAAGATGTTGAGAATGTCCGATGCTCGATGAGAGGAAAAAACGTCAAGAAAGCCCCGG
GGCGGAGTGCAAGTGGGTAA
Protein sequenceShow/hide protein sequence
MDVQNHFHRPPAPSFILCNSVLLLLSVYRTNILHNMSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGASNEIAYANRVFSQSQNPNIFLF
NSMIKAHSLSGPFQQSLLLFSSMKNRRIVPDEYTFAPLLKSCSNLYDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCEKMDDAQKVFDEMPRRDVVVWNLMIRGFCK
MGNVDLGLSLFRQMNDRSLVSWNTTISCLAQSGRDVEALELFQQIEEHGFEPDEVTVVTMLPVCSRLGALDVGQMIHSYATSKADLMNTTMVGNSLIDFYCKSGNTERAY
NIFQKMTCKNVVSWNTMILGFALNGKGELAIDLFMEMGRGDVKPNDATLVAILTACVHSGLLEKGREVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPM
QPNATLWGALLGACRTHGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWKDVENVRCSMRGKNVKKAPGRSASG