; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr011867 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr011867
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153107:75198..77997
RNA-Seq ExpressionSgr011867
SyntenySgr011867
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013843.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0075.16Show/hide
Query:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTKPYRS-----------PVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTRQRSLISSDG
        MRHG GGFVAMESRAT TLS+L+DLLLVAS+TK                + EP  +       +H       F + SLS + S     +++   ++   G
Subjt:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTKPYRS-----------PVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTRQRSLISSDG

Query:  VL-SAPITNIP----------PPHTPKSSIPFVTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEG
         L   P+                HT K     V LD   RS KF+AALEILDHMEELGTSLEL+TYNSVLVALVRKNQVGLALSIFFKLF+AF+ GG EG
Subjt:  VL-SAPITNIP----------PPHTPKSSIPFVTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEG

Query:  NAAASFAFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCL
        +A  SF+FLPNALACNELLVALRK+DMR EFK VFDKLR IR FE N CGYNICIHAFGCWGYLDTSLALFKEMKQ+SLVS SFGPDLCTYNSLIHVLCL
Subjt:  NAAASFAFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCL

Query:  VGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILI
        VGK+ DAL+VWEELKGSGHEPDAFTYR+II GCCKSYRMDDAT IFNEMEYNGF+PDTIVYNSLLDGLFKAR+V EACQ FDKM QEGVRASPWTYNILI
Subjt:  VGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILI

Query:  DGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLK
        DGLFRNGRA ASY+LFCDLKKKGQFVDGVTYSII+LQLCKEGLLEEALQLVEEMEARGFV+DL+TVTSLLIAMH+QGQW+GLERLMKHIREGDLVP+VLK
Subjt:  DGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLK

Query:  WKANMEDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFD
        WKANMEDS+K+Q+NKRKDYSPLFSPK DLSEIISSRASSV KV++D I ENTEE+D D+WSSSPHVDLLA  A+ST  SLQPF L  G+RVQAKG NSFD
Subjt:  WKANMEDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFD

Query:  IDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQ
        IDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNP                               VCPAD+ATYN+IIQGLGKMGRADLASSVL+KLME+
Subjt:  IDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQ

Query:  GGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR
        GG+LDIVMYNTL+NALGKAGRMD+VNKLFEQMR+SGINPDVV+FNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDT LDFLGREIEK R
Subjt:  GGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR

XP_022134897.1 pentatricopeptide repeat-containing protein At4g01570 [Momordica charantia]0.0e+0074.75Show/hide
Query:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTK-------------PYRSPVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTRQRSLISS
        MRHGR GFVAMESRA S+LSQ++DLLLVAS+TK              +  P+ EP  +       +H       F + SLS +    A  ++R   ++  
Subjt:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTK-------------PYRSPVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTRQRSLISS

Query:  DGVL-SAP--ITNIPPPHTPKSSIPF-VTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEGNAAAS
         G L   P  ++++        S  F + LD    S KF+AALEILDHMEELGT LELHTYNS+L+ALVRKNQVGLALSIF KLF+AF NGG  G  A S
Subjt:  DGVL-SAP--ITNIPPPHTPKSSIPF-VTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEGNAAAS

Query:  FAFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIK
         +FLP++LACNELLVALR+ADMR EFK VFDKLR IRGFE N CGYNICIHAFGCWGYLDTSLALFKEMKQKS VS+SFGPDLCTYNSLIHVLCLVG++K
Subjt:  FAFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIK

Query:  DALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFR
        DAL+VWEELKGSGHEPDAFTYR+II GCCKSYRMDDATMIFNEMEYNGF  DTIVYNSLLDGLFKARKV E CQLFDKM QEGVRASPWTYNILIDGLFR
Subjt:  DALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFR

Query:  NGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANM
        NGRAAA YSLFCDLKKKGQFVDGVTYSI+VLQLCKEGLLEEA+Q VEEME RGF+VDLITV+SLLI MH+QG+WDGLERLMKHIREGDLVP+VLKWKANM
Subjt:  NGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANM

Query:  EDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVN
        EDS+K+QQNKRKDYS LFSPK DL+EIISSRASSV KVD++ IF NTEE    +WS SPHVDLLA  ARSTSHS QPF L +GRRVQAKG NSFDIDMVN
Subjt:  EDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVN

Query:  TFLSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLD
        TFLSIFLAKGKLSLACKLFEIFSDMGV+P                               VCPAD+ATYN+IIQGLGKMGRADLASSVLDKLMEQGG+LD
Subjt:  TFLSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLD

Query:  IVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR
        IVMYNTLINALGKAGRMDEVNKLF+QMR+SGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGC+PNHVTDTTLDFLGREIEK+R
Subjt:  IVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR

XP_022929794.1 pentatricopeptide repeat-containing protein At4g01570 [Cucurbita moschata]0.0e+0075.57Show/hide
Query:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTKPYRS-----------PVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTRQRSLISSDG
        MRHG GGFVAMESRAT TLS+L+DLLLVAS+TK                + EP  +       +H       F + SLS + S     +++   ++   G
Subjt:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTKPYRS-----------PVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTRQRSLISSDG

Query:  VL-SAP--ITNIPPPHTPKSSIPF-VTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEGNAAASFA
         L   P  ++++        S  F V LD   RS KF+ AL+ILDHMEELGTSLEL+TYNSVLVALVRKNQVGLALSIFFKLF+AF+ GG EG+A  SF+
Subjt:  VL-SAP--ITNIPPPHTPKSSIPF-VTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEGNAAASFA

Query:  FLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDA
        FLPNALACNELLVALRK+DMR EFK VFDKLR IR FE N CGYNICIHAFGCWGYLDTSLALFKEMKQ+SLVS SF PDLCTYNSLIHVLCLVGK+ DA
Subjt:  FLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDA

Query:  LVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNG
        L+VWEELKGSGHEPDAFTYR+II GCCKSYRMDDAT IFNEMEYNGF+PDTIVYNSLLDGLFKAR+V EACQ FDKM QEGVRASPWTYNILIDGLFRNG
Subjt:  LVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNG

Query:  RAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMED
        RA ASY+LFCDLKKKGQFVDGVTYSII+LQLCKEGLLEEALQLVEEMEARGFV+DL+TVTSLLIAMH+QGQW+GLERLMKHIREGDLVP+VLKWKANMED
Subjt:  RAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMED

Query:  SMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVNTF
        S+K+Q+NKRKDYSPLFSPK DLSEIISSRASSV KV  D I ENTEE+D D+WSSSPHVDLLA  A+ST  SLQPF L  G+RVQAKG NSFDIDMVNTF
Subjt:  SMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVNTF

Query:  LSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIV
        LSIFLAKGKLSLACKLFEIFSDMGVNP                               VCPAD+ATYNVIIQGLGKMGRADLASSVL+KLMEQGG+LDIV
Subjt:  LSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIV

Query:  MYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR
        MYNTL+NALGKAGRMD+VNKLFEQMR+SGINPDVV+FNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDT LDFLGREIEK R
Subjt:  MYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR

XP_022992119.1 pentatricopeptide repeat-containing protein At4g01570 [Cucurbita maxima]0.0e+0074.81Show/hide
Query:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTKPYRS-----------PVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTRQRSLISSDG
        MRHG GGFVAMESRAT TLS+L+DLLLVAS+TK                + EP  +       +H       F + SLS + S  A  +++   ++   G
Subjt:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTKPYRS-----------PVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTRQRSLISSDG

Query:  V---LSAPITNIPPPHTPKSSIPF-VTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEGNAAASFA
            +   ++++        S  F V LD   RS KF+AALEILDHMEELGTSLEL+TYNSVLVALVRKNQVGLALSIFFKLF+AF+ GG EG+A  SF+
Subjt:  V---LSAPITNIPPPHTPKSSIPF-VTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEGNAAASFA

Query:  FLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDA
        FLPNALACNELLVALRK+DMR EFKMVFDKLR IR FE N CGYNICIHAFGCWGYLDTSLALFKEMKQ+SLVS SFGPDLCTYNSLIHVLCLVGK+ DA
Subjt:  FLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDA

Query:  LVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNG
        L+VWEELKGSGHEPDAFTYR+II GCCKSYRMDDAT IFNEMEYNGF+P+TIVYNSLLDGLFKAR+V EACQ FDKM Q+GVRASPWTYNILIDGLFRNG
Subjt:  LVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNG

Query:  RAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMED
        RA ASYSLFCDLKKKGQFVDGVTYSII+LQLCKEGLLEEALQLVEEMEARGFV+DL+TVTSLLIAM++QGQW+GLERLMKHIREGDLVP+VLKWKANMED
Subjt:  RAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMED

Query:  SMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVNTF
        S+K+Q+NKRKDYSPLFSPK DLSEIISSRA+SV KV++D I ENTEE+D D+WSSSPHVDLLA  A+ST  SLQ F L  G+RVQ+KG NSFDIDMVNTF
Subjt:  SMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVNTF

Query:  LSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIV
        LSIFLAKGKLSLACKLF+IFSDMGVNP                               VCPAD+ATYN+IIQGLGKMGRADLASSVL+KLMEQGG+LDIV
Subjt:  LSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIV

Query:  MYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR
        MYNTL+NALGKAGRMD+VNKLFEQMR+SGI PDVV+FNTLIEVHSKAGRFKDAYK+LKMMLDSGCSPNHVTDT LDFLGREIEK R
Subjt:  MYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR

XP_023549441.1 pentatricopeptide repeat-containing protein At4g01570 [Cucurbita pepo subsp. pepo]0.0e+0075.57Show/hide
Query:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTKPYRS-----------PVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTR-QRSLISSD
        MRH  GGFVAMESRAT TLS+L+DLLLVAS+TK                + EP  +       +H       F + SLS + S  A  +++  R+L  S 
Subjt:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTKPYRS-----------PVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTR-QRSLISSD

Query:  GVLSAP--ITNIPPPHTPKSSIPF-VTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEGNAAASFA
         +   P  ++++        S  F V LD   RS KF+AALEILDHMEELGTSLEL+TYNSVLVALVRKNQVGLALSIFFKLF+AF+ GG EG+A  SF+
Subjt:  GVLSAP--ITNIPPPHTPKSSIPF-VTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEGNAAASFA

Query:  FLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDA
        FLPNALACNELLVALRK+DMR EFK VFDKLR IR FE N CGYNICIHAFGCWGYLDTSLALFKEMKQ+SLVS SFGPDLCTYNSLIHVLCLVGK+ DA
Subjt:  FLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDA

Query:  LVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNG
        L+VWEELKGSGHEPDAFTYR+II GCCKSYRMDDAT IFNEMEYNGF+PDTIVYNSLLDGLFKAR+V EACQ FDKM QEGVRASPWTYNILIDGLFRNG
Subjt:  LVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNG

Query:  RAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMED
        RA ASYSLFCDLKKKGQFVDGVTYSII+LQLCKEGLLEEALQLVEEMEARGFV+DL+TVTSLLIAMH+QGQW+GLERLMKHIREGDLVP+VLKWKANMED
Subjt:  RAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMED

Query:  SMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVNTF
        S+K+Q+NKRK+YS LFSPK DLSEIISSRASSV KV++  I ENTEE+D D+WSSSPHVDLLA  A+ST  SLQPF L  G+RV+AKG NSFDIDMVNTF
Subjt:  SMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVNTF

Query:  LSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIV
        LSIFLAKGKLSLACKLFEIFSDMGVNP                               VCPAD+ATYN+IIQGLGKMGRADLASSVL+KLMEQGG+LDIV
Subjt:  LSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIV

Query:  MYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR
        MYNTL+NALGKAGRMD+VNKLFEQMR+SGINPDVV+FNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDT LDFLGREIEK R
Subjt:  MYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR

TrEMBL top hitse value%identityAlignment
A0A1S3CBH7 pentatricopeptide repeat-containing protein At4g015700.0e+0072.45Show/hide
Query:  MRHG--RGGFVAME--SRATSTLSQLSDLLLVASVTKP-----------YRSPVLEPFSIIHFPYRSLSSSKS-------SVAGLFTRQRSLISSDGVL-
        MRHG  R  F+ +E  SR  STLSQLSDLLLVAS+TK            +  P+  P  +     RSL+ S         S+A  F    S  S    + 
Subjt:  MRHG--RGGFVAME--SRATSTLSQLSDLLLVASVTKP-----------YRSPVLEPFSIIHFPYRSLSSSKS-------SVAGLFTRQRSLISSDGVL-

Query:  --SAPITNIPP-------------PHTPKSSIPFVTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGL
          S  +  +PP              HT K     V LD   RS K++AALEILDHME+LGTSLEL+TYNSVLVAL+RKNQVGLALSIFFKLF+  NNGG 
Subjt:  --SAPITNIPP-------------PHTPKSSIPFVTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGL

Query:  EGNAAASFAFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVL
        + +AA SF FLPN+LACNELLVALRK DMR EF+ VFDKLRAI  FE N CGYNICI+AFGCWGYLDT+L+LFKEMK+KSLV  SFGPDLCTYNS+I VL
Subjt:  EGNAAASFAFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVL

Query:  CLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNI
        CLVGK+KDAL+VWEELKGSGHEPDAFTYR+II GCCKSYRMDDATMIFNEMEYNG IPD IVYNSLL+GLFKARKV+EACQLFDKM QE VRASPWTYNI
Subjt:  CLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNI

Query:  LIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSV
        LIDGLFRNGRA A Y+LFCDLKKKGQFVDGVTYSII+LQLCKEGLLEEALQLVEEMEARGFVVDLIT+TSLLIAMH+QGQW+GLERLMKHIREGDLVP+V
Subjt:  LIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSV

Query:  LKWKANMEDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNS
        LKWK NMEDS+K+Q+NKR+D+S LFSPK DL E+ISSRASS  +V++D   ENTEE D D WSSSPHVD LA  A ST+  LQPF L +GRR+Q KG NS
Subjt:  LKWKANMEDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNS

Query:  FDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLM
        FDI+MVNTFLSIFLAKGKL+LACKLFEIFSDMGVNP                               VCPAD+ATYNVIIQGLGKMGRADLASSVL+KLM
Subjt:  FDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLM

Query:  EQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR
        EQGG+LDIVMYNTLINALGKAGRMD+VNKLF+QMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEK R
Subjt:  EQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR

A0A5A7TE47 Pentatricopeptide repeat-containing protein0.0e+0072.45Show/hide
Query:  MRHG--RGGFVAME--SRATSTLSQLSDLLLVASVTKP-----------YRSPVLEPFSIIHFPYRSLSSSKS-------SVAGLFTRQRSLISSDGVL-
        MRHG  R  F+ +E  SR  STLSQLSDLLLVAS+TK            +  P+  P  +     RSL+ S         S+A  F    S  S    + 
Subjt:  MRHG--RGGFVAME--SRATSTLSQLSDLLLVASVTKP-----------YRSPVLEPFSIIHFPYRSLSSSKS-------SVAGLFTRQRSLISSDGVL-

Query:  --SAPITNIPP-------------PHTPKSSIPFVTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGL
          S  +  +PP              HT K     V LD   RS K++AALEILDHME+LGTSLEL+TYNSVLVAL+RKNQVGLALSIFFKLF+  NNGG 
Subjt:  --SAPITNIPP-------------PHTPKSSIPFVTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGL

Query:  EGNAAASFAFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVL
        + +AA SF FLPN+LACNELLVALRK DMR EF+ VFDKLRAI  FE N CGYNICI+AFGCWGYLDT+L+LFKEMK+KSLV  SFGPDLCTYNS+I VL
Subjt:  EGNAAASFAFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVL

Query:  CLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNI
        CLVGK+KDAL+VWEELKGSGHEPDAFTYR+II GCCKSYRMDDATMIFNEMEYNG IPD IVYNSLL+GLFKARKV+EACQLFDKM QE VRASPWTYNI
Subjt:  CLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNI

Query:  LIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSV
        LIDGLFRNGRA A Y+LFCDLKKKGQFVDGVTYSII+LQLCKEGLLEEALQLVEEMEARGFVVDLIT+TSLLIAMH+QGQW+GLERLMKHIREGDLVP+V
Subjt:  LIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSV

Query:  LKWKANMEDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNS
        LKWK NMEDS+K+Q+NKR+D+S LFSPK DL E+ISSRASS  +V++D   ENTEE D D WSSSPHVD LA  A ST+  LQPF L +GRR+Q KG NS
Subjt:  LKWKANMEDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNS

Query:  FDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLM
        FDI+MVNTFLSIFLAKGKL+LACKLFEIFSDMGVNP                               VCPAD+ATYNVIIQGLGKMGRADLASSVL+KLM
Subjt:  FDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLM

Query:  EQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR
        EQGG+LDIVMYNTLINALGKAGRMD+VNKLF+QMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEK R
Subjt:  EQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR

A0A6J1C0Z4 pentatricopeptide repeat-containing protein At4g015700.0e+0074.75Show/hide
Query:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTK-------------PYRSPVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTRQRSLISS
        MRHGR GFVAMESRA S+LSQ++DLLLVAS+TK              +  P+ EP  +       +H       F + SLS +    A  ++R   ++  
Subjt:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTK-------------PYRSPVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTRQRSLISS

Query:  DGVL-SAP--ITNIPPPHTPKSSIPF-VTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEGNAAAS
         G L   P  ++++        S  F + LD    S KF+AALEILDHMEELGT LELHTYNS+L+ALVRKNQVGLALSIF KLF+AF NGG  G  A S
Subjt:  DGVL-SAP--ITNIPPPHTPKSSIPF-VTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEGNAAAS

Query:  FAFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIK
         +FLP++LACNELLVALR+ADMR EFK VFDKLR IRGFE N CGYNICIHAFGCWGYLDTSLALFKEMKQKS VS+SFGPDLCTYNSLIHVLCLVG++K
Subjt:  FAFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIK

Query:  DALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFR
        DAL+VWEELKGSGHEPDAFTYR+II GCCKSYRMDDATMIFNEMEYNGF  DTIVYNSLLDGLFKARKV E CQLFDKM QEGVRASPWTYNILIDGLFR
Subjt:  DALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFR

Query:  NGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANM
        NGRAAA YSLFCDLKKKGQFVDGVTYSI+VLQLCKEGLLEEA+Q VEEME RGF+VDLITV+SLLI MH+QG+WDGLERLMKHIREGDLVP+VLKWKANM
Subjt:  NGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANM

Query:  EDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVN
        EDS+K+QQNKRKDYS LFSPK DL+EIISSRASSV KVD++ IF NTEE    +WS SPHVDLLA  ARSTSHS QPF L +GRRVQAKG NSFDIDMVN
Subjt:  EDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVN

Query:  TFLSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLD
        TFLSIFLAKGKLSLACKLFEIFSDMGV+P                               VCPAD+ATYN+IIQGLGKMGRADLASSVLDKLMEQGG+LD
Subjt:  TFLSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLD

Query:  IVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR
        IVMYNTLINALGKAGRMDEVNKLF+QMR+SGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGC+PNHVTDTTLDFLGREIEK+R
Subjt:  IVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR

A0A6J1EPT7 pentatricopeptide repeat-containing protein At4g015700.0e+0075.57Show/hide
Query:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTKPYRS-----------PVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTRQRSLISSDG
        MRHG GGFVAMESRAT TLS+L+DLLLVAS+TK                + EP  +       +H       F + SLS + S     +++   ++   G
Subjt:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTKPYRS-----------PVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTRQRSLISSDG

Query:  VL-SAP--ITNIPPPHTPKSSIPF-VTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEGNAAASFA
         L   P  ++++        S  F V LD   RS KF+ AL+ILDHMEELGTSLEL+TYNSVLVALVRKNQVGLALSIFFKLF+AF+ GG EG+A  SF+
Subjt:  VL-SAP--ITNIPPPHTPKSSIPF-VTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEGNAAASFA

Query:  FLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDA
        FLPNALACNELLVALRK+DMR EFK VFDKLR IR FE N CGYNICIHAFGCWGYLDTSLALFKEMKQ+SLVS SF PDLCTYNSLIHVLCLVGK+ DA
Subjt:  FLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDA

Query:  LVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNG
        L+VWEELKGSGHEPDAFTYR+II GCCKSYRMDDAT IFNEMEYNGF+PDTIVYNSLLDGLFKAR+V EACQ FDKM QEGVRASPWTYNILIDGLFRNG
Subjt:  LVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNG

Query:  RAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMED
        RA ASY+LFCDLKKKGQFVDGVTYSII+LQLCKEGLLEEALQLVEEMEARGFV+DL+TVTSLLIAMH+QGQW+GLERLMKHIREGDLVP+VLKWKANMED
Subjt:  RAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMED

Query:  SMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVNTF
        S+K+Q+NKRKDYSPLFSPK DLSEIISSRASSV KV  D I ENTEE+D D+WSSSPHVDLLA  A+ST  SLQPF L  G+RVQAKG NSFDIDMVNTF
Subjt:  SMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVNTF

Query:  LSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIV
        LSIFLAKGKLSLACKLFEIFSDMGVNP                               VCPAD+ATYNVIIQGLGKMGRADLASSVL+KLMEQGG+LDIV
Subjt:  LSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIV

Query:  MYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR
        MYNTL+NALGKAGRMD+VNKLFEQMR+SGINPDVV+FNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDT LDFLGREIEK R
Subjt:  MYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR

A0A6J1JWP1 pentatricopeptide repeat-containing protein At4g015700.0e+0074.81Show/hide
Query:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTKPYRS-----------PVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTRQRSLISSDG
        MRHG GGFVAMESRAT TLS+L+DLLLVAS+TK                + EP  +       +H       F + SLS + S  A  +++   ++   G
Subjt:  MRHGRGGFVAMESRATSTLSQLSDLLLVASVTKPYRS-----------PVLEPFSI-------IH-------FPYRSLSSSKSSVAGLFTRQRSLISSDG

Query:  V---LSAPITNIPPPHTPKSSIPF-VTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEGNAAASFA
            +   ++++        S  F V LD   RS KF+AALEILDHMEELGTSLEL+TYNSVLVALVRKNQVGLALSIFFKLF+AF+ GG EG+A  SF+
Subjt:  V---LSAPITNIPPPHTPKSSIPF-VTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEGNAAASFA

Query:  FLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDA
        FLPNALACNELLVALRK+DMR EFKMVFDKLR IR FE N CGYNICIHAFGCWGYLDTSLALFKEMKQ+SLVS SFGPDLCTYNSLIHVLCLVGK+ DA
Subjt:  FLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDA

Query:  LVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNG
        L+VWEELKGSGHEPDAFTYR+II GCCKSYRMDDAT IFNEMEYNGF+P+TIVYNSLLDGLFKAR+V EACQ FDKM Q+GVRASPWTYNILIDGLFRNG
Subjt:  LVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNG

Query:  RAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMED
        RA ASYSLFCDLKKKGQFVDGVTYSII+LQLCKEGLLEEALQLVEEMEARGFV+DL+TVTSLLIAM++QGQW+GLERLMKHIREGDLVP+VLKWKANMED
Subjt:  RAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMED

Query:  SMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVNTF
        S+K+Q+NKRKDYSPLFSPK DLSEIISSRA+SV KV++D I ENTEE+D D+WSSSPHVDLLA  A+ST  SLQ F L  G+RVQ+KG NSFDIDMVNTF
Subjt:  SMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVNTF

Query:  LSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIV
        LSIFLAKGKLSLACKLF+IFSDMGVNP                               VCPAD+ATYN+IIQGLGKMGRADLASSVL+KLMEQGG+LDIV
Subjt:  LSIFLAKGKLSLACKLFEIFSDMGVNP-------------------------------VCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIV

Query:  MYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR
        MYNTL+NALGKAGRMD+VNKLFEQMR+SGI PDVV+FNTLIEVHSKAGRFKDAYK+LKMMLDSGCSPNHVTDT LDFLGREIEK R
Subjt:  MYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVR

SwissProt top hitse value%identityAlignment
Q3EDF8 Pentatricopeptide repeat-containing protein At1g099003.3e-4826.49Show/hide
Query:  PDLCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMA
        PD+    +LI   C +GK + A  + E L+GSG  PD  TY V+I G CK+  +++A  + + M  +   PD + YN++L  L  + K+ +A ++ D+M 
Subjt:  PDLCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMA

Query:  QEGVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERL
        Q        TY ILI+   R+     +  L  +++ +G   D VTY+++V  +CKEG L+EA++ + +M + G   ++IT   +L +M   G+W   E+L
Subjt:  QEGVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERL

Query:  MKHIREGDLVPSVLKWKANMEDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFEN-TEERDADHWSSSPHVDLLAIFARSTSHSLQPFC
        +  +      PSV+ +   +    +           L     D+ E +         +  + +     +E+  D          +    R  S    P  
Subjt:  MKHIREGDLVPSVLKWKANMEDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFEN-TEERDADHWSSSPHVDLLAIFARSTSHSLQPFC

Query:  LGRGRRVQAKGTNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINA
                       DI   NT L+     GK+  A ++    S  G +PV    + TYN +I GL K G+   A  +LD++  +    D + Y++L+  
Subjt:  LGRGRRVQAKGTNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINA

Query:  LGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTT----LDFLGREIEKVRNLHVILKQGQISEARATQ
        L + G++DE  K F +    GI P+ VTFN+++    K+ +   A  FL  M++ GC PN  + T     L + G   E +  L+ +  +G + ++ A Q
Subjt:  LGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTT----LDFLGREIEKVRNLHVILKQGQISEARATQ

Query:  DA
         A
Subjt:  DA

Q8VZE4 Pentatricopeptide repeat-containing protein At4g015705.4e-22953.53Show/hide
Query:  MRHGRGGFVA-----MESRATSTLSQLSDLLLVASVTK---------------PYRSPV---------LEPFSIIHF---------PYRSLSSSKSSV--
        MRHGRG  V+     +     S   QL ++LLVAS++K               P   PV         ++P   + F          Y+  +++ S +  
Subjt:  MRHGRGGFVA-----MESRATSTLSQLSDLLLVASVTK---------------PYRSPV---------LEPFSIIHF---------PYRSLSSSKSSV--

Query:  ----AGLFTRQRSLISS---DGVLSAPITNIPPPHTPKSSIPFVTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKL
             GL      L+ S   DGV                ++  + LD+  RS KF +AL +LD+MEELG  L    Y+SVL+ALV+K+++ LALSI FKL
Subjt:  ----AGLFTRQRSLISS---DGVLSAPITNIPPPHTPKSSIPFVTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKL

Query:  FEAFNNGGLEGNAAASF-AFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLV-SESFGPD
         EA +N   +        ++LP  +A NELLV LR+ADMR+EFK VF+KL+ ++ F+ +   YNICIH FGCWG LD +L+LFKEMK++S V   SFGPD
Subjt:  FEAFNNGGLEGNAAASF-AFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLV-SESFGPD

Query:  LCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQE
        +CTYNSLIHVLCL GK KDAL+VW+ELK SGHEPD  TYR++I GCCKSYRMDDA  I+ EM+YNGF+PDTIVYN LLDG  KARKV+EACQLF+KM QE
Subjt:  LCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQE

Query:  GVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMK
        GVRAS WTYNILIDGLFRNGRA A ++LFCDLKKKGQFVD +T+SI+ LQLC+EG LE A++LVEEME RGF VDL+T++SLLI  H+QG+WD  E+LMK
Subjt:  GVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMK

Query:  HIREGDLVPSVLKWKANMEDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGR
        HIREG+LVP+VL+W A +E S+K  Q+K KDY+P+F  KG   +I+S   S     D     E     + D WSSSP++D L   A   +     F L R
Subjt:  HIREGDLVPSVLKWKANMEDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGR

Query:  GRRVQAKGTNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPV--------------------------------CPADVATYNVIIQGLGKMGR
        G+RV+AK  +SFD+DM+NTFLSI+L+KG LSLACKLFEIF+ MGV  +                                C AD+ATYNVIIQGLGKMGR
Subjt:  GRRVQAKGTNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPV--------------------------------CPADVATYNVIIQGLGKMGR

Query:  ADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLG
        ADLAS+VLD+L +QGG+LDIVMYNTLINALGKA R+DE  +LF+ M+++GINPDVV++NT+IEV+SKAG+ K+AYK+LK MLD+GC PNHVTDT LD+LG
Subjt:  ADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLG

Query:  REIEKVR
        +E+EK R
Subjt:  REIEKVR

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028609.5e-4823.17Show/hide
Query:  SSIPFVTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFE---------------AFNNGGLEGNAAASF------
        +S+  + +    +  + ++A  + + ++E G SL++++Y S++ A     +   A+++F K+ E                F   G   N   S       
Subjt:  SSIPFVTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFE---------------AFNNGGLEGNAAASF------

Query:  -AFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIK
            P+A   N L+   ++  +  E   VF++++A  GF  +   YN  +  +G       ++ +  EM     V   F P + TYNSLI      G + 
Subjt:  -AFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIK

Query:  DALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFR
        +A+ +  ++   G +PD FTY  ++ G  ++ +++ A  IF EM   G  P+   +N+ +       K +E  ++FD++   G+     T+N L+    +
Subjt:  DALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFR

Query:  NGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANM
        NG  +    +F ++K+ G   +  T++ ++    + G  E+A+ +   M   G   DL T  ++L A+ R G W+  E+++  + +G   P+ L +    
Subjt:  NGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANM

Query:  EDSMKHQQNKRKDYSPLFSPKGDL-SEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMV
          S+ H     K+   + S   ++ S +I  RA  +  + +                     DLL    R+ S             ++ +G  S DI  +
Subjt:  EDSMKHQQNKRKDYSPLFSPKGDL-SEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMV

Query:  NTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNS
        N+ +SI+  +  ++ A  + +   + G  P     +ATYN ++    +      +  +L +++ +G   DI+ YNT+I A  +  RM + +++F +MRNS
Subjt:  NTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNS

Query:  GINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVT
        GI PDV+T+NT I  ++    F++A   ++ M+  GC PN  T
Subjt:  GINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVT

Q9M907 Pentatricopeptide repeat-containing protein At3g069208.6e-4927.9Show/hide
Query:  YNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEME
        YN+CI +FG  G +D +   F E++   L      PD  TY S+I VLC   ++ +A+ ++E L+ +   P  + Y  +I G   + + D+A  +     
Subjt:  YNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEME

Query:  YNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQL
          G IP  I YN +L  L K  KV EA ++F++M ++    +  TYNILID L R G+   ++ L   ++K G F +  T +I+V +LCK   L+EA  +
Subjt:  YNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQL

Query:  VEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMEDSMKH--QQNKRKDYSPLFSP--KGDLSEIISSRASSVTKVDMD
         EEM+ +    D IT  SL+  + + G+ D   ++ + + + D   + + + + +++   H  +++  K Y  + +     DL +++++    + K    
Subjt:  VEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMEDSMKH--QQNKRKDYSPLFSP--KGDLSEIISSRASSVTKVDMD

Query:  ----VIFENTEER----DADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPA
             +FE  + R    DA  +S   H  + A FA  T              ++ +G    D    N  +  F   GK++ A +L E     G  P    
Subjt:  ----VIFENTEER----DADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPA

Query:  DVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLD
         V TY  +I GL K+ R D A  + ++   +   L++V+Y++LI+  GK GR+DE   + E++   G+ P++ T+N+L++   KA    +A    + M +
Subjt:  DVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLD

Query:  SGCSPNHVT
          C+PN VT
Subjt:  SGCSPNHVT

Q9SXD8 Pentatricopeptide repeat-containing protein At1g625904.0e-4626.9Show/hide
Query:  YNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEME
        +N  + A       D  ++L ++M++  +V       L TYN LI+  C   +I  AL +  ++   G+EP   T   +++G C   R+ DA  + ++M 
Subjt:  YNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEME

Query:  YNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQL
          G+ PDTI + +L+ GLF   K SEA  L D+M Q G + +  TY ++++GL + G    + +L   ++      D V ++ I+  LCK   +++AL L
Subjt:  YNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQL

Query:  VEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMEDSMKHQQ--NKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVI
         +EME +G   +++T +SL+  +   G+W    +L+  + E  + P+++ + A ++  +K  +     K Y  +   K  +   I +  S V    M   
Subjt:  VEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMEDSMKHQQ--NKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVI

Query:  FENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSF----------DIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPA
         +  ++      S     D++     + +  ++ FC  + +RV+  GT  F          D     T +      G    A K+F+     GV    P 
Subjt:  FENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSF----------DIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPA

Query:  DVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLD
        D+ TY++++ GL   G+ + A  V D + +    LDI +Y T+I  + KAG++D+   LF  +   G+ P+VVT+NT+I         ++AY  LK M +
Subjt:  DVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLD

Query:  SGCSPNHVTDTTL
         G  PN  T  TL
Subjt:  SGCSPNHVTDTTL

Arabidopsis top hitse value%identityAlignment
AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.3e-4926.49Show/hide
Query:  PDLCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMA
        PD+    +LI   C +GK + A  + E L+GSG  PD  TY V+I G CK+  +++A  + + M  +   PD + YN++L  L  + K+ +A ++ D+M 
Subjt:  PDLCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMA

Query:  QEGVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERL
        Q        TY ILI+   R+     +  L  +++ +G   D VTY+++V  +CKEG L+EA++ + +M + G   ++IT   +L +M   G+W   E+L
Subjt:  QEGVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERL

Query:  MKHIREGDLVPSVLKWKANMEDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFEN-TEERDADHWSSSPHVDLLAIFARSTSHSLQPFC
        +  +      PSV+ +   +    +           L     D+ E +         +  + +     +E+  D          +    R  S    P  
Subjt:  MKHIREGDLVPSVLKWKANMEDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFEN-TEERDADHWSSSPHVDLLAIFARSTSHSLQPFC

Query:  LGRGRRVQAKGTNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINA
                       DI   NT L+     GK+  A ++    S  G +PV    + TYN +I GL K G+   A  +LD++  +    D + Y++L+  
Subjt:  LGRGRRVQAKGTNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINA

Query:  LGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTT----LDFLGREIEKVRNLHVILKQGQISEARATQ
        L + G++DE  K F +    GI P+ VTFN+++    K+ +   A  FL  M++ GC PN  + T     L + G   E +  L+ +  +G + ++ A Q
Subjt:  LGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTT----LDFLGREIEKVRNLHVILKQGQISEARATQ

Query:  DA
         A
Subjt:  DA

AT1G62590.1 pentatricopeptide (PPR) repeat-containing protein2.8e-4726.9Show/hide
Query:  YNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEME
        +N  + A       D  ++L ++M++  +V       L TYN LI+  C   +I  AL +  ++   G+EP   T   +++G C   R+ DA  + ++M 
Subjt:  YNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEME

Query:  YNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQL
          G+ PDTI + +L+ GLF   K SEA  L D+M Q G + +  TY ++++GL + G    + +L   ++      D V ++ I+  LCK   +++AL L
Subjt:  YNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQL

Query:  VEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMEDSMKHQQ--NKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVI
         +EME +G   +++T +SL+  +   G+W    +L+  + E  + P+++ + A ++  +K  +     K Y  +   K  +   I +  S V    M   
Subjt:  VEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMEDSMKHQQ--NKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVI

Query:  FENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSF----------DIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPA
         +  ++      S     D++     + +  ++ FC  + +RV+  GT  F          D     T +      G    A K+F+     GV    P 
Subjt:  FENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSF----------DIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPA

Query:  DVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLD
        D+ TY++++ GL   G+ + A  V D + +    LDI +Y T+I  + KAG++D+   LF  +   G+ P+VVT+NT+I         ++AY  LK M +
Subjt:  DVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLD

Query:  SGCSPNHVTDTTL
         G  PN  T  TL
Subjt:  SGCSPNHVTDTTL

AT3G06920.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.1e-5027.9Show/hide
Query:  YNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEME
        YN+CI +FG  G +D +   F E++   L      PD  TY S+I VLC   ++ +A+ ++E L+ +   P  + Y  +I G   + + D+A  +     
Subjt:  YNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEME

Query:  YNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQL
          G IP  I YN +L  L K  KV EA ++F++M ++    +  TYNILID L R G+   ++ L   ++K G F +  T +I+V +LCK   L+EA  +
Subjt:  YNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQL

Query:  VEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMEDSMKH--QQNKRKDYSPLFSP--KGDLSEIISSRASSVTKVDMD
         EEM+ +    D IT  SL+  + + G+ D   ++ + + + D   + + + + +++   H  +++  K Y  + +     DL +++++    + K    
Subjt:  VEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANMEDSMKH--QQNKRKDYSPLFSP--KGDLSEIISSRASSVTKVDMD

Query:  ----VIFENTEER----DADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPA
             +FE  + R    DA  +S   H  + A FA  T              ++ +G    D    N  +  F   GK++ A +L E     G  P    
Subjt:  ----VIFENTEER----DADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPA

Query:  DVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLD
         V TY  +I GL K+ R D A  + ++   +   L++V+Y++LI+  GK GR+DE   + E++   G+ P++ T+N+L++   KA    +A    + M +
Subjt:  DVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLD

Query:  SGCSPNHVT
          C+PN VT
Subjt:  SGCSPNHVT

AT4G01570.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.9e-23053.53Show/hide
Query:  MRHGRGGFVA-----MESRATSTLSQLSDLLLVASVTK---------------PYRSPV---------LEPFSIIHF---------PYRSLSSSKSSV--
        MRHGRG  V+     +     S   QL ++LLVAS++K               P   PV         ++P   + F          Y+  +++ S +  
Subjt:  MRHGRGGFVA-----MESRATSTLSQLSDLLLVASVTK---------------PYRSPV---------LEPFSIIHF---------PYRSLSSSKSSV--

Query:  ----AGLFTRQRSLISS---DGVLSAPITNIPPPHTPKSSIPFVTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKL
             GL      L+ S   DGV                ++  + LD+  RS KF +AL +LD+MEELG  L    Y+SVL+ALV+K+++ LALSI FKL
Subjt:  ----AGLFTRQRSLISS---DGVLSAPITNIPPPHTPKSSIPFVTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKL

Query:  FEAFNNGGLEGNAAASF-AFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLV-SESFGPD
         EA +N   +        ++LP  +A NELLV LR+ADMR+EFK VF+KL+ ++ F+ +   YNICIH FGCWG LD +L+LFKEMK++S V   SFGPD
Subjt:  FEAFNNGGLEGNAAASF-AFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLV-SESFGPD

Query:  LCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQE
        +CTYNSLIHVLCL GK KDAL+VW+ELK SGHEPD  TYR++I GCCKSYRMDDA  I+ EM+YNGF+PDTIVYN LLDG  KARKV+EACQLF+KM QE
Subjt:  LCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQE

Query:  GVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMK
        GVRAS WTYNILIDGLFRNGRA A ++LFCDLKKKGQFVD +T+SI+ LQLC+EG LE A++LVEEME RGF VDL+T++SLLI  H+QG+WD  E+LMK
Subjt:  GVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMK

Query:  HIREGDLVPSVLKWKANMEDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGR
        HIREG+LVP+VL+W A +E S+K  Q+K KDY+P+F  KG   +I+S   S     D     E     + D WSSSP++D L   A   +     F L R
Subjt:  HIREGDLVPSVLKWKANMEDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGR

Query:  GRRVQAKGTNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPV--------------------------------CPADVATYNVIIQGLGKMGR
        G+RV+AK  +SFD+DM+NTFLSI+L+KG LSLACKLFEIF+ MGV  +                                C AD+ATYNVIIQGLGKMGR
Subjt:  GRRVQAKGTNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPV--------------------------------CPADVATYNVIIQGLGKMGR

Query:  ADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLG
        ADLAS+VLD+L +QGG+LDIVMYNTLINALGKA R+DE  +LF+ M+++GINPDVV++NT+IEV+SKAG+ K+AYK+LK MLD+GC PNHVTDT LD+LG
Subjt:  ADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLG

Query:  REIEKVR
        +E+EK R
Subjt:  REIEKVR

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein6.7e-4923.17Show/hide
Query:  SSIPFVTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFE---------------AFNNGGLEGNAAASF------
        +S+  + +    +  + ++A  + + ++E G SL++++Y S++ A     +   A+++F K+ E                F   G   N   S       
Subjt:  SSIPFVTLDTSTRSSKFNAALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFE---------------AFNNGGLEGNAAASF------

Query:  -AFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIK
            P+A   N L+   ++  +  E   VF++++A  GF  +   YN  +  +G       ++ +  EM     V   F P + TYNSLI      G + 
Subjt:  -AFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIK

Query:  DALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFR
        +A+ +  ++   G +PD FTY  ++ G  ++ +++ A  IF EM   G  P+   +N+ +       K +E  ++FD++   G+     T+N L+    +
Subjt:  DALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLDGLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFR

Query:  NGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANM
        NG  +    +F ++K+ G   +  T++ ++    + G  E+A+ +   M   G   DL T  ++L A+ R G W+  E+++  + +G   P+ L +    
Subjt:  NGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWDGLERLMKHIREGDLVPSVLKWKANM

Query:  EDSMKHQQNKRKDYSPLFSPKGDL-SEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMV
          S+ H     K+   + S   ++ S +I  RA  +  + +                     DLL    R+ S             ++ +G  S DI  +
Subjt:  EDSMKHQQNKRKDYSPLFSPKGDL-SEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLGRGRRVQAKGTNSFDIDMV

Query:  NTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNS
        N+ +SI+  +  ++ A  + +   + G  P     +ATYN ++    +      +  +L +++ +G   DI+ YNT+I A  +  RM + +++F +MRNS
Subjt:  NTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNKLFEQMRNS

Query:  GINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVT
        GI PDV+T+NT I  ++    F++A   ++ M+  GC PN  T
Subjt:  GINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCCATGGAAGAGGTGGTTTCGTTGCCATGGAGTCAAGGGCAACGTCAACTCTGTCTCAATTGTCTGACCTTCTCCTGGTGGCCTCCGTCACAAAACCCTATCGGAG
TCCGGTACTCGAACCCTTCAGCATCATTCACTTCCCTTATCGGAGCCTCTCCTCCTCCAAATCCTCCGTAGCAGGTCTCTTCACCCGTCAAAGAAGCTTGATTTCTTCAG
ATGGTGTTCTCTCAGCCCCAATTACAAACATTCCGCCTCCACATACTCCCAAATCTTCCATACCCTTTGTGACTCTGGATACCTCCACGAGGTCCAGTAAATTCAATGCT
GCTCTCGAAATTCTAGATCACATGGAGGAGTTGGGAACTAGCTTGGAACTCCACACGTATAACTCTGTTCTCGTCGCTCTGGTCAGAAAAAACCAGGTGGGTTTGGCCTT
GTCAATTTTCTTCAAGCTCTTCGAAGCTTTTAATAATGGAGGGCTAGAAGGTAATGCTGCGGCTAGTTTTGCCTTCTTGCCCAATGCCCTTGCTTGTAATGAATTGTTGG
TTGCTCTTAGGAAAGCAGACATGAGGGCCGAGTTCAAAATGGTTTTTGACAAACTTAGAGCAATTAGAGGCTTTGAGTTGAATGCATGCGGTTATAATATATGCATTCAT
GCTTTTGGGTGTTGGGGTTATCTGGATACTTCTCTTGCGCTGTTCAAAGAAATGAAGCAAAAGAGTTTAGTTTCTGAGTCTTTCGGACCGGATTTGTGCACGTACAATAG
CCTTATTCATGTGCTCTGTTTGGTAGGGAAGATCAAGGATGCACTTGTTGTGTGGGAGGAATTAAAAGGATCTGGTCATGAGCCTGATGCCTTCACATACCGTGTCATAA
TTCATGGTTGCTGTAAATCTTACAGAATGGATGATGCTACCATGATTTTTAATGAAATGGAGTACAATGGATTTATCCCAGATACTATTGTTTATAATTCTCTCCTTGAT
GGGTTATTCAAGGCTCGAAAAGTTAGCGAAGCTTGTCAACTTTTTGATAAGATGGCGCAGGAAGGTGTAAGAGCTTCTCCTTGGACATATAATATTCTTATTGATGGATT
GTTTAGGAATGGAAGAGCTGCAGCTAGTTACTCTTTATTCTGTGATTTGAAGAAAAAGGGTCAATTTGTCGATGGTGTTACATACAGCATCATTGTGCTGCAACTGTGTA
AAGAGGGACTACTTGAGGAAGCACTACAATTGGTTGAAGAAATGGAAGCGAGAGGCTTTGTTGTTGATCTAATTACTGTAACATCTCTATTGATTGCAATGCACAGGCAA
GGCCAGTGGGACGGTTTAGAGAGGCTCATGAAGCACATCAGAGAAGGTGATCTGGTACCCAGTGTTCTCAAATGGAAGGCCAATATGGAGGATTCAATGAAACATCAGCA
AAATAAAAGGAAAGACTACTCACCTTTGTTCTCCCCAAAGGGTGATTTGAGTGAGATTATAAGTTCAAGAGCTTCTTCTGTTACTAAAGTTGATATGGACGTGATTTTCG
AAAATACCGAAGAACGGGATGCTGACCATTGGTCATCATCCCCACATGTGGATCTTTTAGCTATTTTTGCGAGGTCCACAAGCCATTCTTTGCAACCATTCTGTCTCGGT
AGGGGGCGACGAGTTCAAGCAAAAGGGACCAACTCCTTCGATATTGATATGGTCAATACATTTTTGTCTATTTTTCTGGCAAAAGGAAAATTAAGCTTAGCCTGCAAGTT
GTTTGAGATCTTCAGTGATATGGGTGTGAACCCAGTTTGTCCAGCCGATGTAGCCACATACAACGTGATAATTCAAGGACTTGGGAAGATGGGTAGAGCGGATCTAGCTA
GTTCTGTTCTTGATAAGCTCATGGAACAGGGTGGCTTTCTCGATATTGTAATGTACAACACCTTGATTAATGCGCTGGGGAAGGCAGGCCGAATGGATGAAGTAAATAAG
CTTTTTGAGCAGATGAGAAACAGTGGAATAAACCCAGATGTTGTTACTTTTAATACACTTATTGAAGTTCACAGCAAAGCAGGTCGGTTTAAGGATGCCTACAAATTTTT
GAAGATGATGCTGGATTCAGGCTGCTCTCCGAATCACGTCACGGATACAACTTTGGATTTTCTAGGGAGGGAGATTGAGAAAGTGAGAAATCTTCACGTAATCTTGAAGC
AGGGGCAAATTAGTGAAGCTCGTGCTACACAGGATGCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGCCATGGAAGAGGTGGTTTCGTTGCCATGGAGTCAAGGGCAACGTCAACTCTGTCTCAATTGTCTGACCTTCTCCTGGTGGCCTCCGTCACAAAACCCTATCGGAG
TCCGGTACTCGAACCCTTCAGCATCATTCACTTCCCTTATCGGAGCCTCTCCTCCTCCAAATCCTCCGTAGCAGGTCTCTTCACCCGTCAAAGAAGCTTGATTTCTTCAG
ATGGTGTTCTCTCAGCCCCAATTACAAACATTCCGCCTCCACATACTCCCAAATCTTCCATACCCTTTGTGACTCTGGATACCTCCACGAGGTCCAGTAAATTCAATGCT
GCTCTCGAAATTCTAGATCACATGGAGGAGTTGGGAACTAGCTTGGAACTCCACACGTATAACTCTGTTCTCGTCGCTCTGGTCAGAAAAAACCAGGTGGGTTTGGCCTT
GTCAATTTTCTTCAAGCTCTTCGAAGCTTTTAATAATGGAGGGCTAGAAGGTAATGCTGCGGCTAGTTTTGCCTTCTTGCCCAATGCCCTTGCTTGTAATGAATTGTTGG
TTGCTCTTAGGAAAGCAGACATGAGGGCCGAGTTCAAAATGGTTTTTGACAAACTTAGAGCAATTAGAGGCTTTGAGTTGAATGCATGCGGTTATAATATATGCATTCAT
GCTTTTGGGTGTTGGGGTTATCTGGATACTTCTCTTGCGCTGTTCAAAGAAATGAAGCAAAAGAGTTTAGTTTCTGAGTCTTTCGGACCGGATTTGTGCACGTACAATAG
CCTTATTCATGTGCTCTGTTTGGTAGGGAAGATCAAGGATGCACTTGTTGTGTGGGAGGAATTAAAAGGATCTGGTCATGAGCCTGATGCCTTCACATACCGTGTCATAA
TTCATGGTTGCTGTAAATCTTACAGAATGGATGATGCTACCATGATTTTTAATGAAATGGAGTACAATGGATTTATCCCAGATACTATTGTTTATAATTCTCTCCTTGAT
GGGTTATTCAAGGCTCGAAAAGTTAGCGAAGCTTGTCAACTTTTTGATAAGATGGCGCAGGAAGGTGTAAGAGCTTCTCCTTGGACATATAATATTCTTATTGATGGATT
GTTTAGGAATGGAAGAGCTGCAGCTAGTTACTCTTTATTCTGTGATTTGAAGAAAAAGGGTCAATTTGTCGATGGTGTTACATACAGCATCATTGTGCTGCAACTGTGTA
AAGAGGGACTACTTGAGGAAGCACTACAATTGGTTGAAGAAATGGAAGCGAGAGGCTTTGTTGTTGATCTAATTACTGTAACATCTCTATTGATTGCAATGCACAGGCAA
GGCCAGTGGGACGGTTTAGAGAGGCTCATGAAGCACATCAGAGAAGGTGATCTGGTACCCAGTGTTCTCAAATGGAAGGCCAATATGGAGGATTCAATGAAACATCAGCA
AAATAAAAGGAAAGACTACTCACCTTTGTTCTCCCCAAAGGGTGATTTGAGTGAGATTATAAGTTCAAGAGCTTCTTCTGTTACTAAAGTTGATATGGACGTGATTTTCG
AAAATACCGAAGAACGGGATGCTGACCATTGGTCATCATCCCCACATGTGGATCTTTTAGCTATTTTTGCGAGGTCCACAAGCCATTCTTTGCAACCATTCTGTCTCGGT
AGGGGGCGACGAGTTCAAGCAAAAGGGACCAACTCCTTCGATATTGATATGGTCAATACATTTTTGTCTATTTTTCTGGCAAAAGGAAAATTAAGCTTAGCCTGCAAGTT
GTTTGAGATCTTCAGTGATATGGGTGTGAACCCAGTTTGTCCAGCCGATGTAGCCACATACAACGTGATAATTCAAGGACTTGGGAAGATGGGTAGAGCGGATCTAGCTA
GTTCTGTTCTTGATAAGCTCATGGAACAGGGTGGCTTTCTCGATATTGTAATGTACAACACCTTGATTAATGCGCTGGGGAAGGCAGGCCGAATGGATGAAGTAAATAAG
CTTTTTGAGCAGATGAGAAACAGTGGAATAAACCCAGATGTTGTTACTTTTAATACACTTATTGAAGTTCACAGCAAAGCAGGTCGGTTTAAGGATGCCTACAAATTTTT
GAAGATGATGCTGGATTCAGGCTGCTCTCCGAATCACGTCACGGATACAACTTTGGATTTTCTAGGGAGGGAGATTGAGAAAGTGAGAAATCTTCACGTAATCTTGAAGC
AGGGGCAAATTAGTGAAGCTCGTGCTACACAGGATGCGTGA
Protein sequenceShow/hide protein sequence
MRHGRGGFVAMESRATSTLSQLSDLLLVASVTKPYRSPVLEPFSIIHFPYRSLSSSKSSVAGLFTRQRSLISSDGVLSAPITNIPPPHTPKSSIPFVTLDTSTRSSKFNA
ALEILDHMEELGTSLELHTYNSVLVALVRKNQVGLALSIFFKLFEAFNNGGLEGNAAASFAFLPNALACNELLVALRKADMRAEFKMVFDKLRAIRGFELNACGYNICIH
AFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKIKDALVVWEELKGSGHEPDAFTYRVIIHGCCKSYRMDDATMIFNEMEYNGFIPDTIVYNSLLD
GLFKARKVSEACQLFDKMAQEGVRASPWTYNILIDGLFRNGRAAASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHRQ
GQWDGLERLMKHIREGDLVPSVLKWKANMEDSMKHQQNKRKDYSPLFSPKGDLSEIISSRASSVTKVDMDVIFENTEERDADHWSSSPHVDLLAIFARSTSHSLQPFCLG
RGRRVQAKGTNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVCPADVATYNVIIQGLGKMGRADLASSVLDKLMEQGGFLDIVMYNTLINALGKAGRMDEVNK
LFEQMRNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKVRNLHVILKQGQISEARATQDA