; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G022800 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G022800
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionpentatricopeptide repeat-containing protein At1g09900-like
Genome locationchr03:33685237..33687936
RNA-Seq ExpressionLsi03G022800
SyntenyLsi03G022800
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580429.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]2.0e-21778.83Show/hide
Query:  LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT--------------PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEI
        +R+ LRPSFLTTLPLSST TPFGAN  EEN RLS NK+ HS+     SF               PTR+T       IKSLPIPSEEGTEIFIMSQK  EI
Subjt:  LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT--------------PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEI

Query:  RNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVG
        +NI EFNDLFM+FVSE+EL LALKLLSN+SSYGLVPN RTFSIMIRCYCKKG+L+NA RVL QMLGRG  PNDAT+  LVNAFCKRGK QKALEMVE+VG
Subjt:  RNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVG

Query:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV
        R GRKPTV+ YNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELL+EAE+NG+KPSVVTFNTLFNGYCKEGRP+DGI V
Subjt:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV

Query:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS
        L KMKQMNC PDRI+Y+TLL GLIKWGKIR ALRTYKEMVSSGH+IE KMMNTFMRAL +R+WKEKDLLEDAHQVFEKMK++ QVIDRSTYGLLIQALCS
Subjt:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS

Query:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR
        GN  SEALANLHHMIGKGYSPRAITIDV+VQ+LC    H+G A+EALC+LGHGI FS  SFDLII ELN++GM  SA NVYGLALKRG+ PTK PR
Subjt:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR

KAG7017186.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]7.5e-21778.83Show/hide
Query:  LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT--------------PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEI
        +RL LRPSFLTTLPLSST TPFGAN  EEN RLS NK+ HS+     SF               PTR+T       IKSLPIPSEEGTEIF MSQK  EI
Subjt:  LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT--------------PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEI

Query:  RNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVG
        +NI EFNDLFM+FVSE+EL LALKLLSN+SSYGLVPN RTFSIMIRCYCKKG+LDNA RVL QMLG G  PNDAT+  LVNAFCKRGK QKA EMVE+VG
Subjt:  RNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVG

Query:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV
        R GRKPTV+ YNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELL+EAE+NG+KPSVVTFNTLFNGYCKEGRP+DGI V
Subjt:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV

Query:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS
        L KMKQMNC PDRI+Y+TLL GLIKWGKIR ALRTYKEMVSSGH+IE KMMNTFMRAL +RSWKEKDLLEDAHQVFEKMK++ QVIDRSTYGLLIQALCS
Subjt:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS

Query:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR
        GN  SEALANLHHMIGKGYSPRAITIDV+VQ+LC    H+G A+EALC+LGHGI FS  SFDLII ELN++GM  SA NVYGLALKRG+ PTK PR
Subjt:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR

XP_022935128.1 pentatricopeptide repeat-containing protein At1g09900-like [Cucurbita moschata]2.6e-21778.63Show/hide
Query:  LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT--------------PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEI
        +R+ LRPSFLTTLPLSST TPFGAN  EEN RLS NK+ HS+     SF               PTR+T       IKSLPIPSEEGTEIFIMSQK  EI
Subjt:  LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT--------------PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEI

Query:  RNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVG
        +NI EFNDLFM+FVSE+EL LALKLLSN+SSYGLVPN RTFSIMIRCYCKKG+L+NA RVL QMLGRG  PNDAT+  LVNAFCKRGK QKALEMVE+VG
Subjt:  RNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVG

Query:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV
        R GRKPTV+ YNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELL+EAE+NG+KPSVVTFNTLFNGYCKEGRP+DGI V
Subjt:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV

Query:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS
        L KMKQMNC PDRI+Y+TLL GLIKWGKIR ALRTYKEMVSSGH+IE KMMNTFMRAL +R+WKEKDLLEDAHQVFEKMK++ QVIDRSTYGLLIQALCS
Subjt:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS

Query:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR
        GN  SEALANLHHMIGKGYSPRAITIDV+VQ+LC    H+G A+EALC+LGHGI FS  SFDL+I ELN++GM  SA NVYGLALKRG+ PTK PR
Subjt:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR

XP_022982589.1 pentatricopeptide repeat-containing protein At5g64320, mitochondrial-like [Cucurbita maxima]4.0e-21879.44Show/hide
Query:  LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT--------------PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEI
        +RL LRPSFLTTLPLSST TPFGAN  E N R S NK+ HS+     SF               PTR+T      RIKSLPIPSEEGTEIFIMSQK  EI
Subjt:  LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT--------------PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEI

Query:  RNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVG
        +NI EFNDLFM+FVSE+EL LALKLLSN++SYGLVPNSRTFSIMIRCYCKKG+LDNA RVL QMLGRG  PNDAT+  LVNAFCKRGK QKALEMVE+VG
Subjt:  RNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVG

Query:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV
        R GRKPTV+ YNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELL+EAE NG+KPSVVTFNTLFNGYCKEGRP+DGI V
Subjt:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV

Query:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS
        L KMKQMNC PDRI+Y+TLLHGLIKWGKIR ALRTYKEMVSSGH+IE KMMNTFMRAL +R+WKEKDLLEDAHQVFEKMK++FQVIDRSTYGLLIQALCS
Subjt:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS

Query:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR
        GN  SEALANLHHMIGKGYSP AITIDV+VQ+LC    H+GSA+EALC+LGHGI FS  SFDLII ELN++GM LSA +VYGLALKRG+ PTK PR
Subjt:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR

XP_038904608.1 pentatricopeptide repeat-containing protein At1g09900-like [Benincasa hispida]2.3e-22984.29Show/hide
Query:  LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT--------------PTRITRIKSLPIPSEEGTEIFIMSQKHTEIRNISEF
        + L L PSF TT PLS TYTPF AN   E SRLS NKQSHSI T   SFT              PTR+TRIKSLPIPSEEGTEIFIMSQK TEIRN+SEF
Subjt:  LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT--------------PTRITRIKSLPIPSEEGTEIFIMSQKHTEIRNISEF

Query:  NDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKP
        ND  MDFVSENEL LALKLLSNISSYGLVPNSRTFSIMIR YCKKGEL+ AG+VLEQM+GRGH PNDATV  LVNAFCKRGKTQKALEMVE+VGRIGRKP
Subjt:  NDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKP

Query:  TVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQ
        TV+TYNCLLKGLCYVGRVEEACEMVTEMKKD LIPDIYTYTALMDGLCKVGRSDEAMELL EAE NGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQ
Subjt:  TVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQ

Query:  MNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISE
        +NC PDRITYTTLLHGLIKWGKIRIAL TYKEMVSSGHTIEAKMMNTFMRAL +RSW EKDLLEDAHQVFEKMKDD+QVID+STYGLLIQALCSGNMISE
Subjt:  MNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISE

Query:  ALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR
        ALANLHHMI KGYSPRAI IDVVVQ+LC    H GS +EAL +LGHGIPF   SFDLII+ELNKQ MR SA NVYGLALKRGVNPTKTPR
Subjt:  ALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR

TrEMBL top hitse value%identityAlignment
A0A6J1DW66 pentatricopeptide repeat-containing protein At1g09900-like1.3e-19573.48Show/hide
Query:  LPLSSTYTPFGANLFEENSRLSTNKQSHS---------------IHTALRSFTPTRIT------RIKSLPIPSEEGTEIFIMSQKHTEIRNISEFNDLFM
        LPLSST      N  EEN RLS NKQSHS               +   L    P RIT      RI SLP PS+EGTE+FI SQK  EI+NISEFNDLF 
Subjt:  LPLSSTYTPFGANLFEENSRLSTNKQSHS---------------IHTALRSFTPTRIT------RIKSLPIPSEEGTEIFIMSQKHTEIRNISEFNDLFM

Query:  DFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKPTVKTY
        DFVS  EL LAL+LLSNISSYGLVPNSRTFSI IRCYCKKG+LDNA RV +QMLG G  PNDATV  LVNA C+RGK ++ALEMVE+VGRIGRK TV+TY
Subjt:  DFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKPTVKTY

Query:  NCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQMNCMP
        NCLLKGLCYVGRVEEACEMV +MKKD L+PDIYTYTALMDGLCKVGRSDEAMELLNEAEENGL+PSVVTFNTLFNGYCKEGRP+DGI+VL KMKQMNCMP
Subjt:  NCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQMNCMP

Query:  DRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANL
        DRI+YTTLLHGLIKWGKIR ALRTYKEMVSSGH++E KMMNTFMRAL +RSWKEKDLLEDAHQVFEKMK++FQVI RSTYG++I ALCSGN ISEA+ANL
Subjt:  DRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANL

Query:  HHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLG---------HGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR
        HHMI KGYSPRAITI+VVV++LC      GS NEAL ++G         H IPFS  S+DLII+ELNKQGM   A  VYGLALKRGV  TK P+
Subjt:  HHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLG---------HGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR

A0A6J1F9P1 pentatricopeptide repeat-containing protein At1g09900-like1.3e-21778.63Show/hide
Query:  LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT--------------PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEI
        +R+ LRPSFLTTLPLSST TPFGAN  EEN RLS NK+ HS+     SF               PTR+T       IKSLPIPSEEGTEIFIMSQK  EI
Subjt:  LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT--------------PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEI

Query:  RNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVG
        +NI EFNDLFM+FVSE+EL LALKLLSN+SSYGLVPN RTFSIMIRCYCKKG+L+NA RVL QMLGRG  PNDAT+  LVNAFCKRGK QKALEMVE+VG
Subjt:  RNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVG

Query:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV
        R GRKPTV+ YNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELL+EAE+NG+KPSVVTFNTLFNGYCKEGRP+DGI V
Subjt:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV

Query:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS
        L KMKQMNC PDRI+Y+TLL GLIKWGKIR ALRTYKEMVSSGH+IE KMMNTFMRAL +R+WKEKDLLEDAHQVFEKMK++ QVIDRSTYGLLIQALCS
Subjt:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS

Query:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR
        GN  SEALANLHHMIGKGYSPRAITIDV+VQ+LC    H+G A+EALC+LGHGI FS  SFDL+I ELN++GM  SA NVYGLALKRG+ PTK PR
Subjt:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR

A0A6J1J506 pentatricopeptide repeat-containing protein At5g64320, mitochondrial-like1.9e-21879.44Show/hide
Query:  LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT--------------PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEI
        +RL LRPSFLTTLPLSST TPFGAN  E N R S NK+ HS+     SF               PTR+T      RIKSLPIPSEEGTEIFIMSQK  EI
Subjt:  LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT--------------PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEI

Query:  RNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVG
        +NI EFNDLFM+FVSE+EL LALKLLSN++SYGLVPNSRTFSIMIRCYCKKG+LDNA RVL QMLGRG  PNDAT+  LVNAFCKRGK QKALEMVE+VG
Subjt:  RNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVG

Query:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV
        R GRKPTV+ YNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELL+EAE NG+KPSVVTFNTLFNGYCKEGRP+DGI V
Subjt:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV

Query:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS
        L KMKQMNC PDRI+Y+TLLHGLIKWGKIR ALRTYKEMVSSGH+IE KMMNTFMRAL +R+WKEKDLLEDAHQVFEKMK++FQVIDRSTYGLLIQALCS
Subjt:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS

Query:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR
        GN  SEALANLHHMIGKGYSP AITIDV+VQ+LC    H+GSA+EALC+LGHGI FS  SFDLII ELN++GM LSA +VYGLALKRG+ PTK PR
Subjt:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR

A0A6J5V7D1 PPR_long domain-containing protein1.4e-13947.28Show/hide
Query:  ALTEEMAVIKKFGDEHSNLLDQYERLSFEARLNQAMLERSLSEPRMLRSQPQVLF--AGSVQLPYLITTTNQPRKGRCGSTFNFNNLLNNLLKPILERKG
        A   + A+++ FGD+ S+LLD +ERLS E +LNQAML RSLSEP  +RSQ  +L   A S   P       + R+G       F+ +L  ++KPIL R  
Subjt:  ALTEEMAVIKKFGDEHSNLLDQYERLSFEARLNQAMLERSLSEPRMLRSQPQVLF--AGSVQLPYLITTTNQPRKGRCGSTFNFNNLLNNLLKPILERKG

Query:  RAKKERPHFKNPMDQIEAQFSRVAVMPYELLIQSSLQNQSFLPLKSKHKRMRLRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRS
          KKE P  K+P      +         +LL+ ++L   +  P  +         PL    L      S    +G     E  +L  +        A R 
Subjt:  RAKKERPHFKNPMDQIEAQFSRVAVMPYELLIQSSLQNQSFLPLKSKHKRMRLRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRS

Query:  FTPTRITRIKSLPIPSEEGTEIFIMSQKHTEIRNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRG
             I RIK+LP    E + I  + ++    + +SEFN L M  V   E  +AL L + +S+YGLVP+S TFSIMIRCYC+K +LD A RVL  M+  G
Subjt:  FTPTRITRIKSLPIPSEEGTEIFIMSQKHTEIRNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRG

Query:  HYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNE
         YPN AT+  L+N+ CKRG+ Q+ALE++EV+GRIG KPTV+ YNCLLKGLCYVGRVE+A EM+  +KKD++ PDIYT+TA+MDG CKVGRSDEAMELL+E
Subjt:  HYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNE

Query:  AEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDL
        A E GL P VVTFNTLFNGYCKEGRP++G+ VL +MK+ NC PD ITY+TLLHGL+KWGK R ALR YKEMV +G  ++ ++MN  +R L +RS KEKDL
Subjt:  AEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDL

Query:  LEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLG----HGIPFSTTSFDLI
        LEDAH+VFEKM+++   ID STYGL+IQ LC    +  A+  L  MIG GYSP  IT + V+++LC      G   EAL +L      G   +  S++ +
Subjt:  LEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLG----HGIPFSTTSFDLI

Query:  INELNKQGMRLSASNVYGLALKRG
        I+ LN++G  L   +VYG A+KRG
Subjt:  INELNKQGMRLSASNVYGLALKRG

A0A6J5VAX5 PPR_long domain-containing protein3.3e-14147.15Show/hide
Query:  ALTEEMAVIKKFGDEHSNLLDQYERLSFEARLNQAMLERSLSEPRMLRSQPQVLF--AGSVQLPYLITTTNQPRKGRCGSTFNFNNLLNNLLKPILERKG
        A   + A+++ FGD+ S+LLD +ERLS E +LNQAML RSLSEP  +RSQ  +L   A S   P       + R+G       F+ +L  ++KPIL R  
Subjt:  ALTEEMAVIKKFGDEHSNLLDQYERLSFEARLNQAMLERSLSEPRMLRSQPQVLF--AGSVQLPYLITTTNQPRKGRCGSTFNFNNLLNNLLKPILERKG

Query:  RAKKERPHFKNPMDQIEAQFSRVAVMPYELLIQSSLQNQSFLPLKSKHKRMRLRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRS
          KKE P  K+P      +         +LL+ ++L   +  P  +         PL    L      S    +G     E  +L  +        A R 
Subjt:  RAKKERPHFKNPMDQIEAQFSRVAVMPYELLIQSSLQNQSFLPLKSKHKRMRLRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRS

Query:  FTPTRITRIKSLPIPSEEGTEIFIMSQKHTEIRNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRG
             I RIK+LP    E + I  + ++    + +SEFN L M  V   E  +AL L + +S+YGLVP+S TFSIMIRCYC+K +LD A RVL  M+  G
Subjt:  FTPTRITRIKSLPIPSEEGTEIFIMSQKHTEIRNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRG

Query:  HYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNE
         YPN AT+  L+N+ CKRG+ Q+ALE++EV+GRIG KPTV+ YNCLLKGLCYVGRVE+A EM+  +KKD++ PDIYT+TA+MDG CKVGRSDEAMELL+E
Subjt:  HYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNE

Query:  AEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDL
        A E GL P VVTFNTLFNGYCKEGRP++G+ VL +MK+ NC PD ITY+TLLHGL+KWGK R ALR YKEMV +G  ++ ++MN  +R L +RS KEKDL
Subjt:  AEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDL

Query:  LEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLG----HGIPFSTTSFDLI
        LEDAH+VFEKM+++   ID STYGL+IQ LC    +  A+  L  MIG GYSP  IT + V+++LC      G   EAL +L      G   +  S++ +
Subjt:  LEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLG----HGIPFSTTSFDLI

Query:  INELNKQGMRLSASNVYGLALKRGVNPTKTPR
        I+ LN++G  L   +VYG A+KRGV P   P+
Subjt:  INELNKQGMRLSASNVYGLALKRGVNPTKTPR

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial1.1e-4229.16Show/hide
Query:  NISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGR
        N++ +N +         +  A  LL  +   G  P+  ++S ++  YC+ GELD   +++E M  +G  PN      ++   C+  K  +A E    + R
Subjt:  NISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGR

Query:  IGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVL
         G  P    Y  L+ G C  G +  A +   EM    + PD+ TYTA++ G C++G   EA +L +E    GL+P  VTF  L NGYCK G   D   V 
Subjt:  IGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVL

Query:  NKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSG
        N M Q  C P+ +TYTTL+ GL K G +  A     EM   G        N+ +  L K        +E+A ++  + +      D  TY  L+ A C  
Subjt:  NKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSG

Query:  NMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEAL-CLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNP
          + +A   L  M+GKG  P  +T +V++   C  H       + L  +L  GI  + T+F+ ++ +   +    +A+ +Y     RGV P
Subjt:  NMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEAL-CLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNP

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397101.6e-4430.79Show/hide
Query:  NISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGR
        N+  +N L   +    ++    KLL +++  GL PN  +++++I   C++G +     VL +M  RG+  ++ T   L+  +CK G   +AL M   + R
Subjt:  NISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGR

Query:  IGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVL
         G  P+V TY  L+  +C  G +  A E + +M+   L P+  TYT L+DG  + G  +EA  +L E  +NG  PSVVT+N L NG+C  G+  D I VL
Subjt:  IGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVL

Query:  NKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSG
          MK+    PD ++Y+T+L G  +   +  ALR  +EMV  G   +    ++ ++        E+   ++A  ++E+M       D  TY  LI A C  
Subjt:  NKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSG

Query:  NMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGH--GIPFSTTSFDLIIN
          + +AL   + M+ KG  P  +T  V++  L +  S T  A   L  L +   +P   T   LI N
Subjt:  NMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGH--GIPFSTTSFDLIIN

Q9FMF6 Pentatricopeptide repeat-containing protein At5g64320, mitochondrial8.5e-4626.87Show/hide
Query:  KHTEIRNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEM
        KH  + N   +  L       N ++ AL+LL  +   G VP++ TF+ +I   CK   ++ A +++ +ML RG  P+D T  +L+N  CK G+   A ++
Subjt:  KHTEIRNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEM

Query:  V--------------------------------EVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAM
                                         ++V   G  P V TYN L+ G    G V  A E++ +M+     P++Y+YT L+DG CK+G+ DEA 
Subjt:  V--------------------------------EVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAM

Query:  ELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSW
         +LNE   +GLKP+ V FN L + +CKE R  + + +  +M +  C PD  T+ +L+ GL +  +I+ AL   ++M+S G        NT + A  +R  
Subjt:  ELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSW

Query:  KEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDL
             +++A ++  +M      +D  TY  LI+ LC    + +A +    M+  G++P  I+ ++++  LC +     +      ++  G      +F+ 
Subjt:  KEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDL

Query:  IINELNKQGMRLSASNVYGLALKRGVNP
        +IN L + G       ++      G+ P
Subjt:  IINELNKQGMRLSASNVYGLALKRGVNP

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic3.6e-4427.62Show/hide
Query:  NISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMV-EVVG
        ++S FN L       ++L  A+ +L ++ SYGLVP+ +TF+ +++ Y ++G+LD A R+ EQM+  G   ++ +V  +V+ FCK G+ + AL  + E+  
Subjt:  NISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMV-EVVG

Query:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV
        + G  P   T+N L+ GLC  G V+ A E++  M ++   PD+YTY +++ GLCK+G   EA+E+L++       P+ VT+NTL +  CKE +  +   +
Subjt:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV

Query:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS
           +     +PD  T+ +L+ GL      R+A+  ++EM S G   +    N  + +L       K  L++A  + ++M+         TY  LI   C 
Subjt:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS

Query:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNP
         N   EA      M   G S  ++T + ++  LC +     +A     ++  G      +++ ++    + G    A+++       G  P
Subjt:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNP

Q9SR00 Pentatricopeptide repeat-containing protein At3g04760, chloroplastic1.7e-4630.11Show/hide
Query:  FNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRK
        +N +     S  +L LALK+L+ + S    P   T++I+I     +G +D A +++++ML RG  P+  T   ++   CK G   +A EMV  +   G +
Subjt:  FNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRK

Query:  PTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMK
        P V +YN LL+ L   G+ EE  +++T+M  +   P++ TY+ L+  LC+ G+ +EAM LL   +E GL P   +++ L   +C+EGR    I  L  M 
Subjt:  PTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMK

Query:  QMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMIS
           C+PD + Y T+L  L K GK   AL  + ++   G +  +   NT   AL    W   D +   H + E M +     D  TY  +I  LC   M+ 
Subjt:  QMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMIS

Query:  EALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSA
        EA   L  M    + P  +T ++V+   C  H    + N    ++G+G   + T++ ++I  +   G R  A
Subjt:  EALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSA

Arabidopsis top hitse value%identityAlignment
AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein8.2e-4428.8Show/hide
Query:  EFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGR
        E N+     V   EL    K L N+  +G VP+    + +IR +C+ G+   A ++LE + G G  P+  T   +++ +CK G+   AL    V+ R+  
Subjt:  EFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGR

Query:  KPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKM
         P V TYN +L+ LC  G++++A E++  M +    PD+ TYT L++  C+      AM+LL+E  + G  P VVT+N L NG CKEGR  + I  LN M
Subjt:  KPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKM

Query:  KQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMI
            C P+ IT+  +L  +   G+   A +   +M+  G +      N  +  L +     K LL  A  + EKM       +  +Y  L+   C    +
Subjt:  KQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMI

Query:  SEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQG
          A+  L  M+ +G  P  +T + ++ +LC       +      L   G      +++ +I+ L K G
Subjt:  SEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQG

AT3G04760.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.2e-4730.11Show/hide
Query:  FNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRK
        +N +     S  +L LALK+L+ + S    P   T++I+I     +G +D A +++++ML RG  P+  T   ++   CK G   +A EMV  +   G +
Subjt:  FNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRK

Query:  PTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMK
        P V +YN LL+ L   G+ EE  +++T+M  +   P++ TY+ L+  LC+ G+ +EAM LL   +E GL P   +++ L   +C+EGR    I  L  M 
Subjt:  PTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMK

Query:  QMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMIS
           C+PD + Y T+L  L K GK   AL  + ++   G +  +   NT   AL    W   D +   H + E M +     D  TY  +I  LC   M+ 
Subjt:  QMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMIS

Query:  EALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSA
        EA   L  M    + P  +T ++V+   C  H    + N    ++G+G   + T++ ++I  +   G R  A
Subjt:  EALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSA

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein2.5e-4527.62Show/hide
Query:  NISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMV-EVVG
        ++S FN L       ++L  A+ +L ++ SYGLVP+ +TF+ +++ Y ++G+LD A R+ EQM+  G   ++ +V  +V+ FCK G+ + AL  + E+  
Subjt:  NISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMV-EVVG

Query:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV
        + G  P   T+N L+ GLC  G V+ A E++  M ++   PD+YTY +++ GLCK+G   EA+E+L++       P+ VT+NTL +  CKE +  +   +
Subjt:  RIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV

Query:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS
           +     +PD  T+ +L+ GL      R+A+  ++EM S G   +    N  + +L       K  L++A  + ++M+         TY  LI   C 
Subjt:  LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS

Query:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNP
         N   EA      M   G S  ++T + ++  LC +     +A     ++  G      +++ ++    + G    A+++       G  P
Subjt:  GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNP

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-4530.79Show/hide
Query:  NISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGR
        N+  +N L   +    ++    KLL +++  GL PN  +++++I   C++G +     VL +M  RG+  ++ T   L+  +CK G   +AL M   + R
Subjt:  NISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGR

Query:  IGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVL
         G  P+V TY  L+  +C  G +  A E + +M+   L P+  TYT L+DG  + G  +EA  +L E  +NG  PSVVT+N L NG+C  G+  D I VL
Subjt:  IGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVL

Query:  NKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSG
          MK+    PD ++Y+T+L G  +   +  ALR  +EMV  G   +    ++ ++        E+   ++A  ++E+M       D  TY  LI A C  
Subjt:  NKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSG

Query:  NMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGH--GIPFSTTSFDLIIN
          + +AL   + M+ KG  P  +T  V++  L +  S T  A   L  L +   +P   T   LI N
Subjt:  NMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGH--GIPFSTTSFDLIIN

AT5G64320.1 Pentatricopeptide repeat (PPR) superfamily protein6.0e-4726.87Show/hide
Query:  KHTEIRNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEM
        KH  + N   +  L       N ++ AL+LL  +   G VP++ TF+ +I   CK   ++ A +++ +ML RG  P+D T  +L+N  CK G+   A ++
Subjt:  KHTEIRNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEM

Query:  V--------------------------------EVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAM
                                         ++V   G  P V TYN L+ G    G V  A E++ +M+     P++Y+YT L+DG CK+G+ DEA 
Subjt:  V--------------------------------EVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAM

Query:  ELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSW
         +LNE   +GLKP+ V FN L + +CKE R  + + +  +M +  C PD  T+ +L+ GL +  +I+ AL   ++M+S G        NT + A  +R  
Subjt:  ELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSW

Query:  KEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDL
             +++A ++  +M      +D  TY  LI+ LC    + +A +    M+  G++P  I+ ++++  LC +     +      ++  G      +F+ 
Subjt:  KEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDL

Query:  IINELNKQGMRLSASNVYGLALKRGVNP
        +IN L + G       ++      G+ P
Subjt:  IINELNKQGMRLSASNVYGLALKRGVNP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGGGCATCAACTTTCAAGGATCATGAAAAACATCGGGAGATCCATGAGCCCTAGTCTTGGGCCTGCTCCTTTAGTCTCCAATAAGCCATCGAGGGAGAATAAACA
AGATTCCTTGCTTGTTGCTTTCAAGCAAGCGATGAAAAGAAAACATACGGTGTTGGTCCAAATAAAAGCACAGTTAATGGCTTTAGCTTCTTCTAAGCTTCTCAGGAGCA
TTGCCTGTTTGGTTTCTGTTGCAGTTGCAGTCGCAGTCGCAGCCAATGCCTTAACAGAAGAAATGGCAGTGATTAAGAAATTTGGGGATGAACACAGCAACTTGTTGGAC
CAATACGAGAGACTGAGCTTTGAAGCCCGACTAAACCAAGCAATGTTGGAGCGGAGTCTTTCGGAGCCGAGGATGCTGAGGTCTCAACCGCAGGTGCTATTTGCTGGCTC
AGTTCAACTACCATACTTGATAACAACTACTAATCAGCCTAGAAAAGGACGTTGTGGATCTACATTCAACTTCAACAACTTACTCAACAACTTACTCAAACCCATTTTGG
AGAGGAAGGGTAGGGCAAAGAAGGAACGCCCACATTTCAAAAACCCCATGGACCAAATTGAAGCTCAGTTCAGTCGCGTCGCCGTGATGCCTTATGAGCTTCTCATCCAA
TCATCTCTGCAAAATCAATCTTTCCTTCCTTTGAAATCCAAACATAAGCGAATGCGTCTCCGTCTCCCCCTTCGCCCCTCTTTCCTCACAACTCTTCCACTCTCTTCCAC
ATATACGCCTTTCGGTGCCAACTTGTTTGAAGAAAACAGCAGGCTAAGCACGAATAAACAATCCCATTCCATCCATACTGCCCTCCGTAGTTTCACTCCCACTAGAATCA
CGAGAATTAAATCACTGCCAATACCGTCGGAAGAAGGGACTGAAATTTTCATCATGTCTCAGAAACACACTGAGATTCGAAACATATCTGAATTCAATGACCTGTTCATG
GATTTCGTCTCAGAAAATGAGCTCCATCTTGCCCTGAAACTGTTGTCCAATATATCATCTTATGGTTTGGTCCCAAATTCTAGAACATTTTCCATCATGATAAGGTGCTA
TTGCAAGAAAGGAGAATTGGATAATGCGGGCAGGGTTTTAGAGCAAATGCTGGGAAGGGGTCATTACCCAAACGATGCCACCGTCGCATTTCTCGTGAATGCTTTCTGCA
AAAGGGGTAAAACGCAGAAAGCTTTAGAAATGGTTGAGGTGGTGGGAAGGATTGGACGCAAGCCGACAGTTAAGACATACAATTGTTTGTTGAAAGGCCTATGCTATGTT
GGGAGAGTGGAAGAGGCATGCGAAATGGTGACGGAAATGAAGAAGGATAGCTTGATACCTGACATTTACACCTATACGGCTCTTATGGATGGCTTGTGTAAGGTAGGGCG
ATCAGACGAGGCAATGGAATTGCTCAATGAAGCAGAGGAAAATGGTTTAAAACCAAGTGTAGTCACTTTCAACACCCTCTTCAATGGCTACTGCAAGGAGGGCAGGCCAG
TGGATGGAATCTATGTCTTGAACAAAATGAAGCAAATGAACTGTATGCCCGATCGCATTACTTACACTACTCTGCTACATGGGCTGATAAAATGGGGTAAAATCCGAATA
GCCTTGAGGACATACAAGGAAATGGTTAGCTCAGGTCACACCATCGAAGCAAAAATGATGAATACCTTCATGAGAGCGTTAAGCAAGAGATCTTGGAAAGAAAAGGATCT
ATTGGAAGATGCCCATCAAGTGTTTGAGAAAATGAAAGACGACTTTCAAGTTATTGATCGGAGTACATATGGCCTGCTGATCCAAGCACTCTGTTCAGGAAACATGATTT
CAGAGGCTCTGGCAAATTTGCATCATATGATTGGAAAAGGGTACTCTCCAAGGGCAATTACCATTGATGTTGTTGTTCAATCGCTTTGTCACACTCACAGCCACACGGGA
AGCGCCAATGAAGCATTGTGTCTTTTGGGGCATGGAATCCCTTTCAGTACAACTTCTTTCGACTTGATAATCAATGAGCTAAATAAACAAGGAATGCGGTTAAGTGCTTC
TAATGTATATGGTTTGGCTCTGAAACGAGGTGTTAACCCCACGAAAACCCCGAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGGGGCATCAACTTTCAAGGATCATGAAAAACATCGGGAGATCCATGAGCCCTAGTCTTGGGCCTGCTCCTTTAGTCTCCAATAAGCCATCGAGGGAGAATAAACA
AGATTCCTTGCTTGTTGCTTTCAAGCAAGCGATGAAAAGAAAACATACGGTGTTGGTCCAAATAAAAGCACAGTTAATGGCTTTAGCTTCTTCTAAGCTTCTCAGGAGCA
TTGCCTGTTTGGTTTCTGTTGCAGTTGCAGTCGCAGTCGCAGCCAATGCCTTAACAGAAGAAATGGCAGTGATTAAGAAATTTGGGGATGAACACAGCAACTTGTTGGAC
CAATACGAGAGACTGAGCTTTGAAGCCCGACTAAACCAAGCAATGTTGGAGCGGAGTCTTTCGGAGCCGAGGATGCTGAGGTCTCAACCGCAGGTGCTATTTGCTGGCTC
AGTTCAACTACCATACTTGATAACAACTACTAATCAGCCTAGAAAAGGACGTTGTGGATCTACATTCAACTTCAACAACTTACTCAACAACTTACTCAAACCCATTTTGG
AGAGGAAGGGTAGGGCAAAGAAGGAACGCCCACATTTCAAAAACCCCATGGACCAAATTGAAGCTCAGTTCAGTCGCGTCGCCGTGATGCCTTATGAGCTTCTCATCCAA
TCATCTCTGCAAAATCAATCTTTCCTTCCTTTGAAATCCAAACATAAGCGAATGCGTCTCCGTCTCCCCCTTCGCCCCTCTTTCCTCACAACTCTTCCACTCTCTTCCAC
ATATACGCCTTTCGGTGCCAACTTGTTTGAAGAAAACAGCAGGCTAAGCACGAATAAACAATCCCATTCCATCCATACTGCCCTCCGTAGTTTCACTCCCACTAGAATCA
CGAGAATTAAATCACTGCCAATACCGTCGGAAGAAGGGACTGAAATTTTCATCATGTCTCAGAAACACACTGAGATTCGAAACATATCTGAATTCAATGACCTGTTCATG
GATTTCGTCTCAGAAAATGAGCTCCATCTTGCCCTGAAACTGTTGTCCAATATATCATCTTATGGTTTGGTCCCAAATTCTAGAACATTTTCCATCATGATAAGGTGCTA
TTGCAAGAAAGGAGAATTGGATAATGCGGGCAGGGTTTTAGAGCAAATGCTGGGAAGGGGTCATTACCCAAACGATGCCACCGTCGCATTTCTCGTGAATGCTTTCTGCA
AAAGGGGTAAAACGCAGAAAGCTTTAGAAATGGTTGAGGTGGTGGGAAGGATTGGACGCAAGCCGACAGTTAAGACATACAATTGTTTGTTGAAAGGCCTATGCTATGTT
GGGAGAGTGGAAGAGGCATGCGAAATGGTGACGGAAATGAAGAAGGATAGCTTGATACCTGACATTTACACCTATACGGCTCTTATGGATGGCTTGTGTAAGGTAGGGCG
ATCAGACGAGGCAATGGAATTGCTCAATGAAGCAGAGGAAAATGGTTTAAAACCAAGTGTAGTCACTTTCAACACCCTCTTCAATGGCTACTGCAAGGAGGGCAGGCCAG
TGGATGGAATCTATGTCTTGAACAAAATGAAGCAAATGAACTGTATGCCCGATCGCATTACTTACACTACTCTGCTACATGGGCTGATAAAATGGGGTAAAATCCGAATA
GCCTTGAGGACATACAAGGAAATGGTTAGCTCAGGTCACACCATCGAAGCAAAAATGATGAATACCTTCATGAGAGCGTTAAGCAAGAGATCTTGGAAAGAAAAGGATCT
ATTGGAAGATGCCCATCAAGTGTTTGAGAAAATGAAAGACGACTTTCAAGTTATTGATCGGAGTACATATGGCCTGCTGATCCAAGCACTCTGTTCAGGAAACATGATTT
CAGAGGCTCTGGCAAATTTGCATCATATGATTGGAAAAGGGTACTCTCCAAGGGCAATTACCATTGATGTTGTTGTTCAATCGCTTTGTCACACTCACAGCCACACGGGA
AGCGCCAATGAAGCATTGTGTCTTTTGGGGCATGGAATCCCTTTCAGTACAACTTCTTTCGACTTGATAATCAATGAGCTAAATAAACAAGGAATGCGGTTAAGTGCTTC
TAATGTATATGGTTTGGCTCTGAAACGAGGTGTTAACCCCACGAAAACCCCGAGATAA
Protein sequenceShow/hide protein sequence
MQGHQLSRIMKNIGRSMSPSLGPAPLVSNKPSRENKQDSLLVAFKQAMKRKHTVLVQIKAQLMALASSKLLRSIACLVSVAVAVAVAANALTEEMAVIKKFGDEHSNLLD
QYERLSFEARLNQAMLERSLSEPRMLRSQPQVLFAGSVQLPYLITTTNQPRKGRCGSTFNFNNLLNNLLKPILERKGRAKKERPHFKNPMDQIEAQFSRVAVMPYELLIQ
SSLQNQSFLPLKSKHKRMRLRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFTPTRITRIKSLPIPSEEGTEIFIMSQKHTEIRNISEFNDLFM
DFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYV
GRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRI
ALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTG
SANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR