; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019876 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019876
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153424:522620..524361
RNA-Seq ExpressionSgr019876
SyntenySgr019876
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038402.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]5.5e-25579.38Show/hide
Query:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYM
        MFS+E LPQSL F+ PL KPT  Q+H  S+ +    R  +  + RN     +SAE+R+ H P+ DNR+ HLMKLLNRSCRAGKHNESLYFLESVVSKG+ 
Subjt:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYM

Query:  PDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLK
        PDVVLCTKL+KGFFNSRNLKKAVRV EILETYGDPDVYSYNAMISGFSKANQI+ ANQV DRM SRGFSPDIVTYNIMIGSLCSRGKL+ AFEV+DELLK
Subjt:  PDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLK

Query:  DGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLM
        DGCKPSVITYTILIEATILEG+INEALELFD+LLSRGL PD+YTYNAIIRGICKEGMEDRAV F+RDL+ RGCNPDV+SYNILLRSFLNK +WE+GEKLM
Subjt:  DGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLM

Query:  ENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQ
        ++MVLSGCEPNVVT+SILISS CREG+V EAVNVL VMKEKGLTPD+YSYDPLISAFCKEGRLDLAIE+L KMVS+GCLPDIVNYNTILATLCKFG A  
Subjt:  ENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQ

Query:  ALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKA
        ALDIFEKLD+                LW    + +       MI KGID DEITYNSLISCLCRDGLVDEAI LLVDMEATSFQPTVISFNIVLLGMCKA
Subjt:  ALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKA

Query:  HRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN
        HR+ EGI LLITMVEKG  PNET+YVLLIEGIAYA WRAEAMELANSLYRLGVI EDSS+RLNKTFPM DVYKGLSLSESKN
Subjt:  HRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN

TYJ96990.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]7.2e-25579.38Show/hide
Query:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYM
        MFS+E LPQS  F+ PL KPT  Q+H  S+ +    R  +  + RN     +SAE+R+ H P+ DNR+ HLMKLLNRSCRAGKHNESLYFLESVVSKG+ 
Subjt:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYM

Query:  PDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLK
        PDVVLCTKL+KGFFNSRNLKKAVRV EILETYGDPDVYSYNAMISGFSKANQI+ ANQV DRM SRGFSPDIVTYNIMIGSLCSRGKL+ AFEV+DELLK
Subjt:  PDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLK

Query:  DGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLM
        DGCKPSVITYTILIEATILEG+INEALELFD+LLSRGL PD+YTYNAIIRGICKEGMEDRAV F+RDL+ RGCNPDV+SYNILLRSFLNK +WE+GEKLM
Subjt:  DGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLM

Query:  ENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQ
        ++MVLSGCEPNVVT+SILISS CREG+V EAVNVL VMKEKGLTPD+YSYDPLISAFCKEGRLDLAIE+L KMVS+GCLPDIVNYNTILATLCKFG A  
Subjt:  ENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQ

Query:  ALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKA
        ALDIFEKLDE                LW    + +       MI KGID DEITYNSLISCLCRDGLVDEAI LLVDMEATSFQPTVISFNIVLLGMCKA
Subjt:  ALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKA

Query:  HRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN
        HR+ EGI LLITMVEKG  PNET+YVLLIEGIAYA WRAEAMELANSLYRLGVI EDSS+RLNKTFPM DVYKGLSLSESKN
Subjt:  HRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN

XP_008443759.1 PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucumis melo]7.2e-25579.55Show/hide
Query:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYM
        MFS+E LPQSL F+ PL KPT  Q+H  S+ +    R  +  + RN     +SAE+R+ H P+ DNR+ HLMKLLNRSCRAGKHNESLYFLESVVSKG+ 
Subjt:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYM

Query:  PDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLK
        PDVVLCTKL+KGFFNSRNLKKAVRV EILETYGDPDVYSYNAMISGFSKANQI+ ANQV DRM SRGFSPDIVTYNIMIGSLCSRGKL  AFEV+DELLK
Subjt:  PDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLK

Query:  DGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLM
        DGCKPSVITYTILIEATILEG+INEALELFD+LLSRGL PD+YTYNAIIRGICKEGMEDRAV F+RDL+ RGCNPDV+SYNILLRSFLNK +WE+GEKLM
Subjt:  DGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLM

Query:  ENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQ
        ++MVLSGCEPNVVT+SILISS CREG+V EAVNVL VMKEKGLTPD+YSYDPLISAFCKEGRLDLAIE+L KMVS+GCLPDIVNYNTILATLCKFG A  
Subjt:  ENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQ

Query:  ALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKA
        ALDIFEKLDE                LW    + +       MI KGID DEITYNSLISCLCRDGLVDEAI LLVDMEATSFQPTVISFNIVLLGMCKA
Subjt:  ALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKA

Query:  HRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN
        HR+ EGI LLITMVEKG  PNET+YVLLIEGIAYA WRAEAMELANSLYRLGVI EDSS+RLNKTFPM DVYKGLSLSESKN
Subjt:  HRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN

XP_022158521.1 pentatricopeptide repeat-containing protein At3g04760, chloroplastic-like [Momordica charantia]3.7e-26781.55Show/hide
Query:  MFSTELLPQSLPFSTPLKPTSQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYMPD
        MFS E LP +LPF+ P KPT QTH +SL+SCR+PR+         L+NGASA TR+ HLP FDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKG+ PD
Subjt:  MFSTELLPQSLPFSTPLKPTSQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYMPD

Query:  VVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLKDG
        VVLCTKL+KGFFNSRNLKKAVRV EILE YGDPDVYSYNAMISGFSKANQIE AN+V DRM SRGFSPD+VTYNIMIGSLCSRGKL+  FEV+DELLKDG
Subjt:  VVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLKDG

Query:  CKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLMEN
        CKPSVITYTILI+ATILEG+I++ LELFD++LSRGL PDMYTYNAIIRGICKEGMEDRAVAFIRDL  RGCNPDVISYNILLRSFLNKR+WEEGEKLM++
Subjt:  CKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLMEN

Query:  MVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQAL
        MVLSGCEPNVVT+SILISSLCREGKVGEAVNVL VMKEKGLTPD+YSYDPLISAFCKEGRLDLAIEFLHKM S+GCLPDIVNYNTILATLCKFGSA +AL
Subjt:  MVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQAL

Query:  DIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKAHR
        DIFEKL+E                LW    + +       MISKGID DEITYNSLISCLCRDGLVDEA+ LLVDME+TSF+PTVISFNIVLLGMCKAHR
Subjt:  DIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKAHR

Query:  ILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN
        ILEGI LLITMVEKG LPNET+YVLLIEGIAYA WRAEAMELAN LYRLGVICEDSS+RLNKTFPM  VYKGLSLS  KN
Subjt:  ILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN

XP_038880759.1 pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Benincasa hispida]2.5e-25578.9Show/hide
Query:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGA-SAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGY
        MFS+E LPQSL F+ PL KPT  ++H  SLV+ +          +  L+NGA SAE+RE H  +  NR+ HLMKLLNRSCRAGKHNESLYFLESVVSKG+
Subjt:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGA-SAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGY

Query:  MPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELL
         PDVVLCTKL+KGFFNSRNLKKA+RV EILETYGDPDVYSYNAMISGFSKANQIE AN+V DRM SRGFSPD+VTYNIMIG LCSRGKL+ AFEV+DELL
Subjt:  MPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELL

Query:  KDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKL
        KDGCKPSVITYTILIEATILEG+INEALELFD+LLSRGL PD+YTYNAIIRGICKEGMEDRAV F++ L+ RGCNPDVISYNILLRSFLNK +W +GEKL
Subjt:  KDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKL

Query:  MENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAV
        M++MVL GCEPNVVT+SILISSLCREG+VGEAVNVL VMKEKGLTPD+YSYDPLISAFCKEGRLDLAIE+LHKMVS+GCLPDIVNYNTILATLCKFGSA 
Subjt:  MENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAV

Query:  QALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCK
         ALDIFEKLDE                LW   ++ +       MI KGID DEITYNSLISCLCRDGLVDEAI LLVDMEAT+FQPTVISFNIVLLGMCK
Subjt:  QALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCK

Query:  AHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN
        AHR+ EGI LLITMVEKG +PN+T+YVLLIEGIAYA WRAEAMELAN+LYRLGVICEDSS+RLNKTFPM DVYKGLSLSESKN
Subjt:  AHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN

TrEMBL top hitse value%identityAlignment
A0A0A0M3C6 Uncharacterized protein6.6e-25478.52Show/hide
Query:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYM
        MFS+E LPQSL F+ PL KPT  Q+   S+ +C   R  +  H RN     +SAE R+ H P+ DNR+ HLMKLLNRSCRAGKHNESLYFLESVVSKG+ 
Subjt:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYM

Query:  PDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLK
        PDVVLCTKL+KGFFNSRNLKKA+RV EILETYGDPDVYSYNAMISGFSKANQI+ ANQV DRM SRGFSPD+VTYNIMIGSLCSRGKL+ AFEV+DELLK
Subjt:  PDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLK

Query:  DGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLM
        DGCKPSVITYTILIEATILEG+INEALELFD+L+SRGL PD+YTYNAIIRGICKEGMEDRA+ F+R L+ RGCNPDV+SYNILLRSFLNK +WE+GE+LM
Subjt:  DGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLM

Query:  ENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQ
        ++MVLSGCEPNVVT+SILISS CREG+V EAVNVL VMKEKGLTPDSYSYDPLISAFCKEGRLDLAIE+L KMVS+GCLPDIVNYNTILATLCKFG A  
Subjt:  ENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQ

Query:  ALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKA
        ALD+FEKLDE                LW    + +       MI KGID DEITYNSLISCLCRDGLVDEAI LLVDMEAT FQPTVISFNIVLLGMCKA
Subjt:  ALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKA

Query:  HRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN
        HR+ EGI LLITMVEKG LPNET+YVLLIEGIAYA WRAEAMELANSLYRLGVI  DSS+RLNKTFPM DVYKGLSLSESKN
Subjt:  HRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN

A0A1S3B9K3 pentatricopeptide repeat-containing protein At3g04760, chloroplastic3.5e-25579.55Show/hide
Query:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYM
        MFS+E LPQSL F+ PL KPT  Q+H  S+ +    R  +  + RN     +SAE+R+ H P+ DNR+ HLMKLLNRSCRAGKHNESLYFLESVVSKG+ 
Subjt:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYM

Query:  PDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLK
        PDVVLCTKL+KGFFNSRNLKKAVRV EILETYGDPDVYSYNAMISGFSKANQI+ ANQV DRM SRGFSPDIVTYNIMIGSLCSRGKL  AFEV+DELLK
Subjt:  PDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLK

Query:  DGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLM
        DGCKPSVITYTILIEATILEG+INEALELFD+LLSRGL PD+YTYNAIIRGICKEGMEDRAV F+RDL+ RGCNPDV+SYNILLRSFLNK +WE+GEKLM
Subjt:  DGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLM

Query:  ENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQ
        ++MVLSGCEPNVVT+SILISS CREG+V EAVNVL VMKEKGLTPD+YSYDPLISAFCKEGRLDLAIE+L KMVS+GCLPDIVNYNTILATLCKFG A  
Subjt:  ENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQ

Query:  ALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKA
        ALDIFEKLDE                LW    + +       MI KGID DEITYNSLISCLCRDGLVDEAI LLVDMEATSFQPTVISFNIVLLGMCKA
Subjt:  ALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKA

Query:  HRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN
        HR+ EGI LLITMVEKG  PNET+YVLLIEGIAYA WRAEAMELANSLYRLGVI EDSS+RLNKTFPM DVYKGLSLSESKN
Subjt:  HRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN

A0A5A7T4J1 Pentatricopeptide repeat-containing protein2.7e-25579.38Show/hide
Query:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYM
        MFS+E LPQSL F+ PL KPT  Q+H  S+ +    R  +  + RN     +SAE+R+ H P+ DNR+ HLMKLLNRSCRAGKHNESLYFLESVVSKG+ 
Subjt:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYM

Query:  PDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLK
        PDVVLCTKL+KGFFNSRNLKKAVRV EILETYGDPDVYSYNAMISGFSKANQI+ ANQV DRM SRGFSPDIVTYNIMIGSLCSRGKL+ AFEV+DELLK
Subjt:  PDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLK

Query:  DGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLM
        DGCKPSVITYTILIEATILEG+INEALELFD+LLSRGL PD+YTYNAIIRGICKEGMEDRAV F+RDL+ RGCNPDV+SYNILLRSFLNK +WE+GEKLM
Subjt:  DGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLM

Query:  ENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQ
        ++MVLSGCEPNVVT+SILISS CREG+V EAVNVL VMKEKGLTPD+YSYDPLISAFCKEGRLDLAIE+L KMVS+GCLPDIVNYNTILATLCKFG A  
Subjt:  ENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQ

Query:  ALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKA
        ALDIFEKLD+                LW    + +       MI KGID DEITYNSLISCLCRDGLVDEAI LLVDMEATSFQPTVISFNIVLLGMCKA
Subjt:  ALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKA

Query:  HRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN
        HR+ EGI LLITMVEKG  PNET+YVLLIEGIAYA WRAEAMELANSLYRLGVI EDSS+RLNKTFPM DVYKGLSLSESKN
Subjt:  HRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN

A0A5D3BAX6 Pentatricopeptide repeat-containing protein3.5e-25579.38Show/hide
Query:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYM
        MFS+E LPQS  F+ PL KPT  Q+H  S+ +    R  +  + RN     +SAE+R+ H P+ DNR+ HLMKLLNRSCRAGKHNESLYFLESVVSKG+ 
Subjt:  MFSTELLPQSLPFSTPL-KPT-SQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYM

Query:  PDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLK
        PDVVLCTKL+KGFFNSRNLKKAVRV EILETYGDPDVYSYNAMISGFSKANQI+ ANQV DRM SRGFSPDIVTYNIMIGSLCSRGKL+ AFEV+DELLK
Subjt:  PDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLK

Query:  DGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLM
        DGCKPSVITYTILIEATILEG+INEALELFD+LLSRGL PD+YTYNAIIRGICKEGMEDRAV F+RDL+ RGCNPDV+SYNILLRSFLNK +WE+GEKLM
Subjt:  DGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLM

Query:  ENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQ
        ++MVLSGCEPNVVT+SILISS CREG+V EAVNVL VMKEKGLTPD+YSYDPLISAFCKEGRLDLAIE+L KMVS+GCLPDIVNYNTILATLCKFG A  
Subjt:  ENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQ

Query:  ALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKA
        ALDIFEKLDE                LW    + +       MI KGID DEITYNSLISCLCRDGLVDEAI LLVDMEATSFQPTVISFNIVLLGMCKA
Subjt:  ALDIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKA

Query:  HRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN
        HR+ EGI LLITMVEKG  PNET+YVLLIEGIAYA WRAEAMELANSLYRLGVI EDSS+RLNKTFPM DVYKGLSLSESKN
Subjt:  HRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN

A0A6J1E150 pentatricopeptide repeat-containing protein At3g04760, chloroplastic-like1.8e-26781.55Show/hide
Query:  MFSTELLPQSLPFSTPLKPTSQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYMPD
        MFS E LP +LPF+ P KPT QTH +SL+SCR+PR+         L+NGASA TR+ HLP FDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKG+ PD
Subjt:  MFSTELLPQSLPFSTPLKPTSQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYMPD

Query:  VVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLKDG
        VVLCTKL+KGFFNSRNLKKAVRV EILE YGDPDVYSYNAMISGFSKANQIE AN+V DRM SRGFSPD+VTYNIMIGSLCSRGKL+  FEV+DELLKDG
Subjt:  VVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLKDG

Query:  CKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLMEN
        CKPSVITYTILI+ATILEG+I++ LELFD++LSRGL PDMYTYNAIIRGICKEGMEDRAVAFIRDL  RGCNPDVISYNILLRSFLNKR+WEEGEKLM++
Subjt:  CKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLMEN

Query:  MVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQAL
        MVLSGCEPNVVT+SILISSLCREGKVGEAVNVL VMKEKGLTPD+YSYDPLISAFCKEGRLDLAIEFLHKM S+GCLPDIVNYNTILATLCKFGSA +AL
Subjt:  MVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQAL

Query:  DIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKAHR
        DIFEKL+E                LW    + +       MISKGID DEITYNSLISCLCRDGLVDEA+ LLVDME+TSF+PTVISFNIVLLGMCKAHR
Subjt:  DIFEKLDECTL------------ELW---EQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKAHR

Query:  ILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN
        ILEGI LLITMVEKG LPNET+YVLLIEGIAYA WRAEAMELAN LYRLGVICEDSS+RLNKTFPM  VYKGLSLS  KN
Subjt:  ILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFPMFDVYKGLSLSESKN

SwissProt top hitse value%identityAlignment
A3KPF8 Pentatricopeptide repeat-containing protein At1g79080, chloroplastic9.0e-7534.09Show/hide
Query:  NESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYG-DPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLC
        ++S   LES+V+ G+ P+V   T+L+     +  LKKA+RV E++ + G  PD  +Y  +++   K   +  A Q++++M   G+  + VTYN ++  LC
Subjt:  NESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYG-DPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLC

Query:  SRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNIL
          G L+ + + ++ L++ G  P+  TY+ L+EA   E   +EA++L D+++ +G  P++ +YN ++ G CKEG  D A+A  R+L  +G   +V+SYNIL
Subjt:  SRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNIL

Query:  LRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKG--LTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPD
        LR      +WEE   L+  M      P+VVTY+ILI+SL   G+  +A+ VL  M +        + SY+P+I+  CKEG++DL ++ L +M+   C P+
Subjt:  LRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKG--LTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPD

Query:  IVNYNTILATLCKFGSAVQ-ALDIFEKLDE----CTLELWEQDQGS-----------RNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDM-E
           YN I  +LC+  S VQ A  I + L      CT + ++    S           +    M   G D D  TY++LI  LC +G+   A+E+L  M E
Subjt:  IVNYNTILATLCKFGSAVQ-ALDIFEKLDE----CTLELWEQDQGS-----------RNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDM-E

Query:  ATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTF
        + + +PTV +FN ++LG+CK  R    + +   MVEK  +PNETTY +L+EGIA+ +    A E+ + L    VI +++  R+   F
Subjt:  ATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTF

Q3EDF8 Pentatricopeptide repeat-containing protein At1g099002.9e-11342.54Show/hide
Query:  LNRSCRAGKHNESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGD-PDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIV
        L +  R G+  E   FLE++V  G +PD++ CT L++GF      +KA ++ EILE  G  PDV +YN MISG+ KA +I  A  VLDRMS    SPD+V
Subjt:  LNRSCRAGKHNESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGD-PDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIV

Query:  TYNIMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGC
        TYN ++ SLC  GKL  A EV+D +L+  C P VITYTILIEAT  +  +  A++L D++  RG  PD+ TYN ++ GICKEG  D A+ F+ D+   GC
Subjt:  TYNIMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGC

Query:  NPDVISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKM
         P+VI++NI+LRS  +  +W + EKL+ +M+  G  P+VVT++ILI+ LCR+G +G A+++L  M + G  P+S SY+PL+  FCKE ++D AIE+L +M
Subjt:  NPDVISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKM

Query:  VSNGCLPDIVNYNTILATLCKFGSAVQALDIFEKLDE--CT-------------LELWEQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIE
        VS GC PDIV YNT+L  LCK G    A++I  +L    C+              +  +  +  +    M +K +  D ITY+SL+  L R+G VDEAI+
Subjt:  VSNGCLPDIVNYNTILATLCKFGSAVQALDIFEKLDE--CT-------------LELWEQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIE

Query:  LLVDMEATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRL
           + E    +P  ++FN ++LG+CK+ +    I  L+ M+ +G  PNET+Y +LIEG+AY     EA+EL N L   G++ + S+ ++
Subjt:  LLVDMEATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRL

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic1.1e-7231.84Show/hide
Query:  CRAGKHNESLYFLESVVSK-GYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYG-DPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYN
        C+ G+  ++L F++ + ++ G+ PD      LV G   + ++K A+ + +++   G DPDVY+YN++ISG  K  +++ A +VLD+M +R  SP+ VTYN
Subjt:  CRAGKHNESLYFLESVVSK-GYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYG-DPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYN

Query:  IMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPD
         +I +LC   +++ A E+   L   G  P V T+  LI+   L      A+ELF+++ S+G  PD +TYN +I  +C +G  D A+  ++ + + GC   
Subjt:  IMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPD

Query:  VISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSN
        VI+YN L+  F    K  E E++ + M + G   N VTY+ LI  LC+  +V +A  +++ M  +G  PD Y+Y+ L++ FC+ G +  A + +  M SN
Subjt:  VISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSN

Query:  GCLPDIVNYNTILATLCKFGSAVQALDIFEKLDECTLELWEQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDM-EATSFQPTVISF
        GC PDIV Y T+++ LCK G    A  +   +                      KGI+L    YN +I  L R     EAI L  +M E     P  +S+
Subjt:  GCLPDIVNYNTILATLCKFGSAVQALDIFEKLDECTLELWEQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDM-EATSFQPTVISF

Query:  NIVLLGMCK-AHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICED
         IV  G+C     I E +  L+ ++EKG++P  ++  +L EG+         ++L N + +     E+
Subjt:  NIVLLGMCK-AHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICED

Q9SFV9 Pentatricopeptide repeat-containing protein At3g07290, mitochondrial3.1e-6729.84Show/hide
Query:  LLNRSCRAGKHNESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEIL--ETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPD
        ++N  C+ G    +  F+  ++  G++ D  + T L+ GF    NL+ A++V +++  E    P+  SY+ +I G  +  ++E A  + D+M  +G  P 
Subjt:  LLNRSCRAGKHNESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEIL--ETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPD

Query:  IVTYNIMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVR
          TY ++I +LC RG +D AF + DE++  GCKP+V TYT+LI+    +GKI EA  +  K++   + P + TYNA+I G CK+G    A   +  +  R
Subjt:  IVTYNIMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVR

Query:  GCNPDVISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLH
         C P+V ++N L+       K  +   L++ M+ +G  P++V+Y++LI  LCREG +  A  +L+ M    + PD  ++  +I+AFCK+G+ D+A  FL 
Subjt:  GCNPDVISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLH

Query:  KMVSNGCLPDIVNYNTILATLCKFGSAVQALDIFEKLDE-----------CTLELWEQDQGSRNDIRMISK----GIDLDEITYNSLISCLCRDGLVDEA
         M+  G   D V   T++  +CK G    AL I E L +             L++  +    + ++ M+ K    G+    +TY +L+  L R G +  +
Subjt:  KMVSNGCLPDIVNYNTILATLCKFGSAVQALDIFEKLDE-----------CTLELWEQDQGSRNDIRMISK----GIDLDEITYNSLISCLCRDGLVDEA

Query:  IELLVDMEATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICED
          +L  M+ +   P V  + I++ G+C+  R+ E   LL  M + G  PN  TY ++++G         A+E   ++   G    D
Subjt:  IELLVDMEATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICED

Q9SR00 Pentatricopeptide repeat-containing protein At3g04760, chloroplastic4.6e-18862.67Show/hide
Query:  ETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIE
        E R+ H  S   R+T ++K+ +RSCR+G + ESL+ LE++V KGY PDV+LCTKL+KGFF  RN+ KAVRV EILE +G PDV++YNA+I+GF K N+I+
Subjt:  ETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIE

Query:  CANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICK
         A +VLDRM S+ FSPD VTYNIMIGSLCSRGKLD A +V+++LL D C+P+VITYTILIEAT+LEG ++EAL+L D++LSRGL PDM+TYN IIRG+CK
Subjt:  CANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICK

Query:  EGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLI
        EGM DRA   +R+L ++GC PDVISYNILLR+ LN+ KWEEGEKLM  M    C+PNVVTYSILI++LCR+GK+ EA+N+L +MKEKGLTPD+YSYDPLI
Subjt:  EGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLI

Query:  SAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQALDIFEKLDE--CT----------LELWEQDQGSR---NDIRMISKGIDLDEIT
        +AFC+EGRLD+AIEFL  M+S+GCLPDIVNYNT+LATLCK G A QAL+IF KL E  C+            LW      R     + M+S GID DEIT
Subjt:  SAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQALDIFEKLDE--CT----------LELWEQDQGSR---NDIRMISKGIDLDEIT

Query:  YNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVI
        YNS+ISCLCR+G+VDEA ELLVDM +  F P+V+++NIVLLG CKAHRI + I +L +MV  G  PNETTY +LIEGI +A +RAEAMELAN L R+  I
Subjt:  YNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVI

Query:  CEDSSRRLNKTFPMFDV
         E S +RL++TFP+ +V
Subjt:  CEDSSRRLNKTFPMFDV

Arabidopsis top hitse value%identityAlignment
AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.0e-11442.54Show/hide
Query:  LNRSCRAGKHNESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGD-PDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIV
        L +  R G+  E   FLE++V  G +PD++ CT L++GF      +KA ++ EILE  G  PDV +YN MISG+ KA +I  A  VLDRMS    SPD+V
Subjt:  LNRSCRAGKHNESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGD-PDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIV

Query:  TYNIMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGC
        TYN ++ SLC  GKL  A EV+D +L+  C P VITYTILIEAT  +  +  A++L D++  RG  PD+ TYN ++ GICKEG  D A+ F+ D+   GC
Subjt:  TYNIMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGC

Query:  NPDVISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKM
         P+VI++NI+LRS  +  +W + EKL+ +M+  G  P+VVT++ILI+ LCR+G +G A+++L  M + G  P+S SY+PL+  FCKE ++D AIE+L +M
Subjt:  NPDVISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKM

Query:  VSNGCLPDIVNYNTILATLCKFGSAVQALDIFEKLDE--CT-------------LELWEQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIE
        VS GC PDIV YNT+L  LCK G    A++I  +L    C+              +  +  +  +    M +K +  D ITY+SL+  L R+G VDEAI+
Subjt:  VSNGCLPDIVNYNTILATLCKFGSAVQALDIFEKLDE--CT-------------LELWEQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIE

Query:  LLVDMEATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRL
           + E    +P  ++FN ++LG+CK+ +    I  L+ M+ +G  PNET+Y +LIEG+AY     EA+EL N L   G++ + S+ ++
Subjt:  LLVDMEATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRL

AT1G79080.1 Pentatricopeptide repeat (PPR) superfamily protein6.4e-7634.09Show/hide
Query:  NESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYG-DPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLC
        ++S   LES+V+ G+ P+V   T+L+     +  LKKA+RV E++ + G  PD  +Y  +++   K   +  A Q++++M   G+  + VTYN ++  LC
Subjt:  NESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYG-DPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLC

Query:  SRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNIL
          G L+ + + ++ L++ G  P+  TY+ L+EA   E   +EA++L D+++ +G  P++ +YN ++ G CKEG  D A+A  R+L  +G   +V+SYNIL
Subjt:  SRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNIL

Query:  LRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKG--LTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPD
        LR      +WEE   L+  M      P+VVTY+ILI+SL   G+  +A+ VL  M +        + SY+P+I+  CKEG++DL ++ L +M+   C P+
Subjt:  LRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKG--LTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPD

Query:  IVNYNTILATLCKFGSAVQ-ALDIFEKLDE----CTLELWEQDQGS-----------RNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDM-E
           YN I  +LC+  S VQ A  I + L      CT + ++    S           +    M   G D D  TY++LI  LC +G+   A+E+L  M E
Subjt:  IVNYNTILATLCKFGSAVQ-ALDIFEKLDE----CTLELWEQDQGS-----------RNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDM-E

Query:  ATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTF
        + + +PTV +FN ++LG+CK  R    + +   MVEK  +PNETTY +L+EGIA+ +    A E+ + L    VI +++  R+   F
Subjt:  ATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTF

AT3G04760.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.3e-18962.67Show/hide
Query:  ETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIE
        E R+ H  S   R+T ++K+ +RSCR+G + ESL+ LE++V KGY PDV+LCTKL+KGFF  RN+ KAVRV EILE +G PDV++YNA+I+GF K N+I+
Subjt:  ETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIE

Query:  CANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICK
         A +VLDRM S+ FSPD VTYNIMIGSLCSRGKLD A +V+++LL D C+P+VITYTILIEAT+LEG ++EAL+L D++LSRGL PDM+TYN IIRG+CK
Subjt:  CANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICK

Query:  EGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLI
        EGM DRA   +R+L ++GC PDVISYNILLR+ LN+ KWEEGEKLM  M    C+PNVVTYSILI++LCR+GK+ EA+N+L +MKEKGLTPD+YSYDPLI
Subjt:  EGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLI

Query:  SAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQALDIFEKLDE--CT----------LELWEQDQGSR---NDIRMISKGIDLDEIT
        +AFC+EGRLD+AIEFL  M+S+GCLPDIVNYNT+LATLCK G A QAL+IF KL E  C+            LW      R     + M+S GID DEIT
Subjt:  SAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQALDIFEKLDE--CT----------LELWEQDQGSR---NDIRMISKGIDLDEIT

Query:  YNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVI
        YNS+ISCLCR+G+VDEA ELLVDM +  F P+V+++NIVLLG CKAHRI + I +L +MV  G  PNETTY +LIEGI +A +RAEAMELAN L R+  I
Subjt:  YNSLISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVI

Query:  CEDSSRRLNKTFPMFDV
         E S +RL++TFP+ +V
Subjt:  CEDSSRRLNKTFPMFDV

AT3G07290.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-6829.84Show/hide
Query:  LLNRSCRAGKHNESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEIL--ETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPD
        ++N  C+ G    +  F+  ++  G++ D  + T L+ GF    NL+ A++V +++  E    P+  SY+ +I G  +  ++E A  + D+M  +G  P 
Subjt:  LLNRSCRAGKHNESLYFLESVVSKGYMPDVVLCTKLVKGFFNSRNLKKAVRVTEIL--ETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPD

Query:  IVTYNIMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVR
          TY ++I +LC RG +D AF + DE++  GCKP+V TYT+LI+    +GKI EA  +  K++   + P + TYNA+I G CK+G    A   +  +  R
Subjt:  IVTYNIMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVR

Query:  GCNPDVISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLH
         C P+V ++N L+       K  +   L++ M+ +G  P++V+Y++LI  LCREG +  A  +L+ M    + PD  ++  +I+AFCK+G+ D+A  FL 
Subjt:  GCNPDVISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLH

Query:  KMVSNGCLPDIVNYNTILATLCKFGSAVQALDIFEKLDE-----------CTLELWEQDQGSRNDIRMISK----GIDLDEITYNSLISCLCRDGLVDEA
         M+  G   D V   T++  +CK G    AL I E L +             L++  +    + ++ M+ K    G+    +TY +L+  L R G +  +
Subjt:  KMVSNGCLPDIVNYNTILATLCKFGSAVQALDIFEKLDE-----------CTLELWEQDQGSRNDIRMISK----GIDLDEITYNSLISCLCRDGLVDEA

Query:  IELLVDMEATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICED
          +L  M+ +   P V  + I++ G+C+  R+ E   LL  M + G  PN  TY ++++G         A+E   ++   G    D
Subjt:  IELLVDMEATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICED

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein7.8e-7431.84Show/hide
Query:  CRAGKHNESLYFLESVVSK-GYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYG-DPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYN
        C+ G+  ++L F++ + ++ G+ PD      LV G   + ++K A+ + +++   G DPDVY+YN++ISG  K  +++ A +VLD+M +R  SP+ VTYN
Subjt:  CRAGKHNESLYFLESVVSK-GYMPDVVLCTKLVKGFFNSRNLKKAVRVTEILETYG-DPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYN

Query:  IMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPD
         +I +LC   +++ A E+   L   G  P V T+  LI+   L      A+ELF+++ S+G  PD +TYN +I  +C +G  D A+  ++ + + GC   
Subjt:  IMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGKINEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPD

Query:  VISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSN
        VI+YN L+  F    K  E E++ + M + G   N VTY+ LI  LC+  +V +A  +++ M  +G  PD Y+Y+ L++ FC+ G +  A + +  M SN
Subjt:  VISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAVNVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSN

Query:  GCLPDIVNYNTILATLCKFGSAVQALDIFEKLDECTLELWEQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDM-EATSFQPTVISF
        GC PDIV Y T+++ LCK G    A  +   +                      KGI+L    YN +I  L R     EAI L  +M E     P  +S+
Subjt:  GCLPDIVNYNTILATLCKFGSAVQALDIFEKLDECTLELWEQDQGSRNDIRMISKGIDLDEITYNSLISCLCRDGLVDEAIELLVDM-EATSFQPTVISF

Query:  NIVLLGMCK-AHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICED
         IV  G+C     I E +  L+ ++EKG++P  ++  +L EG+         ++L N + +     E+
Subjt:  NIVLLGMCK-AHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTCTACTGAACTACTGCCTCAGAGCCTTCCTTTTTCCACCCCATTGAAGCCCACTTCCCAAACACATTGCAGCTCCCTCGTGAGTTGCAGACTTCCCAGGCTCGA
CGACGACAACCACCCCAGAAACCCTCTAAAAAATGGCGCATCTGCTGAAACGAGAGAAGCCCATCTCCCCAGCTTCGACAACAGAGAGACCCATTTGATGAAGCTCCTCA
ACAGGTCCTGCAGAGCTGGGAAACACAACGAGTCACTCTACTTTCTCGAGAGCGTAGTGAGCAAAGGCTATATGCCGGATGTCGTGCTCTGCACGAAGCTCGTTAAAGGG
TTCTTCAATTCGAGAAATTTGAAGAAAGCTGTTCGGGTTACGGAGATTTTGGAAACTTATGGGGACCCAGATGTTTATTCTTACAATGCTATGATTAGTGGCTTTAGTAA
AGCCAATCAGATTGAGTGTGCAAACCAAGTGCTTGATAGAATGAGCAGCAGAGGATTTTCTCCTGATATTGTTACTTATAATATAATGATTGGGAGTTTGTGTAGTAGGG
GGAAGCTTGATCCTGCTTTTGAGGTCATAGATGAGCTTCTGAAGGATGGATGTAAGCCATCTGTGATTACTTACACAATTCTAATAGAAGCAACCATTCTTGAAGGTAAA
ATCAATGAAGCTTTGGAGCTGTTCGACAAATTGCTATCAAGGGGCCTCCATCCCGACATGTATACATACAATGCCATCATTAGGGGAATCTGCAAGGAAGGAATGGAAGA
TCGGGCTGTGGCGTTTATTCGGGATTTAACCGTAAGAGGCTGTAATCCAGATGTGATTTCATACAACATTCTGCTGCGTTCTTTCCTGAACAAAAGAAAATGGGAAGAGG
GAGAAAAGCTTATGGAAAACATGGTTTTAAGTGGCTGTGAGCCGAATGTTGTTACTTACAGCATTTTAATCAGTTCGTTATGTCGTGAAGGGAAAGTCGGGGAAGCTGTG
AATGTGTTGAACGTGATGAAGGAGAAAGGCTTAACCCCAGATTCATATAGCTATGATCCTCTGATTTCTGCCTTCTGCAAAGAAGGGAGATTAGATTTAGCAATAGAGTT
TTTGCACAAAATGGTCTCTAATGGTTGTTTGCCTGATATTGTAAACTACAATACAATTTTGGCTACTCTCTGTAAATTTGGAAGTGCTGTTCAGGCTTTAGACATCTTTG
AGAAGCTAGATGAATGCACTTTGGAACTGTGGGAGCAAGATCAAGGCTCTAGGAATGATATCAGAATGATAAGCAAAGGAATAGATCTTGACGAGATAACGTATAATTCG
CTGATATCGTGTTTGTGTCGGGATGGGTTGGTCGATGAGGCTATTGAGTTGTTGGTAGACATGGAAGCTACCAGCTTCCAGCCAACAGTGATCAGCTTCAACATTGTCCT
TCTGGGAATGTGCAAAGCACATAGGATTCTTGAAGGCATTGGGTTGCTAATAACAATGGTTGAAAAGGGCTACCTGCCAAACGAAACTACCTACGTCTTGTTGATTGAGG
GGATCGCTTATGCCGAGTGGCGAGCGGAGGCTATGGAGTTGGCTAACTCTCTGTACAGATTGGGAGTTATTTGTGAGGATTCTTCTAGGCGTTTGAACAAGACATTTCCA
ATGTTTGACGTTTATAAAGGGTTAAGCTTATCAGAAAGCAAGAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTTCTACTGAACTACTGCCTCAGAGCCTTCCTTTTTCCACCCCATTGAAGCCCACTTCCCAAACACATTGCAGCTCCCTCGTGAGTTGCAGACTTCCCAGGCTCGA
CGACGACAACCACCCCAGAAACCCTCTAAAAAATGGCGCATCTGCTGAAACGAGAGAAGCCCATCTCCCCAGCTTCGACAACAGAGAGACCCATTTGATGAAGCTCCTCA
ACAGGTCCTGCAGAGCTGGGAAACACAACGAGTCACTCTACTTTCTCGAGAGCGTAGTGAGCAAAGGCTATATGCCGGATGTCGTGCTCTGCACGAAGCTCGTTAAAGGG
TTCTTCAATTCGAGAAATTTGAAGAAAGCTGTTCGGGTTACGGAGATTTTGGAAACTTATGGGGACCCAGATGTTTATTCTTACAATGCTATGATTAGTGGCTTTAGTAA
AGCCAATCAGATTGAGTGTGCAAACCAAGTGCTTGATAGAATGAGCAGCAGAGGATTTTCTCCTGATATTGTTACTTATAATATAATGATTGGGAGTTTGTGTAGTAGGG
GGAAGCTTGATCCTGCTTTTGAGGTCATAGATGAGCTTCTGAAGGATGGATGTAAGCCATCTGTGATTACTTACACAATTCTAATAGAAGCAACCATTCTTGAAGGTAAA
ATCAATGAAGCTTTGGAGCTGTTCGACAAATTGCTATCAAGGGGCCTCCATCCCGACATGTATACATACAATGCCATCATTAGGGGAATCTGCAAGGAAGGAATGGAAGA
TCGGGCTGTGGCGTTTATTCGGGATTTAACCGTAAGAGGCTGTAATCCAGATGTGATTTCATACAACATTCTGCTGCGTTCTTTCCTGAACAAAAGAAAATGGGAAGAGG
GAGAAAAGCTTATGGAAAACATGGTTTTAAGTGGCTGTGAGCCGAATGTTGTTACTTACAGCATTTTAATCAGTTCGTTATGTCGTGAAGGGAAAGTCGGGGAAGCTGTG
AATGTGTTGAACGTGATGAAGGAGAAAGGCTTAACCCCAGATTCATATAGCTATGATCCTCTGATTTCTGCCTTCTGCAAAGAAGGGAGATTAGATTTAGCAATAGAGTT
TTTGCACAAAATGGTCTCTAATGGTTGTTTGCCTGATATTGTAAACTACAATACAATTTTGGCTACTCTCTGTAAATTTGGAAGTGCTGTTCAGGCTTTAGACATCTTTG
AGAAGCTAGATGAATGCACTTTGGAACTGTGGGAGCAAGATCAAGGCTCTAGGAATGATATCAGAATGATAAGCAAAGGAATAGATCTTGACGAGATAACGTATAATTCG
CTGATATCGTGTTTGTGTCGGGATGGGTTGGTCGATGAGGCTATTGAGTTGTTGGTAGACATGGAAGCTACCAGCTTCCAGCCAACAGTGATCAGCTTCAACATTGTCCT
TCTGGGAATGTGCAAAGCACATAGGATTCTTGAAGGCATTGGGTTGCTAATAACAATGGTTGAAAAGGGCTACCTGCCAAACGAAACTACCTACGTCTTGTTGATTGAGG
GGATCGCTTATGCCGAGTGGCGAGCGGAGGCTATGGAGTTGGCTAACTCTCTGTACAGATTGGGAGTTATTTGTGAGGATTCTTCTAGGCGTTTGAACAAGACATTTCCA
ATGTTTGACGTTTATAAAGGGTTAAGCTTATCAGAAAGCAAGAACTAG
Protein sequenceShow/hide protein sequence
MFSTELLPQSLPFSTPLKPTSQTHCSSLVSCRLPRLDDDNHPRNPLKNGASAETREAHLPSFDNRETHLMKLLNRSCRAGKHNESLYFLESVVSKGYMPDVVLCTKLVKG
FFNSRNLKKAVRVTEILETYGDPDVYSYNAMISGFSKANQIECANQVLDRMSSRGFSPDIVTYNIMIGSLCSRGKLDPAFEVIDELLKDGCKPSVITYTILIEATILEGK
INEALELFDKLLSRGLHPDMYTYNAIIRGICKEGMEDRAVAFIRDLTVRGCNPDVISYNILLRSFLNKRKWEEGEKLMENMVLSGCEPNVVTYSILISSLCREGKVGEAV
NVLNVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEFLHKMVSNGCLPDIVNYNTILATLCKFGSAVQALDIFEKLDECTLELWEQDQGSRNDIRMISKGIDLDEITYNS
LISCLCRDGLVDEAIELLVDMEATSFQPTVISFNIVLLGMCKAHRILEGIGLLITMVEKGYLPNETTYVLLIEGIAYAEWRAEAMELANSLYRLGVICEDSSRRLNKTFP
MFDVYKGLSLSESKN