; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012807 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012807
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153561:1244..3920
RNA-Seq ExpressionSgr012807
SyntenySgr012807
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135750.2 pentatricopeptide repeat-containing protein At4g21300 [Cucumis sativus]5.6e-30070.25Show/hide
Query:  SQNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMG
        +QNGDLGP++LGMYV TGSL+DAKN+FYTLQLGCTSAWNWMIRGFTMM                              AC  L +VKMGKIVHETVNLMG
Subjt:  SQNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMG

Query:  LKEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH-----
        LKED FV G   ++ YA                            GYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVL VC  EAMLDLGTQLH     
Subjt:  LKEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH-----

Query:  ------------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIV
                                AARKLFD  PQ+DLVSWNGIISGYVQNGLM EAEHLFRGMISAG+KPDSITFASFLPCV ELLSLKHCKEIHGYI+
Subjt:  ------------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIV

Query:  RHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGCY
        RHAVVLDVFLKSALIDIYFKCRDVE+A+KIL  SS  D VVCT MISGYVLNG N EALEAFRWL+QERMKPTSVTF+S+FPAFAGLAALNLGKELHG  
Subjt:  RHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGCY

Query:  LLELHDYKLF-----------------------------------------PERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGF
        +    D K                                              + G     FRQMGMEGT+YDCVSISGALSACANLPALHYGKEIHG 
Subjt:  LLELHDYKLF-----------------------------------------PERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGF

Query:  MIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYH
        MIK PLRSD+YAESSLIDMYAKCGNLNFSR VFD MQ +NEVSWNSIISAYGNH                    DHVTFLGIISACGHAGQVDEGI YYH
Subjt:  MIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYH

Query:  LMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKV
        LMT+EYGIPARMEHYAC+ D+FGRAGR+DEA+ETI SMPFPPDAGVWGTLLGACH+HGNVELAEVASK+LFDLDPLNSGYYVLLANVQAGAGKWRKVLKV
Subjt:  LMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKV

Query:  RSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSKSI
        RSIMKERGVRKVPGYSWIE+NNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLP+HPQ LSKSI
Subjt:  RSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSKSI

XP_022142608.1 pentatricopeptide repeat-containing protein At4g21300 isoform X1 [Momordica charantia]0.0e+0072.05Show/hide
Query:  SQNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMG
        SQNGD+GP+ILGMYVLTGSL+DAKNVFY+LQLGCTSAWNWMIRGFT+M                              ACGALNNVKMGKIVHETVNLMG
Subjt:  SQNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMG

Query:  LKEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH-----
        L++DAFV G   ++ YA                            GYVKNGDS NAIKIFLEMRH EIKPNSVTFACVL VC +EAMLDLGTQLH     
Subjt:  LKEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH-----

Query:  ------------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIV
                                AARKLFDMMPQ+DLVSWNGIISGYVQNGLMSEAE LFRGM+SAGMKPDSITFASFLPCVTEL SL+HCK IHGYIV
Subjt:  ------------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIV

Query:  RHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH---
        RHAVVLDVFLKSALID+YFKCRDVE+A+KILR SSLVD VVCTAMISGYVLNGMN EALEAFRWLLQ+R+KPTSVTFASVFPAFAGLAALNLGKELH   
Subjt:  RHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH---

Query:  -----------GCYLLELHD-----------YKLFPER----------------QTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGF
                   G  +L+++            +    E+                + G     FRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGF
Subjt:  -----------GCYLLELHD-----------YKLFPER----------------QTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGF

Query:  MIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYH
        MIK PLRSDIYAESSLIDMYAKCGNLNFSR VFD MQ KNEVSWNSIISAYGNH                    DHVTFLGIISACGHAGQVDEGI YYH
Subjt:  MIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYH

Query:  LMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKV
        LMT++YGIPARMEHYACM DLFGRAGR+DEA+ETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKW+KVLKV
Subjt:  LMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKV

Query:  RSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSKSISETVLQD
        RSIMKERGVRKVPGYSWIE+NNATHMFVAADGSHPLTAQIYSVLDSLLLEL+KEGYVPQLYLP+HPQTLSKS+ ET+LQD
Subjt:  RSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSKSISETVLQD

XP_023519042.1 pentatricopeptide repeat-containing protein At4g21300 isoform X1 [Cucurbita pepo subsp. pepo]3.6e-29970.38Show/hide
Query:  QNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTM------------------------------MACGALNNVKMGKIVHETVNLMGL
        +NG LG +ILGMYVLTGSLEDAKN+FYTLQLGC+S WNWMIRGF M                               ACGALN+VKMG+IVHETV+L+GL
Subjt:  QNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTM------------------------------MACGALNNVKMGKIVHETVNLMGL

Query:  KEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH------
        KEDAFV G   ++ YA                            GYVKNG+SGNAIKIFL+MRHSEIKPNSVTFACVL VC  EAMLDLGTQLH      
Subjt:  KEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH------

Query:  -----------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVR
                               AARKLFDMMPQ+DLVSWNGIISGYVQNGLMSEAEHLFRGMISAG+KPDSITFASFLPCV ELLSL+HCKEIHGYIVR
Subjt:  -----------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVR

Query:  HAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH----
        H V LD+FLKSALIDIY KCRDVE+ARKILR SS  D VVCTAMISGYVLNGMN EA+EAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH    
Subjt:  HAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH----

Query:  ----------GCYLLELHD-----------YKLFPER-------------QTGGGHQ---YFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFM
                  G  +L+++            +    ER             Q G   +    FRQMGMEGT YDCVSISGALSACANLPALHYGKEIHGFM
Subjt:  ----------GCYLLELHD-----------YKLFPER-------------QTGGGHQ---YFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFM

Query:  IKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHL
        IK PLRSD+YAESSLIDMYAKCGNLN SR VF+TMQ KNEVSWNSIISAYGNH                    DHVTF+GIISACGHAGQVDEGI YYHL
Subjt:  IKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHL

Query:  MTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVR
        MT+EY IPARMEHYACMVDLFGRAGR+DEA+ETI +MPFPPDAGVWGTLLGACHVHGNVELAEVASK+LFDLDPLNSGYYVLLANVQAGAGKW+KVLKVR
Subjt:  MTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVR

Query:  SIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTL-SKSISETVLQD
        SIMKERGVRK+PGYSWIE+NNATHMFVAADGSHPLTAQIYSVLDSLL ELKKEGYVPQLYLP+HPQ L SKS+SET LQD
Subjt:  SIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTL-SKSISETVLQD

XP_038895274.1 pentatricopeptide repeat-containing protein At4g21300 isoform X1 [Benincasa hispida]3.9e-30170.88Show/hide
Query:  QNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMGL
        QNGDLGP+ILGMYV TGS EDAKN+FYTLQLG TSAWNWMI+GFTMM                              ACGALN+VKMGKIVHETVNL+GL
Subjt:  QNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMGL

Query:  KEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH------
        KEDAFV G   ++ YA                            GYVKNGDS NAIKIFLEMR+SEIKPNSVTFAC+L VC  EAML LGTQLH      
Subjt:  KEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH------

Query:  -----------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVR
                               AARKLFD MPQ+DLVSWNGIISGYVQNGLMSEAEHLFRGMI+AG+KPDSITFASFLPCV ELLSLKHCKEIHGYIVR
Subjt:  -----------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVR

Query:  HAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGCYL
        HAVVLDVFLKSALIDIYFKCRDVE+ARKIL  SS  D VVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHG  +
Subjt:  HAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGCYL

Query:  LELHDYKLF-----------------------------------------PERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFM
            D K                                              + G     FRQMG+EGTQYDCVSISGALSACANLPALHYGKEIHG M
Subjt:  LELHDYKLF-----------------------------------------PERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFM

Query:  IKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHL
        IK PLRSD+YAESSLIDMYAKCGNL+FSR VFD MQ KNEVSWNSIISAYGNH                    DHVTFLGIISACGHAG+VDEGI YYHL
Subjt:  IKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHL

Query:  MTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVR
        MT+EYGIPA+MEHYAC+VDLFGRAGR+DEA+ETI SMPF PDAGVWGTLLGACHVHGNVELAEVASK+LFDLDPLNSGYYVLLANVQAGAGKW+KVLKVR
Subjt:  MTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVR

Query:  SIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSKSISETV
        SIMKERGVRK+PGYSWIE+NNATHMFVAADGSHPLTAQIYS+LDSLLLELKKEGYVPQLYLP+HPQ LSKS+SETV
Subjt:  SIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSKSISETV

XP_038895275.1 pentatricopeptide repeat-containing protein At4g21300 isoform X2 [Benincasa hispida]2.2e-30473.53Show/hide
Query:  QNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM--ACGALNNVKMGKIVHETVNLMGLKEDAFVEGLCFMECYA------------
        QNGDLGP+ILGMYV TGS EDAKN+FYTLQLG TSAWNWMI+GFTMM  ACGALN+VKMGKIVHETVNL+GLKEDAFV G   ++ YA            
Subjt:  QNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM--ACGALNNVKMGKIVHETVNLMGLKEDAFVEGLCFMECYA------------

Query:  ---------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH-----------------------------AARKL
                        GYVKNGDS NAIKIFLEMR+SEIKPNSVTFAC+L VC  EAML LGTQLH                             AARKL
Subjt:  ---------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH-----------------------------AARKL

Query:  FDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEIARK
        FD MPQ+DLVSWNGIISGYVQNGLMSEAEHLFRGMI+AG+KPDSITFASFLPCV ELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVE+ARK
Subjt:  FDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEIARK

Query:  ILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGCYLLELHDYKLF-------------------
        IL  SS  D VVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHG  +    D K                     
Subjt:  ILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGCYLLELHDYKLF-------------------

Query:  ----------------------PERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKSPLRSDIYAESSLIDMYAKCGNLNFS
                                 + G     FRQMG+EGTQYDCVSISGALSACANLPALHYGKEIHG MIK PLRSD+YAESSLIDMYAKCGNL+FS
Subjt:  ----------------------PERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKSPLRSDIYAESSLIDMYAKCGNLNFS

Query:  RLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHLMTKEYGIPARMEHYACMVDLFGRAGRVD
        R VFD MQ KNEVSWNSIISAYGNH                    DHVTFLGIISACGHAG+VDEGI YYHLMT+EYGIPA+MEHYAC+VDLFGRAGR+D
Subjt:  RLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHLMTKEYGIPARMEHYACMVDLFGRAGRVD

Query:  EAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKERGVRKVPGYSWIEINNATHMFVA
        EA+ETI SMPF PDAGVWGTLLGACHVHGNVELAEVASK+LFDLDPLNSGYYVLLANVQAGAGKW+KVLKVRSIMKERGVRK+PGYSWIE+NNATHMFVA
Subjt:  EAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKERGVRKVPGYSWIEINNATHMFVA

Query:  ADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSKSISETV
        ADGSHPLTAQIYS+LDSLLLELKKEGYVPQLYLP+HPQ LSKS+SETV
Subjt:  ADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSKSISETV

TrEMBL top hitse value%identityAlignment
A0A0A0LW16 Uncharacterized protein2.7e-30070.25Show/hide
Query:  SQNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMG
        +QNGDLGP++LGMYV TGSL+DAKN+FYTLQLGCTSAWNWMIRGFTMM                              AC  L +VKMGKIVHETVNLMG
Subjt:  SQNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMG

Query:  LKEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH-----
        LKED FV G   ++ YA                            GYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVL VC  EAMLDLGTQLH     
Subjt:  LKEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH-----

Query:  ------------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIV
                                AARKLFD  PQ+DLVSWNGIISGYVQNGLM EAEHLFRGMISAG+KPDSITFASFLPCV ELLSLKHCKEIHGYI+
Subjt:  ------------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIV

Query:  RHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGCY
        RHAVVLDVFLKSALIDIYFKCRDVE+A+KIL  SS  D VVCT MISGYVLNG N EALEAFRWL+QERMKPTSVTF+S+FPAFAGLAALNLGKELHG  
Subjt:  RHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGCY

Query:  LLELHDYKLF-----------------------------------------PERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGF
        +    D K                                              + G     FRQMGMEGT+YDCVSISGALSACANLPALHYGKEIHG 
Subjt:  LLELHDYKLF-----------------------------------------PERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGF

Query:  MIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYH
        MIK PLRSD+YAESSLIDMYAKCGNLNFSR VFD MQ +NEVSWNSIISAYGNH                    DHVTFLGIISACGHAGQVDEGI YYH
Subjt:  MIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYH

Query:  LMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKV
        LMT+EYGIPARMEHYAC+ D+FGRAGR+DEA+ETI SMPFPPDAGVWGTLLGACH+HGNVELAEVASK+LFDLDPLNSGYYVLLANVQAGAGKWRKVLKV
Subjt:  LMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKV

Query:  RSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSKSI
        RSIMKERGVRKVPGYSWIE+NNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLP+HPQ LSKSI
Subjt:  RSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSKSI

A0A6J1CLE8 pentatricopeptide repeat-containing protein At4g21300 isoform X37.4e-29871.56Show/hide
Query:  SQNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMG
        SQNGD+GP+ILGMYVLTGSL+DAKNVFY+LQLGCTSAWNWMIRGFT+M                              ACGALNNVKMGKIVHETVNLMG
Subjt:  SQNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMG

Query:  LKEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH-----
        L++DAFV G   ++ YA                            GYVKNGDS NAIKIFLEMRH EIKPNSVTFACVL VC +EAMLDLGTQLH     
Subjt:  LKEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH-----

Query:  ------------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIV
                                AARKLFDMMPQ+DLVSWNGIISGYVQNGLMSEAE LFRGM+SAGMKPDSITFASFLPCVTEL SL+HCK IHGYIV
Subjt:  ------------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIV

Query:  RHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH---
        RHAVVLDVFLKSALID+YFKCRDVE+A+KILR SSLVD VVCTAMISGYVLNGMN EALEAFRWLLQ+R+KPTSVTFASVFPAFAGLAALNLGKELH   
Subjt:  RHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH---

Query:  -----------GCYLLELHD-----------YKLFPER----------------QTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGF
                   G  +L+++            +    E+                + G     FRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGF
Subjt:  -----------GCYLLELHD-----------YKLFPER----------------QTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGF

Query:  MIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYH
        MIK PLRSDIYAESSLIDMYAKCGNLNFSR VFD MQ KNEVSWNSIISAYGNH                    DHVTFLGIISACGHAGQVDEGI YYH
Subjt:  MIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYH

Query:  LMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKV
        LMT++YGIPARMEHYACM DLFGRAGR+DEA+ETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKW+KVLKV
Subjt:  LMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKV

Query:  RSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGY
        RSIMKERGVRKVPGYSWIE+NNATHMFVAADGSHPLTAQIYSVLDSLLLEL+KEG+
Subjt:  RSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGY

A0A6J1CNN9 pentatricopeptide repeat-containing protein At4g21300 isoform X10.0e+0072.05Show/hide
Query:  SQNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMG
        SQNGD+GP+ILGMYVLTGSL+DAKNVFY+LQLGCTSAWNWMIRGFT+M                              ACGALNNVKMGKIVHETVNLMG
Subjt:  SQNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMG

Query:  LKEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH-----
        L++DAFV G   ++ YA                            GYVKNGDS NAIKIFLEMRH EIKPNSVTFACVL VC +EAMLDLGTQLH     
Subjt:  LKEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH-----

Query:  ------------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIV
                                AARKLFDMMPQ+DLVSWNGIISGYVQNGLMSEAE LFRGM+SAGMKPDSITFASFLPCVTEL SL+HCK IHGYIV
Subjt:  ------------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIV

Query:  RHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH---
        RHAVVLDVFLKSALID+YFKCRDVE+A+KILR SSLVD VVCTAMISGYVLNGMN EALEAFRWLLQ+R+KPTSVTFASVFPAFAGLAALNLGKELH   
Subjt:  RHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH---

Query:  -----------GCYLLELHD-----------YKLFPER----------------QTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGF
                   G  +L+++            +    E+                + G     FRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGF
Subjt:  -----------GCYLLELHD-----------YKLFPER----------------QTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGF

Query:  MIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYH
        MIK PLRSDIYAESSLIDMYAKCGNLNFSR VFD MQ KNEVSWNSIISAYGNH                    DHVTFLGIISACGHAGQVDEGI YYH
Subjt:  MIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYH

Query:  LMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKV
        LMT++YGIPARMEHYACM DLFGRAGR+DEA+ETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKW+KVLKV
Subjt:  LMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKV

Query:  RSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSKSISETVLQD
        RSIMKERGVRKVPGYSWIE+NNATHMFVAADGSHPLTAQIYSVLDSLLLEL+KEGYVPQLYLP+HPQTLSKS+ ET+LQD
Subjt:  RSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSKSISETVLQD

A0A6J1EG50 pentatricopeptide repeat-containing protein At4g21300 isoform X12.5e-29869.87Show/hide
Query:  QNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMGL
        +NG LG +ILGMYVL GSLEDAKN+FYTLQLGC+S WNWMIRGFTMM                              ACGALN+VKMG+IVHETV+L+GL
Subjt:  QNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMGL

Query:  KEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH------
        KEDAFV G   ++ YA                            GYVKNGDSGNAIKIFL+MRHSEIKPNSVTFACVL VC  EAMLDLGTQLH      
Subjt:  KEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH------

Query:  -----------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVR
                               AARKLFDMMP++DLVSWNGIISGYVQNGLMSEAE L RGMISAG+KPDSITFASFLPCV E+LSL+HCKEIHGYI+R
Subjt:  -----------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVR

Query:  HAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH----
        H V LDVFLKSALIDIY KCRDVE+ARKILR SS  D VVCTAMISGYVLNGMN EA+EAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH    
Subjt:  HAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH----

Query:  ----------GCYLLELHD-----------YKLFPER----------------QTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFM
                  G  +L+++            +    ER                + G     FRQMGMEGT YDCVSISGALSACANLPALHYGKEIHGFM
Subjt:  ----------GCYLLELHD-----------YKLFPER----------------QTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFM

Query:  IKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHL
        IK PLRSD+YAESSLIDMYAKCGNLN SR VF+TMQ KNEVSWNSIISAYGNH                    DHVTF+GIISACGHAGQVDEGI YYHL
Subjt:  IKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHL

Query:  MTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVR
        MT+EY IPARMEHYACMVDLFGRAGR++EA+ETI +MPFPPDAGVWGTLLGACHVHGNVELAEVASK+LFDLDPLNSGYYVLLANVQAGAGKW+KVLKVR
Subjt:  MTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVR

Query:  SIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTL-SKSISETVLQD
        SIMKERGVRK+PGYSWIE+NNATHMFVAADGSHPLTAQIYSVLDSLL ELKKEGYVPQLYLP+HPQ L SKS+SET LQD
Subjt:  SIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTL-SKSISETVLQD

A0A6J1KKW0 pentatricopeptide repeat-containing protein At4g21300 isoform X15.1e-29970.13Show/hide
Query:  QNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMGL
        +NG LG +ILGMYVLTGSLEDAKN+FYTLQLGC+S WNWMIRGFT+M                              ACGALN+VKMG+IVHETV+L+GL
Subjt:  QNGDLGPKILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMM------------------------------ACGALNNVKMGKIVHETVNLMGL

Query:  KEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH------
        KEDAFV G   ++ YA                            GYVKNGDSGNAIKIFL+MRHSEIKPNSVTFACVL VC  EAMLDLGTQLH      
Subjt:  KEDAFVEGLCFMECYA---------------------------YGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLH------

Query:  -----------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVR
                               AARKLFDMMPQ+DLVSWNGIISGYVQNGLMSEAE LFRGMISAG+KPDSITFASFLPCV ELLSL+HCKEIHGYIVR
Subjt:  -----------------------AARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVR

Query:  HAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH----
        H V LD+FLKSALIDIY KCRDVE+ARKILR SS  D VVCTAMISGYVLNGMN EA+EAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH    
Subjt:  HAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH----

Query:  ----------GCYLLELHD-----------YKLFPER----------------QTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFM
                  G  +L+++            +    ER                + G     FRQMG EGT YDCVSIS ALSACANLPALHYGKEIHGFM
Subjt:  ----------GCYLLELHD-----------YKLFPER----------------QTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFM

Query:  IKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHL
        IK PLRSD+YAESSLIDMYAKCGNLN SR VF+TMQ KNEVSWNSIISAYGNH                    DHVTF+GIISACGHAGQVDEGI YYHL
Subjt:  IKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHL

Query:  MTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVR
        MT+EY IPARMEHYACMVDLFGRAGR+DEA+ETI +MPFPPDAGVWGTLLGACHVHGNVELAEVASK+LFDLDPLNSGYYVLLANVQAGAGKW+KVLKVR
Subjt:  MTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVR

Query:  SIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTL-SKSISETVLQD
        SIMKERGVRK+PGYSWIE+NNATHMFVAADGSHPLTAQIYSVLDSLL ELKKEGYVPQLYLP+HPQ L SKS+SET LQD
Subjt:  SIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTL-SKSISETVLQD

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic4.3e-10134.04Show/hide
Query:  GYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHA-----------------------------ARKLFDMMPQNDLVSWNGII
        G+ K  D   A++ F+ MR+ +++P    F  +L VCG EA L +G ++H                              ARK+FD MP+ DLVSWN I+
Subjt:  GYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHA-----------------------------ARKLFDMMPQNDLVSWNGII

Query:  SGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAM
        +GY QNG+   A  + + M    +KP  IT  S LP V+ L  +   KEIHGY +R      V + +AL+D+Y KC  +E AR++       + V   +M
Subjt:  SGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAM

Query:  ISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHG---------------------CYLLELHD-YKLFPERQTG-------
        I  YV N    EA+  F+ +L E +KPT V+      A A L  L  G+ +H                      C   E+     +F + Q+        
Subjt:  ISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHG---------------------CYLLELHD-YKLFPERQTG-------

Query:  ------------GGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWN
                        YF QM     + D  +    ++A A L   H+ K IHG +++S L  +++  ++L+DMYAKCG +  +RL+FD M  ++  +WN
Subjt:  ------------GGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWN

Query:  SIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHLMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAG
        ++I  YG H                    + VTFL +ISAC H+G V+ G+  +++M + Y I   M+HY  MVDL GRAGR++EA++ I  MP  P   
Subjt:  SIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHLMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAG

Query:  VWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLD
        V+G +LGAC +H NV  AE A++ LF+L+P + GY+VLLAN+   A  W KV +VR  M  +G+RK PG S +EI N  H F +   +HP + +IY+ L+
Subjt:  VWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLD

Query:  SLLLELKKEGYVPQLYL
         L+  +K+ GYVP   L
Subjt:  SLLLELKKEGYVPQLYL

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202307.0e-10435.98Show/hide
Query:  CGALNNVKMGKIVHETVNLMGLKEDAFVEGLCFMECYAYGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHAARKLFDMMP-
        C  L+  K+GK +H    + GL  DAFV+G  F     + Y++ G  G+A K+F  M   ++    VT + +LC    +  L+        R L +M   
Subjt:  CGALNNVKMGKIVHETVNLMGLKEDAFVEGLCFMECYAYGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHAARKLFDMMP-

Query:  --QNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILR
          + ++VSWNGI+SG+ ++G   EA  +F+ +   G  PD +T +S LP V +   L   + IHGY+++  ++ D  + SA+ID+Y K   V     +  
Subjt:  --QNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILR

Query:  PSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGCYLLELHDYKLFPERQTGGGHQYFRQMGMEGTQY
           +++A VC A I+G   NG+  +ALE F    ++ M+   V++ S+    AG A    GK++     LEL                 FR+M + G + 
Subjt:  PSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGCYLLELHDYKLFPERQTGGGHQYFRQMGMEGTQY

Query:  DCVSISGALSACANLPALHYGKEIHGFMIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH-------------------
        + V+I   L AC N+ AL +G+  HGF ++  L  +++  S+LIDMYAKCG +N S++VF+ M  KN V WNS+++ +  H                   
Subjt:  DCVSISGALSACANLPALHYGKEIHGFMIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH-------------------

Query:  -DHVTFLGIISACGHAGQVDEGINYYHLMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDL
         D ++F  ++SACG  G  DEG  Y+ +M++EYGI  R+EHY+CMV+L GRAG++ EAY+ IK MPF PD+ VWG LL +C +  NV+LAE+A++ LF L
Subjt:  -DHVTFLGIISACGHAGQVDEGINYYHLMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDL

Query:  DPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIH
        +P N G YVLL+N+ A  G W +V  +R+ M+  G++K PG SWI++ N  +  +A D SHP   QI   +D +  E++K G+ P L   +H
Subjt:  DPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIH

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233308.3e-9730.88Show/hide
Query:  ILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFT------------------------------MMACGALNNVKMGKIVHETVNLMGLKEDAFVEG
        ++ +Y     L +A  +F TL+     AW  +IR FT                              + +C  + +++ G+ VH  +  +G+  D +  G
Subjt:  ILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFT------------------------------MMACGALNNVKMGKIVHETVNLMGLKEDAFVEG

Query:  LCFMECYAYGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHAARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMI
           M  YA   +  G   +   +F EM                C+        +   + + R++F++MP+ D+VS+N II+GY Q+G+  +A  + R M 
Subjt:  LCFMECYAYGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHAARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMI

Query:  SAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWL
        +  +KPDS T +S LP  +E + +   KEIHGY++R  +  DV++ S+L+D+Y K   +E + ++       D +   ++++GYV NG   EAL  FR +
Subjt:  SAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWL

Query:  LQERMKPTSVTFASVFPAFAGLAALNLGKELHGCYLLELHDYKLFPERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKSPL
        +  ++KP +V F+SV P                                                            ACA+L  LH GK++HG++++   
Subjt:  LQERMKPTSVTFASVFPAFAGLAALNLGKELHGCYLLELHDYKLFPERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKSPL

Query:  RSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNHDH--------------------VTFLGIISACGHAGQVDEGINYYHLMTKEY
         S+I+  S+L+DMY+KCGN+  +R +FD M   +EVSW +II  +  H H                    V F+ +++AC H G VDE   Y++ MTK Y
Subjt:  RSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNHDH--------------------VTFLGIISACGHAGQVDEGINYYHLMTKEY

Query:  GIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKE
        G+   +EHYA + DL GRAG+++EAY  I  M   P   VW TLL +C VH N+ELAE  ++ +F +D  N G YVL+ N+ A  G+W+++ K+R  M++
Subjt:  GIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKE

Query:  RGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYV
        +G+RK P  SWIE+ N TH FV+ D SHP   +I   L +++ +++KEGYV
Subjt:  RGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYV

Q9SS83 Pentatricopeptide repeat-containing protein At3g09040, mitochondrial3.6e-9230Show/hide
Query:  ILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMMAC------------------------------GALNNVKMGKIVHETVNLMGLKEDAFV--
        ++  Y+  G L+DA+ +F  +      AWN MI G     C                              G + N+ +G +VH     +GL  + +V  
Subjt:  ILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMMAC------------------------------GALNNVKMGKIVHETVNLMGLKEDAFV--

Query:  -------------------EGL-----CFMECYAYGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHA--------------
                           E L      F      GY  NG+S   +++F++M+ S    +  TF  +L  C     L++G+Q H+              
Subjt:  -------------------EGL-----CFMECYAYGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHA--------------

Query:  ---------------ARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFL
                       AR++F+ M   D V+WN II  YVQ+   SEA  LF+ M   G+  D    AS L   T +  L   K++H   V+  +  D+  
Subjt:  ---------------ARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFL

Query:  KSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH-------------
         S+LID+Y KC  ++ ARK+         V   A+I+GY  N +  EA+  F+ +L   + P+ +TFA++  A     +L LG + H             
Subjt:  KSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH-------------

Query:  --GCYLLELH--------DYKLFPERQ--------TG--GGH----------QYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKSPLRS
          G  LL ++           LF E          TG   GH          +++++M  +G   D  +    L  C+ L +L  G+ IH  +       
Subjt:  --GCYLLELH--------DYKLFPERQ--------TG--GGH----------QYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKSPLRS

Query:  DIYAESSLIDMYAKCGNLNFSRLVFDTMQRK-NEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHLMTKEYG
        D    ++LIDMYAKCG++  S  VFD M+R+ N VSWNS+I+ Y  +                    D +TFLG+++AC HAG+V +G   + +M  +YG
Subjt:  DIYAESSLIDMYAKCGNLNFSRLVFDTMQRK-NEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHLMTKEYG

Query:  IPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKER
        I AR++H ACMVDL GR G + EA + I++    PDA +W +LLGAC +HG+    E++++ L +L+P NS  YVLL+N+ A  G W K   +R +M++R
Subjt:  IPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKER

Query:  GVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYV
        GV+KVPGYSWI++   TH+F A D SH    +I   L+ L   +K +  V
Subjt:  GVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYV

Q9STE1 Pentatricopeptide repeat-containing protein At4g213006.6e-17944.97Show/hide
Query:  NSQNGD--LGPKILGMYVLTGSLEDAKNVFYTLQLGCTS--AWNWMIRGFT------------------------------MMACGALNNVKMGKIVHET
        NS +GD     +ILGMY + GS  D   +FY L L  +S   WN +I  F                               + AC AL N K    + +T
Subjt:  NSQNGD--LGPKILGMYVLTGSLEDAKNVFYTLQLGCTS--AWNWMIRGFT------------------------------MMACGALNNVKMGKIVHET

Query:  VNLMGLKEDAFVEGLCFMECYAY--------------------------GYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHA
        V+ +G+  + FV          Y                          GY K G   + IK F  MR  +I PN+VTF CVL VC  + ++DLG QLH 
Subjt:  VNLMGLKEDAFVEGLCFMECYAY--------------------------GYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHA

Query:  -----------------------------ARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIH
                                     A KLF MM + D V+WN +ISGYVQ+GLM E+   F  MIS+G+ PD+ITF+S LP V++  +L++CK+IH
Subjt:  -----------------------------ARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIH

Query:  GYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKEL
         YI+RH++ LD+FL SALID YFKCR V +A+ I    + VD VV TAMISGY+ NG+  ++LE FRWL++ ++ P  +T  S+ P    L AL LG+EL
Subjt:  GYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKEL

Query:  H--------------GCYLLELH--------DYKLF-------------------PERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKE
        H              GC +++++         Y++F                              FRQMG+ G  YDCVSIS ALSACANLP+  +GK 
Subjt:  H--------------GCYLLELH--------DYKLF-------------------PERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKE

Query:  IHGFMIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH---------------------DHVTFLGIISACGHAGQVDEG
        IHGFMIK  L SD+Y+ES+LIDMYAKCGNL  +  VF TM+ KN VSWNSII+A GNH                     D +TFL IIS+C H G VDEG
Subjt:  IHGFMIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH---------------------DHVTFLGIISACGHAGQVDEG

Query:  INYYHLMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWR
        + ++  MT++YGI  + EHYAC+VDLFGRAGR+ EAYET+KSMPFPPDAGVWGTLLGAC +H NVELAEVAS  L DLDP NSGYYVL++N  A A +W 
Subjt:  INYYHLMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWR

Query:  KVLKVRSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSK
         V KVRS+MKER V+K+PGYSWIEIN  TH+FV+ D +HP ++ IYS+L+SLL EL+ EGY+PQ YLP+HP++  K
Subjt:  KVLKVRSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSK

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein3.0e-10234.04Show/hide
Query:  GYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHA-----------------------------ARKLFDMMPQNDLVSWNGII
        G+ K  D   A++ F+ MR+ +++P    F  +L VCG EA L +G ++H                              ARK+FD MP+ DLVSWN I+
Subjt:  GYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHA-----------------------------ARKLFDMMPQNDLVSWNGII

Query:  SGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAM
        +GY QNG+   A  + + M    +KP  IT  S LP V+ L  +   KEIHGY +R      V + +AL+D+Y KC  +E AR++       + V   +M
Subjt:  SGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAM

Query:  ISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHG---------------------CYLLELHD-YKLFPERQTG-------
        I  YV N    EA+  F+ +L E +KPT V+      A A L  L  G+ +H                      C   E+     +F + Q+        
Subjt:  ISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHG---------------------CYLLELHD-YKLFPERQTG-------

Query:  ------------GGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWN
                        YF QM     + D  +    ++A A L   H+ K IHG +++S L  +++  ++L+DMYAKCG +  +RL+FD M  ++  +WN
Subjt:  ------------GGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWN

Query:  SIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHLMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAG
        ++I  YG H                    + VTFL +ISAC H+G V+ G+  +++M + Y I   M+HY  MVDL GRAGR++EA++ I  MP  P   
Subjt:  SIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHLMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAG

Query:  VWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLD
        V+G +LGAC +H NV  AE A++ LF+L+P + GY+VLLAN+   A  W KV +VR  M  +G+RK PG S +EI N  H F +   +HP + +IY+ L+
Subjt:  VWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLD

Query:  SLLLELKKEGYVPQLYL
         L+  +K+ GYVP   L
Subjt:  SLLLELKKEGYVPQLYL

AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein5.0e-10535.98Show/hide
Query:  CGALNNVKMGKIVHETVNLMGLKEDAFVEGLCFMECYAYGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHAARKLFDMMP-
        C  L+  K+GK +H    + GL  DAFV+G  F     + Y++ G  G+A K+F  M   ++    VT + +LC    +  L+        R L +M   
Subjt:  CGALNNVKMGKIVHETVNLMGLKEDAFVEGLCFMECYAYGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHAARKLFDMMP-

Query:  --QNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILR
          + ++VSWNGI+SG+ ++G   EA  +F+ +   G  PD +T +S LP V +   L   + IHGY+++  ++ D  + SA+ID+Y K   V     +  
Subjt:  --QNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILR

Query:  PSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGCYLLELHDYKLFPERQTGGGHQYFRQMGMEGTQY
           +++A VC A I+G   NG+  +ALE F    ++ M+   V++ S+    AG A    GK++     LEL                 FR+M + G + 
Subjt:  PSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGCYLLELHDYKLFPERQTGGGHQYFRQMGMEGTQY

Query:  DCVSISGALSACANLPALHYGKEIHGFMIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH-------------------
        + V+I   L AC N+ AL +G+  HGF ++  L  +++  S+LIDMYAKCG +N S++VF+ M  KN V WNS+++ +  H                   
Subjt:  DCVSISGALSACANLPALHYGKEIHGFMIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH-------------------

Query:  -DHVTFLGIISACGHAGQVDEGINYYHLMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDL
         D ++F  ++SACG  G  DEG  Y+ +M++EYGI  R+EHY+CMV+L GRAG++ EAY+ IK MPF PD+ VWG LL +C +  NV+LAE+A++ LF L
Subjt:  -DHVTFLGIISACGHAGQVDEGINYYHLMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDL

Query:  DPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIH
        +P N G YVLL+N+ A  G W +V  +R+ M+  G++K PG SWI++ N  +  +A D SHP   QI   +D +  E++K G+ P L   +H
Subjt:  DPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIH

AT3G09040.1 Pentatricopeptide repeat (PPR) superfamily protein2.6e-9330Show/hide
Query:  ILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMMAC------------------------------GALNNVKMGKIVHETVNLMGLKEDAFV--
        ++  Y+  G L+DA+ +F  +      AWN MI G     C                              G + N+ +G +VH     +GL  + +V  
Subjt:  ILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMMAC------------------------------GALNNVKMGKIVHETVNLMGLKEDAFV--

Query:  -------------------EGL-----CFMECYAYGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHA--------------
                           E L      F      GY  NG+S   +++F++M+ S    +  TF  +L  C     L++G+Q H+              
Subjt:  -------------------EGL-----CFMECYAYGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHA--------------

Query:  ---------------ARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFL
                       AR++F+ M   D V+WN II  YVQ+   SEA  LF+ M   G+  D    AS L   T +  L   K++H   V+  +  D+  
Subjt:  ---------------ARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFL

Query:  KSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH-------------
         S+LID+Y KC  ++ ARK+         V   A+I+GY  N +  EA+  F+ +L   + P+ +TFA++  A     +L LG + H             
Subjt:  KSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELH-------------

Query:  --GCYLLELH--------DYKLFPERQ--------TG--GGH----------QYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKSPLRS
          G  LL ++           LF E          TG   GH          +++++M  +G   D  +    L  C+ L +L  G+ IH  +       
Subjt:  --GCYLLELH--------DYKLFPERQ--------TG--GGH----------QYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKSPLRS

Query:  DIYAESSLIDMYAKCGNLNFSRLVFDTMQRK-NEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHLMTKEYG
        D    ++LIDMYAKCG++  S  VFD M+R+ N VSWNS+I+ Y  +                    D +TFLG+++AC HAG+V +G   + +M  +YG
Subjt:  DIYAESSLIDMYAKCGNLNFSRLVFDTMQRK-NEVSWNSIISAYGNH--------------------DHVTFLGIISACGHAGQVDEGINYYHLMTKEYG

Query:  IPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKER
        I AR++H ACMVDL GR G + EA + I++    PDA +W +LLGAC +HG+    E++++ L +L+P NS  YVLL+N+ A  G W K   +R +M++R
Subjt:  IPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKER

Query:  GVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYV
        GV+KVPGYSWI++   TH+F A D SH    +I   L+ L   +K +  V
Subjt:  GVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYV

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.9e-9830.88Show/hide
Query:  ILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFT------------------------------MMACGALNNVKMGKIVHETVNLMGLKEDAFVEG
        ++ +Y     L +A  +F TL+     AW  +IR FT                              + +C  + +++ G+ VH  +  +G+  D +  G
Subjt:  ILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFT------------------------------MMACGALNNVKMGKIVHETVNLMGLKEDAFVEG

Query:  LCFMECYAYGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHAARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMI
           M  YA   +  G   +   +F EM                C+        +   + + R++F++MP+ D+VS+N II+GY Q+G+  +A  + R M 
Subjt:  LCFMECYAYGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHAARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMI

Query:  SAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWL
        +  +KPDS T +S LP  +E + +   KEIHGY++R  +  DV++ S+L+D+Y K   +E + ++       D +   ++++GYV NG   EAL  FR +
Subjt:  SAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWL

Query:  LQERMKPTSVTFASVFPAFAGLAALNLGKELHGCYLLELHDYKLFPERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKSPL
        +  ++KP +V F+SV P                                                            ACA+L  LH GK++HG++++   
Subjt:  LQERMKPTSVTFASVFPAFAGLAALNLGKELHGCYLLELHDYKLFPERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKSPL

Query:  RSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNHDH--------------------VTFLGIISACGHAGQVDEGINYYHLMTKEY
         S+I+  S+L+DMY+KCGN+  +R +FD M   +EVSW +II  +  H H                    V F+ +++AC H G VDE   Y++ MTK Y
Subjt:  RSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNHDH--------------------VTFLGIISACGHAGQVDEGINYYHLMTKEY

Query:  GIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKE
        G+   +EHYA + DL GRAG+++EAY  I  M   P   VW TLL +C VH N+ELAE  ++ +F +D  N G YVL+ N+ A  G+W+++ K+R  M++
Subjt:  GIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKE

Query:  RGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYV
        +G+RK P  SWIE+ N TH FV+ D SHP   +I   L +++ +++KEGYV
Subjt:  RGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYV

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.7e-18044.97Show/hide
Query:  NSQNGD--LGPKILGMYVLTGSLEDAKNVFYTLQLGCTS--AWNWMIRGFT------------------------------MMACGALNNVKMGKIVHET
        NS +GD     +ILGMY + GS  D   +FY L L  +S   WN +I  F                               + AC AL N K    + +T
Subjt:  NSQNGD--LGPKILGMYVLTGSLEDAKNVFYTLQLGCTS--AWNWMIRGFT------------------------------MMACGALNNVKMGKIVHET

Query:  VNLMGLKEDAFVEGLCFMECYAY--------------------------GYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHA
        V+ +G+  + FV          Y                          GY K G   + IK F  MR  +I PN+VTF CVL VC  + ++DLG QLH 
Subjt:  VNLMGLKEDAFVEGLCFMECYAY--------------------------GYVKNGDSGNAIKIFLEMRHSEIKPNSVTFACVLCVCGVEAMLDLGTQLHA

Query:  -----------------------------ARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIH
                                     A KLF MM + D V+WN +ISGYVQ+GLM E+   F  MIS+G+ PD+ITF+S LP V++  +L++CK+IH
Subjt:  -----------------------------ARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIH

Query:  GYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKEL
         YI+RH++ LD+FL SALID YFKCR V +A+ I    + VD VV TAMISGY+ NG+  ++LE FRWL++ ++ P  +T  S+ P    L AL LG+EL
Subjt:  GYIVRHAVVLDVFLKSALIDIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKEL

Query:  H--------------GCYLLELH--------DYKLF-------------------PERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKE
        H              GC +++++         Y++F                              FRQMG+ G  YDCVSIS ALSACANLP+  +GK 
Subjt:  H--------------GCYLLELH--------DYKLF-------------------PERQTGGGHQYFRQMGMEGTQYDCVSISGALSACANLPALHYGKE

Query:  IHGFMIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH---------------------DHVTFLGIISACGHAGQVDEG
        IHGFMIK  L SD+Y+ES+LIDMYAKCGNL  +  VF TM+ KN VSWNSII+A GNH                     D +TFL IIS+C H G VDEG
Subjt:  IHGFMIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNH---------------------DHVTFLGIISACGHAGQVDEG

Query:  INYYHLMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWR
        + ++  MT++YGI  + EHYAC+VDLFGRAGR+ EAYET+KSMPFPPDAGVWGTLLGAC +H NVELAEVAS  L DLDP NSGYYVL++N  A A +W 
Subjt:  INYYHLMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWR

Query:  KVLKVRSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSK
         V KVRS+MKER V+K+PGYSWIEIN  TH+FV+ D +HP ++ IYS+L+SLL EL+ EGY+PQ YLP+HP++  K
Subjt:  KVLKVRSIMKERGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTCTAACCGTCGATTGAAAGCAGCCACTGATAAAATACAACAATCCAAGAGCGGGATTGAAAGCAAAAGATTCGAGGGCGGGCTGGGCCTTCGCCTAAATGAACATGATT
TGATCAACGAAGCATGTACTACAAATTTCGTTTTCATTCTACAGATTTCTTCCTCATTTCTCGCGACCGGAGTCTCTCTTTTCAACTCAATCCAACTGCAAAAACCCGAT
AAACCCCACCTTGGTTTTCGACAAATGTGGAAGCGGTACTGGCATCAGCAAACAATCTCACGCCCAGGCCATTGTTACTGGAATAGCCAAAATGGTGATCTGGGTCCAAA
GATATTGGGTATGTACGTGCTTACTGGCAGCCTCGAGGATGCCAAGAACGTGTTTTATACGCTTCAATTGGGATGTACTTCGGCTTGGAATTGGATGATTAGGGGGTTTA
CAATGATGGCCTGTGGTGCTTTGAACAATGTGAAGATGGGTAAGATTGTTCATGAGACTGTTAATTTAATGGGCCTTAAGGAGGATGCCTTCGTGGAAGGACTGTGTTTT
ATGGAATGTTATGCTTATGGTTATGTGAAAAATGGAGACTCAGGCAATGCTATTAAGATCTTTTTGGAAATGAGACATAGTGAAATTAAGCCCAACTCCGTAACCTTTGC
TTGTGTTTTATGTGTTTGTGGTGTGGAGGCAATGCTTGACTTAGGTACTCAACTTCACGCTGCGCGTAAACTGTTTGATATGATGCCACAAAACGACTTGGTGAGTTGGA
ATGGAATAATTTCTGGATACGTACAGAATGGTTTGATGAGTGAGGCTGAGCATTTGTTTCGTGGAATGATATCTGCAGGAATGAAGCCCGACTCGATCACTTTTGCAAGT
TTCCTTCCATGTGTTACTGAGTTGCTGAGTCTCAAACATTGTAAGGAAATTCATGGTTACATAGTAAGACATGCTGTAGTTTTGGATGTGTTCTTGAAAAGTGCTCTAAT
TGATATATACTTCAAGTGCAGGGATGTGGAAATCGCACGAAAAATTTTGCGTCCAAGTAGTTTGGTTGATGCTGTAGTGTGCACGGCCATGATTTCAGGATACGTGCTTA
ATGGGATGAACACAGAAGCATTGGAGGCATTTAGATGGTTGCTGCAGGAGAGAATGAAGCCAACTTCTGTGACTTTTGCGAGTGTCTTTCCAGCTTTTGCTGGTTTGGCC
GCTCTAAACTTAGGAAAGGAATTGCATGGATGCTATTTGCTGGAACTCCATGATTACAAGCTGTTCCCAGAACGGCAGACCGGAGGAGGCCATCAATATTTCCGTCAGAT
GGGAATGGAGGGAACTCAGTATGACTGTGTGAGCATATCTGGTGCACTATCTGCTTGTGCAAACTTACCTGCTCTCCATTATGGAAAAGAGATCCATGGCTTCATGATCA
AAAGCCCATTAAGATCTGACATTTATGCCGAGAGTTCACTGATAGACATGTATGCTAAGTGTGGAAACTTGAATTTCTCCCGGCTAGTGTTTGACACGATGCAACGGAAA
AATGAAGTCTCCTGGAACAGCATCATTTCTGCCTATGGCAACCACGACCATGTCACCTTTCTTGGTATCATATCTGCTTGCGGCCATGCTGGCCAAGTCGATGAAGGAAT
TAACTATTACCACCTCATGACAAAGGAATACGGGATCCCAGCTCGAATGGAGCACTATGCTTGCATGGTCGATTTGTTTGGCCGCGCAGGTCGTGTGGATGAAGCATATG
AAACCATAAAAAGCATGCCATTCCCTCCAGATGCTGGCGTATGGGGAACACTACTTGGGGCATGCCATGTTCATGGCAACGTTGAGCTCGCCGAAGTGGCATCAAAGTAT
CTGTTTGATTTAGACCCTCTAAACTCTGGCTACTACGTATTGCTTGCAAATGTGCAGGCTGGTGCCGGAAAATGGAGGAAGGTGCTTAAAGTACGTAGCATAATGAAAGA
ACGAGGAGTTCGGAAGGTTCCTGGTTATAGCTGGATCGAGATCAACAATGCCACCCACATGTTCGTTGCAGCGGACGGAAGCCATCCGCTCACTGCTCAGATCTATTCTG
TGCTGGATAGTCTTCTTCTAGAACTGAAAAAAGAAGGGTATGTTCCTCAACTCTACCTTCCAATACACCCACAAACTCTCAGTAAATCAATATCAGAAACTGTTTTACAA
GATTGA
mRNA sequenceShow/hide mRNA sequence
TTCTAACCGTCGATTGAAAGCAGCCACTGATAAAATACAACAATCCAAGAGCGGGATTGAAAGCAAAAGATTCGAGGGCGGGCTGGGCCTTCGCCTAAATGAACATGATT
TGATCAACGAAGCATGTACTACAAATTTCGTTTTCATTCTACAGATTTCTTCCTCATTTCTCGCGACCGGAGTCTCTCTTTTCAACTCAATCCAACTGCAAAAACCCGAT
AAACCCCACCTTGGTTTTCGACAAATGTGGAAGCGGTACTGGCATCAGCAAACAATCTCACGCCCAGGCCATTGTTACTGGAATAGCCAAAATGGTGATCTGGGTCCAAA
GATATTGGGTATGTACGTGCTTACTGGCAGCCTCGAGGATGCCAAGAACGTGTTTTATACGCTTCAATTGGGATGTACTTCGGCTTGGAATTGGATGATTAGGGGGTTTA
CAATGATGGCCTGTGGTGCTTTGAACAATGTGAAGATGGGTAAGATTGTTCATGAGACTGTTAATTTAATGGGCCTTAAGGAGGATGCCTTCGTGGAAGGACTGTGTTTT
ATGGAATGTTATGCTTATGGTTATGTGAAAAATGGAGACTCAGGCAATGCTATTAAGATCTTTTTGGAAATGAGACATAGTGAAATTAAGCCCAACTCCGTAACCTTTGC
TTGTGTTTTATGTGTTTGTGGTGTGGAGGCAATGCTTGACTTAGGTACTCAACTTCACGCTGCGCGTAAACTGTTTGATATGATGCCACAAAACGACTTGGTGAGTTGGA
ATGGAATAATTTCTGGATACGTACAGAATGGTTTGATGAGTGAGGCTGAGCATTTGTTTCGTGGAATGATATCTGCAGGAATGAAGCCCGACTCGATCACTTTTGCAAGT
TTCCTTCCATGTGTTACTGAGTTGCTGAGTCTCAAACATTGTAAGGAAATTCATGGTTACATAGTAAGACATGCTGTAGTTTTGGATGTGTTCTTGAAAAGTGCTCTAAT
TGATATATACTTCAAGTGCAGGGATGTGGAAATCGCACGAAAAATTTTGCGTCCAAGTAGTTTGGTTGATGCTGTAGTGTGCACGGCCATGATTTCAGGATACGTGCTTA
ATGGGATGAACACAGAAGCATTGGAGGCATTTAGATGGTTGCTGCAGGAGAGAATGAAGCCAACTTCTGTGACTTTTGCGAGTGTCTTTCCAGCTTTTGCTGGTTTGGCC
GCTCTAAACTTAGGAAAGGAATTGCATGGATGCTATTTGCTGGAACTCCATGATTACAAGCTGTTCCCAGAACGGCAGACCGGAGGAGGCCATCAATATTTCCGTCAGAT
GGGAATGGAGGGAACTCAGTATGACTGTGTGAGCATATCTGGTGCACTATCTGCTTGTGCAAACTTACCTGCTCTCCATTATGGAAAAGAGATCCATGGCTTCATGATCA
AAAGCCCATTAAGATCTGACATTTATGCCGAGAGTTCACTGATAGACATGTATGCTAAGTGTGGAAACTTGAATTTCTCCCGGCTAGTGTTTGACACGATGCAACGGAAA
AATGAAGTCTCCTGGAACAGCATCATTTCTGCCTATGGCAACCACGACCATGTCACCTTTCTTGGTATCATATCTGCTTGCGGCCATGCTGGCCAAGTCGATGAAGGAAT
TAACTATTACCACCTCATGACAAAGGAATACGGGATCCCAGCTCGAATGGAGCACTATGCTTGCATGGTCGATTTGTTTGGCCGCGCAGGTCGTGTGGATGAAGCATATG
AAACCATAAAAAGCATGCCATTCCCTCCAGATGCTGGCGTATGGGGAACACTACTTGGGGCATGCCATGTTCATGGCAACGTTGAGCTCGCCGAAGTGGCATCAAAGTAT
CTGTTTGATTTAGACCCTCTAAACTCTGGCTACTACGTATTGCTTGCAAATGTGCAGGCTGGTGCCGGAAAATGGAGGAAGGTGCTTAAAGTACGTAGCATAATGAAAGA
ACGAGGAGTTCGGAAGGTTCCTGGTTATAGCTGGATCGAGATCAACAATGCCACCCACATGTTCGTTGCAGCGGACGGAAGCCATCCGCTCACTGCTCAGATCTATTCTG
TGCTGGATAGTCTTCTTCTAGAACTGAAAAAAGAAGGGTATGTTCCTCAACTCTACCTTCCAATACACCCACAAACTCTCAGTAAATCAATATCAGAAACTGTTTTACAA
GATTGA
Protein sequenceShow/hide protein sequence
SNRRLKAATDKIQQSKSGIESKRFEGGLGLRLNEHDLINEACTTNFVFILQISSSFLATGVSLFNSIQLQKPDKPHLGFRQMWKRYWHQQTISRPGHCYWNSQNGDLGPK
ILGMYVLTGSLEDAKNVFYTLQLGCTSAWNWMIRGFTMMACGALNNVKMGKIVHETVNLMGLKEDAFVEGLCFMECYAYGYVKNGDSGNAIKIFLEMRHSEIKPNSVTFA
CVLCVCGVEAMLDLGTQLHAARKLFDMMPQNDLVSWNGIISGYVQNGLMSEAEHLFRGMISAGMKPDSITFASFLPCVTELLSLKHCKEIHGYIVRHAVVLDVFLKSALI
DIYFKCRDVEIARKILRPSSLVDAVVCTAMISGYVLNGMNTEALEAFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGCYLLELHDYKLFPERQTGGGHQYFRQM
GMEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKSPLRSDIYAESSLIDMYAKCGNLNFSRLVFDTMQRKNEVSWNSIISAYGNHDHVTFLGIISACGHAGQVDEGI
NYYHLMTKEYGIPARMEHYACMVDLFGRAGRVDEAYETIKSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFDLDPLNSGYYVLLANVQAGAGKWRKVLKVRSIMKE
RGVRKVPGYSWIEINNATHMFVAADGSHPLTAQIYSVLDSLLLELKKEGYVPQLYLPIHPQTLSKSISETVLQD