; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg27253 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg27253
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCarg_Chr14:11036358..11038083
RNA-Seq ExpressionCarg27253
SyntenyCarg27253
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582041.1 Isoleucine--tRNA ligase, cytoplasmic, partial [Cucurbita argyrosperma subsp. sororia]2.0e-28195.41Show/hide
Query:  SSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQILMNYRLFGRAKTLEFFSWSGLQMGY
        SSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQILMNYRLFGRAKTLEFFSWSGLQMGY
Subjt:  SSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQILMNYRLFGRAKTLEFFSWSGLQMGY

Query:  RFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELIDTALT
        RFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEP FGCKPDNLVFNNMLYALCKKEPTGELIDTALT
Subjt:  RFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELIDTALT

Query:  IFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA-----------------------SGAIDPAVGVFWA
        IFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA                       SGAIDPAVGVFWA
Subjt:  IFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA-----------------------SGAIDPAVGVFWA

Query:  ANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAE
        ANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAE
Subjt:  ANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAE

Query:  RVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMKWESQILQKLCKQGQLGDAYEKLKSM
        RVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMKWESQILQKLCKQGQLGDAYEKLKSM
Subjt:  RVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMKWESQILQKLCKQGQLGDAYEKLKSM

Query:  LEKGFYPPIYVRDAFESAFQKKG
        LEKGFYPPIYVRDAFESAFQKKG
Subjt:  LEKGFYPPIYVRDAFESAFQKKG

KAG7018469.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI
        MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI
Subjt:  MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI

Query:  LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN
        LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN
Subjt:  LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN

Query:  LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVF
        LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVF
Subjt:  LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVF

Query:  WAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDD
        WAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDD
Subjt:  WAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDD

Query:  AERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMKWESQILQKLCKQGQLGDAYEKLK
        AERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMKWESQILQKLCKQGQLGDAYEKLK
Subjt:  AERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMKWESQILQKLCKQGQLGDAYEKLK

Query:  SMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS
        SMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS
Subjt:  SMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS

XP_022955543.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita moschata]1.1e-30895.64Show/hide
Query:  MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI
        MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI
Subjt:  MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI

Query:  LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN
        LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEP FGCKPDN
Subjt:  LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN

Query:  LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA-----------
        LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA           
Subjt:  LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA-----------

Query:  ------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD
                    SGAID AVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD
Subjt:  ------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD

Query:  MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMK
        MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMK
Subjt:  MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMK

Query:  WESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS
        WESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS
Subjt:  WESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS

XP_022979738.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita maxima]8.2e-30494.08Show/hide
Query:  MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI
        MLSTSTIRNASAYLKFVAYGFSSYSSSTSST KRTAIAPRALARRPTSRTA IPRALDTDAVSSVCSLLSNK+HQTTNLELDHLLKRFKETLSSDFVLQI
Subjt:  MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI

Query:  LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN
        LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN
Subjt:  LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN

Query:  LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA-----------
        LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRA LVPTRSAVNILIGDLCSLSAKEGA           
Subjt:  LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA-----------

Query:  ------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD
                    SGAI+PAVGVFWAANR+ALVPS FVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD
Subjt:  ------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD

Query:  MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMK
        MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYS+VDKLMREDGQTDLCLKLEMK
Subjt:  MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMK

Query:  WESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS
        WESQILQKLCKQGQLG AYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHES++RKAS
Subjt:  WESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS

XP_023526018.1 pentatricopeptide repeat-containing protein At4g20090-like isoform X1 [Cucurbita pepo subsp. pepo]1.1e-30093.21Show/hide
Query:  MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI
        MLSTSTIRNASAYLKFVAYGFSS S STSST KRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQT NLELDHLLKRFKET+SSDFVLQI
Subjt:  MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI

Query:  LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN
        LMNYRLFGRAKTLEFFSWS LQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEP FGCKPDN
Subjt:  LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN

Query:  LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA-----------
        LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEM RAGLVPTRSAVNILIGDLCSLSAKEGA           
Subjt:  LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA-----------

Query:  ------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD
                    SGAI+PAVGVFWAANRLALVPS FVIVRLI ELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDL GRMLSQD
Subjt:  ------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD

Query:  MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMK
        MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRK+CVPDHVTYSALIHAYGEARNWSA YSLLK+MLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMK
Subjt:  MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMK

Query:  WESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS
        WESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHES SRKAS
Subjt:  WESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS

TrEMBL top hitse value%identityAlignment
A0A0A0KU61 Uncharacterized protein3.8e-24680.69Show/hide
Query:  TIRNASAYLKFVA---YGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDT----DAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVL
        TIR++SA LK ++   +G SS+  STS T    AIAPRALARRPTSRTAP PR+ +T    D V+SVCSLLSNKN QT NL+LDHLLKRFK+ LSSDFVL
Subjt:  TIRNASAYLKFVA---YGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDT----DAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVL

Query:  QILMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKP
        QILMNY+L GRAKTLEFFSWSGLQMG+RFD SVVEYMADFLGRRKLFDDMKCLLVTV S+KGRISCRTFSICIRFLGRQGRVREALCLFEEMEP FGCKP
Subjt:  QILMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA---------
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSN+IIGLCKFGR+ TA+E F EM RAGLVPTR+AVNILIG+LCSLSAKEGA         
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA---------

Query:  --------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLS
                      SGAI+PAVG+FWAAN+L+LVPS+FV V+LISELCRLGQMQEAIRVLKVVE +KLRC EECYS+VM+ALCEHR VDEASDLFGRMLS
Subjt:  --------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLS

Query:  QDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLE
        Q MKPKLAIYN VICMLCKLGNLD AERVF IMN+KRC PDHVTYSALIHAYGE R+WSAAY LLKEMLSLG+SPHFHVYS+VDKLMRE GQ DLCLKLE
Subjt:  QDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLE

Query:  MKWESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKK
        MKWE+QILQKLCKQGQL  AYEK+KSMLEKG  PPIYVRDAFESAFQKK
Subjt:  MKWESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKK

A0A1S3BXL0 pentatricopeptide repeat-containing protein At5g65560-like isoform X15.6e-25879.86Show/hide
Query:  TIRNASAYLKFVA---YGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDT----DAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVL
        TIR++SA LK ++   +GFSS+  STS T K  AIAPRAL RRPTSRTAP PR+ +T    D V+SVCSLLSNKN QT NL+++HLLKRFK+ LSSD VL
Subjt:  TIRNASAYLKFVA---YGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDT----DAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVL

Query:  QILMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKP
        QILMNY+L GRAKTLEFFSWSGLQMG+RFD SVVEYMADFLGRRKLFDDMKCLLVTV S+KGRISCRTFSICIRFLGRQGRVREALCLFEEMEP FGCKP
Subjt:  QILMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA---------
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSN+IIGLCKFGR+ TA+E F EM RAGLVPTRSA NILIG+LCSLSAKEGA         
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA---------

Query:  --------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLS
                      SGAI+PAVG+FWAAN+L LVPS+FV V+LISELCR+GQMQEAI+VLKVVE +KLRC EECYS+VM+ALCEHR +DEASDLFGRMLS
Subjt:  --------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLS

Query:  QDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLE
        Q MKPKLAIYN VICMLCKLGNLD AERVF IMN+KRC PDHVTYSALIHAYGE RNWSAAY LLKEMLSLG+SPHFHVYS+VDKLMRE GQ DLCLKLE
Subjt:  QDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLE

Query:  MKWESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS
        MKWE+QILQKLCKQGQL  AYEK+KSMLEKG  PPIYVRDAFESAFQKKGKFKIARELLQ MDGVH+HES +R +S
Subjt:  MKWESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS

A0A6J1CV78 pentatricopeptide repeat-containing protein At5g39710-like1.0e-25180.42Show/hide
Query:  RNASAYLKFVAYGFSSYSSS---TSSTRKRTAIAPRALARRPTSRTAPIPRALD----TDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI
        RN S  LKF    FS +SS+   TS+TR   AIAPR  ARRPTSR+AP+PRALD    TD V+SVCSLLSNKNHQTTNL+LD LLKRF E LSSD VL+I
Subjt:  RNASAYLKFVAYGFSSYSSS---TSSTRKRTAIAPRALARRPTSRTAPIPRALD----TDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI

Query:  LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN
        LMNYR+ GRAKTLEFFSWSGLQMGYRFDESVVEYMADF GRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEMEP FGCKPDN
Subjt:  LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN

Query:  LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA-----------
        LVFNN+LYALCKKE TGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRF TA+EVF+EM R G VPTRSAVNILIGDLCSLSAKEGA           
Subjt:  LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA-----------

Query:  ------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD
                    SGAI+PAVGVFWAANR+ALVPS+FV+V+LISELCRLGQMQEAI VLKVVE  KLRC EEC+SIVMQALCE+R+V+EASDLFGRMLSQ 
Subjt:  ------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD

Query:  MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMK
        MKPKLA+YNSVICMLCKLGN+ DAERVFKIMNRKRCVPD VTYSALIHAY E  NWSAAYSLLKEMLSLG+SPHFH+YS VDKLMRE GQ DLCLKLEMK
Subjt:  MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMK

Query:  WESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHE
        WE+QILQKLCKQGQL  AYEKLKSMLEKG +PP YVRDAFE+AFQK GK+KIARELL+ + GVH  E
Subjt:  WESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHE

A0A6J1GU90 pentatricopeptide repeat-containing protein At4g20090-like5.3e-30995.64Show/hide
Query:  MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI
        MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI
Subjt:  MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI

Query:  LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN
        LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEP FGCKPDN
Subjt:  LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN

Query:  LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA-----------
        LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA           
Subjt:  LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA-----------

Query:  ------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD
                    SGAID AVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD
Subjt:  ------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD

Query:  MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMK
        MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMK
Subjt:  MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMK

Query:  WESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS
        WESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS
Subjt:  WESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS

A0A6J1IX53 pentatricopeptide repeat-containing protein At4g20090-like4.0e-30494.08Show/hide
Query:  MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI
        MLSTSTIRNASAYLKFVAYGFSSYSSSTSST KRTAIAPRALARRPTSRTA IPRALDTDAVSSVCSLLSNK+HQTTNLELDHLLKRFKETLSSDFVLQI
Subjt:  MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQI

Query:  LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN
        LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN
Subjt:  LMNYRLFGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDN

Query:  LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA-----------
        LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRA LVPTRSAVNILIGDLCSLSAKEGA           
Subjt:  LVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGA-----------

Query:  ------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD
                    SGAI+PAVGVFWAANR+ALVPS FVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD
Subjt:  ------------SGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQD

Query:  MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMK
        MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYS+VDKLMREDGQTDLCLKLEMK
Subjt:  MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMK

Query:  WESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS
        WESQILQKLCKQGQLG AYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHES++RKAS
Subjt:  WESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKAS

SwissProt top hitse value%identityAlignment
Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic9.8e-3428.01Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAG
        T++  I  L + G V+EA+ + ++M  T  C P+ + +N ++  LCK+    E  + A  +  +  LPD  +++++I GLC       A+E+F+EMR  G
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAG

Query:  LVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDE
          P     N+LI  LCS        G +D A+ +           S      LI   C+  + +EA  +   +EV+ +      Y+ ++  LC+ RRV++
Subjt:  LVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDE

Query:  ASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYS-MVDKLMRE
        A+ L  +M+ +  KP    YNS++   C+ G++  A  + + M    C PD VTY  LI    +A     A  LL+ +   GI+   H Y+ ++  L R+
Subjt:  ASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYS-MVDKLMRE

Query:  DGQTD-LCLKLEMKWESQ----------ILQKLCK-QGQLGDAYEKLKSMLEKGFYP
           T+ + L  EM  +++          + + LC   G + +A + L  +LEKGF P
Subjt:  DGQTD-LCLKLEMKWESQ----------ILQKLCK-QGQLGDAYEKLKSMLEKGFYP

Q9LQ16 Pentatricopeptide repeat-containing protein At1g629101.7e-3325Show/hide
Query:  LQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELI
        +++GY  D   +  + +     K   D   L+  +     +    TF+  I  L    +  EA+ L ++M    GC+PD + +  ++  LCK+   G+ I
Subjt:  LQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELI

Query:  DTALTIFRRIEL----PDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIV
        D AL++ +++E      D   Y+ II GLCK+     A+ +F EM   G+ P     + LI  LC+      AS  +   +          + P+     
Subjt:  DTALTIFRRIEL----PDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIV

Query:  RLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPD
         LI    + G++ EA ++   +    +      YS ++   C H R+DEA  +F  M+S+D  P +  Y+++I   CK   +++   +F+ M+++  V +
Subjt:  RLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPD

Query:  HVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYS-MVDKLMREDGQTDLCLKLEMKWESQ----------ILQKLCKQGQLGDAYEKLKSMLEK
         VTY+ LIH + +AR+   A  + K+M+S+G+ P+   Y+ ++D L +        +  E    S           +++ +CK G++ D +E   ++  K
Subjt:  HVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYS-MVDKLMREDGQTDLCLKLEMKWESQ----------ILQKLCKQGQLGDAYEKLKSMLEK

Query:  GFYPPIYVRDAFESAFQKKGKFKIARELLQTM
        G  P +   +   S F +KG  + A  LL+ M
Subjt:  GFYPPIYVRDAFESAFQKKGKFKIARELLQTM

Q9LSL9 Pentatricopeptide repeat-containing protein At5g655604.9e-3324.41Show/hide
Query:  EFFSWSGLQMGY---------------------RFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEP
        +FF+++ L MGY                     R +E    ++   L   +  D+   L V +   +   + RT+++ I+ L    R  EAL L +EME 
Subjt:  EFFSWSGLQMGY---------------------RFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEP

Query:  TFGCKPDNLVFNNMLYALCK-------KEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSA
        T G KP+   +  ++ +LC        +E  G++++  L       +P+  +Y+ +I G CK G    AV+V + M    L P     N LI   C  + 
Subjt:  TFGCKPDNLVFNNMLYALCK-------KEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSA

Query:  KEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQDMKPKLAIY
               +  A+GV        ++P       LI   CR G    A R+L ++    L   +  Y+ ++ +LC+ +RV+EA DLF  +  + + P + +Y
Subjt:  KEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQDMKPKLAIY

Query:  NSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLC-----------LKL
         ++I   CK G +D+A  + + M  K C+P+ +T++ALIH          A  L ++M+ +G+ P     +++   + +DG  D              K 
Subjt:  NSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLC-----------LKL

Query:  EMKWESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTM
        +    +  +Q  C++G+L DA + +  M E G  P ++   +    +   G+   A ++L+ M
Subjt:  EMKWESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTM

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial1.0e-3025.58Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIEL----PDKYSYSNIIIGLCKFGRFGTAVEVFDEM
        TF+  I  L    +  EA+ L + M    GC+PD + +  ++  LCK+  T    D A  +  ++E     P    Y+ II GLCK+     A+ +F EM
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIEL----PDKYSYSNIIIGLCKFGRFGTAVEVFDEM

Query:  RRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHR
           G+ P     + LI  LC+      AS  +   +          + P  F    LI    + G++ EA ++   +    +  +   YS ++   C H 
Subjt:  RRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHR

Query:  RVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKL
        R+DEA  +F  M+S+   P +  YN++I   CK   +++   VF+ M+++  V + VTY+ LI    +A +   A  + KEM+S G+ P+   Y+ +   
Subjt:  RVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKL

Query:  MREDGQTDLCLKL-----EMKWESQI------LQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTM
        + ++G+ +  + +       K E  I      ++ +CK G++ D ++   ++  KG  P +   +   S F +KG  + A  L + M
Subjt:  MREDGQTDLCLKL-----EMKWESQI------LQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTM

Q9ZUE9 Pentatricopeptide repeat-containing protein At2g060002.7e-3128.31Show/hide
Query:  RTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIE-----LPDKYSYSNIIIGLCKFGRFGTAVEVFD
        +TF+I IR L   G+  +AL L   M   FGC+PD + +N ++   CK       ++ A  +F+ ++      PD  +Y+++I G CK G+   A  + D
Subjt:  RTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIE-----LPDKYSYSNIIIGLCKFGRFGTAVEVFD

Query:  EMRRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCE
        +M R G+ PT    N+L+       AK G     +   G   +       P       LI   CR+GQ+ +  R+ + +    +      YSI++ ALC 
Subjt:  EMRRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCE

Query:  HRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVD
          R+ +A +L G++ S+D+ P+  +YN VI   CK G +++A  + + M +K+C PD +T++ LI  +        A S+  +M+++G SP        D
Subjt:  HRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVD

Query:  KLMREDGQTDLCLKLEMKWESQILQKLCKQGQ
        K+          LK  M  E+  L ++ ++GQ
Subjt:  KLMREDGQTDLCLKLEMKWESQILQKLCKQGQ

Arabidopsis top hitse value%identityAlignment
AT1G62910.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-3425Show/hide
Query:  LQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELI
        +++GY  D   +  + +     K   D   L+  +     +    TF+  I  L    +  EA+ L ++M    GC+PD + +  ++  LCK+   G+ I
Subjt:  LQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELI

Query:  DTALTIFRRIEL----PDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIV
        D AL++ +++E      D   Y+ II GLCK+     A+ +F EM   G+ P     + LI  LC+      AS  +   +          + P+     
Subjt:  DTALTIFRRIEL----PDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIV

Query:  RLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPD
         LI    + G++ EA ++   +    +      YS ++   C H R+DEA  +F  M+S+D  P +  Y+++I   CK   +++   +F+ M+++  V +
Subjt:  RLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPD

Query:  HVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYS-MVDKLMREDGQTDLCLKLEMKWESQ----------ILQKLCKQGQLGDAYEKLKSMLEK
         VTY+ LIH + +AR+   A  + K+M+S+G+ P+   Y+ ++D L +        +  E    S           +++ +CK G++ D +E   ++  K
Subjt:  HVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYS-MVDKLMREDGQTDLCLKLEMKWESQ----------ILQKLCKQGQLGDAYEKLKSMLEK

Query:  GFYPPIYVRDAFESAFQKKGKFKIARELLQTM
        G  P +   +   S F +KG  + A  LL+ M
Subjt:  GFYPPIYVRDAFESAFQKKGKFKIARELLQTM

AT2G06000.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-3228.31Show/hide
Query:  RTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIE-----LPDKYSYSNIIIGLCKFGRFGTAVEVFD
        +TF+I IR L   G+  +AL L   M   FGC+PD + +N ++   CK       ++ A  +F+ ++      PD  +Y+++I G CK G+   A  + D
Subjt:  RTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIE-----LPDKYSYSNIIIGLCKFGRFGTAVEVFD

Query:  EMRRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCE
        +M R G+ PT    N+L+       AK G     +   G   +       P       LI   CR+GQ+ +  R+ + +    +      YSI++ ALC 
Subjt:  EMRRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCE

Query:  HRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVD
          R+ +A +L G++ S+D+ P+  +YN VI   CK G +++A  + + M +K+C PD +T++ LI  +        A S+  +M+++G SP        D
Subjt:  HRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVD

Query:  KLMREDGQTDLCLKLEMKWESQILQKLCKQGQ
        K+          LK  M  E+  L ++ ++GQ
Subjt:  KLMREDGQTDLCLKLEMKWESQILQKLCKQGQ

AT2G06000.2 Pentatricopeptide repeat (PPR) superfamily protein1.9e-3228.31Show/hide
Query:  RTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIE-----LPDKYSYSNIIIGLCKFGRFGTAVEVFD
        +TF+I IR L   G+  +AL L   M   FGC+PD + +N ++   CK       ++ A  +F+ ++      PD  +Y+++I G CK G+   A  + D
Subjt:  RTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIE-----LPDKYSYSNIIIGLCKFGRFGTAVEVFD

Query:  EMRRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCE
        +M R G+ PT    N+L+       AK G     +   G   +       P       LI   CR+GQ+ +  R+ + +    +      YSI++ ALC 
Subjt:  EMRRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCE

Query:  HRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVD
          R+ +A +L G++ S+D+ P+  +YN VI   CK G +++A  + + M +K+C PD +T++ LI  +        A S+  +M+++G SP        D
Subjt:  HRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVD

Query:  KLMREDGQTDLCLKLEMKWESQILQKLCKQGQ
        K+          LK  M  E+  L ++ ++GQ
Subjt:  KLMREDGQTDLCLKLEMKWESQILQKLCKQGQ

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein7.0e-3528.01Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAG
        T++  I  L + G V+EA+ + ++M  T  C P+ + +N ++  LCK+    E  + A  +  +  LPD  +++++I GLC       A+E+F+EMR  G
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAG

Query:  LVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDE
          P     N+LI  LCS        G +D A+ +           S      LI   C+  + +EA  +   +EV+ +      Y+ ++  LC+ RRV++
Subjt:  LVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDE

Query:  ASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYS-MVDKLMRE
        A+ L  +M+ +  KP    YNS++   C+ G++  A  + + M    C PD VTY  LI    +A     A  LL+ +   GI+   H Y+ ++  L R+
Subjt:  ASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYS-MVDKLMRE

Query:  DGQTD-LCLKLEMKWESQ----------ILQKLCK-QGQLGDAYEKLKSMLEKGFYP
           T+ + L  EM  +++          + + LC   G + +A + L  +LEKGF P
Subjt:  DGQTD-LCLKLEMKWESQ----------ILQKLCK-QGQLGDAYEKLKSMLEKGFYP

AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein3.5e-3424.41Show/hide
Query:  EFFSWSGLQMGY---------------------RFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEP
        +FF+++ L MGY                     R +E    ++   L   +  D+   L V +   +   + RT+++ I+ L    R  EAL L +EME 
Subjt:  EFFSWSGLQMGY---------------------RFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEP

Query:  TFGCKPDNLVFNNMLYALCK-------KEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSA
        T G KP+   +  ++ +LC        +E  G++++  L       +P+  +Y+ +I G CK G    AV+V + M    L P     N LI   C  + 
Subjt:  TFGCKPDNLVFNNMLYALCK-------KEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSA

Query:  KEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQDMKPKLAIY
               +  A+GV        ++P       LI   CR G    A R+L ++    L   +  Y+ ++ +LC+ +RV+EA DLF  +  + + P + +Y
Subjt:  KEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQEAIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQDMKPKLAIY

Query:  NSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLC-----------LKL
         ++I   CK G +D+A  + + M  K C+P+ +T++ALIH          A  L ++M+ +G+ P     +++   + +DG  D              K 
Subjt:  NSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLLKEMLSLGISPHFHVYSMVDKLMREDGQTDLC-----------LKL

Query:  EMKWESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTM
        +    +  +Q  C++G+L DA + +  M E G  P ++   +    +   G+   A ++L+ M
Subjt:  EMKWESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGCACGAGCACCATTAGGAATGCTTCGGCGTACCTCAAATTCGTTGCTTATGGCTTCTCTTCATACTCTTCCAGCACTTCATCGACCAGAAAGCGCACTGCCAT
AGCTCCAAGAGCTCTTGCAAGAAGACCCACTTCGCGAACTGCCCCAATCCCTCGGGCTTTGGACACAGATGCCGTCAGTTCAGTATGTTCTTTACTTTCAAACAAAAATC
ACCAAACAACTAATCTCGAACTTGATCATTTATTGAAAAGATTCAAAGAAACCTTGAGCTCGGATTTCGTTCTTCAAATTCTGATGAATTATAGGCTGTTCGGTAGGGCT
AAAACGCTAGAATTCTTCTCCTGGTCTGGATTGCAAATGGGGTATCGGTTTGATGAGTCCGTGGTTGAGTACATGGCTGATTTCTTAGGTAGAAGGAAACTGTTTGATGA
TATGAAATGTCTTTTGGTGACGGTGTCGTCTTATAAAGGTCGGATTTCTTGTCGAACATTTTCAATTTGTATTAGGTTTTTGGGTAGGCAAGGGAGGGTTAGAGAAGCAC
TTTGCTTATTTGAAGAAATGGAGCCAACATTTGGGTGTAAACCTGATAATCTGGTCTTTAATAACATGCTTTATGCACTTTGTAAGAAGGAACCAACAGGGGAATTGATT
GATACTGCTCTAACAATTTTCAGAAGAATTGAATTGCCTGATAAATATTCATACAGTAACATAATTATTGGATTATGTAAATTTGGTAGGTTTGGTACTGCTGTTGAAGT
GTTTGATGAAATGCGTAGGGCAGGTTTGGTACCTACTCGATCTGCTGTGAACATTCTCATTGGGGATTTGTGTTCATTGAGTGCCAAAGAAGGGGCTAGCGGAGCCATTG
ATCCTGCAGTTGGAGTTTTTTGGGCAGCTAATAGGCTGGCTTTAGTTCCCAGTACTTTTGTAATAGTTCGGCTCATCTCGGAGCTTTGTCGATTAGGCCAAATGCAAGAA
GCAATTAGAGTATTAAAGGTTGTCGAGGTTAACAAGCTAAGGTGTACTGAAGAGTGTTATTCCATTGTGATGCAAGCATTGTGTGAACATCGTCGAGTAGACGAAGCTAG
TGATCTGTTTGGGAGGATGCTTTCTCAGGATATGAAGCCAAAGTTGGCTATTTACAATTCTGTTATTTGTATGCTATGTAAATTAGGAAATTTGGATGATGCTGAAAGGG
TCTTCAAGATTATGAACAGGAAAAGATGCGTCCCGGATCATGTTACTTACTCGGCGCTAATCCATGCCTACGGTGAAGCTAGGAATTGGTCAGCGGCCTACAGTTTATTG
AAGGAAATGTTGAGTTTAGGCATATCCCCGCATTTTCATGTGTATAGTATGGTGGATAAACTAATGAGGGAAGATGGGCAAACTGATCTGTGCTTGAAGCTGGAAATGAA
GTGGGAATCCCAAATTTTGCAGAAACTTTGTAAACAAGGACAACTGGGGGATGCGTATGAAAAGCTAAAGTCAATGCTTGAAAAGGGTTTTTATCCTCCTATCTATGTAA
GAGATGCTTTCGAGAGCGCGTTTCAAAAGAAGGGTAAGTTTAAGATTGCTCGGGAGTTGCTGCAGACGATGGACGGAGTCCACGAACATGAGTCGGATTCCAGAAAGGCA
TCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAGCACGAGCACCATTAGGAATGCTTCGGCGTACCTCAAATTCGTTGCTTATGGCTTCTCTTCATACTCTTCCAGCACTTCATCGACCAGAAAGCGCACTGCCAT
AGCTCCAAGAGCTCTTGCAAGAAGACCCACTTCGCGAACTGCCCCAATCCCTCGGGCTTTGGACACAGATGCCGTCAGTTCAGTATGTTCTTTACTTTCAAACAAAAATC
ACCAAACAACTAATCTCGAACTTGATCATTTATTGAAAAGATTCAAAGAAACCTTGAGCTCGGATTTCGTTCTTCAAATTCTGATGAATTATAGGCTGTTCGGTAGGGCT
AAAACGCTAGAATTCTTCTCCTGGTCTGGATTGCAAATGGGGTATCGGTTTGATGAGTCCGTGGTTGAGTACATGGCTGATTTCTTAGGTAGAAGGAAACTGTTTGATGA
TATGAAATGTCTTTTGGTGACGGTGTCGTCTTATAAAGGTCGGATTTCTTGTCGAACATTTTCAATTTGTATTAGGTTTTTGGGTAGGCAAGGGAGGGTTAGAGAAGCAC
TTTGCTTATTTGAAGAAATGGAGCCAACATTTGGGTGTAAACCTGATAATCTGGTCTTTAATAACATGCTTTATGCACTTTGTAAGAAGGAACCAACAGGGGAATTGATT
GATACTGCTCTAACAATTTTCAGAAGAATTGAATTGCCTGATAAATATTCATACAGTAACATAATTATTGGATTATGTAAATTTGGTAGGTTTGGTACTGCTGTTGAAGT
GTTTGATGAAATGCGTAGGGCAGGTTTGGTACCTACTCGATCTGCTGTGAACATTCTCATTGGGGATTTGTGTTCATTGAGTGCCAAAGAAGGGGCTAGCGGAGCCATTG
ATCCTGCAGTTGGAGTTTTTTGGGCAGCTAATAGGCTGGCTTTAGTTCCCAGTACTTTTGTAATAGTTCGGCTCATCTCGGAGCTTTGTCGATTAGGCCAAATGCAAGAA
GCAATTAGAGTATTAAAGGTTGTCGAGGTTAACAAGCTAAGGTGTACTGAAGAGTGTTATTCCATTGTGATGCAAGCATTGTGTGAACATCGTCGAGTAGACGAAGCTAG
TGATCTGTTTGGGAGGATGCTTTCTCAGGATATGAAGCCAAAGTTGGCTATTTACAATTCTGTTATTTGTATGCTATGTAAATTAGGAAATTTGGATGATGCTGAAAGGG
TCTTCAAGATTATGAACAGGAAAAGATGCGTCCCGGATCATGTTACTTACTCGGCGCTAATCCATGCCTACGGTGAAGCTAGGAATTGGTCAGCGGCCTACAGTTTATTG
AAGGAAATGTTGAGTTTAGGCATATCCCCGCATTTTCATGTGTATAGTATGGTGGATAAACTAATGAGGGAAGATGGGCAAACTGATCTGTGCTTGAAGCTGGAAATGAA
GTGGGAATCCCAAATTTTGCAGAAACTTTGTAAACAAGGACAACTGGGGGATGCGTATGAAAAGCTAAAGTCAATGCTTGAAAAGGGTTTTTATCCTCCTATCTATGTAA
GAGATGCTTTCGAGAGCGCGTTTCAAAAGAAGGGTAAGTTTAAGATTGCTCGGGAGTTGCTGCAGACGATGGACGGAGTCCACGAACATGAGTCGGATTCCAGAAAGGCA
TCATGA
Protein sequenceShow/hide protein sequence
MLSTSTIRNASAYLKFVAYGFSSYSSSTSSTRKRTAIAPRALARRPTSRTAPIPRALDTDAVSSVCSLLSNKNHQTTNLELDHLLKRFKETLSSDFVLQILMNYRLFGRA
KTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSYKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPTFGCKPDNLVFNNMLYALCKKEPTGELI
DTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTAVEVFDEMRRAGLVPTRSAVNILIGDLCSLSAKEGASGAIDPAVGVFWAANRLALVPSTFVIVRLISELCRLGQMQE
AIRVLKVVEVNKLRCTEECYSIVMQALCEHRRVDEASDLFGRMLSQDMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGEARNWSAAYSLL
KEMLSLGISPHFHVYSMVDKLMREDGQTDLCLKLEMKWESQILQKLCKQGQLGDAYEKLKSMLEKGFYPPIYVRDAFESAFQKKGKFKIARELLQTMDGVHEHESDSRKA
S