; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G04560 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G04560
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr7:3406500..3409024
RNA-Seq ExpressionCSPI07G04560
SyntenyCSPI07G04560
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031579.1 putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]2.3e-24988.07Show/hide
Query:  MIRKLRSWNNNLISNLLIQTS----------KTLSLPFSST-PPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALE
        M+RKLRSWNNNLI NLLIQTS          KTLSLPFSST PPQ  ILR +II+IR PKISV+PVLEKWVGDGRAI KPELQYLV+L K+ RRFNHALE
Subjt:  MIRKLRSWNNNLISNLLIQTS----------KTLSLPFSST-PPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALE

Query:  ISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHD
        ISQWMTDRRY+SLS SDAA+RLDLIHSVHGLEHAENYFNSIS RLKTSNVYG+LLGCYVREKS+EKAEAIMQEMRKMGIA TSFAYNVLINLYAQIGQH+
Subjt:  ISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHD

Query:  KIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLY
        KIDLLIEEMK K IPQDIYSIRNLCAAYVAK DISGMEKILKRIEEDSE KADW IYSIAANGYLTAGLETEALSML K E+K+RPNTNK AF+FLLSLY
Subjt:  KIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLY

Query:  ERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILA
        ERTGHKNEVYRVWNTFKPLT++T VPYALMITSLAKLDD+EGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAE VVNQAVV RTPF STWS+LA
Subjt:  ERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILA

Query:  TGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEE
        TGYAEYGHMSKAVEMLKKA+LVGRQNWKPK+ DILEACLDYLEKQGDAETM+EIVRLCKSSGTV KEMYYRLLRTSIAGGKPV+SILEQMKMDGFAADEE
Subjt:  TGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEE

Query:  VDK
         +K
Subjt:  VDK

KAE8645769.1 hypothetical protein Csa_020441 [Cucumis sativus]6.5e-28499.6Show/hide
Query:  MIRKLRSWNNNLISNLLIQTSKTLSLPFSSTPPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYL
        MIRKLRSWNNNLISNLLIQTSKTLSLPFSSTPPQLAILRQKI+NIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYL
Subjt:  MIRKLRSWNNNLISNLLIQTSKTLSLPFSSTPPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYL

Query:  SLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKT
        SLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKT
Subjt:  SLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKT

Query:  KRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYR
        K IPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYR
Subjt:  KRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYR

Query:  VWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYAEYGHMSK
        VWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYAEYGHMSK
Subjt:  VWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYAEYGHMSK

Query:  AVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKILGSKTNL
        AVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKILGSKTNL
Subjt:  AVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKILGSKTNL

XP_016901686.1 PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis melo]1.4e-25488.65Show/hide
Query:  MIRKLRSWNNNLISNLLIQT----------SKTLSLPFSST-PPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALE
        M+RKLRSWNNNLI NLLIQT          +KTLSLPFSST PPQ  ILR +II+IR PKISV+PVLEKWVGDGRAI KPELQYLV+L K+ RRFNHALE
Subjt:  MIRKLRSWNNNLISNLLIQT----------SKTLSLPFSST-PPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALE

Query:  ISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHD
        ISQWMTDRRYLSLS SDAA+RLDLIHSVHGLEHAENYFNSIS RLKTSNVYG+LL CYVREKS+EKAEAIMQEMRKMGIA TSFAYNVLINLYAQIGQH+
Subjt:  ISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHD

Query:  KIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLY
        KIDLLIEEMK K IPQDIYSIRNLCAAYVAK DISGMEKILKRIEEDSE KADW IYSIAANGYLTAGLETEALSML K E+K+RPNTNK AF+FLLSLY
Subjt:  KIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLY

Query:  ERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILA
        ERTGHKNEVYRVWNTFKPLT++T VPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAE VVNQAVV RTPF STWS+LA
Subjt:  ERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILA

Query:  TGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEE
        TGYAEYGHMSKAVEMLKKA+LVGRQNWKPK+ DILEACLDYLEKQGDAETM+EIVRLCKSSGTV KEMYYRLLRTSIAGGKPV+SILEQMKMDGFAADEE
Subjt:  TGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEE

Query:  VDKILGSKTNL
        VDKILGSKTNL
Subjt:  VDKILGSKTNL

XP_031744657.1 pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Cucumis sativus]6.5e-28499.6Show/hide
Query:  MIRKLRSWNNNLISNLLIQTSKTLSLPFSSTPPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYL
        MIRKLRSWNNNLISNLLIQTSKTLSLPFSSTPPQLAILRQKI+NIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYL
Subjt:  MIRKLRSWNNNLISNLLIQTSKTLSLPFSSTPPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYL

Query:  SLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKT
        SLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKT
Subjt:  SLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKT

Query:  KRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYR
        K IPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYR
Subjt:  KRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYR

Query:  VWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYAEYGHMSK
        VWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYAEYGHMSK
Subjt:  VWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYAEYGHMSK

Query:  AVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKILGSKTNL
        AVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKILGSKTNL
Subjt:  AVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKILGSKTNL

XP_038888307.1 pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like isoform X1 [Benincasa hispida]1.0e-22078.75Show/hide
Query:  MIRKLRSWNNNLISNL------------LIQTSKTLSLPFSST-PPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHA
        MI KLRSW NNL+ NL               T   +S PFSST PP    L  KI+ IR PKISV+PVLEKWVGDG AIG  ELQ LV+LMK+ RRFNHA
Subjt:  MIRKLRSWNNNLISNL------------LIQTSKTLSLPFSST-PPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHA

Query:  LEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQ
        L+ISQW+TDRRY SLSPSDAA+R+DLIH VHGLEHAENYFNSIS +LKTSNVYGALL  YVREKS+EKAEAIMQEMRKMGIA TSF YNVLINLYAQIGQ
Subjt:  LEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQ

Query:  HDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLS
        HDKIDLL +EMK K IPQDIY+IRNLCAAYVAK DI G+EKILKRI E SELKADW IYSIAA+GYL+AGLET+ALSMLKK EEK+  N NK AF+FLL+
Subjt:  HDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLS

Query:  LYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSI
        LYERTG K+E+YRVW+TFKPL ++T VPYALMITSL KLDDIEGAERIFQEWESKCT YDFRVLNRLLVAYCRKGLLDKAESVVNQAVV RTP+ STWS+
Subjt:  LYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSI

Query:  LATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAAD
        LATGYAE+G MSKAVEMLKKA+LVGRQ+W+PK  DILEACLDYLE+QGDAETM+EI+RLCKSSG V KE+YYRLLRTSIAGGK V+SILEQMKMDGF+ D
Subjt:  LATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAAD

Query:  EEVDKILGSKTNL
        EEVDKILG+KTNL
Subjt:  EEVDKILGSKTNL

TrEMBL top hitse value%identityAlignment
A0A0A0K2B7 Uncharacterized protein7.6e-23099.75Show/hide
Query:  MTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDL
        MTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDL
Subjt:  MTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDL

Query:  LIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTG
        LIEEMKTK IPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTG
Subjt:  LIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTG

Query:  HKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYA
        HKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYA
Subjt:  HKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYA

Query:  EYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKI
        EYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKI
Subjt:  EYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKI

Query:  LGSKTNL
        LGSKTNL
Subjt:  LGSKTNL

A0A1S4E0D2 pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like6.9e-25588.65Show/hide
Query:  MIRKLRSWNNNLISNLLIQT----------SKTLSLPFSST-PPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALE
        M+RKLRSWNNNLI NLLIQT          +KTLSLPFSST PPQ  ILR +II+IR PKISV+PVLEKWVGDGRAI KPELQYLV+L K+ RRFNHALE
Subjt:  MIRKLRSWNNNLISNLLIQT----------SKTLSLPFSST-PPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALE

Query:  ISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHD
        ISQWMTDRRYLSLS SDAA+RLDLIHSVHGLEHAENYFNSIS RLKTSNVYG+LL CYVREKS+EKAEAIMQEMRKMGIA TSFAYNVLINLYAQIGQH+
Subjt:  ISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHD

Query:  KIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLY
        KIDLLIEEMK K IPQDIYSIRNLCAAYVAK DISGMEKILKRIEEDSE KADW IYSIAANGYLTAGLETEALSML K E+K+RPNTNK AF+FLLSLY
Subjt:  KIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLY

Query:  ERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILA
        ERTGHKNEVYRVWNTFKPLT++T VPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAE VVNQAVV RTPF STWS+LA
Subjt:  ERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILA

Query:  TGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEE
        TGYAEYGHMSKAVEMLKKA+LVGRQNWKPK+ DILEACLDYLEKQGDAETM+EIVRLCKSSGTV KEMYYRLLRTSIAGGKPV+SILEQMKMDGFAADEE
Subjt:  TGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEE

Query:  VDKILGSKTNL
        VDKILGSKTNL
Subjt:  VDKILGSKTNL

A0A5A7SQP0 Putative pentatricopeptide repeat-containing protein1.1e-24988.07Show/hide
Query:  MIRKLRSWNNNLISNLLIQTS----------KTLSLPFSST-PPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALE
        M+RKLRSWNNNLI NLLIQTS          KTLSLPFSST PPQ  ILR +II+IR PKISV+PVLEKWVGDGRAI KPELQYLV+L K+ RRFNHALE
Subjt:  MIRKLRSWNNNLISNLLIQTS----------KTLSLPFSST-PPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALE

Query:  ISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHD
        ISQWMTDRRY+SLS SDAA+RLDLIHSVHGLEHAENYFNSIS RLKTSNVYG+LLGCYVREKS+EKAEAIMQEMRKMGIA TSFAYNVLINLYAQIGQH+
Subjt:  ISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHD

Query:  KIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLY
        KIDLLIEEMK K IPQDIYSIRNLCAAYVAK DISGMEKILKRIEEDSE KADW IYSIAANGYLTAGLETEALSML K E+K+RPNTNK AF+FLLSLY
Subjt:  KIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLY

Query:  ERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILA
        ERTGHKNEVYRVWNTFKPLT++T VPYALMITSLAKLDD+EGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAE VVNQAVV RTPF STWS+LA
Subjt:  ERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILA

Query:  TGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEE
        TGYAEYGHMSKAVEMLKKA+LVGRQNWKPK+ DILEACLDYLEKQGDAETM+EIVRLCKSSGTV KEMYYRLLRTSIAGGKPV+SILEQMKMDGFAADEE
Subjt:  TGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEE

Query:  VDK
         +K
Subjt:  VDK

A0A6J1GLA8 pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like3.0e-21074.36Show/hide
Query:  MIRKLRSWNNNLISNLLIQTSKTLSLP-----------FSSTPPQLA-ILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHAL
        M+ KLRSWN   I NLL +  +++ LP           FSSTPP  +  L  KI+ IR PKISV+PVLEKWVGDGRAIGK ELQ LV LMK  RRFNHAL
Subjt:  MIRKLRSWNNNLISNLLIQTSKTLSLP-----------FSSTPPQLA-ILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHAL

Query:  EISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQH
        EIS+WMTDRRY +LS SDAA+RLDLI  VHGLEHAE+YFNSIS +L+TSN YGALL  YVRE+S+EKAEAIMQEMR +G ATTSF YNVLINLYAQ+GQH
Subjt:  EISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQH

Query:  DKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSL
         KIDLLI+EM+ K IPQDIY++RNL AAYVA ADISGMEKILKRIEE+SE +ADW IYSIAA+GYL+AGLETEALSMLKK EEK+ P  NK AF+FLLSL
Subjt:  DKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSL

Query:  YERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSIL
        YER G K+E+YRVW+TFK   K+  VPYALMITSL KLDDIEGAERIFQEWESKCT YDFRVLNRLLVAYCRKGL DKAES+VN+AV+ RTP+ STWS+L
Subjt:  YERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSIL

Query:  ATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADE
        A GY E+G MSKAVEMLK+A+LVGRQ+WKP Q DILEACL+YLE+QGDAETM+E++RLC+SSG++ KEMYYR LRTSIAGGKPV+SIL QM+MDGF ADE
Subjt:  ATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADE

Query:  EVDKILGSKTN
        EV KILG+KT+
Subjt:  EVDKILGSKTN

A0A6J1HYW6 pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like2.1e-21174.95Show/hide
Query:  MIRKLRSWNNNLISNLLIQTSKTLSLP-----------FSSTPPQLA-ILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHAL
        M+ KLRSWN   I NLL    +++ LP           FSSTPP  +  L  KI+ IR PKISV+PVLEKWVGDGRAIGK ELQ LV LMK  RRFNHAL
Subjt:  MIRKLRSWNNNLISNLLIQTSKTLSLP-----------FSSTPPQLA-ILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHAL

Query:  EISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQH
        EIS+WMTDRRY +LS SDAA+RLDLI  VHGLEHAE+YFNSIS +L+TSN YGALL  YVRE+S+EKAEAIMQEMRK+G ATTSF YNVLINLYAQ+GQH
Subjt:  EISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQH

Query:  DKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSL
         KIDLLI+EM+ K IPQDIY++RNL AAYVA ADISGMEKILKRIEE+SE +ADW IYSIAA+GYL+AGLETEALSMLKK EEK+ P  NK AF+FLLSL
Subjt:  DKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSL

Query:  YERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSIL
        YERTG K+E+YRVW+TFK   K+  VPYALMITSL KLDDIEGAERIFQEWESKCT YDFRVLNRLLVAYCRKGL DKAES+VN+AV+ RTP+ STWS+L
Subjt:  YERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSIL

Query:  ATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADE
        A GY E+G MSKAVEMLK+A+LVGRQ+WKP Q DILEACL+YLE+QGD ETM+E++RLC+SSGTV KEMYYR LRTSIAGGKPV+SIL QM+MDGF+ADE
Subjt:  ATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADE

Query:  EVDKILGSKTN
        EV KILG+KT+
Subjt:  EVDKILGSKTN

SwissProt top hitse value%identityAlignment
O22714 Pentatricopeptide repeat-containing protein At1g607703.4e-6531.06Show/hide
Query:  KISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYV
        ++ V   L +++   + + K E+   +  +++   +  AL++S+ M + R ++ + SD A+ LDL+     +   ENYF  +    KT   YG+LL CY 
Subjt:  KISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYV

Query:  REKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSI
        +E   EKAE ++ +M+++ I  +S +YN L+ LY + G+ +K+  +I+E+K + +  D Y+      A  A  DISG+E++++ +  D  +  DWT YS 
Subjt:  REKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSI

Query:  AANGYLTAGLETEALSMLKKTEEKVRPNTNK--FAFKFLLSLYERTGHKNEVYRVWNTFK-PLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTV
         A+ Y+ AGL  +A   L++ E K   NT +   A++FL++LY R G   EVYR+W + +  + K + V Y  MI  L KL+D+ GAE +F+EW++ C+ 
Subjt:  AANGYLTAGLETEALSMLKKTEEKVRPNTNK--FAFKFLLSLYERTGHKNEVYRVWNTFK-PLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTV

Query:  YDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRS-TWSILATGYAEYGHMSKAVEMLKKAILVGRQN---WKPKQGDILEACLDYLEKQGDAETMD
        YD R++N L+ AY ++GL+ KA  +  +A        + TW I    Y + G M++A+E + KA+ +G+ +   W P   + + A + Y E++ D    +
Subjt:  YDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRS-TWSILATGYAEYGHMSKAVEMLKKAILVGRQN---WKPKQGDILEACLDYLEKQGDAETMD

Query:  EIVRLCKS-SGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKIL
         ++ + K+ +  +  E++  L+RT  A GK   ++  ++KM+    +E   K+L
Subjt:  EIVRLCKS-SGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKIL

Q3E911 Pentatricopeptide repeat-containing protein At5g274603.4e-6532.52Show/hide
Query:  QKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYF-----NSISIRL
        ++I+    P+ SV  +L++ +  G A+   EL+ +   +  S R++ AL++ +WM +++ +  S  D A+RLDLI   HGL+  E YF     +S+S+R+
Subjt:  QKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYF-----NSISIRL

Query:  KTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIE
          S  Y  LL  YV+ K +++AEA+M+++  +G   T   +N ++ LY   GQ++K+ +++  MK  +IP+++ S      A    + ++ +E + K + 
Subjt:  KTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIE

Query:  EDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLT-KETCVPYALMITSLAKLDDIEGAE
         D  ++  W+     AN Y+ +G + +A  +L+   EK+   +N+  + FL++LY   G+K  V R+W   K +  + +CV Y  +++SL K  D+E AE
Subjt:  EDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLT-KETCVPYALMITSLAKLDDIEGAE

Query:  RIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVER--TPFRSTWSILATGYAEYGHMSKAVEMLKKA-ILVGRQNWKPKQGDILEACLDY
        R+F EWE++C  YD RV N LL AY R G + KAES ++  V+ER  TP   TW IL  G+ +  +M KA++ + +  +L+ R +W+P   +I+ A  +Y
Subjt:  RIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVER--TPFRSTWSILATGYAEYGHMSKAVEMLKKA-ILVGRQNWKPKQGDILEACLDY

Query:  LEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMD
         EK+   E     VR     G     +Y  LLR      +P   I E MK+D
Subjt:  LEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMD

Q84JR3 Pentatricopeptide repeat-containing protein At4g21705, mitochondrial3.0e-7433.69Show/hide
Query:  LRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTS
        L  KI  +  PK SV P L+ WV  G+ +   EL  +VH ++  +RF HALE+S+WM +      SP++ AV LDLI  V+G   AE YF ++  + K  
Subjt:  LRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTS

Query:  NVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDS
          YGALL CYVR++++EK+    ++M++MG  T+S  YN ++ LY  IGQH+K+  ++EEMK + +  D YS R    A+ A  D+  +   L+ +E   
Subjt:  NVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDS

Query:  ELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETC-VPYALMITSLAKLDDIEGAERIF
        ++  DW  Y++AA  Y+  G    A+ +LK +E ++     +  +  L++LY R G K EV R+W+  K + K      Y  ++ SL K+D +  AE + 
Subjt:  ELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETC-VPYALMITSLAKLDDIEGAERIF

Query:  QEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQ-AVVERTPFRSTWSILATGYAEYGHMSKAVEMLKKA--ILVGRQNWKPKQGDILEACLDYLEK
         EW+S    YDFRV N ++  Y  K + +KAE+++   A   +     +W ++AT YAE G +  A + +K A  + VG + W+P    ++ + L ++  
Subjt:  QEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQ-AVVERTPFRSTWSILATGYAEYGHMSKAVEMLKKA--ILVGRQNWKPKQGDILEACLDYLEK

Query:  QGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSI-AGGKPVISILEQMKMDGFAADEEVDKILGSKT
        +G  + ++  V   ++   V K+MY+ L++  I  GG+ + ++L++MK D    DEE   IL +++
Subjt:  QGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSI-AGGKPVISILEQMKMDGFAADEEVDKILGSKT

Q8LPS6 Pentatricopeptide repeat-containing protein At1g021501.2e-6533.96Show/hide
Query:  QKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDR-RYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSN
        +KI  +  P++    VL +W   GR + K EL  +V  ++  +R N ALE+  WM +R     LS SDAA++LDLI  V G+  AE +F  +    K   
Subjt:  QKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDR-RYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSN

Query:  VYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSE
        VYG+LL  YVR KS EKAEA++  MR  G A     +NV++ LY  + ++DK+D ++ EMK K I  DIYS     ++  +   +  ME + ++++ D  
Subjt:  VYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSE

Query:  LKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKE-TCVPYALMITSLAKLDDIEGAERIFQ
        +  +WT +S  A  Y+  G   +A   L+K E ++    N+  + +LLSLY   G+K E+YRVW+ +K +      + Y  +++SL ++ DIEGAE++++
Subjt:  LKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKE-TCVPYALMITSLAKLDDIEGAERIFQ

Query:  EWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAV-VERTPFRSTWSILATGYAEYGHMSKAVEMLKKAILV-GRQNWKPKQGDILEACLDYLEKQG
        EW    + YD R+ N L+ AY +   L+ AE + +  V +   P  STW ILA G+     +S+A+  L+ A    G  NW+PK   +L       E++ 
Subjt:  EWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAV-VERTPFRSTWSILATGYAEYGHMSKAVEMLKKAILV-GRQNWKPKQGDILEACLDYLEKQG

Query:  DAETMDEIVRLCKSSGTVMKEMYYRLL
        D  + + ++ L + SG +  + Y  L+
Subjt:  DAETMDEIVRLCKSSGTVMKEMYYRLL

Q9SKU6 Pentatricopeptide repeat-containing protein At2g20710, mitochondrial1.8e-9039.81Show/hide
Query:  LRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTS
        L++++     P  S++ VL+ W+  G  +   EL  ++ +++   RF+HAL+IS WM++ R   +S  D A+RLDLI  V GL  AE +F +I +  +  
Subjt:  LRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTS

Query:  NVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDS
        ++YGALL CY  +K L KAE + QEM+++G       YNV++NLY + G++  ++ L+ EM+ + +  DI+++     AY   +D+ GMEK L R E D 
Subjt:  NVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDS

Query:  ELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQ
         L  DW  Y+  ANGY+ AGL  +AL ML+K+E+ V     K A++ L+S Y   G K EVYR+W+ +K L       Y  +I++L K+DDIE  E+I +
Subjt:  ELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQ

Query:  EWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVE-RTPFRSTWSILATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGD
        EWE+  +++D R+ + L+  YC+KG+++KAE VVN  V + R    STW  LA GY   G M KAVE  K+AI V +  W+P Q  +L +C+DYLE Q D
Subjt:  EWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVE-RTPFRSTWSILATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGD

Query:  AETMDEIVRLCKSSGTV
         E + +I+RL    G +
Subjt:  AETMDEIVRLCKSSGTV

Arabidopsis top hitse value%identityAlignment
AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.2e-6733.96Show/hide
Query:  QKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDR-RYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSN
        +KI  +  P++    VL +W   GR + K EL  +V  ++  +R N ALE+  WM +R     LS SDAA++LDLI  V G+  AE +F  +    K   
Subjt:  QKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDR-RYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSN

Query:  VYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSE
        VYG+LL  YVR KS EKAEA++  MR  G A     +NV++ LY  + ++DK+D ++ EMK K I  DIYS     ++  +   +  ME + ++++ D  
Subjt:  VYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSE

Query:  LKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKE-TCVPYALMITSLAKLDDIEGAERIFQ
        +  +WT +S  A  Y+  G   +A   L+K E ++    N+  + +LLSLY   G+K E+YRVW+ +K +      + Y  +++SL ++ DIEGAE++++
Subjt:  LKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKE-TCVPYALMITSLAKLDDIEGAERIFQ

Query:  EWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAV-VERTPFRSTWSILATGYAEYGHMSKAVEMLKKAILV-GRQNWKPKQGDILEACLDYLEKQG
        EW    + YD R+ N L+ AY +   L+ AE + +  V +   P  STW ILA G+     +S+A+  L+ A    G  NW+PK   +L       E++ 
Subjt:  EWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAV-VERTPFRSTWSILATGYAEYGHMSKAVEMLKKAILV-GRQNWKPKQGDILEACLDYLEKQG

Query:  DAETMDEIVRLCKSSGTVMKEMYYRLL
        D  + + ++ L + SG +  + Y  L+
Subjt:  DAETMDEIVRLCKSSGTVMKEMYYRLL

AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-6631.06Show/hide
Query:  KISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYV
        ++ V   L +++   + + K E+   +  +++   +  AL++S+ M + R ++ + SD A+ LDL+     +   ENYF  +    KT   YG+LL CY 
Subjt:  KISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYV

Query:  REKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSI
        +E   EKAE ++ +M+++ I  +S +YN L+ LY + G+ +K+  +I+E+K + +  D Y+      A  A  DISG+E++++ +  D  +  DWT YS 
Subjt:  REKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSI

Query:  AANGYLTAGLETEALSMLKKTEEKVRPNTNK--FAFKFLLSLYERTGHKNEVYRVWNTFK-PLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTV
         A+ Y+ AGL  +A   L++ E K   NT +   A++FL++LY R G   EVYR+W + +  + K + V Y  MI  L KL+D+ GAE +F+EW++ C+ 
Subjt:  AANGYLTAGLETEALSMLKKTEEKVRPNTNK--FAFKFLLSLYERTGHKNEVYRVWNTFK-PLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTV

Query:  YDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRS-TWSILATGYAEYGHMSKAVEMLKKAILVGRQN---WKPKQGDILEACLDYLEKQGDAETMD
        YD R++N L+ AY ++GL+ KA  +  +A        + TW I    Y + G M++A+E + KA+ +G+ +   W P   + + A + Y E++ D    +
Subjt:  YDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRS-TWSILATGYAEYGHMSKAVEMLKKAILVGRQN---WKPKQGDILEACLDYLEKQGDAETMD

Query:  EIVRLCKS-SGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKIL
         ++ + K+ +  +  E++  L+RT  A GK   ++  ++KM+    +E   K+L
Subjt:  EIVRLCKS-SGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKIL

AT2G20710.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-9139.81Show/hide
Query:  LRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTS
        L++++     P  S++ VL+ W+  G  +   EL  ++ +++   RF+HAL+IS WM++ R   +S  D A+RLDLI  V GL  AE +F +I +  +  
Subjt:  LRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTS

Query:  NVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDS
        ++YGALL CY  +K L KAE + QEM+++G       YNV++NLY + G++  ++ L+ EM+ + +  DI+++     AY   +D+ GMEK L R E D 
Subjt:  NVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDS

Query:  ELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQ
         L  DW  Y+  ANGY+ AGL  +AL ML+K+E+ V     K A++ L+S Y   G K EVYR+W+ +K L       Y  +I++L K+DDIE  E+I +
Subjt:  ELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQ

Query:  EWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVE-RTPFRSTWSILATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGD
        EWE+  +++D R+ + L+  YC+KG+++KAE VVN  V + R    STW  LA GY   G M KAVE  K+AI V +  W+P Q  +L +C+DYLE Q D
Subjt:  EWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVE-RTPFRSTWSILATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGD

Query:  AETMDEIVRLCKSSGTV
         E + +I+RL    G +
Subjt:  AETMDEIVRLCKSSGTV

AT2G20710.2 Tetratricopeptide repeat (TPR)-like superfamily protein3.2e-7941.27Show/hide
Query:  MTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDL
        M++ R   +S  D A+RLDLI  V GL  AE +F +I +  +  ++YGALL CY  +K L KAE + QEM+++G       YNV++NLY + G++  ++ 
Subjt:  MTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDL

Query:  LIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTG
        L+ EM+ + +  DI+++     AY   +D+ GMEK L R E D  L  DW  Y+  ANGY+ AGL  +AL ML+K+E+ V     K A++ L+S Y   G
Subjt:  LIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTG

Query:  HKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVE-RTPFRSTWSILATGY
         K EVYR+W+ +K L       Y  +I++L K+DDIE  E+I +EWE+  +++D R+ + L+  YC+KG+++KAE VVN  V + R    STW  LA GY
Subjt:  HKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVE-RTPFRSTWSILATGY

Query:  AEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTV
           G M KAVE  K+AI V +  W+P Q  +L +C+DYLE Q D E + +I+RL    G +
Subjt:  AEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTV

AT4G21705.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-7533.69Show/hide
Query:  LRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTS
        L  KI  +  PK SV P L+ WV  G+ +   EL  +VH ++  +RF HALE+S+WM +      SP++ AV LDLI  V+G   AE YF ++  + K  
Subjt:  LRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTS

Query:  NVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDS
          YGALL CYVR++++EK+    ++M++MG  T+S  YN ++ LY  IGQH+K+  ++EEMK + +  D YS R    A+ A  D+  +   L+ +E   
Subjt:  NVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDS

Query:  ELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETC-VPYALMITSLAKLDDIEGAERIF
        ++  DW  Y++AA  Y+  G    A+ +LK +E ++     +  +  L++LY R G K EV R+W+  K + K      Y  ++ SL K+D +  AE + 
Subjt:  ELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETC-VPYALMITSLAKLDDIEGAERIF

Query:  QEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQ-AVVERTPFRSTWSILATGYAEYGHMSKAVEMLKKA--ILVGRQNWKPKQGDILEACLDYLEK
         EW+S    YDFRV N ++  Y  K + +KAE+++   A   +     +W ++AT YAE G +  A + +K A  + VG + W+P    ++ + L ++  
Subjt:  QEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQ-AVVERTPFRSTWSILATGYAEYGHMSKAVEMLKKA--ILVGRQNWKPKQGDILEACLDYLEK

Query:  QGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSI-AGGKPVISILEQMKMDGFAADEEVDKILGSKT
        +G  + ++  V   ++   V K+MY+ L++  I  GG+ + ++L++MK D    DEE   IL +++
Subjt:  QGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSI-AGGKPVISILEQMKMDGFAADEEVDKILGSKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAAGGAAGCTCCGAAGCTGGAACAACAACCTCATTTCCAATCTCCTTATTCAAACTTCTAAAACCCTTTCCCTCCCCTTCTCTTCCACTCCGCCCCAATTAGCCAT
TCTCCGCCAAAAAATCATAAACATTCGAGCCCCTAAAATTTCAGTTGTTCCGGTACTGGAAAAGTGGGTTGGCGACGGCAGAGCTATTGGAAAACCGGAACTTCAATATC
TTGTTCACCTCATGAAGGACTCTCGCCGTTTCAATCACGCTTTAGAGATATCTCAGTGGATGACTGATCGAAGATACTTGAGTTTATCGCCGAGCGATGCAGCAGTCAGG
CTGGATTTAATCCATAGTGTTCATGGTCTGGAACACGCAGAGAATTACTTCAACAGCATATCTATTCGGTTAAAAACTTCTAATGTTTATGGTGCTCTTCTCGGTTGTTA
TGTGCGAGAGAAATCACTTGAGAAAGCTGAAGCCATCATGCAAGAAATGAGAAAGATGGGCATTGCTACTACGTCCTTTGCTTACAATGTGCTAATTAACCTCTACGCTC
AGATTGGGCAGCATGATAAGATTGATCTACTGATTGAAGAAATGAAAACGAAGAGAATACCTCAAGACATTTACTCAATTAGAAATCTTTGTGCAGCTTATGTTGCTAAG
GCAGATATTTCTGGTATGGAAAAGATTCTCAAAAGGATCGAGGAGGATTCTGAACTCAAAGCTGATTGGACAATTTATTCAATTGCTGCTAATGGGTATCTTACAGCTGG
GTTGGAAACAGAGGCTCTTTCCATGCTAAAGAAAACGGAGGAGAAAGTTCGGCCTAATACAAATAAATTCGCATTTAAGTTTCTTCTGTCCCTTTATGAACGAACAGGTC
ATAAGAACGAAGTTTACAGGGTTTGGAATACCTTCAAACCATTAACTAAAGAAACATGTGTTCCATATGCTTTAATGATCACATCTCTAGCCAAGCTTGATGATATTGAA
GGGGCTGAAAGAATATTCCAGGAGTGGGAATCAAAGTGTACTGTATACGACTTTCGGGTGTTGAATCGACTTCTGGTTGCTTATTGCAGGAAAGGTCTTTTGGATAAGGC
GGAATCAGTTGTTAACCAAGCAGTGGTTGAAAGAACTCCATTCCGCAGCACGTGGAGCATATTAGCCACGGGATATGCAGAATACGGACACATGAGCAAAGCCGTTGAGA
TGTTGAAGAAAGCTATTTTAGTCGGAAGGCAAAATTGGAAACCAAAGCAGGGTGACATTTTGGAAGCTTGTCTGGATTACTTGGAAAAACAAGGAGATGCAGAAACAATG
GATGAAATAGTACGATTATGCAAAAGCTCAGGTACAGTAATGAAGGAGATGTACTACAGATTGCTGAGAACTTCCATAGCAGGGGGTAAACCAGTTATTAGCATTCTTGA
ACAGATGAAGATGGATGGTTTTGCAGCAGATGAAGAGGTAGACAAAATCCTGGGATCTAAGACTAACTTGTAG
mRNA sequenceShow/hide mRNA sequence
GTTTATAGCCATTTTTGTTCGGAGTTCCGACTGAGTTGCTAAACGAAAGCTCTTTTCATTCTCAATTTGATTTTACATATATATCATTCATCAATTCTCATGGTCTTTTA
TCTTTGAATCAAGAAGATGATAAGGAAGCTCCGAAGCTGGAACAACAACCTCATTTCCAATCTCCTTATTCAAACTTCTAAAACCCTTTCCCTCCCCTTCTCTTCCACTC
CGCCCCAATTAGCCATTCTCCGCCAAAAAATCATAAACATTCGAGCCCCTAAAATTTCAGTTGTTCCGGTACTGGAAAAGTGGGTTGGCGACGGCAGAGCTATTGGAAAA
CCGGAACTTCAATATCTTGTTCACCTCATGAAGGACTCTCGCCGTTTCAATCACGCTTTAGAGATATCTCAGTGGATGACTGATCGAAGATACTTGAGTTTATCGCCGAG
CGATGCAGCAGTCAGGCTGGATTTAATCCATAGTGTTCATGGTCTGGAACACGCAGAGAATTACTTCAACAGCATATCTATTCGGTTAAAAACTTCTAATGTTTATGGTG
CTCTTCTCGGTTGTTATGTGCGAGAGAAATCACTTGAGAAAGCTGAAGCCATCATGCAAGAAATGAGAAAGATGGGCATTGCTACTACGTCCTTTGCTTACAATGTGCTA
ATTAACCTCTACGCTCAGATTGGGCAGCATGATAAGATTGATCTACTGATTGAAGAAATGAAAACGAAGAGAATACCTCAAGACATTTACTCAATTAGAAATCTTTGTGC
AGCTTATGTTGCTAAGGCAGATATTTCTGGTATGGAAAAGATTCTCAAAAGGATCGAGGAGGATTCTGAACTCAAAGCTGATTGGACAATTTATTCAATTGCTGCTAATG
GGTATCTTACAGCTGGGTTGGAAACAGAGGCTCTTTCCATGCTAAAGAAAACGGAGGAGAAAGTTCGGCCTAATACAAATAAATTCGCATTTAAGTTTCTTCTGTCCCTT
TATGAACGAACAGGTCATAAGAACGAAGTTTACAGGGTTTGGAATACCTTCAAACCATTAACTAAAGAAACATGTGTTCCATATGCTTTAATGATCACATCTCTAGCCAA
GCTTGATGATATTGAAGGGGCTGAAAGAATATTCCAGGAGTGGGAATCAAAGTGTACTGTATACGACTTTCGGGTGTTGAATCGACTTCTGGTTGCTTATTGCAGGAAAG
GTCTTTTGGATAAGGCGGAATCAGTTGTTAACCAAGCAGTGGTTGAAAGAACTCCATTCCGCAGCACGTGGAGCATATTAGCCACGGGATATGCAGAATACGGACACATG
AGCAAAGCCGTTGAGATGTTGAAGAAAGCTATTTTAGTCGGAAGGCAAAATTGGAAACCAAAGCAGGGTGACATTTTGGAAGCTTGTCTGGATTACTTGGAAAAACAAGG
AGATGCAGAAACAATGGATGAAATAGTACGATTATGCAAAAGCTCAGGTACAGTAATGAAGGAGATGTACTACAGATTGCTGAGAACTTCCATAGCAGGGGGTAAACCAG
TTATTAGCATTCTTGAACAGATGAAGATGGATGGTTTTGCAGCAGATGAAGAGGTAGACAAAATCCTGGGATCTAAGACTAACTTGTAGTTAGTAAAAAAATATTTAGTT
TTTCTTAAATTTTTTGTTCTATGAAATATAGAATGGCATTTTTTACATACATCCTTTAAGCCTC
Protein sequenceShow/hide protein sequence
MIRKLRSWNNNLISNLLIQTSKTLSLPFSSTPPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVR
LDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAK
ADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIE
GAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETM
DEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKILGSKTNL