; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017958 (gene) of Chayote v1 genome

Gene IDSed0017958
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG12:33820390..33822659
RNA-Seq ExpressionSed0017958
SyntenySed0017958
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589865.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0090.97Show/hide
Query:  MAISAPSLLL---SPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRG
        MA SAPSL+L   SPSSDPPY+LLQ+HPSL+L+SKC+SIRTL QIHAQIIKTGLHNTQFALSKLIEFSAVSR+ DISYA+SLFNSIE+PNLFIWNSMIRG
Subjt:  MAISAPSLLL---SPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRG

Query:  LSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIA
        LS+SLSP+LALVFF RMIH+GVEPNSYTFPF+LKSCAKLASA EGKQIHA VLK+GFV DV+IHTSLINMY QSGE+N AQLVFDQS FRDAISFTALIA
Subjt:  LSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIA

Query:  GYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALI
        GY LWG+MDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEM KANVPPNESTIVSVLSACAQSNALDLGNSM SWIEDRGL SNLKLVNALI
Subjt:  GYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALI

Query:  DMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSL
        DMYSKCGDLQTARELFDEM ERDVISWNVMIGGYTH  SYKE LALFR MLASGIEPT+ITFLN+LPSCA LGAID+GKWIHAYINKNFNSASTSL TSL
Subjt:  DMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSL

Query:  IDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYG
        IDMYAKCGNI+AARQVF+GMN KSLAS NAMICGLAMHG A+EA ELFSKMSSDGIEPNEITFVGVLSACKHAG VDLGR FFSSMIQDYKISPKSQHYG
Subjt:  IDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYG

Query:  CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCT
        CMIDLLGRAGLFEEAESLIQNME+KPDGAIWGSLLGACR+HGRVELGE VAERLFELEPDN GAYVLLSNIYAGAGKW+DVARIRT+LND+GMKKVPGCT
Subjt:  CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCT

Query:  TIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKL
        TI+VDNVVHEFLVGDKVHPQSE+IYKMLEEVDRQLK FGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITI+KNLRVCRNCH+ATKL
Subjt:  TIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKL

Query:  ISKIFNREIIARDRNRFHHFKDGSCSCNDFW
        ISKIFNREIIARDRNRFHHFKDGSCSCND+W
Subjt:  ISKIFNREIIARDRNRFHHFKDGSCSCNDFW

XP_004150015.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucumis sativus]0.0e+0091.01Show/hide
Query:  MAISAPSLLLS------PSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSM
        MA+S+PSLLLS      PSSDPPY++LQ+HPSL+LLSKCQSIRT  QIHA IIKTGLHNT FALSKLIEFSAVSR GDISYAISLFNSIE+PNLFIWNSM
Subjt:  MAISAPSLLLS------PSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSM

Query:  IRGLSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTA
        IRGLSMSLSP LALVFF RMI+SGVEPNSYTFPF+LKSCAKLASAHEGKQIHA VLK+GFV DV+IHTSLINMY QSGEMNNAQLVFDQS FRDAISFTA
Subjt:  IRGLSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTA

Query:  LIAGYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVN
        LIAGYALWG+MDRAR+LFDEMPV+DVVSWNAMIAGYAQ GRSKEALLLFE+M KANVPPNESTIVSVLSACAQSNALDLGNSM SWIEDRGLCSNLKLVN
Subjt:  LIAGYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVN

Query:  ALIDMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLC
        ALIDMYSKCGDLQTARELFD+MLERDVISWNVMIGGYTH  SYKE LALFREMLASG+EPTEITFL+ILPSCAHLGAIDLGKWIHAYINKNFNS STSL 
Subjt:  ALIDMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLC

Query:  TSLIDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQ
        TSLID+YAKCGNI AARQVFDGM IKSLAS NAMICGLAMHG AD+AFELFSKMSSDGIEPNEITFVG+LSACKHAGLVDLG++FFSSM+QDYKISPKSQ
Subjt:  TSLIDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQ

Query:  HYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVP
        HYGCMIDLLGRAGLFEEAESL+QNME+KPDGAIWGSLLGACRDHGRVELGE VAERLFELEPDN GAYVLLSNIYAGAGKW+DVARIRTRLNDRGMKKVP
Subjt:  HYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVP

Query:  GCTTIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSA
        GCTTI+VDNVVHEFLVGDKVHPQSEDIY+MLEEVD QLK+FGFV DTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPI I+KNLRVCRNCHSA
Subjt:  GCTTIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSA

Query:  TKLISKIFNREIIARDRNRFHHFKDGSCSCNDFW
        TKLISKIFNREIIARDRNRFHHFKDGSCSCND+W
Subjt:  TKLISKIFNREIIARDRNRFHHFKDGSCSCNDFW

XP_022961045.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita moschata]0.0e+0091.52Show/hide
Query:  MAISAPSLLL---SPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRG
        MA SAPSL+L   SPSSDPPY+LLQDHPSL+L+SKC+SIRTL QIHAQIIKTGLHNTQFALSKLIEFSAVSR+ DISYA+SLFNSIE+PNLFIWNSMIRG
Subjt:  MAISAPSLLL---SPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRG

Query:  LSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIA
        LS+SLSP+LALVFF RMIH+GVEPNSYTFPF+LKSCAKLASA EGKQIHA VLK+GFV DV+IHTSLINMY QSGE+N AQLVFDQS FRDAISFTALIA
Subjt:  LSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIA

Query:  GYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALI
        GY LWG+MDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEM KANVPPNESTIVSVLSACAQSNALDLGNSM SWIEDRGLCSNLKLVNALI
Subjt:  GYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALI

Query:  DMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSL
        DMYSKCGDLQTARELFDEM ERDVISWNVMIGGYTH  SYKE LALFREMLASGIEPT+ITFLN+LPSCA LGAID+GKWIHAYINKNFNSASTSL TSL
Subjt:  DMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSL

Query:  IDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYG
        IDMYAKCGNI+AARQVF+GMN KSLAS NAMICGLAMHG A+EA ELFSKMSSDGIEPNEITFVGVLSACKHAG VDLGR FFSSMIQDYKISPKSQHYG
Subjt:  IDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYG

Query:  CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCT
        CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGE VAERLFELEPDN GAYVLLSNIYAGAGKW+DVARIRT+LND+GMKKVPGCT
Subjt:  CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCT

Query:  TIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKL
        TI+VDNVVHEFLVGDKVH QSE+IYKMLEEVDRQLK FGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITI+KNLRVCRNCH+ATKL
Subjt:  TIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKL

Query:  ISKIFNREIIARDRNRFHHFKDGSCSCNDFW
        ISKIFNREIIARDRNRFHHFKDGSCSCND+W
Subjt:  ISKIFNREIIARDRNRFHHFKDGSCSCNDFW

XP_022987625.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita maxima]0.0e+0091.24Show/hide
Query:  MAISAPSLLL---SPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRG
        MA SAPSL+L   SPSSDPPY+LLQDHPSL+L+SKC+SIRTL QIHAQIIKTGLHNTQFALSKLIEFSAVSR+ DISYA+SLFNSIE+PNLFIWNSMIRG
Subjt:  MAISAPSLLL---SPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRG

Query:  LSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIA
        LS+SLSP+LALVFF+RMIH+GVEPNSYTFPF+LKSCA+LASA EGKQIHA VLK+GFV DV+IHTSLINMY QSGE+N AQLVFDQS FRDAISFTALIA
Subjt:  LSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIA

Query:  GYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALI
        GY LWG+MDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEM KANVPPNESTIVSVLSACAQSNALDLGNSM SWIEDRGL SNLKLVNALI
Subjt:  GYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALI

Query:  DMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSL
        DMYSKCG LQTARELFDEM ERDVISWNVMIGGYTH  SYKE LALFREMLASGIEPT+ITFLN+LPSCA LGAIDLGKWIHAYINKNFNSASTSL TSL
Subjt:  DMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSL

Query:  IDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYG
        IDMYAKCGNIEAARQVF+GMNIKSLAS NAMICGLAMHG A+EA ELFSK++SDGIEPNEITFVGVLSACKHAG VDLGR FFSSMIQDYKISPKSQHYG
Subjt:  IDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYG

Query:  CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCT
        CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGE VAERLFELEPDN GAYVLLSNIYAGAGKW+DVA IRT+LND+GMKKVPGCT
Subjt:  CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCT

Query:  TIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKL
        TI+VDNVVHEFLVGDKVHPQSE+IYKMLEEVDRQLK FGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITI+KNLRVCRNCH+ATKL
Subjt:  TIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKL

Query:  ISKIFNREIIARDRNRFHHFKDGSCSCNDFW
        ISKIFNREIIARDRNRFHHFKDGSCSCND+W
Subjt:  ISKIFNREIIARDRNRFHHFKDGSCSCNDFW

XP_023515625.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0091.24Show/hide
Query:  MAISAPSLLL---SPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRG
        MA SAPSL+L   SPSSDPPY+LLQDHPSL+L+SKC+SIRTL QIHAQIIKTGLHNTQFALSKLIEFSAVSR+ DISYA+SLFNSIE+PNLFIWNSMIRG
Subjt:  MAISAPSLLL---SPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRG

Query:  LSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIA
        LS+SLSP+LALVFF RMIH+GVEPNSYTFPF+LKSCAKLASA EGKQIHA VLK+GFV DV+IHTSLINMY QSGE+N AQLVFDQS FRDAISFTALIA
Subjt:  LSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIA

Query:  GYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALI
        GY LWG+MDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEM KANVPPNESTIVSVLSACAQSNALDLGNSM SWIEDRGL SNLKLVNALI
Subjt:  GYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALI

Query:  DMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSL
        DMYSKCGDLQTA ELFDEM ERDVISWNVMIGGYTH  SYKE LALFREMLASGIEPT+ITFLN+LPSCA LGAIDLGKWIHAYINKNFNSASTSL TSL
Subjt:  DMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSL

Query:  IDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYG
        IDMYAKCGNI+AARQVF+GMNIKSLAS NAMICGLAMHG A+EA ELFSKMSS+GIEPNEITFVGVLSACKHAG VDLGR FFSSMIQDYKISPKSQHYG
Subjt:  IDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYG

Query:  CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCT
        CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGE VAERLF+LEPDN GAYVLLSNIYAGAGKW+DVARIRT+LND GMKKVPGCT
Subjt:  CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCT

Query:  TIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKL
        TI+VDNVVHEFLVGDKVHPQ+E+IYKMLEEVDRQLK FGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITI+KNLRVCRNCH+ATKL
Subjt:  TIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKL

Query:  ISKIFNREIIARDRNRFHHFKDGSCSCNDFW
        ISKIFNREIIARDRNRFHHFKDGSCSCND+W
Subjt:  ISKIFNREIIARDRNRFHHFKDGSCSCNDFW

TrEMBL top hitse value%identityAlignment
A0A0A0LU28 DYW_deaminase domain-containing protein0.0e+0091.01Show/hide
Query:  MAISAPSLLLS------PSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSM
        MA+S+PSLLLS      PSSDPPY++LQ+HPSL+LLSKCQSIRT  QIHA IIKTGLHNT FALSKLIEFSAVSR GDISYAISLFNSIE+PNLFIWNSM
Subjt:  MAISAPSLLLS------PSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSM

Query:  IRGLSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTA
        IRGLSMSLSP LALVFF RMI+SGVEPNSYTFPF+LKSCAKLASAHEGKQIHA VLK+GFV DV+IHTSLINMY QSGEMNNAQLVFDQS FRDAISFTA
Subjt:  IRGLSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTA

Query:  LIAGYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVN
        LIAGYALWG+MDRAR+LFDEMPV+DVVSWNAMIAGYAQ GRSKEALLLFE+M KANVPPNESTIVSVLSACAQSNALDLGNSM SWIEDRGLCSNLKLVN
Subjt:  LIAGYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVN

Query:  ALIDMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLC
        ALIDMYSKCGDLQTARELFD+MLERDVISWNVMIGGYTH  SYKE LALFREMLASG+EPTEITFL+ILPSCAHLGAIDLGKWIHAYINKNFNS STSL 
Subjt:  ALIDMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLC

Query:  TSLIDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQ
        TSLID+YAKCGNI AARQVFDGM IKSLAS NAMICGLAMHG AD+AFELFSKMSSDGIEPNEITFVG+LSACKHAGLVDLG++FFSSM+QDYKISPKSQ
Subjt:  TSLIDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQ

Query:  HYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVP
        HYGCMIDLLGRAGLFEEAESL+QNME+KPDGAIWGSLLGACRDHGRVELGE VAERLFELEPDN GAYVLLSNIYAGAGKW+DVARIRTRLNDRGMKKVP
Subjt:  HYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVP

Query:  GCTTIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSA
        GCTTI+VDNVVHEFLVGDKVHPQSEDIY+MLEEVD QLK+FGFV DTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPI I+KNLRVCRNCHSA
Subjt:  GCTTIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSA

Query:  TKLISKIFNREIIARDRNRFHHFKDGSCSCNDFW
        TKLISKIFNREIIARDRNRFHHFKDGSCSCND+W
Subjt:  TKLISKIFNREIIARDRNRFHHFKDGSCSCNDFW

A0A1S3CMX0 pentatricopeptide repeat-containing protein At1g08070, chloroplastic0.0e+0090.01Show/hide
Query:  MAISAPSLLLS------PSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSM
        MA+S+PSLLLS      PSSDPPY++LQ+HP+L+LLSKCQ+IRT  QIHA IIKTGLHNT FALSKLIEFSAVSR GDISYAISLF+SIEDPNLFIWNSM
Subjt:  MAISAPSLLLS------PSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSM

Query:  IRGLSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTA
        IRGLSMSLSP+LALVFF RMI+SGVEPNSYTFPF+LKSCAKLASA EGKQIHA VLK+GFV DV+IHTSLINMY QSGEMNNAQL+FDQS FRDAISFTA
Subjt:  IRGLSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTA

Query:  LIAGYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVN
        LIAGYALWG+MDRAR+LFDEMPV+DVVSWNAMIAGYAQ GRSKEALLLFE+M K NVPPNESTIVSVLSACAQSNALDLGNSM SWIEDRGL SNLKLVN
Subjt:  LIAGYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVN

Query:  ALIDMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLC
        ALIDMYSKCGDL TARELFD+M ERDVISWNVMIGGYTH  SYKE LALFREMLASG+EPTEITFL+ILPSCAHLGAIDLGKWIHAYINKNFNS STSL 
Subjt:  ALIDMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLC

Query:  TSLIDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQ
        TSLID+YAKCGNI AARQVFDGMNIKSLAS NAMICGLAMHG ADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLG + FSSM+QDYKISPKSQ
Subjt:  TSLIDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQ

Query:  HYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVP
        HYGCMIDLLGRAGLFEEAESLIQNME+KPDGAIWGSLLGACRDHGRVELGE VAERLFELEPDN GAYVLLSNIYAGAGKW+DVARIRTRLNDRGMKKVP
Subjt:  HYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVP

Query:  GCTTIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKP
        GCTTI+VDNVVHEFLVGDKVHPQSEDIYKMLEEVD+QLK+FGFV DTSEVLYDMDEEWKEG LSHHSEKLAIAFGLISTKP
Subjt:  GCTTIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKP

A0A5A7TTJ1 Pentatricopeptide repeat-containing protein0.0e+0090.33Show/hide
Query:  MAISAPSLLLS------PSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSM
        MA+S+PSLLLS      PSSDPPY++LQ+HP+L+LLSKCQ+IRT  QIHA IIKTGLHNT FALSKLIEFSAVSR GDISYAISLF+SIEDPNLFIWNSM
Subjt:  MAISAPSLLLS------PSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSM

Query:  IRGLSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTA
        IRGLSMSLSP+LALVFF RMI+SGVEPNSYTFPF+LKSCAKLASA EGKQIHA VLK+GFV DV+IHTSLINMY QSGEMNNAQL+FDQS FRDAISFTA
Subjt:  IRGLSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTA

Query:  LIAGYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVN
        LIAGYALWG+MDRAR+LFDEMPV+DVVSWNAMIAGYAQ GRSKEALLLFE+M K NVPPNESTIVSVLSACAQSNALDLGNSM SWIEDRGL SNLKLVN
Subjt:  LIAGYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVN

Query:  ALIDMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLC
        ALIDMYSKCGDL TARELFD+M ERDVISWNVMIGGYTH  SYKE LALFREMLASG+EPTEITFL+ILPSCAHLGAIDLGKWIHAYINKNFNS STSL 
Subjt:  ALIDMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLC

Query:  TSLIDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQ
        TSLID+YAKCGNI AARQVFDGMNIKSLAS NAMICGLAMHG ADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLG + FSSM+QDYKISPKSQ
Subjt:  TSLIDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQ

Query:  HYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVP
        HYGCMIDLLGRAGLFEEAESLIQNME+KPDGAIWGSLLGACRDHGRVELGE VAERLFELEPDN GAYVLLSNIYAGAGKW+DVARIRTRLNDRGMKKVP
Subjt:  HYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVP

Query:  GCTTIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSA
        GCTTI+VDNVVHEFLVGDKVHPQSEDIYKMLEEVD+QLK+FGFV DTSEVLYDMDEEWKEG LSHHSEKLAIAFGLISTKPGTPI I+KNLRVCRNCHSA
Subjt:  GCTTIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSA

Query:  TKLISKIFNREIIARDRNRFHHFKDGSCSCNDFW
        TKLISKIFNREIIARDRNRFHHFKDGSCSCND+W
Subjt:  TKLISKIFNREIIARDRNRFHHFKDGSCSCNDFW

A0A6J1HAU9 pentatricopeptide repeat-containing protein At1g08070, chloroplastic0.0e+0091.52Show/hide
Query:  MAISAPSLLL---SPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRG
        MA SAPSL+L   SPSSDPPY+LLQDHPSL+L+SKC+SIRTL QIHAQIIKTGLHNTQFALSKLIEFSAVSR+ DISYA+SLFNSIE+PNLFIWNSMIRG
Subjt:  MAISAPSLLL---SPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRG

Query:  LSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIA
        LS+SLSP+LALVFF RMIH+GVEPNSYTFPF+LKSCAKLASA EGKQIHA VLK+GFV DV+IHTSLINMY QSGE+N AQLVFDQS FRDAISFTALIA
Subjt:  LSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIA

Query:  GYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALI
        GY LWG+MDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEM KANVPPNESTIVSVLSACAQSNALDLGNSM SWIEDRGLCSNLKLVNALI
Subjt:  GYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALI

Query:  DMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSL
        DMYSKCGDLQTARELFDEM ERDVISWNVMIGGYTH  SYKE LALFREMLASGIEPT+ITFLN+LPSCA LGAID+GKWIHAYINKNFNSASTSL TSL
Subjt:  DMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSL

Query:  IDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYG
        IDMYAKCGNI+AARQVF+GMN KSLAS NAMICGLAMHG A+EA ELFSKMSSDGIEPNEITFVGVLSACKHAG VDLGR FFSSMIQDYKISPKSQHYG
Subjt:  IDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYG

Query:  CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCT
        CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGE VAERLFELEPDN GAYVLLSNIYAGAGKW+DVARIRT+LND+GMKKVPGCT
Subjt:  CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCT

Query:  TIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKL
        TI+VDNVVHEFLVGDKVH QSE+IYKMLEEVDRQLK FGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITI+KNLRVCRNCH+ATKL
Subjt:  TIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKL

Query:  ISKIFNREIIARDRNRFHHFKDGSCSCNDFW
        ISKIFNREIIARDRNRFHHFKDGSCSCND+W
Subjt:  ISKIFNREIIARDRNRFHHFKDGSCSCNDFW

A0A6J1JHE6 pentatricopeptide repeat-containing protein At1g08070, chloroplastic0.0e+0091.24Show/hide
Query:  MAISAPSLLL---SPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRG
        MA SAPSL+L   SPSSDPPY+LLQDHPSL+L+SKC+SIRTL QIHAQIIKTGLHNTQFALSKLIEFSAVSR+ DISYA+SLFNSIE+PNLFIWNSMIRG
Subjt:  MAISAPSLLL---SPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRG

Query:  LSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIA
        LS+SLSP+LALVFF+RMIH+GVEPNSYTFPF+LKSCA+LASA EGKQIHA VLK+GFV DV+IHTSLINMY QSGE+N AQLVFDQS FRDAISFTALIA
Subjt:  LSMSLSPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIA

Query:  GYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALI
        GY LWG+MDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEM KANVPPNESTIVSVLSACAQSNALDLGNSM SWIEDRGL SNLKLVNALI
Subjt:  GYALWGFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALI

Query:  DMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSL
        DMYSKCG LQTARELFDEM ERDVISWNVMIGGYTH  SYKE LALFREMLASGIEPT+ITFLN+LPSCA LGAIDLGKWIHAYINKNFNSASTSL TSL
Subjt:  DMYSKCGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSL

Query:  IDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYG
        IDMYAKCGNIEAARQVF+GMNIKSLAS NAMICGLAMHG A+EA ELFSK++SDGIEPNEITFVGVLSACKHAG VDLGR FFSSMIQDYKISPKSQHYG
Subjt:  IDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYG

Query:  CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCT
        CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGE VAERLFELEPDN GAYVLLSNIYAGAGKW+DVA IRT+LND+GMKKVPGCT
Subjt:  CMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCT

Query:  TIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKL
        TI+VDNVVHEFLVGDKVHPQSE+IYKMLEEVDRQLK FGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITI+KNLRVCRNCH+ATKL
Subjt:  TIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKL

Query:  ISKIFNREIIARDRNRFHHFKDGSCSCNDFW
        ISKIFNREIIARDRNRFHHFKDGSCSCND+W
Subjt:  ISKIFNREIIARDRNRFHHFKDGSCSCNDFW

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148206.4e-15838.54Show/hide
Query:  PPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSI-EDPNLFIWNSMIRGLSMSLSPILALVFFSRM
        PP      +  L+ LS C+S+  + Q+HA I++T +++     S L   S  S   ++SYA+++F+SI   P   ++N  +R LS S  P   ++F+ R+
Subjt:  PPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSI-EDPNLFIWNSMIRGLSMSLSPILALVFFSRM

Query:  IHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGFMDRARKLFDE
         H G   + ++F  +LK+ +K+++  EG ++H    K+  +CD ++ T  ++MY   G +N A+ VFD+   RD +++  +I  Y  +G +D A KLF+E
Subjt:  IHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGFMDRARKLFDE

Query:  MPVRDVVSWNA----MIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKCGDLQTAR
        M   +V+        +++   +TG  +    ++E + + +V  +   + ++++  A +  +D+       +  R    NL +  A++  YSKCG L  A+
Subjt:  MPVRDVVSWNA----MIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKCGDLQTAR

Query:  ELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSLIDMYAKCGNIEAA
         +FD+  ++D++ W  MI  Y  +   +E L +F EM  SGI+P  ++  +++ +CA+LG +D  KW+H+ I+ N   +  S+  +LI+MYAKCG ++A 
Subjt:  ELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSLIDMYAKCGNIEAA

Query:  RQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFE
        R VF+ M  +++ S ++MI  L+MHG A +A  LF++M  + +EPNE+TFVGVL  C H+GLV+ G++ F+SM  +Y I+PK +HYGCM+DL GRA L  
Subjt:  RQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFE

Query:  EAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTIDVDNVVHEFLV
        EA  +I++M +  +  IWGSL+ ACR HG +ELG+  A+R+ ELEPD+ GA VL+SNIYA   +WEDV  IR  + ++ + K  G + ID +   HEFL+
Subjt:  EAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTIDVDNVVHEFLV

Query:  GDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTP------ITIVKNLRVCRNCHSATKLISKIFNR
        GDK H QS +IY  L+EV  +LK+ G+VPD   VL D++EE K+  +  HSEKLA+ FGL++ +          I IVKNLRVC +CH   KL+SK++ R
Subjt:  GDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTP------ITIVKNLRVCRNCHSATKLISKIFNR

Query:  EIIARDRNRFHHFKDGSCSCNDFW
        EII RDR RFH +K+G CSC D+W
Subjt:  EIIARDRNRFHHFKDGSCSCNDFW

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic2.3e-16840.35Show/hide
Query:  LQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRGLSMSLSPILALVFFSRMI-HSGVEPNSYT
        + L+ +C S+R L Q H  +I+TG  +  ++ SKL   +A+S F  + YA  +F+ I  PN F WN++IR  +    P+L++  F  M+  S   PN YT
Subjt:  LQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRGLSMSLSPILALVFFSRMI-HSGVEPNSYT

Query:  FPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGFMDRARKLFDEMPVRDVVSWNA
        FPF++K+ A+++S   G+ +H   +K     DV++  SLI+ Y   G+                               +D A K+F  +  +DVVSWN+
Subjt:  FPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGFMDRARKLFDEMPVRDVVSWNA

Query:  MIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKCGDLQTARELFDEMLERDVISWN
        MI G+ Q G   +AL LF++M   +V  +  T+V VLSACA+   L+ G  + S+IE+  +  NL L NA++DMY+KCG ++ A+ LFD M E+D ++W 
Subjt:  MIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKCGDLQTARELFDEMLERDVISWN

Query:  VMIGGYTHTSSYK-------------------------------EGLALFREM-LASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSL
         M+ GY  +  Y+                               E L +F E+ L   ++  +IT ++ L +CA +GA++LG+WIH+YI K+    +  +
Subjt:  VMIGGYTHTSSYK-------------------------------EGLALFREM-LASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSL

Query:  CTSLIDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKS
         ++LI MY+KCG++E +R+VF+ +  + +   +AMI GLAMHG  +EA ++F KM    ++PN +TF  V  AC H GLVD     F  M  +Y I P+ 
Subjt:  CTSLIDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKS

Query:  QHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKV
        +HY C++D+LGR+G  E+A   I+ M + P  ++WG+LLGAC+ H  + L E    RL ELEP N GA+VLLSNIYA  GKWE+V+ +R  +   G+KK 
Subjt:  QHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKV

Query:  PGCTTIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDM-DEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCH
        PGC++I++D ++HEFL GD  HP SE +Y  L EV  +LK  G+ P+ S+VL  + +EE KE +L+ HSEKLAI +GLIST+    I ++KNLRVC +CH
Subjt:  PGCTTIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDM-DEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCH

Query:  SATKLISKIFNREIIARDRNRFHHFKDGSCSCNDFW
        S  KLIS++++REII RDR RFHHF++G CSCNDFW
Subjt:  SATKLISKIFNREIIARDRNRFHHFKDGSCSCNDFW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic5.4e-28263.6Show/hide
Query:  SAPSLLLSPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVS-RFGDISYAISLFNSIEDPNLFIWNSMIRGLSMSL
        S P   L  SSDPPY  +++HPSL LL  C+++++L  IHAQ+IK GLHNT +ALSKLIEF  +S  F  + YAIS+F +I++PNL IWN+M RG ++S 
Subjt:  SAPSLLLSPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVS-RFGDISYAISLFNSIEDPNLFIWNSMIRGLSMSL

Query:  SPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALW
         P+ AL  +  MI  G+ PNSYTFPFVLKSCAK  +  EG+QIH  VLK+G   D+Y+HTSLI+MYVQ+G + +A  VFD+S  RD +S+TALI GYA  
Subjt:  SPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALW

Query:  GFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSK
        G+++ A+KLFDE+PV+DVVSWNAMI+GYA+TG  KEAL LF++M K NV P+EST+V+V+SACAQS +++LG  +H WI+D G  SNLK+VNALID+YSK
Subjt:  GFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSK

Query:  CGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSA--STSLCTSLIDM
        CG+L+TA  LF+ +  +DVISWN +IGGYTH + YKE L LF+EML SG  P ++T L+ILP+CAHLGAID+G+WIH YI+K       ++SL TSLIDM
Subjt:  CGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSA--STSLCTSLIDM

Query:  YAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMI
        YAKCG+IEAA QVF+ +  KSL+S NAMI G AMHG AD +F+LFS+M   GI+P++ITFVG+LSAC H+G++DLGR  F +M QDYK++PK +HYGCMI
Subjt:  YAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMI

Query:  DLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTID
        DLLG +GLF+EAE +I  MEM+PDG IW SLL AC+ HG VELGES AE L ++EP+N G+YVLLSNIYA AG+W +VA+ R  LND+GMKKVPGC++I+
Subjt:  DLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTID

Query:  VDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKLISK
        +D+VVHEF++GDK HP++ +IY MLEE++  L+  GFVPDTSEVL +M+EEWKEGAL HHSEKLAIAFGLISTKPGT +TIVKNLRVCRNCH ATKLISK
Subjt:  VDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKLISK

Query:  IFNREIIARDRNRFHHFKDGSCSCNDFW
        I+ REIIARDR RFHHF+DG CSCND+W
Subjt:  IFNREIIARDRNRFHHFKDGSCSCNDFW

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226902.6e-15937.06Show/hide
Query:  LSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSA-VSRFGDISYAISLF-NSIEDPNLFIWNSMIRGLSMSLSPILALVFFSRMIHSGVEPNSYTFP
        L  C++I  L   H  + K GL N    ++KL+  S  +     +S+A  +F NS      F++NS+IRG + S     A++ F RM++SG+ P+ YTFP
Subjt:  LSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSA-VSRFGDISYAISLF-NSIEDPNLFIWNSMIRGLSMSLSPILALVFFSRMIHSGVEPNSYTFP

Query:  FVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGF-----------------------
        F L +CAK  +   G QIH  ++K+G+  D+++  SL++ Y + GE+++A+ VFD+   R+ +S+T++I GYA   F                       
Subjt:  FVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGF-----------------------

Query:  ------------------------------------------------MDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNES
                                                        +D A++LFDE    ++   NAM + Y + G ++EAL +F  M  + V P+  
Subjt:  ------------------------------------------------MDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNES

Query:  TIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKC-------------------------------GDLQTARELFDEMLERDVISWN
        +++S +S+C+Q   +  G S H ++   G  S   + NALIDMY KC                               G++  A E F+ M E++++SWN
Subjt:  TIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKC-------------------------------GDLQTARELFDEMLERDVISWN

Query:  VMIGGYTHTSSYKEGLALFREMLA-SGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSLIDMYAKCGNIEAARQVFDGMNIKSLAS
         +I G    S ++E + +F  M +  G+    +T ++I  +C HLGA+DL KWI+ YI KN       L T+L+DM+++CG+ E+A  +F+ +  + +++
Subjt:  VMIGGYTHTSSYKEGLALFREMLA-SGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSLIDMYAKCGNIEAARQVFDGMNIKSLAS

Query:  RNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPD
          A I  +AM G A+ A ELF  M   G++P+ + FVG L+AC H GLV  G+  F SM++ + +SP+  HYGCM+DLLGRAGL EEA  LI++M M+P+
Subjt:  RNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPD

Query:  GAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTIDVDNVVHEFLVGDKVHPQSEDIYKM
          IW SLL ACR  G VE+    AE++  L P+  G+YVLLSN+YA AG+W D+A++R  + ++G++K PG ++I +    HEF  GD+ HP+  +I  M
Subjt:  GAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTIDVDNVVHEFLVGDKVHPQSEDIYKM

Query:  LEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHFKDGSCSC
        L+EV ++    G VPD S VL D+DE+ K   LS HSEKLA+A+GLIS+  GT I IVKNLRVC +CHS  K  SK++NREII RD NRFH+ + G CSC
Subjt:  LEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHFKDGSCSC

Query:  NDFW
         DFW
Subjt:  NDFW

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233301.1e-15739.2Show/hide
Query:  SSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKT-GLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRGLSMSLSPILALVFF
        SS    K L  +P+ ++ SK Q+     Q+HAQ I+T  L +T    S  I  S  +    +  A+ LF +++ P +  W S+IR  +       AL  F
Subjt:  SSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKT-GLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRGLSMSLSPILALVFF

Query:  SRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQ---SGEMNNAQLVFDQSKFRDAISFTALI-AGYALWGF-MD
          M  SG  P+   FP VLKSC  +     G+ +H F++++G  CD+Y   +L+NMY +    G   +   VFD+   R + S    + A   +  F +D
Subjt:  SRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQ---SGEMNNAQLVFDQSKFRDAISFTALI-AGYALWGF-MD

Query:  RARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKCGDL
          R++F+ MP +DVVS+N +IAGYAQ+G  ++AL +  EM   ++ P+  T+ SVL   ++   +  G  +H ++  +G+ S++ + ++L+DMY+K   +
Subjt:  RARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKCGDL

Query:  QTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSLIDMYAKCGN
        + +  +F  +  RD ISWN ++ GY     Y E L LFR+M+ + ++P  + F +++P+CAHL  + LGK +H Y+ +    ++  + ++L+DMY+KCGN
Subjt:  QTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSLIDMYAKCGN

Query:  IEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMIDLLGRA
        I+AAR++FD MN+    S  A+I G A+HG   EA  LF +M   G++PN++ FV VL+AC H GLVD    +F+SM + Y ++ + +HY  + DLLGRA
Subjt:  IEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMIDLLGRA

Query:  GLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTIDVDNVVH
        G  EEA + I  M ++P G++W +LL +C  H  +EL E VAE++F ++ +N GAYVL+ N+YA  G+W+++A++R R+  +G++K P C+ I++ N  H
Subjt:  GLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTIDVDNVVH

Query:  EFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKLISKIFNREI
         F+ GD+ HP  + I + L+ V  Q++  G+V DTS VL+D+DEE K   L  HSE+LA+AFG+I+T+PGT I + KN+R+C +CH A K ISKI  REI
Subjt:  EFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKLISKIFNREI

Query:  IARDRNRFHHFKDGSCSCNDFW
        I RD +RFHHF  G+CSC D+W
Subjt:  IARDRNRFHHFKDGSCSCNDFW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.8e-28363.6Show/hide
Query:  SAPSLLLSPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVS-RFGDISYAISLFNSIEDPNLFIWNSMIRGLSMSL
        S P   L  SSDPPY  +++HPSL LL  C+++++L  IHAQ+IK GLHNT +ALSKLIEF  +S  F  + YAIS+F +I++PNL IWN+M RG ++S 
Subjt:  SAPSLLLSPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVS-RFGDISYAISLFNSIEDPNLFIWNSMIRGLSMSL

Query:  SPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALW
         P+ AL  +  MI  G+ PNSYTFPFVLKSCAK  +  EG+QIH  VLK+G   D+Y+HTSLI+MYVQ+G + +A  VFD+S  RD +S+TALI GYA  
Subjt:  SPILALVFFSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALW

Query:  GFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSK
        G+++ A+KLFDE+PV+DVVSWNAMI+GYA+TG  KEAL LF++M K NV P+EST+V+V+SACAQS +++LG  +H WI+D G  SNLK+VNALID+YSK
Subjt:  GFMDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSK

Query:  CGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSA--STSLCTSLIDM
        CG+L+TA  LF+ +  +DVISWN +IGGYTH + YKE L LF+EML SG  P ++T L+ILP+CAHLGAID+G+WIH YI+K       ++SL TSLIDM
Subjt:  CGDLQTARELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSA--STSLCTSLIDM

Query:  YAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMI
        YAKCG+IEAA QVF+ +  KSL+S NAMI G AMHG AD +F+LFS+M   GI+P++ITFVG+LSAC H+G++DLGR  F +M QDYK++PK +HYGCMI
Subjt:  YAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMI

Query:  DLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTID
        DLLG +GLF+EAE +I  MEM+PDG IW SLL AC+ HG VELGES AE L ++EP+N G+YVLLSNIYA AG+W +VA+ R  LND+GMKKVPGC++I+
Subjt:  DLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTID

Query:  VDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKLISK
        +D+VVHEF++GDK HP++ +IY MLEE++  L+  GFVPDTSEVL +M+EEWKEGAL HHSEKLAIAFGLISTKPGT +TIVKNLRVCRNCH ATKLISK
Subjt:  VDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKLISK

Query:  IFNREIIARDRNRFHHFKDGSCSCNDFW
        I+ REIIARDR RFHHF+DG CSCND+W
Subjt:  IFNREIIARDRNRFHHFKDGSCSCNDFW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-16940.35Show/hide
Query:  LQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRGLSMSLSPILALVFFSRMI-HSGVEPNSYT
        + L+ +C S+R L Q H  +I+TG  +  ++ SKL   +A+S F  + YA  +F+ I  PN F WN++IR  +    P+L++  F  M+  S   PN YT
Subjt:  LQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRGLSMSLSPILALVFFSRMI-HSGVEPNSYT

Query:  FPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGFMDRARKLFDEMPVRDVVSWNA
        FPF++K+ A+++S   G+ +H   +K     DV++  SLI+ Y   G+                               +D A K+F  +  +DVVSWN+
Subjt:  FPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGFMDRARKLFDEMPVRDVVSWNA

Query:  MIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKCGDLQTARELFDEMLERDVISWN
        MI G+ Q G   +AL LF++M   +V  +  T+V VLSACA+   L+ G  + S+IE+  +  NL L NA++DMY+KCG ++ A+ LFD M E+D ++W 
Subjt:  MIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKCGDLQTARELFDEMLERDVISWN

Query:  VMIGGYTHTSSYK-------------------------------EGLALFREM-LASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSL
         M+ GY  +  Y+                               E L +F E+ L   ++  +IT ++ L +CA +GA++LG+WIH+YI K+    +  +
Subjt:  VMIGGYTHTSSYK-------------------------------EGLALFREM-LASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSL

Query:  CTSLIDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKS
         ++LI MY+KCG++E +R+VF+ +  + +   +AMI GLAMHG  +EA ++F KM    ++PN +TF  V  AC H GLVD     F  M  +Y I P+ 
Subjt:  CTSLIDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKS

Query:  QHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKV
        +HY C++D+LGR+G  E+A   I+ M + P  ++WG+LLGAC+ H  + L E    RL ELEP N GA+VLLSNIYA  GKWE+V+ +R  +   G+KK 
Subjt:  QHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKV

Query:  PGCTTIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDM-DEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCH
        PGC++I++D ++HEFL GD  HP SE +Y  L EV  +LK  G+ P+ S+VL  + +EE KE +L+ HSEKLAI +GLIST+    I ++KNLRVC +CH
Subjt:  PGCTTIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDM-DEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCH

Query:  SATKLISKIFNREIIARDRNRFHHFKDGSCSCNDFW
        S  KLIS++++REII RDR RFHHF++G CSCNDFW
Subjt:  SATKLISKIFNREIIARDRNRFHHFKDGSCSCNDFW

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)3.5e-15936.99Show/hide
Query:  LSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSA-VSRFGDISYAISLF-NSIEDPNLFIWNSMIRGLSMSLSPILALVFFSRMIHSGVEPNSYTFP
        L  C++I  L   H  + K GL N    ++KL+  S  +     +S+A  +F NS      F++NS+IRG + S     A++ F RM++SG+ P+ YTFP
Subjt:  LSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSA-VSRFGDISYAISLF-NSIEDPNLFIWNSMIRGLSMSLSPILALVFFSRMIHSGVEPNSYTFP

Query:  FVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGF-----------------------
        F L +CAK  +   G QIH  ++K+G+  D+++  SL++ Y + GE+++A+ VFD+   R+ +S+T++I GYA   F                       
Subjt:  FVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGF-----------------------

Query:  ------------------------------------------------MDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNES
                                                        +D A++LFDE    ++   NAM + Y + G ++EAL +F  M  + V P+  
Subjt:  ------------------------------------------------MDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNES

Query:  TIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKC-------------------------------GDLQTARELFDEMLERDVISWN
        +++S +S+C+Q   +  G S H ++   G  S   + NALIDMY KC                               G++  A E F+ M E++++SWN
Subjt:  TIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKC-------------------------------GDLQTARELFDEMLERDVISWN

Query:  VMIGGYTHTSSYKEGLALFREMLA-SGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSLIDMYAKCGNIEAARQVFDGMNIKSLAS
         +I G    S ++E + +F  M +  G+    +T ++I  +C HLGA+DL KWI+ YI KN       L T+L+DM+++CG+ E+A  +F+ +  + +++
Subjt:  VMIGGYTHTSSYKEGLALFREMLA-SGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSLIDMYAKCGNIEAARQVFDGMNIKSLAS

Query:  RNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPD
          A I  +AM G A+ A ELF  M   G++P+ + FVG L+AC H GLV  G+  F SM++ + +SP+  HYGCM+DLLGRAGL EEA  LI++M M+P+
Subjt:  RNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPD

Query:  GAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTIDVDNVVHEFLVGDKVHPQSEDIYKM
          IW SLL ACR  G VE+    AE++  L P+  G+YVLLSN+YA AG+W D+A++R  + ++G++K PG ++I +    HEF  GD+ HP+  +I  M
Subjt:  GAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTIDVDNVVHEFLVGDKVHPQSEDIYKM

Query:  LEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHFKDGSCSC
        L+EV ++    G VPD S VL D+DE+ K   LS HSEKLA+A+GLIS+  GT I IVKNLRVC +CHS  K  SK++NREII RD NRFH+ + G CSC
Subjt:  LEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHFKDGSCSC

Query:  NDF
         DF
Subjt:  NDF

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification1.8e-16037.06Show/hide
Query:  LSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSA-VSRFGDISYAISLF-NSIEDPNLFIWNSMIRGLSMSLSPILALVFFSRMIHSGVEPNSYTFP
        L  C++I  L   H  + K GL N    ++KL+  S  +     +S+A  +F NS      F++NS+IRG + S     A++ F RM++SG+ P+ YTFP
Subjt:  LSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSA-VSRFGDISYAISLF-NSIEDPNLFIWNSMIRGLSMSLSPILALVFFSRMIHSGVEPNSYTFP

Query:  FVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGF-----------------------
        F L +CAK  +   G QIH  ++K+G+  D+++  SL++ Y + GE+++A+ VFD+   R+ +S+T++I GYA   F                       
Subjt:  FVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGF-----------------------

Query:  ------------------------------------------------MDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNES
                                                        +D A++LFDE    ++   NAM + Y + G ++EAL +F  M  + V P+  
Subjt:  ------------------------------------------------MDRARKLFDEMPVRDVVSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNES

Query:  TIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKC-------------------------------GDLQTARELFDEMLERDVISWN
        +++S +S+C+Q   +  G S H ++   G  S   + NALIDMY KC                               G++  A E F+ M E++++SWN
Subjt:  TIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKC-------------------------------GDLQTARELFDEMLERDVISWN

Query:  VMIGGYTHTSSYKEGLALFREMLA-SGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSLIDMYAKCGNIEAARQVFDGMNIKSLAS
         +I G    S ++E + +F  M +  G+    +T ++I  +C HLGA+DL KWI+ YI KN       L T+L+DM+++CG+ E+A  +F+ +  + +++
Subjt:  VMIGGYTHTSSYKEGLALFREMLA-SGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSLIDMYAKCGNIEAARQVFDGMNIKSLAS

Query:  RNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPD
          A I  +AM G A+ A ELF  M   G++P+ + FVG L+AC H GLV  G+  F SM++ + +SP+  HYGCM+DLLGRAGL EEA  LI++M M+P+
Subjt:  RNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPD

Query:  GAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTIDVDNVVHEFLVGDKVHPQSEDIYKM
          IW SLL ACR  G VE+    AE++  L P+  G+YVLLSN+YA AG+W D+A++R  + ++G++K PG ++I +    HEF  GD+ HP+  +I  M
Subjt:  GAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTIDVDNVVHEFLVGDKVHPQSEDIYKM

Query:  LEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHFKDGSCSC
        L+EV ++    G VPD S VL D+DE+ K   LS HSEKLA+A+GLIS+  GT I IVKNLRVC +CHS  K  SK++NREII RD NRFH+ + G CSC
Subjt:  LEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHFKDGSCSC

Query:  NDFW
         DFW
Subjt:  NDFW

AT4G14820.1 Pentatricopeptide repeat (PPR) superfamily protein4.6e-15938.54Show/hide
Query:  PPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSI-EDPNLFIWNSMIRGLSMSLSPILALVFFSRM
        PP      +  L+ LS C+S+  + Q+HA I++T +++     S L   S  S   ++SYA+++F+SI   P   ++N  +R LS S  P   ++F+ R+
Subjt:  PPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSI-EDPNLFIWNSMIRGLSMSLSPILALVFFSRM

Query:  IHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGFMDRARKLFDE
         H G   + ++F  +LK+ +K+++  EG ++H    K+  +CD ++ T  ++MY   G +N A+ VFD+   RD +++  +I  Y  +G +D A KLF+E
Subjt:  IHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGFMDRARKLFDE

Query:  MPVRDVVSWNA----MIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKCGDLQTAR
        M   +V+        +++   +TG  +    ++E + + +V  +   + ++++  A +  +D+       +  R    NL +  A++  YSKCG L  A+
Subjt:  MPVRDVVSWNA----MIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKCGDLQTAR

Query:  ELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSLIDMYAKCGNIEAA
         +FD+  ++D++ W  MI  Y  +   +E L +F EM  SGI+P  ++  +++ +CA+LG +D  KW+H+ I+ N   +  S+  +LI+MYAKCG ++A 
Subjt:  ELFDEMLERDVISWNVMIGGYTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSLIDMYAKCGNIEAA

Query:  RQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFE
        R VF+ M  +++ S ++MI  L+MHG A +A  LF++M  + +EPNE+TFVGVL  C H+GLV+ G++ F+SM  +Y I+PK +HYGCM+DL GRA L  
Subjt:  RQVFDGMNIKSLASRNAMICGLAMHGLADEAFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFE

Query:  EAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTIDVDNVVHEFLV
        EA  +I++M +  +  IWGSL+ ACR HG +ELG+  A+R+ ELEPD+ GA VL+SNIYA   +WEDV  IR  + ++ + K  G + ID +   HEFL+
Subjt:  EAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAERLFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTIDVDNVVHEFLV

Query:  GDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTP------ITIVKNLRVCRNCHSATKLISKIFNR
        GDK H QS +IY  L+EV  +LK+ G+VPD   VL D++EE K+  +  HSEKLA+ FGL++ +          I IVKNLRVC +CH   KL+SK++ R
Subjt:  GDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHHSEKLAIAFGLISTKPGTP------ITIVKNLRVCRNCHSATKLISKIFNR

Query:  EIIARDRNRFHHFKDGSCSCNDFW
        EII RDR RFH +K+G CSC D+W
Subjt:  EIIARDRNRFHHFKDGSCSCNDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATTTCTGCGCCATCTCTGCTACTTTCACCTTCCTCCGACCCCCCATACAAGCTCCTTCAAGACCACCCATCTCTCCAACTTCTCTCCAAATGCCAATCCATTCG
AACTCTGACTCAAATCCACGCCCAGATCATCAAAACCGGCCTCCACAACACCCAGTTCGCCCTCAGCAAGCTCATCGAGTTCTCTGCCGTCTCGCGCTTCGGCGACATCT
CCTACGCCATCTCGCTCTTTAATTCGATCGAAGACCCCAATTTGTTCATTTGGAATTCCATGATTCGAGGGCTTTCGATGAGTCTTTCGCCGATTCTCGCCTTGGTTTTC
TTTTCCAGAATGATTCATTCTGGGGTTGAGCCGAATTCTTATACTTTTCCTTTTGTTTTGAAGTCTTGTGCGAAACTTGCCTCTGCCCATGAAGGGAAACAGATTCATGC
GTTTGTTTTGAAGGTTGGGTTTGTGTGTGATGTTTATATTCATACTTCGCTTATCAATATGTATGTGCAGAGTGGTGAAATGAACAATGCCCAGTTGGTGTTTGATCAAA
GTAAGTTTAGGGATGCGATTTCTTTCACTGCATTGATTGCGGGTTATGCGTTATGGGGTTTTATGGATCGTGCACGGAAACTGTTCGATGAAATGCCTGTGAGAGATGTG
GTGTCTTGGAATGCTATGATTGCTGGCTATGCGCAAACTGGTCGATCCAAAGAGGCGTTGTTGTTGTTTGAAGAAATGAGCAAAGCAAATGTCCCTCCTAATGAAAGTAC
AATTGTCTCTGTTCTCTCTGCTTGTGCTCAGTCAAATGCTTTGGATTTAGGAAATTCGATGCACTCATGGATTGAAGATCGTGGGCTTTGTTCGAATCTTAAGCTTGTTA
ATGCACTTATCGATATGTACTCGAAGTGCGGTGATCTTCAAACGGCCCGTGAATTGTTTGATGAAATGCTTGAACGAGATGTGATCTCATGGAATGTTATGATTGGAGGT
TACACTCATACGAGCAGCTACAAAGAAGGCTTGGCACTCTTTCGTGAAATGCTAGCCTCAGGTATTGAGCCGACCGAAATAACCTTCCTTAACATTCTTCCATCGTGTGC
TCATTTAGGTGCCATTGACCTTGGTAAGTGGATACATGCATATATAAACAAAAACTTTAACTCGGCCAGCACCTCTCTTTGTACGAGTTTGATAGATATGTATGCTAAAT
GTGGTAATATAGAGGCGGCACGACAGGTCTTTGATGGTATGAATATCAAAAGCTTGGCGTCTCGGAATGCTATGATATGCGGGTTAGCAATGCATGGACTGGCAGATGAG
GCTTTTGAACTTTTCTCGAAAATGTCTAGTGATGGAATTGAACCAAATGAGATAACTTTTGTTGGTGTTTTATCTGCTTGCAAACATGCTGGTCTGGTAGATCTCGGGCG
CCGATTTTTTAGCTCTATGATTCAGGATTATAAAATCTCTCCGAAATCCCAACATTATGGATGCATGATAGATCTTCTTGGGCGAGCAGGGTTGTTTGAGGAAGCAGAGT
CCTTAATACAAAATATGGAAATGAAGCCAGATGGAGCTATTTGGGGTTCCCTTCTCGGCGCATGTAGAGACCATGGACGGGTCGAGTTGGGAGAATCAGTTGCGGAGCGT
CTTTTCGAATTGGAGCCAGATAATGCTGGAGCCTACGTGCTTTTATCCAACATATATGCAGGAGCTGGTAAATGGGAGGATGTTGCAAGAATAAGGACAAGACTGAATGA
TAGGGGAATGAAGAAAGTTCCTGGCTGTACAACCATTGATGTCGATAACGTCGTTCACGAGTTTCTAGTAGGGGACAAAGTTCATCCACAAAGTGAAGATATCTACAAGA
TGTTGGAAGAAGTAGACCGACAATTAAAGATGTTCGGATTCGTGCCAGATACATCCGAGGTACTCTACGACATGGACGAAGAATGGAAAGAAGGCGCTCTAAGCCACCAT
AGTGAGAAACTAGCCATTGCTTTTGGATTGATAAGTACAAAACCAGGAACACCGATTACGATCGTTAAAAATCTTCGCGTATGTCGTAATTGTCATTCTGCTACGAAATT
GATATCGAAGATATTCAATAGAGAGATTATTGCTAGAGATAGAAACCGTTTCCATCATTTCAAAGATGGTTCTTGCTCATGTAACGATTTTTGGTGA
mRNA sequenceShow/hide mRNA sequence
GGTCCATACAACTAAATTTCAAATAATAATAACCCTTATCCCTAACACACTGCCACCACCAGAAACCTTCATAGCGCCCAAAAATGGCGATTTCTGCGCCATCTCTGCTA
CTTTCACCTTCCTCCGACCCCCCATACAAGCTCCTTCAAGACCACCCATCTCTCCAACTTCTCTCCAAATGCCAATCCATTCGAACTCTGACTCAAATCCACGCCCAGAT
CATCAAAACCGGCCTCCACAACACCCAGTTCGCCCTCAGCAAGCTCATCGAGTTCTCTGCCGTCTCGCGCTTCGGCGACATCTCCTACGCCATCTCGCTCTTTAATTCGA
TCGAAGACCCCAATTTGTTCATTTGGAATTCCATGATTCGAGGGCTTTCGATGAGTCTTTCGCCGATTCTCGCCTTGGTTTTCTTTTCCAGAATGATTCATTCTGGGGTT
GAGCCGAATTCTTATACTTTTCCTTTTGTTTTGAAGTCTTGTGCGAAACTTGCCTCTGCCCATGAAGGGAAACAGATTCATGCGTTTGTTTTGAAGGTTGGGTTTGTGTG
TGATGTTTATATTCATACTTCGCTTATCAATATGTATGTGCAGAGTGGTGAAATGAACAATGCCCAGTTGGTGTTTGATCAAAGTAAGTTTAGGGATGCGATTTCTTTCA
CTGCATTGATTGCGGGTTATGCGTTATGGGGTTTTATGGATCGTGCACGGAAACTGTTCGATGAAATGCCTGTGAGAGATGTGGTGTCTTGGAATGCTATGATTGCTGGC
TATGCGCAAACTGGTCGATCCAAAGAGGCGTTGTTGTTGTTTGAAGAAATGAGCAAAGCAAATGTCCCTCCTAATGAAAGTACAATTGTCTCTGTTCTCTCTGCTTGTGC
TCAGTCAAATGCTTTGGATTTAGGAAATTCGATGCACTCATGGATTGAAGATCGTGGGCTTTGTTCGAATCTTAAGCTTGTTAATGCACTTATCGATATGTACTCGAAGT
GCGGTGATCTTCAAACGGCCCGTGAATTGTTTGATGAAATGCTTGAACGAGATGTGATCTCATGGAATGTTATGATTGGAGGTTACACTCATACGAGCAGCTACAAAGAA
GGCTTGGCACTCTTTCGTGAAATGCTAGCCTCAGGTATTGAGCCGACCGAAATAACCTTCCTTAACATTCTTCCATCGTGTGCTCATTTAGGTGCCATTGACCTTGGTAA
GTGGATACATGCATATATAAACAAAAACTTTAACTCGGCCAGCACCTCTCTTTGTACGAGTTTGATAGATATGTATGCTAAATGTGGTAATATAGAGGCGGCACGACAGG
TCTTTGATGGTATGAATATCAAAAGCTTGGCGTCTCGGAATGCTATGATATGCGGGTTAGCAATGCATGGACTGGCAGATGAGGCTTTTGAACTTTTCTCGAAAATGTCT
AGTGATGGAATTGAACCAAATGAGATAACTTTTGTTGGTGTTTTATCTGCTTGCAAACATGCTGGTCTGGTAGATCTCGGGCGCCGATTTTTTAGCTCTATGATTCAGGA
TTATAAAATCTCTCCGAAATCCCAACATTATGGATGCATGATAGATCTTCTTGGGCGAGCAGGGTTGTTTGAGGAAGCAGAGTCCTTAATACAAAATATGGAAATGAAGC
CAGATGGAGCTATTTGGGGTTCCCTTCTCGGCGCATGTAGAGACCATGGACGGGTCGAGTTGGGAGAATCAGTTGCGGAGCGTCTTTTCGAATTGGAGCCAGATAATGCT
GGAGCCTACGTGCTTTTATCCAACATATATGCAGGAGCTGGTAAATGGGAGGATGTTGCAAGAATAAGGACAAGACTGAATGATAGGGGAATGAAGAAAGTTCCTGGCTG
TACAACCATTGATGTCGATAACGTCGTTCACGAGTTTCTAGTAGGGGACAAAGTTCATCCACAAAGTGAAGATATCTACAAGATGTTGGAAGAAGTAGACCGACAATTAA
AGATGTTCGGATTCGTGCCAGATACATCCGAGGTACTCTACGACATGGACGAAGAATGGAAAGAAGGCGCTCTAAGCCACCATAGTGAGAAACTAGCCATTGCTTTTGGA
TTGATAAGTACAAAACCAGGAACACCGATTACGATCGTTAAAAATCTTCGCGTATGTCGTAATTGTCATTCTGCTACGAAATTGATATCGAAGATATTCAATAGAGAGAT
TATTGCTAGAGATAGAAACCGTTTCCATCATTTCAAAGATGGTTCTTGCTCATGTAACGATTTTTGGTGA
Protein sequenceShow/hide protein sequence
MAISAPSLLLSPSSDPPYKLLQDHPSLQLLSKCQSIRTLTQIHAQIIKTGLHNTQFALSKLIEFSAVSRFGDISYAISLFNSIEDPNLFIWNSMIRGLSMSLSPILALVF
FSRMIHSGVEPNSYTFPFVLKSCAKLASAHEGKQIHAFVLKVGFVCDVYIHTSLINMYVQSGEMNNAQLVFDQSKFRDAISFTALIAGYALWGFMDRARKLFDEMPVRDV
VSWNAMIAGYAQTGRSKEALLLFEEMSKANVPPNESTIVSVLSACAQSNALDLGNSMHSWIEDRGLCSNLKLVNALIDMYSKCGDLQTARELFDEMLERDVISWNVMIGG
YTHTSSYKEGLALFREMLASGIEPTEITFLNILPSCAHLGAIDLGKWIHAYINKNFNSASTSLCTSLIDMYAKCGNIEAARQVFDGMNIKSLASRNAMICGLAMHGLADE
AFELFSKMSSDGIEPNEITFVGVLSACKHAGLVDLGRRFFSSMIQDYKISPKSQHYGCMIDLLGRAGLFEEAESLIQNMEMKPDGAIWGSLLGACRDHGRVELGESVAER
LFELEPDNAGAYVLLSNIYAGAGKWEDVARIRTRLNDRGMKKVPGCTTIDVDNVVHEFLVGDKVHPQSEDIYKMLEEVDRQLKMFGFVPDTSEVLYDMDEEWKEGALSHH
SEKLAIAFGLISTKPGTPITIVKNLRVCRNCHSATKLISKIFNREIIARDRNRFHHFKDGSCSCNDFW