; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G01000 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G01000
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr1:592524..594881
RNA-Seq ExpressionCSPI01G01000
SyntenyCSPI01G01000
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057818.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0094.99Show/hide
Query:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA
        MASIVGCLP TSLTSIT   QFPENPKSLILQQCKTPKDL+QVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA
Subjt:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY
        FKRSPDNALLLFKKMHE SVQHDKFTFSSVLKACSRM+ L+EGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPER IVAWNSMLSGY
Subjt:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLF+KILEL I FDDVTMISVLMACGRLANLE+GELIGEYIVSKGLRRNNTL TSLIDMYAKCG++DTARKLF+EMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ
        YAQADRCKEALNLFHEMQKGNV PNEVTMVSVLYSCAMLGAY+TGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ
Subjt:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ

Query:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT
        GLANNGEGKMALEFF  MLENDVKPNDVTFIGVLSACSHACLVDQGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFID+MPFPPNAVVWRT
Subjt:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT

Query:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
        LLASCRAHKNIEMAEKSLEHITRLEP HSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKK PGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
Subjt:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK

Query:  QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
        QIK LGYVPN + ARLEAEEE+KETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMC DCHNATK+ISQ FERMIIVRDRNRFHHFKDGLCSC DYW
Subjt:  QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW

KAG7023094.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0088.7Show/hide
Query:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA
        MASIV CLPN S+TSIT   QFPENPKSLILQ+CKTPKDL+QVHAHLLKTRRL DP I EAVLESAALLLP++IDYALSIFNH+DKPESSAYNVMIRGLA
Subjt:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY
        FK+SP NA+LLFKKMHE SVQHDKFTFSSVLKACSRM+ALREGEQVHALILKSGFKSNEFVENTLI MYANCGQ+GVAR VFDGM +R+ VAWNSMLSGY
Subjt:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFRK+LEL IEFDDVTMISVLMACGRLA+LE+GELIGEYI+SKG+RRN+TLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ
        YAQADRCKEAL+LFHEMQK  V  NEVTMVSVLYSCA+LGAYETGKWVH YIK+KKMKLTV+LGTQLIDFYAKCGYIDRSVEVF+ M F NVFTWTALIQ
Subjt:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ

Query:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT
        GLANNGEGKMAL+FF  M EN+VKPNDVTFI VLSACSHACLVDQGRHLFNSMRR FDIEPRIEHYGCMVDILGRAG LEEAYQFI NMP PPNAVVWRT
Subjt:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT

Query:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
        LLASC+AHKN+EMAEKS +HIT LEPAHSGDYILLSNTYALVGRVEDA+RVRSLIK+KEIKK PGCSLIELDGVVHEFFSEDG+H HSKEIHDALD+MMK
Subjt:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK

Query:  QIKRLGYVPNTDDARLEA-EEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
        +IK LGYVPN +DARLEA EEESKETSVSHHSEKLAIAYGLIRT  +TTIRISKNLRMCRDCHNATK ISQV++R IIVRDRNRFHHFKDGLCSCNDYW
Subjt:  QIKRLGYVPNTDDARLEA-EEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW

XP_004138266.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucumis sativus]0.0e+0099.71Show/hide
Query:  MASIVGCLPNTSLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKR
        MASIVGCLPN SLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKR
Subjt:  MASIVGCLPNTSLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKR

Query:  SPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKN
        SPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKN
Subjt:  SPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKN

Query:  GLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQ
        GLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQ
Subjt:  GLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQ

Query:  ADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLA
        ADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLA
Subjt:  ADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLA

Query:  NNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLA
        NNGEGKMALEFF SMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLA
Subjt:  NNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLA

Query:  SCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK
        SCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK
Subjt:  SCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK

Query:  RLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
        RLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
Subjt:  RLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW

XP_016903201.1 PREDICTED: pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucumis melo]0.0e+0094.99Show/hide
Query:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA
        MASIVGCLP TSLTSIT   QFPENPKSLILQQCKTPKDL+QVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA
Subjt:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY
        FKRSPDNALLLFKKMHE SVQHDKFTFSSVLKACSRM+ L+EGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPER IVAWNSMLSGY
Subjt:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLF+KILEL I FDDVTMISVLMACGRLANLE+GELIGEYIVSKGLRRNNTL TSLIDMYAKCG++DTARKLF+EMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ
        YAQADRCKEALNLFHEMQKGNV PNEVTMVSVLYSCAMLGAY+TGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ
Subjt:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ

Query:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT
        GLANNGEGKMALEFF  MLENDVKPNDVTFIGVLSACSHACLVDQGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFID+MPFPPNAVVWRT
Subjt:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT

Query:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
        LLASCRAHKNIEMAEKSLEHITRLEP HSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKK PGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
Subjt:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK

Query:  QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
        QIK LGYVPN + ARLEAEEE+KETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMC DCHNATK+ISQ FERMIIVRDRNRFHHFKDGLCSC DYW
Subjt:  QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW

XP_022921781.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita moschata]0.0e+0088.56Show/hide
Query:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA
        MASIV CLPN S+TSIT   QFPENPKSLILQ+CKTPKDL+QVHAHLLKTRRL DP I EAVLESAALLLP++IDYALSIFNH+DKPESSAYNVMIRGLA
Subjt:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY
        FK+SP NA+LLFKKMHE SVQHDKFTFSSVLKACSRM+ALREGEQVHALILKSGFK NEFVENTLI MYANCGQ+GVAR VFDGM +R+ VAWNSMLSGY
Subjt:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFRK+LEL IEFDDVTMISVLMACGRLA+LE+GELIGEYI+SKG+RRN+TLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ
        YAQADRCKEAL+LFHEMQK  V  NEVTMVSVLYSCA+LGAYETGKWVH YIK+KKMKLTV+LGTQLIDFYAKCGYIDRSVEVF+ M F NVFTWTALIQ
Subjt:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ

Query:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT
        GLANNGEGKMAL+FF  M EN+VKPNDVTFI VLSACSHACLVDQGRHLFNSMRR FDIEPRIEHYGCMVDILGRAG LEEAYQFI NMP PPNAVVWRT
Subjt:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT

Query:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
        LLASC+AHKN+EMAEKS +HIT LEPAHSGDYILLSNTYALVGRVEDA+RVRSLIK+KEIKK PGCSLIELDGVVHEFFSEDG+H HSKEIHDALD+MMK
Subjt:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK

Query:  QIKRLGYVPNTDDARLEA-EEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
        +IK LGYVPN +DARLEA EEESKETSVSHHSEKLAIAYGLIRT  +TTIRISKNLRMCRDCHNATK ISQV++R IIVRDRNRFHHFKDGLCSCNDYW
Subjt:  QIKRLGYVPNTDDARLEA-EEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW

TrEMBL top hitse value%identityAlignment
A0A0A0LRD6 DYW_deaminase domain-containing protein0.0e+0099.71Show/hide
Query:  MASIVGCLPNTSLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKR
        MASIVGCLPN SLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKR
Subjt:  MASIVGCLPNTSLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKR

Query:  SPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKN
        SPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKN
Subjt:  SPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKN

Query:  GLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQ
        GLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQ
Subjt:  GLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQ

Query:  ADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLA
        ADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLA
Subjt:  ADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLA

Query:  NNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLA
        NNGEGKMALEFF SMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLA
Subjt:  NNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLA

Query:  SCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK
        SCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK
Subjt:  SCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK

Query:  RLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
        RLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
Subjt:  RLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW

A0A1S4E4P2 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like0.0e+0094.99Show/hide
Query:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA
        MASIVGCLP TSLTSIT   QFPENPKSLILQQCKTPKDL+QVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA
Subjt:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY
        FKRSPDNALLLFKKMHE SVQHDKFTFSSVLKACSRM+ L+EGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPER IVAWNSMLSGY
Subjt:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLF+KILEL I FDDVTMISVLMACGRLANLE+GELIGEYIVSKGLRRNNTL TSLIDMYAKCG++DTARKLF+EMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ
        YAQADRCKEALNLFHEMQKGNV PNEVTMVSVLYSCAMLGAY+TGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ
Subjt:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ

Query:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT
        GLANNGEGKMALEFF  MLENDVKPNDVTFIGVLSACSHACLVDQGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFID+MPFPPNAVVWRT
Subjt:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT

Query:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
        LLASCRAHKNIEMAEKSLEHITRLEP HSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKK PGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
Subjt:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK

Query:  QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
        QIK LGYVPN + ARLEAEEE+KETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMC DCHNATK+ISQ FERMIIVRDRNRFHHFKDGLCSC DYW
Subjt:  QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW

A0A5A7URN2 Pentatricopeptide repeat-containing protein0.0e+0094.99Show/hide
Query:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA
        MASIVGCLP TSLTSIT   QFPENPKSLILQQCKTPKDL+QVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA
Subjt:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY
        FKRSPDNALLLFKKMHE SVQHDKFTFSSVLKACSRM+ L+EGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPER IVAWNSMLSGY
Subjt:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLF+KILEL I FDDVTMISVLMACGRLANLE+GELIGEYIVSKGLRRNNTL TSLIDMYAKCG++DTARKLF+EMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ
        YAQADRCKEALNLFHEMQKGNV PNEVTMVSVLYSCAMLGAY+TGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ
Subjt:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ

Query:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT
        GLANNGEGKMALEFF  MLENDVKPNDVTFIGVLSACSHACLVDQGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFID+MPFPPNAVVWRT
Subjt:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT

Query:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
        LLASCRAHKNIEMAEKSLEHITRLEP HSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKK PGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
Subjt:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK

Query:  QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
        QIK LGYVPN + ARLEAEEE+KETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMC DCHNATK+ISQ FERMIIVRDRNRFHHFKDGLCSC DYW
Subjt:  QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW

A0A5D3BFH8 Pentatricopeptide repeat-containing protein0.0e+0094.99Show/hide
Query:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA
        MASIVGCLP TSLTSIT   QFPENPKSLILQQCKTPKDL+QVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA
Subjt:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY
        FKRSPDNALLLFKKMHE SVQHDKFTFSSVLKACSRM+ L+EGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPER IVAWNSMLSGY
Subjt:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLF+KILEL I FDDVTMISVLMACGRLANLE+GELIGEYIVSKGLRRNNTL TSLIDMYAKCG++DTARKLF+EMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ
        YAQADRCKEALNLFHEMQKGNV PNEVTMVSVLYSCAMLGAY+TGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ
Subjt:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ

Query:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT
        GLANNGEGKMALEFF  MLENDVKPNDVTFIGVLSACSHACLVDQGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFID+MPFPPNAVVWRT
Subjt:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT

Query:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
        LLASCRAHKNIEMAEKSLEHITRLEP HSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKK PGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
Subjt:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK

Query:  QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
        QIK LGYVPN + ARLEAEEE+KETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMC DCHNATK+ISQ FERMIIVRDRNRFHHFKDGLCSC DYW
Subjt:  QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW

A0A6J1E2C2 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like0.0e+0088.56Show/hide
Query:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA
        MASIV CLPN S+TSIT   QFPENPKSLILQ+CKTPKDL+QVHAHLLKTRRL DP I EAVLESAALLLP++IDYALSIFNH+DKPESSAYNVMIRGLA
Subjt:  MASIVGCLPNTSLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY
        FK+SP NA+LLFKKMHE SVQHDKFTFSSVLKACSRM+ALREGEQVHALILKSGFK NEFVENTLI MYANCGQ+GVAR VFDGM +R+ VAWNSMLSGY
Subjt:  FKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFRK+LEL IEFDDVTMISVLMACGRLA+LE+GELIGEYI+SKG+RRN+TLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ
        YAQADRCKEAL+LFHEMQK  V  NEVTMVSVLYSCA+LGAYETGKWVH YIK+KKMKLTV+LGTQLIDFYAKCGYIDRSVEVF+ M F NVFTWTALIQ
Subjt:  YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQ

Query:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT
        GLANNGEGKMAL+FF  M EN+VKPNDVTFI VLSACSHACLVDQGRHLFNSMRR FDIEPRIEHYGCMVDILGRAG LEEAYQFI NMP PPNAVVWRT
Subjt:  GLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRT

Query:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
        LLASC+AHKN+EMAEKS +HIT LEPAHSGDYILLSNTYALVGRVEDA+RVRSLIK+KEIKK PGCSLIELDGVVHEFFSEDG+H HSKEIHDALD+MMK
Subjt:  LLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK

Query:  QIKRLGYVPNTDDARLEA-EEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
        +IK LGYVPN +DARLEA EEESKETSVSHHSEKLAIAYGLIRT  +TTIRISKNLRMCRDCHNATK ISQV++R IIVRDRNRFHHFKDGLCSCNDYW
Subjt:  QIKRLGYVPNTDDARLEA-EEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148206.1e-15037.85Show/hide
Query:  LQQCKTPKDLQQVHAHLLKT--RRLLDPIITEAVLESAALLLPDTIDYALSIFNHI-DKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTF
        L  CK+   ++Q+HAH+L+T     L+  +    + S+++     + YAL++F+ I   PES  +N  +R L+    P   +L ++++     + D+F+F
Subjt:  LQQCKTPKDLQQVHAHLLKT--RRLLDPIITEAVLESAALLLPDTIDYALSIFNHI-DKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTF

Query:  SSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVT
          +LKA S++ AL EG ++H +  K     + FVE   + MYA+CG+I  AR+VFD M  R +V WN+M+  Y + GL DE  KLF ++ +  +  D++ 
Subjt:  SSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVT

Query:  MISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYA-------------------------------KCGQVDTARKLFDEMDKRDVVAWSA
        + +++ ACGR  N+     I E+++   +R +  L T+L+ MYA                               KCG++D A+ +FD+ +K+D+V W+ 
Subjt:  MISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYA-------------------------------KCGQVDTARKLFDEMDKRDVVAWSA

Query:  MISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWT
        MIS Y ++D  +EAL +F EM    + P+ V+M SV+ +CA LG  +  KWVH  I    ++  +++   LI+ YAKCG +D + +VF++M  +NV +W+
Subjt:  MISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWT

Query:  ALIQGLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAV
        ++I  L+ +GE   AL  F  M + +V+PN+VTF+GVL  CSH+ LV++G+ +F SM  +++I P++EHYGCMVD+ GRA  L EA + I++MP   N V
Subjt:  ALIQGLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAV

Query:  VWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALD
        +W +L+++CR H  +E+ + + + I  LEP H G  +L+SN YA   R ED   +R +++EK + K  G S I+ +G  HEF   D  HK S EI+  LD
Subjt:  VWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALD

Query:  KMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRT------TIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDG
        +++ ++K  GYVP+     ++ EEE K+  V  HSEKLA+ +GL+             IRI KNLR+C DCH   K +S+V+ER IIVRDR RFH +K+G
Subjt:  KMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRT------TIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDG

Query:  LCSCNDYW
        LCSC DYW
Subjt:  LCSCNDYW

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.2e-16139.97Show/hide
Query:  ILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKM-HEKSVQHDKFTFS
        ++++C + + L+Q H H+++T    DP     +   AAL    +++YA  +F+ I KP S A+N +IR  A    P  ++  F  M  E     +K+TF 
Subjt:  ILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKM-HEKSVQHDKFTFS

Query:  SVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTM
         ++KA + + +L  G+ +H + +KS   S+ FV N+LI  Y +CG +  A  VF  + E+ +V+WNSM++G+ + G  D+ ++LF+K+    ++   VTM
Subjt:  SVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTM

Query:  ISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMD-------------------------------KRDVVAWSAM
        + VL AC ++ NLE G  +  YI    +  N TL  +++DMY KCG ++ A++LFD M+                               ++D+VAW+A+
Subjt:  ISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMD-------------------------------KRDVVAWSAM

Query:  ISGYAQADRCKEALNLFHEMQ-KGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWT
        IS Y Q  +  EAL +FHE+Q + N+  N++T+VS L +CA +GA E G+W+H YIKK  +++   + + LI  Y+KCG +++S EVF  +  ++VF W+
Subjt:  ISGYAQADRCKEALNLFHEMQ-KGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWT

Query:  ALIQGLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAV
        A+I GLA +G G  A++ FY M E +VKPN VTF  V  ACSH  LVD+   LF+ M  ++ I P  +HY C+VD+LGR+G+LE+A +FI+ MP PP+  
Subjt:  ALIQGLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAV

Query:  VWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALD
        VW  LL +C+ H N+ +AE +   +  LEP + G ++LLSN YA +G+ E+   +R  ++   +KK PGCS IE+DG++HEF S D  H  S++++  L 
Subjt:  VWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALD

Query:  KMMKQIKRLGYVPNTDDA-RLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCN
        ++M+++K  GY P      ++  EEE KE S++ HSEKLAI YGLI T     IR+ KNLR+C DCH+  K ISQ+++R IIVRDR RFHHF++G CSCN
Subjt:  KMMKQIKRLGYVPNTDDA-RLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCN

Query:  DYW
        D+W
Subjt:  DYW

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665201.3e-15240.3Show/hide
Query:  LQQCKTPKDLQQVHAHLLKTRRLLDP-IITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSS
        LQ+C   ++L+Q+HA +LKT  + D   IT+ +    +    D + YA  +F+  D+P++  +N+MIRG +    P+ +LLL+++M   S  H+ +TF S
Subjt:  LQQCKTPKDLQQVHAHLLKTRRLLDP-IITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSS

Query:  VLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMI
        +LKACS + A  E  Q+HA I K G++++ +  N+LI  YA  G   +A  +FD +PE   V+WNS++ GY K G  D  + LFRK+ E           
Subjt:  VLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMI

Query:  SVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTM
                                                                   ++ ++W+ MISGY QAD  KEAL LFHEMQ  +V P+ V++
Subjt:  SVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTM

Query:  VSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFYSMLENDVKPNDVT
         + L +CA LGA E GKW+H Y+ K ++++   LG  LID YAKCG ++ ++EVFK +  K+V  WTALI G A +G G+ A+  F  M +  +KPN +T
Subjt:  VSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFYSMLENDVKPNDVT

Query:  FIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHS
        F  VL+ACS+  LV++G+ +F SM RD++++P IEHYGC+VD+LGRAG L+EA +FI  MP  PNAV+W  LL +CR HKNIE+ E+  E +  ++P H 
Subjt:  FIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHS

Query:  GDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLE-AEEESKETSVS
        G Y+  +N +A+  + + A   R L+KE+ + K+PGCS I L+G  HEF + D  H   ++I      M ++++  GYVP  ++  L+  +++ +E  V 
Subjt:  GDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLE-AEEESKETSVS

Query:  HHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
         HSEKLAI YGLI+T P T IRI KNLR+C+DCH  TK IS++++R I++RDR RFHHF+DG CSC DYW
Subjt:  HHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic2.1e-16639.53Show/hide
Query:  LPNTSLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLK-----TRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSP
        LP++S         +P   +L  CKT + L+ +HA ++K     T   L  +I   +L      LP    YA+S+F  I +P    +N M RG A    P
Subjt:  LPNTSLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLK-----TRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSP

Query:  DNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQM-------------------------------YANCGQ
         +AL L+  M    +  + +TF  VLK+C++ KA +EG+Q+H  +LK G   + +V  +LI M                               YA+ G 
Subjt:  DNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQM-------------------------------YANCGQ

Query:  IGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCG
        I  A+ +FD +P + +V+WN+M+SGY + G + E ++LF+ +++  +  D+ TM++V+ AC +  ++E+G  +  +I   G   N  +  +LID+Y+KCG
Subjt:  IGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCG

Query:  QVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLT--VTLGTQLIDFYA
        +++TA  LF+ +  +DV++W+ +I GY   +  KEAL LF EM +    PN+VTM+S+L +CA LGA + G+W+H YI K+   +T   +L T LID YA
Subjt:  QVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLT--VTLGTQLIDFYA

Query:  KCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDI
        KCG I+ + +VF  +  K++ +W A+I G A +G    + + F  M +  ++P+D+TF+G+LSACSH+ ++D GRH+F +M +D+ + P++EHYGCM+D+
Subjt:  KCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDI

Query:  LGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELD
        LG +G  +EA + I+ M   P+ V+W +LL +C+ H N+E+ E   E++ ++EP + G Y+LLSN YA  GR  +  + R+L+ +K +KK+PGCS IE+D
Subjt:  LGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELD

Query:  GVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVF
         VVHEF   D  H  ++EI+  L++M   +++ G+VP+T +   E EEE KE ++ HHSEKLAIA+GLI T P T + I KNLR+CR+CH ATK IS+++
Subjt:  GVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVF

Query:  ERMIIVRDRNRFHHFKDGLCSCNDYW
        +R II RDR RFHHF+DG+CSCNDYW
Subjt:  ERMIIVRDRNRFHHFKDGLCSCNDYW

Q9SR82 Putative pentatricopeptide repeat-containing protein At3g088201.7e-14738.6Show/hide
Query:  SLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKK
        ++ S T   +  K+LI   C T   L+Q+H  L+      D  +   +L+           Y L  F+H   P    YN +I G          L LF  
Subjt:  SLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKK

Query:  MHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFR
        + +  +    FTF  VLKAC+R  + + G  +H+L++K GF  +     +L+ +Y+  G++  A  +FD +P+RS+V W ++ SGYT +G   E + LF+
Subjt:  MHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFR

Query:  KILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLF
        K++E+ ++ D   ++ VL AC  + +L+ GE I +Y+    +++N+ + T+L+++YAKCG+++ AR +FD M ++D+V WS MI GYA     KE + LF
Subjt:  KILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLF

Query:  HEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEF
         +M + N+ P++ ++V  L SCA LGA + G+W    I + +    + +   LID YAKCG + R  EVFKEM  K++    A I GLA NG  K++   
Subjt:  HEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEF

Query:  FYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMA
        F    +  + P+  TF+G+L  C HA L+  G   FN++   + ++  +EHYGCMVD+ GRAG L++AY+ I +MP  PNA+VW  LL+ CR  K+ ++A
Subjt:  FYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMA

Query:  EKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDA
        E  L+ +  LEP ++G+Y+ LSN Y++ GR ++A  VR ++ +K +KKIPG S IEL+G VHEF ++D  H  S +I+  L+ +  +++ +G+VP T+  
Subjt:  EKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDA

Query:  RLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
          + EEE KE  + +HSEKLA+A GLI T     IR+ KNLR+C DCH   K IS++  R I+VRD NRFH F +G CSCNDYW
Subjt:  RLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-16739.53Show/hide
Query:  LPNTSLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLK-----TRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSP
        LP++S         +P   +L  CKT + L+ +HA ++K     T   L  +I   +L      LP    YA+S+F  I +P    +N M RG A    P
Subjt:  LPNTSLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLK-----TRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSP

Query:  DNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQM-------------------------------YANCGQ
         +AL L+  M    +  + +TF  VLK+C++ KA +EG+Q+H  +LK G   + +V  +LI M                               YA+ G 
Subjt:  DNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQM-------------------------------YANCGQ

Query:  IGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCG
        I  A+ +FD +P + +V+WN+M+SGY + G + E ++LF+ +++  +  D+ TM++V+ AC +  ++E+G  +  +I   G   N  +  +LID+Y+KCG
Subjt:  IGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCG

Query:  QVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLT--VTLGTQLIDFYA
        +++TA  LF+ +  +DV++W+ +I GY   +  KEAL LF EM +    PN+VTM+S+L +CA LGA + G+W+H YI K+   +T   +L T LID YA
Subjt:  QVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLT--VTLGTQLIDFYA

Query:  KCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDI
        KCG I+ + +VF  +  K++ +W A+I G A +G    + + F  M +  ++P+D+TF+G+LSACSH+ ++D GRH+F +M +D+ + P++EHYGCM+D+
Subjt:  KCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDI

Query:  LGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELD
        LG +G  +EA + I+ M   P+ V+W +LL +C+ H N+E+ E   E++ ++EP + G Y+LLSN YA  GR  +  + R+L+ +K +KK+PGCS IE+D
Subjt:  LGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELD

Query:  GVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVF
         VVHEF   D  H  ++EI+  L++M   +++ G+VP+T +   E EEE KE ++ HHSEKLAIA+GLI T P T + I KNLR+CR+CH ATK IS+++
Subjt:  GVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVF

Query:  ERMIIVRDRNRFHHFKDGLCSCNDYW
        +R II RDR RFHHF+DG+CSCNDYW
Subjt:  ERMIIVRDRNRFHHFKDGLCSCNDYW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.5e-16339.97Show/hide
Query:  ILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKM-HEKSVQHDKFTFS
        ++++C + + L+Q H H+++T    DP     +   AAL    +++YA  +F+ I KP S A+N +IR  A    P  ++  F  M  E     +K+TF 
Subjt:  ILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKM-HEKSVQHDKFTFS

Query:  SVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTM
         ++KA + + +L  G+ +H + +KS   S+ FV N+LI  Y +CG +  A  VF  + E+ +V+WNSM++G+ + G  D+ ++LF+K+    ++   VTM
Subjt:  SVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTM

Query:  ISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMD-------------------------------KRDVVAWSAM
        + VL AC ++ NLE G  +  YI    +  N TL  +++DMY KCG ++ A++LFD M+                               ++D+VAW+A+
Subjt:  ISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMD-------------------------------KRDVVAWSAM

Query:  ISGYAQADRCKEALNLFHEMQ-KGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWT
        IS Y Q  +  EAL +FHE+Q + N+  N++T+VS L +CA +GA E G+W+H YIKK  +++   + + LI  Y+KCG +++S EVF  +  ++VF W+
Subjt:  ISGYAQADRCKEALNLFHEMQ-KGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWT

Query:  ALIQGLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAV
        A+I GLA +G G  A++ FY M E +VKPN VTF  V  ACSH  LVD+   LF+ M  ++ I P  +HY C+VD+LGR+G+LE+A +FI+ MP PP+  
Subjt:  ALIQGLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAV

Query:  VWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALD
        VW  LL +C+ H N+ +AE +   +  LEP + G ++LLSN YA +G+ E+   +R  ++   +KK PGCS IE+DG++HEF S D  H  S++++  L 
Subjt:  VWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALD

Query:  KMMKQIKRLGYVPNTDDA-RLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCN
        ++M+++K  GY P      ++  EEE KE S++ HSEKLAI YGLI T     IR+ KNLR+C DCH+  K ISQ+++R IIVRDR RFHHF++G CSCN
Subjt:  KMMKQIKRLGYVPNTDDA-RLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCN

Query:  DYW
        D+W
Subjt:  DYW

AT3G08820.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-14838.6Show/hide
Query:  SLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKK
        ++ S T   +  K+LI   C T   L+Q+H  L+      D  +   +L+           Y L  F+H   P    YN +I G          L LF  
Subjt:  SLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKK

Query:  MHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFR
        + +  +    FTF  VLKAC+R  + + G  +H+L++K GF  +     +L+ +Y+  G++  A  +FD +P+RS+V W ++ SGYT +G   E + LF+
Subjt:  MHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFR

Query:  KILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLF
        K++E+ ++ D   ++ VL AC  + +L+ GE I +Y+    +++N+ + T+L+++YAKCG+++ AR +FD M ++D+V WS MI GYA     KE + LF
Subjt:  KILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLF

Query:  HEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEF
         +M + N+ P++ ++V  L SCA LGA + G+W    I + +    + +   LID YAKCG + R  EVFKEM  K++    A I GLA NG  K++   
Subjt:  HEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEF

Query:  FYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMA
        F    +  + P+  TF+G+L  C HA L+  G   FN++   + ++  +EHYGCMVD+ GRAG L++AY+ I +MP  PNA+VW  LL+ CR  K+ ++A
Subjt:  FYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMA

Query:  EKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDA
        E  L+ +  LEP ++G+Y+ LSN Y++ GR ++A  VR ++ +K +KKIPG S IEL+G VHEF ++D  H  S +I+  L+ +  +++ +G+VP T+  
Subjt:  EKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDA

Query:  RLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
          + EEE KE  + +HSEKLA+A GLI T     IR+ KNLR+C DCH   K IS++  R I+VRD NRFH F +G CSCNDYW
Subjt:  RLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW

AT4G14820.1 Pentatricopeptide repeat (PPR) superfamily protein4.4e-15137.85Show/hide
Query:  LQQCKTPKDLQQVHAHLLKT--RRLLDPIITEAVLESAALLLPDTIDYALSIFNHI-DKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTF
        L  CK+   ++Q+HAH+L+T     L+  +    + S+++     + YAL++F+ I   PES  +N  +R L+    P   +L ++++     + D+F+F
Subjt:  LQQCKTPKDLQQVHAHLLKT--RRLLDPIITEAVLESAALLLPDTIDYALSIFNHI-DKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTF

Query:  SSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVT
          +LKA S++ AL EG ++H +  K     + FVE   + MYA+CG+I  AR+VFD M  R +V WN+M+  Y + GL DE  KLF ++ +  +  D++ 
Subjt:  SSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVT

Query:  MISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYA-------------------------------KCGQVDTARKLFDEMDKRDVVAWSA
        + +++ ACGR  N+     I E+++   +R +  L T+L+ MYA                               KCG++D A+ +FD+ +K+D+V W+ 
Subjt:  MISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYA-------------------------------KCGQVDTARKLFDEMDKRDVVAWSA

Query:  MISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWT
        MIS Y ++D  +EAL +F EM    + P+ V+M SV+ +CA LG  +  KWVH  I    ++  +++   LI+ YAKCG +D + +VF++M  +NV +W+
Subjt:  MISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWT

Query:  ALIQGLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAV
        ++I  L+ +GE   AL  F  M + +V+PN+VTF+GVL  CSH+ LV++G+ +F SM  +++I P++EHYGCMVD+ GRA  L EA + I++MP   N V
Subjt:  ALIQGLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAV

Query:  VWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALD
        +W +L+++CR H  +E+ + + + I  LEP H G  +L+SN YA   R ED   +R +++EK + K  G S I+ +G  HEF   D  HK S EI+  LD
Subjt:  VWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALD

Query:  KMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRT------TIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDG
        +++ ++K  GYVP+     ++ EEE K+  V  HSEKLA+ +GL+             IRI KNLR+C DCH   K +S+V+ER IIVRDR RFH +K+G
Subjt:  KMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRT------TIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDG

Query:  LCSCNDYW
        LCSC DYW
Subjt:  LCSCNDYW

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.4e-15440.3Show/hide
Query:  LQQCKTPKDLQQVHAHLLKTRRLLDP-IITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSS
        LQ+C   ++L+Q+HA +LKT  + D   IT+ +    +    D + YA  +F+  D+P++  +N+MIRG +    P+ +LLL+++M   S  H+ +TF S
Subjt:  LQQCKTPKDLQQVHAHLLKTRRLLDP-IITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSS

Query:  VLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMI
        +LKACS + A  E  Q+HA I K G++++ +  N+LI  YA  G   +A  +FD +PE   V+WNS++ GY K G  D  + LFRK+ E           
Subjt:  VLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMI

Query:  SVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTM
                                                                   ++ ++W+ MISGY QAD  KEAL LFHEMQ  +V P+ V++
Subjt:  SVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTM

Query:  VSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFYSMLENDVKPNDVT
         + L +CA LGA E GKW+H Y+ K ++++   LG  LID YAKCG ++ ++EVFK +  K+V  WTALI G A +G G+ A+  F  M +  +KPN +T
Subjt:  VSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFYSMLENDVKPNDVT

Query:  FIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHS
        F  VL+ACS+  LV++G+ +F SM RD++++P IEHYGC+VD+LGRAG L+EA +FI  MP  PNAV+W  LL +CR HKNIE+ E+  E +  ++P H 
Subjt:  FIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHS

Query:  GDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLE-AEEESKETSVS
        G Y+  +N +A+  + + A   R L+KE+ + K+PGCS I L+G  HEF + D  H   ++I      M ++++  GYVP  ++  L+  +++ +E  V 
Subjt:  GDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLE-AEEESKETSVS

Query:  HHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
         HSEKLAI YGLI+T P T IRI KNLR+C+DCH  TK IS++++R I++RDR RFHHF+DG CSC DYW
Subjt:  HHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCGATAGTCGGTTGCCTTCCCAATACATCTCTGACTTCCATAACCCAGTTCCCTGAAAACCCAAAATCTTTGATTCTTCAGCAATGCAAAACTCCAAAAGACCT
CCAGCAAGTTCACGCTCACCTTCTCAAAACTCGCCGTCTCCTCGACCCCATCATTACAGAAGCCGTTCTCGAGTCCGCAGCTTTACTCCTTCCCGACACCATAGATTATG
CCCTTTCCATTTTCAACCATATCGACAAACCCGAATCGTCGGCTTACAATGTTATGATCAGGGGCCTTGCTTTCAAGCGATCGCCTGATAATGCCCTTCTCTTGTTCAAG
AAAATGCATGAAAAGTCAGTTCAGCATGACAAATTCACTTTCTCCTCTGTCTTAAAGGCTTGCTCTAGAATGAAAGCGCTGAGGGAAGGTGAACAGGTCCACGCGTTGAT
TCTGAAATCTGGGTTCAAATCAAATGAGTTTGTCGAGAATACTTTGATTCAGATGTATGCGAATTGTGGACAAATTGGGGTTGCACGTCATGTGTTTGATGGAATGCCGG
AAAGAAGCATAGTTGCGTGGAATTCGATGTTGTCTGGTTATACGAAAAATGGGCTTTGGGATGAGGTCGTGAAGCTTTTTCGAAAAATTTTGGAACTGCGTATTGAATTT
GATGATGTTACAATGATTAGTGTATTGATGGCTTGTGGAAGATTAGCGAATCTGGAAATAGGGGAGTTGATTGGTGAGTATATTGTGTCAAAAGGGCTAAGACGAAACAA
TACTCTAACGACTTCGCTGATTGATATGTATGCCAAATGTGGTCAAGTTGATACCGCTAGAAAGTTGTTCGATGAAATGGATAAAAGAGATGTTGTTGCTTGGAGTGCAA
TGATCTCGGGGTATGCTCAAGCTGATCGATGTAAAGAAGCTCTTAATCTGTTCCATGAGATGCAGAAGGGAAATGTATATCCAAACGAGGTAACAATGGTCAGTGTTCTC
TATTCGTGCGCTATGCTTGGAGCATACGAAACAGGTAAGTGGGTTCATTTCTACATCAAAAAGAAGAAGATGAAGCTCACGGTTACTCTTGGAACTCAGCTGATAGATTT
TTATGCTAAATGTGGGTATATAGATAGATCAGTTGAAGTTTTCAAGGAAATGTCTTTCAAGAATGTGTTCACATGGACAGCATTAATTCAAGGTCTTGCCAATAATGGAG
AAGGGAAAATGGCTCTGGAATTCTTTTACTCGATGCTAGAGAATGATGTAAAGCCAAATGATGTAACTTTCATTGGCGTTCTGTCTGCTTGTAGCCACGCTTGTCTGGTT
GATCAAGGTCGACATCTTTTCAATAGCATGAGAAGAGATTTTGATATTGAGCCAAGGATTGAGCATTATGGTTGCATGGTTGATATACTTGGACGTGCTGGGTTTCTTGA
AGAAGCCTATCAGTTCATAGATAACATGCCCTTCCCTCCCAATGCTGTTGTTTGGAGAACACTATTGGCTTCATGTAGAGCTCATAAAAACATTGAAATGGCAGAAAAAT
CATTGGAACACATAACTCGATTGGAGCCTGCTCACAGTGGAGATTACATTCTTCTGTCAAATACTTATGCATTGGTTGGTAGGGTTGAGGATGCAATCAGGGTAAGATCT
TTGATAAAAGAGAAGGAGATTAAGAAGATTCCAGGTTGTAGTTTGATTGAGCTCGATGGTGTTGTACATGAGTTTTTTTCAGAAGATGGAGAACATAAGCACTCCAAGGA
AATACATGACGCGTTAGATAAAATGATGAAGCAGATCAAGAGGCTCGGATATGTGCCCAACACAGACGATGCTAGACTGGAGGCTGAGGAAGAGAGCAAAGAAACTTCAG
TGTCGCATCATAGTGAGAAGCTTGCTATTGCTTATGGTCTGATCCGAACGTCTCCTCGAACCACTATTAGAATTTCAAAAAATCTTAGGATGTGTAGGGATTGCCATAAT
GCAACGAAGTTTATATCACAAGTCTTTGAAAGAATGATTATTGTTAGGGATCGGAACCGTTTTCATCATTTTAAAGATGGCCTTTGCTCCTGTAATGACTATTGGTGA
mRNA sequenceShow/hide mRNA sequence
TAAAGAGTACTATTTTTTACTCCTCCTTCGTGAAAGAATTCTTTGTCGATTGAAATGTTGGCAGCTCTGTTCCACTGAATTGAACTATCTATAGATGATATATTGCCAAA
TTTAGAATCTTTCGAGCTTCACTTTCATCTCAAATGGCGTCGATAGTCGGTTGCCTTCCCAATACATCTCTGACTTCCATAACCCAGTTCCCTGAAAACCCAAAATCTTT
GATTCTTCAGCAATGCAAAACTCCAAAAGACCTCCAGCAAGTTCACGCTCACCTTCTCAAAACTCGCCGTCTCCTCGACCCCATCATTACAGAAGCCGTTCTCGAGTCCG
CAGCTTTACTCCTTCCCGACACCATAGATTATGCCCTTTCCATTTTCAACCATATCGACAAACCCGAATCGTCGGCTTACAATGTTATGATCAGGGGCCTTGCTTTCAAG
CGATCGCCTGATAATGCCCTTCTCTTGTTCAAGAAAATGCATGAAAAGTCAGTTCAGCATGACAAATTCACTTTCTCCTCTGTCTTAAAGGCTTGCTCTAGAATGAAAGC
GCTGAGGGAAGGTGAACAGGTCCACGCGTTGATTCTGAAATCTGGGTTCAAATCAAATGAGTTTGTCGAGAATACTTTGATTCAGATGTATGCGAATTGTGGACAAATTG
GGGTTGCACGTCATGTGTTTGATGGAATGCCGGAAAGAAGCATAGTTGCGTGGAATTCGATGTTGTCTGGTTATACGAAAAATGGGCTTTGGGATGAGGTCGTGAAGCTT
TTTCGAAAAATTTTGGAACTGCGTATTGAATTTGATGATGTTACAATGATTAGTGTATTGATGGCTTGTGGAAGATTAGCGAATCTGGAAATAGGGGAGTTGATTGGTGA
GTATATTGTGTCAAAAGGGCTAAGACGAAACAATACTCTAACGACTTCGCTGATTGATATGTATGCCAAATGTGGTCAAGTTGATACCGCTAGAAAGTTGTTCGATGAAA
TGGATAAAAGAGATGTTGTTGCTTGGAGTGCAATGATCTCGGGGTATGCTCAAGCTGATCGATGTAAAGAAGCTCTTAATCTGTTCCATGAGATGCAGAAGGGAAATGTA
TATCCAAACGAGGTAACAATGGTCAGTGTTCTCTATTCGTGCGCTATGCTTGGAGCATACGAAACAGGTAAGTGGGTTCATTTCTACATCAAAAAGAAGAAGATGAAGCT
CACGGTTACTCTTGGAACTCAGCTGATAGATTTTTATGCTAAATGTGGGTATATAGATAGATCAGTTGAAGTTTTCAAGGAAATGTCTTTCAAGAATGTGTTCACATGGA
CAGCATTAATTCAAGGTCTTGCCAATAATGGAGAAGGGAAAATGGCTCTGGAATTCTTTTACTCGATGCTAGAGAATGATGTAAAGCCAAATGATGTAACTTTCATTGGC
GTTCTGTCTGCTTGTAGCCACGCTTGTCTGGTTGATCAAGGTCGACATCTTTTCAATAGCATGAGAAGAGATTTTGATATTGAGCCAAGGATTGAGCATTATGGTTGCAT
GGTTGATATACTTGGACGTGCTGGGTTTCTTGAAGAAGCCTATCAGTTCATAGATAACATGCCCTTCCCTCCCAATGCTGTTGTTTGGAGAACACTATTGGCTTCATGTA
GAGCTCATAAAAACATTGAAATGGCAGAAAAATCATTGGAACACATAACTCGATTGGAGCCTGCTCACAGTGGAGATTACATTCTTCTGTCAAATACTTATGCATTGGTT
GGTAGGGTTGAGGATGCAATCAGGGTAAGATCTTTGATAAAAGAGAAGGAGATTAAGAAGATTCCAGGTTGTAGTTTGATTGAGCTCGATGGTGTTGTACATGAGTTTTT
TTCAGAAGATGGAGAACATAAGCACTCCAAGGAAATACATGACGCGTTAGATAAAATGATGAAGCAGATCAAGAGGCTCGGATATGTGCCCAACACAGACGATGCTAGAC
TGGAGGCTGAGGAAGAGAGCAAAGAAACTTCAGTGTCGCATCATAGTGAGAAGCTTGCTATTGCTTATGGTCTGATCCGAACGTCTCCTCGAACCACTATTAGAATTTCA
AAAAATCTTAGGATGTGTAGGGATTGCCATAATGCAACGAAGTTTATATCACAAGTCTTTGAAAGAATGATTATTGTTAGGGATCGGAACCGTTTTCATCATTTTAAAGA
TGGCCTTTGCTCCTGTAATGACTATTGGTGAGTCTTTTATAGTTGATGGAACATCCATTGTTAGAGAGAAGTAGATGGAACATCGTGTTGATTGGTTACAATGTCTAAAT
ATAGGCAACTCATTCAGTTATATATCTTCAATGGCAGTATCATTTATG
Protein sequenceShow/hide protein sequence
MASIVGCLPNTSLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFK
KMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEF
DDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVL
YSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFYSMLENDVKPNDVTFIGVLSACSHACLV
DQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRS
LIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHN
ATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW