; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013977 (gene) of Snake gourd v1 genome

Gene IDTan0013977
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG06:14767936..14771965
RNA-Seq ExpressionTan0013977
SyntenyTan0013977
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575982.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.3e-25590.12Show/hide
Query:  AASTPTNPLPPAINHRNPKNPVFLRQQRAS-------SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVI
        A STP+  LPPA++H NPKN VFL QQRAS       + SSSSSSTDGKLM+RSP   RMD+ KL AKEA ERKEEVNRKIASQKAISVILRREATKAVI
Subjt:  AASTPTNPLPPAINHRNPKNPVFLRQQRAS-------SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVI

Query:  ERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGL
        ERKRGP NSKKLLPRTVLEALHERITALRWESALKVFELLREQ+WYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMI+EGCEVSHESYTALLSAYSRSGL
Subjt:  ERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGL

Query:  LDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLR
        LD+AF LL+EMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKP+TITYNTFIDAYGKAKMFAEMESILVEMLS+DGCKPDVWTMNSTLR
Subjt:  LDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLR

Query:  AFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL
         FGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGK  SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAGNLKQME+LF+LMRSERI+PSCVTL
Subjt:  AFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL

Query:  CSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASK
        CSLVKAYGQAGK +KIDSVLHIVENS+IMLDTVF+NCLVDAYGRM CFAEMK VLGMMEQRGCKPDKTTYR MARAYSDGGMANHAREI DLVS+AEASK
Subjt:  CSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASK

Query:  RTRPDL
        RTRPDL
Subjt:  RTRPDL

XP_022954169.1 pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Cucurbita moschata]1.7e-25590.32Show/hide
Query:  AASTPTNPLPPAINHRNPKNPVFLRQQRAS-------SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVI
        A STP+  LPPA++H NPKN VFL QQRAS       + SSSSSSTDGKL++RSP   RMD+AKL AKEA ERKEEVNRKIASQKAISVILRREATKAVI
Subjt:  AASTPTNPLPPAINHRNPKNPVFLRQQRAS-------SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVI

Query:  ERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGL
        ERKRGP NSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMI+EGCEVSHESYTALLSAYSRSGL
Subjt:  ERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGL

Query:  LDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLR
        LD+AF LL+EMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKP+TITYNTFIDAYGKAKMFAEMESILVEMLS+DGCKPDVWTMNSTLR
Subjt:  LDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLR

Query:  AFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL
         FGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGK  SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAGNLKQME+LFRLMRSERI+PSCVTL
Subjt:  AFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL

Query:  CSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASK
        CSLVKAYGQAGK +KIDSVLHIVENS+I LDTVF+NCLVDAYGRM CFAEMK VLGMMEQRGCKPDKTTYR MARAYSDGGMANHAREI DLVS+AEASK
Subjt:  CSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASK

Query:  RTRPDL
        RTRPDL
Subjt:  RTRPDL

XP_022991692.1 pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Cucurbita maxima]6.4e-25590.12Show/hide
Query:  AASTPTNPLPPAINHRNPKNPVFLRQQRAS-------SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVI
        A STP+  LPPA++H NPKN VFL QQRAS       + SSSSSSTDGKLM+RSP   RMD+AKL AKEA ERKEEVNRKIASQKAISVILRREATKAVI
Subjt:  AASTPTNPLPPAINHRNPKNPVFLRQQRAS-------SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVI

Query:  ERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGL
        ERKRGP NSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMI+EGCEVSHESYTALLSAYSRSGL
Subjt:  ERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGL

Query:  LDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLR
        LD+AF LL+EMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKP+TITYNTFIDAYGKAKMFAEMESILVEMLS+DGCKPDVWTMNSTLR
Subjt:  LDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLR

Query:  AFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL
         FGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGK  SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAGNLKQME+LFRLMRSERI+PSCVTL
Subjt:  AFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL

Query:  CSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASK
        CSLVKAYGQAGK +KI+SVLHIVENS+I LDTVF+NCLVDAYGRM CFAEMK VLGMMEQRGCKPDKTTYR MARAYSDGGMANHAREI DLV +AEASK
Subjt:  CSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASK

Query:  RTRPDL
        RTRPDL
Subjt:  RTRPDL

XP_023549189.1 pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Cucurbita pepo subsp. pepo]6.4e-25590.12Show/hide
Query:  AASTPTNPLPPAINHRNPKNPVFLRQQRAS-------SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVI
        A STP+  LPPA++H NPKN VFL QQRAS       + SSSSSSTDGKLM+RSP   RMD+AKL AKEA ERKEEVNRKIASQKAISVILRREATKAVI
Subjt:  AASTPTNPLPPAINHRNPKNPVFLRQQRAS-------SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVI

Query:  ERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGL
        ERKRGP NSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMI+EGCEVSHESYTALLSAYSRSGL
Subjt:  ERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGL

Query:  LDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLR
        LD+AF LL+EMKNSPDCQPDVHTYSILIKSCLQVFAF++AQTLLSDMVTRGIKP+TITYNTFIDAYGKAKMFAEMESILVEMLS+DGCKPDVWTMNSTLR
Subjt:  LDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLR

Query:  AFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL
         FGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGK  SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAGNLKQME+LFRLMRSERI+PSCVTL
Subjt:  AFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL

Query:  CSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASK
        CSLVKAYGQAGK +KIDSVLHIVENS+I LDTVF+NCLVDAYGRM CFAEMK VLGMMEQRGCKPDKTTYR MARAYSDGGMANHAREI DLVS+AEASK
Subjt:  CSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASK

Query:  RTRPDL
        RTRPDL
Subjt:  RTRPDL

XP_038877300.1 pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Benincasa hispida]1.4e-25790.35Show/hide
Query:  AASTPTNPLPPAINHRNPKNPVFLRQQRAS---------SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKA
        A STPTNPLPPAINH NP N VF RQQRA+         SSSSSSS++DGKLMERSP   RMDVAKL AKEA ERKEEVNRKIASQKAISVILRREATKA
Subjt:  AASTPTNPLPPAINHRNPKNPVFLRQQRAS---------SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKA

Query:  VIERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRS
        VIERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPY GMYIKLIVMLGKCKQPEKAYELFQEMI+EGCEVSHESYTALLSAYSRS
Subjt:  VIERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRS

Query:  GLLDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNST
        GLLDKAF +L+EMKNSPDCQPDVHTYSIL+KSCLQVFAFNKAQTLLSDMVTRGIKP+TITYN FIDAYGKAKMFAEMESIL+EMLS+DGCKPDVWTMNST
Subjt:  GLLDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNST

Query:  LRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCV
        LRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGK  SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAG+LKQME+LFRLMRSERIKPSCV
Subjt:  LRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCV

Query:  TLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEA
        TLCSLV+AYGQAGK EKIDS+L+ VENSDIMLDTVF+NCLVDAYGRM CFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMA HA+EIQ+L+S+AEA
Subjt:  TLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEA

Query:  SKRTRPDL
        SKRTRPDL
Subjt:  SKRTRPDL

TrEMBL top hitse value%identityAlignment
A0A1S3C6D0 pentatricopeptide repeat-containing protein At5g48730, chloroplastic8.4e-25388.87Show/hide
Query:  NPLPPAINHRNPKNPVFLRQQR----------ASSSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVIERK
        NPLPP INHR+P   +FLR+Q+           SSSSSS+++TDGKL++ SPH  RMDVAKL AKEA+ERKEEVNRKIASQKAISVILRREATKAVIERK
Subjt:  NPLPPAINHRNPKNPVFLRQQR----------ASSSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVIERK

Query:  RGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGLLDK
        RGPNNSKKLLPRTVLEALH+RITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAY+LFQEMI+EGCEVSHESYTALLSAYSRSGLLDK
Subjt:  RGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGLLDK

Query:  AFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLRAFG
        AF +L+EMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVT+GIKP+TITYNTFIDAYGKAKMFAEMESILVEMLS+DGCKPDVWTMNSTLRAFG
Subjt:  AFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLRAFG

Query:  SSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSL
         SGQ+ETMEKCYEKFQ AGIQPNIQTFNILLDSYGK  SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAGNLKQME+LFRLMRSERIKPSCVTLCSL
Subjt:  SSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSL

Query:  VKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASKRTR
        VKAYGQAGK EKI+SVL++VENSDIMLDTVF+NCLVDAYGRM CFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHA+EIQ+L+++AEASKRTR
Subjt:  VKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASKRTR

Query:  PDL
        PDL
Subjt:  PDL

A0A5A7SJF3 Pentatricopeptide repeat-containing protein3.2e-25288.67Show/hide
Query:  NPLPPAINHRNPKNPVFLRQQR----------ASSSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVIERK
        NPLPP INHR+P   +FLR+Q+           SSSSSS+++TDGKL++ SPH  RMDVAKL AKEA+ERKEEVNRKIASQKAISVILRREATKAVIERK
Subjt:  NPLPPAINHRNPKNPVFLRQQR----------ASSSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVIERK

Query:  RGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGLLDK
        RGPNNSKKLLPRTVLEALH+RITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAY+LFQEMI+EGCEVSHESYTALLSAYSRSGLLDK
Subjt:  RGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGLLDK

Query:  AFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLRAFG
        AF +L+EMKNSP+CQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVT+GIKP+TITYNTFIDAYGKAKMFAEMESILVEMLS+DGCKPDVWTMNSTLRAFG
Subjt:  AFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLRAFG

Query:  SSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSL
         SGQ+ETMEKCYEKFQ AGIQPNIQTFNILLDSYGK  SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAGNLKQME+LFRLMRSERIKPSCVTLCSL
Subjt:  SSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSL

Query:  VKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASKRTR
        VKAYGQAGK EKI+SVL++VENSDIMLDTVF+NCLVDAYGRM CFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHA+EIQ+L+++AEASKRTR
Subjt:  VKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASKRTR

Query:  PDL
        PDL
Subjt:  PDL

A0A6J1CRU5 pentatricopeptide repeat-containing protein At5g48730, chloroplastic5.8e-25489.96Show/hide
Query:  STPTNPLPPAINHRNPKNPVFLRQQRAS-SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVIERKRGPNN
        STPTNPLP AINHRNP+ PV L++Q+ S S + S +S++GKLMER PHG RMDVAKL A EASERKEEVNRKIASQKAISVILRREATKAVIE+KRGP N
Subjt:  STPTNPLPPAINHRNPKNPVFLRQQRAS-SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVIERKRGPNN

Query:  SKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGLLDKAFLLL
        SKKLLP+TVLEALHER++ALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKA ELF EMIDEGCEVS ESYTALLSAYSRSGLLDKAF LL
Subjt:  SKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGLLDKAFLLL

Query:  DEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLRAFGSSGQL
        DEM+NSPDCQPDVHTYSILIKSCLQVFAF KAQ LLSDMV RGIKP+TITYNTFIDAYGKAKMFAEMESIL+EMLS+DGCKPDVWTMNSTLRAFGSSGQL
Subjt:  DEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLRAFGSSGQL

Query:  ETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVKAYG
        ETMEKCYEKFQGAGIQPNIQTFNILLDSYGK GSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLV+AYG
Subjt:  ETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVKAYG

Query:  QAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASKRTRPDL
        QAGK EKID VL IVENSDI LDTVFFNCLVDAYG+MGCFAEMK V+GMMEQRGCKPDKTTYRT+ARAYSDGGM NHAREIQ L+SSAEASK+TRPDL
Subjt:  QAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASKRTRPDL

A0A6J1GS58 pentatricopeptide repeat-containing protein At5g48730, chloroplastic8.1e-25690.32Show/hide
Query:  AASTPTNPLPPAINHRNPKNPVFLRQQRAS-------SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVI
        A STP+  LPPA++H NPKN VFL QQRAS       + SSSSSSTDGKL++RSP   RMD+AKL AKEA ERKEEVNRKIASQKAISVILRREATKAVI
Subjt:  AASTPTNPLPPAINHRNPKNPVFLRQQRAS-------SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVI

Query:  ERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGL
        ERKRGP NSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMI+EGCEVSHESYTALLSAYSRSGL
Subjt:  ERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGL

Query:  LDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLR
        LD+AF LL+EMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKP+TITYNTFIDAYGKAKMFAEMESILVEMLS+DGCKPDVWTMNSTLR
Subjt:  LDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLR

Query:  AFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL
         FGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGK  SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAGNLKQME+LFRLMRSERI+PSCVTL
Subjt:  AFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL

Query:  CSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASK
        CSLVKAYGQAGK +KIDSVLHIVENS+I LDTVF+NCLVDAYGRM CFAEMK VLGMMEQRGCKPDKTTYR MARAYSDGGMANHAREI DLVS+AEASK
Subjt:  CSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASK

Query:  RTRPDL
        RTRPDL
Subjt:  RTRPDL

A0A6J1JMJ6 pentatricopeptide repeat-containing protein At5g48730, chloroplastic3.1e-25590.12Show/hide
Query:  AASTPTNPLPPAINHRNPKNPVFLRQQRAS-------SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVI
        A STP+  LPPA++H NPKN VFL QQRAS       + SSSSSSTDGKLM+RSP   RMD+AKL AKEA ERKEEVNRKIASQKAISVILRREATKAVI
Subjt:  AASTPTNPLPPAINHRNPKNPVFLRQQRAS-------SSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVI

Query:  ERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGL
        ERKRGP NSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMI+EGCEVSHESYTALLSAYSRSGL
Subjt:  ERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGL

Query:  LDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLR
        LD+AF LL+EMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKP+TITYNTFIDAYGKAKMFAEMESILVEMLS+DGCKPDVWTMNSTLR
Subjt:  LDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLR

Query:  AFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL
         FGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGK  SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAGNLKQME+LFRLMRSERI+PSCVTL
Subjt:  AFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL

Query:  CSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASK
        CSLVKAYGQAGK +KI+SVLHIVENS+I LDTVF+NCLVDAYGRM CFAEMK VLGMMEQRGCKPDKTTYR MARAYSDGGMANHAREI DLV +AEASK
Subjt:  CSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASK

Query:  RTRPDL
        RTRPDL
Subjt:  RTRPDL

SwissProt top hitse value%identityAlignment
Q9FKC3 Pentatricopeptide repeat-containing protein At5g48730, chloroplastic6.5e-18667.4Show/hide
Query:  MAASTPTNPLPPAINHRNPKNPVFL--------RQQRASSSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKA
        ++ ST T+  PP   +R      F         R+   + +S  S++T   L E       ++   +N +++ E KE+ N KIAS+KAIS+ILRREATK+
Subjt:  MAASTPTNPLPPAINHRNPKNPVFL--------RQQRASSSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKA

Query:  VIERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRS
        +IE+K+G   SKKLLPRTVLE+LHERITALRWESA++VFELLREQLWY+P  G+Y+KLIVMLGKCKQPEKA+ELFQEMI+EGC V+HE YTAL+SAYSRS
Subjt:  VIERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRS

Query:  GLLDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNST
        G  D AF LL+ MK+S +CQPDVHTYSILIKS LQVFAF+K Q LLSDM  +GI+P+TITYNT IDAYGKAKMF EMES L++ML ED CKPD WTMNST
Subjt:  GLLDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNST

Query:  LRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCV
        LRAFG +GQ+E ME CYEKFQ +GI+PNI+TFNILLDSYGK+G+Y+KMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAG+LKQMEYLFRLM+SERI PSCV
Subjt:  LRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCV

Query:  TLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSS
        TLCSLV+AYG+A K +KI  VL  +ENSDI LD VFFNCLVDAYGRM  FAEMK VL +ME++G KPDK TYRTM +AY   GM  H +E+  +V S
Subjt:  TLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSS

Q9S7Q2 Pentatricopeptide repeat-containing protein At1g74850, chloroplastic8.6e-4527.09Show/hide
Query:  WESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGLLDKAFLLLDEMKN---SPD----------
        W+ +L++F+ ++ Q+W +P   +Y  +I +LG+    +K  E+F EM  +G   S  SYTAL++AY R+G  + +  LLD MKN   SP           
Subjt:  WESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGLLDKAFLLLDEMKN---SPD----------

Query:  ----------------------CQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWT
                               QPD+ TY+ L+ +C      ++A+ +   M   GI P   TY+  ++ +GK +   ++  +L EM S  G  PD+ +
Subjt:  ----------------------CQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWT

Query:  MNSTLRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIK
         N  L A+  SG ++     + + Q AG  PN  T+++LL+ +G++G Y+ +  +   M+  +      TYN++I+ FG  G  K++  LF  M  E I+
Subjt:  MNSTLRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIK

Query:  PSCVTLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREI
        P   T   ++ A G+ G HE    +L  +  +DI+  +  +  +++A+G+   + E       M + G  P   T+ ++  +++ GG+   +  I
Subjt:  PSCVTLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREI

Q9SCP4 Pentatricopeptide repeat-containing protein At3g531702.7e-10745.45Show/hide
Query:  NAKEASERKEEVNRKIAS-------QKAISVILRREATKAVIERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIV
        + K  +ER E++N  + S       +K +S ILR +A    IERK        L P+ VLEAL E I   RW+SALK+F LLR+Q WY P    Y KL  
Subjt:  NAKEASERKEEVNRKIAS-------QKAISVILRREATKAVIERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIV

Query:  MLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGLLDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTIT
        +LG CKQP++A  LF+ M+ EG + + + YT+L+S Y +S LLDKAF  L+ MK+  DC+PDV T+++LI  C ++  F+  ++++ +M   G+  ST+T
Subjt:  MLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGLLDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTIT

Query:  YNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHY
        YNT ID YGKA MF EMES+L +M+ +    PDV T+NS + ++G+   +  ME  Y +FQ  G+QP+I TFNIL+ S+GK G Y+KM +VM++M+K  +
Subjt:  YNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHY

Query:  SWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMM
        S T VTYN+VI+ FG+AG +++M+ +FR M+ + +KP+ +T CSLV AY +AG   KIDSVL  + NSD++LDT FFNC+++AYG+ G  A MK++   M
Subjt:  SWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMM

Query:  EQRGCKPDKTTYRTMARAYSDGGMANHAREIQ-DLVSSAE
        E+R CKPDK T+ TM + Y+  G+ +  +E++  ++SS E
Subjt:  EQRGCKPDKTTYRTMARAYSDGGMANHAREIQ-DLVSSAE

Q9SQU6 Pentatricopeptide repeat-containing protein At3g06430, chloroplastic7.2e-9242.82Show/hide
Query:  TKAVIERKRGPNNSKKLLPR---------TVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHE
        T+ V +R+    N KK L R         TV E L + I   +W  AL+VF++LREQ +Y+P  G Y+KL+V+LGK  QP +A +LF EM++EG E + E
Subjt:  TKAVIERKRGPNNSKKLLPR---------TVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHE

Query:  SYTALLSAYSRSGLLDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSED
         YTALL+AY+RS L+D AF +LD+MK+ P CQPDV TYS L+K+C+    F+   +L  +M  R I P+T+T N  +  YG+   F +ME +L +ML   
Subjt:  SYTALLSAYSRSGLLDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSED

Query:  GCKPDVWTMNSTLRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFR
         CKPDVWTMN  L  FG+ G+++ ME  YEKF+  GI+P  +TFNIL+ SYGK   Y+KMS+VMEYM+K  + WT  TYN +I+AF   G+ K ME  F 
Subjt:  GCKPDVWTMNSTLRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFR

Query:  LMRSERIKPSCVTLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANH-
         MRSE +K    T C L+  Y  AG   K+ S + +    +I  +T F+N ++ A  +     EM++V   M++R C  D  T+  M  AY   GM +  
Subjt:  LMRSERIKPSCVTLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANH-

Query:  ---AREIQDLVSSAEASK
            +E Q L+    A+K
Subjt:  ---AREIQDLVSSAEASK

Q9SV96 Pentatricopeptide repeat-containing protein At4g39620, chloroplastic1.4e-4229.76Show/hide
Query:  RWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAY----SRSGLLDKAFLLLDEMKNSPDCQPDVHTY
        +W   L+VF  +++Q WY P  G+Y KLI ++GK  Q   A  LF EM + GC      Y AL++A+     ++  L+K    LD+MK    CQP+V TY
Subjt:  RWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAY----SRSGLLDKAFLLLDEMKNSPDCQPDVHTY

Query:  SILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLRAFGSSGQLETMEKCYEKFQGAGIQ
        +IL+++  Q    ++   L  D+    + P   T+N  +DAYGK  M  EME++L  M S + CKPD                                 
Subjt:  SILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLRAFGSSGQLETMEKCYEKFQGAGIQ

Query:  PNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVKAYGQAGKHEKIDSVLHIVE
          I TFN+L+DSYGK   +EKM    + + +     T+ T+N +I  +G+A  + + E++F+ M      PS +T   ++  YG  G   +   +   V 
Subjt:  PNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVKAYGQAGKHEKIDSVLHIVE

Query:  NSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAE
         SD +L     N +++ Y R G + E  K+          PD +TY+ + +AY+   M     ++Q L+   E
Subjt:  NSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAE

Arabidopsis top hitse value%identityAlignment
AT1G74850.1 plastid transcriptionally active 26.1e-4627.09Show/hide
Query:  WESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGLLDKAFLLLDEMKN---SPD----------
        W+ +L++F+ ++ Q+W +P   +Y  +I +LG+    +K  E+F EM  +G   S  SYTAL++AY R+G  + +  LLD MKN   SP           
Subjt:  WESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGLLDKAFLLLDEMKN---SPD----------

Query:  ----------------------CQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWT
                               QPD+ TY+ L+ +C      ++A+ +   M   GI P   TY+  ++ +GK +   ++  +L EM S  G  PD+ +
Subjt:  ----------------------CQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWT

Query:  MNSTLRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIK
         N  L A+  SG ++     + + Q AG  PN  T+++LL+ +G++G Y+ +  +   M+  +      TYN++I+ FG  G  K++  LF  M  E I+
Subjt:  MNSTLRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIK

Query:  PSCVTLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREI
        P   T   ++ A G+ G HE    +L  +  +DI+  +  +  +++A+G+   + E       M + G  P   T+ ++  +++ GG+   +  I
Subjt:  PSCVTLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREI

AT3G06430.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.1e-9342.82Show/hide
Query:  TKAVIERKRGPNNSKKLLPR---------TVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHE
        T+ V +R+    N KK L R         TV E L + I   +W  AL+VF++LREQ +Y+P  G Y+KL+V+LGK  QP +A +LF EM++EG E + E
Subjt:  TKAVIERKRGPNNSKKLLPR---------TVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHE

Query:  SYTALLSAYSRSGLLDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSED
         YTALL+AY+RS L+D AF +LD+MK+ P CQPDV TYS L+K+C+    F+   +L  +M  R I P+T+T N  +  YG+   F +ME +L +ML   
Subjt:  SYTALLSAYSRSGLLDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSED

Query:  GCKPDVWTMNSTLRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFR
         CKPDVWTMN  L  FG+ G+++ ME  YEKF+  GI+P  +TFNIL+ SYGK   Y+KMS+VMEYM+K  + WT  TYN +I+AF   G+ K ME  F 
Subjt:  GCKPDVWTMNSTLRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFR

Query:  LMRSERIKPSCVTLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANH-
         MRSE +K    T C L+  Y  AG   K+ S + +    +I  +T F+N ++ A  +     EM++V   M++R C  D  T+  M  AY   GM +  
Subjt:  LMRSERIKPSCVTLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANH-

Query:  ---AREIQDLVSSAEASK
            +E Q L+    A+K
Subjt:  ---AREIQDLVSSAEASK

AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-10844.92Show/hide
Query:  NAKEASERKEEVNRKIAS-------QKAISVILRREATKAVIERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIV
        + K  +ER E++N  + S       +K +S ILR +A    IERK        L P+ VLEAL E I   RW+SALK+F LLR+Q WY P    Y KL  
Subjt:  NAKEASERKEEVNRKIAS-------QKAISVILRREATKAVIERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIV

Query:  MLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGLLDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTIT
        +LG CKQP++A  LF+ M+ EG + + + YT+L+S Y +S LLDKAF  L+ MK+  DC+PDV T+++LI  C ++  F+  ++++ +M   G+  ST+T
Subjt:  MLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGLLDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTIT

Query:  YNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHY
        YNT ID YGKA MF EMES+L +M+ +    PDV T+NS + ++G+   +  ME  Y +FQ  G+QP+I TFNIL+ S+GK G Y+KM +VM++M+K  +
Subjt:  YNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHY

Query:  SWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMM
        S T VTYN+VI+ FG+AG +++M+ +FR M+ + +KP+ +T CSLV AY +AG   KIDSVL  + NSD++LDT FFNC+++AYG+ G  A MK++   M
Subjt:  SWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMM

Query:  EQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASKR
        E+R CKPDK T+ TM + Y+  G+ +  +E++  + S++  K+
Subjt:  EQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASKR

AT4G39620.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.7e-4429.76Show/hide
Query:  RWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAY----SRSGLLDKAFLLLDEMKNSPDCQPDVHTY
        +W   L+VF  +++Q WY P  G+Y KLI ++GK  Q   A  LF EM + GC      Y AL++A+     ++  L+K    LD+MK    CQP+V TY
Subjt:  RWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAY----SRSGLLDKAFLLLDEMKNSPDCQPDVHTY

Query:  SILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLRAFGSSGQLETMEKCYEKFQGAGIQ
        +IL+++  Q    ++   L  D+    + P   T+N  +DAYGK  M  EME++L  M S + CKPD                                 
Subjt:  SILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLRAFGSSGQLETMEKCYEKFQGAGIQ

Query:  PNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVKAYGQAGKHEKIDSVLHIVE
          I TFN+L+DSYGK   +EKM    + + +     T+ T+N +I  +G+A  + + E++F+ M      PS +T   ++  YG  G   +   +   V 
Subjt:  PNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVKAYGQAGKHEKIDSVLHIVE

Query:  NSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAE
         SD +L     N +++ Y R G + E  K+          PD +TY+ + +AY+   M     ++Q L+   E
Subjt:  NSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAE

AT5G48730.1 Pentatricopeptide repeat (PPR) superfamily protein4.6e-18767.4Show/hide
Query:  MAASTPTNPLPPAINHRNPKNPVFL--------RQQRASSSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKA
        ++ ST T+  PP   +R      F         R+   + +S  S++T   L E       ++   +N +++ E KE+ N KIAS+KAIS+ILRREATK+
Subjt:  MAASTPTNPLPPAINHRNPKNPVFL--------RQQRASSSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKA

Query:  VIERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRS
        +IE+K+G   SKKLLPRTVLE+LHERITALRWESA++VFELLREQLWY+P  G+Y+KLIVMLGKCKQPEKA+ELFQEMI+EGC V+HE YTAL+SAYSRS
Subjt:  VIERKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRS

Query:  GLLDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNST
        G  D AF LL+ MK+S +CQPDVHTYSILIKS LQVFAF+K Q LLSDM  +GI+P+TITYNT IDAYGKAKMF EMES L++ML ED CKPD WTMNST
Subjt:  GLLDKAFLLLDEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNST

Query:  LRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCV
        LRAFG +GQ+E ME CYEKFQ +GI+PNI+TFNILLDSYGK+G+Y+KMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAG+LKQMEYLFRLM+SERI PSCV
Subjt:  LRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCV

Query:  TLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSS
        TLCSLV+AYG+A K +KI  VL  +ENSDI LD VFFNCLVDAYGRM  FAEMK VL +ME++G KPDK TYRTM +AY   GM  H +E+  +V S
Subjt:  TLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCATCCACCCCCACCAACCCTCTCCCGCCAGCAATCAACCATCGAAACCCTAAAAATCCCGTATTTCTTCGTCAACAACGAGCATCTTCTTCTTCTTCTTCTTC
TTCTTCTACAGATGGTAAGCTCATGGAGAGAAGTCCTCACGGAGAACGCATGGATGTGGCAAAGCTGAATGCGAAGGAAGCGAGCGAAAGAAAGGAGGAGGTCAACCGTA
AGATTGCTTCTCAGAAAGCCATTTCGGTGATTTTGCGCAGAGAAGCCACCAAGGCGGTCATTGAGAGGAAGAGAGGCCCCAATAATTCTAAGAAGCTGCTTCCGCGAACT
GTTCTTGAAGCTCTCCATGAGCGAATCACGGCCTTGCGATGGGAGTCTGCGCTCAAGGTTTTTGAACTACTTCGTGAACAATTGTGGTACAGACCTTATGCCGGGATGTA
CATTAAGCTAATTGTCATGCTTGGAAAATGTAAGCAACCCGAAAAGGCCTACGAGCTGTTCCAAGAAATGATTGATGAAGGATGTGAAGTTAGCCATGAGTCATACACCG
CTCTCTTGTCAGCGTATAGCAGGAGTGGCCTTCTTGACAAAGCATTTTTGCTCCTCGACGAGATGAAAAACAGTCCTGATTGCCAGCCTGATGTTCACACTTACTCTATC
CTCATAAAATCATGCTTGCAGGTGTTTGCATTCAACAAAGCACAAACTCTACTCTCTGACATGGTGACTCGAGGAATAAAACCCAGCACTATTACATATAATACCTTCAT
TGATGCTTATGGCAAAGCAAAAATGTTTGCTGAAATGGAGTCCATCCTGGTCGAAATGCTGAGTGAAGATGGTTGTAAGCCTGATGTTTGGACCATGAACTCGACGCTTC
GAGCTTTCGGCAGCAGTGGACAGTTAGAAACCATGGAGAAGTGTTATGAAAAGTTCCAGGGAGCTGGAATCCAACCAAACATTCAGACCTTCAACATCCTTCTGGATTCA
TACGGAAAGACTGGAAGTTATGAGAAAATGAGTGCTGTGATGGAGTATATGCAGAAGTACCATTACTCATGGACAATTGTAACCTACAACGTCGTTATCGACGCGTTTGG
GAGGGCTGGGAATTTGAAACAGATGGAGTATCTATTTAGACTTATGAGATCAGAGAGGATCAAACCGAGTTGTGTAACACTTTGCTCGCTCGTGAAGGCATATGGGCAAG
CGGGAAAGCACGAAAAAATCGACAGTGTACTGCACATAGTTGAGAATTCCGATATAATGTTGGATACCGTTTTTTTCAACTGTCTTGTGGATGCTTACGGGCGGATGGGA
TGTTTTGCAGAGATGAAGAAGGTGCTTGGGATGATGGAGCAGAGAGGATGCAAGCCTGATAAGACTACCTACAGGACCATGGCTAGAGCTTATTCAGATGGGGGAATGGC
TAACCATGCCAGGGAGATCCAAGATCTTGTAAGCTCTGCAGAAGCAAGTAAGAGAACTAGACCTGACTTATGA
mRNA sequenceShow/hide mRNA sequence
TGATTATCCATCCACCTAAACGGTCCAAATGGTTATGAAGGGGGCTAGCGAGAAGAGTGAAGATTTTTGTTTCAGGAATGAGGATAAGAGTGGCCTGAACATCTCGGAGC
TTAGTATCCTCTATCCATGGCGGCATCCACCCCCACCAACCCTCTCCCGCCAGCAATCAACCATCGAAACCCTAAAAATCCCGTATTTCTTCGTCAACAACGAGCATCTT
CTTCTTCTTCTTCTTCTTCTTCTACAGATGGTAAGCTCATGGAGAGAAGTCCTCACGGAGAACGCATGGATGTGGCAAAGCTGAATGCGAAGGAAGCGAGCGAAAGAAAG
GAGGAGGTCAACCGTAAGATTGCTTCTCAGAAAGCCATTTCGGTGATTTTGCGCAGAGAAGCCACCAAGGCGGTCATTGAGAGGAAGAGAGGCCCCAATAATTCTAAGAA
GCTGCTTCCGCGAACTGTTCTTGAAGCTCTCCATGAGCGAATCACGGCCTTGCGATGGGAGTCTGCGCTCAAGGTTTTTGAACTACTTCGTGAACAATTGTGGTACAGAC
CTTATGCCGGGATGTACATTAAGCTAATTGTCATGCTTGGAAAATGTAAGCAACCCGAAAAGGCCTACGAGCTGTTCCAAGAAATGATTGATGAAGGATGTGAAGTTAGC
CATGAGTCATACACCGCTCTCTTGTCAGCGTATAGCAGGAGTGGCCTTCTTGACAAAGCATTTTTGCTCCTCGACGAGATGAAAAACAGTCCTGATTGCCAGCCTGATGT
TCACACTTACTCTATCCTCATAAAATCATGCTTGCAGGTGTTTGCATTCAACAAAGCACAAACTCTACTCTCTGACATGGTGACTCGAGGAATAAAACCCAGCACTATTA
CATATAATACCTTCATTGATGCTTATGGCAAAGCAAAAATGTTTGCTGAAATGGAGTCCATCCTGGTCGAAATGCTGAGTGAAGATGGTTGTAAGCCTGATGTTTGGACC
ATGAACTCGACGCTTCGAGCTTTCGGCAGCAGTGGACAGTTAGAAACCATGGAGAAGTGTTATGAAAAGTTCCAGGGAGCTGGAATCCAACCAAACATTCAGACCTTCAA
CATCCTTCTGGATTCATACGGAAAGACTGGAAGTTATGAGAAAATGAGTGCTGTGATGGAGTATATGCAGAAGTACCATTACTCATGGACAATTGTAACCTACAACGTCG
TTATCGACGCGTTTGGGAGGGCTGGGAATTTGAAACAGATGGAGTATCTATTTAGACTTATGAGATCAGAGAGGATCAAACCGAGTTGTGTAACACTTTGCTCGCTCGTG
AAGGCATATGGGCAAGCGGGAAAGCACGAAAAAATCGACAGTGTACTGCACATAGTTGAGAATTCCGATATAATGTTGGATACCGTTTTTTTCAACTGTCTTGTGGATGC
TTACGGGCGGATGGGATGTTTTGCAGAGATGAAGAAGGTGCTTGGGATGATGGAGCAGAGAGGATGCAAGCCTGATAAGACTACCTACAGGACCATGGCTAGAGCTTATT
CAGATGGGGGAATGGCTAACCATGCCAGGGAGATCCAAGATCTTGTAAGCTCTGCAGAAGCAAGTAAGAGAACTAGACCTGACTTATGATAGTAATTTTCTCAGTCCACT
TTTTTTTTTTTTAATTATTTATGAGATTTGCACCAATTTGTTTTTTGTAATTTCAAATCTTAGTATATGTAGTGATTTAAAAAGGAGGAATATTTCAACCGTATTGATAA
AATTATAATAAACAGTAAAAAGAATATAGAGAAATTTTGTAAAGGAAAGAAGAATAAAGAAAAAATATTTCCATGAGAGGTTGAGAGCTGAGGACTTGGATCAAGAAAGG
GTTAGCTTCATAAATGGGAAATAAGTGTTATATTGAACTAGAACTTTCGTACTTATGTCATTTTTAGTTCTTGCTAAAAAAATGTCATTTTTAGTTTGAGTTGTACTTGA
GGCTGCACCATCTATGGTTAATTCATTTTCCA
Protein sequenceShow/hide protein sequence
MAASTPTNPLPPAINHRNPKNPVFLRQQRASSSSSSSSSTDGKLMERSPHGERMDVAKLNAKEASERKEEVNRKIASQKAISVILRREATKAVIERKRGPNNSKKLLPRT
VLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIDEGCEVSHESYTALLSAYSRSGLLDKAFLLLDEMKNSPDCQPDVHTYSI
LIKSCLQVFAFNKAQTLLSDMVTRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCKPDVWTMNSTLRAFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDS
YGKTGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVKAYGQAGKHEKIDSVLHIVENSDIMLDTVFFNCLVDAYGRMG
CFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLVSSAEASKRTRPDL