; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0015503 (gene) of Chayote v1 genome

Gene IDSed0015503
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG09:29335972..29344919
RNA-Seq ExpressionSed0015503
SyntenySed0015503
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144515.1 pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Momordica charantia]2.0e-23785.17Show/hide
Query:  SSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKLKLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVIEKKRGPN
        S+P+NPLP AINH NP+ P+ +++Q+ S SK  SPA  +  KLMER P   R D AKLK       +EEVNRKIAS+KAISVILRREAT AVIEKKRGP 
Subjt:  SSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKLKLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVIEKKRGPN

Query:  NSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLDKAFLL
        NSKKLLP+TVLEALHER+SALRW+SALKVF+L REQLWYRPY GMYIKLIVMLGKCKQPEKA ELF +MIDEGCEV  ESYTALLSAYSRSGLLDKAF L
Subjt:  NSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLDKAFLL

Query:  LDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAFGSSGQ
        LDEM+NSP  +PDVHTYSILIKSCLQVFAFKKAQ LLSDMV RGIKP+TITYNTFIDAYGKAKMFAEMESIL+EMLS+DGC PDVWTMNSTLRAFGSSGQ
Subjt:  LDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAFGSSGQ

Query:  LETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVRAY
        LETMEKCYEKF GAGIQPNIQTFNILL+SYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVRAY
Subjt:  LETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVRAY

Query:  GQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSKKTRPDL
        GQAGKR+KI  VL+IVENSDITLDTVFFNCLVDAYG+MGCFAEMK V+GMMEQRGCKPDKTTYRT+ARAYSDGGM NHAREIQ LISS E SKKTRPDL
Subjt:  GQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSKKTRPDL

XP_022954169.1 pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Cucurbita moschata]1.1e-23583.2Show/hide
Query:  AASSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL-----KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVI
        A S+PS  LPPA++H NPK  + + QQRASH  L+SPAP         KL++RSP   R D AKLK       +EEVNRKIAS+KAISVILRREAT AVI
Subjt:  AASSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL-----KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVI

Query:  EKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGL
        E+KRGP NSKKLLPRTVLEALHERI+ALRW+SALKVF+L REQLWYRPY GMYIKLIVMLGKCKQPEKAYELFQ+MI+EGCEV +ESYTALLSAYSRSGL
Subjt:  EKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGL

Query:  LDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLR
        LD+AF LL+EMKNSP  +PDVHTYSILIKSCLQVFAF KAQTLLSDMV+RGIKP+TITYNTFIDAYGKAKMFAEMESILVEMLS+DGC PDVWTMNSTLR
Subjt:  LDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLR

Query:  AFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL
         FGSSGQLETMEKCYEKF GAGIQPNIQTFNILL+SYGKA SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAGNLKQME+LFRLMRSERI+PSCVTL
Subjt:  AFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL

Query:  CSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSK
        CSLV+AYGQAGKRDKI SVL IVENS+ITLDTVF+NCLVDAYGRM CFAEMK VLGMMEQRGCKPDKTTYR MARAYSDGGMANHAREI DL+S+ E SK
Subjt:  CSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSK

Query:  KTRPDL
        +TRPDL
Subjt:  KTRPDL

XP_022991692.1 pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Cucurbita maxima]5.5e-23583Show/hide
Query:  AASSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL-----KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVI
        A S+PS  LPPA++H NPK  + + QQRASH   +SPAP         KLM+RSP   R D AKLK       +EEVNRKIAS+KAISVILRREAT AVI
Subjt:  AASSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL-----KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVI

Query:  EKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGL
        E+KRGP NSKKLLPRTVLEALHERI+ALRW+SALKVF+L REQLWYRPY GMYIKLIVMLGKCKQPEKAYELFQ+MI+EGCEV +ESYTALLSAYSRSGL
Subjt:  EKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGL

Query:  LDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLR
        LD+AF LL+EMKNSP  +PDVHTYSILIKSCLQVFAF KAQTLLSDMV+RGIKP+TITYNTFIDAYGKAKMFAEMESILVEMLS+DGC PDVWTMNSTLR
Subjt:  LDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLR

Query:  AFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL
         FGSSGQLETMEKCYEKF GAGIQPNIQTFNILL+SYGKA SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAGNLKQME+LFRLMRSERI+PSCVTL
Subjt:  AFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL

Query:  CSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSK
        CSLV+AYGQAGKRDKI SVL IVENS+ITLDTVF+NCLVDAYGRM CFAEMK VLGMMEQRGCKPDKTTYR MARAYSDGGMANHAREI DL+ + E SK
Subjt:  CSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSK

Query:  KTRPDL
        +TRPDL
Subjt:  KTRPDL

XP_023549189.1 pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Cucurbita pepo subsp. pepo]3.3e-23583Show/hide
Query:  AASSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL-----KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVI
        A S+PS  LPPA++H NPK  + + QQRASH   +SPAP         KLM+RSP   R D AKLK       +EEVNRKIAS+KAISVILRREAT AVI
Subjt:  AASSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL-----KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVI

Query:  EKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGL
        E+KRGP NSKKLLPRTVLEALHERI+ALRW+SALKVF+L REQLWYRPY GMYIKLIVMLGKCKQPEKAYELFQ+MI+EGCEV +ESYTALLSAYSRSGL
Subjt:  EKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGL

Query:  LDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLR
        LD+AF LL+EMKNSP  +PDVHTYSILIKSCLQVFAF +AQTLLSDMV+RGIKP+TITYNTFIDAYGKAKMFAEMESILVEMLS+DGC PDVWTMNSTLR
Subjt:  LDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLR

Query:  AFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL
         FGSSGQLETMEKCYEKF GAGIQPNIQTFNILL+SYGKA SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAGNLKQME+LFRLMRSERI+PSCVTL
Subjt:  AFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL

Query:  CSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSK
        CSLV+AYGQAGKRDKI SVL IVENS+ITLDTVF+NCLVDAYGRM CFAEMK VLGMMEQRGCKPDKTTYR MARAYSDGGMANHAREI DL+S+ E SK
Subjt:  CSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSK

Query:  KTRPDL
        +TRPDL
Subjt:  KTRPDL

XP_038877300.1 pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Benincasa hispida]4.2e-23582.68Show/hide
Query:  AASSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL-------KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMA
        A S+P+NPLPPAINH NP   +  RQQRA+H K +SP+            KLMERSP   R D AKLK       +EEVNRKIAS+KAISVILRREAT A
Subjt:  AASSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL-------KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMA

Query:  VIEKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRS
        VIE+KRGPNNSKKLLPRTVLEALHERI+ALRW+SALKVF+L REQLWYRPY GMYIKLIVMLGKCKQPEKAYELFQ+MI+EGCEV +ESYTALLSAYSRS
Subjt:  VIEKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRS

Query:  GLLDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNST
        GLLDKAF +L+EMKNSP  +PDVHTYSIL+KSCLQVFAF KAQTLLSDMV+RGIKP+TITYN FIDAYGKAKMFAEMESIL+EMLS+DGC PDVWTMNST
Subjt:  GLLDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNST

Query:  LRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCV
        LRAFGSSGQLETMEKCYEKF GAGIQPNIQTFNILL+SYGKA SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAG+LKQME+LFRLMRSERIKPSCV
Subjt:  LRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCV

Query:  TLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVET
        TLCSLVRAYGQAGK +KI S+L  VENSDI LDTVF+NCLVDAYGRM CFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMA HA+EIQ+LIS+ E 
Subjt:  TLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVET

Query:  SKKTRPDL
        SK+TRPDL
Subjt:  SKKTRPDL

TrEMBL top hitse value%identityAlignment
A0A1S3C6D0 pentatricopeptide repeat-containing protein At5g48730, chloroplastic3.4e-23080.95Show/hide
Query:  SNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL--------KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVIEK
        +NPLPP INH +P   + +R+Q+ ++ K +SPAP            KL++ SP   R D AKLK       +EEVNRKIAS+KAISVILRREAT AVIE+
Subjt:  SNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL--------KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVIEK

Query:  KRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLD
        KRGPNNSKKLLPRTVLEALH+RI+ALRW+SALKVF+L REQLWYRPY GMYIKLIVMLGKCKQPEKAY+LFQ+MI+EGCEV +ESYTALLSAYSRSGLLD
Subjt:  KRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLD

Query:  KAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAF
        KAF +L+EMKNSP  +PDVHTYSILIKSCLQVFAF KAQTLLSDMV++GIKP+TITYNTFIDAYGKAKMFAEMESILVEMLS+DGC PDVWTMNSTLRAF
Subjt:  KAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAF

Query:  GSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCS
        G SGQ+ETMEKCYEKF  AGIQPNIQTFNILL+SYGKA SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAGNLKQME+LFRLMRSERIKPSCVTLCS
Subjt:  GSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCS

Query:  LVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSKKT
        LV+AYGQAGK +KI SVL +VENSDI LDTVF+NCLVDAYGRM CFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHA+EIQ+LI++ E SK+T
Subjt:  LVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSKKT

Query:  RPDL
        RPDL
Subjt:  RPDL

A0A5A7SJF3 Pentatricopeptide repeat-containing protein2.6e-23080.95Show/hide
Query:  SNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL--------KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVIEK
        +NPLPP INH +P   + +R+Q+ ++ K +SPAP            KL++ SP   R D AKLK       +EEVNRKIAS+KAISVILRREAT AVIE+
Subjt:  SNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL--------KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVIEK

Query:  KRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLD
        KRGPNNSKKLLPRTVLEALH+RI+ALRW+SALKVF+L REQLWYRPY GMYIKLIVMLGKCKQPEKAY+LFQ+MI+EGCEV +ESYTALLSAYSRSGLLD
Subjt:  KRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLD

Query:  KAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAF
        KAF +L+EMKNSP  +PDVHTYSILIKSCLQVFAF KAQTLLSDMV++GIKP+TITYNTFIDAYGKAKMFAEMESILVEMLS+DGC PDVWTMNSTLRAF
Subjt:  KAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAF

Query:  GSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCS
        G SGQ+ETMEKCYEKF  AGIQPNIQTFNILL+SYGKA SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAGNLKQME+LFRLMRSERIKPSCVTLCS
Subjt:  GSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCS

Query:  LVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSKKT
        LV+AYGQAGK +KI SVL +VENSDI LDTVF+NCLVDAYGRM CFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHA+EIQ+LI++ E SK+T
Subjt:  LVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSKKT

Query:  RPDL
        RPDL
Subjt:  RPDL

A0A6J1CRU5 pentatricopeptide repeat-containing protein At5g48730, chloroplastic9.9e-23885.17Show/hide
Query:  SSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKLKLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVIEKKRGPN
        S+P+NPLP AINH NP+ P+ +++Q+ S SK  SPA  +  KLMER P   R D AKLK       +EEVNRKIAS+KAISVILRREAT AVIEKKRGP 
Subjt:  SSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKLKLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVIEKKRGPN

Query:  NSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLDKAFLL
        NSKKLLP+TVLEALHER+SALRW+SALKVF+L REQLWYRPY GMYIKLIVMLGKCKQPEKA ELF +MIDEGCEV  ESYTALLSAYSRSGLLDKAF L
Subjt:  NSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLDKAFLL

Query:  LDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAFGSSGQ
        LDEM+NSP  +PDVHTYSILIKSCLQVFAFKKAQ LLSDMV RGIKP+TITYNTFIDAYGKAKMFAEMESIL+EMLS+DGC PDVWTMNSTLRAFGSSGQ
Subjt:  LDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAFGSSGQ

Query:  LETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVRAY
        LETMEKCYEKF GAGIQPNIQTFNILL+SYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVRAY
Subjt:  LETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVRAY

Query:  GQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSKKTRPDL
        GQAGKR+KI  VL+IVENSDITLDTVFFNCLVDAYG+MGCFAEMK V+GMMEQRGCKPDKTTYRT+ARAYSDGGM NHAREIQ LISS E SKKTRPDL
Subjt:  GQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSKKTRPDL

A0A6J1GS58 pentatricopeptide repeat-containing protein At5g48730, chloroplastic5.4e-23683.2Show/hide
Query:  AASSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL-----KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVI
        A S+PS  LPPA++H NPK  + + QQRASH  L+SPAP         KL++RSP   R D AKLK       +EEVNRKIAS+KAISVILRREAT AVI
Subjt:  AASSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL-----KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVI

Query:  EKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGL
        E+KRGP NSKKLLPRTVLEALHERI+ALRW+SALKVF+L REQLWYRPY GMYIKLIVMLGKCKQPEKAYELFQ+MI+EGCEV +ESYTALLSAYSRSGL
Subjt:  EKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGL

Query:  LDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLR
        LD+AF LL+EMKNSP  +PDVHTYSILIKSCLQVFAF KAQTLLSDMV+RGIKP+TITYNTFIDAYGKAKMFAEMESILVEMLS+DGC PDVWTMNSTLR
Subjt:  LDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLR

Query:  AFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL
         FGSSGQLETMEKCYEKF GAGIQPNIQTFNILL+SYGKA SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAGNLKQME+LFRLMRSERI+PSCVTL
Subjt:  AFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL

Query:  CSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSK
        CSLV+AYGQAGKRDKI SVL IVENS+ITLDTVF+NCLVDAYGRM CFAEMK VLGMMEQRGCKPDKTTYR MARAYSDGGMANHAREI DL+S+ E SK
Subjt:  CSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSK

Query:  KTRPDL
        +TRPDL
Subjt:  KTRPDL

A0A6J1JMJ6 pentatricopeptide repeat-containing protein At5g48730, chloroplastic2.7e-23583Show/hide
Query:  AASSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL-----KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVI
        A S+PS  LPPA++H NPK  + + QQRASH   +SPAP         KLM+RSP   R D AKLK       +EEVNRKIAS+KAISVILRREAT AVI
Subjt:  AASSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKL-----KLMERSPGRERTDAAKLK------EREEVNRKIASRKAISVILRREATMAVI

Query:  EKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGL
        E+KRGP NSKKLLPRTVLEALHERI+ALRW+SALKVF+L REQLWYRPY GMYIKLIVMLGKCKQPEKAYELFQ+MI+EGCEV +ESYTALLSAYSRSGL
Subjt:  EKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGL

Query:  LDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLR
        LD+AF LL+EMKNSP  +PDVHTYSILIKSCLQVFAF KAQTLLSDMV+RGIKP+TITYNTFIDAYGKAKMFAEMESILVEMLS+DGC PDVWTMNSTLR
Subjt:  LDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLR

Query:  AFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL
         FGSSGQLETMEKCYEKF GAGIQPNIQTFNILL+SYGKA SYEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAGNLKQME+LFRLMRSERI+PSCVTL
Subjt:  AFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTL

Query:  CSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSK
        CSLV+AYGQAGKRDKI SVL IVENS+ITLDTVF+NCLVDAYGRM CFAEMK VLGMMEQRGCKPDKTTYR MARAYSDGGMANHAREI DL+ + E SK
Subjt:  CSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSK

Query:  KTRPDL
        +TRPDL
Subjt:  KTRPDL

SwissProt top hitse value%identityAlignment
Q9FKC3 Pentatricopeptide repeat-containing protein At5g48730, chloroplastic3.4e-17965.86Show/hide
Query:  ASSPSNPLPPAINHGNPKTPITVR-----QQRASHSKLKSPAPDDKLKLMERSPGRERTDAAKLKER------EEVNRKIASRKAISVILRREATMAVIE
        ++S S+  P   N    +   TVR      +  +++     + +  L L E    +   +A  + ER      E+ N KIASRKAIS+ILRREAT ++IE
Subjt:  ASSPSNPLPPAINHGNPKTPITVR-----QQRASHSKLKSPAPDDKLKLMERSPGRERTDAAKLKER------EEVNRKIASRKAISVILRREATMAVIE

Query:  KKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLL
        KK+G   SKKLLPRTVLE+LHERI+ALRW+SA++VF+L REQLWY+P VG+Y+KLIVMLGKCKQPEKA+ELFQ+MI+EGC V +E YTAL+SAYSRSG  
Subjt:  KKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLL

Query:  DKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRA
        D AF LL+ MK+S   +PDVHTYSILIKS LQVFAF K Q LLSDM  +GI+P+TITYNT IDAYGKAKMF EMES L++ML ED C PD WTMNSTLRA
Subjt:  DKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRA

Query:  FGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLC
        FG +GQ+E ME CYEKF  +GI+PNI+TFNILL+SYGK+G+Y+KMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAG+LKQMEYLFRLM+SERI PSCVTLC
Subjt:  FGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLC

Query:  SLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSV
        SLVRAYG+A K DKIG VL+ +ENSDI LD VFFNCLVDAYGRM  FAEMK VL +ME++G KPDK TYRTM +AY   GM  H +E+  ++ SV
Subjt:  SLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSV

Q9S7Q2 Pentatricopeptide repeat-containing protein At1g74850, chloroplastic1.0e-4226.33Show/hide
Query:  WDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLDKAFLLLDEMKN---SP-----------
        W  +L++F   + Q+W +P   +Y  +I +LG+    +K  E+F +M  +G      SYTAL++AY R+G  + +  LLD MKN   SP           
Subjt:  WDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLDKAFLLLDEMKN---SP-----------

Query:  ---------------------GREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWT
                             G +PD+ TY+ L+ +C       +A+ +   M   GI P   TY+  ++ +GK +   ++  +L EM S  G  PD+ +
Subjt:  ---------------------GREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWT

Query:  MNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIK
         N  L A+  SG ++     + +   AG  PN  T+++LLN +G++G Y+ +  +   M+  +      TYN++I+ FG  G  K++  LF  M  E I+
Subjt:  MNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIK

Query:  PSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREI
        P   T   ++ A G+ G  +    +LQ +  +DI   +  +  +++A+G+   + E       M + G  P   T+ ++  +++ GG+   +  I
Subjt:  PSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREI

Q9SCP4 Pentatricopeptide repeat-containing protein At3g531703.1e-10344.6Show/hide
Query:  ERTDAAKLKEREEVNRKIASRKAISVILRREATMAVIEKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKC
        E+ ++  +  R +V+ K    K +S ILR +A +  IE+K        L P+ VLEAL E I   RW SALK+F+L R+Q WY P    Y KL  +LG C
Subjt:  ERTDAAKLKEREEVNRKIASRKAISVILRREATMAVIEKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKC

Query:  KQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFI
        KQP++A  LF+ M+ EG +   + YT+L+S Y +S LLDKAF  L+ MK+    +PDV T+++LI  C ++  F   ++++ +M   G+  ST+TYNT I
Subjt:  KQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFI

Query:  DAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIV
        D YGKA MF EMES+L +M+ +    PDV T+NS + ++G+   +  ME  Y +F   G+QP+I TFNIL+ S+GKAG Y+KM +VM++M+K  +S T V
Subjt:  DAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIV

Query:  TYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGC
        TYN+VI+ FG+AG +++M+ +FR M+ + +KP+ +T CSLV AY +AG   KI SVL+ + NSD+ LDT FFNC+++AYG+ G  A MK++   ME+R C
Subjt:  TYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGC

Query:  KPDKTTYRTMARAYSDGGMANHAREIQ-DLISSVE
        KPDK T+ TM + Y+  G+ +  +E++  +ISS E
Subjt:  KPDKTTYRTMARAYSDGGMANHAREIQ-DLISSVE

Q9SQU6 Pentatricopeptide repeat-containing protein At3g06430, chloroplastic5.6e-8943.26Show/hide
Query:  IEKKRGP-NNSKKLLPR---------TVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYT
        + ++R P  N KK L R         TV E L + I+  +W  AL+VFD+ REQ +Y+P  G Y+KL+V+LGK  QP +A +LF +M++EG E   E YT
Subjt:  IEKKRGP-NNSKKLLPR---------TVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYT

Query:  ALLSAYSRSGLLDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCN
        ALL+AY+RS L+D AF +LD+MK+ P  +PDV TYS L+K+C+    F    +L  +M  R I P+T+T N  +  YG+   F +ME +L +ML    C 
Subjt:  ALLSAYSRSGLLDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCN

Query:  PDVWTMNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMR
        PDVWTMN  L  FG+ G+++ ME  YEKF   GI+P  +TFNIL+ SYGK   Y+KMS+VMEYM+K  + WT  TYN +I+AF   G+ K ME  F  MR
Subjt:  PDVWTMNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMR

Query:  SERIKPSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGM
        SE +K    T C L+  Y  AG   K+ S +Q+    +I  +T F+N ++ A  +     EM++V   M++R C  D  T+  M  AY   GM
Subjt:  SERIKPSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGM

Q9SV96 Pentatricopeptide repeat-containing protein At4g39620, chloroplastic1.2e-3827.82Show/hide
Query:  KLKEREEVNR--KIASRKAISVILRREATMAVIEKKRGPNNSKKLLPRTVLEALHERI-SALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQP
        KL ERE   R  ++  R  +S I  RE  +  ++K        K++       L E +  + +W   L+VF   ++Q WY P  G+Y KLI ++GK  Q 
Subjt:  KLKEREEVNR--KIASRKAISVILRREATMAVIEKKRGPNNSKKLLPRTVLEALHERI-SALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQP

Query:  EKAYELFQQMIDEGCEVRYESYTALLSAY----SRSGLLDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTF
          A  LF +M + GC      Y AL++A+     ++  L+K    LD+MK     +P+V TY+IL+++  Q     +   L  D+    + P   T+N  
Subjt:  EKAYELFQQMIDEGCEVRYESYTALLSAY----SRSGLLDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTF

Query:  IDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTI
        +DAYGK  M  EME++L  M S + C PD                                   I TFN+L++SYGK   +EKM    + + +     T+
Subjt:  IDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTI

Query:  VTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRG
         T+N +I  +G+A  + + E++F+ M      PS +T   ++  YG  G   +   + + V  SD  L     N +++ Y R G + E  K+        
Subjt:  VTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRG

Query:  CKPDKTTYRTMARAYSDGGMANHAREIQDLISSVE
          PD +TY+ + +AY+   M     ++Q L+  +E
Subjt:  CKPDKTTYRTMARAYSDGGMANHAREIQDLISSVE

Arabidopsis top hitse value%identityAlignment
AT1G74850.1 plastid transcriptionally active 27.4e-4426.33Show/hide
Query:  WDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLDKAFLLLDEMKN---SP-----------
        W  +L++F   + Q+W +P   +Y  +I +LG+    +K  E+F +M  +G      SYTAL++AY R+G  + +  LLD MKN   SP           
Subjt:  WDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLDKAFLLLDEMKN---SP-----------

Query:  ---------------------GREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWT
                             G +PD+ TY+ L+ +C       +A+ +   M   GI P   TY+  ++ +GK +   ++  +L EM S  G  PD+ +
Subjt:  ---------------------GREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWT

Query:  MNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIK
         N  L A+  SG ++     + +   AG  PN  T+++LLN +G++G Y+ +  +   M+  +      TYN++I+ FG  G  K++  LF  M  E I+
Subjt:  MNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIK

Query:  PSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREI
        P   T   ++ A G+ G  +    +LQ +  +DI   +  +  +++A+G+   + E       M + G  P   T+ ++  +++ GG+   +  I
Subjt:  PSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREI

AT3G06430.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.0e-9043.26Show/hide
Query:  IEKKRGP-NNSKKLLPR---------TVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYT
        + ++R P  N KK L R         TV E L + I+  +W  AL+VFD+ REQ +Y+P  G Y+KL+V+LGK  QP +A +LF +M++EG E   E YT
Subjt:  IEKKRGP-NNSKKLLPR---------TVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYT

Query:  ALLSAYSRSGLLDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCN
        ALL+AY+RS L+D AF +LD+MK+ P  +PDV TYS L+K+C+    F    +L  +M  R I P+T+T N  +  YG+   F +ME +L +ML    C 
Subjt:  ALLSAYSRSGLLDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCN

Query:  PDVWTMNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMR
        PDVWTMN  L  FG+ G+++ ME  YEKF   GI+P  +TFNIL+ SYGK   Y+KMS+VMEYM+K  + WT  TYN +I+AF   G+ K ME  F  MR
Subjt:  PDVWTMNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMR

Query:  SERIKPSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGM
        SE +K    T C L+  Y  AG   K+ S +Q+    +I  +T F+N ++ A  +     EM++V   M++R C  D  T+  M  AY   GM
Subjt:  SERIKPSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGM

AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.7e-10544.06Show/hide
Query:  ERTDAAKLKEREEVNRKIASRKAISVILRREATMAVIEKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKC
        E+ ++  +  R +V+ K    K +S ILR +A +  IE+K        L P+ VLEAL E I   RW SALK+F+L R+Q WY P    Y KL  +LG C
Subjt:  ERTDAAKLKEREEVNRKIASRKAISVILRREATMAVIEKKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKC

Query:  KQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFI
        KQP++A  LF+ M+ EG +   + YT+L+S Y +S LLDKAF  L+ MK+    +PDV T+++LI  C ++  F   ++++ +M   G+  ST+TYNT I
Subjt:  KQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFI

Query:  DAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIV
        D YGKA MF EMES+L +M+ +    PDV T+NS + ++G+   +  ME  Y +F   G+QP+I TFNIL+ S+GKAG Y+KM +VM++M+K  +S T V
Subjt:  DAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIV

Query:  TYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGC
        TYN+VI+ FG+AG +++M+ +FR M+ + +KP+ +T CSLV AY +AG   KI SVL+ + NSD+ LDT FFNC+++AYG+ G  A MK++   ME+R C
Subjt:  TYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGC

Query:  KPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSKK
        KPDK T+ TM + Y+  G+ +  +E++  + S +  KK
Subjt:  KPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSKK

AT4G39620.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.5e-4027.82Show/hide
Query:  KLKEREEVNR--KIASRKAISVILRREATMAVIEKKRGPNNSKKLLPRTVLEALHERI-SALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQP
        KL ERE   R  ++  R  +S I  RE  +  ++K        K++       L E +  + +W   L+VF   ++Q WY P  G+Y KLI ++GK  Q 
Subjt:  KLKEREEVNR--KIASRKAISVILRREATMAVIEKKRGPNNSKKLLPRTVLEALHERI-SALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQP

Query:  EKAYELFQQMIDEGCEVRYESYTALLSAY----SRSGLLDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTF
          A  LF +M + GC      Y AL++A+     ++  L+K    LD+MK     +P+V TY+IL+++  Q     +   L  D+    + P   T+N  
Subjt:  EKAYELFQQMIDEGCEVRYESYTALLSAY----SRSGLLDKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTF

Query:  IDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTI
        +DAYGK  M  EME++L  M S + C PD                                   I TFN+L++SYGK   +EKM    + + +     T+
Subjt:  IDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTI

Query:  VTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRG
         T+N +I  +G+A  + + E++F+ M      PS +T   ++  YG  G   +   + + V  SD  L     N +++ Y R G + E  K+        
Subjt:  VTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRG

Query:  CKPDKTTYRTMARAYSDGGMANHAREIQDLISSVE
          PD +TY+ + +AY+   M     ++Q L+  +E
Subjt:  CKPDKTTYRTMARAYSDGGMANHAREIQDLISSVE

AT5G48730.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-18065.86Show/hide
Query:  ASSPSNPLPPAINHGNPKTPITVR-----QQRASHSKLKSPAPDDKLKLMERSPGRERTDAAKLKER------EEVNRKIASRKAISVILRREATMAVIE
        ++S S+  P   N    +   TVR      +  +++     + +  L L E    +   +A  + ER      E+ N KIASRKAIS+ILRREAT ++IE
Subjt:  ASSPSNPLPPAINHGNPKTPITVR-----QQRASHSKLKSPAPDDKLKLMERSPGRERTDAAKLKER------EEVNRKIASRKAISVILRREATMAVIE

Query:  KKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLL
        KK+G   SKKLLPRTVLE+LHERI+ALRW+SA++VF+L REQLWY+P VG+Y+KLIVMLGKCKQPEKA+ELFQ+MI+EGC V +E YTAL+SAYSRSG  
Subjt:  KKRGPNNSKKLLPRTVLEALHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLL

Query:  DKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRA
        D AF LL+ MK+S   +PDVHTYSILIKS LQVFAF K Q LLSDM  +GI+P+TITYNT IDAYGKAKMF EMES L++ML ED C PD WTMNSTLRA
Subjt:  DKAFLLLDEMKNSPGREPDVHTYSILIKSCLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRA

Query:  FGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLC
        FG +GQ+E ME CYEKF  +GI+PNI+TFNILL+SYGK+G+Y+KMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAG+LKQMEYLFRLM+SERI PSCVTLC
Subjt:  FGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKAGSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLC

Query:  SLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSV
        SLVRAYG+A K DKIG VL+ +ENSDI LD VFFNCLVDAYGRM  FAEMK VL +ME++G KPDK TYRTM +AY   GM  H +E+  ++ SV
Subjt:  SLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCCTCGAGCCCGAGCAATCCTCTCCCACCAGCAATCAATCATGGAAACCCTAAGACTCCCATAACTGTCCGTCAACAACGAGCGAGCCATTCGAAGCTGAAGTC
CCCTGCCCCAGATGACAAGCTGAAGCTGATGGAGAGAAGCCCTGGGAGAGAACGCACGGACGCGGCAAAGCTGAAGGAAAGGGAAGAGGTGAACAGGAAGATTGCTTCTC
GAAAGGCCATTTCGGTGATTTTGCGCAGAGAAGCCACCATGGCCGTCATTGAGAAGAAGCGAGGCCCTAATAACTCTAAGAAGCTGCTTCCCCGCACTGTTCTTGAAGCT
CTCCATGAACGGATCTCGGCCTTGCGATGGGACTCCGCGCTCAAGGTTTTTGACCTAGCTCGTGAACAATTGTGGTACAGACCTTACGTCGGGATGTACATTAAGCTGAT
TGTAATGCTTGGAAAATGTAAGCAACCGGAAAAGGCCTATGAGCTATTCCAACAAATGATTGATGAAGGATGTGAAGTTCGCTATGAGTCATACACTGCTCTCTTGTCGG
CCTATAGCAGGAGCGGTCTTCTTGACAAAGCATTTTTGCTCCTCGACGAGATGAAAAACAGTCCTGGTCGAGAGCCTGATGTTCACACTTACTCTATCCTCATAAAATCA
TGCTTGCAGGTGTTTGCATTCAAAAAAGCACAAACTCTGCTCTCTGACATGGTGTCTCGAGGAATAAAACCCAGCACTATTACATATAATACCTTCATTGATGCGTATGG
CAAAGCAAAAATGTTTGCTGAAATGGAGTCCATCCTGGTGGAAATGCTGAGTGAAGATGGTTGTAACCCCGATGTTTGGACCATGAACTCGACGCTTCGAGCTTTTGGCA
GCAGTGGACAATTAGAGACCATGGAGAAGTGTTATGAAAAGTTCCTCGGAGCTGGAATCCAACCAAACATTCAGACTTTCAACATCCTTCTGAATTCATATGGCAAGGCT
GGAAGTTATGAGAAAATGAGCGCTGTGATGGAGTACATGCAGAAGTACCATTATTCATGGACAATTGTAACCTATAACGTCGTAATCGACGCGTTCGGGAGGGCTGGGAA
TTTGAAACAGATGGAGTATCTATTTAGACTTATGAGATCAGAGAGGATCAAACCGAGTTGCGTAACACTTTGCTCACTCGTGAGGGCATATGGGCAAGCAGGAAAACGCG
ACAAAATCGGCAGTGTCCTGCAGATAGTTGAGAATTCGGATATAACACTGGATACCGTTTTTTTCAACTGTCTTGTGGACGCTTACGGGCGGATGGGATGTTTTGCAGAG
ATGAAGAAGGTGCTCGGGATGATGGAGCAGAGAGGATGCAAGCCTGATAAGACTACCTACAGGACCATGGCTAGAGCTTACTCAGATGGGGGGATGGCTAATCATGCAAG
GGAAATCCAAGATCTTATAAGCTCTGTAGAAACAAGTAAGAAAACTCGACCCGACTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCCTCGAGCCCGAGCAATCCTCTCCCACCAGCAATCAATCATGGAAACCCTAAGACTCCCATAACTGTCCGTCAACAACGAGCGAGCCATTCGAAGCTGAAGTC
CCCTGCCCCAGATGACAAGCTGAAGCTGATGGAGAGAAGCCCTGGGAGAGAACGCACGGACGCGGCAAAGCTGAAGGAAAGGGAAGAGGTGAACAGGAAGATTGCTTCTC
GAAAGGCCATTTCGGTGATTTTGCGCAGAGAAGCCACCATGGCCGTCATTGAGAAGAAGCGAGGCCCTAATAACTCTAAGAAGCTGCTTCCCCGCACTGTTCTTGAAGCT
CTCCATGAACGGATCTCGGCCTTGCGATGGGACTCCGCGCTCAAGGTTTTTGACCTAGCTCGTGAACAATTGTGGTACAGACCTTACGTCGGGATGTACATTAAGCTGAT
TGTAATGCTTGGAAAATGTAAGCAACCGGAAAAGGCCTATGAGCTATTCCAACAAATGATTGATGAAGGATGTGAAGTTCGCTATGAGTCATACACTGCTCTCTTGTCGG
CCTATAGCAGGAGCGGTCTTCTTGACAAAGCATTTTTGCTCCTCGACGAGATGAAAAACAGTCCTGGTCGAGAGCCTGATGTTCACACTTACTCTATCCTCATAAAATCA
TGCTTGCAGGTGTTTGCATTCAAAAAAGCACAAACTCTGCTCTCTGACATGGTGTCTCGAGGAATAAAACCCAGCACTATTACATATAATACCTTCATTGATGCGTATGG
CAAAGCAAAAATGTTTGCTGAAATGGAGTCCATCCTGGTGGAAATGCTGAGTGAAGATGGTTGTAACCCCGATGTTTGGACCATGAACTCGACGCTTCGAGCTTTTGGCA
GCAGTGGACAATTAGAGACCATGGAGAAGTGTTATGAAAAGTTCCTCGGAGCTGGAATCCAACCAAACATTCAGACTTTCAACATCCTTCTGAATTCATATGGCAAGGCT
GGAAGTTATGAGAAAATGAGCGCTGTGATGGAGTACATGCAGAAGTACCATTATTCATGGACAATTGTAACCTATAACGTCGTAATCGACGCGTTCGGGAGGGCTGGGAA
TTTGAAACAGATGGAGTATCTATTTAGACTTATGAGATCAGAGAGGATCAAACCGAGTTGCGTAACACTTTGCTCACTCGTGAGGGCATATGGGCAAGCAGGAAAACGCG
ACAAAATCGGCAGTGTCCTGCAGATAGTTGAGAATTCGGATATAACACTGGATACCGTTTTTTTCAACTGTCTTGTGGACGCTTACGGGCGGATGGGATGTTTTGCAGAG
ATGAAGAAGGTGCTCGGGATGATGGAGCAGAGAGGATGCAAGCCTGATAAGACTACCTACAGGACCATGGCTAGAGCTTACTCAGATGGGGGGATGGCTAATCATGCAAG
GGAAATCCAAGATCTTATAAGCTCTGTAGAAACAAGTAAGAAAACTCGACCCGACTTGTGATGATAATTTTCTCTGTCCACATCTTTTTTTAAAAAAATGCTGTTTTGCA
CCTTATTGTTAATTTGCAATTACAAATCTATGTAGGGTTTAAAGGAGGAATATTTCAATTGTATTGATAAAATTATAGAATAATCTAGCTCTAGATTTGTAAAATAAAAG
AAAAATAGCAAGGGTAA
Protein sequenceShow/hide protein sequence
MAASSPSNPLPPAINHGNPKTPITVRQQRASHSKLKSPAPDDKLKLMERSPGRERTDAAKLKEREEVNRKIASRKAISVILRREATMAVIEKKRGPNNSKKLLPRTVLEA
LHERISALRWDSALKVFDLAREQLWYRPYVGMYIKLIVMLGKCKQPEKAYELFQQMIDEGCEVRYESYTALLSAYSRSGLLDKAFLLLDEMKNSPGREPDVHTYSILIKS
CLQVFAFKKAQTLLSDMVSRGIKPSTITYNTFIDAYGKAKMFAEMESILVEMLSEDGCNPDVWTMNSTLRAFGSSGQLETMEKCYEKFLGAGIQPNIQTFNILLNSYGKA
GSYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGNLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGQAGKRDKIGSVLQIVENSDITLDTVFFNCLVDAYGRMGCFAE
MKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANHAREIQDLISSVETSKKTRPDL