; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS025469 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS025469
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionpentatricopeptide repeat-containing protein At2g15820, chloroplastic-like
Genome locationscaffold401:139395..142310
RNA-Seq ExpressionMS025469
SyntenyMS025469
Gene Ontology termsGO:0008380 - RNA splicing (biological process)
GO:0004519 - endonuclease activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR004860 - Homing endonuclease, LAGLIDADG
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR027434 - Homing endonuclease


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607381.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]4.4e-21053.76Show/hide
Query:  NPISVISMSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHF
        N  S  SMSI TSAFAT+TL  SLT   SQ H HFR            T+SAK R +LPRIPAFAS S VE L+YD+DSP++SEE  CSPYS  AEGF  
Subjt:  NPISVISMSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHF

Query:  ENSYASADLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYAL
            ASADLKHLG PALEVKELDEL EQWRRSKLAWLCKELPA KPGTL+RLLNAQRKWM+QDDAAY+IVHCLRIRENETAFRVYKW MQQHWYRFDYAL
Subjt:  ENSYASADLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYAL

Query:  ATKLADYMGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFI
        ATKLADYMGKERKFSKCREVFDDIINQGCVPSEST+                  ++      +QLGGY+PRLSLHNSLF+ALVSKPGD SKH+LKQAEFI
Subjt:  ATKLADYMGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFI

Query:  YHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGC
        YHN+ TTGLELHKDIYGGLIWLHSYQDT+DKERI+SL KEM  AGIEEERE                                   AFVYKMEVYAKVG 
Subjt:  YHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGC

Query:  PMKALEIFREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN--------------------------------------
        PMKA EIFREMEQL   SA AYQ IIGILCK +E+ LAES+ME FIKSNLKPL PAYVDLMN                                      
Subjt:  PMKALEIFREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN--------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGG
                                                                             QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGG
Subjt:  ---------------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGG

Query:  SRLSSGDILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE
         R+SSGD +LKLKGS EGV K+VKSL+ KSM+CKVKRKGRVYWIGLLGSNA WFWKL EPFILDDLKDSLQAD++++EKA+ ET  I FDSQSDSDEE
Subjt:  SRLSSGDILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE

XP_022158727.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Momordica charantia]2.0e-22669.71Show/hide
Query:  TLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKHLGNPALE
        TLF SLTHSL   HRHFR            TFSAKRRPKLPRI AFAS S V QLLYD+DSPSDSEEHSCSPYSN A+GFHFENS+ASADLKHLGNPALE
Subjt:  TLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKHLGNPALE

Query:  VKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADYMGKERKFSKCR
        VKELDEL EQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADYMGKERKFSKCR
Subjt:  VKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADYMGKERKFSKCR

Query:  EVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSY---VQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTTGLELHKDI
        EVFDDIINQG VPSEST   F   +  Y    +       +  Y   +QLGGYKPRLSLHNSLFRALVSKPGD SKHYLKQ EFIYHNLVTTGLELHKDI
Subjt:  EVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSY---VQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTTGLELHKDI

Query:  YGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKALEIFREMEQLK
        YGGLIWLHSYQDTIDKERIVSL KEML AGI EERE                                   AFVYKMEVYAKVG PMK LEIF EM+ L 
Subjt:  YGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKALEIFREMEQLK

Query:  SA-----SATAYQRIIGILCKFQEIE-----------------------LAESIMEDF---------IKSNLKPLVPAYVDL--------MNQFWPRGHP
         A     S    + ++G+L    EIE                       L   I E +         +  + K +   +  +         +QFWPRGHP
Subjt:  SA-----SATAYQRIIGILCKFQEIE-----------------------LAESIMEDF---------IKSNLKPLVPAYVDL--------MNQFWPRGHP

Query:  AIPNLIHRWLSPRVLAYWYMYGGSRLSSGDILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDL
        AIPNLIHRWLSPRVLAYWYMYGGSRLSSGDILLKLKGSHEGVEKVVKSLKAKSM CKVKRKGRVYWIGLLGSNA WFWKLTEPFILDD KDSLQADNVDL
Subjt:  AIPNLIHRWLSPRVLAYWYMYGGSRLSSGDILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDL

Query:  EKASTETEFIKFDSQSDSDEEVFD
        E+ASTETE I FDSQSD DEEV D
Subjt:  EKASTETEFIKFDSQSDSDEEVFD

XP_022949171.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita moschata]4.0e-21154.36Show/hide
Query:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA
        MSI TSAFAT+TL  SLT   SQ H HFR            T+SAK R +LPRIPAFAS S VE L+YD+DSP++SEE  CSPYS  AEGF      ASA
Subjt:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA

Query:  DLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDEL EQWRRSKLAWLCKELPA KPGTL+RLLNAQRKWM+QDDAAYLIVHCLRIRENETAFRVYKW MQQHWYRFDYALATKLADY
Subjt:  DLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTT
        MGKERKFSKCREVFDDIINQGCVPSEST+                  S+      +QLGGY+PRLSLHNSLF+ALVSKPGD SKH+LKQAEFIYHNL TT
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKALEI
        GLELHKDIYGGLIWLHSYQDT+DKERI+SL KEM  AGIEEERE                                   AFVYKMEVYAKVG PMKA EI
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKALEI

Query:  FREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN---------------------------------------------
        FREMEQL S SA AYQ IIGILCKF+E+ LAES+ME FIKSNLKPL PAYVDLMN                                             
Subjt:  FREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLSSGD
                                                                      QFWPRGHP IPNLIHRWLSPRVLAYWYMYGG R+SSGD
Subjt:  --------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLSSGD

Query:  ILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE
         +LKLKGS EGV K+VKSL+ KSM+CKVKRKGRVYWIGLLGSNA WFWKL EPFILDDLKDSLQAD++++EKA+ ET  I FDSQSDSDEE
Subjt:  ILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE

XP_022998786.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita maxima]4.7e-21254.74Show/hide
Query:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA
        MSI TSAFAT+TL  SLT   SQ H HFR            T+SAK R +LPRIPAFAS S VE L+YD+DSP++SEE  CSPYSN AE F      ASA
Subjt:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA

Query:  DLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDEL EQWRRSKLAWLCKELPAHKPGTL+RLLNAQRKWM+QDDAAYLIVHCLRIRENETAFRVYKW MQQHWYRFDYALATKLADY
Subjt:  DLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTT
        MGKERKFSKCREVFDDIINQGCVPSEST+                  ++      +QLGGY PRLSLHNSLF+ALVSKPGD SKH+LKQAEFIYHNLVTT
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKALEI
        GLELHKDIYGGLIWLHSYQDT+DKERI+SL KEM  AGIEEERE                                   AFVYKMEVYAKVG PMKALEI
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKALEI

Query:  FREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN---------------------------------------------
        FREMEQL S S+ AYQ IIGILCKF+E+ LAES+M  FIKSNLKPL PAYVDLMN                                             
Subjt:  FREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLSSGD
                                                                      QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGG R+SSGD
Subjt:  --------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLSSGD

Query:  ILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE
         +LKLKGS EGV K+VKSL+ KSM+CKVKRKGRVYWIGLLGSNA WFWKL EPFILDDLKDSLQADN++LEKA  ET  I FDSQSDSDEE
Subjt:  ILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE

XP_023521219.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Cucurbita pepo subsp. pepo]3.5e-20753.6Show/hide
Query:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA
        MSI TSAFAT+TL  SLT S    H HFR            T+SAK R +LPRIPAFAS S VE L++D+DSP++SEE  CSPYS  AEGF      ASA
Subjt:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA

Query:  DLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDEL EQWRRSKLAWLCKELPAH PGTL+RLLNAQRKWM+QDDAAY+IVHCLRIRENETAFRVYKW MQQHWYRFDYALATKLADY
Subjt:  DLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTT
        MGKERKFSKCREVFDDIINQGCVPSEST+                  ++      +QLGGY+PRLSLHNSLF+AL+SKPGD SKH+LKQAEFIYHNLVTT
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKALEI
        GLELHKDIY GLIWLHSYQDT+DKERI+SL KEM  AGIEEERE                                   AFVYKMEVYAKVG PMKA EI
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKALEI

Query:  FREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN---------------------------------------------
        FREMEQL S SA AYQ IIGILCK +E+ LAES+ME FIKSNLKPL PAYVDLMN                                             
Subjt:  FREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLSSGD
                                                                      QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGG R+SSGD
Subjt:  --------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLSSGD

Query:  ILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE
         +LKLKGS EGV K+VKSL  KSM+CKVKRKGRVYWIGLLGSNA WFWKL EPFILDDLKD LQAD++++EKA  ET  I FDSQSDSDEE
Subjt:  ILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE

TrEMBL top hitse value%identityAlignment
A0A0A0LBL0 LAGLIDADG_2 domain-containing protein4.0e-20151.89Show/hide
Query:  VISMSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSY
        V SMSI TSAF+T+T   SLT SLS  H +F              +S K R +LPRI AFAS SFV+QL+YD DSPS+SEEH  S +SN  +GFHFEN +
Subjt:  VISMSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSY

Query:  ASADLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKL
        AS DLKHLG P LEVKELDEL EQWRRSK+AWLCKELPA KPGT++RLLNAQ+KWM QDDA YLIVHCLRIRENETAFRVYKW MQQHWYRFDYAL+TKL
Subjt:  ASADLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKL

Query:  ADYMGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNL
        ADYMGKERKFSKCREVFDDIINQGCVPSEST+                  ++      +QLGGY+PRLSLH+SLFRALVSKPGD SKH+LKQAEFIYHNL
Subjt:  ADYMGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNL

Query:  VTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKA
        VT+GLELHKD+YGGLIWLHSYQDTID+ERIVSL KEM  AGI+EERE                                   AFVYKMEVYAK+G PMKA
Subjt:  VTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKA

Query:  LEIFREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN------------------------------------------
        LEIFREMEQL S +A AYQ IIGILCKFQ IELAESIM  FI+SNLKPL PAYVDLMN                                          
Subjt:  LEIFREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLS
                                                                         QFWPRG  AIPNLIHRWLSPRVLAYWYMYGG R S
Subjt:  -----------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLS

Query:  SGDILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE
        SGDILLKLKGSHEGVEK+VKSL+ KS++CKVKRKG +YWIGLLGSNA WFWKL EPFILD LK+S QAD+++L      +E I FDS+SDS EE
Subjt:  SGDILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE

A0A1S3CPK0 pentatricopeptide repeat-containing protein At2g15820, chloroplastic1.3e-20452.9Show/hide
Query:  VISMSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSY
        V SMSI TSAF+T+TL  SLT SLS  H +F             ++S K R +LPRI AFAS SFV+QL+YD+DSPS+SEEH  SPYSN  +GFHFEN +
Subjt:  VISMSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSY

Query:  ASADLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKL
        AS DLKHLG PALEVKELDEL EQWRRSKLAWLCKELPA KPGT++RLLNAQRKWM QDDA YL VHCLRIRENETAFRVYKW MQQHWYRFDYAL+TKL
Subjt:  ASADLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKL

Query:  ADYMGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNL
        ADYMGKERKFSKCREVFDDIINQGCVPSEST+                  ++      +QLGGY+PRLSLH+SLFRAL+SKPGD SKH+LKQAEFIYHNL
Subjt:  ADYMGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNL

Query:  VTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKA
        VT+GLELHKDIYGGLIWLHSYQDTIDKERIVSL KEM  AGI+EE+E                                   AFVYKMEVYAK+G PMKA
Subjt:  VTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKA

Query:  LEIFREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN------------------------------------------
        LEIFREMEQL S +A AYQ IIGILCKFQEIELAESIM  FI+SNLKPL PAYVD+MN                                          
Subjt:  LEIFREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLS
                                                                         QFWPRG   IPNLIHRWLSPR LAYWYMYGG R S
Subjt:  -----------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLS

Query:  SGDILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE
        SGDILLKLKGSHEGVEK+VKSL+ KSM+CKVKRKG +YWIGLLGSNA WFWKL EPFILDDLK+S QAD+++L     ETE I FDSQSDS EE
Subjt:  SGDILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE

A0A6J1DXY9 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like9.5e-22769.71Show/hide
Query:  TLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKHLGNPALE
        TLF SLTHSL   HRHFR            TFSAKRRPKLPRI AFAS S V QLLYD+DSPSDSEEHSCSPYSN A+GFHFENS+ASADLKHLGNPALE
Subjt:  TLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKHLGNPALE

Query:  VKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADYMGKERKFSKCR
        VKELDEL EQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADYMGKERKFSKCR
Subjt:  VKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADYMGKERKFSKCR

Query:  EVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSY---VQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTTGLELHKDI
        EVFDDIINQG VPSEST   F   +  Y    +       +  Y   +QLGGYKPRLSLHNSLFRALVSKPGD SKHYLKQ EFIYHNLVTTGLELHKDI
Subjt:  EVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSY---VQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTTGLELHKDI

Query:  YGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKALEIFREMEQLK
        YGGLIWLHSYQDTIDKERIVSL KEML AGI EERE                                   AFVYKMEVYAKVG PMK LEIF EM+ L 
Subjt:  YGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKALEIFREMEQLK

Query:  SA-----SATAYQRIIGILCKFQEIE-----------------------LAESIMEDF---------IKSNLKPLVPAYVDL--------MNQFWPRGHP
         A     S    + ++G+L    EIE                       L   I E +         +  + K +   +  +         +QFWPRGHP
Subjt:  SA-----SATAYQRIIGILCKFQEIE-----------------------LAESIMEDF---------IKSNLKPLVPAYVDL--------MNQFWPRGHP

Query:  AIPNLIHRWLSPRVLAYWYMYGGSRLSSGDILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDL
        AIPNLIHRWLSPRVLAYWYMYGGSRLSSGDILLKLKGSHEGVEKVVKSLKAKSM CKVKRKGRVYWIGLLGSNA WFWKLTEPFILDD KDSLQADNVDL
Subjt:  AIPNLIHRWLSPRVLAYWYMYGGSRLSSGDILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDL

Query:  EKASTETEFIKFDSQSDSDEEVFD
        E+ASTETE I FDSQSD DEEV D
Subjt:  EKASTETEFIKFDSQSDSDEEVFD

A0A6J1GB98 pentatricopeptide repeat-containing protein At2g15820, chloroplastic1.9e-21154.36Show/hide
Query:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA
        MSI TSAFAT+TL  SLT   SQ H HFR            T+SAK R +LPRIPAFAS S VE L+YD+DSP++SEE  CSPYS  AEGF      ASA
Subjt:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA

Query:  DLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDEL EQWRRSKLAWLCKELPA KPGTL+RLLNAQRKWM+QDDAAYLIVHCLRIRENETAFRVYKW MQQHWYRFDYALATKLADY
Subjt:  DLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTT
        MGKERKFSKCREVFDDIINQGCVPSEST+                  S+      +QLGGY+PRLSLHNSLF+ALVSKPGD SKH+LKQAEFIYHNL TT
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKALEI
        GLELHKDIYGGLIWLHSYQDT+DKERI+SL KEM  AGIEEERE                                   AFVYKMEVYAKVG PMKA EI
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKALEI

Query:  FREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN---------------------------------------------
        FREMEQL S SA AYQ IIGILCKF+E+ LAES+ME FIKSNLKPL PAYVDLMN                                             
Subjt:  FREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLSSGD
                                                                      QFWPRGHP IPNLIHRWLSPRVLAYWYMYGG R+SSGD
Subjt:  --------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLSSGD

Query:  ILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE
         +LKLKGS EGV K+VKSL+ KSM+CKVKRKGRVYWIGLLGSNA WFWKL EPFILDDLKDSLQAD++++EKA+ ET  I FDSQSDSDEE
Subjt:  ILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE

A0A6J1KB64 pentatricopeptide repeat-containing protein At2g15820, chloroplastic2.3e-21254.74Show/hide
Query:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA
        MSI TSAFAT+TL  SLT   SQ H HFR            T+SAK R +LPRIPAFAS S VE L+YD+DSP++SEE  CSPYSN AE F      ASA
Subjt:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA

Query:  DLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDEL EQWRRSKLAWLCKELPAHKPGTL+RLLNAQRKWM+QDDAAYLIVHCLRIRENETAFRVYKW MQQHWYRFDYALATKLADY
Subjt:  DLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTT
        MGKERKFSKCREVFDDIINQGCVPSEST+                  ++      +QLGGY PRLSLHNSLF+ALVSKPGD SKH+LKQAEFIYHNLVTT
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKALEI
        GLELHKDIYGGLIWLHSYQDT+DKERI+SL KEM  AGIEEERE                                   AFVYKMEVYAKVG PMKALEI
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------AFVYKMEVYAKVGCPMKALEI

Query:  FREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN---------------------------------------------
        FREMEQL S S+ AYQ IIGILCKF+E+ LAES+M  FIKSNLKPL PAYVDLMN                                             
Subjt:  FREMEQLKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMN---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLSSGD
                                                                      QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGG R+SSGD
Subjt:  --------------------------------------------------------------QFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLSSGD

Query:  ILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE
         +LKLKGS EGV K+VKSL+ KSM+CKVKRKGRVYWIGLLGSNA WFWKL EPFILDDLKDSLQADN++LEKA  ET  I FDSQSDSDEE
Subjt:  ILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEE

SwissProt top hitse value%identityAlignment
Q6ZHJ5 Pentatricopeptide repeat-containing protein OTP51, chloroplastic4.1e-11033.56Show/hide
Query:  PRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKH-LGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQR
        P IPA AS   +E L+ D D   + E+           G     ++A+AD +  + +P L V EL+EL EQWRRS++AWLCKELPA+K  T  R+LNAQR
Subjt:  PRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKH-LGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQR

Query:  KWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNH
        KW+ QDDA Y+ VHCLRIR N+ AFRVY W ++QHW+RF++ALAT++AD +G++ K  KCREVF+ ++ QG VP+EST+             R    +  
Subjt:  KWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNH

Query:  HLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEE---------
             +Q+GGYKPRLSLHNSLFRALVSK G  +K+ LKQAEF+YHN+VTT L++HKD+Y GLIWLHSYQD ID+ERI++L KEM  AG +E         
Subjt:  HLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEE---------

Query:  --------------------------EREAFVYKMEVYAKVGCPMKALEIFREMEQLK-SASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPA
                                    +A+V +ME YA+ G PMK+L++F+EM+      +  +Y +II I+ K  E+++ E +M +FI+S++K L+PA
Subjt:  --------------------------EREAFVYKMEVYAKVGCPMKALEIFREMEQLK-SASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPA

Query:  YVDLM-----------------------------------------------------------------------------------------------
        ++DLM                                                                                               
Subjt:  YVDLM-----------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------NQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLSSGDILLKLKGSH-EGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWF
                     +QF+ +G P +P LIHRWL+PRVLAYW+M+GGS+L SGDI+LKL G + EGVE++V SL  +S+  KVKRKGR +WIG  GSNA  F
Subjt:  -------------NQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLSSGDILLKLKGSH-EGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWF

Query:  WKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEEV
        W++ EP +L++    +  +   +    T+      D+ +DSD+++
Subjt:  WKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEEV

Q9FVX2 Pentatricopeptide repeat-containing protein At1g77360, mitochondrial1.0e-0420.54Show/hide
Query:  LADYMGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHN
        L +  GKE    K REVF ++I+ GC P   TY      + +  C          +   +     KP       ++  LV   G  +++ L++A   +  
Subjt:  LADYMGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHN

Query:  LVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEEREAFVYKMEVYAKVGCPMKALEIFREMEQLKSASATAYQRIIGILCKFQEIELA
        +  +G++    ++  LI      + +  + +  ++KEM   G+    ++    +    + G   +A ++FR+M ++    A  Y  +I + C+ +E+E A
Subjt:  LVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEEREAFVYKMEVYAKVGCPMKALEIFREMEQLKSASATAYQRIIGILCKFQEIELA

Query:  ESIMEDFIKSNLKPLVPAYVDLMN
        + + +   K  + P +  +  L+N
Subjt:  ESIMEDFIKSNLKPLVPAYVDLMN

Q9S7Q2 Pentatricopeptide repeat-containing protein At1g74850, chloroplastic6.5e-0719.76Show/hide
Query:  LGNPALEVKELDELLEQWRRS-KLAWLCKELPAHKP-GTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADYMG
        LGNP++ V       E+ + S  +  L  +L +  P G++ R L+  +  +  +D A +        + + + R++K+  +Q W + +  + T +   +G
Subjt:  LGNPALEVKELDELLEQWRRS-KLAWLCKELPAHKP-GTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADYMG

Query:  KERKFSKCREVFDDIINQGCVPSESTY-----PYFDCCLPEYACPRMHRGSNHHLQSYV-------------------QLG--------GYKPRLSLHNS
        +E    KC EVFD++ +QG   S  +Y      Y      E +   + R  N  +   +                    LG        G +P +  +N+
Subjt:  KERKFSKCREVFDDIINQGCVPSESTY-----PYFDCCLPEYACPRMHRGSNHHLQSYV-------------------QLG--------GYKPRLSLHNS

Query:  LFRALVSKP-GDFSKHYLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEEREAFVYKMEVYAKVGCPMKALEIFREM
        L  A   +  GD       +AE ++  +   G+      Y  L+   ++      E++  L+ EM   G   +  ++   +E YAK G   +A+ +F +M
Subjt:  LFRALVSKP-GDFSKHYLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEEREAFVYKMEVYAKVGCPMKALEIFREM

Query:  EQLK-SASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMNQFWPRGH-PAIPNLIHRWLSPRVLAYWYMYGGSRLSSGDILLKLKGSH
        +    + +A  Y  ++ +  +    +    +  +   SN  P    Y  L+  F   G+   +  L H  +   +      Y G   + G       G H
Subjt:  EQLK-SASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMNQFWPRGH-PAIPNLIHRWLSPRVLAYWYMYGGSRLSSGDILLKLKGSH

Query:  EGVEKVVKSLKAKSM
        E   K+++ + A  +
Subjt:  EGVEKVVKSLKAKSM

Q9XIL5 Pentatricopeptide repeat-containing protein At2g15820, chloroplastic5.5e-12336.65Show/hide
Query:  PSVCNPISVISMSIC------TSAFATLTLFHSLTHSLSQRHRHFRTFSAKRRPKL--PRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHF
        P++ N  S +  S+        S+++  +L     H++      F + S+ R P L      A  S +FVE L       ++SEE       + A GF  
Subjt:  PSVCNPISVISMSIC------TSAFATLTLFHSLTHSLSQRHRHFRTFSAKRRPKL--PRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHF

Query:  ENSYASADLKHLGNPAL----EVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRF
          S A  D++++    +    EV+EL+EL E+WRRSKLAWLCKE+P HK  TLVRLLNAQ+KW+RQ+DA Y+ VHC+RIRENET FRVY+W  QQ+WYRF
Subjt:  ENSYASADLKHLGNPAL----EVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRF

Query:  DYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTYPYF----------DCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKP
        D+ L TKLA+Y+GKERKF+KCREVFDD++NQG VPSEST+             + CL E AC   +R         +QLGGYKPRLSLHNSLFRALVSK 
Subjt:  DYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTYPYF----------DCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKP

Query:  GDFSKHYLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------
        G      LKQAEFI+HN+VTTGLE+ KDIY GLIWLHS QD +D  RI SL +EM  AG +E +E                                   
Subjt:  GDFSKHYLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------

Query:  AFVYKMEVYAKVGCPMKALEIFREMEQ-LKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYV---------------------------
        AFVYK+E Y+KVG   KA+EIFREME+ +  A+ + Y +II +LCK Q++EL E++M++F +S  KPL+P+++                           
Subjt:  AFVYKMEVYAKVGCPMKALEIFREMEQ-LKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYV---------------------------

Query:  --------------------------------------------------------------DLM-----------------------------------
                                                                      DLM                                   
Subjt:  --------------------------------------------------------------DLM-----------------------------------

Query:  ----------------------------------------------------------------------------------NQFWPRGHPAIPNLIHRW
                                                                                            +WP+G P IP LIHRW
Subjt:  ----------------------------------------------------------------------------------NQFWPRGHPAIPNLIHRW

Query:  LSPRVLAYWYMYGGSRLSSGDILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKA-STETE
        LSP  LAYWYMY G + SSGDI+L+LKGS EGVEKVVK+L+AKSM C+VK+KG+V+WIGL G+N+  FWKL EP +L++LK+ L+  +  L+     E +
Subjt:  LSPRVLAYWYMYGGSRLSSGDILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKA-STETE

Query:  FIKFDSQSDSDEE
         I F S SD  ++
Subjt:  FIKFDSQSDSDEE

Arabidopsis top hitse value%identityAlignment
AT1G74850.1 plastid transcriptionally active 24.6e-0819.76Show/hide
Query:  LGNPALEVKELDELLEQWRRS-KLAWLCKELPAHKP-GTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADYMG
        LGNP++ V       E+ + S  +  L  +L +  P G++ R L+  +  +  +D A +        + + + R++K+  +Q W + +  + T +   +G
Subjt:  LGNPALEVKELDELLEQWRRS-KLAWLCKELPAHKP-GTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRFDYALATKLADYMG

Query:  KERKFSKCREVFDDIINQGCVPSESTY-----PYFDCCLPEYACPRMHRGSNHHLQSYV-------------------QLG--------GYKPRLSLHNS
        +E    KC EVFD++ +QG   S  +Y      Y      E +   + R  N  +   +                    LG        G +P +  +N+
Subjt:  KERKFSKCREVFDDIINQGCVPSESTY-----PYFDCCLPEYACPRMHRGSNHHLQSYV-------------------QLG--------GYKPRLSLHNS

Query:  LFRALVSKP-GDFSKHYLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEEREAFVYKMEVYAKVGCPMKALEIFREM
        L  A   +  GD       +AE ++  +   G+      Y  L+   ++      E++  L+ EM   G   +  ++   +E YAK G   +A+ +F +M
Subjt:  LFRALVSKP-GDFSKHYLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEEREAFVYKMEVYAKVGCPMKALEIFREM

Query:  EQLK-SASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMNQFWPRGH-PAIPNLIHRWLSPRVLAYWYMYGGSRLSSGDILLKLKGSH
        +    + +A  Y  ++ +  +    +    +  +   SN  P    Y  L+  F   G+   +  L H  +   +      Y G   + G       G H
Subjt:  EQLK-SASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYVDLMNQFWPRGH-PAIPNLIHRWLSPRVLAYWYMYGGSRLSSGDILLKLKGSH

Query:  EGVEKVVKSLKAKSM
        E   K+++ + A  +
Subjt:  EGVEKVVKSLKAKSM

AT1G77360.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.4e-0620.54Show/hide
Query:  LADYMGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHN
        L +  GKE    K REVF ++I+ GC P   TY      + +  C          +   +     KP       ++  LV   G  +++ L++A   +  
Subjt:  LADYMGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDFSKHYLKQAEFIYHN

Query:  LVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEEREAFVYKMEVYAKVGCPMKALEIFREMEQLKSASATAYQRIIGILCKFQEIELA
        +  +G++    ++  LI      + +  + +  ++KEM   G+    ++    +    + G   +A ++FR+M ++    A  Y  +I + C+ +E+E A
Subjt:  LVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEEREAFVYKMEVYAKVGCPMKALEIFREMEQLKSASATAYQRIIGILCKFQEIELA

Query:  ESIMEDFIKSNLKPLVPAYVDLMN
        + + +   K  + P +  +  L+N
Subjt:  ESIMEDFIKSNLKPLVPAYVDLMN

AT2G15820.1 endonucleases3.9e-12436.65Show/hide
Query:  PSVCNPISVISMSIC------TSAFATLTLFHSLTHSLSQRHRHFRTFSAKRRPKL--PRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHF
        P++ N  S +  S+        S+++  +L     H++      F + S+ R P L      A  S +FVE L       ++SEE       + A GF  
Subjt:  PSVCNPISVISMSIC------TSAFATLTLFHSLTHSLSQRHRHFRTFSAKRRPKL--PRIPAFASCSFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHF

Query:  ENSYASADLKHLGNPAL----EVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRF
          S A  D++++    +    EV+EL+EL E+WRRSKLAWLCKE+P HK  TLVRLLNAQ+KW+RQ+DA Y+ VHC+RIRENET FRVY+W  QQ+WYRF
Subjt:  ENSYASADLKHLGNPAL----EVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRVYKWTMQQHWYRF

Query:  DYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTYPYF----------DCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKP
        D+ L TKLA+Y+GKERKF+KCREVFDD++NQG VPSEST+             + CL E AC   +R         +QLGGYKPRLSLHNSLFRALVSK 
Subjt:  DYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTYPYF----------DCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKP

Query:  GDFSKHYLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------
        G      LKQAEFI+HN+VTTGLE+ KDIY GLIWLHS QD +D  RI SL +EM  AG +E +E                                   
Subjt:  GDFSKHYLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEERE-----------------------------------

Query:  AFVYKMEVYAKVGCPMKALEIFREMEQ-LKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYV---------------------------
        AFVYK+E Y+KVG   KA+EIFREME+ +  A+ + Y +II +LCK Q++EL E++M++F +S  KPL+P+++                           
Subjt:  AFVYKMEVYAKVGCPMKALEIFREMEQ-LKSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKPLVPAYV---------------------------

Query:  --------------------------------------------------------------DLM-----------------------------------
                                                                      DLM                                   
Subjt:  --------------------------------------------------------------DLM-----------------------------------

Query:  ----------------------------------------------------------------------------------NQFWPRGHPAIPNLIHRW
                                                                                            +WP+G P IP LIHRW
Subjt:  ----------------------------------------------------------------------------------NQFWPRGHPAIPNLIHRW

Query:  LSPRVLAYWYMYGGSRLSSGDILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKA-STETE
        LSP  LAYWYMY G + SSGDI+L+LKGS EGVEKVVK+L+AKSM C+VK+KG+V+WIGL G+N+  FWKL EP +L++LK+ L+  +  L+     E +
Subjt:  LSPRVLAYWYMYGGSRLSSGDILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNAIWFWKLTEPFILDDLKDSLQADNVDLEKA-STETE

Query:  FIKFDSQSDSDEE
         I F S SD  ++
Subjt:  FIKFDSQSDSDEE

AT5G08310.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-0420.51Show/hide
Query:  AFRVYKWTMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFR
        A+  + W  +Q  YR D      +A  + + R+ +  + +  D++N  C  S   + +F  CL       +   ++       ++G   P    +N L  
Subjt:  AFRVYKWTMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFR

Query:  ALVSKPGDFSKHYLKQAEFIYHNL-VTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAG-----------------------------IEEER
        A+       SK      E +   L        H D +     L  Y +T   ER +S+  E+L  G                             +EE  
Subjt:  ALVSKPGDFSKHYLKQAEFIYHNL-VTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAG-----------------------------IEEER

Query:  EAFVYK-----MEVYAKVGCPMKALEIFREMEQL-KSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKP
            YK     +  + K     KA ++F +M ++  +A    Y  +IG LCK +++E+A S+  +  +S + P
Subjt:  EAFVYK-----MEVYAKVGCPMKALEIFREMEQL-KSASATAYQRIIGILCKFQEIELAESIMEDFIKSNLKP

AT5G48730.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-0432.76Show/hide
Query:  ETAFRVYKWTMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTY
        E+A +V++   +Q WY+ +  +  KL   +GK ++  K  E+F ++IN+GCV +   Y
Subjt:  ETAFRVYKWTMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCACTTGCTGGATCGAGGCCGAGGAATGCTTCCCGAGCCGGCCAGGCCCGTGGGCTCCCCTAATTACAAACCCTATCCGCTCCTCCCTCCCTTTTCGCGCCGCTCC
CCCTCCCTCTTCGGCGTTGCCCTGCCCTCCCTTGCTCCCCTCAGTTTGTAACCCCATTTCTGTCATCTCCATGTCCATTTGCACCTCTGCCTTTGCCACTCTCACTCTTT
TCCATTCTCTCACTCATTCCCTCTCTCAACGCCATCGCCACTTTCGAACATTTTCCGCAAAACGACGACCGAAACTTCCGCGAATTCCTGCCTTCGCTTCATGTTCCTTC
GTCGAACAGTTGTTATACGACCAGGATTCCCCGTCCGACTCTGAGGAGCACTCGTGTTCTCCATACAGTAACAGGGCTGAGGGTTTTCATTTTGAAAATAGTTATGCGTC
GGCGGATTTGAAACACTTGGGAAATCCTGCGCTTGAAGTCAAAGAGCTGGACGAGTTGCTGGAGCAGTGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAG
CGCATAAGCCGGGAACCTTAGTTCGGCTGCTTAATGCTCAGCGGAAATGGATGAGGCAGGATGATGCGGCCTATCTCATCGTGCATTGTTTGCGTATTCGTGAGAACGAG
ACTGCGTTTAGGGTGTACAAGTGGACGATGCAACAACATTGGTACCGATTTGATTATGCTCTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAGCGAAAGTTCTCAAA
GTGTCGGGAGGTATTTGATGATATAATTAATCAGGGATGTGTACCAAGTGAATCTACCTACCCATATTTTGATTGTTGCTTACCTGAGTACGCCTGTCCGAGGATGCATA
GAGGAAGCAATCACCATTTACAATCGTATGTTCAGCTAGGTGGTTACAAACCACGTCTTAGCTTGCACAATTCTCTCTTTAGAGCTCTTGTGAGCAAACCAGGGGATTTT
TCAAAACATTATCTTAAACAGGCTGAGTTTATATATCACAATTTGGTAACAACTGGACTCGAGTTGCATAAAGATATATATGGTGGTCTAATTTGGCTGCACAGTTATCA
GGATACAATAGACAAAGAAAGGATTGTGTCACTAATGAAAGAAATGCTACCAGCAGGAATCGAGGAAGAAAGAGAGGCTTTTGTTTACAAAATGGAAGTCTATGCAAAGG
TGGGCTGCCCGATGAAAGCTTTGGAGATATTTAGGGAGATGGAGCAGTTGAAGTCGGCAAGTGCTACAGCATATCAGAGAATTATTGGGATTTTATGTAAATTTCAAGAG
ATAGAGCTTGCAGAATCCATCATGGAGGACTTCATAAAGAGTAATTTAAAACCCCTGGTGCCAGCTTATGTTGATCTGATGAATCAGTTTTGGCCACGAGGCCATCCTGC
AATCCCTAATCTAATTCACAGGTGGCTTTCACCTCGTGTTCTTGCATACTGGTATATGTATGGTGGCTCCAGACTATCATCGGGGGATATTTTATTAAAGCTGAAGGGAA
GTCATGAGGGTGTTGAGAAGGTTGTTAAATCTTTGAAAGCAAAGTCCATGAATTGCAAGGTGAAAAGGAAGGGTAGAGTGTATTGGATAGGTTTACTAGGAAGCAATGCC
ATATGGTTCTGGAAACTAACCGAACCTTTCATTCTGGATGACTTAAAAGATAGTCTACAGGCAGACAACGTTGACTTGGAGAAGGCTTCAACTGAGACTGAATTCATCAA
GTTTGATAGCCAATCTGATTCTGATGAGGAGGTTTTTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCACTTGCTGGATCGAGGCCGAGGAATGCTTCCCGAGCCGGCCAGGCCCGTGGGCTCCCCTAATTACAAACCCTATCCGCTCCTCCCTCCCTTTTCGCGCCGCTCC
CCCTCCCTCTTCGGCGTTGCCCTGCCCTCCCTTGCTCCCCTCAGTTTGTAACCCCATTTCTGTCATCTCCATGTCCATTTGCACCTCTGCCTTTGCCACTCTCACTCTTT
TCCATTCTCTCACTCATTCCCTCTCTCAACGCCATCGCCACTTTCGAACATTTTCCGCAAAACGACGACCGAAACTTCCGCGAATTCCTGCCTTCGCTTCATGTTCCTTC
GTCGAACAGTTGTTATACGACCAGGATTCCCCGTCCGACTCTGAGGAGCACTCGTGTTCTCCATACAGTAACAGGGCTGAGGGTTTTCATTTTGAAAATAGTTATGCGTC
GGCGGATTTGAAACACTTGGGAAATCCTGCGCTTGAAGTCAAAGAGCTGGACGAGTTGCTGGAGCAGTGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAG
CGCATAAGCCGGGAACCTTAGTTCGGCTGCTTAATGCTCAGCGGAAATGGATGAGGCAGGATGATGCGGCCTATCTCATCGTGCATTGTTTGCGTATTCGTGAGAACGAG
ACTGCGTTTAGGGTGTACAAGTGGACGATGCAACAACATTGGTACCGATTTGATTATGCTCTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAGCGAAAGTTCTCAAA
GTGTCGGGAGGTATTTGATGATATAATTAATCAGGGATGTGTACCAAGTGAATCTACCTACCCATATTTTGATTGTTGCTTACCTGAGTACGCCTGTCCGAGGATGCATA
GAGGAAGCAATCACCATTTACAATCGTATGTTCAGCTAGGTGGTTACAAACCACGTCTTAGCTTGCACAATTCTCTCTTTAGAGCTCTTGTGAGCAAACCAGGGGATTTT
TCAAAACATTATCTTAAACAGGCTGAGTTTATATATCACAATTTGGTAACAACTGGACTCGAGTTGCATAAAGATATATATGGTGGTCTAATTTGGCTGCACAGTTATCA
GGATACAATAGACAAAGAAAGGATTGTGTCACTAATGAAAGAAATGCTACCAGCAGGAATCGAGGAAGAAAGAGAGGCTTTTGTTTACAAAATGGAAGTCTATGCAAAGG
TGGGCTGCCCGATGAAAGCTTTGGAGATATTTAGGGAGATGGAGCAGTTGAAGTCGGCAAGTGCTACAGCATATCAGAGAATTATTGGGATTTTATGTAAATTTCAAGAG
ATAGAGCTTGCAGAATCCATCATGGAGGACTTCATAAAGAGTAATTTAAAACCCCTGGTGCCAGCTTATGTTGATCTGATGAATCAGTTTTGGCCACGAGGCCATCCTGC
AATCCCTAATCTAATTCACAGGTGGCTTTCACCTCGTGTTCTTGCATACTGGTATATGTATGGTGGCTCCAGACTATCATCGGGGGATATTTTATTAAAGCTGAAGGGAA
GTCATGAGGGTGTTGAGAAGGTTGTTAAATCTTTGAAAGCAAAGTCCATGAATTGCAAGGTGAAAAGGAAGGGTAGAGTGTATTGGATAGGTTTACTAGGAAGCAATGCC
ATATGGTTCTGGAAACTAACCGAACCTTTCATTCTGGATGACTTAAAAGATAGTCTACAGGCAGACAACGTTGACTTGGAGAAGGCTTCAACTGAGACTGAATTCATCAA
GTTTGATAGCCAATCTGATTCTGATGAGGAGGTTTTTGATTGA
Protein sequenceShow/hide protein sequence
MTTCWIEAEECFPSRPGPWAPLITNPIRSSLPFRAAPPPSSALPCPPLLPSVCNPISVISMSICTSAFATLTLFHSLTHSLSQRHRHFRTFSAKRRPKLPRIPAFASCSF
VEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKHLGNPALEVKELDELLEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENE
TAFRVYKWTMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTYPYFDCCLPEYACPRMHRGSNHHLQSYVQLGGYKPRLSLHNSLFRALVSKPGDF
SKHYLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLMKEMLPAGIEEEREAFVYKMEVYAKVGCPMKALEIFREMEQLKSASATAYQRIIGILCKFQE
IELAESIMEDFIKSNLKPLVPAYVDLMNQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGSRLSSGDILLKLKGSHEGVEKVVKSLKAKSMNCKVKRKGRVYWIGLLGSNA
IWFWKLTEPFILDDLKDSLQADNVDLEKASTETEFIKFDSQSDSDEEVFD