; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg013758 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg013758
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold3:42182709..42191610
RNA-Seq ExpressionSpg013758
SyntenySpg013758
Gene Ontology termsGO:0016573 - histone acetylation (biological process)
GO:0000123 - histone acetyltransferase complex (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR015418 - Chromatin modification-related protein Eaf6


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593074.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]8.2e-18762.32Show/hide
Query:  SITKLRSSSANLMLRLLLMGWNKILIRTEIEPMYYYGKPKCSRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGK
        SI  LRSSSANLM                             RTV IWNSLINGYVKNCVFHGAF RFNEMQQC+I+PD+FTLSTLAKASSEL NV VGK
Subjt:  SITKLRSSSANLMLRLLLMGWNKILIRTEIEPMYYYGKPKCSRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGK

Query:  LMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMPERNSGSWNVLLAGYASSKDCFYVREAWE-VVRSMLIDGIKPDAFTISSLLPLCGDPIG
         +HGK VRLGF+ DIVVANSL SMYFKYGECKE LKLFDEMPERNSGSWNVLLAGYASS DCFYV+EAW+ VVRSM IDG+KPDAFTISSLLPLCG+PIG
Subjt:  LMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMPERNSGSWNVLLAGYASSKDCFYVREAWE-VVRSMLIDGIKPDAFTISSLLPLCGDPIG

Query:  KLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIASRYVFDQIKSKNVYVWTAM--------------------------------------
        +L+YGKELHCYIVKN LDLDFGSNVHLGC LIDMYSRDNKI ASR+VFDQIK +N+YVWTAM                                      
Subjt:  KLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIASRYVFDQIKSKNVYVWTAM--------------------------------------

Query:  --------------------------------------------------------------------------GEEAIRLYNNMVELGIKPDPITVVSV
                                                                                  G+E+IRLYN+M++LGIKPD I VV V
Subjt:  --------------------------------------------------------------------------GEEAIRLYNNMVELGIKPDPITVVSV

Query:  LSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFI
        LSAC RSGL+ E LQ+Y+TAVQ+YR+EPTAEICAV+VDMLG++GELE+AL+FI+TMPVE GP VWGALVSASIRYK+Y+MLELAYR L EL+PENPSNFI
Subjt:  LSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFI

Query:  SLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHCFHVVADKAHPSIEIEY
        SLSNLYASASRWD+VAELRHTMKERGLRK+PGCSWININ KTHCFH VADK+HPS ++ Y
Subjt:  SLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHCFHVVADKAHPSIEIEY

KAG7025484.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-23353.97Show/hide
Query:  SRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEM
        +RTV IWNSLINGYVKNCVFHGAF RFNEMQQC+I+PD+FTLSTLAKASSEL NV VG+ +HGK VRLGF+ DIVVANSL SMYFKYGECKE LKLFDEM
Subjt:  SRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEM

Query:  PERNSGSWNVLLAGYASSKDCFYVREAWE-VVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKI
        PERNSGSWNVLLAGYASS DCFYV+EAW+ VVRSM IDG+KPDAFTISSLLPLCG+PIG+L+YGKELHCYIVKN LDLDFGSNVHLGC LIDMYSRDNKI
Subjt:  PERNSGSWNVLLAGYASSKDCFYVREAWE-VVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKI

Query:  IASRYVFDQIKSKNVYVWTAM-------------------------------------------------------------------------------
         ASR+VFDQIK +N+YVWTAM                                                                               
Subjt:  IASRYVFDQIKSKNVYVWTAM-------------------------------------------------------------------------------

Query:  ---------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLG
                                         G+E+IRLYN+M++LGIKPD I VV VLSAC RSGL+ E LQ+Y+TAVQ+YR+EPTAEICAV+VDMLG
Subjt:  ---------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLG

Query:  SIGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAK
        ++GELE+AL+FI+TMPVE GP VWGALVSASIRYK+Y+MLELAYR L EL+PENPSNFISLSNLYASASRWD+VAELRHTMKERGLRK+PGCSWININ K
Subjt:  SIGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAK

Query:  THCFHVVADKAHPSIEIEY------------PQAREL------------GLAILNSF-------------------------------------------
        THCFH VADK+HPS ++ Y            P + +L            G   + SF                                           
Subjt:  THCFHVVADKAHPSIEIEY------------PQAREL------------GLAILNSF-------------------------------------------

Query:  -------------------------DHMEAEGQKTATNPSAMLAGLLTRRAKLHDELRNIEKQVYDMETNYLQDPSQCGNVLKGFEGFLSASKSTALYGV
                                 D+MEAEGQKTATNPSAMLAGLLTRRAKLHDELR IEKQVYDMETNYLQDPSQCGNVLKGFEGFLSASKS+AL   
Subjt:  -------------------------DHMEAEGQKTATNPSAMLAGLLTRRAKLHDELRNIEKQVYDMETNYLQDPSQCGNVLKGFEGFLSASKSTALYGV

Query:  SIILSLSNFPLNLKRSRKFQLEDRLFSLSSVTSPAYDKESLKNCFLVLKSFEEVAFPRLFRIANSKGFFISQCKPVALRSWNLRFRRNLAEVDVEEWASM
                    LKRSRKFQL+DRLFSLSSVTSPA                EE+A  R                                          
Subjt:  SIILSLSNFPLNLKRSRKFQLEDRLFSLSSVTSPAYDKESLKNCFLVLKSFEEVAFPRLFRIANSKGFFISQCKPVALRSWNLRFRRNLAEVDVEEWASM

Query:  ENILSTYTSGMLSTDSSGFLSYELLKGGLKSLQLDEMVSCNSKEEIVLGKPKKGRPAPRDAKRMRHSSEQDFDYDDDPDLTL
                      D    L     KGG         +  N +     GKPKKGRP PRDAKRMRHSSEQDFDYDDDPDLTL
Subjt:  ENILSTYTSGMLSTDSSGFLSYELLKGGLKSLQLDEMVSCNSKEEIVLGKPKKGRPAPRDAKRMRHSSEQDFDYDDDPDLTL

XP_022143534.1 pentatricopeptide repeat-containing protein At3g12770-like [Momordica charantia]1.3e-18766.67Show/hide
Query:  TVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMPE
        T+Y WNSLIN +VKN VFHGAFDRF EMQQC+I+PD FTLSTLAKAS+E+ NVAVGKL+HGKS+RLGFMLDIVVANSL SMYFKYGEC+  LKLFDEMPE
Subjt:  TVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMPE

Query:  RNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIAS
        RNS SWNVLLAGYASS DCFYV+E WEVVRSM  DGIKPDAFTISS LPLCGDPIGKLSYGKELHCY+VKN LDLDFGSNVHLGC LIDMYSRDNK+IAS
Subjt:  RNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIAS

Query:  RYVFDQIKSKNVYVWTAM----------------------------------------------------------------------------------
        RYVFDQIKSKN+YVWTAM                                                                                  
Subjt:  RYVFDQIKSKNVYVWTAM----------------------------------------------------------------------------------

Query:  ------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIG
                                      G+EAIRLYN+MVELGIKPDPIT+V VLSACGRSGLVNE LQIY+T VQE+ I+PTAEICAVVVD+LGSIG
Subjt:  ------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIG

Query:  ELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHC
        ELE+ALSFI+TM VE GPGVWGALVSAS++Y+NYEMLELAYR L ELEPEN SNFISLSNLYASASRWDVVAELR+TMKERGLRKA GCSWI+IN KTHC
Subjt:  ELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHC

Query:  FHVVADKAHPSIEIEY
        FH VADKAHPS ++ Y
Subjt:  FHVVADKAHPSIEIEY

XP_023004361.1 pentatricopeptide repeat-containing protein At3g12770-like isoform X1 [Cucurbita maxima]3.7e-18765.51Show/hide
Query:  SRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEM
        +RTV IWNSLINGYVKNCVFHGAF RFNEMQQC+I+PD+FTLSTLAKASSEL NV VGK +HGK VRLGF+ DIVVANSL SMYFKYGECKE LKLFDEM
Subjt:  SRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEM

Query:  PERNSGSWNVLLAGYASSKDCFYVREAWE-VVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKI
        PERNSGSWNVLLAGYASS DCFYV+EAW+ VVRSM IDG+KPDAFTISSLLPLCG+PIGKL+YGKELHCYIVKN LDLDFGSNVHLGC LIDMYSRDNKI
Subjt:  PERNSGSWNVLLAGYASSKDCFYVREAWE-VVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKI

Query:  IASRYVFDQIKSKNVYVWTAM-------------------------------------------------------------------------------
         ASR+VFDQIK +N+YVWTAM                                                                               
Subjt:  IASRYVFDQIKSKNVYVWTAM-------------------------------------------------------------------------------

Query:  ---------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLG
                                         G+E+IRLYN+M++LGIKPD I VV VLSAC RSGL+ E LQ+Y+TAVQ+YR+EPTAEICAV+VDMLG
Subjt:  ---------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLG

Query:  SIGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAK
        ++GELE+AL+FI+TMP+E GP VWGALVSASIRYK+Y+MLELAYR L ELEPENPSNFISLSNLYASASRWD+VAELRHTMKERGLRK+PGCSWININ K
Subjt:  SIGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAK

Query:  THCFHVVADKAHPSIEIEY
        THCFH VADKAHPS ++ Y
Subjt:  THCFHVVADKAHPSIEIEY

XP_023004362.1 putative pentatricopeptide repeat-containing protein At3g23330 isoform X2 [Cucurbita maxima]6.3e-18765.64Show/hide
Query:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP
        RTV IWNSLINGYVKNCVFHGAF RFNEMQQC+I+PD+FTLSTLAKASSEL NV VGK +HGK VRLGF+ DIVVANSL SMYFKYGECKE LKLFDEMP
Subjt:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP

Query:  ERNSGSWNVLLAGYASSKDCFYVREAWE-VVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKII
        ERNSGSWNVLLAGYASS DCFYV+EAW+ VVRSM IDG+KPDAFTISSLLPLCG+PIGKL+YGKELHCYIVKN LDLDFGSNVHLGC LIDMYSRDNKI 
Subjt:  ERNSGSWNVLLAGYASSKDCFYVREAWE-VVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKII

Query:  ASRYVFDQIKSKNVYVWTAM--------------------------------------------------------------------------------
        ASR+VFDQIK +N+YVWTAM                                                                                
Subjt:  ASRYVFDQIKSKNVYVWTAM--------------------------------------------------------------------------------

Query:  --------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGS
                                        G+E+IRLYN+M++LGIKPD I VV VLSAC RSGL+ E LQ+Y+TAVQ+YR+EPTAEICAV+VDMLG+
Subjt:  --------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGS

Query:  IGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKT
        +GELE+AL+FI+TMP+E GP VWGALVSASIRYK+Y+MLELAYR L ELEPENPSNFISLSNLYASASRWD+VAELRHTMKERGLRK+PGCSWININ KT
Subjt:  IGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKT

Query:  HCFHVVADKAHPSIEIEY
        HCFH VADKAHPS ++ Y
Subjt:  HCFHVVADKAHPSIEIEY

TrEMBL top hitse value%identityAlignment
A0A1S4DYI5 pentatricopeptide repeat-containing protein At3g12770-like1.9e-17361.51Show/hide
Query:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP
        +TV+IWNSLI+GYVKN +FHG F RFNEMQ+CNI+PD+FTLS LAKAS+EL NV VGK +HGKS+RLG + DI+VANSL SMYFKYGECKE LKLF+EMP
Subjt:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP

Query:  ERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIA
        ERNSGSWNVLLAGYAS  DCFY +EAWE VR+M +DGIKPDAFTISSLL  CG P GKLSYGKELHCYIVKN LDLD GS  H+ C LIDMYSR+N  IA
Subjt:  ERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIA

Query:  SRYVFDQIKSKNVYVWTAM---------------------------------------------------------------------------------
        SRYVFDQIKS+N+YVWTAM                                                                                 
Subjt:  SRYVFDQIKSKNVYVWTAM---------------------------------------------------------------------------------

Query:  -------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSI
                                        EE+IRLYN+MVEL IKPD ITVV+VLSACGR GLV+E LQ+YNTAVQE++IEPTAEICA VVDMLG+I
Subjt:  -------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSI

Query:  GELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTH
        GEL++A SFI+TM VE GP VWGALVSASIRY+NYEMLELAYR+L ELEP+NPSNFISLSNLYASASRWD+VAELR+TMK+RGLRK PGCSWININ  TH
Subjt:  GELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTH

Query:  CFHVVADKAHPSIEIEY
        CFH VAD+ HP   + Y
Subjt:  CFHVVADKAHPSIEIEY

A0A5A7UUQ3 Pentatricopeptide repeat-containing protein1.9e-17361.51Show/hide
Query:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP
        +TV+IWNSLI+GYVKN +FHG F RFNEMQ+CNI+PD+FTLS LAKAS+EL NV VGK +HGKS+RLG + DI+VANSL SMYFKYGECKE LKLF+EMP
Subjt:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP

Query:  ERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIA
        ERNSGSWNVLLAGYAS  DCFY +EAWE VR+M +DGIKPDAFTISSLL  CG P GKLSYGKELHCYIVKN LDLD GS  H+ C LIDMYSR+N  IA
Subjt:  ERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIA

Query:  SRYVFDQIKSKNVYVWTAM---------------------------------------------------------------------------------
        SRYVFDQIKS+N+YVWTAM                                                                                 
Subjt:  SRYVFDQIKSKNVYVWTAM---------------------------------------------------------------------------------

Query:  -------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSI
                                        EE+IRLYN+MVEL IKPD ITVV+VLSACGR GLV+E LQ+YNTAVQE++IEPTAEICA VVDMLG+I
Subjt:  -------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSI

Query:  GELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTH
        GEL++A SFI+TM VE GP VWGALVSASIRY+NYEMLELAYR+L ELEP+NPSNFISLSNLYASASRWD+VAELR+TMK+RGLRK PGCSWININ  TH
Subjt:  GELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTH

Query:  CFHVVADKAHPSIEIEY
        CFH VAD+ HP   + Y
Subjt:  CFHVVADKAHPSIEIEY

A0A6J1CPK8 pentatricopeptide repeat-containing protein At3g12770-like6.1e-18866.67Show/hide
Query:  TVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMPE
        T+Y WNSLIN +VKN VFHGAFDRF EMQQC+I+PD FTLSTLAKAS+E+ NVAVGKL+HGKS+RLGFMLDIVVANSL SMYFKYGEC+  LKLFDEMPE
Subjt:  TVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMPE

Query:  RNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIAS
        RNS SWNVLLAGYASS DCFYV+E WEVVRSM  DGIKPDAFTISS LPLCGDPIGKLSYGKELHCY+VKN LDLDFGSNVHLGC LIDMYSRDNK+IAS
Subjt:  RNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIAS

Query:  RYVFDQIKSKNVYVWTAM----------------------------------------------------------------------------------
        RYVFDQIKSKN+YVWTAM                                                                                  
Subjt:  RYVFDQIKSKNVYVWTAM----------------------------------------------------------------------------------

Query:  ------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIG
                                      G+EAIRLYN+MVELGIKPDPIT+V VLSACGRSGLVNE LQIY+T VQE+ I+PTAEICAVVVD+LGSIG
Subjt:  ------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIG

Query:  ELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHC
        ELE+ALSFI+TM VE GPGVWGALVSAS++Y+NYEMLELAYR L ELEPEN SNFISLSNLYASASRWDVVAELR+TMKERGLRKA GCSWI+IN KTHC
Subjt:  ELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHC

Query:  FHVVADKAHPSIEIEY
        FH VADKAHPS ++ Y
Subjt:  FHVVADKAHPSIEIEY

A0A6J1KQ82 putative pentatricopeptide repeat-containing protein At3g23330 isoform X23.0e-18765.64Show/hide
Query:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP
        RTV IWNSLINGYVKNCVFHGAF RFNEMQQC+I+PD+FTLSTLAKASSEL NV VGK +HGK VRLGF+ DIVVANSL SMYFKYGECKE LKLFDEMP
Subjt:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP

Query:  ERNSGSWNVLLAGYASSKDCFYVREAWE-VVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKII
        ERNSGSWNVLLAGYASS DCFYV+EAW+ VVRSM IDG+KPDAFTISSLLPLCG+PIGKL+YGKELHCYIVKN LDLDFGSNVHLGC LIDMYSRDNKI 
Subjt:  ERNSGSWNVLLAGYASSKDCFYVREAWE-VVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKII

Query:  ASRYVFDQIKSKNVYVWTAM--------------------------------------------------------------------------------
        ASR+VFDQIK +N+YVWTAM                                                                                
Subjt:  ASRYVFDQIKSKNVYVWTAM--------------------------------------------------------------------------------

Query:  --------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGS
                                        G+E+IRLYN+M++LGIKPD I VV VLSAC RSGL+ E LQ+Y+TAVQ+YR+EPTAEICAV+VDMLG+
Subjt:  --------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGS

Query:  IGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKT
        +GELE+AL+FI+TMP+E GP VWGALVSASIRYK+Y+MLELAYR L ELEPENPSNFISLSNLYASASRWD+VAELRHTMKERGLRK+PGCSWININ KT
Subjt:  IGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKT

Query:  HCFHVVADKAHPSIEIEY
        HCFH VADKAHPS ++ Y
Subjt:  HCFHVVADKAHPSIEIEY

A0A6J1KRX1 pentatricopeptide repeat-containing protein At3g12770-like isoform X11.8e-18765.51Show/hide
Query:  SRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEM
        +RTV IWNSLINGYVKNCVFHGAF RFNEMQQC+I+PD+FTLSTLAKASSEL NV VGK +HGK VRLGF+ DIVVANSL SMYFKYGECKE LKLFDEM
Subjt:  SRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEM

Query:  PERNSGSWNVLLAGYASSKDCFYVREAWE-VVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKI
        PERNSGSWNVLLAGYASS DCFYV+EAW+ VVRSM IDG+KPDAFTISSLLPLCG+PIGKL+YGKELHCYIVKN LDLDFGSNVHLGC LIDMYSRDNKI
Subjt:  PERNSGSWNVLLAGYASSKDCFYVREAWE-VVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKI

Query:  IASRYVFDQIKSKNVYVWTAM-------------------------------------------------------------------------------
         ASR+VFDQIK +N+YVWTAM                                                                               
Subjt:  IASRYVFDQIKSKNVYVWTAM-------------------------------------------------------------------------------

Query:  ---------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLG
                                         G+E+IRLYN+M++LGIKPD I VV VLSAC RSGL+ E LQ+Y+TAVQ+YR+EPTAEICAV+VDMLG
Subjt:  ---------------------------------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLG

Query:  SIGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAK
        ++GELE+AL+FI+TMP+E GP VWGALVSASIRYK+Y+MLELAYR L ELEPENPSNFISLSNLYASASRWD+VAELRHTMKERGLRK+PGCSWININ K
Subjt:  SIGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAK

Query:  THCFHVVADKAHPSIEIEY
        THCFH VADKAHPS ++ Y
Subjt:  THCFHVVADKAHPSIEIEY

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210651.7e-6532.43Show/hide
Query:  MSSITKLRSSSA-----NLMLRLLLMGWNKILIRTEI---EPMYY----YGKPKCSRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIM-PDEFTLST
        +SSITKLR   A      + +    +G + I     +    PM Y    + K +    V+IWN+LI GY +      AF  + EM+   ++ PD  T   
Subjt:  MSSITKLRSSSA-----NLMLRLLLMGWNKILIRTEI---EPMYY----YGKPKCSRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIM-PDEFTLST

Query:  LAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMPERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAF
        L KA + + +V +G+ +H   +R GF   I V NSL  +Y   G+     K+FD+MPE++  +WN ++ G+A +       EA  +   M   GIKPD F
Subjt:  LAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMPERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAF

Query:  TISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIASRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVEL-
        TI SLL  C   IG L+ GK +H Y++K GL      N+H    L+D+Y+R  ++  ++ +FD++  KN   WT++         G+EAI L+  M    
Subjt:  TISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIASRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVEL-

Query:  GIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFL
        G+ P  IT V +L AC   G+V E  + +    +EY+IEP  E    +VD+L   G++++A  +I++MP++    +W  L+ A   + + ++ E A   +
Subjt:  GIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFL

Query:  TELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHCFHVVADKAHPSIEIEYPQAREL
         +LEP +  +++ LSN+YAS  RW  V ++R  M   G++K PG S + +  + H F ++ DK+HP  +  Y + +E+
Subjt:  TELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHCFHVVADKAHPSIEIEYPQAREL

Q9CAY1 Putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial1.4e-6732.42Show/hide
Query:  PKCSRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLF
        P+ S+    +N+LI+GY  N     A   F  M++  +  D  T+  L    +  E + +G+ +HG+ V+ G   ++ V NS  +MY K G  +   +LF
Subjt:  PKCSRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLF

Query:  DEMPERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDN
        DEMP +   +WN +++GY+ +   + V E +E ++S    G+ PD FT+ S+L  C   +G    G E+   +  NG    F  NV +  + I MY+R  
Subjt:  DEMPERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDN

Query:  KIIASRYVFDQIKSKNVYVWTA---------MGEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGS
         +  +R VFD +  K++  WTA         MGE  + L+++M++ GI+PD    V VLSAC  SGL ++ L+++    +EY++EP  E  + +VD+LG 
Subjt:  KIIASRYVFDQIKSKNVYVWTA---------MGEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGS

Query:  IGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKT
         G L+ A+ FI +MPVE    VWGAL+ A   +KN +M ELA+  + E EP N   ++ +SN+Y+ +   + +  +R  M+ER  RK PG S++    + 
Subjt:  IGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKT

Query:  HCFHVVADKAHPSIEIEYPQARELGLAILNSFDHMEAE
        H F +  D++H   E  +    EL  +++    +M+ +
Subjt:  HCFHVVADKAHPSIEIEYPQARELGLAILNSFDHMEAE

Q9LIQ7 Pentatricopeptide repeat-containing protein At3g24000, mitochondrial5.2e-6735.28Show/hide
Query:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP
        R    W +LI+GY ++     A   FN+M +    P+EFTLS++ KA++       G  +HG  V+ GF  ++ V ++L  +Y +YG   +   +FD + 
Subjt:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP

Query:  ERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIA
         RN  SWN L+AG+A         +A E+ + ML DG +P  F+ +SL   C    G L  GK +H Y++K+G  L        G +L+DMY++   I  
Subjt:  ERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIA

Query:  SRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIGEL
        +R +FD++  ++V  W ++         G+EA+  +  M  +GI+P+ I+ +SVL+AC  SGL++E    Y   +++  I P A     VVD+LG  G+L
Subjt:  SRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIGEL

Query:  ERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHCFH
         RAL FI  MP+E    +W AL++A   +KN E+   A   + EL+P++P   + L N+YAS  RW+  A +R  MKE G++K P CSW+ I    H F 
Subjt:  ERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHCFH

Query:  VVADKAHPSIE
        V  D+ HP  E
Subjt:  VVADKAHPSIE

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233302.2e-7034.47Show/hide
Query:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP
        + V  +N++I GY ++ ++  A     EM   ++ PD FTLS++    SE  +V  GK +HG  +R G   D+ + +SL  MY K    ++  ++F  + 
Subjt:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP

Query:  ERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIA
         R+  SWN L+AGY  +       EA  + R M+   +KP A   SS++P C   +  L  GK+LH Y+++ G    FGSN+ +  +L+DMYS+   I A
Subjt:  ERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIA

Query:  SRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIGEL
        +R +FD++   +   WTA+         G EA+ L+  M   G+KP+ +  V+VL+AC   GLV+E    +N+  + Y +    E  A V D+LG  G+L
Subjt:  SRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIGEL

Query:  ERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHCFH
        E A +FI  M VE    VW  L+S+   +KN E+ E     +  ++ EN   ++ + N+YAS  RW  +A+LR  M+++GLRK P CSWI +  KTH F 
Subjt:  ERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHCFH

Query:  VVADKAHPSIEIEYPQARELGLAILNSFDHMEAEGQKTATN
        V  D++HPS++    +  E   A++   + ME EG    T+
Subjt:  VVADKAHPSIEIEYPQARELGLAILNSFDHMEAEGQKTATN

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.2e-6836.23Show/hide
Query:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHG--KSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDE
        R+V  + S+I GY +  +   A   F EM++  I PD +T++ +    +    +  GK +H   K   LGF  DI V+N+L  MY K G  +E   +F E
Subjt:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHG--KSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDE

Query:  MPERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLID-GIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNK
        M  ++  SWN ++ GY  SK+C Y  EA  +   +L +    PD  T++ +LP C   +     G+E+H YI++NG    + S+ H+  SL+DMY++   
Subjt:  MPERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLID-GIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNK

Query:  IIASRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSI
        ++ +  +FD I SK++  WT M         G+EAI L+N M + GI+ D I+ VS+L AC  SGLV+E  + +N    E +IEPT E  A +VDML   
Subjt:  IIASRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSI

Query:  GELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTH
        G+L +A  FI  MP+     +WGAL+     + + ++ E     + ELEPEN   ++ ++N+YA A +W+ V  LR  + +RGLRK PGCSWI I  + +
Subjt:  GELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTH

Query:  CFHVVADKAHPSIE
         F V  D ++P  E
Subjt:  CFHVVADKAHPSIE

Arabidopsis top hitse value%identityAlignment
AT3G11460.1 Pentatricopeptide repeat (PPR) superfamily protein9.6e-6932.42Show/hide
Query:  PKCSRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLF
        P+ S+    +N+LI+GY  N     A   F  M++  +  D  T+  L    +  E + +G+ +HG+ V+ G   ++ V NS  +MY K G  +   +LF
Subjt:  PKCSRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLF

Query:  DEMPERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDN
        DEMP +   +WN +++GY+ +   + V E +E ++S    G+ PD FT+ S+L  C   +G    G E+   +  NG    F  NV +  + I MY+R  
Subjt:  DEMPERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDN

Query:  KIIASRYVFDQIKSKNVYVWTA---------MGEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGS
         +  +R VFD +  K++  WTA         MGE  + L+++M++ GI+PD    V VLSAC  SGL ++ L+++    +EY++EP  E  + +VD+LG 
Subjt:  KIIASRYVFDQIKSKNVYVWTA---------MGEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGS

Query:  IGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKT
         G L+ A+ FI +MPVE    VWGAL+ A   +KN +M ELA+  + E EP N   ++ +SN+Y+ +   + +  +R  M+ER  RK PG S++    + 
Subjt:  IGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKT

Query:  HCFHVVADKAHPSIEIEYPQARELGLAILNSFDHMEAE
        H F +  D++H   E  +    EL  +++    +M+ +
Subjt:  HCFHVVADKAHPSIEIEYPQARELGLAILNSFDHMEAE

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-7134.47Show/hide
Query:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP
        + V  +N++I GY ++ ++  A     EM   ++ PD FTLS++    SE  +V  GK +HG  +R G   D+ + +SL  MY K    ++  ++F  + 
Subjt:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP

Query:  ERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIA
         R+  SWN L+AGY  +       EA  + R M+   +KP A   SS++P C   +  L  GK+LH Y+++ G    FGSN+ +  +L+DMYS+   I A
Subjt:  ERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIA

Query:  SRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIGEL
        +R +FD++   +   WTA+         G EA+ L+  M   G+KP+ +  V+VL+AC   GLV+E    +N+  + Y +    E  A V D+LG  G+L
Subjt:  SRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIGEL

Query:  ERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHCFH
        E A +FI  M VE    VW  L+S+   +KN E+ E     +  ++ EN   ++ + N+YAS  RW  +A+LR  M+++GLRK P CSWI +  KTH F 
Subjt:  ERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHCFH

Query:  VVADKAHPSIEIEYPQARELGLAILNSFDHMEAEGQKTATN
        V  D++HPS++    +  E   A++   + ME EG    T+
Subjt:  VVADKAHPSIEIEYPQARELGLAILNSFDHMEAEGQKTATN

AT3G24000.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-6835.28Show/hide
Query:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP
        R    W +LI+GY ++     A   FN+M +    P+EFTLS++ KA++       G  +HG  V+ GF  ++ V ++L  +Y +YG   +   +FD + 
Subjt:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMP

Query:  ERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIA
         RN  SWN L+AG+A         +A E+ + ML DG +P  F+ +SL   C    G L  GK +H Y++K+G  L        G +L+DMY++   I  
Subjt:  ERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIA

Query:  SRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIGEL
        +R +FD++  ++V  W ++         G+EA+  +  M  +GI+P+ I+ +SVL+AC  SGL++E    Y   +++  I P A     VVD+LG  G+L
Subjt:  SRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIGEL

Query:  ERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHCFH
         RAL FI  MP+E    +W AL++A   +KN E+   A   + EL+P++P   + L N+YAS  RW+  A +R  MKE G++K P CSW+ I    H F 
Subjt:  ERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHCFH

Query:  VVADKAHPSIE
        V  D+ HP  E
Subjt:  VVADKAHPSIE

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein8.7e-7036.23Show/hide
Query:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHG--KSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDE
        R+V  + S+I GY +  +   A   F EM++  I PD +T++ +    +    +  GK +H   K   LGF  DI V+N+L  MY K G  +E   +F E
Subjt:  RTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHG--KSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDE

Query:  MPERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLID-GIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNK
        M  ++  SWN ++ GY  SK+C Y  EA  +   +L +    PD  T++ +LP C   +     G+E+H YI++NG    + S+ H+  SL+DMY++   
Subjt:  MPERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLID-GIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNK

Query:  IIASRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSI
        ++ +  +FD I SK++  WT M         G+EAI L+N M + GI+ D I+ VS+L AC  SGLV+E  + +N    E +IEPT E  A +VDML   
Subjt:  IIASRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSI

Query:  GELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTH
        G+L +A  FI  MP+     +WGAL+     + + ++ E     + ELEPEN   ++ ++N+YA A +W+ V  LR  + +RGLRK PGCSWI I  + +
Subjt:  GELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTH

Query:  CFHVVADKAHPSIE
         F V  D ++P  E
Subjt:  CFHVVADKAHPSIE

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-6632.43Show/hide
Query:  MSSITKLRSSSA-----NLMLRLLLMGWNKILIRTEI---EPMYY----YGKPKCSRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIM-PDEFTLST
        +SSITKLR   A      + +    +G + I     +    PM Y    + K +    V+IWN+LI GY +      AF  + EM+   ++ PD  T   
Subjt:  MSSITKLRSSSA-----NLMLRLLLMGWNKILIRTEI---EPMYY----YGKPKCSRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIM-PDEFTLST

Query:  LAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMPERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAF
        L KA + + +V +G+ +H   +R GF   I V NSL  +Y   G+     K+FD+MPE++  +WN ++ G+A +       EA  +   M   GIKPD F
Subjt:  LAKASSELENVAVGKLMHGKSVRLGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMPERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAF

Query:  TISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIASRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVEL-
        TI SLL  C   IG L+ GK +H Y++K GL      N+H    L+D+Y+R  ++  ++ +FD++  KN   WT++         G+EAI L+  M    
Subjt:  TISSLLPLCGDPIGKLSYGKELHCYIVKNGLDLDFGSNVHLGCSLIDMYSRDNKIIASRYVFDQIKSKNVYVWTAM---------GEEAIRLYNNMVEL-

Query:  GIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFL
        G+ P  IT V +L AC   G+V E  + +    +EY+IEP  E    +VD+L   G++++A  +I++MP++    +W  L+ A   + + ++ E A   +
Subjt:  GIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLGSIGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFL

Query:  TELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHCFHVVADKAHPSIEIEYPQAREL
         +LEP +  +++ LSN+YAS  RW  V ++R  M   G++K PG S + +  + H F ++ DK+HP  +  Y + +E+
Subjt:  TELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHCFHVVADKAHPSIEIEYPQAREL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCAATCACCAAGCTTAGAAGCTCATCCGCCAATCTCATGCTCAGGTTATTACTCATGGGTTGGAACAAAATACTGATAAGAACCGAAATAGAACCTATGTATTA
TTACGGTAAACCCAAGTGTTCGAGGACTGTTTATATATGGAACTCCTTGATTAATGGGTATGTTAAGAATTGTGTGTTCCATGGGGCATTTGATCGTTTTAATGAAATGC
AGCAATGCAACATAATGCCGGATGAGTTTACACTATCCACATTGGCAAAAGCATCCAGCGAGCTTGAGAATGTGGCTGTTGGGAAATTGATGCATGGGAAGAGTGTACGA
CTTGGGTTCATGTTAGATATCGTCGTTGCTAATTCCCTCAGATCGATGTACTTCAAGTATGGGGAGTGCAAAGAATACTTAAAGCTATTTGATGAAATGCCTGAAAGAAA
TTCAGGTTCATGGAATGTCCTGTTAGCTGGCTATGCCAGTTCTAAAGATTGTTTCTATGTTAGAGAAGCATGGGAAGTTGTTAGAAGTATGCTGATAGATGGAATAAAAC
CCGATGCGTTTACTATTTCGAGTCTTCTGCCTTTATGTGGCGATCCCATTGGGAAATTGAGTTATGGGAAAGAGCTCCATTGTTATATAGTGAAGAATGGATTGGATCTT
GATTTTGGCTCAAATGTTCATCTTGGCTGCAGCTTGATTGATATGTACTCAAGGGACAATAAGATCATTGCTTCTAGATATGTATTTGACCAAATCAAAAGTAAAAATGT
TTATGTTTGGACAGCAATGGGCGAGGAAGCCATACGATTGTATAACAATATGGTCGAGCTTGGTATCAAGCCAGACCCGATAACTGTAGTTTCAGTTTTATCCGCTTGTG
GTCGGTCTGGTTTGGTAAACGAAGATCTCCAAATATACAACACTGCAGTTCAGGAATACAGAATCGAACCAACGGCCGAGATCTGTGCTGTTGTGGTTGACATGCTGGGC
AGTATAGGCGAGCTAGAACGAGCATTGAGTTTTATCAGAACAATGCCTGTAGAACTCGGTCCAGGCGTCTGGGGAGCTCTTGTTTCTGCTTCTATAAGATACAAGAACTA
TGAGATGTTAGAATTGGCTTACAGATTCCTGACTGAGTTAGAGCCTGAAAATCCATCTAACTTCATTTCACTGTCAAATTTGTATGCTTCTGCTAGCAGATGGGATGTGG
TGGCTGAGCTTAGACACACCATGAAGGAAAGAGGTCTGAGGAAGGCTCCTGGCTGTAGTTGGATTAACATTAATGCAAAGACTCATTGTTTCCATGTCGTTGCTGATAAA
GCACATCCAAGCATTGAGATTGAATACCCACAGGCTAGGGAGTTGGGATTAGCGATTTTGAACTCATTTGACCATATGGAAGCTGAAGGTCAAAAGACAGCCACAAATCC
GTCCGCAATGCTCGCTGGTCTTCTCACCAGGAGAGCAAAACTCCACGACGAGCTCCGGAATATCGAGAAGCAGGTCTATGACATGGAGACTAATTATCTACAGGATCCAA
GCCAGTGTGGGAATGTATTGAAAGGTTTTGAAGGATTCCTGTCTGCATCTAAGAGTACTGCTCTGTATGGAGTGTCTATAATATTATCTCTCAGTAATTTTCCTTTAAAT
TTGAAGCGGTCGAGAAAGTTTCAGCTTGAAGATAGGCTCTTCTCGTTGTCTTCAGTTACCTCCCCTGCTTATGATAAGGAGAGCTTGAAGAATTGCTTTTTGGTGTTGAA
AAGTTTTGAGGAGGTTGCCTTCCCGAGGCTCTTCAGGATTGCTAATTCCAAGGGCTTCTTCATCAGCCAATGCAAACCTGTAGCTTTGAGGAGCTGGAATCTCCGGTTCA
GGAGGAATTTGGCTGAAGTTGATGTTGAAGAATGGGCTTCCATGGAAAATATCTTATCAACATACACTTCTGGAATGTTGTCAACGGACTCTTCAGGATTCTTAAGTTAT
GAACTCTTGAAGGGAGGGCTGAAGAGCTTGCAGCTGGACGAGATGGTGAGCTGTAACAGTAAAGAAGAAATAGTTTTAGGGAAACCGAAAAAGGGAAGACCGGCCCCAAG
GGATGCAAAGAGAATGCGACATTCAAGCGAGCAAGATTTCGACTATGACGATGATCCAGACTTGACATTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCAATCACCAAGCTTAGAAGCTCATCCGCCAATCTCATGCTCAGGTTATTACTCATGGGTTGGAACAAAATACTGATAAGAACCGAAATAGAACCTATGTATTA
TTACGGTAAACCCAAGTGTTCGAGGACTGTTTATATATGGAACTCCTTGATTAATGGGTATGTTAAGAATTGTGTGTTCCATGGGGCATTTGATCGTTTTAATGAAATGC
AGCAATGCAACATAATGCCGGATGAGTTTACACTATCCACATTGGCAAAAGCATCCAGCGAGCTTGAGAATGTGGCTGTTGGGAAATTGATGCATGGGAAGAGTGTACGA
CTTGGGTTCATGTTAGATATCGTCGTTGCTAATTCCCTCAGATCGATGTACTTCAAGTATGGGGAGTGCAAAGAATACTTAAAGCTATTTGATGAAATGCCTGAAAGAAA
TTCAGGTTCATGGAATGTCCTGTTAGCTGGCTATGCCAGTTCTAAAGATTGTTTCTATGTTAGAGAAGCATGGGAAGTTGTTAGAAGTATGCTGATAGATGGAATAAAAC
CCGATGCGTTTACTATTTCGAGTCTTCTGCCTTTATGTGGCGATCCCATTGGGAAATTGAGTTATGGGAAAGAGCTCCATTGTTATATAGTGAAGAATGGATTGGATCTT
GATTTTGGCTCAAATGTTCATCTTGGCTGCAGCTTGATTGATATGTACTCAAGGGACAATAAGATCATTGCTTCTAGATATGTATTTGACCAAATCAAAAGTAAAAATGT
TTATGTTTGGACAGCAATGGGCGAGGAAGCCATACGATTGTATAACAATATGGTCGAGCTTGGTATCAAGCCAGACCCGATAACTGTAGTTTCAGTTTTATCCGCTTGTG
GTCGGTCTGGTTTGGTAAACGAAGATCTCCAAATATACAACACTGCAGTTCAGGAATACAGAATCGAACCAACGGCCGAGATCTGTGCTGTTGTGGTTGACATGCTGGGC
AGTATAGGCGAGCTAGAACGAGCATTGAGTTTTATCAGAACAATGCCTGTAGAACTCGGTCCAGGCGTCTGGGGAGCTCTTGTTTCTGCTTCTATAAGATACAAGAACTA
TGAGATGTTAGAATTGGCTTACAGATTCCTGACTGAGTTAGAGCCTGAAAATCCATCTAACTTCATTTCACTGTCAAATTTGTATGCTTCTGCTAGCAGATGGGATGTGG
TGGCTGAGCTTAGACACACCATGAAGGAAAGAGGTCTGAGGAAGGCTCCTGGCTGTAGTTGGATTAACATTAATGCAAAGACTCATTGTTTCCATGTCGTTGCTGATAAA
GCACATCCAAGCATTGAGATTGAATACCCACAGGCTAGGGAGTTGGGATTAGCGATTTTGAACTCATTTGACCATATGGAAGCTGAAGGTCAAAAGACAGCCACAAATCC
GTCCGCAATGCTCGCTGGTCTTCTCACCAGGAGAGCAAAACTCCACGACGAGCTCCGGAATATCGAGAAGCAGGTCTATGACATGGAGACTAATTATCTACAGGATCCAA
GCCAGTGTGGGAATGTATTGAAAGGTTTTGAAGGATTCCTGTCTGCATCTAAGAGTACTGCTCTGTATGGAGTGTCTATAATATTATCTCTCAGTAATTTTCCTTTAAAT
TTGAAGCGGTCGAGAAAGTTTCAGCTTGAAGATAGGCTCTTCTCGTTGTCTTCAGTTACCTCCCCTGCTTATGATAAGGAGAGCTTGAAGAATTGCTTTTTGGTGTTGAA
AAGTTTTGAGGAGGTTGCCTTCCCGAGGCTCTTCAGGATTGCTAATTCCAAGGGCTTCTTCATCAGCCAATGCAAACCTGTAGCTTTGAGGAGCTGGAATCTCCGGTTCA
GGAGGAATTTGGCTGAAGTTGATGTTGAAGAATGGGCTTCCATGGAAAATATCTTATCAACATACACTTCTGGAATGTTGTCAACGGACTCTTCAGGATTCTTAAGTTAT
GAACTCTTGAAGGGAGGGCTGAAGAGCTTGCAGCTGGACGAGATGGTGAGCTGTAACAGTAAAGAAGAAATAGTTTTAGGGAAACCGAAAAAGGGAAGACCGGCCCCAAG
GGATGCAAAGAGAATGCGACATTCAAGCGAGCAAGATTTCGACTATGACGATGATCCAGACTTGACATTGTGA
Protein sequenceShow/hide protein sequence
MSSITKLRSSSANLMLRLLLMGWNKILIRTEIEPMYYYGKPKCSRTVYIWNSLINGYVKNCVFHGAFDRFNEMQQCNIMPDEFTLSTLAKASSELENVAVGKLMHGKSVR
LGFMLDIVVANSLRSMYFKYGECKEYLKLFDEMPERNSGSWNVLLAGYASSKDCFYVREAWEVVRSMLIDGIKPDAFTISSLLPLCGDPIGKLSYGKELHCYIVKNGLDL
DFGSNVHLGCSLIDMYSRDNKIIASRYVFDQIKSKNVYVWTAMGEEAIRLYNNMVELGIKPDPITVVSVLSACGRSGLVNEDLQIYNTAVQEYRIEPTAEICAVVVDMLG
SIGELERALSFIRTMPVELGPGVWGALVSASIRYKNYEMLELAYRFLTELEPENPSNFISLSNLYASASRWDVVAELRHTMKERGLRKAPGCSWININAKTHCFHVVADK
AHPSIEIEYPQARELGLAILNSFDHMEAEGQKTATNPSAMLAGLLTRRAKLHDELRNIEKQVYDMETNYLQDPSQCGNVLKGFEGFLSASKSTALYGVSIILSLSNFPLN
LKRSRKFQLEDRLFSLSSVTSPAYDKESLKNCFLVLKSFEEVAFPRLFRIANSKGFFISQCKPVALRSWNLRFRRNLAEVDVEEWASMENILSTYTSGMLSTDSSGFLSY
ELLKGGLKSLQLDEMVSCNSKEEIVLGKPKKGRPAPRDAKRMRHSSEQDFDYDDDPDLTL