; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004782 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004782
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr08:20363081..20365373
RNA-Seq ExpressionHG10004782
SyntenyHG10004782
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020472.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]3.8e-19050.98Show/hide
Query:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE
        MRNA+DVRVLANRYAAQLQLC PQNPSS+SLARTVHAHMI SGFKPRGHLVNRLLDIYWKSSN ++ARQLFDEIP+PDAV RTTLITAYS LGNLNMARE
Subjt:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE

Query:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP
        IFNGTPLNMRDTIFYNAMITG+SH  DG+SAI LFHAMR +NF+PD+FTFTSVLSALALIVD+E QCGQMHGAVVK G GL+SSVLNALLSVY KCASSP
Subjt:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP

Query:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT
        LVSSSSLMASARKLFDEMP+R+ELTWTTLITGYVRNDDL  ARELLDTMTE LGVAWNAMISGYVHHGLFEDALTLFRKMRL+GV+ DEFTYTSVISAC 
Subjt:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT

Query:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL
        NGGFFQLGK++HAYILKNE NPNHDFLLSVSN+LITLYWKY                                                         GL
Subjt:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL

Query:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------
        AQNGFGEE L LFN+MRLDGYEPCDY FA                                                                       
Subjt:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------EKLNLE
                                                                                                      E+L+LE
Subjt:  ----------------------------------------------------------------------------------------------EKLNLE

Query:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM
        MKK GY+PDTKYVL DM+SEHKEYAL+TH+E+LAVGFGLMKLP GATV+VFKNLRI GDCHNAFKFM
Subjt:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM

XP_022144208.1 pentatricopeptide repeat-containing protein At1g25360-like [Momordica charantia]2.2e-19050.85Show/hide
Query:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE
        MRN+L +RV+ANRYAAQLQLC PQNPSS+SLARTVHAHMIASGFKPRGHLVNRLLD+YWKSSN +YARQLFDEIP+PDAV RTTLITAYSA+GNL++AR 
Subjt:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE

Query:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP
        IFNGTPL MRDTIFYNAMITGYSH DDG+SA+ELFHAMR  NFQPD+FTFTSVLSALALIVD+E QC Q+HGAVVK G GL+SSVLNALLSVY KCASSP
Subjt:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP

Query:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT
        LV SSSLMASARKLFDEMPER+ELTWTT+ITGYVRNDDL  A  LLDTMTE LGVAWN+MISGYVHHGLF++ALTLFRKMRLLGVQHDEFTYTSVISAC 
Subjt:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT

Query:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL
        NGGFFQLGKQVHAYILKNE NPNHDF+LSVSNALITLYWK+                                                         GL
Subjt:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL

Query:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------
        AQNGFGEE LKLFNQMRLDGYEPCDY FA                                                                       
Subjt:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------EKLNLE
                                                                                                      E+L+LE
Subjt:  ----------------------------------------------------------------------------------------------EKLNLE

Query:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM
        MKKLGYIPDTKYVL DM+SEHKEYALSTH+EKLAVGFGLMKLP+GATV+VFKNLRI GDCHNAFKFM
Subjt:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM

XP_022951057.1 pentatricopeptide repeat-containing protein At1g25360-like [Cucurbita moschata]1.7e-19051.11Show/hide
Query:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE
        MRNA+DVRVLANRYAAQLQLC PQNPSS+SLARTVHAHMI SGFKPRGHLVNRLLDIYWKSSN +YARQLFDEIP+PDAV RTTLITAYS LGNLNMARE
Subjt:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE

Query:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP
        IFNGTPLNMRDTIFYNAMITG+SH  DG+SAI LFHAMR +NF+PD+FTFTSVLSALALIVD+E QCGQMHGAVVK G GL+SSVLNALLSVY KCASSP
Subjt:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP

Query:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT
        LVSSSSLMASARKLFDEMP+R+ELTWTTLITGYVRNDDL  ARELLDTMTE LGVAWNAMISGYVHHGLFEDALTLFRKMR LGV+ DEFTYTSVISAC 
Subjt:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT

Query:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL
        NGGFFQLGK++HAYILKNE NPNHDFLLSVSN+LITLYWKY                                                         GL
Subjt:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL

Query:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------
        AQNGFGEE L LFN+MRLDGYEPCDY FA                                                                       
Subjt:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------EKLNLE
                                                                                                      E+L+LE
Subjt:  ----------------------------------------------------------------------------------------------EKLNLE

Query:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM
        MKK GY+PDTKYVL DM+SEHKEYAL+TH+E+LAVGFGLMKLP GATV+VFKNLRI GDCHNAFKFM
Subjt:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM

XP_023537093.1 pentatricopeptide repeat-containing protein At1g25360-like [Cucurbita pepo subsp. pepo]2.1e-19353.12Show/hide
Query:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE
        MRNA+DVRVLANRYAAQLQLC PQNPSS+SLARTVHAHMI SGFKPRGHLVNRLLDIYWKSSN +YARQLFDEIP+PDAV RTTLITAYS LGNLNMARE
Subjt:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE

Query:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP
        IFNGTPLNMRDTIFYNAMITG+SH  DG+SAI LFHAMR +NF+PD+FTFTSVLSALALIVD+E QCGQMHGAVVK G GL+SSVLNALLSVY KCASSP
Subjt:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP

Query:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT
        LVSSSSLMASARKLFDEMP+R+ELTWTTLITGYVRNDDL  ARELLDTMTE LGVAWNAMISGYVHHGLFEDALTLFRKMR LGV+ DEFTYTSVISAC 
Subjt:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT

Query:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY--------------------------GLAQNGFGEESLKLFNQMRLDGYEPCDYVFA--
        NGGFFQLGK++HAYILKNE NPNHDFLLSVSN+LITLYWKY                          GLAQNGFGEE L LFN+MRLDGYEPCDY FA  
Subjt:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY--------------------------GLAQNGFGEESLKLFNQMRLDGYEPCDYVFA--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------EKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNE
                                                                       E+L+LEMKK GY+PDTKYVL DM+ EHKEYAL+TH+E
Subjt:  ---------------------------------------------------------------EKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNE

Query:  KLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM
        +LAVGFGLMKLP GATV+VFKNLRI GDCHNAFKFM
Subjt:  KLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM

XP_038886633.1 pentatricopeptide repeat-containing protein At1g25360-like [Benincasa hispida]3.2e-20554.37Show/hide
Query:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE
        MRNALDVRVLANRYAAQLQLC PQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSS+F+ ARQLFDEIPHPDAV RTTLI+AYSALGNLNMAR+
Subjt:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE

Query:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP
        IFNGTPLNMRDTIFYNAMIT YSHKDDG+SAIELFHAMR ANFQPD+FTFTSVLSALALIVDDEHQCGQMHGAVVKFG+GL+SSVLNALLSVY KCASSP
Subjt:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP

Query:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT
        LVSSSSLMASARKLFDEMPER+ELTWTTLITGYVRNDDL  ARELLDTMTEHL VAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISAC 
Subjt:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT

Query:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL
        NGGFFQLGK+VHAYILKNE NPNHDFLLSVSNALITLYWKY                                                         GL
Subjt:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL

Query:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------
        AQNGFGEESLKLFNQMRLDGYEPCDY FA                                                                       
Subjt:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------EKLNLE
                                                                                                      EKLNLE
Subjt:  ----------------------------------------------------------------------------------------------EKLNLE

Query:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM
        MKKLGYIPDTKYVL DM+SEHKEYALSTH+EKLAVGFGLMKLP+GATV+VFKNLRI GDCHNAFKFM
Subjt:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM

TrEMBL top hitse value%identityAlignment
A0A1S4DVG9 pentatricopeptide repeat-containing protein At1g25360-like3.8e-18852.81Show/hide
Query:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE
        MRN LDVRVLANRY AQL LC PQNPSSYSLARTVHAH+IASGFK RGH+VNRL+D+YWKSS+F+YARQLFDEIP PD + RTTLITAYSALGNL MARE
Subjt:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE

Query:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP
        IFN TPL+MRDT+FYNAMITGYSH +DG+SAIELF AMRWANFQPD+FTF SVLSA  LI DDE QCGQMHGAVVK G+GL  +VLN+LLSVY KCASSP
Subjt:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP

Query:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT
        LVSSSSLMASARKLFDEMP+RNE  WTTLITGYVRNDDL  ARE+LDTMTE  G+AWNAMISGY+HHGLFEDALTLFRKMRLLGVQ DE TYTSVISAC 
Subjt:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT

Query:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL
        +GGFF LGKQVHAYILKNE NP+ +FLLSV N LITLYWKY                                                         GL
Subjt:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL

Query:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------
        AQNGFGE++LKLFNQMRLDGYEP DY FA                                                                       
Subjt:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------EKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLR
                                               EKLNLEMKKLGYIPDTKYVL DM+SEHKEYALSTH+EKLAV FGLMKLP+GATV+VFKNLR
Subjt:  ---------------------------------------EKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLR

Query:  IRGDCHNAFKFM
        I GDCHNA KFM
Subjt:  IRGDCHNAFKFM

A0A5A7VDN1 Pentatricopeptide repeat-containing protein9.1e-18249.02Show/hide
Query:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE
        MRN LDVRVLANRY AQL LC PQNPSSYSLARTVHAH+IASGFK RGH+VNRL+D+YWKSS+F+YARQLFDEIP PD + RTTLITAYSALGNL MARE
Subjt:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE

Query:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP
        IFN TPL+MRDT+FYNAMITGYSH +DG+SAIELF AMRWANFQPD+FTF SVLSA  LI DDE QCGQMHGAVVK G+GL  +VLN+LLSVY KCASSP
Subjt:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP

Query:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT
        LVSSSSLMASARKLFDEMP+RNE  WTTLITGYVRNDDL  ARE+LDTMTE  G+AWNAMISGY+HHGLFEDALTLFRKMRLLGVQ DE TYTSVISAC 
Subjt:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT

Query:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL
        +GGFF LGKQVHAYILKNE NP+ +FLLSV N LITLYWKY                                                         GL
Subjt:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL

Query:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------
        AQNGFGE++LKLFNQMRLDGYEP DY FA                                                                       
Subjt:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------EKLNLE
                                                                                                      EKLNLE
Subjt:  ----------------------------------------------------------------------------------------------EKLNLE

Query:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM
        MKKLGYIPDTKYVL DM+SEHKEYALSTH+EKLAV FGLMKLP+GATV+VFKNLRI GDCHNA KFM
Subjt:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM

A0A6J1CRF6 pentatricopeptide repeat-containing protein At1g25360-like1.1e-19050.85Show/hide
Query:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE
        MRN+L +RV+ANRYAAQLQLC PQNPSS+SLARTVHAHMIASGFKPRGHLVNRLLD+YWKSSN +YARQLFDEIP+PDAV RTTLITAYSA+GNL++AR 
Subjt:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE

Query:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP
        IFNGTPL MRDTIFYNAMITGYSH DDG+SA+ELFHAMR  NFQPD+FTFTSVLSALALIVD+E QC Q+HGAVVK G GL+SSVLNALLSVY KCASSP
Subjt:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP

Query:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT
        LV SSSLMASARKLFDEMPER+ELTWTT+ITGYVRNDDL  A  LLDTMTE LGVAWN+MISGYVHHGLF++ALTLFRKMRLLGVQHDEFTYTSVISAC 
Subjt:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT

Query:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL
        NGGFFQLGKQVHAYILKNE NPNHDF+LSVSNALITLYWK+                                                         GL
Subjt:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL

Query:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------
        AQNGFGEE LKLFNQMRLDGYEPCDY FA                                                                       
Subjt:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------EKLNLE
                                                                                                      E+L+LE
Subjt:  ----------------------------------------------------------------------------------------------EKLNLE

Query:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM
        MKKLGYIPDTKYVL DM+SEHKEYALSTH+EKLAVGFGLMKLP+GATV+VFKNLRI GDCHNAFKFM
Subjt:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM

A0A6J1GGL5 pentatricopeptide repeat-containing protein At1g25360-like8.2e-19151.11Show/hide
Query:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE
        MRNA+DVRVLANRYAAQLQLC PQNPSS+SLARTVHAHMI SGFKPRGHLVNRLLDIYWKSSN +YARQLFDEIP+PDAV RTTLITAYS LGNLNMARE
Subjt:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE

Query:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP
        IFNGTPLNMRDTIFYNAMITG+SH  DG+SAI LFHAMR +NF+PD+FTFTSVLSALALIVD+E QCGQMHGAVVK G GL+SSVLNALLSVY KCASSP
Subjt:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP

Query:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT
        LVSSSSLMASARKLFDEMP+R+ELTWTTLITGYVRNDDL  ARELLDTMTE LGVAWNAMISGYVHHGLFEDALTLFRKMR LGV+ DEFTYTSVISAC 
Subjt:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT

Query:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL
        NGGFFQLGK++HAYILKNE NPNHDFLLSVSN+LITLYWKY                                                         GL
Subjt:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL

Query:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------
        AQNGFGEE L LFN+MRLDGYEPCDY FA                                                                       
Subjt:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------EKLNLE
                                                                                                      E+L+LE
Subjt:  ----------------------------------------------------------------------------------------------EKLNLE

Query:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM
        MKK GY+PDTKYVL DM+SEHKEYAL+TH+E+LAVGFGLMKLP GATV+VFKNLRI GDCHNAFKFM
Subjt:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM

A0A6J1KSZ4 pentatricopeptide repeat-containing protein At1g25360-like2.5e-18750.59Show/hide
Query:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE
        MRN +DVRVLANRYAAQLQLC PQN SS+SLARTVHAHMI SGFK RGHLVNRLLDIYWKSSN +YARQLFDEIP+PDAV RTTLITAYS LGNLNMARE
Subjt:  MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMARE

Query:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP
        IFN TPLNMRDTIFYNAMITG+SH  DG+SAI LFHAMR +NF+PD+FTFTSVLSALALIVD+E QCGQMHGAVVK G GL+SSVLNALLSVY KCASSP
Subjt:  IFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSP

Query:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT
        LVSSSSLMASARKLFDEMP+R+ELTWTTLITGYVRNDDL  ARELLDTMTE LGVAWNAMISGYVHHGLFEDALTLFRKMR LGV+ DEFTYTSVISAC 
Subjt:  LVSSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACT

Query:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL
        NGGFFQLGK++HAYILKNE NPNHDFLLSVSN+LITLYWKY                                                         GL
Subjt:  NGGFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKY---------------------------------------------------------GL

Query:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------
        AQNGFGEE L LFN+MRLDGYEPCDY FA                                                                       
Subjt:  AQNGFGEESLKLFNQMRLDGYEPCDYVFA-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------EKLNLE
                                                                                                      E+L+LE
Subjt:  ----------------------------------------------------------------------------------------------EKLNLE

Query:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM
        MKK GY+PDTKYVL DM+SEHKEYAL+TH+E+LAVGFGLMKLP GATV+VFKNLRI GDCHNAFKFM
Subjt:  MKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM

SwissProt top hitse value%identityAlignment
Q9FRI5 Pentatricopeptide repeat-containing protein At1g253609.2e-10747.64Show/hide
Query:  VRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNGTP
        VR +ANRYAA L+LC P   +S  LAR VH ++I  GF+PR H++NRL+D+Y KSS   YARQLFDEI  PD + RTT+++ Y A G++ +AR +F   P
Subjt:  VRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNGTP

Query:  LNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSSSS
        + MRDT+ YNAMITG+SH +DGYSAI LF  M+   F+PD FTF SVL+ LAL+ DDE QC Q H A +K G G I+SV NAL+SVY+KCASSP     S
Subjt:  LNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSSSS

Query:  LMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLG-VAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACTNGGFF
        L+ SARK+FDE+ E++E +WTT++TGYV+N       ELL+ M +++  VA+NAMISGYV+ G +++AL + R+M   G++ DEFTY SVI AC   G  
Subjt:  LMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLG-VAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACTNGGFF

Query:  QLGKQVHAYILKNETNPNHDFLLSVSNALITLYWK---------------------------------------------------------YGLAQNGF
        QLGKQVHAY+L+ E     DF     N+L++LY+K                                                          GLA+NGF
Subjt:  QLGKQVHAYILKNETNPNHDFLLSVSNALITLYWK---------------------------------------------------------YGLAQNGF

Query:  GEESLKLFNQMRLDGYEPCDYVFA
        GEE LKLF+ M+ +G+EPCDY F+
Subjt:  GEESLKLFNQMRLDGYEPCDYVFA

Q9FRI5 Pentatricopeptide repeat-containing protein At1g253605.4e-2258.02Show/hide
Query:  YVFAEKLNLEMKKLGYIPDTKYVLQDMKSE-HKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFMLTW
        Y++ + L  EM++LGY+PDT +VL D++S+ HKE  L+TH+EK+AV FGLMKLP G T+++FKNLR  GDCHN F+F L+W
Subjt:  YVFAEKLNLEMKKLGYIPDTKYVLQDMKSE-HKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFMLTW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.4e-4123.58Show/hide
Query:  LDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNG
        + + +L N Y     L S     ++   + +H H++  G     ++   L+ +Y ++     A ++FD+ PH D V  T LI  Y++ G +  A+++F+ 
Subjt:  LDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNG

Query:  TPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSS
         P  ++D + +NAMI+GY+   +   A+ELF  M   N +PDE T  +V+SA A     E    Q+H  +   G G    ++NAL+ +Y+KC        
Subjt:  TPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSS

Query:  SSLMASARKLFDEMPERNELTWTTLITGYVRND----------DLIRARELLDTMT--------EHLGV-------------------------------
           + +A  LF+ +P ++ ++W TLI GY   +          +++R+ E  + +T         HLG                                
Subjt:  SSLMASARKLFDEMPERNELTWTTLITGYVRND----------DLIRARELLDTMT--------EHLGV-------------------------------

Query:  -----------------------AWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACTNGGFFQLGKQVHAYILKN-ETNP---------
                               +WNAMI G+  HG  + +  LF +MR +G+Q D+ T+  ++SAC++ G   LG+ +   + ++ +  P         
Subjt:  -----------------------AWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACTNGGFFQLGKQVHAYILKN-ETNP---------

Query:  ---NHDFLLSVSNALITL--------YW--------KYGLAQNG--FGEESLK------------------------------LFNQMRLDGYEPCD---
            H  L   +  +I +         W         +G  + G  F E  +K                              L N   +     C    
Subjt:  ---NHDFLLSVSNALITL--------YW--------KYGLAQNG--FGEESLK------------------------------LFNQMRLDGYEPCD---

Query:  ---------------------YVFAEKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM
                             Y   E++ + ++K G++PDT  VLQ+M+ E KE AL  H+EKLA+ FGL+    G  + + KNLR+  +CH A K +
Subjt:  ---------------------YVFAEKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220701.4e-4327.13Show/hide
Query:  NRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVR--TTLITAYSALGNLNMAREIFNGTPLNM
        +R+     L +  N     + + +H+H++ +GF   G ++N L+ +Y +      AR+L ++    D  +   T L+  Y  LG++N A+ IF    L  
Subjt:  NRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVR--TTLITAYSALGNLNMAREIFNGTPLNM

Query:  RDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSSSSLMA
        RD + + AMI GY        AI LF +M     +P+ +T  ++LS  + +    H   Q+HG+ VK G     SV NAL+++YAK  +         + 
Subjt:  RDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSSSSLMA

Query:  SARKLFDEMP-ERNELTWTTLITGYVRNDDLIRARELLDTM------------------TEHLGVA----------------------WNAMISGYVHHG
        SA + FD +  ER+ ++WT++I    ++     A EL +TM                    H G+                       +  M+  +   G
Subjt:  SARKLFDEMP-ERNELTWTTLITGYVRNDDLIRARELLDTM------------------TEHLGVA----------------------WNAMISGYVHHG

Query:  LFEDALTLFRKMRLLGVQHDEFTYTSVISACTNGGFFQLGKQV--HAYILKNETNPNHDFLLSVSNAL--------ITLYWKYGLAQNGFGEESLKLFNQ
        L ++A     KM    ++ D  T+ S++SAC       LGK       +L+ E +  +  L ++ +A         I    K G  +   G   +++ ++
Subjt:  LFEDALTLFRKMRLLGVQHDEFTYTSVISACTNGGFFQLGKQV--HAYILKNETNPNHDFLLSVSNAL--------ITLYWKYGLAQNGFGEESLKLFNQ

Query:  MRLDGYEPCD-------YVFAEKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM
        + + G E          Y+  +K+  E+KK+GY+PDT  VL D++ E KE  L  H+EKLA+ FGL+  P+  T+++ KNLR+  DCH A KF+
Subjt:  MRLDGYEPCD-------YVFAEKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM

Q9SKQ4 Pentatricopeptide repeat-containing protein At2g210901.8e-3829.43Show/hide
Query:  VRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFK-PRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNGT
        +R+  +  A+ LQ C   +  S    + +H H+  +GFK P   L N L+ +Y K    I A ++FD++   +      +++ Y   G L  AR +F+  
Subjt:  VRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFK-PRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNGT

Query:  PLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCG-QMHGAVVKFGVGLISSVL--NALLSVYAKCASSPLV
        P   RD + +N M+ GY+   + + A+  +   R +  + +EF+F  +L+  A +   + Q   Q HG V+    G +S+V+   +++  YAKC      
Subjt:  PLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCG-QMHGAVVKFGVGLISSVL--NALLSVYAKCASSPLV

Query:  SSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACTNG
             M SA++ FDEM  ++   WTTLI+GY +  D+  A +L   M E   V+W A+I+GYV  G    AL LFRKM  LGV+ ++FT++S + A  + 
Subjt:  SSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACTNG

Query:  GFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKYG---------------------------LAQNGFGEESLKLFNQM
           + GK++H Y+++    PN      V ++LI +Y K G                           LAQ+G G ++L++ + M
Subjt:  GFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKYG---------------------------LAQNGFGEESLKLFNQM

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic4.1e-3825.15Show/hide
Query:  YAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNGTPLNMRDTI
        +++ L+ CS ++       + +H H++  G     ++   L+D+Y K  + + A+++FD +P    V  T +IT Y+  GN+  AR +F+   +  RD +
Subjt:  YAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNGTPLNMRDTI

Query:  FYNAMITGYSHKDDGYSAIELFH-AMRWANFQPDEFTFTSVLSALALIVDDEHQCGQ-MHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSSSSLMASA
         +N MI GY+       A+ LF   +     +PDE T  + LSA + I     + G+ +H  V    + L   V   L+ +Y+KC S         +  A
Subjt:  FYNAMITGYSHKDDGYSAIELFH-AMRWANFQPDEFTFTSVLSALALIVDDEHQCGQ-MHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSSSSLMASA

Query:  RKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLG-----VAWNAMISGYVHHGLFEDALTLFRKM-----------------RLLG-----
          +F++ P ++ + W  +I GY  +     A  L + M    G     + +   +    H GL  + + +F  M                  LLG     
Subjt:  RKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLG-----VAWNAMISGYVHHGLFEDALTLFRKM-----------------RLLG-----

Query:  -----------VQHDEFTYTSVISACTNGGFFQLGKQVHAYIL-KNETNPNHDFLLSVSNALITLYWKYGLAQNGFGEESL------------KLFNQMR
                   +  D   ++SV+ +C   G F LGK++  Y++  N  N     LLS   A +  Y      +N   E+ +               ++ R
Subjt:  -----------VQHDEFTYTSVISACTNGGFFQLGKQVHAYIL-KNETNPNHDFLLSVSNALITLYWKYGLAQNGFGEESL------------KLFNQMR

Query:  LDGYEPCD----YVFAEKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM
            E       Y    K++  +K  GY+P+T  VLQD++   KE +L  H+E+LA+ +GL+    G+ +K+FKNLR+  DCH   K +
Subjt:  LDGYEPCD----YVFAEKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.6e-4323.58Show/hide
Query:  LDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNG
        + + +L N Y     L S     ++   + +H H++  G     ++   L+ +Y ++     A ++FD+ PH D V  T LI  Y++ G +  A+++F+ 
Subjt:  LDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNG

Query:  TPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSS
         P  ++D + +NAMI+GY+   +   A+ELF  M   N +PDE T  +V+SA A     E    Q+H  +   G G    ++NAL+ +Y+KC        
Subjt:  TPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSS

Query:  SSLMASARKLFDEMPERNELTWTTLITGYVRND----------DLIRARELLDTMT--------EHLGV-------------------------------
           + +A  LF+ +P ++ ++W TLI GY   +          +++R+ E  + +T         HLG                                
Subjt:  SSLMASARKLFDEMPERNELTWTTLITGYVRND----------DLIRARELLDTMT--------EHLGV-------------------------------

Query:  -----------------------AWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACTNGGFFQLGKQVHAYILKN-ETNP---------
                               +WNAMI G+  HG  + +  LF +MR +G+Q D+ T+  ++SAC++ G   LG+ +   + ++ +  P         
Subjt:  -----------------------AWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACTNGGFFQLGKQVHAYILKN-ETNP---------

Query:  ---NHDFLLSVSNALITL--------YW--------KYGLAQNG--FGEESLK------------------------------LFNQMRLDGYEPCD---
            H  L   +  +I +         W         +G  + G  F E  +K                              L N   +     C    
Subjt:  ---NHDFLLSVSNALITL--------YW--------KYGLAQNG--FGEESLK------------------------------LFNQMRLDGYEPCD---

Query:  ---------------------YVFAEKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM
                             Y   E++ + ++K G++PDT  VLQ+M+ E KE AL  H+EKLA+ FGL+    G  + + KNLR+  +CH A K +
Subjt:  ---------------------YVFAEKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM

AT1G25360.1 Pentatricopeptide repeat (PPR) superfamily protein6.5e-10847.64Show/hide
Query:  VRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNGTP
        VR +ANRYAA L+LC P   +S  LAR VH ++I  GF+PR H++NRL+D+Y KSS   YARQLFDEI  PD + RTT+++ Y A G++ +AR +F   P
Subjt:  VRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNGTP

Query:  LNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSSSS
        + MRDT+ YNAMITG+SH +DGYSAI LF  M+   F+PD FTF SVL+ LAL+ DDE QC Q H A +K G G I+SV NAL+SVY+KCASSP     S
Subjt:  LNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSSSS

Query:  LMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLG-VAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACTNGGFF
        L+ SARK+FDE+ E++E +WTT++TGYV+N       ELL+ M +++  VA+NAMISGYV+ G +++AL + R+M   G++ DEFTY SVI AC   G  
Subjt:  LMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLG-VAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACTNGGFF

Query:  QLGKQVHAYILKNETNPNHDFLLSVSNALITLYWK---------------------------------------------------------YGLAQNGF
        QLGKQVHAY+L+ E     DF     N+L++LY+K                                                          GLA+NGF
Subjt:  QLGKQVHAYILKNETNPNHDFLLSVSNALITLYWK---------------------------------------------------------YGLAQNGF

Query:  GEESLKLFNQMRLDGYEPCDYVFA
        GEE LKLF+ M+ +G+EPCDY F+
Subjt:  GEESLKLFNQMRLDGYEPCDYVFA

AT1G25360.1 Pentatricopeptide repeat (PPR) superfamily protein3.8e-2358.02Show/hide
Query:  YVFAEKLNLEMKKLGYIPDTKYVLQDMKSE-HKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFMLTW
        Y++ + L  EM++LGY+PDT +VL D++S+ HKE  L+TH+EK+AV FGLMKLP G T+++FKNLR  GDCHN F+F L+W
Subjt:  YVFAEKLNLEMKKLGYIPDTKYVLQDMKSE-HKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFMLTW

AT2G21090.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.3e-3929.43Show/hide
Query:  VRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFK-PRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNGT
        +R+  +  A+ LQ C   +  S    + +H H+  +GFK P   L N L+ +Y K    I A ++FD++   +      +++ Y   G L  AR +F+  
Subjt:  VRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFK-PRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNGT

Query:  PLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCG-QMHGAVVKFGVGLISSVL--NALLSVYAKCASSPLV
        P   RD + +N M+ GY+   + + A+  +   R +  + +EF+F  +L+  A +   + Q   Q HG V+    G +S+V+   +++  YAKC      
Subjt:  PLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCG-QMHGAVVKFGVGLISSVL--NALLSVYAKCASSPLV

Query:  SSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACTNG
             M SA++ FDEM  ++   WTTLI+GY +  D+  A +L   M E   V+W A+I+GYV  G    AL LFRKM  LGV+ ++FT++S + A  + 
Subjt:  SSSSLMASARKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACTNG

Query:  GFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKYG---------------------------LAQNGFGEESLKLFNQM
           + GK++H Y+++    PN      V ++LI +Y K G                           LAQ+G G ++L++ + M
Subjt:  GFFQLGKQVHAYILKNETNPNHDFLLSVSNALITLYWKYG---------------------------LAQNGFGEESLKLFNQM

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein1.0e-4427.13Show/hide
Query:  NRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVR--TTLITAYSALGNLNMAREIFNGTPLNM
        +R+     L +  N     + + +H+H++ +GF   G ++N L+ +Y +      AR+L ++    D  +   T L+  Y  LG++N A+ IF    L  
Subjt:  NRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVR--TTLITAYSALGNLNMAREIFNGTPLNM

Query:  RDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSSSSLMA
        RD + + AMI GY        AI LF +M     +P+ +T  ++LS  + +    H   Q+HG+ VK G     SV NAL+++YAK  +         + 
Subjt:  RDTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSSSSLMA

Query:  SARKLFDEMP-ERNELTWTTLITGYVRNDDLIRARELLDTM------------------TEHLGVA----------------------WNAMISGYVHHG
        SA + FD +  ER+ ++WT++I    ++     A EL +TM                    H G+                       +  M+  +   G
Subjt:  SARKLFDEMP-ERNELTWTTLITGYVRNDDLIRARELLDTM------------------TEHLGVA----------------------WNAMISGYVHHG

Query:  LFEDALTLFRKMRLLGVQHDEFTYTSVISACTNGGFFQLGKQV--HAYILKNETNPNHDFLLSVSNAL--------ITLYWKYGLAQNGFGEESLKLFNQ
        L ++A     KM    ++ D  T+ S++SAC       LGK       +L+ E +  +  L ++ +A         I    K G  +   G   +++ ++
Subjt:  LFEDALTLFRKMRLLGVQHDEFTYTSVISACTNGGFFQLGKQV--HAYILKNETNPNHDFLLSVSNAL--------ITLYWKYGLAQNGFGEESLKLFNQ

Query:  MRLDGYEPCD-------YVFAEKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM
        + + G E          Y+  +K+  E+KK+GY+PDT  VL D++ E KE  L  H+EKLA+ FGL+  P+  T+++ KNLR+  DCH A KF+
Subjt:  MRLDGYEPCD-------YVFAEKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.9e-3925.15Show/hide
Query:  YAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNGTPLNMRDTI
        +++ L+ CS ++       + +H H++  G     ++   L+D+Y K  + + A+++FD +P    V  T +IT Y+  GN+  AR +F+   +  RD +
Subjt:  YAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNGTPLNMRDTI

Query:  FYNAMITGYSHKDDGYSAIELFH-AMRWANFQPDEFTFTSVLSALALIVDDEHQCGQ-MHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSSSSLMASA
         +N MI GY+       A+ LF   +     +PDE T  + LSA + I     + G+ +H  V    + L   V   L+ +Y+KC S         +  A
Subjt:  FYNAMITGYSHKDDGYSAIELFH-AMRWANFQPDEFTFTSVLSALALIVDDEHQCGQ-MHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSSSSLMASA

Query:  RKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLG-----VAWNAMISGYVHHGLFEDALTLFRKM-----------------RLLG-----
          +F++ P ++ + W  +I GY  +     A  L + M    G     + +   +    H GL  + + +F  M                  LLG     
Subjt:  RKLFDEMPERNELTWTTLITGYVRNDDLIRARELLDTMTEHLG-----VAWNAMISGYVHHGLFEDALTLFRKM-----------------RLLG-----

Query:  -----------VQHDEFTYTSVISACTNGGFFQLGKQVHAYIL-KNETNPNHDFLLSVSNALITLYWKYGLAQNGFGEESL------------KLFNQMR
                   +  D   ++SV+ +C   G F LGK++  Y++  N  N     LLS   A +  Y      +N   E+ +               ++ R
Subjt:  -----------VQHDEFTYTSVISACTNGGFFQLGKQVHAYIL-KNETNPNHDFLLSVSNALITLYWKYGLAQNGFGEESL------------KLFNQMR

Query:  LDGYEPCD----YVFAEKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM
            E       Y    K++  +K  GY+P+T  VLQD++   KE +L  H+E+LA+ +GL+    G+ +K+FKNLR+  DCH   K +
Subjt:  LDGYEPCD----YVFAEKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHNAFKFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAATGCCTTGGACGTCCGTGTTTTGGCGAACAGGTATGCCGCACAATTGCAACTCTGTTCCCCACAAAATCCCTCTTCATATTCACTTGCTCGGACTGTTCATGC
CCACATGATTGCTTCGGGATTCAAGCCTCGTGGGCACCTTGTCAATCGTCTACTTGATATATACTGGAAATCGTCGAATTTTATTTATGCCCGCCAACTGTTCGACGAAA
TTCCCCACCCAGATGCTGTAGTGAGAACTACATTGATTACGGCGTACTCTGCGTTGGGGAATTTGAATATGGCTAGAGAAATATTCAATGGAACTCCATTGAATATGAGG
GATACTATTTTCTACAATGCAATGATTACTGGGTATTCGCACAAGGATGATGGGTATTCTGCTATTGAATTGTTTCATGCTATGAGATGGGCCAATTTTCAGCCTGATGA
ATTTACATTTACTAGTGTGCTCAGTGCTTTAGCGTTGATTGTTGATGATGAGCACCAGTGTGGCCAAATGCATGGTGCAGTGGTGAAATTTGGAGTTGGGCTTATTTCTT
CAGTGTTGAACGCTCTTCTATCTGTTTATGCTAAGTGTGCTTCTTCACCGTTGGTGTCATCGTCGTCATTGATGGCATCAGCTAGGAAACTGTTTGATGAAATGCCAGAG
AGGAATGAGTTGACATGGACGACTCTGATTACTGGGTATGTTCGGAATGATGATCTAATTAGGGCACGTGAACTTCTTGACACAATGACTGAACATCTGGGTGTAGCATG
GAATGCCATGATCTCTGGATATGTGCATCATGGTCTTTTCGAGGATGCCTTGACATTGTTTAGGAAAATGCGTTTGCTTGGTGTCCAGCACGATGAGTTCACCTATACGA
GCGTGATCAGTGCTTGCACCAATGGTGGTTTTTTTCAACTGGGAAAACAGGTGCATGCTTACATTTTGAAAAATGAGACGAACCCAAATCATGATTTTTTATTGTCTGTG
AGTAATGCATTGATTACTTTATACTGGAAATATGGACTAGCCCAAAATGGATTTGGGGAAGAGAGTTTGAAGCTGTTTAACCAAATGAGGTTAGATGGCTATGAACCTTG
TGATTATGTATTTGCAGAGAAGTTGAATCTTGAAATGAAGAAATTAGGATATATTCCAGACACAAAGTATGTGCTACAAGATATGAAATCTGAACATAAAGAATATGCCT
TATCTACTCACAATGAGAAGCTTGCAGTTGGGTTTGGGCTAATGAAGCTCCCTGAAGGTGCCACTGTCAAGGTTTTCAAGAACCTTCGGATACGTGGAGATTGTCACAAT
GCATTCAAGTTCATGCTCACGTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAATGCCTTGGACGTCCGTGTTTTGGCGAACAGGTATGCCGCACAATTGCAACTCTGTTCCCCACAAAATCCCTCTTCATATTCACTTGCTCGGACTGTTCATGC
CCACATGATTGCTTCGGGATTCAAGCCTCGTGGGCACCTTGTCAATCGTCTACTTGATATATACTGGAAATCGTCGAATTTTATTTATGCCCGCCAACTGTTCGACGAAA
TTCCCCACCCAGATGCTGTAGTGAGAACTACATTGATTACGGCGTACTCTGCGTTGGGGAATTTGAATATGGCTAGAGAAATATTCAATGGAACTCCATTGAATATGAGG
GATACTATTTTCTACAATGCAATGATTACTGGGTATTCGCACAAGGATGATGGGTATTCTGCTATTGAATTGTTTCATGCTATGAGATGGGCCAATTTTCAGCCTGATGA
ATTTACATTTACTAGTGTGCTCAGTGCTTTAGCGTTGATTGTTGATGATGAGCACCAGTGTGGCCAAATGCATGGTGCAGTGGTGAAATTTGGAGTTGGGCTTATTTCTT
CAGTGTTGAACGCTCTTCTATCTGTTTATGCTAAGTGTGCTTCTTCACCGTTGGTGTCATCGTCGTCATTGATGGCATCAGCTAGGAAACTGTTTGATGAAATGCCAGAG
AGGAATGAGTTGACATGGACGACTCTGATTACTGGGTATGTTCGGAATGATGATCTAATTAGGGCACGTGAACTTCTTGACACAATGACTGAACATCTGGGTGTAGCATG
GAATGCCATGATCTCTGGATATGTGCATCATGGTCTTTTCGAGGATGCCTTGACATTGTTTAGGAAAATGCGTTTGCTTGGTGTCCAGCACGATGAGTTCACCTATACGA
GCGTGATCAGTGCTTGCACCAATGGTGGTTTTTTTCAACTGGGAAAACAGGTGCATGCTTACATTTTGAAAAATGAGACGAACCCAAATCATGATTTTTTATTGTCTGTG
AGTAATGCATTGATTACTTTATACTGGAAATATGGACTAGCCCAAAATGGATTTGGGGAAGAGAGTTTGAAGCTGTTTAACCAAATGAGGTTAGATGGCTATGAACCTTG
TGATTATGTATTTGCAGAGAAGTTGAATCTTGAAATGAAGAAATTAGGATATATTCCAGACACAAAGTATGTGCTACAAGATATGAAATCTGAACATAAAGAATATGCCT
TATCTACTCACAATGAGAAGCTTGCAGTTGGGTTTGGGCTAATGAAGCTCCCTGAAGGTGCCACTGTCAAGGTTTTCAAGAACCTTCGGATACGTGGAGATTGTCACAAT
GCATTCAAGTTCATGCTCACGTGGTAA
Protein sequenceShow/hide protein sequence
MRNALDVRVLANRYAAQLQLCSPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVVRTTLITAYSALGNLNMAREIFNGTPLNMR
DTIFYNAMITGYSHKDDGYSAIELFHAMRWANFQPDEFTFTSVLSALALIVDDEHQCGQMHGAVVKFGVGLISSVLNALLSVYAKCASSPLVSSSSLMASARKLFDEMPE
RNELTWTTLITGYVRNDDLIRARELLDTMTEHLGVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACTNGGFFQLGKQVHAYILKNETNPNHDFLLSV
SNALITLYWKYGLAQNGFGEESLKLFNQMRLDGYEPCDYVFAEKLNLEMKKLGYIPDTKYVLQDMKSEHKEYALSTHNEKLAVGFGLMKLPEGATVKVFKNLRIRGDCHN
AFKFMLTW