; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012069 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012069
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr01:17153196..17154872
RNA-Seq ExpressionHG10012069
SyntenyHG10012069
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022963775.1 pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita moschata]6.4e-13089.59Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL
        MNNVYKLHCCIIK+ KQ+DP SLR LLLSCAAAAPESLS+ARYVFSRIPSPDTFA+NTIIRAHSHFFPSHSLS F SMR NGVP D FTFPFVLKAC+RL
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL

Query:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG
        QMDLHLHSLI+KYGLDSDIFVQN+LM+VYGC G V+IAVKVF+EMSERDSVSWSTII SFVNNGYASEALALFK MQLEDKVVPDEVTMLSVISAIS LG
Subjt:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG

Query:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK
        ALELGRWVR+FID+LGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALING AVHGR++
Subjt:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK

XP_022963775.1 pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita moschata]5.3e-0784.38Show/hide
Query:  HGRNKFERKIIIRDRNRFHHFDKGSCSCHDYW
        H  +K+ERKIIIRDRNRFHHFDKGSCSC DYW
Subjt:  HGRNKFERKIIIRDRNRFHHFDKGSCSCHDYW

XP_022967226.1 pentatricopeptide repeat-containing protein At5g48910-like [Cucurbita maxima]3.1e-0784.38Show/hide
Query:  HGRNKFERKIIIRDRNRFHHFDKGSCSCHDYW
        H  + +ERKIIIRDRNRFHHFDKGSCSCHDYW
Subjt:  HGRNKFERKIIIRDRNRFHHFDKGSCSCHDYW

XP_022967226.1 pentatricopeptide repeat-containing protein At5g48910-like [Cucurbita maxima]6.4e-13089.59Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL
        MNNVYKLHCCIIK+ KQ+DP SLR LLLSCAAAAPESLS+ARYVFSRIPSPDTFA+NTIIRAHSHFFPSHSLS F SMR NGVP D FTFPFVLKAC+RL
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL

Query:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG
        QMDLHLHSLI+KYGLDSDIFVQN+LM+VYGC G V+IAVKVF+EMSERDSVSWSTII SFVNNGYASEALALFK MQLEDKVVPDEVTMLSVISAIS LG
Subjt:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG

Query:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK
        ALELGRWVR+FID+LGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALING AVHGR++
Subjt:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK

XP_023553606.1 pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp. pepo]1.4e-0784.38Show/hide
Query:  HGRNKFERKIIIRDRNRFHHFDKGSCSCHDYW
        H  +++ERKIIIRDRNRFHHFDKGSCSCHDYW
Subjt:  HGRNKFERKIIIRDRNRFHHFDKGSCSCHDYW

XP_023553606.1 pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp. pepo]4.9e-13088.48Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL
        MNNVYKLHCCIIK+ KQNDP SLR LLLSCAAAAPESLSY RYVFSRIPSPDTFA+NTIIR HSH+FPSHSLSYF SMR NGVP D+FTFPFVLKAC+RL
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL

Query:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG
        Q DLHLHSLI+KYGLDSDIFVQN+LM+VYGC G V+IAVKVF+EMSERDSVSWSTII SFVNNGYASEALALFK MQLEDKVVPDEVTMLSVISA+S LG
Subjt:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG

Query:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK
        ALELGRWVR+FID+LGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWT LING AVHGR++
Subjt:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK

XP_038887811.1 pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida]8.1e-13391.04Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL
        M NVYKLHCCIIKSNKQNDP SLRRLLLSC AAAPESLSYARY+FSRIPSPDTFA+NTIIRAHSHFFPSHSLS+FFSMRS+GVPFD FTFPFVLKACSRL
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL

Query:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG
        QMDLHLHSLI+KYGLDSD FVQNALM VYGCSG ++IAVKVF++MSERDSVSWSTII+SFVNNG+ASEAL LFKKMQLEDKVVPDEVTML VISAIS LG
Subjt:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG

Query:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRN
        ALELGRWVRV IDRLGLEISVALGTALIDMFSRCGSIDESVVVFE+MAVRNVLTWTALINGLAVHGR+
Subjt:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRN

XP_038887811.1 pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida]1.8e-0793.1Show/hide
Query:  NKFERKIIIRDRNRFHHFDKGSCSCHDYW
        N+FER IIIRDRNRFHHFDKGSCSCHDYW
Subjt:  NKFERKIIIRDRNRFHHFDKGSCSCHDYW

XP_038887811.1 pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida]2.0e-13189.96Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL
        MNNVYKLHCCIIK+ KQNDP SLR LLLSCAAAAPESLSYARYVFSRIPSPDTFA+NTIIRAHSH+FPSHSLS F SMR NGVP D+FTFPFVLKACSRL
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL

Query:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG
        QMDLHLHSLI+KYGLDSDIFVQN+LM+VYGC G V+IAVKVF+EMSERDSVSWST+I SFVNNGYASEALALFK MQLEDKVVPDEVTMLSVISAIS LG
Subjt:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG

Query:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK
        ALELGRWVR+FID+LGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALING AVHGR++
Subjt:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK

TrEMBL top hitse value%identityAlignment
A0A0A0LC76 DYW_deaminase domain-containing protein1.3e-0689.29Show/hide
Query:  KFERKIIIRDRNRFHHFDKGSCSCHDYW
        +FERKIIIRDRNRFHHF+KG CSCHDYW
Subjt:  KFERKIIIRDRNRFHHFDKGSCSCHDYW

A0A0A0LC76 DYW_deaminase domain-containing protein2.8e-12385.45Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL
        MN+VYKLHC IIKS KQ DPLSLR LLLSC A APESLSY RYVFSRIPSPDTFA NTIIR+HSH FPSHSLSYFF+MRSNG+PFD FTFPFVLKACSRL
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL

Query:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG
        Q++LHLHSLI+K+GLDSDIFVQNAL+ VYG  G +++AVKVF+EMSERDSVSWSTIIASF+NNG+ASEALALF+KMQLEDKVVPDEVTMLSVISAIS LG
Subjt:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG

Query:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRN
         LELGRWVR FI RLGL ISVALGTALIDMFSRCGSIDES+VVFEEMAVRNVLTWT LINGL VHGR+
Subjt:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRN

A0A1S3BGQ3 pentatricopeptide repeat-containing protein At4g21065-like1.3e-0689.29Show/hide
Query:  KFERKIIIRDRNRFHHFDKGSCSCHDYW
        +FERKIIIRDRNRFHHF+KG CSCHDYW
Subjt:  KFERKIIIRDRNRFHHFDKGSCSCHDYW

A0A5A7T8N1 Pentatricopeptide repeat-containing protein1.3e-0689.29Show/hide
Query:  KFERKIIIRDRNRFHHFDKGSCSCHDYW
        +FERKIIIRDRNRFHHF+KG CSCHDYW
Subjt:  KFERKIIIRDRNRFHHFDKGSCSCHDYW

A0A5A7T8N1 Pentatricopeptide repeat-containing protein2.8e-12385.45Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL
        MN+VYKLHC IIKS KQ DPLSLR LLLSC A APESLSY RYVFSRIPSPDTFA NTIIR+HSH FPSHSLSYFF+MRSNG+PFD FTFPFVLKACSRL
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL

Query:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG
        Q++LHLHSLI+K+GLDSDIFVQNAL+ VYG  G +++AVKVF+EMSERDSVSWSTIIASF+NNG+ASEALALF+KMQLEDKVVPDEVTMLSVISAIS LG
Subjt:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG

Query:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRN
         LELGRWVR FI RLGL ISVALGTALIDMFSRCGSIDES+VVFEEMAVRNVLTWT LINGL VHGR+
Subjt:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRN

A0A6J1HIY8 pentatricopeptide repeat-containing protein At4g21065-like2.6e-0784.38Show/hide
Query:  HGRNKFERKIIIRDRNRFHHFDKGSCSCHDYW
        H  +K+ERKIIIRDRNRFHHFDKGSCSC DYW
Subjt:  HGRNKFERKIIIRDRNRFHHFDKGSCSCHDYW

A0A6J1HIY8 pentatricopeptide repeat-containing protein At4g21065-like4.4e-12485.45Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL
        MNNVY+LHC IIKS+KQNDPLSLR LLLSC AAAPESLSYARYVFSRIPSPDT A+NTIIR+HS FFPSHSL YFFSMRSNG+P D FTFPFVLKACSRL
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL

Query:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG
        Q++LHLHSLI+KYGLDSDIFVQNAL+ VYG  G +++AVKVF+EMSERDSVSWST+IASF+NNGYASEAL LF+KMQLEDKVVPDEVTMLSVISAIS LG
Subjt:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG

Query:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRN
         LELGRWVR FI RLGL +SVALGTALIDMFSRCGSIDES+VVFE+MAVRNVLTWTALINGL VHGR+
Subjt:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRN

A0A6J1HRF3 pentatricopeptide repeat-containing protein At5g48910-like2.4e-13088.48Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL
        MNNVYKLHCCIIK+ KQNDP SLR LLLSCAAAAPESLSY RYVFSRIPSPDTFA+NTIIR HSH+FPSHSLSYF SMR NGVP D+FTFPFVLKAC+RL
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL

Query:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG
        Q DLHLHSLI+KYGLDSDIFVQN+LM+VYGC G V+IAVKVF+EMSERDSVSWSTII SFVNNGYASEALALFK MQLEDKVVPDEVTMLSVISA+S LG
Subjt:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG

Query:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK
        ALELGRWVR+FID+LGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWT LING AVHGR++
Subjt:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK

A0A6J1HRF3 pentatricopeptide repeat-containing protein At5g48910-like1.5e-0784.38Show/hide
Query:  HGRNKFERKIIIRDRNRFHHFDKGSCSCHDYW
        H  + +ERKIIIRDRNRFHHFDKGSCSCHDYW
Subjt:  HGRNKFERKIIIRDRNRFHHFDKGSCSCHDYW

A0A6J1HRF3 pentatricopeptide repeat-containing protein At5g48910-like3.1e-13089.59Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL
        MNNVYKLHCCIIK+ KQ+DP SLR LLLSCAAAAPESLS+ARYVFSRIPSPDTFA+NTIIRAHSHFFPSHSLS F SMR NGVP D FTFPFVLKAC+RL
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRL

Query:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG
        QMDLHLHSLI+KYGLDSDIFVQN+LM+VYGC G V+IAVKVF+EMSERDSVSWSTII SFVNNGYASEALALFK MQLEDKVVPDEVTMLSVISAIS LG
Subjt:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLG

Query:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK
        ALELGRWVR+FID+LGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALING AVHGR++
Subjt:  ALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK

SwissProt top hitse value%identityAlignment
O49399 Pentatricopeptide repeat-containing protein At4g188402.4e-4232.74Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAA-AAPESLSYARYVFSRIPSPDTFAFNTIIRAHSH-FFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACS
        +  + + H  ++K+   +D  S  +L+   A    P+++SYA  + +RI SP+ F  N++IRA+++   P  +L+ F  M    V  D ++F FVLKAC+
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAA-AAPESLSYARYVFSRIPSPDTFAFNTIIRAHSH-FFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACS

Query:  R---LQMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQ------------------
             +    +H L IK GL +D+FV+N L+NVYG SG+ +IA KV + M  RD+VSW++++++++  G   EA ALF +M+                  
Subjt:  R---LQMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQ------------------

Query:  -------------------------------------------LEDKV-VPDEVTMLSVISAISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSR
                                                   L+D    PD  T++SV+SA + LG+L  G WV V+ID+ G+EI   L TAL+DM+S+
Subjt:  -------------------------------------------LEDKV-VPDEVTMLSVISAISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSR

Query:  CGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK
        CG ID+++ VF   + R+V TW ++I+ L+VHG  K
Subjt:  CGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic3.4e-4135.71Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAH-SHFFPSHSLSYFFSMRSNGVPF-DYFTFPFVLKAC-
        +  + + H  +I++   +DP S  +L    A ++  SL YAR VF  IP P++FA+NT+IRA+ S   P  S+  F  M S    + + +TFPF++KA  
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAH-SHFFPSHSLSYFFSMRSNGVPF-DYFTFPFVLKAC-

Query:  --SRLQMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISA
          S L +   LH + +K  + SD+FV N+L++ Y   G +  A KVF  + E+D VSW+++I  FV  G   +AL LFKKM+ ED V    VTM+ V+SA
Subjt:  --SRLQMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISA

Query:  ISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNKFERKII
         +++  LE GR V  +I+   + +++ L  A++DM+++CGSI+++  +F+ M  ++ +TWT +++G A+    +  R+++
Subjt:  ISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNKFERKII

P93011 Pentatricopeptide repeat-containing protein At2g337601.3e-4035.5Show/hide
Query:  IIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHF-FPSHSLSYFFSMRSNGVPFDYFTFPFVLKACS-----RLQMDL
        I+    ++  L  + + L+C+A A   ++Y   +F  +P PD F FN++I++ S    P H ++Y+  M S+ V    +TF  V+K+C+     R+   +
Subjt:  IIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHF-FPSHSLSYFFSMRSNGVPFDYFTFPFVLKACS-----RLQMDL

Query:  HLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLGALEL
        H H+++  +GLD+  +VQ AL+  Y   G ++ A +VF+ M E+  V+W+++++ F  NG A EA+ +F +M+ E    PD  T +S++SA +Q GA+ L
Subjt:  HLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLGALEL

Query:  GRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHG
        G WV  +I   GL+++V LGTALI+++SRCG + ++  VF++M   NV  WTA+I+    HG
Subjt:  GRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHG

P93011 Pentatricopeptide repeat-containing protein At2g337609.7e-0468Show/hide
Query:  RKIIIRDRNRFHHFDKGSCSCHDYW
        R+I +RD+ RFHHF  GSCSC DYW
Subjt:  RKIIIRDRNRFHHFDKGSCSCHDYW

Q38959 Pentatricopeptide repeat-containing protein At3g26630, chloroplastic2.6e-4136.7Show/hide
Query:  KLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHS-HFFPSHSLSYFFSMR-SNGVPFDYFTFPFVLKAC---SRL
        ++H  IIK N  ND L +R+ L+S +++  E+  YA  VF+++ SP TF +N +IR+ S +  P  +L  F  M  S+   FD FTFPFV+KAC   S +
Subjt:  KLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHS-HFFPSHSLSYFFSMR-SNGVPFDYFTFPFVLKAC---SRL

Query:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWST-------------------------------IIASFVNNGYASEA
        ++   +H L IK G  +D+F QN LM++Y   G      KVF++M  R  VSW+T                               +I ++V N    EA
Subjt:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWST-------------------------------IIASFVNNGYASEA

Query:  LALFKKMQLEDKVVPDEVTMLSVISAISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHG
          LF++MQ++D V P+E T+++++ A +QLG+L +GRWV  +  + G  +   LGTALIDM+S+CGS+ ++  VF+ M  +++ TW ++I  L VHG
Subjt:  LALFKKMQLEDKVVPDEVTMLSVISAISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHG

Q9FI80 Pentatricopeptide repeat-containing protein At5g489102.0e-4132.71Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAA--APESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSL---SYFFSMRSNG-VPFDYFTFPFVL
        + ++ ++H   IKS +  D L+   +L  CA +      L YA  +F+++P  + F++NTIIR  S      +L   + F+ M S+  V  + FTFP VL
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAA--APESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSL---SYFFSMRSNG-VPFDYFTFPFVL

Query:  KACS---RLQMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFV---------------------------------------------QIAVKVFEEMS
        KAC+   ++Q    +H L +KYG   D FV + L+ +Y   GF+                                             + A  +F++M 
Subjt:  KACS---RLQMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFV---------------------------------------------QIAVKVFEEMS

Query:  ERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEE
        +R  VSW+T+I+ +  NG+  +A+ +F++M+  D + P+ VT++SV+ AIS+LG+LELG W+ ++ +  G+ I   LG+ALIDM+S+CG I++++ VFE 
Subjt:  ERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEE

Query:  MAVRNVLTWTALINGLAVHGR
        +   NV+TW+A+ING A+HG+
Subjt:  MAVRNVLTWTALINGLAVHGR

Q9FI80 Pentatricopeptide repeat-containing protein At5g489102.3e-0570.37Show/hide
Query:  FERKIIIRDRNRFHHFDKGSCSCHDYW
        ++RKI +RDR RFHHF  GSCSC DYW
Subjt:  FERKIIIRDRNRFHHFDKGSCSCHDYW

Arabidopsis top hitse value%identityAlignment
AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-4235.71Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAH-SHFFPSHSLSYFFSMRSNGVPF-DYFTFPFVLKAC-
        +  + + H  +I++   +DP S  +L    A ++  SL YAR VF  IP P++FA+NT+IRA+ S   P  S+  F  M S    + + +TFPF++KA  
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAH-SHFFPSHSLSYFFSMRSNGVPF-DYFTFPFVLKAC-

Query:  --SRLQMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISA
          S L +   LH + +K  + SD+FV N+L++ Y   G +  A KVF  + E+D VSW+++I  FV  G   +AL LFKKM+ ED V    VTM+ V+SA
Subjt:  --SRLQMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISA

Query:  ISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNKFERKII
         +++  LE GR V  +I+   + +++ L  A++DM+++CGSI+++  +F+ M  ++ +TWT +++G A+    +  R+++
Subjt:  ISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNKFERKII

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.1e-0662.96Show/hide
Query:  FERKIIIRDRNRFHHFDKGSCSCHDYW
        ++R+II+RDR RFHHF  G CSC+D+W
Subjt:  FERKIIIRDRNRFHHFDKGSCSCHDYW

AT2G33760.1 Pentatricopeptide repeat (PPR) superfamily protein9.2e-4235.5Show/hide
Query:  IIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHF-FPSHSLSYFFSMRSNGVPFDYFTFPFVLKACS-----RLQMDL
        I+    ++  L  + + L+C+A A   ++Y   +F  +P PD F FN++I++ S    P H ++Y+  M S+ V    +TF  V+K+C+     R+   +
Subjt:  IIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHF-FPSHSLSYFFSMRSNGVPFDYFTFPFVLKACS-----RLQMDL

Query:  HLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLGALEL
        H H+++  +GLD+  +VQ AL+  Y   G ++ A +VF+ M E+  V+W+++++ F  NG A EA+ +F +M+ E    PD  T +S++SA +Q GA+ L
Subjt:  HLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLGALEL

Query:  GRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHG
        G WV  +I   GL+++V LGTALI+++SRCG + ++  VF++M   NV  WTA+I+    HG
Subjt:  GRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHG

AT2G33760.1 Pentatricopeptide repeat (PPR) superfamily protein6.9e-0568Show/hide
Query:  RKIIIRDRNRFHHFDKGSCSCHDYW
        R+I +RD+ RFHHF  GSCSC DYW
Subjt:  RKIIIRDRNRFHHFDKGSCSCHDYW

AT3G26630.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.9e-4236.7Show/hide
Query:  KLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHS-HFFPSHSLSYFFSMR-SNGVPFDYFTFPFVLKAC---SRL
        ++H  IIK N  ND L +R+ L+S +++  E+  YA  VF+++ SP TF +N +IR+ S +  P  +L  F  M  S+   FD FTFPFV+KAC   S +
Subjt:  KLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHS-HFFPSHSLSYFFSMR-SNGVPFDYFTFPFVLKAC---SRL

Query:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWST-------------------------------IIASFVNNGYASEA
        ++   +H L IK G  +D+F QN LM++Y   G      KVF++M  R  VSW+T                               +I ++V N    EA
Subjt:  QMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWST-------------------------------IIASFVNNGYASEA

Query:  LALFKKMQLEDKVVPDEVTMLSVISAISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHG
          LF++MQ++D V P+E T+++++ A +QLG+L +GRWV  +  + G  +   LGTALIDM+S+CGS+ ++  VF+ M  +++ TW ++I  L VHG
Subjt:  LALFKKMQLEDKVVPDEVTMLSVISAISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHG

AT4G18840.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.7e-4332.74Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAA-AAPESLSYARYVFSRIPSPDTFAFNTIIRAHSH-FFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACS
        +  + + H  ++K+   +D  S  +L+   A    P+++SYA  + +RI SP+ F  N++IRA+++   P  +L+ F  M    V  D ++F FVLKAC+
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAA-AAPESLSYARYVFSRIPSPDTFAFNTIIRAHSH-FFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACS

Query:  R---LQMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQ------------------
             +    +H L IK GL +D+FV+N L+NVYG SG+ +IA KV + M  RD+VSW++++++++  G   EA ALF +M+                  
Subjt:  R---LQMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQ------------------

Query:  -------------------------------------------LEDKV-VPDEVTMLSVISAISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSR
                                                   L+D    PD  T++SV+SA + LG+L  G WV V+ID+ G+EI   L TAL+DM+S+
Subjt:  -------------------------------------------LEDKV-VPDEVTMLSVISAISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSR

Query:  CGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK
        CG ID+++ VF   + R+V TW ++I+ L+VHG  K
Subjt:  CGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNK

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-4232.71Show/hide
Query:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAA--APESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSL---SYFFSMRSNG-VPFDYFTFPFVL
        + ++ ++H   IKS +  D L+   +L  CA +      L YA  +F+++P  + F++NTIIR  S      +L   + F+ M S+  V  + FTFP VL
Subjt:  MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAA--APESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSL---SYFFSMRSNG-VPFDYFTFPFVL

Query:  KACS---RLQMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFV---------------------------------------------QIAVKVFEEMS
        KAC+   ++Q    +H L +KYG   D FV + L+ +Y   GF+                                             + A  +F++M 
Subjt:  KACS---RLQMDLHLHSLIIKYGLDSDIFVQNALMNVYGCSGFV---------------------------------------------QIAVKVFEEMS

Query:  ERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEE
        +R  VSW+T+I+ +  NG+  +A+ +F++M+  D + P+ VT++SV+ AIS+LG+LELG W+ ++ +  G+ I   LG+ALIDM+S+CG I++++ VFE 
Subjt:  ERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLGALELGRWVRVFIDRLGLEISVALGTALIDMFSRCGSIDESVVVFEE

Query:  MAVRNVLTWTALINGLAVHGR
        +   NV+TW+A+ING A+HG+
Subjt:  MAVRNVLTWTALINGLAVHGR

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-0670.37Show/hide
Query:  FERKIIIRDRNRFHHFDKGSCSCHDYW
        ++RKI +RDR RFHHF  GSCSC DYW
Subjt:  FERKIIIRDRNRFHHFDKGSCSCHDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATGTTTACAAACTCCATTGTTGCATAATCAAAAGCAACAAGCAGAATGATCCTCTGTCTCTCCGTCGTCTCCTTCTTTCCTGTGCTGCTGCAGCTCCCGAAAG
CTTATCTTATGCTCGTTATGTATTCTCTCGAATTCCTTCTCCAGATACCTTCGCTTTTAACACCATCATACGAGCACATTCTCACTTCTTTCCTTCTCATTCTTTGTCCT
ATTTCTTTTCCATGCGCTCCAATGGCGTCCCTTTTGATTATTTCACATTCCCTTTTGTTCTCAAGGCATGTTCTCGATTGCAAATGGACCTTCATTTGCATTCCCTTATT
ATTAAGTATGGTTTGGACTCTGACATTTTTGTGCAGAATGCTTTGATGAATGTTTATGGGTGTAGTGGGTTTGTACAGATTGCAGTCAAGGTGTTTGAGGAAATGTCTGA
GAGAGATTCTGTCTCTTGGTCTACTATTATTGCTTCTTTTGTTAATAATGGCTATGCATCTGAGGCTTTGGCCTTGTTCAAGAAAATGCAATTGGAAGATAAAGTAGTGC
CTGATGAGGTAACCATGCTCAGTGTGATATCTGCAATCTCACAATTGGGAGCATTAGAATTGGGTCGCTGGGTTCGAGTGTTTATCGATCGGCTTGGCCTGGAAATATCT
GTTGCTTTAGGCACTGCTCTTATTGACATGTTCTCCAGATGTGGATCCATTGATGAATCAGTTGTTGTATTTGAGGAGATGGCAGTGAGGAATGTGTTGACATGGACCGC
GCTAATCAACGGGCTTGCGGTTCATGGGCGCAACAAATTTGAGAGGAAAATAATCATTCGGGATCGCAATCGGTTTCACCATTTTGATAAAGGATCGTGTTCATGTCATG
ATTATTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACAATGTTTACAAACTCCATTGTTGCATAATCAAAAGCAACAAGCAGAATGATCCTCTGTCTCTCCGTCGTCTCCTTCTTTCCTGTGCTGCTGCAGCTCCCGAAAG
CTTATCTTATGCTCGTTATGTATTCTCTCGAATTCCTTCTCCAGATACCTTCGCTTTTAACACCATCATACGAGCACATTCTCACTTCTTTCCTTCTCATTCTTTGTCCT
ATTTCTTTTCCATGCGCTCCAATGGCGTCCCTTTTGATTATTTCACATTCCCTTTTGTTCTCAAGGCATGTTCTCGATTGCAAATGGACCTTCATTTGCATTCCCTTATT
ATTAAGTATGGTTTGGACTCTGACATTTTTGTGCAGAATGCTTTGATGAATGTTTATGGGTGTAGTGGGTTTGTACAGATTGCAGTCAAGGTGTTTGAGGAAATGTCTGA
GAGAGATTCTGTCTCTTGGTCTACTATTATTGCTTCTTTTGTTAATAATGGCTATGCATCTGAGGCTTTGGCCTTGTTCAAGAAAATGCAATTGGAAGATAAAGTAGTGC
CTGATGAGGTAACCATGCTCAGTGTGATATCTGCAATCTCACAATTGGGAGCATTAGAATTGGGTCGCTGGGTTCGAGTGTTTATCGATCGGCTTGGCCTGGAAATATCT
GTTGCTTTAGGCACTGCTCTTATTGACATGTTCTCCAGATGTGGATCCATTGATGAATCAGTTGTTGTATTTGAGGAGATGGCAGTGAGGAATGTGTTGACATGGACCGC
GCTAATCAACGGGCTTGCGGTTCATGGGCGCAACAAATTTGAGAGGAAAATAATCATTCGGGATCGCAATCGGTTTCACCATTTTGATAAAGGATCGTGTTCATGTCATG
ATTATTGGTGA
Protein sequenceShow/hide protein sequence
MNNVYKLHCCIIKSNKQNDPLSLRRLLLSCAAAAPESLSYARYVFSRIPSPDTFAFNTIIRAHSHFFPSHSLSYFFSMRSNGVPFDYFTFPFVLKACSRLQMDLHLHSLI
IKYGLDSDIFVQNALMNVYGCSGFVQIAVKVFEEMSERDSVSWSTIIASFVNNGYASEALALFKKMQLEDKVVPDEVTMLSVISAISQLGALELGRWVRVFIDRLGLEIS
VALGTALIDMFSRCGSIDESVVVFEEMAVRNVLTWTALINGLAVHGRNKFERKIIIRDRNRFHHFDKGSCSCHDYW