; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022590 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022590
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00000289:1319941..1339027
RNA-Seq ExpressionSgr022590
SyntenySgr022590
Gene Ontology termsGO:0031047 - gene silencing by RNA (biological process)
GO:0042868 - antisense RNA metabolic process (biological process)
GO:0045892 - negative regulation of transcription, DNA-templated (biological process)
GO:0048589 - developmental growth (biological process)
GO:0080156 - mitochondrial mRNA modification (biological process)
GO:0098789 - pre-mRNA cleavage required for polyadenylation (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005847 - mRNA cleavage and polyadenylation specificity factor complex (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0003729 - mRNA binding (molecular function)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR038192 - Transcription termination and cleavage factor, C-terminal domain superfamily
IPR036093 - NAC domain superfamily
IPR035979 - RNA-binding domain superfamily
IPR032867 - DYW domain
IPR026896 - Transcription termination and cleavage factor, C-terminal domain
IPR025742 - Cleavage stimulation factor subunit 2, hinge domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR003441 - NAC domain
IPR002885 - Pentatricopeptide repeat
IPR000504 - RNA recognition motif domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456696.1 PREDICTED: pentatricopeptide repeat-containing protein At1g34160 [Cucumis melo]2.4e-25077.54Show/hide
Query:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR
        MAYF+LLLQKCSSFS IKQLQANLI NG F F SSRTKLLELCA+S  GDL++A+HIF +IR PSTNDWNA+IRGTALS+DPANAVVWYRAMA SNGPHR
Subjt:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR

Query:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH
        +DALTCSF LKACARALA SEA+Q+HSQL RFGFNADVLLQTTLLD YAKVGDLDLAQKLFDEM +PDIASWNALI+GFAQGSRP++AI +FKRM+E  +
Subjt:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH

Query:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR
        +RPN VTVQ                       +EKLDMNVQVCN+VIDMYAKCGS+DKAYWVFENM                        +   F++LGR
Subjt:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR

Query:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV
        SGMSPDAVSYLAVLCACNHAGLVEDG+KLFN MAQRGL PNIKHYGS+VDLLGRAGRLKEAY+IVNS+PFPNMVLWQTLLGACRTYG+VEMAELAS KLV
Subjt:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV

Query:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE
        EMGFISCGDFVLLSNVYA RQRWDDVGRVRDAMR RDVKK PGFSYIE+KGKM+ FVYGDQ+HSSCREIYAKLDEI  RIK+YGY+A+T NVLHDIG+E+
Subjt:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE

Query:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR
        KENALCYHSEKLAVAFGLTCTEEG PI VIKNLRICGDCH VIKLISKIYNREI++    R
Subjt:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR

XP_022133715.1 pentatricopeptide repeat-containing protein At1g34160 [Momordica charantia]3.2e-25580.04Show/hide
Query:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR
        MAYFDLLLQKCSSFSQIKQLQANLITNG+FQ  SSRTKLLELCAIS FGDL HA+ IF HIR P TNDWNAVIRGTALS+DPANAV+WYRAMA S GPHR
Subjt:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR

Query:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH
        VDALTCSFTLKACARALARSEAMQ+HSQL RFGF+AD+LLQTTLLDAYAKVGDLD AQKLFDE+ QPDIASWNALIAGFAQGSRP +AIALFKRM+ED +
Subjt:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH

Query:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR
        +RPNEVTVQ                       +E LDMNVQVCN+VIDMYAKCGSVDKAYWVF+NM                        +   FE+LG 
Subjt:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR

Query:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV
        SG+SPDAVSYL VLCACNH GLVEDGVKLFNSM +RGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV
Subjt:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV

Query:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE
        EMGFISCGDFVLLSNVYA  QRWDDVGR+RDAMRRRDVKK PGFSY EVKGKMH F YGDQ HSSC EIYAKLDEIKFRIK+ GYAAETGNVLHDIGEE+
Subjt:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE

Query:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR
        KENALCYHSEKLAVAFGL CTEEGT I VIKNLRIC DCH VIKLISKIYNREI++    R
Subjt:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR

XP_031743171.1 pentatricopeptide repeat-containing protein At1g34160 [Cucumis sativus]1.8e-25078.07Show/hide
Query:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR
        MAYF+LLLQKCSSFSQIKQLQANLI NG F F SSRTKLLELCAIS FGDL++A+HIF +I  PSTNDWNAVIRGTALS+DPANAV WYRAMA SNG HR
Subjt:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR

Query:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH
        +DALTCSF LKACARALARSEA+Q+HSQL RFGFNADVLLQTTLLDAYAK+GDLDLAQKLFDEM QPDIASWNALIAGFAQGSRP++AI  FKRM+ D +
Subjt:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH

Query:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR
        +RPN VTVQ                       +EKL+ NVQVCN+VIDMYAKCGS+DKAYWVFENM                        +   FE+LGR
Subjt:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR

Query:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV
        SGMSPDAVSYLAVLCACNHAGLVEDG+KLFNSM QRGL PNIKHYGS+VDLLGRAGRLKEAY+IV+S+PFPNMVLWQTLLGACRTYG+VEMAELASRKLV
Subjt:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV

Query:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE
        EMGFISCGDFVLLSNVYA RQRWDDVGRVRDAMRRRDVKK PGFSYIE+KGKM+ FV GDQ+HSSCREIYAKLDEI  RIK+YGY+A+T NVLHDIG+E+
Subjt:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE

Query:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR
        KENALCYHSEKLAVAFGLTCTEEGTPI VIKNLRICGDCH VIKLISKIY REI++    R
Subjt:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR

XP_038885002.1 pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Benincasa hispida]3.7e-25981.11Show/hide
Query:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR
        MAYF+LLLQKCSSFSQIKQLQANLI NG FQF SSRTKLLELCAIS FGDL++A+HIF +IR PSTNDWNAVIRGTALS+DPANAV  YRAMA SNGPHR
Subjt:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR

Query:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH
        +DALTCSF LKACARALARSEAMQ+HSQL RFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEM QPDIASWNALIAGFAQGSRP +AI LFKRM+E+ +
Subjt:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH

Query:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR
        +RPNEVTVQ                       +EKLDMNVQVCN+VIDMYAKCGSVDKAYWVFENM                        ++  FE++ R
Subjt:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR

Query:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV
        SGMSPDAVSYL+VLCACNHAGLVEDG+KLF+SM QRGLAPNIKHYG++VDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV
Subjt:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV

Query:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE
        EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKK PGFSYIEVKGKMH FVYGDQ+H S REIYAKLDEIKFRIK+YGYAAETGNVLHDIG+E+
Subjt:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE

Query:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR
        KENALCYHSEKLAVAFGLTCTEEGTPI VIKNLRICGDCH VIKLIS+IYNREIV+    R
Subjt:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR

XP_038885014.1 pentatricopeptide repeat-containing protein At1g34160 isoform X2 [Benincasa hispida]3.7e-25981.11Show/hide
Query:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR
        MAYF+LLLQKCSSFSQIKQLQANLI NG FQF SSRTKLLELCAIS FGDL++A+HIF +IR PSTNDWNAVIRGTALS+DPANAV  YRAMA SNGPHR
Subjt:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR

Query:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH
        +DALTCSF LKACARALARSEAMQ+HSQL RFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEM QPDIASWNALIAGFAQGSRP +AI LFKRM+E+ +
Subjt:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH

Query:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR
        +RPNEVTVQ                       +EKLDMNVQVCN+VIDMYAKCGSVDKAYWVFENM                        ++  FE++ R
Subjt:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR

Query:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV
        SGMSPDAVSYL+VLCACNHAGLVEDG+KLF+SM QRGLAPNIKHYG++VDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV
Subjt:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV

Query:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE
        EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKK PGFSYIEVKGKMH FVYGDQ+H S REIYAKLDEIKFRIK+YGYAAETGNVLHDIG+E+
Subjt:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE

Query:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR
        KENALCYHSEKLAVAFGLTCTEEGTPI VIKNLRICGDCH VIKLIS+IYNREIV+    R
Subjt:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR

TrEMBL top hitse value%identityAlignment
A0A0A0KB77 DYW_deaminase domain-containing protein8.9e-25178.07Show/hide
Query:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR
        MAYF+LLLQKCSSFSQIKQLQANLI NG F F SSRTKLLELCAIS FGDL++A+HIF +I  PSTNDWNAVIRGTALS+DPANAV WYRAMA SNG HR
Subjt:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR

Query:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH
        +DALTCSF LKACARALARSEA+Q+HSQL RFGFNADVLLQTTLLDAYAK+GDLDLAQKLFDEM QPDIASWNALIAGFAQGSRP++AI  FKRM+ D +
Subjt:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH

Query:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR
        +RPN VTVQ                       +EKL+ NVQVCN+VIDMYAKCGS+DKAYWVFENM                        +   FE+LGR
Subjt:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR

Query:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV
        SGMSPDAVSYLAVLCACNHAGLVEDG+KLFNSM QRGL PNIKHYGS+VDLLGRAGRLKEAY+IV+S+PFPNMVLWQTLLGACRTYG+VEMAELASRKLV
Subjt:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV

Query:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE
        EMGFISCGDFVLLSNVYA RQRWDDVGRVRDAMRRRDVKK PGFSYIE+KGKM+ FV GDQ+HSSCREIYAKLDEI  RIK+YGY+A+T NVLHDIG+E+
Subjt:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE

Query:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR
        KENALCYHSEKLAVAFGLTCTEEGTPI VIKNLRICGDCH VIKLISKIY REI++    R
Subjt:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR

A0A1S3C3U7 pentatricopeptide repeat-containing protein At1g341601.2e-25077.54Show/hide
Query:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR
        MAYF+LLLQKCSSFS IKQLQANLI NG F F SSRTKLLELCA+S  GDL++A+HIF +IR PSTNDWNA+IRGTALS+DPANAVVWYRAMA SNGPHR
Subjt:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR

Query:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH
        +DALTCSF LKACARALA SEA+Q+HSQL RFGFNADVLLQTTLLD YAKVGDLDLAQKLFDEM +PDIASWNALI+GFAQGSRP++AI +FKRM+E  +
Subjt:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH

Query:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR
        +RPN VTVQ                       +EKLDMNVQVCN+VIDMYAKCGS+DKAYWVFENM                        +   F++LGR
Subjt:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR

Query:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV
        SGMSPDAVSYLAVLCACNHAGLVEDG+KLFN MAQRGL PNIKHYGS+VDLLGRAGRLKEAY+IVNS+PFPNMVLWQTLLGACRTYG+VEMAELAS KLV
Subjt:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV

Query:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE
        EMGFISCGDFVLLSNVYA RQRWDDVGRVRDAMR RDVKK PGFSYIE+KGKM+ FVYGDQ+HSSCREIYAKLDEI  RIK+YGY+A+T NVLHDIG+E+
Subjt:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE

Query:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR
        KENALCYHSEKLAVAFGLTCTEEG PI VIKNLRICGDCH VIKLISKIYNREI++    R
Subjt:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR

A0A6J1BW13 pentatricopeptide repeat-containing protein At1g341601.6e-25580.04Show/hide
Query:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR
        MAYFDLLLQKCSSFSQIKQLQANLITNG+FQ  SSRTKLLELCAIS FGDL HA+ IF HIR P TNDWNAVIRGTALS+DPANAV+WYRAMA S GPHR
Subjt:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR

Query:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH
        VDALTCSFTLKACARALARSEAMQ+HSQL RFGF+AD+LLQTTLLDAYAKVGDLD AQKLFDE+ QPDIASWNALIAGFAQGSRP +AIALFKRM+ED +
Subjt:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH

Query:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR
        +RPNEVTVQ                       +E LDMNVQVCN+VIDMYAKCGSVDKAYWVF+NM                        +   FE+LG 
Subjt:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR

Query:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV
        SG+SPDAVSYL VLCACNH GLVEDGVKLFNSM +RGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV
Subjt:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV

Query:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE
        EMGFISCGDFVLLSNVYA  QRWDDVGR+RDAMRRRDVKK PGFSY EVKGKMH F YGDQ HSSC EIYAKLDEIKFRIK+ GYAAETGNVLHDIGEE+
Subjt:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE

Query:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR
        KENALCYHSEKLAVAFGL CTEEGT I VIKNLRIC DCH VIKLISKIYNREI++    R
Subjt:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR

A0A6J1BWU0 cleavage stimulating factor 644.4e-25090.35Show/hide
Query:  IGNIPYDATEEQLIEICQEVGPVVSFRLVIDRETGKPKGYGFCEYKDEETALSARRNLQGYEINGRQLRVDFAENDKGADRNREQGRGGPGLVANAGGPT
        +GNIPYDATEEQLIEICQEVGPVVSFRLVIDRETGKPKGYGFCEYKDEETALSARRNLQGYEINGRQLRVDFAENDKG+DRNREQGRGGPGLVANAGGPT
Subjt:  IGNIPYDATEEQLIEICQEVGPVVSFRLVIDRETGKPKGYGFCEYKDEETALSARRNLQGYEINGRQLRVDFAENDKGADRNREQGRGGPGLVANAGGPT

Query:  AHGESSQHQPIGLHIAITAAAVMAGALGGAQAASNQNILQSATMANDPLTLHLAKLSRSQLTEIMSGLKVMATQNKDLARQLLLARPQLSKALFLSQIML
        AHGESSQHQPIGLHIAITAAAVMAGALGGAQ A+NQNILQSA M NDPLTLHLAKLSRSQLTEIMSGLKVMATQNKDLARQLLLARPQL+KALF SQIML
Subjt:  AHGESSQHQPIGLHIAITAAAVMAGALGGAQAASNQNILQSATMANDPLTLHLAKLSRSQLTEIMSGLKVMATQNKDLARQLLLARPQLSKALFLSQIML

Query:  GMVTPQVLQKPNLRQPSTHPQLPLHDIQQGQPSSLQIQPGLPPLAPNKMQTGFVPKIKETQMSLGPQNPLAPRQFSASPRPPLQSQIPLSHALQGTLTGI
        GMVTPQVLQKPNLRQPSTHPQLP  DIQQGQPSSLQIQPGLPPLAPN+MQTGFVPK+KETQ+SL PQNPLA  QFSAS RPPLQSQI LSHALQGTLTG+
Subjt:  GMVTPQVLQKPNLRQPSTHPQLPLHDIQQGQPSSLQIQPGLPPLAPNKMQTGFVPKIKETQMSLGPQNPLAPRQFSASPRPPLQSQIPLSHALQGTLTGI

Query:  PAGSSLPSINLQGNLSVRQQVQGPTFSPLKQHMHPPSQQYSGHGGAVIPGHNAQIANPEARPSLLPHPSLPDADFQPGPSTAYSASQIVGGDVDKSSQVP
        P  SSLPSINLQGNL VRQQVQ PT S LKQHM PPSQQYSGHGGAVIP HNAQIANPEARPSLLPHPSL DADFQPGPSTAYSASQIVG DVDK S+ P
Subjt:  PAGSSLPSINLQGNLSVRQQVQGPTFSPLKQHMHPPSQQYSGHGGAVIPGHNAQIANPEARPSLLPHPSLPDADFQPGPSTAYSASQIVGGDVDKSSQVP

Query:  LGVDGKKNILHGFSGTINRPAKQMRLEDGKGRSFSAGGLSASLDINGSGQLGVASDPKLAGTQISEKTTSVLPPDVESALLQQVLNLTPEQLNSLPLEQR
        LGVDGKK     FSGTINRP KQMRLEDGKG SFSAGGLSAS+D NGSGQLGVASDP+LA    SEK T++LP DVESALLQQVLNLTPEQLNSLPLEQR
Subjt:  LGVDGKKNILHGFSGTINRPAKQMRLEDGKGRSFSAGGLSASLDINGSGQLGVASDPKLAGTQISEKTTSVLPPDVESALLQQVLNLTPEQLNSLPLEQR

Query:  QQVIQLQQALRRDQIRPS
        QQVIQLQQALRRDQIRPS
Subjt:  QQVIQLQQALRRDQIRPS

A0A6J1H8R5 pentatricopeptide repeat-containing protein At1g341603.5e-24776.65Show/hide
Query:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR
        MAYFDLLLQKCSSFSQIKQLQANLITNG F F SSRTKLLELCAIS FGDL+HA+HIF HI  PST DWNAVIRGTALS++P+NA+ WYR M  SNGPHR
Subjt:  MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHR

Query:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH
        VDALTCSF LKACARALARSE MQ+HSQ+ RFGF+ADVLLQTTLLDAYAKV DLD AQK+FDEM +PDIASWN+LIAGFAQG RPS+AI LFKRM+ED +
Subjt:  VDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEH

Query:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR
        +RPNEVTVQ                       +E LD  VQVCN+VIDMYAKCGSVDKAYWVFENM                        +   FE+LGR
Subjt:  MRPNEVTVQ-----------------------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM------------------------SAGFFEQLGR

Query:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV
        SG+ PDA+SYLAVLCACNHAGL+EDG+KLFNSM QRG+APNIKHYG VVDLLGRAGRLKEAYEIV+SMPFPNMVLWQTLLGACRTYG+V+MAE+ASRKLV
Subjt:  SGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLV

Query:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE
        EMGFISCGDFVLLSNVYA R+RWDDVGRVRDAMRRRDVKK PGFSYIEVKG MH F+YGD++HSSCREIYAKLDEI FRIK+ GY AETGNVLHDI EE+
Subjt:  EMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEE

Query:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR
        KEN LCYHSEKLAVAFGL+CTEEGTPI VIKNLRICGDCH VIKLISK YNREI +    R
Subjt:  KENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210658.4e-9736.58Show/hide
Query:  SSFSQIKQLQANLITNGQFQFCSSRTK--LLELCAISHFGDLAHAVHIFHHIRVP-STNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHRVDALTCSF
        SS ++++Q+ A  I +G     +   K  +  L ++     +++A  +F  I  P +   WN +IRG A   +  +A   YR M  S G    D  T  F
Subjt:  SSFSQIKQLQANLITNGQFQFCSSRTK--LLELCAISHFGDLAHAVHIFHHIRVP-STNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHRVDALTCSF

Query:  TLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEHMRPNEVTV
         +KA            IHS + R GF + + +Q +LL  YA  GD+  A K+FD+M + D+ +WN++I GFA+  +P EA+AL+  M   + ++P+  T+
Subjt:  TLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEHMRPNEVTV

Query:  QD-----------------------EKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM----------------SAGFFEQL--------GRSGMSPDAV
                                   L  N+   N+++D+YA+CG V++A  +F+ M                  GF ++            G+ P  +
Subjt:  QD-----------------------EKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM----------------SAGFFEQL--------GRSGMSPDAV

Query:  SYLAVLCACNHAGLVEDGVKLFNSMAQR-GLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPF-PNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFIS
        +++ +L AC+H G+V++G + F  M +   + P I+H+G +VDLL RAG++K+AYE + SMP  PN+V+W+TLLGAC  +G+ ++AE A  +++++    
Subjt:  SYLAVLCACNHAGLVEDGVKLFNSMAQR-GLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPF-PNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFIS

Query:  CGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEEKENALC
         GD+VLLSN+YA+ QRW DV ++R  M R  VKK+PG S +EV  ++H F+ GD++H     IYAKL E+  R++S GY  +  NV  D+ EEEKENA+ 
Subjt:  CGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEEKENALC

Query:  YHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR
        YHSEK+A+AF L  T E +PI V+KNLR+C DCH  IKL+SK+YNREIV+    R
Subjt:  YHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR

B8YEK4 Pentatricopeptide repeat-containing protein OGR1, mitochondrial2.0e-13548.42Show/hide
Query:  YFDLLLQKCSSFSQIKQLQANLITNGQF-QFCSSRTKLLELCAIS-HFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGP--
        + + LL + +S     Q  A L+T+G        R + L+  A+S H   L HA+ +   +  P+TND NA +RG A S  PA +++    +AG   P  
Subjt:  YFDLLLQKCSSFSQIKQLQANLITNGQF-QFCSSRTKLLELCAIS-HFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGP--

Query:  -HRVDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRM--
          R DAL+ SF LKA AR       +Q+H+ + R G  ADV L TTLLD+YAK GDL  A+K+FDEM   D+A+WN+L+AG AQG+ P+ A+ALF R+  
Subjt:  -HRVDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRM--

Query:  ---EEDEHMRPNEVTV-----------------------QDEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFE------------NMSAGFFEQLGRSG--
           E      PNEVT+                       +   LD NV+VCN +IDMY+KCGS+ +A  VF             N +       G  G  
Subjt:  ---EEDEHMRPNEVTV-----------------------QDEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFE------------NMSAGFFEQLGRSG--

Query:  ----------MSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFP-NMVLWQTLLGACRTYGNVEM
                  + PD V+YLAVLC CNH+GLV+DG+++FNSM    +APN+KHYG++VDLLGRAGRL EAY+ V SMPFP ++VLWQTLLGA + +G VE+
Subjt:  ----------MSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFP-NMVLWQTLLGACRTYGNVEM

Query:  AELASRKLVEMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGN
        AELA+ KL E+G    GD+VLLSNVYA++ RW DVGRVRD MR  DV+K+PGFSY E+ G MH F+ GD+ H   +EIY  L++I  RI   GY  ET N
Subjt:  AELASRKLVEMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGN

Query:  VLHDIGEEEKENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR
        VLHDIGEEEK+ ALCYHSEKLA+AFGL  T  G  + VIKNLRICGDCH V KLISK Y R IVI    R
Subjt:  VLHDIGEEEKENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR

Q9FX24 Pentatricopeptide repeat-containing protein At1g341607.4e-16256.45Show/hide
Query:  YFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAM----AGSNGP
        Y + ++QKC SFSQIKQLQ++ +T G FQ    R++LLE CAIS FGDL+ AV IF +I  P TNDWNA+IRG A S+ P+ A  WYR+M    + S+  
Subjt:  YFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAM----AGSNGP

Query:  HRVDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEED
         RVDALTCSFTLKACARAL  S   Q+H Q+ R G +AD LL TTLLDAY+K GDL  A KLFDEM   D+ASWNALIAG   G+R SEA+ L+KRM E 
Subjt:  HRVDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEED

Query:  EHMRPNEVTV--------------QDEKL-----DMNVQVCNIVIDMYAKCGSVDKAYWVFE------------NMSAGF------------FEQLGRSG
        E +R +EVTV              + E +     + NV V N  IDMY+KCG VDKAY VFE             M  GF            F++L  +G
Subjt:  EHMRPNEVTV--------------QDEKL-----DMNVQVCNIVIDMYAKCGSVDKAYWVFE------------NMSAGF------------FEQLGRSG

Query:  MSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMP-FPNMVLWQTLLGACRTYGNVEMAELASRKLVE
        + PD VSYLA L AC HAGLVE G+ +FN+MA +G+  N+KHYG VVDLL RAGRL+EA++I+ SM   P+ VLWQ+LLGA   Y +VEMAE+ASR++ E
Subjt:  MSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMP-FPNMVLWQTLLGACRTYGNVEMAELASRKLVE

Query:  MGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEEK
        MG  + GDFVLLSNVYA + RW DVGRVRD M  + VKKIPG SYIE KG +H F   D++H   REIY K+DEI+F+I+  GY A+TG VLHDIGEEEK
Subjt:  MGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEEK

Query:  ENALCYHSEKLAVAFGLTC---TEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVI
        ENALCYHSEKLAVA+GL      +E +P+ VI NLRICGDCH V K ISKIY REI++
Subjt:  ENALCYHSEKLAVAFGLTC---TEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVI

Q9LXY5 Pentatricopeptide repeat-containing protein At3g565503.3e-9334.78Show/hide
Query:  LLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHI-RVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHRVDALT
        +LQ C+S  +++++ +++I NG     S    LL  CA+S  G L+HA  +F H    PST+DWN +IRG + S+ P N++++Y  M  S+   R D  T
Subjt:  LLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHI-RVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHRVDALT

Query:  CSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEED-------
         +F LK+C R  +  + ++IH  + R GF  D ++ T+L+  Y+  G +++A K+FDEM   D+ SWN +I  F+     ++A++++KRM  +       
Subjt:  CSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEED-------

Query:  ---------EHMRPNEVTVQ------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM-----------------------SAGFFEQLGRSGMSPDA
                  H+    + V       D + +  V V N +IDMYAKCGS++ A  VF  M                       +  FF ++  SG+ P+A
Subjt:  ---------EHMRPNEVTVQ------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM-----------------------SAGFFEQLGRSGMSPDA

Query:  VSYLAVLCACNHAGLVEDGVKLFNSM-AQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIV-NSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFI
        +++L +L  C+H GLV++GV+ F  M +Q  L PN+KHYG +VDL GRAG+L+ + E++  S    + VLW+TLLG+C+ + N+E+ E+A +KLV++   
Subjt:  VSYLAVLCACNHAGLVEDGVKLFNSM-AQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIV-NSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFI

Query:  SCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGN-VLHDIGEEEKENA
        + GD+VL++++Y+          +R  +R  D++ +PG+S+IE+  ++H FV  D+ H     IY++L E+  R    GY  E  N     + +    +A
Subjt:  SCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGN-VLHDIGEEEKENA

Query:  LCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVI
           HSEKLA+A+GL  T  GT + + KNLR+C DCH+  K +SK +NREI++
Subjt:  LCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVI

Q9M9G6 Cleavage stimulating factor 648.7e-9445.92Show/hide
Query:  IGNIPYDATEEQLIEICQEVGPVVSFRLVIDRETGKPKGYGFCEYKDEETALSARRNLQGYEINGRQLRVDFAENDKGADRNREQGRGGPGLVANA----
        +GNIPYDATEEQL EIC EVGPVVSFRLV DRETGKPKGYGFCEYKDEETALSARRNLQ YEINGRQLRVDFAENDKG D+ R+Q +GGPGL +      
Subjt:  IGNIPYDATEEQLIEICQEVGPVVSFRLVIDRETGKPKGYGFCEYKDEETALSARRNLQGYEINGRQLRVDFAENDKGADRNREQGRGGPGLVANA----

Query:  -----GGPTAHGESSQHQPIGLHIAITAAAVMAGALGGAQAASNQNILQSATMANDPLTLHLAKLSRSQLTEIMSGLKVMATQNKDLARQLLLARPQLSK
             GGP    +S+ HQP+GLH+A TAA+V+AGALGG Q  S          A+DPL LHLAK+SRSQLTEI+S +K+MATQNK+ ARQLL++RPQL K
Subjt:  -----GGPTAHGESSQHQPIGLHIAITAAAVMAGALGGAQAASNQNILQSATMANDPLTLHLAKLSRSQLTEIMSGLKVMATQNKDLARQLLLARPQLSK

Query:  ALFLSQIMLGMVTPQVLQKPNLRQPSTHPQLPLHDIQQGQPSSLQIQPGLPPLAPNKMQTGFVPKIKETQMSLGPQNPLAPRQFSASPRPPLQSQIPLSH
        A+FL+Q+MLG+V+PQVLQ PN+ Q  +H  +    IQ  Q S   +   LPPLA            +  Q+S  P +    +Q S  P     SQIP   
Subjt:  ALFLSQIMLGMVTPQVLQKPNLRQPSTHPQLPLHDIQQGQPSSLQIQPGLPPLAPNKMQTGFVPKIKETQMSLGPQNPLAPRQFSASPRPPLQSQIPLSH

Query:  ALQGTLTGIPAGSSLPSINLQGNLSVRQQVQGPTFSPLKQHMHPPSQQYSGHGGAVIPGHNAQ--IANPEARPSLLPHPSLPDADFQPGPSTAYSASQIV
             L   P  SS+       N   R QV+       +Q + P S            G+++Q  + N   +PS +PH +LP++  Q G  T        
Subjt:  ALQGTLTGIPAGSSLPSINLQGNLSVRQQVQGPTFSPLKQHMHPPSQQYSGHGGAVIPGHNAQ--IANPEARPSLLPHPSLPDADFQPGPSTAYSASQIV

Query:  GGDVDKSSQVPLGVDGKKNILHG-FSGTINRPAKQMRLEDGKGRSFSAGGLSASLDINGSGQLGVASDPKLAGTQISEKTTSVLPPDVESALLQQVLNLT
                   + ++  K I  G    ++NRP+K M++ED +  S   G +S S+         + +  +   T IS        PDV+S LLQQV+NLT
Subjt:  GGDVDKSSQVPLGVDGKKNILHG-FSGTINRPAKQMRLEDGKGRSFSAGGLSASLDINGSGQLGVASDPKLAGTQISEKTTSVLPPDVESALLQQVLNLT

Query:  PEQLNSLPLEQRQQVIQLQQALRRDQI
        PEQL  L  EQ+Q+V++LQQAL++D +
Subjt:  PEQLNSLPLEQRQQVIQLQQALRRDQI

Arabidopsis top hitse value%identityAlignment
AT1G31920.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.3e-9434.23Show/hide
Query:  LLQKCSSFSQIKQLQANLITNGQFQFCS-SRTKLLELCAISHF-GDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHRVDAL
        LL++C +  + KQ+ A  I    F   S S + +L  CA S +   + +A  IF  I  P T D+N +IRG         A+ +Y  M      +  D  
Subjt:  LLQKCSSFSQIKQLQANLITNGQFQFCS-SRTKLLELCAISHF-GDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHRVDAL

Query:  TCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEHMRPN
        T    LKAC R  +  E  QIH Q+F+ G  ADV +Q +L++ Y + G+++L+  +F+++     ASW+++++  A     SE + LF+ M  + +++  
Subjt:  TCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEHMRPN

Query:  EVTVQDEKL-----------------------DMNVQVCNIVIDMYAKCGSVDKAYWVFENM-----------------------SAGFFEQLGRSGMSP
        E  +    L                       ++N+ V   ++DMY KCG +DKA  +F+ M                       +   F ++ + G+ P
Subjt:  EVTVQDEKL-----------------------DMNVQVCNIVIDMYAKCGSVDKAYWVFENM-----------------------SAGFFEQLGRSGMSP

Query:  DAVSYLAVLCACNHAGLVEDGVKLFNSMAQRG-LAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPF-PNMVLWQTLLGACRTYGNVEMAELASRKLVEMG
        D V Y++VL AC+H+GLV++G ++F  M + G + P  +HYG +VDLLGRAG L+EA E + S+P   N V+W+T L  CR   N+E+ ++A+++L+++ 
Subjt:  DAVSYLAVLCACNHAGLVEDGVKLFNSMAQRG-LAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPF-PNMVLWQTLLGACRTYGNVEMAELASRKLVEMG

Query:  FISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEEKEN
          + GD++L+SN+Y+  Q WDDV R R  +  + +K+ PGFS +E+KGK H FV  D++H  C+EIY  L ++++++K  GY+ +   +L ++ EEEK+ 
Subjt:  FISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEEKEN

Query:  ALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR
         L  HS+K+A+AFGL  T  G+ I + +NLR+C DCH   K IS IY REIV+    R
Subjt:  ALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR

AT1G34160.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.3e-16356.45Show/hide
Query:  YFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAM----AGSNGP
        Y + ++QKC SFSQIKQLQ++ +T G FQ    R++LLE CAIS FGDL+ AV IF +I  P TNDWNA+IRG A S+ P+ A  WYR+M    + S+  
Subjt:  YFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAM----AGSNGP

Query:  HRVDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEED
         RVDALTCSFTLKACARAL  S   Q+H Q+ R G +AD LL TTLLDAY+K GDL  A KLFDEM   D+ASWNALIAG   G+R SEA+ L+KRM E 
Subjt:  HRVDALTCSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEED

Query:  EHMRPNEVTV--------------QDEKL-----DMNVQVCNIVIDMYAKCGSVDKAYWVFE------------NMSAGF------------FEQLGRSG
        E +R +EVTV              + E +     + NV V N  IDMY+KCG VDKAY VFE             M  GF            F++L  +G
Subjt:  EHMRPNEVTV--------------QDEKL-----DMNVQVCNIVIDMYAKCGSVDKAYWVFE------------NMSAGF------------FEQLGRSG

Query:  MSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMP-FPNMVLWQTLLGACRTYGNVEMAELASRKLVE
        + PD VSYLA L AC HAGLVE G+ +FN+MA +G+  N+KHYG VVDLL RAGRL+EA++I+ SM   P+ VLWQ+LLGA   Y +VEMAE+ASR++ E
Subjt:  MSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMP-FPNMVLWQTLLGACRTYGNVEMAELASRKLVE

Query:  MGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEEK
        MG  + GDFVLLSNVYA + RW DVGRVRD M  + VKKIPG SYIE KG +H F   D++H   REIY K+DEI+F+I+  GY A+TG VLHDIGEEEK
Subjt:  MGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEEK

Query:  ENALCYHSEKLAVAFGLTC---TEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVI
        ENALCYHSEKLAVA+GL      +E +P+ VI NLRICGDCH V K ISKIY REI++
Subjt:  ENALCYHSEKLAVAFGLTC---TEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVI

AT1G71800.1 cleavage stimulating factor 646.2e-9545.92Show/hide
Query:  IGNIPYDATEEQLIEICQEVGPVVSFRLVIDRETGKPKGYGFCEYKDEETALSARRNLQGYEINGRQLRVDFAENDKGADRNREQGRGGPGLVANA----
        +GNIPYDATEEQL EIC EVGPVVSFRLV DRETGKPKGYGFCEYKDEETALSARRNLQ YEINGRQLRVDFAENDKG D+ R+Q +GGPGL +      
Subjt:  IGNIPYDATEEQLIEICQEVGPVVSFRLVIDRETGKPKGYGFCEYKDEETALSARRNLQGYEINGRQLRVDFAENDKGADRNREQGRGGPGLVANA----

Query:  -----GGPTAHGESSQHQPIGLHIAITAAAVMAGALGGAQAASNQNILQSATMANDPLTLHLAKLSRSQLTEIMSGLKVMATQNKDLARQLLLARPQLSK
             GGP    +S+ HQP+GLH+A TAA+V+AGALGG Q  S          A+DPL LHLAK+SRSQLTEI+S +K+MATQNK+ ARQLL++RPQL K
Subjt:  -----GGPTAHGESSQHQPIGLHIAITAAAVMAGALGGAQAASNQNILQSATMANDPLTLHLAKLSRSQLTEIMSGLKVMATQNKDLARQLLLARPQLSK

Query:  ALFLSQIMLGMVTPQVLQKPNLRQPSTHPQLPLHDIQQGQPSSLQIQPGLPPLAPNKMQTGFVPKIKETQMSLGPQNPLAPRQFSASPRPPLQSQIPLSH
        A+FL+Q+MLG+V+PQVLQ PN+ Q  +H  +    IQ  Q S   +   LPPLA            +  Q+S  P +    +Q S  P     SQIP   
Subjt:  ALFLSQIMLGMVTPQVLQKPNLRQPSTHPQLPLHDIQQGQPSSLQIQPGLPPLAPNKMQTGFVPKIKETQMSLGPQNPLAPRQFSASPRPPLQSQIPLSH

Query:  ALQGTLTGIPAGSSLPSINLQGNLSVRQQVQGPTFSPLKQHMHPPSQQYSGHGGAVIPGHNAQ--IANPEARPSLLPHPSLPDADFQPGPSTAYSASQIV
             L   P  SS+       N   R QV+       +Q + P S            G+++Q  + N   +PS +PH +LP++  Q G  T        
Subjt:  ALQGTLTGIPAGSSLPSINLQGNLSVRQQVQGPTFSPLKQHMHPPSQQYSGHGGAVIPGHNAQ--IANPEARPSLLPHPSLPDADFQPGPSTAYSASQIV

Query:  GGDVDKSSQVPLGVDGKKNILHG-FSGTINRPAKQMRLEDGKGRSFSAGGLSASLDINGSGQLGVASDPKLAGTQISEKTTSVLPPDVESALLQQVLNLT
                   + ++  K I  G    ++NRP+K M++ED +  S   G +S S+         + +  +   T IS        PDV+S LLQQV+NLT
Subjt:  GGDVDKSSQVPLGVDGKKNILHG-FSGTINRPAKQMRLEDGKGRSFSAGGLSASLDINGSGQLGVASDPKLAGTQISEKTTSVLPPDVESALLQQVLNLT

Query:  PEQLNSLPLEQRQQVIQLQQALRRDQI
        PEQL  L  EQ+Q+V++LQQAL++D +
Subjt:  PEQLNSLPLEQRQQVIQLQQALRRDQI

AT3G56550.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-9434.78Show/hide
Query:  LLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHI-RVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHRVDALT
        +LQ C+S  +++++ +++I NG     S    LL  CA+S  G L+HA  +F H    PST+DWN +IRG + S+ P N++++Y  M  S+   R D  T
Subjt:  LLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHI-RVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHRVDALT

Query:  CSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEED-------
         +F LK+C R  +  + ++IH  + R GF  D ++ T+L+  Y+  G +++A K+FDEM   D+ SWN +I  F+     ++A++++KRM  +       
Subjt:  CSFTLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEED-------

Query:  ---------EHMRPNEVTVQ------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM-----------------------SAGFFEQLGRSGMSPDA
                  H+    + V       D + +  V V N +IDMYAKCGS++ A  VF  M                       +  FF ++  SG+ P+A
Subjt:  ---------EHMRPNEVTVQ------DEKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM-----------------------SAGFFEQLGRSGMSPDA

Query:  VSYLAVLCACNHAGLVEDGVKLFNSM-AQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIV-NSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFI
        +++L +L  C+H GLV++GV+ F  M +Q  L PN+KHYG +VDL GRAG+L+ + E++  S    + VLW+TLLG+C+ + N+E+ E+A +KLV++   
Subjt:  VSYLAVLCACNHAGLVEDGVKLFNSM-AQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIV-NSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFI

Query:  SCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGN-VLHDIGEEEKENA
        + GD+VL++++Y+          +R  +R  D++ +PG+S+IE+  ++H FV  D+ H     IY++L E+  R    GY  E  N     + +    +A
Subjt:  SCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGN-VLHDIGEEEKENA

Query:  LCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVI
           HSEKLA+A+GL  T  GT + + KNLR+C DCH+  K +SK +NREI++
Subjt:  LCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVI

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.9e-9836.58Show/hide
Query:  SSFSQIKQLQANLITNGQFQFCSSRTK--LLELCAISHFGDLAHAVHIFHHIRVP-STNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHRVDALTCSF
        SS ++++Q+ A  I +G     +   K  +  L ++     +++A  +F  I  P +   WN +IRG A   +  +A   YR M  S G    D  T  F
Subjt:  SSFSQIKQLQANLITNGQFQFCSSRTK--LLELCAISHFGDLAHAVHIFHHIRVP-STNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHRVDALTCSF

Query:  TLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEHMRPNEVTV
         +KA            IHS + R GF + + +Q +LL  YA  GD+  A K+FD+M + D+ +WN++I GFA+  +P EA+AL+  M   + ++P+  T+
Subjt:  TLKACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEHMRPNEVTV

Query:  QD-----------------------EKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM----------------SAGFFEQL--------GRSGMSPDAV
                                   L  N+   N+++D+YA+CG V++A  +F+ M                  GF ++            G+ P  +
Subjt:  QD-----------------------EKLDMNVQVCNIVIDMYAKCGSVDKAYWVFENM----------------SAGFFEQL--------GRSGMSPDAV

Query:  SYLAVLCACNHAGLVEDGVKLFNSMAQR-GLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPF-PNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFIS
        +++ +L AC+H G+V++G + F  M +   + P I+H+G +VDLL RAG++K+AYE + SMP  PN+V+W+TLLGAC  +G+ ++AE A  +++++    
Subjt:  SYLAVLCACNHAGLVEDGVKLFNSMAQR-GLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPF-PNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFIS

Query:  CGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEEKENALC
         GD+VLLSN+YA+ QRW DV ++R  M R  VKK+PG S +EV  ++H F+ GD++H     IYAKL E+  R++S GY  +  NV  D+ EEEKENA+ 
Subjt:  CGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAAETGNVLHDIGEEEKENALC

Query:  YHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR
        YHSEK+A+AF L  T E +PI V+KNLR+C DCH  IKL+SK+YNREIV+    R
Subjt:  YHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTACTTCGACCTTCTGCTACAGAAATGTTCTTCGTTCTCGCAAATCAAGCAACTCCAAGCAAATCTCATAACCAATGGCCAATTCCAGTTCTGTTCCTCACGCAC
CAAGCTTCTCGAGCTCTGCGCCATCTCCCACTTCGGGGACCTTGCTCATGCCGTCCATATCTTCCACCATATCAGGGTCCCCTCCACCAATGATTGGAACGCCGTCATTC
GAGGCACCGCCCTGAGCGCCGATCCCGCAAATGCTGTTGTCTGGTACAGGGCCATGGCTGGGTCTAATGGGCCTCACAGAGTTGACGCTCTCACCTGCTCGTTTACTCTG
AAAGCCTGTGCCCGCGCGCTGGCTCGTTCTGAAGCGATGCAAATACATTCGCAGCTTTTTCGATTTGGGTTCAATGCAGATGTTCTCCTGCAAACTACCTTGCTTGATGC
GTACGCAAAAGTTGGCGATCTTGATCTTGCCCAGAAGCTGTTCGACGAAATGCGGCAACCAGATATTGCCTCGTGGAACGCATTAATTGCTGGGTTTGCTCAGGGAAGTC
GACCAAGCGAAGCTATAGCCTTGTTTAAGAGAATGGAGGAGGATGAACATATGAGACCCAATGAAGTAACCGTTCAAGACGAGAAGTTAGACATGAATGTGCAAGTTTGT
AACATCGTTATTGATATGTATGCTAAATGTGGATCTGTGGATAAGGCTTATTGGGTGTTTGAGAACATGAGCGCTGGATTTTTTGAACAGTTGGGTCGATCTGGAATGTC
TCCTGATGCAGTATCATATTTAGCCGTGCTATGCGCATGCAACCATGCAGGACTTGTAGAGGATGGGGTTAAGCTGTTCAATTCAATGGCGCAAAGGGGGCTGGCGCCAA
ATATAAAGCATTATGGAAGTGTGGTTGATTTGCTGGGTCGAGCTGGTCGTCTCAAAGAAGCTTATGAAATTGTAAATTCAATGCCTTTCCCTAATATGGTACTCTGGCAG
ACTTTGCTTGGTGCTTGCAGGACTTATGGGAATGTAGAAATGGCAGAACTGGCATCAAGAAAGTTAGTAGAGATGGGATTTATTAGCTGTGGTGATTTTGTGTTGTTGTC
TAATGTCTATGCCACTCGTCAGAGATGGGATGATGTTGGGAGAGTTAGGGATGCCATGAGAAGAAGGGATGTGAAGAAGATACCGGGATTTAGTTACATAGAAGTAAAGG
GTAAGATGCACAACTTTGTATATGGTGATCAAACCCATTCGAGTTGCCGTGAGATCTATGCAAAGCTTGATGAGATCAAGTTCAGGATCAAATCCTACGGATATGCAGCT
GAAACTGGCAATGTATTGCATGATATTGGAGAGGAGGAGAAGGAGAACGCATTATGTTACCACAGTGAGAAGCTTGCCGTGGCTTTTGGATTAACTTGTACAGAAGAGGG
AACCCCGATTCATGTGATAAAGAATTTAAGGATTTGTGGGGATTGTCATGCTGTGATTAAACTAATATCAAAGATTTATAATCGAGAAATTGTTATAATGCAAATGGGGC
GAGTCGACGGAGGAAGAAGGAGGCTGAGAATAACGGAATATAGAACTTGGAACTGGGATAGAGCTCCTTTGGTACGTTCATACAGAAAATGGCTATGGACAACTGCAGTC
TGCAGTGAATCTGATTCTTTGAACTTCGGAACAAATTTCTTCAGCGTCGCTGTCGATTTTGTCCAGGAATACCGAGTTTTGTTTTCTTCATTCCGTGATTTAGGTATTCT
TGGAAAGGGATTAGATGTCAACGGTGTCGTCGGCGGTCATTCTTTGTTAAAAACTGGTGACAGACAATGGTTCTTTTTCAGTCCTAGGGAACGCAGATATCCAAATGCAG
CAAGATTGAGTAGAGCTACAAGACATGGCTACTGGAAGGCAACCGGAAAGGATCGAATAATAAGGTGCAATTCACGTAATGTTGGAGTGAAGAAGACCCTGGTATTCTAT
CAAGGCCGTGCCCCAAATGGTGAGCGCACTGACTGGGTTATGCACGAGTATACCTTGGATGAAGATGAGCTTAAGAGGTGTAAGAATGTAAAGGATTACTATGCGCTCTA
CAAACTTTATAAGAAAAGTGGTCCGGGCCCTAAAAATGGTGAGCAATATGGAGCACCATTTAGAGAAGAAGATTGGGCTGATGGCATATGCCAACATTATAATGACTCTG
ACGGTCAGGAGCCACAAAGAGAGTGGTCTACACTTCTGCCTTCATCAGATGACATTGAGGAGTTCATGAAACAAATTGCAAATGATCCTGCGCTAGAGTTGCCTTCGGTC
AATGATTATGGTCAAGTTGGCTCTGCTGTGCAGGTTGATGACGAAGAAGAAACTGTAAGTACTATGGTTGATGCTTACTCTCGGGATCACATACTTCCTGAAGCGGATAA
AGTATTTCACTTAAGTGGCCAGCCCAGTGATTTGCATGCAAGCTTCGACTTTACCCAGTCAGGTATCTCTGCGTTGCAATCACTTGAGGGTGAGATATCGTCGGCTCCCA
AATGTTGTGAGGACTCCATTATACTACAAGAAGAAGATTTCTTGGAGATTGATGATCTCATTGGTCCCGAACCAACTTCTGTGACAAATGACAACCCCTTGGAAAATGTG
CCACCTTCAGAGTTGGATGGATTGAGTGAGTTAGACCTGTTCCATGATGCAGATATGTTTCTTCGTGACTTGGGACCCTTCAACCATGAAACAATTTTAGATCCGTATTT
GAATGCTCATGATATTGATGTTGCGAACAATTCGAATGGCCATTTGCAATCTGATCCTTATGTAGAAAATCAGATGGACAATCGGTTCTGGAGGAACAATGAAACAGAAA
ATGCGTTTAACTACCCAGAATCACATCGACAGTTTGTTACTCAACCAAATTTAGGTGTGGGATATGAATCTGTAAATTCTGCAGCAGCAGGAACAAGGGAAAACCATAGA
GCAAACGAAGGAGGTGGTTCTGCTAGTTGGTTCTCTTCCAATTTGTGGGCCTTCGTGGAGTCGATACCAACCACTCCTGCGTCAGCTTCAGAGAATGTCAACCGTGCTTT
TGAGCGTATGTCTAGTTTTAGCAGATTGAGACTAAATACCCTAAACACCAATGTCGCCGTAGGTAATCCCGATACAGGCGCAAGGAGAACGGGACAAGCTAACTCGGTTT
TCGTACCTTTGGTTCTAAGGATCTTCTCGGAGCGTTTTCCTTCTTCGTCGCCGTCGCCGTGTTTTGTTCGGAACCTTCAACTCTCATTCGATTTTCCTAAATCTTTGTTC
TCGAAGACGAAGAGCCACTCAGTTTCGAACTCAATTGATCTGCGTTCCATCTGCGGAGCTTCTGTGCAGAGCATCAGTGGGAAAAGGATAATTGGGAATATACCATATGA
TGCTACTGAAGAGCAGCTTATAGAAATCTGCCAAGAAGTTGGGCCCGTAGTGTCTTTCAGATTAGTTATCGACAGAGAAACTGGAAAACCAAAAGGCTACGGGTTTTGTG
AGTACAAGGATGAAGAAACGGCTCTGAGTGCTCGTCGTAATCTTCAAGGTTATGAAATTAATGGCCGGCAGTTAAGAGTTGATTTTGCTGAGAATGACAAGGGTGCGGAC
AGAAATAGGGAACAGGGTCGTGGTGGACCTGGATTGGTTGCAAATGCTGGCGGCCCAACAGCGCATGGGGAATCCAGTCAACATCAACCAATTGGTCTTCATATAGCTAT
AACTGCTGCAGCCGTCATGGCAGGAGCTCTTGGTGGTGCACAAGCTGCTAGTAATCAGAATATTTTGCAGAGTGCAACAATGGCCAATGATCCTTTAACTCTACATCTGG
CTAAACTTTCTAGGAGTCAGCTTACTGAAATTATGTCAGGGCTAAAGGTAATGGCTACACAAAACAAGGATTTGGCTCGTCAGCTATTGCTGGCAAGACCACAATTGTCA
AAAGCTCTTTTCCTGTCACAGATAATGCTTGGAATGGTCACACCACAAGTGTTGCAGAAGCCTAATTTACGGCAGCCTTCTACTCATCCTCAGCTTCCTTTGCATGACAT
TCAGCAGGGTCAGCCATCATCTCTTCAAATTCAACCAGGGCTGCCCCCTCTTGCACCTAACAAGATGCAGACTGGTTTTGTACCTAAAATAAAAGAAACTCAAATGTCTC
TAGGGCCTCAAAATCCTCTGGCTCCGCGTCAGTTTTCTGCATCCCCACGGCCTCCACTTCAGTCTCAAATTCCGCTATCACATGCTTTACAAGGCACTTTGACTGGGATA
CCTGCAGGCTCATCACTTCCTTCCATAAATTTGCAGGGAAACCTATCTGTTAGACAGCAGGTTCAAGGGCCCACCTTCTCTCCTTTAAAGCAGCACATGCATCCCCCTTC
TCAACAATATTCAGGGCATGGTGGAGCTGTAATTCCGGGGCATAATGCTCAGATTGCTAATCCAGAAGCTAGACCTTCTCTTTTGCCTCACCCTTCCTTACCAGATGCAG
ACTTTCAGCCCGGCCCCTCCACAGCGTACAGTGCATCTCAGATAGTGGGCGGTGATGTTGATAAGTCTTCTCAAGTTCCTCTTGGTGTAGATGGTAAAAAAAATATACTT
CATGGCTTTTCAGGAACAATCAATCGCCCTGCAAAGCAGATGAGATTAGAGGATGGAAAAGGCCGCTCTTTCTCAGCTGGGGGTCTTAGTGCATCATTAGATATCAATGG
ATCTGGACAACTTGGAGTTGCCTCAGATCCCAAATTGGCGGGGACGCAAATCTCTGAGAAAACAACTTCAGTGCTTCCACCCGATGTCGAATCCGCACTACTGCAACAAG
TTTTGAATCTAACTCCTGAACAGCTTAACTCGTTGCCGCTCGAACAGAGACAGCAAGTTATTCAGCTCCAGCAAGCACTGCGCAGAGATCAAATTCGGCCATCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCTACTTCGACCTTCTGCTACAGAAATGTTCTTCGTTCTCGCAAATCAAGCAACTCCAAGCAAATCTCATAACCAATGGCCAATTCCAGTTCTGTTCCTCACGCAC
CAAGCTTCTCGAGCTCTGCGCCATCTCCCACTTCGGGGACCTTGCTCATGCCGTCCATATCTTCCACCATATCAGGGTCCCCTCCACCAATGATTGGAACGCCGTCATTC
GAGGCACCGCCCTGAGCGCCGATCCCGCAAATGCTGTTGTCTGGTACAGGGCCATGGCTGGGTCTAATGGGCCTCACAGAGTTGACGCTCTCACCTGCTCGTTTACTCTG
AAAGCCTGTGCCCGCGCGCTGGCTCGTTCTGAAGCGATGCAAATACATTCGCAGCTTTTTCGATTTGGGTTCAATGCAGATGTTCTCCTGCAAACTACCTTGCTTGATGC
GTACGCAAAAGTTGGCGATCTTGATCTTGCCCAGAAGCTGTTCGACGAAATGCGGCAACCAGATATTGCCTCGTGGAACGCATTAATTGCTGGGTTTGCTCAGGGAAGTC
GACCAAGCGAAGCTATAGCCTTGTTTAAGAGAATGGAGGAGGATGAACATATGAGACCCAATGAAGTAACCGTTCAAGACGAGAAGTTAGACATGAATGTGCAAGTTTGT
AACATCGTTATTGATATGTATGCTAAATGTGGATCTGTGGATAAGGCTTATTGGGTGTTTGAGAACATGAGCGCTGGATTTTTTGAACAGTTGGGTCGATCTGGAATGTC
TCCTGATGCAGTATCATATTTAGCCGTGCTATGCGCATGCAACCATGCAGGACTTGTAGAGGATGGGGTTAAGCTGTTCAATTCAATGGCGCAAAGGGGGCTGGCGCCAA
ATATAAAGCATTATGGAAGTGTGGTTGATTTGCTGGGTCGAGCTGGTCGTCTCAAAGAAGCTTATGAAATTGTAAATTCAATGCCTTTCCCTAATATGGTACTCTGGCAG
ACTTTGCTTGGTGCTTGCAGGACTTATGGGAATGTAGAAATGGCAGAACTGGCATCAAGAAAGTTAGTAGAGATGGGATTTATTAGCTGTGGTGATTTTGTGTTGTTGTC
TAATGTCTATGCCACTCGTCAGAGATGGGATGATGTTGGGAGAGTTAGGGATGCCATGAGAAGAAGGGATGTGAAGAAGATACCGGGATTTAGTTACATAGAAGTAAAGG
GTAAGATGCACAACTTTGTATATGGTGATCAAACCCATTCGAGTTGCCGTGAGATCTATGCAAAGCTTGATGAGATCAAGTTCAGGATCAAATCCTACGGATATGCAGCT
GAAACTGGCAATGTATTGCATGATATTGGAGAGGAGGAGAAGGAGAACGCATTATGTTACCACAGTGAGAAGCTTGCCGTGGCTTTTGGATTAACTTGTACAGAAGAGGG
AACCCCGATTCATGTGATAAAGAATTTAAGGATTTGTGGGGATTGTCATGCTGTGATTAAACTAATATCAAAGATTTATAATCGAGAAATTGTTATAATGCAAATGGGGC
GAGTCGACGGAGGAAGAAGGAGGCTGAGAATAACGGAATATAGAACTTGGAACTGGGATAGAGCTCCTTTGGTACGTTCATACAGAAAATGGCTATGGACAACTGCAGTC
TGCAGTGAATCTGATTCTTTGAACTTCGGAACAAATTTCTTCAGCGTCGCTGTCGATTTTGTCCAGGAATACCGAGTTTTGTTTTCTTCATTCCGTGATTTAGGTATTCT
TGGAAAGGGATTAGATGTCAACGGTGTCGTCGGCGGTCATTCTTTGTTAAAAACTGGTGACAGACAATGGTTCTTTTTCAGTCCTAGGGAACGCAGATATCCAAATGCAG
CAAGATTGAGTAGAGCTACAAGACATGGCTACTGGAAGGCAACCGGAAAGGATCGAATAATAAGGTGCAATTCACGTAATGTTGGAGTGAAGAAGACCCTGGTATTCTAT
CAAGGCCGTGCCCCAAATGGTGAGCGCACTGACTGGGTTATGCACGAGTATACCTTGGATGAAGATGAGCTTAAGAGGTGTAAGAATGTAAAGGATTACTATGCGCTCTA
CAAACTTTATAAGAAAAGTGGTCCGGGCCCTAAAAATGGTGAGCAATATGGAGCACCATTTAGAGAAGAAGATTGGGCTGATGGCATATGCCAACATTATAATGACTCTG
ACGGTCAGGAGCCACAAAGAGAGTGGTCTACACTTCTGCCTTCATCAGATGACATTGAGGAGTTCATGAAACAAATTGCAAATGATCCTGCGCTAGAGTTGCCTTCGGTC
AATGATTATGGTCAAGTTGGCTCTGCTGTGCAGGTTGATGACGAAGAAGAAACTGTAAGTACTATGGTTGATGCTTACTCTCGGGATCACATACTTCCTGAAGCGGATAA
AGTATTTCACTTAAGTGGCCAGCCCAGTGATTTGCATGCAAGCTTCGACTTTACCCAGTCAGGTATCTCTGCGTTGCAATCACTTGAGGGTGAGATATCGTCGGCTCCCA
AATGTTGTGAGGACTCCATTATACTACAAGAAGAAGATTTCTTGGAGATTGATGATCTCATTGGTCCCGAACCAACTTCTGTGACAAATGACAACCCCTTGGAAAATGTG
CCACCTTCAGAGTTGGATGGATTGAGTGAGTTAGACCTGTTCCATGATGCAGATATGTTTCTTCGTGACTTGGGACCCTTCAACCATGAAACAATTTTAGATCCGTATTT
GAATGCTCATGATATTGATGTTGCGAACAATTCGAATGGCCATTTGCAATCTGATCCTTATGTAGAAAATCAGATGGACAATCGGTTCTGGAGGAACAATGAAACAGAAA
ATGCGTTTAACTACCCAGAATCACATCGACAGTTTGTTACTCAACCAAATTTAGGTGTGGGATATGAATCTGTAAATTCTGCAGCAGCAGGAACAAGGGAAAACCATAGA
GCAAACGAAGGAGGTGGTTCTGCTAGTTGGTTCTCTTCCAATTTGTGGGCCTTCGTGGAGTCGATACCAACCACTCCTGCGTCAGCTTCAGAGAATGTCAACCGTGCTTT
TGAGCGTATGTCTAGTTTTAGCAGATTGAGACTAAATACCCTAAACACCAATGTCGCCGTAGGTAATCCCGATACAGGCGCAAGGAGAACGGGACAAGCTAACTCGGTTT
TCGTACCTTTGGTTCTAAGGATCTTCTCGGAGCGTTTTCCTTCTTCGTCGCCGTCGCCGTGTTTTGTTCGGAACCTTCAACTCTCATTCGATTTTCCTAAATCTTTGTTC
TCGAAGACGAAGAGCCACTCAGTTTCGAACTCAATTGATCTGCGTTCCATCTGCGGAGCTTCTGTGCAGAGCATCAGTGGGAAAAGGATAATTGGGAATATACCATATGA
TGCTACTGAAGAGCAGCTTATAGAAATCTGCCAAGAAGTTGGGCCCGTAGTGTCTTTCAGATTAGTTATCGACAGAGAAACTGGAAAACCAAAAGGCTACGGGTTTTGTG
AGTACAAGGATGAAGAAACGGCTCTGAGTGCTCGTCGTAATCTTCAAGGTTATGAAATTAATGGCCGGCAGTTAAGAGTTGATTTTGCTGAGAATGACAAGGGTGCGGAC
AGAAATAGGGAACAGGGTCGTGGTGGACCTGGATTGGTTGCAAATGCTGGCGGCCCAACAGCGCATGGGGAATCCAGTCAACATCAACCAATTGGTCTTCATATAGCTAT
AACTGCTGCAGCCGTCATGGCAGGAGCTCTTGGTGGTGCACAAGCTGCTAGTAATCAGAATATTTTGCAGAGTGCAACAATGGCCAATGATCCTTTAACTCTACATCTGG
CTAAACTTTCTAGGAGTCAGCTTACTGAAATTATGTCAGGGCTAAAGGTAATGGCTACACAAAACAAGGATTTGGCTCGTCAGCTATTGCTGGCAAGACCACAATTGTCA
AAAGCTCTTTTCCTGTCACAGATAATGCTTGGAATGGTCACACCACAAGTGTTGCAGAAGCCTAATTTACGGCAGCCTTCTACTCATCCTCAGCTTCCTTTGCATGACAT
TCAGCAGGGTCAGCCATCATCTCTTCAAATTCAACCAGGGCTGCCCCCTCTTGCACCTAACAAGATGCAGACTGGTTTTGTACCTAAAATAAAAGAAACTCAAATGTCTC
TAGGGCCTCAAAATCCTCTGGCTCCGCGTCAGTTTTCTGCATCCCCACGGCCTCCACTTCAGTCTCAAATTCCGCTATCACATGCTTTACAAGGCACTTTGACTGGGATA
CCTGCAGGCTCATCACTTCCTTCCATAAATTTGCAGGGAAACCTATCTGTTAGACAGCAGGTTCAAGGGCCCACCTTCTCTCCTTTAAAGCAGCACATGCATCCCCCTTC
TCAACAATATTCAGGGCATGGTGGAGCTGTAATTCCGGGGCATAATGCTCAGATTGCTAATCCAGAAGCTAGACCTTCTCTTTTGCCTCACCCTTCCTTACCAGATGCAG
ACTTTCAGCCCGGCCCCTCCACAGCGTACAGTGCATCTCAGATAGTGGGCGGTGATGTTGATAAGTCTTCTCAAGTTCCTCTTGGTGTAGATGGTAAAAAAAATATACTT
CATGGCTTTTCAGGAACAATCAATCGCCCTGCAAAGCAGATGAGATTAGAGGATGGAAAAGGCCGCTCTTTCTCAGCTGGGGGTCTTAGTGCATCATTAGATATCAATGG
ATCTGGACAACTTGGAGTTGCCTCAGATCCCAAATTGGCGGGGACGCAAATCTCTGAGAAAACAACTTCAGTGCTTCCACCCGATGTCGAATCCGCACTACTGCAACAAG
TTTTGAATCTAACTCCTGAACAGCTTAACTCGTTGCCGCTCGAACAGAGACAGCAAGTTATTCAGCTCCAGCAAGCACTGCGCAGAGATCAAATTCGGCCATCCTAG
Protein sequenceShow/hide protein sequence
MAYFDLLLQKCSSFSQIKQLQANLITNGQFQFCSSRTKLLELCAISHFGDLAHAVHIFHHIRVPSTNDWNAVIRGTALSADPANAVVWYRAMAGSNGPHRVDALTCSFTL
KACARALARSEAMQIHSQLFRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMRQPDIASWNALIAGFAQGSRPSEAIALFKRMEEDEHMRPNEVTVQDEKLDMNVQVC
NIVIDMYAKCGSVDKAYWVFENMSAGFFEQLGRSGMSPDAVSYLAVLCACNHAGLVEDGVKLFNSMAQRGLAPNIKHYGSVVDLLGRAGRLKEAYEIVNSMPFPNMVLWQ
TLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYATRQRWDDVGRVRDAMRRRDVKKIPGFSYIEVKGKMHNFVYGDQTHSSCREIYAKLDEIKFRIKSYGYAA
ETGNVLHDIGEEEKENALCYHSEKLAVAFGLTCTEEGTPIHVIKNLRICGDCHAVIKLISKIYNREIVIMQMGRVDGGRRRLRITEYRTWNWDRAPLVRSYRKWLWTTAV
CSESDSLNFGTNFFSVAVDFVQEYRVLFSSFRDLGILGKGLDVNGVVGGHSLLKTGDRQWFFFSPRERRYPNAARLSRATRHGYWKATGKDRIIRCNSRNVGVKKTLVFY
QGRAPNGERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAPFREEDWADGICQHYNDSDGQEPQREWSTLLPSSDDIEEFMKQIANDPALELPSV
NDYGQVGSAVQVDDEEETVSTMVDAYSRDHILPEADKVFHLSGQPSDLHASFDFTQSGISALQSLEGEISSAPKCCEDSIILQEEDFLEIDDLIGPEPTSVTNDNPLENV
PPSELDGLSELDLFHDADMFLRDLGPFNHETILDPYLNAHDIDVANNSNGHLQSDPYVENQMDNRFWRNNETENAFNYPESHRQFVTQPNLGVGYESVNSAAAGTRENHR
ANEGGGSASWFSSNLWAFVESIPTTPASASENVNRAFERMSSFSRLRLNTLNTNVAVGNPDTGARRTGQANSVFVPLVLRIFSERFPSSSPSPCFVRNLQLSFDFPKSLF
SKTKSHSVSNSIDLRSICGASVQSISGKRIIGNIPYDATEEQLIEICQEVGPVVSFRLVIDRETGKPKGYGFCEYKDEETALSARRNLQGYEINGRQLRVDFAENDKGAD
RNREQGRGGPGLVANAGGPTAHGESSQHQPIGLHIAITAAAVMAGALGGAQAASNQNILQSATMANDPLTLHLAKLSRSQLTEIMSGLKVMATQNKDLARQLLLARPQLS
KALFLSQIMLGMVTPQVLQKPNLRQPSTHPQLPLHDIQQGQPSSLQIQPGLPPLAPNKMQTGFVPKIKETQMSLGPQNPLAPRQFSASPRPPLQSQIPLSHALQGTLTGI
PAGSSLPSINLQGNLSVRQQVQGPTFSPLKQHMHPPSQQYSGHGGAVIPGHNAQIANPEARPSLLPHPSLPDADFQPGPSTAYSASQIVGGDVDKSSQVPLGVDGKKNIL
HGFSGTINRPAKQMRLEDGKGRSFSAGGLSASLDINGSGQLGVASDPKLAGTQISEKTTSVLPPDVESALLQQVLNLTPEQLNSLPLEQRQQVIQLQQALRRDQIRPS