; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G19800 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G19800
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr01:31935417..31938120
RNA-Seq ExpressionClc01G19800
SyntenyClc01G19800
Gene Ontology termsGO:0016554 - cytidine to uridine editing (biological process)
GO:1900865 - chloroplast RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143583.2 pentatricopeptide repeat-containing protein At5g66520 [Cucumis sativus]0.0e+0088.73Show/hide
Query:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL
        MF+L AESPLQSTWA    LL NCSNMKQLKQI AQMIKT I+TEPKLATKFLTLCTSPH GDLLYAQ+VFNGITSPNTFMWNAIIRAY NS+EPELAFL
Subjt:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL

Query:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY
         YQQMLSSSVPHNSYTFPF+L+ACRNL AMGEALQVHGLV KLGFGSDVFALNALLHVY LCG+I+ ARQLFDNIPERD VSWNIMIDGYIKSGDVKTAY
Subjt:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY

Query:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA
        G+FLDMPLKNVVSWTSLISGLVEAG SVEAL+LCYEMQ+AGFELDGVAIASLLTACANLGALDQGRWLHFY+LNNGV +DRV GCALVNMY+KCG+MEEA
Subjt:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA

Query:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL
          +FGKLK +QKDVY+WTAMIDGFAIHGRGVEALEWFNRM+REGIRPNSITFTAVLRACSY GLVEEGK LF+SM+C YN++PSIEH+GCMVDLLGR+G 
Subjt:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL

Query:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF
        LD+AKELIKKMPMKP+AVIWGALLKACWIHRDFL+GSQ+GAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMK+L VPI PGKSS+TLNG+VHEF
Subjt:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF

Query:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII
        LAGHQDHPQMEQI LKLKQ+AERLRQDE YEP TKDLLLDLENEEKET MAQHSEKLAIAFGLINTKPG TIRVIKNLR+CRDCHTVAKL+SQIY REII
Subjt:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII

Query:  MRDRVRFHHFRDGNCSCKDYW
        MRDRVRFHHFRDG+CSCKDYW
Subjt:  MRDRVRFHHFRDGNCSCKDYW

XP_008440725.1 PREDICTED: pentatricopeptide repeat-containing protein At5g66520 [Cucumis melo]0.0e+0088.89Show/hide
Query:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL
        MF+LKAESPLQSTW    +LL NCSNMKQLKQI AQMIKT I++EPKLATKFLTLCTSPH GDLLYAQ+VFNGITSPNT MWNAIIRAY NS EPELAFL
Subjt:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL

Query:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY
        LYQQMLSSSVPHNSYTFPF+LKACRNLSA+GEALQVHGLV KLGFGSDVFALNALLHVY LCG+I YARQ+FDNIPERD VSWNIMIDGYIKSGDVKTAY
Subjt:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY

Query:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA
        GIFLDMP KNVVSWTSLISGLV AGLSV+AL+LCYEMQ+AGFELDGVAIA LLTACANLGALDQGRWLHFY+LNNGV +DRV GCALVNMY+KCG+MEEA
Subjt:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA

Query:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL
         R+FGKLK DQKDV +WTAMIDGFAIHGRGVEALEWF+ M+REGIRPNSITFTAVLRACSY GLVEEGK LF+SM+CLYNLSPSIEH+GCMVDLLGR+G 
Subjt:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL

Query:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF
        L++AKELIK MPMKPNAVIWGA LKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAA+GKWKEAAEVRLKMKNL VPI PGKSSITLNG+VHEF
Subjt:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF

Query:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII
        LAGHQDHPQMEQIHLKLKQ+AERLRQDE YEP TKDLLLDLENEEKET +AQHSEKLAIAFGLINTKPG TIRV+KNLR+CRDCHTVAKL+SQIYCREII
Subjt:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII

Query:  MRDRVRFHHFRDGNCSCKDYW
        MRDRVRFHHFRDG+CSCKDYW
Subjt:  MRDRVRFHHFRDGNCSCKDYW

XP_022949774.1 pentatricopeptide repeat-containing protein At5g66520 [Cucurbita moschata]0.0e+0089.05Show/hide
Query:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL
        MF+LKAESP+QSTWAQTMSLL NCSNMKQLK+IHAQMI+T   TEPKLATK LTLC SPHFGDL YAQ+VFNGI+SP TFMWNA+IRAY NSNEPELAFL
Subjt:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL

Query:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY
        LY+QMLSSSVPHNSYTFPF+LKACRN SAM EALQVHGLV KLGFGSDVFALNALLHVY LCGDI YARQLFDNIPERD+VSWNIMIDGYIKSGDVKTAY
Subjt:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY

Query:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA
        G+FLDMPLKNVVSWTSLISGLVEAGL+VEAL+LC+EMQ+AGFELDGVAIASLLTACANLGALDQGRWLHFY+LNNGVH+DRV GCALVNMYLKCG+MEEA
Subjt:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA

Query:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL
         R FGKLK DQKDVYVWTAMIDGFAIHGRGVEALEWF RM REGIRPNSITFTAVLRACSYAGLVEEGKVLFESM  +YNLSPSIEH+GCMVDLLGRAGL
Subjt:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL

Query:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF
        L++AKELIK MPM+PNA+IWGALLKAC IHRDFLVG QIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLR+PIPPGKSSITLNGVVHEF
Subjt:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF

Query:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII
        LAGHQDHPQMEQI  KL QV ERLRQ E YEP TKDLLLDLE+E KET +AQHSEKLAIAFGLINTKPG+TIRV+KNLRVC DCH VAKLISQIY REII
Subjt:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII

Query:  MRDRVRFHHFRDGNCSCKDYW
        MRDRVRFHHFR GNCSC DYW
Subjt:  MRDRVRFHHFRDGNCSCKDYW

XP_022978438.1 pentatricopeptide repeat-containing protein At5g66520 [Cucurbita maxima]0.0e+0089.21Show/hide
Query:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL
        MF+LKAESP+QSTWAQTMSLL NCSNMKQLK+IHAQMI+T   TEPKLATK LTLCTSPHFGDL YAQ+VFNGI+SP TFMWNA+IRAY NSNEPELAFL
Subjt:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL

Query:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY
        LY+QMLSSSVPHNSYTFPF+LKACRN SAM EALQVHGLV KLGFGSDVFALNALLHVY LCGDI YARQLFDNIPERD+VSWNIMIDGYIKSGDVKTAY
Subjt:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY

Query:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA
        G+FLDMPLKNVVSWTSLISGLVEAGL+VEAL+LC+EMQ+AGFELDGVAIASLLTACANLGALDQGRWLHFY+LNNGVH+DRV GCALVNMYLKCG+MEEA
Subjt:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA

Query:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL
         + FGKLK DQKDVYVWTAMIDGFAIHGRGVEALEWF RM REGIRPNSITFTAVLRACSYAGLVEEGKVLFESM  +Y LSPSIEH+GCMVDLLGRAGL
Subjt:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL

Query:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF
        L++AKELIK MPMKPNA+IWGALLKAC IHRDFLVG QIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF
Subjt:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF

Query:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII
        LAGHQDHPQMEQI  KL QV ERLRQ E YEP TKDLLLDLENE KET +AQHSEKLAIAFGLINTKPG+TIRV+KNLRVC DCH VAKLIS+IY REII
Subjt:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII

Query:  MRDRVRFHHFRDGNCSCKDYW
        MRDRVRFHHFR G+CSCKDYW
Subjt:  MRDRVRFHHFRDGNCSCKDYW

XP_038882528.1 pentatricopeptide repeat-containing protein At5g66520 [Benincasa hispida]0.0e+0091.3Show/hide
Query:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL
        MF+LKA+SPLQSTWAQTMSLL NCSNMKQLK+IHAQMIKTE  TEPKLATK LTLCTSPHFGDL YAQ+VFNGIT PNTFMWNAIIRAY NS EPELAFL
Subjt:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL

Query:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY
        LYQQMLSSSVPHNSYTFPF+LKACRNLSAMGEALQ+HGLV KLGFGSDVFALNALLHVY LCGDI YARQLFDNIP RDVVSWNIMIDGYIKSGDVKTAY
Subjt:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY

Query:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA
        G+FLDMPLKNVVSWTSLISGLVEAG SVEAL+LCYEMQ+AGFELDG+AIASLLTACANLGALDQGRWLHFY+LNNGV +DRV GCALVNMYLKCG+MEEA
Subjt:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA

Query:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL
         R+FGKLKSDQKDVYVWTAMIDGFAIHG GVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLV EGK LFESM  LYNL PSIEH+GCMVDLLGRAGL
Subjt:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL

Query:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF
        LD+AKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATI AAEGKWKEAAEVRLKMKNL V IPPGKSSIT+NGVVHEF
Subjt:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF

Query:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII
        LAG QDHPQME+IHLKLKQ+AERLR+DE YEP TKDLLLDLENEEKET MAQHSEKLAIAFGLINTKPG TIRVIKNLRVC DCH VAKLISQIYCR II
Subjt:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII

Query:  MRDRVRFHHFRDGNCSCKDYW
        MRDRVRFHHFR+GNCSCKDYW
Subjt:  MRDRVRFHHFRDGNCSCKDYW

TrEMBL top hitse value%identityAlignment
A0A0A0KKE0 DYW_deaminase domain-containing protein0.0e+0088.73Show/hide
Query:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL
        MF+L AESPLQSTWA    LL NCSNMKQLKQI AQMIKT I+TEPKLATKFLTLCTSPH GDLLYAQ+VFNGITSPNTFMWNAIIRAY NS+EPELAFL
Subjt:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL

Query:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY
         YQQMLSSSVPHNSYTFPF+L+ACRNL AMGEALQVHGLV KLGFGSDVFALNALLHVY LCG+I+ ARQLFDNIPERD VSWNIMIDGYIKSGDVKTAY
Subjt:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY

Query:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA
        G+FLDMPLKNVVSWTSLISGLVEAG SVEAL+LCYEMQ+AGFELDGVAIASLLTACANLGALDQGRWLHFY+LNNGV +DRV GCALVNMY+KCG+MEEA
Subjt:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA

Query:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL
          +FGKLK +QKDVY+WTAMIDGFAIHGRGVEALEWFNRM+REGIRPNSITFTAVLRACSY GLVEEGK LF+SM+C YN++PSIEH+GCMVDLLGR+G 
Subjt:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL

Query:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF
        LD+AKELIKKMPMKP+AVIWGALLKACWIHRDFL+GSQ+GAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMK+L VPI PGKSS+TLNG+VHEF
Subjt:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF

Query:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII
        LAGHQDHPQMEQI LKLKQ+AERLRQDE YEP TKDLLLDLENEEKET MAQHSEKLAIAFGLINTKPG TIRVIKNLR+CRDCHTVAKL+SQIY REII
Subjt:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII

Query:  MRDRVRFHHFRDGNCSCKDYW
        MRDRVRFHHFRDG+CSCKDYW
Subjt:  MRDRVRFHHFRDGNCSCKDYW

A0A1S3B1S8 pentatricopeptide repeat-containing protein At5g665200.0e+0088.89Show/hide
Query:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL
        MF+LKAESPLQSTW    +LL NCSNMKQLKQI AQMIKT I++EPKLATKFLTLCTSPH GDLLYAQ+VFNGITSPNT MWNAIIRAY NS EPELAFL
Subjt:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL

Query:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY
        LYQQMLSSSVPHNSYTFPF+LKACRNLSA+GEALQVHGLV KLGFGSDVFALNALLHVY LCG+I YARQ+FDNIPERD VSWNIMIDGYIKSGDVKTAY
Subjt:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY

Query:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA
        GIFLDMP KNVVSWTSLISGLV AGLSV+AL+LCYEMQ+AGFELDGVAIA LLTACANLGALDQGRWLHFY+LNNGV +DRV GCALVNMY+KCG+MEEA
Subjt:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA

Query:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL
         R+FGKLK DQKDV +WTAMIDGFAIHGRGVEALEWF+ M+REGIRPNSITFTAVLRACSY GLVEEGK LF+SM+CLYNLSPSIEH+GCMVDLLGR+G 
Subjt:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL

Query:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF
        L++AKELIK MPMKPNAVIWGA LKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAA+GKWKEAAEVRLKMKNL VPI PGKSSITLNG+VHEF
Subjt:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF

Query:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII
        LAGHQDHPQMEQIHLKLKQ+AERLRQDE YEP TKDLLLDLENEEKET +AQHSEKLAIAFGLINTKPG TIRV+KNLR+CRDCHTVAKL+SQIYCREII
Subjt:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII

Query:  MRDRVRFHHFRDGNCSCKDYW
        MRDRVRFHHFRDG+CSCKDYW
Subjt:  MRDRVRFHHFRDGNCSCKDYW

A0A5D3CKZ8 Pentatricopeptide repeat-containing protein0.0e+0088.89Show/hide
Query:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL
        MF+LKAESPLQSTW    +LL NCSNMKQLKQI AQMIKT I++EPKLATKFLTLCTSPH GDLLYAQ+VFNGITSPNT MWNAIIRAY NS EPELAFL
Subjt:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL

Query:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY
        LYQQMLSSSVPHNSYTFPF+LKACRNLSA+GEALQVHGLV KLGFGSDVFALNALLHVY LCG+I YARQ+FDNIPERD VSWNIMIDGYIKSGDVKTAY
Subjt:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY

Query:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA
        GIFLDMP KNVVSWTSLISGLV AGLSV+AL+LCYEMQ+AGFELDGVAIA LLTACANLGALDQGRWLHFY+LNNGV +DRV GCALVNMY+KCG+MEEA
Subjt:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA

Query:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL
         R+FGKLK DQKDV +WTAMIDGFAIHGRGVEALEWF+ M+REGIRPNSITFTAVLRACSY GLVEEGK LF+SM+CLYNLSPSIEH+GCMVDLLGR+G 
Subjt:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL

Query:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF
        L++AKELIK MPMKPNAVIWGA LKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAA+GKWKEAAEVRLKMKNL VPI PGKSSITLNG+VHEF
Subjt:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF

Query:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII
        LAGHQDHPQMEQIHLKLKQ+AERLRQDE YEP TKDLLLDLENEEKET +AQHSEKLAIAFGLINTKPG TIRV+KNLR+CRDCHTVAKL+SQIYCREII
Subjt:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII

Query:  MRDRVRFHHFRDGNCSCKDYW
        MRDRVRFHHFRDG+CSCKDYW
Subjt:  MRDRVRFHHFRDGNCSCKDYW

A0A6J1GDX2 pentatricopeptide repeat-containing protein At5g665200.0e+0089.05Show/hide
Query:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL
        MF+LKAESP+QSTWAQTMSLL NCSNMKQLK+IHAQMI+T   TEPKLATK LTLC SPHFGDL YAQ+VFNGI+SP TFMWNA+IRAY NSNEPELAFL
Subjt:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL

Query:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY
        LY+QMLSSSVPHNSYTFPF+LKACRN SAM EALQVHGLV KLGFGSDVFALNALLHVY LCGDI YARQLFDNIPERD+VSWNIMIDGYIKSGDVKTAY
Subjt:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY

Query:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA
        G+FLDMPLKNVVSWTSLISGLVEAGL+VEAL+LC+EMQ+AGFELDGVAIASLLTACANLGALDQGRWLHFY+LNNGVH+DRV GCALVNMYLKCG+MEEA
Subjt:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA

Query:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL
         R FGKLK DQKDVYVWTAMIDGFAIHGRGVEALEWF RM REGIRPNSITFTAVLRACSYAGLVEEGKVLFESM  +YNLSPSIEH+GCMVDLLGRAGL
Subjt:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL

Query:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF
        L++AKELIK MPM+PNA+IWGALLKAC IHRDFLVG QIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLR+PIPPGKSSITLNGVVHEF
Subjt:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF

Query:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII
        LAGHQDHPQMEQI  KL QV ERLRQ E YEP TKDLLLDLE+E KET +AQHSEKLAIAFGLINTKPG+TIRV+KNLRVC DCH VAKLISQIY REII
Subjt:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII

Query:  MRDRVRFHHFRDGNCSCKDYW
        MRDRVRFHHFR GNCSC DYW
Subjt:  MRDRVRFHHFRDGNCSCKDYW

A0A6J1IT43 pentatricopeptide repeat-containing protein At5g665200.0e+0089.21Show/hide
Query:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL
        MF+LKAESP+QSTWAQTMSLL NCSNMKQLK+IHAQMI+T   TEPKLATK LTLCTSPHFGDL YAQ+VFNGI+SP TFMWNA+IRAY NSNEPELAFL
Subjt:  MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFL

Query:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY
        LY+QMLSSSVPHNSYTFPF+LKACRN SAM EALQVHGLV KLGFGSDVFALNALLHVY LCGDI YARQLFDNIPERD+VSWNIMIDGYIKSGDVKTAY
Subjt:  LYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAY

Query:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA
        G+FLDMPLKNVVSWTSLISGLVEAGL+VEAL+LC+EMQ+AGFELDGVAIASLLTACANLGALDQGRWLHFY+LNNGVH+DRV GCALVNMYLKCG+MEEA
Subjt:  GIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA

Query:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL
         + FGKLK DQKDVYVWTAMIDGFAIHGRGVEALEWF RM REGIRPNSITFTAVLRACSYAGLVEEGKVLFESM  +Y LSPSIEH+GCMVDLLGRAGL
Subjt:  FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGL

Query:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF
        L++AKELIK MPMKPNA+IWGALLKAC IHRDFLVG QIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF
Subjt:  LDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEF

Query:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII
        LAGHQDHPQMEQI  KL QV ERLRQ E YEP TKDLLLDLENE KET +AQHSEKLAIAFGLINTKPG+TIRV+KNLRVC DCH VAKLIS+IY REII
Subjt:  LAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII

Query:  MRDRVRFHHFRDGNCSCKDYW
        MRDRVRFHHFR G+CSCKDYW
Subjt:  MRDRVRFHHFRDGNCSCKDYW

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic5.1e-14037.29Show/hide
Query:  MSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPH-NSYT
        +SL+  C +++QLKQ H  MI+T   ++P  A+K   +     F  L YA+KVF+ I  PN+F WN +IRAY +  +P L+   +  M+S S  + N YT
Subjt:  MSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPH-NSYT

Query:  FPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSG----------------------
        FPF++KA   +S++     +HG+  K   GSDVF  N+L+H Y  CGD++ A ++F  I E+DVVSWN MI+G+++ G                      
Subjt:  FPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSG----------------------

Query:  -------------------------------------------------------------------------------DVKTAYGIFLDMPLKNVVSWT
                                                                                       D + A  +   MP K++V+W 
Subjt:  -------------------------------------------------------------------------------DVKTAYGIFLDMPLKNVVSWT

Query:  SLISGLVEAGLSVEALNLCYEMQ-SAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDV
        +LIS   + G   EAL + +E+Q     +L+ + + S L+ACA +GAL+ GRW+H Y+  +G+ ++     AL++MY KCG++E++  +F  +  +++DV
Subjt:  SLISGLVEAGLSVEALNLCYEMQ-SAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDV

Query:  YVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMK
        +VW+AMI G A+HG G EA++ F +MQ   ++PN +TFT V  ACS+ GLV+E + LF  M   Y + P  +H+ C+VD+LGR+G L+KA + I+ MP+ 
Subjt:  YVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMK

Query:  PNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIH
        P+  +WGALL AC IH +  +       L+E++  + G ++ L+ I A  GKW+  +E+R  M+   +   PG SSI ++G++HEFL+G   HP  E+++
Subjt:  PNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIH

Query:  LKLKQVAERLRQDESYEPVTKDLLLDLENEE-KETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRDG
         KL +V E+L+ +  YEP    +L  +E EE KE ++  HSEKLAI +GLI+T+    IRVIKNLRVC DCH+VAKLISQ+Y REII+RDR RFHHFR+G
Subjt:  LKLKQVAERLRQDESYEPVTKDLLLDLENEE-KETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRDG

Query:  NCSCKDYW
         CSC D+W
Subjt:  NCSCKDYW

Q9FG16 Pentatricopeptide repeat-containing protein At5g065403.1e-13740.33Show/hide
Query:  MSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGD-----LLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPH
        ++LL +CS+   LK IH  +++T ++++  +A++ L LC      +     L YA  +F+ I +PN F++N +IR +    EP  AF  Y QML S +  
Subjt:  MSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGD-----LLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPH

Query:  NSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVV
        ++ TFPF++KA   +  +    Q H  + + GF +DV+  N+L+H+Y  CG I  A ++F  +  RDVVSW  M+ GY K G V+ A  +F +MP +N+ 
Subjt:  NSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVV

Query:  SWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQK
        +W+ +I+G  +     +A++L   M+  G   +   + S++++CA+LGAL+ G   + Y++ + + ++ + G ALV+M+ +CG++E+A  +F  L   + 
Subjt:  SWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQK

Query:  DVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMP
        D   W+++I G A+HG   +A+ +F++M   G  P  +TFTAVL ACS+ GLVE+G  ++E+M+  + + P +EH+GC+VD+LGRAG L +A+  I KM 
Subjt:  DVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMP

Query:  MKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAG-HQDHPQME
        +KPNA I GALL AC I+++  V  ++G  L++V  +HSG Y+ L+ I A  G+W +   +R  MK   V  PPG S I ++G +++F  G  Q HP+M 
Subjt:  MKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAG-HQDHPQME

Query:  QIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFR
        +I  K +++  ++R    Y+  T D   D++ EEKE+++  HSEKLAIA+G++ TKPG TIR++KNLRVC DCHTV KLIS++Y RE+I+RDR RFHHFR
Subjt:  QIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFR

Query:  DGNCSCKDYW
        +G CSC+DYW
Subjt:  DGNCSCKDYW

Q9FI80 Pentatricopeptide repeat-containing protein At5g489101.6e-14942.61Show/hide
Query:  MFSLKAESPLQSTWAQTMSL---LVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSP--HFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEP
        +FS    SP  S  +   SL   + NC  ++ L QIHA  IK+  + +   A + L  C +   H  DL YA K+FN +   N F WN IIR +  S+E 
Subjt:  MFSLKAESPLQSTWAQTMSL---LVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSP--HFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEP

Query:  E--LAFLLYQQMLSSS-VPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLF-DNIPERD------------
        +  +A  L+ +M+S   V  N +TFP VLKAC     + E  Q+HGL  K GFG D F ++ L+ +Y +CG +  AR LF  NI E+D            
Subjt:  E--LAFLLYQQMLSSS-VPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLF-DNIPERD------------

Query:  -VVSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVH
         +V WN+MIDGY++ GD K A  +F  M  ++VVSW ++ISG    G   +A+ +  EM+      + V + S+L A + LG+L+ G WLH Y  ++G+ 
Subjt:  -VVSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVH

Query:  IDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCL
        ID V G AL++MY KCG +E+A  +F +L   +++V  W+AMI+GFAIHG+  +A++ F +M++ G+RP+ + +  +L ACS+ GLVEEG+  F  M  +
Subjt:  IDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCL

Query:  YNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMK
          L P IEH+GCMVDLLGR+GLLD+A+E I  MP+KP+ VIW ALL AC +  +  +G ++   L+++    SG Y+ L+ + A++G W E +E+RL+MK
Subjt:  YNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMK

Query:  NLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNL
           +   PG S I ++GV+HEF+     HP+ ++I+  L +++++LR    Y P+T  +LL+LE E+KE  +  HSEK+A AFGLI+T PG  IR++KNL
Subjt:  NLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNL

Query:  RVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRDGNCSCKDYW
        R+C DCH+  KLIS++Y R+I +RDR RFHHF+DG+CSC DYW
Subjt:  RVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRDGNCSCKDYW

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665208.1e-19453.91Show/hide
Query:  LQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLL-YAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSS
        L+    +TMS L  CS  ++LKQIHA+M+KT ++ +    TKFL+ C S    D L YAQ VF+G   P+TF+WN +IR +  S+EPE + LLYQ+ML S
Subjt:  LQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLL-YAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSS

Query:  SVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAYGIFLDMPL
        S PHN+YTFP +LKAC NLSA  E  Q+H  + KLG+ +DV+A+N+L++ Y + G+   A  LFD IPE D VSWN +I GY+K+G +  A  +F  M  
Subjt:  SVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAYGIFLDMPL

Query:  KNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLK
        KN +SWT++ISG V+A ++ EAL L +EMQ++  E D V++A+ L+ACA LGAL+QG+W+H YL    + +D V GC L++MY KCGEMEEA  +F  +K
Subjt:  KNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLK

Query:  SDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELI
          +K V  WTA+I G+A HG G EA+  F  MQ+ GI+PN ITFTAVL ACSY GLVEEGK++F SM   YNL P+IEH+GC+VDLLGRAGLLD+AK  I
Subjt:  SDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELI

Query:  KKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHP
        ++MP+KPNAVIWGALLKAC IH++  +G +IG  L+ +D  H GRY+  A I A + KW +AAE R  MK   V   PG S+I+L G  HEFLAG + HP
Subjt:  KKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHP

Query:  QMEQIHLKLKQVAERLRQDESYEPVTKDLLLDL-ENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRF
        ++E+I  K + +  R  ++  Y P  +++LLDL +++E+E  + QHSEKLAI +GLI TKPG  IR++KNLRVC+DCH V KLIS+IY R+I+MRDR RF
Subjt:  QMEQIHLKLKQVAERLRQDESYEPVTKDLLLDL-ENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRF

Query:  HHFRDGNCSCKDYW
        HHFRDG CSC DYW
Subjt:  HHFRDGNCSCKDYW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic6.9e-13738.36Show/hide
Query:  TMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLC-TSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSY
        ++SLL NC  ++ L+ IHAQMIK  +       +K +  C  SPHF  L YA  VF  I  PN  +WN + R +  S++P  A  LY  M+S  +  NSY
Subjt:  TMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLC-TSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSY

Query:  TFPFVLKACRNLSAMGEALQVHGLVFKLGFG-------------------------------SDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWN
        TFPFVLK+C    A  E  Q+HG V KLG                                  DV +  AL+  Y   G I  A++LFD IP +DVVSWN
Subjt:  TFPFVLKACRNLSAMGEALQVHGLVFKLGFG-------------------------------SDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWN

Query:  IMIDGYI----------------------------------------------------------------------KSGDVKTAYGIFLDMPLKNVVSW
         MI GY                                                                       K G+++TA G+F  +P K+V+SW
Subjt:  IMIDGYI----------------------------------------------------------------------KSGDVKTAYGIFLDMPLKNVVSW

Query:  TSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLN--NGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQK
         +LI G     L  EAL L  EM  +G   + V + S+L ACA+LGA+D GRW+H Y+     GV        +L++MY KCG++E A ++F  +    K
Subjt:  TSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLN--NGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQK

Query:  DVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMP
         +  W AMI GFA+HGR   + + F+RM++ GI+P+ ITF  +L ACS++G+++ G+ +F +M   Y ++P +EH+GCM+DLLG +GL  +A+E+I  M 
Subjt:  DVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMP

Query:  MKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQ
        M+P+ VIW +LLKAC +H +  +G     +L++++ ++ G Y+ L+ I A+ G+W E A+ R  + +  +   PG SSI ++ VVHEF+ G + HP+  +
Subjt:  MKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQ

Query:  IHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRD
        I+  L+++ E L +   + P T ++L ++E E KE  +  HSEKLAIAFGLI+TKPG  + ++KNLRVCR+CH   KLIS+IY REII RDR RFHHFRD
Subjt:  IHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRD

Query:  GNCSCKDYW
        G CSC DYW
Subjt:  GNCSCKDYW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.9e-13838.36Show/hide
Query:  TMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLC-TSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSY
        ++SLL NC  ++ L+ IHAQMIK  +       +K +  C  SPHF  L YA  VF  I  PN  +WN + R +  S++P  A  LY  M+S  +  NSY
Subjt:  TMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLC-TSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSY

Query:  TFPFVLKACRNLSAMGEALQVHGLVFKLGFG-------------------------------SDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWN
        TFPFVLK+C    A  E  Q+HG V KLG                                  DV +  AL+  Y   G I  A++LFD IP +DVVSWN
Subjt:  TFPFVLKACRNLSAMGEALQVHGLVFKLGFG-------------------------------SDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWN

Query:  IMIDGYI----------------------------------------------------------------------KSGDVKTAYGIFLDMPLKNVVSW
         MI GY                                                                       K G+++TA G+F  +P K+V+SW
Subjt:  IMIDGYI----------------------------------------------------------------------KSGDVKTAYGIFLDMPLKNVVSW

Query:  TSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLN--NGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQK
         +LI G     L  EAL L  EM  +G   + V + S+L ACA+LGA+D GRW+H Y+     GV        +L++MY KCG++E A ++F  +    K
Subjt:  TSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLN--NGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQK

Query:  DVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMP
         +  W AMI GFA+HGR   + + F+RM++ GI+P+ ITF  +L ACS++G+++ G+ +F +M   Y ++P +EH+GCM+DLLG +GL  +A+E+I  M 
Subjt:  DVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMP

Query:  MKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQ
        M+P+ VIW +LLKAC +H +  +G     +L++++ ++ G Y+ L+ I A+ G+W E A+ R  + +  +   PG SSI ++ VVHEF+ G + HP+  +
Subjt:  MKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQ

Query:  IHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRD
        I+  L+++ E L +   + P T ++L ++E E KE  +  HSEKLAIAFGLI+TKPG  + ++KNLRVCR+CH   KLIS+IY REII RDR RFHHFRD
Subjt:  IHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRD

Query:  GNCSCKDYW
        G CSC DYW
Subjt:  GNCSCKDYW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-14137.29Show/hide
Query:  MSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPH-NSYT
        +SL+  C +++QLKQ H  MI+T   ++P  A+K   +     F  L YA+KVF+ I  PN+F WN +IRAY +  +P L+   +  M+S S  + N YT
Subjt:  MSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPH-NSYT

Query:  FPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSG----------------------
        FPF++KA   +S++     +HG+  K   GSDVF  N+L+H Y  CGD++ A ++F  I E+DVVSWN MI+G+++ G                      
Subjt:  FPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSG----------------------

Query:  -------------------------------------------------------------------------------DVKTAYGIFLDMPLKNVVSWT
                                                                                       D + A  +   MP K++V+W 
Subjt:  -------------------------------------------------------------------------------DVKTAYGIFLDMPLKNVVSWT

Query:  SLISGLVEAGLSVEALNLCYEMQ-SAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDV
        +LIS   + G   EAL + +E+Q     +L+ + + S L+ACA +GAL+ GRW+H Y+  +G+ ++     AL++MY KCG++E++  +F  +  +++DV
Subjt:  SLISGLVEAGLSVEALNLCYEMQ-SAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDV

Query:  YVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMK
        +VW+AMI G A+HG G EA++ F +MQ   ++PN +TFT V  ACS+ GLV+E + LF  M   Y + P  +H+ C+VD+LGR+G L+KA + I+ MP+ 
Subjt:  YVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMK

Query:  PNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIH
        P+  +WGALL AC IH +  +       L+E++  + G ++ L+ I A  GKW+  +E+R  M+   +   PG SSI ++G++HEFL+G   HP  E+++
Subjt:  PNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIH

Query:  LKLKQVAERLRQDESYEPVTKDLLLDLENEE-KETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRDG
         KL +V E+L+ +  YEP    +L  +E EE KE ++  HSEKLAI +GLI+T+    IRVIKNLRVC DCH+VAKLISQ+Y REII+RDR RFHHFR+G
Subjt:  LKLKQVAERLRQDESYEPVTKDLLLDLENEE-KETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRDG

Query:  NCSCKDYW
         CSC D+W
Subjt:  NCSCKDYW

AT5G06540.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-13840.33Show/hide
Query:  MSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGD-----LLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPH
        ++LL +CS+   LK IH  +++T ++++  +A++ L LC      +     L YA  +F+ I +PN F++N +IR +    EP  AF  Y QML S +  
Subjt:  MSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGD-----LLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPH

Query:  NSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVV
        ++ TFPF++KA   +  +    Q H  + + GF +DV+  N+L+H+Y  CG I  A ++F  +  RDVVSW  M+ GY K G V+ A  +F +MP +N+ 
Subjt:  NSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVV

Query:  SWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQK
        +W+ +I+G  +     +A++L   M+  G   +   + S++++CA+LGAL+ G   + Y++ + + ++ + G ALV+M+ +CG++E+A  +F  L   + 
Subjt:  SWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQK

Query:  DVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMP
        D   W+++I G A+HG   +A+ +F++M   G  P  +TFTAVL ACS+ GLVE+G  ++E+M+  + + P +EH+GC+VD+LGRAG L +A+  I KM 
Subjt:  DVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMP

Query:  MKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAG-HQDHPQME
        +KPNA I GALL AC I+++  V  ++G  L++V  +HSG Y+ L+ I A  G+W +   +R  MK   V  PPG S I ++G +++F  G  Q HP+M 
Subjt:  MKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAG-HQDHPQME

Query:  QIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFR
        +I  K +++  ++R    Y+  T D   D++ EEKE+++  HSEKLAIA+G++ TKPG TIR++KNLRVC DCHTV KLIS++Y RE+I+RDR RFHHFR
Subjt:  QIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFR

Query:  DGNCSCKDYW
        +G CSC+DYW
Subjt:  DGNCSCKDYW

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-15042.61Show/hide
Query:  MFSLKAESPLQSTWAQTMSL---LVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSP--HFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEP
        +FS    SP  S  +   SL   + NC  ++ L QIHA  IK+  + +   A + L  C +   H  DL YA K+FN +   N F WN IIR +  S+E 
Subjt:  MFSLKAESPLQSTWAQTMSL---LVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSP--HFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEP

Query:  E--LAFLLYQQMLSSS-VPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLF-DNIPERD------------
        +  +A  L+ +M+S   V  N +TFP VLKAC     + E  Q+HGL  K GFG D F ++ L+ +Y +CG +  AR LF  NI E+D            
Subjt:  E--LAFLLYQQMLSSS-VPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLF-DNIPERD------------

Query:  -VVSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVH
         +V WN+MIDGY++ GD K A  +F  M  ++VVSW ++ISG    G   +A+ +  EM+      + V + S+L A + LG+L+ G WLH Y  ++G+ 
Subjt:  -VVSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVH

Query:  IDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCL
        ID V G AL++MY KCG +E+A  +F +L   +++V  W+AMI+GFAIHG+  +A++ F +M++ G+RP+ + +  +L ACS+ GLVEEG+  F  M  +
Subjt:  IDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCL

Query:  YNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMK
          L P IEH+GCMVDLLGR+GLLD+A+E I  MP+KP+ VIW ALL AC +  +  +G ++   L+++    SG Y+ L+ + A++G W E +E+RL+MK
Subjt:  YNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMK

Query:  NLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNL
           +   PG S I ++GV+HEF+     HP+ ++I+  L +++++LR    Y P+T  +LL+LE E+KE  +  HSEK+A AFGLI+T PG  IR++KNL
Subjt:  NLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNL

Query:  RVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRDGNCSCKDYW
        R+C DCH+  KLIS++Y R+I +RDR RFHHF+DG+CSC DYW
Subjt:  RVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRDGNCSCKDYW

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.7e-19553.91Show/hide
Query:  LQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLL-YAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSS
        L+    +TMS L  CS  ++LKQIHA+M+KT ++ +    TKFL+ C S    D L YAQ VF+G   P+TF+WN +IR +  S+EPE + LLYQ+ML S
Subjt:  LQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLL-YAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSS

Query:  SVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAYGIFLDMPL
        S PHN+YTFP +LKAC NLSA  E  Q+H  + KLG+ +DV+A+N+L++ Y + G+   A  LFD IPE D VSWN +I GY+K+G +  A  +F  M  
Subjt:  SVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAYGIFLDMPL

Query:  KNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLK
        KN +SWT++ISG V+A ++ EAL L +EMQ++  E D V++A+ L+ACA LGAL+QG+W+H YL    + +D V GC L++MY KCGEMEEA  +F  +K
Subjt:  KNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLK

Query:  SDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELI
          +K V  WTA+I G+A HG G EA+  F  MQ+ GI+PN ITFTAVL ACSY GLVEEGK++F SM   YNL P+IEH+GC+VDLLGRAGLLD+AK  I
Subjt:  SDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELI

Query:  KKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHP
        ++MP+KPNAVIWGALLKAC IH++  +G +IG  L+ +D  H GRY+  A I A + KW +AAE R  MK   V   PG S+I+L G  HEFLAG + HP
Subjt:  KKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHP

Query:  QMEQIHLKLKQVAERLRQDESYEPVTKDLLLDL-ENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRF
        ++E+I  K + +  R  ++  Y P  +++LLDL +++E+E  + QHSEKLAI +GLI TKPG  IR++KNLRVC+DCH V KLIS+IY R+I+MRDR RF
Subjt:  QMEQIHLKLKQVAERLRQDESYEPVTKDLLLDL-ENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRF

Query:  HHFRDGNCSCKDYW
        HHFRDG CSC DYW
Subjt:  HHFRDGNCSCKDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTCTCTAAAAGCAGAGTCTCCATTACAATCAACATGGGCCCAGACCATGTCTCTGCTTGTAAACTGCTCAAACATGAAGCAATTGAAACAAATTCACGCTCAAAT
GATCAAAACAGAGATCGTCACAGAACCCAAATTAGCTACAAAGTTTCTAACCCTCTGCACTTCACCCCATTTCGGCGATTTGCTTTACGCGCAAAAGGTCTTCAATGGAA
TCACCAGCCCCAACACTTTCATGTGGAACGCCATTATAAGAGCTTACTGTAACAGTAACGAACCAGAATTAGCATTTCTCTTGTATCAGCAAATGCTTTCTTCTTCGGTA
CCGCACAACTCCTACACCTTCCCTTTCGTGCTCAAAGCTTGTCGTAATTTGTCGGCCATGGGTGAGGCCCTCCAAGTTCATGGACTGGTTTTCAAACTGGGATTTGGGTC
GGATGTTTTTGCATTGAATGCTCTGCTTCATGTCTACACTTTGTGTGGTGACATTAATTATGCACGCCAACTGTTTGATAATATTCCTGAAAGAGATGTTGTTTCTTGGA
ACATAATGATTGATGGGTATATCAAATCTGGGGATGTAAAAACGGCTTATGGGATTTTCTTGGACATGCCATTGAAAAATGTGGTCTCGTGGACGTCGCTGATTTCGGGG
CTTGTTGAGGCAGGACTGAGCGTAGAAGCTTTGAATCTTTGTTATGAGATGCAGAGTGCAGGATTTGAACTTGATGGTGTTGCTATTGCGAGTTTGCTTACTGCTTGTGC
AAATCTTGGAGCGTTGGATCAAGGAAGATGGCTCCATTTCTATTTGCTCAACAATGGAGTCCACATCGATCGAGTAACTGGCTGTGCTCTGGTGAATATGTACTTAAAAT
GTGGGGAAATGGAAGAAGCCTTTAGATTGTTTGGGAAACTGAAGAGCGATCAGAAAGATGTGTATGTTTGGACGGCCATGATCGATGGCTTTGCCATTCATGGGCGTGGA
GTGGAAGCTCTGGAATGGTTTAACCGAATGCAGAGAGAAGGAATAAGACCAAATTCCATCACTTTCACTGCAGTTTTAAGGGCCTGTAGCTATGCAGGACTGGTTGAAGA
AGGAAAAGTGTTATTCGAGAGCATGAGATGTCTTTACAACTTGAGCCCATCTATTGAGCATTTTGGGTGTATGGTTGATCTTTTGGGTCGAGCTGGGCTGCTGGATAAAG
CGAAGGAGTTGATCAAGAAGATGCCCATGAAACCTAATGCTGTAATATGGGGAGCTTTGCTAAAGGCCTGTTGGATTCATAGAGATTTTCTGGTGGGTAGCCAAATCGGA
GCCCACCTGGTGGAAGTCGATTCAGATCATAGCGGGCGGTACATTCAGTTGGCTACCATTTTAGCTGCAGAAGGTAAATGGAAAGAAGCAGCTGAAGTGAGGTTGAAGAT
GAAGAATCTGAGAGTCCCAATTCCCCCCGGAAAGAGTTCAATAACTTTGAATGGCGTTGTTCATGAATTTCTTGCTGGGCATCAAGATCATCCACAGATGGAGCAGATTC
ATTTGAAACTGAAACAGGTTGCCGAGAGGCTACGACAAGATGAAAGTTATGAACCTGTAACTAAAGATTTATTACTTGATCTTGAGAATGAGGAGAAAGAGACTACGATG
GCTCAACATAGCGAGAAGTTGGCTATTGCTTTTGGATTGATCAATACGAAACCAGGAGCGACGATTCGAGTTATTAAGAATCTTAGAGTCTGTAGAGATTGTCACACTGT
TGCAAAGCTCATATCTCAAATCTATTGTAGAGAGATTATAATGCGAGATAGAGTTCGATTCCACCATTTTAGAGATGGGAATTGTTCTTGCAAAGATTATTGGTAG
mRNA sequenceShow/hide mRNA sequence
TGCATCCTGTCATTACTTTATCTCCTGCATGCAAGTTTATTAATTTACTCTTAAAATAACTTTGTTATCCTGTAAATTGGTTAAACAATTGTACGCCACAAACAAGACAC
AAAGTGGATGATGATCCCATTCAAATTTTCCAAGAGGAACACCAATATTTCGAGAGTTCATATGAAACCATTGTCATGTTCTCTCTAAAAGCAGAGTCTCCATTACAATC
AACATGGGCCCAGACCATGTCTCTGCTTGTAAACTGCTCAAACATGAAGCAATTGAAACAAATTCACGCTCAAATGATCAAAACAGAGATCGTCACAGAACCCAAATTAG
CTACAAAGTTTCTAACCCTCTGCACTTCACCCCATTTCGGCGATTTGCTTTACGCGCAAAAGGTCTTCAATGGAATCACCAGCCCCAACACTTTCATGTGGAACGCCATT
ATAAGAGCTTACTGTAACAGTAACGAACCAGAATTAGCATTTCTCTTGTATCAGCAAATGCTTTCTTCTTCGGTACCGCACAACTCCTACACCTTCCCTTTCGTGCTCAA
AGCTTGTCGTAATTTGTCGGCCATGGGTGAGGCCCTCCAAGTTCATGGACTGGTTTTCAAACTGGGATTTGGGTCGGATGTTTTTGCATTGAATGCTCTGCTTCATGTCT
ACACTTTGTGTGGTGACATTAATTATGCACGCCAACTGTTTGATAATATTCCTGAAAGAGATGTTGTTTCTTGGAACATAATGATTGATGGGTATATCAAATCTGGGGAT
GTAAAAACGGCTTATGGGATTTTCTTGGACATGCCATTGAAAAATGTGGTCTCGTGGACGTCGCTGATTTCGGGGCTTGTTGAGGCAGGACTGAGCGTAGAAGCTTTGAA
TCTTTGTTATGAGATGCAGAGTGCAGGATTTGAACTTGATGGTGTTGCTATTGCGAGTTTGCTTACTGCTTGTGCAAATCTTGGAGCGTTGGATCAAGGAAGATGGCTCC
ATTTCTATTTGCTCAACAATGGAGTCCACATCGATCGAGTAACTGGCTGTGCTCTGGTGAATATGTACTTAAAATGTGGGGAAATGGAAGAAGCCTTTAGATTGTTTGGG
AAACTGAAGAGCGATCAGAAAGATGTGTATGTTTGGACGGCCATGATCGATGGCTTTGCCATTCATGGGCGTGGAGTGGAAGCTCTGGAATGGTTTAACCGAATGCAGAG
AGAAGGAATAAGACCAAATTCCATCACTTTCACTGCAGTTTTAAGGGCCTGTAGCTATGCAGGACTGGTTGAAGAAGGAAAAGTGTTATTCGAGAGCATGAGATGTCTTT
ACAACTTGAGCCCATCTATTGAGCATTTTGGGTGTATGGTTGATCTTTTGGGTCGAGCTGGGCTGCTGGATAAAGCGAAGGAGTTGATCAAGAAGATGCCCATGAAACCT
AATGCTGTAATATGGGGAGCTTTGCTAAAGGCCTGTTGGATTCATAGAGATTTTCTGGTGGGTAGCCAAATCGGAGCCCACCTGGTGGAAGTCGATTCAGATCATAGCGG
GCGGTACATTCAGTTGGCTACCATTTTAGCTGCAGAAGGTAAATGGAAAGAAGCAGCTGAAGTGAGGTTGAAGATGAAGAATCTGAGAGTCCCAATTCCCCCCGGAAAGA
GTTCAATAACTTTGAATGGCGTTGTTCATGAATTTCTTGCTGGGCATCAAGATCATCCACAGATGGAGCAGATTCATTTGAAACTGAAACAGGTTGCCGAGAGGCTACGA
CAAGATGAAAGTTATGAACCTGTAACTAAAGATTTATTACTTGATCTTGAGAATGAGGAGAAAGAGACTACGATGGCTCAACATAGCGAGAAGTTGGCTATTGCTTTTGG
ATTGATCAATACGAAACCAGGAGCGACGATTCGAGTTATTAAGAATCTTAGAGTCTGTAGAGATTGTCACACTGTTGCAAAGCTCATATCTCAAATCTATTGTAGAGAGA
TTATAATGCGAGATAGAGTTCGATTCCACCATTTTAGAGATGGGAATTGTTCTTGCAAAGATTATTGGTAGAGGGGGCAAAAATTGGATATTCTTTTGTTACTTTCAAGA
GTGTTTTTGCATGATCCAAGTCTCACTTTAAAGTGTATTCTAACATGATTTTGTGTATGTTTGTGTTAATATTAAATATATATAGTTATATTGTGTTTC
Protein sequenceShow/hide protein sequence
MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSV
PHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISG
LVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDVYVWTAMIDGFAIHGRG
VEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIG
AHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTM
AQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRDGNCSCKDYW