; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G004670 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G004670
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr03:5266274..5297642
RNA-Seq ExpressionLsi03G004670
SyntenyLsi03G004670
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002625 - Smr domain
IPR002885 - Pentatricopeptide repeat
IPR007258 - Vps52
IPR010908 - Longin domain
IPR011012 - Longin-like domain superfamily
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602718.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0091.74Show/hide
Query:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS
        M  FRAVQLL G S SL KRRF+LP SFLF      + IS RE  SVL  RNSE  EK++VF+N SFIRSY S KNSGNG+SEWTE+IEYLDESGSVIFS
Subjt:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS

Query:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD
        GKGVR VEPGLD HVMVGGLKKPFLNASAVAKIVE+V RWKWGPELESQLEKLQFVPNMTHITQ+LK+I+DAEASLSLFRWAKRQSWYSPNDECYGLLFD
Subjt:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD

Query:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
        GLNQ RDFDAIQLLFDE+VRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESM GAGCSLDA
Subjt:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA

Query:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG
        STFELMIPSLAKSGRLDAAMKLFQEMKERN+RP LNV+T+LVDSMGKAGRLDTSMKIYM+MQLLELRPPASMFVSL+ESHVKAGKLD+ALKLWDEMKRAG
Subjt:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG

Query:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
        FRPNFGLYS+VVESHAKSGKLDVAMSIFTEMEKAGFLP PSTYCCLLEMHAAS QVD AMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
Subjt:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE

Query:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY
        MKAMGFSVDVSASDVLMVYIKEG  D+ALRWLQFMGSSGIRTN+FILRQLFESCMKKGMYESA PLLE+YVDSAAKVDLILYTSILAHLVRCQ+E  ERY
Subjt:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY

Query:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR
        LMSILSATRHKAHSFL GLFTG EQRKQPVLSFVREFFQ IDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+R
Subjt:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR

Query:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
        NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
Subjt:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL

Query:  FPSSAPEIRSLSPPKPLISRNSA
        FPSSAPEIRSLSP KPLI RNSA
Subjt:  FPSSAPEIRSLSPPKPLISRNSA

KAG7033406.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0091.62Show/hide
Query:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS
        M  FRAVQLL G S SL KRRF+LP SFLF      + IS RE  SVL  RNSE  EK++VF+N SFIRSY S K+SGNG+SEWTE+IEYLDESGSVIFS
Subjt:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS

Query:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD
        GKGVR VEPGLD HVMVGGLKKPFLNASAVAKIVE+V RWKWGPELESQLEKLQFVPNMTHITQ+LK+I+DAEASLSLFRWAKRQSWYSPNDECYGLLFD
Subjt:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD

Query:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
        GLNQ RDFDAIQLLFDE+VRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESM GAGCSLDA
Subjt:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA

Query:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG
        STFELMIPSLAKSGRLDAAMKLFQEMKERN+RP LNV+T+LVDSMGKAGRLDTSMKIYM+MQLLELRPPASMFVSL+ESHVKAGKLD+ALKLWDEMKRAG
Subjt:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG

Query:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
        FRPNFGLYS+VVESHAKSGKLDVAMSIFTEMEKAGFLP PSTYCCLLEMHAAS QVD AMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
Subjt:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE

Query:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY
        MKAMGFSVDVSASDVLMVYIKEG  D+ALRWLQFMGSSGIRTN+FILRQLFESCMKKGMYESA PLLE+YVDSAAKVDLILYTSILAHLVRCQ+E  ERY
Subjt:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY

Query:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR
        LMSILSATRHKAHSFL GLFTG EQRKQPVLSFVREFFQ IDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+R
Subjt:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR

Query:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
        NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
Subjt:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL

Query:  FPSSAPEIRSLSPPKPLISRNSA
        FPSSAPEIRSLSP KPLI RNSA
Subjt:  FPSSAPEIRSLSPPKPLISRNSA

XP_004138818.1 pentatricopeptide repeat-containing protein At1g79490, mitochondrial [Cucumis sativus]0.0e+0093.32Show/hide
Query:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS
        MLPFRAVQLL GSSN LHKRR LL  SFLF      SS  WREA+SVL+PRNSEFLE  HVFNN SF RSYCSGK SGNG  EWTEDIEYLDESGSVIFS
Subjt:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS

Query:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD
        GKGVR VEPG+D HVMVGGLKKPFLNASAVAKIVEVV RWKWGPELESQLEKLQFVPNMTHITQ LKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD
Subjt:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD

Query:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
        GLNQRRDFDAIQLLFDEVVRDLSSD TVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGA CSLDA
Subjt:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA

Query:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG
        STFELMIP LAKSGRLDAAMKLFQEMKE+ YRPA NVY+SLVDSMGKAGRLDTSMKIYMEMQLLELRP A MFVSLIESHVKAGKLD+ALKLWD+MKRAG
Subjt:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG

Query:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
        F+PNFGLYSMVVESHAKSGKLDVAMS+FTEMEKAGFLPIPSTYCCLLEM AASG VDAAMKLYNSMTNAGLRLGL+TYT+LLTLLANKKLIDIAAKVLLE
Subjt:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE

Query:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY
        MKAMGFSV VSASDVLMVYIKEGS+DSALRWLQFMGSSGIRTNSFI+RQLFESCMKKGMYESAMPLLETYV+SAAKVDLILYTSILAHLVRCQ+EQKERY
Subjt:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY

Query:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR
        LMSILS T+HKAHSFLCGLFTGTEQRKQPVLSFVREFFQ IDYELEESSA+YFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+R
Subjt:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR

Query:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
        NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
Subjt:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL

Query:  FPSSAPEIRSLSPPKPLISRNSA
        FPSSAPEIRSLSPPKPLISRNSA
Subjt:  FPSSAPEIRSLSPPKPLISRNSA

XP_008441211.1 PREDICTED: pentatricopeptide repeat-containing protein At1g79490, mitochondrial [Cucumis melo]0.0e+0093.32Show/hide
Query:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS
        MLPFRAVQLL GSSN LHKRR LL  SFLF      SS  WREA S+LKPRNS+FLE  HVFNN SF R YCSGK  GNG  EWTEDIEYLDESGSVIFS
Subjt:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS

Query:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD
        GKGVR VEPG+D HVMVGGLKKPFLNASAVAKIVEVV RWKWGPELESQLEKLQFVPNMTHITQ+LKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD
Subjt:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD

Query:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
        GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGF+VDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
Subjt:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA

Query:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG
        STFELMIP LAKSGRLDAAMKLFQEMKE+NYRPA N+Y+SLVDSMGKAGRLDTSMKIYMEMQLLELRP A MFVSLIESHVKAGKLD+ALKLWD+MK+AG
Subjt:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG

Query:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
        F+PNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIP TYCCLLEMHA+SG VDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
Subjt:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE

Query:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY
        MKAMGFSV VSASDVLMVYIKEGS+DSALRWLQFMGSSGIRTNSFI+RQLFESCMKKGMYESAMPLLETYV+SAAKVDLILYTSILAHLVRCQ+EQKERY
Subjt:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY

Query:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR
        LMSILSAT+H+AHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSA+YFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+R
Subjt:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR

Query:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
        NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
Subjt:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL

Query:  FPSSAPEIRSLSPPKPLISRNSA
        FPSSAPEIRSLSPPKPLISRNSA
Subjt:  FPSSAPEIRSLSPPKPLISRNSA

XP_038884643.1 pentatricopeptide repeat-containing protein At1g79490, mitochondrial [Benincasa hispida]0.0e+0094.9Show/hide
Query:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS
        M PFRAVQLLFGSSNSLHKRRFLL ASFLF      SSIS RE + VLK RNSEFLE   VFNN SF RSYCSGK SGNG SEWTEDIEYLDESGSVIFS
Subjt:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS

Query:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD
        GKGVR VEPG+D HVMVGGLKKPFLNASAVAKIVEVV RW+WGPELESQLEKLQFVPNMTHITQ+LKII D EASL+LFRWAKRQSWYS NDECYGLLFD
Subjt:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD

Query:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
        GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
Subjt:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA

Query:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG
        STFELMIPSLAKSGRLDAAMKLFQEMKERNYRP LNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRP AS+F SLIESHVKAGKLD+ALKLWDEMKRAG
Subjt:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG

Query:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
        FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASG VDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
Subjt:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE

Query:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY
        MKA+GFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYV+SAAKVDLILYTSILAHLVRCQ+EQKERY
Subjt:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY

Query:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR
        LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+R
Subjt:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR

Query:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
        NLSVGAALIAV+HTLHRFRKRMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
Subjt:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL

Query:  FPSSAPEIRSLSPPKPLISRNSA
        FPSSAPEIRSLSPPKPLISRNSA
Subjt:  FPSSAPEIRSLSPPKPLISRNSA

TrEMBL top hitse value%identityAlignment
A0A0A0LMG2 Smr domain-containing protein0.0e+0093.32Show/hide
Query:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS
        MLPFRAVQLL GSSN LHKRR LL  SFLF      SS  WREA+SVL+PRNSEFLE  HVFNN SF RSYCSGK SGNG  EWTEDIEYLDESGSVIFS
Subjt:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS

Query:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD
        GKGVR VEPG+D HVMVGGLKKPFLNASAVAKIVEVV RWKWGPELESQLEKLQFVPNMTHITQ LKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD
Subjt:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD

Query:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
        GLNQRRDFDAIQLLFDEVVRDLSSD TVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGA CSLDA
Subjt:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA

Query:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG
        STFELMIP LAKSGRLDAAMKLFQEMKE+ YRPA NVY+SLVDSMGKAGRLDTSMKIYMEMQLLELRP A MFVSLIESHVKAGKLD+ALKLWD+MKRAG
Subjt:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG

Query:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
        F+PNFGLYSMVVESHAKSGKLDVAMS+FTEMEKAGFLPIPSTYCCLLEM AASG VDAAMKLYNSMTNAGLRLGL+TYT+LLTLLANKKLIDIAAKVLLE
Subjt:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE

Query:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY
        MKAMGFSV VSASDVLMVYIKEGS+DSALRWLQFMGSSGIRTNSFI+RQLFESCMKKGMYESAMPLLETYV+SAAKVDLILYTSILAHLVRCQ+EQKERY
Subjt:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY

Query:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR
        LMSILS T+HKAHSFLCGLFTGTEQRKQPVLSFVREFFQ IDYELEESSA+YFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+R
Subjt:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR

Query:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
        NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
Subjt:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL

Query:  FPSSAPEIRSLSPPKPLISRNSA
        FPSSAPEIRSLSPPKPLISRNSA
Subjt:  FPSSAPEIRSLSPPKPLISRNSA

A0A1S3B2G1 pentatricopeptide repeat-containing protein At1g79490, mitochondrial0.0e+0093.32Show/hide
Query:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS
        MLPFRAVQLL GSSN LHKRR LL  SFLF      SS  WREA S+LKPRNS+FLE  HVFNN SF R YCSGK  GNG  EWTEDIEYLDESGSVIFS
Subjt:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS

Query:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD
        GKGVR VEPG+D HVMVGGLKKPFLNASAVAKIVEVV RWKWGPELESQLEKLQFVPNMTHITQ+LKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD
Subjt:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD

Query:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
        GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGF+VDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
Subjt:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA

Query:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG
        STFELMIP LAKSGRLDAAMKLFQEMKE+NYRPA N+Y+SLVDSMGKAGRLDTSMKIYMEMQLLELRP A MFVSLIESHVKAGKLD+ALKLWD+MK+AG
Subjt:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG

Query:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
        F+PNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIP TYCCLLEMHA+SG VDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
Subjt:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE

Query:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY
        MKAMGFSV VSASDVLMVYIKEGS+DSALRWLQFMGSSGIRTNSFI+RQLFESCMKKGMYESAMPLLETYV+SAAKVDLILYTSILAHLVRCQ+EQKERY
Subjt:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY

Query:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR
        LMSILSAT+H+AHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSA+YFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+R
Subjt:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR

Query:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
        NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
Subjt:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL

Query:  FPSSAPEIRSLSPPKPLISRNSA
        FPSSAPEIRSLSPPKPLISRNSA
Subjt:  FPSSAPEIRSLSPPKPLISRNSA

A0A5A7T633 Pentatricopeptide repeat-containing protein0.0e+0093.32Show/hide
Query:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS
        MLPFRAVQLL GSSN LHKRR LL  SFLF      SS  WREA S+LKPRNS+FLE  HVFNN SF R YCSGK  GNG  EWTEDIEYLDESGSVIFS
Subjt:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS

Query:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD
        GKGVR VEPG+D HVMVGGLKKPFLNASAVAKIVEVV RWKWGPELESQLEKLQFVPNMTHITQ+LKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD
Subjt:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD

Query:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
        GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGF+VDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
Subjt:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA

Query:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG
        STFELMIP LAKSGRLDAAMKLFQEMKE+NYRPA N+Y+SLVDSMGKAGRLDTSMKIYMEMQLLELRP A MFVSLIESHVKAGKLD+ALKLWD+MK+AG
Subjt:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG

Query:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
        F+PNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIP TYCCLLEMHA+SG VDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
Subjt:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE

Query:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY
        MKAMGFSV VSASDVLMVYIKEGS+DSALRWLQFMGSSGIRTNSFI+RQLFESCMKKGMYESAMPLLETYV+SAAKVDLILYTSILAHLVRCQ+EQKERY
Subjt:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY

Query:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR
        LMSILSAT+H+AHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSA+YFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+R
Subjt:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR

Query:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
        NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
Subjt:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL

Query:  FPSSAPEIRSLSPPKPLISRNSA
        FPSSAPEIRSLSPPKPLISRNSA
Subjt:  FPSSAPEIRSLSPPKPLISRNSA

A0A6J1H9M8 pentatricopeptide repeat-containing protein At1g79490, mitochondrial0.0e+0091.37Show/hide
Query:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS
        M  FRAVQLL G S SL KRRF+LP SFLF      + IS RE  SVL  RNSE  EK++VF+N SFIRSY S K+SGNG+SEWTE+IEYLDESGSVIFS
Subjt:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS

Query:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD
        GKGVR VEPGLD HVMVGGLKKPFLNASAVAKIVE+V RWKWGPELESQLEKLQFVPNMTHITQ+LK+I+DAEASLSLFRWAKRQSWYSPNDECYGLLFD
Subjt:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD

Query:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
        GLN+ RDFDAIQLLFDE+VRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESM GAGCSLDA
Subjt:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA

Query:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG
        STFELMIPSLAKSGRLDAAMKLFQEMKERN+RP LNV+T+LVDSMGKAGRLDTSMKIYM+MQLLELRPPASMFVSL+ESHVKAGKLD+ALKLWDEMKRAG
Subjt:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG

Query:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
        FRPNFGLYS+VVESHAKSGKLDVAMSIFTEMEKAGFLP PSTYCCLLEMHAAS QVD AMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
Subjt:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE

Query:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY
        MKAMGFSVDVSASDVLMVYIKEG  D+ALRWLQFMGSSGIRTN+FILRQLFESCMKKGMYESA PLLE+YVDSAAKVDLILYTSILAHLVRCQ+E  ERY
Subjt:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY

Query:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR
        LMSILSATRHKAHSFL GLFTG EQRKQPVLSFVREFFQ IDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+R
Subjt:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR

Query:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
        NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSD+LMHKLNTL
Subjt:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL

Query:  FPSSAPEIRSLSPPKPLISRNSA
        FPSSAPEIRSLSP KPLI RNSA
Subjt:  FPSSAPEIRSLSPPKPLISRNSA

A0A6J1JM38 pentatricopeptide repeat-containing protein At1g79490, mitochondrial0.0e+0091.49Show/hide
Query:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS
        M  FRAVQLL G S SL KRRF+LP S LF      S IS REA SVL  RNSE  EK++VF+N SFIRSY S K+SGNG+SEWTE+IEYLDESGSVIFS
Subjt:  MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFS

Query:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD
        GKGVR VEPGLD HVMVGGLKKPFLNASAVAKIVE+V RWKWGPELESQLEKLQFVPNMTHITQ+LKII+DAE+SLSLFRWAKRQSWYS NDECYGLLFD
Subjt:  GKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFD

Query:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
        GLNQ+RDFDAIQLLFDE+VRDLS+DGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESM GAGCSLDA
Subjt:  GLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA

Query:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG
        STFELMIPSLAKSGRLDAAMKLFQEMKERN+RP LNV+T+LVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSL+ESHVKAGKLD+ALKLWDEMKRAG
Subjt:  STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAG

Query:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
        FRPNFGLYS+VVESHAKSGKL+VAMSIFTEMEKAGFLP PSTYCCLLEMHAAS QVD AMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE
Subjt:  FRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLE

Query:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY
        MKAMGFSVDVSASDVLMVYIKEG  D+ALRWLQFMGSSGIRTN+FILRQLFESCMKKGMYESA PLLE+YVDSAAKVDLILYTSILAHLVRCQ+E  ERY
Subjt:  MKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY

Query:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR
        LMSILSATRHKAHSFL GLFTG EQRKQPVLSFVREFFQ IDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+R
Subjt:  LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIR

Query:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
        NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL
Subjt:  NLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTL

Query:  FPSSAPEIRSLSPPKPLISRNSA
        FPSSAPEIRSLSP KPLI RNSA
Subjt:  FPSSAPEIRSLSPPKPLISRNSA

SwissProt top hitse value%identityAlignment
O55166 Vacuolar protein sorting-associated protein 52 homolog1.2e-7226.24Show/hide
Query:  EEDACSDDISLEGLQQELEECKDDDHGLNYPETVVTNILSKGVKLREYTKGVENNLRQVELDSIQEYIKESDNLVSLHEQIRDCDNILSQMETLLSGFQA
        E D  SD+  L+ +   ++         N  + +V   L  GV LR Y+K VE  L+Q+E  SI++YI+ES+N+ SLH QI  CD +L +ME +L  FQ+
Subjt:  EEDACSDDISLEGLQQELEECKDDDHGLNYPETVVTNILSKGVKLREYTKGVENNLRQVELDSIQEYIKESDNLVSLHEQIRDCDNILSQMETLLSGFQA

Query:  EIGSISADIKVLQEKSMDMGLKLKNRKVAESKLAKFVEEIIVPPRMIDIIVDGEVND-EYLRTLEILSKKLVVVEVDPMIKNSKALKDVQPELEKLRQKA
        ++ SIS +I+ LQE+S  M ++L+NR+    KL + V+ ++VP  ++  I++  V +  +L  L+ L  K   V  +   + + A  DV+  L++LR KA
Subjt:  EIGSISADIKVLQEKSMDMGLKLKNRKVAESKLAKFVEEIIVPPRMIDIIVDGEVND-EYLRTLEILSKKLVVVEVDPMIKNSKALKDVQPELEKLRQKA

Query:  VSKDSSSTAVVKYVEKPLNLKPPSSFTVSSFYNTRDDFGSSGTVFDFLVQKLHALRKPKTNIQILQQSVLLKYKNLILPITYSSLSGGKIIPCIYHHRDN
        V+K                                        + +F++QK+++ RKP TN QI  Q+ LLKY+                          
Subjt:  VSKDSSSTAVVKYVEKPLNLKPPSSFTVSSFYNTRDDFGSSGTVFDFLVQKLHALRKPKTNIQILQQSVLLKYKNLILPITYSSLSGGKIIPCIYHHRDN

Query:  DLQEKPDFPKITPAIMNCEFLSLYVISFLKDHSKEVYNEVRTAYIDTMNKVLSAHFRAYIQALEKLQL-DIATSSDLIGVEARSSGLFLRGREPLKNRSA
                               +   FL  + +    EVR  Y++T++K+  +++R+Y+  L K+Q  ++A   DL+GVE  +   F   +  L++R+ 
Subjt:  DLQEKPDFPKITPAIMNCEFLSLYVISFLKDHSKEVYNEVRTAYIDTMNKVLSAHFRAYIQALEKLQL-DIATSSDLIGVEARSSGLFLRGREPLKNRSA

Query:  VFALGDRIKILKIFFMKRKQGFSCDKNIGLATRPDVNFIQEVDEPALIPHIAEASSIKYPYEVLFRSLHKLLMDTATSEYLTVYSSLVYYLQNVICSKYN
        +F LG R                     G    P      E++ P L+PH A+    +YP+E LFRS H  L+D +  EYL +                 
Subjt:  VFALGDRIKILKIFFMKRKQGFSCDKNIGLATRPDVNFIQEVDEPALIPHIAEASSIKYPYEVLFRSLHKLLMDTATSEYLTVYSSLVYYLQNVICSKYN

Query:  FCDDFF----GEESIFYEIFAGPFAVIDEHFNSILPNSYDAIGLMLMILIIHQHQINIFSFLFDLFSAFQRVEHLATHLHPRSLGCIFYYIQYIHWLTML
         C+ F         +F+ +     ++  +H  S L + YDAI + L                                       CI   +++       
Subjt:  FCDDFF----GEESIFYEIFAGPFAVIDEHFNSILPNSYDAIGLMLMILIIHQHQINIFSFLFDLFSAFQRVEHLATHLHPRSLGCIFYYIQYIHWLTML

Query:  VCPAHNVTAAYTLLGFIFRQDIISIN--LDGVNIALWPRFKMVFDMHLSSLRNANVKTLWEDDVHPHYVMRRYAEFTASLIHLNVEYGDGQLDLNLERLR
             N+ A         ++D+ +++   + V   LWPRF+++ +M++ S+R+ + + L   D  PHY+ RRYAEF+++L+ +N    + +    L +L+
Subjt:  VCPAHNVTAAYTLLGFIFRQDIISIN--LDGVNIALWPRFKMVFDMHLSSLRNANVKTLWEDDVHPHYVMRRYAEFTASLIHLNVEYGDGQLDLNLERLR

Query:  MAIDDLLIKLAKTFSKAKSQTVFLINNYDMTISVLKEFIGKKWSS-HNFQPLIEKTERTSWTRLFVYGSTEEAGPEGGKI--LVHFEDLLKSNTALFVAT
        + +++ ++++A  FS  K Q VFLINNYDM + VL E          +FQ L+    +     L        + P GG +  +   E L++   A  +  
Subjt:  MAIDDLLIKLAKTFSKAKSQTVFLINNYDMTISVLKEFIGKKWSS-HNFQPLIEKTERTSWTRLFVYGSTEEAGPEGGKI--LVHFEDLLKSNTALFVAT

Query:  EDLSSNPDRPITVAEVEPLVKDFASRWKAAIELMHKDVITSFSNFLCGMEILRAALTQLLLYYTR
        E+           A V  L++ F S WKA++E + +DV+ SF+NF  G  I++ ALTQL+  Y R
Subjt:  EDLSSNPDRPITVAEVEPLVKDFASRWKAAIELMHKDVITSFSNFLCGMEILRAALTQLLLYYTR

Q8N1B4 Vacuolar protein sorting-associated protein 52 homolog1.6e-7226.01Show/hide
Query:  EEDACSDDISLEGLQQELEECKDDDHGLNYPETVVTNILSKGVKLREYTKGVENNLRQVELDSIQEYIKESDNLVSLHEQIRDCDNILSQMETLLSGFQA
        E D  SD+  L+ +   ++         N  + +V   L  GV LR Y+K VE  L+Q+E  SI++YI+ES+N+ SLH QI  CD +L +ME +L  FQ+
Subjt:  EEDACSDDISLEGLQQELEECKDDDHGLNYPETVVTNILSKGVKLREYTKGVENNLRQVELDSIQEYIKESDNLVSLHEQIRDCDNILSQMETLLSGFQA

Query:  EIGSISADIKVLQEKSMDMGLKLKNRKVAESKLAKFVEEIIVPPRMIDIIVDGEVND-EYLRTLEILSKKLVVVEVDPMIKNSKALKDVQPELEKLRQKA
        ++ SIS++I+ LQE+S  M ++L+NR+    KL + V+ ++VP  ++  I++  V +  +L  L+ L  K   V  +   + + A  DV+  L++LR KA
Subjt:  EIGSISADIKVLQEKSMDMGLKLKNRKVAESKLAKFVEEIIVPPRMIDIIVDGEVND-EYLRTLEILSKKLVVVEVDPMIKNSKALKDVQPELEKLRQKA

Query:  VSKDSSSTAVVKYVEKPLNLKPPSSFTVSSFYNTRDDFGSSGTVFDFLVQKLHALRKPKTNIQILQQSVLLKYKNLILPITYSSLSGGKIIPCIYHHRDN
        V+K                                        + +F++QK+++ RKP TN QI  Q+ LLKY+                          
Subjt:  VSKDSSSTAVVKYVEKPLNLKPPSSFTVSSFYNTRDDFGSSGTVFDFLVQKLHALRKPKTNIQILQQSVLLKYKNLILPITYSSLSGGKIIPCIYHHRDN

Query:  DLQEKPDFPKITPAIMNCEFLSLYVISFLKDHSKEVYNEVRTAYIDTMNKVLSAHFRAYIQALEKLQL-DIATSSDLIGVEARSSGLFLRGREPLKNRSA
                               +   FL  + +    E+R  Y++T++K+  +++R+Y+  L K+Q  ++A   DL+GVE  +   F   +  L++R+ 
Subjt:  DLQEKPDFPKITPAIMNCEFLSLYVISFLKDHSKEVYNEVRTAYIDTMNKVLSAHFRAYIQALEKLQL-DIATSSDLIGVEARSSGLFLRGREPLKNRSA

Query:  VFALGDRIKILKIFFMKRKQGFSCDKNIGLATRPDVNFIQEVDEPALIPHIAEASSIKYPYEVLFRSLHKLLMDTATSEYLTVYSSLVYYLQNVICSKYN
        +F LG                          TR  V    E++ P L+PH A+    +YP+E LFRS H  L+D +  EYL +                 
Subjt:  VFALGDRIKILKIFFMKRKQGFSCDKNIGLATRPDVNFIQEVDEPALIPHIAEASSIKYPYEVLFRSLHKLLMDTATSEYLTVYSSLVYYLQNVICSKYN

Query:  FCDDFF----GEESIFYEIFAGPFAVIDEHFNSILPNSYDAIGLMLMILIIHQHQINIFSFLFDLFSAFQRVEHLATHLHPRSLGCIFYYIQYIHWLTML
         C+ F         +F+ +     ++  +H +S L + YDAI + L                                       CI   +++       
Subjt:  FCDDFF----GEESIFYEIFAGPFAVIDEHFNSILPNSYDAIGLMLMILIIHQHQINIFSFLFDLFSAFQRVEHLATHLHPRSLGCIFYYIQYIHWLTML

Query:  VCPAHNVTAAYTLLGFIFRQDIISIN--LDGVNIALWPRFKMVFDMHLSSLRNANVKTLWEDDVHPHYVMRRYAEFTASLIHLNVEYGDGQLDLNLERLR
             N+ A         ++D+ +++   + V   LWPRF+++ +M++ S+R+ + + L   D  PHY+ RRYAEF+++L+ +N    + +    L +L+
Subjt:  VCPAHNVTAAYTLLGFIFRQDIISIN--LDGVNIALWPRFKMVFDMHLSSLRNANVKTLWEDDVHPHYVMRRYAEFTASLIHLNVEYGDGQLDLNLERLR

Query:  MAIDDLLIKLAKTFSKAKSQTVFLINNYDMTISVLKEFIGKKWSS-HNFQPLIEKTERTSWTRLFVYGSTEEAGPEGGKI--LVHFEDLLKSNTALFVAT
        + +++ ++++A  FS  K Q VFLINNYDM + VL E          +FQ L+    +     L        + P GG +  +   E L++   A  +  
Subjt:  MAIDDLLIKLAKTFSKAKSQTVFLINNYDMTISVLKEFIGKKWSS-HNFQPLIEKTERTSWTRLFVYGSTEEAGPEGGKI--LVHFEDLLKSNTALFVAT

Query:  EDLSSNPDRPITVAEVEPLVKDFASRWKAAIELMHKDVITSFSNFLCGMEILRAALTQLLLYYTR
        E+           A V  L++ F S WK+++E + +DV+ SF+NF  G  I++ ALTQL+  Y R
Subjt:  EDLSSNPDRPITVAEVEPLVKDFASRWKAAIELMHKDVITSFSNFLCGMEILRAALTQLLLYYTR

Q94KD3 Vacuolar protein sorting-associated protein 52 A3.4e-26958.23Show/hide
Query:  INQSYDEANKTQKNVFDLESFVGDLTVEEDACSDDISLEGLQQELEECKDDDHGLNYPETVVTNILSKGVKLREYTKGVENNLRQVELDSIQEYIKESDN
        + Q+  + +  +K  FDL +FVGDL  EED+ S+DISLEGLQQELEEC+ D+        VV NILS G KLREY KGVENNLR+VELDSI++YIKESDN
Subjt:  INQSYDEANKTQKNVFDLESFVGDLTVEEDACSDDISLEGLQQELEECKDDDHGLNYPETVVTNILSKGVKLREYTKGVENNLRQVELDSIQEYIKESDN

Query:  LVSLHEQIRDCDNILSQMETLLSGFQAEIGSISADIKVLQEKSMDMGLKLKNRKVAESKLAKFVEEIIVPPRMIDIIVDGEVNDEYLRTLEILSKKLVVV
        LVSLH+QIRDCD+ILSQMETLLSGFQ EIGSIS+DIK+LQEKSMDMGL+LKNR+VAESKLAKFVE+IIVPP+MID+IVDGEVN+EY++TLEILSKKL  V
Subjt:  LVSLHEQIRDCDNILSQMETLLSGFQAEIGSISADIKVLQEKSMDMGLKLKNRKVAESKLAKFVEEIIVPPRMIDIIVDGEVNDEYLRTLEILSKKLVVV

Query:  EVDPMIKNSKALKDVQPELEKLRQKAVSKDSSSTAVVKYVEKPLNLKPPSSFTVSSFYNTRDDFGSSGTVFDFLVQKLHALRKPKTNIQILQQSVLLKYK
        E D  +K+SKALKDV+PELEKLRQKA+SK                                        V+DF+VQKL ALRKPKTNIQILQQSVLLKYK
Subjt:  EVDPMIKNSKALKDVQPELEKLRQKAVSKDSSSTAVVKYVEKPLNLKPPSSFTVSSFYNTRDDFGSSGTVFDFLVQKLHALRKPKTNIQILQQSVLLKYK

Query:  NLILPITYSSLSGGKIIPCIYHHRDNDLQEKPDFPKITPAIMNCEFLSLYVISFLKDHSKEVYNEVRTAYIDTMNKVLSAHFRAYIQALEKLQLDIATSS
                                                         Y+ISFLK+H KEV+ +VR AYIDTMNKVLSAHFRAYIQALEKLQLDIAT+ 
Subjt:  NLILPITYSSLSGGKIIPCIYHHRDNDLQEKPDFPKITPAIMNCEFLSLYVISFLKDHSKEVYNEVRTAYIDTMNKVLSAHFRAYIQALEKLQLDIATSS

Query:  DLIGVEARSSGLFLRGREPLKNRSAVFALGDRIKILKIFFMKRKQGFSCDKNIGLATRPDVNFIQEVDEPALIPHIAEASSIKYPYEVLFRSLHKLLMDT
        DLIGVE R++GLF R REPLKNRSAVFALGDRIKI+K                            ++D+PALIPHIAEASS+KYPYEVLFRSLHKLLMDT
Subjt:  DLIGVEARSSGLFLRGREPLKNRSAVFALGDRIKILKIFFMKRKQGFSCDKNIGLATRPDVNFIQEVDEPALIPHIAEASSIKYPYEVLFRSLHKLLMDT

Query:  ATSEYLTVYSSLVYYLQNVICSKYNFCDDFFGEESIFYEIFAGPFAVIDEHFNSILPNSYDAIGLMLMILIIHQHQINIFSFLFDLFSAFQRVEHLATHL
        ATSEY+                   FCDDFFGEESIFYEIFAGPF+VIDEHFN +L N +DAIGLMLMI IIH HQ+                      +
Subjt:  ATSEYLTVYSSLVYYLQNVICSKYNFCDDFFGEESIFYEIFAGPFAVIDEHFNSILPNSYDAIGLMLMILIIHQHQINIFSFLFDLFSAFQRVEHLATHL

Query:  HPRSLGCIFYYIQYIHWLTMLVCPAHNVTAAYTLLGFIFRQDIISINLDGVNIALWPRFKMVFDMHLSSLRNANVKTLWEDDVHPHYVMRRYAEFTASLI
          R + C+  Y                                    LD VNI+LWPRFKMVFD HLSSLR+AN+KTLWEDDVHPHYVMRRYAEFTAS I
Subjt:  HPRSLGCIFYYIQYIHWLTMLVCPAHNVTAAYTLLGFIFRQDIISINLDGVNIALWPRFKMVFDMHLSSLRNANVKTLWEDDVHPHYVMRRYAEFTASLI

Query:  HLNVEYGDGQLDLNLERLRMAIDDLLIKLAKTFSKAKSQTVFLINNYDMTISVLKEFIGKKWSSHNFQPLIEKTERTSWTRLFVYGSTEEAGPEGGKILV
        HLNVEYGDGQLD+NLERLRMA+D L++KLAK F + K Q VFLINNYDMTI+VLK                                  EAGPEGGKI +
Subjt:  HLNVEYGDGQLDLNLERLRMAIDDLLIKLAKTFSKAKSQTVFLINNYDMTISVLKEFIGKKWSSHNFQPLIEKTERTSWTRLFVYGSTEEAGPEGGKILV

Query:  HFEDLLKSNTALFV------------------ATEDLSSNPDRPITVAEVEPLVKDFASRWKAAIELMHKDVITSFSNFLCGMEILRAALTQLLLYYTRL
        HFE++LKSNT+LFV                  A+ED S NP+R IT+AEVEPLVKDF SRWK AIELM KD+ITSFSNFLCGM+ILRAALTQLLLYYTRL
Subjt:  HFEDLLKSNTALFV------------------ATEDLSSNPDRPITVAEVEPLVKDFASRWKAAIELMHKDVITSFSNFLCGMEILRAALTQLLLYYTRL

Query:  SDCIKRIVGGSALNKDLVSISSIM
        +DCIK+I GGSALN+DLVSI SIM
Subjt:  SDCIKRIVGGSALNKDLVSISSIM

Q9FVV6 Vacuolar protein sorting-associated protein 52 B1.7e-24954.49Show/hide
Query:  DEANKTQKNVFDLESFVGDLTVEEDACSDDISLEGLQQELEECKDDDHGLNYPETVVTNILSKGVKLREYTKGVENNLRQVELDSIQ------EYIKESD
        D+ +  +K+  +L +F GDL  EED+ S+DISLEGLQQELEEC+ D+        VV NILS G KLREY KGVENNLR+VELDS++      +YIKESD
Subjt:  DEANKTQKNVFDLESFVGDLTVEEDACSDDISLEGLQQELEECKDDDHGLNYPETVVTNILSKGVKLREYTKGVENNLRQVELDSIQ------EYIKESD

Query:  NLVSLHEQIRDCDNILSQMETLLSGFQAEIGSISADIKVLQEKSMDMGLKLKNRKVAESKLAKFVEEIIVPPRMIDIIVDGEVNDEYLRTLEILSKKLVV
         LVSLH+QIRDC+ ILSQMETLLSGFQ EIGSIS+DIK+LQE SMDMGL+LKNR+V ESKLAKFVE+ IVPP+MI +IVDGEVN+EY++TL ILSKKL  
Subjt:  NLVSLHEQIRDCDNILSQMETLLSGFQAEIGSISADIKVLQEKSMDMGLKLKNRKVAESKLAKFVEEIIVPPRMIDIIVDGEVNDEYLRTLEILSKKLVV

Query:  VEVDPMIKNSKALKDVQPELEKLRQKAVSKDSSSTAVVKYVEKPLNLKPPSSFTVSSFYNTRDDFGSSGTVFDFLVQKLHALRKPKTNIQILQQSVLLKY
        VE D  +K+SKALKDV+PELEKLRQKA+SK                                        V+DF+VQKL ALRKPKTNIQILQQSV LKY
Subjt:  VEVDPMIKNSKALKDVQPELEKLRQKAVSKDSSSTAVVKYVEKPLNLKPPSSFTVSSFYNTRDDFGSSGTVFDFLVQKLHALRKPKTNIQILQQSVLLKY

Query:  KNLILPITYSSLSGGKIIPCIYHHRDNDLQEKPDFPKITPAIMNCEFLSLYVISFLKDHSKEVYNEVRTAYIDTMNKVLSAHFRAYIQALEKLQLDIATS
        K                                                 Y+ISFLK+H KEV+ +VR AYIDTMNKVLSAHF++YIQA EKLQLDIATS
Subjt:  KNLILPITYSSLSGGKIIPCIYHHRDNDLQEKPDFPKITPAIMNCEFLSLYVISFLKDHSKEVYNEVRTAYIDTMNKVLSAHFRAYIQALEKLQLDIATS

Query:  SDLIGVEARSSGLFLRGREPLKNRSAVFALGDRIKILKIFFMKRKQGFSCDKNIGLATRPDVNFIQEVDEPALIPHIAEASSIKYPYEVLFRSLHKLLMD
        +DLIGV+ RS+GLF R +EPLKNR AVFALG+RI+I+K                            E+D+PALIPHIAEASS+KYPYEVLFRSLHKLLMD
Subjt:  SDLIGVEARSSGLFLRGREPLKNRSAVFALGDRIKILKIFFMKRKQGFSCDKNIGLATRPDVNFIQEVDEPALIPHIAEASSIKYPYEVLFRSLHKLLMD

Query:  TATSEYLTVYSSLVYYLQNVICSKYNFCDDFFGEESIFYEIFAGPFAVIDEHFNSILPNSYDAIGLMLMILIIHQHQINIFSFLFDLFSAFQRVEHLATH
        TATSEY+                   FC+DFFGE+SIFYEIFAGPF+VI EH +S+L + YDAIGL+LMI IIH HQ+                      
Subjt:  TATSEYLTVYSSLVYYLQNVICSKYNFCDDFFGEESIFYEIFAGPFAVIDEHFNSILPNSYDAIGLMLMILIIHQHQINIFSFLFDLFSAFQRVEHLATH

Query:  LHPRSLGCIFYYIQYIHWLTMLVCPAHNVTAAYTLLGFIFRQDIISINLDGVNIALWPRFKMVFDMHLSSLRNANVKTLWEDDVHPHYVMRRYAEFTASL
        +  R + C+  Y                                    LD VNI+LWPRFK VFD+H+ SLRNAN+ T+WEDDVHPHY+MRRYAEFTAS 
Subjt:  LHPRSLGCIFYYIQYIHWLTMLVCPAHNVTAAYTLLGFIFRQDIISINLDGVNIALWPRFKMVFDMHLSSLRNANVKTLWEDDVHPHYVMRRYAEFTASL

Query:  IHLNVEYGDGQLDLNLERLRMAIDDLLIKLAKTFSKAKSQTVFLINNYDMTISVLKEFIGKKWSSHNFQPLIEKTERTSWTRLFVYGSTEEAGPEGGKIL
        IHLNVEYGDGQLD+NLERLRMA+D L++KLAK F + K Q VFLINNYDMTI+VLK                                  EA PEGGKI 
Subjt:  IHLNVEYGDGQLDLNLERLRMAIDDLLIKLAKTFSKAKSQTVFLINNYDMTISVLKEFIGKKWSSHNFQPLIEKTERTSWTRLFVYGSTEEAGPEGGKIL

Query:  VHFEDLLKSNTALFV------------------ATEDLSSNPDRPITVAEVEPLVKDFASRWKAAIELMHKDVITSFSNFLCGMEILRAALTQLLLYYTR
        +HFE+LLKSNT+LF                   A ED S N +R ITVAEVEPLVKDF SRWK AIELMHKD+IT FSNFLCGM+IL A +TQLLLYYTR
Subjt:  VHFEDLLKSNTALFV------------------ATEDLSSNPDRPITVAEVEPLVKDFASRWKAAIELMHKDVITSFSNFLCGMEILRAALTQLLLYYTR

Query:  LSDCIKRIVGGSALNKDLVSISSIM
        L DCIK+I GGSALN+D+V+  SIM
Subjt:  LSDCIKRIVGGSALNKDLVSISSIM

Q9SAK0 Pentatricopeptide repeat-containing protein At1g79490, mitochondrial0.0e+0078.91Show/hide
Query:  SFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQS
        S +R +CS K   + +S WTE++EYLDESGSV+ SGKG+R VEPGLD HVMVGGLKKP++NAS+VAKIVEVV RWKWGPELE+QL+KLQFVPNM HITQS
Subjt:  SFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQS

Query:  LKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDT
        LKI+ + +A+LSLFRWAK+Q WY P+DECY +LFDGLNQ RDF  IQ LF+E+V+D SS G +SF+AYN+VIQYLAKAEKLEV+FCCFKK  +SG K+DT
Subjt:  LKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDT

Query:  QTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLE
        QTYN+L+ LFLNKGLPYKAFEIYESM      LD ST+EL+IPSLAKSGRLDAA KLFQ+MKER  RP+ +V++SLVDSMGKAGRLDTSMK+YMEMQ   
Subjt:  QTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLE

Query:  LRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNS
         RP A+MFVSLI+S+ KAGKLD+AL+LWDEMK++GFRPNFGLY+M++ESHAKSGKL+VAM++F +MEKAGFLP PSTY CLLEMHA SGQVD+AMK+YNS
Subjt:  LRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNS

Query:  MTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMP
        MTNAGLR GLS+Y +LLTLLANK+L+D+A K+LLEMKAMG+SVDV ASDVLM+YIK+ S+D AL+WL+FMGSSGI+TN+FI+RQLFESCMK G+Y+SA P
Subjt:  MTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMP

Query:  LLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMG
        LLET V SA KVDL+LYTSILAHLVRCQDE KER LMSILSAT+HKAH+F+CGLFTG EQRKQPVL+FVREF+QGIDYELEE +ARYFVNVLLNYL+LMG
Subjt:  LLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMG

Query:  QINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRAT
        QINRARC+WKVAYENKLFPKAIVFDQHIAWSLD+RNLSVGAALIAVVHTLHRFRKRMLYYG+VPRRIKLVTGPTLK+V+AQMLSSVESPFEVSKVVLRA 
Subjt:  QINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRAT

Query:  GDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLIS
        G+ VMEWFKKPIVQQFLLNEIPSRSDILMHK+N +FPSSAPE+RS+SPPKPL+S
Subjt:  GDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLIS

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-5126.1Show/hide
Query:  VAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVS
        V  +  V+ R++WGP  E  L+ L    +     Q LK ++D   +L  F W KRQ  +  +   Y  +   L + + F AI  L DE+VRD     TV+
Subjt:  VAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVS

Query:  FSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPSLAKSGRLDAAMKLFQEMKER
           YNR+I    +A  L  +   F ++ ++G K D  TY +LI +    G    A ++Y+ M   G S D  T+ ++I  L K+G L AA KLF EM ++
Subjt:  FSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPSLAKSGRLDAAMKLFQEMKER

Query:  NYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFT
           P L  Y  ++D                                    H KA    +ALKL+ +M+ AGF P+   YS+V+E     G L+ A ++FT
Subjt:  NYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFT

Query:  EMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSAL
        EM++  ++P    Y  L+++   +G V+ A + Y +M +AGLR  + T  +LL+                                   +++   I  A 
Subjt:  EMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSAL

Query:  RWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSILSATRHKAHSFLCGLFTGTEQRKQP
          LQ M + G+R +      L   C                 D  +K+D+                    +   ++++T H AH FL  +        + 
Subjt:  RWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSILSATRHKAHSFLCGLFTGTEQRKQP

Query:  VLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVP
        V +    F   +  E +  S R  V+ ++++L   GQ   A  +W+VA +  +FP A+       W +++  +S G A+ A+  TL  FRK+ML  G  P
Subjt:  VLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVP

Query:  RRIKLVTG----------PTLKLVVAQMLSSVESPF
         RI +VTG            ++  V ++L+   SPF
Subjt:  RRIKLVTG----------PTLKLVVAQMLSSVESPF

AT1G18900.2 Pentatricopeptide repeat (PPR) superfamily protein1.6e-5126.1Show/hide
Query:  VAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVS
        V  +  V+ R++WGP  E  L+ L    +     Q LK ++D   +L  F W KRQ  +  +   Y  +   L + + F AI  L DE+VRD     TV+
Subjt:  VAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVS

Query:  FSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPSLAKSGRLDAAMKLFQEMKER
           YNR+I    +A  L  +   F ++ ++G K D  TY +LI +    G    A ++Y+ M   G S D  T+ ++I  L K+G L AA KLF EM ++
Subjt:  FSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPSLAKSGRLDAAMKLFQEMKER

Query:  NYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFT
           P L  Y  ++D                                    H KA    +ALKL+ +M+ AGF P+   YS+V+E     G L+ A ++FT
Subjt:  NYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFT

Query:  EMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSAL
        EM++  ++P    Y  L+++   +G V+ A + Y +M +AGLR  + T  +LL+                                   +++   I  A 
Subjt:  EMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSAL

Query:  RWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSILSATRHKAHSFLCGLFTGTEQRKQP
          LQ M + G+R +      L   C                 D  +K+D+                    +   ++++T H AH FL  +        + 
Subjt:  RWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSILSATRHKAHSFLCGLFTGTEQRKQP

Query:  VLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVP
        V +    F   +  E +  S R  V+ ++++L   GQ   A  +W+VA +  +FP A+       W +++  +S G A+ A+  TL  FRK+ML  G  P
Subjt:  VLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVP

Query:  RRIKLVTG----------PTLKLVVAQMLSSVESPF
         RI +VTG            ++  V ++L+   SPF
Subjt:  RRIKLVTG----------PTLKLVVAQMLSSVESPF

AT1G71270.1 Vps52 / Sac2 family2.4e-27058.23Show/hide
Query:  INQSYDEANKTQKNVFDLESFVGDLTVEEDACSDDISLEGLQQELEECKDDDHGLNYPETVVTNILSKGVKLREYTKGVENNLRQVELDSIQEYIKESDN
        + Q+  + +  +K  FDL +FVGDL  EED+ S+DISLEGLQQELEEC+ D+        VV NILS G KLREY KGVENNLR+VELDSI++YIKESDN
Subjt:  INQSYDEANKTQKNVFDLESFVGDLTVEEDACSDDISLEGLQQELEECKDDDHGLNYPETVVTNILSKGVKLREYTKGVENNLRQVELDSIQEYIKESDN

Query:  LVSLHEQIRDCDNILSQMETLLSGFQAEIGSISADIKVLQEKSMDMGLKLKNRKVAESKLAKFVEEIIVPPRMIDIIVDGEVNDEYLRTLEILSKKLVVV
        LVSLH+QIRDCD+ILSQMETLLSGFQ EIGSIS+DIK+LQEKSMDMGL+LKNR+VAESKLAKFVE+IIVPP+MID+IVDGEVN+EY++TLEILSKKL  V
Subjt:  LVSLHEQIRDCDNILSQMETLLSGFQAEIGSISADIKVLQEKSMDMGLKLKNRKVAESKLAKFVEEIIVPPRMIDIIVDGEVNDEYLRTLEILSKKLVVV

Query:  EVDPMIKNSKALKDVQPELEKLRQKAVSKDSSSTAVVKYVEKPLNLKPPSSFTVSSFYNTRDDFGSSGTVFDFLVQKLHALRKPKTNIQILQQSVLLKYK
        E D  +K+SKALKDV+PELEKLRQKA+SK                                        V+DF+VQKL ALRKPKTNIQILQQSVLLKYK
Subjt:  EVDPMIKNSKALKDVQPELEKLRQKAVSKDSSSTAVVKYVEKPLNLKPPSSFTVSSFYNTRDDFGSSGTVFDFLVQKLHALRKPKTNIQILQQSVLLKYK

Query:  NLILPITYSSLSGGKIIPCIYHHRDNDLQEKPDFPKITPAIMNCEFLSLYVISFLKDHSKEVYNEVRTAYIDTMNKVLSAHFRAYIQALEKLQLDIATSS
                                                         Y+ISFLK+H KEV+ +VR AYIDTMNKVLSAHFRAYIQALEKLQLDIAT+ 
Subjt:  NLILPITYSSLSGGKIIPCIYHHRDNDLQEKPDFPKITPAIMNCEFLSLYVISFLKDHSKEVYNEVRTAYIDTMNKVLSAHFRAYIQALEKLQLDIATSS

Query:  DLIGVEARSSGLFLRGREPLKNRSAVFALGDRIKILKIFFMKRKQGFSCDKNIGLATRPDVNFIQEVDEPALIPHIAEASSIKYPYEVLFRSLHKLLMDT
        DLIGVE R++GLF R REPLKNRSAVFALGDRIKI+K                            ++D+PALIPHIAEASS+KYPYEVLFRSLHKLLMDT
Subjt:  DLIGVEARSSGLFLRGREPLKNRSAVFALGDRIKILKIFFMKRKQGFSCDKNIGLATRPDVNFIQEVDEPALIPHIAEASSIKYPYEVLFRSLHKLLMDT

Query:  ATSEYLTVYSSLVYYLQNVICSKYNFCDDFFGEESIFYEIFAGPFAVIDEHFNSILPNSYDAIGLMLMILIIHQHQINIFSFLFDLFSAFQRVEHLATHL
        ATSEY+                   FCDDFFGEESIFYEIFAGPF+VIDEHFN +L N +DAIGLMLMI IIH HQ+                      +
Subjt:  ATSEYLTVYSSLVYYLQNVICSKYNFCDDFFGEESIFYEIFAGPFAVIDEHFNSILPNSYDAIGLMLMILIIHQHQINIFSFLFDLFSAFQRVEHLATHL

Query:  HPRSLGCIFYYIQYIHWLTMLVCPAHNVTAAYTLLGFIFRQDIISINLDGVNIALWPRFKMVFDMHLSSLRNANVKTLWEDDVHPHYVMRRYAEFTASLI
          R + C+  Y                                    LD VNI+LWPRFKMVFD HLSSLR+AN+KTLWEDDVHPHYVMRRYAEFTAS I
Subjt:  HPRSLGCIFYYIQYIHWLTMLVCPAHNVTAAYTLLGFIFRQDIISINLDGVNIALWPRFKMVFDMHLSSLRNANVKTLWEDDVHPHYVMRRYAEFTASLI

Query:  HLNVEYGDGQLDLNLERLRMAIDDLLIKLAKTFSKAKSQTVFLINNYDMTISVLKEFIGKKWSSHNFQPLIEKTERTSWTRLFVYGSTEEAGPEGGKILV
        HLNVEYGDGQLD+NLERLRMA+D L++KLAK F + K Q VFLINNYDMTI+VLK                                  EAGPEGGKI +
Subjt:  HLNVEYGDGQLDLNLERLRMAIDDLLIKLAKTFSKAKSQTVFLINNYDMTISVLKEFIGKKWSSHNFQPLIEKTERTSWTRLFVYGSTEEAGPEGGKILV

Query:  HFEDLLKSNTALFV------------------ATEDLSSNPDRPITVAEVEPLVKDFASRWKAAIELMHKDVITSFSNFLCGMEILRAALTQLLLYYTRL
        HFE++LKSNT+LFV                  A+ED S NP+R IT+AEVEPLVKDF SRWK AIELM KD+ITSFSNFLCGM+ILRAALTQLLLYYTRL
Subjt:  HFEDLLKSNTALFV------------------ATEDLSSNPDRPITVAEVEPLVKDFASRWKAAIELMHKDVITSFSNFLCGMEILRAALTQLLLYYTRL

Query:  SDCIKRIVGGSALNKDLVSISSIM
        +DCIK+I GGSALN+DLVSI SIM
Subjt:  SDCIKRIVGGSALNKDLVSISSIM

AT1G71300.1 Vps52 / Sac2 family4.9e-24753.33Show/hide
Query:  DEANKTQKNVFDLESFVGDLTVEEDA--------------------CSDDISLEGLQQELEECKDDDHGLNYPETVVTNILSKGVKLREYTKGVENNLRQ
        D+ +  +K+  +L +F GDL  EED+                     S+DISLEGLQQELEEC+ D+        VV NILS G KLREY KGVENNLR+
Subjt:  DEANKTQKNVFDLESFVGDLTVEEDA--------------------CSDDISLEGLQQELEECKDDDHGLNYPETVVTNILSKGVKLREYTKGVENNLRQ

Query:  VELDSIQ------EYIKESDNLVSLHEQIRDCDNILSQMETLLSGFQAEIGSISADIKVLQEKSMDMGLKLKNRKVAESKLAKFVEEIIVPPRMIDIIVD
        VELDS++      +YIKESD LVSLH+QIRDC+ ILSQMETLLSGFQ EIGSIS+DIK+LQE SMDMGL+LKNR+V ESKLAKFVE+ IVPP+MI +IVD
Subjt:  VELDSIQ------EYIKESDNLVSLHEQIRDCDNILSQMETLLSGFQAEIGSISADIKVLQEKSMDMGLKLKNRKVAESKLAKFVEEIIVPPRMIDIIVD

Query:  GEVNDEYLRTLEILSKKLVVVEVDPMIKNSKALKDVQPELEKLRQKAVSKDSSSTAVVKYVEKPLNLKPPSSFTVSSFYNTRDDFGSSGTVFDFLVQKLH
        GEVN+EY++TL ILSKKL  VE D  +K+SKALKDV+PELEKLRQKA+SK                                        V+DF+VQKL 
Subjt:  GEVNDEYLRTLEILSKKLVVVEVDPMIKNSKALKDVQPELEKLRQKAVSKDSSSTAVVKYVEKPLNLKPPSSFTVSSFYNTRDDFGSSGTVFDFLVQKLH

Query:  ALRKPKTNIQILQQSVLLKYKNLILPITYSSLSGGKIIPCIYHHRDNDLQEKPDFPKITPAIMNCEFLSLYVISFLKDHSKEVYNEVRTAYIDTMNKVLS
        ALRKPKTNIQILQQSV LKYK                                                 Y+ISFLK+H KEV+ +VR AYIDTMNKVLS
Subjt:  ALRKPKTNIQILQQSVLLKYKNLILPITYSSLSGGKIIPCIYHHRDNDLQEKPDFPKITPAIMNCEFLSLYVISFLKDHSKEVYNEVRTAYIDTMNKVLS

Query:  AHFRAYIQALEKLQLDIATSSDLIGVEARSSGLFLRGREPLKNRSAVFALGDRIKILKIFFMKRKQGFSCDKNIGLATRPDVNFIQEVDEPALIPHIAEA
        AHF++YIQA EKLQLDIATS+DLIGV+ RS+GLF R +EPLKNR AVFALG+RI+I+K                            E+D+PALIPHIAEA
Subjt:  AHFRAYIQALEKLQLDIATSSDLIGVEARSSGLFLRGREPLKNRSAVFALGDRIKILKIFFMKRKQGFSCDKNIGLATRPDVNFIQEVDEPALIPHIAEA

Query:  SSIKYPYEVLFRSLHKLLMDTATSEYLTVYSSLVYYLQNVICSKYNFCDDFFGEESIFYEIFAGPFAVIDEHFNSILPNSYDAIGLMLMILIIHQHQINI
        SS+KYPYEVLFRSLHKLLMDTATSEY+                   FC+DFFGE+SIFYEIFAGPF+VI EH +S+L + YDAIGL+LMI IIH HQ+  
Subjt:  SSIKYPYEVLFRSLHKLLMDTATSEYLTVYSSLVYYLQNVICSKYNFCDDFFGEESIFYEIFAGPFAVIDEHFNSILPNSYDAIGLMLMILIIHQHQINI

Query:  FSFLFDLFSAFQRVEHLATHLHPRSLGCIFYYIQYIHWLTMLVCPAHNVTAAYTLLGFIFRQDIISINLDGVNIALWPRFKMVFDMHLSSLRNANVKTLW
                            +  R + C+  Y                                    LD VNI+LWPRFK VFD+H+ SLRNAN+ T+W
Subjt:  FSFLFDLFSAFQRVEHLATHLHPRSLGCIFYYIQYIHWLTMLVCPAHNVTAAYTLLGFIFRQDIISINLDGVNIALWPRFKMVFDMHLSSLRNANVKTLW

Query:  EDDVHPHYVMRRYAEFTASLIHLNVEYGDGQLDLNLERLRMAIDDLLIKLAKTFSKAKSQTVFLINNYDMTISVLKEFIGKKWSSHNFQPLIEKTERTSW
        EDDVHPHY+MRRYAEFTAS IHLNVEYGDGQLD+NLERLRMA+D L++KLAK F + K Q VFLINNYDMTI+VLK                        
Subjt:  EDDVHPHYVMRRYAEFTASLIHLNVEYGDGQLDLNLERLRMAIDDLLIKLAKTFSKAKSQTVFLINNYDMTISVLKEFIGKKWSSHNFQPLIEKTERTSW

Query:  TRLFVYGSTEEAGPEGGKILVHFEDLLKSNTALFV------------------ATEDLSSNPDRPITVAEVEPLVKDFASRWKAAIELMHKDVITSFSNF
                  EA PEGGKI +HFE+LLKSNT+LF                   A ED S N +R ITVAEVEPLVKDF SRWK AIELMHKD+IT FSNF
Subjt:  TRLFVYGSTEEAGPEGGKILVHFEDLLKSNTALFV------------------ATEDLSSNPDRPITVAEVEPLVKDFASRWKAAIELMHKDVITSFSNF

Query:  LCGMEILRAALTQLLLYYTRLSDCIKRIVGGSALNKDLVSISSIM
        LCGM+IL A +TQLLLYYTRL DCIK+I GGSALN+D+V+  SIM
Subjt:  LCGMEILRAALTQLLLYYTRLSDCIKRIVGGSALNKDLVSISSIM

AT1G79490.1 Pentatricopeptide repeat (PPR) superfamily protein0.0e+0078.91Show/hide
Query:  SFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQS
        S +R +CS K   + +S WTE++EYLDESGSV+ SGKG+R VEPGLD HVMVGGLKKP++NAS+VAKIVEVV RWKWGPELE+QL+KLQFVPNM HITQS
Subjt:  SFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQS

Query:  LKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDT
        LKI+ + +A+LSLFRWAK+Q WY P+DECY +LFDGLNQ RDF  IQ LF+E+V+D SS G +SF+AYN+VIQYLAKAEKLEV+FCCFKK  +SG K+DT
Subjt:  LKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDT

Query:  QTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLE
        QTYN+L+ LFLNKGLPYKAFEIYESM      LD ST+EL+IPSLAKSGRLDAA KLFQ+MKER  RP+ +V++SLVDSMGKAGRLDTSMK+YMEMQ   
Subjt:  QTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLE

Query:  LRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNS
         RP A+MFVSLI+S+ KAGKLD+AL+LWDEMK++GFRPNFGLY+M++ESHAKSGKL+VAM++F +MEKAGFLP PSTY CLLEMHA SGQVD+AMK+YNS
Subjt:  LRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNS

Query:  MTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMP
        MTNAGLR GLS+Y +LLTLLANK+L+D+A K+LLEMKAMG+SVDV ASDVLM+YIK+ S+D AL+WL+FMGSSGI+TN+FI+RQLFESCMK G+Y+SA P
Subjt:  MTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMP

Query:  LLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMG
        LLET V SA KVDL+LYTSILAHLVRCQDE KER LMSILSAT+HKAH+F+CGLFTG EQRKQPVL+FVREF+QGIDYELEE +ARYFVNVLLNYL+LMG
Subjt:  LLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMG

Query:  QINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRAT
        QINRARC+WKVAYENKLFPKAIVFDQHIAWSLD+RNLSVGAALIAVVHTLHRFRKRMLYYG+VPRRIKLVTGPTLK+V+AQMLSSVESPFEVSKVVLRA 
Subjt:  QINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRAT

Query:  GDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLIS
        G+ VMEWFKKPIVQQFLLNEIPSRSDILMHK+N +FPSSAPE+RS+SPPKPL+S
Subjt:  GDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGGAGTGTTTCTGTGTGCTGCCGAATTTGCTGGTAATCTTCCATCCTTCAAATGGATCCGAGTCATAGCAAGATTTACTACTGTTCTATATCAAGGGGCGAACGA
GATTGAGAATTTAGCTGATTTATGCTTGAAAAGTGCCCCTCCCTATCACAAATGGTATTTTGAAACATTGGGTAAAAAGACTTTTGGGTTCTTGATGAAAGAAGGGTATA
TTTATTTTGCAATTGCTGATGAAGTTATAGGTAATCAGGCTCTTCTCCAGTTTCTAGAGCAGTTGAGAGATGAATTCAAAAGGGTAGCAAAGAAGGGTTCACGAGGGAGT
TTCTCAAGTATGAACTCCATTAGTCTGCAGGAACAGCTAGTGCCTGCAATTTGTAAGTTAATCACCTCGCTGGAGAATGTCTCCAACAATAGTAAGGATTGGATGGCTGA
AACACCCTCCTCAAACAATGGTCTGTCTCCATCTCCCAGCAACGTGAATGGCCAAGTCGACGTTTTGGCTTCCACGAAAGCCCCTCTGCTGGGGAAGTCAAACAAACCAG
AGAAGAAAAAGGGAAAAGACCATGTGATAACAATGAGAGGTATTGAGATGGAGGAGCACAGAAAATCTACAGATAGAGGTTTGAAGATTGATTCAGTGTCTTTGGAATCC
AGTAATCAGGGTGGATCAGGGTCTTCAATTCCGGTGCAGAAAGACAGCAATCCATTGAGGAGATCGAACTCTCAAAGCATCCGGCGAAGGTGGTGGCGTCATGTGAGGAT
CGCGCTTGCTGTTGATGCTGCTGTATGTCTGATATTGCTCACTTCTGCGGATTTCTGCAACTTAATAGTCACTTTTGCTGGCGATATCTGTTTTGGGCTCTTAACTGATA
ATGATATTTCACTTTGGTTTGGATCCATGTTCGTTCTAATGCTTATTAATTTGGTATCGCTTTCGAGATTCAAGGAGACGATTGGTCGTGATTTGATGTCATATCGAAAT
CACCGACGCCCAGTTGTTTCCATTGTTTTAGATCAATTTCGGAATTCTGAGGCGCAACTGAGGAATTGGACAACAAAGCGGCTCTGCCGGCTCCGGCGTTCGAACCTCTC
TCCGATCTGTAAAACCCTTACACTCCTACCCGATTCTTCCATCTCTTCCAGCTTCTCAAAACATATGCGAGGCGTCGGGTCAATCGCGCTGGCGGACTATCTATTAAGCA
CTCTTGGCCAATTCGGACGGGATGGCCGATACTGCGATCGACCAAGCGGTATAAATCAATCTTATGATGAAGCCAACAAGACCCAGAAGAATGTTTTTGACTTGGAATCA
TTTGTTGGGGACTTGACTGTCGAGGAAGATGCTTGCAGTGATGACATTTCATTGGAAGGACTGCAGCAAGAACTCGAAGAATGTAAAGATGATGATCATGGCTTGAACTA
TCCAGAGACAGTAGTTACGAACATACTTTCTAAAGGTGTAAAACTAAGGGAGTACACAAAGGGAGTTGAAAACAATTTGCGGCAGGTTGAGCTGGATTCCATACAGGAAT
ATATTAAAGAAAGTGATAACTTAGTCTCGCTTCATGAACAAATTCGTGACTGTGACAACATTTTATCACAGATGGAAACTCTTCTCAGTGGTTTCCAGGCCGAAATTGGT
TCTATTAGTGCAGACATTAAGGTCCTGCAGGAGAAGTCAATGGATATGGGTCTGAAGCTCAAGAATCGCAAGGTGGCGGAATCAAAATTGGCAAAGTTTGTCGAAGAGAT
CATTGTCCCTCCAAGGATGATAGACATAATTGTTGATGGAGAGGTGAATGATGAATATTTGAGAACTCTTGAGATCCTAAGTAAGAAACTGGTGGTCGTGGAAGTGGATC
CAATGATAAAAAATTCCAAAGCACTGAAGGATGTACAACCTGAGCTTGAAAAACTCCGGCAGAAAGCAGTGTCTAAGGACTCTAGTAGTACAGCAGTAGTTAAATATGTA
GAAAAGCCATTAAATTTGAAACCACCTTCCAGCTTTACTGTGAGTAGCTTTTATAACACCAGAGATGACTTTGGCAGTTCAGGAACAGTCTTTGACTTCCTTGTCCAGAA
ACTCCATGCTTTGAGAAAACCCAAGACAAATATTCAGATCCTTCAACAGAGTGTTCTTTTGAAGTACAAGAATCTGATTCTACCAATAACCTATTCTTCGTTGTCTGGAG
GGAAAATAATTCCATGTATCTATCACCATCGAGACAACGATCTCCAAGAGAAGCCAGACTTTCCAAAGATAACACCAGCCATAATGAACTGTGAATTCTTGAGCTTGTAT
GTTATTTCTTTCCTCAAGGATCATAGCAAAGAAGTTTATAATGAGGTTCGGACAGCATATATTGATACAATGAACAAGGTTTTGAGTGCACATTTCCGTGCCTACATACA
GGCACTAGAGAAATTACAGTTAGACATAGCGACATCTAGTGATCTGATTGGGGTTGAGGCTAGAAGTTCAGGTCTTTTCTTAAGAGGAAGGGAACCACTGAAGAACCGAT
CTGCTGTTTTTGCACTTGGTGATAGAATAAAGATATTAAAGATTTTTTTTATGAAACGGAAACAAGGCTTTTCATGTGATAAAAACATTGGACTTGCTACACGTCCTGAT
GTAAACTTTATTCAGGAAGTTGACGAACCTGCACTGATTCCACATATTGCTGAAGCCAGTTCAATTAAGTATCCTTACGAAGTTCTCTTTAGGAGCTTGCATAAGCTTCT
AATGGACACTGCCACTTCTGAGTACTTAACAGTTTACTCTTCCTTAGTTTATTATCTTCAGAATGTTATTTGTAGCAAGTATAACTTTTGTGATGATTTTTTTGGGGAAG
AATCTATATTTTATGAGATCTTTGCGGGTCCTTTTGCTGTCATCGACGAACACTTCAATTCAATTCTCCCAAACAGTTATGATGCAATTGGTCTAATGCTCATGATCCTG
ATCATTCATCAGCATCAGATCAACATATTCAGTTTTCTATTTGATCTTTTCTCTGCTTTCCAACGTGTTGAACATCTTGCAACACATCTACATCCTCGTTCACTTGGTTG
CATCTTCTATTACATACAATATATCCATTGGCTGACAATGCTTGTATGTCCAGCTCATAATGTCACGGCGGCGTATACCTTGCTTGGATTCATATTTAGACAAGATATCA
TTTCCATTAATCTCGATGGTGTTAATATTGCTCTTTGGCCTCGTTTTAAGATGGTATTTGACATGCACCTCAGCAGTCTGAGGAATGCAAATGTAAAGACGTTGTGGGAA
GATGATGTGCATCCTCACTATGTCATGAGGCGATACGCTGAGTTTACAGCTTCACTTATCCATCTGAATGTCGAATATGGAGATGGACAGCTTGATTTGAACTTGGAGCG
GCTAAGAATGGCAATTGATGATTTGCTTATCAAGCTTGCCAAAACCTTTTCAAAAGCTAAATCTCAGACAGTGTTTCTTATAAACAACTACGATATGACAATTTCCGTCT
TGAAGGAGTTCATTGGAAAGAAATGGTCTTCTCACAATTTTCAACCCCTCATTGAGAAAACAGAAAGAACTTCTTGGACAAGATTATTTGTTTATGGCTCTACGGAGGAA
GCCGGTCCGGAGGGTGGGAAAATTCTGGTGCACTTCGAGGATCTGCTGAAGAGCAACACAGCCTTGTTTGTGGCAACGGAGGACCTAAGTTCGAATCCTGACCGACCAAT
TACTGTGGCTGAAGTAGAGCCCCTGGTAAAGGACTTTGCGAGCAGATGGAAAGCTGCAATTGAGCTGATGCACAAAGATGTCATCACTTCTTTCAGTAATTTCTTGTGTG
GAATGGAAATCTTAAGGGCAGCTTTGACTCAACTCCTCCTTTATTACACAAGACTCTCGGATTGTATAAAGAGGATCGTCGGTGGCTCTGCTCTAAACAAGGATCTAGTG
TCCATTTCTTCAATTATGTGCCAATGCAGCTGTGTTCATGTTTGGTTATGCAGAACTGCTTTCGTTTTGGTTCATCAGAAAATATCCAGTCAATGCTCGAGGTTGTGGCT
TTACTCTTCTATCAAATTTCAGTGCCAAGACATTACATTACCTGTGTCACCATTAGTTAATTTGCTGTGTACAAGTTTTACCATGGAACCTAATACACAATCTTTTCCCC
TAAATCACCCCCACCAGAAAAATCCGAGCGGAACAGGAAGAAGTCCAGTTAAATTTGAAGTATCAAACCAAAACTCAAGTAGGCCGCCGTATTGTGAATGTGGGCAGGCC
GGCAGGGCAGGTCGAGACTGGGCTGAGCATCAAAAGGAGGCGGAACTGGAAGACAAGACATTACCAGCCGATTGTAAAGCTTTGCCCAGTTCGCTTTGGGCATTCTCATT
CACCCCATTTCGTTTGAGAAGACAATTTCAAACACAACTTCTCTGCCGCCCACCACCTGTTTGTAAAAATGTCTGCAAGGAAATCGGCAGTTCTTCGCAACAAGCATATG
GGTACTTAAGGAAAAGACCCAGTTTCAGTATGCTTCCTTTTCGTGCTGTACAACTACTTTTCGGGTCCTCCAATTCCCTCCACAAGCGCCGATTTCTTCTGCCGGCTTCC
TTCCTCTTTTCATCGATTTCATGGCGCGAAGCGGAATCTGTTTTAAAACCCAGAAACTCTGAGTTTCTTGAGAAGTCACATGTTTTCAATAATCTTAGTTTTATCAGATC
TTATTGTTCTGGAAAGAACAGTGGCAATGGGGCTAGTGAGTGGACTGAGGATATTGAGTATCTAGATGAGTCGGGGAGTGTTATTTTCTCTGGTAAAGGCGTTCGGTTGG
TTGAACCAGGTCTTGATGGTCATGTAATGGTGGGCGGACTCAAAAAGCCCTTTCTGAATGCATCGGCTGTTGCTAAGATAGTTGAGGTTGTAACGAGGTGGAAATGGGGT
CCAGAGTTGGAATCTCAGCTCGAAAAGCTCCAATTTGTTCCAAATATGACGCACATCACTCAGTCATTGAAGATTATCGATGATGCTGAGGCTTCTTTGAGCTTGTTTCG
TTGGGCTAAGAGGCAGTCCTGGTATTCACCAAATGATGAGTGCTATGGGTTGTTGTTTGATGGGTTAAATCAGAGAAGAGATTTTGATGCAATTCAATTGTTGTTTGATG
AGGTTGTTCGTGATTTGAGCAGTGATGGGACTGTCTCATTCAGTGCATATAATCGTGTGATTCAGTACTTGGCTAAAGCTGAGAAATTGGAAGTGTCTTTCTGTTGTTTT
AAGAAGATTCATGATTCAGGTTTCAAGGTTGATACTCAAACATACAATTCTCTTATAACCTTGTTCTTAAACAAGGGTCTGCCTTACAAGGCTTTCGAGATATACGAGAG
CATGGCAGGAGCAGGGTGTTCTTTAGATGCATCTACCTTTGAGCTGATGATACCAAGTTTGGCAAAATCGGGTCGTCTTGATGCAGCAATGAAGCTCTTTCAAGAGATGA
AAGAGAGGAATTATCGTCCCGCCCTGAATGTTTATACATCCCTTGTGGATTCTATGGGGAAAGCTGGGAGGCTCGACACATCGATGAAGATTTACATGGAAATGCAGCTG
CTTGAGCTCAGACCGCCTGCTTCAATGTTTGTTTCCTTAATTGAGTCACATGTGAAGGCTGGGAAATTGGATTCTGCTCTCAAGCTTTGGGATGAGATGAAAAGGGCAGG
TTTTAGGCCTAACTTTGGTTTGTACTCTATGGTTGTTGAGTCACATGCCAAATCAGGGAAACTTGATGTTGCAATGTCTATCTTCACCGAAATGGAGAAAGCTGGATTTC
TTCCCATCCCATCTACTTATTGCTGTCTCTTGGAAATGCACGCAGCGTCGGGACAAGTAGATGCTGCCATGAAACTCTACAACTCTATGACTAATGCAGGTTTGAGGCTC
GGGTTAAGTACGTACACTGCTTTATTGACACTTCTGGCTAATAAGAAGCTTATCGATATTGCTGCAAAAGTTTTACTTGAAATGAAGGCCATGGGATTCTCTGTTGATGT
GAGCGCTAGCGATGTCTTGATGGTGTATATCAAGGAAGGCTCTATCGATTCCGCTTTGAGGTGGCTTCAGTTCATGGGTTCATCTGGAATAAGAACTAATAGCTTTATTC
TCAGGCAATTGTTTGAGTCATGCATGAAGAAAGGGATGTATGAGTCAGCTATGCCTCTCTTAGAAACTTATGTAGATTCTGCTGCTAAAGTTGATCTTATACTCTACACA
TCCATCCTGGCCCATCTTGTAAGGTGTCAAGATGAGCAGAAGGAAAGATATTTGATGTCCATCCTCAGTGCTACAAGACATAAGGCACACTCTTTTTTGTGTGGACTGTT
CACTGGAACAGAACAAAGGAAACAACCAGTTTTGTCTTTTGTAAGGGAGTTTTTTCAGGGCATCGACTATGAGCTGGAAGAGAGCAGTGCAAGATACTTTGTCAATGTCC
TCCTCAATTATCTCATTCTCATGGGACAAATAAATCGAGCTCGTTGTATTTGGAAAGTTGCTTACGAGAATAAGCTCTTCCCAAAAGCCATTGTCTTTGATCAACACATT
GCCTGGTCCCTCGACATTCGGAACTTGTCGGTTGGAGCTGCTCTTATAGCAGTTGTGCACACTCTCCATCGGTTCAGGAAGCGAATGTTGTATTATGGAATAGTTCCGAG
GCGCATAAAATTGGTTACGGGACCTACTCTGAAGCTTGTGGTTGCTCAAATGTTGAGCTCTGTGGAATCCCCATTTGAGGTCAGTAAAGTAGTTCTGAGAGCAACAGGAG
ACTCTGTGATGGAGTGGTTCAAAAAACCAATCGTCCAACAATTCCTTCTGAATGAGATTCCATCAAGATCAGATATCCTAATGCACAAGTTGAATACTCTCTTTCCCAGT
TCAGCACCTGAAATTAGATCTCTTTCACCTCCCAAACCCCTCATTTCCCGGAATTCAGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGGAGTGTTTCTGTGTGCTGCCGAATTTGCTGGTAATCTTCCATCCTTCAAATGGATCCGAGTCATAGCAAGATTTACTACTGTTCTATATCAAGGGGCGAACGA
GATTGAGAATTTAGCTGATTTATGCTTGAAAAGTGCCCCTCCCTATCACAAATGGTATTTTGAAACATTGGGTAAAAAGACTTTTGGGTTCTTGATGAAAGAAGGGTATA
TTTATTTTGCAATTGCTGATGAAGTTATAGGTAATCAGGCTCTTCTCCAGTTTCTAGAGCAGTTGAGAGATGAATTCAAAAGGGTAGCAAAGAAGGGTTCACGAGGGAGT
TTCTCAAGTATGAACTCCATTAGTCTGCAGGAACAGCTAGTGCCTGCAATTTGTAAGTTAATCACCTCGCTGGAGAATGTCTCCAACAATAGTAAGGATTGGATGGCTGA
AACACCCTCCTCAAACAATGGTCTGTCTCCATCTCCCAGCAACGTGAATGGCCAAGTCGACGTTTTGGCTTCCACGAAAGCCCCTCTGCTGGGGAAGTCAAACAAACCAG
AGAAGAAAAAGGGAAAAGACCATGTGATAACAATGAGAGGTATTGAGATGGAGGAGCACAGAAAATCTACAGATAGAGGTTTGAAGATTGATTCAGTGTCTTTGGAATCC
AGTAATCAGGGTGGATCAGGGTCTTCAATTCCGGTGCAGAAAGACAGCAATCCATTGAGGAGATCGAACTCTCAAAGCATCCGGCGAAGGTGGTGGCGTCATGTGAGGAT
CGCGCTTGCTGTTGATGCTGCTGTATGTCTGATATTGCTCACTTCTGCGGATTTCTGCAACTTAATAGTCACTTTTGCTGGCGATATCTGTTTTGGGCTCTTAACTGATA
ATGATATTTCACTTTGGTTTGGATCCATGTTCGTTCTAATGCTTATTAATTTGGTATCGCTTTCGAGATTCAAGGAGACGATTGGTCGTGATTTGATGTCATATCGAAAT
CACCGACGCCCAGTTGTTTCCATTGTTTTAGATCAATTTCGGAATTCTGAGGCGCAACTGAGGAATTGGACAACAAAGCGGCTCTGCCGGCTCCGGCGTTCGAACCTCTC
TCCGATCTGTAAAACCCTTACACTCCTACCCGATTCTTCCATCTCTTCCAGCTTCTCAAAACATATGCGAGGCGTCGGGTCAATCGCGCTGGCGGACTATCTATTAAGCA
CTCTTGGCCAATTCGGACGGGATGGCCGATACTGCGATCGACCAAGCGGTATAAATCAATCTTATGATGAAGCCAACAAGACCCAGAAGAATGTTTTTGACTTGGAATCA
TTTGTTGGGGACTTGACTGTCGAGGAAGATGCTTGCAGTGATGACATTTCATTGGAAGGACTGCAGCAAGAACTCGAAGAATGTAAAGATGATGATCATGGCTTGAACTA
TCCAGAGACAGTAGTTACGAACATACTTTCTAAAGGTGTAAAACTAAGGGAGTACACAAAGGGAGTTGAAAACAATTTGCGGCAGGTTGAGCTGGATTCCATACAGGAAT
ATATTAAAGAAAGTGATAACTTAGTCTCGCTTCATGAACAAATTCGTGACTGTGACAACATTTTATCACAGATGGAAACTCTTCTCAGTGGTTTCCAGGCCGAAATTGGT
TCTATTAGTGCAGACATTAAGGTCCTGCAGGAGAAGTCAATGGATATGGGTCTGAAGCTCAAGAATCGCAAGGTGGCGGAATCAAAATTGGCAAAGTTTGTCGAAGAGAT
CATTGTCCCTCCAAGGATGATAGACATAATTGTTGATGGAGAGGTGAATGATGAATATTTGAGAACTCTTGAGATCCTAAGTAAGAAACTGGTGGTCGTGGAAGTGGATC
CAATGATAAAAAATTCCAAAGCACTGAAGGATGTACAACCTGAGCTTGAAAAACTCCGGCAGAAAGCAGTGTCTAAGGACTCTAGTAGTACAGCAGTAGTTAAATATGTA
GAAAAGCCATTAAATTTGAAACCACCTTCCAGCTTTACTGTGAGTAGCTTTTATAACACCAGAGATGACTTTGGCAGTTCAGGAACAGTCTTTGACTTCCTTGTCCAGAA
ACTCCATGCTTTGAGAAAACCCAAGACAAATATTCAGATCCTTCAACAGAGTGTTCTTTTGAAGTACAAGAATCTGATTCTACCAATAACCTATTCTTCGTTGTCTGGAG
GGAAAATAATTCCATGTATCTATCACCATCGAGACAACGATCTCCAAGAGAAGCCAGACTTTCCAAAGATAACACCAGCCATAATGAACTGTGAATTCTTGAGCTTGTAT
GTTATTTCTTTCCTCAAGGATCATAGCAAAGAAGTTTATAATGAGGTTCGGACAGCATATATTGATACAATGAACAAGGTTTTGAGTGCACATTTCCGTGCCTACATACA
GGCACTAGAGAAATTACAGTTAGACATAGCGACATCTAGTGATCTGATTGGGGTTGAGGCTAGAAGTTCAGGTCTTTTCTTAAGAGGAAGGGAACCACTGAAGAACCGAT
CTGCTGTTTTTGCACTTGGTGATAGAATAAAGATATTAAAGATTTTTTTTATGAAACGGAAACAAGGCTTTTCATGTGATAAAAACATTGGACTTGCTACACGTCCTGAT
GTAAACTTTATTCAGGAAGTTGACGAACCTGCACTGATTCCACATATTGCTGAAGCCAGTTCAATTAAGTATCCTTACGAAGTTCTCTTTAGGAGCTTGCATAAGCTTCT
AATGGACACTGCCACTTCTGAGTACTTAACAGTTTACTCTTCCTTAGTTTATTATCTTCAGAATGTTATTTGTAGCAAGTATAACTTTTGTGATGATTTTTTTGGGGAAG
AATCTATATTTTATGAGATCTTTGCGGGTCCTTTTGCTGTCATCGACGAACACTTCAATTCAATTCTCCCAAACAGTTATGATGCAATTGGTCTAATGCTCATGATCCTG
ATCATTCATCAGCATCAGATCAACATATTCAGTTTTCTATTTGATCTTTTCTCTGCTTTCCAACGTGTTGAACATCTTGCAACACATCTACATCCTCGTTCACTTGGTTG
CATCTTCTATTACATACAATATATCCATTGGCTGACAATGCTTGTATGTCCAGCTCATAATGTCACGGCGGCGTATACCTTGCTTGGATTCATATTTAGACAAGATATCA
TTTCCATTAATCTCGATGGTGTTAATATTGCTCTTTGGCCTCGTTTTAAGATGGTATTTGACATGCACCTCAGCAGTCTGAGGAATGCAAATGTAAAGACGTTGTGGGAA
GATGATGTGCATCCTCACTATGTCATGAGGCGATACGCTGAGTTTACAGCTTCACTTATCCATCTGAATGTCGAATATGGAGATGGACAGCTTGATTTGAACTTGGAGCG
GCTAAGAATGGCAATTGATGATTTGCTTATCAAGCTTGCCAAAACCTTTTCAAAAGCTAAATCTCAGACAGTGTTTCTTATAAACAACTACGATATGACAATTTCCGTCT
TGAAGGAGTTCATTGGAAAGAAATGGTCTTCTCACAATTTTCAACCCCTCATTGAGAAAACAGAAAGAACTTCTTGGACAAGATTATTTGTTTATGGCTCTACGGAGGAA
GCCGGTCCGGAGGGTGGGAAAATTCTGGTGCACTTCGAGGATCTGCTGAAGAGCAACACAGCCTTGTTTGTGGCAACGGAGGACCTAAGTTCGAATCCTGACCGACCAAT
TACTGTGGCTGAAGTAGAGCCCCTGGTAAAGGACTTTGCGAGCAGATGGAAAGCTGCAATTGAGCTGATGCACAAAGATGTCATCACTTCTTTCAGTAATTTCTTGTGTG
GAATGGAAATCTTAAGGGCAGCTTTGACTCAACTCCTCCTTTATTACACAAGACTCTCGGATTGTATAAAGAGGATCGTCGGTGGCTCTGCTCTAAACAAGGATCTAGTG
TCCATTTCTTCAATTATGTGCCAATGCAGCTGTGTTCATGTTTGGTTATGCAGAACTGCTTTCGTTTTGGTTCATCAGAAAATATCCAGTCAATGCTCGAGGTTGTGGCT
TTACTCTTCTATCAAATTTCAGTGCCAAGACATTACATTACCTGTGTCACCATTAGTTAATTTGCTGTGTACAAGTTTTACCATGGAACCTAATACACAATCTTTTCCCC
TAAATCACCCCCACCAGAAAAATCCGAGCGGAACAGGAAGAAGTCCAGTTAAATTTGAAGTATCAAACCAAAACTCAAGTAGGCCGCCGTATTGTGAATGTGGGCAGGCC
GGCAGGGCAGGTCGAGACTGGGCTGAGCATCAAAAGGAGGCGGAACTGGAAGACAAGACATTACCAGCCGATTGTAAAGCTTTGCCCAGTTCGCTTTGGGCATTCTCATT
CACCCCATTTCGTTTGAGAAGACAATTTCAAACACAACTTCTCTGCCGCCCACCACCTGTTTGTAAAAATGTCTGCAAGGAAATCGGCAGTTCTTCGCAACAAGCATATG
GGTACTTAAGGAAAAGACCCAGTTTCAGTATGCTTCCTTTTCGTGCTGTACAACTACTTTTCGGGTCCTCCAATTCCCTCCACAAGCGCCGATTTCTTCTGCCGGCTTCC
TTCCTCTTTTCATCGATTTCATGGCGCGAAGCGGAATCTGTTTTAAAACCCAGAAACTCTGAGTTTCTTGAGAAGTCACATGTTTTCAATAATCTTAGTTTTATCAGATC
TTATTGTTCTGGAAAGAACAGTGGCAATGGGGCTAGTGAGTGGACTGAGGATATTGAGTATCTAGATGAGTCGGGGAGTGTTATTTTCTCTGGTAAAGGCGTTCGGTTGG
TTGAACCAGGTCTTGATGGTCATGTAATGGTGGGCGGACTCAAAAAGCCCTTTCTGAATGCATCGGCTGTTGCTAAGATAGTTGAGGTTGTAACGAGGTGGAAATGGGGT
CCAGAGTTGGAATCTCAGCTCGAAAAGCTCCAATTTGTTCCAAATATGACGCACATCACTCAGTCATTGAAGATTATCGATGATGCTGAGGCTTCTTTGAGCTTGTTTCG
TTGGGCTAAGAGGCAGTCCTGGTATTCACCAAATGATGAGTGCTATGGGTTGTTGTTTGATGGGTTAAATCAGAGAAGAGATTTTGATGCAATTCAATTGTTGTTTGATG
AGGTTGTTCGTGATTTGAGCAGTGATGGGACTGTCTCATTCAGTGCATATAATCGTGTGATTCAGTACTTGGCTAAAGCTGAGAAATTGGAAGTGTCTTTCTGTTGTTTT
AAGAAGATTCATGATTCAGGTTTCAAGGTTGATACTCAAACATACAATTCTCTTATAACCTTGTTCTTAAACAAGGGTCTGCCTTACAAGGCTTTCGAGATATACGAGAG
CATGGCAGGAGCAGGGTGTTCTTTAGATGCATCTACCTTTGAGCTGATGATACCAAGTTTGGCAAAATCGGGTCGTCTTGATGCAGCAATGAAGCTCTTTCAAGAGATGA
AAGAGAGGAATTATCGTCCCGCCCTGAATGTTTATACATCCCTTGTGGATTCTATGGGGAAAGCTGGGAGGCTCGACACATCGATGAAGATTTACATGGAAATGCAGCTG
CTTGAGCTCAGACCGCCTGCTTCAATGTTTGTTTCCTTAATTGAGTCACATGTGAAGGCTGGGAAATTGGATTCTGCTCTCAAGCTTTGGGATGAGATGAAAAGGGCAGG
TTTTAGGCCTAACTTTGGTTTGTACTCTATGGTTGTTGAGTCACATGCCAAATCAGGGAAACTTGATGTTGCAATGTCTATCTTCACCGAAATGGAGAAAGCTGGATTTC
TTCCCATCCCATCTACTTATTGCTGTCTCTTGGAAATGCACGCAGCGTCGGGACAAGTAGATGCTGCCATGAAACTCTACAACTCTATGACTAATGCAGGTTTGAGGCTC
GGGTTAAGTACGTACACTGCTTTATTGACACTTCTGGCTAATAAGAAGCTTATCGATATTGCTGCAAAAGTTTTACTTGAAATGAAGGCCATGGGATTCTCTGTTGATGT
GAGCGCTAGCGATGTCTTGATGGTGTATATCAAGGAAGGCTCTATCGATTCCGCTTTGAGGTGGCTTCAGTTCATGGGTTCATCTGGAATAAGAACTAATAGCTTTATTC
TCAGGCAATTGTTTGAGTCATGCATGAAGAAAGGGATGTATGAGTCAGCTATGCCTCTCTTAGAAACTTATGTAGATTCTGCTGCTAAAGTTGATCTTATACTCTACACA
TCCATCCTGGCCCATCTTGTAAGGTGTCAAGATGAGCAGAAGGAAAGATATTTGATGTCCATCCTCAGTGCTACAAGACATAAGGCACACTCTTTTTTGTGTGGACTGTT
CACTGGAACAGAACAAAGGAAACAACCAGTTTTGTCTTTTGTAAGGGAGTTTTTTCAGGGCATCGACTATGAGCTGGAAGAGAGCAGTGCAAGATACTTTGTCAATGTCC
TCCTCAATTATCTCATTCTCATGGGACAAATAAATCGAGCTCGTTGTATTTGGAAAGTTGCTTACGAGAATAAGCTCTTCCCAAAAGCCATTGTCTTTGATCAACACATT
GCCTGGTCCCTCGACATTCGGAACTTGTCGGTTGGAGCTGCTCTTATAGCAGTTGTGCACACTCTCCATCGGTTCAGGAAGCGAATGTTGTATTATGGAATAGTTCCGAG
GCGCATAAAATTGGTTACGGGACCTACTCTGAAGCTTGTGGTTGCTCAAATGTTGAGCTCTGTGGAATCCCCATTTGAGGTCAGTAAAGTAGTTCTGAGAGCAACAGGAG
ACTCTGTGATGGAGTGGTTCAAAAAACCAATCGTCCAACAATTCCTTCTGAATGAGATTCCATCAAGATCAGATATCCTAATGCACAAGTTGAATACTCTCTTTCCCAGT
TCAGCACCTGAAATTAGATCTCTTTCACCTCCCAAACCCCTCATTTCCCGGAATTCAGCATAA
Protein sequenceShow/hide protein sequence
MKGVFLCAAEFAGNLPSFKWIRVIARFTTVLYQGANEIENLADLCLKSAPPYHKWYFETLGKKTFGFLMKEGYIYFAIADEVIGNQALLQFLEQLRDEFKRVAKKGSRGS
FSSMNSISLQEQLVPAICKLITSLENVSNNSKDWMAETPSSNNGLSPSPSNVNGQVDVLASTKAPLLGKSNKPEKKKGKDHVITMRGIEMEEHRKSTDRGLKIDSVSLES
SNQGGSGSSIPVQKDSNPLRRSNSQSIRRRWWRHVRIALAVDAAVCLILLTSADFCNLIVTFAGDICFGLLTDNDISLWFGSMFVLMLINLVSLSRFKETIGRDLMSYRN
HRRPVVSIVLDQFRNSEAQLRNWTTKRLCRLRRSNLSPICKTLTLLPDSSISSSFSKHMRGVGSIALADYLLSTLGQFGRDGRYCDRPSGINQSYDEANKTQKNVFDLES
FVGDLTVEEDACSDDISLEGLQQELEECKDDDHGLNYPETVVTNILSKGVKLREYTKGVENNLRQVELDSIQEYIKESDNLVSLHEQIRDCDNILSQMETLLSGFQAEIG
SISADIKVLQEKSMDMGLKLKNRKVAESKLAKFVEEIIVPPRMIDIIVDGEVNDEYLRTLEILSKKLVVVEVDPMIKNSKALKDVQPELEKLRQKAVSKDSSSTAVVKYV
EKPLNLKPPSSFTVSSFYNTRDDFGSSGTVFDFLVQKLHALRKPKTNIQILQQSVLLKYKNLILPITYSSLSGGKIIPCIYHHRDNDLQEKPDFPKITPAIMNCEFLSLY
VISFLKDHSKEVYNEVRTAYIDTMNKVLSAHFRAYIQALEKLQLDIATSSDLIGVEARSSGLFLRGREPLKNRSAVFALGDRIKILKIFFMKRKQGFSCDKNIGLATRPD
VNFIQEVDEPALIPHIAEASSIKYPYEVLFRSLHKLLMDTATSEYLTVYSSLVYYLQNVICSKYNFCDDFFGEESIFYEIFAGPFAVIDEHFNSILPNSYDAIGLMLMIL
IIHQHQINIFSFLFDLFSAFQRVEHLATHLHPRSLGCIFYYIQYIHWLTMLVCPAHNVTAAYTLLGFIFRQDIISINLDGVNIALWPRFKMVFDMHLSSLRNANVKTLWE
DDVHPHYVMRRYAEFTASLIHLNVEYGDGQLDLNLERLRMAIDDLLIKLAKTFSKAKSQTVFLINNYDMTISVLKEFIGKKWSSHNFQPLIEKTERTSWTRLFVYGSTEE
AGPEGGKILVHFEDLLKSNTALFVATEDLSSNPDRPITVAEVEPLVKDFASRWKAAIELMHKDVITSFSNFLCGMEILRAALTQLLLYYTRLSDCIKRIVGGSALNKDLV
SISSIMCQCSCVHVWLCRTAFVLVHQKISSQCSRLWLYSSIKFQCQDITLPVSPLVNLLCTSFTMEPNTQSFPLNHPHQKNPSGTGRSPVKFEVSNQNSSRPPYCECGQA
GRAGRDWAEHQKEAELEDKTLPADCKALPSSLWAFSFTPFRLRRQFQTQLLCRPPPVCKNVCKEIGSSSQQAYGYLRKRPSFSMLPFRAVQLLFGSSNSLHKRRFLLPAS
FLFSSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWG
PELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCF
KKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQL
LELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRL
GLSTYTALLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYT
SILAHLVRCQDEQKERYLMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHI
AWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTLFPS
SAPEIRSLSPPKPLISRNSA