; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009393 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009393
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat superfamily protein
Genome locationscaffold813:1023655..1038574
RNA-Seq ExpressionMS009393
SyntenyMS009393
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR029058 - Alpha/Beta hydrolase fold
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058740.1 putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0083.88Show/hide
Query:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL
        S+AS   SPPTR L+YLL+TCKSMDQLQQ+HCQAIKTGL+ANPVLQN VM+FCCT ++GD +YA HLFDEIPEPN+F+WNTMIRGYSRLD P+LGVSLYL
Subjt:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL

Query:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF
        EMLRR VKPD YTFPFLFKGFTRDIALEYG++ HGHVLKHGLQ+NVFV TALVQMYLLCG +D ARGVLD CSKADVITWNM+ISAYNK GKFEESR+LF
Subjt:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF

Query:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY
        L M+ KQVL TTVTLVL+LSACSKLKDL+TGK VH YV NC+VE +L+LENALIDMYA CGEMD+ALGIFR+M+NRDIISWT++V+GFTNLGEIDVARNY
Subjt:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY

Query:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR
        FDKMPEKDYVSWTAMI+GY+  NRFKEALELFRNMQ TNV+PDEFTMVS+L ACAHLGALELGEWI+TYI+RNKINND FVRNALIDMYFKCG+VDKA+R
Subjt:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR

Query:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA
        +F+EM+QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASI PDE+TYIGVLSACTHTG+VD+GR++F  MT+QH IEPNIAHYGCLVDLLARAGRLKEA
Subjt:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA

Query:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD
        + V++NMP+K NSIVWGALLAGCRV++EADMAEM    IL+LEP+NGAVYVLLCNIYAACKRWN+LRELRQ MMDKGIKKTPGCSLIEMNG VHEFVAGD
Subjt:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD

Query:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
        RSHPQTK I  KL+KMTQDLKLAGYSPDISEVFLD+AEEDKEN+VFRHSEKLAIAFGL+NS PG TIRI KNLRMCMDCH+MAKLVS+VY REVI
Subjt:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI

XP_008461137.2 PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g15930 [Cucumis melo]0.0e+0083.74Show/hide
Query:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL
        S+AS   SPPTR L+YLL+TCKSMDQLQQ+HCQAIKTGL+ANPVLQN VM+FCCT ++GD +YA HLFDEIPEPN+F+WNTMIRGYSRLD P+LGVSLYL
Subjt:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL

Query:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF
        EMLRR VKPD YTFPFLFKGFTRDIALEYG++ HGHVLKHGLQ+NVFV TALVQMYLLCG +D ARGVLD CSKADVITWNM+ISAYNK GKFEESR+LF
Subjt:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF

Query:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY
        L M+ KQVL TTVTLVL+LSACSKLKDL+TGK VH YV NC+VE +L+LENALIDMYA CGEMD+ALGIFR+M+NRDIISWT++V+GFTNLGEIDVARNY
Subjt:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY

Query:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR
        FDKMPEKDYVSWTAMI+GY+  NRFKEALELFRNMQ TNV+PDEFTMVS+L ACAHLGALELGEWI+TYI+RNKINND FVRNALIDMYFKCG+VDKA+R
Subjt:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR

Query:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA
        +F+EM+QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASI PDE+TYIGVLSACTHTG+VD+GR++F  MT+QH IEPNIAHYGCLVDLLARAGRLKEA
Subjt:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA

Query:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD
        + V++NMP+K NSIVWGALLAGCRV++EADMAEM    IL+LEP+NGAVYVLLCNIYAACKRWN+LRELRQ MMDKGIKK PGCSLIEMNG VHEFVAGD
Subjt:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD

Query:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
        RSHPQTK I  KL+KMTQDLKLAGYSPDISEVFLD+AEEDKEN+VFRHSEKLAIAFGL+NS PG TIRI KNLRMCMDCH+MAKLVS+VY REVI
Subjt:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI

XP_022145389.1 putative pentatricopeptide repeat-containing protein At3g15930 [Momordica charantia]0.0e+0099.71Show/hide
Query:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL
        STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL
Subjt:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL

Query:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF
        EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF
Subjt:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF

Query:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY
        LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY
Subjt:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY

Query:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR
        FDKMPEKDYVSWTAMINGYL VNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR
Subjt:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR

Query:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA
        VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA
Subjt:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA

Query:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD
        HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGA YVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD
Subjt:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD

Query:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
        RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
Subjt:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI

XP_022991386.1 putative pentatricopeptide repeat-containing protein At3g15930 [Cucurbita maxima]0.0e+0085.16Show/hide
Query:  TASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYLE
        TAS PLS  T  L+ LL+ C+SMDQLQQIHC+AIKTGL ANPVLQN VM FCCTHE GDLKYA HLFDE+PEPN+F+WNTMIRGYSRLDSPELGVSLYLE
Subjt:  TASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYLE

Query:  MLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLFL
        MLRR VKPD Y+FPFLFKGFTRDIAL+ G+E HGHVLKHGL SNVFV TALVQMYLLCGL+D ARGVLD+ SKADVI WNMMI+AYNK GKFEESR+LFL
Subjt:  MLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLFL

Query:  GMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNYF
        GM+EKQVLPTTVTLVLILSACSKLKD KTGK VH  VNNC+VE +L+LENALIDMYAACGEMD+ALGIFRNM+N+DIISWT++V+GFTNLGEIDVARNYF
Subjt:  GMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNYF

Query:  DKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQRV
        D+MPEKD VSWTAMI+GYL  NRFKEA +LFR+MQ T+V+PDEFTMVSIL ACA LGALELGEWIKTYID+NKINNDAFVRNALIDMYFKCGNVDKA+RV
Subjt:  DKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQRV

Query:  FKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEAH
        F+EM+QRDKFTWT +IVGLAVNGHGEKALD+FSKML+ASI PD+VTYIGVLSACTHTGMVD+GREFF SMTTQH IEPNI HYGCLVDLLARAGRLKEAH
Subjt:  FKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEAH

Query:  QVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGDR
        +V++NMP++PNSIVWGALLAGCRVH+EA+MAEM A QIL+LEPENGAVYVLLCNIYAACKRWNDLR+LRQ MMDKGIKK PGCSLIEMNGTVHEFVAGDR
Subjt:  QVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGDR

Query:  SHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
        SHPQTKEI VKLEKMTQDLK AGYSPDIS+VFLDIAEEDKEN+VFRHSEKLAIAFGL+NS PG TIRIVKNLRMC+DCHS+AKL+S+VY REVI
Subjt:  SHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI

XP_031744195.1 putative pentatricopeptide repeat-containing protein At3g15930 [Cucumis sativus]0.0e+0083.45Show/hide
Query:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL
        S+A    SPPT  L+ LL+TC+SMDQLQQ+HCQAIK GL+ANPVLQN VMTFCCTHE+GD +YA  LFDEIPEPN+F+WNTMIRGYSRLD P+LGVSLYL
Subjt:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL

Query:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF
        EMLRR VKPD YTFPFLFKGFTRDIALEYG++ HGHVLKHGLQ NVFV TALVQMYLLCG +D ARGV D C KADVITWNM+ISAYNK GKFEESR+LF
Subjt:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF

Query:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY
        L M++KQVLPTTVTLVL+LSACSKLKDL+TGK VH YV NC+VE +L+LENA+IDMYA CGEMD+ALGIFR+M+NRDIISWT++V+GFTNLGEIDVARNY
Subjt:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY

Query:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR
        FDKMPEKDYVSWTAMI+GY+  NRFKEALELFRNMQ TNV+PDEFTMVS+L ACAHLGALELGEWI+TYIDRNKI ND FVRNALIDMYFKCG+VDKA+ 
Subjt:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR

Query:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA
        +F+EM+QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASI PDE+TYIGVLSACTHTG+VD+GR++F  MT+QH IEPNIAHYGCLVDLLARAGRLKEA
Subjt:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA

Query:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD
        ++V+ENMP+K NSIVWGALLAGCRV++E+DMAEM   QIL+LEP+NGAVYVLLCNIYAACKRWNDLRELRQ MMDKGIKKTPGCSLIEMNG VHEFVAGD
Subjt:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD

Query:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
        RSHPQTK I  KL+KMTQDLKLAGYSPDISEVFLDIAEEDKEN+VFRHSEKLAIAFGL+NS PG TIRI KNLRMCMDCH+MAKLVS+VY REVI
Subjt:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI

TrEMBL top hitse value%identityAlignment
A0A0A0K6A7 DYW_deaminase domain-containing protein0.0e+0079.02Show/hide
Query:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL
        S+A    SPPT  L+ LL+TC+SMDQLQQ+HCQAIK GL+ANPVLQN VMTFCCTHE+GD +YA  LFDEIPEPN+F+WNTMIRGYSRLD P+LGVSLYL
Subjt:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL

Query:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF
        EMLRR VKPD YTFPFLFKGFTRDIALEYG++ HGHVLKHGLQ NVFV TALVQMYLLCG +D ARGV D C KADVITWNM+ISAYNK GKFEESR+LF
Subjt:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF

Query:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY
        L M++KQVLPTTVTLVL+LSACSKLKDL+TGK VH YV NC+VE +L+LENA+IDMYA CGEMD+ALGIFR+M+NRDIISWT++V+GFTNLGEIDVARNY
Subjt:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY

Query:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR
        FDKMPEKDYVSWTAMI+GY+  NRFKEALELFRNMQ TNV+PDEFTMVS+L ACAHLGALELGEWI+TYIDRNKI ND FVRNALIDMYFKCG+VDKA+ 
Subjt:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR

Query:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA
        +F+EM+QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASI PDE+TYIGVLSACTHTG+VD+GR++F  MT+QH IEPNIAHYGCLVDLLARAGRLKEA
Subjt:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA

Query:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD
        ++V+ENMP+K NSIVWGALLAGCRV++E+DMAEM   QIL+LEP+NGAVYVLLCNIYAACKRWNDLRELRQ MMDKGIKKTPGCSLIEMNG VHEFVAGD
Subjt:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD

Query:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSE---VYTREVIGS
        RSHPQTK I  KL+KMTQDLKLAGYSPDISEVFLDIAEEDKEN+VFRHSEKLAIAFGL+NS PG TIRI KNLRMCMDCH+MAKLVS+      R ++G+
Subjt:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSE---VYTREVIGS

Query:  YK--ITTLCFSTFFFFSFQLYLKHQEKLISTI-LSRISAGFASGRWD--KPSLVVWGISDKYLPQSIAEEFQK-QNSTTTKLKLIEGAGHMPQEDW
            +  L +S+      +L+LK QEKLIS + L +I+AGFASGRWD    +L+    SDKYLPQSI +EFQK  NSTT +LKL+EGAGHM QEDW
Subjt:  YK--ITTLCFSTFFFFSFQLYLKHQEKLISTI-LSRISAGFASGRWD--KPSLVVWGISDKYLPQSIAEEFQK-QNSTTTKLKLIEGAGHMPQEDW

A0A1S3CDK0 LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g159300.0e+0083.74Show/hide
Query:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL
        S+AS   SPPTR L+YLL+TCKSMDQLQQ+HCQAIKTGL+ANPVLQN VM+FCCT ++GD +YA HLFDEIPEPN+F+WNTMIRGYSRLD P+LGVSLYL
Subjt:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL

Query:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF
        EMLRR VKPD YTFPFLFKGFTRDIALEYG++ HGHVLKHGLQ+NVFV TALVQMYLLCG +D ARGVLD CSKADVITWNM+ISAYNK GKFEESR+LF
Subjt:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF

Query:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY
        L M+ KQVL TTVTLVL+LSACSKLKDL+TGK VH YV NC+VE +L+LENALIDMYA CGEMD+ALGIFR+M+NRDIISWT++V+GFTNLGEIDVARNY
Subjt:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY

Query:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR
        FDKMPEKDYVSWTAMI+GY+  NRFKEALELFRNMQ TNV+PDEFTMVS+L ACAHLGALELGEWI+TYI+RNKINND FVRNALIDMYFKCG+VDKA+R
Subjt:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR

Query:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA
        +F+EM+QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASI PDE+TYIGVLSACTHTG+VD+GR++F  MT+QH IEPNIAHYGCLVDLLARAGRLKEA
Subjt:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA

Query:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD
        + V++NMP+K NSIVWGALLAGCRV++EADMAEM    IL+LEP+NGAVYVLLCNIYAACKRWN+LRELRQ MMDKGIKK PGCSLIEMNG VHEFVAGD
Subjt:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD

Query:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
        RSHPQTK I  KL+KMTQDLKLAGYSPDISEVFLD+AEEDKEN+VFRHSEKLAIAFGL+NS PG TIRI KNLRMCMDCH+MAKLVS+VY REVI
Subjt:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI

A0A5A7UUL4 Putative pentatricopeptide repeat-containing protein0.0e+0083.88Show/hide
Query:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL
        S+AS   SPPTR L+YLL+TCKSMDQLQQ+HCQAIKTGL+ANPVLQN VM+FCCT ++GD +YA HLFDEIPEPN+F+WNTMIRGYSRLD P+LGVSLYL
Subjt:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL

Query:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF
        EMLRR VKPD YTFPFLFKGFTRDIALEYG++ HGHVLKHGLQ+NVFV TALVQMYLLCG +D ARGVLD CSKADVITWNM+ISAYNK GKFEESR+LF
Subjt:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF

Query:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY
        L M+ KQVL TTVTLVL+LSACSKLKDL+TGK VH YV NC+VE +L+LENALIDMYA CGEMD+ALGIFR+M+NRDIISWT++V+GFTNLGEIDVARNY
Subjt:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY

Query:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR
        FDKMPEKDYVSWTAMI+GY+  NRFKEALELFRNMQ TNV+PDEFTMVS+L ACAHLGALELGEWI+TYI+RNKINND FVRNALIDMYFKCG+VDKA+R
Subjt:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR

Query:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA
        +F+EM+QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASI PDE+TYIGVLSACTHTG+VD+GR++F  MT+QH IEPNIAHYGCLVDLLARAGRLKEA
Subjt:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA

Query:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD
        + V++NMP+K NSIVWGALLAGCRV++EADMAEM    IL+LEP+NGAVYVLLCNIYAACKRWN+LRELRQ MMDKGIKKTPGCSLIEMNG VHEFVAGD
Subjt:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD

Query:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
        RSHPQTK I  KL+KMTQDLKLAGYSPDISEVFLD+AEEDKEN+VFRHSEKLAIAFGL+NS PG TIRI KNLRMCMDCH+MAKLVS+VY REVI
Subjt:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI

A0A6J1CV36 putative pentatricopeptide repeat-containing protein At3g159300.0e+0099.71Show/hide
Query:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL
        STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL
Subjt:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL

Query:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF
        EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF
Subjt:  EMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLF

Query:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY
        LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY
Subjt:  LGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNY

Query:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR
        FDKMPEKDYVSWTAMINGYL VNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR
Subjt:  FDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQR

Query:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA
        VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA
Subjt:  VFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEA

Query:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD
        HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGA YVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD
Subjt:  HQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGD

Query:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
        RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
Subjt:  RSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI

A0A6J1JQK8 putative pentatricopeptide repeat-containing protein At3g159300.0e+0085.16Show/hide
Query:  TASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYLE
        TAS PLS  T  L+ LL+ C+SMDQLQQIHC+AIKTGL ANPVLQN VM FCCTHE GDLKYA HLFDE+PEPN+F+WNTMIRGYSRLDSPELGVSLYLE
Subjt:  TASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYLE

Query:  MLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLFL
        MLRR VKPD Y+FPFLFKGFTRDIAL+ G+E HGHVLKHGL SNVFV TALVQMYLLCGL+D ARGVLD+ SKADVI WNMMI+AYNK GKFEESR+LFL
Subjt:  MLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLFL

Query:  GMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNYF
        GM+EKQVLPTTVTLVLILSACSKLKD KTGK VH  VNNC+VE +L+LENALIDMYAACGEMD+ALGIFRNM+N+DIISWT++V+GFTNLGEIDVARNYF
Subjt:  GMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNYF

Query:  DKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQRV
        D+MPEKD VSWTAMI+GYL  NRFKEA +LFR+MQ T+V+PDEFTMVSIL ACA LGALELGEWIKTYID+NKINNDAFVRNALIDMYFKCGNVDKA+RV
Subjt:  DKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQRV

Query:  FKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEAH
        F+EM+QRDKFTWT +IVGLAVNGHGEKALD+FSKML+ASI PD+VTYIGVLSACTHTGMVD+GREFF SMTTQH IEPNI HYGCLVDLLARAGRLKEAH
Subjt:  FKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEAH

Query:  QVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGDR
        +V++NMP++PNSIVWGALLAGCRVH+EA+MAEM A QIL+LEPENGAVYVLLCNIYAACKRWNDLR+LRQ MMDKGIKK PGCSLIEMNGTVHEFVAGDR
Subjt:  QVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGDR

Query:  SHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
        SHPQTKEI VKLEKMTQDLK AGYSPDIS+VFLDIAEEDKEN+VFRHSEKLAIAFGL+NS PG TIRIVKNLRMC+DCHS+AKL+S+VY REVI
Subjt:  SHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148206.1e-15438.41Show/hide
Query:  MPLSPP-----TRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPE-PNVFLWNTMIRGYSRLDSPELGVSL
        M L PP        +L  L  CKS++ ++Q+H   ++T +  N  L + +     +    +L YA ++F  IP  P   ++N  +R  SR   P   +  
Subjt:  MPLSPP-----TRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPE-PNVFLWNTMIRGYSRLDSPELGVSL

Query:  YLEMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRK
        Y  +     + D ++F  + K  ++  AL  G E HG   K     + FV+T  + MY  CG ++ AR V D  S  DV+TWN MI  Y + G  +E+ K
Subjt:  YLEMLRRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRK

Query:  LFLGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVAR
        LF  M++  V+P  + L  I+SAC +  +++  + ++ ++    V     L  AL+ MYA  G MD A   FR MS R++   T++V+G++  G +D A+
Subjt:  LFLGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVAR

Query:  NYFDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKA
          FD+  +KD V WT MI+ Y+  +  +EAL +F  M  + ++PD  +M S+++ACA+LG L+  +W+ + I  N + ++  + NALI+MY KCG +D  
Subjt:  NYFDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKA

Query:  QRVFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLK
        + VF++M +R+  +W++MI  L+++G    AL +F++M + ++ P+EVT++GVL  C+H+G+V+EG++ F SMT ++NI P + HYGC+VDL  RA  L+
Subjt:  QRVFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLK

Query:  EAHQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVA
        EA +V+E+MP+  N ++WG+L++ CR+H E ++ + AA +IL+LEP++    VL+ NIYA  +RW D+R +R+ M +K + K  G S I+ NG  HEF+ 
Subjt:  EAHQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVA

Query:  GDRSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPG------FTIRIVKNLRMCMDCHSMAKLVSEVYTR
        GD+ H Q+ EIY KL+++   LKLAGY PD   V +D+ EE+K++ V  HSEKLA+ FGL+N +          IRIVKNLR+C DCH   KLVS+VY R
Subjt:  GDRSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPG------FTIRIVKNLRMCMDCHSMAKLVSEVYTR

Query:  EVI
        E+I
Subjt:  EVI

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic2.3e-16941.06Show/hide
Query:  SMPLSPPTR----RLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLY
        S P  P T     R + L++ C S+ QL+Q H   I+TG  ++P   + +           L+YA  +FDEIP+PN F WNT+IR Y+    P L +  +
Subjt:  SMPLSPPTR----RLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLY

Query:  LEML-RRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRK
        L+M+      P+ YTFPFL K      +L  G+  HG  +K  + S+VFV  +L+  Y  CG +D A  V  +  + DV++WN MI+ + + G  +++ +
Subjt:  LEML-RRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRK

Query:  LFLGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVAR
        LF  M+ + V  + VT+V +LSAC+K+++L+ G+ V  Y+   +V  +L L NA++DMY  CG ++ A  +F  M  +D ++WT+++ G+    + + AR
Subjt:  LFLGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVAR

Query:  NYFDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQV-TNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDK
           + MP+KD V+W A+I+ Y    +  EAL +F  +Q+  N++ ++ T+VS L+ACA +GALELG WI +YI ++ I  +  V +ALI MY KCG+++K
Subjt:  NYFDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQV-TNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDK

Query:  AQRVFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRL
        ++ VF  + +RD F W+AMI GLA++G G +A+DMF KM +A++ P+ VT+  V  AC+HTG+VDE    F  M + + I P   HY C+VD+L R+G L
Subjt:  AQRVFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRL

Query:  KEAHQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFV
        ++A + +E MP+ P++ VWGALL  C++H   ++AEMA  ++L+LEP N   +VLL NIYA   +W ++ ELR+ M   G+KK PGCS IE++G +HEF+
Subjt:  KEAHQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFV

Query:  AGDRSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEED-KENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
        +GD +HP ++++Y KL ++ + LK  GY P+IS+V   I EE+ KE ++  HSEKLAI +GL++++    IR++KNLR+C DCHS+AKL+S++Y RE+I
Subjt:  AGDRSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEED-KENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.0e-16439.3Show/hide
Query:  LYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGD-LKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYLEMLRRDVKPDGYT
        L LL  CK++  L+ IH Q IK GLH      + ++ FC    H + L YA  +F  I EPN+ +WNTM RG++    P   + LY+ M+   + P+ YT
Subjt:  LYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGD-LKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYLEMLRRDVKPDGYT

Query:  FPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMA------------------------RGVLDSCSK-------ADVITWNM
        FPF+ K   +  A + G++ HGHVLK G   +++V T+L+ MY+  G ++ A                        RG +++  K        DV++WN 
Subjt:  FPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMA------------------------RGVLDSCSK-------ADVITWNM

Query:  MISAYNKDGKFEESRKLFLGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWT
        MIS Y + G ++E+ +LF  M +  V P   T+V ++SAC++   ++ G+ VH ++++     +L + NALID+Y+ CGE++ A G+             
Subjt:  MISAYNKDGKFEESRKLFLGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWT

Query:  SVVAGFTNLGEIDVARNYFDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDR--NKINNDAF
                          F+++P KD +SW  +I GY  +N +KEAL LF+ M  +   P++ TM+SIL ACAHLGA+++G WI  YID+    + N + 
Subjt:  SVVAGFTNLGEIDVARNYFDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDR--NKINNDAF

Query:  VRNALIDMYFKCGNVDKAQRVFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPN
        +R +LIDMY KCG+++ A +VF  +  +   +W AMI G A++G  + + D+FS+M K  I PD++T++G+LSAC+H+GM+D GR  FR+MT  + + P 
Subjt:  VRNALIDMYFKCGNVDKAQRVFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPN

Query:  IAHYGCLVDLLARAGRLKEAHQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKK
        + HYGC++DLL  +G  KEA +++  M M+P+ ++W +LL  C++H   ++ E  A  ++++EPEN   YVLL NIYA+  RWN++ + R  + DKG+KK
Subjt:  IAHYGCLVDLLARAGRLKEAHQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKK

Query:  TPGCSLIEMNGTVHEFVAGDRSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCH
         PGCS IE++  VHEF+ GD+ HP+ +EIY  LE+M   L+ AG+ PD SEV  ++ EE KE A+  HSEKLAIAFGL++++PG  + IVKNLR+C +CH
Subjt:  TPGCSLIEMNGTVHEFVAGDRSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCH

Query:  SMAKLVSEVYTREVI
           KL+S++Y RE+I
Subjt:  SMAKLVSEVYTREVI

Q9LSB8 Putative pentatricopeptide repeat-containing protein At3g159301.4e-22256.96Show/hide
Query:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL
        ST +  +S    R + +L  CK+ DQ +Q+H Q+I  G+  NP  Q  +  F C+   G + YA+ LF +IPEP+V +WN MI+G+S++D    GV LYL
Subjt:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL

Query:  EMLRRDVKPDGYTFPFLFKGFTRD-IALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKL
         ML+  V PD +TFPFL  G  RD  AL  GK+ H HV+K GL SN++VQ ALV+MY LCGL+DMARGV D   K DV +WN+MIS YN+  ++EES +L
Subjt:  EMLRRDVKPDGYTFPFLFKGFTRD-IALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKL

Query:  FLGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARN
         + M+   V PT+VTL+L+LSACSK+KD    K VH YV+ C+ E SL LENAL++ YAACGEMD A+ IFR+M  RD+ISWTS+V G+   G + +AR 
Subjt:  FLGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARN

Query:  YFDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQ
        YFD+MP +D +SWT MI+GYL    F E+LE+FR MQ   + PDEFTMVS+L ACAHLG+LE+GEWIKTYID+NKI ND  V NALIDMYFKCG  +KAQ
Subjt:  YFDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQ

Query:  RVFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKE
        +VF +M+QRDKFTWTAM+VGLA NG G++A+ +F +M   SI PD++TY+GVLSAC H+GMVD+ R+FF  M + H IEP++ HYGC+VD+L RAG +KE
Subjt:  RVFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKE

Query:  AHQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAG
        A++++  MPM PNSIVWGALL   R+H +  MAE+AA +IL+LEP+NGAVY LLCNIYA CKRW DLRE+R+ ++D  IKKTPG SLIE+NG  HEFVAG
Subjt:  AHQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAG

Query:  DRSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAE
        D+SH Q++EIY+KLE++ Q+   A Y PD SE+  +  +
Subjt:  DRSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAE

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226902.0e-15740.96Show/hide
Query:  QIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYLEMLR-RDVKPDGYTFPFLFKGFTRDIAL
        QIH   +K G   +  +QN ++ F    E G+L  A  +FDE+ E NV  W +MI GY+R D  +  V L+  M+R  +V P+  T   +     +   L
Subjt:  QIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYLEMLR-RDVKPDGYTFPFLFKGFTRDIAL

Query:  EYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLFLGMQEKQVLPTTVTLVLILSACSKLKD
        E G++ +  +   G++ N  + +ALV MY+ C  +D+A+ + D    +++   N M S Y + G   E+  +F  M +  V P  ++++  +S+CS+L++
Subjt:  EYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLFLGMQEKQVLPTTVTLVLILSACSKLKD

Query:  LKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNYFDKMPEKDYVSWTAMINGYLCVNRFKE
        +  GK  H YV     E    + NALIDMY  C   D A  IF  MSN+ +++W S+VAG+   GE+D A   F+ MPEK+ VSW  +I+G +  + F+E
Subjt:  LKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNYFDKMPEKDYVSWTAMINGYLCVNRFKE

Query:  ALELFRNMQ-VTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQRVFKEMNQRDKFTWTAMIVGLAVNGHG
        A+E+F +MQ    V  D  TM+SI +AC HLGAL+L +WI  YI++N I  D  +   L+DM+ +CG+ + A  +F  +  RD   WTA I  +A+ G+ 
Subjt:  ALELFRNMQ-VTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQRVFKEMNQRDKFTWTAMIVGLAVNGHG

Query:  EKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEAHQVVENMPMKPNSIVWGALLAGCRVH
        E+A+++F  M++  + PD V ++G L+AC+H G+V +G+E F SM   H + P   HYGC+VDLL RAG L+EA Q++E+MPM+PN ++W +LLA CRV 
Subjt:  EKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEAHQVVENMPMKPNSIVWGALLAGCRVH

Query:  KEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTKEIYVKLEKMTQDLKLAGYS
           +MA  AA +I  L PE    YVLL N+YA+  RWND+ ++R +M +KG++K PG S I++ G  HEF +GD SHP+   I   L++++Q     G+ 
Subjt:  KEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTKEIYVKLEKMTQDLKLAGYS

Query:  PDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
        PD+S V +D+ E++K   + RHSEKLA+A+GL++S  G TIRIVKNLR+C DCHS AK  S+VY RE+I
Subjt:  PDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.1e-16639.3Show/hide
Query:  LYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGD-LKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYLEMLRRDVKPDGYT
        L LL  CK++  L+ IH Q IK GLH      + ++ FC    H + L YA  +F  I EPN+ +WNTM RG++    P   + LY+ M+   + P+ YT
Subjt:  LYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGD-LKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYLEMLRRDVKPDGYT

Query:  FPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMA------------------------RGVLDSCSK-------ADVITWNM
        FPF+ K   +  A + G++ HGHVLK G   +++V T+L+ MY+  G ++ A                        RG +++  K        DV++WN 
Subjt:  FPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMA------------------------RGVLDSCSK-------ADVITWNM

Query:  MISAYNKDGKFEESRKLFLGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWT
        MIS Y + G ++E+ +LF  M +  V P   T+V ++SAC++   ++ G+ VH ++++     +L + NALID+Y+ CGE++ A G+             
Subjt:  MISAYNKDGKFEESRKLFLGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWT

Query:  SVVAGFTNLGEIDVARNYFDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDR--NKINNDAF
                          F+++P KD +SW  +I GY  +N +KEAL LF+ M  +   P++ TM+SIL ACAHLGA+++G WI  YID+    + N + 
Subjt:  SVVAGFTNLGEIDVARNYFDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDR--NKINNDAF

Query:  VRNALIDMYFKCGNVDKAQRVFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPN
        +R +LIDMY KCG+++ A +VF  +  +   +W AMI G A++G  + + D+FS+M K  I PD++T++G+LSAC+H+GM+D GR  FR+MT  + + P 
Subjt:  VRNALIDMYFKCGNVDKAQRVFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPN

Query:  IAHYGCLVDLLARAGRLKEAHQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKK
        + HYGC++DLL  +G  KEA +++  M M+P+ ++W +LL  C++H   ++ E  A  ++++EPEN   YVLL NIYA+  RWN++ + R  + DKG+KK
Subjt:  IAHYGCLVDLLARAGRLKEAHQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKK

Query:  TPGCSLIEMNGTVHEFVAGDRSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCH
         PGCS IE++  VHEF+ GD+ HP+ +EIY  LE+M   L+ AG+ PD SEV  ++ EE KE A+  HSEKLAIAFGL++++PG  + IVKNLR+C +CH
Subjt:  TPGCSLIEMNGTVHEFVAGDRSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCH

Query:  SMAKLVSEVYTREVI
           KL+S++Y RE+I
Subjt:  SMAKLVSEVYTREVI

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-17041.06Show/hide
Query:  SMPLSPPTR----RLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLY
        S P  P T     R + L++ C S+ QL+Q H   I+TG  ++P   + +           L+YA  +FDEIP+PN F WNT+IR Y+    P L +  +
Subjt:  SMPLSPPTR----RLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLY

Query:  LEML-RRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRK
        L+M+      P+ YTFPFL K      +L  G+  HG  +K  + S+VFV  +L+  Y  CG +D A  V  +  + DV++WN MI+ + + G  +++ +
Subjt:  LEML-RRDVKPDGYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRK

Query:  LFLGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVAR
        LF  M+ + V  + VT+V +LSAC+K+++L+ G+ V  Y+   +V  +L L NA++DMY  CG ++ A  +F  M  +D ++WT+++ G+    + + AR
Subjt:  LFLGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVAR

Query:  NYFDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQV-TNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDK
           + MP+KD V+W A+I+ Y    +  EAL +F  +Q+  N++ ++ T+VS L+ACA +GALELG WI +YI ++ I  +  V +ALI MY KCG+++K
Subjt:  NYFDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQV-TNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDK

Query:  AQRVFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRL
        ++ VF  + +RD F W+AMI GLA++G G +A+DMF KM +A++ P+ VT+  V  AC+HTG+VDE    F  M + + I P   HY C+VD+L R+G L
Subjt:  AQRVFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRL

Query:  KEAHQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFV
        ++A + +E MP+ P++ VWGALL  C++H   ++AEMA  ++L+LEP N   +VLL NIYA   +W ++ ELR+ M   G+KK PGCS IE++G +HEF+
Subjt:  KEAHQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFV

Query:  AGDRSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEED-KENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
        +GD +HP ++++Y KL ++ + LK  GY P+IS+V   I EE+ KE ++  HSEKLAI +GL++++    IR++KNLR+C DCHS+AKL+S++Y RE+I
Subjt:  AGDRSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEED-KENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI

AT3G15930.1 Pentatricopeptide repeat (PPR) superfamily protein9.7e-22456.96Show/hide
Query:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL
        ST +  +S    R + +L  CK+ DQ +Q+H Q+I  G+  NP  Q  +  F C+   G + YA+ LF +IPEP+V +WN MI+G+S++D    GV LYL
Subjt:  STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYL

Query:  EMLRRDVKPDGYTFPFLFKGFTRD-IALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKL
         ML+  V PD +TFPFL  G  RD  AL  GK+ H HV+K GL SN++VQ ALV+MY LCGL+DMARGV D   K DV +WN+MIS YN+  ++EES +L
Subjt:  EMLRRDVKPDGYTFPFLFKGFTRD-IALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKL

Query:  FLGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARN
         + M+   V PT+VTL+L+LSACSK+KD    K VH YV+ C+ E SL LENAL++ YAACGEMD A+ IFR+M  RD+ISWTS+V G+   G + +AR 
Subjt:  FLGMQEKQVLPTTVTLVLILSACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARN

Query:  YFDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQ
        YFD+MP +D +SWT MI+GYL    F E+LE+FR MQ   + PDEFTMVS+L ACAHLG+LE+GEWIKTYID+NKI ND  V NALIDMYFKCG  +KAQ
Subjt:  YFDKMPEKDYVSWTAMINGYLCVNRFKEALELFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQ

Query:  RVFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKE
        +VF +M+QRDKFTWTAM+VGLA NG G++A+ +F +M   SI PD++TY+GVLSAC H+GMVD+ R+FF  M + H IEP++ HYGC+VD+L RAG +KE
Subjt:  RVFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKE

Query:  AHQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAG
        A++++  MPM PNSIVWGALL   R+H +  MAE+AA +IL+LEP+NGAVY LLCNIYA CKRW DLRE+R+ ++D  IKKTPG SLIE+NG  HEFVAG
Subjt:  AHQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAG

Query:  DRSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAE
        D+SH Q++EIY+KLE++ Q+   A Y PD SE+  +  +
Subjt:  DRSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAE

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)1.4e-15840.96Show/hide
Query:  QIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYLEMLR-RDVKPDGYTFPFLFKGFTRDIAL
        QIH   +K G   +  +QN ++ F    E G+L  A  +FDE+ E NV  W +MI GY+R D  +  V L+  M+R  +V P+  T   +     +   L
Subjt:  QIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYLEMLR-RDVKPDGYTFPFLFKGFTRDIAL

Query:  EYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLFLGMQEKQVLPTTVTLVLILSACSKLKD
        E G++ +  +   G++ N  + +ALV MY+ C  +D+A+ + D    +++   N M S Y + G   E+  +F  M +  V P  ++++  +S+CS+L++
Subjt:  EYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLFLGMQEKQVLPTTVTLVLILSACSKLKD

Query:  LKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNYFDKMPEKDYVSWTAMINGYLCVNRFKE
        +  GK  H YV     E    + NALIDMY  C   D A  IF  MSN+ +++W S+VAG+   GE+D A   F+ MPEK+ VSW  +I+G +  + F+E
Subjt:  LKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNYFDKMPEKDYVSWTAMINGYLCVNRFKE

Query:  ALELFRNMQ-VTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQRVFKEMNQRDKFTWTAMIVGLAVNGHG
        A+E+F +MQ    V  D  TM+SI +AC HLGAL+L +WI  YI++N I  D  +   L+DM+ +CG+ + A  +F  +  RD   WTA I  +A+ G+ 
Subjt:  ALELFRNMQ-VTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQRVFKEMNQRDKFTWTAMIVGLAVNGHG

Query:  EKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEAHQVVENMPMKPNSIVWGALLAGCRVH
        E+A+++F  M++  + PD V ++G L+AC+H G+V +G+E F SM   H + P   HYGC+VDLL RAG L+EA Q++E+MPM+PN ++W +LLA CRV 
Subjt:  EKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEAHQVVENMPMKPNSIVWGALLAGCRVH

Query:  KEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTKEIYVKLEKMTQDLKLAGYS
           +MA  AA +I  L PE    YVLL N+YA+  RWND+ ++R +M +KG++K PG S I++ G  HEF +GD SHP+   I   L++++Q     G+ 
Subjt:  KEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTKEIYVKLEKMTQDLKLAGYS

Query:  PDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
        PD+S V +D+ E++K   + RHSEKLA+A+GL++S  G TIRIVKNLR+C DCHS AK  S+VY RE+I
Subjt:  PDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification1.4e-15840.96Show/hide
Query:  QIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYLEMLR-RDVKPDGYTFPFLFKGFTRDIAL
        QIH   +K G   +  +QN ++ F    E G+L  A  +FDE+ E NV  W +MI GY+R D  +  V L+  M+R  +V P+  T   +     +   L
Subjt:  QIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYLEMLR-RDVKPDGYTFPFLFKGFTRDIAL

Query:  EYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLFLGMQEKQVLPTTVTLVLILSACSKLKD
        E G++ +  +   G++ N  + +ALV MY+ C  +D+A+ + D    +++   N M S Y + G   E+  +F  M +  V P  ++++  +S+CS+L++
Subjt:  EYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLFLGMQEKQVLPTTVTLVLILSACSKLKD

Query:  LKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNYFDKMPEKDYVSWTAMINGYLCVNRFKE
        +  GK  H YV     E    + NALIDMY  C   D A  IF  MSN+ +++W S+VAG+   GE+D A   F+ MPEK+ VSW  +I+G +  + F+E
Subjt:  LKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNYFDKMPEKDYVSWTAMINGYLCVNRFKE

Query:  ALELFRNMQ-VTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQRVFKEMNQRDKFTWTAMIVGLAVNGHG
        A+E+F +MQ    V  D  TM+SI +AC HLGAL+L +WI  YI++N I  D  +   L+DM+ +CG+ + A  +F  +  RD   WTA I  +A+ G+ 
Subjt:  ALELFRNMQ-VTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQRVFKEMNQRDKFTWTAMIVGLAVNGHG

Query:  EKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEAHQVVENMPMKPNSIVWGALLAGCRVH
        E+A+++F  M++  + PD V ++G L+AC+H G+V +G+E F SM   H + P   HYGC+VDLL RAG L+EA Q++E+MPM+PN ++W +LLA CRV 
Subjt:  EKALDMFSKMLKASIWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEAHQVVENMPMKPNSIVWGALLAGCRVH

Query:  KEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTKEIYVKLEKMTQDLKLAGYS
           +MA  AA +I  L PE    YVLL N+YA+  RWND+ ++R +M +KG++K PG S I++ G  HEF +GD SHP+   I   L++++Q     G+ 
Subjt:  KEADMAEMAANQILQLEPENGAVYVLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTKEIYVKLEKMTQDLKLAGYS

Query:  PDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI
        PD+S V +D+ E++K   + RHSEKLA+A+GL++S  G TIRIVKNLR+C DCHS AK  S+VY RE+I
Subjt:  PDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVNSQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCCACAGCTTCAATGCCCCTCTCTCCACCCACCCGCCGTCTACTCTATCTCCTTCAGACCTGCAAATCCATGGACCAGCTTCAGCAAATTCACTGCCAAGCAATTAAAAC
AGGTCTCCATGCCAACCCAGTTCTCCAAAACGGAGTTATGACCTTCTGTTGTACGCATGAACATGGTGACTTGAAATATGCACATCACCTGTTTGATGAAATTCCTGAAC
CGAATGTGTTTCTCTGGAACACCATGATCAGAGGCTACTCCCGGCTGGATTCTCCCGAGCTCGGAGTTTCTCTGTATTTGGAAATGTTGAGGAGGGATGTTAAGCCTGAT
GGTTACACCTTCCCGTTCCTGTTCAAGGGATTTACAAGAGACATTGCATTAGAATATGGAAAAGAGTTTCATGGCCACGTTCTGAAGCATGGGCTTCAGTCCAATGTGTT
TGTTCAGACTGCTTTAGTGCAAATGTACCTCTTGTGTGGCCTTGTTGATATGGCTCGTGGGGTTCTGGATTCTTGCTCTAAAGCTGATGTGATCACTTGGAATATGATGA
TCTCTGCCTACAACAAAGATGGGAAGTTTGAGGAATCAAGAAAACTTTTTCTTGGTATGCAGGAAAAGCAAGTGCTGCCCACCACGGTGACCCTTGTATTAATCCTGTCA
GCTTGCTCCAAATTGAAGGATTTAAAAACTGGGAAGCTGGTTCATAGATATGTCAACAACTGCCAGGTTGAGTGCAGTTTGATTCTTGAAAATGCTCTGATTGATATGTA
TGCTGCTTGTGGGGAAATGGATGCTGCCCTTGGGATTTTCAGGAATATGAGTAATAGAGATATCATATCTTGGACTAGCGTCGTAGCCGGGTTTACCAACTTGGGAGAAA
TTGATGTTGCTCGGAATTATTTCGACAAGATGCCTGAGAAAGATTACGTTTCATGGACTGCCATGATCAATGGATACCTCTGTGTGAACAGATTCAAAGAAGCATTAGAG
CTATTCCGCAATATGCAAGTAACAAATGTGCAGCCTGATGAGTTCACTATGGTTAGTATTCTGAATGCTTGTGCACATCTGGGAGCCCTTGAGTTAGGAGAATGGATAAA
GACTTATATCGACCGAAACAAGATCAACAACGATGCATTTGTTAGAAATGCTTTAATAGACATGTACTTCAAGTGTGGAAATGTTGACAAAGCACAAAGGGTATTCAAAG
AAATGAACCAGAGAGACAAGTTTACATGGACAGCCATGATAGTTGGCCTTGCAGTTAATGGGCATGGTGAGAAAGCTCTGGATATGTTTTCTAAAATGCTAAAAGCTTCA
ATTTGGCCAGATGAGGTTACTTACATTGGCGTTCTTTCTGCCTGTACTCACACCGGCATGGTAGATGAAGGACGAGAGTTTTTTCGTAGCATGACAACCCAACATAACAT
TGAACCCAATATAGCACACTATGGTTGTCTGGTTGATCTTCTTGCCCGAGCTGGTCGTCTAAAAGAAGCCCATCAAGTTGTAGAGAATATGCCAATGAAACCCAATTCAA
TAGTTTGGGGAGCTCTTCTAGCTGGTTGTAGAGTTCATAAGGAAGCTGATATGGCTGAAATGGCTGCAAACCAGATTCTCCAGTTGGAGCCTGAGAACGGTGCTGTCTAT
GTTCTCCTGTGTAATATATATGCAGCTTGCAAGAGATGGAATGACTTGCGAGAGTTAAGACAGACAATGATGGATAAAGGAATCAAGAAAACACCTGGTTGCAGTTTGAT
AGAGATGAATGGCACGGTTCACGAGTTCGTAGCTGGGGACAGATCACACCCTCAAACAAAGGAAATTTATGTTAAGCTAGAAAAGATGACCCAAGACCTGAAATTAGCAG
GGTATTCACCTGATATCTCAGAAGTGTTCCTTGACATTGCAGAAGAGGATAAAGAGAATGCAGTTTTTCGTCATAGTGAGAAGTTGGCCATTGCTTTTGGACTCGTTAAT
TCACAACCGGGGTTCACAATTAGAATTGTGAAAAACCTTAGAATGTGCATGGATTGTCACAGTATGGCAAAGTTGGTCTCAGAGGTGTATACTAGAGAAGTAATTGGATC
CTACAAAATAACAACATTATGCTTTTCTACTTTCTTTTTTTTTTCTTTTCAGCTTTACTTGAAGCATCAAGAAAAGCTAATTTCAACGATATTAAGTCGGATAAGTGCTG
GCTTTGCTTCTGGAAGATGGGACAAACCAAGTCTTGTTGTTTGGGGCATTTCAGATAAGTATCTTCCCCAGTCCATTGCAGAAGAGTTTCAAAAACAGAACTCAACAACA
ACCAAGCTCAAGTTAATTGAAGGGGCTGGCCATATGCCCCAAGAGGACTGG
mRNA sequenceShow/hide mRNA sequence
TCCACAGCTTCAATGCCCCTCTCTCCACCCACCCGCCGTCTACTCTATCTCCTTCAGACCTGCAAATCCATGGACCAGCTTCAGCAAATTCACTGCCAAGCAATTAAAAC
AGGTCTCCATGCCAACCCAGTTCTCCAAAACGGAGTTATGACCTTCTGTTGTACGCATGAACATGGTGACTTGAAATATGCACATCACCTGTTTGATGAAATTCCTGAAC
CGAATGTGTTTCTCTGGAACACCATGATCAGAGGCTACTCCCGGCTGGATTCTCCCGAGCTCGGAGTTTCTCTGTATTTGGAAATGTTGAGGAGGGATGTTAAGCCTGAT
GGTTACACCTTCCCGTTCCTGTTCAAGGGATTTACAAGAGACATTGCATTAGAATATGGAAAAGAGTTTCATGGCCACGTTCTGAAGCATGGGCTTCAGTCCAATGTGTT
TGTTCAGACTGCTTTAGTGCAAATGTACCTCTTGTGTGGCCTTGTTGATATGGCTCGTGGGGTTCTGGATTCTTGCTCTAAAGCTGATGTGATCACTTGGAATATGATGA
TCTCTGCCTACAACAAAGATGGGAAGTTTGAGGAATCAAGAAAACTTTTTCTTGGTATGCAGGAAAAGCAAGTGCTGCCCACCACGGTGACCCTTGTATTAATCCTGTCA
GCTTGCTCCAAATTGAAGGATTTAAAAACTGGGAAGCTGGTTCATAGATATGTCAACAACTGCCAGGTTGAGTGCAGTTTGATTCTTGAAAATGCTCTGATTGATATGTA
TGCTGCTTGTGGGGAAATGGATGCTGCCCTTGGGATTTTCAGGAATATGAGTAATAGAGATATCATATCTTGGACTAGCGTCGTAGCCGGGTTTACCAACTTGGGAGAAA
TTGATGTTGCTCGGAATTATTTCGACAAGATGCCTGAGAAAGATTACGTTTCATGGACTGCCATGATCAATGGATACCTCTGTGTGAACAGATTCAAAGAAGCATTAGAG
CTATTCCGCAATATGCAAGTAACAAATGTGCAGCCTGATGAGTTCACTATGGTTAGTATTCTGAATGCTTGTGCACATCTGGGAGCCCTTGAGTTAGGAGAATGGATAAA
GACTTATATCGACCGAAACAAGATCAACAACGATGCATTTGTTAGAAATGCTTTAATAGACATGTACTTCAAGTGTGGAAATGTTGACAAAGCACAAAGGGTATTCAAAG
AAATGAACCAGAGAGACAAGTTTACATGGACAGCCATGATAGTTGGCCTTGCAGTTAATGGGCATGGTGAGAAAGCTCTGGATATGTTTTCTAAAATGCTAAAAGCTTCA
ATTTGGCCAGATGAGGTTACTTACATTGGCGTTCTTTCTGCCTGTACTCACACCGGCATGGTAGATGAAGGACGAGAGTTTTTTCGTAGCATGACAACCCAACATAACAT
TGAACCCAATATAGCACACTATGGTTGTCTGGTTGATCTTCTTGCCCGAGCTGGTCGTCTAAAAGAAGCCCATCAAGTTGTAGAGAATATGCCAATGAAACCCAATTCAA
TAGTTTGGGGAGCTCTTCTAGCTGGTTGTAGAGTTCATAAGGAAGCTGATATGGCTGAAATGGCTGCAAACCAGATTCTCCAGTTGGAGCCTGAGAACGGTGCTGTCTAT
GTTCTCCTGTGTAATATATATGCAGCTTGCAAGAGATGGAATGACTTGCGAGAGTTAAGACAGACAATGATGGATAAAGGAATCAAGAAAACACCTGGTTGCAGTTTGAT
AGAGATGAATGGCACGGTTCACGAGTTCGTAGCTGGGGACAGATCACACCCTCAAACAAAGGAAATTTATGTTAAGCTAGAAAAGATGACCCAAGACCTGAAATTAGCAG
GGTATTCACCTGATATCTCAGAAGTGTTCCTTGACATTGCAGAAGAGGATAAAGAGAATGCAGTTTTTCGTCATAGTGAGAAGTTGGCCATTGCTTTTGGACTCGTTAAT
TCACAACCGGGGTTCACAATTAGAATTGTGAAAAACCTTAGAATGTGCATGGATTGTCACAGTATGGCAAAGTTGGTCTCAGAGGTGTATACTAGAGAAGTAATTGGATC
CTACAAAATAACAACATTATGCTTTTCTACTTTCTTTTTTTTTTCTTTTCAGCTTTACTTGAAGCATCAAGAAAAGCTAATTTCAACGATATTAAGTCGGATAAGTGCTG
GCTTTGCTTCTGGAAGATGGGACAAACCAAGTCTTGTTGTTTGGGGCATTTCAGATAAGTATCTTCCCCAGTCCATTGCAGAAGAGTTTCAAAAACAGAACTCAACAACA
ACCAAGCTCAAGTTAATTGAAGGGGCTGGCCATATGCCCCAAGAGGACTGG
Protein sequenceShow/hide protein sequence
STASMPLSPPTRRLLYLLQTCKSMDQLQQIHCQAIKTGLHANPVLQNGVMTFCCTHEHGDLKYAHHLFDEIPEPNVFLWNTMIRGYSRLDSPELGVSLYLEMLRRDVKPD
GYTFPFLFKGFTRDIALEYGKEFHGHVLKHGLQSNVFVQTALVQMYLLCGLVDMARGVLDSCSKADVITWNMMISAYNKDGKFEESRKLFLGMQEKQVLPTTVTLVLILS
ACSKLKDLKTGKLVHRYVNNCQVECSLILENALIDMYAACGEMDAALGIFRNMSNRDIISWTSVVAGFTNLGEIDVARNYFDKMPEKDYVSWTAMINGYLCVNRFKEALE
LFRNMQVTNVQPDEFTMVSILNACAHLGALELGEWIKTYIDRNKINNDAFVRNALIDMYFKCGNVDKAQRVFKEMNQRDKFTWTAMIVGLAVNGHGEKALDMFSKMLKAS
IWPDEVTYIGVLSACTHTGMVDEGREFFRSMTTQHNIEPNIAHYGCLVDLLARAGRLKEAHQVVENMPMKPNSIVWGALLAGCRVHKEADMAEMAANQILQLEPENGAVY
VLLCNIYAACKRWNDLRELRQTMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTKEIYVKLEKMTQDLKLAGYSPDISEVFLDIAEEDKENAVFRHSEKLAIAFGLVN
SQPGFTIRIVKNLRMCMDCHSMAKLVSEVYTREVIGSYKITTLCFSTFFFFSFQLYLKHQEKLISTILSRISAGFASGRWDKPSLVVWGISDKYLPQSIAEEFQKQNSTT
TKLKLIEGAGHMPQEDW