; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014748 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014748
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr12:4346487..4348532
RNA-Seq ExpressionLag0014748
SyntenyLag0014748
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572234.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089.72Show/hide
Query:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI
        M SLTLS SL PF P +LHFPLQ+CETEREAKQ HALSLKTGSLN+PSIS RLLALYA+PRINNLEY +SLFDWI KPTLVSWNM+IKCYIENQRSNDAI
Subjt:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI

Query:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL
         LFC LLCE MPDSFTLPCVLKGCARLSAL+EGKQIHGLILKIG GVDKFVLSSLV++YSKCGEIELCRKVFDRMEDKD+VSWNSLIDGYARCGEIELAL
Subjt:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML
        ELF+EMPEKD+FSWTILVDGLSKSGKL+ ARDVFDRMPTRN VSWNAMINGYMKAG FN ARELFD+MPERN V+WNSMI GYELN+QF QALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML

Query:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM
         EDISPNHAT+ GA SAA+GL SLG GRWVHSYIVKN F+T+GVLGTSLIEMYSKCGSIK ALRVF SIP+KKLGHWTAIIVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA
        CRTGLKPHAITFIGVLNACSHAGFA++A RYFK MT+D+GIEPSIEHYGCLID LCRAGYLEEA++TIERMPIKAN VIWMSLLSGSRKHG+ RMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL
        HHL+DLAPDTTGCYVILSNMYAA GLWEKVRQVRE+MKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKL EMK+KLNVAGH+PDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL

Query:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        EE NEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTK IS IYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW

XP_004144616.1 pentatricopeptide repeat-containing protein At5g48910 [Cucumis sativus]0.0e+0088.99Show/hide
Query:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI
        M+S TLS SL PFLP +LHFPLQ+C TEREA QLHALS+KT SLN+PS+SSRLLALYADPRINNL+YA SLFDWI +PTLVSWN++IKCYIENQRSNDAI
Subjt:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI

Query:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL
         LFC LLC+ +PDSFTLPCVLKGCARL AL+EGKQIHGL+LKIGFGVDKFVLSSLVS+YSKCGEIELCRKVFDRMEDKD+VSWNSLIDGYARCGEIELAL
Subjt:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML
        E+FEEMPEKDSFSWTIL+DGLSKSGKLE ARDVFDRMP RN VSWNAMINGYMKAGD N A+ELFDQMPER+LVTWNSMI GYE N+QF +ALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML

Query:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM
        REDISPN+ TI GA+SAA+G+VSLG GRWVHSYIVK+GF+T+GVLGT LIEMYSKCGS+KSALRVF SIP+KKLGHWT++IVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA
        CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMT DYGI+PSIEHYGCLIDVLCRAG+LEEAK+TIERMPIKANKVIW SLLSGSRKHG+IRMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL
         HLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVRE+MKKKG++KDPGCSSIEHQGSIHEFIVGD+SHPQTEEIY+KLCEMK KLNVAGH+PDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL

Query:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        EE NEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKL+SHIYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW

XP_022135993.1 pentatricopeptide repeat-containing protein At5g48910-like [Momordica charantia]0.0e+0090.01Show/hide
Query:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI
        M+SL LS SL PFLPR+LHFPLQ+CETERE KQLHALSLKTGS N+PSISSRLLALY DPRINNLEYARSLFDWI +PTLVSWN+++KCY+ENQRSNDAI
Subjt:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI

Query:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL
         LFC LL E +PDSFTLPCVLKGCARLSAL+EGKQIHGLILKIGFGVDKFVLSSLVS+YSKCGEIELCRKVFDRMEDKD+VSWNSLIDGYARCG+IELAL
Subjt:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML
        E+F+EMPE+DSFSWTILVDGLSKSGKLETARDVFDRMPTRN VSWNAMINGYMKAGDFN ARELFDQMPERNLVTWNSMI GYELNRQF QALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML

Query:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM
        RE+ISPNHATI GALSAA+GLVS GKGRWVHS+IVKNGFET+GVLGTSLIEMYSKCGSI SALRVF SIP+KKLGHWTAIIVGLGMHGLV QTLELFDEM
Subjt:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA
        CR GLKPHAITFIG+LNACSHAGFA+DA+ YFKMM +DYGIEPSIEHYGCLIDVLCRAG LEEAKNTIERMPIK NKVIWMSLLSGSRKHG+IRMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL
        HHLIDLAPDTTGCY+ILSNMYA AGLWEKVRQVRE+MKKKGIRKDPGCSSIEHQGS+HEFIVGDRSHPQTEEIY+KL EMK+KLNVAGHVPDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL

Query:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        E+ NEKE+ELETHSERLAIAFGL+NIKHG+P+RIIKNLRICNDCH V+KL+SHIYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW

XP_022953072.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita moschata]0.0e+0090.16Show/hide
Query:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI
        M SLTLS SL PF P +LHFPLQ+CETEREAKQ HALSLKTGSLN+PSIS RLLALYA+PRINNLEYA+SLFDWI KPTLVSWNM+IKCYIENQRSNDAI
Subjt:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI

Query:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL
         LFC LLCE MPDSFTLPCVLKGCARLSAL+EGKQIHGLILKIG GVDKFVLSSLV++YSKCGEIELCRKVFDRMEDKD+VSWNSLIDGYARCGEIELAL
Subjt:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML
        ELF+EMPEKD+FSWTILVDGLSKSGKL+ ARDVFDRMPTRN VSWNAMINGYMKAG FN ARELFD+MPERN V+WNSMI GYELN+QF QALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML

Query:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM
         EDISPNHAT+ GA SAA+GL SLG GRWVHSYIVKN F+T+GVLGTSLIEMYSKCGSIK ALRVF SIP+KKLGHWTAIIVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA
        CRTGLKPHAITFIGVLNACSHAGFA++A RYFK MT+D+GIEPSIEHYGCLID LCRAGYLEEAK+TIERMPIKAN VIWMSLLSGSRKHG  RMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL
        HHL+DLAPDTTGCYVILSNMYAA GLWEKVRQVRE+MKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKL EMK+KLNVAGHVPDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL

Query:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        EE NEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTK IS IYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW

XP_038887831.1 pentatricopeptide repeat-containing protein At5g48910-like [Benincasa hispida]0.0e+0089.72Show/hide
Query:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI
        M++LTLS SL PF+PR+LHFPLQ+CETEREAKQLHALSLK GSLN+PS+SSRLLALYADPRINNLEYA+SLFDWI KPTLVSWN++IKCYIE+QRSNDAI
Subjt:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI

Query:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL
         LFC  LCE +PDSFTLPCVLKGC+RL AL+EGKQIHGL+LKIGFGVDKFVLSSLVS+Y+KCGEIELCRKVFDRMED+DIVSWNSLIDGYARCGEIELAL
Subjt:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML
        +L EEMPEKDS SWTILVDGLSKSGKLE ARDVFD+MPTRN VSWNAMINGYMKAG+FN ARELFDQMPERNLVTWNSMI+GYELN+QF QALKL E ML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML

Query:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM
        REDISPN+ TI GALSAA+GLVSLGKGRWVHSYIVKNGF T GVLGTSLIEMYSKCGS++SAL VF SIP KKLGHWTAIIVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA
        CRTGL+PHAITFIGVLNACSHAGFAEDAHRYFKMMT+DYGI+PSIEHYGCLIDVLCRAGYLEEAK+TIERMP+KANKVIWMSLLSGSRKHG+IRMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL
        HHLIDLAPDTTGCYVILSNMYAAAGLWEKV QVRE+MKKKGIRKDPGCSSIEHQGS+HEFIVGD+SHPQT+EIY+KLCEMK+KL+ AGHVPDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL

Query:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        EE N+KEAELETHSERLAIAFGLLNI HGSPIRIIKNLRICNDCHAVTKL+SHIYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW

TrEMBL top hitse value%identityAlignment
A0A5A7V0C2 Pentatricopeptide repeat-containing protein0.0e+0088.25Show/hide
Query:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI
        M+SLTLS SL PFLP +LHFPLQ+C TEREA QLHALS+KT SLN+PS+SS LLALYA P INNL+YA+SLFDWI KPTLVSWN++IKCYIENQRSNDAI
Subjt:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI

Query:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL
         LFC LLC+ MPDSFTLPCVLKGCARL AL+EGKQIHGL+LKIGFGVDKFVLSSLVS+YSKCGEIE+CRKVFDRMEDKD+VSWNSLIDGYARCGEIELAL
Subjt:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML
        E+FEEMPEKDSFSWTIL+DGLSKSGKLE ARDVFDRMP RN VSWNAMINGYMKAGD N A+ELFDQMPER+LVTWNSMI GYE N+QF +ALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML

Query:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM
        REDISPN+ TI GA+SAA+GLVSLG GRWVHSYIVKNGF+T+GVLGT LIEMYSKCGS+KSALRVF  I +KKLGHWT+IIVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA
        CRTGL+PHAITFIGVLNACSHAGFAEDAHRYFKMMT DYGI+P+IEHYGCLIDVLCRAGYLEEAK+TI+RMPIKANKVIW SLLSGSRKHG+IRMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL
         HLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVRE+MK+K IRKDPGCSSIEHQGSIHEFIVGD+SHPQTEEIY+KL EMK KLNVAGH+PDT+QVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL

Query:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        EE NEKEAELETHSERLAIAFGLL+IKHGSPIRIIKNLRICNDCHAVTKL+SHIYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW

A0A5D3D6Y7 Pentatricopeptide repeat-containing protein0.0e+0088.55Show/hide
Query:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI
        M+SLTLS SL PFLP +LHFPLQ+C TEREA QLHALS+KT SLN+PS+SS LLALYA P INNL+YA+SLFDWI KPTLVSWN++IKCYIENQRSNDAI
Subjt:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI

Query:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL
         LFC LLC+ MPDSFTLPCVLKGCARL AL+EGKQIHGL+LKIGFGVDKFVLSSLVS+YSKCGEIELCRKVFDRMEDKD+VSWNSLIDGYARCGEIELAL
Subjt:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML
        E+FEEMPEKDSFSWTIL+DGLSKSGKLE AR VFDRMP RN VSWNAMINGYMKAGD N A+ELFDQMPER+LVTWNSMI GYE N+QF +ALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML

Query:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM
        REDISPN+ TI GA+SAA+GLVSLG GRWVHSYIVKNGF+T+GVLGT LIEMYSKCGS+KSALRVF  I +KKLGHWT+IIVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA
        CRTGL+PHAITFIGVLNACSHAGFAEDAHRYFKMMT DYGI+P+IEHYGCLIDVLCRAGYLEEAK+TIERMPIKANKVIW SLLSGSRKHG+IRMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL
         HLIDLAPDTTGCYVILSNMYAA GLWEKVRQVRE+MK+KGIRKDPGCSSIEHQGSIHEFIVGD+SHPQTEEIY+KL EMK KLNVAGH+PDT+QVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL

Query:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        EE NEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKL+SHIYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW

A0A6J1C6D9 pentatricopeptide repeat-containing protein At5g48910-like0.0e+0090.01Show/hide
Query:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI
        M+SL LS SL PFLPR+LHFPLQ+CETERE KQLHALSLKTGS N+PSISSRLLALY DPRINNLEYARSLFDWI +PTLVSWN+++KCY+ENQRSNDAI
Subjt:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI

Query:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL
         LFC LL E +PDSFTLPCVLKGCARLSAL+EGKQIHGLILKIGFGVDKFVLSSLVS+YSKCGEIELCRKVFDRMEDKD+VSWNSLIDGYARCG+IELAL
Subjt:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML
        E+F+EMPE+DSFSWTILVDGLSKSGKLETARDVFDRMPTRN VSWNAMINGYMKAGDFN ARELFDQMPERNLVTWNSMI GYELNRQF QALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML

Query:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM
        RE+ISPNHATI GALSAA+GLVS GKGRWVHS+IVKNGFET+GVLGTSLIEMYSKCGSI SALRVF SIP+KKLGHWTAIIVGLGMHGLV QTLELFDEM
Subjt:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA
        CR GLKPHAITFIG+LNACSHAGFA+DA+ YFKMM +DYGIEPSIEHYGCLIDVLCRAG LEEAKNTIERMPIK NKVIWMSLLSGSRKHG+IRMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL
        HHLIDLAPDTTGCY+ILSNMYA AGLWEKVRQVRE+MKKKGIRKDPGCSSIEHQGS+HEFIVGDRSHPQTEEIY+KL EMK+KLNVAGHVPDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL

Query:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        E+ NEKE+ELETHSERLAIAFGL+NIKHG+P+RIIKNLRICNDCH V+KL+SHIYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW

A0A6J1GM70 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like0.0e+0090.16Show/hide
Query:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI
        M SLTLS SL PF P +LHFPLQ+CETEREAKQ HALSLKTGSLN+PSIS RLLALYA+PRINNLEYA+SLFDWI KPTLVSWNM+IKCYIENQRSNDAI
Subjt:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI

Query:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL
         LFC LLCE MPDSFTLPCVLKGCARLSAL+EGKQIHGLILKIG GVDKFVLSSLV++YSKCGEIELCRKVFDRMEDKD+VSWNSLIDGYARCGEIELAL
Subjt:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML
        ELF+EMPEKD+FSWTILVDGLSKSGKL+ ARDVFDRMPTRN VSWNAMINGYMKAG FN ARELFD+MPERN V+WNSMI GYELN+QF QALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML

Query:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM
         EDISPNHAT+ GA SAA+GL SLG GRWVHSYIVKN F+T+GVLGTSLIEMYSKCGSIK ALRVF SIP+KKLGHWTAIIVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA
        CRTGLKPHAITFIGVLNACSHAGFA++A RYFK MT+D+GIEPSIEHYGCLID LCRAGYLEEAK+TIERMPIKAN VIWMSLLSGSRKHG  RMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL
        HHL+DLAPDTTGCYVILSNMYAA GLWEKVRQVRE+MKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKL EMK+KLNVAGHVPDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL

Query:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        EE NEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTK IS IYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW

A0A6J1HYS6 pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like0.0e+0089.87Show/hide
Query:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI
        M SLTLS SL PFLP +LHFPLQ+CETEREAKQ HALS+KTGSLN PSIS RLLALYA+PRINNLEYA+SLFDWI KPTLVSWNM+IKCYIENQRSNDAI
Subjt:  MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAI

Query:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL
         LFC LLCE MPDSFTLPCVLKGCARLSAL+EGKQIHGLILKIG GVDKFVLSSLV++YSKCGEIELCRKVFDRMEDKD+VSWNSLIDGYARCGEIELAL
Subjt:  VLFCNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML
        ELF+EMPEKD+FSWTILVDGLSKSGKL+ ARDVFDRMPTRN +SWNAMINGYMKAG FN ARELFD+MPERN V+WNSMI GYELN+QF QALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML

Query:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM
         EDISPNHAT+ GALSAA+GL SLG GRWVHSYIVKN F+T+GVLGTSLIEMYSKCGSIK ALRVF SIP++KLGHWTAIIVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA
        CRTGLKPHAITFIGVLNACSHAGFAE+A RYFK MT+D+GIEPSIEHYGCLID LCRAGYLEEAK+TIERMPIKAN VIWMSLLSGSRKHG+ RMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL
        HHL+DLAPDTTGCYVILSNMYAA GLWE  RQVRE+MKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKL EMK+KLNVAGHVPDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL

Query:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        EE NEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTK IS IYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic2.0e-16139.21Show/hide
Query:  RSLHFPL-QSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAIVLFCNLLCE--LMP
        RS H  L + C + R+ KQ H   ++TG+ + P  +S+L A+ A     +LEYAR +FD IPKP   +WN +I+ Y        +I  F +++ E    P
Subjt:  RSLHFPL-QSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAIVLFCNLLCE--LMP

Query:  DSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELALELFEEMPEKD--
        + +T P ++K  A +S+L  G+ +HG+ +K   G D FV +SL+  Y  CG+++   KVF  +++KD+VSWNS+I+G+ + G  + ALELF++M  +D  
Subjt:  DSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELALELFEEMPEKD--

Query:  ----------------------------------SFSWTI---LVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNL
                                          + + T+   ++D  +K G +E A+ +FD M  ++ V+W  M++GY  + D+  ARE+ + MP++++
Subjt:  ----------------------------------SFSWTI---LVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNL

Query:  VTWNSMINGYELNRQFPQALKLF-EVMLREDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRK
        V WN++I+ YE N +  +AL +F E+ L++++  N  T+   LSA A + +L  GRW+HSYI K+G   N  + ++LI MYSKCG ++ +  VF+S+ ++
Subjt:  VTWNSMINGYELNRQFPQALKLF-EVMLREDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRK

Query:  KLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMP
         +  W+A+I GL MHG   + +++F +M    +KP+ +TF  V  ACSH G  ++A   F  M  +YGI P  +HY C++DVL R+GYLE+A   IE MP
Subjt:  KLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMP

Query:  IKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEE
        I  +  +W +LL   + H ++ + E A   L++L P   G +V+LSN+YA  G WE V ++R+ M+  G++K+PGCSSIE  G IHEF+ GD +HP +E+
Subjt:  IKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEE

Query:  IYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKS
        +Y KL E+ +KL   G+ P+ +QVL  +EE   KE  L  HSE+LAI +GL++ +    IR+IKNLR+C DCH+V KLIS +Y+REII+RD  RFHHF++
Subjt:  IYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKS

Query:  GSCSCKDFW
        G CSC DFW
Subjt:  GSCSCKDFW

Q9FI80 Pentatricopeptide repeat-containing protein At5g489107.8e-15842.14Show/hide
Query:  SPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYA--DPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSND--AIVL
        SP+ H   P SL   + +C T R+  Q+HA+ +K+G +     ++ +L   A  D    +L+YA  +F+ +P+    SWN II+ + E+       AI L
Subjt:  SPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYA--DPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSND--AIVL

Query:  FCNLLCE--LMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL
        F  ++ +  + P+ FT P VLK CA+   ++EGKQIHGL LK GFG D+FV+S+LV +Y  CG ++  R +F                            
Subjt:  FCNLLCE--LMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML
          ++ + EKD     ++ D   + G++               V WN MI+GYM+ GD   AR LFD+M +R++V+WN+MI+GY LN  F  A+++F  M 
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML

Query:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM
        + DI PN+ T+   L A + L SL  G W+H Y   +G   + VLG++LI+MYSKCG I+ A+ VF  +PR+ +  W+A+I G  +HG     ++ F +M
Subjt:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA
         + G++P  + +I +L ACSH G  E+  RYF  M    G+EP IEHYGC++D+L R+G L+EA+  I  MPIK + VIW +LL   R  G++ MG+  A
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL
        + L+D+ P  +G YV LSNMYA+ G W +V ++R  MK+K IRKDPGCS I+  G +HEF+V D SHP+ +EI   L E+ DKL +AG+ P TTQVLL L
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL

Query:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        EE  +KE  L  HSE++A AFGL++   G PIRI+KNLRIC DCH+  KLIS +Y R+I +RD  RFHHF+ GSCSC D+W
Subjt:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665209.9e-14539.43Show/hide
Query:  LQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINN-LEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAIVLFCNLLCELMP-DSFTLPC
        LQ C  + E KQ+HA  LKTG +      ++ L+       ++ L YA+ +FD   +P    WN++I+ +  +     +++L+  +LC   P +++T P 
Subjt:  LQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINN-LEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAIVLFCNLLCELMP-DSFTLPC

Query:  VLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELALELFEEMPEKDSFSWTILVD
        +LK C+ LSA EE  QIH  I K+G+  D + ++SL++ Y+  G  +L   +FDR+ + D VSWNS+I GY + G++++AL LF +M EK          
Subjt:  VLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELALELFEEMPEKDSFSWTILVD

Query:  GLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVMLREDISPNHATIHGALSAAA
                             N +SW  MI+GY++A D N+                              +AL+LF  M   D+ P++ ++  ALSA A
Subjt:  GLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVMLREDISPNHATIHGALSAAA

Query:  GLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNAC
         L +L +G+W+HSY+ K     + VLG  LI+MY+KCG ++ AL VF +I +K +  WTA+I G   HG   + +  F EM + G+KP+ ITF  VL AC
Subjt:  GLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNAC

Query:  SHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSN
        S+ G  E+    F  M  DY ++P+IEHYGC++D+L RAG L+EAK  I+ MP+K N VIW +LL   R H +I +GE     LI + P   G YV  +N
Subjt:  SHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSN

Query:  MYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAI
        ++A    W+K  + R +MK++G+ K PGCS+I  +G+ HEF+ GDRSHP+ E+I  K   M+ KL   G+VP+  ++LL L + +E+EA +  HSE+LAI
Subjt:  MYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAI

Query:  AFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
         +GL+  K G+ IRI+KNLR+C DCH VTKLIS IY R+I++RD +RFHHF+ G CSC D+W
Subjt:  AFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic3.8e-16040.68Show/hide
Query:  LTLSPSLHP--FLPRSLHFP------------LQSCETEREAKQLHALSLKTGSLNYPSISSRLLAL-YADPRINNLEYARSLFDWIPKPTLVSWNMIIK
        LT+  S +P  FLP S   P            L +C+T +  + +HA  +K G  N     S+L+      P    L YA S+F  I +P L+ WN + +
Subjt:  LTLSPSLHP--FLPRSLHFP------------LQSCETEREAKQLHALSLKTGSLNYPSISSRLLAL-YADPRINNLEYARSLFDWIPKPTLVSWNMIIK

Query:  CYIENQRSNDAIVLF-CNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLI
         +  +     A+ L+ C +   L+P+S+T P VLK CA+  A +EG+QIHG +LK+G  +D +V +SL+S+Y + G +E   KVFD+   +D+VS+ +LI
Subjt:  CYIENQRSNDAIVLF-CNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLI

Query:  DGYARCGEIELALELFEEMPEKDSFSWTILVDGLSKSGKLETARDVF-DRMPT-------------------------RNPVSW-------------NAM
         GYA  G IE A +LF+E+P KD  SW  ++ G +++G  + A ++F D M T                         R    W             NA+
Subjt:  DGYARCGEIELALELFEEMPEKDSFSWTILVDGLSKSGKLETARDVF-DRMPT-------------------------RNPVSW-------------NAM

Query:  INGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVMLREDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVK--NGFETNGVLG
        I+ Y K G+   A  LF+++P +++++WN++I GY     + +AL LF+ MLR   +PN  T+   L A A L ++  GRW+H YI K   G      L 
Subjt:  INGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVMLREDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVK--NGFETNGVLG

Query:  TSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIE
        TSLI+MY+KCG I++A +VF+SI  K L  W A+I G  MHG  + + +LF  M + G++P  ITF+G+L+ACSH+G  +     F+ MT+DY + P +E
Subjt:  TSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIE

Query:  HYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDP
        HYGC+ID+L  +G  +EA+  I  M ++ + VIW SLL   + HG++ +GE  A +LI + P+  G YV+LSN+YA+AG W +V + R ++  KG++K P
Subjt:  HYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDP

Query:  GCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHA
        GCSSIE    +HEFI+GD+ HP+  EIY  L EM+  L  AG VPDT++VL  +EE   KE  L  HSE+LAIAFGL++ K G+ + I+KNLR+C +CH 
Subjt:  GCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHA

Query:  VTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
         TKLIS IY REII RD +RFHHF+ G CSC D+W
Subjt:  VTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226902.6e-14537.05Show/hide
Query:  MVSLTLSPSLHPFLPRSLHFPLQSCETER---EAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSN
        M++  +SP  + F      F L +C   R      Q+H L +K G      + + L+  YA+     L+ AR +FD + +  +VSW  +I  Y     + 
Subjt:  MVSLTLSPSLHPFLPRSLHFPLQSCETER---EAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSN

Query:  DAIVLFCNLLC--ELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGE
        DA+ LF  ++   E+ P+S T+ CV+  CA+L  LE G++++  I   G  V+  ++S+LV +Y KC  I++ +++FD     ++   N++   Y R G 
Subjt:  DAIVLFCNLLC--ELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGE

Query:  IELALELFEEM------PEKDSF-----------------------------SW----TILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAG
           AL +F  M      P++ S                              SW      L+D   K  + +TA  +FDRM  +  V+WN+++ GY++ G
Subjt:  IELALELFEEM------PEKDSF-----------------------------SW----TILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAG

Query:  DFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLF-EVMLREDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSK
        + + A E F+ MPE+N+V+WN++I+G      F +A+++F  +  +E ++ +  T+    SA   L +L   +W++ YI KNG + +  LGT+L++M+S+
Subjt:  DFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLF-EVMLREDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSK

Query:  CGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVL
        CG  +SA+ +F+S+  + +  WTA I  + M G  E+ +ELFD+M   GLKP  + F+G L ACSH G  +     F  M + +G+ P   HYGC++D+L
Subjt:  CGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVL

Query:  CRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQG
         RAG LEEA   IE MP++ N VIW SLL+  R  G++ M  YAA  +  LAP+ TG YV+LSN+YA+AG W  + +VR  MK+KG+RK PG SSI+ +G
Subjt:  CRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQG

Query:  SIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIY
          HEF  GD SHP+   I   L E+  + +  GHVPD + VL+ ++E  EK   L  HSE+LA+A+GL++   G+ IRI+KNLR+C+DCH+  K  S +Y
Subjt:  SIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIY

Query:  NREIIIRDGSRFHHFKSGSCSCKDFW
        NREII+RD +RFH+ + G CSC DFW
Subjt:  NREIIIRDGSRFHHFKSGSCSCKDFW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-16140.68Show/hide
Query:  LTLSPSLHP--FLPRSLHFP------------LQSCETEREAKQLHALSLKTGSLNYPSISSRLLAL-YADPRINNLEYARSLFDWIPKPTLVSWNMIIK
        LT+  S +P  FLP S   P            L +C+T +  + +HA  +K G  N     S+L+      P    L YA S+F  I +P L+ WN + +
Subjt:  LTLSPSLHP--FLPRSLHFP------------LQSCETEREAKQLHALSLKTGSLNYPSISSRLLAL-YADPRINNLEYARSLFDWIPKPTLVSWNMIIK

Query:  CYIENQRSNDAIVLF-CNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLI
         +  +     A+ L+ C +   L+P+S+T P VLK CA+  A +EG+QIHG +LK+G  +D +V +SL+S+Y + G +E   KVFD+   +D+VS+ +LI
Subjt:  CYIENQRSNDAIVLF-CNLLCELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLI

Query:  DGYARCGEIELALELFEEMPEKDSFSWTILVDGLSKSGKLETARDVF-DRMPT-------------------------RNPVSW-------------NAM
         GYA  G IE A +LF+E+P KD  SW  ++ G +++G  + A ++F D M T                         R    W             NA+
Subjt:  DGYARCGEIELALELFEEMPEKDSFSWTILVDGLSKSGKLETARDVF-DRMPT-------------------------RNPVSW-------------NAM

Query:  INGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVMLREDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVK--NGFETNGVLG
        I+ Y K G+   A  LF+++P +++++WN++I GY     + +AL LF+ MLR   +PN  T+   L A A L ++  GRW+H YI K   G      L 
Subjt:  INGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVMLREDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVK--NGFETNGVLG

Query:  TSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIE
        TSLI+MY+KCG I++A +VF+SI  K L  W A+I G  MHG  + + +LF  M + G++P  ITF+G+L+ACSH+G  +     F+ MT+DY + P +E
Subjt:  TSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIE

Query:  HYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDP
        HYGC+ID+L  +G  +EA+  I  M ++ + VIW SLL   + HG++ +GE  A +LI + P+  G YV+LSN+YA+AG W +V + R ++  KG++K P
Subjt:  HYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDP

Query:  GCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHA
        GCSSIE    +HEFI+GD+ HP+  EIY  L EM+  L  AG VPDT++VL  +EE   KE  L  HSE+LAIAFGL++ K G+ + I+KNLR+C +CH 
Subjt:  GCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHA

Query:  VTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
         TKLIS IY REII RD +RFHHF+ G CSC D+W
Subjt:  VTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-16239.21Show/hide
Query:  RSLHFPL-QSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAIVLFCNLLCE--LMP
        RS H  L + C + R+ KQ H   ++TG+ + P  +S+L A+ A     +LEYAR +FD IPKP   +WN +I+ Y        +I  F +++ E    P
Subjt:  RSLHFPL-QSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAIVLFCNLLCE--LMP

Query:  DSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELALELFEEMPEKD--
        + +T P ++K  A +S+L  G+ +HG+ +K   G D FV +SL+  Y  CG+++   KVF  +++KD+VSWNS+I+G+ + G  + ALELF++M  +D  
Subjt:  DSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELALELFEEMPEKD--

Query:  ----------------------------------SFSWTI---LVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNL
                                          + + T+   ++D  +K G +E A+ +FD M  ++ V+W  M++GY  + D+  ARE+ + MP++++
Subjt:  ----------------------------------SFSWTI---LVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNL

Query:  VTWNSMINGYELNRQFPQALKLF-EVMLREDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRK
        V WN++I+ YE N +  +AL +F E+ L++++  N  T+   LSA A + +L  GRW+HSYI K+G   N  + ++LI MYSKCG ++ +  VF+S+ ++
Subjt:  VTWNSMINGYELNRQFPQALKLF-EVMLREDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRK

Query:  KLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMP
         +  W+A+I GL MHG   + +++F +M    +KP+ +TF  V  ACSH G  ++A   F  M  +YGI P  +HY C++DVL R+GYLE+A   IE MP
Subjt:  KLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMP

Query:  IKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEE
        I  +  +W +LL   + H ++ + E A   L++L P   G +V+LSN+YA  G WE V ++R+ M+  G++K+PGCSSIE  G IHEF+ GD +HP +E+
Subjt:  IKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEE

Query:  IYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKS
        +Y KL E+ +KL   G+ P+ +QVL  +EE   KE  L  HSE+LAI +GL++ +    IR+IKNLR+C DCH+V KLIS +Y+REII+RD  RFHHF++
Subjt:  IYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKS

Query:  GSCSCKDFW
        G CSC DFW
Subjt:  GSCSCKDFW

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification1.9e-14637.05Show/hide
Query:  MVSLTLSPSLHPFLPRSLHFPLQSCETER---EAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSN
        M++  +SP  + F      F L +C   R      Q+H L +K G      + + L+  YA+     L+ AR +FD + +  +VSW  +I  Y     + 
Subjt:  MVSLTLSPSLHPFLPRSLHFPLQSCETER---EAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSN

Query:  DAIVLFCNLLC--ELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGE
        DA+ LF  ++   E+ P+S T+ CV+  CA+L  LE G++++  I   G  V+  ++S+LV +Y KC  I++ +++FD     ++   N++   Y R G 
Subjt:  DAIVLFCNLLC--ELMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGE

Query:  IELALELFEEM------PEKDSF-----------------------------SW----TILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAG
           AL +F  M      P++ S                              SW      L+D   K  + +TA  +FDRM  +  V+WN+++ GY++ G
Subjt:  IELALELFEEM------PEKDSF-----------------------------SW----TILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAG

Query:  DFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLF-EVMLREDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSK
        + + A E F+ MPE+N+V+WN++I+G      F +A+++F  +  +E ++ +  T+    SA   L +L   +W++ YI KNG + +  LGT+L++M+S+
Subjt:  DFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLF-EVMLREDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSK

Query:  CGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVL
        CG  +SA+ +F+S+  + +  WTA I  + M G  E+ +ELFD+M   GLKP  + F+G L ACSH G  +     F  M + +G+ P   HYGC++D+L
Subjt:  CGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVL

Query:  CRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQG
         RAG LEEA   IE MP++ N VIW SLL+  R  G++ M  YAA  +  LAP+ TG YV+LSN+YA+AG W  + +VR  MK+KG+RK PG SSI+ +G
Subjt:  CRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQG

Query:  SIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIY
          HEF  GD SHP+   I   L E+  + +  GHVPD + VL+ ++E  EK   L  HSE+LA+A+GL++   G+ IRI+KNLR+C+DCH+  K  S +Y
Subjt:  SIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIY

Query:  NREIIIRDGSRFHHFKSGSCSCKDFW
        NREII+RD +RFH+ + G CSC DFW
Subjt:  NREIIIRDGSRFHHFKSGSCSCKDFW

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein5.6e-15942.14Show/hide
Query:  SPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYA--DPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSND--AIVL
        SP+ H   P SL   + +C T R+  Q+HA+ +K+G +     ++ +L   A  D    +L+YA  +F+ +P+    SWN II+ + E+       AI L
Subjt:  SPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYA--DPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSND--AIVL

Query:  FCNLLCE--LMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL
        F  ++ +  + P+ FT P VLK CA+   ++EGKQIHGL LK GFG D+FV+S+LV +Y  CG ++  R +F                            
Subjt:  FCNLLCE--LMPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML
          ++ + EKD     ++ D   + G++               V WN MI+GYM+ GD   AR LFD+M +R++V+WN+MI+GY LN  F  A+++F  M 
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVML

Query:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM
        + DI PN+ T+   L A + L SL  G W+H Y   +G   + VLG++LI+MYSKCG I+ A+ VF  +PR+ +  W+A+I G  +HG     ++ F +M
Subjt:  REDISPNHATIHGALSAAAGLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA
         + G++P  + +I +L ACSH G  E+  RYF  M    G+EP IEHYGC++D+L R+G L+EA+  I  MPIK + VIW +LL   R  G++ MG+  A
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL
        + L+D+ P  +G YV LSNMYA+ G W +V ++R  MK+K IRKDPGCS I+  G +HEF+V D SHP+ +EI   L E+ DKL +AG+ P TTQVLL L
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCL

Query:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        EE  +KE  L  HSE++A AFGL++   G PIRI+KNLRIC DCH+  KLIS +Y R+I +RD  RFHHF+ GSCSC D+W
Subjt:  EEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.1e-14639.43Show/hide
Query:  LQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINN-LEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAIVLFCNLLCELMP-DSFTLPC
        LQ C  + E KQ+HA  LKTG +      ++ L+       ++ L YA+ +FD   +P    WN++I+ +  +     +++L+  +LC   P +++T P 
Subjt:  LQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINN-LEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAIVLFCNLLCELMP-DSFTLPC

Query:  VLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELALELFEEMPEKDSFSWTILVD
        +LK C+ LSA EE  QIH  I K+G+  D + ++SL++ Y+  G  +L   +FDR+ + D VSWNS+I GY + G++++AL LF +M EK          
Subjt:  VLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELALELFEEMPEKDSFSWTILVD

Query:  GLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVMLREDISPNHATIHGALSAAA
                             N +SW  MI+GY++A D N+                              +AL+LF  M   D+ P++ ++  ALSA A
Subjt:  GLSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVMLREDISPNHATIHGALSAAA

Query:  GLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNAC
         L +L +G+W+HSY+ K     + VLG  LI+MY+KCG ++ AL VF +I +K +  WTA+I G   HG   + +  F EM + G+KP+ ITF  VL AC
Subjt:  GLVSLGKGRWVHSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNAC

Query:  SHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSN
        S+ G  E+    F  M  DY ++P+IEHYGC++D+L RAG L+EAK  I+ MP+K N VIW +LL   R H +I +GE     LI + P   G YV  +N
Subjt:  SHAGFAEDAHRYFKMMTEDYGIEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSN

Query:  MYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAI
        ++A    W+K  + R +MK++G+ K PGCS+I  +G+ HEF+ GDRSHP+ E+I  K   M+ KL   G+VP+  ++LL L + +E+EA +  HSE+LAI
Subjt:  MYAAAGLWEKVRQVREIMKKKGIRKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAI

Query:  AFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW
         +GL+  K G+ IRI+KNLR+C DCH VTKLIS IY R+I++RD +RFHHF+ G CSC D+W
Subjt:  AFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREIIIRDGSRFHHFKSGSCSCKDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTCTCTTACACTTTCGCCTTCCCTCCACCCGTTTCTTCCTCGCAGCCTTCATTTTCCTCTTCAAAGCTGCGAAACTGAACGAGAAGCCAAGCAACTCCACGCTCT
CTCCCTCAAAACAGGCTCCTTGAATTACCCTTCAATATCTTCTCGTCTTTTGGCCCTCTATGCAGATCCCAGAATCAACAATCTCGAATATGCTCGATCCCTCTTTGACT
GGATTCCAAAACCCACTTTGGTTTCTTGGAATATGATCATCAAGTGCTACATCGAGAACCAACGTTCAAATGATGCCATTGTGTTGTTCTGCAACTTGCTCTGTGAGTTA
ATGCCTGATTCCTTTACATTGCCTTGTGTTCTAAAGGGTTGTGCTCGATTGAGTGCACTAGAGGAGGGGAAACAGATTCATGGGTTGATATTGAAAATTGGGTTTGGTGT
GGATAAGTTTGTTTTGAGTAGTTTGGTTAGCCTGTATTCTAAATGTGGTGAGATTGAGCTGTGTAGGAAAGTGTTTGATCGAATGGAAGATAAGGATATTGTCTCATGGA
ATTCTTTGATTGATGGATATGCTAGATGTGGTGAGATTGAACTGGCACTCGAGTTGTTCGAAGAAATGCCAGAGAAGGATTCTTTTTCTTGGACTATTCTGGTTGATGGG
CTTTCGAAAAGTGGGAAGCTCGAGACTGCTAGAGATGTGTTCGATCGAATGCCTACTAGAAATCCTGTATCTTGGAATGCTATGATCAATGGCTACATGAAAGCCGGGGA
TTTTAACAGGGCACGGGAATTATTCGATCAAATGCCAGAGAGAAACCTCGTTACATGGAATTCAATGATCAATGGATATGAACTGAACAGGCAGTTTCCACAAGCCTTGA
AGCTGTTTGAGGTCATGTTGAGAGAAGATATATCACCAAATCATGCCACTATACATGGAGCTCTTTCTGCAGCTGCAGGACTAGTTAGTCTTGGTAAGGGAAGATGGGTT
CATTCCTATATAGTGAAAAATGGATTCGAAACGAATGGTGTGCTCGGCACGTCACTGATAGAAATGTACTCCAAGTGTGGCAGCATTAAGAGTGCCCTCAGAGTTTTTCA
TTCTATACCTAGAAAGAAATTGGGACATTGGACGGCTATAATTGTAGGCTTGGGAATGCATGGTTTGGTAGAGCAAACTCTTGAGCTATTTGATGAAATGTGCAGAACTG
GGTTGAAGCCTCATGCCATTACTTTTATTGGAGTGTTGAATGCTTGTAGTCATGCAGGATTTGCAGAAGATGCCCATCGGTACTTCAAAATGATGACAGAGGATTATGGA
ATTGAACCATCAATCGAACACTACGGTTGCTTAATTGATGTTCTGTGTCGAGCTGGATATCTTGAAGAGGCAAAGAATACCATTGAAAGAATGCCTATCAAAGCAAACAA
AGTAATTTGGATGAGTCTACTAAGTGGTTCAAGGAAACATGGAGACATAAGAATGGGGGAATATGCAGCTCATCATCTGATTGATTTAGCGCCGGATACTACTGGATGTT
ATGTTATTCTCTCGAACATGTATGCAGCAGCTGGCTTGTGGGAAAAAGTTCGGCAAGTGAGAGAAATAATGAAGAAAAAAGGAATCAGAAAGGATCCAGGATGCAGTTCC
ATTGAGCATCAAGGTTCAATCCATGAATTCATTGTGGGAGATAGGTCACATCCTCAAACCGAAGAGATATACGTCAAACTGTGTGAGATGAAAGACAAATTGAATGTAGC
CGGACATGTTCCCGACACGACTCAAGTTCTTTTATGCCTTGAAGAGGGTAATGAGAAAGAAGCAGAACTTGAAACCCATAGTGAGAGGTTGGCAATAGCTTTTGGTCTTC
TTAACATCAAGCATGGAAGTCCTATCCGCATCATAAAGAATCTTCGTATTTGCAACGATTGTCATGCTGTGACTAAACTTATTTCTCATATATACAACCGTGAGATCATT
ATCAGAGATGGTAGTCGATTCCATCACTTTAAAAGTGGGTCTTGTTCTTGTAAAGATTTTTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGTCTCTTACACTTTCGCCTTCCCTCCACCCGTTTCTTCCTCGCAGCCTTCATTTTCCTCTTCAAAGCTGCGAAACTGAACGAGAAGCCAAGCAACTCCACGCTCT
CTCCCTCAAAACAGGCTCCTTGAATTACCCTTCAATATCTTCTCGTCTTTTGGCCCTCTATGCAGATCCCAGAATCAACAATCTCGAATATGCTCGATCCCTCTTTGACT
GGATTCCAAAACCCACTTTGGTTTCTTGGAATATGATCATCAAGTGCTACATCGAGAACCAACGTTCAAATGATGCCATTGTGTTGTTCTGCAACTTGCTCTGTGAGTTA
ATGCCTGATTCCTTTACATTGCCTTGTGTTCTAAAGGGTTGTGCTCGATTGAGTGCACTAGAGGAGGGGAAACAGATTCATGGGTTGATATTGAAAATTGGGTTTGGTGT
GGATAAGTTTGTTTTGAGTAGTTTGGTTAGCCTGTATTCTAAATGTGGTGAGATTGAGCTGTGTAGGAAAGTGTTTGATCGAATGGAAGATAAGGATATTGTCTCATGGA
ATTCTTTGATTGATGGATATGCTAGATGTGGTGAGATTGAACTGGCACTCGAGTTGTTCGAAGAAATGCCAGAGAAGGATTCTTTTTCTTGGACTATTCTGGTTGATGGG
CTTTCGAAAAGTGGGAAGCTCGAGACTGCTAGAGATGTGTTCGATCGAATGCCTACTAGAAATCCTGTATCTTGGAATGCTATGATCAATGGCTACATGAAAGCCGGGGA
TTTTAACAGGGCACGGGAATTATTCGATCAAATGCCAGAGAGAAACCTCGTTACATGGAATTCAATGATCAATGGATATGAACTGAACAGGCAGTTTCCACAAGCCTTGA
AGCTGTTTGAGGTCATGTTGAGAGAAGATATATCACCAAATCATGCCACTATACATGGAGCTCTTTCTGCAGCTGCAGGACTAGTTAGTCTTGGTAAGGGAAGATGGGTT
CATTCCTATATAGTGAAAAATGGATTCGAAACGAATGGTGTGCTCGGCACGTCACTGATAGAAATGTACTCCAAGTGTGGCAGCATTAAGAGTGCCCTCAGAGTTTTTCA
TTCTATACCTAGAAAGAAATTGGGACATTGGACGGCTATAATTGTAGGCTTGGGAATGCATGGTTTGGTAGAGCAAACTCTTGAGCTATTTGATGAAATGTGCAGAACTG
GGTTGAAGCCTCATGCCATTACTTTTATTGGAGTGTTGAATGCTTGTAGTCATGCAGGATTTGCAGAAGATGCCCATCGGTACTTCAAAATGATGACAGAGGATTATGGA
ATTGAACCATCAATCGAACACTACGGTTGCTTAATTGATGTTCTGTGTCGAGCTGGATATCTTGAAGAGGCAAAGAATACCATTGAAAGAATGCCTATCAAAGCAAACAA
AGTAATTTGGATGAGTCTACTAAGTGGTTCAAGGAAACATGGAGACATAAGAATGGGGGAATATGCAGCTCATCATCTGATTGATTTAGCGCCGGATACTACTGGATGTT
ATGTTATTCTCTCGAACATGTATGCAGCAGCTGGCTTGTGGGAAAAAGTTCGGCAAGTGAGAGAAATAATGAAGAAAAAAGGAATCAGAAAGGATCCAGGATGCAGTTCC
ATTGAGCATCAAGGTTCAATCCATGAATTCATTGTGGGAGATAGGTCACATCCTCAAACCGAAGAGATATACGTCAAACTGTGTGAGATGAAAGACAAATTGAATGTAGC
CGGACATGTTCCCGACACGACTCAAGTTCTTTTATGCCTTGAAGAGGGTAATGAGAAAGAAGCAGAACTTGAAACCCATAGTGAGAGGTTGGCAATAGCTTTTGGTCTTC
TTAACATCAAGCATGGAAGTCCTATCCGCATCATAAAGAATCTTCGTATTTGCAACGATTGTCATGCTGTGACTAAACTTATTTCTCATATATACAACCGTGAGATCATT
ATCAGAGATGGTAGTCGATTCCATCACTTTAAAAGTGGGTCTTGTTCTTGTAAAGATTTTTGGTAA
Protein sequenceShow/hide protein sequence
MVSLTLSPSLHPFLPRSLHFPLQSCETEREAKQLHALSLKTGSLNYPSISSRLLALYADPRINNLEYARSLFDWIPKPTLVSWNMIIKCYIENQRSNDAIVLFCNLLCEL
MPDSFTLPCVLKGCARLSALEEGKQIHGLILKIGFGVDKFVLSSLVSLYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIELALELFEEMPEKDSFSWTILVDG
LSKSGKLETARDVFDRMPTRNPVSWNAMINGYMKAGDFNRARELFDQMPERNLVTWNSMINGYELNRQFPQALKLFEVMLREDISPNHATIHGALSAAAGLVSLGKGRWV
HSYIVKNGFETNGVLGTSLIEMYSKCGSIKSALRVFHSIPRKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHRYFKMMTEDYG
IEPSIEHYGCLIDVLCRAGYLEEAKNTIERMPIKANKVIWMSLLSGSRKHGDIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREIMKKKGIRKDPGCSS
IEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKDKLNVAGHVPDTTQVLLCLEEGNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVTKLISHIYNREII
IRDGSRFHHFKSGSCSCKDFW