; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009169 (gene) of Snake gourd v1 genome

Gene IDTan0009169
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG02:4345377..4347422
RNA-Seq ExpressionTan0009169
SyntenyTan0009169
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572234.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0090.01Show/hide
Query:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI
        M SLTLSHSLQPF P +LHFPLQNCETEREAKQFHALSLKTGSLNHPSIS RLLALYA+PRINNLEY QSLFD IR+PTLVSWNMLIKCYIENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI

Query:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL
        +LFCKLLCEFMPDSFTLPCVLKGCARLSAL+EGKQIHGL+LKIG GVDKFVLSSLV+MYSKCGEIELCRKVFDRMEDKD+VSWNSLI GYARCGEIELAL
Subjt:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML
        ELF+EMPEKD+FSWTILVDGLSKSGKLK ARDVFDRMPTRNSV+WNAMINGYMKAG FNTARELF++MPERN V+WNSMITGYELN+QF QALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML

Query:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM
         EDISPN+AT+LGA SAASGL SLG GRWVHSYIVKN F+T+GVLGTSLIEMYSKCGSIK ALR F+SIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA
        CRTGLKPHAITFIGVLNACSHAGFA++A +YFK MTDD+GIEPSIEHYGCLID LCRAGYLEEA+ TIE MPI+AN VIW SLLSGSRKHGN RMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA

Query:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI
         HL+DLAPDTTGCYVILSNMYAA GLWEKVR+VRE+MKKKGI KDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKL EMKEKLN+AGH+PDTTQVLLC+
Subjt:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI

Query:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW
        EEDNEKEAELETHSERLAIAFGL+NIKHGSPIRIIKNLRICNDCHAVTK +S IYNREIIIRDGSRFHHFK+GSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW

XP_004144616.1 pentatricopeptide repeat-containing protein At5g48910 [Cucumis sativus]0.0e+0089.57Show/hide
Query:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI
        MLS TLSHSLQPFLP +LHFPLQNC TEREA Q HALS+KT SLNHPS+SSRLLALYADPRINNL+YA SLFD I+EPTLVSWN+LIKCYIENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI

Query:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL
        +LFCKLLC+F+PDSFTLPCVLKGCARL AL+EGKQIHGLVLKIG GVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKD+VSWNSLI GYARCGEIELAL
Subjt:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML
        E+FEEMPEKDSFSWTIL+DGLSKSGKL+ ARDVFDRMP RNSV+WNAMINGYMKAGD NTA+ELF+QMPER+LVTWNSMITGYE N+QF +ALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML

Query:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM
        REDISPN  TILGA+SAASG+VSLG GRWVHSYIVK+GF+T+GVLGT LIEMYSKCGS+KSALR F SIPKKKLGHWT++IVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA
        CRTGLKPHAITFIGVLNACSHAGFAEDAH+YFKMMT DYGI+PSIEHYGCLIDVLCRAG+LEEAKDTIE MPI+ANKVIWTSLLSGSRKHGN+RMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA

Query:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI
        +HLIDLAPDTTGCYVILSNMYAAAGLWEKVR+VRE+MKKKG+ KDPGCSSIEHQGSIHEFIVGD+SHPQTEEIY+KLCEMK+KLN+AGH+PDTTQVLLC+
Subjt:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI

Query:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW
        EEDNEKEAELETHSERLAIAFGL+NIKHGSPIRIIKNLRICNDCHAVTKL+SHIYNREIIIRDGSRFHHFK+GSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW

XP_022135993.1 pentatricopeptide repeat-containing protein At5g48910-like [Momordica charantia]0.0e+0089.87Show/hide
Query:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI
        MLSL LSHSL PFLPR+LHFPLQNCETERE KQ HALSLKTGS NHPSISSRLLALY DPRINNLEYA+SLFD IREPTLVSWN+L+KCY+ENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI

Query:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL
        SLFC+LL EF+PDSFTLPCVLKGCARLSAL+EGKQIHGL+LKIG GVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKD+VSWNSLI GYARCG+IELAL
Subjt:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML
        E+F+EMPE+DSFSWTILVDGLSKSGKL+TARDVFDRMPTRNSV+WNAMINGYMKAGDFNTARELF+QMPERNLVTWNSMITGYELNRQF+QALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML

Query:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM
        RE+ISPN+ATILGALSAASGLVS GKGRWVHS+IVKNGFET+GVLGTSLIEMYSKCGSI SALR F+SIPKKKLGHWTAIIVGLGMHGLV QTLELFDEM
Subjt:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA
        CR GLKPHAITFIG+LNACSHAGFA+DA+ YFKMM DDYGIEPSIEHYGCLIDVLCRAG LEEAK+TIE MPI+ NKVIW SLLSGSRKHGN+RMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA

Query:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI
         HLIDLAPDTTGCY+ILSNMYA AGLWEKVR+VRE+MKKKGI KDPGCSSIEHQGS+HEFIVGDRSHPQTEEIY+KL EMKEKLN+AGHVPDTTQVLLC+
Subjt:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI

Query:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW
        E+DNEKE+ELETHSERLAIAFGLINIKHG+P+RIIKNLRICNDCH V+KL+SHIYNREIIIRDGSRFHHFK+GSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW

XP_022953072.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita moschata]0.0e+0090.46Show/hide
Query:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI
        M SLTLSHSLQPF P +LHFPLQNCETEREAKQFHALSLKTGSLNHPSIS RLLALYA+PRINNLEYAQSLFD IR+PTLVSWNMLIKCYIENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI

Query:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL
        +LFCKLLCEFMPDSFTLPCVLKGCARLSAL+EGKQIHGL+LKIG GVDKFVLSSLV+MYSKCGEIELCRKVFDRMEDKD+VSWNSLI GYARCGEIELAL
Subjt:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML
        ELF+EMPEKD+FSWTILVDGLSKSGKLK ARDVFDRMPTRNSV+WNAMINGYMKAG FNTARELF++MPERN V+WNSMITGYELN+QF QALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML

Query:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM
         EDISPN+AT+LGA SAASGL SLG GRWVHSYIVKN F+T+GVLGTSLIEMYSKCGSIK ALR F+SIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA
        CRTGLKPHAITFIGVLNACSHAGFA++A +YFK MTDD+GIEPSIEHYGCLID LCRAGYLEEAKDTIE MPI+AN VIW SLLSGSRKHG+ RMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA

Query:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI
         HL+DLAPDTTGCYVILSNMYAA GLWEKVR+VRE+MKKKGI KDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKL EMKEKLN+AGHVPDTTQVLLC+
Subjt:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI

Query:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW
        EEDNEKEAELETHSERLAIAFGL+NIKHGSPIRIIKNLRICNDCHAVTK +S IYNREIIIRDGSRFHHFK+GSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW

XP_038887831.1 pentatricopeptide repeat-containing protein At5g48910-like [Benincasa hispida]0.0e+0090.01Show/hide
Query:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI
        ML+LTLSHSLQPF+PR+LHFPLQNCETEREAKQ HALSLK GSLNHPS+SSRLLALYADPRINNLEYAQSLFD I++PTLVSWN+LIKCYIE+QRSNDAI
Subjt:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI

Query:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL
        +LFCK LCEF+PDSFTLPCVLKGC+RL AL+EGKQIHGLVLKIG GVDKFVLSSLVSMY+KCGEIELCRKVFDRMED+DIVSWNSLI GYARCGEIELAL
Subjt:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML
        +L EEMPEKDS SWTILVDGLSKSGKL+ ARDVFD+MPTRNSV+WNAMINGYMKAG+FNTARELF+QMPERNLVTWNSMI+GYELN+QF QALKL E ML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML

Query:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM
        REDISPN  TILGALSAASGLVSLGKGRWVHSYIVKNGF TEGVLGTSLIEMYSKCGS++SAL  F+SIP+KKLGHWTAIIVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA
        CRTGL+PHAITFIGVLNACSHAGFAEDAH+YFKMMTDDYGI+PSIEHYGCLIDVLCRAGYLEEAKDTIE MP++ANKVIW SLLSGSRKHGN+RMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA

Query:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI
         HLIDLAPDTTGCYVILSNMYAAAGLWEKV +VRE+MKKKGI KDPGCSSIEHQGS+HEFIVGD+SHPQT+EIY+KLCEMKEKL+ AGHVPDTTQVLLC+
Subjt:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI

Query:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW
        EEDN+KEAELETHSERLAIAFGL+NI HGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFK+GSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW

TrEMBL top hitse value%identityAlignment
A0A5A7V0C2 Pentatricopeptide repeat-containing protein0.0e+0088.55Show/hide
Query:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI
        MLSLTLSHSLQPFLP +LHFPLQNC TEREA Q HALS+KT SLNHPS+SS LLALYA P INNL+YAQSLFD I++PTLVSWN+LIKCYIENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI

Query:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL
        +LFCKLLC+FMPDSFTLPCVLKGCARL AL+EGKQIHGLVLKIG GVDKFVLSSLVSMYSKCGEIE+CRKVFDRMEDKD+VSWNSLI GYARCGEIELAL
Subjt:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML
        E+FEEMPEKDSFSWTIL+DGLSKSGKL+ ARDVFDRMP RNSV+WNAMINGYMKAGD NTA+ELF+QMPER+LVTWNSMITGYE N+QF +ALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML

Query:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM
        REDISPN  TILGA+SAASGLVSLG GRWVHSYIVKNGF+T+GVLGT LIEMYSKCGS+KSALR F  I KKKLGHWT+IIVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA
        CRTGL+PHAITFIGVLNACSHAGFAEDAH+YFKMMT DYGI+P+IEHYGCLIDVLCRAGYLEEAKDTI+ MPI+ANKVIWTSLLSGSRKHGN+RMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA

Query:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI
        +HLIDLAPDTTGCYVILSNMYAAAGLWEKVR+VRE+MK+K I KDPGCSSIEHQGSIHEFIVGD+SHPQTEEIY+KL EMK+KLN+AGH+PDT+QVLLC+
Subjt:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI

Query:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW
        EEDNEKEAELETHSERLAIAFGL++IKHGSPIRIIKNLRICNDCHAVTKL+SHIYNREIIIRDGSRFHHFK+GSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW

A0A5D3D6Y7 Pentatricopeptide repeat-containing protein0.0e+0088.84Show/hide
Query:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI
        MLSLTLSHSLQPFLP +LHFPLQNC TEREA Q HALS+KT SLNHPS+SS LLALYA P INNL+YAQSLFD I++PTLVSWN+LIKCYIENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI

Query:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL
        +LFCKLLC+FMPDSFTLPCVLKGCARL AL+EGKQIHGLVLKIG GVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKD+VSWNSLI GYARCGEIELAL
Subjt:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML
        E+FEEMPEKDSFSWTIL+DGLSKSGKL+ AR VFDRMP RNSV+WNAMINGYMKAGD NTA+ELF+QMPER+LVTWNSMITGYE N+QF +ALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML

Query:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM
        REDISPN  TILGA+SAASGLVSLG GRWVHSYIVKNGF+T+GVLGT LIEMYSKCGS+KSALR F  I KKKLGHWT+IIVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA
        CRTGL+PHAITFIGVLNACSHAGFAEDAH+YFKMMT DYGI+P+IEHYGCLIDVLCRAGYLEEAKDTIE MPI+ANKVIWTSLLSGSRKHGN+RMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA

Query:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI
        +HLIDLAPDTTGCYVILSNMYAA GLWEKVR+VRE+MK+KGI KDPGCSSIEHQGSIHEFIVGD+SHPQTEEIY+KL EMK+KLN+AGH+PDT+QVLLC+
Subjt:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI

Query:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW
        EEDNEKEAELETHSERLAIAFGL+NIKHGSPIRIIKNLRICNDCHAVTKL+SHIYNREIIIRDGSRFHHFK+GSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW

A0A6J1C6D9 pentatricopeptide repeat-containing protein At5g48910-like0.0e+0089.87Show/hide
Query:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI
        MLSL LSHSL PFLPR+LHFPLQNCETERE KQ HALSLKTGS NHPSISSRLLALY DPRINNLEYA+SLFD IREPTLVSWN+L+KCY+ENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI

Query:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL
        SLFC+LL EF+PDSFTLPCVLKGCARLSAL+EGKQIHGL+LKIG GVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKD+VSWNSLI GYARCG+IELAL
Subjt:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML
        E+F+EMPE+DSFSWTILVDGLSKSGKL+TARDVFDRMPTRNSV+WNAMINGYMKAGDFNTARELF+QMPERNLVTWNSMITGYELNRQF+QALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML

Query:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM
        RE+ISPN+ATILGALSAASGLVS GKGRWVHS+IVKNGFET+GVLGTSLIEMYSKCGSI SALR F+SIPKKKLGHWTAIIVGLGMHGLV QTLELFDEM
Subjt:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA
        CR GLKPHAITFIG+LNACSHAGFA+DA+ YFKMM DDYGIEPSIEHYGCLIDVLCRAG LEEAK+TIE MPI+ NKVIW SLLSGSRKHGN+RMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA

Query:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI
         HLIDLAPDTTGCY+ILSNMYA AGLWEKVR+VRE+MKKKGI KDPGCSSIEHQGS+HEFIVGDRSHPQTEEIY+KL EMKEKLN+AGHVPDTTQVLLC+
Subjt:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI

Query:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW
        E+DNEKE+ELETHSERLAIAFGLINIKHG+P+RIIKNLRICNDCH V+KL+SHIYNREIIIRDGSRFHHFK+GSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW

A0A6J1GM70 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like0.0e+0090.46Show/hide
Query:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI
        M SLTLSHSLQPF P +LHFPLQNCETEREAKQFHALSLKTGSLNHPSIS RLLALYA+PRINNLEYAQSLFD IR+PTLVSWNMLIKCYIENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI

Query:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL
        +LFCKLLCEFMPDSFTLPCVLKGCARLSAL+EGKQIHGL+LKIG GVDKFVLSSLV+MYSKCGEIELCRKVFDRMEDKD+VSWNSLI GYARCGEIELAL
Subjt:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML
        ELF+EMPEKD+FSWTILVDGLSKSGKLK ARDVFDRMPTRNSV+WNAMINGYMKAG FNTARELF++MPERN V+WNSMITGYELN+QF QALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML

Query:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM
         EDISPN+AT+LGA SAASGL SLG GRWVHSYIVKN F+T+GVLGTSLIEMYSKCGSIK ALR F+SIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA
        CRTGLKPHAITFIGVLNACSHAGFA++A +YFK MTDD+GIEPSIEHYGCLID LCRAGYLEEAKDTIE MPI+AN VIW SLLSGSRKHG+ RMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA

Query:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI
         HL+DLAPDTTGCYVILSNMYAA GLWEKVR+VRE+MKKKGI KDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKL EMKEKLN+AGHVPDTTQVLLC+
Subjt:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI

Query:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW
        EEDNEKEAELETHSERLAIAFGL+NIKHGSPIRIIKNLRICNDCHAVTK +S IYNREIIIRDGSRFHHFK+GSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW

A0A6J1HYS6 pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like0.0e+0090.01Show/hide
Query:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI
        M SLTLSHSL PFLP +LHFPLQNCETEREAKQFHALS+KTGSLN PSIS RLLALYA+PRINNLEYAQSLFD IR+PTLVSWNMLIKCYIENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAI

Query:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL
        +LFCKLLCEFMPDSFTLPCVLKGCARLSAL+EGKQIHGL+LKIG GVDKFVLSSLV+MYSKCGEIELCRKVFDRMEDKD+VSWNSLI GYARCGEIELAL
Subjt:  SLFCKLLCEFMPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELAL

Query:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML
        ELF+EMPEKD+FSWTILVDGLSKSGKLK ARDVFDRMPTRNS++WNAMINGYMKAG FNTARELF++MPERN V+WNSMITGYELN+QF QALKLFEVML
Subjt:  ELFEEMPEKDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVML

Query:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM
         EDISPN+AT+LGALSAASGL SLG GRWVHSYIVKN F+T+GVLGTSLIEMYSKCGSIK ALR F+SIPK+KLGHWTAIIVGLGMHGLVEQTLELFDEM
Subjt:  REDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEM

Query:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA
        CRTGLKPHAITFIGVLNACSHAGFAE+A +YFK MTDD+GIEPSIEHYGCLID LCRAGYLEEAKDTIE MPI+AN VIW SLLSGSRKHGN RMGEYAA
Subjt:  CRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAA

Query:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI
         HL+DLAPDTTGCYVILSNMYAA GLWE  R+VRE+MKKKGI KDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKL EMKEKLN+AGHVPDTTQVLLC+
Subjt:  RHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCI

Query:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW
        EEDNEKEAELETHSERLAIAFGL+NIKHGSPIRIIKNLRICNDCHAVTK +S IYNREIIIRDGSRFHHFK+GSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic4.0e-16239.63Show/hide
Query:  RSLHFPL-QNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLFCKLLCEFM--P
        RS H  L + C + R+ KQ H   ++TG+ + P  +S+L A+ A     +LEYA+ +FD I +P   +WN LI+ Y        +I  F  ++ E    P
Subjt:  RSLHFPL-QNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLFCKLLCEFM--P

Query:  DSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEMPEKD--
        + +T P ++K  A +S+L  G+ +HG+ +K   G D FV +SL+  Y  CG+++   KVF  +++KD+VSWNS+I G+ + G  + ALELF++M  +D  
Subjt:  DSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEMPEKD--

Query:  ----------------------------------SFSWTI---LVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNL
                                          + + T+   ++D  +K G ++ A+ +FD M  +++VTW  M++GY  + D+  ARE+ N MP++++
Subjt:  ----------------------------------SFSWTI---LVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNL

Query:  VTWNSMITGYELNRQFAQALKLF-EVMLREDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKK
        V WN++I+ YE N +  +AL +F E+ L++++  N  T++  LSA + + +L  GRW+HSYI K+G      + ++LI MYSKCG ++ +   F S+ K+
Subjt:  VTWNSMITGYELNRQFAQALKLF-EVMLREDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKK

Query:  KLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMP
         +  W+A+I GL MHG   + +++F +M    +KP+ +TF  V  ACSH G  ++A   F  M  +YGI P  +HY C++DVL R+GYLE+A   IE MP
Subjt:  KLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMP

Query:  IEANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEE
        I  +  +W +LL   + H NL + E A   L++L P   G +V+LSN+YA  G WE V E+R+ M+  G+ K+PGCSSIE  G IHEF+ GD +HP +E+
Subjt:  IEANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEE

Query:  IYVKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKN
        +Y KL E+ EKL   G+ P+ +QVL  IEE+  KE  L  HSE+LAI +GLI+ +    IR+IKNLR+C DCH+V KL+S +Y+REII+RD  RFHHF+N
Subjt:  IYVKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKN

Query:  GSCSCKDFW
        G CSC DFW
Subjt:  GSCSCKDFW

Q9FI80 Pentatricopeptide repeat-containing protein At5g489101.6e-15541.9Show/hide
Query:  PRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYA--DPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSND--AISLFCKLLC-E
        P SL   + NC T R+  Q HA+ +K+G +     ++ +L   A  D    +L+YA  +F+++ +    SWN +I+ + E+       AI+LF +++  E
Subjt:  PRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYA--DPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSND--AISLFCKLLC-E

Query:  FM-PDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEMPE
        F+ P+ FT P VLK CA+   ++EGKQIHGL LK G G D+FV+S+LV MY  CG ++  R +F                              ++ + E
Subjt:  FM-PDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEMPE

Query:  KDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVMLREDISPNN
        KD     ++ D   + G++               V WN MI+GYM+ GD   AR LF++M +R++V+WN+MI+GY LN  F  A+++F  M + DI PN 
Subjt:  KDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVMLREDISPNN

Query:  ATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPH
         T++  L A S L SL  G W+H Y   +G   + VLG++LI+MYSKCG I+ A+  FE +P++ +  W+A+I G  +HG     ++ F +M + G++P 
Subjt:  ATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPH

Query:  AITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAP
         + +I +L ACSH G  E+  +YF  M    G+EP IEHYGC++D+L R+G L+EA++ I  MPI+ + VIW +LL   R  GN+ MG+  A  L+D+ P
Subjt:  AITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAP

Query:  DTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEA
          +G YV LSNMYA+ G W +V E+R  MK+K I KDPGCS I+  G +HEF+V D SHP+ +EI   L E+ +KL LAG+ P TTQVLL +EE+ +KE 
Subjt:  DTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEA

Query:  ELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW
         L  HSE++A AFGLI+   G PIRI+KNLRIC DCH+  KL+S +Y R+I +RD  RFHHF++GSCSC D+W
Subjt:  ELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665207.6e-14539.12Show/hide
Query:  LQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINN-LEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLFCKLLCEFMP-DSFTLPC
        LQ C  + E KQ HA  LKTG +      ++ L+       ++ L YAQ +FD    P    WN++I+ +  +     ++ L+ ++LC   P +++T P 
Subjt:  LQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINN-LEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLFCKLLCEFMP-DSFTLPC

Query:  VLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEMPEKDSFSWTILVD
        +LK C+ LSA EE  QIH  + K+G   D + ++SL++ Y+  G  +L   +FDR+ + D VSWNS+I GY + G++++AL LF +M EK          
Subjt:  VLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEMPEKDSFSWTILVD

Query:  GLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVMLREDISPNNATILGALSAAS
                             N+++W  MI+GY++A                            ++N+   +AL+LF  M   D+ P+N ++  ALSA +
Subjt:  GLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVMLREDISPNNATILGALSAAS

Query:  GLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNAC
         L +L +G+W+HSY+ K     + VLG  LI+MY+KCG ++ AL  F++I KK +  WTA+I G   HG   + +  F EM + G+KP+ ITF  VL AC
Subjt:  GLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNAC

Query:  SHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSN
        S+ G  E+    F  M  DY ++P+IEHYGC++D+L RAG L+EAK  I+ MP++ N VIW +LL   R H N+ +GE     LI + P   G YV  +N
Subjt:  SHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSN

Query:  MYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAI
        ++A    W+K  E R +MK++G++K PGCS+I  +G+ HEF+ GDRSHP+ E+I  K   M+ KL   G+VP+  ++LL + +D+E+EA +  HSE+LAI
Subjt:  MYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAI

Query:  AFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW
         +GLI  K G+ IRI+KNLR+C DCH VTKL+S IY R+I++RD +RFHHF++G CSC D+W
Subjt:  AFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic6.2e-16342.11Show/hide
Query:  LQNCETEREAKQFHALSLKTGSLNHPSISSRLLAL-YADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLF-CKLLCEFMPDSFTLPC
        L NC+T +  +  HA  +K G  N     S+L+      P    L YA S+F  I+EP L+ WN + + +  +     A+ L+ C +    +P+S+T P 
Subjt:  LQNCETEREAKQFHALSLKTGSLNHPSISSRLLAL-YADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLF-CKLLCEFMPDSFTLPC

Query:  VLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEMPEKDSFSWTILVD
        VLK CA+  A +EG+QIHG VLK+G  +D +V +SL+SMY + G +E   KVFD+   +D+VS+ +LI GYA  G IE A +LF+E+P KD  SW  ++ 
Subjt:  VLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEMPEKDSFSWTILVD

Query:  GLSKSGKLKTARDVF-DRMPT-------------------------RNSVTW-------------NAMINGYMKAGDFNTARELFNQMPERNLVTWNSMI
        G +++G  K A ++F D M T                         R    W             NA+I+ Y K G+  TA  LF ++P +++++WN++I
Subjt:  GLSKSGKLKTARDVF-DRMPT-------------------------RNSVTW-------------NAMINGYMKAGDFNTARELFNQMPERNLVTWNSMI

Query:  TGYELNRQFAQALKLFEVMLREDISPNNATILGALSAASGLVSLGKGRWVHSYIVK--NGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWT
         GY     + +AL LF+ MLR   +PN+ T+L  L A + L ++  GRW+H YI K   G      L TSLI+MY+KCG I++A + F SI  K L  W 
Subjt:  TGYELNRQFAQALKLFEVMLREDISPNNATILGALSAASGLVSLGKGRWVHSYIVK--NGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWT

Query:  AIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKV
        A+I G  MHG  + + +LF  M + G++P  ITF+G+L+ACSH+G  +     F+ MT DY + P +EHYGC+ID+L  +G  +EA++ I  M +E + V
Subjt:  AIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKV

Query:  IWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLC
        IW SLL   + HGN+ +GE  A +LI + P+  G YV+LSN+YA+AG W +V + R ++  KG+ K PGCSSIE    +HEFI+GD+ HP+  EIY  L 
Subjt:  IWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLC

Query:  EMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCK
        EM+  L  AG VPDT++VL  +EE+  KE  L  HSE+LAIAFGLI+ K G+ + I+KNLR+C +CH  TKL+S IY REII RD +RFHHF++G CSC 
Subjt:  EMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCK

Query:  DFW
        D+W
Subjt:  DFW

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226904.0e-14637.91Show/hide
Query:  FPLQNCETER---EAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLFCKLLC--EFMPDS
        F L  C   R      Q H L +K G      + + L+  YA+     L+ A+ +FD + E  +VSW  +I  Y     + DA+ LF +++   E  P+S
Subjt:  FPLQNCETER---EAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLFCKLLC--EFMPDS

Query:  FTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEM------PE
         T+ CV+  CA+L  LE G++++  +   G  V+  ++S+LV MY KC  I++ +++FD     ++   N++   Y R G    AL +F  M      P+
Subjt:  FTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEM------PE

Query:  KDSF-----------------------------SW----TILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVT
        + S                              SW      L+D   K  +  TA  +FDRM  +  VTWN+++ GY++ G+ + A E F  MPE+N+V+
Subjt:  KDSF-----------------------------SW----TILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVT

Query:  WNSMITGYELNRQFAQALKLF-EVMLREDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKL
        WN++I+G      F +A+++F  +  +E ++ +  T++   SA   L +L   +W++ YI KNG + +  LGT+L++M+S+CG  +SA+  F S+  + +
Subjt:  WNSMITGYELNRQFAQALKLF-EVMLREDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKL

Query:  GHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIE
          WTA I  + M G  E+ +ELFD+M   GLKP  + F+G L ACSH G  +   + F  M   +G+ P   HYGC++D+L RAG LEEA   IE MP+E
Subjt:  GHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIE

Query:  ANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIY
         N VIW SLL+  R  GN+ M  YAA  +  LAP+ TG YV+LSN+YA+AG W  + +VR  MK+KG+ K PG SSI+ +G  HEF  GD SHP+   I 
Subjt:  ANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIY

Query:  VKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGS
          L E+ ++ +  GHVPD + VL+ ++E  EK   L  HSE+LA+A+GLI+   G+ IRI+KNLR+C+DCH+  K  S +YNREII+RD +RFH+ + G 
Subjt:  VKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGS

Query:  CSCKDFW
        CSC DFW
Subjt:  CSCKDFW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.4e-16442.11Show/hide
Query:  LQNCETEREAKQFHALSLKTGSLNHPSISSRLLAL-YADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLF-CKLLCEFMPDSFTLPC
        L NC+T +  +  HA  +K G  N     S+L+      P    L YA S+F  I+EP L+ WN + + +  +     A+ L+ C +    +P+S+T P 
Subjt:  LQNCETEREAKQFHALSLKTGSLNHPSISSRLLAL-YADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLF-CKLLCEFMPDSFTLPC

Query:  VLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEMPEKDSFSWTILVD
        VLK CA+  A +EG+QIHG VLK+G  +D +V +SL+SMY + G +E   KVFD+   +D+VS+ +LI GYA  G IE A +LF+E+P KD  SW  ++ 
Subjt:  VLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEMPEKDSFSWTILVD

Query:  GLSKSGKLKTARDVF-DRMPT-------------------------RNSVTW-------------NAMINGYMKAGDFNTARELFNQMPERNLVTWNSMI
        G +++G  K A ++F D M T                         R    W             NA+I+ Y K G+  TA  LF ++P +++++WN++I
Subjt:  GLSKSGKLKTARDVF-DRMPT-------------------------RNSVTW-------------NAMINGYMKAGDFNTARELFNQMPERNLVTWNSMI

Query:  TGYELNRQFAQALKLFEVMLREDISPNNATILGALSAASGLVSLGKGRWVHSYIVK--NGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWT
         GY     + +AL LF+ MLR   +PN+ T+L  L A + L ++  GRW+H YI K   G      L TSLI+MY+KCG I++A + F SI  K L  W 
Subjt:  TGYELNRQFAQALKLFEVMLREDISPNNATILGALSAASGLVSLGKGRWVHSYIVK--NGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWT

Query:  AIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKV
        A+I G  MHG  + + +LF  M + G++P  ITF+G+L+ACSH+G  +     F+ MT DY + P +EHYGC+ID+L  +G  +EA++ I  M +E + V
Subjt:  AIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKV

Query:  IWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLC
        IW SLL   + HGN+ +GE  A +LI + P+  G YV+LSN+YA+AG W +V + R ++  KG+ K PGCSSIE    +HEFI+GD+ HP+  EIY  L 
Subjt:  IWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLC

Query:  EMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCK
        EM+  L  AG VPDT++VL  +EE+  KE  L  HSE+LAIAFGLI+ K G+ + I+KNLR+C +CH  TKL+S IY REII RD +RFHHF++G CSC 
Subjt:  EMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCK

Query:  DFW
        D+W
Subjt:  DFW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-16339.63Show/hide
Query:  RSLHFPL-QNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLFCKLLCEFM--P
        RS H  L + C + R+ KQ H   ++TG+ + P  +S+L A+ A     +LEYA+ +FD I +P   +WN LI+ Y        +I  F  ++ E    P
Subjt:  RSLHFPL-QNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLFCKLLCEFM--P

Query:  DSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEMPEKD--
        + +T P ++K  A +S+L  G+ +HG+ +K   G D FV +SL+  Y  CG+++   KVF  +++KD+VSWNS+I G+ + G  + ALELF++M  +D  
Subjt:  DSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEMPEKD--

Query:  ----------------------------------SFSWTI---LVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNL
                                          + + T+   ++D  +K G ++ A+ +FD M  +++VTW  M++GY  + D+  ARE+ N MP++++
Subjt:  ----------------------------------SFSWTI---LVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNL

Query:  VTWNSMITGYELNRQFAQALKLF-EVMLREDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKK
        V WN++I+ YE N +  +AL +F E+ L++++  N  T++  LSA + + +L  GRW+HSYI K+G      + ++LI MYSKCG ++ +   F S+ K+
Subjt:  VTWNSMITGYELNRQFAQALKLF-EVMLREDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKK

Query:  KLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMP
         +  W+A+I GL MHG   + +++F +M    +KP+ +TF  V  ACSH G  ++A   F  M  +YGI P  +HY C++DVL R+GYLE+A   IE MP
Subjt:  KLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMP

Query:  IEANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEE
        I  +  +W +LL   + H NL + E A   L++L P   G +V+LSN+YA  G WE V E+R+ M+  G+ K+PGCSSIE  G IHEF+ GD +HP +E+
Subjt:  IEANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEE

Query:  IYVKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKN
        +Y KL E+ EKL   G+ P+ +QVL  IEE+  KE  L  HSE+LAI +GLI+ +    IR+IKNLR+C DCH+V KL+S +Y+REII+RD  RFHHF+N
Subjt:  IYVKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKN

Query:  GSCSCKDFW
        G CSC DFW
Subjt:  GSCSCKDFW

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)5.4e-14637.82Show/hide
Query:  FPLQNCETER---EAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLFCKLLC--EFMPDS
        F L  C   R      Q H L +K G      + + L+  YA+     L+ A+ +FD + E  +VSW  +I  Y     + DA+ LF +++   E  P+S
Subjt:  FPLQNCETER---EAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLFCKLLC--EFMPDS

Query:  FTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEM------PE
         T+ CV+  CA+L  LE G++++  +   G  V+  ++S+LV MY KC  I++ +++FD     ++   N++   Y R G    AL +F  M      P+
Subjt:  FTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEM------PE

Query:  KDSF-----------------------------SW----TILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVT
        + S                              SW      L+D   K  +  TA  +FDRM  +  VTWN+++ GY++ G+ + A E F  MPE+N+V+
Subjt:  KDSF-----------------------------SW----TILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVT

Query:  WNSMITGYELNRQFAQALKLF-EVMLREDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKL
        WN++I+G      F +A+++F  +  +E ++ +  T++   SA   L +L   +W++ YI KNG + +  LGT+L++M+S+CG  +SA+  F S+  + +
Subjt:  WNSMITGYELNRQFAQALKLF-EVMLREDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKL

Query:  GHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIE
          WTA I  + M G  E+ +ELFD+M   GLKP  + F+G L ACSH G  +   + F  M   +G+ P   HYGC++D+L RAG LEEA   IE MP+E
Subjt:  GHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIE

Query:  ANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIY
         N VIW SLL+  R  GN+ M  YAA  +  LAP+ TG YV+LSN+YA+AG W  + +VR  MK+KG+ K PG SSI+ +G  HEF  GD SHP+   I 
Subjt:  ANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIY

Query:  VKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGS
          L E+ ++ +  GHVPD + VL+ ++E  EK   L  HSE+LA+A+GLI+   G+ IRI+KNLR+C+DCH+  K  S +YNREII+RD +RFH+ + G 
Subjt:  VKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGS

Query:  CSCKDF
        CSC DF
Subjt:  CSCKDF

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification2.9e-14737.91Show/hide
Query:  FPLQNCETER---EAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLFCKLLC--EFMPDS
        F L  C   R      Q H L +K G      + + L+  YA+     L+ A+ +FD + E  +VSW  +I  Y     + DA+ LF +++   E  P+S
Subjt:  FPLQNCETER---EAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLFCKLLC--EFMPDS

Query:  FTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEM------PE
         T+ CV+  CA+L  LE G++++  +   G  V+  ++S+LV MY KC  I++ +++FD     ++   N++   Y R G    AL +F  M      P+
Subjt:  FTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEM------PE

Query:  KDSF-----------------------------SW----TILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVT
        + S                              SW      L+D   K  +  TA  +FDRM  +  VTWN+++ GY++ G+ + A E F  MPE+N+V+
Subjt:  KDSF-----------------------------SW----TILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVT

Query:  WNSMITGYELNRQFAQALKLF-EVMLREDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKL
        WN++I+G      F +A+++F  +  +E ++ +  T++   SA   L +L   +W++ YI KNG + +  LGT+L++M+S+CG  +SA+  F S+  + +
Subjt:  WNSMITGYELNRQFAQALKLF-EVMLREDISPNNATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKL

Query:  GHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIE
          WTA I  + M G  E+ +ELFD+M   GLKP  + F+G L ACSH G  +   + F  M   +G+ P   HYGC++D+L RAG LEEA   IE MP+E
Subjt:  GHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIE

Query:  ANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIY
         N VIW SLL+  R  GN+ M  YAA  +  LAP+ TG YV+LSN+YA+AG W  + +VR  MK+KG+ K PG SSI+ +G  HEF  GD SHP+   I 
Subjt:  ANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIY

Query:  VKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGS
          L E+ ++ +  GHVPD + VL+ ++E  EK   L  HSE+LA+A+GLI+   G+ IRI+KNLR+C+DCH+  K  S +YNREII+RD +RFH+ + G 
Subjt:  VKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGS

Query:  CSCKDFW
        CSC DFW
Subjt:  CSCKDFW

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-15641.9Show/hide
Query:  PRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYA--DPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSND--AISLFCKLLC-E
        P SL   + NC T R+  Q HA+ +K+G +     ++ +L   A  D    +L+YA  +F+++ +    SWN +I+ + E+       AI+LF +++  E
Subjt:  PRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYA--DPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSND--AISLFCKLLC-E

Query:  FM-PDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEMPE
        F+ P+ FT P VLK CA+   ++EGKQIHGL LK G G D+FV+S+LV MY  CG ++  R +F                              ++ + E
Subjt:  FM-PDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEMPE

Query:  KDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVMLREDISPNN
        KD     ++ D   + G++               V WN MI+GYM+ GD   AR LF++M +R++V+WN+MI+GY LN  F  A+++F  M + DI PN 
Subjt:  KDSFSWTILVDGLSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVMLREDISPNN

Query:  ATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPH
         T++  L A S L SL  G W+H Y   +G   + VLG++LI+MYSKCG I+ A+  FE +P++ +  W+A+I G  +HG     ++ F +M + G++P 
Subjt:  ATILGALSAASGLVSLGKGRWVHSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPH

Query:  AITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAP
         + +I +L ACSH G  E+  +YF  M    G+EP IEHYGC++D+L R+G L+EA++ I  MPI+ + VIW +LL   R  GN+ MG+  A  L+D+ P
Subjt:  AITFIGVLNACSHAGFAEDAHQYFKMMTDDYGIEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAP

Query:  DTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEA
          +G YV LSNMYA+ G W +V E+R  MK+K I KDPGCS I+  G +HEF+V D SHP+ +EI   L E+ +KL LAG+ P TTQVLL +EE+ +KE 
Subjt:  DTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSSIEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEA

Query:  ELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW
         L  HSE++A AFGLI+   G PIRI+KNLRIC DCH+  KL+S +Y R+I +RD  RFHHF++GSCSC D+W
Subjt:  ELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREIIIRDGSRFHHFKNGSCSCKDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTATCTCTTACACTTTCGCATTCCCTCCAACCATTTCTTCCTCGCAGCCTTCATTTTCCTCTTCAAAATTGCGAAACTGAACGAGAAGCAAAGCAATTCCATGCGCT
CTCGCTCAAGACAGGCTCCTTGAATCACCCTTCAATATCTTCTCGTCTTTTGGCCCTCTATGCAGATCCCAGAATCAACAATCTCGAGTATGCGCAGTCCCTTTTTGACA
GGATTCGAGAACCCACTTTGGTATCTTGGAACATGCTCATCAAGTGCTACATCGAGAACCAACGTTCAAATGATGCCATTTCGTTGTTCTGCAAATTGCTCTGTGAGTTC
ATGCCTGATTCTTTTACATTGCCTTGTGTCCTTAAGGGTTGTGCTCGATTGAGTGCACTAGAGGAGGGGAAACAGATTCATGGGTTGGTATTGAAAATTGGGTCTGGTGT
CGATAAGTTTGTTTTGAGTAGTTTGGTTAGTATGTATTCTAAATGTGGTGAGATTGAGTTGTGTAGGAAAGTGTTTGATCGAATGGAAGATAAGGATATAGTATCATGGA
ATTCGTTGATTTATGGATATGCTAGATGTGGTGAAATTGAATTGGCACTTGAGTTGTTTGAAGAAATGCCAGAGAAGGATTCTTTTTCATGGACTATTCTGGTTGATGGG
CTTTCGAAGAGTGGAAAGTTAAAGACTGCTAGAGACGTGTTCGATAGAATGCCTACTCGAAATTCTGTAACTTGGAATGCTATGATTAATGGCTACATGAAAGCTGGGGA
TTTTAACACGGCACGAGAACTATTCAATCAAATGCCAGAGAGAAACCTCGTTACATGGAATTCAATGATCACTGGATATGAACTGAACAGGCAGTTTGCACAAGCCTTAA
AGCTGTTTGAGGTCATGTTGAGAGAAGATATATCACCCAATAATGCCACTATCCTTGGAGCTCTTTCTGCAGCTTCAGGACTGGTTAGTCTTGGTAAGGGAAGATGGGTT
CATTCCTATATAGTGAAAAATGGATTCGAAACAGAGGGTGTACTCGGCACATCGCTGATAGAAATGTACTCCAAATGTGGCAGCATTAAGAGTGCCCTCAGAGCTTTTGA
GTCTATACCTAAAAAGAAATTGGGGCATTGGACGGCTATAATTGTAGGCTTGGGAATGCATGGTTTGGTAGAGCAAACTCTTGAGCTATTTGATGAAATGTGCAGAACTG
GGTTGAAGCCTCATGCCATTACTTTTATTGGAGTGTTAAATGCTTGTAGTCATGCAGGATTTGCGGAAGATGCCCATCAGTACTTTAAAATGATGACAGATGATTATGGA
ATTGAGCCTTCTATTGAACACTATGGTTGCTTGATTGATGTTCTGTGTCGTGCTGGATATCTTGAAGAGGCAAAGGATACCATTGAGACAATGCCTATCGAAGCAAACAA
AGTAATTTGGACGAGTCTACTAAGTGGTTCAAGGAAACATGGAAACTTAAGAATGGGAGAATATGCAGCTCGTCATCTGATTGATTTAGCACCGGATACTACTGGATGTT
ATGTGATTCTTTCGAACATGTACGCTGCAGCTGGCTTGTGGGAAAAAGTACGGGAAGTAAGAGAAATAATGAAGAAAAAAGGAATCAGTAAAGATCCAGGATGCAGTTCC
ATTGAGCATCAAGGTTCAATCCATGAATTCATTGTGGGAGATAGGTCACATCCTCAAACCGAAGAGATATATGTCAAACTGTGTGAGATGAAAGAGAAATTGAATCTAGC
GGGACATGTTCCCGATACGACTCAAGTTCTGTTATGCATTGAAGAGGATAATGAGAAAGAAGCAGAACTTGAAACCCATAGTGAGAGGTTGGCAATAGCTTTTGGTCTTA
TCAATATCAAGCATGGAAGTCCTATCCGTATCATAAAGAATCTTCGTATTTGCAACGATTGTCATGCTGTGACTAAACTTGTTTCTCATATATATAACCGTGAGATCATT
ATCAGAGATGGTAGTCGATTCCATCACTTTAAAAATGGGTCTTGCTCTTGTAAAGATTTTTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTATCTCTTACACTTTCGCATTCCCTCCAACCATTTCTTCCTCGCAGCCTTCATTTTCCTCTTCAAAATTGCGAAACTGAACGAGAAGCAAAGCAATTCCATGCGCT
CTCGCTCAAGACAGGCTCCTTGAATCACCCTTCAATATCTTCTCGTCTTTTGGCCCTCTATGCAGATCCCAGAATCAACAATCTCGAGTATGCGCAGTCCCTTTTTGACA
GGATTCGAGAACCCACTTTGGTATCTTGGAACATGCTCATCAAGTGCTACATCGAGAACCAACGTTCAAATGATGCCATTTCGTTGTTCTGCAAATTGCTCTGTGAGTTC
ATGCCTGATTCTTTTACATTGCCTTGTGTCCTTAAGGGTTGTGCTCGATTGAGTGCACTAGAGGAGGGGAAACAGATTCATGGGTTGGTATTGAAAATTGGGTCTGGTGT
CGATAAGTTTGTTTTGAGTAGTTTGGTTAGTATGTATTCTAAATGTGGTGAGATTGAGTTGTGTAGGAAAGTGTTTGATCGAATGGAAGATAAGGATATAGTATCATGGA
ATTCGTTGATTTATGGATATGCTAGATGTGGTGAAATTGAATTGGCACTTGAGTTGTTTGAAGAAATGCCAGAGAAGGATTCTTTTTCATGGACTATTCTGGTTGATGGG
CTTTCGAAGAGTGGAAAGTTAAAGACTGCTAGAGACGTGTTCGATAGAATGCCTACTCGAAATTCTGTAACTTGGAATGCTATGATTAATGGCTACATGAAAGCTGGGGA
TTTTAACACGGCACGAGAACTATTCAATCAAATGCCAGAGAGAAACCTCGTTACATGGAATTCAATGATCACTGGATATGAACTGAACAGGCAGTTTGCACAAGCCTTAA
AGCTGTTTGAGGTCATGTTGAGAGAAGATATATCACCCAATAATGCCACTATCCTTGGAGCTCTTTCTGCAGCTTCAGGACTGGTTAGTCTTGGTAAGGGAAGATGGGTT
CATTCCTATATAGTGAAAAATGGATTCGAAACAGAGGGTGTACTCGGCACATCGCTGATAGAAATGTACTCCAAATGTGGCAGCATTAAGAGTGCCCTCAGAGCTTTTGA
GTCTATACCTAAAAAGAAATTGGGGCATTGGACGGCTATAATTGTAGGCTTGGGAATGCATGGTTTGGTAGAGCAAACTCTTGAGCTATTTGATGAAATGTGCAGAACTG
GGTTGAAGCCTCATGCCATTACTTTTATTGGAGTGTTAAATGCTTGTAGTCATGCAGGATTTGCGGAAGATGCCCATCAGTACTTTAAAATGATGACAGATGATTATGGA
ATTGAGCCTTCTATTGAACACTATGGTTGCTTGATTGATGTTCTGTGTCGTGCTGGATATCTTGAAGAGGCAAAGGATACCATTGAGACAATGCCTATCGAAGCAAACAA
AGTAATTTGGACGAGTCTACTAAGTGGTTCAAGGAAACATGGAAACTTAAGAATGGGAGAATATGCAGCTCGTCATCTGATTGATTTAGCACCGGATACTACTGGATGTT
ATGTGATTCTTTCGAACATGTACGCTGCAGCTGGCTTGTGGGAAAAAGTACGGGAAGTAAGAGAAATAATGAAGAAAAAAGGAATCAGTAAAGATCCAGGATGCAGTTCC
ATTGAGCATCAAGGTTCAATCCATGAATTCATTGTGGGAGATAGGTCACATCCTCAAACCGAAGAGATATATGTCAAACTGTGTGAGATGAAAGAGAAATTGAATCTAGC
GGGACATGTTCCCGATACGACTCAAGTTCTGTTATGCATTGAAGAGGATAATGAGAAAGAAGCAGAACTTGAAACCCATAGTGAGAGGTTGGCAATAGCTTTTGGTCTTA
TCAATATCAAGCATGGAAGTCCTATCCGTATCATAAAGAATCTTCGTATTTGCAACGATTGTCATGCTGTGACTAAACTTGTTTCTCATATATATAACCGTGAGATCATT
ATCAGAGATGGTAGTCGATTCCATCACTTTAAAAATGGGTCTTGCTCTTGTAAAGATTTTTGGTAA
Protein sequenceShow/hide protein sequence
MLSLTLSHSLQPFLPRSLHFPLQNCETEREAKQFHALSLKTGSLNHPSISSRLLALYADPRINNLEYAQSLFDRIREPTLVSWNMLIKCYIENQRSNDAISLFCKLLCEF
MPDSFTLPCVLKGCARLSALEEGKQIHGLVLKIGSGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIYGYARCGEIELALELFEEMPEKDSFSWTILVDG
LSKSGKLKTARDVFDRMPTRNSVTWNAMINGYMKAGDFNTARELFNQMPERNLVTWNSMITGYELNRQFAQALKLFEVMLREDISPNNATILGALSAASGLVSLGKGRWV
HSYIVKNGFETEGVLGTSLIEMYSKCGSIKSALRAFESIPKKKLGHWTAIIVGLGMHGLVEQTLELFDEMCRTGLKPHAITFIGVLNACSHAGFAEDAHQYFKMMTDDYG
IEPSIEHYGCLIDVLCRAGYLEEAKDTIETMPIEANKVIWTSLLSGSRKHGNLRMGEYAARHLIDLAPDTTGCYVILSNMYAAAGLWEKVREVREIMKKKGISKDPGCSS
IEHQGSIHEFIVGDRSHPQTEEIYVKLCEMKEKLNLAGHVPDTTQVLLCIEEDNEKEAELETHSERLAIAFGLINIKHGSPIRIIKNLRICNDCHAVTKLVSHIYNREII
IRDGSRFHHFKNGSCSCKDFW