; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g03350 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g03350
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr4:2088640..2090685
RNA-Seq ExpressionMoc04g03350
SyntenyMoc04g03350
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572234.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0087.67Show/hide
Query:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI
        M SL LSHSL PF P NLHFPLQNCETERE KQ HALSLKTGS NHPSIS RLLALY +PRINNLEY +SLFDWIR+PTLVSWN+L+KCY+ENQRSNDAI
Subjt:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI

Query:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL
        +LFC+LL EF+PDSFTLPCVLKGCARLSAL EGKQIHGLILKIG GVDKFVLSSLV+MYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCG+IELAL
Subjt:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL

Query:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML
        E+FDEMPE+D+FSWTILVDGLSKSGKL+ ARDVFDRMPTRNSVSWNAMINGYMKAG FNTARELFD+MPERN V+WNSMITGYELN+QF+QALKLFEVML
Subjt:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML

Query:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM
         E+ISPNHAT+LGA SAASGL S G GRWVHS+IVKN F+TDGVLGTSLIEMYSKCGSI  ALRVF+SIPKKKLGHWTAIIVGLGMHGLV QTLELFDEM
Subjt:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM

Query:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA
        CR GLKPHAITFIG+LNACSHAGFA +A  YFK M DD+GIEPSIEHYGCLID LCRAG LEEA++TIERMPIK N VIWMSLLSGSRKHGN RMGEYAA
Subjt:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL
        HHL+DLAPDTTGCY+ILSNMYA  GLWEKVRQVREMMKKKGIRKDPGCSSIEHQGS+HEFIVGDRSHPQTEEIY+KL EMKEKLNVAGH+PDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL

Query:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        E+DNEKE+ELETHSERLAIAFGL+NIKHG+P+RIIKNLRICNDCH V+K +S IYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW

XP_004144616.1 pentatricopeptide repeat-containing protein At5g48910 [Cucumis sativus]0.0e+0087.52Show/hide
Query:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI
        MLS  LSHSL PFLP NLHFPLQNC TERE  QLHALS+KT S NHPS+SSRLLALY DPRINNL+YA SLFDWI+EPTLVSWNLL+KCY+ENQRSNDAI
Subjt:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI

Query:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL
        +LFC+LL +F+PDSFTLPCVLKGCARL AL EGKQIHGL+LKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCG+IELAL
Subjt:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL

Query:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML
        E+F+EMPE+DSFSWTIL+DGLSKSGKLE ARDVFDRMP RNSVSWNAMINGYMKAGD NTA+ELFDQMPER+LVTWNSMITGYE N+QF++ALKLFEVML
Subjt:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML

Query:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM
        RE+ISPN+ TILGA+SAASG+VS G GRWVHS+IVK+GF+TDGVLGT LIEMYSKCGS+ SALRVF+SIPKKKLGHWT++IVGLGMHGLV QTLELFDEM
Subjt:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM

Query:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA
        CR GLKPHAITFIG+LNACSHAGFA+DA+ YFKMM  DYGI+PSIEHYGCLIDVLCRAG LEEAK+TIERMPIK NKVIW SLLSGSRKHGNIRMGEYAA
Subjt:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL
         HLIDLAPDTTGCY+ILSNMYA AGLWEKVRQVREMMKKKG++KDPGCSSIEHQGS+HEFIVGD+SHPQTEEIYIKL EMK+KLNVAGH+PDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL

Query:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        E+DNEKE+ELETHSERLAIAFGL+NIKHG+P+RIIKNLRICNDCH V+KLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW

XP_022135993.1 pentatricopeptide repeat-containing protein At5g48910-like [Momordica charantia]0.0e+00100Show/hide
Query:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI
        MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI
Subjt:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI

Query:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL
        SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL
Subjt:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL

Query:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML
        EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML
Subjt:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML

Query:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM
        REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM
Subjt:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM

Query:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA
        CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA
Subjt:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL
        HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL

Query:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW

XP_022953072.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita moschata]0.0e+0087.96Show/hide
Query:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI
        M SL LSHSL PF P NLHFPLQNCETERE KQ HALSLKTGS NHPSIS RLLALY +PRINNLEYA+SLFDWIR+PTLVSWN+L+KCY+ENQRSNDAI
Subjt:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI

Query:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL
        +LFC+LL EF+PDSFTLPCVLKGCARLSAL EGKQIHGLILKIG GVDKFVLSSLV+MYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCG+IELAL
Subjt:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL

Query:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML
        E+FDEMPE+D+FSWTILVDGLSKSGKL+ ARDVFDRMPTRNSVSWNAMINGYMKAG FNTARELFD+MPERN V+WNSMITGYELN+QF+QALKLFEVML
Subjt:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML

Query:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM
         E+ISPNHAT+LGA SAASGL S G GRWVHS+IVKN F+TDGVLGTSLIEMYSKCGSI  ALRVF+SIPKKKLGHWTAIIVGLGMHGLV QTLELFDEM
Subjt:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM

Query:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA
        CR GLKPHAITFIG+LNACSHAGFA +A  YFK M DD+GIEPSIEHYGCLID LCRAG LEEAK+TIERMPIK N VIWMSLLSGSRKHG+ RMGEYAA
Subjt:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL
        HHL+DLAPDTTGCY+ILSNMYA  GLWEKVRQVREMMKKKGIRKDPGCSSIEHQGS+HEFIVGDRSHPQTEEIY+KL EMKEKLNVAGHVPDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL

Query:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        E+DNEKE+ELETHSERLAIAFGL+NIKHG+P+RIIKNLRICNDCH V+K +S IYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW

XP_038887831.1 pentatricopeptide repeat-containing protein At5g48910-like [Benincasa hispida]0.0e+0087.52Show/hide
Query:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI
        ML+L LSHSL PF+PRNLHFPLQNCETERE KQLHALSLK GS NHPS+SSRLLALY DPRINNLEYA+SLFDWI++PTLVSWNLL+KCY+E+QRSNDAI
Subjt:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI

Query:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL
        +LFC+ L EF+PDSFTLPCVLKGC+RL AL EGKQIHGL+LKIGFGVDKFVLSSLVSMY+KCGEIELCRKVFDRMED+D+VSWNSLIDGYARCG+IELAL
Subjt:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL

Query:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML
        ++ +EMPE+DS SWTILVDGLSKSGKLE ARDVFD+MPTRNSVSWNAMINGYMKAG+FNTARELFDQMPERNLVTWNSMI+GYELN+QF+QALKL E ML
Subjt:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML

Query:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM
        RE+ISPN+ TILGALSAASGLVS GKGRWVHS+IVKNGF T+GVLGTSLIEMYSKCGS+ SAL VF+SIP+KKLGHWTAIIVGLGMHGLV QTLELFDEM
Subjt:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM

Query:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA
        CR GL+PHAITFIG+LNACSHAGFA+DA+ YFKMM DDYGI+PSIEHYGCLIDVLCRAG LEEAK+TIERMP+K NKVIWMSLLSGSRKHGNIRMGEYAA
Subjt:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL
        HHLIDLAPDTTGCY+ILSNMYA AGLWEKV QVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGD+SHPQT+EIYIKL EMKEKL+ AGHVPDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL

Query:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        E+DN+KE+ELETHSERLAIAFGL+NI HG+P+RIIKNLRICNDCH V+KL+SHIYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW

TrEMBL top hitse value%identityAlignment
A0A5A7V0C2 Pentatricopeptide repeat-containing protein0.0e+0086.49Show/hide
Query:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI
        MLSL LSHSL PFLP NLHFPLQNC TERE  QLHALS+KT S NHPS+SS LLALY  P INNL+YA+SLFDWI++PTLVSWNLL+KCY+ENQRSNDAI
Subjt:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI

Query:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL
        +LFC+LL +F+PDSFTLPCVLKGCARL AL EGKQIHGL+LKIGFGVDKFVLSSLVSMYSKCGEIE+CRKVFDRMEDKDVVSWNSLIDGYARCG+IELAL
Subjt:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL

Query:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML
        EVF+EMPE+DSFSWTIL+DGLSKSGKLE ARDVFDRMP RNSVSWNAMINGYMKAGD NTA+ELFDQMPER+LVTWNSMITGYE N+QF++ALKLFEVML
Subjt:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML

Query:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM
        RE+ISPN+ TILGA+SAASGLVS G GRWVHS+IVKNGF+TDGVLGT LIEMYSKCGS+ SALRVF+ I KKKLGHWT+IIVGLGMHGLV QTLELFDEM
Subjt:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM

Query:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA
        CR GL+PHAITFIG+LNACSHAGFA+DA+ YFKMM  DYGI+P+IEHYGCLIDVLCRAG LEEAK+TI+RMPIK NKVIW SLLSGSRKHGNIRMGEYAA
Subjt:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL
         HLIDLAPDTTGCY+ILSNMYA AGLWEKVRQVREMMK+K IRKDPGCSSIEHQGS+HEFIVGD+SHPQTEEIYIKL EMK+KLNVAGH+PDT+QVLLCL
Subjt:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL

Query:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        E+DNEKE+ELETHSERLAIAFGL++IKHG+P+RIIKNLRICNDCH V+KLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW

A0A5D3D6Y7 Pentatricopeptide repeat-containing protein0.0e+0086.78Show/hide
Query:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI
        MLSL LSHSL PFLP NLHFPLQNC TERE  QLHALS+KT S NHPS+SS LLALY  P INNL+YA+SLFDWI++PTLVSWNLL+KCY+ENQRSNDAI
Subjt:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI

Query:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL
        +LFC+LL +F+PDSFTLPCVLKGCARL AL EGKQIHGL+LKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCG+IELAL
Subjt:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL

Query:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML
        EVF+EMPE+DSFSWTIL+DGLSKSGKLE AR VFDRMP RNSVSWNAMINGYMKAGD NTA+ELFDQMPER+LVTWNSMITGYE N+QF++ALKLFEVML
Subjt:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML

Query:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM
        RE+ISPN+ TILGA+SAASGLVS G GRWVHS+IVKNGF+TDGVLGT LIEMYSKCGS+ SALRVF+ I KKKLGHWT+IIVGLGMHGLV QTLELFDEM
Subjt:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM

Query:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA
        CR GL+PHAITFIG+LNACSHAGFA+DA+ YFKMM  DYGI+P+IEHYGCLIDVLCRAG LEEAK+TIERMPIK NKVIW SLLSGSRKHGNIRMGEYAA
Subjt:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL
         HLIDLAPDTTGCY+ILSNMYA  GLWEKVRQVREMMK+KGIRKDPGCSSIEHQGS+HEFIVGD+SHPQTEEIYIKL EMK+KLNVAGH+PDT+QVLLCL
Subjt:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL

Query:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        E+DNEKE+ELETHSERLAIAFGL+NIKHG+P+RIIKNLRICNDCH V+KLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW

A0A6J1C6D9 pentatricopeptide repeat-containing protein At5g48910-like0.0e+00100Show/hide
Query:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI
        MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI
Subjt:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI

Query:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL
        SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL
Subjt:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL

Query:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML
        EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML
Subjt:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML

Query:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM
        REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM
Subjt:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM

Query:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA
        CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA
Subjt:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL
        HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL

Query:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW

A0A6J1GM70 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like0.0e+0087.96Show/hide
Query:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI
        M SL LSHSL PF P NLHFPLQNCETERE KQ HALSLKTGS NHPSIS RLLALY +PRINNLEYA+SLFDWIR+PTLVSWN+L+KCY+ENQRSNDAI
Subjt:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI

Query:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL
        +LFC+LL EF+PDSFTLPCVLKGCARLSAL EGKQIHGLILKIG GVDKFVLSSLV+MYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCG+IELAL
Subjt:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL

Query:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML
        E+FDEMPE+D+FSWTILVDGLSKSGKL+ ARDVFDRMPTRNSVSWNAMINGYMKAG FNTARELFD+MPERN V+WNSMITGYELN+QF+QALKLFEVML
Subjt:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML

Query:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM
         E+ISPNHAT+LGA SAASGL S G GRWVHS+IVKN F+TDGVLGTSLIEMYSKCGSI  ALRVF+SIPKKKLGHWTAIIVGLGMHGLV QTLELFDEM
Subjt:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM

Query:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA
        CR GLKPHAITFIG+LNACSHAGFA +A  YFK M DD+GIEPSIEHYGCLID LCRAG LEEAK+TIERMPIK N VIWMSLLSGSRKHG+ RMGEYAA
Subjt:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL
        HHL+DLAPDTTGCY+ILSNMYA  GLWEKVRQVREMMKKKGIRKDPGCSSIEHQGS+HEFIVGDRSHPQTEEIY+KL EMKEKLNVAGHVPDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL

Query:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        E+DNEKE+ELETHSERLAIAFGL+NIKHG+P+RIIKNLRICNDCH V+K +S IYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW

A0A6J1HYS6 pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like0.0e+0087.67Show/hide
Query:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI
        M SL LSHSLLPFLP NLHFPLQNCETERE KQ HALS+KTGS N PSIS RLLALY +PRINNLEYA+SLFDWIR+PTLVSWN+L+KCY+ENQRSNDAI
Subjt:  MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAI

Query:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL
        +LFC+LL EF+PDSFTLPCVLKGCARLSAL EGKQIHGLILKIG GVDKFVLSSLV+MYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCG+IELAL
Subjt:  SLFCELLSEFIPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELAL

Query:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML
        E+FDEMPE+D+FSWTILVDGLSKSGKL+ ARDVFDRMPTRNS+SWNAMINGYMKAG FNTARELFD+MPERN V+WNSMITGYELN+QF+QALKLFEVML
Subjt:  EVFDEMPERDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVML

Query:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM
         E+ISPNHAT+LGALSAASGL S G GRWVHS+IVKN F+TDGVLGTSLIEMYSKCGSI  ALRVF+SIPK+KLGHWTAIIVGLGMHGLV QTLELFDEM
Subjt:  REEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEM

Query:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA
        CR GLKPHAITFIG+LNACSHAGFA++A  YFK M DD+GIEPSIEHYGCLID LCRAG LEEAK+TIERMPIK N VIWMSLLSGSRKHGN RMGEYAA
Subjt:  CRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL
        HHL+DLAPDTTGCY+ILSNMYA  GLWE  RQVREMMKKKGIRKDPGCSSIEHQGS+HEFIVGDRSHPQTEEIY+KL EMKEKLNVAGHVPDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCL

Query:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        E+DNEKE+ELETHSERLAIAFGL+NIKHG+P+RIIKNLRICNDCH V+K +S IYNREIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic6.4e-16038.22Show/hide
Query:  RNLHFPL-QNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLFCELLSE--FIP
        R+ H  L + C + R++KQ H   ++TG+F+ P  +S+L A+       +LEYAR +FD I +P   +WN L++ Y        +I  F +++SE    P
Subjt:  RNLHFPL-QNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLFCELLSE--FIP

Query:  DSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEMPERD--
        + +T P ++K  A +S+L  G+ +HG+ +K   G D FV +SL+  Y  CG+++   KVF  +++KDVVSWNS+I+G+ + G  + ALE+F +M   D  
Subjt:  DSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEMPERD--

Query:  ----------------------------------SFSWTI---LVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNL
                                          + + T+   ++D  +K G +E A+ +FD M  +++V+W  M++GY  + D+  ARE+ + MP++++
Subjt:  ----------------------------------SFSWTI---LVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNL

Query:  VTWNSMITGYELNRQFSQALKLF-EVMLREEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKK
        V WN++I+ YE N + ++AL +F E+ L++ +  N  T++  LSA + + +   GRW+HS+I K+G   +  + ++LI MYSKCG +  +  VF S+ K+
Subjt:  VTWNSMITGYELNRQFSQALKLF-EVMLREEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKK

Query:  KLGHWTAIIVGLGMHGLVGQTLELFDEMCRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMP
         +  W+A+I GL MHG   + +++F +M    +KP+ +TF  +  ACSH G  D+A   F  M  +YGI P  +HY C++DVL R+G LE+A   IE MP
Subjt:  KLGHWTAIIVGLGMHGLVGQTLELFDEMCRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMP

Query:  IKPNKVIWMSLLSGSRKHGNIRMGEYAAHHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEE
        I P+  +W +LL   + H N+ + E A   L++L P   G +++LSN+YA  G WE V ++R+ M+  G++K+PGCSSIE  G +HEF+ GD +HP +E+
Subjt:  IKPNKVIWMSLLSGSRKHGNIRMGEYAAHHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEE

Query:  IYIKLSEMKEKLNVAGHVPDTTQVLLCLEDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKS
        +Y KL E+ EKL   G+ P+ +QVL  +E++  KE  L  HSE+LAI +GLI+ +    +R+IKNLR+C DCH+V+KL+S +Y+REII+RD  RFHHF++
Subjt:  IYIKLSEMKEKLNVAGHVPDTTQVLLCLEDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKS

Query:  GSCSCKDFW
        G CSC DFW
Subjt:  GSCSCKDFW

Q9FI80 Pentatricopeptide repeat-containing protein At5g489106.0e-15841.9Show/hide
Query:  PRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLL--ALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSND--AISLFCELLS-E
        P +L   + NC T R++ Q+HA+ +K+G       ++ +L     +D    +L+YA  +F+ + +    SWN +++ + E+       AI+LF E++S E
Subjt:  PRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLL--ALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSND--AISLFCELLS-E

Query:  FI-PDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEMPE
        F+ P+ FT P VLK CA+   + EGKQIHGL LK GFG D+FV+S+LV MY  CG           M+D  V+ + ++I                    E
Subjt:  FI-PDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEMPE

Query:  RDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVMLREEISPNH
        +D     ++ D   + G++               V WN MI+GYM+ GD   AR LFD+M +R++V+WN+MI+GY LN  F  A+++F  M + +I PN+
Subjt:  RDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVMLREEISPNH

Query:  ATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEMCRIGLKPH
         T++  L A S L S   G W+H +   +G   D VLG++LI+MYSKCG I  A+ VF+ +P++ +  W+A+I G  +HG  G  ++ F +M + G++P 
Subjt:  ATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEMCRIGLKPH

Query:  AITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAAHHLIDLAP
         + +I LL ACSH G  ++   YF  M+   G+EP IEHYGC++D+L R+G L+EA+  I  MPIKP+ VIW +LL   R  GN+ MG+  A+ L+D+ P
Subjt:  AITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAAHHLIDLAP

Query:  DTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCLEDDNEKES
          +G Y+ LSNMYA  G W +V ++R  MK+K IRKDPGCS I+  G +HEF+V D SHP+ +EI   L E+ +KL +AG+ P TTQVLL LE++ +KE+
Subjt:  DTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCLEDDNEKES

Query:  ELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
         L  HSE++A AFGLI+   G P+RI+KNLRIC DCH+  KL+S +Y R+I +RD  RFHHF+ GSCSC D+W
Subjt:  ELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665202.7e-14238.37Show/hide
Query:  LQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINN-LEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLFCELLSEFIP-DSFTLPC
        LQ C  + E+KQ+HA  LKTG        ++ L+       ++ L YA+ +FD    P    WNL+++ +  +     ++ L+  +L    P +++T P 
Subjt:  LQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINN-LEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLFCELLSEFIP-DSFTLPC

Query:  VLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEMPERDSFSWTILVD
        +LK C+ LSA +E  QIH  I K+G+  D + ++SL++ Y+  G  +L   +FDR+ + D VSWNS+I GY + G++++AL +F +M E           
Subjt:  VLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEMPERDSFSWTILVD

Query:  GLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVMLREEISPNHATILGALSAAS
                            +N++SW  MI+GY++A D N                               +AL+LF  M   ++ P++ ++  ALSA +
Subjt:  GLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVMLREEISPNHATILGALSAAS

Query:  GLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEMCRIGLKPHAITFIGLLNAC
         L +  +G+W+HS++ K     D VLG  LI+MY+KCG +  AL VFK+I KK +  WTA+I G   HG   + +  F EM ++G+KP+ ITF  +L AC
Subjt:  GLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEMCRIGLKPHAITFIGLLNAC

Query:  SHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAAHHLIDLAPDTTGCYIILSN
        S+ G  ++    F  M  DY ++P+IEHYGC++D+L RAG L+EAK  I+ MP+KPN VIW +LL   R H NI +GE     LI + P   G Y+  +N
Subjt:  SHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAAHHLIDLAPDTTGCYIILSN

Query:  MYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCLEDDNEKESELETHSERLAI
        ++A    W+K  + R +MK++G+ K PGCS+I  +G+ HEF+ GDRSHP+ E+I  K   M+ KL   G+VP+  ++LL L DD+E+E+ +  HSE+LAI
Subjt:  MYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCLEDDNEKESELETHSERLAI

Query:  AFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
         +GLI  K G  +RI+KNLR+C DCH V+KL+S IY R+I++RD +RFHHF+ G CSC D+W
Subjt:  AFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic8.1e-16341.96Show/hide
Query:  LQNCETEREVKQLHALSLKTGSFNHPSISSRLLAL-YTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLF-CELLSEFIPDSFTLPC
        L NC+T + ++ +HA  +K G  N     S+L+      P    L YA S+F  I+EP L+ WN + + +  +     A+ L+ C +    +P+S+T P 
Subjt:  LQNCETEREVKQLHALSLKTGSFNHPSISSRLLAL-YTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLF-CELLSEFIPDSFTLPC

Query:  VLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEMPERDSFSWTILVD
        VLK CA+  A  EG+QIHG +LK+G  +D +V +SL+SMY + G +E   KVFD+   +DVVS+ +LI GYA  G IE A ++FDE+P +D  SW  ++ 
Subjt:  VLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEMPERDSFSWTILVD

Query:  GLSKSGKLETARDVF-DRMPT-------------------------RNSVSW-------------NAMINGYMKAGDFNTARELFDQMPERNLVTWNSMI
        G +++G  + A ++F D M T                         R    W             NA+I+ Y K G+  TA  LF+++P +++++WN++I
Subjt:  GLSKSGKLETARDVF-DRMPT-------------------------RNSVSW-------------NAMINGYMKAGDFNTARELFDQMPERNLVTWNSMI

Query:  TGYELNRQFSQALKLFEVMLREEISPNHATILGALSAASGLVSFGKGRWVHSFIVK--NGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWT
         GY     + +AL LF+ MLR   +PN  T+L  L A + L +   GRW+H +I K   G      L TSLI+MY+KCG I +A +VF SI  K L  W 
Subjt:  TGYELNRQFSQALKLFEVMLREEISPNHATILGALSAASGLVSFGKGRWVHSFIVK--NGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWT

Query:  AIIVGLGMHGLVGQTLELFDEMCRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKV
        A+I G  MHG    + +LF  M +IG++P  ITF+GLL+ACSH+G  D   H F+ M  DY + P +EHYGC+ID+L  +G  +EA+  I  M ++P+ V
Subjt:  AIIVGLGMHGLVGQTLELFDEMCRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKV

Query:  IWMSLLSGSRKHGNIRMGEYAAHHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLS
        IW SLL   + HGN+ +GE  A +LI + P+  G Y++LSN+YA AG W +V + R ++  KG++K PGCSSIE    VHEFI+GD+ HP+  EIY  L 
Subjt:  IWMSLLSGSRKHGNIRMGEYAAHHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLS

Query:  EMKEKLNVAGHVPDTTQVLLCLEDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCK
        EM+  L  AG VPDT++VL  +E++  KE  L  HSE+LAIAFGLI+ K G  + I+KNLR+C +CH  +KL+S IY REII RD +RFHHF+ G CSC 
Subjt:  EMKEKLNVAGHVPDTTQVLLCLEDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCK

Query:  DFW
        D+W
Subjt:  DFW

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226903.1e-14637.77Show/hide
Query:  QLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLFCELL--SEFIPDSFTLPCVLKGCARLSAL
        Q+H L +K G      + + L+  Y +     L+ AR +FD + E  +VSW  ++  Y     + DA+ LF  ++   E  P+S T+ CV+  CA+L  L
Subjt:  QLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLFCELL--SEFIPDSFTLPCVLKGCARLSAL

Query:  DEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEM------PERDSF------------
        + G++++  I   G  V+  ++S+LV MY KC  I++ +++FD     ++   N++   Y R G    AL VF+ M      P+R S             
Subjt:  DEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEM------PERDSF------------

Query:  -----------------SW----TILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQ
                         SW      L+D   K  + +TA  +FDRM  +  V+WN+++ GY++ G+ + A E F+ MPE+N+V+WN++I+G      F +
Subjt:  -----------------SW----TILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQ

Query:  ALKLF-EVMLREEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLV
        A+++F  +  +E ++ +  T++   SA   L +    +W++ +I KNG + D  LGT+L++M+S+CG   SA+ +F S+  + +  WTA I  + M G  
Subjt:  ALKLF-EVMLREEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLV

Query:  GQTLELFDEMCRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKH
         + +ELFD+M   GLKP  + F+G L ACSH G        F  M+  +G+ P   HYGC++D+L RAG LEEA   IE MP++PN VIW SLL+  R  
Subjt:  GQTLELFDEMCRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKH

Query:  GNIRMGEYAAHHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHV
        GN+ M  YAA  +  LAP+ TG Y++LSN+YA AG W  + +VR  MK+KG+RK PG SSI+ +G  HEF  GD SHP+   I   L E+ ++ +  GHV
Subjt:  GNIRMGEYAAHHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHV

Query:  PDTTQVLLCLEDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        PD + VL+ + D+ EK   L  HSE+LA+A+GLI+   G  +RI+KNLR+C+DCH+ +K  S +YNREII+RD +RFH+ + G CSC DFW
Subjt:  PDTTQVLLCLEDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.7e-16441.96Show/hide
Query:  LQNCETEREVKQLHALSLKTGSFNHPSISSRLLAL-YTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLF-CELLSEFIPDSFTLPC
        L NC+T + ++ +HA  +K G  N     S+L+      P    L YA S+F  I+EP L+ WN + + +  +     A+ L+ C +    +P+S+T P 
Subjt:  LQNCETEREVKQLHALSLKTGSFNHPSISSRLLAL-YTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLF-CELLSEFIPDSFTLPC

Query:  VLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEMPERDSFSWTILVD
        VLK CA+  A  EG+QIHG +LK+G  +D +V +SL+SMY + G +E   KVFD+   +DVVS+ +LI GYA  G IE A ++FDE+P +D  SW  ++ 
Subjt:  VLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEMPERDSFSWTILVD

Query:  GLSKSGKLETARDVF-DRMPT-------------------------RNSVSW-------------NAMINGYMKAGDFNTARELFDQMPERNLVTWNSMI
        G +++G  + A ++F D M T                         R    W             NA+I+ Y K G+  TA  LF+++P +++++WN++I
Subjt:  GLSKSGKLETARDVF-DRMPT-------------------------RNSVSW-------------NAMINGYMKAGDFNTARELFDQMPERNLVTWNSMI

Query:  TGYELNRQFSQALKLFEVMLREEISPNHATILGALSAASGLVSFGKGRWVHSFIVK--NGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWT
         GY     + +AL LF+ MLR   +PN  T+L  L A + L +   GRW+H +I K   G      L TSLI+MY+KCG I +A +VF SI  K L  W 
Subjt:  TGYELNRQFSQALKLFEVMLREEISPNHATILGALSAASGLVSFGKGRWVHSFIVK--NGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWT

Query:  AIIVGLGMHGLVGQTLELFDEMCRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKV
        A+I G  MHG    + +LF  M +IG++P  ITF+GLL+ACSH+G  D   H F+ M  DY + P +EHYGC+ID+L  +G  +EA+  I  M ++P+ V
Subjt:  AIIVGLGMHGLVGQTLELFDEMCRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKV

Query:  IWMSLLSGSRKHGNIRMGEYAAHHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLS
        IW SLL   + HGN+ +GE  A +LI + P+  G Y++LSN+YA AG W +V + R ++  KG++K PGCSSIE    VHEFI+GD+ HP+  EIY  L 
Subjt:  IWMSLLSGSRKHGNIRMGEYAAHHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLS

Query:  EMKEKLNVAGHVPDTTQVLLCLEDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCK
        EM+  L  AG VPDT++VL  +E++  KE  L  HSE+LAIAFGLI+ K G  + I+KNLR+C +CH  +KL+S IY REII RD +RFHHF+ G CSC 
Subjt:  EMKEKLNVAGHVPDTTQVLLCLEDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCK

Query:  DFW
        D+W
Subjt:  DFW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.5e-16138.22Show/hide
Query:  RNLHFPL-QNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLFCELLSE--FIP
        R+ H  L + C + R++KQ H   ++TG+F+ P  +S+L A+       +LEYAR +FD I +P   +WN L++ Y        +I  F +++SE    P
Subjt:  RNLHFPL-QNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLFCELLSE--FIP

Query:  DSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEMPERD--
        + +T P ++K  A +S+L  G+ +HG+ +K   G D FV +SL+  Y  CG+++   KVF  +++KDVVSWNS+I+G+ + G  + ALE+F +M   D  
Subjt:  DSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEMPERD--

Query:  ----------------------------------SFSWTI---LVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNL
                                          + + T+   ++D  +K G +E A+ +FD M  +++V+W  M++GY  + D+  ARE+ + MP++++
Subjt:  ----------------------------------SFSWTI---LVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNL

Query:  VTWNSMITGYELNRQFSQALKLF-EVMLREEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKK
        V WN++I+ YE N + ++AL +F E+ L++ +  N  T++  LSA + + +   GRW+HS+I K+G   +  + ++LI MYSKCG +  +  VF S+ K+
Subjt:  VTWNSMITGYELNRQFSQALKLF-EVMLREEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKK

Query:  KLGHWTAIIVGLGMHGLVGQTLELFDEMCRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMP
         +  W+A+I GL MHG   + +++F +M    +KP+ +TF  +  ACSH G  D+A   F  M  +YGI P  +HY C++DVL R+G LE+A   IE MP
Subjt:  KLGHWTAIIVGLGMHGLVGQTLELFDEMCRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMP

Query:  IKPNKVIWMSLLSGSRKHGNIRMGEYAAHHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEE
        I P+  +W +LL   + H N+ + E A   L++L P   G +++LSN+YA  G WE V ++R+ M+  G++K+PGCSSIE  G +HEF+ GD +HP +E+
Subjt:  IKPNKVIWMSLLSGSRKHGNIRMGEYAAHHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEE

Query:  IYIKLSEMKEKLNVAGHVPDTTQVLLCLEDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKS
        +Y KL E+ EKL   G+ P+ +QVL  +E++  KE  L  HSE+LAI +GLI+ +    +R+IKNLR+C DCH+V+KL+S +Y+REII+RD  RFHHF++
Subjt:  IYIKLSEMKEKLNVAGHVPDTTQVLLCLEDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKS

Query:  GSCSCKDFW
        G CSC DFW
Subjt:  GSCSCKDFW

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)4.1e-14637.68Show/hide
Query:  QLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLFCELL--SEFIPDSFTLPCVLKGCARLSAL
        Q+H L +K G      + + L+  Y +     L+ AR +FD + E  +VSW  ++  Y     + DA+ LF  ++   E  P+S T+ CV+  CA+L  L
Subjt:  QLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLFCELL--SEFIPDSFTLPCVLKGCARLSAL

Query:  DEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEM------PERDSF------------
        + G++++  I   G  V+  ++S+LV MY KC  I++ +++FD     ++   N++   Y R G    AL VF+ M      P+R S             
Subjt:  DEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEM------PERDSF------------

Query:  -----------------SW----TILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQ
                         SW      L+D   K  + +TA  +FDRM  +  V+WN+++ GY++ G+ + A E F+ MPE+N+V+WN++I+G      F +
Subjt:  -----------------SW----TILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQ

Query:  ALKLF-EVMLREEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLV
        A+++F  +  +E ++ +  T++   SA   L +    +W++ +I KNG + D  LGT+L++M+S+CG   SA+ +F S+  + +  WTA I  + M G  
Subjt:  ALKLF-EVMLREEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLV

Query:  GQTLELFDEMCRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKH
         + +ELFD+M   GLKP  + F+G L ACSH G        F  M+  +G+ P   HYGC++D+L RAG LEEA   IE MP++PN VIW SLL+  R  
Subjt:  GQTLELFDEMCRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKH

Query:  GNIRMGEYAAHHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHV
        GN+ M  YAA  +  LAP+ TG Y++LSN+YA AG W  + +VR  MK+KG+RK PG SSI+ +G  HEF  GD SHP+   I   L E+ ++ +  GHV
Subjt:  GNIRMGEYAAHHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHV

Query:  PDTTQVLLCLEDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDF
        PD + VL+ + D+ EK   L  HSE+LA+A+GLI+   G  +RI+KNLR+C+DCH+ +K  S +YNREII+RD +RFH+ + G CSC DF
Subjt:  PDTTQVLLCLEDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDF

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification2.2e-14737.77Show/hide
Query:  QLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLFCELL--SEFIPDSFTLPCVLKGCARLSAL
        Q+H L +K G      + + L+  Y +     L+ AR +FD + E  +VSW  ++  Y     + DA+ LF  ++   E  P+S T+ CV+  CA+L  L
Subjt:  QLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLFCELL--SEFIPDSFTLPCVLKGCARLSAL

Query:  DEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEM------PERDSF------------
        + G++++  I   G  V+  ++S+LV MY KC  I++ +++FD     ++   N++   Y R G    AL VF+ M      P+R S             
Subjt:  DEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEM------PERDSF------------

Query:  -----------------SW----TILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQ
                         SW      L+D   K  + +TA  +FDRM  +  V+WN+++ GY++ G+ + A E F+ MPE+N+V+WN++I+G      F +
Subjt:  -----------------SW----TILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQ

Query:  ALKLF-EVMLREEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLV
        A+++F  +  +E ++ +  T++   SA   L +    +W++ +I KNG + D  LGT+L++M+S+CG   SA+ +F S+  + +  WTA I  + M G  
Subjt:  ALKLF-EVMLREEISPNHATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLV

Query:  GQTLELFDEMCRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKH
         + +ELFD+M   GLKP  + F+G L ACSH G        F  M+  +G+ P   HYGC++D+L RAG LEEA   IE MP++PN VIW SLL+  R  
Subjt:  GQTLELFDEMCRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKH

Query:  GNIRMGEYAAHHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHV
        GN+ M  YAA  +  LAP+ TG Y++LSN+YA AG W  + +VR  MK+KG+RK PG SSI+ +G  HEF  GD SHP+   I   L E+ ++ +  GHV
Subjt:  GNIRMGEYAAHHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHV

Query:  PDTTQVLLCLEDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
        PD + VL+ + D+ EK   L  HSE+LA+A+GLI+   G  +RI+KNLR+C+DCH+ +K  S +YNREII+RD +RFH+ + G CSC DFW
Subjt:  PDTTQVLLCLEDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein4.3e-15941.9Show/hide
Query:  PRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLL--ALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSND--AISLFCELLS-E
        P +L   + NC T R++ Q+HA+ +K+G       ++ +L     +D    +L+YA  +F+ + +    SWN +++ + E+       AI+LF E++S E
Subjt:  PRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLL--ALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSND--AISLFCELLS-E

Query:  FI-PDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEMPE
        F+ P+ FT P VLK CA+   + EGKQIHGL LK GFG D+FV+S+LV MY  CG           M+D  V+ + ++I                    E
Subjt:  FI-PDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEMPE

Query:  RDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVMLREEISPNH
        +D     ++ D   + G++               V WN MI+GYM+ GD   AR LFD+M +R++V+WN+MI+GY LN  F  A+++F  M + +I PN+
Subjt:  RDSFSWTILVDGLSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVMLREEISPNH

Query:  ATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEMCRIGLKPH
         T++  L A S L S   G W+H +   +G   D VLG++LI+MYSKCG I  A+ VF+ +P++ +  W+A+I G  +HG  G  ++ F +M + G++P 
Subjt:  ATILGALSAASGLVSFGKGRWVHSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEMCRIGLKPH

Query:  AITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAAHHLIDLAP
         + +I LL ACSH G  ++   YF  M+   G+EP IEHYGC++D+L R+G L+EA+  I  MPIKP+ VIW +LL   R  GN+ MG+  A+ L+D+ P
Subjt:  AITFIGLLNACSHAGFADDAYHYFKMMMDDYGIEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAAHHLIDLAP

Query:  DTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCLEDDNEKES
          +G Y+ LSNMYA  G W +V ++R  MK+K IRKDPGCS I+  G +HEF+V D SHP+ +EI   L E+ +KL +AG+ P TTQVLL LE++ +KE+
Subjt:  DTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSSIEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCLEDDNEKES

Query:  ELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW
         L  HSE++A AFGLI+   G P+RI+KNLRIC DCH+  KL+S +Y R+I +RD  RFHHF+ GSCSC D+W
Subjt:  ELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREIIIRDGSRFHHFKSGSCSCKDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGTCTCTCGCACTTTCGCATTCCCTCCTCCCATTTCTCCCTCGGAACCTCCATTTTCCTCTCCAGAACTGCGAAACTGAACGAGAAGTGAAGCAACTCCACGCTCT
GTCGCTCAAAACAGGTTCCTTCAATCACCCTTCAATATCTTCTCGTCTTTTGGCGCTCTATACAGATCCTAGAATCAACAATCTCGAATATGCTCGGTCCCTTTTTGACT
GGATTCGAGAACCCACTTTGGTTTCTTGGAATCTGCTCGTCAAGTGCTACGTCGAGAACCAACGTTCGAATGATGCCATTTCGTTGTTCTGCGAATTGCTCTCTGAGTTC
ATTCCTGATTCTTTTACACTGCCTTGCGTTCTGAAGGGTTGTGCTCGATTAAGTGCTCTAGACGAGGGGAAACAGATTCATGGGTTGATATTGAAAATTGGGTTTGGGGT
GGATAAGTTTGTTTTGAGTAGTTTGGTTAGTATGTATTCTAAGTGTGGTGAGATTGAGCTGTGTAGGAAAGTGTTTGACCGAATGGAAGATAAGGATGTAGTCTCATGGA
ATTCTTTGATTGATGGATATGCAAGATGTGGTCAAATTGAACTGGCACTGGAGGTGTTTGATGAAATGCCGGAGAGGGATTCTTTTTCATGGACTATTCTGGTTGATGGG
CTTTCGAAAAGCGGGAAGTTAGAGACTGCTAGAGACGTGTTCGATCGAATGCCTACTCGAAATTCTGTATCCTGGAATGCTATGATTAATGGCTACATGAAAGCTGGGGA
TTTTAACACGGCACGAGAATTATTCGATCAGATGCCAGAAAGAAACCTCGTTACATGGAATTCGATGATCACTGGATACGAATTGAACAGGCAGTTTTCACAAGCCTTGA
AGCTGTTTGAGGTTATGTTGAGAGAAGAAATATCACCCAATCATGCCACTATCCTTGGAGCTCTTTCTGCAGCTTCAGGATTGGTTAGTTTTGGTAAGGGAAGATGGGTT
CATTCCTTCATAGTGAAAAATGGATTTGAAACAGATGGCGTGCTTGGCACATCGCTGATAGAAATGTACTCCAAGTGTGGCAGCATTGCGAGCGCCCTCAGAGTTTTCAA
ATCTATACCCAAAAAGAAATTGGGGCATTGGACTGCTATAATTGTAGGCTTGGGAATGCATGGTTTGGTAGGGCAAACTCTTGAGCTATTTGATGAAATGTGCAGAATTG
GGCTGAAGCCTCATGCTATTACTTTTATTGGACTGTTAAATGCTTGTAGTCATGCAGGATTTGCAGACGATGCCTATCATTACTTTAAAATGATGATGGACGATTATGGA
ATTGAACCCTCTATCGAACACTACGGTTGCTTGATCGATGTTCTGTGTCGTGCAGGATGCCTTGAAGAGGCAAAGAATACTATTGAGAGAATGCCCATCAAACCAAACAA
AGTTATTTGGATGAGTCTACTAAGTGGTTCTAGGAAACATGGAAACATAAGAATGGGGGAATATGCGGCTCATCATCTGATTGATCTGGCACCAGATACTACTGGATGTT
ATATTATCCTTTCGAACATGTACGCTGGAGCTGGCTTGTGGGAAAAAGTTCGGCAAGTAAGAGAAATGATGAAGAAAAAAGGAATCAGAAAGGATCCAGGATGCAGTTCC
ATTGAGCATCAAGGTTCAGTTCATGAATTCATTGTGGGTGATAGGTCACATCCTCAAACAGAAGAGATATACATCAAACTGAGTGAGATGAAAGAGAAATTGAACGTAGC
TGGACATGTTCCCGACACAACTCAAGTTCTTTTGTGCCTTGAAGATGATAATGAGAAAGAATCAGAGCTTGAAACCCATAGTGAGAGGTTGGCAATAGCTTTTGGTCTTA
TCAATATCAAGCATGGAAATCCCGTCCGCATCATAAAAAATCTTCGTATTTGCAACGATTGCCACACTGTTAGCAAACTTCTTTCCCATATATATAACCGTGAGATCATT
ATCAGAGACGGTAGTCGATTCCATCACTTTAAAAGTGGGTCTTGTTCTTGTAAAGATTTTTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTGTCTCTCGCACTTTCGCATTCCCTCCTCCCATTTCTCCCTCGGAACCTCCATTTTCCTCTCCAGAACTGCGAAACTGAACGAGAAGTGAAGCAACTCCACGCTCT
GTCGCTCAAAACAGGTTCCTTCAATCACCCTTCAATATCTTCTCGTCTTTTGGCGCTCTATACAGATCCTAGAATCAACAATCTCGAATATGCTCGGTCCCTTTTTGACT
GGATTCGAGAACCCACTTTGGTTTCTTGGAATCTGCTCGTCAAGTGCTACGTCGAGAACCAACGTTCGAATGATGCCATTTCGTTGTTCTGCGAATTGCTCTCTGAGTTC
ATTCCTGATTCTTTTACACTGCCTTGCGTTCTGAAGGGTTGTGCTCGATTAAGTGCTCTAGACGAGGGGAAACAGATTCATGGGTTGATATTGAAAATTGGGTTTGGGGT
GGATAAGTTTGTTTTGAGTAGTTTGGTTAGTATGTATTCTAAGTGTGGTGAGATTGAGCTGTGTAGGAAAGTGTTTGACCGAATGGAAGATAAGGATGTAGTCTCATGGA
ATTCTTTGATTGATGGATATGCAAGATGTGGTCAAATTGAACTGGCACTGGAGGTGTTTGATGAAATGCCGGAGAGGGATTCTTTTTCATGGACTATTCTGGTTGATGGG
CTTTCGAAAAGCGGGAAGTTAGAGACTGCTAGAGACGTGTTCGATCGAATGCCTACTCGAAATTCTGTATCCTGGAATGCTATGATTAATGGCTACATGAAAGCTGGGGA
TTTTAACACGGCACGAGAATTATTCGATCAGATGCCAGAAAGAAACCTCGTTACATGGAATTCGATGATCACTGGATACGAATTGAACAGGCAGTTTTCACAAGCCTTGA
AGCTGTTTGAGGTTATGTTGAGAGAAGAAATATCACCCAATCATGCCACTATCCTTGGAGCTCTTTCTGCAGCTTCAGGATTGGTTAGTTTTGGTAAGGGAAGATGGGTT
CATTCCTTCATAGTGAAAAATGGATTTGAAACAGATGGCGTGCTTGGCACATCGCTGATAGAAATGTACTCCAAGTGTGGCAGCATTGCGAGCGCCCTCAGAGTTTTCAA
ATCTATACCCAAAAAGAAATTGGGGCATTGGACTGCTATAATTGTAGGCTTGGGAATGCATGGTTTGGTAGGGCAAACTCTTGAGCTATTTGATGAAATGTGCAGAATTG
GGCTGAAGCCTCATGCTATTACTTTTATTGGACTGTTAAATGCTTGTAGTCATGCAGGATTTGCAGACGATGCCTATCATTACTTTAAAATGATGATGGACGATTATGGA
ATTGAACCCTCTATCGAACACTACGGTTGCTTGATCGATGTTCTGTGTCGTGCAGGATGCCTTGAAGAGGCAAAGAATACTATTGAGAGAATGCCCATCAAACCAAACAA
AGTTATTTGGATGAGTCTACTAAGTGGTTCTAGGAAACATGGAAACATAAGAATGGGGGAATATGCGGCTCATCATCTGATTGATCTGGCACCAGATACTACTGGATGTT
ATATTATCCTTTCGAACATGTACGCTGGAGCTGGCTTGTGGGAAAAAGTTCGGCAAGTAAGAGAAATGATGAAGAAAAAAGGAATCAGAAAGGATCCAGGATGCAGTTCC
ATTGAGCATCAAGGTTCAGTTCATGAATTCATTGTGGGTGATAGGTCACATCCTCAAACAGAAGAGATATACATCAAACTGAGTGAGATGAAAGAGAAATTGAACGTAGC
TGGACATGTTCCCGACACAACTCAAGTTCTTTTGTGCCTTGAAGATGATAATGAGAAAGAATCAGAGCTTGAAACCCATAGTGAGAGGTTGGCAATAGCTTTTGGTCTTA
TCAATATCAAGCATGGAAATCCCGTCCGCATCATAAAAAATCTTCGTATTTGCAACGATTGCCACACTGTTAGCAAACTTCTTTCCCATATATATAACCGTGAGATCATT
ATCAGAGACGGTAGTCGATTCCATCACTTTAAAAGTGGGTCTTGTTCTTGTAAAGATTTTTGGTAA
Protein sequenceShow/hide protein sequence
MLSLALSHSLLPFLPRNLHFPLQNCETEREVKQLHALSLKTGSFNHPSISSRLLALYTDPRINNLEYARSLFDWIREPTLVSWNLLVKCYVENQRSNDAISLFCELLSEF
IPDSFTLPCVLKGCARLSALDEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDVVSWNSLIDGYARCGQIELALEVFDEMPERDSFSWTILVDG
LSKSGKLETARDVFDRMPTRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNRQFSQALKLFEVMLREEISPNHATILGALSAASGLVSFGKGRWV
HSFIVKNGFETDGVLGTSLIEMYSKCGSIASALRVFKSIPKKKLGHWTAIIVGLGMHGLVGQTLELFDEMCRIGLKPHAITFIGLLNACSHAGFADDAYHYFKMMMDDYG
IEPSIEHYGCLIDVLCRAGCLEEAKNTIERMPIKPNKVIWMSLLSGSRKHGNIRMGEYAAHHLIDLAPDTTGCYIILSNMYAGAGLWEKVRQVREMMKKKGIRKDPGCSS
IEHQGSVHEFIVGDRSHPQTEEIYIKLSEMKEKLNVAGHVPDTTQVLLCLEDDNEKESELETHSERLAIAFGLINIKHGNPVRIIKNLRICNDCHTVSKLLSHIYNREII
IRDGSRFHHFKSGSCSCKDFW