; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013202 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013202
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat (PPR-like) superfamily protein
Genome locationscaffold459:1147299..1149422
RNA-Seq ExpressionMS013202
SyntenyMS013202
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606718.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0086.48Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        MLYASSYQRF SASF FP+ ++NFP HFR YF SH  SITYD ELLDSFD LLRQC+G +HCKQVHSATVVTGAC SAFVAARLVSVYARSG VFDARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD
        FDT PFEGLSNLLLWNSIIRANV  GY  E LQLYGKM+  GVL DGFTFPLVLRASSNLG FNLCK+LHCHVVQFGF NHLHVVNEL+GMY KL RMDD
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSV+SWNTMVSG+AYNYDV+GASRMFL+ME EGVEPNPVTWTSLLSSHARCGHLE T+ALFSKMRMKG+GATAEMLAVVLSVCADLATLD
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
        RGQM+HGYI+KGGFEDYLFAKNALITVYGKGGD+RDAEKLFHEM++KNLVSWN+LISSYAESGL DKAFE FS+L +M+  PEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQLA+VKANSVTISSV S+CAMLAALNLGREMHGHVIRA M+DNILVGNGLINMYTKCG+FKP CLVF KLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL  FD+MIKSGF PDDVTFIAALSACSHAGLVAEGRWLFDQMLQNF+IKPQMEHYACMVDLLGRAGL+EEASNIVK MPI+PNAY+WSALLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE
        SCRMHKDTD+AEET  QI NLNSEI GSHMLLSNIF+A  RWEDSARVRISAR KGLK VPG SWIEVKK+VYMFK+GNS+ EGLE+VDEILHDLALQIE
Subjt:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE

Query:  GHDFDDSIIE
          DFDDSIIE
Subjt:  GHDFDDSIIE

XP_022148095.1 putative pentatricopeptide repeat-containing protein At1g17630 [Momordica charantia]0.0e+0099.01Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVF
        MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVF
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVF

Query:  DTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDA
        DTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRM DA
Subjt:  DTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDA

Query:  QKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLDR
        QKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFS+MRMKGIGATAEMLAVVLSVCADLATLDR
Subjt:  QKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLDR

Query:  GQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASK
        GQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEME+KNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASK
Subjt:  GQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASK

Query:  GLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYGF
        GLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKP CLVFEKLENRDLISWNSMIAGYGF
Subjt:  GLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYGF

Query:  HGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNS
        HGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNS
Subjt:  HGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNS

Query:  CRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEGH
        CRMHKDTDIAEETALQISNLNSEI+GSHMLLSNIFAACSRWEDSARVRI ARTKGL IVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEGH
Subjt:  CRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEGH

Query:  DFDDSIIE
        DFDDSIIE
Subjt:  DFDDSIIE

XP_022949499.1 putative pentatricopeptide repeat-containing protein At1g17630 [Cucurbita moschata]0.0e+0086.48Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        MLYASSYQRF SASF FP+ +INFP HFR YF SH  SITYD +LLDSFD LLRQC+G +HCKQVHSAT+VTGAC SAFVAARLVSVYARSG VFDARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD
        FDT PFE L NLLLWNSIIRANV  GY  E LQLYGKM+  GVL DGFTFPLVLRASSNLG FNLCK+LHCHVVQFGF NHLHVVNEL+GMY KL RMDD
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSV+SWNTMVSG+AYNYDV+GASRMFL+ME EGVEPNPVTWTSLLSSHARCGHLE T+ALFSKMRMKG+GATAEMLAVVLSVCADLATLD
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
        RGQM+HGYI+KGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEM++KNLVSWN+LISSYAESGL DKAFE FS+LEKM   PEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQLA+VKANSVTISSV S+CAMLAALNLGREMHGHVIRA M+DNILVGNGLINMYTKCG+FKP CLVF KLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL  FD+MIKSGF PDDVTFIAALSACSHAGLVAEGRWLFDQMLQNF+IKPQMEHYACMVDLLGRAGL+EEASNIVK MPI+PNAY+WSALLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE
        SCRMHKDTD+AEET  QI NLNSEI GSHMLLSNIF+A  RWEDSARVRISAR KGLK VPG SWIEVKK+VYMFK+GNS+ EGLE+VDEILHDLALQIE
Subjt:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE

Query:  GHDFDDSIIE
          DFDDSIIE
Subjt:  GHDFDDSIIE

XP_022997825.1 putative pentatricopeptide repeat-containing protein At1g17630 [Cucurbita maxima]0.0e+0087.18Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITY-DELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        MLYASSYQRF SASFCFP+ +INFP HFR YF SH  SITY DELLDSFD LLRQC+G +HCKQVHS TVV GAC SAFVAARLVSVYARSG VFDARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITY-DELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD
        FDT PFEGLSNLLLWNSIIRANV  GYC E LQLYGKMR  GVL DGFTFPLVLRASSNLG FNLCK LHCHVVQFGFQNHLHVVNELIGMY KL RM D
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSV+SWNTMVSG+AYNYDV+GA RMFL+ME EGVEPNPVTWTSLLSSHARCGHLE T+ALFSKMRMKG+GATAEMLAVVLSVCADLATLD
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
        RGQM+HGYI+KGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEM++KNLVSWN+LISSYAESGL DKAFE FS+LEKM+  PEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQLA+VKANSVTISSV S+CAMLAALNLGREMHGHVIRA M+DNILVGNGLINMYTKCG+FKP CLVFEKLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL  FD+MIKSGF PDDVTFIAALSACSHAGLVAEGRWLFDQMLQNF+IKPQMEHYACMVDLLGRAGL+EEASN+VK MPI+PN Y+WSALLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE
        SCRMHK TD+AEET  QI NLNSEI GSHMLLSNIF+A  RWEDSARVRISAR KGLK VPG SWIEVKKKVYMFKAGNS+ EGLE+VDEILHDLALQIE
Subjt:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE

Query:  GHDFDDSIIE
          DFDDSIIE
Subjt:  GHDFDDSIIE

XP_023523771.1 putative pentatricopeptide repeat-containing protein At1g17630 [Cucurbita pepo subsp. pepo]0.0e+0086.62Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITY-DELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        MLYASSYQRF SASFCFP+ +INFP HFR+YF S   SITY DELLDSFD LLRQC+G +HCKQVHSATVVTGAC SAFVAARLVSVYARSG VFDARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITY-DELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD
        FDT PFEGLSNLLLWNSIIRANV  GY  E LQLYGKMR  GVL DGFTFPLVLRASSNLG FNLCK+LHCHVVQFGFQNHLHVVNEL+GMY KL RMDD
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSV+SWNTMVSG+AYNYDV+GASRMFL+ME EGVEPNPVTWTSLLSSHARCGHLE T+ALFSKMRMKG+GATAEMLAVVLSVCADL T D
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
        RGQM+HGYI+KGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEM++KNLVSWN+LISSYAESGL DKAFE FS+LEKM+  PEMKP+VITWSAVICGFAS
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQLA+VKANSVTISSV S+CAMLAALNLGREMHGHVIRA M+DNILVGNGLINMYTKCG+FKP CLVF KLENRDLISWNS+IAGYG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL  FD+MIKSGF PDDVTFIAALSACSHAGLVAEGRWLFDQMLQNF+IKPQMEHYACMVDLLGRAGL+EEASNIVK MPI+PNAY+WSALLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE
        SCRMHKDTD+AEET  QI +L+SEI GSHMLLSNI++A  RWEDSARVRISAR KGLK VPG SWIEVKKKVYMFKAGNS+ EGLE+VDEILHDLALQIE
Subjt:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE

Query:  GHDFDDSIIE
          DFDDSIIE
Subjt:  GHDFDDSIIE

TrEMBL top hitse value%identityAlignment
A0A0A0LFT1 Uncharacterized protein0.0e+0081.97Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDE-LLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        ML ASSYQRFKS SFCFP  SIN        F S   SITYDE L D FDHLLRQC+G QH KQVHSATVVTGA CSAFV+ARLVS+Y+R GLV DARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDE-LLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD
        F +APFE  SN LLWNSIIRANV HGYC E LQLYGKMR  GVLGDGFTFPL+LRASSNLG+FN+CKNLHCHVVQFGFQNHLHV NELIGMYAKL RMDD
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSVVSWNTMVSG+AYNYDV+GASRMF +ME EGVEPNPVTWTSLLSSHARCGHLE TM LF KMRMKG+G TAEMLAVVLSVCADLATL+
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
         GQMIHGY++KGGF DYLFAKNALIT+YGKGG + DAEKLFHEM++KNLVSWNALISS+AESG+ DKA E+ SQLEKM+ YPEMKPNVITWSA+ICGFAS
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG
        KGLGEESLEVFR+MQLA+VKANSVTI+SV S+CAMLAALNLGREMHGHVIRA MDDN+LVGNGLINMYTKCG+FKP  +VFEKLENRD ISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL  F+ MIKSG+ PD VTFIAALSACSHAGLVAEG WLF QM QNF+I+P++EHYACMVDLLGRAGL+EEASNI+KGMP++PNAY+WS+LLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE
        SCRMHKDTD+AEE A +ISNLNS+I GSHMLLSNIFAA  RWEDSARVRISAR KGLK VPG SWIEVKKKVYMFKAG +I EGLEKVDEILHDLA QIE
Subjt:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE

Query:  GHDFDDSIIE
         ++ DD IIE
Subjt:  GHDFDDSIIE

A0A5D3BVI7 Putative pentatricopeptide repeat-containing protein0.0e+0080.82Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDE-LLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        ML A SYQRFKS SFCFP  SIN        F S   SITYDE L + FDHLLRQC+G QH KQVHSATVVTGA CSAFV+ARLVS+Y+R GLV DARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDE-LLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD
        F +APFE LSN LLWNSIIRANV HGYC E L LYGKMR  GVLGDGFTFPLVLRASSNLG+ ++CKNLHCHVVQFGFQNHLHV NELIGMYAKL RMDD
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSVVSWNTMVSG+AYNYDV+GASRMF +ME EGVEPNPVTWTSLLSSHARCGHL  TM LF KMRMKG+GATAEMLAVVLSVCADLATL+
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
         GQMIHGY++KGGF DYLFAKNALIT+YGKGGD+ DAEKLFHEM++KNLVSWNALISS+AESG+ DKA E+ SQLEKM+ YPEMKPNVITWS++ICGF+S
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG
        KGLGEESLEVFR+MQLA+VKANSVTI+SV S+CAMLAALNLGRE+HGHVIRA MD+N+LVGNGLINMYTKCG+FKP  LVFEKLENRD ISWNSMIA YG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL   + MIKSG+ PD VTFIAALSACSHAGLVAEG WLF QM QNF+I+P++EHYACMVDLLGRAGL+EEASNI+K MP++PNAY+WS+LLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEG
        SCRMHKDTD+AEE A +ISNLNS+I GSHMLLSNIFAA  RWEDSARVRISAR KGLK VPG SWIEVKKKVY+FKAG + EGLEKVDEILHDLA QIE 
Subjt:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEG

Query:  HDFDDSIIE
        +D DD IIE
Subjt:  HDFDDSIIE

A0A6J1D1Z6 putative pentatricopeptide repeat-containing protein At1g176300.0e+0099.01Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVF
        MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVF
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVF

Query:  DTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDA
        DTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRM DA
Subjt:  DTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDA

Query:  QKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLDR
        QKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFS+MRMKGIGATAEMLAVVLSVCADLATLDR
Subjt:  QKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLDR

Query:  GQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASK
        GQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEME+KNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASK
Subjt:  GQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASK

Query:  GLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYGF
        GLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKP CLVFEKLENRDLISWNSMIAGYGF
Subjt:  GLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYGF

Query:  HGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNS
        HGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNS
Subjt:  HGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNS

Query:  CRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEGH
        CRMHKDTDIAEETALQISNLNSEI+GSHMLLSNIFAACSRWEDSARVRI ARTKGL IVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEGH
Subjt:  CRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEGH

Query:  DFDDSIIE
        DFDDSIIE
Subjt:  DFDDSIIE

A0A6J1GD03 putative pentatricopeptide repeat-containing protein At1g176300.0e+0086.48Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        MLYASSYQRF SASF FP+ +INFP HFR YF SH  SITYD +LLDSFD LLRQC+G +HCKQVHSAT+VTGAC SAFVAARLVSVYARSG VFDARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD
        FDT PFE L NLLLWNSIIRANV  GY  E LQLYGKM+  GVL DGFTFPLVLRASSNLG FNLCK+LHCHVVQFGF NHLHVVNEL+GMY KL RMDD
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSV+SWNTMVSG+AYNYDV+GASRMFL+ME EGVEPNPVTWTSLLSSHARCGHLE T+ALFSKMRMKG+GATAEMLAVVLSVCADLATLD
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
        RGQM+HGYI+KGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEM++KNLVSWN+LISSYAESGL DKAFE FS+LEKM   PEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQLA+VKANSVTISSV S+CAMLAALNLGREMHGHVIRA M+DNILVGNGLINMYTKCG+FKP CLVF KLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL  FD+MIKSGF PDDVTFIAALSACSHAGLVAEGRWLFDQMLQNF+IKPQMEHYACMVDLLGRAGL+EEASNIVK MPI+PNAY+WSALLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE
        SCRMHKDTD+AEET  QI NLNSEI GSHMLLSNIF+A  RWEDSARVRISAR KGLK VPG SWIEVKK+VYMFK+GNS+ EGLE+VDEILHDLALQIE
Subjt:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE

Query:  GHDFDDSIIE
          DFDDSIIE
Subjt:  GHDFDDSIIE

A0A6J1KAZ0 putative pentatricopeptide repeat-containing protein At1g176300.0e+0087.18Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITY-DELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        MLYASSYQRF SASFCFP+ +INFP HFR YF SH  SITY DELLDSFD LLRQC+G +HCKQVHS TVV GAC SAFVAARLVSVYARSG VFDARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITY-DELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD
        FDT PFEGLSNLLLWNSIIRANV  GYC E LQLYGKMR  GVL DGFTFPLVLRASSNLG FNLCK LHCHVVQFGFQNHLHVVNELIGMY KL RM D
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSV+SWNTMVSG+AYNYDV+GA RMFL+ME EGVEPNPVTWTSLLSSHARCGHLE T+ALFSKMRMKG+GATAEMLAVVLSVCADLATLD
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
        RGQM+HGYI+KGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEM++KNLVSWN+LISSYAESGL DKAFE FS+LEKM+  PEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQLA+VKANSVTISSV S+CAMLAALNLGREMHGHVIRA M+DNILVGNGLINMYTKCG+FKP CLVFEKLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL  FD+MIKSGF PDDVTFIAALSACSHAGLVAEGRWLFDQMLQNF+IKPQMEHYACMVDLLGRAGL+EEASN+VK MPI+PN Y+WSALLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE
        SCRMHK TD+AEET  QI NLNSEI GSHMLLSNIF+A  RWEDSARVRISAR KGLK VPG SWIEVKKKVYMFKAGNS+ EGLE+VDEILHDLALQIE
Subjt:  SCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE

Query:  GHDFDDSIIE
          DFDDSIIE
Subjt:  GHDFDDSIIE

SwissProt top hitse value%identityAlignment
Q9LFL5 Pentatricopeptide repeat-containing protein At5g168607.0e-10133.96Show/hide
Query:  HLFSITYDELLDSFDHLLRQC---SGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVL
        H  S T D    +F  + + C   S  +  +  H+ ++VTG   + FV   LV++Y+R   + DARKVFD      + +++ WNSII +    G     L
Subjt:  HLFSITYDELLDSFDHLLRQC---SGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVL

Query:  QLYGKM-RRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDG
        +++ +M    G   D  T   VL   ++LG+ +L K LHC  V      ++ V N L+ MYAK G MD+A  VF  M +K VVSWN MV+G++     + 
Subjt:  QLYGKM-RRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDG

Query:  ASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKG
        A R+F KM+ E ++ + VTW++ +S +A+ G     + +  +M   GI      L  VLS CA +  L  G+ IH Y IK   +     KN      G G
Subjt:  ASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKG

Query:  GDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD--VKANSVTISSV
         +              N+V  N LI  YA+    D A  +F  L   +       +V+TW+ +I G++  G   ++LE+  +M   D   + N+ TIS  
Subjt:  GDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD--VKANSVTISSV

Query:  FSVCAMLAALNLGREMHGHVIRALMDD-NILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFI
           CA LAAL +G+++H + +R   +   + V N LI+MY KCG+   A LVF+ +  ++ ++W S++ GYG HG G++AL  FD+M + GF  D VT +
Subjt:  FSVCAMLAALNLGREMHGHVIRALMDD-NILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFI

Query:  AALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGS
          L ACSH+G++ +G   F++M   F + P  EHYAC+VDLLGRAG +  A  +++ MP++P   VW A L+ CR+H   ++ E  A +I+ L S   GS
Subjt:  AALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGS

Query:  HMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNS------------IEGLEKVDEILHDLALQIEGHDFDD
        + LLSN++A   RW+D  R+R   R KG+K  PG SW+E  K    F  G+             ++ ++++ +I +        HD DD
Subjt:  HMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNS------------IEGLEKVDEILHDLALQIEGHDFDD

Q9LNP2 Putative pentatricopeptide repeat-containing protein At1g176301.3e-19548.9Show/hide
Query:  MLYASSYQ------RFKSASFCF-----PQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACC-SAFVAARLVSVYA
        M++AS +Q       ++  +FCF     P +SI+ P          L S     L   FDHLL  C   Q C+QVH+  +++     S  +AA L+SVYA
Subjt:  MLYASSYQ------RFKSASFCF-----PQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACC-SAFVAARLVSVYA

Query:  RSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELI
        R GL+ DAR VF+T     LS+L LWNSI++ANVSHG     L+LY  MR+ G+ GDG+  PL+LRA   LG F LC+  H  V+Q G + +LHVVNEL+
Subjt:  RSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELI

Query:  GMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVV
         +Y K GRM DA  +F +M +++ +SWN M+ GF+  YD + A ++F  M+ E  +P+ VTWTS+LS H++CG  E  +  F  MRM G   + E LAV 
Subjt:  GMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVV

Query:  LSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVI
         SVCA+L  L   + +HGY+IKGGFE+YL ++NALI VYGK G ++DAE LF ++  K + SWN+LI+S+ ++G  D+A  +FS+LE+M+    +K NV+
Subjt:  LSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVI

Query:  TWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDL
        TW++VI G   +G G++SLE FRQMQ + V ANSVTI  + S+CA L ALNLGRE+HGHVIR  M +NILV N L+NMY KCG      LVFE + ++DL
Subjt:  TWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDL

Query:  ISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQ
        ISWNS+I GYG HG  + AL+ FD+MI SGF PD +  +A LSACSHAGLV +GR +F  M + F ++PQ EHYAC+VDLLGR G ++EAS IVK MP++
Subjt:  ISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQ

Query:  PNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSIEG-LEKVD
        P   V  ALLNSCRMHK+ DIAE  A Q+S L  E  GS+MLLSNI++A  RWE+SA VR  A+ K LK V G SWIEVKKK Y F +G+ ++   E + 
Subjt:  PNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSIEG-LEKVD

Query:  EILHDLALQI-------EGHDFDDSI
         +L DL   +       +G++++D +
Subjt:  EILHDLALQI-------EGHDFDDSI

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202301.9e-11434.75Show/hide
Query:  QVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSF
        Q H+  + +GA    +++A+L++ Y+      DA  V  + P      +  ++S+I A        + + ++ +M   G++ D    P + +  + L +F
Subjt:  QVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSF

Query:  NLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGH
         + K +HC     G      V   +  MY + GRM DA+KVFD+M  K VV+ + ++  +A    ++   R+  +MES G+E N V+W  +LS   R G+
Subjt:  NLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGH

Query:  LEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESG
         +  + +F K+   G       ++ VL    D   L+ G++IHGY+IK G        +A+I +YGK G +     LF++ EM      NA I+  + +G
Subjt:  LEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESG

Query:  LCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNG
        L DKA E+F   ++      M+ NV++W+++I G A  G   E+LE+FR+MQ+A VK N VTI S+   C  +AAL  GR  HG  +R  + DN+ VG+ 
Subjt:  LCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNG

Query:  LINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHY
        LI+MY KCG    + +VF  +  ++L+ WNS++ G+  HG  K+ ++ F+ ++++   PD ++F + LSAC   GL  EG   F  M + + IKP++EHY
Subjt:  LINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHY

Query:  ACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGR
        +CMV+LLGRAG ++EA +++K MP +P++ VW ALLNSCR+  + D+AE  A ++ +L  E  G+++LLSNI+AA   W +   +R    + GLK  PG 
Subjt:  ACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGR

Query:  SWIEVKKKVYMFKAGNSI-----EGLEKVDEILHDL
        SWI+VK +VY   AG+       +  EK+DEI  ++
Subjt:  SWIEVKKKVYMFKAGNSI-----EGLEKVDEILHDL

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226902.8e-10231.76Show/hide
Query:  LRQCSGTQHCKQVHSATVVTGACCSAFVAARLVS----VYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGF
        L+ C      K  H +    G         +LV+    +  R  L F A++VF+ +  E      ++NS+IR   S G C E + L+ +M   G+  D +
Subjt:  LRQCSGTQHCKQVHSATVVTGACCSAFVAARLVS----VYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGF

Query:  TFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFA-YNYDVDGASRMFLKMESEGVEPN
        TFP  L A +   +      +H  +V+ G+   L V N L+  YA+ G +D A+KVFD+M  ++VVSW +M+ G+A  ++  D     F  +  E V PN
Subjt:  TFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFA-YNYDVDGASRMFLKMESEGVEPN

Query:  PVTWTSLLSSHARCGHLEGTMALFSKMRMKGI------------------------------------------------GATAEMLAV-----------
         VT   ++S+ A+   LE    +++ +R  GI                                                G T E L V           
Subjt:  PVTWTSLLSSHARCGHLEGTMALFSKMRMKGI------------------------------------------------GATAEMLAV-----------

Query:  -------VLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVY
                +S C+ L  +  G+  HGY+++ GFE +    NALI +Y K      A ++F  M  K +V+WN++++ Y E+G  D A+E F      +  
Subjt:  -------VLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVY

Query:  PEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD-VKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLV
        PE   N+++W+ +I G     L EE++EVF  MQ  + V A+ VT+ S+ S C  L AL+L + ++ ++ +  +  ++ +G  L++M+++CG+ + A  +
Subjt:  PEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD-VKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLV

Query:  FEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEAS
        F  L NRD+ +W + I      G  + A+  FD MI+ G  PD V F+ AL+ACSH GLV +G+ +F  ML+   + P+  HY CMVDLLGRAGL+EEA 
Subjt:  FEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEAS

Query:  NIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGN
         +++ MP++PN  +W++LL +CR+  + ++A   A +I  L  E  GS++LLSN++A+  RW D A+VR+S + KGL+  PG S I+++ K + F +G+
Subjt:  NIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGN

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136002.0e-10331.02Show/hide
Query:  SITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAP----------FEGLSNL----------------
        S  + +LLDS    ++      + + VH++ + +G     F+  RL+  Y++ G + D R+VFD  P            GL+ L                
Subjt:  SITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAP----------FEGLSNL----------------

Query:  --LLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRL
            WNS++     H  C E L  +  M + G + + ++F  VL A S L   N    +H  + +  F + +++ + L+ MY+K G ++DAQ+VFD+M  
Subjt:  --LLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRL

Query:  KSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYII
        ++VVSWN++++ F  N     A  +F  M    VEP+ VT                                   LA V+S CA L+ +  GQ +HG ++
Subjt:  KSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYII

Query:  KGG-FEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLE
        K     + +   NA + +Y K   I++A  +F  M ++N+++  ++IS YA +     A  +F+++ +         NV++W+A+I G+   G  EE+L 
Subjt:  KGG-FEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLE

Query:  VFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIR------ALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYGFHG
        +F  ++   V     + +++   CA LA L+LG + H HV++      +  +D+I VGN LI+MY KCG  +   LVF K+  RD +SWN+MI G+  +G
Subjt:  VFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIR------ALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYGFHG

Query:  LGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCR
         G +AL  F +M++SG  PD +T I  LSAC HAG V EGR  F  M ++F + P  +HY CMVDLLGRAG +EEA ++++ MP+QP++ +W +LL +C+
Subjt:  LGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCR

Query:  MHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLAL
        +H++  + +  A ++  +     G ++LLSN++A   +WED   VR S R +G+   PG SWI+++   ++F   +     +K    L D+ +
Subjt:  MHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLAL

Arabidopsis top hitse value%identityAlignment
AT1G17630.1 Pentatricopeptide repeat (PPR-like) superfamily protein9.1e-19748.9Show/hide
Query:  MLYASSYQ------RFKSASFCF-----PQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACC-SAFVAARLVSVYA
        M++AS +Q       ++  +FCF     P +SI+ P          L S     L   FDHLL  C   Q C+QVH+  +++     S  +AA L+SVYA
Subjt:  MLYASSYQ------RFKSASFCF-----PQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACC-SAFVAARLVSVYA

Query:  RSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELI
        R GL+ DAR VF+T     LS+L LWNSI++ANVSHG     L+LY  MR+ G+ GDG+  PL+LRA   LG F LC+  H  V+Q G + +LHVVNEL+
Subjt:  RSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELI

Query:  GMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVV
         +Y K GRM DA  +F +M +++ +SWN M+ GF+  YD + A ++F  M+ E  +P+ VTWTS+LS H++CG  E  +  F  MRM G   + E LAV 
Subjt:  GMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVV

Query:  LSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVI
         SVCA+L  L   + +HGY+IKGGFE+YL ++NALI VYGK G ++DAE LF ++  K + SWN+LI+S+ ++G  D+A  +FS+LE+M+    +K NV+
Subjt:  LSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVI

Query:  TWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDL
        TW++VI G   +G G++SLE FRQMQ + V ANSVTI  + S+CA L ALNLGRE+HGHVIR  M +NILV N L+NMY KCG      LVFE + ++DL
Subjt:  TWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDL

Query:  ISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQ
        ISWNS+I GYG HG  + AL+ FD+MI SGF PD +  +A LSACSHAGLV +GR +F  M + F ++PQ EHYAC+VDLLGR G ++EAS IVK MP++
Subjt:  ISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQ

Query:  PNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSIEG-LEKVD
        P   V  ALLNSCRMHK+ DIAE  A Q+S L  E  GS+MLLSNI++A  RWE+SA VR  A+ K LK V G SWIEVKKK Y F +G+ ++   E + 
Subjt:  PNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSIEG-LEKVD

Query:  EILHDLALQI-------EGHDFDDSI
         +L DL   +       +G++++D +
Subjt:  EILHDLALQI-------EGHDFDDSI

AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-11534.75Show/hide
Query:  QVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSF
        Q H+  + +GA    +++A+L++ Y+      DA  V  + P      +  ++S+I A        + + ++ +M   G++ D    P + +  + L +F
Subjt:  QVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSF

Query:  NLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGH
         + K +HC     G      V   +  MY + GRM DA+KVFD+M  K VV+ + ++  +A    ++   R+  +MES G+E N V+W  +LS   R G+
Subjt:  NLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGH

Query:  LEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESG
         +  + +F K+   G       ++ VL    D   L+ G++IHGY+IK G        +A+I +YGK G +     LF++ EM      NA I+  + +G
Subjt:  LEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESG

Query:  LCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNG
        L DKA E+F   ++      M+ NV++W+++I G A  G   E+LE+FR+MQ+A VK N VTI S+   C  +AAL  GR  HG  +R  + DN+ VG+ 
Subjt:  LCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNG

Query:  LINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHY
        LI+MY KCG    + +VF  +  ++L+ WNS++ G+  HG  K+ ++ F+ ++++   PD ++F + LSAC   GL  EG   F  M + + IKP++EHY
Subjt:  LINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHY

Query:  ACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGR
        +CMV+LLGRAG ++EA +++K MP +P++ VW ALLNSCR+  + D+AE  A ++ +L  E  G+++LLSNI+AA   W +   +R    + GLK  PG 
Subjt:  ACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGR

Query:  SWIEVKKKVYMFKAGNSI-----EGLEKVDEILHDL
        SWI+VK +VY   AG+       +  EK+DEI  ++
Subjt:  SWIEVKKKVYMFKAGNSI-----EGLEKVDEILHDL

AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-10431.02Show/hide
Query:  SITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAP----------FEGLSNL----------------
        S  + +LLDS    ++      + + VH++ + +G     F+  RL+  Y++ G + D R+VFD  P            GL+ L                
Subjt:  SITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAP----------FEGLSNL----------------

Query:  --LLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRL
            WNS++     H  C E L  +  M + G + + ++F  VL A S L   N    +H  + +  F + +++ + L+ MY+K G ++DAQ+VFD+M  
Subjt:  --LLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRL

Query:  KSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYII
        ++VVSWN++++ F  N     A  +F  M    VEP+ VT                                   LA V+S CA L+ +  GQ +HG ++
Subjt:  KSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYII

Query:  KGG-FEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLE
        K     + +   NA + +Y K   I++A  +F  M ++N+++  ++IS YA +     A  +F+++ +         NV++W+A+I G+   G  EE+L 
Subjt:  KGG-FEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLE

Query:  VFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIR------ALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYGFHG
        +F  ++   V     + +++   CA LA L+LG + H HV++      +  +D+I VGN LI+MY KCG  +   LVF K+  RD +SWN+MI G+  +G
Subjt:  VFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIR------ALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYGFHG

Query:  LGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCR
         G +AL  F +M++SG  PD +T I  LSAC HAG V EGR  F  M ++F + P  +HY CMVDLLGRAG +EEA ++++ MP+QP++ +W +LL +C+
Subjt:  LGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCR

Query:  MHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLAL
        +H++  + +  A ++  +     G ++LLSN++A   +WED   VR S R +G+   PG SWI+++   ++F   +     +K    L D+ +
Subjt:  MHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLAL

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)2.0e-10331.76Show/hide
Query:  LRQCSGTQHCKQVHSATVVTGACCSAFVAARLVS----VYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGF
        L+ C      K  H +    G         +LV+    +  R  L F A++VF+ +  E      ++NS+IR   S G C E + L+ +M   G+  D +
Subjt:  LRQCSGTQHCKQVHSATVVTGACCSAFVAARLVS----VYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGF

Query:  TFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFA-YNYDVDGASRMFLKMESEGVEPN
        TFP  L A +   +      +H  +V+ G+   L V N L+  YA+ G +D A+KVFD+M  ++VVSW +M+ G+A  ++  D     F  +  E V PN
Subjt:  TFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFA-YNYDVDGASRMFLKMESEGVEPN

Query:  PVTWTSLLSSHARCGHLEGTMALFSKMRMKGI------------------------------------------------GATAEMLAV-----------
         VT   ++S+ A+   LE    +++ +R  GI                                                G T E L V           
Subjt:  PVTWTSLLSSHARCGHLEGTMALFSKMRMKGI------------------------------------------------GATAEMLAV-----------

Query:  -------VLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVY
                +S C+ L  +  G+  HGY+++ GFE +    NALI +Y K      A ++F  M  K +V+WN++++ Y E+G  D A+E F      +  
Subjt:  -------VLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVY

Query:  PEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD-VKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLV
        PE   N+++W+ +I G     L EE++EVF  MQ  + V A+ VT+ S+ S C  L AL+L + ++ ++ +  +  ++ +G  L++M+++CG+ + A  +
Subjt:  PEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD-VKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLV

Query:  FEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEAS
        F  L NRD+ +W + I      G  + A+  FD MI+ G  PD V F+ AL+ACSH GLV +G+ +F  ML+   + P+  HY CMVDLLGRAGL+EEA 
Subjt:  FEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEAS

Query:  NIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGN
         +++ MP++PN  +W++LL +CR+  + ++A   A +I  L  E  GS++LLSN++A+  RW D A+VR+S + KGL+  PG S I+++ K + F +G+
Subjt:  NIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGN

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification2.0e-10331.76Show/hide
Query:  LRQCSGTQHCKQVHSATVVTGACCSAFVAARLVS----VYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGF
        L+ C      K  H +    G         +LV+    +  R  L F A++VF+ +  E      ++NS+IR   S G C E + L+ +M   G+  D +
Subjt:  LRQCSGTQHCKQVHSATVVTGACCSAFVAARLVS----VYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGF

Query:  TFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFA-YNYDVDGASRMFLKMESEGVEPN
        TFP  L A +   +      +H  +V+ G+   L V N L+  YA+ G +D A+KVFD+M  ++VVSW +M+ G+A  ++  D     F  +  E V PN
Subjt:  TFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVSGFA-YNYDVDGASRMFLKMESEGVEPN

Query:  PVTWTSLLSSHARCGHLEGTMALFSKMRMKGI------------------------------------------------GATAEMLAV-----------
         VT   ++S+ A+   LE    +++ +R  GI                                                G T E L V           
Subjt:  PVTWTSLLSSHARCGHLEGTMALFSKMRMKGI------------------------------------------------GATAEMLAV-----------

Query:  -------VLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVY
                +S C+ L  +  G+  HGY+++ GFE +    NALI +Y K      A ++F  M  K +V+WN++++ Y E+G  D A+E F      +  
Subjt:  -------VLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVY

Query:  PEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD-VKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLV
        PE   N+++W+ +I G     L EE++EVF  MQ  + V A+ VT+ S+ S C  L AL+L + ++ ++ +  +  ++ +G  L++M+++CG+ + A  +
Subjt:  PEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD-VKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLV

Query:  FEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEAS
        F  L NRD+ +W + I      G  + A+  FD MI+ G  PD V F+ AL+ACSH GLV +G+ +F  ML+   + P+  HY CMVDLLGRAGL+EEA 
Subjt:  FEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEAS

Query:  NIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGN
         +++ MP++PN  +W++LL +CR+  + ++A   A +I  L  E  GS++LLSN++A+  RW D A+VR+S + KGL+  PG S I+++ K + F +G+
Subjt:  NIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVPGRSWIEVKKKVYMFKAGN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGTATGCTTCTTCTTATCAGCGATTCAAATCAGCTTCCTTTTGTTTTCCCCAATCCTCAATCAATTTCCCCTTTCATTTTCGGTTCTATTTCAAATCCCATCTCTT
CTCAATCACGTATGACGAGCTCCTTGATTCCTTTGATCATCTTCTTCGGCAATGTAGCGGGACTCAACATTGCAAACAAGTTCATTCTGCAACCGTTGTCACCGGCGCGT
GTTGCTCGGCGTTCGTCGCCGCCCGGCTTGTATCCGTCTACGCCCGTTCTGGGCTTGTTTTCGATGCCCGGAAAGTGTTTGATACTGCGCCATTTGAAGGCTTGTCGAAC
TTGCTCTTATGGAATTCGATTATAAGAGCAAATGTCTCCCATGGGTACTGCGGAGAAGTGCTTCAACTTTATGGGAAAATGAGAAGATTAGGGGTTTTGGGGGATGGGTT
TACTTTTCCTCTGGTTCTGAGGGCTTCTTCCAACTTGGGTAGTTTCAACTTGTGTAAGAATCTTCATTGCCATGTTGTGCAATTTGGATTCCAAAATCATTTGCATGTTG
TGAATGAATTGATCGGAATGTATGCCAAGCTCGGACGAATGGATGATGCCCAGAAAGTGTTTGATAAAATGCGCCTTAAAAGTGTAGTTTCTTGGAACACTATGGTTTCT
GGTTTTGCCTATAATTATGATGTTGATGGTGCTTCTAGGATGTTCCTTAAAATGGAGTCAGAAGGGGTTGAACCGAACCCTGTAACTTGGACTTCGTTGTTGTCTAGTCA
TGCTCGGTGTGGTCATCTTGAAGGAACCATGGCCTTGTTTAGCAAGATGAGGATGAAAGGTATTGGTGCCACTGCTGAAATGCTTGCTGTGGTGTTATCTGTTTGTGCTG
ATTTAGCTACATTGGATAGGGGTCAGATGATTCATGGGTATATAATAAAGGGAGGTTTTGAAGATTATTTGTTCGCCAAAAACGCACTTATAACTGTATATGGAAAAGGA
GGAGACATAAGAGATGCAGAGAAGTTATTTCATGAGATGGAAATGAAGAATCTTGTGAGTTGGAACGCTCTCATTTCCTCCTATGCTGAATCTGGTTTATGTGACAAGGC
TTTTGAAGTGTTTTCTCAGCTGGAGAAAATGGATGTCTATCCAGAGATGAAACCGAACGTCATAACTTGGAGTGCTGTCATTTGTGGATTTGCTTCCAAGGGACTAGGAG
AAGAATCTTTGGAAGTTTTTCGTCAAATGCAGCTTGCAGATGTTAAAGCCAATTCAGTGACGATATCTAGTGTTTTTTCAGTTTGTGCAATGCTGGCAGCTCTGAATCTT
GGTAGGGAAATGCATGGTCACGTGATTAGAGCTCTGATGGATGATAACATATTGGTAGGAAACGGATTGATTAACATGTATACGAAATGTGGAAATTTCAAGCCAGCTTG
TTTAGTGTTCGAAAAACTTGAAAATCGAGATTTGATCTCATGGAACTCAATGATTGCAGGATATGGATTTCATGGACTTGGTAAAGATGCTCTTACAGCTTTTGATCAGA
TGATTAAATCTGGATTTATACCAGATGATGTCACCTTTATTGCTGCGCTATCTGCTTGTAGTCATGCTGGTCTTGTTGCTGAAGGCCGTTGGCTTTTTGATCAGATGCTA
CAAAACTTCAGGATCAAACCTCAGATGGAGCACTATGCGTGCATGGTCGATCTTCTCGGTCGTGCTGGTCTCATTGAAGAAGCAAGTAACATAGTCAAAGGCATGCCAAT
TCAACCCAATGCTTATGTCTGGAGTGCTCTTCTCAACTCTTGCAGAATGCACAAAGATACAGACATTGCAGAAGAAACTGCCCTACAAATTTCCAATCTGAATTCCGAGA
TCATGGGGAGCCATATGTTGCTCTCAAATATATTTGCTGCGTGCTCTAGATGGGAGGATTCTGCAAGGGTGAGGATCTCGGCCAGGACGAAGGGCTTAAAGATAGTTCCT
GGGCGTAGCTGGATTGAGGTGAAGAAGAAGGTTTATATGTTCAAAGCAGGAAACTCAATAGAAGGTCTAGAGAAAGTTGATGAAATTCTTCATGATTTGGCTCTTCAGAT
AGAAGGTCATGATTTTGATGATAGTATTATTGAA
mRNA sequenceShow/hide mRNA sequence
ATGCTGTATGCTTCTTCTTATCAGCGATTCAAATCAGCTTCCTTTTGTTTTCCCCAATCCTCAATCAATTTCCCCTTTCATTTTCGGTTCTATTTCAAATCCCATCTCTT
CTCAATCACGTATGACGAGCTCCTTGATTCCTTTGATCATCTTCTTCGGCAATGTAGCGGGACTCAACATTGCAAACAAGTTCATTCTGCAACCGTTGTCACCGGCGCGT
GTTGCTCGGCGTTCGTCGCCGCCCGGCTTGTATCCGTCTACGCCCGTTCTGGGCTTGTTTTCGATGCCCGGAAAGTGTTTGATACTGCGCCATTTGAAGGCTTGTCGAAC
TTGCTCTTATGGAATTCGATTATAAGAGCAAATGTCTCCCATGGGTACTGCGGAGAAGTGCTTCAACTTTATGGGAAAATGAGAAGATTAGGGGTTTTGGGGGATGGGTT
TACTTTTCCTCTGGTTCTGAGGGCTTCTTCCAACTTGGGTAGTTTCAACTTGTGTAAGAATCTTCATTGCCATGTTGTGCAATTTGGATTCCAAAATCATTTGCATGTTG
TGAATGAATTGATCGGAATGTATGCCAAGCTCGGACGAATGGATGATGCCCAGAAAGTGTTTGATAAAATGCGCCTTAAAAGTGTAGTTTCTTGGAACACTATGGTTTCT
GGTTTTGCCTATAATTATGATGTTGATGGTGCTTCTAGGATGTTCCTTAAAATGGAGTCAGAAGGGGTTGAACCGAACCCTGTAACTTGGACTTCGTTGTTGTCTAGTCA
TGCTCGGTGTGGTCATCTTGAAGGAACCATGGCCTTGTTTAGCAAGATGAGGATGAAAGGTATTGGTGCCACTGCTGAAATGCTTGCTGTGGTGTTATCTGTTTGTGCTG
ATTTAGCTACATTGGATAGGGGTCAGATGATTCATGGGTATATAATAAAGGGAGGTTTTGAAGATTATTTGTTCGCCAAAAACGCACTTATAACTGTATATGGAAAAGGA
GGAGACATAAGAGATGCAGAGAAGTTATTTCATGAGATGGAAATGAAGAATCTTGTGAGTTGGAACGCTCTCATTTCCTCCTATGCTGAATCTGGTTTATGTGACAAGGC
TTTTGAAGTGTTTTCTCAGCTGGAGAAAATGGATGTCTATCCAGAGATGAAACCGAACGTCATAACTTGGAGTGCTGTCATTTGTGGATTTGCTTCCAAGGGACTAGGAG
AAGAATCTTTGGAAGTTTTTCGTCAAATGCAGCTTGCAGATGTTAAAGCCAATTCAGTGACGATATCTAGTGTTTTTTCAGTTTGTGCAATGCTGGCAGCTCTGAATCTT
GGTAGGGAAATGCATGGTCACGTGATTAGAGCTCTGATGGATGATAACATATTGGTAGGAAACGGATTGATTAACATGTATACGAAATGTGGAAATTTCAAGCCAGCTTG
TTTAGTGTTCGAAAAACTTGAAAATCGAGATTTGATCTCATGGAACTCAATGATTGCAGGATATGGATTTCATGGACTTGGTAAAGATGCTCTTACAGCTTTTGATCAGA
TGATTAAATCTGGATTTATACCAGATGATGTCACCTTTATTGCTGCGCTATCTGCTTGTAGTCATGCTGGTCTTGTTGCTGAAGGCCGTTGGCTTTTTGATCAGATGCTA
CAAAACTTCAGGATCAAACCTCAGATGGAGCACTATGCGTGCATGGTCGATCTTCTCGGTCGTGCTGGTCTCATTGAAGAAGCAAGTAACATAGTCAAAGGCATGCCAAT
TCAACCCAATGCTTATGTCTGGAGTGCTCTTCTCAACTCTTGCAGAATGCACAAAGATACAGACATTGCAGAAGAAACTGCCCTACAAATTTCCAATCTGAATTCCGAGA
TCATGGGGAGCCATATGTTGCTCTCAAATATATTTGCTGCGTGCTCTAGATGGGAGGATTCTGCAAGGGTGAGGATCTCGGCCAGGACGAAGGGCTTAAAGATAGTTCCT
GGGCGTAGCTGGATTGAGGTGAAGAAGAAGGTTTATATGTTCAAAGCAGGAAACTCAATAGAAGGTCTAGAGAAAGTTGATGAAATTCTTCATGATTTGGCTCTTCAGAT
AGAAGGTCATGATTTTGATGATAGTATTATTGAA
Protein sequenceShow/hide protein sequence
MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAPFEGLSN
LLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMDDAQKVFDKMRLKSVVSWNTMVS
GFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSKMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKG
GDIRDAEKLFHEMEMKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNL
GREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPACLVFEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQML
QNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIMGSHMLLSNIFAACSRWEDSARVRISARTKGLKIVP
GRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEGHDFDDSIIE