; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g1094 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g1094
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat (PPR-like) superfamily protein
Genome locationMC08:9123418..9125541
RNA-Seq ExpressionMC08g1094
SyntenyMC08g1094
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606718.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.086.2Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        MLYASSYQRF SASF FP+ ++NFP HFR YF SH  SITYD ELLDSFD LLRQC+G +HCKQVHSATVVTGAC SAFVAARLVSVYARSG VFDARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD
        FDT PFEGLSNLLLWNSIIRANV  GY  E LQLYGKM+  GVL DGFTFPLVLRASSNLG FNLCK+LHCHVVQFGF NHLHVVNEL+GMY KL RM D
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSV+SWNTMVSG+AYNYDV+GASRMFL+ME EGVEPNPVTWTSLLSSHARCGHLE T+ALFS+MRMKG+GATAEMLAVVLSVCADLATLD
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
        RGQM+HGYI+KGGFEDYLFAKNALITVYGKGGD+RDAEKLFHEM+VKNLVSWN+LISSYAESGL DKAFE FS+L +M+  PEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQLA+VKANSVTISSV S+CAMLAALNLGREMHGHVIRA M+DNILVGNGLINMYTKCG+FKPGCLVF KLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL  FD+MIKSGF PDDVTFIAALSACSHAGLVAEGRWLFDQMLQNF+IKPQMEHYACMVDLLGRAGL+EEASNIVK MPI+PNAY+WSALLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE
        SCRMHKDTD+AEET  QI NLNSEI GSHMLLSNIF+A  RWEDSARVRI AR KGL  VPG SWIEVKK+VYMFK+GNS+ EGLE+VDEILHDLALQIE
Subjt:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE

Query:  GHDFDDSIIE
          DFDDSIIE
Subjt:  GHDFDDSIIE

XP_022148095.1 putative pentatricopeptide repeat-containing protein At1g17630 [Momordica charantia]0.0100Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVF
        MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVF
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVF

Query:  DTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDA
        DTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDA
Subjt:  DTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDA

Query:  QKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDR
        QKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDR
Subjt:  QKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDR

Query:  GQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASK
        GQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASK
Subjt:  GQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASK

Query:  GLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGF
        GLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGF
Subjt:  GLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGF

Query:  HGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNS
        HGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNS
Subjt:  HGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNS

Query:  CRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEGH
        CRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEGH
Subjt:  CRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEGH

Query:  DFDDSIIE
        DFDDSIIE
Subjt:  DFDDSIIE

XP_022949499.1 putative pentatricopeptide repeat-containing protein At1g17630 [Cucurbita moschata]0.086.2Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        MLYASSYQRF SASF FP+ +INFP HFR YF SH  SITYD +LLDSFD LLRQC+G +HCKQVHSAT+VTGAC SAFVAARLVSVYARSG VFDARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD
        FDT PFE L NLLLWNSIIRANV  GY  E LQLYGKM+  GVL DGFTFPLVLRASSNLG FNLCK+LHCHVVQFGF NHLHVVNEL+GMY KL RM D
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSV+SWNTMVSG+AYNYDV+GASRMFL+ME EGVEPNPVTWTSLLSSHARCGHLE T+ALFS+MRMKG+GATAEMLAVVLSVCADLATLD
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
        RGQM+HGYI+KGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEM+VKNLVSWN+LISSYAESGL DKAFE FS+LEKM   PEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQLA+VKANSVTISSV S+CAMLAALNLGREMHGHVIRA M+DNILVGNGLINMYTKCG+FKPGCLVF KLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL  FD+MIKSGF PDDVTFIAALSACSHAGLVAEGRWLFDQMLQNF+IKPQMEHYACMVDLLGRAGL+EEASNIVK MPI+PNAY+WSALLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE
        SCRMHKDTD+AEET  QI NLNSEI GSHMLLSNIF+A  RWEDSARVRI AR KGL  VPG SWIEVKK+VYMFK+GNS+ EGLE+VDEILHDLALQIE
Subjt:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE

Query:  GHDFDDSIIE
          DFDDSIIE
Subjt:  GHDFDDSIIE

XP_022997825.1 putative pentatricopeptide repeat-containing protein At1g17630 [Cucurbita maxima]0.087.18Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        MLYASSYQRF SASFCFP+ +INFP HFR YF SH  SITYD ELLDSFD LLRQC+G +HCKQVHS TVV GAC SAFVAARLVSVYARSG VFDARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD
        FDT PFEGLSNLLLWNSIIRANV  GYC E LQLYGKMR  GVL DGFTFPLVLRASSNLG FNLCK LHCHVVQFGFQNHLHVVNELIGMY KL RMGD
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSV+SWNTMVSG+AYNYDV+GA RMFL+ME EGVEPNPVTWTSLLSSHARCGHLE T+ALFS+MRMKG+GATAEMLAVVLSVCADLATLD
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
        RGQM+HGYI+KGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEM+VKNLVSWN+LISSYAESGL DKAFE FS+LEKM+  PEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQLA+VKANSVTISSV S+CAMLAALNLGREMHGHVIRA M+DNILVGNGLINMYTKCG+FKPGCLVFEKLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL  FD+MIKSGF PDDVTFIAALSACSHAGLVAEGRWLFDQMLQNF+IKPQMEHYACMVDLLGRAGL+EEASN+VK MPI+PN Y+WSALLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE
        SCRMHK TD+AEET  QI NLNSEI GSHMLLSNIF+A  RWEDSARVRI AR KGL  VPG SWIEVKKKVYMFKAGNS+ EGLE+VDEILHDLALQIE
Subjt:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE

Query:  GHDFDDSIIE
          DFDDSIIE
Subjt:  GHDFDDSIIE

XP_023523771.1 putative pentatricopeptide repeat-containing protein At1g17630 [Cucurbita pepo subsp. pepo]0.086.34Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        MLYASSYQRF SASFCFP+ +INFP HFR+YF S   SITYD ELLDSFD LLRQC+G +HCKQVHSATVVTGAC SAFVAARLVSVYARSG VFDARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD
        FDT PFEGLSNLLLWNSIIRANV  GY  E LQLYGKMR  GVL DGFTFPLVLRASSNLG FNLCK+LHCHVVQFGFQNHLHVVNEL+GMY KL RM D
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSV+SWNTMVSG+AYNYDV+GASRMFL+ME EGVEPNPVTWTSLLSSHARCGHLE T+ALFS+MRMKG+GATAEMLAVVLSVCADL T D
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
        RGQM+HGYI+KGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEM+VKNLVSWN+LISSYAESGL DKAFE FS+LEKM+  PEMKP+VITWSAVICGFAS
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQLA+VKANSVTISSV S+CAMLAALNLGREMHGHVIRA M+DNILVGNGLINMYTKCG+FKPGCLVF KLENRDLISWNS+IAGYG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL  FD+MIKSGF PDDVTFIAALSACSHAGLVAEGRWLFDQMLQNF+IKPQMEHYACMVDLLGRAGL+EEASNIVK MPI+PNAY+WSALLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE
        SCRMHKDTD+AEET  QI +L+SEI GSHMLLSNI++A  RWEDSARVRI AR KGL  VPG SWIEVKKKVYMFKAGNS+ EGLE+VDEILHDLALQIE
Subjt:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE

Query:  GHDFDDSIIE
          DFDDSIIE
Subjt:  GHDFDDSIIE

TrEMBL top hitse value%identityAlignment
A0A0A0LFT1 Uncharacterized protein0.081.69Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDE-LLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        ML ASSYQRFKS SFCFP  SINF         S   SITYDE L D FDHLLRQC+G QH KQVHSATVVTGA CSAFV+ARLVS+Y+R GLV DARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDE-LLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD
        F +APFE  SN LLWNSIIRANV HGYC E LQLYGKMR  GVLGDGFTFPL+LRASSNLG+FN+CKNLHCHVVQFGFQNHLHV NELIGMYAKL RM D
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSVVSWNTMVSG+AYNYDV+GASRMF +ME EGVEPNPVTWTSLLSSHARCGHLE TM LF +MRMKG+G TAEMLAVVLSVCADLATL+
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
         GQMIHGY++KGGF DYLFAKNALIT+YGKGG + DAEKLFHEM+VKNLVSWNALISS+AESG+ DKA E+ SQLEKM+ YPEMKPNVITWSA+ICGFAS
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG
        KGLGEESLEVFR+MQLA+VKANSVTI+SV S+CAMLAALNLGREMHGHVIRA MDDN+LVGNGLINMYTKCG+FKPG +VFEKLENRD ISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL  F+ MIKSG+ PD VTFIAALSACSHAGLVAEG WLF QM QNF+I+P++EHYACMVDLLGRAGL+EEASNI+KGMP++PNAY+WS+LLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE
        SCRMHKDTD+AEE A +ISNLNS+I GSHMLLSNIFAA  RWEDSARVRI AR KGL  VPG SWIEVKKKVYMFKAG +I EGLEKVDEILHDLA QIE
Subjt:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE

Query:  GHDFDDSIIE
         ++ DD IIE
Subjt:  GHDFDDSIIE

A0A5D3BVI7 Putative pentatricopeptide repeat-containing protein0.080.54Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDE-LLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        ML A SYQRFKS SFCFP  SINF         S   SITYDE L + FDHLLRQC+G QH KQVHSATVVTGA CSAFV+ARLVS+Y+R GLV DARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDE-LLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD
        F +APFE LSN LLWNSIIRANV HGYC E L LYGKMR  GVLGDGFTFPLVLRASSNLG+ ++CKNLHCHVVQFGFQNHLHV NELIGMYAKL RM D
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSVVSWNTMVSG+AYNYDV+GASRMF +ME EGVEPNPVTWTSLLSSHARCGHL  TM LF +MRMKG+GATAEMLAVVLSVCADLATL+
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
         GQMIHGY++KGGF DYLFAKNALIT+YGKGGD+ DAEKLFHEM+VKNLVSWNALISS+AESG+ DKA E+ SQLEKM+ YPEMKPNVITWS++ICGF+S
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG
        KGLGEESLEVFR+MQLA+VKANSVTI+SV S+CAMLAALNLGRE+HGHVIRA MD+N+LVGNGLINMYTKCG+FKPG LVFEKLENRD ISWNSMIA YG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL   + MIKSG+ PD VTFIAALSACSHAGLVAEG WLF QM QNF+I+P++EHYACMVDLLGRAGL+EEASNI+K MP++PNAY+WS+LLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEG
        SCRMHKDTD+AEE A +ISNLNS+I GSHMLLSNIFAA  RWEDSARVRI AR KGL  VPG SWIEVKKKVY+FKAG + EGLEKVDEILHDLA QIE 
Subjt:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEG

Query:  HDFDDSIIE
        +D DD IIE
Subjt:  HDFDDSIIE

A0A6J1D1Z6 putative pentatricopeptide repeat-containing protein At1g176300.0100Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVF
        MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVF
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVF

Query:  DTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDA
        DTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDA
Subjt:  DTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDA

Query:  QKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDR
        QKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDR
Subjt:  QKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDR

Query:  GQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASK
        GQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASK
Subjt:  GQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASK

Query:  GLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGF
        GLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGF
Subjt:  GLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGF

Query:  HGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNS
        HGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNS
Subjt:  HGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNS

Query:  CRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEGH
        CRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEGH
Subjt:  CRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEGH

Query:  DFDDSIIE
        DFDDSIIE
Subjt:  DFDDSIIE

A0A6J1GD03 putative pentatricopeptide repeat-containing protein At1g176300.086.2Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        MLYASSYQRF SASF FP+ +INFP HFR YF SH  SITYD +LLDSFD LLRQC+G +HCKQVHSAT+VTGAC SAFVAARLVSVYARSG VFDARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD
        FDT PFE L NLLLWNSIIRANV  GY  E LQLYGKM+  GVL DGFTFPLVLRASSNLG FNLCK+LHCHVVQFGF NHLHVVNEL+GMY KL RM D
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSV+SWNTMVSG+AYNYDV+GASRMFL+ME EGVEPNPVTWTSLLSSHARCGHLE T+ALFS+MRMKG+GATAEMLAVVLSVCADLATLD
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
        RGQM+HGYI+KGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEM+VKNLVSWN+LISSYAESGL DKAFE FS+LEKM   PEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQLA+VKANSVTISSV S+CAMLAALNLGREMHGHVIRA M+DNILVGNGLINMYTKCG+FKPGCLVF KLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL  FD+MIKSGF PDDVTFIAALSACSHAGLVAEGRWLFDQMLQNF+IKPQMEHYACMVDLLGRAGL+EEASNIVK MPI+PNAY+WSALLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE
        SCRMHKDTD+AEET  QI NLNSEI GSHMLLSNIF+A  RWEDSARVRI AR KGL  VPG SWIEVKK+VYMFK+GNS+ EGLE+VDEILHDLALQIE
Subjt:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE

Query:  GHDFDDSIIE
          DFDDSIIE
Subjt:  GHDFDDSIIE

A0A6J1KAZ0 putative pentatricopeptide repeat-containing protein At1g176300.087.18Show/hide
Query:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV
        MLYASSYQRF SASFCFP+ +INFP HFR YF SH  SITYD ELLDSFD LLRQC+G +HCKQVHS TVV GAC SAFVAARLVSVYARSG VFDARKV
Subjt:  MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYD-ELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKV

Query:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD
        FDT PFEGLSNLLLWNSIIRANV  GYC E LQLYGKMR  GVL DGFTFPLVLRASSNLG FNLCK LHCHVVQFGFQNHLHVVNELIGMY KL RMGD
Subjt:  FDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGD

Query:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD
        A+KVFDKMR+KSV+SWNTMVSG+AYNYDV+GA RMFL+ME EGVEPNPVTWTSLLSSHARCGHLE T+ALFS+MRMKG+GATAEMLAVVLSVCADLATLD
Subjt:  AQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLD

Query:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS
        RGQM+HGYI+KGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEM+VKNLVSWN+LISSYAESGL DKAFE FS+LEKM+  PEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQLA+VKANSVTISSV S+CAMLAALNLGREMHGHVIRA M+DNILVGNGLINMYTKCG+FKPGCLVFEKLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN
         HGLGKDAL  FD+MIKSGF PDDVTFIAALSACSHAGLVAEGRWLFDQMLQNF+IKPQMEHYACMVDLLGRAGL+EEASN+VK MPI+PN Y+WSALLN
Subjt:  FHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLN

Query:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE
        SCRMHK TD+AEET  QI NLNSEI GSHMLLSNIF+A  RWEDSARVRI AR KGL  VPG SWIEVKKKVYMFKAGNS+ EGLE+VDEILHDLALQIE
Subjt:  SCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSI-EGLEKVDEILHDLALQIE

Query:  GHDFDDSIIE
          DFDDSIIE
Subjt:  GHDFDDSIIE

SwissProt top hitse value%identityAlignment
Q9LFL5 Pentatricopeptide repeat-containing protein At5g168607.7e-10033.67Show/hide
Query:  HLFSITYDELLDSFDHLLRQC---SGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVL
        H  S T D    +F  + + C   S  +  +  H+ ++VTG   + FV   LV++Y+R   + DARKVFD      + +++ WNSII +    G     L
Subjt:  HLFSITYDELLDSFDHLLRQC---SGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVL

Query:  QLYGKM-RRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDG
        +++ +M    G   D  T   VL   ++LG+ +L K LHC  V      ++ V N L+ MYAK G M +A  VF  M +K VVSWN MV+G++     + 
Subjt:  QLYGKM-RRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDG

Query:  ASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKG
        A R+F KM+ E ++ + VTW++ +S +A+ G     + +  +M   GI      L  VLS CA +  L  G+ IH Y IK   +     KN      G G
Subjt:  ASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKG

Query:  GDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD--VKANSVTISSV
         +              N+V  N LI  YA+    D A  +F  L   +       +V+TW+ +I G++  G   ++LE+  +M   D   + N+ TIS  
Subjt:  GDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD--VKANSVTISSV

Query:  FSVCAMLAALNLGREMHGHVIRALMDD-NILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFI
           CA LAAL +G+++H + +R   +   + V N LI+MY KCG+     LVF+ +  ++ ++W S++ GYG HG G++AL  FD+M + GF  D VT +
Subjt:  FSVCAMLAALNLGREMHGHVIRALMDD-NILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFI

Query:  AALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGS
          L ACSH+G++ +G   F++M   F + P  EHYAC+VDLLGRAG +  A  +++ MP++P   VW A L+ CR+H   ++ E  A +I+ L S   GS
Subjt:  AALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGS

Query:  HMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNS------------IEGLEKVDEILHDLALQIEGHDFDD
        + LLSN++A   RW+D  R+R L R KG+   PG SW+E  K    F  G+             ++ ++++ +I +        HD DD
Subjt:  HMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNS------------IEGLEKVDEILHDLALQIEGHDFDD

Q9LNP2 Putative pentatricopeptide repeat-containing protein At1g176302.3e-19749.17Show/hide
Query:  MLYASSYQ------RFKSASFCF-----PQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACC-SAFVAARLVSVYA
        M++AS +Q       ++  +FCF     P +SI+ P          L S     L   FDHLL  C   Q C+QVH+  +++     S  +AA L+SVYA
Subjt:  MLYASSYQ------RFKSASFCF-----PQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACC-SAFVAARLVSVYA

Query:  RSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELI
        R GL+ DAR VF+T     LS+L LWNSI++ANVSHG     L+LY  MR+ G+ GDG+  PL+LRA   LG F LC+  H  V+Q G + +LHVVNEL+
Subjt:  RSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELI

Query:  GMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVV
         +Y K GRMGDA  +F +M +++ +SWN M+ GF+  YD + A ++F  M+ E  +P+ VTWTS+LS H++CG  E  +  F  MRM G   + E LAV 
Subjt:  GMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVV

Query:  LSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVI
         SVCA+L  L   + +HGY+IKGGFE+YL ++NALI VYGK G ++DAE LF ++  K + SWN+LI+S+ ++G  D+A  +FS+LE+M+    +K NV+
Subjt:  LSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVI

Query:  TWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDL
        TW++VI G   +G G++SLE FRQMQ + V ANSVTI  + S+CA L ALNLGRE+HGHVIR  M +NILV N L+NMY KCG    G LVFE + ++DL
Subjt:  TWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDL

Query:  ISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQ
        ISWNS+I GYG HG  + AL+ FD+MI SGF PD +  +A LSACSHAGLV +GR +F  M + F ++PQ EHYAC+VDLLGR G ++EAS IVK MP++
Subjt:  ISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQ

Query:  PNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEG-LEKVD
        P   V  ALLNSCRMHK+ DIAE  A Q+S L  E  GS+MLLSNI++A  RWE+SA VR LA+ K L  V G SWIEVKKK Y F +G+ ++   E + 
Subjt:  PNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEG-LEKVD

Query:  EILHDLALQI-------EGHDFDDSI
         +L DL   +       +G++++D +
Subjt:  EILHDLALQI-------EGHDFDDSI

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202301.2e-11334.43Show/hide
Query:  QVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSF
        Q H+  + +GA    +++A+L++ Y+      DA  V  + P      +  ++S+I A        + + ++ +M   G++ D    P + +  + L +F
Subjt:  QVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSF

Query:  NLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGH
         + K +HC     G      V   +  MY + GRMGDA+KVFD+M  K VV+ + ++  +A    ++   R+  +MES G+E N V+W  +LS   R G+
Subjt:  NLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGH

Query:  LEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESG
         +  + +F ++   G       ++ VL    D   L+ G++IHGY+IK G        +A+I +YGK G +     LF++ E+      NA I+  + +G
Subjt:  LEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESG

Query:  LCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNG
        L DKA E+F   ++      M+ NV++W+++I G A  G   E+LE+FR+MQ+A VK N VTI S+   C  +AAL  GR  HG  +R  + DN+ VG+ 
Subjt:  LCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNG

Query:  LINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHY
        LI+MY KCG      +VF  +  ++L+ WNS++ G+  HG  K+ ++ F+ ++++   PD ++F + LSAC   GL  EG   F  M + + IKP++EHY
Subjt:  LINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHY

Query:  ACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGR
        +CMV+LLGRAG ++EA +++K MP +P++ VW ALLNSCR+  + D+AE  A ++ +L  E  G+++LLSNI+AA   W +   +R    + GL   PG 
Subjt:  ACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGR

Query:  SWIEVKKKVYMFKAGNSI-----EGLEKVDEILHDL
        SWI+VK +VY   AG+       +  EK+DEI  ++
Subjt:  SWIEVKKKVYMFKAGNSI-----EGLEKVDEILHDL

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226904.5e-10031.33Show/hide
Query:  LRQCSGTQHCKQVHSATVVTGACCSAFVAARLVS----VYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGF
        L+ C      K  H +    G         +LV+    +  R  L F A++VF+ +  E      ++NS+IR   S G C E + L+ +M   G+  D +
Subjt:  LRQCSGTQHCKQVHSATVVTGACCSAFVAARLVS----VYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGF

Query:  TFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFA-YNYDVDGASRMFLKMESEGVEPN
        TFP  L A +   +      +H  +V+ G+   L V N L+  YA+ G +  A+KVFD+M  ++VVSW +M+ G+A  ++  D     F  +  E V PN
Subjt:  TFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFA-YNYDVDGASRMFLKMESEGVEPN

Query:  PVTWTSLLSSHARCGHLEGTMALFSRMRMKGI------------------------------------------------GATAEMLAV-----------
         VT   ++S+ A+   LE    +++ +R  GI                                                G T E L V           
Subjt:  PVTWTSLLSSHARCGHLEGTMALFSRMRMKGI------------------------------------------------GATAEMLAV-----------

Query:  -------VLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVY
                +S C+ L  +  G+  HGY+++ GFE +    NALI +Y K      A ++F  M  K +V+WN++++ Y E+G  D A+E F      +  
Subjt:  -------VLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVY

Query:  PEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD-VKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLV
        PE   N+++W+ +I G     L EE++EVF  MQ  + V A+ VT+ S+ S C  L AL+L + ++ ++ +  +  ++ +G  L++M+++CG+ +    +
Subjt:  PEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD-VKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLV

Query:  FEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEAS
        F  L NRD+ +W + I      G  + A+  FD MI+ G  PD V F+ AL+ACSH GLV +G+ +F  ML+   + P+  HY CMVDLLGRAGL+EEA 
Subjt:  FEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEAS

Query:  NIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGN
         +++ MP++PN  +W++LL +CR+  + ++A   A +I  L  E  GS++LLSN++A+  RW D A+VR+  + KGL   PG S I+++ K + F +G+
Subjt:  NIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGN

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136001.2e-10331.02Show/hide
Query:  SITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAP----------FEGLSNL----------------
        S  + +LLDS    ++      + + VH++ + +G     F+  RL+  Y++ G + D R+VFD  P            GL+ L                
Subjt:  SITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAP----------FEGLSNL----------------

Query:  --LLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRL
            WNS++     H  C E L  +  M + G + + ++F  VL A S L   N    +H  + +  F + +++ + L+ MY+K G + DAQ+VFD+M  
Subjt:  --LLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRL

Query:  KSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYII
        ++VVSWN++++ F  N     A  +F  M    VEP+ VT                                   LA V+S CA L+ +  GQ +HG ++
Subjt:  KSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYII

Query:  KGG-FEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLE
        K     + +   NA + +Y K   I++A  +F  M ++N+++  ++IS YA +     A  +F+++ +         NV++W+A+I G+   G  EE+L 
Subjt:  KGG-FEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLE

Query:  VFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIR------ALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGFHG
        +F  ++   V     + +++   CA LA L+LG + H HV++      +  +D+I VGN LI+MY KCG  + G LVF K+  RD +SWN+MI G+  +G
Subjt:  VFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIR------ALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGFHG

Query:  LGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCR
         G +AL  F +M++SG  PD +T I  LSAC HAG V EGR  F  M ++F + P  +HY CMVDLLGRAG +EEA ++++ MP+QP++ +W +LL +C+
Subjt:  LGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCR

Query:  MHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLAL
        +H++  + +  A ++  +     G ++LLSN++A   +WED   VR   R +G+   PG SWI+++   ++F   +     +K    L D+ +
Subjt:  MHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLAL

Arabidopsis top hitse value%identityAlignment
AT1G17630.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.7e-19849.17Show/hide
Query:  MLYASSYQ------RFKSASFCF-----PQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACC-SAFVAARLVSVYA
        M++AS +Q       ++  +FCF     P +SI+ P          L S     L   FDHLL  C   Q C+QVH+  +++     S  +AA L+SVYA
Subjt:  MLYASSYQ------RFKSASFCF-----PQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACC-SAFVAARLVSVYA

Query:  RSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELI
        R GL+ DAR VF+T     LS+L LWNSI++ANVSHG     L+LY  MR+ G+ GDG+  PL+LRA   LG F LC+  H  V+Q G + +LHVVNEL+
Subjt:  RSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELI

Query:  GMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVV
         +Y K GRMGDA  +F +M +++ +SWN M+ GF+  YD + A ++F  M+ E  +P+ VTWTS+LS H++CG  E  +  F  MRM G   + E LAV 
Subjt:  GMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVV

Query:  LSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVI
         SVCA+L  L   + +HGY+IKGGFE+YL ++NALI VYGK G ++DAE LF ++  K + SWN+LI+S+ ++G  D+A  +FS+LE+M+    +K NV+
Subjt:  LSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVI

Query:  TWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDL
        TW++VI G   +G G++SLE FRQMQ + V ANSVTI  + S+CA L ALNLGRE+HGHVIR  M +NILV N L+NMY KCG    G LVFE + ++DL
Subjt:  TWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDL

Query:  ISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQ
        ISWNS+I GYG HG  + AL+ FD+MI SGF PD +  +A LSACSHAGLV +GR +F  M + F ++PQ EHYAC+VDLLGR G ++EAS IVK MP++
Subjt:  ISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQ

Query:  PNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEG-LEKVD
        P   V  ALLNSCRMHK+ DIAE  A Q+S L  E  GS+MLLSNI++A  RWE+SA VR LA+ K L  V G SWIEVKKK Y F +G+ ++   E + 
Subjt:  PNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEG-LEKVD

Query:  EILHDLALQI-------EGHDFDDSI
         +L DL   +       +G++++D +
Subjt:  EILHDLALQI-------EGHDFDDSI

AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein8.7e-11534.43Show/hide
Query:  QVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSF
        Q H+  + +GA    +++A+L++ Y+      DA  V  + P      +  ++S+I A        + + ++ +M   G++ D    P + +  + L +F
Subjt:  QVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSF

Query:  NLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGH
         + K +HC     G      V   +  MY + GRMGDA+KVFD+M  K VV+ + ++  +A    ++   R+  +MES G+E N V+W  +LS   R G+
Subjt:  NLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGH

Query:  LEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESG
         +  + +F ++   G       ++ VL    D   L+ G++IHGY+IK G        +A+I +YGK G +     LF++ E+      NA I+  + +G
Subjt:  LEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESG

Query:  LCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNG
        L DKA E+F   ++      M+ NV++W+++I G A  G   E+LE+FR+MQ+A VK N VTI S+   C  +AAL  GR  HG  +R  + DN+ VG+ 
Subjt:  LCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNG

Query:  LINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHY
        LI+MY KCG      +VF  +  ++L+ WNS++ G+  HG  K+ ++ F+ ++++   PD ++F + LSAC   GL  EG   F  M + + IKP++EHY
Subjt:  LINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHY

Query:  ACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGR
        +CMV+LLGRAG ++EA +++K MP +P++ VW ALLNSCR+  + D+AE  A ++ +L  E  G+++LLSNI+AA   W +   +R    + GL   PG 
Subjt:  ACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGR

Query:  SWIEVKKKVYMFKAGNSI-----EGLEKVDEILHDL
        SWI+VK +VY   AG+       +  EK+DEI  ++
Subjt:  SWIEVKKKVYMFKAGNSI-----EGLEKVDEILHDL

AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein8.2e-10531.02Show/hide
Query:  SITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAP----------FEGLSNL----------------
        S  + +LLDS    ++      + + VH++ + +G     F+  RL+  Y++ G + D R+VFD  P            GL+ L                
Subjt:  SITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAP----------FEGLSNL----------------

Query:  --LLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRL
            WNS++     H  C E L  +  M + G + + ++F  VL A S L   N    +H  + +  F + +++ + L+ MY+K G + DAQ+VFD+M  
Subjt:  --LLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRL

Query:  KSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYII
        ++VVSWN++++ F  N     A  +F  M    VEP+ VT                                   LA V+S CA L+ +  GQ +HG ++
Subjt:  KSVVSWNTMVSGFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYII

Query:  KGG-FEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLE
        K     + +   NA + +Y K   I++A  +F  M ++N+++  ++IS YA +     A  +F+++ +         NV++W+A+I G+   G  EE+L 
Subjt:  KGG-FEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLE

Query:  VFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIR------ALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGFHG
        +F  ++   V     + +++   CA LA L+LG + H HV++      +  +D+I VGN LI+MY KCG  + G LVF K+  RD +SWN+MI G+  +G
Subjt:  VFRQMQLADVKANSVTISSVFSVCAMLAALNLGREMHGHVIR------ALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGFHG

Query:  LGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCR
         G +AL  F +M++SG  PD +T I  LSAC HAG V EGR  F  M ++F + P  +HY CMVDLLGRAG +EEA ++++ MP+QP++ +W +LL +C+
Subjt:  LGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCR

Query:  MHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLAL
        +H++  + +  A ++  +     G ++LLSN++A   +WED   VR   R +G+   PG SWI+++   ++F   +     +K    L D+ +
Subjt:  MHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLAL

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)3.2e-10131.33Show/hide
Query:  LRQCSGTQHCKQVHSATVVTGACCSAFVAARLVS----VYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGF
        L+ C      K  H +    G         +LV+    +  R  L F A++VF+ +  E      ++NS+IR   S G C E + L+ +M   G+  D +
Subjt:  LRQCSGTQHCKQVHSATVVTGACCSAFVAARLVS----VYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGF

Query:  TFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFA-YNYDVDGASRMFLKMESEGVEPN
        TFP  L A +   +      +H  +V+ G+   L V N L+  YA+ G +  A+KVFD+M  ++VVSW +M+ G+A  ++  D     F  +  E V PN
Subjt:  TFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFA-YNYDVDGASRMFLKMESEGVEPN

Query:  PVTWTSLLSSHARCGHLEGTMALFSRMRMKGI------------------------------------------------GATAEMLAV-----------
         VT   ++S+ A+   LE    +++ +R  GI                                                G T E L V           
Subjt:  PVTWTSLLSSHARCGHLEGTMALFSRMRMKGI------------------------------------------------GATAEMLAV-----------

Query:  -------VLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVY
                +S C+ L  +  G+  HGY+++ GFE +    NALI +Y K      A ++F  M  K +V+WN++++ Y E+G  D A+E F      +  
Subjt:  -------VLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVY

Query:  PEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD-VKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLV
        PE   N+++W+ +I G     L EE++EVF  MQ  + V A+ VT+ S+ S C  L AL+L + ++ ++ +  +  ++ +G  L++M+++CG+ +    +
Subjt:  PEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD-VKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLV

Query:  FEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEAS
        F  L NRD+ +W + I      G  + A+  FD MI+ G  PD V F+ AL+ACSH GLV +G+ +F  ML+   + P+  HY CMVDLLGRAGL+EEA 
Subjt:  FEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEAS

Query:  NIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGN
         +++ MP++PN  +W++LL +CR+  + ++A   A +I  L  E  GS++LLSN++A+  RW D A+VR+  + KGL   PG S I+++ K + F +G+
Subjt:  NIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGN

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification3.2e-10131.33Show/hide
Query:  LRQCSGTQHCKQVHSATVVTGACCSAFVAARLVS----VYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGF
        L+ C      K  H +    G         +LV+    +  R  L F A++VF+ +  E      ++NS+IR   S G C E + L+ +M   G+  D +
Subjt:  LRQCSGTQHCKQVHSATVVTGACCSAFVAARLVS----VYARSGLVFDARKVFDTAPFEGLSNLLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGF

Query:  TFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFA-YNYDVDGASRMFLKMESEGVEPN
        TFP  L A +   +      +H  +V+ G+   L V N L+  YA+ G +  A+KVFD+M  ++VVSW +M+ G+A  ++  D     F  +  E V PN
Subjt:  TFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVSGFA-YNYDVDGASRMFLKMESEGVEPN

Query:  PVTWTSLLSSHARCGHLEGTMALFSRMRMKGI------------------------------------------------GATAEMLAV-----------
         VT   ++S+ A+   LE    +++ +R  GI                                                G T E L V           
Subjt:  PVTWTSLLSSHARCGHLEGTMALFSRMRMKGI------------------------------------------------GATAEMLAV-----------

Query:  -------VLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVY
                +S C+ L  +  G+  HGY+++ GFE +    NALI +Y K      A ++F  M  K +V+WN++++ Y E+G  D A+E F      +  
Subjt:  -------VLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKGGDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVY

Query:  PEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD-VKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLV
        PE   N+++W+ +I G     L EE++EVF  MQ  + V A+ VT+ S+ S C  L AL+L + ++ ++ +  +  ++ +G  L++M+++CG+ +    +
Subjt:  PEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLAD-VKANSVTISSVFSVCAMLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLV

Query:  FEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEAS
        F  L NRD+ +W + I      G  + A+  FD MI+ G  PD V F+ AL+ACSH GLV +G+ +F  ML+   + P+  HY CMVDLLGRAGL+EEA 
Subjt:  FEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQMLQNFRIKPQMEHYACMVDLLGRAGLIEEAS

Query:  NIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGN
         +++ MP++PN  +W++LL +CR+  + ++A   A +I  L  E  GS++LLSN++A+  RW D A+VR+  + KGL   PG S I+++ K + F +G+
Subjt:  NIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVPGRSWIEVKKKVYMFKAGN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGTATGCTTCTTCTTATCAGCGATTCAAATCAGCTTCCTTTTGTTTTCCCCAATCCTCAATCAATTTCCCCTTTCATTTTCGGTTCTATTTCAAATCCCATCTCTT
CTCAATCACGTATGACGAGCTCCTTGATTCCTTTGATCATCTTCTTCGGCAATGTAGCGGGACTCAACATTGCAAACAAGTTCATTCTGCAACCGTTGTCACCGGCGCGT
GTTGCTCGGCGTTCGTCGCCGCCCGGCTTGTATCCGTCTACGCCCGTTCTGGGCTTGTTTTCGATGCCCGGAAAGTGTTTGATACTGCGCCATTTGAAGGCTTGTCGAAC
TTGCTCTTATGGAATTCGATTATAAGAGCAAATGTCTCCCATGGGTACTGCGGAGAAGTGCTTCAACTTTATGGGAAAATGAGAAGATTAGGGGTTTTGGGGGATGGGTT
TACTTTTCCTCTGGTTCTGAGGGCTTCTTCCAACTTGGGTAGTTTCAACTTGTGTAAGAATCTTCATTGCCATGTTGTGCAATTTGGATTCCAAAATCATTTGCATGTTG
TGAATGAATTGATCGGAATGTATGCCAAGCTCGGACGAATGGGTGATGCCCAGAAAGTGTTTGATAAAATGCGCCTTAAAAGTGTAGTTTCTTGGAACACTATGGTTTCT
GGTTTTGCCTATAATTATGATGTTGATGGTGCTTCTAGGATGTTCCTTAAAATGGAGTCAGAAGGGGTTGAACCGAACCCTGTAACTTGGACTTCGTTGTTGTCTAGTCA
TGCTCGGTGTGGTCATCTTGAAGGAACCATGGCCTTGTTTAGCAGGATGAGGATGAAAGGTATTGGTGCCACTGCTGAAATGCTTGCTGTGGTGTTATCTGTTTGTGCTG
ATTTAGCTACATTGGATAGGGGTCAGATGATTCATGGGTATATAATAAAGGGAGGTTTTGAAGATTATTTGTTCGCCAAAAACGCACTTATAACTGTATATGGAAAAGGA
GGAGACATAAGAGATGCAGAGAAGTTATTTCATGAGATGGAAGTGAAGAATCTTGTGAGTTGGAACGCTCTCATTTCCTCCTATGCTGAATCTGGTTTATGTGACAAGGC
TTTTGAAGTGTTTTCTCAGCTGGAGAAAATGGATGTCTATCCAGAGATGAAACCGAACGTCATAACTTGGAGTGCTGTCATTTGTGGATTTGCTTCCAAGGGACTAGGAG
AAGAATCTTTGGAAGTTTTTCGTCAAATGCAGCTTGCAGATGTTAAAGCGAATTCAGTGACGATATCTAGTGTTTTTTCAGTTTGTGCAATGCTGGCAGCTCTGAATCTT
GGTAGGGAAATGCATGGTCACGTGATTAGAGCTCTGATGGATGATAACATATTGGTAGGAAACGGATTGATTAACATGTATACGAAATGTGGAAATTTCAAGCCAGGTTG
TTTAGTGTTCGAAAAACTTGAAAATCGAGATTTGATCTCATGGAACTCAATGATTGCAGGATATGGATTTCATGGACTTGGTAAAGATGCTCTTACAGCTTTTGATCAGA
TGATTAAATCTGGATTTATACCAGATGATGTCACCTTTATTGCTGCGCTATCTGCTTGTAGTCATGCTGGTCTTGTTGCTGAAGGCCGTTGGCTTTTTGATCAGATGCTA
CAAAACTTCAGGATCAAACCTCAGATGGAGCACTATGCGTGCATGGTCGATCTTCTCGGTCGTGCTGGTCTCATTGAAGAAGCAAGTAACATAGTCAAAGGCATGCCAAT
TCAACCCAATGCTTATGTCTGGAGTGCTCTTCTCAACTCTTGCAGAATGCACAAAGATACAGACATTGCAGAAGAGACTGCCCTACAAATTTCCAATCTGAATTCCGAGA
TCATAGGGAGCCATATGTTGCTCTCAAATATATTTGCTGCATGCTCTAGATGGGAGGATTCTGCAAGGGTGAGGATCTTGGCCAGGACGAAGGGCTTAATGATAGTTCCT
GGGCGTAGCTGGATTGAGGTGAAGAAGAAGGTTTATATGTTCAAAGCAGGAAACTCAATAGAAGGTCTAGAGAAAGTTGATGAAATTCTTCATGATTTGGCTCTTCAGAT
AGAAGGTCATGATTTTGATGATAGTATTATTGAA
mRNA sequenceShow/hide mRNA sequence
ATGCTGTATGCTTCTTCTTATCAGCGATTCAAATCAGCTTCCTTTTGTTTTCCCCAATCCTCAATCAATTTCCCCTTTCATTTTCGGTTCTATTTCAAATCCCATCTCTT
CTCAATCACGTATGACGAGCTCCTTGATTCCTTTGATCATCTTCTTCGGCAATGTAGCGGGACTCAACATTGCAAACAAGTTCATTCTGCAACCGTTGTCACCGGCGCGT
GTTGCTCGGCGTTCGTCGCCGCCCGGCTTGTATCCGTCTACGCCCGTTCTGGGCTTGTTTTCGATGCCCGGAAAGTGTTTGATACTGCGCCATTTGAAGGCTTGTCGAAC
TTGCTCTTATGGAATTCGATTATAAGAGCAAATGTCTCCCATGGGTACTGCGGAGAAGTGCTTCAACTTTATGGGAAAATGAGAAGATTAGGGGTTTTGGGGGATGGGTT
TACTTTTCCTCTGGTTCTGAGGGCTTCTTCCAACTTGGGTAGTTTCAACTTGTGTAAGAATCTTCATTGCCATGTTGTGCAATTTGGATTCCAAAATCATTTGCATGTTG
TGAATGAATTGATCGGAATGTATGCCAAGCTCGGACGAATGGGTGATGCCCAGAAAGTGTTTGATAAAATGCGCCTTAAAAGTGTAGTTTCTTGGAACACTATGGTTTCT
GGTTTTGCCTATAATTATGATGTTGATGGTGCTTCTAGGATGTTCCTTAAAATGGAGTCAGAAGGGGTTGAACCGAACCCTGTAACTTGGACTTCGTTGTTGTCTAGTCA
TGCTCGGTGTGGTCATCTTGAAGGAACCATGGCCTTGTTTAGCAGGATGAGGATGAAAGGTATTGGTGCCACTGCTGAAATGCTTGCTGTGGTGTTATCTGTTTGTGCTG
ATTTAGCTACATTGGATAGGGGTCAGATGATTCATGGGTATATAATAAAGGGAGGTTTTGAAGATTATTTGTTCGCCAAAAACGCACTTATAACTGTATATGGAAAAGGA
GGAGACATAAGAGATGCAGAGAAGTTATTTCATGAGATGGAAGTGAAGAATCTTGTGAGTTGGAACGCTCTCATTTCCTCCTATGCTGAATCTGGTTTATGTGACAAGGC
TTTTGAAGTGTTTTCTCAGCTGGAGAAAATGGATGTCTATCCAGAGATGAAACCGAACGTCATAACTTGGAGTGCTGTCATTTGTGGATTTGCTTCCAAGGGACTAGGAG
AAGAATCTTTGGAAGTTTTTCGTCAAATGCAGCTTGCAGATGTTAAAGCGAATTCAGTGACGATATCTAGTGTTTTTTCAGTTTGTGCAATGCTGGCAGCTCTGAATCTT
GGTAGGGAAATGCATGGTCACGTGATTAGAGCTCTGATGGATGATAACATATTGGTAGGAAACGGATTGATTAACATGTATACGAAATGTGGAAATTTCAAGCCAGGTTG
TTTAGTGTTCGAAAAACTTGAAAATCGAGATTTGATCTCATGGAACTCAATGATTGCAGGATATGGATTTCATGGACTTGGTAAAGATGCTCTTACAGCTTTTGATCAGA
TGATTAAATCTGGATTTATACCAGATGATGTCACCTTTATTGCTGCGCTATCTGCTTGTAGTCATGCTGGTCTTGTTGCTGAAGGCCGTTGGCTTTTTGATCAGATGCTA
CAAAACTTCAGGATCAAACCTCAGATGGAGCACTATGCGTGCATGGTCGATCTTCTCGGTCGTGCTGGTCTCATTGAAGAAGCAAGTAACATAGTCAAAGGCATGCCAAT
TCAACCCAATGCTTATGTCTGGAGTGCTCTTCTCAACTCTTGCAGAATGCACAAAGATACAGACATTGCAGAAGAGACTGCCCTACAAATTTCCAATCTGAATTCCGAGA
TCATAGGGAGCCATATGTTGCTCTCAAATATATTTGCTGCATGCTCTAGATGGGAGGATTCTGCAAGGGTGAGGATCTTGGCCAGGACGAAGGGCTTAATGATAGTTCCT
GGGCGTAGCTGGATTGAGGTGAAGAAGAAGGTTTATATGTTCAAAGCAGGAAACTCAATAGAAGGTCTAGAGAAAGTTGATGAAATTCTTCATGATTTGGCTCTTCAGAT
AGAAGGTCATGATTTTGATGATAGTATTATTGAA
Protein sequenceShow/hide protein sequence
MLYASSYQRFKSASFCFPQSSINFPFHFRFYFKSHLFSITYDELLDSFDHLLRQCSGTQHCKQVHSATVVTGACCSAFVAARLVSVYARSGLVFDARKVFDTAPFEGLSN
LLLWNSIIRANVSHGYCGEVLQLYGKMRRLGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLGRMGDAQKVFDKMRLKSVVSWNTMVS
GFAYNYDVDGASRMFLKMESEGVEPNPVTWTSLLSSHARCGHLEGTMALFSRMRMKGIGATAEMLAVVLSVCADLATLDRGQMIHGYIIKGGFEDYLFAKNALITVYGKG
GDIRDAEKLFHEMEVKNLVSWNALISSYAESGLCDKAFEVFSQLEKMDVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQLADVKANSVTISSVFSVCAMLAALNL
GREMHGHVIRALMDDNILVGNGLINMYTKCGNFKPGCLVFEKLENRDLISWNSMIAGYGFHGLGKDALTAFDQMIKSGFIPDDVTFIAALSACSHAGLVAEGRWLFDQML
QNFRIKPQMEHYACMVDLLGRAGLIEEASNIVKGMPIQPNAYVWSALLNSCRMHKDTDIAEETALQISNLNSEIIGSHMLLSNIFAACSRWEDSARVRILARTKGLMIVP
GRSWIEVKKKVYMFKAGNSIEGLEKVDEILHDLALQIEGHDFDDSIIE