; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0001326 (gene) of Chayote v1 genome

Gene IDSed0001326
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat (PPR-like) superfamily protein
Genome locationLG07:7626756..7628894
RNA-Seq ExpressionSed0001326
SyntenySed0001326
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606718.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0086.2Show/hide
Query:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV
        MLYASSYQ F S  F FPRF++NFPS+FR YFNSHFSSITYDNELLD FD LLR+CNGI+HCKQ HSATVVTGACSSAFVAARLVSVYARSGFVF ARKV
Subjt:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV

Query:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD
        FD+ PF GLSNLLLWNSIIRANV +GY REAL+LYG+M+N GVLADGFTFPLVL+AS+NLG FNLCK+LHCHVVQFGF NHLHV NEL+GMY KLRRMDD
Subjt:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD

Query:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN
        ARKVFDKM +K ++SWN MVSGYAYNYDVNGA R+FLQME EG+ PNPVTWTSLLSSHARCGHLEET+ LFSKMR KGVGA AEMLAVVLSVCADLATL+
Subjt:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN

Query:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS
        RGQM+HGYIVKGGFEDYLFAKNAL+TVYGKGGD+R+AEKLFHEMKVK+LVSWNSLISSY+ESGLYDKAFEAFS+L +ME  PEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQ+ANVKANSVTISSVLS+CAMLAALNLGRE+HGHV RA+M+DNILVGNGLINMYTKCGSFKPGCLVF KLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN
        MHGLGK+AL  FDEM+KSGF+PDDVTFIAALS+CSHAGL+AEGRWLF +ML+N+KIKPQMEHYACMVDLLGRAGLVEEASNIVK MPIKPNAY+WSALLN
Subjt:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN

Query:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE
        SCRMHKDT+LAEET S+I +LNSEITGSHMLLSNIF+ASCRWEDSARVRISARMKGLKKVPGCSWIE+KK+VYMFK+GNS+QE L RVDEILHDLA QIE
Subjt:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE

Query:  SHDFDDCIID
        S DFDD II+
Subjt:  SHDFDDCIID

XP_022148095.1 putative pentatricopeptide repeat-containing protein At1g17630 [Momordica charantia]0.0e+0084.65Show/hide
Query:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV
        MLYASSYQ FKS  FCFP+ SINFP +FRFYF SH  SITYD ELLD FDHLLR+C+G QHCKQ HSATVVTGAC SAFVAARLVSVYARSG VF ARKV
Subjt:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV

Query:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD
        FD+APF GLSNLLLWNSIIRANVS+GYC E L+LYG+MR +GVL DGFTFPLVL+AS+NLGSFNLCKNLHCHVVQFGFQNHLHV NELIGMYAKL RM D
Subjt:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD

Query:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN
        A+KVFDKM LK +VSWN MVSG+AYNYDV+GA R+FL+MESEG+ PNPVTWTSLLSSHARCGHLE TM LFS+MR KG+GA AEMLAVVLSVCADLATL+
Subjt:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN

Query:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS
        RGQMIHGYI+KGGFEDYLFAKNAL+TVYGKGGDIR+AEKLFHEM+VK+LVSWN+LISSY+ESGL DKAFE FSQLEKM+VYPEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG
        KGLGEESLEVFRQMQ+A+VKANSVTISSV SVCAMLAALNLGRE+HGHV RA MDDNILVGNGLINMYTKCG+FKPGCLVFEKLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN
         HGLGK+AL  FD+M+KSGF PDDVTFIAALS+CSHAGL+AEGRWLF +ML+N++IKPQMEHYACMVDLLGRAGL+EEASNIVKGMPI+PNAYVWSALLN
Subjt:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN

Query:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE
        SCRMHKDT++AEETA +I +LNSEI GSHMLLSNIFAA  RWEDSARVRI AR KGL  VPG SWIE+KKKVYMFKAGNSI E L +VDEILHDLA QIE
Subjt:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE

Query:  SHDFDDCIID
         HDFDD II+
Subjt:  SHDFDDCIID

XP_022949499.1 putative pentatricopeptide repeat-containing protein At1g17630 [Cucurbita moschata]0.0e+0086.06Show/hide
Query:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV
        MLYASSYQ F S  F FPRF+INFPS+FR YFNSHFSSITYDN+LLD FD LLR+CNGI+HCKQ HSAT+VTGACSSAFVAARLVSVYARSGFVF ARKV
Subjt:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV

Query:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD
        FD+ PF  L NLLLWNSIIRANV +GY REAL+LYG+M+N GVLADGFTFPLVL+AS+NLG FNLCK+LHCHVVQFGF NHLHV NEL+GMY KLRRMDD
Subjt:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD

Query:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN
        ARKVFDKM +K ++SWN MVSGYAYNYDVNGA R+FLQME EG+ PNPVTWTSLLSSHARCGHLEET+ LFSKMR KGVGA AEMLAVVLSVCADLATL+
Subjt:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN

Query:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS
        RGQM+HGYIVKGGFEDYLFAKNAL+TVYGKGGDIR+AEKLFHEMKVK+LVSWNSLISSY+ESGLYDKAFEAFS+LEKM   PEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQ+ANVKANSVTISSVLS+CAMLAALNLGRE+HGHV RA+M+DNILVGNGLINMYTKCGSFKPGCLVF KLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN
        MHGLGK+AL  FDEM+KSGF+PDDVTFIAALS+CSHAGL+AEGRWLF +ML+N+KIKPQMEHYACMVDLLGRAGLVEEASNIVK MPIKPNAY+WSALLN
Subjt:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN

Query:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE
        SCRMHKDT+LAEET S+I +LNSEITGSHMLLSNIF+ASCRWEDSARVRISARMKGLKKVPGCSWIE+KK+VYMFK+GNS+QE L RVDEILHDLA QIE
Subjt:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE

Query:  SHDFDDCIID
        S DFDD II+
Subjt:  SHDFDDCIID

XP_022997825.1 putative pentatricopeptide repeat-containing protein At1g17630 [Cucurbita maxima]0.0e+0086.76Show/hide
Query:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV
        MLYASSYQ F S  FCFPRF+INFPS+FR YFNSHFSSITYD+ELLD FD LLR+CNGI+HCKQ HS TVV GACSSAFVAARLVSVYARSGFVF ARKV
Subjt:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV

Query:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD
        FD+ PF GLSNLLLWNSIIRANV +GYCREAL+LYG+MRN GVLADGFTFPLVL+AS+NLG FNLCK LHCHVVQFGFQNHLHV NELIGMY KLRRM D
Subjt:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD

Query:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN
        ARKVFDKM +K ++SWN MVSGYAYNYDVNGA+R+FLQME EG+ PNPVTWTSLLSSHARCGHLEET+ LFSKMR KGVGA AEMLAVVLSVCADLATL+
Subjt:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN

Query:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS
        RGQM+HGYIVKGGFEDYLFAKNAL+TVYGKGGDIR+AEKLFHEMKVK+LVSWNSLISSY+ESGLYDKAFEAFS+LEKME  PEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQ+ANVKANSVTISSVLS+CAMLAALNLGRE+HGHV RA+M+DNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN
        MHGLGK+AL  FDEM+KSGF+PDDVTFIAALS+CSHAGL+AEGRWLF +ML+N+KIKPQMEHYACMVDLLGRAGLVEEASN+VK MPIKPN Y+WSALLN
Subjt:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN

Query:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE
        SCRMHK T+LAEET S+I +LNSEI GSHMLLSNIF+ASCRWEDSARVRISARMKGLKKVPGCSWIE+KKKVYMFKAGNS+QE L RVDEILHDLA QIE
Subjt:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE

Query:  SHDFDDCIID
        S DFDD II+
Subjt:  SHDFDDCIID

XP_023523771.1 putative pentatricopeptide repeat-containing protein At1g17630 [Cucurbita pepo subsp. pepo]0.0e+0086.34Show/hide
Query:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV
        MLYASSYQ F S  FCFPRF+INFPS+FR+YFNS FSSITYD+ELLD FD LLR+CNGI+HCKQ HSATVVTGACSSAFVAARLVSVYARSGFVF ARKV
Subjt:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV

Query:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD
        FD+ PF GLSNLLLWNSIIRANV +GY REAL+LYG+MRN GVLADGFTFPLVL+AS+NLG FNLCK+LHCHVVQFGFQNHLHV NEL+GMY KLRRMDD
Subjt:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD

Query:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN
        ARKVFDKM +K ++SWN MVSGYAYNYDVNGA R+FLQME EG+ PNPVTWTSLLSSHARCGHLEET+ LFSKMR KGVGA AEMLAVVLSVCADL T +
Subjt:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN

Query:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS
        RGQM+HGYIVKGGFEDYLFAKNAL+TVYGKGGDIR+AEKLFHEMKVK+LVSWNSLISSY+ESGLYDKAFEAFS+LEKME  PEMKP+VITWSAVICGFAS
Subjt:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQ+ANVKANSVTISSVLS+CAMLAALNLGRE+HGHV RA+M+DNILVGNGLINMYTKCGSFKPGCLVF KLENRDLISWNS+IAGYG
Subjt:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN
        MHGLGK+AL  FDEM+KSGF+PDDVTFIAALS+CSHAGL+AEGRWLF +ML+N+KIKPQMEHYACMVDLLGRAGLVEEASNIVK MPIKPNAY+WSALLN
Subjt:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN

Query:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE
        SCRMHKDT+LAEET S+I  L+SEITGSHMLLSNI++ASCRWEDSARVRISARMKGLKKVPGCSWIE+KKKVYMFKAGNS+QE L RVDEILHDLA QIE
Subjt:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE

Query:  SHDFDDCIID
        S DFDD II+
Subjt:  SHDFDDCIID

TrEMBL top hitse value%identityAlignment
A0A0A0LFT1 Uncharacterized protein0.0e+0080.14Show/hide
Query:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV
        ML ASSYQ FKSV FCFP  SIN        F+S FSSITYD +L DFFDHLLR+CNGIQH KQ HSATVVTGA  SAFV+ARLVS+Y+R G V  ARKV
Subjt:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV

Query:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD
        F SAPF   SN LLWNSIIRANV +GYC EAL+LYG+MRN GVL DGFTFPL+L+AS+NLG+FN+CKNLHCHVVQFGFQNHLHVGNELIGMYAKL RMDD
Subjt:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD

Query:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN
        ARKVFDKM +K +VSWN MVSGYAYNYDVNGA R+F QME EG+ PNPVTWTSLLSSHARCGHLEETM LF KMR KGVG  AEMLAVVLSVCADLATLN
Subjt:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN

Query:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS
         GQMIHGY+VKGGF DYLFAKNAL+T+YGKGG + +AEKLFHEMKVK+LVSWN+LISS++ESG+YDKA E  SQLEKME YPEMKPNVITWSA+ICGFAS
Subjt:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG
        KGLGEESLEVFR+MQ+ANVKANSVTI+SVLS+CAMLAALNLGRE+HGHV RA+MDDN+LVGNGLINMYTKCGSFKPG +VFEKLENRD ISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN
         HGLGK+ALA F+ M+KSG++PD VTFIAALS+CSHAGL+AEG WLF +M +N+KI+P++EHYACMVDLLGRAGLVEEASNI+KGMP++PNAY+WS+LLN
Subjt:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN

Query:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE
        SCRMHKDT+LAEE A++I +LNS+ITGSHMLLSNIFAASCRWEDSARVRISAR KGLKKVPG SWIE+KKKVYMFKAG +I E L +VDEILHDLAFQIE
Subjt:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE

Query:  SHDFDDCIID
        +++ DD II+
Subjt:  SHDFDDCIID

A0A5A7U7B1 Putative pentatricopeptide repeat-containing protein0.0e+0079.3Show/hide
Query:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV
        ML A SYQ FKSV FCFP  SIN        F+S FSSITYD +L +FFDHLLR+CNGIQH KQ HSATVVTGA  SAFV+ARLVS+Y+R G V  ARKV
Subjt:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV

Query:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD
        F SAPF  LSN LLWNSIIRANV +GYC EAL LYG+MRN GVL DGFTFPLVL+AS+NLG+ ++CKNLHCHVVQFGFQNHLHVGNELIGMYAKL RMDD
Subjt:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD

Query:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN
        ARKVFDKM +K +VSWN MVSGYAYNYDVNGA R+F QME EG+ PNPVTWTSLLSSHARCGHL ETM LF KMR KGVGA AEMLAVVLSVCADLATLN
Subjt:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN

Query:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS
         GQMIHGY+VKGGF DYLFAKNAL+T+YGKGGD+ +AEKLFHEMKVK+LVSWN+LISS++ESG+YDKA E  SQLEKME YPEMKPNVITWS++ICGF+S
Subjt:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG
        KGLGEESLEVFR+MQ+ANVKANSVTI+SVLS+CAMLAALNLGRE+HGHV RA+MD+N+LVGNGLINMYTKCGSFKPG LVFEKLENRD ISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN
         HGLGK+ALA  + M+KSG++PD VTFIAALS+CSHAGL+AEG WLF +M +N+KI+P++EHYACMVDLLGRAGLVEEASNI+K MP++PNAY+WS+LLN
Subjt:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN

Query:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE
        SCRMHKDT+LAEE A++I +LNS+ITGSHMLLSNIFAASCRWEDSARVRISAR+KGLKKVPG SWIE+KKKVY+FKAG +  E L +VDEILHDLAFQIE
Subjt:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE

Query:  SHDFDDCIID
        ++D DD II+
Subjt:  SHDFDDCIID

A0A6J1D1Z6 putative pentatricopeptide repeat-containing protein At1g176300.0e+0084.65Show/hide
Query:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV
        MLYASSYQ FKS  FCFP+ SINFP +FRFYF SH  SITYD ELLD FDHLLR+C+G QHCKQ HSATVVTGAC SAFVAARLVSVYARSG VF ARKV
Subjt:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV

Query:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD
        FD+APF GLSNLLLWNSIIRANVS+GYC E L+LYG+MR +GVL DGFTFPLVL+AS+NLGSFNLCKNLHCHVVQFGFQNHLHV NELIGMYAKL RM D
Subjt:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD

Query:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN
        A+KVFDKM LK +VSWN MVSG+AYNYDV+GA R+FL+MESEG+ PNPVTWTSLLSSHARCGHLE TM LFS+MR KG+GA AEMLAVVLSVCADLATL+
Subjt:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN

Query:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS
        RGQMIHGYI+KGGFEDYLFAKNAL+TVYGKGGDIR+AEKLFHEM+VK+LVSWN+LISSY+ESGL DKAFE FSQLEKM+VYPEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG
        KGLGEESLEVFRQMQ+A+VKANSVTISSV SVCAMLAALNLGRE+HGHV RA MDDNILVGNGLINMYTKCG+FKPGCLVFEKLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN
         HGLGK+AL  FD+M+KSGF PDDVTFIAALS+CSHAGL+AEGRWLF +ML+N++IKPQMEHYACMVDLLGRAGL+EEASNIVKGMPI+PNAYVWSALLN
Subjt:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN

Query:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE
        SCRMHKDT++AEETA +I +LNSEI GSHMLLSNIFAA  RWEDSARVRI AR KGL  VPG SWIE+KKKVYMFKAGNSI E L +VDEILHDLA QIE
Subjt:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE

Query:  SHDFDDCIID
         HDFDD II+
Subjt:  SHDFDDCIID

A0A6J1GD03 putative pentatricopeptide repeat-containing protein At1g176300.0e+0086.06Show/hide
Query:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV
        MLYASSYQ F S  F FPRF+INFPS+FR YFNSHFSSITYDN+LLD FD LLR+CNGI+HCKQ HSAT+VTGACSSAFVAARLVSVYARSGFVF ARKV
Subjt:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV

Query:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD
        FD+ PF  L NLLLWNSIIRANV +GY REAL+LYG+M+N GVLADGFTFPLVL+AS+NLG FNLCK+LHCHVVQFGF NHLHV NEL+GMY KLRRMDD
Subjt:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD

Query:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN
        ARKVFDKM +K ++SWN MVSGYAYNYDVNGA R+FLQME EG+ PNPVTWTSLLSSHARCGHLEET+ LFSKMR KGVGA AEMLAVVLSVCADLATL+
Subjt:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN

Query:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS
        RGQM+HGYIVKGGFEDYLFAKNAL+TVYGKGGDIR+AEKLFHEMKVK+LVSWNSLISSY+ESGLYDKAFEAFS+LEKM   PEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQ+ANVKANSVTISSVLS+CAMLAALNLGRE+HGHV RA+M+DNILVGNGLINMYTKCGSFKPGCLVF KLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN
        MHGLGK+AL  FDEM+KSGF+PDDVTFIAALS+CSHAGL+AEGRWLF +ML+N+KIKPQMEHYACMVDLLGRAGLVEEASNIVK MPIKPNAY+WSALLN
Subjt:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN

Query:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE
        SCRMHKDT+LAEET S+I +LNSEITGSHMLLSNIF+ASCRWEDSARVRISARMKGLKKVPGCSWIE+KK+VYMFK+GNS+QE L RVDEILHDLA QIE
Subjt:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE

Query:  SHDFDDCIID
        S DFDD II+
Subjt:  SHDFDDCIID

A0A6J1KAZ0 putative pentatricopeptide repeat-containing protein At1g176300.0e+0086.76Show/hide
Query:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV
        MLYASSYQ F S  FCFPRF+INFPS+FR YFNSHFSSITYD+ELLD FD LLR+CNGI+HCKQ HS TVV GACSSAFVAARLVSVYARSGFVF ARKV
Subjt:  MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKV

Query:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD
        FD+ PF GLSNLLLWNSIIRANV +GYCREAL+LYG+MRN GVLADGFTFPLVL+AS+NLG FNLCK LHCHVVQFGFQNHLHV NELIGMY KLRRM D
Subjt:  FDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDD

Query:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN
        ARKVFDKM +K ++SWN MVSGYAYNYDVNGA+R+FLQME EG+ PNPVTWTSLLSSHARCGHLEET+ LFSKMR KGVGA AEMLAVVLSVCADLATL+
Subjt:  ARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLN

Query:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS
        RGQM+HGYIVKGGFEDYLFAKNAL+TVYGKGGDIR+AEKLFHEMKVK+LVSWNSLISSY+ESGLYDKAFEAFS+LEKME  PEMKPNVITWSAVICGFAS
Subjt:  RGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG
         G GEESLEVFRQMQ+ANVKANSVTISSVLS+CAMLAALNLGRE+HGHV RA+M+DNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG
Subjt:  KGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYG

Query:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN
        MHGLGK+AL  FDEM+KSGF+PDDVTFIAALS+CSHAGL+AEGRWLF +ML+N+KIKPQMEHYACMVDLLGRAGLVEEASN+VK MPIKPN Y+WSALLN
Subjt:  MHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLN

Query:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE
        SCRMHK T+LAEET S+I +LNSEI GSHMLLSNIF+ASCRWEDSARVRISARMKGLKKVPGCSWIE+KKKVYMFKAGNS+QE L RVDEILHDLA QIE
Subjt:  SCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIE

Query:  SHDFDDCIID
        S DFDD II+
Subjt:  SHDFDDCIID

SwissProt top hitse value%identityAlignment
Q9LFL5 Pentatricopeptide repeat-containing protein At5g168602.6e-11136.24Show/hide
Query:  HFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALEL
        H  S T DN    F        + ++  + AH+ ++VTG  S+ FV   LV++Y+R   +  ARKVFD      + +++ WNSII +    G  + ALE+
Subjt:  HFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALEL

Query:  YGEMRN-VGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAY
        +  M N  G   D  T   VL   A+LG+ +L K LHC  V      ++ VGN L+ MYAK   MD+A  VF  M +K +VSWNAMV+GY+       A 
Subjt:  YGEMRN-VGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAY

Query:  RVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGD
        R+F +M+ E I  + VTW++ +S +A+ G   E + +  +M + G+  N   L  VLS CA +  L  G+ IH Y +K   +         L   G G +
Subjt:  RVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGD

Query:  IRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQM--QVANVKANSVTISSVLS
                  M +      N LI  Y++    D A   F  L   E       +V+TW+ +I G++  G   ++LE+  +M  +    + N+ TIS  L 
Subjt:  IRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQM--QVANVKANSVTISSVLS

Query:  VCAMLAALNLGREIHGHVFRAQMDD-NILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAA
         CA LAAL +G++IH +  R Q +   + V N LI+MY KCGS     LVF+ +  ++ ++W S++ GYGMHG G+EAL IFDEM + GFK D VT +  
Subjt:  VCAMLAALNLGREIHGHVFRAQMDD-NILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAA

Query:  LSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHM
        L +CSH+G+I +G   F  M   + + P  EHYAC+VDLLGRAG +  A  +++ MP++P   VW A L+ CR+H   EL E  A +I +L S   GS+ 
Subjt:  LSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHM

Query:  LLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIES-----------HDFDD
        LLSN++A + RW+D  R+R   R KG+KK PGCSW+E  K    F  G+        + ++L D   +I+            HD DD
Subjt:  LLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIES-----------HDFDD

Q9LNP2 Putative pentatricopeptide repeat-containing protein At1g176301.7e-19248.24Show/hide
Query:  MLYASSYQTFKSVL------FCF-----PRFSINFPSNFRFYFNSHFSSITYDNE--LLDFFDHLLRRCNGIQHCKQAHSATVVTG-ACSSAFVAARLVS
        M++AS +Q    ++      FCF     P  SI+ P        S + S+T +N+  L  +FDHLL  C   Q C+Q H+  +++     S  +AA L+S
Subjt:  MLYASSYQTFKSVL------FCF-----PRFSINFPSNFRFYFNSHFSSITYDNE--LLDFFDHLLRRCNGIQHCKQAHSATVVTG-ACSSAFVAARLVS

Query:  VYARSGFVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGN
        VYAR G +  AR VF++     LS+L LWNSI++ANVS+G    ALELY  MR  G+  DG+  PL+L+A   LG F LC+  H  V+Q G + +LHV N
Subjt:  VYARSGFVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGN

Query:  ELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEML
        EL+ +Y K  RM DA  +F +M ++  +SWN M+ G++  YD   A ++F  M+ E   P+ VTWTS+LS H++CG  E+ +  F  MR  G   + E L
Subjt:  ELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEML

Query:  AVVLSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKP
        AV  SVCA+L  L+  + +HGY++KGGFE+YL ++NAL+ VYGK G +++AE LF +++ K + SWNSLI+S+ ++G  D+A   FS+LE+M     +K 
Subjt:  AVVLSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKP

Query:  NVITWSAVICGFASKGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLEN
        NV+TW++VI G   +G G++SLE FRQMQ + V ANSVTI  +LS+CA L ALNLGREIHGHV R  M +NILV N L+NMY KCG    G LVFE + +
Subjt:  NVITWSAVICGFASKGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLEN

Query:  RDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGM
        +DLISWNS+I GYGMHG  ++AL++FD M+ SGF PD +  +A LS+CSHAGL+ +GR +FY M + + ++PQ EHYAC+VDLLGR G ++EAS IVK M
Subjt:  RDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGM

Query:  PIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLG
        P++P   V  ALLNSCRMHK+ ++AE  AS++  L  E TGS+MLLSNI++A  RWE+SA VR  A+ K LKKV G SWIE+KKK Y F +G+ +Q    
Subjt:  PIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLG

Query:  RVDEILHDL
         +  +L DL
Subjt:  RVDEILHDL

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202305.5e-12236.12Show/hide
Query:  QAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSF
        QAH+  + +GA +  +++A+L++ Y+       A  V  S P      +  ++S+I A        +++ ++  M + G++ D    P + K  A L +F
Subjt:  QAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSF

Query:  NLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGH
         + K +HC     G      V   +  MY +  RM DARKVFD+M  K +V+ +A++  YA    +    R+  +MES GI  N V+W  +LS   R G+
Subjt:  NLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGH

Query:  LEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESG
         +E + +F K+ + G   +   ++ VL    D   LN G++IHGY++K G        +A++ +YGK G +     LF++ ++      N+ I+  S +G
Subjt:  LEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESG

Query:  LYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNG
        L DKA E F   ++      M+ NV++W+++I G A  G   E+LE+FR+MQVA VK N VTI S+L  C  +AAL  GR  HG   R  + DN+ VG+ 
Subjt:  LYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNG

Query:  LINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEML-ENYKIKPQMEH
        LI+MY KCG      +VF  +  ++L+ WNS++ G+ MHG  KE ++IF+ +M++  KPD ++F + LS+C   GL  EG W +++M+ E Y IKP++EH
Subjt:  LINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEML-ENYKIKPQMEH

Query:  YACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPG
        Y+CMV+LLGRAG ++EA +++K MP +P++ VW ALLNSCR+  + +LAE  A ++F L  E  G+++LLSNI+AA   W +   +R      GLKK PG
Subjt:  YACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPG

Query:  CSWIELKKKVYMFKAGNSIQESLGRVDEILHDLA
        CSWI++K +VY   AG+     + ++ E + +++
Subjt:  CSWIELKKKVYMFKAGNSIQESLGRVDEILHDLA

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127704.3e-10632.73Show/hide
Query:  YFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCRE
        Y NS   S ++   L+D   H  +        KQ H+  +V G   S F+  +L+   +  G + +AR+VFD  P      +  WN+IIR    N + ++
Subjt:  YFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCRE

Query:  ALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHL--KVLVSWNAMVSGYAYNYD
        AL +Y  M+   V  D FTFP +LKA + L    + + +H  V + GF   + V N LI +YAK RR+  AR VF+ + L  + +VSW A+VS YA N +
Subjt:  ALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHL--KVLVSWNAMVSGYAYNYD

Query:  VNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLNRGQMIHGYIVKGGFE---DYLFAKNALL
           A  +F QM    + P+   W +L+S                                VL+    L  L +G+ IH  +VK G E   D L + N   
Subjt:  VNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLNRGQMIHGYIVKGGFE---DYLFAKNALL

Query:  TVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQVANVKANSVT
        T+Y K G +  A+ LF +MK                                        PN+I W+A+I G+A  G   E++++F +M   +V+ ++++
Subjt:  TVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQVANVKANSVT

Query:  ISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDV
        I+S +S CA + +L   R ++ +V R+   D++ + + LI+M+ KCGS +   LVF++  +RD++ W++MI GYG+HG  +EA++++  M + G  P+DV
Subjt:  ISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDV

Query:  TFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEI
        TF+  L +C+H+G++ EG W F+  + ++KI PQ +HYAC++DLLGRAG +++A  ++K MP++P   VW ALL++C+ H+  EL E  A ++F ++   
Subjt:  TFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEI

Query:  TGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEI
        TG ++ LSN++AA+  W+  A VR+  + KGL K  GCSW+E++ ++  F+ G+   +S  R +EI
Subjt:  TGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEI

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226906.4e-11031.39Show/hide
Query:  LRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSG---FVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFT
        L+ C  I   K  H +    G  +      +LV+     G    + +A++VF+++   G     ++NS+IR   S+G C EA+ L+  M N G+  D +T
Subjt:  LRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSG---FVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFT

Query:  FPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQM-ESEGIVPNP
        FP  L A A   +      +H  +V+ G+   L V N L+  YA+   +D ARKVFD+M  + +VSW +M+ GYA       A  +F +M   E + PN 
Subjt:  FPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQM-ESEGIVPNP

Query:  VTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVV-----------------------------------------------------------
        VT   ++S+ A+   LE    +++ +RN G+  N  M++ +                                                           
Subjt:  VTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVV-----------------------------------------------------------

Query:  -------LSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYP
               +S C+ L  +  G+  HGY+++ GFE +    NAL+ +Y K      A ++F  M  K++V+WNS+++ Y E+G  D A+E F      E  P
Subjt:  -------LSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYP

Query:  EMKPNVITWSAVICGFASKGLGEESLEVFRQMQ-VANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVF
        E   N+++W+ +I G     L EE++EVF  MQ    V A+ VT+ S+ S C  L AL+L + I+ ++ +  +  ++ +G  L++M+++CG  +    +F
Subjt:  EMKPNVITWSAVICGFASKGLGEESLEVFRQMQ-VANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVF

Query:  EKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASN
          L NRD+ +W + I    M G  + A+ +FD+M++ G KPD V F+ AL++CSH GL+ +G+ +FY ML+ + + P+  HY CMVDLLGRAGL+EEA  
Subjt:  EKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASN

Query:  IVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSI
        +++ MP++PN  +W++LL +CR+  + E+A   A +I  L  E TGS++LLSN++A++ RW D A+VR+S + KGL+K PG S I+++ K + F +G+  
Subjt:  IVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSI

Query:  QESLGRVDEILHDLAFQIESH-----DFDDCIIDTE
           +  ++ +L +++ Q  SH     D  + ++D +
Subjt:  QESLGRVDEILHDLAFQIESH-----DFDDCIIDTE

Arabidopsis top hitse value%identityAlignment
AT1G17630.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.2e-19348.24Show/hide
Query:  MLYASSYQTFKSVL------FCF-----PRFSINFPSNFRFYFNSHFSSITYDNE--LLDFFDHLLRRCNGIQHCKQAHSATVVTG-ACSSAFVAARLVS
        M++AS +Q    ++      FCF     P  SI+ P        S + S+T +N+  L  +FDHLL  C   Q C+Q H+  +++     S  +AA L+S
Subjt:  MLYASSYQTFKSVL------FCF-----PRFSINFPSNFRFYFNSHFSSITYDNE--LLDFFDHLLRRCNGIQHCKQAHSATVVTG-ACSSAFVAARLVS

Query:  VYARSGFVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGN
        VYAR G +  AR VF++     LS+L LWNSI++ANVS+G    ALELY  MR  G+  DG+  PL+L+A   LG F LC+  H  V+Q G + +LHV N
Subjt:  VYARSGFVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGN

Query:  ELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEML
        EL+ +Y K  RM DA  +F +M ++  +SWN M+ G++  YD   A ++F  M+ E   P+ VTWTS+LS H++CG  E+ +  F  MR  G   + E L
Subjt:  ELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEML

Query:  AVVLSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKP
        AV  SVCA+L  L+  + +HGY++KGGFE+YL ++NAL+ VYGK G +++AE LF +++ K + SWNSLI+S+ ++G  D+A   FS+LE+M     +K 
Subjt:  AVVLSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKP

Query:  NVITWSAVICGFASKGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLEN
        NV+TW++VI G   +G G++SLE FRQMQ + V ANSVTI  +LS+CA L ALNLGREIHGHV R  M +NILV N L+NMY KCG    G LVFE + +
Subjt:  NVITWSAVICGFASKGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLEN

Query:  RDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGM
        +DLISWNS+I GYGMHG  ++AL++FD M+ SGF PD +  +A LS+CSHAGL+ +GR +FY M + + ++PQ EHYAC+VDLLGR G ++EAS IVK M
Subjt:  RDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGM

Query:  PIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLG
        P++P   V  ALLNSCRMHK+ ++AE  AS++  L  E TGS+MLLSNI++A  RWE+SA VR  A+ K LKKV G SWIE+KKK Y F +G+ +Q    
Subjt:  PIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLG

Query:  RVDEILHDL
         +  +L DL
Subjt:  RVDEILHDL

AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein3.9e-12336.12Show/hide
Query:  QAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSF
        QAH+  + +GA +  +++A+L++ Y+       A  V  S P      +  ++S+I A        +++ ++  M + G++ D    P + K  A L +F
Subjt:  QAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSF

Query:  NLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGH
         + K +HC     G      V   +  MY +  RM DARKVFD+M  K +V+ +A++  YA    +    R+  +MES GI  N V+W  +LS   R G+
Subjt:  NLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGH

Query:  LEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESG
         +E + +F K+ + G   +   ++ VL    D   LN G++IHGY++K G        +A++ +YGK G +     LF++ ++      N+ I+  S +G
Subjt:  LEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESG

Query:  LYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNG
        L DKA E F   ++      M+ NV++W+++I G A  G   E+LE+FR+MQVA VK N VTI S+L  C  +AAL  GR  HG   R  + DN+ VG+ 
Subjt:  LYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNG

Query:  LINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEML-ENYKIKPQMEH
        LI+MY KCG      +VF  +  ++L+ WNS++ G+ MHG  KE ++IF+ +M++  KPD ++F + LS+C   GL  EG W +++M+ E Y IKP++EH
Subjt:  LINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEML-ENYKIKPQMEH

Query:  YACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPG
        Y+CMV+LLGRAG ++EA +++K MP +P++ VW ALLNSCR+  + +LAE  A ++F L  E  G+++LLSNI+AA   W +   +R      GLKK PG
Subjt:  YACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPG

Query:  CSWIELKKKVYMFKAGNSIQESLGRVDEILHDLA
        CSWI++K +VY   AG+     + ++ E + +++
Subjt:  CSWIELKKKVYMFKAGNSIQESLGRVDEILHDLA

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)4.5e-11131.39Show/hide
Query:  LRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSG---FVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFT
        L+ C  I   K  H +    G  +      +LV+     G    + +A++VF+++   G     ++NS+IR   S+G C EA+ L+  M N G+  D +T
Subjt:  LRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSG---FVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFT

Query:  FPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQM-ESEGIVPNP
        FP  L A A   +      +H  +V+ G+   L V N L+  YA+   +D ARKVFD+M  + +VSW +M+ GYA       A  +F +M   E + PN 
Subjt:  FPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQM-ESEGIVPNP

Query:  VTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVV-----------------------------------------------------------
        VT   ++S+ A+   LE    +++ +RN G+  N  M++ +                                                           
Subjt:  VTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVV-----------------------------------------------------------

Query:  -------LSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYP
               +S C+ L  +  G+  HGY+++ GFE +    NAL+ +Y K      A ++F  M  K++V+WNS+++ Y E+G  D A+E F      E  P
Subjt:  -------LSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYP

Query:  EMKPNVITWSAVICGFASKGLGEESLEVFRQMQ-VANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVF
        E   N+++W+ +I G     L EE++EVF  MQ    V A+ VT+ S+ S C  L AL+L + I+ ++ +  +  ++ +G  L++M+++CG  +    +F
Subjt:  EMKPNVITWSAVICGFASKGLGEESLEVFRQMQ-VANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVF

Query:  EKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASN
          L NRD+ +W + I    M G  + A+ +FD+M++ G KPD V F+ AL++CSH GL+ +G+ +FY ML+ + + P+  HY CMVDLLGRAGL+EEA  
Subjt:  EKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASN

Query:  IVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSI
        +++ MP++PN  +W++LL +CR+  + E+A   A +I  L  E TGS++LLSN++A++ RW D A+VR+S + KGL+K PG S I+++ K + F +G+  
Subjt:  IVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSI

Query:  QESLGRVDEILHDLAFQIESH-----DFDDCIIDTE
           +  ++ +L +++ Q  SH     D  + ++D +
Subjt:  QESLGRVDEILHDLAFQIESH-----DFDDCIIDTE

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification4.5e-11131.39Show/hide
Query:  LRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSG---FVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFT
        L+ C  I   K  H +    G  +      +LV+     G    + +A++VF+++   G     ++NS+IR   S+G C EA+ L+  M N G+  D +T
Subjt:  LRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSG---FVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFT

Query:  FPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQM-ESEGIVPNP
        FP  L A A   +      +H  +V+ G+   L V N L+  YA+   +D ARKVFD+M  + +VSW +M+ GYA       A  +F +M   E + PN 
Subjt:  FPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAYRVFLQM-ESEGIVPNP

Query:  VTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVV-----------------------------------------------------------
        VT   ++S+ A+   LE    +++ +RN G+  N  M++ +                                                           
Subjt:  VTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVV-----------------------------------------------------------

Query:  -------LSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYP
               +S C+ L  +  G+  HGY+++ GFE +    NAL+ +Y K      A ++F  M  K++V+WNS+++ Y E+G  D A+E F      E  P
Subjt:  -------LSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYP

Query:  EMKPNVITWSAVICGFASKGLGEESLEVFRQMQ-VANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVF
        E   N+++W+ +I G     L EE++EVF  MQ    V A+ VT+ S+ S C  L AL+L + I+ ++ +  +  ++ +G  L++M+++CG  +    +F
Subjt:  EMKPNVITWSAVICGFASKGLGEESLEVFRQMQ-VANVKANSVTISSVLSVCAMLAALNLGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVF

Query:  EKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASN
          L NRD+ +W + I    M G  + A+ +FD+M++ G KPD V F+ AL++CSH GL+ +G+ +FY ML+ + + P+  HY CMVDLLGRAGL+EEA  
Subjt:  EKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASN

Query:  IVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSI
        +++ MP++PN  +W++LL +CR+  + E+A   A +I  L  E TGS++LLSN++A++ RW D A+VR+S + KGL+K PG S I+++ K + F +G+  
Subjt:  IVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSI

Query:  QESLGRVDEILHDLAFQIESH-----DFDDCIIDTE
           +  ++ +L +++ Q  SH     D  + ++D +
Subjt:  QESLGRVDEILHDLAFQIESH-----DFDDCIIDTE

AT5G16860.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-11236.24Show/hide
Query:  HFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALEL
        H  S T DN    F        + ++  + AH+ ++VTG  S+ FV   LV++Y+R   +  ARKVFD      + +++ WNSII +    G  + ALE+
Subjt:  HFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKVFDSAPFAGLSNLLLWNSIIRANVSNGYCREALEL

Query:  YGEMRN-VGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAY
        +  M N  G   D  T   VL   A+LG+ +L K LHC  V      ++ VGN L+ MYAK   MD+A  VF  M +K +VSWNAMV+GY+       A 
Subjt:  YGEMRN-VGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMVSGYAYNYDVNGAY

Query:  RVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGD
        R+F +M+ E I  + VTW++ +S +A+ G   E + +  +M + G+  N   L  VLS CA +  L  G+ IH Y +K   +         L   G G +
Subjt:  RVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGKGGD

Query:  IRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQM--QVANVKANSVTISSVLS
                  M +      N LI  Y++    D A   F  L   E       +V+TW+ +I G++  G   ++LE+  +M  +    + N+ TIS  L 
Subjt:  IRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQM--QVANVKANSVTISSVLS

Query:  VCAMLAALNLGREIHGHVFRAQMDD-NILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAA
         CA LAAL +G++IH +  R Q +   + V N LI+MY KCGS     LVF+ +  ++ ++W S++ GYGMHG G+EAL IFDEM + GFK D VT +  
Subjt:  VCAMLAALNLGREIHGHVFRAQMDD-NILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAA

Query:  LSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHM
        L +CSH+G+I +G   F  M   + + P  EHYAC+VDLLGRAG +  A  +++ MP++P   VW A L+ CR+H   EL E  A +I +L S   GS+ 
Subjt:  LSSCSHAGLIAEGRWLFYEMLENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHM

Query:  LLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIES-----------HDFDD
        LLSN++A + RW+D  R+R   R KG+KK PGCSW+E  K    F  G+        + ++L D   +I+            HD DD
Subjt:  LLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIES-----------HDFDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTATGCTTCTTCTTATCAGACATTCAAATCGGTTTTGTTCTGTTTTCCCCGATTCTCAATCAATTTCCCCTCAAATTTTCGATTCTATTTCAACTCCCATTTTTC
CTCAATCACTTATGACAACGAACTCCTCGATTTCTTCGATCATCTTCTCCGGCGATGCAACGGGATTCAACATTGCAAACAAGCTCATTCCGCCACCGTTGTCACCGGCG
CCTGTTCGTCGGCGTTCGTCGCTGCCCGGCTTGTGTCCGTCTATGCCCGTTCTGGGTTTGTTTTTTATGCCCGGAAAGTGTTTGATTCTGCGCCATTTGCAGGTTTGTCG
AACTTGCTGTTATGGAATTCGATTATAAGAGCAAATGTATCTAATGGGTATTGTAGAGAAGCGCTTGAACTTTATGGGGAAATGAGAAATGTTGGGGTTTTGGCTGATGG
GTTTACTTTTCCCCTGGTTTTGAAGGCTTCTGCCAATTTGGGTTCTTTTAATTTGTGCAAGAATCTTCATTGTCATGTTGTGCAATTTGGGTTTCAAAATCATTTGCATG
TTGGGAATGAATTGATAGGAATGTATGCTAAACTTAGACGAATGGATGATGCCCGGAAAGTGTTTGACAAAATGCATCTGAAAGTTTTAGTTTCTTGGAACGCTATGGTT
TCTGGTTATGCCTATAATTATGATGTTAATGGTGCTTATAGGGTGTTCCTTCAAATGGAGTCGGAAGGGATCGTGCCTAACCCTGTAACTTGGACATCATTGCTGTCGAG
TCATGCGCGGTGCGGTCATCTTGAAGAAACAATGACATTGTTTAGCAAGATGAGGAATAAAGGTGTTGGTGCCAATGCTGAAATGCTTGCTGTGGTGTTATCTGTTTGTG
CTGATTTAGCTACATTGAATAGGGGTCAGATGATTCATGGATATATAGTCAAGGGAGGTTTCGAAGATTACTTGTTCGCCAAAAACGCTCTTTTAACTGTATATGGAAAA
GGAGGAGACATAAGAAATGCAGAGAAGTTATTTCATGAGATGAAAGTAAAAAGTCTTGTGAGTTGGAATTCTCTTATATCTTCCTATTCTGAATCTGGATTATATGACAA
AGCTTTTGAAGCATTTTCTCAGCTTGAGAAAATGGAAGTTTATCCAGAGATGAAGCCTAATGTCATAACTTGGAGTGCAGTCATATGTGGATTTGCTTCCAAGGGATTAG
GAGAAGAATCTTTGGAAGTTTTTCGCCAAATGCAGGTTGCAAATGTAAAGGCGAACTCGGTGACGATATCTAGTGTTTTATCAGTTTGTGCTATGCTAGCAGCTCTAAAT
CTTGGTAGGGAAATACATGGTCATGTCTTTAGAGCTCAGATGGATGATAACATATTGGTAGGAAATGGATTGATTAACATGTATACAAAGTGTGGAAGTTTCAAGCCAGG
CTGTTTGGTGTTTGAAAAACTTGAAAATCGAGATTTGATCTCGTGGAACTCAATGATTGCAGGATATGGAATGCATGGACTTGGTAAAGAAGCTCTTGCGATTTTCGATG
AGATGATGAAATCAGGATTTAAACCAGATGATGTTACCTTTATTGCTGCTCTTTCTTCTTGTAGCCATGCTGGTCTCATTGCCGAAGGCCGTTGGCTTTTTTATGAGATG
CTAGAAAACTATAAGATCAAGCCTCAGATGGAGCACTATGCATGCATGGTCGATCTTCTCGGTCGTGCCGGGCTCGTGGAAGAAGCAAGTAACATAGTCAAGGGCATGCC
AATCAAACCCAATGCTTATGTCTGGAGTGCCCTTCTCAACTCTTGCAGAATGCACAAGGATACAGAACTTGCAGAAGAAACTGCCTCTCGGATTTTCGATCTTAATTCCG
AGATAACGGGGAGCCATATGTTGCTGTCGAATATTTTTGCGGCAAGTTGTAGATGGGAGGATTCTGCAAGGGTGAGGATCTCAGCGAGGATGAAAGGCTTAAAGAAAGTT
CCTGGGTGCAGCTGGATTGAGTTGAAGAAGAAGGTTTATATGTTTAAAGCAGGAAACTCAATACAAGAAAGTTTAGGTAGAGTTGATGAAATTCTTCATGATTTGGCTTT
TCAGATTGAAAGCCATGATTTTGATGATTGTATTATTGACACTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGTATGCTTCTTCTTATCAGACATTCAAATCGGTTTTGTTCTGTTTTCCCCGATTCTCAATCAATTTCCCCTCAAATTTTCGATTCTATTTCAACTCCCATTTTTC
CTCAATCACTTATGACAACGAACTCCTCGATTTCTTCGATCATCTTCTCCGGCGATGCAACGGGATTCAACATTGCAAACAAGCTCATTCCGCCACCGTTGTCACCGGCG
CCTGTTCGTCGGCGTTCGTCGCTGCCCGGCTTGTGTCCGTCTATGCCCGTTCTGGGTTTGTTTTTTATGCCCGGAAAGTGTTTGATTCTGCGCCATTTGCAGGTTTGTCG
AACTTGCTGTTATGGAATTCGATTATAAGAGCAAATGTATCTAATGGGTATTGTAGAGAAGCGCTTGAACTTTATGGGGAAATGAGAAATGTTGGGGTTTTGGCTGATGG
GTTTACTTTTCCCCTGGTTTTGAAGGCTTCTGCCAATTTGGGTTCTTTTAATTTGTGCAAGAATCTTCATTGTCATGTTGTGCAATTTGGGTTTCAAAATCATTTGCATG
TTGGGAATGAATTGATAGGAATGTATGCTAAACTTAGACGAATGGATGATGCCCGGAAAGTGTTTGACAAAATGCATCTGAAAGTTTTAGTTTCTTGGAACGCTATGGTT
TCTGGTTATGCCTATAATTATGATGTTAATGGTGCTTATAGGGTGTTCCTTCAAATGGAGTCGGAAGGGATCGTGCCTAACCCTGTAACTTGGACATCATTGCTGTCGAG
TCATGCGCGGTGCGGTCATCTTGAAGAAACAATGACATTGTTTAGCAAGATGAGGAATAAAGGTGTTGGTGCCAATGCTGAAATGCTTGCTGTGGTGTTATCTGTTTGTG
CTGATTTAGCTACATTGAATAGGGGTCAGATGATTCATGGATATATAGTCAAGGGAGGTTTCGAAGATTACTTGTTCGCCAAAAACGCTCTTTTAACTGTATATGGAAAA
GGAGGAGACATAAGAAATGCAGAGAAGTTATTTCATGAGATGAAAGTAAAAAGTCTTGTGAGTTGGAATTCTCTTATATCTTCCTATTCTGAATCTGGATTATATGACAA
AGCTTTTGAAGCATTTTCTCAGCTTGAGAAAATGGAAGTTTATCCAGAGATGAAGCCTAATGTCATAACTTGGAGTGCAGTCATATGTGGATTTGCTTCCAAGGGATTAG
GAGAAGAATCTTTGGAAGTTTTTCGCCAAATGCAGGTTGCAAATGTAAAGGCGAACTCGGTGACGATATCTAGTGTTTTATCAGTTTGTGCTATGCTAGCAGCTCTAAAT
CTTGGTAGGGAAATACATGGTCATGTCTTTAGAGCTCAGATGGATGATAACATATTGGTAGGAAATGGATTGATTAACATGTATACAAAGTGTGGAAGTTTCAAGCCAGG
CTGTTTGGTGTTTGAAAAACTTGAAAATCGAGATTTGATCTCGTGGAACTCAATGATTGCAGGATATGGAATGCATGGACTTGGTAAAGAAGCTCTTGCGATTTTCGATG
AGATGATGAAATCAGGATTTAAACCAGATGATGTTACCTTTATTGCTGCTCTTTCTTCTTGTAGCCATGCTGGTCTCATTGCCGAAGGCCGTTGGCTTTTTTATGAGATG
CTAGAAAACTATAAGATCAAGCCTCAGATGGAGCACTATGCATGCATGGTCGATCTTCTCGGTCGTGCCGGGCTCGTGGAAGAAGCAAGTAACATAGTCAAGGGCATGCC
AATCAAACCCAATGCTTATGTCTGGAGTGCCCTTCTCAACTCTTGCAGAATGCACAAGGATACAGAACTTGCAGAAGAAACTGCCTCTCGGATTTTCGATCTTAATTCCG
AGATAACGGGGAGCCATATGTTGCTGTCGAATATTTTTGCGGCAAGTTGTAGATGGGAGGATTCTGCAAGGGTGAGGATCTCAGCGAGGATGAAAGGCTTAAAGAAAGTT
CCTGGGTGCAGCTGGATTGAGTTGAAGAAGAAGGTTTATATGTTTAAAGCAGGAAACTCAATACAAGAAAGTTTAGGTAGAGTTGATGAAATTCTTCATGATTTGGCTTT
TCAGATTGAAAGCCATGATTTTGATGATTGTATTATTGACACTGAATGA
Protein sequenceShow/hide protein sequence
MLYASSYQTFKSVLFCFPRFSINFPSNFRFYFNSHFSSITYDNELLDFFDHLLRRCNGIQHCKQAHSATVVTGACSSAFVAARLVSVYARSGFVFYARKVFDSAPFAGLS
NLLLWNSIIRANVSNGYCREALELYGEMRNVGVLADGFTFPLVLKASANLGSFNLCKNLHCHVVQFGFQNHLHVGNELIGMYAKLRRMDDARKVFDKMHLKVLVSWNAMV
SGYAYNYDVNGAYRVFLQMESEGIVPNPVTWTSLLSSHARCGHLEETMTLFSKMRNKGVGANAEMLAVVLSVCADLATLNRGQMIHGYIVKGGFEDYLFAKNALLTVYGK
GGDIRNAEKLFHEMKVKSLVSWNSLISSYSESGLYDKAFEAFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRQMQVANVKANSVTISSVLSVCAMLAALN
LGREIHGHVFRAQMDDNILVGNGLINMYTKCGSFKPGCLVFEKLENRDLISWNSMIAGYGMHGLGKEALAIFDEMMKSGFKPDDVTFIAALSSCSHAGLIAEGRWLFYEM
LENYKIKPQMEHYACMVDLLGRAGLVEEASNIVKGMPIKPNAYVWSALLNSCRMHKDTELAEETASRIFDLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKV
PGCSWIELKKKVYMFKAGNSIQESLGRVDEILHDLAFQIESHDFDDCIIDTE