; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0013162 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0013162
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr1:47948917..47953258
RNA-Seq ExpressionLag0013162
SyntenyLag0013162
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044645 - Pentatricopeptide repeat-containing protein DG1/EMB2279-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604125.1 Calcium-transporting ATPase 2, plasma membrane-type, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0086.91Show/hide
Query:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA
        MEAL+ NA +PSPKFEP+ EKIKRTLLQKGV+PTPKIVR+L KKEIQKHNRKLKR AERQ  QSPPLSE QKQLI EET FLTLRSEYKEFSKAIEA+PA
Subjt:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLERVNLKELT FRT YNRD+LKKESLRELRKLFEARKLEELQW+LDDDVELKDEWL SENGH DAVKRRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMI+SGLQFNEGQLL+ILD LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLG ARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL
        VTLGQAGLLKQLLK+IE MRQQPSKKVRN  RK WDPAVEPDLVIYNAILNAC+PTLEWK V+WVFTQLRK+GLKPNGATYGLSMEVMLKSGKYE VH L
Subjt:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL

Query:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI
        FTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI
Subjt:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS
        DDCISIFEYMKQKC PNIGTINTMLKV+GR DMFSKAKDL+EEIKRKADHSS S  + SI+PD+YTY SML+AAA A QWEYFENVYREMALSG++LDQS
Subjt:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS

Query:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA
        KHAMLLVEASRAGKWYLLDHAFD ILEAGQIPHPLLFTEMILQLTTQDNY+QAVTLV+ M YAPFQVSERQWT+LFE NTDRICW+NLKKLSDALS+C+A
Subjt:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA

Query:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK
        SEATVSNLSRSLQSLCK GIPEN  QS+  DH   +GLQL GSENMENMKLH D  TD C+ SLD IPVNHASLN     DSK  PWSLS S+GVL T K
Subjt:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK

Query:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHP
        FSD S+NE STFDL D  EDDEEEL++LLD F DSYDSN PSV EILKTWKE+RK DGL+LHP
Subjt:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHP

XP_022950896.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Cucurbita moschata]0.0e+0086.91Show/hide
Query:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA
        MEAL+ NA +PSPKFEP+ EKIKRTLLQKGV+PTPKIVR+L KKEIQKHNRKLKR AERQ  QSPPLSE QKQLI EET FLTLRSEYKEFSKAIEA+PA
Subjt:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLERVNLKELT FRT YNRD+LKKESLRELRKLFEARKLEELQW+LDDDVELKDEWL SENGH DAVKRRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMI+SGLQFNEGQLL+ILD LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLG ARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL
        VTLGQAGLLKQLLK+IE MRQQPSKKVRN  RK WDPAVEPDLVIYNAILNAC+PTLEWK V+WVFTQLRK+GLKPNGATYGLSMEVMLKSGKYE VH L
Subjt:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL

Query:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI
        FTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRWQDAL+EVEKMKTLSHMKPLVVTFTGMILSSFDGGHI
Subjt:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS
        DDCISIFEYMKQKC PNIGTINTMLKV+GRNDMFSKAKDL+EEIKRKADHSS SS + SI+PD+YTY SML+AAA A QWEYFENVYREMALSG++LDQS
Subjt:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS

Query:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA
        KHAMLLVEASRAGKWYLLDHAFD ILEAGQIPHPLLFTEMILQLTTQDNY+QAVTLV+ M YAPFQVSERQWT+LFE NTDRICW+NLKKLSDALS+C+A
Subjt:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA

Query:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK
        SEATV NLS SLQSLCK GIPENASQS+A DH   +GLQLPG ENM+NMKLH D  TD C+ SLD IPVNHASLN     DS+  PWSLS S+GVL T K
Subjt:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK

Query:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHP
        FSD S+NE STFDL D  EDDEEEL++LLD F DSYDSN PSV EILKTWKE+RK DGL+LHP
Subjt:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHP

XP_023543129.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0086.79Show/hide
Query:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA
        MEAL+ NA +PSPKFEP+ EKIKRTLLQKGV+PTPKIVR+L KKEIQKHNRKLKR AERQ  QSPPLSE QKQLI EETHFLTLRSEYKEFSKAIEA+PA
Subjt:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLERVNLKELT FRT YNRD+LKKESLRELRKLFEARKLEELQW+LDDDVELK+EWL SENG  DAVKRRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMI+SGLQFNEGQLL+ILD LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLG ARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL
        VTLGQAGLLKQLLK+IE MRQQPSKKVRN  RK WDPAVEPDLVIYNAILNACIPTLEWK V+WVFTQLRK+GLKPNGATYGLSMEVMLKSGKYE VH L
Subjt:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL

Query:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI
        FTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRWQDAL+EVEKMKTLSHMKPLVVTFTGMILSSFDGGHI
Subjt:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS
        DDCISIFEYMKQKC PNIGTINTMLKV+GRNDMFSKAKDL+EEIKRKADHSS SS + SI+PD+YTY SML+AAA A QWEYFENVYREMALSG++LDQS
Subjt:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS

Query:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA
        KHAMLLVEASRAGKWYLLDHAFD ILEAGQIPHPLLFTE+ILQLTTQDNY+QAVTLV+ M YAPFQVSERQWT++FE NTDRICW+NLKKLSDALS+C+A
Subjt:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA

Query:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK
        SEATVSNLSRSLQ LCK GIPEN SQS+  DH   +GLQLPGSENMENMKLH D  TD C+ SLD IPVNHASLN     DSK  PWSLS S+GVL T K
Subjt:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK

Query:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHP
        FSD S+NE STFDL D  EDDEEEL++LLD F D YDSN PSV EILKTWKE+RK DGL+LHP
Subjt:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHP

XP_038881784.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic isoform X1 [Benincasa hispida]0.0e+0087.97Show/hide
Query:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA
        MEAL+ N+PIPSPKFEP+ EKIKRTL+ KGV+PTP+IVR+LRKKEIQK+NRKLKR  ERQADQSPPLSE QKQLIAEETHFLTLRSEYKEFSKAIEAKPA
Subjt:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLERVNLKELT FRTGYNRD+LKKESLRELRKLFEARKLEELQW+LDDDVELK+EWLESEN H DA++RRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMR--------GDGQIYPD
        MRDWKFSRMMIRSGLQFNEGQLL+ILD LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGM+RKPQEALQIF+LMR        GDGQIYPD
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMR--------GDGQIYPD

Query:  MAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSG
        MAAYHSIAVTLGQAGLLKQLLKV+E MRQQPS+KVRNK RKSWDPAVEPDLV+YNAILNACIPTLEWKGV+WVFTQLRKSGL+PNGATYGLSMEVMLKSG
Subjt:  MAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSG

Query:  KYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMIL
        KYE +H LFTK+K+SGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDAL+EVEKMKTLSHMKPLVVTFTGMIL
Subjt:  KYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMIL

Query:  SSFDGGHIDDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMAL
        SSFDGGHIDDCISIFEYMKQKC PNIGTIN+MLKVYGRNDMF KAKDLFEEIKRKAD SS SS +PS++PDEYTYGSMLEAAA ALQWEYFENVYREMAL
Subjt:  SSFDGGHIDDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMAL

Query:  SGHQLDQSKHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLS
        SG++LDQSKHA LLVEASRAGKWYLLDHAFD+ILEAGQIPHPLLFTEMIL LTTQDNY+QAVTLV+ MGYAPFQVSERQWT+LFE N DRICW NLKKL 
Subjt:  SGHQLDQSKHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLS

Query:  DALSNCNASEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFS
        DAL NC+ASEATVSNLSRSLQSLCK GIPEN SQSVA DH   +GLQLPGSEN ENMKLHPDR TD CDESLD IPVNHASLN KV+SDS+ +PWS S S
Subjt:  DALSNCNASEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFS

Query:  EGVLGTDKFSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHPLN
        EGVLGTD+FSD S NE ST DLCD  EDDEE LN+LLD F DSYDSN PSV+EILKTWKE+RKTDGLFLHPLN
Subjt:  EGVLGTDKFSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHPLN

XP_038881786.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic isoform X2 [Benincasa hispida]0.0e+0088.79Show/hide
Query:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA
        MEAL+ N+PIPSPKFEP+ EKIKRTL+ KGV+PTP+IVR+LRKKEIQK+NRKLKR  ERQADQSPPLSE QKQLIAEETHFLTLRSEYKEFSKAIEAKPA
Subjt:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLERVNLKELT FRTGYNRD+LKKESLRELRKLFEARKLEELQW+LDDDVELK+EWLESEN H DA++RRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMIRSGLQFNEGQLL+ILD LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGM+RKPQEALQIF+LMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL
        VTLGQAGLLKQLLKV+E MRQQPS+KVRNK RKSWDPAVEPDLV+YNAILNACIPTLEWKGV+WVFTQLRKSGL+PNGATYGLSMEVMLKSGKYE +H L
Subjt:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL

Query:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI
        FTK+K+SGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDAL+EVEKMKTLSHMKPLVVTFTGMILSSFDGGHI
Subjt:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS
        DDCISIFEYMKQKC PNIGTIN+MLKVYGRNDMF KAKDLFEEIKRKAD SS SS +PS++PDEYTYGSMLEAAA ALQWEYFENVYREMALSG++LDQS
Subjt:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS

Query:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA
        KHA LLVEASRAGKWYLLDHAFD+ILEAGQIPHPLLFTEMIL LTTQDNY+QAVTLV+ MGYAPFQVSERQWT+LFE N DRICW NLKKL DAL NC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA

Query:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK
        SEATVSNLSRSLQSLCK GIPEN SQSVA DH   +GLQLPGSEN ENMKLHPDR TD CDESLD IPVNHASLN KV+SDS+ +PWS S SEGVLGTD+
Subjt:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK

Query:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHPLN
        FSD S NE ST DLCD  EDDEE LN+LLD F DSYDSN PSV+EILKTWKE+RKTDGLFLHPLN
Subjt:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHPLN

TrEMBL top hitse value%identityAlignment
A0A0A0KFY8 Uncharacterized protein0.0e+0086.36Show/hide
Query:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA
        MEALN+N PIPSPKFEP+T+KIKR LLQKGVYPTP+IVR+LRKKEIQK+NRKLKR AERQ+ QSPPLSE QKQLIAEETHFLTLRSEYKEFSKAIEAKPA
Subjt:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLERVN KELT  RTGYNRDSLKKESLRELRKLFE RKLEE QW LDDDVELK+EWLESEN   D VKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMIRSGLQFNEGQLL+ILD LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL
        VTLGQAGLLKQLLKVIE MRQQPSKKVRNK RKSWDPAVEPDLVIYNAILNACIPTLEWKGV+WVFTQLRKSGL+PNGATYGLSMEVMLKSGKYE +HNL
Subjt:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL

Query:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI
        FTKMK++G+TLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNG+WQDAL+EVEKMKTLSHMKPLVVTFTGMI SSF+GGHI
Subjt:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS
        DDCISIFEYMKQ C PNIGTINTMLKVYGRNDM+SKAKDLFEEIKRKAD SS  S +PS++PDEYTY SMLEAAA +LQWEYFE+VYREMALSG+QLDQS
Subjt:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS

Query:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA
        KHA+LLVEAS+AGKWYLLDHAFD ILEAGQIPHPLLFTEMILQLTTQDNY+QAVTLV+ MGYAPFQVSERQWT+LFE NTDRI  +NLK+L  AL +C+A
Subjt:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA

Query:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK
        SEATVSNLSRSLQSLCK  IPEN SQSVA DH A + LQLP SENMENMKLHPD      DESLD IPV+HASLN KV S+SK +PWS+S S+G LGT +
Subjt:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK

Query:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHPLN
        FSDGS+N  S FDLC   EDDEEELN LLD F D+YDSN P+V+EIL+TWKE+RK DGLFLH LN
Subjt:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHPLN

A0A1S3B127 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0086.01Show/hide
Query:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA
        MEALN+NAPIPSPKFEP+T+KIKR LLQKGVYPTP+IVR+LRKKEIQK+NRKLKR AERQ+DQSPPLSE QKQLIAEETHFLTLRSEYKEFSKAIEAKPA
Subjt:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLERVN  ELT  RTGYNRDSLKKESLRELRKLFE RKLEEL+W LDDDVELK+EWL+SENG  DAVKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMIRSGLQFNEGQLL+ILD LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL
        VTLGQAGLLKQLLKVIE MRQQPSKKVRNK RKSWDPAVEPDLVIYN ILNACIPTLEWKGV+WVFTQLRKSGL+PNGATYGLSMEVMLKSGKYE +HNL
Subjt:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL

Query:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI
        FTKMK+SGETLKANTYRVLVKAFWEEGN +GAIEAVRDMEQRGVVGSASVYYELACCLCYNG+WQDAL+EVEKMKTLSHMKPLVVTFTGMILSSF+GGHI
Subjt:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS
        DDCISIFEYMKQ C PNIGTINTMLKVYGRNDMFSKAKDLFEEIK+KAD SS +S +PS++PDEYTY SML+AAA +LQWEYFENVYREMALSG++LDQS
Subjt:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS

Query:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA
        KHA+LLVEAS+AGKWYLLDHAFD ILEAGQIPHPLLFTEMILQLTTQ+NY+QAVTLV+ MGYAPFQVSERQWT+LFE N DRIC +NLK+L DAL +C+A
Subjt:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA

Query:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK
        SEATVSNLSRSLQSLCK GI E+ SQS+A D  A +GL+LP S+NMENMKLHPD+     DESLD IPV+HASLN KV S+S  +PWS S S+GVLGT +
Subjt:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK

Query:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHPLN
        FSD S+NE STFD  D  EDDE ELN+LLD F DSYDSN P+ +EIL+TWKE+RK DGLFLHPLN
Subjt:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHPLN

A0A5D3CLI0 Pentatricopeptide repeat-containing protein0.0e+0086.01Show/hide
Query:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA
        MEALN+NAPIPSPKFEP+T+KIKR LLQKGVYPTP+IVR+LRKKEIQK+NRKLKR AERQ+DQSPPLSE QKQLIAEETHFLTLRSEYKEFSKAIEAKPA
Subjt:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLERVN  ELT  RTGYNRDSLKKESLRELRKLFE RKLEEL+W LDDDVELK+EWL+SENG  DAVKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMIRSGLQFNEGQLL+ILD LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL
        VTLGQAGLLKQLLKVIE MRQQPSKKVRNK RKSWDPAVEPDLVIYN ILNACIPTLEWKGV+WVFTQLRKSGL+PNGATYGLSMEVMLKSGKYE +HNL
Subjt:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL

Query:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI
        FTKMK+SGETLKANTYRVLVKAFWEEGN +GAIEAVRDMEQRGVVGSASVYYELACCLCYNG+WQDAL+EVEKMKTLSHMKPLVVTFTGMILSSF+GGHI
Subjt:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS
        DDCISIFEYMKQ C PNIGTINTMLKVYGRNDMFSKAKDLFEEIK+KAD SS +S +PS++PDEYTY SML+AAA +LQWEYFENVYREMALSG++LDQS
Subjt:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS

Query:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA
        KHA+LLVEAS+AGKWYLLDHAFD ILEAGQIPHPLLFTEMILQLTTQ+NY+QAVTLV+ MGYAPFQVSERQWT+LFE N DRIC +NLK+L DAL +C+A
Subjt:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA

Query:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK
        SEATVSNLSRSLQSLCK GI E+ SQS+A D  A +GL+LP S+NMENMKLHPD+     DESLD IPV+HASLN KV S+S  +PWS S S+GVLGT +
Subjt:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK

Query:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHPLN
        FSD S+NE STFD  D  EDDE ELN+LLD F DSYDSN P+ +EIL+TWKE+RK DGLFLHPLN
Subjt:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHPLN

A0A6J1GG29 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0086.91Show/hide
Query:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA
        MEAL+ NA +PSPKFEP+ EKIKRTLLQKGV+PTPKIVR+L KKEIQKHNRKLKR AERQ  QSPPLSE QKQLI EET FLTLRSEYKEFSKAIEA+PA
Subjt:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLERVNLKELT FRT YNRD+LKKESLRELRKLFEARKLEELQW+LDDDVELKDEWL SENGH DAVKRRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMI+SGLQFNEGQLL+ILD LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLG ARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL
        VTLGQAGLLKQLLK+IE MRQQPSKKVRN  RK WDPAVEPDLVIYNAILNAC+PTLEWK V+WVFTQLRK+GLKPNGATYGLSMEVMLKSGKYE VH L
Subjt:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL

Query:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI
        FTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRWQDAL+EVEKMKTLSHMKPLVVTFTGMILSSFDGGHI
Subjt:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS
        DDCISIFEYMKQKC PNIGTINTMLKV+GRNDMFSKAKDL+EEIKRKADHSS SS + SI+PD+YTY SML+AAA A QWEYFENVYREMALSG++LDQS
Subjt:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS

Query:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA
        KHAMLLVEASRAGKWYLLDHAFD ILEAGQIPHPLLFTEMILQLTTQDNY+QAVTLV+ M YAPFQVSERQWT+LFE NTDRICW+NLKKLSDALS+C+A
Subjt:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA

Query:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK
        SEATV NLS SLQSLCK GIPENASQS+A DH   +GLQLPG ENM+NMKLH D  TD C+ SLD IPVNHASLN     DS+  PWSLS S+GVL T K
Subjt:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK

Query:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHP
        FSD S+NE STFDL D  EDDEEEL++LLD F DSYDSN PSV EILKTWKE+RK DGL+LHP
Subjt:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHP

A0A6J1ILT7 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0086.56Show/hide
Query:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA
        MEAL+ NA +PSPKFEP+ EKIKRTL+QKGV+PTPKIVR+L KKEIQKHNRKLKR AERQ  QSPPLSE QKQLI EETHFLTLRSEYKEFSKAIEA+PA
Subjt:  MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLERVNLKE T FRT YNRD+LKKESLRELRKLFEARKLEELQW+LDDDVELK+EWL SENGH DAVKRRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMI+SGLQFNEGQLL+ILD LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLG ARKPQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL
        VTLGQAGLLKQLLK+IE MRQQPSKKVRN  RK WDPAVEPDLVIYNAILNACIPTLEWK V+WVFTQLRK+GLKPNGATYGLSMEVMLKSGKYE VH L
Subjt:  VTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNL

Query:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI
        FTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRWQDAL+EVEKMKTLSHMKPLVVTFTGMILSSFDGGHI
Subjt:  FTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS
        DDCISIFEYMKQKC PNIGTINTMLKV+GRNDMFSKAKDL+EEIKRKADHSS SS + SI+PD+YTY SML+AAA A QWEYFENVYREMALSG++LDQS
Subjt:  DDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQS

Query:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA
        KHAMLLVEASRAGKWYLLDHAFD ILEAGQIPHPLLFTEMILQLTTQDNY+QA+TLV+ M YAPFQVSERQWT+LFE NTDRICW+NLKKLSDALS+C+A
Subjt:  KHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNA

Query:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK
        SEATVSNLS SLQSLCK GIPEN SQS+A DH   +GLQLPGSEN +NMKLH D  TD C+ SLD IPVNHASLN     DSK  PWSLS S+GVL T K
Subjt:  SEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDK

Query:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHP
        FSD S+NE STFDL D  EDDEEEL++LLD F DSY SN PSV EILKTW+E+RK DGL+LHP
Subjt:  FSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHP

SwissProt top hitse value%identityAlignment
Q9FIX3 Pentatricopeptide repeat-containing protein At5g397103.3e-2022.97Show/hide
Query:  KQALSVVEWVYNLKSHSH-SKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKS
        K+ +S  E V+     S  S + F Y  L+     A     AL +F+ M   G + P++  Y+++       G  K L K+ +G +   S  ++      
Subjt:  KQALSVVEWVYNLKSHSH-SKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKS

Query:  WDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIE
            +EP+L+ YN ++N        K V +V T++ + G   +  TY   ++   K G +     +  +M R G T    TY  L+ +  + GN+N A+E
Subjt:  WDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIE

Query:  AVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMKQK-CPPNIGTINTMLKVYGRNDM
         +  M  RG+  +   Y  L       G   +A   + +M   +   P VVT+  +I      G ++D I++ E MK+K   P++ + +T+L  + R+  
Subjt:  AVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMKQK-CPPNIGTINTMLKVYGRNDM

Query:  FSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLVEASRAGKWYLLDHAFDAILEAGQIPH
          +A  +  E+  K            I PD  TY S+++      + +   ++Y EM   G   D+  +  L+      G         + ++E G +P 
Subjt:  FSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLVEASRAGKWYLLDHAFDAILEAGQIPH

Query:  PLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFE
         + ++ +I  L  Q    +A  L+  + Y     S+  +  L E
Subjt:  PLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFE

Q9FJW6 Pentatricopeptide repeat-containing protein At5g67570, chloroplastic1.5e-24654.29Show/hide
Query:  PKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPAG--GLMVGRPWE
        P+FEP+ EKIKR LL+ GV PTPKI+  LRKKEIQKHNR+ KR  E +A+     +E QKQ + EE  F TLR EYK+F+++I  K  G  GLMVG PWE
Subjt:  PKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPAG--GLMVGRPWE

Query:  RLERVNLKELTD--FRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREISMRDWKFSR
         +ERV LKEL     R   +   LKKE+L+EL+K+ E    ++L+W+LDDDV++++  L+ E    D  KR R +GE +R LVDRLS REI+ + WKF R
Subjt:  RLERVNLKELTD--FRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREISMRDWKFSR

Query:  MMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGL
        MM +SGLQF E Q+L+I+D LG K  WKQA +VV WVY+ K   H +SRFVYTKLL+VLG AR+PQEALQIFN M GD Q+YPDMAAYH IAVTLGQAGL
Subjt:  MMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGL

Query:  LKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNLFTKMKRSG
        LK+LLKVIE MRQ+P+K  +N  +K+WDP +EPDLV+YNAILNAC+PTL+WK V WVF +LRK+GL+PNGATYGL+MEVML+SGK++ VH+ F KMK SG
Subjt:  LKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNLFTKMKRSG

Query:  ETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHIDDCISIFE
        E  KA TY+VLV+A W EG +  A+EAVRDMEQ+GV+G+ SVYYELACCLC NGRW DA+LEV +MK L + +PL +TFTG+I +S +GGH+DDC++IF+
Subjt:  ETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHIDDCISIFE

Query:  YMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEI-KRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLV
        YMK KC PNIGT N MLKVYGRNDMFS+AK+LFEEI  RK  H         ++P+EYTY  MLEA+A +LQWEYFE+VY+ M LSG+Q+DQ+KHA +L+
Subjt:  YMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEI-KRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLV

Query:  EASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCN-ASEATVS
        EASRAGKW LL+HAFDA+LE G+IPHPL FTE++   T + ++ +A+TL+  +  A FQ+SE +WT LFE + D +   NL KLSD L  C+  SE TVS
Subjt:  EASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCN-ASEATVS

Query:  NLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDKFSDGSD
        NLS+SL+S C      +A   +A D       Q  G +  E++ L                       +T +E D+     +  F+E             
Subjt:  NLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDKFSDGSD

Query:  NEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTD
         E  T  L +   DD+EE          S DS+S SV +ILK W+E  K +
Subjt:  NEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTD

Q9S7Q2 Pentatricopeptide repeat-containing protein At1g74850, chloroplastic8.2e-1922.64Show/hide
Query:  YTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEW
        Y  LL+   +     EA  +F  M  DG I PD+  Y  +  T G+   L++L KV + + +  S                PD+  YN +L A   +   
Subjt:  YTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEW

Query:  KGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLC
        K    VF Q++ +G  PN  TY + + +  +SG+Y+ V  LF +MK S     A TY +L++ F E G     +    DM +  +      Y  +     
Subjt:  KGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLC

Query:  YNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMKQ-KCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIP
          G  +DA   ++ M T + + P    +TG+I +       ++ +  F  M +    P+I T +++L  + R  +  +++ +   +        + SGIP
Subjt:  YNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMKQ-KCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIP

Query:  SIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVK
            +  T+ + +EA     ++E     Y +M  S    D+     +L   S A         F+ +  +  +P  + +  M+      + +D    L++
Subjt:  SIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVK

Query:  AM
         M
Subjt:  AM

Q9SA76 Pentatricopeptide repeat-containing protein At1g30610, chloroplastic5.2e-9838.03Show/hide
Query:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGD
        I  L   L+  +I+M +W+FS+ +  + +++ +  ++R++  LG  G W++ L V+EW+     +  +K R +YT  L VLG +R+P EAL +F+ M   
Subjt:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGD

Query:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSME
           YPDM AY SIAVTLGQAG +K+L  VI+ MR  P KK +    + WDP +EPD+V+YNA+LNAC+   +W+G  WV  QL++ G KP+  TYGL ME
Subjt:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSME

Query:  VMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVE--------------
        VML   KY  VH  F KM++S     A  YRVLV   W+EG  + A+  V DME RG+VGSA++YY+LA CLC  GR  + L  V               
Subjt:  VMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVE--------------

Query:  -----------KMKTLSHM--KPLVVTFTGMILSSFDGGHIDDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPS
                   ++K +  +  KPLVVT+TG+I +  D G+I +   IF+ MK+ C PN+ T N MLK Y +  +F +A++LF+++    +H   SS   S
Subjt:  -----------KMKTLSHM--KPLVVTFTGMILSSFDGGHIDDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPS

Query:  -IIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVT
         ++PD YT+ +ML+  A   +W+ F   YREM   G+  +  +H  +++EASRAGK  +++  ++ +  + +IP   L  E   +   + ++  A++
Subjt:  -IIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVT

Q9SR00 Pentatricopeptide repeat-containing protein At3g04760, chloroplastic4.8e-1926.17Show/hide
Query:  LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVR
        L ++G    AL V+     L S +   +   YT L+    +     EAL++ + M   G + PDM  Y++I       G+ K      EGM  +  + VR
Subjt:  LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVR

Query:  NKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGN
        N   K      EPD++ YN +L A +   +W+    + T++      PN  TY + +  + + GK E   NL   MK  G T  A +Y  L+ AF  EG 
Subjt:  NKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGN

Query:  VNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGG-HIDDCISIFEYMKQKCPPNIGTINTMLKV
        ++ AIE +  M   G +     Y  +   LC NG+   AL    K+  +    P   ++  M  + +  G  I     I E M     P+  T N+M+  
Subjt:  VNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGG-HIDDCISIFEYMKQKCPPNIGTINTMLKV

Query:  YGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLVEASRAG
          R  M  +A +L  +++    H       PS++    TY  +L     A + E   NV   M  +G + +++ + +L+     AG
Subjt:  YGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLVEASRAG

Arabidopsis top hitse value%identityAlignment
AT1G30610.1 pentatricopeptide (PPR) repeat-containing protein3.7e-9938.03Show/hide
Query:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGD
        I  L   L+  +I+M +W+FS+ +  + +++ +  ++R++  LG  G W++ L V+EW+     +  +K R +YT  L VLG +R+P EAL +F+ M   
Subjt:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGD

Query:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSME
           YPDM AY SIAVTLGQAG +K+L  VI+ MR  P KK +    + WDP +EPD+V+YNA+LNAC+   +W+G  WV  QL++ G KP+  TYGL ME
Subjt:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSME

Query:  VMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVE--------------
        VML   KY  VH  F KM++S     A  YRVLV   W+EG  + A+  V DME RG+VGSA++YY+LA CLC  GR  + L  V               
Subjt:  VMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVE--------------

Query:  -----------KMKTLSHM--KPLVVTFTGMILSSFDGGHIDDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPS
                   ++K +  +  KPLVVT+TG+I +  D G+I +   IF+ MK+ C PN+ T N MLK Y +  +F +A++LF+++    +H   SS   S
Subjt:  -----------KMKTLSHM--KPLVVTFTGMILSSFDGGHIDDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPS

Query:  -IIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVT
         ++PD YT+ +ML+  A   +W+ F   YREM   G+  +  +H  +++EASRAGK  +++  ++ +  + +IP   L  E   +   + ++  A++
Subjt:  -IIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVT

AT1G30610.2 pentatricopeptide (PPR) repeat-containing protein9.4e-10340Show/hide
Query:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGD
        I  L   L+  +I+M +W+FS+ +  + +++ +  ++R++  LG  G W++ L V+EW+     +  +K R +YT  L VLG +R+P EAL +F+ M   
Subjt:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGD

Query:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSME
           YPDM AY SIAVTLGQAG +K+L  VI+ MR  P KK +    + WDP +EPD+V+YNA+LNAC+   +W+G  WV  QL++ G KP+  TYGL ME
Subjt:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSME

Query:  VMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVT
        VML   KY  VH  F KM++S     A  YRVLV   W+EG  + A+  V DME RG+VGSA++YY+LA CLC  GR  + L  ++K+  +++ KPLVVT
Subjt:  VMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVT

Query:  FTGMILSSFDGGHIDDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPS-IIPDEYTYGSMLEAAACALQWEYFEN
        +TG+I +  D G+I +   IF+ MK+ C PN+ T N MLK Y +  +F +A++LF+++    +H   SS   S ++PD YT+ +ML+  A   +W+ F  
Subjt:  FTGMILSSFDGGHIDDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADHSSLSSGIPS-IIPDEYTYGSMLEAAACALQWEYFEN

Query:  VYREMALSGHQLDQSKHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVT
         YREM   G+  +  +H  +++EASRAGK  +++  ++ +  + +IP   L  E   +   + ++  A++
Subjt:  VYREMALSGHQLDQSKHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVT

AT3G04760.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.4e-2026.17Show/hide
Query:  LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVR
        L ++G    AL V+     L S +   +   YT L+    +     EAL++ + M   G + PDM  Y++I       G+ K      EGM  +  + VR
Subjt:  LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVR

Query:  NKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGN
        N   K      EPD++ YN +L A +   +W+    + T++      PN  TY + +  + + GK E   NL   MK  G T  A +Y  L+ AF  EG 
Subjt:  NKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGN

Query:  VNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGG-HIDDCISIFEYMKQKCPPNIGTINTMLKV
        ++ AIE +  M   G +     Y  +   LC NG+   AL    K+  +    P   ++  M  + +  G  I     I E M     P+  T N+M+  
Subjt:  VNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGG-HIDDCISIFEYMKQKCPPNIGTINTMLKV

Query:  YGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLVEASRAG
          R  M  +A +L  +++    H       PS++    TY  +L     A + E   NV   M  +G + +++ + +L+     AG
Subjt:  YGRNDMFSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLVEASRAG

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-2122.97Show/hide
Query:  KQALSVVEWVYNLKSHSH-SKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKS
        K+ +S  E V+     S  S + F Y  L+     A     AL +F+ M   G + P++  Y+++       G  K L K+ +G +   S  ++      
Subjt:  KQALSVVEWVYNLKSHSH-SKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVRNKYRKS

Query:  WDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIE
            +EP+L+ YN ++N        K V +V T++ + G   +  TY   ++   K G +     +  +M R G T    TY  L+ +  + GN+N A+E
Subjt:  WDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIE

Query:  AVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMKQK-CPPNIGTINTMLKVYGRNDM
         +  M  RG+  +   Y  L       G   +A   + +M   +   P VVT+  +I      G ++D I++ E MK+K   P++ + +T+L  + R+  
Subjt:  AVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMKQK-CPPNIGTINTMLKVYGRNDM

Query:  FSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLVEASRAGKWYLLDHAFDAILEAGQIPH
          +A  +  E+  K            I PD  TY S+++      + +   ++Y EM   G   D+  +  L+      G         + ++E G +P 
Subjt:  FSKAKDLFEEIKRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLVEASRAGKWYLLDHAFDAILEAGQIPH

Query:  PLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFE
         + ++ +I  L  Q    +A  L+  + Y     S+  +  L E
Subjt:  PLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFE

AT5G67570.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-24754.29Show/hide
Query:  PKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPAG--GLMVGRPWE
        P+FEP+ EKIKR LL+ GV PTPKI+  LRKKEIQKHNR+ KR  E +A+     +E QKQ + EE  F TLR EYK+F+++I  K  G  GLMVG PWE
Subjt:  PKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPAG--GLMVGRPWE

Query:  RLERVNLKELTD--FRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREISMRDWKFSR
         +ERV LKEL     R   +   LKKE+L+EL+K+ E    ++L+W+LDDDV++++  L+ E    D  KR R +GE +R LVDRLS REI+ + WKF R
Subjt:  RLERVNLKELTD--FRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREISMRDWKFSR

Query:  MMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGL
        MM +SGLQF E Q+L+I+D LG K  WKQA +VV WVY+ K   H +SRFVYTKLL+VLG AR+PQEALQIFN M GD Q+YPDMAAYH IAVTLGQAGL
Subjt:  MMIRSGLQFNEGQLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGL

Query:  LKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNLFTKMKRSG
        LK+LLKVIE MRQ+P+K  +N  +K+WDP +EPDLV+YNAILNAC+PTL+WK V WVF +LRK+GL+PNGATYGL+MEVML+SGK++ VH+ F KMK SG
Subjt:  LKQLLKVIEGMRQQPSKKVRNKYRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNLFTKMKRSG

Query:  ETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHIDDCISIFE
        E  KA TY+VLV+A W EG +  A+EAVRDMEQ+GV+G+ SVYYELACCLC NGRW DA+LEV +MK L + +PL +TFTG+I +S +GGH+DDC++IF+
Subjt:  ETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHIDDCISIFE

Query:  YMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEI-KRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLV
        YMK KC PNIGT N MLKVYGRNDMFS+AK+LFEEI  RK  H         ++P+EYTY  MLEA+A +LQWEYFE+VY+ M LSG+Q+DQ+KHA +L+
Subjt:  YMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEI-KRKADHSSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLV

Query:  EASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCN-ASEATVS
        EASRAGKW LL+HAFDA+LE G+IPHPL FTE++   T + ++ +A+TL+  +  A FQ+SE +WT LFE + D +   NL KLSD L  C+  SE TVS
Subjt:  EASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAMGYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCN-ASEATVS

Query:  NLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDKFSDGSD
        NLS+SL+S C      +A   +A D       Q  G +  E++ L                       +T +E D+     +  F+E             
Subjt:  NLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVNHASLNTKVESDSKTAPWSLSFSEGVLGTDKFSDGSD

Query:  NEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTD
         E  T  L +   DD+EE          S DS+S SV +ILK W+E  K +
Subjt:  NEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTTTGAACGCAAATGCGCCAATTCCTTCACCGAAGTTCGAACCAAATACCGAGAAAATCAAGCGTACGCTCCTCCAAAAGGGTGTCTATCCCACTCCCAAAAT
CGTCCGCGCACTCCGCAAGAAAGAAATTCAGAAGCACAATCGCAAACTCAAGCGACGCGCTGAACGACAAGCCGATCAGTCGCCACCACTTTCCGAGCCCCAAAAGCAGC
TAATCGCCGAGGAAACCCATTTTCTGACGTTGAGAAGCGAGTACAAGGAGTTCTCGAAGGCCATTGAGGCAAAACCAGCCGGCGGCTTGATGGTCGGAAGGCCTTGGGAG
AGACTGGAAAGAGTTAACCTTAAAGAACTTACTGATTTTAGAACGGGATACAATCGGGATAGTCTGAAGAAGGAGAGTTTGAGAGAATTGAGGAAGCTGTTTGAGGCTCG
CAAGCTCGAGGAACTGCAGTGGATTTTAGACGACGATGTGGAACTGAAGGACGAGTGGTTAGAGAGTGAAAATGGTCATTGTGATGCCGTGAAACGGAGGCGCGGAGACG
GAGAGGTTATTCGTTTCCTTGTTGACAGGCTCAGTTCGAGGGAAATTTCCATGAGAGACTGGAAATTCTCGAGGATGATGATACGGTCAGGATTGCAGTTTAATGAAGGT
CAGCTGCTTAGAATTTTGGATCTCCTTGGTGCTAAGGGATGTTGGAAACAGGCCTTGTCAGTGGTCGAATGGGTGTACAATCTTAAAAGTCACAGTCATTCTAAAAGCAG
ATTTGTCTATACAAAGCTCTTAGCTGTTCTTGGGATGGCAAGGAAACCTCAGGAAGCCCTTCAGATATTCAATTTGATGCGTGGAGATGGCCAGATATATCCCGATATGG
CTGCATATCACAGTATTGCTGTTACGCTGGGTCAAGCTGGTCTCTTGAAACAATTGCTAAAAGTTATTGAAGGCATGAGGCAGCAGCCATCTAAAAAAGTTAGAAACAAG
TACCGCAAATCTTGGGATCCTGCAGTTGAACCCGATCTTGTTATATATAATGCTATTTTGAACGCATGTATTCCCACCCTTGAGTGGAAAGGTGTTCACTGGGTGTTCAC
ACAATTGAGGAAGAGTGGTTTGAAACCTAATGGAGCAACATATGGACTTTCTATGGAGGTAATGCTGAAATCTGGAAAGTATGAGCACGTCCACAACCTTTTTACAAAAA
TGAAGAGGAGTGGAGAAACACTAAAGGCAAACACTTACAGAGTTCTTGTCAAAGCTTTTTGGGAGGAAGGAAATGTCAATGGAGCTATTGAAGCAGTCAGGGATATGGAA
CAAAGAGGAGTAGTTGGATCTGCGAGTGTCTATTATGAACTAGCTTGTTGTCTATGCTACAATGGGAGGTGGCAAGATGCATTACTAGAGGTTGAAAAAATGAAAACACT
ATCACACATGAAACCGTTGGTGGTGACCTTCACTGGCATGATCTTATCTTCCTTTGATGGTGGACATATTGATGATTGTATATCTATTTTCGAGTACATGAAGCAAAAAT
GTCCGCCAAATATAGGGACCATAAATACCATGCTTAAAGTTTATGGCCGAAATGATATGTTTTCTAAAGCTAAAGATTTATTTGAAGAAATAAAGAGAAAAGCAGATCAT
TCCTCCCTGAGTAGCGGTATTCCTTCTATAATCCCAGATGAGTATACGTATGGCTCAATGCTTGAGGCAGCTGCTTGTGCACTTCAATGGGAATATTTTGAGAATGTATA
CAGGGAAATGGCTCTGTCAGGACACCAGCTTGATCAAAGTAAACATGCAATGCTACTTGTGGAAGCATCCAGAGCTGGGAAGTGGTATCTATTAGATCATGCTTTTGACG
CAATCTTGGAAGCTGGTCAAATTCCCCATCCGCTGTTGTTCACAGAAATGATATTGCAACTTACAACTCAAGATAACTATGACCAGGCTGTCACCTTGGTTAAAGCCATG
GGTTATGCTCCATTTCAAGTAAGTGAAAGGCAATGGACAAAACTTTTTGAAGCGAACACGGACAGGATTTGTTGGAGTAACTTGAAGAAACTGTCGGATGCTCTTAGCAA
CTGTAATGCATCAGAAGCCACGGTCTCAAACTTGTCAAGATCACTGCAGTCTCTTTGCAAACCTGGCATACCAGAAAACGCCTCCCAGTCTGTAGCCTACGACCATGGTG
CAAGAAATGGATTACAACTTCCTGGTTCTGAAAACATGGAGAATATGAAGCTTCACCCAGATCGCTTCACAGATCATTGTGACGAGTCATTGGATAAGATTCCTGTTAAC
CATGCTAGCCTGAACACGAAGGTCGAGAGTGACTCAAAGACGGCCCCTTGGTCATTGAGTTTTTCTGAAGGTGTTTTAGGGACTGACAAGTTTTCGGATGGTTCCGACAA
TGAGTTCTCAACTTTTGATTTGTGTGATGGCGGTGAAGATGATGAGGAAGAACTTAACATATTACTTGATAGATTTGGTGATTCTTATGATTCAAACTCGCCTTCTGTCA
GTGAAATACTGAAAACCTGGAAAGAAGACAGGAAAACCGATGGGTTATTTCTCCACCCTTTGAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCTTTGAACGCAAATGCGCCAATTCCTTCACCGAAGTTCGAACCAAATACCGAGAAAATCAAGCGTACGCTCCTCCAAAAGGGTGTCTATCCCACTCCCAAAAT
CGTCCGCGCACTCCGCAAGAAAGAAATTCAGAAGCACAATCGCAAACTCAAGCGACGCGCTGAACGACAAGCCGATCAGTCGCCACCACTTTCCGAGCCCCAAAAGCAGC
TAATCGCCGAGGAAACCCATTTTCTGACGTTGAGAAGCGAGTACAAGGAGTTCTCGAAGGCCATTGAGGCAAAACCAGCCGGCGGCTTGATGGTCGGAAGGCCTTGGGAG
AGACTGGAAAGAGTTAACCTTAAAGAACTTACTGATTTTAGAACGGGATACAATCGGGATAGTCTGAAGAAGGAGAGTTTGAGAGAATTGAGGAAGCTGTTTGAGGCTCG
CAAGCTCGAGGAACTGCAGTGGATTTTAGACGACGATGTGGAACTGAAGGACGAGTGGTTAGAGAGTGAAAATGGTCATTGTGATGCCGTGAAACGGAGGCGCGGAGACG
GAGAGGTTATTCGTTTCCTTGTTGACAGGCTCAGTTCGAGGGAAATTTCCATGAGAGACTGGAAATTCTCGAGGATGATGATACGGTCAGGATTGCAGTTTAATGAAGGT
CAGCTGCTTAGAATTTTGGATCTCCTTGGTGCTAAGGGATGTTGGAAACAGGCCTTGTCAGTGGTCGAATGGGTGTACAATCTTAAAAGTCACAGTCATTCTAAAAGCAG
ATTTGTCTATACAAAGCTCTTAGCTGTTCTTGGGATGGCAAGGAAACCTCAGGAAGCCCTTCAGATATTCAATTTGATGCGTGGAGATGGCCAGATATATCCCGATATGG
CTGCATATCACAGTATTGCTGTTACGCTGGGTCAAGCTGGTCTCTTGAAACAATTGCTAAAAGTTATTGAAGGCATGAGGCAGCAGCCATCTAAAAAAGTTAGAAACAAG
TACCGCAAATCTTGGGATCCTGCAGTTGAACCCGATCTTGTTATATATAATGCTATTTTGAACGCATGTATTCCCACCCTTGAGTGGAAAGGTGTTCACTGGGTGTTCAC
ACAATTGAGGAAGAGTGGTTTGAAACCTAATGGAGCAACATATGGACTTTCTATGGAGGTAATGCTGAAATCTGGAAAGTATGAGCACGTCCACAACCTTTTTACAAAAA
TGAAGAGGAGTGGAGAAACACTAAAGGCAAACACTTACAGAGTTCTTGTCAAAGCTTTTTGGGAGGAAGGAAATGTCAATGGAGCTATTGAAGCAGTCAGGGATATGGAA
CAAAGAGGAGTAGTTGGATCTGCGAGTGTCTATTATGAACTAGCTTGTTGTCTATGCTACAATGGGAGGTGGCAAGATGCATTACTAGAGGTTGAAAAAATGAAAACACT
ATCACACATGAAACCGTTGGTGGTGACCTTCACTGGCATGATCTTATCTTCCTTTGATGGTGGACATATTGATGATTGTATATCTATTTTCGAGTACATGAAGCAAAAAT
GTCCGCCAAATATAGGGACCATAAATACCATGCTTAAAGTTTATGGCCGAAATGATATGTTTTCTAAAGCTAAAGATTTATTTGAAGAAATAAAGAGAAAAGCAGATCAT
TCCTCCCTGAGTAGCGGTATTCCTTCTATAATCCCAGATGAGTATACGTATGGCTCAATGCTTGAGGCAGCTGCTTGTGCACTTCAATGGGAATATTTTGAGAATGTATA
CAGGGAAATGGCTCTGTCAGGACACCAGCTTGATCAAAGTAAACATGCAATGCTACTTGTGGAAGCATCCAGAGCTGGGAAGTGGTATCTATTAGATCATGCTTTTGACG
CAATCTTGGAAGCTGGTCAAATTCCCCATCCGCTGTTGTTCACAGAAATGATATTGCAACTTACAACTCAAGATAACTATGACCAGGCTGTCACCTTGGTTAAAGCCATG
GGTTATGCTCCATTTCAAGTAAGTGAAAGGCAATGGACAAAACTTTTTGAAGCGAACACGGACAGGATTTGTTGGAGTAACTTGAAGAAACTGTCGGATGCTCTTAGCAA
CTGTAATGCATCAGAAGCCACGGTCTCAAACTTGTCAAGATCACTGCAGTCTCTTTGCAAACCTGGCATACCAGAAAACGCCTCCCAGTCTGTAGCCTACGACCATGGTG
CAAGAAATGGATTACAACTTCCTGGTTCTGAAAACATGGAGAATATGAAGCTTCACCCAGATCGCTTCACAGATCATTGTGACGAGTCATTGGATAAGATTCCTGTTAAC
CATGCTAGCCTGAACACGAAGGTCGAGAGTGACTCAAAGACGGCCCCTTGGTCATTGAGTTTTTCTGAAGGTGTTTTAGGGACTGACAAGTTTTCGGATGGTTCCGACAA
TGAGTTCTCAACTTTTGATTTGTGTGATGGCGGTGAAGATGATGAGGAAGAACTTAACATATTACTTGATAGATTTGGTGATTCTTATGATTCAAACTCGCCTTCTGTCA
GTGAAATACTGAAAACCTGGAAAGAAGACAGGAAAACCGATGGGTTATTTCTCCACCCTTTGAATTAG
Protein sequenceShow/hide protein sequence
MEALNANAPIPSPKFEPNTEKIKRTLLQKGVYPTPKIVRALRKKEIQKHNRKLKRRAERQADQSPPLSEPQKQLIAEETHFLTLRSEYKEFSKAIEAKPAGGLMVGRPWE
RLERVNLKELTDFRTGYNRDSLKKESLRELRKLFEARKLEELQWILDDDVELKDEWLESENGHCDAVKRRRGDGEVIRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEG
QLLRILDLLGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARKPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIEGMRQQPSKKVRNK
YRKSWDPAVEPDLVIYNAILNACIPTLEWKGVHWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHNLFTKMKRSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDME
QRGVVGSASVYYELACCLCYNGRWQDALLEVEKMKTLSHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMKQKCPPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADH
SSLSSGIPSIIPDEYTYGSMLEAAACALQWEYFENVYREMALSGHQLDQSKHAMLLVEASRAGKWYLLDHAFDAILEAGQIPHPLLFTEMILQLTTQDNYDQAVTLVKAM
GYAPFQVSERQWTKLFEANTDRICWSNLKKLSDALSNCNASEATVSNLSRSLQSLCKPGIPENASQSVAYDHGARNGLQLPGSENMENMKLHPDRFTDHCDESLDKIPVN
HASLNTKVESDSKTAPWSLSFSEGVLGTDKFSDGSDNEFSTFDLCDGGEDDEEELNILLDRFGDSYDSNSPSVSEILKTWKEDRKTDGLFLHPLN