; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007859 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007859
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr10:16085763..16087832
RNA-Seq ExpressionHG10007859
SyntenyHG10007859
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062552.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0089.84Show/hide
Query:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL
        MRT D AAALDNFILPSLLKACAQAS    GRELHGFA KNGFA+DVFVCNALMNMYEKCG LVSA LVFDKMPERDVVSWSTMLGCYVRSK+FGEALRL
Subjt:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL

Query:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG
        V EM FVGVKLSGVALIS+IGVFG LLDMKSGRAVHGYIVRNVGD KMEV LTTALI+MYCK ECL SAQRLFD LS+RSVVSWT MI GCI +CRLVEG
Subjt:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG

Query:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ
        AKNFNRMLEEK+FPNEITLLSLIT CGFV+TLDLGK  HAYLLRNGFGMSLAL TALIDMYGKCGQVGYARALFNGVE+KDVKIWSA+IS YA+VSC+DQ
Subjt:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ

Query:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT
         F LFLEMLDNEVKPNKVTMVSLLS CAE G LDLGKWTHAYI RH LEVDVILETALINMY KCGD+TIARSLFDEAT+RDIHMWNAMMAGFSMHGCG 
Subjt:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT

Query:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK
        EALELFSEMESHGV+PNDITFISIFHACSHSGLV +GKKHFNRMVH FGIVPK+EHYGCLVDLLGRAG L+EAH+IIENMPMRPNTI WGALLAACKLHK
Subjt:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK

Query:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP
        NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETM+HLGMKKEPGLSWIEVNGS+HHFKSGDKTCTQTT+VYEMV EMCIKLREAGYTP
Subjt:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP

Query:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
        NT+ VLLN++EEEKE ALSYHSEKLAMAFGLIS APGTPIRI+KNLRICDDCHAA KLLSKIY RTIIVRDRNRFHHFSEGYCSCLGYW
Subjt:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

XP_008462708.1 PREDICTED: pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Cucumis melo]0.0e+0089.7Show/hide
Query:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL
        MRT D AAALDNFILPSLLKACAQAS    GRELHGFA KNGFA+DVFVCNALMNMYEKCG LVSA LVFDKMPERDVVSWSTMLGCYVRSK+FGEALRL
Subjt:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL

Query:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG
        V EM FVGVKLSGVALIS+IGVFG LLDMKSGRAVHGYI+RNVGD KMEV LTTALI+MYCK ECL SAQRLFD LS+RSVVSWT MI GCI +CRLVEG
Subjt:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG

Query:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ
        AKNFNRMLEEK+FPNEITLLSLIT CGFV+TLDLGK  HAYLLRNGFGMSLAL TALIDMYGKCGQVGYARALFNGVE+KDVKIWSA+IS YA+VSC+DQ
Subjt:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ

Query:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT
         F LFLEMLDNEVKPNKVTMVSLLS CAE G LDLGKWTHAYI RH LEVDVILETALINMY KCGD+TIARSLFDEAT+RDIHMWNAMMAGFSMHGCG 
Subjt:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT

Query:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK
        EALELFSEMESHGV+PNDITFISIFHACSHSGLV +GKKHFNRMVH+FGIVPK+EHYGCLVDLLGRAG L+EAH+IIENMPMRPNTI WGALLAACKLHK
Subjt:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK

Query:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP
        NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETM+HLGMKKEPGLSWIEVNGS+HHFKSGDKTCTQTT+VYEMV EMCIKLREAGYTP
Subjt:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP

Query:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
        NT+ VLLN++EEEKE ALSYHSEKLAMAFGLIS APGTPIRI+KNLRICDDCHAA KLLSKIY RTIIVRDRNRFHHFSEGYCSCLGYW
Subjt:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

XP_011660280.1 pentatricopeptide repeat-containing protein At3g26782, mitochondrial [Cucumis sativus]0.0e+0087.81Show/hide
Query:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL
        MR+ D AAALDNFILPSLLKACAQAS G  GRELHGFA KNGFA+DVFVCNALMNMYEKCG LVSARLVFD+MPERDVVSW+TMLGCYVRSK+FGEALRL
Subjt:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL

Query:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG
        V EM FVGVKLSGVALIS+I VFG LLDMKSGRAVHGYIVRNVGD KMEV +TTALI+MYCKG CL SAQRLFD LS+RSVVSWT MIAGCI +CRL EG
Subjt:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG

Query:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ
        AKNFNRMLEEK+FPNEITLLSLIT CGFV TLDLGK  HAYLLRNGFGMSLAL TALIDMYGKCGQVGYARALFNGV++KDVKIWS +IS YA+VSC+DQ
Subjt:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ

Query:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT
         F LF+EML+N+VKPN VTMVSLLS CAE GALDLGKWTHAYI RH LEVDVILETALINMYAKCGD+TIARSLF+EA +RDI MWN MMAGFSMHGCG 
Subjt:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT

Query:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK
        EALELFSEMESHGV+PNDITF+SIFHACSHSGLV +GKK+FN+MVHDFGIVPK+EHYGCLVDLLGRAG LDEAH+IIENMPMRPNTI WGALLAACKLHK
Subjt:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK

Query:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP
        NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRE M+H GMKKEPGLSWIEV+GS+HHFKSGDK CTQTT+VYEMVTEMCIKLRE+GYTP
Subjt:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP

Query:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
        NT+AVLLN++EEEKE ALSYHSEKLA AFGLIS APGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSC+GYW
Subjt:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

XP_023533718.1 pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp. pepo]0.0e+0086.94Show/hide
Query:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL
        MRTTD AAA+DNFI+PSLLKACAQAS   +GRE+HGFAVKNGF +DVFVCNALMNMYEKCGSLVSA LVFDKMP+RDVVSWSTMLGCYVRSKSFGEA RL
Subjt:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL

Query:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG
        V EMHFVGV+LS VALIS+IGVFGEL DMKSGRA+HGY+VRNVG+ +MEVPLTTALI+MYCKG+ L SA RLFD LSQR+VVSWT++IAGCI +CR  EG
Subjt:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG

Query:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ
        AKNF+RMLEE I PNEITLLSLIT CGFV  LDLGK LHAYLLRNGFGMSLALATALIDMYGKCGQV YARALFNGVEEKDVKIWSA+IS YA+ SCIDQ
Subjt:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ

Query:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT
        AFGLFL+MLD+EVKPNKVTMVSLLS CAEVGALDLG+WTHAYI RH LEVDV+LETALINMYAKCGDL  ARSLFDEATRRDIHMWNAMMAGFS+HGCG 
Subjt:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT

Query:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK
        EALELF +ME HGV+PNDITFIS+FHACSHSGLV +G KHF+RMVH+FGIVPKIEHYGCLVDLLGRA RLD AH IIENMPMRPNTI WGALLAACKLHK
Subjt:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK

Query:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP
        NL LGEVAARKILELDP+NCGY VLKSNIYAS KRW DVTSVRETM+HLGMKKEPGLSWIEVNGS+HHF+SGDKTCTQT +V+EMVTEMCIKLREAGY P
Subjt:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP

Query:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
        NTSAVLLNVE+EEKE ALSYHSEKLAMAFGLIS APGTPIRI+KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHF EGYCSCLGYW
Subjt:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

XP_038879151.1 pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Benincasa hispida]0.0e+0090.42Show/hide
Query:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL
        MRTTD AAALDNFILPSLLKACAQASCGV GRELHGFA+KNGFA DVFVCNALMNMYEKCGSLV ARLVFDKMP+RDVVSWSTMLGCYVRSKS+ EAL L
Subjt:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL

Query:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG
        V EMHFVGVKLSGVALIS+IG FGELLDMKSGRAVHGYIVRNV D KMEVPLTTALINMYCKGE LESAQRLFD L Q+SVVSWT MIAGCI NCRLVEG
Subjt:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG

Query:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ
        A NFNRMLEE++FPNEITLL+LIT CGFV TLDLGK  HAYLLRN FGMSLAL TALIDMYGKCGQVGYARALFNG+EEKDVKIWSA++  YA+ SCIDQ
Subjt:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ

Query:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT
        AF LFLEMLD+EVKPNKVTMV LLS CAE GAL+LGKWTH YI RH LEVDV+LETALINMYAKCGDLTIARSLFDEAT+RDIHMWNAMMAGFSMHGCG 
Subjt:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT

Query:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK
        EALELFSEME +GV+PNDITFIS+FHACSHSGLV DGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAG LDEAH+IIENMPM+PNTI WGALLAACKLHK
Subjt:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK

Query:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP
        NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRW DVTSVRETM+HLGMKKEPGLSWIEVNGS+HHFKSGDKTCTQTTEVYEMVTEMCIKLRE GYTP
Subjt:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP

Query:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
        NTSAVLLNVEEEEKE  LSYHSEKLAMAFGLIS APGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEG+CSCLGYW
Subjt:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

TrEMBL top hitse value%identityAlignment
A0A0A0LYC2 DYW_deaminase domain-containing protein0.0e+0087.81Show/hide
Query:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL
        MR+ D AAALDNFILPSLLKACAQAS G  GRELHGFA KNGFA+DVFVCNALMNMYEKCG LVSARLVFD+MPERDVVSW+TMLGCYVRSK+FGEALRL
Subjt:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL

Query:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG
        V EM FVGVKLSGVALIS+I VFG LLDMKSGRAVHGYIVRNVGD KMEV +TTALI+MYCKG CL SAQRLFD LS+RSVVSWT MIAGCI +CRL EG
Subjt:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG

Query:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ
        AKNFNRMLEEK+FPNEITLLSLIT CGFV TLDLGK  HAYLLRNGFGMSLAL TALIDMYGKCGQVGYARALFNGV++KDVKIWS +IS YA+VSC+DQ
Subjt:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ

Query:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT
         F LF+EML+N+VKPN VTMVSLLS CAE GALDLGKWTHAYI RH LEVDVILETALINMYAKCGD+TIARSLF+EA +RDI MWN MMAGFSMHGCG 
Subjt:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT

Query:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK
        EALELFSEMESHGV+PNDITF+SIFHACSHSGLV +GKK+FN+MVHDFGIVPK+EHYGCLVDLLGRAG LDEAH+IIENMPMRPNTI WGALLAACKLHK
Subjt:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK

Query:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP
        NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRE M+H GMKKEPGLSWIEV+GS+HHFKSGDK CTQTT+VYEMVTEMCIKLRE+GYTP
Subjt:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP

Query:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
        NT+AVLLN++EEEKE ALSYHSEKLA AFGLIS APGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSC+GYW
Subjt:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

A0A1S3CJ58 pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like0.0e+0089.7Show/hide
Query:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL
        MRT D AAALDNFILPSLLKACAQAS    GRELHGFA KNGFA+DVFVCNALMNMYEKCG LVSA LVFDKMPERDVVSWSTMLGCYVRSK+FGEALRL
Subjt:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL

Query:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG
        V EM FVGVKLSGVALIS+IGVFG LLDMKSGRAVHGYI+RNVGD KMEV LTTALI+MYCK ECL SAQRLFD LS+RSVVSWT MI GCI +CRLVEG
Subjt:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG

Query:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ
        AKNFNRMLEEK+FPNEITLLSLIT CGFV+TLDLGK  HAYLLRNGFGMSLAL TALIDMYGKCGQVGYARALFNGVE+KDVKIWSA+IS YA+VSC+DQ
Subjt:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ

Query:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT
         F LFLEMLDNEVKPNKVTMVSLLS CAE G LDLGKWTHAYI RH LEVDVILETALINMY KCGD+TIARSLFDEAT+RDIHMWNAMMAGFSMHGCG 
Subjt:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT

Query:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK
        EALELFSEMESHGV+PNDITFISIFHACSHSGLV +GKKHFNRMVH+FGIVPK+EHYGCLVDLLGRAG L+EAH+IIENMPMRPNTI WGALLAACKLHK
Subjt:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK

Query:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP
        NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETM+HLGMKKEPGLSWIEVNGS+HHFKSGDKTCTQTT+VYEMV EMCIKLREAGYTP
Subjt:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP

Query:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
        NT+ VLLN++EEEKE ALSYHSEKLAMAFGLIS APGTPIRI+KNLRICDDCHAA KLLSKIY RTIIVRDRNRFHHFSEGYCSCLGYW
Subjt:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

A0A5A7V2V9 Pentatricopeptide repeat-containing protein0.0e+0089.84Show/hide
Query:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL
        MRT D AAALDNFILPSLLKACAQAS    GRELHGFA KNGFA+DVFVCNALMNMYEKCG LVSA LVFDKMPERDVVSWSTMLGCYVRSK+FGEALRL
Subjt:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL

Query:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG
        V EM FVGVKLSGVALIS+IGVFG LLDMKSGRAVHGYIVRNVGD KMEV LTTALI+MYCK ECL SAQRLFD LS+RSVVSWT MI GCI +CRLVEG
Subjt:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG

Query:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ
        AKNFNRMLEEK+FPNEITLLSLIT CGFV+TLDLGK  HAYLLRNGFGMSLAL TALIDMYGKCGQVGYARALFNGVE+KDVKIWSA+IS YA+VSC+DQ
Subjt:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ

Query:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT
         F LFLEMLDNEVKPNKVTMVSLLS CAE G LDLGKWTHAYI RH LEVDVILETALINMY KCGD+TIARSLFDEAT+RDIHMWNAMMAGFSMHGCG 
Subjt:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT

Query:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK
        EALELFSEMESHGV+PNDITFISIFHACSHSGLV +GKKHFNRMVH FGIVPK+EHYGCLVDLLGRAG L+EAH+IIENMPMRPNTI WGALLAACKLHK
Subjt:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK

Query:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP
        NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETM+HLGMKKEPGLSWIEVNGS+HHFKSGDKTCTQTT+VYEMV EMCIKLREAGYTP
Subjt:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP

Query:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
        NT+ VLLN++EEEKE ALSYHSEKLAMAFGLIS APGTPIRI+KNLRICDDCHAA KLLSKIY RTIIVRDRNRFHHFSEGYCSCLGYW
Subjt:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

A0A6J1HA74 pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like0.0e+0086.79Show/hide
Query:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL
        MRTTD AAA+DNFI+PSLLKACAQAS   +GRE+HGFAVKNGF +DVFVCNALMNMYEKCGSLVSA LVFDKMP+RDVVSWSTMLGCYVRSKSFGEA RL
Subjt:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL

Query:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG
        V EMHFVGVKLS VALIS+IGVFGEL DMKSGRA+HGY+VRNVG+ ++E+PLTTALI+MYCKG+ L SA RLFD LSQR+VVSWT++IAGCI +CR VEG
Subjt:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG

Query:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ
        AKNF+RMLEE I PNEITLLSLIT CGFV  LDLGK LHAYLLRNGFGMSLALATALIDMYGKCGQV YARALFNGVEEKDVKIWSA+IS YA+ SCIDQ
Subjt:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ

Query:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT
        AF LFL+MLD+EVKPNKVTMVSLLS CAEVGALDLG+WTHAYI RH +EVDV+LETALINMYAKCGDL  AR LFDEATRRDIHMWNAMMAGFS+HGCG 
Subjt:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT

Query:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK
        EALELFS+M  HGV+PNDITFIS+FHACSHSGLV +G KHF+RMVH+FGIVPKIEHYGCLVDLLGRA RLD AH IIENMPMRPNTI WGALLAACKLHK
Subjt:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK

Query:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP
        NLALGEVAARKILELDP+NCGY VLKSNIYAS KRW DVTSVRETM+HLGMKKEPGLSWIEVNGS+HHF+SGDKTCTQT +V+EMVTEMCIKLREAGY P
Subjt:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP

Query:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
        NTSAVLLNVE+EEKE ALSYHSEKLAMAFGLIS APGTPIRI+KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
Subjt:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

A0A6J1JKG9 pentatricopeptide repeat-containing protein At4g21065-like0.0e+0084.76Show/hide
Query:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL
        MRTTD AAA+DNFI+PSLLKACAQAS    GRE+HGFAVKNGF +DVFVCNALMNMYEKCGSLVSA LVFDKMP+RDVVSWSTMLGCYVRSKSFGEA RL
Subjt:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL

Query:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG
        V EMHFVGVKLS VALIS+IGVFGEL DMKSGRA+HGY+VRNVG  ++E+PLTTALI+MYCKG+ L SA RLF+ LSQR+VVSWT++IAGCI +CR VEG
Subjt:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG

Query:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ
        AKNF+RMLEE I PNEITLLSLIT CGFV  LDLGK LH+YLLRNGFGMSL L TALIDMYGKCGQV YARALFN V+EKDVKIWSA+IS YA+ SCIDQ
Subjt:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ

Query:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT
        AF LFL+MLD+EVKPNKVTMVSLLS CAEVGALDLG+WTHAYI  H +EVD++LETALINMYAKCGDL  ARSLFDEAT+RDIHMWNAMMAGFS+HGCG 
Subjt:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT

Query:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK
        EALELFS+ME HGV+PNDITFIS+FHACSHSGLV +G KHF+RMVH+FGIVPKIEHYGCLVDLLGRA RLD AH IIENMPMRPNTI WGALLAACKLHK
Subjt:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK

Query:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP
        NL LG+VAARKILELDP+NCGY VLKSNIYAS KRW +VTS+RE+M+HLGMKKEPGLSW EVNGS+HHF+SGDKTCTQ  +V+EMVTEMCIKLREAGY P
Subjt:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP

Query:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
        NTSAVLLNVE+EEKE ALSYHSEKLAMAFGLIS APGTPIRI+KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
Subjt:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic6.1e-15040.09Show/hide
Query:  DNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRLVGEMHFVGVK
        + +  P L+KA A+ S    G+ LHG AVK+   +DVFV N+L++ Y  CG L SA  VF  + E+DVVSW++M+  +V+  S  +AL L  +M    VK
Subjt:  DNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRLVGEMHFVGVK

Query:  LSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEGAKNFNRMLEE
         S V ++ V+    ++ +++ GR V  YI  N   V + + L  A+++MY K   +E A+RLFD++ ++  V+WT+M+ G                    
Subjt:  LSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEGAKNFNRMLEE

Query:  KIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEM-L
                                                          Y        AR + N + +KD+  W+A+IS Y      ++A  +F E+ L
Subjt:  KIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEM-L

Query:  DNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSEM
           +K N++T+VS LS+CA+VGAL+LG+W H+YI++H + ++  + +ALI+MY+KCGDL  +R +F+   +RD+ +W+AM+ G +MHGCG EA+++F +M
Subjt:  DNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSEM

Query:  ESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVAA
        +   VKPN +TF ++F ACSH+GLV + +  F++M  ++GIVP+ +HY C+VD+LGR+G L++A   IE MP+ P+T  WGALL ACK+H NL L E+A 
Subjt:  ESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVAA

Query:  RKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLNV
         ++LEL+P+N G  VL SNIYA   +W +V+ +R+ M   G+KKEPG S IE++G IH F SGD     + +VY  + E+  KL+  GY P  S VL  +
Subjt:  RKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLNV

Query:  EEEE-KEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
        EEEE KE +L+ HSEKLA+ +GLIS      IR++KNLR+C DCH+  KL+S++Y R IIVRDR RFHHF  G CSC  +W
Subjt:  EEEE-KEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic8.8e-15740.93Show/hide
Query:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL
        MR  D    + NF    LLK C   +    G+E+HG  VK+GF+ D+F    L NMY KC  +  AR VFD+MPERD+VSW+T++  Y ++     AL +
Subjt:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL

Query:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG
        V  M    +K S + ++SV+     L  +  G+ +HGY +R+  D    V ++TAL++MY K   LE+A++LFD + +R+VVSW SMI   + N    E 
Subjt:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG

Query:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ
           F +ML+E + P +++++  + AC  +  L+ G+ +H   +  G   ++++  +LI MY KC +V  A ++F  ++ + +  W+AMI  +A       
Subjt:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ

Query:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT
        A   F +M    VKP+  T VS++++ AE+      KW H  + R  L+ +V + TAL++MYAKCG + IAR +FD  + R +  WNAM+ G+  HG G 
Subjt:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT

Query:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK
         ALELF EM+   +KPN +TF+S+  ACSHSGLV  G K F  M  ++ I   ++HYG +VDLLGRAGRL+EA D I  MP++P    +GA+L AC++HK
Subjt:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK

Query:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP
        N+   E AA ++ EL+P + GY VL +NIY +A  W  V  VR +M   G++K PG S +E+   +H F SG      + ++Y  + ++   ++EAGY P
Subjt:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP

Query:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
        +T+ V L VE + KE  LS HSEKLA++FGL++   GT I + KNLR+C DCH ATK +S + GR I+VRD  RFHHF  G CSC  YW
Subjt:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic5.9e-15340.38Show/hide
Query:  DNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRLVGEMHFVGVK
        +++  P +LK+CA++     G+++HG  +K G   D++V  +L++MY + G L  A  VFDK P RDVVS+                             
Subjt:  DNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRLVGEMHFVGVK

Query:  LSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEGAKNFNRMLEE
                                                   TALI  Y     +E+AQ+LFD +  + VVSW +MI+G        E  + F  M++ 
Subjt:  LSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEGAKNFNRMLEE

Query:  KIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEMLD
         + P+E T++++++AC    +++LG+ +H ++  +GFG +L +  ALID+Y KCG++  A  LF  +  KDV  W+ +I  Y +++   +A  LF EML 
Subjt:  KIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEMLD

Query:  NEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYI--RRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSE
        +   PN VTM+S+L +CA +GA+D+G+W H YI  R   +     L T+LI+MYAKCGD+  A  +F+    + +  WNAM+ GF+MHG    + +LFS 
Subjt:  NEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYI--RRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSE

Query:  MESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVA
        M   G++P+DITF+ +  ACSHSG++  G+  F  M  D+ + PK+EHYGC++DLLG +G   EA ++I  M M P+ + W +LL ACK+H N+ LGE  
Subjt:  MESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVA

Query:  ARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLN
        A  +++++P+N G  VL SNIYASA RWN+V   R  +N  GMKK PG S IE++  +H F  GDK   +  E+Y M+ EM + L +AG+ P+TS VL  
Subjt:  ARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLN

Query:  VEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
        +EEE KE AL +HSEKLA+AFGLIS  PGT + IVKNLR+C +CH ATKL+SKIY R II RDR RFHHF +G CSC  YW
Subjt:  VEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226908.5e-15239.47Show/hide
Query:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL
        +R  +   + D +  P  L ACA++     G ++HG  VK G+A D+FV N+L++ Y +CG L SAR VFD+M ER+VVSW++M+  Y R     +A+ L
Subjt:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL

Query:  VGEM-HFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVE
           M     V  + V ++ VI    +L D+++G  V+ +I RN G +++   + +AL++MY K   ++ A+RLFD     ++    +M +  +      E
Subjt:  VGEM-HFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVE

Query:  GAKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKC-------------------------------GQVG
            FN M++  + P+ I++LS I++C  +R +  GK  H Y+LRNGF     +  ALIDMY KC                               G+V 
Subjt:  GAKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKC-------------------------------GQVG

Query:  YARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEMLDNE-VKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGD
         A   F  + EK++  W+ +IS     S  ++A  +F  M   E V  + VTM+S+ S+C  +GALDL KW + YI ++ +++DV L T L++M+++CGD
Subjt:  YARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEMLDNE-VKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGD

Query:  LTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRA
           A S+F+  T RD+  W A +   +M G    A+ELF +M   G+KP+ + F+    ACSH GLV  GK+ F  M+   G+ P+  HYGC+VDLLGRA
Subjt:  LTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRA

Query:  GRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIH
        G L+EA  +IE+MPM PN + W +LLAAC++  N+ +   AA KI  L P+  G  VL SN+YASA RWND+  VR +M   G++K PG S I++ G  H
Subjt:  GRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIH

Query:  HFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTI
         F SGD++  +   +  M+ E+  +    G+ P+ S VL++V+E+EK F LS HSEKLAMA+GLIS   GT IRIVKNLR+C DCH+  K  SK+Y R I
Subjt:  HFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTI

Query:  IVRDRNRFHHFSEGYCSCLGYW
        I+RD NRFH+  +G CSC  +W
Subjt:  IVRDRNRFHHFSEGYCSCLGYW

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic2.1e-15039.21Show/hide
Query:  LDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRLVGEMHFVGV
        +D++    + K+ +       G +LHGF +K+GF     V N+L+  Y K   + SAR VFD+M ERDV+SW++++  YV +    + L +  +M   G+
Subjt:  LDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRLVGEMHFVGV

Query:  KLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEGAKNFNRMLE
        ++    ++SV     +   +  GRAVH   V+       E      L++MY K   L+SA+ +F  +S RSVVS+TSMIAG        E  K F  M E
Subjt:  KLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEGAKNFNRMLE

Query:  EKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEML
        E I P+  T+ +++  C   R LD GK +H ++  N  G  + ++ AL+DMY KCG +  A  +F+ +  KD+  W+ +I  Y+     ++A  LF  +L
Subjt:  EKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEML

Query:  DNE-VKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSE
        + +   P++ T+  +L +CA + A D G+  H YI R+    D  +  +L++MYAKCG L +A  LFD+   +D+  W  M+AG+ MHG G EA+ LF++
Subjt:  DNE-VKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSE

Query:  MESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVA
        M   G++ ++I+F+S+ +ACSHSGLV +G + FN M H+  I P +EHY C+VD+L R G L +A+  IENMP+ P+   WGALL  C++H ++ L E  
Subjt:  MESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVA

Query:  ARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLN
        A K+ EL+P+N GY VL +NIYA A++W  V  +R+ +   G++K PG SWIE+ G ++ F +GD +  +T  +   + ++  ++ E GY+P T   L++
Subjt:  ARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLN

Query:  VEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
         EE EKE AL  HSEKLAMA G+IS   G  IR+ KNLR+C DCH   K +SK+  R I++RD NRFH F +G+CSC G+W
Subjt:  VEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-15440.38Show/hide
Query:  DNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRLVGEMHFVGVK
        +++  P +LK+CA++     G+++HG  +K G   D++V  +L++MY + G L  A  VFDK P RDVVS+                             
Subjt:  DNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRLVGEMHFVGVK

Query:  LSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEGAKNFNRMLEE
                                                   TALI  Y     +E+AQ+LFD +  + VVSW +MI+G        E  + F  M++ 
Subjt:  LSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEGAKNFNRMLEE

Query:  KIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEMLD
         + P+E T++++++AC    +++LG+ +H ++  +GFG +L +  ALID+Y KCG++  A  LF  +  KDV  W+ +I  Y +++   +A  LF EML 
Subjt:  KIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEMLD

Query:  NEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYI--RRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSE
        +   PN VTM+S+L +CA +GA+D+G+W H YI  R   +     L T+LI+MYAKCGD+  A  +F+    + +  WNAM+ GF+MHG    + +LFS 
Subjt:  NEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYI--RRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSE

Query:  MESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVA
        M   G++P+DITF+ +  ACSHSG++  G+  F  M  D+ + PK+EHYGC++DLLG +G   EA ++I  M M P+ + W +LL ACK+H N+ LGE  
Subjt:  MESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVA

Query:  ARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLN
        A  +++++P+N G  VL SNIYASA RWN+V   R  +N  GMKK PG S IE++  +H F  GDK   +  E+Y M+ EM + L +AG+ P+TS VL  
Subjt:  ARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLN

Query:  VEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
        +EEE KE AL +HSEKLA+AFGLIS  PGT + IVKNLR+C +CH ATKL+SKIY R II RDR RFHHF +G CSC  YW
Subjt:  VEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein6.2e-15840.93Show/hide
Query:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL
        MR  D    + NF    LLK C   +    G+E+HG  VK+GF+ D+F    L NMY KC  +  AR VFD+MPERD+VSW+T++  Y ++     AL +
Subjt:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL

Query:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG
        V  M    +K S + ++SV+     L  +  G+ +HGY +R+  D    V ++TAL++MY K   LE+A++LFD + +R+VVSW SMI   + N    E 
Subjt:  VGEMHFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEG

Query:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ
           F +ML+E + P +++++  + AC  +  L+ G+ +H   +  G   ++++  +LI MY KC +V  A ++F  ++ + +  W+AMI  +A       
Subjt:  AKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQ

Query:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT
        A   F +M    VKP+  T VS++++ AE+      KW H  + R  L+ +V + TAL++MYAKCG + IAR +FD  + R +  WNAM+ G+  HG G 
Subjt:  AFGLFLEMLDNEVKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGT

Query:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK
         ALELF EM+   +KPN +TF+S+  ACSHSGLV  G K F  M  ++ I   ++HYG +VDLLGRAGRL+EA D I  MP++P    +GA+L AC++HK
Subjt:  EALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHK

Query:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP
        N+   E AA ++ EL+P + GY VL +NIY +A  W  V  VR +M   G++K PG S +E+   +H F SG      + ++Y  + ++   ++EAGY P
Subjt:  NLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTP

Query:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
        +T+ V L VE + KE  LS HSEKLA++FGL++   GT I + KNLR+C DCH ATK +S + GR I+VRD  RFHHF  G CSC  YW
Subjt:  NTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)5.1e-15239.55Show/hide
Query:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL
        +R  +   + D +  P  L ACA++     G ++HG  VK G+A D+FV N+L++ Y +CG L SAR VFD+M ER+VVSW++M+  Y R     +A+ L
Subjt:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL

Query:  VGEM-HFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVE
           M     V  + V ++ VI    +L D+++G  V+ +I RN G +++   + +AL++MY K   ++ A+RLFD     ++    +M +  +      E
Subjt:  VGEM-HFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVE

Query:  GAKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKC-------------------------------GQVG
            FN M++  + P+ I++LS I++C  +R +  GK  H Y+LRNGF     +  ALIDMY KC                               G+V 
Subjt:  GAKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKC-------------------------------GQVG

Query:  YARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEMLDNE-VKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGD
         A   F  + EK++  W+ +IS     S  ++A  +F  M   E V  + VTM+S+ S+C  +GALDL KW + YI ++ +++DV L T L++M+++CGD
Subjt:  YARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEMLDNE-VKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGD

Query:  LTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRA
           A S+F+  T RD+  W A +   +M G    A+ELF +M   G+KP+ + F+    ACSH GLV  GK+ F  M+   G+ P+  HYGC+VDLLGRA
Subjt:  LTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRA

Query:  GRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIH
        G L+EA  +IE+MPM PN + W +LLAAC++  N+ +   AA KI  L P+  G  VL SN+YASA RWND+  VR +M   G++K PG S I++ G  H
Subjt:  GRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIH

Query:  HFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTI
         F SGD++  +   +  M+ E+  +    G+ P+ S VL++V+E+EK F LS HSEKLAMA+GLIS   GT IRIVKNLR+C DCH+  K  SK+Y R I
Subjt:  HFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTI

Query:  IVRDRNRFHHFSEGYCSC
        I+RD NRFH+  +G CSC
Subjt:  IVRDRNRFHHFSEGYCSC

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification6.0e-15339.47Show/hide
Query:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL
        +R  +   + D +  P  L ACA++     G ++HG  VK G+A D+FV N+L++ Y +CG L SAR VFD+M ER+VVSW++M+  Y R     +A+ L
Subjt:  MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRL

Query:  VGEM-HFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVE
           M     V  + V ++ VI    +L D+++G  V+ +I RN G +++   + +AL++MY K   ++ A+RLFD     ++    +M +  +      E
Subjt:  VGEM-HFVGVKLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVE

Query:  GAKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKC-------------------------------GQVG
            FN M++  + P+ I++LS I++C  +R +  GK  H Y+LRNGF     +  ALIDMY KC                               G+V 
Subjt:  GAKNFNRMLEEKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKC-------------------------------GQVG

Query:  YARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEMLDNE-VKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGD
         A   F  + EK++  W+ +IS     S  ++A  +F  M   E V  + VTM+S+ S+C  +GALDL KW + YI ++ +++DV L T L++M+++CGD
Subjt:  YARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEMLDNE-VKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGD

Query:  LTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRA
           A S+F+  T RD+  W A +   +M G    A+ELF +M   G+KP+ + F+    ACSH GLV  GK+ F  M+   G+ P+  HYGC+VDLLGRA
Subjt:  LTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRA

Query:  GRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIH
        G L+EA  +IE+MPM PN + W +LLAAC++  N+ +   AA KI  L P+  G  VL SN+YASA RWND+  VR +M   G++K PG S I++ G  H
Subjt:  GRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIH

Query:  HFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTI
         F SGD++  +   +  M+ E+  +    G+ P+ S VL++V+E+EK F LS HSEKLAMA+GLIS   GT IRIVKNLR+C DCH+  K  SK+Y R I
Subjt:  HFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTI

Query:  IVRDRNRFHHFSEGYCSCLGYW
        I+RD NRFH+  +G CSC  +W
Subjt:  IVRDRNRFHHFSEGYCSCLGYW

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-15139.21Show/hide
Query:  LDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRLVGEMHFVGV
        +D++    + K+ +       G +LHGF +K+GF     V N+L+  Y K   + SAR VFD+M ERDV+SW++++  YV +    + L +  +M   G+
Subjt:  LDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRLVGEMHFVGV

Query:  KLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEGAKNFNRMLE
        ++    ++SV     +   +  GRAVH   V+       E      L++MY K   L+SA+ +F  +S RSVVS+TSMIAG        E  K F  M E
Subjt:  KLSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEGAKNFNRMLE

Query:  EKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEML
        E I P+  T+ +++  C   R LD GK +H ++  N  G  + ++ AL+DMY KCG +  A  +F+ +  KD+  W+ +I  Y+     ++A  LF  +L
Subjt:  EKIFPNEITLLSLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEML

Query:  DNE-VKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSE
        + +   P++ T+  +L +CA + A D G+  H YI R+    D  +  +L++MYAKCG L +A  LFD+   +D+  W  M+AG+ MHG G EA+ LF++
Subjt:  DNE-VKPNKVTMVSLLSSCAEVGALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSE

Query:  MESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVA
        M   G++ ++I+F+S+ +ACSHSGLV +G + FN M H+  I P +EHY C+VD+L R G L +A+  IENMP+ P+   WGALL  C++H ++ L E  
Subjt:  MESHGVKPNDITFISIFHACSHSGLVADGKKHFNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVA

Query:  ARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLN
        A K+ EL+P+N GY VL +NIYA A++W  V  +R+ +   G++K PG SWIE+ G ++ F +GD +  +T  +   + ++  ++ E GY+P T   L++
Subjt:  ARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLGMKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLN

Query:  VEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
         EE EKE AL  HSEKLAMA G+IS   G  IR+ KNLR+C DCH   K +SK+  R I++RD NRFH F +G+CSC G+W
Subjt:  VEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAACGACTGATCCTGCTGCTGCACTTGACAACTTCATTTTGCCTTCACTTCTCAAAGCCTGTGCTCAGGCTTCCTGTGGGGTTTGGGGCAGGGAACTCCATGGTTT
CGCCGTTAAGAACGGGTTTGCCACAGATGTTTTTGTGTGCAACGCTTTGATGAACATGTATGAGAAATGTGGGAGCTTGGTTTCTGCTCGCTTGGTGTTTGACAAAATGC
CCGAGAGAGATGTTGTCTCTTGGAGTACTATGCTCGGGTGCTATGTGCGAAGCAAATCTTTCGGTGAAGCGCTTCGACTCGTAGGAGAGATGCATTTTGTGGGAGTGAAG
CTTAGTGGTGTTGCTTTGATTAGCGTGATTGGTGTATTTGGAGAACTCTTGGATATGAAGTCGGGGAGGGCGGTTCATGGTTACATTGTGAGAAATGTTGGTGATGTGAA
GATGGAAGTTCCTTTGACAACTGCGTTGATCAATATGTATTGCAAGGGTGAGTGTTTGGAATCAGCACAGAGGCTTTTTGACAGTTTATCTCAGAGAAGTGTTGTTTCTT
GGACGTCAATGATAGCAGGTTGTATTCACAATTGCAGATTAGTTGAAGGGGCAAAGAACTTTAATAGAATGCTTGAAGAAAAAATATTTCCTAATGAGATTACATTGCTC
AGTTTGATTACAGCATGTGGTTTTGTGAGAACCTTGGATTTGGGCAAATGTTTGCATGCGTATCTGTTAAGAAATGGGTTTGGTATGTCTCTGGCTTTGGCCACTGCTCT
CATAGACATGTATGGAAAATGTGGGCAAGTTGGATATGCCAGAGCTCTTTTCAATGGTGTTGAGGAAAAAGATGTCAAGATTTGGAGTGCTATGATATCCACTTATGCAT
ACGTGAGTTGCATCGATCAAGCTTTTGGCCTCTTCCTTGAGATGTTAGACAATGAAGTGAAACCAAACAAGGTGACAATGGTTAGCCTTCTTTCTTCATGTGCAGAAGTT
GGAGCCCTTGACCTTGGCAAGTGGACTCATGCTTACATACGTCGTCACGACCTTGAAGTAGACGTCATTCTAGAAACAGCTCTCATCAACATGTATGCCAAATGTGGAGA
TCTAACAATTGCTCGTAGCCTGTTCGATGAAGCCACACGACGGGATATTCACATGTGGAATGCAATGATGGCTGGATTCTCGATGCATGGTTGTGGAACAGAAGCTTTGG
AACTCTTTTCAGAGATGGAGAGCCATGGTGTTAAACCTAATGATATCACATTCATTTCTATTTTTCACGCTTGCAGTCATTCGGGATTGGTAGCAGATGGGAAAAAGCAT
TTCAACAGAATGGTTCACGACTTTGGAATTGTTCCAAAGATTGAGCACTATGGATGCTTGGTGGATCTTCTCGGTCGAGCTGGACGTCTTGACGAAGCTCACGACATCAT
TGAAAACATGCCCATGAGGCCTAACACAATTACATGGGGTGCTCTGCTTGCTGCATGTAAACTACACAAGAATCTGGCCTTGGGGGAGGTGGCTGCAAGAAAGATTCTTG
AATTGGACCCACAAAATTGTGGGTATAGTGTTCTTAAGTCAAACATCTATGCATCAGCAAAGCGATGGAATGATGTAACAAGCGTTAGAGAAACAATGAACCATTTAGGG
ATGAAGAAAGAACCAGGACTCAGCTGGATAGAAGTAAATGGTTCAATCCACCACTTCAAATCTGGGGATAAAACATGCACACAAACAACAGAAGTATATGAAATGGTGAC
TGAAATGTGCATCAAGTTGAGAGAGGCAGGATACACTCCAAACACATCAGCAGTTTTGTTAAATGTAGAAGAGGAAGAGAAGGAATTTGCACTCAGTTACCATAGTGAGA
AACTGGCCATGGCATTTGGACTCATAAGCATGGCCCCTGGTACACCCATCCGAATCGTTAAGAATCTGAGGATTTGCGATGATTGTCATGCTGCAACAAAGCTATTATCA
AAGATCTATGGACGGACAATAATAGTTAGAGATCGAAACCGATTTCACCACTTCAGTGAAGGATATTGTTCTTGTCTGGGTTATTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGAACGACTGATCCTGCTGCTGCACTTGACAACTTCATTTTGCCTTCACTTCTCAAAGCCTGTGCTCAGGCTTCCTGTGGGGTTTGGGGCAGGGAACTCCATGGTTT
CGCCGTTAAGAACGGGTTTGCCACAGATGTTTTTGTGTGCAACGCTTTGATGAACATGTATGAGAAATGTGGGAGCTTGGTTTCTGCTCGCTTGGTGTTTGACAAAATGC
CCGAGAGAGATGTTGTCTCTTGGAGTACTATGCTCGGGTGCTATGTGCGAAGCAAATCTTTCGGTGAAGCGCTTCGACTCGTAGGAGAGATGCATTTTGTGGGAGTGAAG
CTTAGTGGTGTTGCTTTGATTAGCGTGATTGGTGTATTTGGAGAACTCTTGGATATGAAGTCGGGGAGGGCGGTTCATGGTTACATTGTGAGAAATGTTGGTGATGTGAA
GATGGAAGTTCCTTTGACAACTGCGTTGATCAATATGTATTGCAAGGGTGAGTGTTTGGAATCAGCACAGAGGCTTTTTGACAGTTTATCTCAGAGAAGTGTTGTTTCTT
GGACGTCAATGATAGCAGGTTGTATTCACAATTGCAGATTAGTTGAAGGGGCAAAGAACTTTAATAGAATGCTTGAAGAAAAAATATTTCCTAATGAGATTACATTGCTC
AGTTTGATTACAGCATGTGGTTTTGTGAGAACCTTGGATTTGGGCAAATGTTTGCATGCGTATCTGTTAAGAAATGGGTTTGGTATGTCTCTGGCTTTGGCCACTGCTCT
CATAGACATGTATGGAAAATGTGGGCAAGTTGGATATGCCAGAGCTCTTTTCAATGGTGTTGAGGAAAAAGATGTCAAGATTTGGAGTGCTATGATATCCACTTATGCAT
ACGTGAGTTGCATCGATCAAGCTTTTGGCCTCTTCCTTGAGATGTTAGACAATGAAGTGAAACCAAACAAGGTGACAATGGTTAGCCTTCTTTCTTCATGTGCAGAAGTT
GGAGCCCTTGACCTTGGCAAGTGGACTCATGCTTACATACGTCGTCACGACCTTGAAGTAGACGTCATTCTAGAAACAGCTCTCATCAACATGTATGCCAAATGTGGAGA
TCTAACAATTGCTCGTAGCCTGTTCGATGAAGCCACACGACGGGATATTCACATGTGGAATGCAATGATGGCTGGATTCTCGATGCATGGTTGTGGAACAGAAGCTTTGG
AACTCTTTTCAGAGATGGAGAGCCATGGTGTTAAACCTAATGATATCACATTCATTTCTATTTTTCACGCTTGCAGTCATTCGGGATTGGTAGCAGATGGGAAAAAGCAT
TTCAACAGAATGGTTCACGACTTTGGAATTGTTCCAAAGATTGAGCACTATGGATGCTTGGTGGATCTTCTCGGTCGAGCTGGACGTCTTGACGAAGCTCACGACATCAT
TGAAAACATGCCCATGAGGCCTAACACAATTACATGGGGTGCTCTGCTTGCTGCATGTAAACTACACAAGAATCTGGCCTTGGGGGAGGTGGCTGCAAGAAAGATTCTTG
AATTGGACCCACAAAATTGTGGGTATAGTGTTCTTAAGTCAAACATCTATGCATCAGCAAAGCGATGGAATGATGTAACAAGCGTTAGAGAAACAATGAACCATTTAGGG
ATGAAGAAAGAACCAGGACTCAGCTGGATAGAAGTAAATGGTTCAATCCACCACTTCAAATCTGGGGATAAAACATGCACACAAACAACAGAAGTATATGAAATGGTGAC
TGAAATGTGCATCAAGTTGAGAGAGGCAGGATACACTCCAAACACATCAGCAGTTTTGTTAAATGTAGAAGAGGAAGAGAAGGAATTTGCACTCAGTTACCATAGTGAGA
AACTGGCCATGGCATTTGGACTCATAAGCATGGCCCCTGGTACACCCATCCGAATCGTTAAGAATCTGAGGATTTGCGATGATTGTCATGCTGCAACAAAGCTATTATCA
AAGATCTATGGACGGACAATAATAGTTAGAGATCGAAACCGATTTCACCACTTCAGTGAAGGATATTGTTCTTGTCTGGGTTATTGGTAA
Protein sequenceShow/hide protein sequence
MRTTDPAAALDNFILPSLLKACAQASCGVWGRELHGFAVKNGFATDVFVCNALMNMYEKCGSLVSARLVFDKMPERDVVSWSTMLGCYVRSKSFGEALRLVGEMHFVGVK
LSGVALISVIGVFGELLDMKSGRAVHGYIVRNVGDVKMEVPLTTALINMYCKGECLESAQRLFDSLSQRSVVSWTSMIAGCIHNCRLVEGAKNFNRMLEEKIFPNEITLL
SLITACGFVRTLDLGKCLHAYLLRNGFGMSLALATALIDMYGKCGQVGYARALFNGVEEKDVKIWSAMISTYAYVSCIDQAFGLFLEMLDNEVKPNKVTMVSLLSSCAEV
GALDLGKWTHAYIRRHDLEVDVILETALINMYAKCGDLTIARSLFDEATRRDIHMWNAMMAGFSMHGCGTEALELFSEMESHGVKPNDITFISIFHACSHSGLVADGKKH
FNRMVHDFGIVPKIEHYGCLVDLLGRAGRLDEAHDIIENMPMRPNTITWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMNHLG
MKKEPGLSWIEVNGSIHHFKSGDKTCTQTTEVYEMVTEMCIKLREAGYTPNTSAVLLNVEEEEKEFALSYHSEKLAMAFGLISMAPGTPIRIVKNLRICDDCHAATKLLS
KIYGRTIIVRDRNRFHHFSEGYCSCLGYW