; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010138 (gene) of Snake gourd v1 genome

Gene IDTan0010138
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein DOT4
Genome locationLG05:76489696..76492811
RNA-Seq ExpressionTan0010138
SyntenyTan0010138
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057054.1 pentatricopeptide repeat-containing protein DOT4 [Cucumis melo var. makuwa]0.0e+0088.24Show/hide
Query:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL
        MLL AKAP TFW+SPA HDH G +NL+FR SF+F KPNSK SFS+ AYA      PA+ETKSY+DVE ++S KIVEFCEVGDLKNAMELLCSSQNSN DL
Subjt:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL

Query:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN
        + +CSILQLCAE+KSIRDGRRVHSIIES+GV +DGILG KLVFMYVKCG+LKEGRMIFDKLSE KVF+WNLMISEY G+GNYGESINLFK+MLELGI PN
Subjt:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN

Query:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV
        SYTFSSVLKCFAAV+ VEEGRQVHGLI+KLG+NSY TVVNSLISFYF+GRKVR AQKLFDEL DRDVISWNSMISGYVKNGL+D GIEIFI+MLVFGV +
Subjt:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV

Query:  DLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV
        DLATMVNVLVACAN GTLL GK LHSYSIK A LDREV FNNTLLDMYSKCGDLNSAIRVFE+MDEKTVVSWTSMI GYVREGLSDGAIKLFDEMKSRGV
Subjt:  DLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV

Query:  VPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRES
        VPDVYAVTSILHACAINGNL SG+IVH YIRENNLETNSFVSNAL DMYAKCGSMKDAH +FSH+K+KDVISWNTM+GGYSKNR PNEAL LF EMQ ES
Subjt:  VPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRES

Query:  KPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIAG
        KPDGTTVACILPACASLAALDRGREIHGYALRNG SEDKYVVNAL+DMYVKCGLLVLARS FDMI +KDLVSWTVMIAGYGMHGFGSEA+NTF+QMR+ G
Subjt:  KPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIAG

Query:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
        I+PDEVSFISILYACSHSGLLDEGWK F+IMKKECQIEP LEHYACMVDLLARTGNLVKA KFI+TMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
Subjt:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF

Query:  ELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADERE
        ELEPENTGYYVLLANIYAEAEKWEEVQKLRK+IGQRGLKKNPGCSWIEIKG+VNIFVAGDCSKPQAKKIE LLK+LRSKMKEEGYSPKT +ALLNADERE
Subjt:  ELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADERE

Query:  KEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW
        KEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSK+ +REI+LRDSSRFHHFKDG CSCRG+W
Subjt:  KEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW

XP_011657608.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucumis sativus]0.0e+0088.36Show/hide
Query:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL
        MLL AKAP TFW+SPA +DH G +NL+FR SF+FV P+SK SFS+ AYA      PALETKSY+DVE ++S KIVEFCEVGDLKNAMELLCSSQNSN DL
Subjt:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL

Query:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN
          YCSILQLCAE+KSIRDGRRV SIIES+GV +DGILG KLVFMYVKCG+LKEGRM+FDKLSE K+FLWNLMISEYSGSGNYGESINLFK+MLELGI PN
Subjt:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN

Query:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV
        SYTFSS+LKCFAAV+RVEEGRQVHGLI KLGFNSY TVVNSLISFYF+GRKVR AQKLFDEL DRDVISWNSMISGYVKNGL+D GIEIFI+MLVFGV +
Subjt:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV

Query:  DLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV
        DLATMVNV VACAN+GTLLLGK LHSYSIK A LDREV FNNTLLDMYSKCGDLNSAIRVFE+MDEKTVVSWTSMI GYVREGLSDGAIKLFDEMKSRGV
Subjt:  DLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV

Query:  VPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRES
        VPDVYAVTSIL+ACAINGNL SGKIVH YIRENNLETNSFVSNAL DMYAKCGSMKDAH +FSH+K+KDVISWNTM+GGY+KN  PNEAL LF EMQRES
Subjt:  VPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRES

Query:  KPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIAG
        KPDGTTVACILPACASLAALD+GREIHGYALRNG SEDKYV NA+VDMYVKCGLLVLARSLFDMIP+KDLVSWTVMIAGYGMHG+GSEA+NTF+QMR+ G
Subjt:  KPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIAG

Query:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
        IEPDEVSFISILYACSHSGLLDEGWK FNIMKKECQIEP LEHYACMVDLLARTGNLVKA KFI+ MPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
Subjt:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF

Query:  ELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADERE
        ELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKG++NIFVAGDCSKPQAKKIE LLK+LRSKMKEEGYSPKT +ALLNADERE
Subjt:  ELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADERE

Query:  KEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW
        KEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSK+ +REI+LRDSSRFHHFKDG CSCRGYW
Subjt:  KEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW

XP_022139839.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Momordica charantia]0.0e+0089.17Show/hide
Query:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL
        MLL+AK+PPTFW+SP  HD HGL+NL+F HSFVF KP SKFSFS SAYACT+IY    +TKSYLD+E +NS +IVEFCEVGDLKNAMELLCSS N+NLDL
Subjt:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL

Query:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN
        ETYCS+LQLCAE+KSIR G+RVHSIIESNGV +DGILGAKLVFMYVKCG+LKE RMIFDKLSE+KVFLWNLMISEY+G+GNY ES+NLFK M+ELGI PN
Subjt:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN

Query:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV
        SYTFSSVLKC AAV+RVE+GR VHG I KLGF+SY TVVNSLISFYF+ +KVRSAQKLFDEL DRDVISWNSMISGYVKNGLED GIEIFI+ML F V V
Subjt:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV

Query:  DLATMVNVLVACANMGTLLLGKALHSYSIKVA-VLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG
        DLATMVNVLVACAN GTLLLGKALHSY+IK A  LDREVMF NTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAI+LFDEMKSRG
Subjt:  DLATMVNVLVACANMGTLLLGKALHSYSIKVA-VLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG

Query:  VVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRE
        VVPDVYAVTSILHACAINGNL+SGKIVH+YIR+NNLETNSFVSNALMDMYAKCGSMKDA S+FSH+K KDVISWNTM+GGYSKN  PNEALNLF EMQRE
Subjt:  VVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRE

Query:  SKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIA
        SKPDGTTVACILPACASLAALDRGREIHGYALRNG S+DKYVVNALVDMYVKCGLLVLARSLFDMI DKDLVSWTVMIAGYGMHGFG+EAV+ F+QMRI+
Subjt:  SKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIA

Query:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI
        G+EPDEVSFISILYACSHSGLLDEGWKFFNIMKKEC+IEPKLEHYACMVDLLARTGNLVKA KFI+TMPI+PDATIWGALLCGCRIHHDVK+AEKVAERI
Subjt:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI

Query:  FELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADER
        FELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKG+VNIFVAGDCSKPQAKKIE LLKKLRSKMKEEGYSPKTR+ALLNADER
Subjt:  FELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADER

Query:  EKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW
        EKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSK TTREIVLRDS+RFHHFKDGYCSCRGYW
Subjt:  EKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW

XP_023519235.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0088.72Show/hide
Query:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTP-ALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLD
        M L+AKAPPTFW+  A +DH GL+NL+FR S  FVKPNS+ SFS SAYA TE YTP ALE K Y DVE NNS KIV+FCEVGDLKNA+ELLCSSQNSNLD
Subjt:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTP-ALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLD

Query:  LETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINP
        L+TYC ILQLCAEQKSIRDGRRVHSIIESN V +DGILGAKLVFMYVKCG+L+EGRMIFDKLSE+KVFLWNL+ISEYSGSGNYGESINLFK MLELGINP
Subjt:  LETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINP

Query:  NSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVG
        NSYTFSSVLKCFAAV+RVEEGRQVHGLI KLGF SY  VVNSLISFYF+GRKVR A+KLFDE+ DRDVISWNSMISGYVKNGLED GIEIF+RMLVF V 
Subjt:  NSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVG

Query:  VDLATMVNVLVACANMGTLLLGKALHSYSIK-VAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSR
        VDLATMVNVLVACANMGTL LGK LHSYSIK  AVLDR+VMFNNTLLDMYSKCGDLNSAIRVFE+MDEKTVVSWTSMIAGYVREGLSDGAI+LF+EMKSR
Subjt:  VDLATMVNVLVACANMGTLLLGKALHSYSIK-VAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSR

Query:  GVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQR
        GV+PDVYAV SILHACAINGNLNSGK +H+YIRENNLETNSFVSNALMDMYAKCGSMKDA  +FSH+KRKDVISWNTM+GGYSKNR PNEAL+LF EMQR
Subjt:  GVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQR

Query:  ESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRI
        ESKPDGTTVACILPACASLAALD+GREIHGYALRNG S+DK+V NALVDMYVKCGLLVLAR LFDMI +KDLVSWTVMIAGYGMHG+GSEAVN F+QMR 
Subjt:  ESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRI

Query:  AGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAER
        AGIEPDEVSFISILYACSHSGLLDEGW FFNIMKKECQIEP LEHYACMVDLLARTGNL +A KFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAER
Subjt:  AGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAER

Query:  IFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADE
        IFELEPENTGYYVLLANIYAEAEKWEEVQKLR +IG+RGLKKNPGCSWIEIKG+VNIFVAGDCSKPQAKKIE LL +LRSKMKEEGYSPKTR+ALLNADE
Subjt:  IFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADE

Query:  REKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW
        REKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMA+FMSK T REIVLRDSSRFHHFKDGYCSCRGYW
Subjt:  REKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW

XP_038893908.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Benincasa hispida]0.0e+0089.52Show/hide
Query:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALET--KSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNL
        MLL+AK P TFW+SP  +DH GL++L+FR SFVFVKPNSKFSFS SA+ACTE YTPALET  KSY+DVE +NS KIVEFCE+GDLKNAMELLC SQNS  
Subjt:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALET--KSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNL

Query:  DLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGIN
        DL+TYCSILQLCAEQKSIRDGRRVHSIIESNGV +DGILG KLVFMYVKCG+LKEGR+IFDKLSE KVFLWNLMISEYSG+GNYGESINLFK+MLELGI 
Subjt:  DLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGIN

Query:  PNSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGV
        PNSYTFSSVLKC AAV+RVEEGRQVHGLI KLGFNSY TVVNSLISFYF+ RKVR AQKLFDEL DRDVISWNSMISGYVKNGLED GIEIFI+ML F +
Subjt:  PNSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGV

Query:  GVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSR
          DLATMVNVLVACANMGTLLLGKALHSY+IK A L++EVMFNNTLLDMYSKCG LNSAIRVFE+MDEKTVVSWTSMI GYVREGLSDGAIKLFDEMKS+
Subjt:  GVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSR

Query:  GVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQR
        G++PDVYAVTSILHACAINGNLNSGKIVH+YIREN LETNSFVSNALMDMYAK GSMKDAH +FSH+KRKDVISWNTM+GGYSKNR PNEALNLF EMQR
Subjt:  GVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQR

Query:  ESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRI
        E KPD TTVACILPACASLAALDRGREIHGYALRNG S+DKYVVNALVDMYVKCGLLVLARSLFDMI +KDLVSWTVMIAGYGMHGFGSEA+NTF+QMRI
Subjt:  ESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRI

Query:  AGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAER
        AGIEPDEVSFISILYACSHSGLLDEGWKF+NIMKKECQIEP LEHYACMVDLLARTGNLVKA KFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAE+
Subjt:  AGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAER

Query:  IFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADE
        IFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKG+VNIFVAGDCSKPQAKKIE LLK+LRSKMKEEGYSPKTR+ALLNADE
Subjt:  IFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADE

Query:  REKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW
        REKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSK+ +REI+LRDSSRFH+FKDG CSCRGYW
Subjt:  REKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW

TrEMBL top hitse value%identityAlignment
A0A0A0LV30 DYW_deaminase domain-containing protein0.0e+0088.36Show/hide
Query:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL
        MLL AKAP TFW+SPA +DH G +NL+FR SF+FV P+SK SFS+ AYA      PALETKSY+DVE ++S KIVEFCEVGDLKNAMELLCSSQNSN DL
Subjt:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL

Query:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN
          YCSILQLCAE+KSIRDGRRV SIIES+GV +DGILG KLVFMYVKCG+LKEGRM+FDKLSE K+FLWNLMISEYSGSGNYGESINLFK+MLELGI PN
Subjt:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN

Query:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV
        SYTFSS+LKCFAAV+RVEEGRQVHGLI KLGFNSY TVVNSLISFYF+GRKVR AQKLFDEL DRDVISWNSMISGYVKNGL+D GIEIFI+MLVFGV +
Subjt:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV

Query:  DLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV
        DLATMVNV VACAN+GTLLLGK LHSYSIK A LDREV FNNTLLDMYSKCGDLNSAIRVFE+MDEKTVVSWTSMI GYVREGLSDGAIKLFDEMKSRGV
Subjt:  DLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV

Query:  VPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRES
        VPDVYAVTSIL+ACAINGNL SGKIVH YIRENNLETNSFVSNAL DMYAKCGSMKDAH +FSH+K+KDVISWNTM+GGY+KN  PNEAL LF EMQRES
Subjt:  VPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRES

Query:  KPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIAG
        KPDGTTVACILPACASLAALD+GREIHGYALRNG SEDKYV NA+VDMYVKCGLLVLARSLFDMIP+KDLVSWTVMIAGYGMHG+GSEA+NTF+QMR+ G
Subjt:  KPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIAG

Query:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
        IEPDEVSFISILYACSHSGLLDEGWK FNIMKKECQIEP LEHYACMVDLLARTGNLVKA KFI+ MPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
Subjt:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF

Query:  ELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADERE
        ELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKG++NIFVAGDCSKPQAKKIE LLK+LRSKMKEEGYSPKT +ALLNADERE
Subjt:  ELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADERE

Query:  KEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW
        KEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSK+ +REI+LRDSSRFHHFKDG CSCRGYW
Subjt:  KEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW

A0A1S3B857 pentatricopeptide repeat-containing protein DOT4, chloroplastic0.0e+0088.13Show/hide
Query:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL
        MLL AKAP TFW+SPA HDH G +NL+FR SF+F KPNSK SFS+ AYA      PA+ETKSY+DVE ++S KIVEFCEVGDLKNAMELLCSSQNSN DL
Subjt:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL

Query:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN
        + +CSILQLCAE+KSIRDGRRVHSIIES+GV +DGILG KLVFMYVKCG+LKEGRMIFDKLSE KVF+WNLMISEY G+GNYGESINLFK+MLELGI PN
Subjt:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN

Query:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV
        SYTFSSVLKCFAAV+ VEEGRQVHGLI+KLG+NSY TVVNSLISFYF+GRKVR AQKLFDEL DRDVISWNSMISGYVKNGL+D GIEIFI+MLVFGV +
Subjt:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV

Query:  DLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV
        DLATMVNVLVACAN GTLL GK LHSYSIK A LDREV FNNTLLDMYSKCGDLNSAIRVFE+MDEKTVVSWTSMI GYVREGLSDGAIKLFDEMKSRGV
Subjt:  DLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV

Query:  VPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRES
        VPDVYAVTSILHACAINGNL SG+IVH YIRENNLETNSFVSNAL DMYAKCGSMKDAH +FSH+K+KDVISWNTM+GGYSKNR PNEAL LF EMQ ES
Subjt:  VPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRES

Query:  KPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIAG
        KPDGTTVACILPACASLAALDRGREIHGYALRNG SEDKYVVNAL+DMYVKCGLLVLARS FDMI +KDLVSWTVMIAGYGMHGFGSEA+NTF+QMR+ G
Subjt:  KPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIAG

Query:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
        I+PDEVSFISILYACSHSGLLDEGWK F+IMKKECQIEP LEHYACMVDLLARTGNLVKA KFI+TMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
Subjt:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF

Query:  ELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADERE
        ELEPENTGYYVLLANIYAEAEKWEEVQKLRK+IGQRGLKKNPGC+WIEIKG+VNIFVAGDCSKPQAKKIE LLK+LRSKMKEEGYSPKT +ALLNADERE
Subjt:  ELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADERE

Query:  KEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW
        KEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSK+ +REI+LRDSSRFHHFKDG CSCRG+W
Subjt:  KEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW

A0A5A7UPC9 Pentatricopeptide repeat-containing protein DOT40.0e+0088.24Show/hide
Query:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL
        MLL AKAP TFW+SPA HDH G +NL+FR SF+F KPNSK SFS+ AYA      PA+ETKSY+DVE ++S KIVEFCEVGDLKNAMELLCSSQNSN DL
Subjt:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL

Query:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN
        + +CSILQLCAE+KSIRDGRRVHSIIES+GV +DGILG KLVFMYVKCG+LKEGRMIFDKLSE KVF+WNLMISEY G+GNYGESINLFK+MLELGI PN
Subjt:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN

Query:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV
        SYTFSSVLKCFAAV+ VEEGRQVHGLI+KLG+NSY TVVNSLISFYF+GRKVR AQKLFDEL DRDVISWNSMISGYVKNGL+D GIEIFI+MLVFGV +
Subjt:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV

Query:  DLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV
        DLATMVNVLVACAN GTLL GK LHSYSIK A LDREV FNNTLLDMYSKCGDLNSAIRVFE+MDEKTVVSWTSMI GYVREGLSDGAIKLFDEMKSRGV
Subjt:  DLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV

Query:  VPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRES
        VPDVYAVTSILHACAINGNL SG+IVH YIRENNLETNSFVSNAL DMYAKCGSMKDAH +FSH+K+KDVISWNTM+GGYSKNR PNEAL LF EMQ ES
Subjt:  VPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRES

Query:  KPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIAG
        KPDGTTVACILPACASLAALDRGREIHGYALRNG SEDKYVVNAL+DMYVKCGLLVLARS FDMI +KDLVSWTVMIAGYGMHGFGSEA+NTF+QMR+ G
Subjt:  KPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIAG

Query:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
        I+PDEVSFISILYACSHSGLLDEGWK F+IMKKECQIEP LEHYACMVDLLARTGNLVKA KFI+TMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
Subjt:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF

Query:  ELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADERE
        ELEPENTGYYVLLANIYAEAEKWEEVQKLRK+IGQRGLKKNPGCSWIEIKG+VNIFVAGDCSKPQAKKIE LLK+LRSKMKEEGYSPKT +ALLNADERE
Subjt:  ELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADERE

Query:  KEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW
        KEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSK+ +REI+LRDSSRFHHFKDG CSCRG+W
Subjt:  KEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW

A0A6J1CE34 pentatricopeptide repeat-containing protein DOT4, chloroplastic0.0e+0089.17Show/hide
Query:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL
        MLL+AK+PPTFW+SP  HD HGL+NL+F HSFVF KP SKFSFS SAYACT+IY    +TKSYLD+E +NS +IVEFCEVGDLKNAMELLCSS N+NLDL
Subjt:  MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL

Query:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN
        ETYCS+LQLCAE+KSIR G+RVHSIIESNGV +DGILGAKLVFMYVKCG+LKE RMIFDKLSE+KVFLWNLMISEY+G+GNY ES+NLFK M+ELGI PN
Subjt:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN

Query:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV
        SYTFSSVLKC AAV+RVE+GR VHG I KLGF+SY TVVNSLISFYF+ +KVRSAQKLFDEL DRDVISWNSMISGYVKNGLED GIEIFI+ML F V V
Subjt:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV

Query:  DLATMVNVLVACANMGTLLLGKALHSYSIKVA-VLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG
        DLATMVNVLVACAN GTLLLGKALHSY+IK A  LDREVMF NTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAI+LFDEMKSRG
Subjt:  DLATMVNVLVACANMGTLLLGKALHSYSIKVA-VLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG

Query:  VVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRE
        VVPDVYAVTSILHACAINGNL+SGKIVH+YIR+NNLETNSFVSNALMDMYAKCGSMKDA S+FSH+K KDVISWNTM+GGYSKN  PNEALNLF EMQRE
Subjt:  VVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRE

Query:  SKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIA
        SKPDGTTVACILPACASLAALDRGREIHGYALRNG S+DKYVVNALVDMYVKCGLLVLARSLFDMI DKDLVSWTVMIAGYGMHGFG+EAV+ F+QMRI+
Subjt:  SKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIA

Query:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI
        G+EPDEVSFISILYACSHSGLLDEGWKFFNIMKKEC+IEPKLEHYACMVDLLARTGNLVKA KFI+TMPI+PDATIWGALLCGCRIHHDVK+AEKVAERI
Subjt:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI

Query:  FELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADER
        FELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKG+VNIFVAGDCSKPQAKKIE LLKKLRSKMKEEGYSPKTR+ALLNADER
Subjt:  FELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADER

Query:  EKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW
        EKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSK TTREIVLRDS+RFHHFKDGYCSCRGYW
Subjt:  EKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW

A0A6J1KNK7 pentatricopeptide repeat-containing protein DOT4, chloroplastic0.0e+0088.81Show/hide
Query:  LLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTP-ALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL
        LL+AKAPPTFW+S A +DH GL+NL+FR S  FVKPNS+ SFS SAYA TE YTP ALE K+Y+D E NNS KIV+FCEVGDLKNA+ELLCSSQNSNLDL
Subjt:  LLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTP-ALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDL

Query:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN
        +TYC ILQLCAEQKSIRDGRRVHSIIESN V +DGILGAKLVFMYVKCG+L+EGRMIFDKLSE+KVFLWNLMISEYSGSGNYGESINLFK MLELGINPN
Subjt:  ETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPN

Query:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV
        SYTFSSVLKCFAAV RVEEGRQVHGLI KLGF SY  VVNSLISFYF+GRKVRSA+KLFDE+ DRDVISWNSMISGYVKNGLED GIEIF+RMLVF V V
Subjt:  SYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGV

Query:  DLATMVNVLVACANMGTLLLGKALHSYSIK-VAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG
        DLATMVNVLVACANMGTL LGK LHSYSIK  A LDR+VMFNNTLLDMYSKCGDLNSAIRVFE+MDEKTVVSWTSMIAGYVREGLSDGAI+LF+EMKSRG
Subjt:  DLATMVNVLVACANMGTLLLGKALHSYSIK-VAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG

Query:  VVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRE
        V+PDVYAV SILHACAINGNLNSGK +H+YI+ENNLETNSFVSNALMDMYAKCGSMKDA  +FSH+KRKDVISWNTM+GGYSKNR PNEAL+LF EMQRE
Subjt:  VVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRE

Query:  SKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIA
        SKPDGTTVACILPACASLAALD+GREIHGYALRNG S DK+V NALVDMYVKCGLLVLARSLFDMI +KDLVSWTVMIAGYGMHG+GSEAVN F+QMRIA
Subjt:  SKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIA

Query:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI
        GIEPDEVSFISILYACSHSGLLDEGW FF IMKKECQIEPKLEHYACMVDLLARTGNL +A KFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI
Subjt:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI

Query:  FELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADER
        FELEPENTGYYVL+ANIYAEAEKWEEVQKLR +IG+ GLKKNPGCSWIEIKG+VNIFVAGDCSKPQAKKIE LL +LRSKMKEEGYSPKTR+ALLNADER
Subjt:  FELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADER

Query:  EKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGY
        EKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMA+FMSK T REIVLRDSSRFHHFKDGYCSCRGY
Subjt:  EKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGY

SwissProt top hitse value%identityAlignment
Q7Y211 Pentatricopeptide repeat-containing protein At3g57430, chloroplastic6.1e-17139.02Show/hide
Query:  DLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAK-LVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGI
        D   + ++L+  A+ + +  G+++H+ +   G GVD +  A  LV +Y KCG+      +FD++SER    WN +IS       +  ++  F+ ML+  +
Subjt:  DLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAK-LVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGI

Query:  NPNSYTFSSVLKCFAAVSRVE---EGRQVHGLIWKLG-FNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRM
         P+S+T  SV+   + +   E    G+QVH    + G  NS+I  +N+L++ Y    K+ S++ L      RD+++WN+++S   +N      +E    M
Subjt:  NPNSYTFSSVLKCFAAVSRVE---EGRQVHGLIWKLG-FNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRM

Query:  LVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFD
        ++ GV  D  T+ +VL AC+++  L  GK LH+Y++K   LD      + L+DMY  C  + S  RVF+ M ++ +  W +MIAGY +      A+ LF 
Subjt:  LVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFD

Query:  EM-KSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNL
         M +S G++ +   +  ++ AC  +G  +  + +H ++ +  L+ + FV N LMDMY++ G +  A  IF  ++ +D+++WNTM+ GY  +    +AL L
Subjt:  EM-KSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNL

Query:  FVEMQR------------ESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGY
          +MQ               KP+  T+  ILP+CA+L+AL +G+EIH YA++N  + D  V +ALVDMY KCG L ++R +FD IP K++++W V+I  Y
Subjt:  FVEMQR------------ESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGY

Query:  GMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIK-PDATIWGAL
        GMHG G EA++    M + G++P+EV+FIS+  ACSHSG++DEG + F +MK +  +EP  +HYAC+VDLL R G + +A + +  MP     A  W +L
Subjt:  GMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIK-PDATIWGAL

Query:  LCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSK
        L   RIH+++++ E  A+ + +LEP    +YVLLANIY+ A  W++  ++R+ + ++G++K PGCSWIE    V+ FVAGD S PQ++K+   L+ L  +
Subjt:  LCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSK

Query:  MKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW
        M++EGY P T   L N +E EKE+ LCGHSEKLA+AFG+LN  PG  IRV KNLRVC DCH   KF+SK   REI+LRD  RFH FK+G CSC  YW
Subjt:  MKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW

Q9LFL5 Pentatricopeptide repeat-containing protein At5g168605.9e-17440.98Show/hide
Query:  KSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKL--SERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPNSYTFSSVLKCF
        K+I   + +H  + S G+ +   L + L+  Y+  G L     +  +   S+  V+ WN +I  Y  +G   + + LF  M  L   P++YTF  V K  
Subjt:  KSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKL--SERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPNSYTFSSVLKCF

Query:  AAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRML-VFGVGVDLATMVNVLV
          +S V  G   H L    GF S + V N+L++ Y   R +  A+K+FDE+   DV+SWNS+I  Y K G     +E+F RM   FG   D  T+VNVL 
Subjt:  AAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRML-VFGVGVDLATMVNVLV

Query:  ACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMK--------------
         CA++GT  LGK LH +++   ++ + +   N L+DMY+KCG ++ A  VF  M  K VVSW +M+AGY + G  + A++LF++M+              
Subjt:  ACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMK--------------

Query:  ---------------------SRGVVPDVYAVTSILHACAINGNLNSGKIVHSY-------IRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHI--K
                             S G+ P+   + S+L  CA  G L  GK +H Y       +R+N     + V N L+DMYAKC  +  A ++F  +  K
Subjt:  ---------------------SRGVVPDVYAVTSILHACAINGNLNSGKIVHSY-------IRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHI--K

Query:  RKDVISWNTMVGGYSKNRFPNEALNLFVEMQRE---SKPDGTTVACILPACASLAALDRGREIHGYALRNGCSE-DKYVVNALVDMYVKCGLLVLARSLF
         +DV++W  M+GGYS++   N+AL L  EM  E   ++P+  T++C L ACASLAAL  G++IH YALRN  +    +V N L+DMY KCG +  AR +F
Subjt:  RKDVISWNTMVGGYSKNRFPNEALNLFVEMQRE---SKPDGTTVACILPACASLAALDRGREIHGYALRNGCSE-DKYVVNALVDMYVKCGLLVLARSLF

Query:  DMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARK
        D +  K+ V+WT ++ GYGMHG+G EA+  FD+MR  G + D V+ + +LYACSHSG++D+G ++FN MK    + P  EHYAC+VDLL R G L  A +
Subjt:  DMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARK

Query:  FIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIE-IKGRVNIFVAGDC
         IE MP++P   +W A L  CRIH  V+L E  AE+I EL   + G Y LL+N+YA A +W++V ++R  +  +G+KK PGCSW+E IKG    FV GD 
Subjt:  FIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIE-IKGRVNIFVAGDC

Query:  SKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSR
        + P AK+I Q+L     ++K+ GY P+T  AL + D+ EK+  L  HSEKLA+A+G+L  P G  IR+TKNLRVCGDCH    +MS+    +I+LRDSSR
Subjt:  SKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSR

Query:  FHHFKDGYCSCRGYW
        FHHFK+G CSC+GYW
Subjt:  FHHFKDGYCSCRGYW

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic6.3e-17639.85Show/hide
Query:  CEVGDLKNAMELLCSSQNSN--LDLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISE
        C  G L+ AM+LL S Q     +D + + ++++LC  +++  +G +V+SI  S+   +   LG   + M+V+ G L +   +F K+SER +F WN+++  
Subjt:  CEVGDLKNAMELLCSSQNSN--LDLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISE

Query:  YSGSGNYGESINLFKEMLEL-GINPNSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMI
        Y+  G + E++ L+  ML + G+ P+ YTF  VL+    +  +  G++VH  + + G+   I VVN+LI+ Y     V+SA+ LFD +  RD+ISWN+MI
Subjt:  YSGSGNYGESINLFKEMLEL-GINPNSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMI

Query:  SGYVKNGLEDTGIEIFIRMLVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTS
        SGY +NG+   G+E+F  M    V  DL T+ +V+ AC  +G   LG+ +H+Y I       ++   N+L  MY   G    A ++F +M+ K +VSWT+
Subjt:  SGYVKNGLEDTGIEIFIRMLVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTS

Query:  MIAGYVREGLSDGAIKLFDEMKSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWN
        MI+GY    L D AI  +  M    V PD   V ++L ACA  G+L++G  +H    +  L +   V+N L++MY+KC  +  A  IF +I RK+VISW 
Subjt:  MIAGYVREGLSDGAIKLFDEMKSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWN

Query:  TMVGGYSKNRFPNEALNLFVEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWT
        +++ G   N    EAL    +M+   +P+  T+   L ACA + AL  G+EIH + LR G   D ++ NAL+DMYV+CG +  A S F+    KD+ SW 
Subjt:  TMVGGYSKNRFPNEALNLFVEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWT

Query:  VMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDAT
        +++ GY   G GS  V  FD+M  + + PDE++FIS+L  CS S ++ +G  +F+ M ++  + P L+HYAC+VDLL R G L +A KFI+ MP+ PD  
Subjt:  VMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDAT

Query:  IWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLK
        +WGALL  CRIHH + L E  A+ IFEL+ ++ GYY+LL N+YA+  KW EV K+R+ + + GL  + GCSW+E+KG+V+ F++ D   PQ K+I  +L+
Subjt:  IWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLK

Query:  KLRSKMKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSC
            KM E G +  +  + ++  E  ++   CGHSE+ A+AFG++N  PG  I VTKNL +C +CH+  KF+SKT  REI +RD+  FHHFKDG CSC
Subjt:  KLRSKMKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSC

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.1e-31062.84Show/hide
Query:  FCEVGDLKNAMELLCSSQNSNLDLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEY
        FCE G+L+NA++LLC S   ++D  T CS+LQLCA+ KS++DG+ V + I  NG  +D  LG+KL  MY  CG+LKE   +FD++   K   WN++++E 
Subjt:  FCEVGDLKNAMELLCSSQNSNLDLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEY

Query:  SGSGNYGESINLFKEMLELGINPNSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISG
        + SG++  SI LFK+M+  G+  +SYTFS V K F+++  V  G Q+HG I K GF    +V NSL++FY   ++V SA+K+FDE+ +RDVISWNS+I+G
Subjt:  SGSGNYGESINLFKEMLELGINPNSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISG

Query:  YVKNGLEDTGIEIFIRMLVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMI
        YV NGL + G+ +F++MLV G+ +DLAT+V+V   CA+   + LG+A+HS  +K A   RE  F NTLLDMYSKCGDL+SA  VF +M +++VVS+TSMI
Subjt:  YVKNGLEDTGIEIFIRMLVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMI

Query:  AGYVREGLSDGAIKLFDEMKSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTM
        AGY REGL+  A+KLF+EM+  G+ PDVY VT++L+ CA    L+ GK VH +I+EN+L  + FVSNALMDMYAKCGSM++A  +FS ++ KD+ISWNT+
Subjt:  AGYVREGLSDGAIKLFDEMKSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTM

Query:  VGGYSKNRFPNEALNLFVEMQRESK--PDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWT
        +GGYSKN + NEAL+LF  +  E +  PD  TVAC+LPACASL+A D+GREIHGY +RNG   D++V N+LVDMY KCG L+LA  LFD I  KDLVSWT
Subjt:  VGGYSKNRFPNEALNLFVEMQRESK--PDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWT

Query:  VMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDAT
        VMIAGYGMHGFG EA+  F+QMR AGIE DE+SF+S+LYACSHSGL+DEGW+FFNIM+ EC+IEP +EHYAC+VD+LARTG+L+KA +FIE MPI PDAT
Subjt:  VMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDAT

Query:  IWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLK
        IWGALLCGCRIHHDVKLAEKVAE++FELEPENTGYYVL+ANIYAEAEKWE+V++LRK+IGQRGL+KNPGCSWIEIKGRVNIFVAGD S P+ + IE  L+
Subjt:  IWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLK

Query:  KLRSKMKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRG
        K+R++M EEGYSP T++AL++A+E EKE ALCGHSEKLAMA G+++   GK IRVTKNLRVCGDCHEMAKFMSK T REIVLRDS+RFH FKDG+CSCRG
Subjt:  KLRSKMKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRG

Query:  YW
        +W
Subjt:  YW

Q9SS60 Pentatricopeptide repeat-containing protein At3g035802.5e-17241.19Show/hide
Query:  FCEVGDLKNAMELLCSSQNSNL--DLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMIS
        F + G    A+E     + S +  D  T+ S+++ CA       G  V+  I   G   D  +G  LV MY + G L   R +FD++  R +  WN +IS
Subjt:  FCEVGDLKNAMELLCSSQNSNL--DLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMIS

Query:  EYSGSGNYGESINLFKEMLELGINPNSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMI
         YS  G Y E++ ++ E+    I P+S+T SSVL  F  +  V++G+ +HG   K G NS + V N L++ Y   R+   A+++FDE+  RD +S+N+MI
Subjt:  EYSGSGNYGESINLFKEMLELGINPNSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMI

Query:  SGYVKNGLEDTGIEIFIRMLVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTS
         GY+K  + +  + +F+  L      DL T+ +VL AC ++  L L K +++Y +K   +  E    N L+D+Y+KCGD+ +A  VF  M+ K  VSW S
Subjt:  SGYVKNGLEDTGIEIFIRMLVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTS

Query:  MIAGYVREGLSDGAIKLFDEMKSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWN
        +I+GY++ G    A+KLF  M       D      ++       +L  GK +HS   ++ +  +  VSNAL+DMYAKCG + D+  IFS +   D ++WN
Subjt:  MIAGYVREGLSDGAIKLFDEMKSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWN

Query:  TMVGGYSKNRFPNEALNLFVEMQ---RESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLV
        T++      RF + A  L V  Q    E  PD  T    LP CASLAA   G+EIH   LR G   +  + NAL++MY KCG L  +  +F+ +  +D+V
Subjt:  TMVGGYSKNRFPNEALNLFVEMQ---RESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLV

Query:  SWTVMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKP
        +WT MI  YGM+G G +A+ TF  M  +GI PD V FI+I+YACSHSGL+DEG   F  MK   +I+P +EHYAC+VDLL+R+  + KA +FI+ MPIKP
Subjt:  SWTVMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKP

Query:  DATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQ
        DA+IW ++L  CR   D++ AE+V+ RI EL P++ GY +L +N YA   KW++V  +RK +  + + KNPG SWIE+   V++F +GD S PQ++ I +
Subjt:  DATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQ

Query:  LLKKLRSKMKEEGYSPKTRHALLN-ADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYC
         L+ L S M +EGY P  R    N  +E EK   +CGHSE+LA+AFG+LN  PG  ++V KNLRVCGDCHE+ K +SK   REI++RD++RFH FKDG C
Subjt:  LLKKLRSKMKEEGYSPKTRHALLN-ADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYC

Query:  SCRGYW
        SC+  W
Subjt:  SCRGYW

Arabidopsis top hitse value%identityAlignment
AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.5e-17739.85Show/hide
Query:  CEVGDLKNAMELLCSSQNSN--LDLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISE
        C  G L+ AM+LL S Q     +D + + ++++LC  +++  +G +V+SI  S+   +   LG   + M+V+ G L +   +F K+SER +F WN+++  
Subjt:  CEVGDLKNAMELLCSSQNSN--LDLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISE

Query:  YSGSGNYGESINLFKEMLEL-GINPNSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMI
        Y+  G + E++ L+  ML + G+ P+ YTF  VL+    +  +  G++VH  + + G+   I VVN+LI+ Y     V+SA+ LFD +  RD+ISWN+MI
Subjt:  YSGSGNYGESINLFKEMLEL-GINPNSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMI

Query:  SGYVKNGLEDTGIEIFIRMLVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTS
        SGY +NG+   G+E+F  M    V  DL T+ +V+ AC  +G   LG+ +H+Y I       ++   N+L  MY   G    A ++F +M+ K +VSWT+
Subjt:  SGYVKNGLEDTGIEIFIRMLVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTS

Query:  MIAGYVREGLSDGAIKLFDEMKSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWN
        MI+GY    L D AI  +  M    V PD   V ++L ACA  G+L++G  +H    +  L +   V+N L++MY+KC  +  A  IF +I RK+VISW 
Subjt:  MIAGYVREGLSDGAIKLFDEMKSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWN

Query:  TMVGGYSKNRFPNEALNLFVEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWT
        +++ G   N    EAL    +M+   +P+  T+   L ACA + AL  G+EIH + LR G   D ++ NAL+DMYV+CG +  A S F+    KD+ SW 
Subjt:  TMVGGYSKNRFPNEALNLFVEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWT

Query:  VMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDAT
        +++ GY   G GS  V  FD+M  + + PDE++FIS+L  CS S ++ +G  +F+ M ++  + P L+HYAC+VDLL R G L +A KFI+ MP+ PD  
Subjt:  VMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDAT

Query:  IWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLK
        +WGALL  CRIHH + L E  A+ IFEL+ ++ GYY+LL N+YA+  KW EV K+R+ + + GL  + GCSW+E+KG+V+ F++ D   PQ K+I  +L+
Subjt:  IWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLK

Query:  KLRSKMKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSC
            KM E G +  +  + ++  E  ++   CGHSE+ A+AFG++N  PG  I VTKNL +C +CH+  KF+SKT  REI +RD+  FHHFKDG CSC
Subjt:  KLRSKMKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSC

AT3G03580.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-17341.19Show/hide
Query:  FCEVGDLKNAMELLCSSQNSNL--DLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMIS
        F + G    A+E     + S +  D  T+ S+++ CA       G  V+  I   G   D  +G  LV MY + G L   R +FD++  R +  WN +IS
Subjt:  FCEVGDLKNAMELLCSSQNSNL--DLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMIS

Query:  EYSGSGNYGESINLFKEMLELGINPNSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMI
         YS  G Y E++ ++ E+    I P+S+T SSVL  F  +  V++G+ +HG   K G NS + V N L++ Y   R+   A+++FDE+  RD +S+N+MI
Subjt:  EYSGSGNYGESINLFKEMLELGINPNSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMI

Query:  SGYVKNGLEDTGIEIFIRMLVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTS
         GY+K  + +  + +F+  L      DL T+ +VL AC ++  L L K +++Y +K   +  E    N L+D+Y+KCGD+ +A  VF  M+ K  VSW S
Subjt:  SGYVKNGLEDTGIEIFIRMLVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTS

Query:  MIAGYVREGLSDGAIKLFDEMKSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWN
        +I+GY++ G    A+KLF  M       D      ++       +L  GK +HS   ++ +  +  VSNAL+DMYAKCG + D+  IFS +   D ++WN
Subjt:  MIAGYVREGLSDGAIKLFDEMKSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWN

Query:  TMVGGYSKNRFPNEALNLFVEMQ---RESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLV
        T++      RF + A  L V  Q    E  PD  T    LP CASLAA   G+EIH   LR G   +  + NAL++MY KCG L  +  +F+ +  +D+V
Subjt:  TMVGGYSKNRFPNEALNLFVEMQ---RESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLV

Query:  SWTVMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKP
        +WT MI  YGM+G G +A+ TF  M  +GI PD V FI+I+YACSHSGL+DEG   F  MK   +I+P +EHYAC+VDLL+R+  + KA +FI+ MPIKP
Subjt:  SWTVMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKP

Query:  DATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQ
        DA+IW ++L  CR   D++ AE+V+ RI EL P++ GY +L +N YA   KW++V  +RK +  + + KNPG SWIE+   V++F +GD S PQ++ I +
Subjt:  DATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQ

Query:  LLKKLRSKMKEEGYSPKTRHALLN-ADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYC
         L+ L S M +EGY P  R    N  +E EK   +CGHSE+LA+AFG+LN  PG  ++V KNLRVCGDCHE+ K +SK   REI++RD++RFH FKDG C
Subjt:  LLKKLRSKMKEEGYSPKTRHALLN-ADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYC

Query:  SCRGYW
        SC+  W
Subjt:  SCRGYW

AT3G57430.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.3e-17239.02Show/hide
Query:  DLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAK-LVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGI
        D   + ++L+  A+ + +  G+++H+ +   G GVD +  A  LV +Y KCG+      +FD++SER    WN +IS       +  ++  F+ ML+  +
Subjt:  DLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAK-LVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGI

Query:  NPNSYTFSSVLKCFAAVSRVE---EGRQVHGLIWKLG-FNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRM
         P+S+T  SV+   + +   E    G+QVH    + G  NS+I  +N+L++ Y    K+ S++ L      RD+++WN+++S   +N      +E    M
Subjt:  NPNSYTFSSVLKCFAAVSRVE---EGRQVHGLIWKLG-FNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRM

Query:  LVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFD
        ++ GV  D  T+ +VL AC+++  L  GK LH+Y++K   LD      + L+DMY  C  + S  RVF+ M ++ +  W +MIAGY +      A+ LF 
Subjt:  LVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFD

Query:  EM-KSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNL
         M +S G++ +   +  ++ AC  +G  +  + +H ++ +  L+ + FV N LMDMY++ G +  A  IF  ++ +D+++WNTM+ GY  +    +AL L
Subjt:  EM-KSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNL

Query:  FVEMQR------------ESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGY
          +MQ               KP+  T+  ILP+CA+L+AL +G+EIH YA++N  + D  V +ALVDMY KCG L ++R +FD IP K++++W V+I  Y
Subjt:  FVEMQR------------ESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWTVMIAGY

Query:  GMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIK-PDATIWGAL
        GMHG G EA++    M + G++P+EV+FIS+  ACSHSG++DEG + F +MK +  +EP  +HYAC+VDLL R G + +A + +  MP     A  W +L
Subjt:  GMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIK-PDATIWGAL

Query:  LCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSK
        L   RIH+++++ E  A+ + +LEP    +YVLLANIY+ A  W++  ++R+ + ++G++K PGCSWIE    V+ FVAGD S PQ++K+   L+ L  +
Subjt:  LCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLKKLRSK

Query:  MKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW
        M++EGY P T   L N +E EKE+ LCGHSEKLA+AFG+LN  PG  IRV KNLRVC DCH   KF+SK   REI+LRD  RFH FK+G CSC  YW
Subjt:  MKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein8.1e-31262.84Show/hide
Query:  FCEVGDLKNAMELLCSSQNSNLDLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEY
        FCE G+L+NA++LLC S   ++D  T CS+LQLCA+ KS++DG+ V + I  NG  +D  LG+KL  MY  CG+LKE   +FD++   K   WN++++E 
Subjt:  FCEVGDLKNAMELLCSSQNSNLDLETYCSILQLCAEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEY

Query:  SGSGNYGESINLFKEMLELGINPNSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISG
        + SG++  SI LFK+M+  G+  +SYTFS V K F+++  V  G Q+HG I K GF    +V NSL++FY   ++V SA+K+FDE+ +RDVISWNS+I+G
Subjt:  SGSGNYGESINLFKEMLELGINPNSYTFSSVLKCFAAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISG

Query:  YVKNGLEDTGIEIFIRMLVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMI
        YV NGL + G+ +F++MLV G+ +DLAT+V+V   CA+   + LG+A+HS  +K A   RE  F NTLLDMYSKCGDL+SA  VF +M +++VVS+TSMI
Subjt:  YVKNGLEDTGIEIFIRMLVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMI

Query:  AGYVREGLSDGAIKLFDEMKSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTM
        AGY REGL+  A+KLF+EM+  G+ PDVY VT++L+ CA    L+ GK VH +I+EN+L  + FVSNALMDMYAKCGSM++A  +FS ++ KD+ISWNT+
Subjt:  AGYVREGLSDGAIKLFDEMKSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTM

Query:  VGGYSKNRFPNEALNLFVEMQRESK--PDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWT
        +GGYSKN + NEAL+LF  +  E +  PD  TVAC+LPACASL+A D+GREIHGY +RNG   D++V N+LVDMY KCG L+LA  LFD I  KDLVSWT
Subjt:  VGGYSKNRFPNEALNLFVEMQRESK--PDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYVKCGLLVLARSLFDMIPDKDLVSWT

Query:  VMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDAT
        VMIAGYGMHGFG EA+  F+QMR AGIE DE+SF+S+LYACSHSGL+DEGW+FFNIM+ EC+IEP +EHYAC+VD+LARTG+L+KA +FIE MPI PDAT
Subjt:  VMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARKFIETMPIKPDAT

Query:  IWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLK
        IWGALLCGCRIHHDVKLAEKVAE++FELEPENTGYYVL+ANIYAEAEKWE+V++LRK+IGQRGL+KNPGCSWIEIKGRVNIFVAGD S P+ + IE  L+
Subjt:  IWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEQLLK

Query:  KLRSKMKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRG
        K+R++M EEGYSP T++AL++A+E EKE ALCGHSEKLAMA G+++   GK IRVTKNLRVCGDCHEMAKFMSK T REIVLRDS+RFH FKDG+CSCRG
Subjt:  KLRSKMKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRG

Query:  YW
        +W
Subjt:  YW

AT5G16860.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-17540.98Show/hide
Query:  KSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKL--SERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPNSYTFSSVLKCF
        K+I   + +H  + S G+ +   L + L+  Y+  G L     +  +   S+  V+ WN +I  Y  +G   + + LF  M  L   P++YTF  V K  
Subjt:  KSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKL--SERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPNSYTFSSVLKCF

Query:  AAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRML-VFGVGVDLATMVNVLV
          +S V  G   H L    GF S + V N+L++ Y   R +  A+K+FDE+   DV+SWNS+I  Y K G     +E+F RM   FG   D  T+VNVL 
Subjt:  AAVSRVEEGRQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRML-VFGVGVDLATMVNVLV

Query:  ACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMK--------------
         CA++GT  LGK LH +++   ++ + +   N L+DMY+KCG ++ A  VF  M  K VVSW +M+AGY + G  + A++LF++M+              
Subjt:  ACANMGTLLLGKALHSYSIKVAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMK--------------

Query:  ---------------------SRGVVPDVYAVTSILHACAINGNLNSGKIVHSY-------IRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHI--K
                             S G+ P+   + S+L  CA  G L  GK +H Y       +R+N     + V N L+DMYAKC  +  A ++F  +  K
Subjt:  ---------------------SRGVVPDVYAVTSILHACAINGNLNSGKIVHSY-------IRENNLETNSFVSNALMDMYAKCGSMKDAHSIFSHI--K

Query:  RKDVISWNTMVGGYSKNRFPNEALNLFVEMQRE---SKPDGTTVACILPACASLAALDRGREIHGYALRNGCSE-DKYVVNALVDMYVKCGLLVLARSLF
         +DV++W  M+GGYS++   N+AL L  EM  E   ++P+  T++C L ACASLAAL  G++IH YALRN  +    +V N L+DMY KCG +  AR +F
Subjt:  RKDVISWNTMVGGYSKNRFPNEALNLFVEMQRE---SKPDGTTVACILPACASLAALDRGREIHGYALRNGCSE-DKYVVNALVDMYVKCGLLVLARSLF

Query:  DMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARK
        D +  K+ V+WT ++ GYGMHG+G EA+  FD+MR  G + D V+ + +LYACSHSG++D+G ++FN MK    + P  EHYAC+VDLL R G L  A +
Subjt:  DMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKARK

Query:  FIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIE-IKGRVNIFVAGDC
         IE MP++P   +W A L  CRIH  V+L E  AE+I EL   + G Y LL+N+YA A +W++V ++R  +  +G+KK PGCSW+E IKG    FV GD 
Subjt:  FIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIE-IKGRVNIFVAGDC

Query:  SKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSR
        + P AK+I Q+L     ++K+ GY P+T  AL + D+ EK+  L  HSEKLA+A+G+L  P G  IR+TKNLRVCGDCH    +MS+    +I+LRDSSR
Subjt:  SKPQAKKIEQLLKKLRSKMKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSR

Query:  FHHFKDGYCSCRGYW
        FHHFK+G CSC+GYW
Subjt:  FHHFKDGYCSCRGYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACTGATGGCAAAAGCCCCTCCGACGTTCTGGGTATCTCCGGCGAGGCACGATCATCATGGCTTAATGAACTTGGAATTCCGGCATTCTTTCGTCTTTGTCAAACC
CAATTCGAAATTTTCCTTTTCCACTTCGGCTTATGCTTGTACTGAGATCTACACCCCAGCATTGGAAACGAAAAGCTATCTCGATGTTGAACAGAATAACTCCGGCAAAA
TTGTCGAGTTTTGTGAAGTGGGTGATCTAAAAAATGCCATGGAGCTTCTTTGCAGCTCCCAAAATTCCAACCTTGACTTGGAAACTTATTGCTCCATCTTGCAGCTATGT
GCTGAACAGAAATCCATACGAGATGGAAGAAGGGTTCATTCTATAATTGAGTCTAATGGGGTTGGGGTCGATGGAATCTTGGGGGCGAAACTAGTTTTTATGTATGTGAA
GTGTGGAGAATTAAAAGAAGGGAGGATGATATTTGATAAACTTTCTGAAAGGAAGGTTTTCCTCTGGAACCTTATGATCAGTGAGTATTCAGGAAGTGGTAACTATGGAG
AGAGTATAAATTTGTTCAAGGAAATGCTGGAGTTGGGGATAAACCCTAATTCTTATACATTTTCTAGTGTTTTAAAATGTTTTGCAGCAGTTTCACGTGTAGAAGAAGGC
AGGCAGGTTCATGGGTTGATCTGGAAGTTGGGTTTTAATTCATATATCACTGTCGTTAATTCGCTAATTTCTTTCTACTTCATTGGTAGAAAGGTTAGAAGTGCACAGAA
GCTGTTCGATGAATTGCATGACCGAGATGTCATTTCGTGGAACTCTATGATCAGTGGCTATGTTAAGAATGGTTTAGAAGACACAGGAATTGAGATTTTCATAAGGATGT
TAGTTTTTGGTGTTGGCGTTGATTTGGCTACAATGGTCAATGTGCTTGTGGCTTGTGCAAATATGGGCACTCTTTTGTTGGGTAAGGCACTTCATTCTTATTCGATAAAG
GTGGCTGTTCTTGACAGAGAAGTTATGTTCAATAATACTTTACTGGACATGTACTCAAAATGTGGGGATTTGAACAGTGCTATTCGAGTTTTTGAGAAAATGGATGAGAA
AACTGTTGTATCTTGGACTTCGATGATCGCAGGCTATGTCCGTGAAGGTCTATCCGATGGTGCGATCAAGTTGTTTGATGAAATGAAAAGCAGAGGTGTTGTCCCTGATG
TATATGCTGTTACAAGCATTCTTCATGCTTGTGCTATTAATGGCAACCTGAATAGTGGGAAGATTGTACACAGTTACATCAGGGAAAACAACTTGGAAACGAACTCGTTT
GTTAGTAATGCTCTTATGGACATGTATGCTAAATGTGGCAGTATGAAGGATGCTCACAGCATTTTTTCTCACATAAAAAGAAAGGATGTTATATCGTGGAATACCATGGT
CGGAGGCTACTCGAAGAACCGTTTTCCAAATGAAGCTCTTAACTTGTTTGTGGAGATGCAAAGAGAATCAAAGCCTGATGGAACAACAGTGGCTTGCATCCTTCCAGCCT
GTGCAAGCCTTGCAGCTTTGGATAGAGGTAGAGAAATCCACGGATATGCATTAAGAAATGGATGCTCTGAAGACAAATATGTCGTTAATGCACTTGTCGATATGTATGTG
AAGTGTGGGCTATTAGTTCTCGCCCGGTCACTCTTTGATATGATTCCCGATAAGGACCTTGTCTCATGGACAGTGATGATAGCTGGATATGGGATGCATGGCTTTGGTAG
TGAAGCAGTCAATACCTTCGATCAGATGAGAATAGCCGGGATTGAGCCCGATGAAGTATCCTTCATTTCAATTCTTTATGCCTGTAGCCATTCTGGATTGCTTGATGAAG
GATGGAAATTTTTCAATATTATGAAGAAAGAATGTCAGATTGAACCCAAGTTGGAGCACTATGCCTGTATGGTGGATCTTCTTGCCCGAACTGGGAATCTGGTGAAGGCA
CGTAAATTCATAGAAACAATGCCAATCAAACCAGATGCAACAATTTGGGGTGCATTGTTGTGCGGATGCAGGATACACCATGACGTCAAACTAGCAGAGAAAGTTGCAGA
ACGGATCTTTGAGCTAGAACCAGAAAACACAGGATATTATGTACTTTTGGCAAACATCTATGCAGAGGCAGAAAAATGGGAAGAAGTTCAAAAGCTGAGGAAGAAAATTG
GACAACGCGGTTTGAAGAAGAATCCAGGCTGCAGTTGGATAGAGATCAAGGGTAGAGTCAATATCTTTGTAGCTGGAGATTGCTCCAAACCCCAAGCCAAGAAGATAGAG
CAACTTCTTAAAAAACTAAGAAGCAAGATGAAGGAAGAAGGTTACTCTCCAAAAACAAGGCATGCCTTGTTAAATGCAGATGAAAGGGAGAAGGAAGTAGCCCTCTGTGG
GCACAGTGAGAAGTTAGCCATGGCTTTTGGTATGCTGAATCTCCCACCCGGCAAGACTATACGGGTGACTAAAAATCTCCGAGTTTGCGGCGACTGTCATGAAATGGCTA
AGTTCATGTCGAAGACCACCACGAGAGAAATTGTGTTGAGGGATTCCAGTCGTTTTCATCATTTCAAAGATGGATATTGTTCTTGTAGAGGTTACTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATCCAAACTGCATCTGATCCACAAGCTGTATTGTATCTATATATTATATACGTCGGTTCTTCTTTATTGCCATTCACCGCTGTCCACAAGCTTCTGAATCTCGTGCTCCC
ACGTGGCCATCTCACTATTTGTTCTTCCTTTAGTTTGTCCCTATTATTGCTTTCTCTTTTTCTTTAATATCAGAAATTCGTTGCCATTTTTGCTGTTTCTTACTGGGGCT
CTTCGGCAAAAAGCCATGTTACTGATGGCAAAAGCCCCTCCGACGTTCTGGGTATCTCCGGCGAGGCACGATCATCATGGCTTAATGAACTTGGAATTCCGGCATTCTTT
CGTCTTTGTCAAACCCAATTCGAAATTTTCCTTTTCCACTTCGGCTTATGCTTGTACTGAGATCTACACCCCAGCATTGGAAACGAAAAGCTATCTCGATGTTGAACAGA
ATAACTCCGGCAAAATTGTCGAGTTTTGTGAAGTGGGTGATCTAAAAAATGCCATGGAGCTTCTTTGCAGCTCCCAAAATTCCAACCTTGACTTGGAAACTTATTGCTCC
ATCTTGCAGCTATGTGCTGAACAGAAATCCATACGAGATGGAAGAAGGGTTCATTCTATAATTGAGTCTAATGGGGTTGGGGTCGATGGAATCTTGGGGGCGAAACTAGT
TTTTATGTATGTGAAGTGTGGAGAATTAAAAGAAGGGAGGATGATATTTGATAAACTTTCTGAAAGGAAGGTTTTCCTCTGGAACCTTATGATCAGTGAGTATTCAGGAA
GTGGTAACTATGGAGAGAGTATAAATTTGTTCAAGGAAATGCTGGAGTTGGGGATAAACCCTAATTCTTATACATTTTCTAGTGTTTTAAAATGTTTTGCAGCAGTTTCA
CGTGTAGAAGAAGGCAGGCAGGTTCATGGGTTGATCTGGAAGTTGGGTTTTAATTCATATATCACTGTCGTTAATTCGCTAATTTCTTTCTACTTCATTGGTAGAAAGGT
TAGAAGTGCACAGAAGCTGTTCGATGAATTGCATGACCGAGATGTCATTTCGTGGAACTCTATGATCAGTGGCTATGTTAAGAATGGTTTAGAAGACACAGGAATTGAGA
TTTTCATAAGGATGTTAGTTTTTGGTGTTGGCGTTGATTTGGCTACAATGGTCAATGTGCTTGTGGCTTGTGCAAATATGGGCACTCTTTTGTTGGGTAAGGCACTTCAT
TCTTATTCGATAAAGGTGGCTGTTCTTGACAGAGAAGTTATGTTCAATAATACTTTACTGGACATGTACTCAAAATGTGGGGATTTGAACAGTGCTATTCGAGTTTTTGA
GAAAATGGATGAGAAAACTGTTGTATCTTGGACTTCGATGATCGCAGGCTATGTCCGTGAAGGTCTATCCGATGGTGCGATCAAGTTGTTTGATGAAATGAAAAGCAGAG
GTGTTGTCCCTGATGTATATGCTGTTACAAGCATTCTTCATGCTTGTGCTATTAATGGCAACCTGAATAGTGGGAAGATTGTACACAGTTACATCAGGGAAAACAACTTG
GAAACGAACTCGTTTGTTAGTAATGCTCTTATGGACATGTATGCTAAATGTGGCAGTATGAAGGATGCTCACAGCATTTTTTCTCACATAAAAAGAAAGGATGTTATATC
GTGGAATACCATGGTCGGAGGCTACTCGAAGAACCGTTTTCCAAATGAAGCTCTTAACTTGTTTGTGGAGATGCAAAGAGAATCAAAGCCTGATGGAACAACAGTGGCTT
GCATCCTTCCAGCCTGTGCAAGCCTTGCAGCTTTGGATAGAGGTAGAGAAATCCACGGATATGCATTAAGAAATGGATGCTCTGAAGACAAATATGTCGTTAATGCACTT
GTCGATATGTATGTGAAGTGTGGGCTATTAGTTCTCGCCCGGTCACTCTTTGATATGATTCCCGATAAGGACCTTGTCTCATGGACAGTGATGATAGCTGGATATGGGAT
GCATGGCTTTGGTAGTGAAGCAGTCAATACCTTCGATCAGATGAGAATAGCCGGGATTGAGCCCGATGAAGTATCCTTCATTTCAATTCTTTATGCCTGTAGCCATTCTG
GATTGCTTGATGAAGGATGGAAATTTTTCAATATTATGAAGAAAGAATGTCAGATTGAACCCAAGTTGGAGCACTATGCCTGTATGGTGGATCTTCTTGCCCGAACTGGG
AATCTGGTGAAGGCACGTAAATTCATAGAAACAATGCCAATCAAACCAGATGCAACAATTTGGGGTGCATTGTTGTGCGGATGCAGGATACACCATGACGTCAAACTAGC
AGAGAAAGTTGCAGAACGGATCTTTGAGCTAGAACCAGAAAACACAGGATATTATGTACTTTTGGCAAACATCTATGCAGAGGCAGAAAAATGGGAAGAAGTTCAAAAGC
TGAGGAAGAAAATTGGACAACGCGGTTTGAAGAAGAATCCAGGCTGCAGTTGGATAGAGATCAAGGGTAGAGTCAATATCTTTGTAGCTGGAGATTGCTCCAAACCCCAA
GCCAAGAAGATAGAGCAACTTCTTAAAAAACTAAGAAGCAAGATGAAGGAAGAAGGTTACTCTCCAAAAACAAGGCATGCCTTGTTAAATGCAGATGAAAGGGAGAAGGA
AGTAGCCCTCTGTGGGCACAGTGAGAAGTTAGCCATGGCTTTTGGTATGCTGAATCTCCCACCCGGCAAGACTATACGGGTGACTAAAAATCTCCGAGTTTGCGGCGACT
GTCATGAAATGGCTAAGTTCATGTCGAAGACCACCACGAGAGAAATTGTGTTGAGGGATTCCAGTCGTTTTCATCATTTCAAAGATGGATATTGTTCTTGTAGAGGTTAC
TGGTGAATGAAACTGTTAGTAGTTCCATTCCATAAAGGACCCTTCTTGTTTCACCCAATTGGTATTTGCTCTTTTCCCTATTCAAGATATTACACGTTTGCAAATAATTT
GAAATCAAAAAGCTATGCCCACATAGCAGAAAGTCACCCCTCTCATCATCACCAGCCTGATTCTTGCTCTCTGAGGATTACTTATGAATAACAACAGGAAAAACCCAGTT
TGAGAGAAAGAGAGAGGGAGGAGCAAAGTTGAGACA
Protein sequenceShow/hide protein sequence
MLLMAKAPPTFWVSPARHDHHGLMNLEFRHSFVFVKPNSKFSFSTSAYACTEIYTPALETKSYLDVEQNNSGKIVEFCEVGDLKNAMELLCSSQNSNLDLETYCSILQLC
AEQKSIRDGRRVHSIIESNGVGVDGILGAKLVFMYVKCGELKEGRMIFDKLSERKVFLWNLMISEYSGSGNYGESINLFKEMLELGINPNSYTFSSVLKCFAAVSRVEEG
RQVHGLIWKLGFNSYITVVNSLISFYFIGRKVRSAQKLFDELHDRDVISWNSMISGYVKNGLEDTGIEIFIRMLVFGVGVDLATMVNVLVACANMGTLLLGKALHSYSIK
VAVLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGVVPDVYAVTSILHACAINGNLNSGKIVHSYIRENNLETNSF
VSNALMDMYAKCGSMKDAHSIFSHIKRKDVISWNTMVGGYSKNRFPNEALNLFVEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGCSEDKYVVNALVDMYV
KCGLLVLARSLFDMIPDKDLVSWTVMIAGYGMHGFGSEAVNTFDQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPKLEHYACMVDLLARTGNLVKA
RKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIE
QLLKKLRSKMKEEGYSPKTRHALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKTTTREIVLRDSSRFHHFKDGYCSCRGYW