; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0027061 (gene) of Chayote v1 genome

Gene IDSed0027061
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein DOT4
Genome locationLG08:35899019..35901742
RNA-Seq ExpressionSed0027061
SyntenySed0027061
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019566.1 Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0086.32Show/hide
Query:  LLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTP-ASETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL
        LL+AKAP TFW+S AG D+ GL+NLKFR S AFVKP  + SFSN A+A TE YTP A E K + DVEL++S +IV FCE+G LKNA+ELLCS QNS LDL
Subjt:  LLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTP-ASETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL

Query:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD
        +TYC +LQLCAE+KSIRDGRRVHSIIESN V+IDGILGAKL+FMYVKCGDL++GRMIFDKLS KKVFLWN M+SEYSGSGNYGESINLFK M ELGI P+
Subjt:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD

Query:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV
        SYTFS VLKCFAAVARVEEG QVHGLICKLGFTS+NAV+NSLISFYFVGRKVRSA+KLFDE+ DRDVISWNSMISGYVKNGLED+GIEIF++ML+F VDV
Subjt:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV

Query:  DLATMVNALVACANMGTLLLGKALHSYSIK-AAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG
        DLATMVN LVACANMGTL LGK LHSYSIK AA +DR+V FNNTLLDMYSKCGDLNSAI+VFE+MDEKTVVSWTSMIAGYVREGLSDGAI+LF+EMKSRG
Subjt:  DLATMVNALVACANMGTLLLGKALHSYSIK-AAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG

Query:  VAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRE
        V PDVYAV SILHACA NGNLNSGK +H+YI+EN+LETNS VSNALMDMYAKC SM+DA  VFSHMKRKDVISWNTMIGGYSKN LPNEAL+LF EMQRE
Subjt:  VAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRE

Query:  SKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIA
        SKPDGTTVA ILPACA+LAALD+GREIHGYALRNGY K+K+V NALVDMYVKCGLLVLARSLFDMIL+KDLVSWTVMIAGYGMHG+GSEAV+ FN+MRIA
Subjt:  SKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIA

Query:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI
        GIEPDEVSFISILYACSHSGLLDEGW FFNIMKKECQIEP LEHYACMVDLL+RTGNL +A KFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI
Subjt:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI

Query:  FELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADER
        FELEP+NTGYYVLLANIYAEAEKWEEVQKLR +IGQRGLKKNPGCSWIEIKG+VNIFVAGDCSKPQAKKIELLL +LR+KMKEEGYSPKTRYALLNADER
Subjt:  FELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADER

Query:  EKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW
        EKEVALC HSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHE A+FMSK T REIVLRDSSRFHHFKDGYCSCRGYW
Subjt:  EKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW

XP_011657608.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucumis sativus]0.0e+0085.84Show/hide
Query:  MLLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPASETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL
        MLL AKAP TFW+SPAG+D+ G +NLKFR SF FV P  K+SFS+LAYA      PA ETK++ DVELD S +IV+FCE+G LKNAMELLCS QNS  DL
Subjt:  MLLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPASETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL

Query:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD
          YCS+LQLCAE+KSIRDGRRV SIIES+GVMIDGILG KLVFMYVKCGDLK+GRM+FDKLS  K+FLWN M+SEYSGSGNYGESINLFK+M ELGI+P+
Subjt:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD

Query:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV
        SYTFS +LKCFAAVARVEEGRQVHGLICKLGF S+N V+NSLISFYFVGRKVR AQKLFDEL DRDVISWNSMISGYVKNGL+D+GIEIF+KML+FGVD+
Subjt:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV

Query:  DLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV
        DLATMVN  VACAN+GTLLLGK LHSYSIKAA +DREV+FNNTLLDMYSKCGDLNSAI+VFE+MDEKTVVSWTSMI GYVREGLSDGAIKLFDEMKSRGV
Subjt:  DLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV

Query:  APDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRES
         PDVYAVTSIL+ACAINGNL SGKIVH YI+EN+LETNS VSNAL DMYAKC SMKDAH VFSHMK+KDVISWNTMIGGY+KN LPNEAL LF EMQRES
Subjt:  APDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRES

Query:  KPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAG
        KPDGTTVA ILPACA+LAALD+GREIHGYALRNGY ++KYV NA+VDMYVKCGLLVLARSLFDMI +KDLVSWTVMIAGYGMHG+GSEA+NTFN+MR+ G
Subjt:  KPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAG

Query:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
        IEPDEVSFISILYACSHSGLLDEGWK FNIMKKECQIEP LEHYACMVDLL+RTGNL KA KFI+ MPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
Subjt:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF

Query:  ELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADERE
        ELEP+NTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKG++NIFVAGDCSKPQAKKIELLLK+LRSKMKEEGYSPKT YALLNADERE
Subjt:  ELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADERE

Query:  KEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW
        KEVALC HSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHE AKFMSK+  REI+LRDSSRFHHFKDG CSCRGYW
Subjt:  KEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW

XP_022139839.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Momordica charantia]0.0e+0087Show/hide
Query:  MLLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPASETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL
        MLL+AK+P TFW+SP G D  GL+NLKF HSF F KP  K SFSN AYACT+IY   S+TK++ D+ELD+SA+IV+FCE+G LKNAMELLCS  N+ LDL
Subjt:  MLLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPASETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL

Query:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD
        ETYCSVLQLCAE+KSIR G+RVHSIIESNGV++DGILGAKLVFMYVKCGDLK+ RMIFDKLS +KVFLWN M+SEY+G+GNY ES+NLFK M ELGI+P+
Subjt:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD

Query:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV
        SYTFS VLKC AAVARVE+GR VHG ICKLGF+S+N V+NSLISFYFV +KVRSAQKLFDEL DRDVISWNSMISGYVKNGLEDKGIEIF+KML F VDV
Subjt:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV

Query:  DLATMVNALVACANMGTLLLGKALHSYSIKAA-VIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG
        DLATMVN LVACAN GTLLLGKALHSY+IKAA  +DREV F NTLLDMYSKCGDLNSAI+VFEKMDEKTVVSWTSMIAGYVREGLSDGAI+LFDEMKSRG
Subjt:  DLATMVNALVACANMGTLLLGKALHSYSIKAA-VIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG

Query:  VAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRE
        V PDVYAVTSILHACAINGNL+SGKIVH+YI++N+LETNS VSNALMDMYAKC SMKDA SVFSHMK KDVISWNTMIGGYSKNHLPNEALNLF EMQRE
Subjt:  VAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRE

Query:  SKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIA
        SKPDGTTVA ILPACA+LAALDRGREIHGYALRNGY K+KYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFG+EAV+ FN+MRI+
Subjt:  SKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIA

Query:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI
        G+EPDEVSFISILYACSHSGLLDEGWKFFNIMKKEC+IEP+LEHYACMVDLL+RTGNL KA KFI+TMPI+PDATIWGALLCGCRIHHDVK+AEKVAERI
Subjt:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI

Query:  FELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADER
        FELEP+NTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKG+VNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADER
Subjt:  FELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADER

Query:  EKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW
        EKEVALC HSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHE AKFMSK T REIVLRDS+RFHHFKDGYCSCRGYW
Subjt:  EKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW

XP_023519235.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0086.45Show/hide
Query:  MLLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPAS-ETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLD
        M L+AKAP TFW+  AG D+ GL+NLKFR S AFVKP  + SFSN AYA TE YTPA+ E K + DVEL++S +IV FCE+G LKNA+ELLCS QNS LD
Subjt:  MLLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPAS-ETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLD

Query:  LETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEP
        L+TYC +LQLCAE+KSIRDGRRVHSIIESN V+IDGILGAKLVFMYVKCGDL++GRMIFDKLS KKVFLWN ++SEYSGSGNYGESINLFK M ELGI P
Subjt:  LETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEP

Query:  DSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVD
        +SYTFS VLKCFAAVARVEEGRQVHGLICKLGFTS+NAV+NSLISFYFVGRKVR A+KLFDE+ DRDVISWNSMISGYVKNGLED+GIEIF++ML+F VD
Subjt:  DSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVD

Query:  VDLATMVNALVACANMGTLLLGKALHSYSIK-AAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSR
        VDLATMVN LVACANMGTL LGK LHSYSIK AAV+DR+V FNNTLLDMYSKCGDLNSAI+VFE+MDEKTVVSWTSMIAGYVREGLSDGAI+LF+EMKSR
Subjt:  VDLATMVNALVACANMGTLLLGKALHSYSIK-AAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSR

Query:  GVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQR
        GV PDVYAV SILHACAINGNLNSGK +H+YI+EN+LETNS VSNALMDMYAKC SMKDA  VFSHMKRKDVISWNTMIGGYSKN LPNEAL+LF EMQR
Subjt:  GVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQR

Query:  ESKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRI
        ESKPDGTTVA ILPACA+LAALD+GREIHGYALRNGY K+K+V NALVDMYVKCGLLVLAR LFDMIL+KDLVSWTVMIAGYGMHG+GSEAVN FN+MR 
Subjt:  ESKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRI

Query:  AGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAER
        AGIEPDEVSFISILYACSHSGLLDEGW FFNIMKKECQIEP LEHYACMVDLL+RTGNL +A KFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAER
Subjt:  AGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAER

Query:  IFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADE
        IFELEP+NTGYYVLLANIYAEAEKWEEVQKLR +IG+RGLKKNPGCSWIEIKG+VNIFVAGDCSKPQAKKIELLL +LRSKMKEEGYSPKTRYALLNADE
Subjt:  IFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADE

Query:  REKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW
        REKEVALC HSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHE A+FMSK T REIVLRDSSRFHHFKDGYCSCRGYW
Subjt:  REKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW

XP_038893908.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Benincasa hispida]0.0e+0087.36Show/hide
Query:  MLLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPASETKTH---DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKL
        MLL+AK P+TFW+SP G D+ GLL+LKFR SF FVKP  K SFSN A+ACTE YTPA ETK     DVELD+S +IV+FCEMG LKNAMELLC  QNS  
Subjt:  MLLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPASETKTH---DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKL

Query:  DLETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIE
        DL+TYCS+LQLCAE+KSIRDGRRVHSIIESNGVMIDGILG KLVFMYVKCGDLK+GR+IFDKLS  KVFLWN M+SEYSG+GNYGESINLFK+M ELGI+
Subjt:  DLETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIE

Query:  PDSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGV
        P+SYTFS VLKC AAVARVEEGRQVHGLICKLGF S+N V+NSLISFYFV RKVR AQKLFDEL DRDVISWNSMISGYVKNGLEDKGIEIF+KML F +
Subjt:  PDSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGV

Query:  DVDLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSR
        D DLATMVN LVACANMGTLLLGKALHSY+IKAA +++EV FNNTLLDMYSKCG LNSAI+VFE+MDEKTVVSWTSMI GYVREGLSDGAIKLFDEMKS+
Subjt:  DVDLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSR

Query:  GVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQR
        G+ PDVYAVTSILHACAINGNLNSGKIVH+YI+EN LETNS VSNALMDMYAK  SMKDAH VFSHMKRKDVISWNTMIGGYSKN LPNEALNLF EMQR
Subjt:  GVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQR

Query:  ESKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRI
        E KPD TTVA ILPACA+LAALDRGREIHGYALRNGY K+KYVVNALVDMYVKCGLLVLARSLFDMI +KDLVSWTVMIAGYGMHGFGSEA+NTFN+MRI
Subjt:  ESKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRI

Query:  AGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAER
        AGIEPDEVSFISILYACSHSGLLDEGWKF+NIMKKECQIEP LEHYACMVDLL+RTGNL KA KFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAE+
Subjt:  AGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAER

Query:  IFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADE
        IFELEP+NTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKG+VNIFVAGDCSKPQAKKIELLLK+LRSKMKEEGYSPKTRYALLNADE
Subjt:  IFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADE

Query:  REKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW
        REKEVALC HSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHE AKFMSK+  REI+LRDSSRFH+FKDG CSCRGYW
Subjt:  REKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW

TrEMBL top hitse value%identityAlignment
A0A0A0LV30 DYW_deaminase domain-containing protein0.0e+0085.84Show/hide
Query:  MLLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPASETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL
        MLL AKAP TFW+SPAG+D+ G +NLKFR SF FV P  K+SFS+LAYA      PA ETK++ DVELD S +IV+FCE+G LKNAMELLCS QNS  DL
Subjt:  MLLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPASETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL

Query:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD
          YCS+LQLCAE+KSIRDGRRV SIIES+GVMIDGILG KLVFMYVKCGDLK+GRM+FDKLS  K+FLWN M+SEYSGSGNYGESINLFK+M ELGI+P+
Subjt:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD

Query:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV
        SYTFS +LKCFAAVARVEEGRQVHGLICKLGF S+N V+NSLISFYFVGRKVR AQKLFDEL DRDVISWNSMISGYVKNGL+D+GIEIF+KML+FGVD+
Subjt:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV

Query:  DLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV
        DLATMVN  VACAN+GTLLLGK LHSYSIKAA +DREV+FNNTLLDMYSKCGDLNSAI+VFE+MDEKTVVSWTSMI GYVREGLSDGAIKLFDEMKSRGV
Subjt:  DLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV

Query:  APDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRES
         PDVYAVTSIL+ACAINGNL SGKIVH YI+EN+LETNS VSNAL DMYAKC SMKDAH VFSHMK+KDVISWNTMIGGY+KN LPNEAL LF EMQRES
Subjt:  APDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRES

Query:  KPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAG
        KPDGTTVA ILPACA+LAALD+GREIHGYALRNGY ++KYV NA+VDMYVKCGLLVLARSLFDMI +KDLVSWTVMIAGYGMHG+GSEA+NTFN+MR+ G
Subjt:  KPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAG

Query:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
        IEPDEVSFISILYACSHSGLLDEGWK FNIMKKECQIEP LEHYACMVDLL+RTGNL KA KFI+ MPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
Subjt:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF

Query:  ELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADERE
        ELEP+NTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKG++NIFVAGDCSKPQAKKIELLLK+LRSKMKEEGYSPKT YALLNADERE
Subjt:  ELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADERE

Query:  KEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW
        KEVALC HSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHE AKFMSK+  REI+LRDSSRFHHFKDG CSCRGYW
Subjt:  KEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW

A0A5A7UPC9 Pentatricopeptide repeat-containing protein DOT40.0e+0085.39Show/hide
Query:  MLLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPASETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL
        MLL AKAP TFW+SPAG D+ G +NLKFR SF F KP  K+SFS+LAYA      PA ETK++ DVELD S +IV+FCE+G LKNAMELLCS QNS  DL
Subjt:  MLLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPASETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL

Query:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD
        + +CS+LQLCAE+KSIRDGRRVHSIIES+GV+IDGILG KLVFMYVKCGDLK+GRMIFDKLS  KVF+WN M+SEY G+GNYGESINLFK+M ELGI+P+
Subjt:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD

Query:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV
        SYTFS VLKCFAAVA VEEGRQVHGLI KLG+ S+N V+NSLISFYFVGRKVR AQKLFDEL DRDVISWNSMISGYVKNGL+D+GIEIF+KML+FGV++
Subjt:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV

Query:  DLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV
        DLATMVN LVACAN GTLL GK LHSYSIKAA +DREV+FNNTLLDMYSKCGDLNSAI+VFE+MDEKTVVSWTSMI GYVREGLSDGAIKLFDEMKSRGV
Subjt:  DLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGV

Query:  APDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRES
         PDVYAVTSILHACAINGNL SG+IVH YI+EN+LETNS VSNAL DMYAKC SMKDAH VFSHMK+KDVISWNTMIGGYSKN LPNEAL LF EMQ ES
Subjt:  APDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRES

Query:  KPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAG
        KPDGTTVA ILPACA+LAALDRGREIHGYALRNGY ++KYVVNAL+DMYVKCGLLVLARS FDMIL+KDLVSWTVMIAGYGMHGFGSEA+NTFN+MR+ G
Subjt:  KPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAG

Query:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
        I+PDEVSFISILYACSHSGLLDEGWK F+IMKKECQIEP LEHYACMVDLL+RTGNL KA KFI+TMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF
Subjt:  IEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIF

Query:  ELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADERE
        ELEP+NTGYYVLLANIYAEAEKWEEVQKLRK+IGQRGLKKNPGCSWIEIKG+VNIFVAGDCSKPQAKKIELLLK+LRSKMKEEGYSPKT YALLNADERE
Subjt:  ELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADERE

Query:  KEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW
        KEVALC HSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHE AKFMSK+  REI+LRDSSRFHHFKDG CSCRG+W
Subjt:  KEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW

A0A6J1CE34 pentatricopeptide repeat-containing protein DOT4, chloroplastic0.0e+0087Show/hide
Query:  MLLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPASETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL
        MLL+AK+P TFW+SP G D  GL+NLKF HSF F KP  K SFSN AYACT+IY   S+TK++ D+ELD+SA+IV+FCE+G LKNAMELLCS  N+ LDL
Subjt:  MLLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPASETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL

Query:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD
        ETYCSVLQLCAE+KSIR G+RVHSIIESNGV++DGILGAKLVFMYVKCGDLK+ RMIFDKLS +KVFLWN M+SEY+G+GNY ES+NLFK M ELGI+P+
Subjt:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD

Query:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV
        SYTFS VLKC AAVARVE+GR VHG ICKLGF+S+N V+NSLISFYFV +KVRSAQKLFDEL DRDVISWNSMISGYVKNGLEDKGIEIF+KML F VDV
Subjt:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV

Query:  DLATMVNALVACANMGTLLLGKALHSYSIKAA-VIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG
        DLATMVN LVACAN GTLLLGKALHSY+IKAA  +DREV F NTLLDMYSKCGDLNSAI+VFEKMDEKTVVSWTSMIAGYVREGLSDGAI+LFDEMKSRG
Subjt:  DLATMVNALVACANMGTLLLGKALHSYSIKAA-VIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG

Query:  VAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRE
        V PDVYAVTSILHACAINGNL+SGKIVH+YI++N+LETNS VSNALMDMYAKC SMKDA SVFSHMK KDVISWNTMIGGYSKNHLPNEALNLF EMQRE
Subjt:  VAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRE

Query:  SKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIA
        SKPDGTTVA ILPACA+LAALDRGREIHGYALRNGY K+KYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFG+EAV+ FN+MRI+
Subjt:  SKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIA

Query:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI
        G+EPDEVSFISILYACSHSGLLDEGWKFFNIMKKEC+IEP+LEHYACMVDLL+RTGNL KA KFI+TMPI+PDATIWGALLCGCRIHHDVK+AEKVAERI
Subjt:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI

Query:  FELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADER
        FELEP+NTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKG+VNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADER
Subjt:  FELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADER

Query:  EKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW
        EKEVALC HSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHE AKFMSK T REIVLRDS+RFHHFKDGYCSCRGYW
Subjt:  EKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW

A0A6J1EHU9 pentatricopeptide repeat-containing protein DOT4, chloroplastic0.0e+0086.32Show/hide
Query:  LLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTP-ASETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL
        LL+AKAP TFW+S AG D+ GL+NLKFR S AFVKP  + SFSN A+A TE YTP A E K + DVEL++S +IV FCE+G LKNA+ELLCS QNS LDL
Subjt:  LLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTP-ASETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL

Query:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD
        +TYC +LQLCAE+KSIRDGRRVHSIIESN V+IDGILGAKLVFMYVKCGDL++GRMIFDKLS KKVFLWN M+SEYSGSGNYGESINLFK M ELGI P+
Subjt:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD

Query:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV
        SYTFS VLKCFAAVARVEEG QVHGLICKLGFTS+NAV+NSLISFYFVGRKVRSA+KLFDE+ DRDVISWNSMISGYVKNGLED+GIEIF++ML+F VDV
Subjt:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV

Query:  DLATMVNALVACANMGTLLLGKALHSYSIK-AAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG
        DLATMVN LVACANMGTL LGK LHSYSIK AA +DR+V FNNTLLDMYSKCGDLNSAI+VFE+MDEKTVVSWTS+IAGYVREGLSDGAI+LF+EMKSRG
Subjt:  DLATMVNALVACANMGTLLLGKALHSYSIK-AAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG

Query:  VAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRE
        V PDVYAV SILHACA NGNLNSGK +H+YI+EN+LETNS VSNALMDMYAKC SM+DA  VFSHMKRKDVISWNTMIGGYSKN LPNEAL+LF EMQRE
Subjt:  VAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRE

Query:  SKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIA
        SKPDGTTVA ILPACA+LAALD+GREIHGYALRNGY K+K+V NALVDMYVKCGLLVLARSLFDMIL+KDLVSWTVMIAGYGMHG+GSEAV+ FN+MRIA
Subjt:  SKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIA

Query:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI
        GIEPDEVSFISILYACSHSGLLDEGW FFNIMKKECQIEP LEHYACMVDLL+RTGNL +A KFI+TMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI
Subjt:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI

Query:  FELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADER
        FELEP+NTGYYVLLANIYAEAEKWEEVQKLR +IGQRGLKKNPGCSWIEIKG+VNIFVAGDCSKPQAKKIELLL +LRSKMKEEGYSPKTRYALLNADER
Subjt:  FELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADER

Query:  EKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW
        EKEVALC HSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHE A+FMSK T REIVLRDSSRFHHFKDGYCSCRGYW
Subjt:  EKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW

A0A6J1KNK7 pentatricopeptide repeat-containing protein DOT4, chloroplastic0.0e+0086.3Show/hide
Query:  LLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPAS-ETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL
        LL+AKAP TFW+S AG D+ GL+NLKFR S AFVKP  + SFSN AYA TE YTPA+ E K + D EL++S +IV FCE+G LKNA+ELLCS QNS LDL
Subjt:  LLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPAS-ETKTH-DVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDL

Query:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD
        +TYC +LQLCAE+KSIRDGRRVHSIIESN V+IDGILGAKLVFMYVKCGDL++GRMIFDKLS KKVFLWN M+SEYSGSGNYGESINLFK M ELGI P+
Subjt:  ETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPD

Query:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV
        SYTFS VLKCFAAV RVEEGRQVHGLICKLGFTS+NAV+NSLISFYFVGRKVRSA+KLFDE+ DRDVISWNSMISGYVKNGLED+GIEIF++ML+F VDV
Subjt:  SYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDV

Query:  DLATMVNALVACANMGTLLLGKALHSYSIK-AAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG
        DLATMVN LVACANMGTL LGK LHSYSIK AA +DR+V FNNTLLDMYSKCGDLNSAI+VFE+MDEKTVVSWTSMIAGYVREGLSDGAI+LF+EMKSRG
Subjt:  DLATMVNALVACANMGTLLLGKALHSYSIK-AAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRG

Query:  VAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRE
        V PDVYAV SILHACAINGNLNSGK +H+YIKEN+LETNS VSNALMDMYAKC SMKDA  VFSH+KRKDVISWNTMIGGYSKN LPNEAL+LF EMQRE
Subjt:  VAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRE

Query:  SKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIA
        SKPDGTTVA ILPACA+LAALD+GREIHGYALRNGY ++K+V NALVDMYVKCGLLVLARSLFDMIL+KDLVSWTVMIAGYGMHG+GSEAVN FN+MRIA
Subjt:  SKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIA

Query:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI
        GIEPDEVSFISILYACSHSGLLDEGW FF IMKKECQIEP+LEHYACMVDLL+RTGNL +A KFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI
Subjt:  GIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERI

Query:  FELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADER
        FELEP+NTGYYVL+ANIYAEAEKWEEVQKLR +IG+ GLKKNPGCSWIEIKG+VNIFVAGDCSKPQAKKIELLL +LRSKMKEEGYSPKTRYALLNADER
Subjt:  FELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADER

Query:  EKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGY
        EKEVALC HSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHE A+FMSK T REIVLRDSSRFHHFKDGYCSCRGY
Subjt:  EKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGY

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.0e-17440.16Show/hide
Query:  VLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPDSYTFS
        +L+ C+   S+++ R++  ++  NG+  +     KLV ++ + G + +   +F+ +  K   L++ M+  ++   +  +++  F  M+   +EP  Y F+
Subjt:  VLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPDSYTFS

Query:  CVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDVDLATM
         +LK     A +  G+++HGL+ K GF+     +  L + Y   R+V  A+K+FD + +RD++SWN++++GY +NG+    +E+   M    +     T+
Subjt:  CVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDVDLATM

Query:  VNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGVAPDVY
        V+ L A + +  + +GK +H Y++++   D  V  +  L+DMY+KCG L +A Q+F+ M E+ VVSW SMI  YV+      A+ +F +M   GV P   
Subjt:  VNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGVAPDVY

Query:  AVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQ-RESKPDG
        +V   LHACA  G+L  G+ +H    E  L+ N SV N+L+ MY KC  +  A S+F  ++ + ++SWN MI G+++N  P +ALN F++M+ R  KPD 
Subjt:  AVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQ-RESKPDG

Query:  TTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPD
         T  S++ A A L+     + IHG  +R+   K  +V  ALVDMY KCG +++AR +FDM+ ++ + +W  MI GYG HGFG  A+  F EM+   I+P+
Subjt:  TTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPD

Query:  EVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEP
         V+F+S++ ACSHSGL++ G K F +MK+   IE  ++HY  MVDLL R G L +A  FI  MP+KP   ++GA+L  C+IH +V  AEK AER+FEL P
Subjt:  EVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEP

Query:  DNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVA
        D+ GY+VLLANIY  A  WE+V ++R  + ++GL+K PGCS +EIK  V+ F +G  + P +KKI   L+KL   +KE GY P T   +L  +   KE  
Subjt:  DNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVA

Query:  LCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW
        L  HSEKLA++FG+LN   G TI V KNLRVC DCH   K++S  TGREIV+RD  RFHHFK+G CSC  YW
Subjt:  LCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW

Q9LFL5 Pentatricopeptide repeat-containing protein At5g168607.2e-17240.49Show/hide
Query:  KSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKL--SVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPDSYTFSCVLKCF
        K+I   + +H  + S G++    L + L+  Y+  G L     +  +   S   V+ WN ++  Y  +G   + + LF  M  L   PD+YTF  V K  
Subjt:  KSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKL--SVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPDSYTFSCVLKCF

Query:  AAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKML-IFGVDVDLATMVNALV
          ++ V  G   H L    GF S+  V N+L++ Y   R +  A+K+FDE+   DV+SWNS+I  Y K G     +E+F +M   FG   D  T+VN L 
Subjt:  AAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKML-IFGVDVDLATMVNALV

Query:  ACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMK--------------
         CA++GT  LGK LH +++ + +I + +   N L+DMY+KCG ++ A  VF  M  K VVSW +M+AGY + G  + A++LF++M+              
Subjt:  ACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMK--------------

Query:  ---------------------SRGVAPDVYAVTSILHACAINGNLNSGKIVHSY-------IKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHM--K
                             S G+ P+   + S+L  CA  G L  GK +H Y       +++N     + V N L+DMYAKC  +  A ++F  +  K
Subjt:  ---------------------SRGVAPDVYAVTSILHACAINGNLNSGKIVHSY-------IKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHM--K

Query:  RKDVISWNTMIGGYSKNHLPNEALNLFTEMQRE---SKPDGTTVASILPACATLAALDRGREIHGYALRNGYYK-EKYVVNALVDMYVKCGLLVLARSLF
         +DV++W  MIGGYS++   N+AL L +EM  E   ++P+  T++  L ACA+LAAL  G++IH YALRN       +V N L+DMY KCG +  AR +F
Subjt:  RKDVISWNTMIGGYSKNHLPNEALNLFTEMQRE---SKPDGTTVASILPACATLAALDRGREIHGYALRNGYYK-EKYVVNALVDMYVKCGLLVLARSLF

Query:  DMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARK
        D ++ K+ V+WT ++ GYGMHG+G EA+  F+EMR  G + D V+ + +LYACSHSG++D+G ++FN MK    + P  EHYAC+VDLL R G L  A +
Subjt:  DMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARK

Query:  FIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIE-IKGRVNIFVAGDC
         IE MP++P   +W A L  CRIH  V+L E  AE+I EL  ++ G Y LL+N+YA A +W++V ++R  +  +G+KK PGCSW+E IKG    FV GD 
Subjt:  FIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIE-IKGRVNIFVAGDC

Query:  SKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSR
        + P AK+I  +L     ++K+ GY P+T +AL + D+ EK+  L EHSEKLA+A+G+L  P G  IR+TKNLRVCGDCH    +MS+    +I+LRDSSR
Subjt:  SKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSR

Query:  FHHFKDGYCSCRGYW
        FHHFK+G CSC+GYW
Subjt:  FHHFKDGYCSCRGYW

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic5.9e-17438.75Show/hide
Query:  ASETKTHDVELDDSAQIVDFCEMGYLKNAMELLCSYQNSK--LDLETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGR
        +S + TH      ++Q+   C  G L+ AM+LL S Q  +  +D + + ++++LC  K++  +G +V+SI  S+   +   LG   + M+V+ G+L    
Subjt:  ASETKTHDVELDDSAQIVDFCEMGYLKNAMELLCSYQNSK--LDLETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGR

Query:  MIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQEL-GIEPDSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRS
         +F K+S + +F WN ++  Y+  G + E++ L+  M  + G++PD YTF CVL+    +  +  G++VH  + + G+     V+N+LI+ Y     V+S
Subjt:  MIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQEL-GIEPDSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRS

Query:  AQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDVDLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDL
        A+ LFD +  RD+ISWN+MISGY +NG+  +G+E+F  M    VD DL T+ + + AC  +G   LG+ +H+Y I       ++   N+L  MY   G  
Subjt:  AQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDVDLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDL

Query:  NSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSS
          A ++F +M+ K +VSWT+MI+GY    L D AI  +  M    V PD   V ++L ACA  G+L++G  +H    +  L +   V+N L++MY+KC  
Subjt:  NSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSS

Query:  MKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRESKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGL
        +  A  +F ++ RK+VISW ++I G   N+   EAL    +M+   +P+  T+ + L ACA + AL  G+EIH + LR G   + ++ NAL+DMYV+CG 
Subjt:  MKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRESKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGL

Query:  LVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRT
        +  A S F+    KD+ SW +++ GY   G GS  V  F+ M  + + PDE++FIS+L  CS S ++ +G  +F+ M ++  + P L+HYAC+VDLL R 
Subjt:  LVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRT

Query:  GNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVN
        G L++A KFI+ MP+ PD  +WGALL  CRIHH + L E  A+ IFEL+  + GYY+LL N+YA+  KW EV K+R+ + + GL  + GCSW+E+KG+V+
Subjt:  GNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVN

Query:  IFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREI
         F++ D   PQ K+I  +L+    KM E G +  +  + ++  E  ++   C HSE+ A+AFG++N  PG  I VTKNL +C +CH+T KF+SKT  REI
Subjt:  IFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREI

Query:  VLRDSSRFHHFKDGYCSC
         +RD+  FHHFKDG CSC
Subjt:  VLRDSSRFHHFKDGYCSC

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic0.0e+0062.96Show/hide
Query:  DDSAQIVDFCEMGYLKNAMELLCSYQNSKLDLETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFL
        D + Q+  FCE G L+NA++LLC      +D  T CSVLQLCA+ KS++DG+ V + I  NG +ID  LG+KL  MY  CGDLK+   +FD++ ++K   
Subjt:  DDSAQIVDFCEMGYLKNAMELLCSYQNSKLDLETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFL

Query:  WNFMMSEYSGSGNYGESINLFKEMQELGIEPDSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVI
        WN +M+E + SG++  SI LFK+M   G+E DSYTFSCV K F+++  V  G Q+HG I K GF   N+V NSL++FY   ++V SA+K+FDE+ +RDVI
Subjt:  WNFMMSEYSGSGNYGESINLFKEMQELGIEPDSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVI

Query:  SWNSMISGYVKNGLEDKGIEIFVKMLIFGVDVDLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKT
        SWNS+I+GYV NGL +KG+ +FV+ML+ G+++DLAT+V+    CA+   + LG+A+HS  +KA    RE +F NTLLDMYSKCGDL+SA  VF +M +++
Subjt:  SWNSMISGYVKNGLEDKGIEIFVKMLIFGVDVDLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKT

Query:  VVSWTSMIAGYVREGLSDGAIKLFDEMKSRGVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRK
        VVS+TSMIAGY REGL+  A+KLF+EM+  G++PDVY VT++L+ CA    L+ GK VH +IKENDL  +  VSNALMDMYAKC SM++A  VFS M+ K
Subjt:  VVSWTSMIAGYVREGLSDGAIKLFDEMKSRGVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRK

Query:  DVISWNTMIGGYSKNHLPNEALNLFTEMQRESK--PDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMIL
        D+ISWNT+IGGYSKN   NEAL+LF  +  E +  PD  TVA +LPACA+L+A D+GREIHGY +RNGY+ +++V N+LVDMY KCG L+LA  LFD I 
Subjt:  DVISWNTMIGGYSKNHLPNEALNLFTEMQRESK--PDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMIL

Query:  DKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIET
         KDLVSWTVMIAGYGMHGFG EA+  FN+MR AGIE DE+SF+S+LYACSHSGL+DEGW+FFNIM+ EC+IEP +EHYAC+VD+L+RTG+L KA +FIE 
Subjt:  DKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIET

Query:  MPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQA
        MPI PDATIWGALLCGCRIHHDVKLAEKVAE++FELEP+NTGYYVL+ANIYAEAEKWE+V++LRK+IGQRGL+KNPGCSWIEIKGRVNIFVAGD S P+ 
Subjt:  MPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQA

Query:  KKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFK
        + IE  L+K+R++M EEGYSP T+YAL++A+E EKE ALC HSEKLAMA G+++   GK IRVTKNLRVCGDCHE AKFMSK T REIVLRDS+RFH FK
Subjt:  KKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFK

Query:  DGYCSCRGYW
        DG+CSCRG+W
Subjt:  DGYCSCRGYW

Q9SS60 Pentatricopeptide repeat-containing protein At3g035803.4e-17441.42Show/hide
Query:  FCEMGYLKNAMELLCSYQNSKL--DLETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMS
        F + G    A+E     + SK+  D  T+ SV++ CA       G  V+  I   G   D  +G  LV MY + G L + R +FD++ V+ +  WN ++S
Subjt:  FCEMGYLKNAMELLCSYQNSKL--DLETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMS

Query:  EYSGSGNYGESINLFKEMQELGIEPDSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMI
         YS  G Y E++ ++ E++   I PDS+T S VL  F  +  V++G+ +HG   K G  S   V N L++ Y   R+   A+++FDE+  RD +S+N+MI
Subjt:  EYSGSGNYGESINLFKEMQELGIEPDSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMI

Query:  SGYVKNGLEDKGIEIFVKMLIFGVDVDLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTS
         GY+K  + ++ + +F++ L      DL T+ + L AC ++  L L K +++Y +KA  +  E    N L+D+Y+KCGD+ +A  VF  M+ K  VSW S
Subjt:  SGYVKNGLEDKGIEIFVKMLIFGVDVDLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTS

Query:  MIAGYVREGLSDGAIKLFDEMKSRGVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWN
        +I+GY++ G    A+KLF  M       D      ++       +L  GK +HS   ++ +  + SVSNAL+DMYAKC  + D+  +FS M   D ++WN
Subjt:  MIAGYVREGLSDGAIKLFDEMKSRGVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWN

Query:  TMIGGYSKNHLPNEALNLFTEMQR-ESKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSW
        T+I    +       L + T+M++ E  PD  T    LP CA+LAA   G+EIH   LR GY  E  + NAL++MY KCG L  +  +F+ +  +D+V+W
Subjt:  TMIGGYSKNHLPNEALNLFTEMQR-ESKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSW

Query:  TVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDA
        T MI  YGM+G G +A+ TF +M  +GI PD V FI+I+YACSHSGL+DEG   F  MK   +I+P +EHYAC+VDLLSR+  + KA +FI+ MPIKPDA
Subjt:  TVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDA

Query:  TIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLL
        +IW ++L  CR   D++ AE+V+ RI EL PD+ GY +L +N YA   KW++V  +RK +  + + KNPG SWIE+   V++F +GD S PQ++ I   L
Subjt:  TIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLL

Query:  KKLRSKMKEEGYSPKTRYALLN-ADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSC
        + L S M +EGY P  R    N  +E EK   +C HSE+LA+AFG+LN  PG  ++V KNLRVCGDCHE  K +SK  GREI++RD++RFH FKDG CSC
Subjt:  KKLRSKMKEEGYSPKTRYALLN-ADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSC

Query:  RGYW
        +  W
Subjt:  RGYW

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-17540.16Show/hide
Query:  VLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPDSYTFS
        +L+ C+   S+++ R++  ++  NG+  +     KLV ++ + G + +   +F+ +  K   L++ M+  ++   +  +++  F  M+   +EP  Y F+
Subjt:  VLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPDSYTFS

Query:  CVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDVDLATM
         +LK     A +  G+++HGL+ K GF+     +  L + Y   R+V  A+K+FD + +RD++SWN++++GY +NG+    +E+   M    +     T+
Subjt:  CVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDVDLATM

Query:  VNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGVAPDVY
        V+ L A + +  + +GK +H Y++++   D  V  +  L+DMY+KCG L +A Q+F+ M E+ VVSW SMI  YV+      A+ +F +M   GV P   
Subjt:  VNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGVAPDVY

Query:  AVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQ-RESKPDG
        +V   LHACA  G+L  G+ +H    E  L+ N SV N+L+ MY KC  +  A S+F  ++ + ++SWN MI G+++N  P +ALN F++M+ R  KPD 
Subjt:  AVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQ-RESKPDG

Query:  TTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPD
         T  S++ A A L+     + IHG  +R+   K  +V  ALVDMY KCG +++AR +FDM+ ++ + +W  MI GYG HGFG  A+  F EM+   I+P+
Subjt:  TTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPD

Query:  EVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEP
         V+F+S++ ACSHSGL++ G K F +MK+   IE  ++HY  MVDLL R G L +A  FI  MP+KP   ++GA+L  C+IH +V  AEK AER+FEL P
Subjt:  EVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEP

Query:  DNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVA
        D+ GY+VLLANIY  A  WE+V ++R  + ++GL+K PGCS +EIK  V+ F +G  + P +KKI   L+KL   +KE GY P T   +L  +   KE  
Subjt:  DNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVA

Query:  LCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW
        L  HSEKLA++FG+LN   G TI V KNLRVC DCH   K++S  TGREIV+RD  RFHHFK+G CSC  YW
Subjt:  LCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW

AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-17538.75Show/hide
Query:  ASETKTHDVELDDSAQIVDFCEMGYLKNAMELLCSYQNSK--LDLETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGR
        +S + TH      ++Q+   C  G L+ AM+LL S Q  +  +D + + ++++LC  K++  +G +V+SI  S+   +   LG   + M+V+ G+L    
Subjt:  ASETKTHDVELDDSAQIVDFCEMGYLKNAMELLCSYQNSK--LDLETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGR

Query:  MIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQEL-GIEPDSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRS
         +F K+S + +F WN ++  Y+  G + E++ L+  M  + G++PD YTF CVL+    +  +  G++VH  + + G+     V+N+LI+ Y     V+S
Subjt:  MIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQEL-GIEPDSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRS

Query:  AQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDVDLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDL
        A+ LFD +  RD+ISWN+MISGY +NG+  +G+E+F  M    VD DL T+ + + AC  +G   LG+ +H+Y I       ++   N+L  MY   G  
Subjt:  AQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDVDLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDL

Query:  NSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSS
          A ++F +M+ K +VSWT+MI+GY    L D AI  +  M    V PD   V ++L ACA  G+L++G  +H    +  L +   V+N L++MY+KC  
Subjt:  NSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSS

Query:  MKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRESKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGL
        +  A  +F ++ RK+VISW ++I G   N+   EAL    +M+   +P+  T+ + L ACA + AL  G+EIH + LR G   + ++ NAL+DMYV+CG 
Subjt:  MKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRESKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGL

Query:  LVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRT
        +  A S F+    KD+ SW +++ GY   G GS  V  F+ M  + + PDE++FIS+L  CS S ++ +G  +F+ M ++  + P L+HYAC+VDLL R 
Subjt:  LVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRT

Query:  GNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVN
        G L++A KFI+ MP+ PD  +WGALL  CRIHH + L E  A+ IFEL+  + GYY+LL N+YA+  KW EV K+R+ + + GL  + GCSW+E+KG+V+
Subjt:  GNLEKARKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVN

Query:  IFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREI
         F++ D   PQ K+I  +L+    KM E G +  +  + ++  E  ++   C HSE+ A+AFG++N  PG  I VTKNL +C +CH+T KF+SKT  REI
Subjt:  IFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREI

Query:  VLRDSSRFHHFKDGYCSC
         +RD+  FHHFKDG CSC
Subjt:  VLRDSSRFHHFKDGYCSC

AT3G03580.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-17541.42Show/hide
Query:  FCEMGYLKNAMELLCSYQNSKL--DLETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMS
        F + G    A+E     + SK+  D  T+ SV++ CA       G  V+  I   G   D  +G  LV MY + G L + R +FD++ V+ +  WN ++S
Subjt:  FCEMGYLKNAMELLCSYQNSKL--DLETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMS

Query:  EYSGSGNYGESINLFKEMQELGIEPDSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMI
         YS  G Y E++ ++ E++   I PDS+T S VL  F  +  V++G+ +HG   K G  S   V N L++ Y   R+   A+++FDE+  RD +S+N+MI
Subjt:  EYSGSGNYGESINLFKEMQELGIEPDSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMI

Query:  SGYVKNGLEDKGIEIFVKMLIFGVDVDLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTS
         GY+K  + ++ + +F++ L      DL T+ + L AC ++  L L K +++Y +KA  +  E    N L+D+Y+KCGD+ +A  VF  M+ K  VSW S
Subjt:  SGYVKNGLEDKGIEIFVKMLIFGVDVDLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTS

Query:  MIAGYVREGLSDGAIKLFDEMKSRGVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWN
        +I+GY++ G    A+KLF  M       D      ++       +L  GK +HS   ++ +  + SVSNAL+DMYAKC  + D+  +FS M   D ++WN
Subjt:  MIAGYVREGLSDGAIKLFDEMKSRGVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRKDVISWN

Query:  TMIGGYSKNHLPNEALNLFTEMQR-ESKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSW
        T+I    +       L + T+M++ E  PD  T    LP CA+LAA   G+EIH   LR GY  E  + NAL++MY KCG L  +  +F+ +  +D+V+W
Subjt:  TMIGGYSKNHLPNEALNLFTEMQR-ESKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSW

Query:  TVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDA
        T MI  YGM+G G +A+ TF +M  +GI PD V FI+I+YACSHSGL+DEG   F  MK   +I+P +EHYAC+VDLLSR+  + KA +FI+ MPIKPDA
Subjt:  TVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIETMPIKPDA

Query:  TIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLL
        +IW ++L  CR   D++ AE+V+ RI EL PD+ GY +L +N YA   KW++V  +RK +  + + KNPG SWIE+   V++F +GD S PQ++ I   L
Subjt:  TIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIELLL

Query:  KKLRSKMKEEGYSPKTRYALLN-ADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSC
        + L S M +EGY P  R    N  +E EK   +C HSE+LA+AFG+LN  PG  ++V KNLRVCGDCHE  K +SK  GREI++RD++RFH FKDG CSC
Subjt:  KKLRSKMKEEGYSPKTRYALLN-ADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSC

Query:  RGYW
        +  W
Subjt:  RGYW

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein0.0e+0062.96Show/hide
Query:  DDSAQIVDFCEMGYLKNAMELLCSYQNSKLDLETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFL
        D + Q+  FCE G L+NA++LLC      +D  T CSVLQLCA+ KS++DG+ V + I  NG +ID  LG+KL  MY  CGDLK+   +FD++ ++K   
Subjt:  DDSAQIVDFCEMGYLKNAMELLCSYQNSKLDLETYCSVLQLCAEKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFL

Query:  WNFMMSEYSGSGNYGESINLFKEMQELGIEPDSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVI
        WN +M+E + SG++  SI LFK+M   G+E DSYTFSCV K F+++  V  G Q+HG I K GF   N+V NSL++FY   ++V SA+K+FDE+ +RDVI
Subjt:  WNFMMSEYSGSGNYGESINLFKEMQELGIEPDSYTFSCVLKCFAAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVI

Query:  SWNSMISGYVKNGLEDKGIEIFVKMLIFGVDVDLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKT
        SWNS+I+GYV NGL +KG+ +FV+ML+ G+++DLAT+V+    CA+   + LG+A+HS  +KA    RE +F NTLLDMYSKCGDL+SA  VF +M +++
Subjt:  SWNSMISGYVKNGLEDKGIEIFVKMLIFGVDVDLATMVNALVACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKT

Query:  VVSWTSMIAGYVREGLSDGAIKLFDEMKSRGVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRK
        VVS+TSMIAGY REGL+  A+KLF+EM+  G++PDVY VT++L+ CA    L+ GK VH +IKENDL  +  VSNALMDMYAKC SM++A  VFS M+ K
Subjt:  VVSWTSMIAGYVREGLSDGAIKLFDEMKSRGVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHMKRK

Query:  DVISWNTMIGGYSKNHLPNEALNLFTEMQRESK--PDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMIL
        D+ISWNT+IGGYSKN   NEAL+LF  +  E +  PD  TVA +LPACA+L+A D+GREIHGY +RNGY+ +++V N+LVDMY KCG L+LA  LFD I 
Subjt:  DVISWNTMIGGYSKNHLPNEALNLFTEMQRESK--PDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVKCGLLVLARSLFDMIL

Query:  DKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIET
         KDLVSWTVMIAGYGMHGFG EA+  FN+MR AGIE DE+SF+S+LYACSHSGL+DEGW+FFNIM+ EC+IEP +EHYAC+VD+L+RTG+L KA +FIE 
Subjt:  DKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARKFIET

Query:  MPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQA
        MPI PDATIWGALLCGCRIHHDVKLAEKVAE++FELEP+NTGYYVL+ANIYAEAEKWE+V++LRK+IGQRGL+KNPGCSWIEIKGRVNIFVAGD S P+ 
Subjt:  MPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQA

Query:  KKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFK
        + IE  L+K+R++M EEGYSP T+YAL++A+E EKE ALC HSEKLAMA G+++   GK IRVTKNLRVCGDCHE AKFMSK T REIVLRDS+RFH FK
Subjt:  KKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFK

Query:  DGYCSCRGYW
        DG+CSCRG+W
Subjt:  DGYCSCRGYW

AT5G16860.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.1e-17340.49Show/hide
Query:  KSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKL--SVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPDSYTFSCVLKCF
        K+I   + +H  + S G++    L + L+  Y+  G L     +  +   S   V+ WN ++  Y  +G   + + LF  M  L   PD+YTF  V K  
Subjt:  KSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKL--SVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPDSYTFSCVLKCF

Query:  AAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKML-IFGVDVDLATMVNALV
          ++ V  G   H L    GF S+  V N+L++ Y   R +  A+K+FDE+   DV+SWNS+I  Y K G     +E+F +M   FG   D  T+VN L 
Subjt:  AAVARVEEGRQVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKML-IFGVDVDLATMVNALV

Query:  ACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMK--------------
         CA++GT  LGK LH +++ + +I + +   N L+DMY+KCG ++ A  VF  M  K VVSW +M+AGY + G  + A++LF++M+              
Subjt:  ACANMGTLLLGKALHSYSIKAAVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMK--------------

Query:  ---------------------SRGVAPDVYAVTSILHACAINGNLNSGKIVHSY-------IKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHM--K
                             S G+ P+   + S+L  CA  G L  GK +H Y       +++N     + V N L+DMYAKC  +  A ++F  +  K
Subjt:  ---------------------SRGVAPDVYAVTSILHACAINGNLNSGKIVHSY-------IKENDLETNSSVSNALMDMYAKCSSMKDAHSVFSHM--K

Query:  RKDVISWNTMIGGYSKNHLPNEALNLFTEMQRE---SKPDGTTVASILPACATLAALDRGREIHGYALRNGYYK-EKYVVNALVDMYVKCGLLVLARSLF
         +DV++W  MIGGYS++   N+AL L +EM  E   ++P+  T++  L ACA+LAAL  G++IH YALRN       +V N L+DMY KCG +  AR +F
Subjt:  RKDVISWNTMIGGYSKNHLPNEALNLFTEMQRE---SKPDGTTVASILPACATLAALDRGREIHGYALRNGYYK-EKYVVNALVDMYVKCGLLVLARSLF

Query:  DMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARK
        D ++ K+ V+WT ++ GYGMHG+G EA+  F+EMR  G + D V+ + +LYACSHSG++D+G ++FN MK    + P  EHYAC+VDLL R G L  A +
Subjt:  DMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKARK

Query:  FIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIE-IKGRVNIFVAGDC
         IE MP++P   +W A L  CRIH  V+L E  AE+I EL  ++ G Y LL+N+YA A +W++V ++R  +  +G+KK PGCSW+E IKG    FV GD 
Subjt:  FIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIE-IKGRVNIFVAGDC

Query:  SKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSR
        + P AK+I  +L     ++K+ GY P+T +AL + D+ EK+  L EHSEKLA+A+G+L  P G  IR+TKNLRVCGDCH    +MS+    +I+LRDSSR
Subjt:  SKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSR

Query:  FHHFKDGYCSCRGYW
        FHHFK+G CSC+GYW
Subjt:  FHHFKDGYCSCRGYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACTGATAGCGAAAGCCCCTTCGACGTTCTGGATATCGCCGGCGGGGTCCGATAATTGTGGCTTATTGAACTTGAAATTCCGGCATTCTTTTGCTTTTGTCAAACC
CATTTTGAAAATTTCCTTTTCGAATTTGGCTTATGCTTGTACAGAGATCTACACCCCAGCATCGGAAACGAAAACCCACGATGTTGAACTGGATGATTCTGCCCAAATTG
TCGACTTTTGTGAAATGGGTTATCTAAAAAATGCTATGGAGCTTCTTTGCAGCTACCAAAATTCCAAACTTGACTTGGAAACTTACTGCTCCGTGTTGCAGCTATGTGCT
GAAAAGAAGTCCATACGAGATGGAAGAAGGGTCCATTCTATAATTGAATCTAATGGAGTTATGATAGATGGAATCTTGGGTGCGAAACTGGTTTTTATGTATGTAAAATG
TGGGGATCTTAAACAAGGGAGGATGATTTTTGATAAACTATCTGTAAAGAAGGTTTTTCTCTGGAACTTTATGATGAGTGAGTATTCTGGAAGTGGCAATTATGGAGAGA
GTATAAATTTGTTCAAGGAAATGCAGGAGTTGGGGATTGAACCTGATTCTTATACATTTTCTTGCGTTTTGAAATGTTTTGCAGCAGTTGCACGTGTAGAAGAGGGTAGA
CAAGTTCATGGGTTGATCTGCAAATTGGGTTTTACTTCTCATAATGCAGTGATTAATTCGCTAATTTCTTTCTACTTTGTGGGTAGAAAGGTTAGAAGTGCACAGAAGCT
GTTCGATGAATTGTGTGACCGAGATGTTATATCATGGAACTCTATGATTAGTGGCTATGTTAAGAATGGTTTAGAAGATAAGGGAATTGAGATTTTTGTGAAGATGTTAA
TTTTTGGTGTTGATGTTGATTTGGCTACAATGGTCAATGCGCTTGTGGCTTGTGCAAATATGGGCACTCTTTTGTTGGGTAAGGCACTTCATTCTTATTCGATAAAGGCA
GCTGTTATTGACAGAGAAGTTAAGTTCAATAATACCTTACTGGACATGTACTCAAAATGTGGGGATTTGAACAGTGCCATTCAGGTTTTTGAGAAAATGGATGAGAAAAC
TGTTGTATCTTGGACTTCGATGATTGCAGGCTATGTTCGTGAAGGTCTATCCGATGGTGCGATCAAGTTGTTTGATGAAATGAAAAGCAGAGGTGTTGCCCCGGATGTTT
ATGCTGTTACAAGCATTCTTCATGCTTGTGCTATTAATGGCAATCTGAATAGTGGAAAGATTGTACACAGCTACATCAAGGAAAACGACTTGGAAACTAACTCGTCTGTT
AGTAATGCTCTTATGGATATGTATGCTAAATGTAGCAGTATGAAGGACGCTCATAGTGTGTTTTCTCACATGAAAAGGAAGGATGTTATATCGTGGAATACCATGATCGG
AGGCTACTCGAAGAATCATCTTCCAAATGAAGCTCTTAACTTGTTTACAGAGATGCAAAGAGAATCAAAGCCTGATGGCACAACAGTGGCCAGCATCCTTCCAGCCTGTG
CGACTCTTGCAGCCTTGGATAGAGGTAGAGAAATCCATGGATATGCATTAAGAAACGGATACTATAAAGAAAAATATGTTGTTAATGCACTTGTCGATATGTATGTGAAA
TGTGGGCTCTTGGTTCTTGCGCGGTCGCTCTTCGATATGATTCTCGATAAGGACCTTGTCTCATGGACAGTGATGATAGCTGGATATGGGATGCATGGCTTTGGTAGTGA
AGCTGTCAATACCTTTAATGAAATGAGAATCGCAGGGATTGAGCCAGATGAAGTATCCTTCATTTCAATTCTTTATGCCTGTAGCCATTCTGGATTGCTTGATGAAGGAT
GGAAATTTTTCAATATTATGAAGAAAGAATGTCAGATTGAACCCGAGTTGGAGCACTATGCTTGCATGGTGGATCTTCTTTCCCGAACCGGGAATCTGGAAAAGGCACGC
AAATTCATTGAAACGATGCCAATCAAACCAGATGCAACAATTTGGGGTGCATTGTTGTGTGGATGCAGGATACACCATGATGTCAAACTAGCAGAGAAAGTTGCAGAAAG
AATCTTCGAGCTAGAACCAGATAACACAGGCTATTATGTACTTTTGGCAAACATCTATGCAGAGGCAGAAAAATGGGAAGAAGTTCAAAAGTTAAGGAAGAAAATCGGAC
AACGCGGTTTGAAGAAGAATCCAGGCTGCAGTTGGATAGAGATCAAGGGCAGGGTCAATATTTTTGTTGCTGGTGATTGCTCCAAACCCCAAGCCAAGAAGATAGAGCTA
CTTCTTAAAAAACTAAGAAGCAAGATGAAGGAAGAAGGTTACTCTCCGAAAACAAGGTATGCTTTGTTAAATGCAGATGAAAGGGAGAAGGAAGTAGCCCTCTGTGAGCA
CAGTGAGAAGCTAGCCATGGCTTTTGGTATGCTGAATCTCCCACCCGGCAAGACTATACGGGTGACTAAAAACCTCCGAGTTTGCGGCGACTGTCATGAGACGGCCAAGT
TCATGTCGAAGACTACTGGACGAGAAATCGTTTTGAGAGATTCGAGTCGTTTTCATCATTTCAAAGATGGATATTGTTCTTGTAGAGGTTACTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTACTGATAGCGAAAGCCCCTTCGACGTTCTGGATATCGCCGGCGGGGTCCGATAATTGTGGCTTATTGAACTTGAAATTCCGGCATTCTTTTGCTTTTGTCAAACC
CATTTTGAAAATTTCCTTTTCGAATTTGGCTTATGCTTGTACAGAGATCTACACCCCAGCATCGGAAACGAAAACCCACGATGTTGAACTGGATGATTCTGCCCAAATTG
TCGACTTTTGTGAAATGGGTTATCTAAAAAATGCTATGGAGCTTCTTTGCAGCTACCAAAATTCCAAACTTGACTTGGAAACTTACTGCTCCGTGTTGCAGCTATGTGCT
GAAAAGAAGTCCATACGAGATGGAAGAAGGGTCCATTCTATAATTGAATCTAATGGAGTTATGATAGATGGAATCTTGGGTGCGAAACTGGTTTTTATGTATGTAAAATG
TGGGGATCTTAAACAAGGGAGGATGATTTTTGATAAACTATCTGTAAAGAAGGTTTTTCTCTGGAACTTTATGATGAGTGAGTATTCTGGAAGTGGCAATTATGGAGAGA
GTATAAATTTGTTCAAGGAAATGCAGGAGTTGGGGATTGAACCTGATTCTTATACATTTTCTTGCGTTTTGAAATGTTTTGCAGCAGTTGCACGTGTAGAAGAGGGTAGA
CAAGTTCATGGGTTGATCTGCAAATTGGGTTTTACTTCTCATAATGCAGTGATTAATTCGCTAATTTCTTTCTACTTTGTGGGTAGAAAGGTTAGAAGTGCACAGAAGCT
GTTCGATGAATTGTGTGACCGAGATGTTATATCATGGAACTCTATGATTAGTGGCTATGTTAAGAATGGTTTAGAAGATAAGGGAATTGAGATTTTTGTGAAGATGTTAA
TTTTTGGTGTTGATGTTGATTTGGCTACAATGGTCAATGCGCTTGTGGCTTGTGCAAATATGGGCACTCTTTTGTTGGGTAAGGCACTTCATTCTTATTCGATAAAGGCA
GCTGTTATTGACAGAGAAGTTAAGTTCAATAATACCTTACTGGACATGTACTCAAAATGTGGGGATTTGAACAGTGCCATTCAGGTTTTTGAGAAAATGGATGAGAAAAC
TGTTGTATCTTGGACTTCGATGATTGCAGGCTATGTTCGTGAAGGTCTATCCGATGGTGCGATCAAGTTGTTTGATGAAATGAAAAGCAGAGGTGTTGCCCCGGATGTTT
ATGCTGTTACAAGCATTCTTCATGCTTGTGCTATTAATGGCAATCTGAATAGTGGAAAGATTGTACACAGCTACATCAAGGAAAACGACTTGGAAACTAACTCGTCTGTT
AGTAATGCTCTTATGGATATGTATGCTAAATGTAGCAGTATGAAGGACGCTCATAGTGTGTTTTCTCACATGAAAAGGAAGGATGTTATATCGTGGAATACCATGATCGG
AGGCTACTCGAAGAATCATCTTCCAAATGAAGCTCTTAACTTGTTTACAGAGATGCAAAGAGAATCAAAGCCTGATGGCACAACAGTGGCCAGCATCCTTCCAGCCTGTG
CGACTCTTGCAGCCTTGGATAGAGGTAGAGAAATCCATGGATATGCATTAAGAAACGGATACTATAAAGAAAAATATGTTGTTAATGCACTTGTCGATATGTATGTGAAA
TGTGGGCTCTTGGTTCTTGCGCGGTCGCTCTTCGATATGATTCTCGATAAGGACCTTGTCTCATGGACAGTGATGATAGCTGGATATGGGATGCATGGCTTTGGTAGTGA
AGCTGTCAATACCTTTAATGAAATGAGAATCGCAGGGATTGAGCCAGATGAAGTATCCTTCATTTCAATTCTTTATGCCTGTAGCCATTCTGGATTGCTTGATGAAGGAT
GGAAATTTTTCAATATTATGAAGAAAGAATGTCAGATTGAACCCGAGTTGGAGCACTATGCTTGCATGGTGGATCTTCTTTCCCGAACCGGGAATCTGGAAAAGGCACGC
AAATTCATTGAAACGATGCCAATCAAACCAGATGCAACAATTTGGGGTGCATTGTTGTGTGGATGCAGGATACACCATGATGTCAAACTAGCAGAGAAAGTTGCAGAAAG
AATCTTCGAGCTAGAACCAGATAACACAGGCTATTATGTACTTTTGGCAAACATCTATGCAGAGGCAGAAAAATGGGAAGAAGTTCAAAAGTTAAGGAAGAAAATCGGAC
AACGCGGTTTGAAGAAGAATCCAGGCTGCAGTTGGATAGAGATCAAGGGCAGGGTCAATATTTTTGTTGCTGGTGATTGCTCCAAACCCCAAGCCAAGAAGATAGAGCTA
CTTCTTAAAAAACTAAGAAGCAAGATGAAGGAAGAAGGTTACTCTCCGAAAACAAGGTATGCTTTGTTAAATGCAGATGAAAGGGAGAAGGAAGTAGCCCTCTGTGAGCA
CAGTGAGAAGCTAGCCATGGCTTTTGGTATGCTGAATCTCCCACCCGGCAAGACTATACGGGTGACTAAAAACCTCCGAGTTTGCGGCGACTGTCATGAGACGGCCAAGT
TCATGTCGAAGACTACTGGACGAGAAATCGTTTTGAGAGATTCGAGTCGTTTTCATCATTTCAAAGATGGATATTGTTCTTGTAGAGGTTACTGGTGAATGAAACTATTA
GTAGCTCCATTACAATAAAGGGCACTTTTTTTTCCACCCAGTTGGCATTTGCTCTTTTCCCTATTCAAGTTTACACCTTTGCAA
Protein sequenceShow/hide protein sequence
MLLIAKAPSTFWISPAGSDNCGLLNLKFRHSFAFVKPILKISFSNLAYACTEIYTPASETKTHDVELDDSAQIVDFCEMGYLKNAMELLCSYQNSKLDLETYCSVLQLCA
EKKSIRDGRRVHSIIESNGVMIDGILGAKLVFMYVKCGDLKQGRMIFDKLSVKKVFLWNFMMSEYSGSGNYGESINLFKEMQELGIEPDSYTFSCVLKCFAAVARVEEGR
QVHGLICKLGFTSHNAVINSLISFYFVGRKVRSAQKLFDELCDRDVISWNSMISGYVKNGLEDKGIEIFVKMLIFGVDVDLATMVNALVACANMGTLLLGKALHSYSIKA
AVIDREVKFNNTLLDMYSKCGDLNSAIQVFEKMDEKTVVSWTSMIAGYVREGLSDGAIKLFDEMKSRGVAPDVYAVTSILHACAINGNLNSGKIVHSYIKENDLETNSSV
SNALMDMYAKCSSMKDAHSVFSHMKRKDVISWNTMIGGYSKNHLPNEALNLFTEMQRESKPDGTTVASILPACATLAALDRGREIHGYALRNGYYKEKYVVNALVDMYVK
CGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGSEAVNTFNEMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPELEHYACMVDLLSRTGNLEKAR
KFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPDNTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGRVNIFVAGDCSKPQAKKIEL
LLKKLRSKMKEEGYSPKTRYALLNADEREKEVALCEHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHETAKFMSKTTGREIVLRDSSRFHHFKDGYCSCRGYW