; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013000 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013000
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr01:26007564..26010251
RNA-Seq ExpressionHG10013000
SyntenyHG10013000
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7028688.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0064.58Show/hide
Query:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG
        MKPPK  SNFN++PET++ELLRSSLLKALSSAKNT+QLR VHS II SGL LSV+FSGKLISKYAQLKDP SSVSVFRTVSPT NVYQWNSIIRALT NG
Subjt:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG

Query:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN
        LFTQALGYYTEMRE KLQPDAYTFPSVINSCARILDLK+GR+VHEHV EMGFESDLYIGNALIDMY RF DL+NARY+F+EMSDRD+VSWNSLISGYCSN
Subjt:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN

Query:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ
        GF EEALDMYHKSRM GM+PDCFTMSSVLLACGSL AV+EG+ +HG IEKIGI GD+  GNGLLSMYFKFERPRE G VF+EMA KDSVTWNTMICGYSQ
Subjt:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ

Query:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ
        LG HEESVKLFM MIDEF PDVL++TST RACGHLGDL++GK+VHKYLIG GYECDTVACNILIDMYAKCGDLLAAQEVF T  CKDSVTWNSLINGYTQ
Subjt:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ

Query:  SGFYKEGM-----MKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH
         G+YK+GM     MK ES+PD VTFVLLLS  SQLADI+ GRGIH DVIK GFE ELIIGN+LLDMYAKCG MDDLLK FSYMR  DIISWNTLIASSVH
Subjt:  SGFYKEGM-----MKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH

Query:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------
        FDDCTVG+RAI+EMRTEGLMPDEAT+LGILP+CSLLAAR+QGKEIH  IFK                                                 
Subjt:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA
                                                                                    T DKSFEQY KV+D LEYL GLMA
Subjt:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA

Query:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKI QREILVRDANRFHLFK+GTCSCGDHW
Subjt:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

TYK17500.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0066.15Show/hide
Query:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG
        MKPPK CSNFNNTPE S+E LRSSLLK LSSAKNT QLRTVHSLIITSGL+LSVIFSGKLISKY+Q+KDP SSVSVFR++SPT+NVY WNSIIRALTHNG
Subjt:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG

Query:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN
        LFTQALGYY EMREKKLQPDA+TFPSVINSCAR+LDL+LG IVH+HV+EMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMS+RD+VSWNSLISGYCSN
Subjt:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN

Query:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ
        GF E+ALDMYHK RMTGM+PD FTMSSVLLACGSLMAVKEGV VHG IEKIGI GDV IGNGLLSMYFKFER REA W+FSEMAVKDSVTWNTMICGY+Q
Subjt:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ

Query:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ
        LGRHEESVKLFMEMID F PD+L+ITST RACG  G+LQ+GKFVHKYLIGSG+ECDTVA NILIDMYAKCGDLLAAQEVF TTKCKDSVTWNSLINGYTQ
Subjt:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ

Query:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH
        SG+YKEG     MMKMES+PD VTFVLLLS FSQLADIN GRGI  DVIKFGFEAELIIGNSLLDMYAKCG MDDLLKVFSYM  HD ISWNT+IASSVH
Subjt:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH

Query:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------
        FDDCTVGF+ INEMRTEGLMPDEATVLGILP+CSLLA R+QGKEIHG IFK                                                 
Subjt:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA
                                                                                    TGDKSFEQY KVKDLLEYLVGLMA
Subjt:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA

Query:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGS LLVMKNLRVCGDCHTVTKYI+KI QREILVRDANRFH FK+G CSCGDHW
Subjt:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

XP_004134352.2 pentatricopeptide repeat-containing protein At3g03580 [Cucumis sativus]0.0e+0066.82Show/hide
Query:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG
        MKPPK CSNFNNTPE S+E LRSSLLK LSSAKNT QLRTVHSLIITSGL+LSVIFSGKLISKYAQ+KDP SSVSVFR++SPT+NVY WNSIIRALTHNG
Subjt:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG

Query:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN
        LFTQALGYYTEMREKKLQPDA+TFPSVINSCARILDL+LG IVHEH +EMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMS+RD+VSWNSLISGYCSN
Subjt:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN

Query:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ
        GF E+ALDMYHK RMTGM+PDCFTMSSVLLACGSLMAVKEGV VHG IEKIGIAGDV IGNGLLSMYFKFER REA  VFS+MAVKDSVTWNTMICGY+Q
Subjt:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ

Query:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ
        LGRHE SVKLFM+MID F PD+L+ITST RACG  GDLQVGKFVHKYLIGSG+ECDTVACNILIDMYAKCGDLLAAQEVF TTKCKDSVTWNSLINGYTQ
Subjt:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ

Query:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH
        SG+YKEG     MMKME +PD VTFVLLLS FSQLADIN GRGIH DVIKFGFEAELIIGNSLLD+YAKCG MDDLLKVFSYM  HDIISWNT+IASSVH
Subjt:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH

Query:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------
        FDDCTVGF+ INEMRTEGLMPDEATVLGILP+CSLLA R+QGKEIHG IFK                                                 
Subjt:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA
                                                                                    TGDKSFEQY KVKDLLEYLV LMA
Subjt:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA

Query:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKI QREILVRDANRFH FK+G CSCGDHW
Subjt:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

XP_016899076.1 PREDICTED: pentatricopeptide repeat-containing protein At3g03580 [Cucumis melo]0.0e+0066.37Show/hide
Query:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG
        MKPPK CSNFNNTPE S+ELLRSSLLK LSSAKNT QLRTVHSLIITSGL+LSVIFSGKLISKY+Q+KDP SSVSVFR++SPT+NVY WNSIIRALTHNG
Subjt:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG

Query:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN
        LFTQALGYY EMREKKLQPDA+TFPSVINSCAR+LDL+LG IVH+HV+EMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMS+RD+VSWNSLISGYCSN
Subjt:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN

Query:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ
        GF EEALDMYHK RMTGM+PD FTMSSVLLACGSLMAVKEGV VHG IEKIGI GDV IGNGLLSMYFKFER REA W+FSEMAVKDSVTWNTMICGY+Q
Subjt:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ

Query:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ
        LGRHEESVKLFMEMID F PD+L+ITST RACG  G+LQ+GKFVHKYLIGSG+ECDTVA NILIDMYAKCGDLLAAQEVF TTKCKDSVTWNSLINGYTQ
Subjt:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ

Query:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH
        SG+YKEG     MMKMES+PD VTFVLLLS FSQLADIN GRGI  DVIKFGFEAELIIGNSLLDMYAKCG MDDLLKVFSYM  HD ISWNT+IASSVH
Subjt:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH

Query:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------
        FDDCTVGF+ INEMRTEGLMPDEATVLGILP+CSLLA R+QGKEIHG IFK                                                 
Subjt:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA
                                                                                    TGDKSFEQY KVKDLLEYLVGLMA
Subjt:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA

Query:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGS LLVMKNLRVCGDCHTVTKYI+KI QREILVRDANRFH FK+G CSCGDHW
Subjt:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

XP_038897296.1 pentatricopeptide repeat-containing protein At3g03580 [Benincasa hispida]0.0e+0068.38Show/hide
Query:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG
        M PPK CSNFN++PETS+E +RSSLLKALSSAKNT+QLRTVHSLIITSGLALSVIFSGKLISKYAQLKDP SSVSVFRTVSPTHNVYQWNSIIRALTHNG
Subjt:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG

Query:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN
        LFTQALGYYT+MREKKLQPDA+TFPSVINSC R+LDLK GRIVHEH++EMGFESDLYIGNALIDMYSR VDLDNAR VFEEMSDRD VSWNSLISGYC N
Subjt:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN

Query:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ
        GF EEALDMYHKSRMTGM+PDCFTMSSVLL+CGSLMA+KEGVTVHGAIEKIGI GDV IGNGLLSMYFKFERPREAG VFSEMAVKDSVTWNTMICGYSQ
Subjt:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ

Query:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ
        LGRHEESVKLFMEMIDEFTPDVLTITST RACGHLGDLQVGKFVHKYLIGSGYECDT+ACNILIDMYAKCGDLLAAQEVF +TKCKDSVTWNS+INGYTQ
Subjt:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ

Query:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH
        SG+YKEG     MMKMES+PD VTFVLLLS FSQLA+IN GRGIH DVIKFGF+AELIIGNSLLDMYAKCG M+DLLK+FSYMR HDIISWNTLIASSVH
Subjt:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH

Query:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------
        FDDC VGFRAINEMRTEGL+PDEATVLGILP+ SLLA RQQGKEIHG IFK                                                 
Subjt:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA
                                                                                    TGDKSFEQY KVKDLLEYLVGLMA
Subjt:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA

Query:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLR CGDCHTVTKYITKI QREILVRDANRFHLFKNGTCSCGDHW
Subjt:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

TrEMBL top hitse value%identityAlignment
A0A0A0L4F4 DYW_deaminase domain-containing protein0.0e+0066.82Show/hide
Query:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG
        MKPPK CSNFNNTPE S+E LRSSLLK LSSAKNT QLRTVHSLIITSGL+LSVIFSGKLISKYAQ+KDP SSVSVFR++SPT+NVY WNSIIRALTHNG
Subjt:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG

Query:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN
        LFTQALGYYTEMREKKLQPDA+TFPSVINSCARILDL+LG IVHEH +EMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMS+RD+VSWNSLISGYCSN
Subjt:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN

Query:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ
        GF E+ALDMYHK RMTGM+PDCFTMSSVLLACGSLMAVKEGV VHG IEKIGIAGDV IGNGLLSMYFKFER REA  VFS+MAVKDSVTWNTMICGY+Q
Subjt:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ

Query:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ
        LGRHE SVKLFM+MID F PD+L+ITST RACG  GDLQVGKFVHKYLIGSG+ECDTVACNILIDMYAKCGDLLAAQEVF TTKCKDSVTWNSLINGYTQ
Subjt:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ

Query:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH
        SG+YKEG     MMKME +PD VTFVLLLS FSQLADIN GRGIH DVIKFGFEAELIIGNSLLD+YAKCG MDDLLKVFSYM  HDIISWNT+IASSVH
Subjt:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH

Query:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------
        FDDCTVGF+ INEMRTEGLMPDEATVLGILP+CSLLA R+QGKEIHG IFK                                                 
Subjt:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA
                                                                                    TGDKSFEQY KVKDLLEYLV LMA
Subjt:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA

Query:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKI QREILVRDANRFH FK+G CSCGDHW
Subjt:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

A0A1S4DSV8 pentatricopeptide repeat-containing protein At3g035800.0e+0066.37Show/hide
Query:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG
        MKPPK CSNFNNTPE S+ELLRSSLLK LSSAKNT QLRTVHSLIITSGL+LSVIFSGKLISKY+Q+KDP SSVSVFR++SPT+NVY WNSIIRALTHNG
Subjt:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG

Query:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN
        LFTQALGYY EMREKKLQPDA+TFPSVINSCAR+LDL+LG IVH+HV+EMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMS+RD+VSWNSLISGYCSN
Subjt:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN

Query:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ
        GF EEALDMYHK RMTGM+PD FTMSSVLLACGSLMAVKEGV VHG IEKIGI GDV IGNGLLSMYFKFER REA W+FSEMAVKDSVTWNTMICGY+Q
Subjt:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ

Query:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ
        LGRHEESVKLFMEMID F PD+L+ITST RACG  G+LQ+GKFVHKYLIGSG+ECDTVA NILIDMYAKCGDLLAAQEVF TTKCKDSVTWNSLINGYTQ
Subjt:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ

Query:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH
        SG+YKEG     MMKMES+PD VTFVLLLS FSQLADIN GRGI  DVIKFGFEAELIIGNSLLDMYAKCG MDDLLKVFSYM  HD ISWNT+IASSVH
Subjt:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH

Query:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------
        FDDCTVGF+ INEMRTEGLMPDEATVLGILP+CSLLA R+QGKEIHG IFK                                                 
Subjt:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA
                                                                                    TGDKSFEQY KVKDLLEYLVGLMA
Subjt:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA

Query:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGS LLVMKNLRVCGDCHTVTKYI+KI QREILVRDANRFH FK+G CSCGDHW
Subjt:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

A0A5A7TZN0 Pentatricopeptide repeat-containing protein0.0e+0066.37Show/hide
Query:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG
        MKPPK CSNFNNTPE S+ELLRSSLLK LSSAKNT QLRTVHSLIITSGL+LSVIFSGKLISKY+Q+KDP SSVSVFR++SPT+NVY WNSIIRALTHNG
Subjt:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG

Query:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN
        LFTQALGYY EMREKKLQPDA+TFPSVINSCAR+LDL+LG IVH+HV+EMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMS+RD+VSWNSLISGYCSN
Subjt:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN

Query:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ
        GF EEALDMYHK RMTGM+PD FTMSSVLLACGSLMAVKEGV VHG IEKIGI GDV IGNGLLSMYFKFER REA W+FSEMAVKDSVTWNTMICGY+Q
Subjt:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ

Query:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ
        LGRHEESVKLFMEMID F PD+L+ITST RACG  G+LQ+GKFVHKYLIGSG+ECDTVA NILIDMYAKCGDLLAAQEVF TTKCKDSVTWNSLINGYTQ
Subjt:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ

Query:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH
        SG+YKEG     MMKMES+PD VTFVLLLS FSQLADIN GRGI  DVIKFGFEAELIIGNSLLDMYAKCG MDDLLKVFSYM  HD ISWNT+IASSVH
Subjt:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH

Query:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------
        FDDCTVGF+ INEMRTEGLMPDEATVLGILP+CSLLA R+QGKEIHG IFK                                                 
Subjt:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA
                                                                                    TGDKSFEQY KVKDLLEYLVGLMA
Subjt:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA

Query:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGS LLVMKNLRVCGDCHTVTKYI+KI QREILVRDANRFH FK+G CSCGDHW
Subjt:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

A0A5D3D1L1 Pentatricopeptide repeat-containing protein0.0e+0066.15Show/hide
Query:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG
        MKPPK CSNFNNTPE S+E LRSSLLK LSSAKNT QLRTVHSLIITSGL+LSVIFSGKLISKY+Q+KDP SSVSVFR++SPT+NVY WNSIIRALTHNG
Subjt:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG

Query:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN
        LFTQALGYY EMREKKLQPDA+TFPSVINSCAR+LDL+LG IVH+HV+EMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMS+RD+VSWNSLISGYCSN
Subjt:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN

Query:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ
        GF E+ALDMYHK RMTGM+PD FTMSSVLLACGSLMAVKEGV VHG IEKIGI GDV IGNGLLSMYFKFER REA W+FSEMAVKDSVTWNTMICGY+Q
Subjt:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ

Query:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ
        LGRHEESVKLFMEMID F PD+L+ITST RACG  G+LQ+GKFVHKYLIGSG+ECDTVA NILIDMYAKCGDLLAAQEVF TTKCKDSVTWNSLINGYTQ
Subjt:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ

Query:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH
        SG+YKEG     MMKMES+PD VTFVLLLS FSQLADIN GRGI  DVIKFGFEAELIIGNSLLDMYAKCG MDDLLKVFSYM  HD ISWNT+IASSVH
Subjt:  SGFYKEG-----MMKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH

Query:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------
        FDDCTVGF+ INEMRTEGLMPDEATVLGILP+CSLLA R+QGKEIHG IFK                                                 
Subjt:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA
                                                                                    TGDKSFEQY KVKDLLEYLVGLMA
Subjt:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA

Query:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGS LLVMKNLRVCGDCHTVTKYI+KI QREILVRDANRFH FK+G CSCGDHW
Subjt:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

A0A6J1GDL7 pentatricopeptide repeat-containing protein At3g035800.0e+0064.25Show/hide
Query:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG
        MKPPK  SNFN++PET++ELLRSSLLKALSSAKNT+QLR VHS II SGL LSV+FSGKLISKYAQLKDP SSVSVFRTVSPT NVYQWNSIIRALT NG
Subjt:  MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNG

Query:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN
        LFTQALGYYTEMRE KLQPDAYTFPSVINSCAR+LDLK+GR+VHEHV EMGFESDLYIGNALIDMY RF DL+NARY+F+EMSDRD+VSWNSLISGYCSN
Subjt:  LFTQALGYYTEMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSN

Query:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ
        GF EEALDMYHKSR+ GM+PDCFTMSSVLLACGSL AV+EG+ +HG IEKIGI GD+  GNGLLSMYFKFERPRE G VF+EMA KDSVTWNTMICGYSQ
Subjt:  GFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQ

Query:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ
        LG HEESVKLFM MIDEF PDVL++TST RACGHLGDL++GK+VHKYLIG GYECDTVACNILIDMYAKCGDLLAAQEVF T   KDSVTWNSLINGYTQ
Subjt:  LGRHEESVKLFMEMIDEFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQ

Query:  SGFYKEGM-----MKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH
         G+YKEGM     MK ES+PD VTFVLLLS  SQLADI+ GRGIH DVIK GFE ELIIGN+LLDMYAKCG MDDLLK FSYMR  DIISWNTLIASSVH
Subjt:  SGFYKEGM-----MKMESRPDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVH

Query:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------
        FDDCTVG++AI+EMRTEGLMPDEAT+LGILP+CSLLAAR+QGKEIH  IFK                                                 
Subjt:  FDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFK-------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA
                                                                                    T DKSFEQY KV+D LEYL GLMA
Subjt:  ----------------------------------------------------------------------------TGDKSFEQYGKVKDLLEYLVGLMA

Query:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKI QREILVRDANRFHLFK+GTCSCGDHW
Subjt:  KEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

SwissProt top hitse value%identityAlignment
O81767 Pentatricopeptide repeat-containing protein At4g339907.5e-9729.5Show/hide
Query:  NTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYT-EMREKKLQPDAYTFPSVINSCA
        N    + +H+ ++ S    +V  S KL++ Y  L +   +   F  +    +VY WN +I      G  ++ +  ++  M    L PD  TFPSV+ +C 
Subjt:  NTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYT-EMREKKLQPDAYTFPSVINSCA

Query:  RILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMSSVLLAC
         ++D   G  +H   ++ GF  D+Y+  +LI +YSR+  + NAR +F+EM  RD  SWN++ISGYC +G  +EAL + +  R      D  T+ S+L AC
Subjt:  RILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMSSVLLAC

Query:  GSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEM-IDEFTPDVLTITSTSRA
                GVT+H    K G+  ++F+ N L+ +Y +F R R+   VF  M V+D ++WN++I  Y    +   ++ LF EM +    PD LT+ S +  
Subjt:  GSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEM-IDEFTPDVLTITSTSRA

Query:  CGHLGDLQVGKFVHKYLIGSGYECDTVAC-NILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKEGM-------MKMESRPDPVTFVLL
           LGD++  + V  + +  G+  + +   N ++ MYAK G + +A+ VF      D ++WN++I+GY Q+GF  E +        + E   +  T+V +
Subjt:  CGHLGDLQVGKFVHKYLIGSGYECDTVAC-NILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKEGM-------MKMESRPDPVTFVLL

Query:  LSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVHFDDCTVGFRAINEMRTEGLMPDEATVLG
        L   SQ   +  G  +H  ++K G   ++ +  SL DMY KCG ++D L +F  +   + + WNTLIA                EM  EG+ PD  T + 
Subjt:  LSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVHFDDCTVGFRAINEMRTEGLMPDEATVLG

Query:  ILPICSLLAARQQGK----------------------------------------------------------EIHGS----------------------
        +L  CS      +G+                                                           +HG+                      
Subjt:  ILPICSLLAARQQGK----------------------------------------------------------EIHGS----------------------

Query:  --------------------------------------------IFKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFALHDVEEDDKRDMLCGHSE
                                                    +F TG+++   Y ++   L  L   +   GYV D +F L DVE+D+K  +L  HSE
Subjt:  --------------------------------------------IFKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFALHDVEEDDKRDMLCGHSE

Query:  RLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        RLAIAF L+ T   + + + KNLRVCGDCH+VTK+I+KIT+REI+VRD+NRFH FKNG CSCGD+W
Subjt:  RLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127703.3e-10031.25Show/hide
Query:  LSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREKKLQPDAYTFPSVI
        + SA +  QL+ +H+ ++  GL  S     KLI   +   D   +  VF  + P   ++ WN+IIR  + N  F  AL  Y+ M+  ++ PD++TFP ++
Subjt:  LSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREKKLQPDAYTFPSVI

Query:  NSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFE--EMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMS
         +C+ +  L++GR VH  V  +GF++D+++ N LI +Y++   L +AR VFE   + +R  VSW +++S Y  NG   EAL+++ + R   + PD   + 
Subjt:  NSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFE--EMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMS

Query:  SVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMID-EFTPDVLTI
        SVL A   L  +K+G ++H ++ K+G+  +  +   L +MY K  +   A  +F +M   + + WN MI GY++ G   E++ +F EMI+ +  PD ++I
Subjt:  SVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMID-EFTPDVLTI

Query:  TSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKE------GMMKMESRPDPVT
        TS   AC  +G L+  + +++Y+  S Y  D    + LIDM+AKCG +  A+ VF  T  +D V W+++I GY   G  +E       M +    P+ VT
Subjt:  TSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKE------GMMKMESRPDPVT

Query:  FVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVH-DIISWNTLIASSVHFDDCTVGFRAINEMRTEGLMPDE
        F+ LL   +    +  G    + +       +      ++D+  + G +D   +V   M V   +  W  L+++        +G  A  ++       D 
Subjt:  FVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVH-DIISWNTLIASSVHFDDCTVGFRAINEMRTEGLMPDE

Query:  ATVLGILPICSLLAA--------------RQQG---------KEIHGSI--FKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFALHDVEEDDKRDM
        +     + + +L AA              +++G          E+ G +  F+ GDKS  +Y +++  +E++   + + G+VA+   +LHD+ +++  + 
Subjt:  ATVLGILPICSLLAA--------------RQQG---------KEIHGSI--FKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFALHDVEEDDKRDM

Query:  LCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        LC HSER+AIA+GL++T  G+PL + KNLR C +CH  TK I+K+  REI+VRD NRFH FK+G CSCGD+W
Subjt:  LCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233301.8e-11432.78Show/hide
Query:  TSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREK
        +S + L  +L+K  +  K+ +Q + +H+  I +  +LS   +  +IS Y  LK    ++ +F+T+  +  V  W S+IR  T   LF++AL  + EMR  
Subjt:  TSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREK

Query:  KLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFV------------------------------------DLDNARYVF
           PD   FPSV+ SC  ++DL+ G  VH  ++ +G + DLY GNAL++MY++ +                                     +D+ R VF
Subjt:  KLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFV------------------------------------DLDNARYVF

Query:  EEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWV
        E M  +D VS+N++I+GY  +G  E+AL M  +   T + PD FT+SSVL      + V +G  +HG + + GI  DV+IG+ L+ MY K  R  ++  V
Subjt:  EEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWV

Query:  FSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMID-EFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQE
        FS +  +D ++WN+++ GY Q GR+ E+++LF +M+  +  P  +  +S   AC HL  L +GK +H Y++  G+  +    + L+DMY+KCG++ AA++
Subjt:  FSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMID-EFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQE

Query:  VFGTTKCKDSVTWNSLINGYTQSGFYKEGMMKMES------RPDPVTFVLLLSTFSQLADINLGRGIHSDVIK-FGFEAELIIGNSLLDMYAKCGAMDDL
        +F      D V+W ++I G+   G   E +   E       +P+ V FV +L+  S +  ++   G  + + K +G   EL    ++ D+  + G +++ 
Subjt:  VFGTTKCKDSVTWNSLINGYTQSGFYKEGMMKMES------RPDPVTFVLLLSTFSQLADINLGRGIHSDVIK-FGFEAELIIGNSLLDMYAKCGAMDDL

Query:  LKVFSYMRVHDIIS-WNTLIAS-SVHFDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEI---------------------------HG
            S M V    S W+TL++S SVH            ++  +    D   +   + +C++ A+  + KE+                           HG
Subjt:  LKVFSYMRVHDIIS-WNTLIAS-SVHFDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEI---------------------------HG

Query:  SIFKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQRE
          F +GD+S     K+ + L+ ++  M KEGYVAD    LHDV+E+ KR++L GHSERLA+AFG++NT+PG+ + V KN+R+C DCH   K+I+KIT+RE
Subjt:  SIFKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQRE

Query:  ILVRDANRFHLFKNGTCSCGDHW
        I+VRD +RFH F  G CSCGD+W
Subjt:  ILVRDANRFHLFKNGTCSCGDHW

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic7.5e-9733.19Show/hide
Query:  SSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREKKLQPDAY
        S + K+ SS ++      +H  I+ SG          L++ Y + +   S+  VF  ++   +V  WNSII     NGL  + L  + +M    ++ D  
Subjt:  SSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREKKLQPDAY

Query:  TFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDC
        T  SV   CA    + LGR VH   ++  F  +    N L+DMYS+  DLD+A+ VF EMSDR  VS+ S+I+GY   G   EA+ ++ +    G+ PD 
Subjt:  TFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDC

Query:  FTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMIDE--FTP
        +T+++VL  C     + EG  VH  I++  +  D+F+ N L+ MY K    +EA  VFSEM VKD ++WNT+I GYS+     E++ LF  +++E  F+P
Subjt:  FTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMIDE--FTP

Query:  DVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKEG------MMKMESR
        D  T+     AC  L     G+ +H Y++ +GY  D    N L+DMYAKCG LL A  +F     KD V+W  +I GY   GF KE       M +    
Subjt:  DVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKEG------MMKMESR

Query:  PDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGN--SLLDMYAKCGAMDDLLKVFSYMR----VHDIISWNTLIAS-SVHFDDCTVGFRAIN
         D ++FV LL   S    ++ G     ++++   + E  + +   ++DM A+ G   DL+K + ++       D   W  L+    +H D        + 
Subjt:  PDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGN--SLLDMYAKCGAMDDLLKVFSYMR----VHDIISWNTLIAS-SVHFDDCTVGFRAIN

Query:  EMRTEGLMPDEATVLGILPICSLLAAR---------------QQGK---------EIHG--SIFKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFA
        E   E +   E    G   + + + A                Q+G          EI G  +IF  GD S  +   ++  L  +   M +EGY    ++A
Subjt:  EMRTEGLMPDEATVLGILPICSLLAAR---------------QQGK---------EIHG--SIFKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFA

Query:  LHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        L D EE +K + LCGHSE+LA+A G++++  G  + V KNLRVCGDCH + K+++K+T+REI++RD+NRFH FK+G CSC   W
Subjt:  LHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

Q9SS60 Pentatricopeptide repeat-containing protein At3g035804.9e-18140.3Show/hide
Query:  KALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREKKLQPDAYTFPS
        +ALSS+ N  +LR +H+L+I+ GL  S  FSGKLI KY+  ++P SS+SVFR VSP  NVY WNSIIRA + NGLF +AL +Y ++RE K+ PD YTFPS
Subjt:  KALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREKKLQPDAYTFPS

Query:  VINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMS
        VI +CA + D ++G +V+E +++MGFESDL++GNAL+DMYSR   L  AR VF+EM  RD VSWNSLISGY S+G+ EEAL++YH+ + + ++PD FT+S
Subjt:  VINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMS

Query:  SVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMIDEFTPDVLTIT
        SVL A G+L+ VK+G  +HG   K G+   V + NGL++MY KF RP +A  VF EM V+DSV++NTMICGY +L   EESV++F+E +D+F PD+LT++
Subjt:  SVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMIDEFTPDVLTIT

Query:  STSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKEG------MMKMESRPDPVTF
        S  RACGHL DL + K+++ Y++ +G+  ++   NILID+YAKCGD++ A++VF + +CKD+V+WNS+I+GY QSG   E       MM ME + D +T+
Subjt:  STSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKEG------MMKMESRPDPVTF

Query:  VLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVHFDDCTVGFRAINEMRTEGLMPDEAT
        ++L+S  ++LAD+  G+G+HS+ IK G   +L + N+L+DMYAKCG + D LK+FS M   D ++WNT+I++ V F D   G +   +MR   ++PD AT
Subjt:  VLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVHFDDCTVGFRAINEMRTEGLMPDEAT

Query:  VLGILPICSLLAARQQGKEIH-----------------------------------------------GSIFKTG-------------------------
         L  LP+C+ LAA++ GKEIH                                               G I+  G                         
Subjt:  VLGILPICSLLAARQQGKEIH-----------------------------------------------GSIFKTG-------------------------

Query:  -------------------------------DKSFEQYGKVKDL--------------------------------------------------------
                                       D   E Y  V DL                                                        
Subjt:  -------------------------------DKSFEQYGKVKDL--------------------------------------------------------

Query:  ------------------------------------------------------------------LEYLVGLMAKEGYVADLQFALHDV-EEDDKRDML
                                                                          LE L  LMAKEGY+ D +    ++ EE++KR ++
Subjt:  ------------------------------------------------------------------LEYLVGLMAKEGYVADLQFALHDV-EEDDKRDML

Query:  CGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        CGHSERLAIAFGLLNT+PG+PL VMKNLRVCGDCH VTK I+KI  REILVRDANRFHLFK+GTCSC D W
Subjt:  CGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

Arabidopsis top hitse value%identityAlignment
AT3G03580.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.5e-18240.3Show/hide
Query:  KALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREKKLQPDAYTFPS
        +ALSS+ N  +LR +H+L+I+ GL  S  FSGKLI KY+  ++P SS+SVFR VSP  NVY WNSIIRA + NGLF +AL +Y ++RE K+ PD YTFPS
Subjt:  KALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREKKLQPDAYTFPS

Query:  VINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMS
        VI +CA + D ++G +V+E +++MGFESDL++GNAL+DMYSR   L  AR VF+EM  RD VSWNSLISGY S+G+ EEAL++YH+ + + ++PD FT+S
Subjt:  VINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMS

Query:  SVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMIDEFTPDVLTIT
        SVL A G+L+ VK+G  +HG   K G+   V + NGL++MY KF RP +A  VF EM V+DSV++NTMICGY +L   EESV++F+E +D+F PD+LT++
Subjt:  SVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMIDEFTPDVLTIT

Query:  STSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKEG------MMKMESRPDPVTF
        S  RACGHL DL + K+++ Y++ +G+  ++   NILID+YAKCGD++ A++VF + +CKD+V+WNS+I+GY QSG   E       MM ME + D +T+
Subjt:  STSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKEG------MMKMESRPDPVTF

Query:  VLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVHFDDCTVGFRAINEMRTEGLMPDEAT
        ++L+S  ++LAD+  G+G+HS+ IK G   +L + N+L+DMYAKCG + D LK+FS M   D ++WNT+I++ V F D   G +   +MR   ++PD AT
Subjt:  VLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVHFDDCTVGFRAINEMRTEGLMPDEAT

Query:  VLGILPICSLLAARQQGKEIH-----------------------------------------------GSIFKTG-------------------------
         L  LP+C+ LAA++ GKEIH                                               G I+  G                         
Subjt:  VLGILPICSLLAARQQGKEIH-----------------------------------------------GSIFKTG-------------------------

Query:  -------------------------------DKSFEQYGKVKDL--------------------------------------------------------
                                       D   E Y  V DL                                                        
Subjt:  -------------------------------DKSFEQYGKVKDL--------------------------------------------------------

Query:  ------------------------------------------------------------------LEYLVGLMAKEGYVADLQFALHDV-EEDDKRDML
                                                                          LE L  LMAKEGY+ D +    ++ EE++KR ++
Subjt:  ------------------------------------------------------------------LEYLVGLMAKEGYVADLQFALHDV-EEDDKRDML

Query:  CGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        CGHSERLAIAFGLLNT+PG+PL VMKNLRVCGDCH VTK I+KI  REILVRDANRFHLFK+GTCSC D W
Subjt:  CGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

AT3G12770.1 mitochondrial editing factor 222.3e-10131.25Show/hide
Query:  LSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREKKLQPDAYTFPSVI
        + SA +  QL+ +H+ ++  GL  S     KLI   +   D   +  VF  + P   ++ WN+IIR  + N  F  AL  Y+ M+  ++ PD++TFP ++
Subjt:  LSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREKKLQPDAYTFPSVI

Query:  NSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFE--EMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMS
         +C+ +  L++GR VH  V  +GF++D+++ N LI +Y++   L +AR VFE   + +R  VSW +++S Y  NG   EAL+++ + R   + PD   + 
Subjt:  NSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFE--EMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMS

Query:  SVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMID-EFTPDVLTI
        SVL A   L  +K+G ++H ++ K+G+  +  +   L +MY K  +   A  +F +M   + + WN MI GY++ G   E++ +F EMI+ +  PD ++I
Subjt:  SVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMID-EFTPDVLTI

Query:  TSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKE------GMMKMESRPDPVT
        TS   AC  +G L+  + +++Y+  S Y  D    + LIDM+AKCG +  A+ VF  T  +D V W+++I GY   G  +E       M +    P+ VT
Subjt:  TSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKE------GMMKMESRPDPVT

Query:  FVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVH-DIISWNTLIASSVHFDDCTVGFRAINEMRTEGLMPDE
        F+ LL   +    +  G    + +       +      ++D+  + G +D   +V   M V   +  W  L+++        +G  A  ++       D 
Subjt:  FVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVH-DIISWNTLIASSVHFDDCTVGFRAINEMRTEGLMPDE

Query:  ATVLGILPICSLLAA--------------RQQG---------KEIHGSI--FKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFALHDVEEDDKRDM
        +     + + +L AA              +++G          E+ G +  F+ GDKS  +Y +++  +E++   + + G+VA+   +LHD+ +++  + 
Subjt:  ATVLGILPICSLLAA--------------RQQG---------KEIHGSI--FKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFALHDVEEDDKRDM

Query:  LCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        LC HSER+AIA+GL++T  G+PL + KNLR C +CH  TK I+K+  REI+VRD NRFH FK+G CSCGD+W
Subjt:  LCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-11532.78Show/hide
Query:  TSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREK
        +S + L  +L+K  +  K+ +Q + +H+  I +  +LS   +  +IS Y  LK    ++ +F+T+  +  V  W S+IR  T   LF++AL  + EMR  
Subjt:  TSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREK

Query:  KLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFV------------------------------------DLDNARYVF
           PD   FPSV+ SC  ++DL+ G  VH  ++ +G + DLY GNAL++MY++ +                                     +D+ R VF
Subjt:  KLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFV------------------------------------DLDNARYVF

Query:  EEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWV
        E M  +D VS+N++I+GY  +G  E+AL M  +   T + PD FT+SSVL      + V +G  +HG + + GI  DV+IG+ L+ MY K  R  ++  V
Subjt:  EEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWV

Query:  FSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMID-EFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQE
        FS +  +D ++WN+++ GY Q GR+ E+++LF +M+  +  P  +  +S   AC HL  L +GK +H Y++  G+  +    + L+DMY+KCG++ AA++
Subjt:  FSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMID-EFTPDVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQE

Query:  VFGTTKCKDSVTWNSLINGYTQSGFYKEGMMKMES------RPDPVTFVLLLSTFSQLADINLGRGIHSDVIK-FGFEAELIIGNSLLDMYAKCGAMDDL
        +F      D V+W ++I G+   G   E +   E       +P+ V FV +L+  S +  ++   G  + + K +G   EL    ++ D+  + G +++ 
Subjt:  VFGTTKCKDSVTWNSLINGYTQSGFYKEGMMKMES------RPDPVTFVLLLSTFSQLADINLGRGIHSDVIK-FGFEAELIIGNSLLDMYAKCGAMDDL

Query:  LKVFSYMRVHDIIS-WNTLIAS-SVHFDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEI---------------------------HG
            S M V    S W+TL++S SVH            ++  +    D   +   + +C++ A+  + KE+                           HG
Subjt:  LKVFSYMRVHDIIS-WNTLIAS-SVHFDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEI---------------------------HG

Query:  SIFKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQRE
          F +GD+S     K+ + L+ ++  M KEGYVAD    LHDV+E+ KR++L GHSERLA+AFG++NT+PG+ + V KN+R+C DCH   K+I+KIT+RE
Subjt:  SIFKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQRE

Query:  ILVRDANRFHLFKNGTCSCGDHW
        I+VRD +RFH F  G CSCGD+W
Subjt:  ILVRDANRFHLFKNGTCSCGDHW

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein5.3e-9833.19Show/hide
Query:  SSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREKKLQPDAY
        S + K+ SS ++      +H  I+ SG          L++ Y + +   S+  VF  ++   +V  WNSII     NGL  + L  + +M    ++ D  
Subjt:  SSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYTEMREKKLQPDAY

Query:  TFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDC
        T  SV   CA    + LGR VH   ++  F  +    N L+DMYS+  DLD+A+ VF EMSDR  VS+ S+I+GY   G   EA+ ++ +    G+ PD 
Subjt:  TFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDC

Query:  FTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMIDE--FTP
        +T+++VL  C     + EG  VH  I++  +  D+F+ N L+ MY K    +EA  VFSEM VKD ++WNT+I GYS+     E++ LF  +++E  F+P
Subjt:  FTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMIDE--FTP

Query:  DVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKEG------MMKMESR
        D  T+     AC  L     G+ +H Y++ +GY  D    N L+DMYAKCG LL A  +F     KD V+W  +I GY   GF KE       M +    
Subjt:  DVLTITSTSRACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKEG------MMKMESR

Query:  PDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGN--SLLDMYAKCGAMDDLLKVFSYMR----VHDIISWNTLIAS-SVHFDDCTVGFRAIN
         D ++FV LL   S    ++ G     ++++   + E  + +   ++DM A+ G   DL+K + ++       D   W  L+    +H D        + 
Subjt:  PDPVTFVLLLSTFSQLADINLGRGIHSDVIKFGFEAELIIGN--SLLDMYAKCGAMDDLLKVFSYMR----VHDIISWNTLIAS-SVHFDDCTVGFRAIN

Query:  EMRTEGLMPDEATVLGILPICSLLAAR---------------QQGK---------EIHG--SIFKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFA
        E   E +   E    G   + + + A                Q+G          EI G  +IF  GD S  +   ++  L  +   M +EGY    ++A
Subjt:  EMRTEGLMPDEATVLGILPICSLLAAR---------------QQGK---------EIHG--SIFKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFA

Query:  LHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        L D EE +K + LCGHSE+LA+A G++++  G  + V KNLRVCGDCH + K+++K+T+REI++RD+NRFH FK+G CSC   W
Subjt:  LHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW

AT4G33990.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.3e-9829.5Show/hide
Query:  NTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYT-EMREKKLQPDAYTFPSVINSCA
        N    + +H+ ++ S    +V  S KL++ Y  L +   +   F  +    +VY WN +I      G  ++ +  ++  M    L PD  TFPSV+ +C 
Subjt:  NTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYT-EMREKKLQPDAYTFPSVINSCA

Query:  RILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMSSVLLAC
         ++D   G  +H   ++ GF  D+Y+  +LI +YSR+  + NAR +F+EM  RD  SWN++ISGYC +G  +EAL + +  R      D  T+ S+L AC
Subjt:  RILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLPDCFTMSSVLLAC

Query:  GSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEM-IDEFTPDVLTITSTSRA
                GVT+H    K G+  ++F+ N L+ +Y +F R R+   VF  M V+D ++WN++I  Y    +   ++ LF EM +    PD LT+ S +  
Subjt:  GSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEM-IDEFTPDVLTITSTSRA

Query:  CGHLGDLQVGKFVHKYLIGSGYECDTVAC-NILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKEGM-------MKMESRPDPVTFVLL
           LGD++  + V  + +  G+  + +   N ++ MYAK G + +A+ VF      D ++WN++I+GY Q+GF  E +        + E   +  T+V +
Subjt:  CGHLGDLQVGKFVHKYLIGSGYECDTVAC-NILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKEGM-------MKMESRPDPVTFVLL

Query:  LSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVHFDDCTVGFRAINEMRTEGLMPDEATVLG
        L   SQ   +  G  +H  ++K G   ++ +  SL DMY KCG ++D L +F  +   + + WNTLIA                EM  EG+ PD  T + 
Subjt:  LSTFSQLADINLGRGIHSDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVHFDDCTVGFRAINEMRTEGLMPDEATVLG

Query:  ILPICSLLAARQQGK----------------------------------------------------------EIHGS----------------------
        +L  CS      +G+                                                           +HG+                      
Subjt:  ILPICSLLAARQQGK----------------------------------------------------------EIHGS----------------------

Query:  --------------------------------------------IFKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFALHDVEEDDKRDMLCGHSE
                                                    +F TG+++   Y ++   L  L   +   GYV D +F L DVE+D+K  +L  HSE
Subjt:  --------------------------------------------IFKTGDKSFEQYGKVKDLLEYLVGLMAKEGYVADLQFALHDVEEDDKRDMLCGHSE

Query:  RLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW
        RLAIAF L+ T   + + + KNLRVCGDCH+VTK+I+KIT+REI+VRD+NRFH FKNG CSCGD+W
Subjt:  RLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCSCGDHW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACCGCCAAAATCCTGTTCCAATTTCAACAATACACCGGAAACTTCACGAGAACTTCTCCGTTCTTCACTACTGAAAGCTCTCTCTTCAGCCAAAAACACCACACA
GCTACGTACTGTTCATTCCTTGATCATCACTTCAGGTTTGGCCCTTTCCGTCATCTTTTCCGGCAAACTCATAAGCAAATACGCCCAGCTTAAAGACCCAAAGTCGTCTG
TTTCAGTTTTTCGCACTGTTTCTCCAACTCACAATGTCTATCAATGGAATTCAATTATCCGTGCTCTCACTCACAATGGTCTCTTCACACAAGCACTTGGATATTACACT
GAGATGCGTGAAAAAAAGCTCCAACCCGATGCTTATACCTTTCCTTCTGTTATCAATTCATGTGCCCGTATTTTGGACTTGAAATTGGGTCGCATTGTTCATGAACATGT
TATTGAAATGGGGTTTGAATCGGATTTATATATTGGCAACGCATTGATCGATATGTATTCTAGATTTGTGGATCTTGATAATGCGCGTTATGTGTTTGAGGAAATGTCTG
ACCGAGACAATGTATCGTGGAATAGTTTAATTTCTGGGTATTGTTCTAATGGATTTTTGGAGGAGGCTTTGGACATGTATCACAAATCCAGAATGACTGGGATGTTACCT
GATTGTTTCACTATGTCCAGTGTTTTACTCGCTTGTGGAAGCTTAATGGCTGTTAAAGAAGGTGTGACTGTTCATGGGGCGATTGAGAAGATTGGAATTGCTGGGGATGT
TTTTATAGGCAATGGACTTCTTTCCATGTACTTCAAGTTTGAGAGACCAAGAGAAGCAGGTTGGGTTTTTTCCGAGATGGCTGTGAAGGACTCAGTTACCTGGAATACCA
TGATTTGTGGGTACTCCCAACTGGGGCGGCACGAAGAATCTGTCAAGTTATTTATGGAGATGATAGATGAATTCACTCCAGATGTGTTGACAATTACATCGACCAGTCGC
GCTTGTGGACACTTGGGAGATCTGCAGGTTGGAAAGTTTGTTCATAAGTACTTAATTGGGAGTGGGTATGAATGTGATACTGTGGCATGTAATATCCTTATAGATATGTA
TGCTAAATGCGGGGATCTTTTGGCTGCACAGGAAGTCTTTGGCACTACAAAATGCAAGGATTCTGTGACATGGAACTCCCTAATTAACGGCTACACTCAAAGTGGCTTTT
ATAAAGAGGGGATGATGAAAATGGAAAGCAGACCAGATCCTGTCACTTTTGTTCTGCTCCTATCTACATTTTCTCAGTTAGCTGATATAAATCTGGGGAGAGGAATCCAT
AGTGATGTGATAAAGTTTGGATTTGAAGCTGAACTTATAATTGGCAATTCTCTTCTGGATATGTATGCTAAATGTGGTGCAATGGATGACTTATTGAAGGTGTTTTCTTA
CATGAGAGTTCATGACATTATATCATGGAATACCCTTATTGCTTCAAGTGTTCATTTTGATGATTGCACTGTAGGATTTCGGGCGATTAATGAAATGAGGACTGAAGGGT
TGATGCCAGATGAGGCCACGGTACTAGGTATTTTGCCCATCTGTTCTCTGCTTGCAGCTCGGCAACAAGGGAAAGAGATCCATGGTAGTATTTTCAAGACGGGTGATAAA
TCATTTGAACAGTATGGCAAGGTCAAAGATTTACTCGAATACCTTGTGGGGTTAATGGCCAAGGAAGGTTATGTTGCAGACCTGCAATTTGCTTTGCATGACGTCGAGGA
AGACGATAAGAGAGACATGCTATGTGGGCACAGTGAAAGACTCGCAATAGCCTTTGGACTGTTAAATACCAAACCAGGGAGCCCTTTGCTGGTAATGAAAAACCTTCGAG
TATGTGGAGATTGTCATACTGTAACTAAGTACATAACTAAGATAACGCAAAGAGAAATACTAGTGAGAGATGCCAATCGCTTTCACCTTTTCAAGAATGGAACCTGTAGT
TGTGGAGATCATTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAACCGCCAAAATCCTGTTCCAATTTCAACAATACACCGGAAACTTCACGAGAACTTCTCCGTTCTTCACTACTGAAAGCTCTCTCTTCAGCCAAAAACACCACACA
GCTACGTACTGTTCATTCCTTGATCATCACTTCAGGTTTGGCCCTTTCCGTCATCTTTTCCGGCAAACTCATAAGCAAATACGCCCAGCTTAAAGACCCAAAGTCGTCTG
TTTCAGTTTTTCGCACTGTTTCTCCAACTCACAATGTCTATCAATGGAATTCAATTATCCGTGCTCTCACTCACAATGGTCTCTTCACACAAGCACTTGGATATTACACT
GAGATGCGTGAAAAAAAGCTCCAACCCGATGCTTATACCTTTCCTTCTGTTATCAATTCATGTGCCCGTATTTTGGACTTGAAATTGGGTCGCATTGTTCATGAACATGT
TATTGAAATGGGGTTTGAATCGGATTTATATATTGGCAACGCATTGATCGATATGTATTCTAGATTTGTGGATCTTGATAATGCGCGTTATGTGTTTGAGGAAATGTCTG
ACCGAGACAATGTATCGTGGAATAGTTTAATTTCTGGGTATTGTTCTAATGGATTTTTGGAGGAGGCTTTGGACATGTATCACAAATCCAGAATGACTGGGATGTTACCT
GATTGTTTCACTATGTCCAGTGTTTTACTCGCTTGTGGAAGCTTAATGGCTGTTAAAGAAGGTGTGACTGTTCATGGGGCGATTGAGAAGATTGGAATTGCTGGGGATGT
TTTTATAGGCAATGGACTTCTTTCCATGTACTTCAAGTTTGAGAGACCAAGAGAAGCAGGTTGGGTTTTTTCCGAGATGGCTGTGAAGGACTCAGTTACCTGGAATACCA
TGATTTGTGGGTACTCCCAACTGGGGCGGCACGAAGAATCTGTCAAGTTATTTATGGAGATGATAGATGAATTCACTCCAGATGTGTTGACAATTACATCGACCAGTCGC
GCTTGTGGACACTTGGGAGATCTGCAGGTTGGAAAGTTTGTTCATAAGTACTTAATTGGGAGTGGGTATGAATGTGATACTGTGGCATGTAATATCCTTATAGATATGTA
TGCTAAATGCGGGGATCTTTTGGCTGCACAGGAAGTCTTTGGCACTACAAAATGCAAGGATTCTGTGACATGGAACTCCCTAATTAACGGCTACACTCAAAGTGGCTTTT
ATAAAGAGGGGATGATGAAAATGGAAAGCAGACCAGATCCTGTCACTTTTGTTCTGCTCCTATCTACATTTTCTCAGTTAGCTGATATAAATCTGGGGAGAGGAATCCAT
AGTGATGTGATAAAGTTTGGATTTGAAGCTGAACTTATAATTGGCAATTCTCTTCTGGATATGTATGCTAAATGTGGTGCAATGGATGACTTATTGAAGGTGTTTTCTTA
CATGAGAGTTCATGACATTATATCATGGAATACCCTTATTGCTTCAAGTGTTCATTTTGATGATTGCACTGTAGGATTTCGGGCGATTAATGAAATGAGGACTGAAGGGT
TGATGCCAGATGAGGCCACGGTACTAGGTATTTTGCCCATCTGTTCTCTGCTTGCAGCTCGGCAACAAGGGAAAGAGATCCATGGTAGTATTTTCAAGACGGGTGATAAA
TCATTTGAACAGTATGGCAAGGTCAAAGATTTACTCGAATACCTTGTGGGGTTAATGGCCAAGGAAGGTTATGTTGCAGACCTGCAATTTGCTTTGCATGACGTCGAGGA
AGACGATAAGAGAGACATGCTATGTGGGCACAGTGAAAGACTCGCAATAGCCTTTGGACTGTTAAATACCAAACCAGGGAGCCCTTTGCTGGTAATGAAAAACCTTCGAG
TATGTGGAGATTGTCATACTGTAACTAAGTACATAACTAAGATAACGCAAAGAGAAATACTAGTGAGAGATGCCAATCGCTTTCACCTTTTCAAGAATGGAACCTGTAGT
TGTGGAGATCATTGGTGA
Protein sequenceShow/hide protein sequence
MKPPKSCSNFNNTPETSRELLRSSLLKALSSAKNTTQLRTVHSLIITSGLALSVIFSGKLISKYAQLKDPKSSVSVFRTVSPTHNVYQWNSIIRALTHNGLFTQALGYYT
EMREKKLQPDAYTFPSVINSCARILDLKLGRIVHEHVIEMGFESDLYIGNALIDMYSRFVDLDNARYVFEEMSDRDNVSWNSLISGYCSNGFLEEALDMYHKSRMTGMLP
DCFTMSSVLLACGSLMAVKEGVTVHGAIEKIGIAGDVFIGNGLLSMYFKFERPREAGWVFSEMAVKDSVTWNTMICGYSQLGRHEESVKLFMEMIDEFTPDVLTITSTSR
ACGHLGDLQVGKFVHKYLIGSGYECDTVACNILIDMYAKCGDLLAAQEVFGTTKCKDSVTWNSLINGYTQSGFYKEGMMKMESRPDPVTFVLLLSTFSQLADINLGRGIH
SDVIKFGFEAELIIGNSLLDMYAKCGAMDDLLKVFSYMRVHDIISWNTLIASSVHFDDCTVGFRAINEMRTEGLMPDEATVLGILPICSLLAARQQGKEIHGSIFKTGDK
SFEQYGKVKDLLEYLVGLMAKEGYVADLQFALHDVEEDDKRDMLCGHSERLAIAFGLLNTKPGSPLLVMKNLRVCGDCHTVTKYITKITQREILVRDANRFHLFKNGTCS
CGDHW