; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012709 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012709
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr01:23546602..23549001
RNA-Seq ExpressionHG10012709
SyntenyHG10012709
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049341.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0077.72Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLD+ARK FELFHE+GEAN TLFMYNSLIRGYS AGLCDEAIS YVQMIE GFMPDNFTFPFV
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC
        LSACAKTAAFVEG+QLHGALMKIGLERDM                                           T+ SREAVALFFQMIEAGVRPNSVTMVC
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC

Query:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL
        VISACAKLKDLELAKR+ AYIEESEMELNTHMVNALVD++MKCGETGAAKRLYD CVDKNLVLCNTIMSN+ARHGMP+EVLAVLVDM +LDLRPDRVSLL
Subjt:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL

Query:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV
         AISACGQMGDYLLGK CHNYSLRNGYE WDNICNAMIDMYMKCGK EMAYRVFD   NKTIVSWNSLLVGYIRNKDLES RK FNEMPEKDIVSWNTM+
Subjt:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV

Query:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA
        NALVQESM DEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKW Y+YI KN +  DMLLET LVDMFARCGDP SAMEVFNNMDRKDVSAWTAA
Subjt:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA

Query:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG
        IGAMAV+GNG RAIELYNEMLRQGVKPDQVVFVNILTACSHGG VEQG+HIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDII+SMPMKPNGIIWG
Subjt:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG

Query:  SLLAACRTHKDIDMATFAAERLG-----------------------------------------------------------------------------
        SLLAACRTHK+IDMATFAAERL                                                                              
Subjt:  SLLAACRTHKDIDMATFAAERLG-----------------------------------------------------------------------------

Query:  ----DAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
            D GYVPDVTNVLLDVNEQEK+YLLNRHSEKLAMAYGLIST+K+VPIRV+KNLRMCSDCH+FAKYISKVY REI VRDNNRFHFFRQGSCSCGDYW
Subjt:  ----DAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

KAG7028890.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0074.97Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        MDEL QLHCYA KQGLIRKQSTVTKLISTCVEMGT ESLDYARK FELF E+ EAN T+F+YNSLIRGYSA+GLCDEA+S Y+QMIE GF+PDNFTFPF+
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC
        LSACAKTAAF  GVQLHGALMKIGLE +M                                           TDSS EAVALFFQMIEAGVRPNSVTMVC
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC

Query:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL
        VISACAKLKDLELA +I AYI+ESE+ELNTHMVNALVD+YMKCGE GAA+ LY+ CVDKNLVLCNTIMSN ARHGMP EVLAVLVDMF++DLRPDRVSLL
Subjt:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL

Query:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV
        SAISACGQ+GDYLLG+CCHN++LRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFD  SNKTIVSWNSLLV Y+RN+DLEST+KIFNEMPEKDIVSWNTMV
Subjt:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV

Query:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA
        +ALVQESM +EAIELFREMQ K+I+ADRVTMVEVASACGYLGALELAKW YAYI KNN+ CDMLLETALVDMFARCGD RSAM+VF+NMDRKDVSAWTAA
Subjt:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA

Query:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG
        IGAMAVDGNGERAIELY+EMLRQGVKPDQVVFVNILTACSHGG VEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSM MKPNGIIWG
Subjt:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG

Query:  SLLAACRTHKDIDMATFAAE--------------------------------------------------------------------------------
        SLLAACRTHK++++ATFAAE                                                                                
Subjt:  SLLAACRTHKDIDMATFAAE--------------------------------------------------------------------------------

Query:  -RLGDAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
         RLGDAGYVPD+TNVLLDVNEQEKQYLLN+HSEKLAMAYGLIST+K++PIRVIKNLR CSDCHAFAKY+SKVYDREIT+RDNNRFHFFR GSCSCGDYW
Subjt:  -RLGDAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

XP_008438644.1 PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At3g22690 [Cucumis melo]0.0e+0077.6Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLD+ARK FELFHE+GEAN TLFMYNSLIRGYS AGLCDEAIS YVQMIE GFMPDNFTFPFV
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC
        LSACAKTAAFVEG+QLHGALMKIGLERDM                                           T+ SREAVALFFQMIEAGVRPNSVTMVC
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC

Query:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL
        VISACAKLKDLELAKR+ AYIEESEMELNTHMVNALVD++MKCGETGAAKRLYD CVDKNLVLCNTIMSN+ARHGMP+EVLAVLVDM +LDLRPDRVSLL
Subjt:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL

Query:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV
         AISACGQMGDYLLGK CHNYSLRNGYE WDNICNAMIDMYMKCGK EMAYRVFD   NKTIVSWNSLLVGYIRNKDLES RK FNEMPEKDIVSWNTM+
Subjt:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV

Query:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA
        NALVQESM DEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKW Y+YI KN +  DMLLET LVDMFARCGDP SAMEVFNNMDRKDVSAWTAA
Subjt:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA

Query:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG
        IGAMAV+GNG RAIELYNEMLRQGVKPDQVVFVNILTACSHGG VEQG+HIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDII+SMPMKPNGIIWG
Subjt:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG

Query:  SLLAACRTHKDIDMATFAAERLG-----------------------------------------------------------------------------
        SLLAACRTHK+IDMATFAAERL                                                                              
Subjt:  SLLAACRTHKDIDMATFAAERLG-----------------------------------------------------------------------------

Query:  ----DAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
            D GYVP+VTNVLLDVNEQEK YLLNRHSEKLAMAYGLIST+K+VPIRV+KNLRMCSDCH+FAKYISKVY REI VRDNNRFHFFRQGSCSCGDYW
Subjt:  ----DAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

XP_011650966.1 pentatricopeptide repeat-containing protein At3g22690 [Cucumis sativus]0.0e+0077.1Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        +DELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLD+ARKAFELFHE+GEAN TLFMYN LIRGYSAAGL DEAIS YVQMIE GFMPDNFTFPFV
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC
        LSACAKTAAFVEG+QLHGAL+KIGLE DM                                           TD   EAVALFFQMIEAGV+PNSVTMVC
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC

Query:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL
        VISACAKLKDLELAKR+ AYIEESEMELNTHMVNAL D++MKCGETGAAKRLYD CVDKNLVLCNTIMSN+ARHGMP+EVLAVLVDM +LDLRPDRVSLL
Subjt:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL

Query:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV
         AISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFD   NKTIVSWNSLLVGYIRNKDLES RKIFNEMPEKDIVSWNTM+
Subjt:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV

Query:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA
        NALVQESM DEAIELFREMQLKEIK DRVTMVEVASACG LGALELAKW Y++I KN +YCDMLLETALVDMFARCGDP SAM VFNNM RKDVSAWTAA
Subjt:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA

Query:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG
        IGAMAV+GNG+RAIELYNEMLRQGVKPDQVVFVNILTACSHGG VEQG+HIFESMKQHG+SPQIVHYGCMVDLLGRAGKLEEALDII+SMPM+PNGIIWG
Subjt:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG

Query:  SLLAACRTHKDIDMATFAAE--------------------------------------------------------------------------------
        SLLAACRTHK+IDMATFAAE                                                                                
Subjt:  SLLAACRTHKDIDMATFAAE--------------------------------------------------------------------------------

Query:  -RLGDAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
         RLGD GYVPDVTNVLLDVNEQEKQYLLNRHSEKLA+AYGLIST+K+VPIRV+KNLRMCSDCH+FAKYISKVY REITVRDNNRFH FRQGSCSCGDYW
Subjt:  -RLGDAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

XP_038877935.1 pentatricopeptide repeat-containing protein At3g22690 [Benincasa hispida]0.0e+0077.72Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        MDEL QLHCYALK GLIRKQS V+KLISTCVEMGTSESLDYARKAFELF E+GEANAT+FMYNSLIRGYSAAGLCDEAIS YVQMIEVGFMPDNFTFPFV
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC
        LS CAKT AFVEGVQLHGALMKIGLERDM                                           TDSSREAVALFFQMIEAGVRPNSVTMVC
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC

Query:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL
        VISACAKLKDLELAKRI AYIEES MELNTHMVNALVD+YMKCGETG AKRLYDGCVDKNLVLCNTIMSN++ HGMP+EVL VLVDMFR+DL+PDR+SLL
Subjt:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL

Query:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV
        SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNI NA+IDMYMKCGKQE+AYRVFD+ SNK+IVSWNSLLVGYIRNKDLES RKIFNEMPEKDIVSWNTM+
Subjt:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV

Query:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA
        NALVQESM DEAIELFREMQLKE+KADRVTMVEVASACGYLG LELAKW YAYI KN++YCDMLLETA+VDMFARCGD  SAM+VFNNMD+KD+SAWT A
Subjt:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA

Query:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG
        IGAMAVDGNG+RAIELY+EML+QGVKPDQVVFVNILTACSHGG VEQGQ IFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG
Subjt:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG

Query:  SLLAACRTHKDIDMATFAAER-------------------------------------------------------------------------------
        SLLAACRTHK+IDMATFAAER                                                                               
Subjt:  SLLAACRTHKDIDMATFAAER-------------------------------------------------------------------------------

Query:  --LGDAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
          LG AGYVPD TNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYIS+VY+REITVRDNNRFHFFRQGSCSCGDYW
Subjt:  --LGDAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A0A0LA65 DYW_deaminase domain-containing protein0.0e+0077.1Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        +DELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLD+ARKAFELFHE+GEAN TLFMYN LIRGYSAAGL DEAIS YVQMIE GFMPDNFTFPFV
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC
        LSACAKTAAFVEG+QLHGAL+KIGLE DM                                           TD   EAVALFFQMIEAGV+PNSVTMVC
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC

Query:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL
        VISACAKLKDLELAKR+ AYIEESEMELNTHMVNAL D++MKCGETGAAKRLYD CVDKNLVLCNTIMSN+ARHGMP+EVLAVLVDM +LDLRPDRVSLL
Subjt:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL

Query:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV
         AISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFD   NKTIVSWNSLLVGYIRNKDLES RKIFNEMPEKDIVSWNTM+
Subjt:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV

Query:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA
        NALVQESM DEAIELFREMQLKEIK DRVTMVEVASACG LGALELAKW Y++I KN +YCDMLLETALVDMFARCGDP SAM VFNNM RKDVSAWTAA
Subjt:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA

Query:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG
        IGAMAV+GNG+RAIELYNEMLRQGVKPDQVVFVNILTACSHGG VEQG+HIFESMKQHG+SPQIVHYGCMVDLLGRAGKLEEALDII+SMPM+PNGIIWG
Subjt:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG

Query:  SLLAACRTHKDIDMATFAAE--------------------------------------------------------------------------------
        SLLAACRTHK+IDMATFAAE                                                                                
Subjt:  SLLAACRTHKDIDMATFAAE--------------------------------------------------------------------------------

Query:  -RLGDAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
         RLGD GYVPDVTNVLLDVNEQEKQYLLNRHSEKLA+AYGLIST+K+VPIRV+KNLRMCSDCH+FAKYISKVY REITVRDNNRFH FRQGSCSCGDYW
Subjt:  -RLGDAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

A0A1S3AXK0 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At3g226900.0e+0077.6Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLD+ARK FELFHE+GEAN TLFMYNSLIRGYS AGLCDEAIS YVQMIE GFMPDNFTFPFV
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC
        LSACAKTAAFVEG+QLHGALMKIGLERDM                                           T+ SREAVALFFQMIEAGVRPNSVTMVC
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC

Query:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL
        VISACAKLKDLELAKR+ AYIEESEMELNTHMVNALVD++MKCGETGAAKRLYD CVDKNLVLCNTIMSN+ARHGMP+EVLAVLVDM +LDLRPDRVSLL
Subjt:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL

Query:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV
         AISACGQMGDYLLGK CHNYSLRNGYE WDNICNAMIDMYMKCGK EMAYRVFD   NKTIVSWNSLLVGYIRNKDLES RK FNEMPEKDIVSWNTM+
Subjt:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV

Query:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA
        NALVQESM DEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKW Y+YI KN +  DMLLET LVDMFARCGDP SAMEVFNNMDRKDVSAWTAA
Subjt:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA

Query:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG
        IGAMAV+GNG RAIELYNEMLRQGVKPDQVVFVNILTACSHGG VEQG+HIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDII+SMPMKPNGIIWG
Subjt:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG

Query:  SLLAACRTHKDIDMATFAAERLG-----------------------------------------------------------------------------
        SLLAACRTHK+IDMATFAAERL                                                                              
Subjt:  SLLAACRTHKDIDMATFAAERLG-----------------------------------------------------------------------------

Query:  ----DAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
            D GYVP+VTNVLLDVNEQEK YLLNRHSEKLAMAYGLIST+K+VPIRV+KNLRMCSDCH+FAKYISKVY REI VRDNNRFHFFRQGSCSCGDYW
Subjt:  ----DAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

A0A5D3D302 Pentatricopeptide repeat-containing protein0.0e+0077.72Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLD+ARK FELFHE+GEAN TLFMYNSLIRGYS AGLCDEAIS YVQMIE GFMPDNFTFPFV
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC
        LSACAKTAAFVEG+QLHGALMKIGLERDM                                           T+ SREAVALFFQMIEAGVRPNSVTMVC
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC

Query:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL
        VISACAKLKDLELAKR+ AYIEESEMELNTHMVNALVD++MKCGETGAAKRLYD CVDKNLVLCNTIMSN+ARHGMP+EVLAVLVDM +LDLRPDRVSLL
Subjt:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL

Query:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV
         AISACGQMGDYLLGK CHNYSLRNGYE WDNICNAMIDMYMKCGK EMAYRVFD   NKTIVSWNSLLVGYIRNKDLES RK FNEMPEKDIVSWNTM+
Subjt:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV

Query:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA
        NALVQESM DEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKW Y+YI KN +  DMLLET LVDMFARCGDP SAMEVFNNMDRKDVSAWTAA
Subjt:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA

Query:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG
        IGAMAV+GNG RAIELYNEMLRQGVKPDQVVFVNILTACSHGG VEQG+HIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDII+SMPMKPNGIIWG
Subjt:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG

Query:  SLLAACRTHKDIDMATFAAERLG-----------------------------------------------------------------------------
        SLLAACRTHK+IDMATFAAERL                                                                              
Subjt:  SLLAACRTHKDIDMATFAAERLG-----------------------------------------------------------------------------

Query:  ----DAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
            D GYVPDVTNVLLDVNEQEK+YLLNRHSEKLAMAYGLIST+K+VPIRV+KNLRMCSDCH+FAKYISKVY REI VRDNNRFHFFRQGSCSCGDYW
Subjt:  ----DAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

A0A6J1GGL0 pentatricopeptide repeat-containing protein At3g226900.0e+0074.34Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        MDEL QLHCYA KQGLIRKQSTVTKLISTCVEMGT ESLDYARK FELF E+ EAN T+F+YNSLIRGYSA+GLCDEA+S Y+QMIE GF+PDNFTFPF+
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC
        LSACAKTAAF  GVQLHGALMKIGLE +M                                           TDSS EAVALFFQMIEAGVRPNSVTMVC
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC

Query:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL
        VISACAKLKDLELA +I AYI+ESE+ELNTHMVNALVD+YMKCGE GAA+ LY+ CVDKNLVLCNTIMSN ARHGMP EVLAVLVDMF++DL+PDRVSLL
Subjt:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL

Query:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV
        SAISACGQ+GDYLLG+CCHN++LRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFD  SNKTIVSWNSLLV Y+RN+DLEST+KIFNEMPEKDIVSWNTMV
Subjt:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV

Query:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA
        ++LVQESM +EAIELFREMQ K+I+ADRVTMVEVASACGYLGALELAKW YAYI KNN+ CDMLLETALVDMFARCGD  SAM+VF+NMDRKDVSAWTAA
Subjt:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA

Query:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG
        IGAMAVDGNGERAIELY+EMLRQGVKPDQVVFVNILTACSHGG VEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAG LEEAL+IIKSM MKPNGIIWG
Subjt:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG

Query:  SLLAACRTHKDIDMATFAAE--------------------------------------------------------------------------------
        SLLAACRTHK++++ATFAAE                                                                                
Subjt:  SLLAACRTHKDIDMATFAAE--------------------------------------------------------------------------------

Query:  -RLGDAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
         RLGDAGYVPD+TNVLLDVNEQEKQYLLN+HSEKLAMAYGLIST+K++PIRVIKNLR CSDCHAFAKY+SKVYDREIT+RDNNRFHFFR GSCSCGDYW
Subjt:  -RLGDAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

A0A6J1IBE6 pentatricopeptide repeat-containing protein At3g22690-like0.0e+0074.22Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        MDEL QLHCYA KQGLIRKQSTVTKLISTCVEMGT ESLDYARK FELF E+ EAN T+F+YNSLIRGYSA+GLCDEA+S Y+QMIE GF+PDNFTFPF+
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC
        LSACAKTAAF  GVQLHGAL KIGLE +M                                           TDSS EAVALFFQMIEAGVRPNSVTMVC
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMIEAGVRPNSVTMVC

Query:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL
        VISAC+KLKDLELA +I AYIEESE+ELNTHMVNALVD+YMKCGET AA+ LY+ CVDKNLVLCNTIMSNFAR GMP EVL+V+VDMF++DL+PDRVSLL
Subjt:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL

Query:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV
        SAISACGQMGDYLLG+CCHN++LRNGYEGWDNICNAMIDMYMKCGKQEMAYRVF   SNKTIVSWNSLLV Y+RN+DLEST+KIFNEM EKDIVSWNTMV
Subjt:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV

Query:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA
        +ALVQESM +EAIELFREMQ KEI+ADRVTMVEVASACGYLGALELAKW YAYI KNN+ CDMLLETALVDMFARCGD RSAM+VF+NMDRKDVSAWTAA
Subjt:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA

Query:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG
        IGAMAVDGNGERAIELY+EMLRQGVKPDQVVFVNILTACSHGG VEQGQ IFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSM MKPNGIIWG
Subjt:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWG

Query:  SLLAACRTHKDIDMATFAAE--------------------------------------------------------------------------------
        SLLAACRTHK++++ATFAAE                                                                                
Subjt:  SLLAACRTHKDIDMATFAAE--------------------------------------------------------------------------------

Query:  -RLGDAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
         RLGDAGYVPD+TNVLLDVNEQEK YLLN+HSEKLAMAYGLIST+K++PIRVIKNLR CSDCHAFAKY+SKVYDREIT+RDNNRFHFFRQG CSCGDYW
Subjt:  -RLGDAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148206.9e-10630.14Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        ++ + QLH + L+  +  K ++    +S      +S +L YA   F       E+     ++N  +R  S +      I  Y ++  VG   D F+F  +
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDMTDSSREAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKC
        L A +K +A  EG++LHG   KI               L    +E G                                              +D+Y  C
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDMTDSSREAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKC

Query:  GETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLLSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMK
        G    A+ ++D    +++V  NT++  + R G+  E   +  +M   ++ PD + L + +SACG+ G+    +  + + + N      ++  A++ MY  
Subjt:  GETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLLSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMK

Query:  CGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMVNALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGA
         G  +MA   F   S + +    +++ GY +   L+  + IF++  +KD+V W TM++A V+     EA+ +F EM    IK D V+M  V SAC  LG 
Subjt:  CGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMVNALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGA

Query:  LELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGG
        L+ AKW ++ I  N +  ++ +  AL++M+A+CG   +  +VF  M R++V +W++ I A+++ G    A+ L+  M ++ V+P++V FV +L  CSH G
Subjt:  LELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGG

Query:  LVEQGQHIFESM-KQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKDIDMATFAAER---------------------
        LVE+G+ IF SM  ++ I+P++ HYGCMVDL GRA  L EAL++I+SMP+  N +IWGSL++ACR H ++++  FAA+R                     
Subjt:  LVEQGQHIFESM-KQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKDIDMATFAAER---------------------

Query:  --------------------------------------LGD----------------------AGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLI
                                              +GD                      AGYVPD  +VL+DV E+EK+ L+  HSEKLA+ +GL+
Subjt:  --------------------------------------LGD----------------------AGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLI

Query:  STEKYVP------IRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        + EK         IR++KNLR+C DCH F K +SKVY+REI VRD  RFH ++ G CSC DYW
Subjt:  STEKYVP------IRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.1e-11631.53Show/hide
Query:  ELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMI-EVGFMPDNFTFPFVL
        +L Q H + ++ G      + +KL +    + +  SL+YARK F+   E  + N+  F +N+LIR Y++      +I  ++ M+ E    P+ +TFPF++
Subjt:  ELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMI-EVGFMPDNFTFPFVL

Query:  SACAKTAAFVEGVQLHGALMKIGLERDMTDSSREAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCG
         A A+ ++   G  LHG  +K  +  D+                                                           + N+L+  Y  CG
Subjt:  SACAKTAAFVEGVQLHGALMKIGLERDMTDSSREAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCG

Query:  ETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLLSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKC
        +  +A +++    +K++V  N++++ F + G P + L +   M   D++   V+++  +SAC ++ +   G+   +Y   N       + NAM+DMY KC
Subjt:  ETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLLSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKC

Query:  GKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMVNALVQESMCDEAIELFREMQL-KEIKADRVTMVEVASACGYLGA
        G  E A R+FD    K  V+W ++L GY  ++D E+ R++ N MP+KDIV+WN +++A  Q    +EA+ +F E+QL K +K +++T+V   SAC  +GA
Subjt:  GKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMVNALVQESMCDEAIELFREMQL-KEIKADRVTMVEVASACGYLGA

Query:  LELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGG
        LEL +W ++YI K+ +  +  + +AL+ M+++CGD   + EVFN+++++DV  W+A IG +A+ G G  A++++ +M    VKP+ V F N+  ACSH G
Subjt:  LELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGG

Query:  LVEQGQHIFESMK-QHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKDIDMATFAA-----------------------
        LV++ + +F  M+  +GI P+  HY C+VD+LGR+G LE+A+  I++MP+ P+  +WG+LL AC+ H ++++A  A                        
Subjt:  LVEQGQHIFESMK-QHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKDIDMATFAA-----------------------

Query:  ----------------------------------------------------------ERLGDAGYVPDVTNVLLDVNEQE-KQYLLNRHSEKLAMAYGL
                                                                  E+L   GY P+++ VL  + E+E K+  LN HSEKLA+ YGL
Subjt:  ----------------------------------------------------------ERLGDAGYVPDVTNVLLDVNEQE-KQYLLNRHSEKLAMAYGL

Query:  ISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        ISTE    IRVIKNLR+C DCH+ AK IS++YDREI VRD  RFH FR G CSC D+W
Subjt:  ISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic3.4e-10529.88Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        + EL Q+     K GL ++    TKL+S     G   S+D A + FE    + + N    +Y+++++G++     D+A+  +V+M      P  + F ++
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGL-------------------------------ERDMTD------------SSREAVALFFQMIEAGVRPNSVTMVC
        L  C   A    G ++HG L+K G                                ERD+               +R A+ +   M E  ++P+ +T+V 
Subjt:  LSACAKTAAFVEGVQLHGALMKIGL-------------------------------ERDMTD------------SSREAVALFFQMIEAGVRPNSVTMVC

Query:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL
        V+ A + L+ + + K I  Y   S  +   ++  ALVD+Y KCG    A++L+DG +++N+V  N+++  + ++  P E + +   M    ++P  VS++
Subjt:  VISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLL

Query:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV
         A+ AC  +GD   G+  H  S+  G +   ++ N++I MY KC + + A  +F    ++T+VSWN++++G+ +N      R I                
Subjt:  SAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMV

Query:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA
                  +A+  F +M+ + +K D  T V V +A   L     AKW +  + ++ +  ++ + TALVDM+A+CG    A  +F+ M  + V+ W A 
Subjt:  NALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAA

Query:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQ-HGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIW
        I      G G+ A+EL+ EM +  +KP+ V F+++++ACSH GLVE G   F  MK+ + I   + HYG MVDLLGRAG+L EA D I  MP+KP   ++
Subjt:  IGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQ-HGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIW

Query:  GSLLAACRTHKDIDMATFAAERL-----------------------------------------------------------------------------
        G++L AC+ HK+++ A  AAERL                                                                             
Subjt:  GSLLAACRTHKDIDMATFAAERL-----------------------------------------------------------------------------

Query:  ----GDAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
             +AGYVPD TN++L V    K+ LL+ HSEKLA+++GL++T     I V KNLR+C+DCH   KYIS V  REI VRD  RFH F+ G+CSCGDYW
Subjt:  ----GDAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic7.6e-11331.36Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        +  L  +H   +K GL      ++KLI  C+     E L YA   F+   E       L ++N++ RG++ +     A+  YV MI +G +P+++TFPFV
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDMTDSSREAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKC
        L +CAK+ AF EG Q+HG ++K+G + D+                            +IS   +   LE A +    + +     +     AL+  Y   
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDMTDSSREAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKC

Query:  GETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLLSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMK
        G    A++L+D    K++V  N ++S +A  G   E L +  DM + ++RPD  ++++ +SAC Q G   LG+  H +   +G+     I NA+ID+Y K
Subjt:  GETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLLSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMK

Query:  CGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMVNALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGA
        CG                               +LE+   +F  +P KD++SWNT++      ++  EA+ LF+EM       + VTM+ +  AC +LGA
Subjt:  CGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMVNALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGA

Query:  LELAKWTYAYIAK--NNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSH
        +++ +W + YI K    V     L T+L+DM+A+CGD  +A +VFN++  K +S+W A I   A+ G  + + +L++ M + G++PD + FV +L+ACSH
Subjt:  LELAKWTYAYIAK--NNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSH

Query:  GGLVEQGQHIFESMKQ-HGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKDIDMATFAAERL------------------
         G+++ G+HIF +M Q + ++P++ HYGCM+DLLG +G  +EA ++I  M M+P+G+IW SLL AC+ H ++++    AE L                  
Subjt:  GGLVEQGQHIFESMKQ-HGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKDIDMATFAAERL------------------

Query:  -----------------------------------------GD----------------------AGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYG
                                                 GD                      AG+VPD + VL ++ E+ K+  L  HSEKLA+A+G
Subjt:  -----------------------------------------GD----------------------AGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYG

Query:  LISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        LIST+    + ++KNLR+C +CH   K ISK+Y REI  RD  RFH FR G CSC DYW
Subjt:  LISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226902.0e-22249.25Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        +DEL   H    KQGL    ST+TKL++   E+GT ESL +A++ F    E  E+  T FMYNSLIRGY+++GLC+EAI  +++M+  G  PD +TFPF 
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMI-EAGVRPNSVTMV
        LSACAK+ A   G+Q+HG ++K+G  +D+                                            D +++AV LFF+M+ +  V PNSVTMV
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMI-EAGVRPNSVTMV

Query:  CVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSL
        CVISACAKL+DLE  +++ A+I  S +E+N  MV+ALVD+YMKC     AKRL+D     NL LCN + SN+ R G+  E L V   M    +RPDR+S+
Subjt:  CVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSL

Query:  LSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTM
        LSAIS+C Q+ + L GK CH Y LRNG+E WDNICNA+IDMYMKC +Q+ A+R+FD  SNKT+V+WNS++ GY+ N ++++  + F  MPEK+IVSWNT+
Subjt:  LSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTM

Query:  VNALVQESMCDEAIELFREMQLKE-IKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWT
        ++ LVQ S+ +EAIE+F  MQ +E + AD VTM+ +ASACG+LGAL+LAKW Y YI KN +  D+ L T LVDMF+RCGDP SAM +FN++  +DVSAWT
Subjt:  VNALVQESMCDEAIELFREMQLKE-IKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWT

Query:  AAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESM-KQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGI
        AAIGAMA+ GN ERAIEL+++M+ QG+KPD V FV  LTACSHGGLV+QG+ IF SM K HG+SP+ VHYGCMVDLLGRAG LEEA+ +I+ MPM+PN +
Subjt:  AAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESM-KQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGI

Query:  IWGSLLAACRTHKDIDMATFAAERL-----------------------------------------------------------GD--------------
        IW SLLAACR   +++MA +AAE++                                                           GD              
Subjt:  IWGSLLAACRTHKDIDMATFAAERL-----------------------------------------------------------GD--------------

Query:  --------AGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGD
                 G+VPD++NVL+DV+E+EK ++L+RHSEKLAMAYGLIS+ K   IR++KNLR+CSDCH+FAK+ SKVY+REI +RDNNRFH+ RQG CSCGD
Subjt:  --------AGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGD

Query:  YW
        +W
Subjt:  YW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.4e-11431.36Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        +  L  +H   +K GL      ++KLI  C+     E L YA   F+   E       L ++N++ RG++ +     A+  YV MI +G +P+++TFPFV
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDMTDSSREAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKC
        L +CAK+ AF EG Q+HG ++K+G + D+                            +IS   +   LE A +    + +     +     AL+  Y   
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDMTDSSREAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKC

Query:  GETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLLSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMK
        G    A++L+D    K++V  N ++S +A  G   E L +  DM + ++RPD  ++++ +SAC Q G   LG+  H +   +G+     I NA+ID+Y K
Subjt:  GETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLLSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMK

Query:  CGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMVNALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGA
        CG                               +LE+   +F  +P KD++SWNT++      ++  EA+ LF+EM       + VTM+ +  AC +LGA
Subjt:  CGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMVNALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGA

Query:  LELAKWTYAYIAK--NNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSH
        +++ +W + YI K    V     L T+L+DM+A+CGD  +A +VFN++  K +S+W A I   A+ G  + + +L++ M + G++PD + FV +L+ACSH
Subjt:  LELAKWTYAYIAK--NNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSH

Query:  GGLVEQGQHIFESMKQ-HGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKDIDMATFAAERL------------------
         G+++ G+HIF +M Q + ++P++ HYGCM+DLLG +G  +EA ++I  M M+P+G+IW SLL AC+ H ++++    AE L                  
Subjt:  GGLVEQGQHIFESMKQ-HGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKDIDMATFAAERL------------------

Query:  -----------------------------------------GD----------------------AGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYG
                                                 GD                      AG+VPD + VL ++ E+ K+  L  HSEKLA+A+G
Subjt:  -----------------------------------------GD----------------------AGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYG

Query:  LISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        LIST+    + ++KNLR+C +CH   K ISK+Y REI  RD  RFH FR G CSC DYW
Subjt:  LISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.1e-11831.53Show/hide
Query:  ELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMI-EVGFMPDNFTFPFVL
        +L Q H + ++ G      + +KL +    + +  SL+YARK F+   E  + N+  F +N+LIR Y++      +I  ++ M+ E    P+ +TFPF++
Subjt:  ELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMI-EVGFMPDNFTFPFVL

Query:  SACAKTAAFVEGVQLHGALMKIGLERDMTDSSREAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCG
         A A+ ++   G  LHG  +K  +  D+                                                           + N+L+  Y  CG
Subjt:  SACAKTAAFVEGVQLHGALMKIGLERDMTDSSREAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCG

Query:  ETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLLSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKC
        +  +A +++    +K++V  N++++ F + G P + L +   M   D++   V+++  +SAC ++ +   G+   +Y   N       + NAM+DMY KC
Subjt:  ETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLLSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKC

Query:  GKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMVNALVQESMCDEAIELFREMQL-KEIKADRVTMVEVASACGYLGA
        G  E A R+FD    K  V+W ++L GY  ++D E+ R++ N MP+KDIV+WN +++A  Q    +EA+ +F E+QL K +K +++T+V   SAC  +GA
Subjt:  GKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMVNALVQESMCDEAIELFREMQL-KEIKADRVTMVEVASACGYLGA

Query:  LELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGG
        LEL +W ++YI K+ +  +  + +AL+ M+++CGD   + EVFN+++++DV  W+A IG +A+ G G  A++++ +M    VKP+ V F N+  ACSH G
Subjt:  LELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGG

Query:  LVEQGQHIFESMK-QHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKDIDMATFAA-----------------------
        LV++ + +F  M+  +GI P+  HY C+VD+LGR+G LE+A+  I++MP+ P+  +WG+LL AC+ H ++++A  A                        
Subjt:  LVEQGQHIFESMK-QHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKDIDMATFAA-----------------------

Query:  ----------------------------------------------------------ERLGDAGYVPDVTNVLLDVNEQE-KQYLLNRHSEKLAMAYGL
                                                                  E+L   GY P+++ VL  + E+E K+  LN HSEKLA+ YGL
Subjt:  ----------------------------------------------------------ERLGDAGYVPDVTNVLLDVNEQE-KQYLLNRHSEKLAMAYGL

Query:  ISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        ISTE    IRVIKNLR+C DCH+ AK IS++YDREI VRD  RFH FR G CSC D+W
Subjt:  ISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)2.7e-22249.19Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        +DEL   H    KQGL    ST+TKL++   E+GT ESL +A++ F    E  E+  T FMYNSLIRGY+++GLC+EAI  +++M+  G  PD +TFPF 
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMI-EAGVRPNSVTMV
        LSACAK+ A   G+Q+HG ++K+G  +D+                                            D +++AV LFF+M+ +  V PNSVTMV
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMI-EAGVRPNSVTMV

Query:  CVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSL
        CVISACAKL+DLE  +++ A+I  S +E+N  MV+ALVD+YMKC     AKRL+D     NL LCN + SN+ R G+  E L V   M    +RPDR+S+
Subjt:  CVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSL

Query:  LSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTM
        LSAIS+C Q+ + L GK CH Y LRNG+E WDNICNA+IDMYMKC +Q+ A+R+FD  SNKT+V+WNS++ GY+ N ++++  + F  MPEK+IVSWNT+
Subjt:  LSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTM

Query:  VNALVQESMCDEAIELFREMQLKE-IKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWT
        ++ LVQ S+ +EAIE+F  MQ +E + AD VTM+ +ASACG+LGAL+LAKW Y YI KN +  D+ L T LVDMF+RCGDP SAM +FN++  +DVSAWT
Subjt:  VNALVQESMCDEAIELFREMQLKE-IKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWT

Query:  AAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESM-KQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGI
        AAIGAMA+ GN ERAIEL+++M+ QG+KPD V FV  LTACSHGGLV+QG+ IF SM K HG+SP+ VHYGCMVDLLGRAG LEEA+ +I+ MPM+PN +
Subjt:  AAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESM-KQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGI

Query:  IWGSLLAACRTHKDIDMATFAAERL-----------------------------------------------------------GD--------------
        IW SLLAACR   +++MA +AAE++                                                           GD              
Subjt:  IWGSLLAACRTHKDIDMATFAAERL-----------------------------------------------------------GD--------------

Query:  --------AGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGD
                 G+VPD++NVL+DV+E+EK ++L+RHSEKLAMAYGLIS+ K   IR++KNLR+CSDCH+FAK+ SKVY+REI +RDNNRFH+ RQG CSCGD
Subjt:  --------AGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGD

Query:  Y
        +
Subjt:  Y

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification1.4e-22349.25Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        +DEL   H    KQGL    ST+TKL++   E+GT ESL +A++ F    E  E+  T FMYNSLIRGY+++GLC+EAI  +++M+  G  PD +TFPF 
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMI-EAGVRPNSVTMV
        LSACAK+ A   G+Q+HG ++K+G  +D+                                            D +++AV LFF+M+ +  V PNSVTMV
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDM-------------------------------------------TDSSREAVALFFQMI-EAGVRPNSVTMV

Query:  CVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSL
        CVISACAKL+DLE  +++ A+I  S +E+N  MV+ALVD+YMKC     AKRL+D     NL LCN + SN+ R G+  E L V   M    +RPDR+S+
Subjt:  CVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSL

Query:  LSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTM
        LSAIS+C Q+ + L GK CH Y LRNG+E WDNICNA+IDMYMKC +Q+ A+R+FD  SNKT+V+WNS++ GY+ N ++++  + F  MPEK+IVSWNT+
Subjt:  LSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTM

Query:  VNALVQESMCDEAIELFREMQLKE-IKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWT
        ++ LVQ S+ +EAIE+F  MQ +E + AD VTM+ +ASACG+LGAL+LAKW Y YI KN +  D+ L T LVDMF+RCGDP SAM +FN++  +DVSAWT
Subjt:  VNALVQESMCDEAIELFREMQLKE-IKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWT

Query:  AAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESM-KQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGI
        AAIGAMA+ GN ERAIEL+++M+ QG+KPD V FV  LTACSHGGLV+QG+ IF SM K HG+SP+ VHYGCMVDLLGRAG LEEA+ +I+ MPM+PN +
Subjt:  AAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESM-KQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGI

Query:  IWGSLLAACRTHKDIDMATFAAERL-----------------------------------------------------------GD--------------
        IW SLLAACR   +++MA +AAE++                                                           GD              
Subjt:  IWGSLLAACRTHKDIDMATFAAERL-----------------------------------------------------------GD--------------

Query:  --------AGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGD
                 G+VPD++NVL+DV+E+EK ++L+RHSEKLAMAYGLIS+ K   IR++KNLR+CSDCH+FAK+ SKVY+REI +RDNNRFH+ RQG CSCGD
Subjt:  --------AGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGD

Query:  YW
        +W
Subjt:  YW

AT4G14820.1 Pentatricopeptide repeat (PPR) superfamily protein4.9e-10730.14Show/hide
Query:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV
        ++ + QLH + L+  +  K ++    +S      +S +L YA   F       E+     ++N  +R  S +      I  Y ++  VG   D F+F  +
Subjt:  MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFV

Query:  LSACAKTAAFVEGVQLHGALMKIGLERDMTDSSREAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKC
        L A +K +A  EG++LHG   KI               L    +E G                                              +D+Y  C
Subjt:  LSACAKTAAFVEGVQLHGALMKIGLERDMTDSSREAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKC

Query:  GETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLLSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMK
        G    A+ ++D    +++V  NT++  + R G+  E   +  +M   ++ PD + L + +SACG+ G+    +  + + + N      ++  A++ MY  
Subjt:  GETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLLSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMK

Query:  CGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMVNALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGA
         G  +MA   F   S + +    +++ GY +   L+  + IF++  +KD+V W TM++A V+     EA+ +F EM    IK D V+M  V SAC  LG 
Subjt:  CGKQEMAYRVFDNTSNKTIVSWNSLLVGYIRNKDLESTRKIFNEMPEKDIVSWNTMVNALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGA

Query:  LELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGG
        L+ AKW ++ I  N +  ++ +  AL++M+A+CG   +  +VF  M R++V +W++ I A+++ G    A+ L+  M ++ V+P++V FV +L  CSH G
Subjt:  LELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAMEVFNNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGG

Query:  LVEQGQHIFESM-KQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKDIDMATFAAER---------------------
        LVE+G+ IF SM  ++ I+P++ HYGCMVDL GRA  L EAL++I+SMP+  N +IWGSL++ACR H ++++  FAA+R                     
Subjt:  LVEQGQHIFESM-KQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKDIDMATFAAER---------------------

Query:  --------------------------------------LGD----------------------AGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLI
                                              +GD                      AGYVPD  +VL+DV E+EK+ L+  HSEKLA+ +GL+
Subjt:  --------------------------------------LGD----------------------AGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLI

Query:  STEKYVP------IRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW
        + EK         IR++KNLR+C DCH F K +SKVY+REI VRD  RFH ++ G CSC DYW
Subjt:  STEKYVP------IRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNRFHFFRQGSCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAACTTGGACAATTACACTGTTACGCGTTGAAGCAGGGTCTCATTCGCAAACAATCGACTGTAACTAAGCTTATTTCCACTTGCGTGGAAATGGGCACTTCAGA
AAGCTTGGATTATGCTCGAAAGGCGTTCGAGCTCTTCCATGAAGAAGGAGAAGCAAATGCCACACTTTTTATGTACAATTCGTTAATCAGAGGATACTCTGCTGCAGGGC
TTTGTGATGAAGCTATTTCGACGTATGTTCAGATGATCGAGGTTGGGTTTATGCCAGACAACTTCACGTTTCCATTTGTGTTAAGTGCGTGTGCTAAGACTGCTGCGTTT
GTAGAAGGTGTTCAGCTCCATGGAGCTCTTATGAAGATTGGTTTAGAGAGAGATATGACAGATTCTTCTAGAGAGGCTGTGGCTTTGTTTTTCCAAATGATCGAGGCAGG
TGTTAGACCCAATTCTGTCACAATGGTGTGTGTGATATCGGCTTGTGCCAAGTTGAAAGATCTTGAACTAGCCAAGAGAATTCGTGCTTACATTGAAGAGTCAGAAATGG
AGCTTAATACTCATATGGTGAATGCACTCGTGGATTTGTACATGAAATGTGGAGAAACTGGTGCTGCAAAGCGGCTATATGATGGATGTGTTGATAAGAATTTGGTTTTA
TGTAACACAATCATGTCAAATTTTGCACGCCATGGTATGCCAAGCGAGGTACTTGCTGTCTTGGTGGATATGTTCCGATTAGATCTTCGACCGGATAGAGTTTCGTTGTT
ATCGGCAATCTCGGCATGTGGGCAGATGGGTGACTATCTACTTGGGAAGTGTTGCCATAATTATTCTCTAAGAAATGGGTATGAAGGTTGGGATAACATTTGCAATGCAA
TGATTGACATGTATATGAAGTGTGGAAAACAAGAAATGGCCTACAGAGTTTTTGACAATACGTCAAATAAGACTATTGTGTCGTGGAACTCATTACTTGTTGGTTACATT
AGAAACAAAGATTTAGAGTCAACTAGGAAGATATTCAATGAGATGCCTGAAAAGGATATAGTGTCTTGGAACACAATGGTTAATGCTTTGGTTCAAGAGAGTATGTGTGA
TGAAGCAATTGAACTGTTCCGAGAGATGCAATTAAAGGAAATAAAAGCAGACAGGGTGACAATGGTAGAAGTTGCATCGGCATGTGGATATCTCGGAGCTCTTGAACTCG
CCAAGTGGACATATGCCTATATTGCAAAGAACAACGTCTACTGTGATATGTTGCTTGAGACAGCATTAGTTGATATGTTTGCTAGGTGTGGTGATCCTCGTAGTGCGATG
GAAGTGTTCAACAATATGGATAGAAAAGACGTCTCGGCATGGACAGCAGCCATTGGAGCAATGGCTGTGGATGGGAATGGCGAACGAGCTATAGAACTTTACAATGAGAT
GCTAAGGCAAGGGGTGAAACCAGATCAAGTAGTTTTTGTAAACATATTAACAGCTTGTAGCCATGGTGGTTTGGTGGAACAAGGGCAACATATATTTGAGTCAATGAAGC
AACATGGAATCTCTCCACAGATTGTTCATTATGGTTGCATGGTTGATCTATTAGGCCGTGCAGGTAAGTTAGAAGAAGCTCTAGATATTATAAAGAGCATGCCAATGAAA
CCCAATGGAATTATATGGGGATCTCTATTGGCTGCATGCCGTACCCATAAAGACATTGATATGGCGACATTTGCAGCTGAAAGGCTTGGGGATGCTGGCTATGTTCCCGA
TGTAACCAATGTTCTTCTTGACGTAAATGAGCAGGAAAAACAATATCTACTGAATCGGCATAGCGAGAAGCTGGCCATGGCTTACGGGCTTATAAGTACAGAAAAGTATG
TACCGATTCGTGTTATAAAGAATCTCCGAATGTGCTCAGATTGTCATGCATTCGCCAAATACATTTCAAAAGTGTATGATAGGGAAATAACAGTACGAGATAATAACAGG
TTTCACTTCTTTAGACAAGGGTCTTGTTCATGTGGTGATTATTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGAACTTGGACAATTACACTGTTACGCGTTGAAGCAGGGTCTCATTCGCAAACAATCGACTGTAACTAAGCTTATTTCCACTTGCGTGGAAATGGGCACTTCAGA
AAGCTTGGATTATGCTCGAAAGGCGTTCGAGCTCTTCCATGAAGAAGGAGAAGCAAATGCCACACTTTTTATGTACAATTCGTTAATCAGAGGATACTCTGCTGCAGGGC
TTTGTGATGAAGCTATTTCGACGTATGTTCAGATGATCGAGGTTGGGTTTATGCCAGACAACTTCACGTTTCCATTTGTGTTAAGTGCGTGTGCTAAGACTGCTGCGTTT
GTAGAAGGTGTTCAGCTCCATGGAGCTCTTATGAAGATTGGTTTAGAGAGAGATATGACAGATTCTTCTAGAGAGGCTGTGGCTTTGTTTTTCCAAATGATCGAGGCAGG
TGTTAGACCCAATTCTGTCACAATGGTGTGTGTGATATCGGCTTGTGCCAAGTTGAAAGATCTTGAACTAGCCAAGAGAATTCGTGCTTACATTGAAGAGTCAGAAATGG
AGCTTAATACTCATATGGTGAATGCACTCGTGGATTTGTACATGAAATGTGGAGAAACTGGTGCTGCAAAGCGGCTATATGATGGATGTGTTGATAAGAATTTGGTTTTA
TGTAACACAATCATGTCAAATTTTGCACGCCATGGTATGCCAAGCGAGGTACTTGCTGTCTTGGTGGATATGTTCCGATTAGATCTTCGACCGGATAGAGTTTCGTTGTT
ATCGGCAATCTCGGCATGTGGGCAGATGGGTGACTATCTACTTGGGAAGTGTTGCCATAATTATTCTCTAAGAAATGGGTATGAAGGTTGGGATAACATTTGCAATGCAA
TGATTGACATGTATATGAAGTGTGGAAAACAAGAAATGGCCTACAGAGTTTTTGACAATACGTCAAATAAGACTATTGTGTCGTGGAACTCATTACTTGTTGGTTACATT
AGAAACAAAGATTTAGAGTCAACTAGGAAGATATTCAATGAGATGCCTGAAAAGGATATAGTGTCTTGGAACACAATGGTTAATGCTTTGGTTCAAGAGAGTATGTGTGA
TGAAGCAATTGAACTGTTCCGAGAGATGCAATTAAAGGAAATAAAAGCAGACAGGGTGACAATGGTAGAAGTTGCATCGGCATGTGGATATCTCGGAGCTCTTGAACTCG
CCAAGTGGACATATGCCTATATTGCAAAGAACAACGTCTACTGTGATATGTTGCTTGAGACAGCATTAGTTGATATGTTTGCTAGGTGTGGTGATCCTCGTAGTGCGATG
GAAGTGTTCAACAATATGGATAGAAAAGACGTCTCGGCATGGACAGCAGCCATTGGAGCAATGGCTGTGGATGGGAATGGCGAACGAGCTATAGAACTTTACAATGAGAT
GCTAAGGCAAGGGGTGAAACCAGATCAAGTAGTTTTTGTAAACATATTAACAGCTTGTAGCCATGGTGGTTTGGTGGAACAAGGGCAACATATATTTGAGTCAATGAAGC
AACATGGAATCTCTCCACAGATTGTTCATTATGGTTGCATGGTTGATCTATTAGGCCGTGCAGGTAAGTTAGAAGAAGCTCTAGATATTATAAAGAGCATGCCAATGAAA
CCCAATGGAATTATATGGGGATCTCTATTGGCTGCATGCCGTACCCATAAAGACATTGATATGGCGACATTTGCAGCTGAAAGGCTTGGGGATGCTGGCTATGTTCCCGA
TGTAACCAATGTTCTTCTTGACGTAAATGAGCAGGAAAAACAATATCTACTGAATCGGCATAGCGAGAAGCTGGCCATGGCTTACGGGCTTATAAGTACAGAAAAGTATG
TACCGATTCGTGTTATAAAGAATCTCCGAATGTGCTCAGATTGTCATGCATTCGCCAAATACATTTCAAAAGTGTATGATAGGGAAATAACAGTACGAGATAATAACAGG
TTTCACTTCTTTAGACAAGGGTCTTGTTCATGTGGTGATTATTGGTAA
Protein sequenceShow/hide protein sequence
MDELGQLHCYALKQGLIRKQSTVTKLISTCVEMGTSESLDYARKAFELFHEEGEANATLFMYNSLIRGYSAAGLCDEAISTYVQMIEVGFMPDNFTFPFVLSACAKTAAF
VEGVQLHGALMKIGLERDMTDSSREAVALFFQMIEAGVRPNSVTMVCVISACAKLKDLELAKRIRAYIEESEMELNTHMVNALVDLYMKCGETGAAKRLYDGCVDKNLVL
CNTIMSNFARHGMPSEVLAVLVDMFRLDLRPDRVSLLSAISACGQMGDYLLGKCCHNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDNTSNKTIVSWNSLLVGYI
RNKDLESTRKIFNEMPEKDIVSWNTMVNALVQESMCDEAIELFREMQLKEIKADRVTMVEVASACGYLGALELAKWTYAYIAKNNVYCDMLLETALVDMFARCGDPRSAM
EVFNNMDRKDVSAWTAAIGAMAVDGNGERAIELYNEMLRQGVKPDQVVFVNILTACSHGGLVEQGQHIFESMKQHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMK
PNGIIWGSLLAACRTHKDIDMATFAAERLGDAGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKYVPIRVIKNLRMCSDCHAFAKYISKVYDREITVRDNNR
FHFFRQGSCSCGDYW