; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015605 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015605
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr12:17492175..17503226
RNA-Seq ExpressionLag0015605
SyntenyLag0015605
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592948.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]4.1e-24986.91Show/hide
Query:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK
        WPRLLTPT+LSQIIRKQNNP  AYQLF EA CRYP+Y+HNGPVYAAMINILGNSGR  EMREVI+QMK DSC+CKDS+FSFAIKTYASHGLLEEGISLFK
Subjt:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK

Query:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE
        SLG FNCT+RTQ+FNTLLEILLNESQLDAACQLFQQSS+GWEVKSRT SLNLLMQSLCQRGQSELALHVF+EMDYQSCYPNRLSYLILMKGLCQDG+L+E
Subjt:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA
        AIHLLYSMFWRISRRGSGGDIV+YRTLLFALCDNGEI+QAVEILGKIL+KGLK PKRAHY IDL+ CR S LTV EIK  INEALIKGGIPSSDSYCAMA
Subjt:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA

Query:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK
        +DLYNENET+QGDKVVSHMLAKGF PPSS+YEAK  ALCKEGKVDDAVKVIE+ETVK SCVPT+ALYNI+L GLC  GKSTVAME+LKKM KQVGLVADK
Subjt:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK

Query:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI
        ETYS+LV+GLC ENRY EACK+LEEMVIKS+WP SNTFNT+IRGLCSVGK Y+A M +EEMISQGQLPELSVWN+LVSSLCFNVA TD+
Subjt:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI

XP_016902498.1 PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucumis melo]5.6e-25487.93Show/hide
Query:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK
        WPR+LTPTYLSQIIRKQNNPL AYQLFKEA CRYPDYRHNGPVYAAMINILGNSGR  EMREV++QM+DDSCECKDSVFSFAIKTYASHGLLE+GISLFK
Subjt:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK

Query:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE
        SLGRFNCTNRTQ+FNTLLEILLNESQL AACQLFQ+ SYGWEVKSRT SLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLI+MKGLCQDGRLNE
Subjt:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA
        AIHLLYSMFWRISR+GSGGDIV+YRTLLFALCDNGEI+QAVEILGKILRKGLK PKRAHY+IDLDQCR++ LT+ EIKS INEALIKGGIPSSDSYCAMA
Subjt:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA

Query:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK
        VDLYNEN+T+QGDKVVSHM+AKGF PPSSIYEAKV +LCKEGKVDDAVKVIE++ V  SCVPTIALYNI+LKGLCD+GKSTVAMEYLKKMAK+VGLVA+K
Subjt:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK

Query:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI
        ETYS+LV+GLC ENRY EACK+LEEMVIKS+WP SNTFNT+I+GLCSVGKQYEA MW+EEMISQGQLP + VWNSLVSSLC +VAG D+
Subjt:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI

XP_023004266.1 pentatricopeptide repeat-containing protein At1g05600 [Cucurbita maxima]1.9e-24987.27Show/hide
Query:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK
        WPRLLTPT+LSQIIRKQNNP  AYQLF EA CRYP+Y+HNGPVYAAMINILGNSGR  EMREVI+QMK DSC+CKDS+FSFAIKTYASHGLLEEGISLFK
Subjt:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK

Query:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE
        SLG FNCT+RTQ+FNTLLEILLNESQLDAACQLFQQSS+GWEVKSRT SLNLLMQSLCQRGQSELALHVF+EMDYQSCYPNRLSYLILMKGLCQDG+L+E
Subjt:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA
        AIHLLYSMFWRISR+GSGGDIV+YRTLLFALCDNGEI+QAVEILGKIL+KGLK+PKRAHY IDL+ CR S LTV EIK  INEALIKGGIPSSDSYCAMA
Subjt:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA

Query:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK
        +DLYNENET+QGDKVVSHMLAKGF PPSS+YEAK  ALCKEGKVDDAVKVIE+ETVK SCVPT+ALYNI+L GLC  GKSTVAME+LKKMAKQVGLVADK
Subjt:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK

Query:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGT
        ETYS+LV+GLC ENRY EACK+LEEMVIKS+WP SNTFNT+IRGLCSVGK Y+A M +EEMISQGQLPELSVWN+LVSSLCFNVAGT
Subjt:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGT

XP_023514107.1 pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucurbita pepo subsp. pepo]2.4e-24986.91Show/hide
Query:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK
        WPRLLTPT+LSQIIRKQNNP  AYQLF EA CRYP+Y+HNGPVYAAMINILGNSGR  EMREVI+QMK DSC+CKDS+FSFAIKTYASHGLLEEGISLFK
Subjt:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK

Query:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE
        SLG FNCT+RTQ+FNTLLEILLNESQLDAACQLFQQSS+GWEVKSRT SLNLLMQSLCQRGQSELALHVF+EMDYQSCYPNRLSYLILMKGLCQDG+L+E
Subjt:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA
        AIHLLYSMFWRISRRGSGGDIV+YRTLLFALCDNGEI+QAVEILGKIL+KGLK+PKRAHY IDL+ CR S LTV EIK  INEALIKGGIPSSDSYCAMA
Subjt:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA

Query:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK
        +DLYNENET+QGDKVVSHMLAKGF PPSS+YEAK  ALCKEGKVDDAVKVIE+ETVK SCVPT+ALYNI+L GLC  G STVAME+LKKM KQVGLVADK
Subjt:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK

Query:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI
        ETYS+LV+GLC ENRYIEACK+LEEMVIKS+WP SNTFNT+IRGLCSVGK Y+A M +EEMISQGQLPELSVWN+LVSSLCFNVA TD+
Subjt:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI

XP_038874865.1 pentatricopeptide repeat-containing protein At1g05600 [Benincasa hispida]4.6e-25688.55Show/hide
Query:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK
        WPR+LTPTYLSQIIRKQNNPL AYQLFKEA  RYPDYRHNGPVYAAMI+ILGNSGR  EMREVI+QM+DDSCECKDSVFSFAIKTYASHGLLE+GI+LFK
Subjt:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK

Query:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE
        SLG+FNCTNRTQ+FNTLLEILLNESQLDAACQLFQ+SSYGW VKSRT SLNLLMQSLCQRGQSELALHVFQEMDYQSCYP+RLSYLILMKGLCQDGRLNE
Subjt:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA
        AIHLLYSMFWRIS+RG GGDIV+YRTLLFALCDNG+I+QAVEILGKILRKGLK PKRAHY+IDLDQCR+S LT+GEIK  INEALIKGGIPSSDS+CAMA
Subjt:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA

Query:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK
        VDLYNENET+QGDKVVSHM+AKGF PPSSI+EAKV ALCKEGKVDDAVKVIE+E VK SCVPT+ALYNI+LKGLCDEGKSTVAMEYLKKMAKQVGLVADK
Subjt:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK

Query:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI
         TYS+LV+GLC ENRYIEACK+LEEMVIKS+WP SNTFNT+IRGLCSVGKQYEA MW+EEMISQGQLP +SVWNSLVSSLC +VAGT++
Subjt:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI

TrEMBL top hitse value%identityAlignment
A0A1S4E2N8 pentatricopeptide repeat-containing protein At1g05600 isoform X12.7e-25487.93Show/hide
Query:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK
        WPR+LTPTYLSQIIRKQNNPL AYQLFKEA CRYPDYRHNGPVYAAMINILGNSGR  EMREV++QM+DDSCECKDSVFSFAIKTYASHGLLE+GISLFK
Subjt:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK

Query:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE
        SLGRFNCTNRTQ+FNTLLEILLNESQL AACQLFQ+ SYGWEVKSRT SLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLI+MKGLCQDGRLNE
Subjt:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA
        AIHLLYSMFWRISR+GSGGDIV+YRTLLFALCDNGEI+QAVEILGKILRKGLK PKRAHY+IDLDQCR++ LT+ EIKS INEALIKGGIPSSDSYCAMA
Subjt:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA

Query:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK
        VDLYNEN+T+QGDKVVSHM+AKGF PPSSIYEAKV +LCKEGKVDDAVKVIE++ V  SCVPTIALYNI+LKGLCD+GKSTVAMEYLKKMAK+VGLVA+K
Subjt:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK

Query:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI
        ETYS+LV+GLC ENRY EACK+LEEMVIKS+WP SNTFNT+I+GLCSVGKQYEA MW+EEMISQGQLP + VWNSLVSSLC +VAG D+
Subjt:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI

A0A5D3DM62 Pentatricopeptide repeat-containing protein2.7e-25487.93Show/hide
Query:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK
        WPR+LTPTYLSQIIRKQNNPL AYQLFKEA CRYPDYRHNGPVYAAMINILGNSGR  EMREV++QM+DDSCECKDSVFSFAIKTYASHGLLE+GISLFK
Subjt:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK

Query:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE
        SLGRFNCTNRTQ+FNTLLEILLNESQL AACQLFQ+ SYGWEVKSRT SLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLI+MKGLCQDGRLNE
Subjt:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA
        AIHLLYSMFWRISR+GSGGDIV+YRTLLFALCDNGEI+QAVEILGKILRKGLK PKRAHY+IDLDQCR++ LT+ EIKS INEALIKGGIPSSDSYCAMA
Subjt:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA

Query:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK
        VDLYNEN+T+QGDKVVSHM+AKGF PPSSIYEAKV +LCKEGKVDDAVKVIE++ V  SCVPTIALYNI+LKGLCD+GKSTVAMEYLKKMAK+VGLVA+K
Subjt:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK

Query:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI
        ETYS+LV+GLC ENRY EACK+LEEMVIKS+WP SNTFNT+I+GLCSVGKQYEA MW+EEMISQGQLP + VWNSLVSSLC +VAG D+
Subjt:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI

A0A6J1DVG7 pentatricopeptide repeat-containing protein At1g056005.8e-24985.69Show/hide
Query:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK
        WPR+LTPT+LSQIIRKQNNP  A+QLFKEA CRYP YRHNGPVYAAMI+ILGNSGR  EMREVI+QMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK
Subjt:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK

Query:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE
        SLGRFNCTNRTQ+FNTLLEILLNES+LDAACQLFQQSSYGW VKSRT SLNLLMQSLCQR QSELALH+FQEMDYQ CYPNRLSYLILMKGLCQDGRLNE
Subjt:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA
        A+HLLYSMFWRIS+RGSGGDIV+YRTLL+ALC NGE++QAVEILGKILRKGLK PKR HY+IDLDQC++S LT+ EIK   NEALIKGGIPS  +YCAMA
Subjt:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA

Query:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK
        VDLYNEN T+QGDKVVSHMLAKGF PPSSIYEAK  ALCKEGKVDDA++VI++ETVK SC+P++ALYNI+LKGL +EGKSTVA+EYLKKMAKQVGLVADK
Subjt:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK

Query:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI
        ETYS LV+GLC+ENRYIEACK+LEEMVIKSY P S+TFNT+I GLCS+GKQYEA MW+EEMISQGQLPELSVWNSLVSS+CFNVAGTD+
Subjt:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI

A0A6J1H9J7 pentatricopeptide repeat-containing protein At1g056007.6e-24986.71Show/hide
Query:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK
        WPRLLTPT+LSQIIRKQNNP  AYQLF EA CRYP+Y+HNGPVYAAMINILGNSGR  EMREVI+QMK DSC+CKDS+FSFAIKTYASHGLLEEGISLFK
Subjt:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK

Query:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE
        SLG FNCT+RTQ+FNTLLEILLNESQLDAACQLFQQSS+GWEVKSRT SLNLLMQSLCQRGQSELALHVF+EMDYQSCYPNRLSYLILMKGLCQDG+L+E
Subjt:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA
        AIHLLYSMFWRISRRGSGGDIV+YRTLLFALCDNGEI+QAVEILGKIL+KGLK PKRAHY IDL+ CR S LTV EIK  INEALIKGGIPSSDSYCAMA
Subjt:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA

Query:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK
        +DLYNENET+QGDKVVSHMLAKGF PPSS+YEAK  ALCKEGKVDDAVKVIE+ETVK SCVPT+ALYNI+L GLC  GKSTVAME+LKKM KQVGLVADK
Subjt:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK

Query:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI
        ETYS+LV+GLC ENRY EACK+LEEMVIKS+WP SNTFNT+ RGLCSVGK Y+A M +EEMISQGQLPELSVWN+LVSSLCFNVA TD+
Subjt:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDI

A0A6J1KU44 pentatricopeptide repeat-containing protein At1g056009.0e-25087.27Show/hide
Query:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK
        WPRLLTPT+LSQIIRKQNNP  AYQLF EA CRYP+Y+HNGPVYAAMINILGNSGR  EMREVI+QMK DSC+CKDS+FSFAIKTYASHGLLEEGISLFK
Subjt:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK

Query:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE
        SLG FNCT+RTQ+FNTLLEILLNESQLDAACQLFQQSS+GWEVKSRT SLNLLMQSLCQRGQSELALHVF+EMDYQSCYPNRLSYLILMKGLCQDG+L+E
Subjt:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA
        AIHLLYSMFWRISR+GSGGDIV+YRTLLFALCDNGEI+QAVEILGKIL+KGLK+PKRAHY IDL+ CR S LTV EIK  INEALIKGGIPSSDSYCAMA
Subjt:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA

Query:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK
        +DLYNENET+QGDKVVSHMLAKGF PPSS+YEAK  ALCKEGKVDDAVKVIE+ETVK SCVPT+ALYNI+L GLC  GKSTVAME+LKKMAKQVGLVADK
Subjt:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK

Query:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGT
        ETYS+LV+GLC ENRY EACK+LEEMVIKS+WP SNTFNT+IRGLCSVGK Y+A M +EEMISQGQLPELSVWN+LVSSLCFNVAGT
Subjt:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGT

SwissProt top hitse value%identityAlignment
O49436 Pentatricopeptide repeat-containing protein At4g200902.2e-4324.89Show/hide
Query:  NPLRA---YQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLF-KSLGRFNCTNRTQSF
        NPL A    ++FK A  +   ++      ++MI    NSG F  + +++++++ ++    +  F    + Y    L ++ + LF + +  F C    +SF
Subjt:  NPLRA---YQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLF-KSLGRFNCTNRTQSF

Query:  NTLLEILLNESQLDAACQLFQ---QSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWR
        N++L +++NE       + +     S+    +     S NL++++LC+    + A+ VF+ M  + C P+  +Y  LM GLC++ R++EA+ LL  M   
Subjt:  NTLLEILLNESQLDAACQLFQ---QSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWR

Query:  ISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMAVDLYNENETNQ
            G     V+Y  L+  LC  G++ +  +++  +  KG    +  +  +    C    L   +  S +   +    IP+  +Y  +   L  +     
Subjt:  ISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMAVDLYNENETNQ

Query:  GDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADKETYSSLVNGLC
          +++S M  +G+     IY   ++ L KEGK ++A+ +   +  ++ C P I +Y++++ GLC EGK   A E L +M    G + +  TYSSL+ G  
Subjt:  GDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADKETYSSLVNGLC

Query:  SENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLC
              EA ++ +EM       +   ++ +I GLC VG+  EA M   +M++ G  P+   ++S++  LC
Subjt:  SENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLC

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745808.9e-4523Show/hide
Query:  LTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDD-SCECKDSVFSFAIKTYASHGLLEEGISLFKSLG
        L P +++ +I+ Q +P++A ++F     +   ++H    Y ++I  LG  G+F  M EV+  M+++      + V+  A+K Y   G ++E +++F+ + 
Subjt:  LTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDD-SCECKDSVFSFAIKTYASHGLLEEGISLFKSLG

Query:  RFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKG------------
         ++C     S+N ++ +L++    D A +++ +      +    +S  + M+S C+  +   AL +   M  Q C  N ++Y  ++ G            
Subjt:  RFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKG------------

Query:  -----------------------LCQDGRLNEAIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCR
                               LC+ G + E   LL     ++ +RG   ++  Y   +  LC  GE+  AV ++G ++ +G K     +  +    C+
Subjt:  -----------------------LCQDGRLNEAIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCR

Query:  DSNLTVGEIKSSINEALIKGGIPSSDSYCAMAVDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYN
        +S     E+   + + + +G  P S +Y  +             +++V   +  GF P    Y + +  LC EG+ + A+ +  +E + +   P + LYN
Subjt:  DSNLTVGEIKSSINEALIKGGIPSSDSYCAMAVDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYN

Query:  IILKGLCDEGKSTVAMEYLKKMAKQVGLVADKETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLP
         ++KGL ++G    A +   +M+++ GL+ + +T++ LVNGLC      +A  +++ M+ K Y+P   TFN +I G  +  K   A   ++ M+  G  P
Subjt:  IILKGLCDEGKSTVAMEYLKKMAKQVGLVADKETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLP

Query:  ELSVWNSLVSSLC
        ++  +NSL++ LC
Subjt:  ELSVWNSLVSSLC

Q9FFE3 Pentatricopeptide repeat-containing protein At5g16420, mitochondrial3.9e-4023.21Show/hide
Query:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDD--SCECKDSVFSFAIKTYASHGLLEEGISL
        WP+ L P  L  +I +Q N   A Q+F  A   +P + HN   Y +++  L  +  F  +  ++  +++     +C +++F   ++ Y   G  E  + +
Subjt:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDD--SCECKDSVFSFAIKTYASHGLLEEGISL

Query:  FKSLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRL
        F  +  F      +S NTLL +L+   + D    +F+ S   + +     + NLL+++LC++   E A  V  E+      PN ++Y  ++ G    G +
Subjt:  FKSLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRL

Query:  NEAIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQ-CRDSNLTVGEIKSSINEALIKGGIPSSDSYC
          A  +L  M      RG   D   Y  L+   C  G   +A  ++  + +  ++ P    Y + +   C++     GE ++  +E L +  +P S   C
Subjt:  NEAIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQ-CRDSNLTVGEIKSSINEALIKGGIPSSDSYC

Query:  AMAVDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLV
         +   L  +++ ++   +   ML     P +++    +  LCKEG+V +A K+ ++   ++  +P++  YN ++ G+C++G+ T A      M ++    
Subjt:  AMAVDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLV

Query:  ADKETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELS
         +  TY+ L+ GL       E  ++LEEM+    +P+  TF  +  GL  +GK+ +A   +   +  G++ + S
Subjt:  ADKETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELS

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic2.3e-4524.66Show/hide
Query:  TPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK-SLGR
        T   L   +R Q +   A +LF  AS + P++     +Y  ++  LG SG F +M++++  MK   CE   S F   I++YA   L +E +S+    +  
Subjt:  TPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK-SLGR

Query:  FNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLC-----------------------------------QRGQSELALHV
        F     T  +N +L +L++ + L    ++       W +K    + N+L+++LC                                   + G  + AL +
Subjt:  FNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLC-----------------------------------QRGQSELALHV

Query:  FQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRD
         ++M    C  + +S  +++ G C++GR+ +A++ +  M    ++ G   D   + TL+  LC  G +K A+EI+  +L++G       +  +    C+ 
Subjt:  FQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRD

Query:  SNLTVGEIKSSI---NEALIKGGIPSSDSYCAMAVDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIAL
            +GE+K ++   ++ + +   P++ +Y  +   L  EN+  +  ++   + +KG  P    + + +  LC       A+++ E E   + C P    
Subjt:  SNLTVGEIKSSI---NEALIKGGIPSSDSYCAMAVDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIAL

Query:  YNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADKETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQ
        YN+++  LC +GK   A+  LK+M +  G      TY++L++G C  N+  EA +I +EM +     +S T+NT+I GLC   +  +AA  M++MI +GQ
Subjt:  YNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADKETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQ

Query:  LPELSVWNSLVSSLC
         P+   +NSL++  C
Subjt:  LPELSVWNSLVSSLC

Q9SYK1 Pentatricopeptide repeat-containing protein At1g056004.4e-16155.81Show/hide
Query:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK
        WPR+LTP+ LSQI++KQ NP+ A +LF+EA  R+P Y HNG VYA MI+ILG S R  EM+ VI +MK+DSCECKDSVF+  I+T++  G LE+ ISLFK
Subjt:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK

Query:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE
        SL  FNC N + SF+TLL+ ++ ES+L+AAC +F++  YGWEV SR  +LNLLM+ LCQ  +S+LA  VFQEM+YQ CYP+R SY ILMKG C +G+L E
Subjt:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA
        A HLLYSMFWRIS++GSG DIVVYR LL ALCD GE+  A+EILGKILRKGLK PKR ++ I+      S+  +  +K  + E LI+G IP  DSY AMA
Subjt:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA

Query:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK
         DL+ E +  +G++V+  M +KGF P   IY AKV ALC+ GK+ +AV VI  E ++  C+PT+ +YN+++KGLCD+GKS  A+ YLKKM+KQV  VA++
Subjt:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK

Query:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCF
        ETY +LV+GLC + +++EA +++EEM+IKS++P   T++ +I+GLC + ++YEA MW+EEM+SQ  +PE SVW +L  S+CF
Subjt:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCF

Arabidopsis top hitse value%identityAlignment
AT1G05600.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-16255.81Show/hide
Query:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK
        WPR+LTP+ LSQI++KQ NP+ A +LF+EA  R+P Y HNG VYA MI+ILG S R  EM+ VI +MK+DSCECKDSVF+  I+T++  G LE+ ISLFK
Subjt:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK

Query:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE
        SL  FNC N + SF+TLL+ ++ ES+L+AAC +F++  YGWEV SR  +LNLLM+ LCQ  +S+LA  VFQEM+YQ CYP+R SY ILMKG C +G+L E
Subjt:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA
        A HLLYSMFWRIS++GSG DIVVYR LL ALCD GE+  A+EILGKILRKGLK PKR ++ I+      S+  +  +K  + E LI+G IP  DSY AMA
Subjt:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA

Query:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK
         DL+ E +  +G++V+  M +KGF P   IY AKV ALC+ GK+ +AV VI  E ++  C+PT+ +YN+++KGLCD+GKS  A+ YLKKM+KQV  VA++
Subjt:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK

Query:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCF
        ETY +LV+GLC + +++EA +++EEM+IKS++P   T++ +I+GLC + ++YEA MW+EEM+SQ  +PE SVW +L  S+CF
Subjt:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCF

AT1G05600.2 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-16255.81Show/hide
Query:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK
        WPR+LTP+ LSQI++KQ NP+ A +LF+EA  R+P Y HNG VYA MI+ILG S R  EM+ VI +MK+DSCECKDSVF+  I+T++  G LE+ ISLFK
Subjt:  WPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK

Query:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE
        SL  FNC N + SF+TLL+ ++ ES+L+AAC +F++  YGWEV SR  +LNLLM+ LCQ  +S+LA  VFQEM+YQ CYP+R SY ILMKG C +G+L E
Subjt:  SLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA
        A HLLYSMFWRIS++GSG DIVVYR LL ALCD GE+  A+EILGKILRKGLK PKR ++ I+      S+  +  +K  + E LI+G IP  DSY AMA
Subjt:  AIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA

Query:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK
         DL+ E +  +G++V+  M +KGF P   IY AKV ALC+ GK+ +AV VI  E ++  C+PT+ +YN+++KGLCD+GKS  A+ YLKKM+KQV  VA++
Subjt:  VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADK

Query:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCF
        ETY +LV+GLC + +++EA +++EEM+IKS++P   T++ +I+GLC + ++YEA MW+EEM+SQ  +PE SVW +L  S+CF
Subjt:  ETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCF

AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein6.3e-4623Show/hide
Query:  LTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDD-SCECKDSVFSFAIKTYASHGLLEEGISLFKSLG
        L P +++ +I+ Q +P++A ++F     +   ++H    Y ++I  LG  G+F  M EV+  M+++      + V+  A+K Y   G ++E +++F+ + 
Subjt:  LTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDD-SCECKDSVFSFAIKTYASHGLLEEGISLFKSLG

Query:  RFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKG------------
         ++C     S+N ++ +L++    D A +++ +      +    +S  + M+S C+  +   AL +   M  Q C  N ++Y  ++ G            
Subjt:  RFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKG------------

Query:  -----------------------LCQDGRLNEAIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCR
                               LC+ G + E   LL     ++ +RG   ++  Y   +  LC  GE+  AV ++G ++ +G K     +  +    C+
Subjt:  -----------------------LCQDGRLNEAIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCR

Query:  DSNLTVGEIKSSINEALIKGGIPSSDSYCAMAVDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYN
        +S     E+   + + + +G  P S +Y  +             +++V   +  GF P    Y + +  LC EG+ + A+ +  +E + +   P + LYN
Subjt:  DSNLTVGEIKSSINEALIKGGIPSSDSYCAMAVDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYN

Query:  IILKGLCDEGKSTVAMEYLKKMAKQVGLVADKETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLP
         ++KGL ++G    A +   +M+++ GL+ + +T++ LVNGLC      +A  +++ M+ K Y+P   TFN +I G  +  K   A   ++ M+  G  P
Subjt:  IILKGLCDEGKSTVAMEYLKKMAKQVGLVADKETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLP

Query:  ELSVWNSLVSSLC
        ++  +NSL++ LC
Subjt:  ELSVWNSLVSSLC

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-4624.66Show/hide
Query:  TPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK-SLGR
        T   L   +R Q +   A +LF  AS + P++     +Y  ++  LG SG F +M++++  MK   CE   S F   I++YA   L +E +S+    +  
Subjt:  TPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLFK-SLGR

Query:  FNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLC-----------------------------------QRGQSELALHV
        F     T  +N +L +L++ + L    ++       W +K    + N+L+++LC                                   + G  + AL +
Subjt:  FNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLC-----------------------------------QRGQSELALHV

Query:  FQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRD
         ++M    C  + +S  +++ G C++GR+ +A++ +  M    ++ G   D   + TL+  LC  G +K A+EI+  +L++G       +  +    C+ 
Subjt:  FQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRD

Query:  SNLTVGEIKSSI---NEALIKGGIPSSDSYCAMAVDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIAL
            +GE+K ++   ++ + +   P++ +Y  +   L  EN+  +  ++   + +KG  P    + + +  LC       A+++ E E   + C P    
Subjt:  SNLTVGEIKSSI---NEALIKGGIPSSDSYCAMAVDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIAL

Query:  YNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADKETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQ
        YN+++  LC +GK   A+  LK+M +  G      TY++L++G C  N+  EA +I +EM +     +S T+NT+I GLC   +  +AA  M++MI +GQ
Subjt:  YNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADKETYSSLVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQ

Query:  LPELSVWNSLVSSLC
         P+   +NSL++  C
Subjt:  LPELSVWNSLVSSLC

AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-4424.89Show/hide
Query:  NPLRA---YQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLF-KSLGRFNCTNRTQSF
        NPL A    ++FK A  +   ++      ++MI    NSG F  + +++++++ ++    +  F    + Y    L ++ + LF + +  F C    +SF
Subjt:  NPLRA---YQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHGLLEEGISLF-KSLGRFNCTNRTQSF

Query:  NTLLEILLNESQLDAACQLFQ---QSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWR
        N++L +++NE       + +     S+    +     S NL++++LC+    + A+ VF+ M  + C P+  +Y  LM GLC++ R++EA+ LL  M   
Subjt:  NTLLEILLNESQLDAACQLFQ---QSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWR

Query:  ISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMAVDLYNENETNQ
            G     V+Y  L+  LC  G++ +  +++  +  KG    +  +  +    C    L   +  S +   +    IP+  +Y  +   L  +     
Subjt:  ISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMAVDLYNENETNQ

Query:  GDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADKETYSSLVNGLC
          +++S M  +G+     IY   ++ L KEGK ++A+ +   +  ++ C P I +Y++++ GLC EGK   A E L +M    G + +  TYSSL+ G  
Subjt:  GDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADKETYSSLVNGLC

Query:  SENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLC
              EA ++ +EM       +   ++ +I GLC VG+  EA M   +M++ G  P+   ++S++  LC
Subjt:  SENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGGCGGAAAAAACAGAGGGAAAAGAGTAAAGTTGTTCAAGACTGTGTTCTTTGGCAAAAAGAAGGAAAAAGAAACAAGAAAATGGAGAAATTGTCGAATG
ATTCCGGAGAAAAAAGCTGGGTTACGAGGATATACGCTTGTATGTCATCGGAGTGTGAAAATCATGCAAGAATGGGCGAGAAAGAAATCGGTCGTGGCGAAGTCT
GGATTTTCGAAGTACGATCGTTGGGTGTCGAAGCTTTCGTGGCAAGAGTTGAAAACTTGGAGAGGAGAACATAGCAGCGACCACAAAGTTAGGAAAAAGGCAGAA
GAAAACGAGGAGAAAAGGAGAATGGCCGACAGTGGTCGTCACTGGCAATGTCGACATGTGTGGTCGGAGACAAAATTTTGGGTAATGTGCCATCGCAAGATTAGT
AAAAATGCAGGGAAAGAAATGAAGGAGAAGGACCATGAAGAAAATCAAACTAAAAATGAAATCCTTCTTGACGGAGAATGGACCCAGCAACAAATGCAGATGGAG
GTGATAATTTTTCGATGGAGGTGCAAACTCATCTTTCGTAATGAACAAGACGAAATAAACCCTAGTCACGTGCAATGTCAGGGAGTGGGGCCATGGATTTGGAAA
AGGTTGGGCATTAACATTGGGCTTTTCAATGGGCTTATTTTAATTGAGTTAGTTGAAGTCCATCTTGGAGAAAAATCTGGAAAACAGGATTGGTGTGGTGGGATA
TTCAATGGGCCGATTCTAAAGGAATTGGTGGAGGCCCAATATTCTTTCGCAATACCCAATGTTTTCAGGTTTACCATTGGGGCACAATCCGGTATCTTCGTTGTT
TGGGACCCAGTGATGGATAAGGTGAGGAAAAGACTTGCAAGTTGGAAGAAGAGTTTTTTCTCCAAAGGAGGTAGACTGACCCTCATCCGTTTCGTGTTAAGTGGC
TTCAGAGCCCCTTGTTCGATGTGTAAAACTCTCGAGAAGCTGATGAAGGACTTTCTGTGGGAAGGAGTGTATGAGGGCAAGTCTATTCATTTGGTGAGTTGGGAC
TTAGTGGGGCGGCCTTTAAGTCGGGGGGGGGGGTTGAGGATTATTGTTAGCAAATACGACCCTCATCCTTCTGAATGGGTGATGGGAGGGATCAAAGGCACTTCT
AGAAACCCTTGGAAAGAAATTTCTCACGAGCTCCCTTCCTTGACCTCTTTTGTTTGTTATTCTGTGGGGAATGGGGAGGATACATATTTTTGGGAAGACAGGTGG
GTGGGAGATAGACCCCTTTGTGCTACTTTTCCTCGGCTTTATCACTTATCTTCTATGAAAAACCACCCTGTGGTTGAGGTTTTGAGTCCTTTAGGGAGTTCTCTA
TCCTATTCTTTTGGCTTCTTGCGTTCGTTATCTGATAGAGATACTACAGACCTCTTGTCTCTCTTGTCCTTGATTGGGGAGTTCTCTTTTAGTACTACGAGGAGG
GATTTCCATTTGTGGATTCCTAGCCCATCCATTGACTTCTCTTGTCGGTCCTTCCATTGCTTACTAGACCCCTCTCCTTTCGACTTGTTTATCTTTTCCATGTTA
TGGAAGATCCTTTGCTTGATCGGGCCTTTTTGTTGCATTCTCTGTCGGATGGCGGAGGAAGACCTCGACCATATTCTGTGGAGTTGTAGATTTGCTAAGGCAGTG
TGGGATGAGTTTTTTGCATCGTTTGGTTTGCAGTTTGCCAGACATAGGGGCCTTAGAGAGATGATCGAGGAGTTCCTTTCCCATCCTCCTTTTAGGGATCAAGGA
AATTTCTTGTGGCAAGCTGGGATCTGCACTATTATTTGGAGGCTTTGGGGTGAGAGGAACAATAGGACGTTTAGTGGGATGGAGAGGGATGCGACTGAGCTCAAA
GAACTTGGACAGTTTGGTGGAAGAACAATACTTGCTGGACACTGGTGGCCAAGGCTTTTAACGCCCACATACCTATCTCAGATTATTAGGAAGCAGAATAACCCC
TTAAGAGCTTACCAATTGTTCAAGGAAGCCAGTTGTAGGTACCCAGATTATCGGCACAATGGTCCGGTGTATGCTGCAATGATCAATATACTTGGAAACTCGGGT
CGATTTTTCGAGATGAGAGAAGTGATAAATCAGATGAAAGATGACTCCTGCGAGTGCAAAGATTCTGTATTTTCATTTGCAATTAAAACGTATGCTAGTCATGGA
TTATTGGAAGAAGGTATATCCCTGTTTAAAAGCCTTGGGAGATTTAACTGTACAAATAGAACACAAAGTTTCAATACCCTTTTGGAGATTCTCTTGAATGAATCT
CAGCTTGATGCTGCTTGTCAGCTTTTTCAGCAGAGTTCTTATGGTTGGGAAGTGAAATCCAGGACTCATTCCTTGAATTTGCTTATGCAATCCCTTTGCCAAAGA
GGCCAATCTGAACTTGCTTTACATGTCTTTCAAGAGATGGATTACCAAAGTTGCTATCCAAATAGGCTGAGTTATTTGATTCTAATGAAAGGGCTGTGTCAAGAT
GGTAGGCTTAATGAGGCCATCCATTTGCTGTATTCCATGTTTTGGAGGATTTCTCGAAGGGGTAGTGGAGGGGACATAGTTGTTTACAGAACCCTTCTATTTGCT
TTATGTGATAACGGAGAGATTAAGCAGGCTGTGGAAATACTTGGCAAGATCTTGAGGAAAGGACTGAAAACCCCTAAGCGAGCTCATTACCAGATTGACCTCGAT
CAATGCAGGGATAGCAACCTCACTGTTGGTGAAATCAAGAGTTCAATCAATGAAGCTTTAATCAAAGGCGGAATTCCCAGTTCAGATAGCTACTGTGCAATGGCT
GTCGATCTATATAACGAAAACGAAACTAATCAGGGAGATAAAGTTGTTAGCCACATGCTAGCTAAAGGCTTCACGCCCCCATCCTCGATCTATGAAGCGAAAGTG
ACTGCATTATGCAAAGAAGGCAAAGTTGATGATGCAGTGAAAGTTATTGAAGATGAAACAGTGAAAAGAAGTTGTGTTCCAACTATTGCATTGTACAACATCATT
CTTAAGGGTCTGTGTGATGAGGGTAAATCAACAGTGGCTATGGAGTATTTGAAGAAAATGGCAAAGCAGGTGGGCCTTGTTGCAGACAAGGAAACTTACAGCAGT
TTAGTAAATGGACTTTGTAGTGAAAATAGATACATTGAAGCATGTAAGATCTTAGAAGAGATGGTTATCAAATCGTATTGGCCTTCTTCTAACACATTCAATACA
GTTATCAGAGGTCTTTGTTCGGTTGGAAAACAGTATGAAGCTGCGATGTGGATGGAAGAAATGATTAGCCAAGGTCAATTGCCTGAACTTTCTGTCTGGAATTCT
TTGGTTTCATCCTTGTGTTTCAACGTGGCTGGAACTGATATATCGTTCGAGGGCGTAAGGAATTTTATCAAAACAGATCTGGTTTTTTCTCCGATCAAAACCCTC
GCCATCTCTCGTATCGCTCTTCGACCTTCTCATCTCCCCGACTGTGTCATCAGTTTTTTTCTCGAAACAGATCTGACGTTGCCGCTTCCTCTTTCAGTCACGCCG
CCGCCTCAGCCGAAGTCTCGCGTCGCCGCTGTTCGAGCCATTCGTCGCGCCACCTCCCTTTGCGTGCGATTCTTCTCTGTGAGCTCCCTCTCTCCGCGTCGTCAC
TCCCATCCGCCGTCGCCTTCCCTCGCGCCGCGCCCGGAAGTGTTGTTGCCGCCGTTTGTCGAGCTCTGCCGAGCCCAGATCGAAGCCGCCACATCTCTTCCCCTT
GTTTCGCGCGATTTCGGCCAAGCCCGATCCGGCGCGTCCAACAGCTTGAGGTTCGTTTTCGAGCATCCTCGCCTCTGTCCAGCGGCGTTTGGACCCCATTCTGGA
CCGTGGAAGCGTCGCCTAGCCTTGTTGATTCGGGCCATTTGCAGCCGCTGTACAGCACCTAATTGGGTCTGTTTGGTGCTGATTGACCTTGGACAGCACCTGTTT
AAGGAGTTTCAGTGCTGTGTTAGGTTGTTCCAACGAGGTTCAACTCCTGTTTGGGAGCCCTTGGTTGTGTTGTTGGTTTGCTTAGAGCTTGCTTTGTATTTTCCA
GCAGCGCTGGTATTTAAGAGTGCTATGCTTGGGATAAGCATGAAAATGACTTACCGTGGTTGTACTGATACCCCCTTCCTCACCTTCCCCAACATTTTAGATGTT
GCAGGTATCGAGGATGATCCAGACCTTGGTGGCGAGGAGGACTATGAGGAAAATCCTAGGCTACCCCATCTTCTCAGTCGATGGCTCGTGGTCGTGGTCGTGGAC
GGGTTGTGGCGTGGACCCAGGAATCCAGCTAACAGACAGGAGGACCATGTCGATGATGCCCCTGCTCCGCCGGAGATTAACCAGCCGATTCCTCCTAGTCAGCAC
GAAGTTGACCCTCCGCCCCCTCCAGTCCCCCGTGCTCCTCGTAGGCAGCAAGAGGTTGTTCCCCCAGCACCGCCTTCAGTGGTTCCCCACCACCACCTCCAGTCC
CGCACATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGGCGGAAAAAACAGAGGGAAAAGAGTAAAGTTGTTCAAGACTGTGTTCTTTGGCAAAAAGAAGGAAAAAGAAACAAGAAAATGGAGAAATTGTCGAATG
ATTCCGGAGAAAAAAGCTGGGTTACGAGGATATACGCTTGTATGTCATCGGAGTGTGAAAATCATGCAAGAATGGGCGAGAAAGAAATCGGTCGTGGCGAAGTCT
GGATTTTCGAAGTACGATCGTTGGGTGTCGAAGCTTTCGTGGCAAGAGTTGAAAACTTGGAGAGGAGAACATAGCAGCGACCACAAAGTTAGGAAAAAGGCAGAA
GAAAACGAGGAGAAAAGGAGAATGGCCGACAGTGGTCGTCACTGGCAATGTCGACATGTGTGGTCGGAGACAAAATTTTGGGTAATGTGCCATCGCAAGATTAGT
AAAAATGCAGGGAAAGAAATGAAGGAGAAGGACCATGAAGAAAATCAAACTAAAAATGAAATCCTTCTTGACGGAGAATGGACCCAGCAACAAATGCAGATGGAG
GTGATAATTTTTCGATGGAGGTGCAAACTCATCTTTCGTAATGAACAAGACGAAATAAACCCTAGTCACGTGCAATGTCAGGGAGTGGGGCCATGGATTTGGAAA
AGGTTGGGCATTAACATTGGGCTTTTCAATGGGCTTATTTTAATTGAGTTAGTTGAAGTCCATCTTGGAGAAAAATCTGGAAAACAGGATTGGTGTGGTGGGATA
TTCAATGGGCCGATTCTAAAGGAATTGGTGGAGGCCCAATATTCTTTCGCAATACCCAATGTTTTCAGGTTTACCATTGGGGCACAATCCGGTATCTTCGTTGTT
TGGGACCCAGTGATGGATAAGGTGAGGAAAAGACTTGCAAGTTGGAAGAAGAGTTTTTTCTCCAAAGGAGGTAGACTGACCCTCATCCGTTTCGTGTTAAGTGGC
TTCAGAGCCCCTTGTTCGATGTGTAAAACTCTCGAGAAGCTGATGAAGGACTTTCTGTGGGAAGGAGTGTATGAGGGCAAGTCTATTCATTTGGTGAGTTGGGAC
TTAGTGGGGCGGCCTTTAAGTCGGGGGGGGGGGTTGAGGATTATTGTTAGCAAATACGACCCTCATCCTTCTGAATGGGTGATGGGAGGGATCAAAGGCACTTCT
AGAAACCCTTGGAAAGAAATTTCTCACGAGCTCCCTTCCTTGACCTCTTTTGTTTGTTATTCTGTGGGGAATGGGGAGGATACATATTTTTGGGAAGACAGGTGG
GTGGGAGATAGACCCCTTTGTGCTACTTTTCCTCGGCTTTATCACTTATCTTCTATGAAAAACCACCCTGTGGTTGAGGTTTTGAGTCCTTTAGGGAGTTCTCTA
TCCTATTCTTTTGGCTTCTTGCGTTCGTTATCTGATAGAGATACTACAGACCTCTTGTCTCTCTTGTCCTTGATTGGGGAGTTCTCTTTTAGTACTACGAGGAGG
GATTTCCATTTGTGGATTCCTAGCCCATCCATTGACTTCTCTTGTCGGTCCTTCCATTGCTTACTAGACCCCTCTCCTTTCGACTTGTTTATCTTTTCCATGTTA
TGGAAGATCCTTTGCTTGATCGGGCCTTTTTGTTGCATTCTCTGTCGGATGGCGGAGGAAGACCTCGACCATATTCTGTGGAGTTGTAGATTTGCTAAGGCAGTG
TGGGATGAGTTTTTTGCATCGTTTGGTTTGCAGTTTGCCAGACATAGGGGCCTTAGAGAGATGATCGAGGAGTTCCTTTCCCATCCTCCTTTTAGGGATCAAGGA
AATTTCTTGTGGCAAGCTGGGATCTGCACTATTATTTGGAGGCTTTGGGGTGAGAGGAACAATAGGACGTTTAGTGGGATGGAGAGGGATGCGACTGAGCTCAAA
GAACTTGGACAGTTTGGTGGAAGAACAATACTTGCTGGACACTGGTGGCCAAGGCTTTTAACGCCCACATACCTATCTCAGATTATTAGGAAGCAGAATAACCCC
TTAAGAGCTTACCAATTGTTCAAGGAAGCCAGTTGTAGGTACCCAGATTATCGGCACAATGGTCCGGTGTATGCTGCAATGATCAATATACTTGGAAACTCGGGT
CGATTTTTCGAGATGAGAGAAGTGATAAATCAGATGAAAGATGACTCCTGCGAGTGCAAAGATTCTGTATTTTCATTTGCAATTAAAACGTATGCTAGTCATGGA
TTATTGGAAGAAGGTATATCCCTGTTTAAAAGCCTTGGGAGATTTAACTGTACAAATAGAACACAAAGTTTCAATACCCTTTTGGAGATTCTCTTGAATGAATCT
CAGCTTGATGCTGCTTGTCAGCTTTTTCAGCAGAGTTCTTATGGTTGGGAAGTGAAATCCAGGACTCATTCCTTGAATTTGCTTATGCAATCCCTTTGCCAAAGA
GGCCAATCTGAACTTGCTTTACATGTCTTTCAAGAGATGGATTACCAAAGTTGCTATCCAAATAGGCTGAGTTATTTGATTCTAATGAAAGGGCTGTGTCAAGAT
GGTAGGCTTAATGAGGCCATCCATTTGCTGTATTCCATGTTTTGGAGGATTTCTCGAAGGGGTAGTGGAGGGGACATAGTTGTTTACAGAACCCTTCTATTTGCT
TTATGTGATAACGGAGAGATTAAGCAGGCTGTGGAAATACTTGGCAAGATCTTGAGGAAAGGACTGAAAACCCCTAAGCGAGCTCATTACCAGATTGACCTCGAT
CAATGCAGGGATAGCAACCTCACTGTTGGTGAAATCAAGAGTTCAATCAATGAAGCTTTAATCAAAGGCGGAATTCCCAGTTCAGATAGCTACTGTGCAATGGCT
GTCGATCTATATAACGAAAACGAAACTAATCAGGGAGATAAAGTTGTTAGCCACATGCTAGCTAAAGGCTTCACGCCCCCATCCTCGATCTATGAAGCGAAAGTG
ACTGCATTATGCAAAGAAGGCAAAGTTGATGATGCAGTGAAAGTTATTGAAGATGAAACAGTGAAAAGAAGTTGTGTTCCAACTATTGCATTGTACAACATCATT
CTTAAGGGTCTGTGTGATGAGGGTAAATCAACAGTGGCTATGGAGTATTTGAAGAAAATGGCAAAGCAGGTGGGCCTTGTTGCAGACAAGGAAACTTACAGCAGT
TTAGTAAATGGACTTTGTAGTGAAAATAGATACATTGAAGCATGTAAGATCTTAGAAGAGATGGTTATCAAATCGTATTGGCCTTCTTCTAACACATTCAATACA
GTTATCAGAGGTCTTTGTTCGGTTGGAAAACAGTATGAAGCTGCGATGTGGATGGAAGAAATGATTAGCCAAGGTCAATTGCCTGAACTTTCTGTCTGGAATTCT
TTGGTTTCATCCTTGTGTTTCAACGTGGCTGGAACTGATATATCGTTCGAGGGCGTAAGGAATTTTATCAAAACAGATCTGGTTTTTTCTCCGATCAAAACCCTC
GCCATCTCTCGTATCGCTCTTCGACCTTCTCATCTCCCCGACTGTGTCATCAGTTTTTTTCTCGAAACAGATCTGACGTTGCCGCTTCCTCTTTCAGTCACGCCG
CCGCCTCAGCCGAAGTCTCGCGTCGCCGCTGTTCGAGCCATTCGTCGCGCCACCTCCCTTTGCGTGCGATTCTTCTCTGTGAGCTCCCTCTCTCCGCGTCGTCAC
TCCCATCCGCCGTCGCCTTCCCTCGCGCCGCGCCCGGAAGTGTTGTTGCCGCCGTTTGTCGAGCTCTGCCGAGCCCAGATCGAAGCCGCCACATCTCTTCCCCTT
GTTTCGCGCGATTTCGGCCAAGCCCGATCCGGCGCGTCCAACAGCTTGAGGTTCGTTTTCGAGCATCCTCGCCTCTGTCCAGCGGCGTTTGGACCCCATTCTGGA
CCGTGGAAGCGTCGCCTAGCCTTGTTGATTCGGGCCATTTGCAGCCGCTGTACAGCACCTAATTGGGTCTGTTTGGTGCTGATTGACCTTGGACAGCACCTGTTT
AAGGAGTTTCAGTGCTGTGTTAGGTTGTTCCAACGAGGTTCAACTCCTGTTTGGGAGCCCTTGGTTGTGTTGTTGGTTTGCTTAGAGCTTGCTTTGTATTTTCCA
GCAGCGCTGGTATTTAAGAGTGCTATGCTTGGGATAAGCATGAAAATGACTTACCGTGGTTGTACTGATACCCCCTTCCTCACCTTCCCCAACATTTTAGATGTT
GCAGGTATCGAGGATGATCCAGACCTTGGTGGCGAGGAGGACTATGAGGAAAATCCTAGGCTACCCCATCTTCTCAGTCGATGGCTCGTGGTCGTGGTCGTGGAC
GGGTTGTGGCGTGGACCCAGGAATCCAGCTAACAGACAGGAGGACCATGTCGATGATGCCCCTGCTCCGCCGGAGATTAACCAGCCGATTCCTCCTAGTCAGCAC
GAAGTTGACCCTCCGCCCCCTCCAGTCCCCCGTGCTCCTCGTAGGCAGCAAGAGGTTGTTCCCCCAGCACCGCCTTCAGTGGTTCCCCACCACCACCTCCAGTCC
CGCACATGTTGA
Protein sequenceShow/hide protein sequence
MVGGKNRGKRVKLFKTVFFGKKKEKETRKWRNCRMIPEKKAGLRGYTLVCHRSVKIMQEWARKKSVVAKSGFSKYDRWVSKLSWQELKTWRGEHSSDHKVRKKAE
ENEEKRRMADSGRHWQCRHVWSETKFWVMCHRKISKNAGKEMKEKDHEENQTKNEILLDGEWTQQQMQMEVIIFRWRCKLIFRNEQDEINPSHVQCQGVGPWIWK
RLGINIGLFNGLILIELVEVHLGEKSGKQDWCGGIFNGPILKELVEAQYSFAIPNVFRFTIGAQSGIFVVWDPVMDKVRKRLASWKKSFFSKGGRLTLIRFVLSG
FRAPCSMCKTLEKLMKDFLWEGVYEGKSIHLVSWDLVGRPLSRGGGLRIIVSKYDPHPSEWVMGGIKGTSRNPWKEISHELPSLTSFVCYSVGNGEDTYFWEDRW
VGDRPLCATFPRLYHLSSMKNHPVVEVLSPLGSSLSYSFGFLRSLSDRDTTDLLSLLSLIGEFSFSTTRRDFHLWIPSPSIDFSCRSFHCLLDPSPFDLFIFSML
WKILCLIGPFCCILCRMAEEDLDHILWSCRFAKAVWDEFFASFGLQFARHRGLREMIEEFLSHPPFRDQGNFLWQAGICTIIWRLWGERNNRTFSGMERDATELK
ELGQFGGRTILAGHWWPRLLTPTYLSQIIRKQNNPLRAYQLFKEASCRYPDYRHNGPVYAAMINILGNSGRFFEMREVINQMKDDSCECKDSVFSFAIKTYASHG
LLEEGISLFKSLGRFNCTNRTQSFNTLLEILLNESQLDAACQLFQQSSYGWEVKSRTHSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQD
GRLNEAIHLLYSMFWRISRRGSGGDIVVYRTLLFALCDNGEIKQAVEILGKILRKGLKTPKRAHYQIDLDQCRDSNLTVGEIKSSINEALIKGGIPSSDSYCAMA
VDLYNENETNQGDKVVSHMLAKGFTPPSSIYEAKVTALCKEGKVDDAVKVIEDETVKRSCVPTIALYNIILKGLCDEGKSTVAMEYLKKMAKQVGLVADKETYSS
LVNGLCSENRYIEACKILEEMVIKSYWPSSNTFNTVIRGLCSVGKQYEAAMWMEEMISQGQLPELSVWNSLVSSLCFNVAGTDISFEGVRNFIKTDLVFSPIKTL
AISRIALRPSHLPDCVISFFLETDLTLPLPLSVTPPPQPKSRVAAVRAIRRATSLCVRFFSVSSLSPRRHSHPPSPSLAPRPEVLLPPFVELCRAQIEAATSLPL
VSRDFGQARSGASNSLRFVFEHPRLCPAAFGPHSGPWKRRLALLIRAICSRCTAPNWVCLVLIDLGQHLFKEFQCCVRLFQRGSTPVWEPLVVLLVCLELALYFP
AALVFKSAMLGISMKMTYRGCTDTPFLTFPNILDVAGIEDDPDLGGEEDYEENPRLPHLLSRWLVVVVVDGLWRGPRNPANRQEDHVDDAPAPPEINQPIPPSQH
EVDPPPPPVPRAPRRQQEVVPPAPPSVVPHHHLQSRTC