; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G001590 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G001590
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCmo_Chr08:920999..922513
RNA-Seq ExpressionCmoCh08G001590
SyntenyCmoCh08G001590
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592948.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]3.7e-29599.8Show/hide
Query:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
        MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
Subjt:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI

Query:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
        SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
Subjt:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG

Query:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
        KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
Subjt:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY

Query:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL
        CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL
Subjt:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL

Query:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ
        VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTL RGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ
Subjt:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ

Query:  IRSC
        IRSC
Subjt:  IRSC

KAG7025356.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.5e-29199.6Show/hide
Query:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
        MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
Subjt:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI

Query:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
        SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
Subjt:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG

Query:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
        KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
Subjt:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY

Query:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL
        CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL
Subjt:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL

Query:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVL
        VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTL RGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKV+
Subjt:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVL

XP_022959953.1 pentatricopeptide repeat-containing protein At1g05600 [Cucurbita moschata]7.5e-296100Show/hide
Query:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
        MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
Subjt:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI

Query:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
        SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
Subjt:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG

Query:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
        KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
Subjt:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY

Query:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL
        CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL
Subjt:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL

Query:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ
        VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ
Subjt:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ

Query:  IRSC
        IRSC
Subjt:  IRSC

XP_023004266.1 pentatricopeptide repeat-containing protein At1g05600 [Cucurbita maxima]7.2e-29198.21Show/hide
Query:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
        MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
Subjt:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI

Query:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
        SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
Subjt:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG

Query:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
        KLHEAIHLLYSMFWRISR+GSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLK+PKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
Subjt:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY

Query:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL
        CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCR GKSTVAMEFLKKM KQVGL
Subjt:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL

Query:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ
        VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTL RGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVA T MWSKVL+Q
Subjt:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ

Query:  IRSC
        I+SC
Subjt:  IRSC

XP_023514107.1 pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucurbita pepo subsp. pepo]4.5e-29399.01Show/hide
Query:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
        MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
Subjt:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI

Query:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
        SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
Subjt:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG

Query:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
        KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLK+PKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
Subjt:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY

Query:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL
        CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCR G STVAMEFLKKMVKQVGL
Subjt:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL

Query:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ
        VADKETYSTLVHGLCRENRY EACKLLEEMVIKSHWPCSNTFNTL RGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ
Subjt:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ

Query:  IRSC
        IRSC
Subjt:  IRSC

TrEMBL top hitse value%identityAlignment
A0A1S4E2N8 pentatricopeptide repeat-containing protein At1g05600 isoform X18.4e-25386.55Show/hide
Query:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
        MTVRWPR+LTPT+LSQIIRKQNNP TAYQLF EAKCRYP+Y+HNGPVYAAMINILGNSGR+SEMREV+DQM+ DSC+CKDS+FSFAIKTYASHGLLE+GI
Subjt:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI

Query:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
        SLFKSLG FNCT+RTQTFNTLLEILLNESQL AACQLFQ+ S+GWEVKSRTQSLNLLMQSLCQRGQSELALHVF+EMDYQSCYPNRLSYLI+MKGLCQDG
Subjt:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG

Query:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
        +L+EAIHLLYSMFWRISR+GSGGDIVIYRTLLFALCDNGEIEQAVEILGKIL+KGLKAPKRAHY IDL+ CR +KLT+ EIK LINEALIKGGIPSSDSY
Subjt:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY

Query:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL
        CAMA+DLYNEN+TDQGDKVVSHM+AKGF PPSS+YEAK A+LCKEGKVDDAVKVIEE+ V GSCVPT+ALYNIVL GLC  GKSTVAME+LKKM K+VGL
Subjt:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL

Query:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVL
        VA+KETYSTLVHGLCRENRYTEACK+LEEMVIKS WPCSNTFNTL +GLCSVGK Y+AVM LEEMISQGQLP + VWN+LVSSLC +VA  DM SKVL
Subjt:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVL

A0A5D3DM62 Pentatricopeptide repeat-containing protein8.4e-25386.55Show/hide
Query:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
        MTVRWPR+LTPT+LSQIIRKQNNP TAYQLF EAKCRYP+Y+HNGPVYAAMINILGNSGR+SEMREV+DQM+ DSC+CKDS+FSFAIKTYASHGLLE+GI
Subjt:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI

Query:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
        SLFKSLG FNCT+RTQTFNTLLEILLNESQL AACQLFQ+ S+GWEVKSRTQSLNLLMQSLCQRGQSELALHVF+EMDYQSCYPNRLSYLI+MKGLCQDG
Subjt:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG

Query:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
        +L+EAIHLLYSMFWRISR+GSGGDIVIYRTLLFALCDNGEIEQAVEILGKIL+KGLKAPKRAHY IDL+ CR +KLT+ EIK LINEALIKGGIPSSDSY
Subjt:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY

Query:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL
        CAMA+DLYNEN+TDQGDKVVSHM+AKGF PPSS+YEAK A+LCKEGKVDDAVKVIEE+ V GSCVPT+ALYNIVL GLC  GKSTVAME+LKKM K+VGL
Subjt:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL

Query:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVL
        VA+KETYSTLVHGLCRENRYTEACK+LEEMVIKS WPCSNTFNTL +GLCSVGK Y+AVM LEEMISQGQLP + VWN+LVSSLC +VA  DM SKVL
Subjt:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVL

A0A6J1DVG7 pentatricopeptide repeat-containing protein At1g056003.0e-25084.69Show/hide
Query:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
        MTVRWPR+LTPTHLSQIIRKQNNPFTA+QLF EAKCRYP Y+HNGPVYAAMI+ILGNSGR+ EMREVIDQMK DSC+CKDS+FSFAIKTYASHGLLEEGI
Subjt:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI

Query:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
        SLFKSLG FNCT+RTQTFNTLLEILLNES+LDAACQLFQQSS+GW VKSRTQSLNLLMQSLCQR QSELALH+F+EMDYQ CYPNRLSYLILMKGLCQDG
Subjt:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG

Query:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
        +L+EA+HLLYSMFWRIS+RGSGGDIVIYRTLL+ALC NGE+EQAVEILGKIL+KGLKAPKR HY IDL+ C+ SKLT+ EIK L NEALIKGGIPS  +Y
Subjt:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY

Query:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL
        CAMA+DLYNEN TDQGDKVVSHMLAKGF PPSS+YEAKAAALCKEGKVDDA++VI+EETVKGSC+P+VALYNIVL GL   GKSTVA+E+LKKM KQVGL
Subjt:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL

Query:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMW-SKVLR
        VADKETYS LVHGLC ENRY EACK+LEEMVIKS+ PCS+TFNTL  GLCS+GK Y+AVM LEEMISQGQLPELSVWN+LVSS+CFNVA TD+W S+VL+
Subjt:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMW-SKVLR

Query:  QIR
        QIR
Subjt:  QIR

A0A6J1H9J7 pentatricopeptide repeat-containing protein At1g056003.6e-296100Show/hide
Query:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
        MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
Subjt:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI

Query:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
        SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
Subjt:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG

Query:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
        KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
Subjt:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY

Query:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL
        CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL
Subjt:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL

Query:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ
        VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ
Subjt:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ

Query:  IRSC
        IRSC
Subjt:  IRSC

A0A6J1KU44 pentatricopeptide repeat-containing protein At1g056003.5e-29198.21Show/hide
Query:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
        MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI
Subjt:  MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGI

Query:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
        SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG
Subjt:  SLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDG

Query:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
        KLHEAIHLLYSMFWRISR+GSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLK+PKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY
Subjt:  KLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY

Query:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL
        CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCR GKSTVAMEFLKKM KQVGL
Subjt:  CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGL

Query:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ
        VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTL RGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVA T MWSKVL+Q
Subjt:  VADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQ

Query:  IRSC
        I+SC
Subjt:  IRSC

SwissProt top hitse value%identityAlignment
O49436 Pentatricopeptide repeat-containing protein At4g200901.5e-4124.32Show/hide
Query:  AAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLF-KSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQ---QSSFG
        ++MI    NSG    + +++ ++++++    +  F    + Y    L ++ + LF + +  F C    ++FN++L +++NE       + +     S+  
Subjt:  AAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLF-KSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQ---QSSFG

Query:  WEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQA
          +     S NL++++LC+    + A+ VF+ M  + C P+  +Y  LM GLC++ ++ EA+ LL  M       G     VIY  L+  LC  G++ + 
Subjt:  WEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQA

Query:  VEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCK
         +++  +  KG    +  +  +    C   KL   +   L+   +    IP+  +Y  +   L  +       +++S M  +G+     +Y    + L K
Subjt:  VEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCK

Query:  EGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNT---
        EGK ++A+ +  +   KG C P + +Y+++++GLCR GK   A E L +M+   G + +  TYS+L+ G  +     EA ++ +EM       CS     
Subjt:  EGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNT---

Query:  FNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLC
        ++ L  GLC VG+  +A+M   +M++ G  P+   +++++  LC
Subjt:  FNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLC

Q3EDF8 Pentatricopeptide repeat-containing protein At1g099002.7e-3826.1Show/hide
Query:  IKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNR
        I+ +   G   +   + + L G        T+N ++       +++ A  +  + S    V     + N +++SLC  G+ + A+ V   M  + CYP+ 
Subjt:  IKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNR

Query:  LSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKL----------
        ++Y IL++  C+D  +  A+ LL  M      RG   D+V Y  L+  +C  G +++A++ L  +   G +     H +I  + C   +           
Subjt:  LSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKL----------

Query:  -------TVTEIKCLINEALIKGGI----------------PSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVK
               +V     LIN    KG +                P+S SY  +      E + D+  + +  M+++G +P    Y     ALCK+GKV+DAV+
Subjt:  -------TVTEIKCLINEALIKGGI----------------PSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVK

Query:  VIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVG
        ++ + + KG C P +  YN V++GL + GK+  A++ L +M +   L  D  TYS+LV GL RE +  EA K   E       P + TFN++  GLC   
Subjt:  VIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVG

Query:  KPYKAVMCLEEMISQGQLPELSVWNALVSSLCF
        +  +A+  L  MI++G  P  + +  L+  L +
Subjt:  KPYKAVMCLEEMISQGQLPELSVWNALVSSLCF

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745801.1e-4422.26Show/hide
Query:  LTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVD-SCQCKDSIFSFAIKTYASHGLLEEGISLFKSLG
        L P H++ +I+ Q +P  A ++FN  + +   ++H    Y ++I  LG  G+   M EV+  M+ +      + ++  A+K Y   G ++E +++F+ + 
Subjt:  LTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVD-SCQCKDSIFSFAIKTYASHGLLEEGISLFKSLG

Query:  GFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSC-------------------------
         ++C     ++N ++ +L++    D A +++ +      +     S  + M+S C+  +   AL +   M  Q C                         
Subjt:  GFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSC-------------------------

Query:  ---------------------------------------------YPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCD
                                                      PN  +Y + ++GLCQ G+L  A+ ++  +      +G   D++ Y  L++ LC 
Subjt:  ---------------------------------------------YPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCD

Query:  NGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEA
        N + ++A   LGK++ +GL+     +  +   YC+   + + E   ++ +A+  G +P   +Y ++   L +E ET++   + +  L KG  P   +Y  
Subjt:  NGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEA

Query:  KAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWP
            L  +G + +A ++  E + KG  +P V  +NI++NGLC++G  + A   +K M+ + G   D  T++ L+HG   + +   A ++L+ M+     P
Subjt:  KAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWP

Query:  CSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLC
           T+N+L  GLC   K    +   + M+ +G  P L  +N L+ SLC
Subjt:  CSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLC

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic2.1e-4323.83Show/hide
Query:  TPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFK-SLGG
        T   L   +R Q +   A +LFN A  + PN+     +Y  ++  LG SG   +M+++++ MK   C+   S F   I++YA   L +E +S+    +  
Subjt:  TPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFK-SLGG

Query:  FNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHV
        F     T  +N +L +L++ + L    ++       W +K    + N+L+++LC                                   + G  + AL +
Subjt:  FNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHV

Query:  FKEMDYQSCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRI
         ++M    C  + +S  +++ G C++G++ +A++ +  M    ++ G   D   + TL+  LC  G ++ A+EI+  +L++G       +  +    C++
Subjt:  FKEMDYQSCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRI

Query:  SKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNI
         +  V E   ++++ + +   P++ +Y  +   L  EN+ ++  ++   + +KG  P    + +    LC       A+++ EE   KG C P    YN+
Subjt:  SKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNI

Query:  VLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPE
        +++ LC  GK   A+  LK+M +  G      TY+TL+ G C+ N+  EA ++ +EM +      S T+NTL  GLC   +   A   +++MI +GQ P+
Subjt:  VLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPE

Query:  LSVWNALVSSLC
           +N+L++  C
Subjt:  LSVWNALVSSLC

Q9SYK1 Pentatricopeptide repeat-containing protein At1g056005.8e-15855.12Show/hide
Query:  VRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISL
        VRWPR+LTP+ LSQI++KQ NP TA +LF EAK R+P+Y HNG VYA MI+ILG S R+ EM+ VI++MK DSC+CKDS+F+  I+T++  G LE+ ISL
Subjt:  VRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISL

Query:  FKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDGKL
        FKSL  FNC + + +F+TLL+ ++ ES+L+AAC +F++  +GWEV SR  +LNLLM+ LCQ  +S+LA  VF+EM+YQ CYP+R SY ILMKG C +GKL
Subjt:  FKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDGKL

Query:  HEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCA
         EA HLLYSMFWRIS++GSG DIV+YR LL ALCD GE++ A+EILGKIL+KGLKAPKR ++ I+  +   S   +  +K L+ E LI+G IP  DSY A
Subjt:  HEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCA

Query:  MAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVA
        MA DL+ E +  +G++V+  M +KGF P   +Y AK  ALC+ GK+ +AV VI +E ++G C+PTV +YN+++ GLC  GKS  A+ +LKKM KQV  VA
Subjt:  MAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVA

Query:  DKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVAD
        ++ETY TLV GLCR+ ++ EA +++EEM+IKSH+P   T++ + +GLC + + Y+AVM LEEM+SQ  +PE SVW AL  S+CF   D
Subjt:  DKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVAD

Arabidopsis top hitse value%identityAlignment
AT1G05600.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.1e-15955.12Show/hide
Query:  VRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISL
        VRWPR+LTP+ LSQI++KQ NP TA +LF EAK R+P+Y HNG VYA MI+ILG S R+ EM+ VI++MK DSC+CKDS+F+  I+T++  G LE+ ISL
Subjt:  VRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISL

Query:  FKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDGKL
        FKSL  FNC + + +F+TLL+ ++ ES+L+AAC +F++  +GWEV SR  +LNLLM+ LCQ  +S+LA  VF+EM+YQ CYP+R SY ILMKG C +GKL
Subjt:  FKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDGKL

Query:  HEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCA
         EA HLLYSMFWRIS++GSG DIV+YR LL ALCD GE++ A+EILGKIL+KGLKAPKR ++ I+  +   S   +  +K L+ E LI+G IP  DSY A
Subjt:  HEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCA

Query:  MAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVA
        MA DL+ E +  +G++V+  M +KGF P   +Y AK  ALC+ GK+ +AV VI +E ++G C+PTV +YN+++ GLC  GKS  A+ +LKKM KQV  VA
Subjt:  MAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVA

Query:  DKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVAD
        ++ETY TLV GLCR+ ++ EA +++EEM+IKSH+P   T++ + +GLC + + Y+AVM LEEM+SQ  +PE SVW AL  S+CF   D
Subjt:  DKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVAD

AT1G05600.2 Tetratricopeptide repeat (TPR)-like superfamily protein4.1e-15955.12Show/hide
Query:  VRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISL
        VRWPR+LTP+ LSQI++KQ NP TA +LF EAK R+P+Y HNG VYA MI+ILG S R+ EM+ VI++MK DSC+CKDS+F+  I+T++  G LE+ ISL
Subjt:  VRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISL

Query:  FKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDGKL
        FKSL  FNC + + +F+TLL+ ++ ES+L+AAC +F++  +GWEV SR  +LNLLM+ LCQ  +S+LA  VF+EM+YQ CYP+R SY ILMKG C +GKL
Subjt:  FKSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDGKL

Query:  HEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCA
         EA HLLYSMFWRIS++GSG DIV+YR LL ALCD GE++ A+EILGKIL+KGLKAPKR ++ I+  +   S   +  +K L+ E LI+G IP  DSY A
Subjt:  HEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCA

Query:  MAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVA
        MA DL+ E +  +G++V+  M +KGF P   +Y AK  ALC+ GK+ +AV VI +E ++G C+PTV +YN+++ GLC  GKS  A+ +LKKM KQV  VA
Subjt:  MAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVA

Query:  DKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVAD
        ++ETY TLV GLCR+ ++ EA +++EEM+IKSH+P   T++ + +GLC + + Y+AVM LEEM+SQ  +PE SVW AL  S+CF   D
Subjt:  DKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVAD

AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein8.0e-4622.26Show/hide
Query:  LTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVD-SCQCKDSIFSFAIKTYASHGLLEEGISLFKSLG
        L P H++ +I+ Q +P  A ++FN  + +   ++H    Y ++I  LG  G+   M EV+  M+ +      + ++  A+K Y   G ++E +++F+ + 
Subjt:  LTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVD-SCQCKDSIFSFAIKTYASHGLLEEGISLFKSLG

Query:  GFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSC-------------------------
         ++C     ++N ++ +L++    D A +++ +      +     S  + M+S C+  +   AL +   M  Q C                         
Subjt:  GFNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSC-------------------------

Query:  ---------------------------------------------YPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCD
                                                      PN  +Y + ++GLCQ G+L  A+ ++  +      +G   D++ Y  L++ LC 
Subjt:  ---------------------------------------------YPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCD

Query:  NGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEA
        N + ++A   LGK++ +GL+     +  +   YC+   + + E   ++ +A+  G +P   +Y ++   L +E ET++   + +  L KG  P   +Y  
Subjt:  NGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEA

Query:  KAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWP
            L  +G + +A ++  E + KG  +P V  +NI++NGLC++G  + A   +K M+ + G   D  T++ L+HG   + +   A ++L+ M+     P
Subjt:  KAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWP

Query:  CSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLC
           T+N+L  GLC   K    +   + M+ +G  P L  +N L+ SLC
Subjt:  CSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLC

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-4423.83Show/hide
Query:  TPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFK-SLGG
        T   L   +R Q +   A +LFN A  + PN+     +Y  ++  LG SG   +M+++++ MK   C+   S F   I++YA   L +E +S+    +  
Subjt:  TPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFK-SLGG

Query:  FNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHV
        F     T  +N +L +L++ + L    ++       W +K    + N+L+++LC                                   + G  + AL +
Subjt:  FNCTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHV

Query:  FKEMDYQSCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRI
         ++M    C  + +S  +++ G C++G++ +A++ +  M    ++ G   D   + TL+  LC  G ++ A+EI+  +L++G       +  +    C++
Subjt:  FKEMDYQSCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRI

Query:  SKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNI
         +  V E   ++++ + +   P++ +Y  +   L  EN+ ++  ++   + +KG  P    + +    LC       A+++ EE   KG C P    YN+
Subjt:  SKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNI

Query:  VLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPE
        +++ LC  GK   A+  LK+M +  G      TY+TL+ G C+ N+  EA ++ +EM +      S T+NTL  GLC   +   A   +++MI +GQ P+
Subjt:  VLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPE

Query:  LSVWNALVSSLC
           +N+L++  C
Subjt:  LSVWNALVSSLC

AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-4224.32Show/hide
Query:  AAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLF-KSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQ---QSSFG
        ++MI    NSG    + +++ ++++++    +  F    + Y    L ++ + LF + +  F C    ++FN++L +++NE       + +     S+  
Subjt:  AAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLF-KSLGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQ---QSSFG

Query:  WEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQA
          +     S NL++++LC+    + A+ VF+ M  + C P+  +Y  LM GLC++ ++ EA+ LL  M       G     VIY  L+  LC  G++ + 
Subjt:  WEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQA

Query:  VEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCK
         +++  +  KG    +  +  +    C   KL   +   L+   +    IP+  +Y  +   L  +       +++S M  +G+     +Y    + L K
Subjt:  VEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCK

Query:  EGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNT---
        EGK ++A+ +  +   KG C P + +Y+++++GLCR GK   A E L +M+   G + +  TYS+L+ G  +     EA ++ +EM       CS     
Subjt:  EGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNT---

Query:  FNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLC
        ++ L  GLC VG+  +A+M   +M++ G  P+   +++++  LC
Subjt:  FNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGTTAGGTGGCCAAGGCTTTTAACGCCCACACACCTGTCTCAGATTATTAGGAAGCAGAACAATCCTTTCACAGCTTACCAACTGTTCAATGAAGCCAAATGTAG
GTATCCAAATTATCAGCACAATGGTCCGGTGTACGCCGCAATGATCAATATACTTGGAAACTCGGGTAGAATTTCCGAGATGAGGGAAGTGATAGATCAGATGAAAGTTG
ACTCTTGTCAGTGCAAGGATTCTATATTTTCATTTGCAATTAAAACGTATGCTAGTCATGGATTATTGGAAGAAGGTATATCTCTGTTTAAAAGCCTTGGGGGATTTAAC
TGTACCGATAGAACACAAACTTTCAATACCCTTTTGGAAATACTCTTGAATGAATCTCAGCTCGATGCTGCTTGTCAGCTTTTTCAGCAGAGTTCATTTGGTTGGGAAGT
GAAATCCAGGACTCAGTCATTGAATTTGCTAATGCAATCTCTTTGTCAAAGAGGCCAATCTGAACTTGCTTTACATGTCTTTAAAGAAATGGATTACCAAAGTTGCTATC
CTAATAGGCTGAGTTATTTGATTCTAATGAAAGGACTGTGTCAAGATGGTAAGCTTCATGAGGCCATCCATTTATTGTATTCCATGTTCTGGAGGATTTCTCGAAGGGGT
AGCGGAGGGGACATAGTAATTTACAGAACCCTTCTGTTTGCTTTGTGTGATAATGGAGAGATTGAGCAAGCTGTGGAAATACTAGGCAAGATCTTGAAGAAAGGACTGAA
AGCCCCTAAGCGAGCTCATTACTTGATTGACCTCAACTACTGCAGGATTAGCAAGCTCACCGTCACGGAAATCAAGTGTTTAATCAATGAAGCTTTAATCAAAGGTGGAA
TTCCCAGTTCAGATAGTTATTGTGCCATGGCTATCGATCTATATAACGAAAACGAGACTGATCAGGGAGATAAAGTTGTTAGCCACATGCTAGCTAAAGGCTTTTGGCCA
CCATCCTCAGTCTATGAGGCGAAAGCAGCTGCATTATGCAAAGAAGGGAAAGTTGATGATGCAGTGAAAGTAATTGAAGAGGAAACGGTGAAGGGAAGTTGCGTTCCAAC
CGTTGCGTTGTATAACATCGTTCTGAATGGTCTTTGTAGGGTGGGCAAGTCAACAGTGGCTATGGAGTTCTTGAAGAAAATGGTAAAGCAGGTTGGTCTTGTTGCAGACA
AGGAGACTTATAGCACTTTAGTACATGGTCTTTGTCGTGAGAATAGATACACTGAAGCATGTAAGTTGTTGGAGGAGATGGTTATCAAATCACATTGGCCTTGTTCTAAC
ACATTCAATACACTTACCAGAGGTCTTTGCTCGGTGGGAAAACCATATAAAGCAGTGATGTGCTTGGAAGAAATGATTAGCCAAGGCCAATTGCCGGAACTTTCTGTTTG
GAATGCTTTGGTTTCATCTTTGTGTTTCAATGTGGCTGACACTGATATGTGGTCTAAGGTCTTACGACAGATACGAAGTTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGTTAGGTGGCCAAGGCTTTTAACGCCCACACACCTGTCTCAGATTATTAGGAAGCAGAACAATCCTTTCACAGCTTACCAACTGTTCAATGAAGCCAAATGTAG
GTATCCAAATTATCAGCACAATGGTCCGGTGTACGCCGCAATGATCAATATACTTGGAAACTCGGGTAGAATTTCCGAGATGAGGGAAGTGATAGATCAGATGAAAGTTG
ACTCTTGTCAGTGCAAGGATTCTATATTTTCATTTGCAATTAAAACGTATGCTAGTCATGGATTATTGGAAGAAGGTATATCTCTGTTTAAAAGCCTTGGGGGATTTAAC
TGTACCGATAGAACACAAACTTTCAATACCCTTTTGGAAATACTCTTGAATGAATCTCAGCTCGATGCTGCTTGTCAGCTTTTTCAGCAGAGTTCATTTGGTTGGGAAGT
GAAATCCAGGACTCAGTCATTGAATTTGCTAATGCAATCTCTTTGTCAAAGAGGCCAATCTGAACTTGCTTTACATGTCTTTAAAGAAATGGATTACCAAAGTTGCTATC
CTAATAGGCTGAGTTATTTGATTCTAATGAAAGGACTGTGTCAAGATGGTAAGCTTCATGAGGCCATCCATTTATTGTATTCCATGTTCTGGAGGATTTCTCGAAGGGGT
AGCGGAGGGGACATAGTAATTTACAGAACCCTTCTGTTTGCTTTGTGTGATAATGGAGAGATTGAGCAAGCTGTGGAAATACTAGGCAAGATCTTGAAGAAAGGACTGAA
AGCCCCTAAGCGAGCTCATTACTTGATTGACCTCAACTACTGCAGGATTAGCAAGCTCACCGTCACGGAAATCAAGTGTTTAATCAATGAAGCTTTAATCAAAGGTGGAA
TTCCCAGTTCAGATAGTTATTGTGCCATGGCTATCGATCTATATAACGAAAACGAGACTGATCAGGGAGATAAAGTTGTTAGCCACATGCTAGCTAAAGGCTTTTGGCCA
CCATCCTCAGTCTATGAGGCGAAAGCAGCTGCATTATGCAAAGAAGGGAAAGTTGATGATGCAGTGAAAGTAATTGAAGAGGAAACGGTGAAGGGAAGTTGCGTTCCAAC
CGTTGCGTTGTATAACATCGTTCTGAATGGTCTTTGTAGGGTGGGCAAGTCAACAGTGGCTATGGAGTTCTTGAAGAAAATGGTAAAGCAGGTTGGTCTTGTTGCAGACA
AGGAGACTTATAGCACTTTAGTACATGGTCTTTGTCGTGAGAATAGATACACTGAAGCATGTAAGTTGTTGGAGGAGATGGTTATCAAATCACATTGGCCTTGTTCTAAC
ACATTCAATACACTTACCAGAGGTCTTTGCTCGGTGGGAAAACCATATAAAGCAGTGATGTGCTTGGAAGAAATGATTAGCCAAGGCCAATTGCCGGAACTTTCTGTTTG
GAATGCTTTGGTTTCATCTTTGTGTTTCAATGTGGCTGACACTGATATGTGGTCTAAGGTCTTACGACAGATACGAAGTTGTTGA
Protein sequenceShow/hide protein sequence
MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFN
CTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRG
SGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWP
PSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSN
TFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQIRSC