; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G02450 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G02450
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr06:2391240..2395176
RNA-Seq ExpressionClc06G02450
SyntenyClc06G02450
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592948.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]7.4e-24387.68Show/hide
Query:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN
        KQNNP TAYQLF EAKCRYP+Y+HNGPVYAAMI+ILGNSGRISEMREV+DQM+ DSC+CKDS+FSFAIKTYAS GLLE+GISLFKSLG FNCTDRTQTFN
Subjt:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR
        TLLEILLN+SQLDAACQLFQQ S+GWEVKSRTQSLNLLMQSLCQRGQSELALHVF+EMDYQSCYPNRLSYLILMKGLCQDG+L+EAIHLLYSMFWRIS+R
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR

Query:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV
        G GGDIVIYRTLLFALCDNGEIEQAVEILGKIL+KGLKAPKRAHY IDL+ CR SKLT+ EIK LINEALIKGGIPSSDSYCAMA+DLYNENE DQGDKV
Subjt:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV

Query:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR
        VSHM+AKGF PP S+YEAKAAALCKEGKVDDAVKVIEEE VKGS VPTVALYNIVL GLC  GKSTVAME+LKKM KQVGLVA+K TYSTLVHGLCRENR
Subjt:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR

Query:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL
        Y EACK+LEEMVIKS  PCSNTFNTLIRGLCSVGK Y+AVM LEEMISQGQLP +SVWN+LVSSLC N+  T MWSKVL
Subjt:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL

XP_004140638.1 pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucumis sativus]1.9e-25491.02Show/hide
Query:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN
        KQNNP TAYQLFKEAKCRYPDYRHNGPVYA MI+ILGNSGR+SEMREV+DQMRDDSCECKDSVFSFAIKTYAS GLLEDGISLFKS GRFNCT+RTQTFN
Subjt:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR
        TLLEILL +SQL AACQLFQ+CSYGW VKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLI+MKGLCQDGRLNEAIHLLYSMFWRIS++
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR

Query:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV
        GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNS LTI EIKSLINEALIKGGIPSSDSYCAMAVDLYNEN+ DQGDKV
Subjt:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV

Query:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR
        VSHM+AKGFRPP  IYEAKAA+LCKEGKVDDAVKVIEE+IV G  VPT+ALYNIVLKGLCD+GKSTVAMEYLKKMAKQVGLVANK TYSTLVHGLC ENR
Subjt:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR

Query:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL
        Y+EACKVLEEMVIKSF PCSNTFNTLI+GLCSVGKHYEAVMWLEEMISQGQLPHV VWNSLVSSLCC++ G  M S+VL
Subjt:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL

XP_016902498.1 PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucumis melo]5.3e-25792.07Show/hide
Query:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN
        KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMI+ILGNSGR+SEMREV+DQMRDDSCECKDSVFSFAIKTYAS GLLEDGISLFKSLGRFNCT+RTQTFN
Subjt:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR
        TLLEILLN+SQL AACQLFQ+CSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLI+MKGLCQDGRLNEAIHLLYSMFWRIS++
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR

Query:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV
        G GGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRN+KLTI EIKSLINEALIKGGIPSSDSYCAMAVDLYNEN+ DQGDKV
Subjt:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV

Query:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR
        VSHM+AKGFRPP SIYEAK A+LCKEGKVDDAVKVIEE+IV GS VPT+ALYNIVLKGLCD+GKSTVAMEYLKKMAK+VGLVANK TYSTLVHGLCRENR
Subjt:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR

Query:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL
        Y EACKVLEEMVIKSF PCSNTFNTLI+GLCSVGK YEAVMWLEEMISQGQLPHV VWNSLVSSLCC++ G  M SKVL
Subjt:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL

XP_023004266.1 pentatricopeptide repeat-containing protein At1g05600 [Cucurbita maxima]1.5e-24387.68Show/hide
Query:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN
        KQNNP TAYQLF EAKCRYP+Y+HNGPVYAAMI+ILGNSGRISEMREV+DQM+ DSC+CKDS+FSFAIKTYAS GLLE+GISLFKSLG FNCTDRTQTFN
Subjt:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR
        TLLEILLN+SQLDAACQLFQQ S+GWEVKSRTQSLNLLMQSLCQRGQSELALHVF+EMDYQSCYPNRLSYLILMKGLCQDG+L+EAIHLLYSMFWRIS++
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR

Query:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV
        G GGDIVIYRTLLFALCDNGEIEQAVEILGKIL+KGLK+PKRAHY IDL+ CR SKLT+ EIK LINEALIKGGIPSSDSYCAMA+DLYNENE DQGDKV
Subjt:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV

Query:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR
        VSHM+AKGF PP S+YEAKAAALCKEGKVDDAVKVIEEE VKGS VPTVALYNIVL GLC  GKSTVAME+LKKMAKQVGLVA+K TYSTLVHGLCRENR
Subjt:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR

Query:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL
        Y EACK+LEEMVIKS  PCSNTFNTLIRGLCSVGK Y+AVM LEEMISQGQLP +SVWN+LVSSLC N+ GT MWSKVL
Subjt:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL

XP_038874865.1 pentatricopeptide repeat-containing protein At1g05600 [Benincasa hispida]3.2e-26293.93Show/hide
Query:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN
        KQNNPLTAYQLFKEAK RYPDYRHNGPVYAAMIDILGNSGR+SEMREV+DQMRDDSCECKDSVFSFAIKTYAS GLLEDGI+LFKSLG+FNCT+RTQTFN
Subjt:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR
        TLLEILLN+SQLDAACQLFQ+ SYGW VKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYP+RLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR

Query:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV
        GGGGDIVIYRTLLFALCDNG+IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIK LINEALIKGGIPSSDS+CAMAVDLYNENE DQGDKV
Subjt:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV

Query:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR
        VSHMVAKGFRPP SI+EAK AALCKEGKVDDAVKVIEEEIVKGS VPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVA+KGTYSTLVHGLCRENR
Subjt:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR

Query:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKV
        Y+EACKVLEEMVIKSF PCSNTFNTLIRGLCSVGK YEAVMWLEEMISQGQLPHVSVWNSLVSSLCC++ GT MWS+V
Subjt:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKV

TrEMBL top hitse value%identityAlignment
A0A0A0K9Q4 Uncharacterized protein9.1e-25591.02Show/hide
Query:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN
        KQNNP TAYQLFKEAKCRYPDYRHNGPVYA MI+ILGNSGR+SEMREV+DQMRDDSCECKDSVFSFAIKTYAS GLLEDGISLFKS GRFNCT+RTQTFN
Subjt:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR
        TLLEILL +SQL AACQLFQ+CSYGW VKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLI+MKGLCQDGRLNEAIHLLYSMFWRIS++
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR

Query:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV
        GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNS LTI EIKSLINEALIKGGIPSSDSYCAMAVDLYNEN+ DQGDKV
Subjt:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV

Query:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR
        VSHM+AKGFRPP  IYEAKAA+LCKEGKVDDAVKVIEE+IV G  VPT+ALYNIVLKGLCD+GKSTVAMEYLKKMAKQVGLVANK TYSTLVHGLC ENR
Subjt:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR

Query:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL
        Y+EACKVLEEMVIKSF PCSNTFNTLI+GLCSVGKHYEAVMWLEEMISQGQLPHV VWNSLVSSLCC++ G  M S+VL
Subjt:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL

A0A1S4E2N8 pentatricopeptide repeat-containing protein At1g05600 isoform X12.5e-25792.07Show/hide
Query:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN
        KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMI+ILGNSGR+SEMREV+DQMRDDSCECKDSVFSFAIKTYAS GLLEDGISLFKSLGRFNCT+RTQTFN
Subjt:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR
        TLLEILLN+SQL AACQLFQ+CSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLI+MKGLCQDGRLNEAIHLLYSMFWRIS++
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR

Query:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV
        G GGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRN+KLTI EIKSLINEALIKGGIPSSDSYCAMAVDLYNEN+ DQGDKV
Subjt:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV

Query:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR
        VSHM+AKGFRPP SIYEAK A+LCKEGKVDDAVKVIEE+IV GS VPT+ALYNIVLKGLCD+GKSTVAMEYLKKMAK+VGLVANK TYSTLVHGLCRENR
Subjt:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR

Query:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL
        Y EACKVLEEMVIKSF PCSNTFNTLI+GLCSVGK YEAVMWLEEMISQGQLPHV VWNSLVSSLCC++ G  M SKVL
Subjt:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL

A0A5D3DM62 Pentatricopeptide repeat-containing protein2.5e-25792.07Show/hide
Query:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN
        KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMI+ILGNSGR+SEMREV+DQMRDDSCECKDSVFSFAIKTYAS GLLEDGISLFKSLGRFNCT+RTQTFN
Subjt:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR
        TLLEILLN+SQL AACQLFQ+CSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLI+MKGLCQDGRLNEAIHLLYSMFWRIS++
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR

Query:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV
        G GGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRN+KLTI EIKSLINEALIKGGIPSSDSYCAMAVDLYNEN+ DQGDKV
Subjt:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV

Query:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR
        VSHM+AKGFRPP SIYEAK A+LCKEGKVDDAVKVIEE+IV GS VPT+ALYNIVLKGLCD+GKSTVAMEYLKKMAK+VGLVANK TYSTLVHGLCRENR
Subjt:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR

Query:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL
        Y EACKVLEEMVIKSF PCSNTFNTLI+GLCSVGK YEAVMWLEEMISQGQLPHV VWNSLVSSLCC++ G  M SKVL
Subjt:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL

A0A6J1H9J7 pentatricopeptide repeat-containing protein At1g056001.4e-24287.47Show/hide
Query:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN
        KQNNP TAYQLF EAKCRYP+Y+HNGPVYAAMI+ILGNSGRISEMREV+DQM+ DSC+CKDS+FSFAIKTYAS GLLE+GISLFKSLG FNCTDRTQTFN
Subjt:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR
        TLLEILLN+SQLDAACQLFQQ S+GWEVKSRTQSLNLLMQSLCQRGQSELALHVF+EMDYQSCYPNRLSYLILMKGLCQDG+L+EAIHLLYSMFWRIS+R
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR

Query:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV
        G GGDIVIYRTLLFALCDNGEIEQAVEILGKIL+KGLKAPKRAHY IDL+ CR SKLT+ EIK LINEALIKGGIPSSDSYCAMA+DLYNENE DQGDKV
Subjt:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV

Query:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR
        VSHM+AKGF PP S+YEAKAAALCKEGKVDDAVKVIEEE VKGS VPTVALYNIVL GLC  GKSTVAME+LKKM KQVGLVA+K TYSTLVHGLCRENR
Subjt:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR

Query:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL
        Y EACK+LEEMVIKS  PCSNTFNTL RGLCSVGK Y+AVM LEEMISQGQLP +SVWN+LVSSLC N+  T MWSKVL
Subjt:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL

A0A6J1KU44 pentatricopeptide repeat-containing protein At1g056007.2e-24487.68Show/hide
Query:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN
        KQNNP TAYQLF EAKCRYP+Y+HNGPVYAAMI+ILGNSGRISEMREV+DQM+ DSC+CKDS+FSFAIKTYAS GLLE+GISLFKSLG FNCTDRTQTFN
Subjt:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR
        TLLEILLN+SQLDAACQLFQQ S+GWEVKSRTQSLNLLMQSLCQRGQSELALHVF+EMDYQSCYPNRLSYLILMKGLCQDG+L+EAIHLLYSMFWRIS++
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR

Query:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV
        G GGDIVIYRTLLFALCDNGEIEQAVEILGKIL+KGLK+PKRAHY IDL+ CR SKLT+ EIK LINEALIKGGIPSSDSYCAMA+DLYNENE DQGDKV
Subjt:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV

Query:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR
        VSHM+AKGF PP S+YEAKAAALCKEGKVDDAVKVIEEE VKGS VPTVALYNIVL GLC  GKSTVAME+LKKMAKQVGLVA+K TYSTLVHGLCRENR
Subjt:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR

Query:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL
        Y EACK+LEEMVIKS  PCSNTFNTLIRGLCSVGK Y+AVM LEEMISQGQLP +SVWN+LVSSLC N+ GT MWSKVL
Subjt:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL

SwissProt top hitse value%identityAlignment
O49436 Pentatricopeptide repeat-containing protein At4g200902.1e-4325.74Show/hide
Query:  NPLTA---YQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLF-KSLGRFNCTDRTQTF
        NPL A    ++FK A  +   ++      ++MI+   NSG    + ++L ++R ++    +  F    + Y    L +  + LF + +  F C    ++F
Subjt:  NPLTA---YQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLF-KSLGRFNCTDRTQTF

Query:  NTLLEILLNQSQLDAACQLFQ---QCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWR
        N++L +++N+       + +      +    +     S NL++++LC+    + A+ VF+ M  + C P+  +Y  LM GLC++ R++EA+ LL  M   
Subjt:  NTLLEILLNQSQLDAACQLFQ---QCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWR

Query:  ISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQ
            G     VIY  L+  LC  G++ +  +++  +  KG    +  +  +    C   KL   +  SL+   +    IP+  +Y  +   L  +     
Subjt:  ISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQ

Query:  GDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLC
          +++S M  +G+     IY    + L KEGK ++A+ +  +   KG   P + +Y++++ GLC EGK   A E L +M    G + N  TYS+L+ G  
Subjt:  GDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLC

Query:  RENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLC
        +     EA +V +EM     S     ++ LI GLC VG+  EA+M   +M++ G  P    ++S++  LC
Subjt:  RENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLC

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745801.8e-4224.45Show/hide
Query:  QNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDD-SCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN
        Q +P+ A ++F   + +   ++H    Y ++I+ LG  G+   M EVL  MR++      + V+  A+K Y  +G +++ +++F+ +  ++C     ++N
Subjt:  QNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDD-SCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKG------------------------
         ++ +L++    D A +++ +      +     S  + M+S C+  +   AL +   M  Q C  N ++Y  ++ G                        
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKG------------------------

Query:  -----------LCQDGRLNEAIHLLYSMFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEI--K
                   LC+ G + E   LL     ++ +RG   ++  Y   +  LC  GE++ AV ++G ++ +G K     +  +    C+NSK    E+   
Subjt:  -----------LCQDGRLNEAIHLLYSMFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEI--K

Query:  SLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEG
         ++NE L     P S +Y  +         +   +++V   V  GF P    Y +    LC EG+ + A+ +  E + KG   P V LYN ++KGL ++G
Subjt:  SLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEG

Query:  KSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVS
            A +   +M+++ GL+    T++ LV+GLC+     +A  +++ M+ K + P   TFN LI G  +  K   A+  L+ M+  G  P V  +NSL++
Subjt:  KSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVS

Query:  SLC
         LC
Subjt:  SLC

Q9FMF6 Pentatricopeptide repeat-containing protein At5g64320, mitochondrial7.9e-3825.05Show/hide
Query:  YRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGR-FNCTDRTQTFNTLLEILLNQSQLDAACQLFQ
        YRH+  VY  +I  LG +G    +  +L QM+D+    K+S+F   ++ Y   G       L   +   ++C    +++N +LEIL++ +    A  +F 
Subjt:  YRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGR-FNCTDRTQTFNTLLEILLNQSQLDAACQLFQ

Query:  QCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFW-------------------------
              ++     +  ++M++ C   + + AL + ++M    C PN + Y  L+  L +  R+NEA+ LL  MF                          
Subjt:  QCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFW-------------------------

Query:  ------RISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILR----------------------KGLKAPKRAHYRIDLDQCRNSKLTIGEIK----
              R+  RG   D + Y  L+  LC  G ++ A ++  +I +                      K + +     Y I  D C  + L  G  K    
Subjt:  ------RISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILR----------------------KGLKAPKRAHYRIDLDQCRNSKLTIGEIK----

Query:  ----SLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGL
             ++++   KG  P+  SY  +        +ID+   V++ M A G +P    +    +A CKE ++ +AV++  E   KG   P V  +N ++ GL
Subjt:  ----SLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGL

Query:  CDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWN
        C+  +   A+  L+ M  + G+VAN  TY+TL++   R     EA K++ EMV +       T+N+LI+GLC  G+  +A    E+M+  G  P     N
Subjt:  CDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWN

Query:  SLVSSLC
         L++ LC
Subjt:  SLVSSLC

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic3.1e-4224.21Show/hide
Query:  QNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFK-SLGRFNCTDRTQTFN
        Q +   A +LF  A  + P++     +Y  ++  LG SG   +M+++L+ M+   CE   S F   I++YA   L ++ +S+    +  F     T  +N
Subjt:  QNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFK-SLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHVFQEMDYQSCYP
         +L +L++ + L    ++       W +K    + N+L+++LC                                   + G  + AL + ++M    C  
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHVFQEMDYQSCYP

Query:  NRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIK--
        + +S  +++ G C++GR+ +A++ +  M    +Q G   D   + TL+  LC  G ++ A+EI+  +L++G       +  +    C+     +GE+K  
Subjt:  NRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIK--

Query:  -SLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDE
          ++++ + +   P++ +Y  +   L  EN++++  ++   + +KG  P    + +    LC       A+++ EE   KG   P    YN+++  LC +
Subjt:  -SLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDE

Query:  GKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLV
        GK   A+  LK+M +  G   +  TY+TL+ G C+ N+  EA ++ +EM +   S  S T+NTLI GLC   +  +A   +++MI +GQ P    +NSL+
Subjt:  GKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLV

Query:  SSLC
        +  C
Subjt:  SSLC

Q9SYK1 Pentatricopeptide repeat-containing protein At1g056008.4e-15756.65Show/hide
Query:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN
        KQ NP+TA +LF+EAK R+P Y HNG VYA MIDILG S R+ EM+ V+++M++DSCECKDSVF+  I+T++  G LED ISLFKSL  FNC + + +F+
Subjt:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR
        TLL+ ++ +S+L+AAC +F++  YGWEV SR  +LNLLM+ LCQ  +S+LA  VFQEM+YQ CYP+R SY ILMKG C +G+L EA HLLYSMFWRISQ+
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR

Query:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV
        G G DIV+YR LL ALCD GE++ A+EILGKILRKGLKAPKR ++ I+     +S   I  +K L+ E LI+G IP  DSY AMA DL+ E ++ +G++V
Subjt:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV

Query:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR
        +  M +KGF P P IY AK  ALC+ GK+ +AV VI +E+++G  +PTV +YN+++KGLCD+GKS  A+ YLKKM+KQV  VAN+ TY TLV GLCR+ +
Subjt:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR

Query:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLC
        ++EA +V+EEM+IKS  P   T++ +I+GLC + + YEAVMWLEEM+SQ  +P  SVW +L  S+C
Subjt:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLC

Arabidopsis top hitse value%identityAlignment
AT1G05600.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.0e-15856.65Show/hide
Query:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN
        KQ NP+TA +LF+EAK R+P Y HNG VYA MIDILG S R+ EM+ V+++M++DSCECKDSVF+  I+T++  G LED ISLFKSL  FNC + + +F+
Subjt:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR
        TLL+ ++ +S+L+AAC +F++  YGWEV SR  +LNLLM+ LCQ  +S+LA  VFQEM+YQ CYP+R SY ILMKG C +G+L EA HLLYSMFWRISQ+
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR

Query:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV
        G G DIV+YR LL ALCD GE++ A+EILGKILRKGLKAPKR ++ I+     +S   I  +K L+ E LI+G IP  DSY AMA DL+ E ++ +G++V
Subjt:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV

Query:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR
        +  M +KGF P P IY AK  ALC+ GK+ +AV VI +E+++G  +PTV +YN+++KGLCD+GKS  A+ YLKKM+KQV  VAN+ TY TLV GLCR+ +
Subjt:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR

Query:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLC
        ++EA +V+EEM+IKS  P   T++ +I+GLC + + YEAVMWLEEM+SQ  +P  SVW +L  S+C
Subjt:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLC

AT1G05600.2 Tetratricopeptide repeat (TPR)-like superfamily protein6.0e-15856.65Show/hide
Query:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN
        KQ NP+TA +LF+EAK R+P Y HNG VYA MIDILG S R+ EM+ V+++M++DSCECKDSVF+  I+T++  G LED ISLFKSL  FNC + + +F+
Subjt:  KQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR
        TLL+ ++ +S+L+AAC +F++  YGWEV SR  +LNLLM+ LCQ  +S+LA  VFQEM+YQ CYP+R SY ILMKG C +G+L EA HLLYSMFWRISQ+
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQR

Query:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV
        G G DIV+YR LL ALCD GE++ A+EILGKILRKGLKAPKR ++ I+     +S   I  +K L+ E LI+G IP  DSY AMA DL+ E ++ +G++V
Subjt:  GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKV

Query:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR
        +  M +KGF P P IY AK  ALC+ GK+ +AV VI +E+++G  +PTV +YN+++KGLCD+GKS  A+ YLKKM+KQV  VAN+ TY TLV GLCR+ +
Subjt:  VSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENR

Query:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLC
        ++EA +V+EEM+IKS  P   T++ +I+GLC + + YEAVMWLEEM+SQ  +P  SVW +L  S+C
Subjt:  YVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLC

AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-4324.45Show/hide
Query:  QNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDD-SCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN
        Q +P+ A ++F   + +   ++H    Y ++I+ LG  G+   M EVL  MR++      + V+  A+K Y  +G +++ +++F+ +  ++C     ++N
Subjt:  QNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDD-SCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKG------------------------
         ++ +L++    D A +++ +      +     S  + M+S C+  +   AL +   M  Q C  N ++Y  ++ G                        
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKG------------------------

Query:  -----------LCQDGRLNEAIHLLYSMFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEI--K
                   LC+ G + E   LL     ++ +RG   ++  Y   +  LC  GE++ AV ++G ++ +G K     +  +    C+NSK    E+   
Subjt:  -----------LCQDGRLNEAIHLLYSMFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEI--K

Query:  SLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEG
         ++NE L     P S +Y  +         +   +++V   V  GF P    Y +    LC EG+ + A+ +  E + KG   P V LYN ++KGL ++G
Subjt:  SLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEG

Query:  KSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVS
            A +   +M+++ GL+    T++ LV+GLC+     +A  +++ M+ K + P   TFN LI G  +  K   A+  L+ M+  G  P V  +NSL++
Subjt:  KSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVS

Query:  SLC
         LC
Subjt:  SLC

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-4324.21Show/hide
Query:  QNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFK-SLGRFNCTDRTQTFN
        Q +   A +LF  A  + P++     +Y  ++  LG SG   +M+++L+ M+   CE   S F   I++YA   L ++ +S+    +  F     T  +N
Subjt:  QNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFK-SLGRFNCTDRTQTFN

Query:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHVFQEMDYQSCYP
         +L +L++ + L    ++       W +K    + N+L+++LC                                   + G  + AL + ++M    C  
Subjt:  TLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHVFQEMDYQSCYP

Query:  NRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIK--
        + +S  +++ G C++GR+ +A++ +  M    +Q G   D   + TL+  LC  G ++ A+EI+  +L++G       +  +    C+     +GE+K  
Subjt:  NRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIK--

Query:  -SLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDE
          ++++ + +   P++ +Y  +   L  EN++++  ++   + +KG  P    + +    LC       A+++ EE   KG   P    YN+++  LC +
Subjt:  -SLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDE

Query:  GKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLV
        GK   A+  LK+M +  G   +  TY+TL+ G C+ N+  EA ++ +EM +   S  S T+NTLI GLC   +  +A   +++MI +GQ P    +NSL+
Subjt:  GKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLV

Query:  SSLC
        +  C
Subjt:  SSLC

AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-4425.74Show/hide
Query:  NPLTA---YQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLF-KSLGRFNCTDRTQTF
        NPL A    ++FK A  +   ++      ++MI+   NSG    + ++L ++R ++    +  F    + Y    L +  + LF + +  F C    ++F
Subjt:  NPLTA---YQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLF-KSLGRFNCTDRTQTF

Query:  NTLLEILLNQSQLDAACQLFQ---QCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWR
        N++L +++N+       + +      +    +     S NL++++LC+    + A+ VF+ M  + C P+  +Y  LM GLC++ R++EA+ LL  M   
Subjt:  NTLLEILLNQSQLDAACQLFQ---QCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWR

Query:  ISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQ
            G     VIY  L+  LC  G++ +  +++  +  KG    +  +  +    C   KL   +  SL+   +    IP+  +Y  +   L  +     
Subjt:  ISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQ

Query:  GDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLC
          +++S M  +G+     IY    + L KEGK ++A+ +  +   KG   P + +Y++++ GLC EGK   A E L +M    G + N  TYS+L+ G  
Subjt:  GDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLC

Query:  RENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLC
        +     EA +V +EM     S     ++ LI GLC VG+  EA+M   +M++ G  P    ++S++  LC
Subjt:  RENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAATATGGAGGGCTCACTTGTGGGCGATATCTATTCCATTTGCCATGGAATATTCAACACCCCTTACCAAGCTAATTGCTCGCACTGGTGGTTTTGCGGCAACAG
AAACTCACGGTGTGCGAGATTGGTCAAGCACAGTGGCTACAGAACACCCGATCGGACGCAAACCAGACGAGATCAATCGGACAGAAACTCTGGGCTGAGCGCCACAAAGT
CTGGATGGTCGAACCAGCGAAAGGACGACGTTGCGGCGTGCACGGCGCAAGCGACAGCAGCCGCGACGTGGGTTGAAGACCGGCGGCTGTTGCAGGGAGAGAGAAAAATG
GAAGGGACCAGCGGCCAGCCTTCACGAGAAGACAACCTTGAGGATTGGTTGAAGATTGTTTGGGCAAGAAAGGAATTCCACTGCACAGGATCGGCTGAACAGTTTCCGTC
TATGCAAAATAGAGGAGGCGATATGCTTAGCCAAATTCAATCCGAGATAACTTCTTCAAGCGCCGAAGTTGATTTTGAGGTGGGACAGAAAATCCCACTAAGATGGAAGC
AGAATAATCCCTTAACAGCTTACCAACTGTTTAAGGAAGCCAAATGTAGGTACCCAGATTATCGGCACAATGGTCCGGTGTACGCCGCGATGATTGATATACTCGGAAAT
TCGGGGAGGATTTCTGAGATGAGAGAAGTGTTGGATCAGATGAGAGATGACTCTTGTGAATGCAAAGATTCTGTATTTTCATTTGCAATTAAAACGTATGCTAGTCAGGG
ATTATTGGAAGATGGTATATCTCTTTTTAAAAGTCTTGGGAGATTTAACTGTACCGATAGAACACAAACTTTTAATACCCTTTTGGAAATCCTGTTGAATCAATCTCAGC
TTGATGCTGCCTGTCAGCTTTTTCAGCAGTGTTCTTATGGTTGGGAAGTGAAATCCAGGACTCAGTCCTTGAATTTGCTGATGCAATCTCTCTGCCAGAGAGGCCAGTCT
GAACTTGCTTTACATGTCTTTCAAGAAATGGATTACCAAAGTTGCTATCCAAATAGACTGAGTTATTTGATTTTAATGAAAGGATTGTGTCAAGATGGTAGGCTTAATGA
GGCCATCCATTTGTTGTATTCCATGTTTTGGCGGATTTCTCAAAGGGGTGGTGGAGGGGACATAGTAATTTACAGAACCCTGCTGTTTGCTTTGTGTGATAATGGAGAGA
TAGAGCAAGCTGTGGAAATACTTGGCAAGATCTTGAGGAAAGGACTGAAAGCCCCTAAGCGAGCTCATTACCGGATTGACTTAGATCAATGCAGGAATAGCAAGCTCACT
ATTGGGGAAATCAAGAGTTTAATCAATGAAGCTTTAATCAAAGGCGGAATTCCCAGTTCAGATAGCTATTGTGCCATGGCTGTTGATCTATATAACGAAAATGAGATTGA
TCAGGGAGATAAAGTGGTTAGCCACATGGTAGCTAAAGGCTTCAGGCCACCGCCGTCGATCTATGAAGCAAAAGCGGCTGCATTATGCAAAGAAGGCAAAGTCGATGATG
CAGTGAAAGTAATTGAAGAGGAAATAGTGAAGGGAAGTGGTGTTCCAACTGTTGCATTGTACAACATAGTTCTGAAGGGTCTGTGTGATGAGGGCAAATCAACAGTGGCT
ATGGAGTATTTGAAGAAAATGGCAAAGCAGGTCGGTCTTGTTGCAAACAAAGGAACTTACAGCACTTTAGTACATGGACTTTGTCGTGAAAATCGATACGTTGAAGCATG
CAAGGTTTTAGAGGAGATGGTTATCAAATCGTTTTCGCCTTGTTCTAACACATTCAATACACTAATTAGAGGCCTTTGCTCAGTTGGAAAACACTATGAAGCTGTGATGT
GGTTGGAAGAAATGATTAGCCAAGGTCAATTGCCTCATGTTTCTGTCTGGAATTCTTTGGTTTCATCTCTGTGTTGCAACCTGACTGGCACCGCTATGTGGTCCAAGGTT
TTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATAATATGGAGGGCTCACTTGTGGGCGATATCTATTCCATTTGCCATGGAATATTCAACACCCCTTACCAAGCTAATTGCTCGCACTGGTGGTTTTGCGGCAACAG
AAACTCACGGTGTGCGAGATTGGTCAAGCACAGTGGCTACAGAACACCCGATCGGACGCAAACCAGACGAGATCAATCGGACAGAAACTCTGGGCTGAGCGCCACAAAGT
CTGGATGGTCGAACCAGCGAAAGGACGACGTTGCGGCGTGCACGGCGCAAGCGACAGCAGCCGCGACGTGGGTTGAAGACCGGCGGCTGTTGCAGGGAGAGAGAAAAATG
GAAGGGACCAGCGGCCAGCCTTCACGAGAAGACAACCTTGAGGATTGGTTGAAGATTGTTTGGGCAAGAAAGGAATTCCACTGCACAGGATCGGCTGAACAGTTTCCGTC
TATGCAAAATAGAGGAGGCGATATGCTTAGCCAAATTCAATCCGAGATAACTTCTTCAAGCGCCGAAGTTGATTTTGAGGTGGGACAGAAAATCCCACTAAGATGGAAGC
AGAATAATCCCTTAACAGCTTACCAACTGTTTAAGGAAGCCAAATGTAGGTACCCAGATTATCGGCACAATGGTCCGGTGTACGCCGCGATGATTGATATACTCGGAAAT
TCGGGGAGGATTTCTGAGATGAGAGAAGTGTTGGATCAGATGAGAGATGACTCTTGTGAATGCAAAGATTCTGTATTTTCATTTGCAATTAAAACGTATGCTAGTCAGGG
ATTATTGGAAGATGGTATATCTCTTTTTAAAAGTCTTGGGAGATTTAACTGTACCGATAGAACACAAACTTTTAATACCCTTTTGGAAATCCTGTTGAATCAATCTCAGC
TTGATGCTGCCTGTCAGCTTTTTCAGCAGTGTTCTTATGGTTGGGAAGTGAAATCCAGGACTCAGTCCTTGAATTTGCTGATGCAATCTCTCTGCCAGAGAGGCCAGTCT
GAACTTGCTTTACATGTCTTTCAAGAAATGGATTACCAAAGTTGCTATCCAAATAGACTGAGTTATTTGATTTTAATGAAAGGATTGTGTCAAGATGGTAGGCTTAATGA
GGCCATCCATTTGTTGTATTCCATGTTTTGGCGGATTTCTCAAAGGGGTGGTGGAGGGGACATAGTAATTTACAGAACCCTGCTGTTTGCTTTGTGTGATAATGGAGAGA
TAGAGCAAGCTGTGGAAATACTTGGCAAGATCTTGAGGAAAGGACTGAAAGCCCCTAAGCGAGCTCATTACCGGATTGACTTAGATCAATGCAGGAATAGCAAGCTCACT
ATTGGGGAAATCAAGAGTTTAATCAATGAAGCTTTAATCAAAGGCGGAATTCCCAGTTCAGATAGCTATTGTGCCATGGCTGTTGATCTATATAACGAAAATGAGATTGA
TCAGGGAGATAAAGTGGTTAGCCACATGGTAGCTAAAGGCTTCAGGCCACCGCCGTCGATCTATGAAGCAAAAGCGGCTGCATTATGCAAAGAAGGCAAAGTCGATGATG
CAGTGAAAGTAATTGAAGAGGAAATAGTGAAGGGAAGTGGTGTTCCAACTGTTGCATTGTACAACATAGTTCTGAAGGGTCTGTGTGATGAGGGCAAATCAACAGTGGCT
ATGGAGTATTTGAAGAAAATGGCAAAGCAGGTCGGTCTTGTTGCAAACAAAGGAACTTACAGCACTTTAGTACATGGACTTTGTCGTGAAAATCGATACGTTGAAGCATG
CAAGGTTTTAGAGGAGATGGTTATCAAATCGTTTTCGCCTTGTTCTAACACATTCAATACACTAATTAGAGGCCTTTGCTCAGTTGGAAAACACTATGAAGCTGTGATGT
GGTTGGAAGAAATGATTAGCCAAGGTCAATTGCCTCATGTTTCTGTCTGGAATTCTTTGGTTTCATCTCTGTGTTGCAACCTGACTGGCACCGCTATGTGGTCCAAGGTT
TTATGA
Protein sequenceShow/hide protein sequence
MDNMEGSLVGDIYSICHGIFNTPYQANCSHWWFCGNRNSRCARLVKHSGYRTPDRTQTRRDQSDRNSGLSATKSGWSNQRKDDVAACTAQATAAATWVEDRRLLQGERKM
EGTSGQPSREDNLEDWLKIVWARKEFHCTGSAEQFPSMQNRGGDMLSQIQSEITSSSAEVDFEVGQKIPLRWKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGN
SGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQS
ELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLT
IGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVA
MEYLKKMAKQVGLVANKGTYSTLVHGLCRENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKV
L