; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G07920 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G07920
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr6:6692105..6693598
RNA-Seq ExpressionCSPI06G07920
SyntenyCSPI06G07920
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140638.1 pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucumis sativus]2.2e-29299.6Show/hide
Query:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
        MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMI ILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
Subjt:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI

Query:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
        SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
Subjt:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG

Query:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
        RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIE+AVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
Subjt:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY

Query:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV
        CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV
Subjt:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV

Query:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
Subjt:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

XP_008459832.1 PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X2 [Cucumis melo]8.0e-25096.2Show/hide
Query:  MIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSR
        MI ILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKS GRFNCTNRTQTFNTLLEILL ESQLHAACQLFQECSYGW VKSR
Subjt:  MIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSR

Query:  TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGK
        TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKG GGDIVIYRTLLFALCDNGEIE+AVEILGK
Subjt:  TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGK

Query:  ILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDD
        ILRKGLKAPKRAHYRIDLDQCRN+ LTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPS IYEAK ASLCKEGKVDD
Subjt:  ILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDD

Query:  AVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCS
        AVKVIEEQIVG CVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAK+VGLVANKETYSTLVHGLC ENRY EACKVLEEMVIKSF PCSNTFNTLIKGLCS
Subjt:  AVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCS

Query:  VGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        VGK YEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCS+VL
Subjt:  VGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

XP_011656819.1 pentatricopeptide repeat-containing protein At1g05600 isoform X2 [Cucumis sativus]6.5e-26099.55Show/hide
Query:  MIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSR
        MI ILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSR
Subjt:  MIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSR

Query:  TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGK
        TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIE+AVEILGK
Subjt:  TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGK

Query:  ILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDD
        ILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDD
Subjt:  ILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDD

Query:  AVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCS
        AVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCS
Subjt:  AVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCS

Query:  VGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        VGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
Subjt:  VGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

XP_016902498.1 PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucumis melo]3.7e-27995.57Show/hide
Query:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
        M++RWPRILTPT LSQIIRKQNNP TAYQLFKEAKCRYPDYRHNGPVYA MI ILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
Subjt:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI

Query:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
        SLFKS GRFNCTNRTQTFNTLLEILL ESQLHAACQLFQECSYGW VKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
Subjt:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG

Query:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
        RLNEAIHLLYSMFWRISRKG GGDIVIYRTLLFALCDNGEIE+AVEILGKILRKGLKAPKRAHYRIDLDQCRN+ LTIEEIKSLINEALIKGGIPSSDSY
Subjt:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY

Query:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV
        CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPS IYEAK ASLCKEGKVDDAVKVIEEQIVG CVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAK+VGLV
Subjt:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV

Query:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        ANKETYSTLVHGLC ENRY EACKVLEEMVIKSF PCSNTFNTLIKGLCSVGK YEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCS+VL
Subjt:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

XP_038874865.1 pentatricopeptide repeat-containing protein At1g05600 [Benincasa hispida]6.3e-26390.54Show/hide
Query:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
        M++RWPRILTPT LSQIIRKQNNP TAYQLFKEAK RYPDYRHNGPVYA MI ILGNSGRVSEMREV+DQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
Subjt:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI

Query:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
        +LFKS G+FNCTNRTQTFNTLLEILL ESQL AACQLFQE SYGW VKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYP+RLSYLI+MKGLCQDG
Subjt:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG

Query:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
        RLNEAIHLLYSMFWRIS++GGGGDIVIYRTLLFALCDNG+IE+AVEILGKILRKGLKAPKRAHYRIDLDQCRNS LTI EIK LINEALIKGGIPSSDS+
Subjt:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY

Query:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIV-GGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGL
        CAMAVDLYNEN+TDQGDKVVSHM+AKGFRPPS I+EAK A+LCKEGKVDDAVKVIEE+IV G CVPT+ALYNIVLKGLCD+GKSTVAMEYLKKMAKQVGL
Subjt:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIV-GGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGL

Query:  VANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRV
        VA+K TYSTLVHGLC ENRYIEACKVLEEMVIKSF PCSNTFNTLI+GLCSVGK YEAVMWLEEMISQGQLPHV VWNSLVSSLCCDVAG +M S V
Subjt:  VANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRV

TrEMBL top hitse value%identityAlignment
A0A0A0K9Q4 Uncharacterized protein1.1e-29299.6Show/hide
Query:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
        MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMI ILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
Subjt:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI

Query:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
        SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
Subjt:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG

Query:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
        RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIE+AVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
Subjt:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY

Query:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV
        CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV
Subjt:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV

Query:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
Subjt:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

A0A1S3CB38 pentatricopeptide repeat-containing protein At1g05600 isoform X23.9e-25096.2Show/hide
Query:  MIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSR
        MI ILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKS GRFNCTNRTQTFNTLLEILL ESQLHAACQLFQECSYGW VKSR
Subjt:  MIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSR

Query:  TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGK
        TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKG GGDIVIYRTLLFALCDNGEIE+AVEILGK
Subjt:  TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGK

Query:  ILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDD
        ILRKGLKAPKRAHYRIDLDQCRN+ LTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPS IYEAK ASLCKEGKVDD
Subjt:  ILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDD

Query:  AVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCS
        AVKVIEEQIVG CVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAK+VGLVANKETYSTLVHGLC ENRY EACKVLEEMVIKSF PCSNTFNTLIKGLCS
Subjt:  AVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCS

Query:  VGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        VGK YEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCS+VL
Subjt:  VGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

A0A1S4E2N8 pentatricopeptide repeat-containing protein At1g05600 isoform X11.8e-27995.57Show/hide
Query:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
        M++RWPRILTPT LSQIIRKQNNP TAYQLFKEAKCRYPDYRHNGPVYA MI ILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
Subjt:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI

Query:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
        SLFKS GRFNCTNRTQTFNTLLEILL ESQLHAACQLFQECSYGW VKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
Subjt:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG

Query:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
        RLNEAIHLLYSMFWRISRKG GGDIVIYRTLLFALCDNGEIE+AVEILGKILRKGLKAPKRAHYRIDLDQCRN+ LTIEEIKSLINEALIKGGIPSSDSY
Subjt:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY

Query:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV
        CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPS IYEAK ASLCKEGKVDDAVKVIEEQIVG CVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAK+VGLV
Subjt:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV

Query:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        ANKETYSTLVHGLC ENRY EACKVLEEMVIKSF PCSNTFNTLIKGLCSVGK YEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCS+VL
Subjt:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

A0A5D3DM62 Pentatricopeptide repeat-containing protein1.8e-27995.57Show/hide
Query:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
        M++RWPRILTPT LSQIIRKQNNP TAYQLFKEAKCRYPDYRHNGPVYA MI ILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
Subjt:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI

Query:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
        SLFKS GRFNCTNRTQTFNTLLEILL ESQLHAACQLFQECSYGW VKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
Subjt:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG

Query:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
        RLNEAIHLLYSMFWRISRKG GGDIVIYRTLLFALCDNGEIE+AVEILGKILRKGLKAPKRAHYRIDLDQCRN+ LTIEEIKSLINEALIKGGIPSSDSY
Subjt:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY

Query:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV
        CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPS IYEAK ASLCKEGKVDDAVKVIEEQIVG CVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAK+VGLV
Subjt:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV

Query:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        ANKETYSTLVHGLC ENRY EACKVLEEMVIKSF PCSNTFNTLIKGLCSVGK YEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCS+VL
Subjt:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

A0A6J1DVG7 pentatricopeptide repeat-containing protein At1g056002.4e-24784.58Show/hide
Query:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
        M++RWPR+LTPT LSQIIRKQNNP TA+QLFKEAKCRYP YRHNGPVYA MI ILGNSGRV EMREV+DQM+DDSCECKDSVFSFAIKTYASHGLLE+GI
Subjt:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI

Query:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
        SLFKS GRFNCTNRTQTFNTLLEILL ES+L AACQLFQ+ SYGWGVKSRTQSLNLLMQSLCQR QSELALH+FQEMDYQ CYPNRLSYLI+MKGLCQDG
Subjt:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG

Query:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
        RLNEA+HLLYSMFWRIS++G GGDIVIYRTLL+ALC NGE+E+AVEILGKILRKGLKAPKR HYRIDLDQC+NS LTI+EIK L NEALIKGGIPS  +Y
Subjt:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY

Query:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIV-GGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGL
        CAMAVDLYNEN TDQGDKVVSHM+AKGFRPPS IYEAKAA+LCKEGKVDDA++VI+E+ V G C+P++ALYNIVLKGL ++GKSTVA+EYLKKMAKQVGL
Subjt:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIV-GGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGL

Query:  VANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDM
        VA+KETYS LVHGLC ENRYIEACK+LEEMVIKS+ PCS+TFNTLI GLCS+GK YEAVMWLEEMISQGQLP + VWNSLVSS+C +VAG D+
Subjt:  VANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDM

SwissProt top hitse value%identityAlignment
O49436 Pentatricopeptide repeat-containing protein At4g200901.9e-4423.96Show/hide
Query:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLF-KSFG
        ++P    +++        + ++FK A  +   ++      ++MI+   NSG    + +++ ++R ++    +  F    + Y    L +  + LF +   
Subjt:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLF-KSFG

Query:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQ---ECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNE
         F C    ++FN++L +++ E   H   + +      +    +     S NL++++LC+    + A+ VF+ M  + C P+  +Y  +M GLC++ R++E
Subjt:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQ---ECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMA
        A+ LL  M      +G     VIY  L+  LC  G++ +  +++  +  KG    +  +  +    C    L  ++  SL+   +    IP+  +Y  + 
Subjt:  AIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMA

Query:  VDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKE
          L  + +     +++S M  +G+     IY    + L KEGK ++A+ +  +    GC P I +Y++++ GLC +GK   A E L +M    G + N  
Subjt:  VDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKE

Query:  TYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC
        TYS+L+ G        EA +V +EM           ++ LI GLC VG+  EA+M   +M++ G  P    ++S++  LC
Subjt:  TYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC

O81908 Pentatricopeptide repeat-containing protein At1g02060, chloroplastic9.1e-3925.49Show/hide
Query:  YRHNGPVYATMIKILGNSGRVSEMREVM--DQMRDDSC-ECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQL
        + H    +  M++ LG +  ++  R  +   + R + C + +D  F+  I++Y + GL ++ + LF++  +   +    TFN+LL ILLK  +   A  L
Subjt:  YRHNGPVYATMIKILGNSGRVSEMREVM--DQMRDDSC-ECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQL

Query:  FQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCD
        F E    +GV   + + N L+   C+    + A  +F++M+   C P+ ++Y  ++ GLC+ G++  A ++L  M  + +      ++V Y TL+   C 
Subjt:  FQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCD

Query:  NGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNEN----------KTDQGDKVVSHMIAKG
          EI++AV +   +L +GLK P    Y   +     ++   +EIK ++        I  +D++   A D    N            D   KV   M+   
Subjt:  NGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNEN----------KTDQGDKVVSHMIAKG

Query:  FRPPSLIYEAKAASLCKEGKVDDAV----KVIEEQIVGG---CVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYI
          P S  Y     +LC   + D A     ++ E++++ G   C P  A YN + + LC +GK+  A +  +++ K+   V +  +Y TL+ G C E ++ 
Subjt:  FRPPSLIYEAKAASLCKEGKVDDAV----KVIEEQIVGG---CVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYI

Query:  EACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSL
         A ++L  M+ + F P   T+  LI GL  +G+   A   L+ M+    LP    ++S+++ L
Subjt:  EACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSL

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745804.7e-4323.84Show/hide
Query:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDD-SCECKDSVFSFAIKTYASHGLLEDGISLFKSFG
        L P  ++ +I+ Q +P  A ++F   + +   ++H    Y ++I+ LG  G+   M EV+  MR++      + V+  A+K Y   G +++ +++F+   
Subjt:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDD-SCECKDSVFSFAIKTYASHGLLEDGISLFKSFG

Query:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIH
         ++C     ++N ++ +L+       A +++       G+     S  + M+S C+  +   AL +   M  Q C  N ++Y  V+ G  ++    E   
Subjt:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIH

Query:  LLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLD-QCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVD
          Y +F ++   G    +  +  LL  LC  G++++  ++L K++++G+  P    Y + +   C+   L  +    ++   + +G  P   +Y  +   
Subjt:  LLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLD-QCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVD

Query:  LYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETY
        L   +K  + +  +  M+ +G  P S  Y    A  CK G V  A +++ + +  G VP    Y  ++ GLC +G++  A+    + A   G+  N   Y
Subjt:  LYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETY

Query:  STLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLV
        +TL+ GL  +   +EA ++  EM  K   P   TFN L+ GLC +G   +A   ++ MIS+G  P +  +N L+
Subjt:  STLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLV

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic4.2e-4423.68Show/hide
Query:  TPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFK-SFGR
        T   L   +R Q +   A +LF  A  + P++     +Y  ++  LG SG   +M+++++ M+   CE   S F   I++YA   L ++ +S+       
Subjt:  TPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFK-SFGR

Query:  FNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHV
        F     T  +N +L +L+  + L    ++       WG+K    + N+L+++LC                                   + G  + AL +
Subjt:  FNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHV

Query:  FQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRN
         ++M    C  + +S  +++ G C++GR+ +A++ +  M    ++ G   D   + TL+  LC  G ++ A+EI+  +L++G       +  +    C+ 
Subjt:  FQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRN

Query:  SNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIV
            ++E   ++++ + +   P++ +Y  +   L  EN+ ++  ++   + +KG  P    + +    LC       A+++ EE    GC P    YN++
Subjt:  SNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIV

Query:  LKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHV
        +  LC  GK   A+  LK+M +  G   +  TY+TL+ G C  N+  EA ++ +EM +      S T+NTLI GLC   +  +A   +++MI +GQ P  
Subjt:  LKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHV

Query:  CVWNSLVSSLC
          +NSL++  C
Subjt:  CVWNSLVSSLC

Q9SYK1 Pentatricopeptide repeat-containing protein At1g056004.1e-16455.94Show/hide
Query:  IRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISL
        +RWPR+LTP+ LSQI++KQ NP TA +LF+EAK R+P Y HNG VYATMI ILG S RV EM+ V+++M++DSCECKDSVF+  I+T++  G LED ISL
Subjt:  IRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISL

Query:  FKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRL
        FKS   FNC N + +F+TLL+ ++KES+L AAC +F++  YGW V SR  +LNLLM+ LCQ  +S+LA  VFQEM+YQ CYP+R SY I+MKG C +G+L
Subjt:  FKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRL

Query:  NEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCA
         EA HLLYSMFWRIS+KG G DIV+YR LL ALCD GE++ A+EILGKILRKGLKAPKR ++ I+     +S+  IE +K L+ E LI+G IP  DSY A
Subjt:  NEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCA

Query:  MAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVI-EEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVA
        MA DL+ E K  +G++V+  M +KGF P   IY AK  +LC+ GK+ +AV VI +E + G C+PT+ +YN+++KGLCDDGKS  A+ YLKKM+KQV  VA
Subjt:  MAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVI-EEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVA

Query:  NKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC-CDVAGIDMCSRVL
        N+ETY TLV GLC + +++EA +V+EEM+IKS  P   T++ +IKGLC + + YEAVMWLEEM+SQ  +P   VW +L  S+C C +  +++   ++
Subjt:  NKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC-CDVAGIDMCSRVL

Arabidopsis top hitse value%identityAlignment
AT1G05600.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.9e-16555.94Show/hide
Query:  IRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISL
        +RWPR+LTP+ LSQI++KQ NP TA +LF+EAK R+P Y HNG VYATMI ILG S RV EM+ V+++M++DSCECKDSVF+  I+T++  G LED ISL
Subjt:  IRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISL

Query:  FKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRL
        FKS   FNC N + +F+TLL+ ++KES+L AAC +F++  YGW V SR  +LNLLM+ LCQ  +S+LA  VFQEM+YQ CYP+R SY I+MKG C +G+L
Subjt:  FKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRL

Query:  NEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCA
         EA HLLYSMFWRIS+KG G DIV+YR LL ALCD GE++ A+EILGKILRKGLKAPKR ++ I+     +S+  IE +K L+ E LI+G IP  DSY A
Subjt:  NEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCA

Query:  MAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVI-EEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVA
        MA DL+ E K  +G++V+  M +KGF P   IY AK  +LC+ GK+ +AV VI +E + G C+PT+ +YN+++KGLCDDGKS  A+ YLKKM+KQV  VA
Subjt:  MAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVI-EEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVA

Query:  NKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC-CDVAGIDMCSRVL
        N+ETY TLV GLC + +++EA +V+EEM+IKS  P   T++ +IKGLC + + YEAVMWLEEM+SQ  +P   VW +L  S+C C +  +++   ++
Subjt:  NKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC-CDVAGIDMCSRVL

AT1G05600.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.9e-16555.94Show/hide
Query:  IRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISL
        +RWPR+LTP+ LSQI++KQ NP TA +LF+EAK R+P Y HNG VYATMI ILG S RV EM+ V+++M++DSCECKDSVF+  I+T++  G LED ISL
Subjt:  IRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISL

Query:  FKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRL
        FKS   FNC N + +F+TLL+ ++KES+L AAC +F++  YGW V SR  +LNLLM+ LCQ  +S+LA  VFQEM+YQ CYP+R SY I+MKG C +G+L
Subjt:  FKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRL

Query:  NEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCA
         EA HLLYSMFWRIS+KG G DIV+YR LL ALCD GE++ A+EILGKILRKGLKAPKR ++ I+     +S+  IE +K L+ E LI+G IP  DSY A
Subjt:  NEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCA

Query:  MAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVI-EEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVA
        MA DL+ E K  +G++V+  M +KGF P   IY AK  +LC+ GK+ +AV VI +E + G C+PT+ +YN+++KGLCDDGKS  A+ YLKKM+KQV  VA
Subjt:  MAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVI-EEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVA

Query:  NKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC-CDVAGIDMCSRVL
        N+ETY TLV GLC + +++EA +V+EEM+IKS  P   T++ +IKGLC + + YEAVMWLEEM+SQ  +P   VW +L  S+C C +  +++   ++
Subjt:  NKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC-CDVAGIDMCSRVL

AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein3.3e-4423.84Show/hide
Query:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDD-SCECKDSVFSFAIKTYASHGLLEDGISLFKSFG
        L P  ++ +I+ Q +P  A ++F   + +   ++H    Y ++I+ LG  G+   M EV+  MR++      + V+  A+K Y   G +++ +++F+   
Subjt:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDD-SCECKDSVFSFAIKTYASHGLLEDGISLFKSFG

Query:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIH
         ++C     ++N ++ +L+       A +++       G+     S  + M+S C+  +   AL +   M  Q C  N ++Y  V+ G  ++    E   
Subjt:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIH

Query:  LLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLD-QCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVD
          Y +F ++   G    +  +  LL  LC  G++++  ++L K++++G+  P    Y + +   C+   L  +    ++   + +G  P   +Y  +   
Subjt:  LLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLD-QCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVD

Query:  LYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETY
        L   +K  + +  +  M+ +G  P S  Y    A  CK G V  A +++ + +  G VP    Y  ++ GLC +G++  A+    + A   G+  N   Y
Subjt:  LYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETY

Query:  STLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLV
        +TL+ GL  +   +EA ++  EM  K   P   TFN L+ GLC +G   +A   ++ MIS+G  P +  +N L+
Subjt:  STLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLV

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein3.0e-4523.68Show/hide
Query:  TPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFK-SFGR
        T   L   +R Q +   A +LF  A  + P++     +Y  ++  LG SG   +M+++++ M+   CE   S F   I++YA   L ++ +S+       
Subjt:  TPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFK-SFGR

Query:  FNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHV
        F     T  +N +L +L+  + L    ++       WG+K    + N+L+++LC                                   + G  + AL +
Subjt:  FNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHV

Query:  FQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRN
         ++M    C  + +S  +++ G C++GR+ +A++ +  M    ++ G   D   + TL+  LC  G ++ A+EI+  +L++G       +  +    C+ 
Subjt:  FQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRN

Query:  SNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIV
            ++E   ++++ + +   P++ +Y  +   L  EN+ ++  ++   + +KG  P    + +    LC       A+++ EE    GC P    YN++
Subjt:  SNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIV

Query:  LKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHV
        +  LC  GK   A+  LK+M +  G   +  TY+TL+ G C  N+  EA ++ +EM +      S T+NTLI GLC   +  +A   +++MI +GQ P  
Subjt:  LKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHV

Query:  CVWNSLVSSLC
          +NSL++  C
Subjt:  CVWNSLVSSLC

AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-4523.96Show/hide
Query:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLF-KSFG
        ++P    +++        + ++FK A  +   ++      ++MI+   NSG    + +++ ++R ++    +  F    + Y    L +  + LF +   
Subjt:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLF-KSFG

Query:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQ---ECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNE
         F C    ++FN++L +++ E   H   + +      +    +     S NL++++LC+    + A+ VF+ M  + C P+  +Y  +M GLC++ R++E
Subjt:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQ---ECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMA
        A+ LL  M      +G     VIY  L+  LC  G++ +  +++  +  KG    +  +  +    C    L  ++  SL+   +    IP+  +Y  + 
Subjt:  AIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMA

Query:  VDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKE
          L  + +     +++S M  +G+     IY    + L KEGK ++A+ +  +    GC P I +Y++++ GLC +GK   A E L +M    G + N  
Subjt:  VDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKE

Query:  TYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC
        TYS+L+ G        EA +V +EM           ++ LI GLC VG+  EA+M   +M++ G  P    ++S++  LC
Subjt:  TYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATTAGGTGGCCAAGGATTTTAACGCCCACATGTTTATCTCAGATTATTAGGAAGCAGAATAATCCCCAAACAGCTTACCAACTGTTTAAGGAAGCCAAATGTAG
GTACCCAGATTATCGGCACAATGGTCCAGTGTATGCCACAATGATCAAGATACTCGGAAACTCGGGCAGAGTTTCCGAGATGAGAGAAGTGATGGATCAGATGAGAGATG
ACTCTTGTGAATGCAAAGATTCTGTATTTTCATTTGCAATTAAAACGTATGCTAGTCATGGATTATTGGAAGATGGTATATCTCTTTTTAAAAGTTTTGGGAGATTTAAC
TGTACCAATAGAACACAAACTTTCAATACCCTTTTAGAAATCCTGTTGAAGGAATCTCAGCTTCATGCTGCTTGTCAGCTTTTTCAGGAGTGTTCTTATGGTTGGGGAGT
GAAATCCAGGACTCAGTCCTTGAATTTGTTGATGCAATCTCTCTGCCAAAGAGGTCAGTCGGAGCTTGCTTTACATGTCTTTCAAGAAATGGATTACCAAAGCTGCTATC
CAAATAGACTGAGTTATTTGATTGTAATGAAAGGACTGTGTCAAGATGGTAGGCTTAACGAGGCCATCCATTTGTTGTATTCCATGTTTTGGAGAATTTCCCGAAAGGGT
GGTGGCGGGGACATAGTAATTTATAGAACCCTTCTGTTTGCTTTGTGTGATAATGGAGAGATCGAGAAAGCTGTGGAAATACTTGGCAAGATCTTGAGGAAAGGACTAAA
AGCCCCTAAACGGGCCCATTACCGAATCGACTTAGATCAATGCAGGAATAGCAACCTCACCATTGAGGAAATCAAGAGTTTAATCAATGAAGCTTTAATCAAAGGCGGAA
TTCCCAGTTCAGATAGCTATTGTGCCATGGCTGTTGATCTATATAACGAAAACAAGACTGATCAGGGAGATAAAGTGGTTAGCCACATGATAGCTAAAGGCTTCAGGCCA
CCATCCTTGATCTATGAAGCGAAAGCAGCTTCATTATGCAAAGAAGGCAAAGTTGACGATGCAGTCAAAGTAATTGAAGAGCAAATAGTGGGAGGCTGTGTTCCAACTAT
TGCATTGTACAACATCGTTCTGAAGGGTCTTTGTGATGATGGAAAATCAACAGTGGCTATGGAGTATTTGAAGAAAATGGCAAAGCAGGTCGGTCTTGTTGCCAACAAAG
AAACTTACAGCACTTTAGTACATGGACTTTGTCTCGAAAATCGATATATTGAAGCATGTAAGGTTTTAGAGGAGATGGTAATCAAATCGTTTTGCCCTTGCTCTAACACA
TTCAATACACTTATCAAAGGTCTTTGCTCAGTTGGAAAACACTATGAAGCTGTGATGTGGTTGGAAGAAATGATTAGCCAAGGTCAATTGCCTCATGTTTGTGTCTGGAA
TTCTTTGGTTTCATCTTTGTGTTGCGATGTGGCTGGCATCGATATGTGTTCCAGGGTTTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTATTAGGTGGCCAAGGATTTTAACGCCCACATGTTTATCTCAGATTATTAGGAAGCAGAATAATCCCCAAACAGCTTACCAACTGTTTAAGGAAGCCAAATGTAG
GTACCCAGATTATCGGCACAATGGTCCAGTGTATGCCACAATGATCAAGATACTCGGAAACTCGGGCAGAGTTTCCGAGATGAGAGAAGTGATGGATCAGATGAGAGATG
ACTCTTGTGAATGCAAAGATTCTGTATTTTCATTTGCAATTAAAACGTATGCTAGTCATGGATTATTGGAAGATGGTATATCTCTTTTTAAAAGTTTTGGGAGATTTAAC
TGTACCAATAGAACACAAACTTTCAATACCCTTTTAGAAATCCTGTTGAAGGAATCTCAGCTTCATGCTGCTTGTCAGCTTTTTCAGGAGTGTTCTTATGGTTGGGGAGT
GAAATCCAGGACTCAGTCCTTGAATTTGTTGATGCAATCTCTCTGCCAAAGAGGTCAGTCGGAGCTTGCTTTACATGTCTTTCAAGAAATGGATTACCAAAGCTGCTATC
CAAATAGACTGAGTTATTTGATTGTAATGAAAGGACTGTGTCAAGATGGTAGGCTTAACGAGGCCATCCATTTGTTGTATTCCATGTTTTGGAGAATTTCCCGAAAGGGT
GGTGGCGGGGACATAGTAATTTATAGAACCCTTCTGTTTGCTTTGTGTGATAATGGAGAGATCGAGAAAGCTGTGGAAATACTTGGCAAGATCTTGAGGAAAGGACTAAA
AGCCCCTAAACGGGCCCATTACCGAATCGACTTAGATCAATGCAGGAATAGCAACCTCACCATTGAGGAAATCAAGAGTTTAATCAATGAAGCTTTAATCAAAGGCGGAA
TTCCCAGTTCAGATAGCTATTGTGCCATGGCTGTTGATCTATATAACGAAAACAAGACTGATCAGGGAGATAAAGTGGTTAGCCACATGATAGCTAAAGGCTTCAGGCCA
CCATCCTTGATCTATGAAGCGAAAGCAGCTTCATTATGCAAAGAAGGCAAAGTTGACGATGCAGTCAAAGTAATTGAAGAGCAAATAGTGGGAGGCTGTGTTCCAACTAT
TGCATTGTACAACATCGTTCTGAAGGGTCTTTGTGATGATGGAAAATCAACAGTGGCTATGGAGTATTTGAAGAAAATGGCAAAGCAGGTCGGTCTTGTTGCCAACAAAG
AAACTTACAGCACTTTAGTACATGGACTTTGTCTCGAAAATCGATATATTGAAGCATGTAAGGTTTTAGAGGAGATGGTAATCAAATCGTTTTGCCCTTGCTCTAACACA
TTCAATACACTTATCAAAGGTCTTTGCTCAGTTGGAAAACACTATGAAGCTGTGATGTGGTTGGAAGAAATGATTAGCCAAGGTCAATTGCCTCATGTTTGTGTCTGGAA
TTCTTTGGTTTCATCTTTGTGTTGCGATGTGGCTGGCATCGATATGTGTTCCAGGGTTTTATGA
Protein sequenceShow/hide protein sequence
MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMIKILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFN
CTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKG
GGGDIVIYRTLLFALCDNGEIEKAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRP
PSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNT
FNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL