; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy6G007970 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy6G007970
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationGy14Chr6:6708063..6709952
RNA-Seq ExpressionCsGy6G007970
SyntenyCsGy6G007970
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140638.1 pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucumis sativus]0.0100Show/hide
Query:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
        MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
Subjt:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI

Query:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
        SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
Subjt:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG

Query:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
        RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
Subjt:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY

Query:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV
        CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV
Subjt:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV

Query:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
Subjt:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

XP_008459832.1 PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X2 [Cucumis melo]0.096.64Show/hide
Query:  MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSR
        MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKS GRFNCTNRTQTFNTLLEILL ESQLHAACQLFQECSYGW VKSR
Subjt:  MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSR

Query:  TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGK
        TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKG GGDIVIYRTLLFALCDNGEIEQAVEILGK
Subjt:  TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGK

Query:  ILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDD
        ILRKGLKAPKRAHYRIDLDQCRN+ LTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPS IYEAK ASLCKEGKVDD
Subjt:  ILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDD

Query:  AVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCS
        AVKVIEEQIVG CVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAK+VGLVANKETYSTLVHGLC ENRY EACKVLEEMVIKSF PCSNTFNTLIKGLCS
Subjt:  AVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCS

Query:  VGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        VGK YEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCS+VL
Subjt:  VGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

XP_011656819.1 pentatricopeptide repeat-containing protein At1g05600 isoform X2 [Cucumis sativus]0.0100Show/hide
Query:  MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSR
        MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSR
Subjt:  MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSR

Query:  TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGK
        TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGK
Subjt:  TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGK

Query:  ILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDD
        ILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDD
Subjt:  ILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDD

Query:  AVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCS
        AVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCS
Subjt:  AVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCS

Query:  VGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        VGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
Subjt:  VGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

XP_016902498.1 PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucumis melo]0.095.98Show/hide
Query:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
        M++RWPRILTPT LSQIIRKQNNP TAYQLFKEAKCRYPDYRHNGPVYA MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
Subjt:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI

Query:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
        SLFKS GRFNCTNRTQTFNTLLEILL ESQLHAACQLFQECSYGW VKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
Subjt:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG

Query:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
        RLNEAIHLLYSMFWRISRKG GGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRN+ LTIEEIKSLINEALIKGGIPSSDSY
Subjt:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY

Query:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV
        CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPS IYEAK ASLCKEGKVDDAVKVIEEQIVG CVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAK+VGLV
Subjt:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV

Query:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        ANKETYSTLVHGLC ENRY EACKVLEEMVIKSF PCSNTFNTLIKGLCSVGK YEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCS+VL
Subjt:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

XP_038874865.1 pentatricopeptide repeat-containing protein At1g05600 [Benincasa hispida]0.090.74Show/hide
Query:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
        M++RWPRILTPT LSQIIRKQNNP TAYQLFKEAK RYPDYRHNGPVYA MI+ILGNSGRVSEMREV+DQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
Subjt:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI

Query:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
        +LFKS G+FNCTNRTQTFNTLLEILL ESQL AACQLFQE SYGW VKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYP+RLSYLI+MKGLCQDG
Subjt:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG

Query:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
        RLNEAIHLLYSMFWRIS++GGGGDIVIYRTLLFALCDNG+IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNS LTI EIK LINEALIKGGIPSSDS+
Subjt:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY

Query:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIV-GGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGL
        CAMAVDLYNEN+TDQGDKVVSHM+AKGFRPPS I+EAK A+LCKEGKVDDAVKVIEE+IV G CVPT+ALYNIVLKGLCD+GKSTVAMEYLKKMAKQVGL
Subjt:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIV-GGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGL

Query:  VANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRV
        VA+K TYSTLVHGLC ENRYIEACKVLEEMVIKSF PCSNTFNTLI+GLCSVGK YEAVMWLEEMISQGQLPHV VWNSLVSSLCCDVAG +M S V
Subjt:  VANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRV

TrEMBL top hitse value%identityAlignment
A0A0A0K9Q4 Uncharacterized protein0.0100Show/hide
Query:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
        MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
Subjt:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI

Query:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
        SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
Subjt:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG

Query:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
        RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
Subjt:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY

Query:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV
        CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV
Subjt:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV

Query:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
Subjt:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

A0A1S3CB38 pentatricopeptide repeat-containing protein At1g05600 isoform X20.096.64Show/hide
Query:  MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSR
        MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKS GRFNCTNRTQTFNTLLEILL ESQLHAACQLFQECSYGW VKSR
Subjt:  MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSR

Query:  TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGK
        TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKG GGDIVIYRTLLFALCDNGEIEQAVEILGK
Subjt:  TQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGK

Query:  ILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDD
        ILRKGLKAPKRAHYRIDLDQCRN+ LTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPS IYEAK ASLCKEGKVDD
Subjt:  ILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDD

Query:  AVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCS
        AVKVIEEQIVG CVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAK+VGLVANKETYSTLVHGLC ENRY EACKVLEEMVIKSF PCSNTFNTLIKGLCS
Subjt:  AVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCS

Query:  VGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        VGK YEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCS+VL
Subjt:  VGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

A0A1S4E2N8 pentatricopeptide repeat-containing protein At1g05600 isoform X10.095.98Show/hide
Query:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
        M++RWPRILTPT LSQIIRKQNNP TAYQLFKEAKCRYPDYRHNGPVYA MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
Subjt:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI

Query:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
        SLFKS GRFNCTNRTQTFNTLLEILL ESQLHAACQLFQECSYGW VKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
Subjt:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG

Query:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
        RLNEAIHLLYSMFWRISRKG GGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRN+ LTIEEIKSLINEALIKGGIPSSDSY
Subjt:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY

Query:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV
        CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPS IYEAK ASLCKEGKVDDAVKVIEEQIVG CVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAK+VGLV
Subjt:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV

Query:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        ANKETYSTLVHGLC ENRY EACKVLEEMVIKSF PCSNTFNTLIKGLCSVGK YEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCS+VL
Subjt:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

A0A5D3DM62 Pentatricopeptide repeat-containing protein0.095.98Show/hide
Query:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
        M++RWPRILTPT LSQIIRKQNNP TAYQLFKEAKCRYPDYRHNGPVYA MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
Subjt:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI

Query:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
        SLFKS GRFNCTNRTQTFNTLLEILL ESQLHAACQLFQECSYGW VKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
Subjt:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG

Query:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
        RLNEAIHLLYSMFWRISRKG GGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRN+ LTIEEIKSLINEALIKGGIPSSDSY
Subjt:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY

Query:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV
        CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPS IYEAK ASLCKEGKVDDAVKVIEEQIVG CVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAK+VGLV
Subjt:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLV

Query:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
        ANKETYSTLVHGLC ENRY EACKVLEEMVIKSF PCSNTFNTLIKGLCSVGK YEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCS+VL
Subjt:  ANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL

A0A6J1DVG7 pentatricopeptide repeat-containing protein At1g056006.64e-31584.79Show/hide
Query:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI
        M++RWPR+LTPT LSQIIRKQNNP TA+QLFKEAKCRYP YRHNGPVYA MI+ILGNSGRV EMREV+DQM+DDSCECKDSVFSFAIKTYASHGLLE+GI
Subjt:  MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI

Query:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG
        SLFKS GRFNCTNRTQTFNTLLEILL ES+L AACQLFQ+ SYGWGVKSRTQSLNLLMQSLCQR QSELALH+FQEMDYQ CYPNRLSYLI+MKGLCQDG
Subjt:  SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDG

Query:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
        RLNEA+HLLYSMFWRIS++G GGDIVIYRTLL+ALC NGE+EQAVEILGKILRKGLKAPKR HYRIDLDQC+NS LTI+EIK L NEALIKGGIPS  +Y
Subjt:  RLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY

Query:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIV-GGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGL
        CAMAVDLYNEN TDQGDKVVSHM+AKGFRPPS IYEAKAA+LCKEGKVDDA++VI+E+ V G C+P++ALYNIVLKGL ++GKSTVA+EYLKKMAKQVGL
Subjt:  CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIV-GGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGL

Query:  VANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDM
        VA+KETYS LVHGLC ENRYIEACK+LEEMVIKS+ PCS+TFNTLI GLCS+GK YEAVMWLEEMISQGQLP + VWNSLVSS+C +VAG D+
Subjt:  VANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDM

SwissProt top hitse value%identityAlignment
O49436 Pentatricopeptide repeat-containing protein At4g200902.5e-4423.96Show/hide
Query:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLF-KSFG
        ++P    +++        + ++FK A  +   ++      ++MI    NSG    + +++ ++R ++    +  F    + Y    L +  + LF +   
Subjt:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLF-KSFG

Query:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQ---ECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNE
         F C    ++FN++L +++ E   H   + +      +    +     S NL++++LC+    + A+ VF+ M  + C P+  +Y  +M GLC++ R++E
Subjt:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQ---ECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMA
        A+ LL  M      +G     VIY  L+  LC  G++ +  +++  +  KG    +  +  +    C    L  ++  SL+   +    IP+  +Y  + 
Subjt:  AIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMA

Query:  VDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKE
          L  + +     +++S M  +G+     IY    + L KEGK ++A+ +  +    GC P I +Y++++ GLC +GK   A E L +M    G + N  
Subjt:  VDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKE

Query:  TYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC
        TYS+L+ G        EA +V +EM           ++ LI GLC VG+  EA+M   +M++ G  P    ++S++  LC
Subjt:  TYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745803.6e-4323.84Show/hide
Query:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDD-SCECKDSVFSFAIKTYASHGLLEDGISLFKSFG
        L P  ++ +I+ Q +P  A ++F   + +   ++H    Y ++I  LG  G+   M EV+  MR++      + V+  A+K Y   G +++ +++F+   
Subjt:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDD-SCECKDSVFSFAIKTYASHGLLEDGISLFKSFG

Query:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIH
         ++C     ++N ++ +L+       A +++       G+     S  + M+S C+  +   AL +   M  Q C  N ++Y  V+ G  ++    E   
Subjt:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIH

Query:  LLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLD-QCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVD
          Y +F ++   G    +  +  LL  LC  G++++  ++L K++++G+  P    Y + +   C+   L  +    ++   + +G  P   +Y  +   
Subjt:  LLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLD-QCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVD

Query:  LYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETY
        L   +K  + +  +  M+ +G  P S  Y    A  CK G V  A +++ + +  G VP    Y  ++ GLC +G++  A+    + A   G+  N   Y
Subjt:  LYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETY

Query:  STLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLV
        +TL+ GL  +   +EA ++  EM  K   P   TFN L+ GLC +G   +A   ++ MIS+G  P +  +N L+
Subjt:  STLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLV

Q9FMF6 Pentatricopeptide repeat-containing protein At5g64320, mitochondrial5.4e-3923.79Show/hide
Query:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGR
        +TP  L +++    N  T+ +LF     +   YRH+  VY  +I  LG +G    +  ++ QM+D+    K+S+F   ++ Y   G       L      
Subjt:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGR

Query:  -FNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIH
         ++C    +++N +LEIL+  +    A  +F +      +     +  ++M++ C   + + AL + ++M    C PN + Y  ++  L +  R+NEA+ 
Subjt:  -FNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIH

Query:  LLYSMFW-------------------------------RISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILR----------------------K
        LL  MF                                R+  +G   D + Y  L+  LC  G ++ A ++  +I +                      K
Subjt:  LLYSMFW-------------------------------RISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILR----------------------K

Query:  GLKAPKRAHYRIDLDQCRNSNLTIEEIK--------SLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEG
         + +     Y I  D C  ++L     K         ++++   KG  P+  SY  +        K D+   V++ M A G +P ++ +    ++ CKE 
Subjt:  GLKAPKRAHYRIDLDQCRNSNLTIEEIK--------SLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEG

Query:  KVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIK
        ++ +AV++  E    GC P +  +N ++ GLC+  +   A+  L+ M  + G+VAN  TY+TL++         EA K++ EMV +       T+N+LIK
Subjt:  KVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIK

Query:  GLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC
        GLC  G+  +A    E+M+  G  P     N L++ LC
Subjt:  GLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic3.2e-4423.68Show/hide
Query:  TPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFK-SFGR
        T   L   +R Q +   A +LF  A  + P++     +Y  ++  LG SG   +M+++++ M+   CE   S F   I++YA   L ++ +S+       
Subjt:  TPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFK-SFGR

Query:  FNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHV
        F     T  +N +L +L+  + L    ++       WG+K    + N+L+++LC                                   + G  + AL +
Subjt:  FNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHV

Query:  FQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRN
         ++M    C  + +S  +++ G C++GR+ +A++ +  M    ++ G   D   + TL+  LC  G ++ A+EI+  +L++G       +  +    C+ 
Subjt:  FQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRN

Query:  SNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIV
            ++E   ++++ + +   P++ +Y  +   L  EN+ ++  ++   + +KG  P    + +    LC       A+++ EE    GC P    YN++
Subjt:  SNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIV

Query:  LKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHV
        +  LC  GK   A+  LK+M +  G   +  TY+TL+ G C  N+  EA ++ +EM +      S T+NTLI GLC   +  +A   +++MI +GQ P  
Subjt:  LKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHV

Query:  CVWNSLVSSLC
          +NSL++  C
Subjt:  CVWNSLVSSLC

Q9SYK1 Pentatricopeptide repeat-containing protein At1g056001.8e-16455.94Show/hide
Query:  IRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISL
        +RWPR+LTP+ LSQI++KQ NP TA +LF+EAK R+P Y HNG VYATMI+ILG S RV EM+ V+++M++DSCECKDSVF+  I+T++  G LED ISL
Subjt:  IRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISL

Query:  FKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRL
        FKS   FNC N + +F+TLL+ ++KES+L AAC +F++  YGW V SR  +LNLLM+ LCQ  +S+LA  VFQEM+YQ CYP+R SY I+MKG C +G+L
Subjt:  FKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRL

Query:  NEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCA
         EA HLLYSMFWRIS+KG G DIV+YR LL ALCD GE++ A+EILGKILRKGLKAPKR ++ I+     +S+  IE +K L+ E LI+G IP  DSY A
Subjt:  NEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCA

Query:  MAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVI-EEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVA
        MA DL+ E K  +G++V+  M +KGF P   IY AK  +LC+ GK+ +AV VI +E + G C+PT+ +YN+++KGLCDDGKS  A+ YLKKM+KQV  VA
Subjt:  MAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVI-EEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVA

Query:  NKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC-CDVAGIDMCSRVL
        N+ETY TLV GLC + +++EA +V+EEM+IKS  P   T++ +IKGLC + + YEAVMWLEEM+SQ  +P   VW +L  S+C C +  +++   ++
Subjt:  NKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC-CDVAGIDMCSRVL

Arabidopsis top hitse value%identityAlignment
AT1G05600.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-16555.94Show/hide
Query:  IRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISL
        +RWPR+LTP+ LSQI++KQ NP TA +LF+EAK R+P Y HNG VYATMI+ILG S RV EM+ V+++M++DSCECKDSVF+  I+T++  G LED ISL
Subjt:  IRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISL

Query:  FKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRL
        FKS   FNC N + +F+TLL+ ++KES+L AAC +F++  YGW V SR  +LNLLM+ LCQ  +S+LA  VFQEM+YQ CYP+R SY I+MKG C +G+L
Subjt:  FKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRL

Query:  NEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCA
         EA HLLYSMFWRIS+KG G DIV+YR LL ALCD GE++ A+EILGKILRKGLKAPKR ++ I+     +S+  IE +K L+ E LI+G IP  DSY A
Subjt:  NEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCA

Query:  MAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVI-EEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVA
        MA DL+ E K  +G++V+  M +KGF P   IY AK  +LC+ GK+ +AV VI +E + G C+PT+ +YN+++KGLCDDGKS  A+ YLKKM+KQV  VA
Subjt:  MAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVI-EEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVA

Query:  NKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC-CDVAGIDMCSRVL
        N+ETY TLV GLC + +++EA +V+EEM+IKS  P   T++ +IKGLC + + YEAVMWLEEM+SQ  +P   VW +L  S+C C +  +++   ++
Subjt:  NKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC-CDVAGIDMCSRVL

AT1G05600.2 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-16555.94Show/hide
Query:  IRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISL
        +RWPR+LTP+ LSQI++KQ NP TA +LF+EAK R+P Y HNG VYATMI+ILG S RV EM+ V+++M++DSCECKDSVF+  I+T++  G LED ISL
Subjt:  IRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISL

Query:  FKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRL
        FKS   FNC N + +F+TLL+ ++KES+L AAC +F++  YGW V SR  +LNLLM+ LCQ  +S+LA  VFQEM+YQ CYP+R SY I+MKG C +G+L
Subjt:  FKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRL

Query:  NEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCA
         EA HLLYSMFWRIS+KG G DIV+YR LL ALCD GE++ A+EILGKILRKGLKAPKR ++ I+     +S+  IE +K L+ E LI+G IP  DSY A
Subjt:  NEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCA

Query:  MAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVI-EEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVA
        MA DL+ E K  +G++V+  M +KGF P   IY AK  +LC+ GK+ +AV VI +E + G C+PT+ +YN+++KGLCDDGKS  A+ YLKKM+KQV  VA
Subjt:  MAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVI-EEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVA

Query:  NKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC-CDVAGIDMCSRVL
        N+ETY TLV GLC + +++EA +V+EEM+IKS  P   T++ +IKGLC + + YEAVMWLEEM+SQ  +P   VW +L  S+C C +  +++   ++
Subjt:  NKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC-CDVAGIDMCSRVL

AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein2.5e-4423.84Show/hide
Query:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDD-SCECKDSVFSFAIKTYASHGLLEDGISLFKSFG
        L P  ++ +I+ Q +P  A ++F   + +   ++H    Y ++I  LG  G+   M EV+  MR++      + V+  A+K Y   G +++ +++F+   
Subjt:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDD-SCECKDSVFSFAIKTYASHGLLEDGISLFKSFG

Query:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIH
         ++C     ++N ++ +L+       A +++       G+     S  + M+S C+  +   AL +   M  Q C  N ++Y  V+ G  ++    E   
Subjt:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIH

Query:  LLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLD-QCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVD
          Y +F ++   G    +  +  LL  LC  G++++  ++L K++++G+  P    Y + +   C+   L  +    ++   + +G  P   +Y  +   
Subjt:  LLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLD-QCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVD

Query:  LYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETY
        L   +K  + +  +  M+ +G  P S  Y    A  CK G V  A +++ + +  G VP    Y  ++ GLC +G++  A+    + A   G+  N   Y
Subjt:  LYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETY

Query:  STLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLV
        +TL+ GL  +   +EA ++  EM  K   P   TFN L+ GLC +G   +A   ++ MIS+G  P +  +N L+
Subjt:  STLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLV

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-4523.68Show/hide
Query:  TPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFK-SFGR
        T   L   +R Q +   A +LF  A  + P++     +Y  ++  LG SG   +M+++++ M+   CE   S F   I++YA   L ++ +S+       
Subjt:  TPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFK-SFGR

Query:  FNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHV
        F     T  +N +L +L+  + L    ++       WG+K    + N+L+++LC                                   + G  + AL +
Subjt:  FNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLC-----------------------------------QRGQSELALHV

Query:  FQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRN
         ++M    C  + +S  +++ G C++GR+ +A++ +  M    ++ G   D   + TL+  LC  G ++ A+EI+  +L++G       +  +    C+ 
Subjt:  FQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRN

Query:  SNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIV
            ++E   ++++ + +   P++ +Y  +   L  EN+ ++  ++   + +KG  P    + +    LC       A+++ EE    GC P    YN++
Subjt:  SNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIV

Query:  LKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHV
        +  LC  GK   A+  LK+M +  G   +  TY+TL+ G C  N+  EA ++ +EM +      S T+NTLI GLC   +  +A   +++MI +GQ P  
Subjt:  LKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHV

Query:  CVWNSLVSSLC
          +NSL++  C
Subjt:  CVWNSLVSSLC

AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-4523.96Show/hide
Query:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLF-KSFG
        ++P    +++        + ++FK A  +   ++      ++MI    NSG    + +++ ++R ++    +  F    + Y    L +  + LF +   
Subjt:  LTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLF-KSFG

Query:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQ---ECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNE
         F C    ++FN++L +++ E   H   + +      +    +     S NL++++LC+    + A+ VF+ M  + C P+  +Y  +M GLC++ R++E
Subjt:  RFNCTNRTQTFNTLLEILLKESQLHAACQLFQ---ECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNE

Query:  AIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMA
        A+ LL  M      +G     VIY  L+  LC  G++ +  +++  +  KG    +  +  +    C    L  ++  SL+   +    IP+  +Y  + 
Subjt:  AIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMA

Query:  VDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKE
          L  + +     +++S M  +G+     IY    + L KEGK ++A+ +  +    GC P I +Y++++ GLC +GK   A E L +M    G + N  
Subjt:  VDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKE

Query:  TYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC
        TYS+L+ G        EA +V +EM           ++ LI GLC VG+  EA+M   +M++ G  P    ++S++  LC
Subjt:  TYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATTAGGTGGCCAAGGATTTTAACGCCCACATGTTTATCTCAGATTATTAGGAAGCAGAATAATCCCCAAACAGCTTACCAACTGTTTAAGGAAGCCAAATGTAG
GTACCCAGATTATCGGCACAATGGTCCAGTGTATGCCACAATGATCAATATACTCGGAAACTCGGGCAGAGTTTCCGAGATGAGAGAAGTGATGGATCAGATGAGAGATG
ACTCTTGTGAATGCAAAGATTCTGTATTTTCATTTGCAATTAAAACGTATGCTAGTCATGGATTATTGGAAGATGGTATATCTCTTTTTAAAAGTTTTGGGAGATTTAAC
TGTACCAATAGAACACAAACTTTCAATACTCTTTTAGAAATCCTGTTGAAGGAATCTCAGCTTCATGCTGCTTGTCAGCTTTTTCAGGAGTGTTCTTATGGTTGGGGAGT
GAAATCCAGGACTCAGTCCTTGAATTTGTTGATGCAATCTCTCTGCCAAAGAGGTCAGTCGGAGCTTGCTTTACATGTCTTTCAAGAAATGGATTACCAAAGTTGCTATC
CAAATAGACTGAGTTATTTGATTGTAATGAAAGGACTGTGTCAAGATGGTAGGCTTAACGAGGCCATCCATTTGTTGTATTCCATGTTTTGGAGAATTTCCCGAAAGGGT
GGTGGCGGGGACATAGTAATTTATAGAACCCTTCTGTTTGCTTTATGTGATAATGGAGAGATCGAGCAAGCTGTGGAAATACTTGGCAAGATCTTGAGGAAAGGACTAAA
AGCCCCTAAACGGGCTCATTACCGAATCGACTTAGATCAATGCAGGAATAGCAACCTCACCATTGAGGAAATCAAGAGTTTAATCAATGAAGCTCTAATCAAAGGCGGAA
TTCCCAGTTCAGATAGCTATTGTGCCATGGCTGTTGATCTATATAACGAAAACAAGACTGATCAGGGAGATAAAGTGGTTAGCCACATGATAGCTAAAGGCTTCAGGCCA
CCATCCTTGATCTATGAAGCGAAAGCAGCTTCATTATGCAAAGAAGGCAAAGTTGACGATGCAGTCAAAGTAATTGAAGAGCAAATAGTGGGAGGCTGTGTTCCAACTAT
TGCATTGTACAACATCGTTCTGAAGGGTCTTTGTGATGATGGAAAATCAACAGTGGCTATGGAGTATTTGAAGAAAATGGCAAAGCAGGTCGGTCTTGTTGCCAACAAAG
AAACTTACAGCACTTTAGTACATGGACTTTGTCTCGAAAATCGATATATTGAAGCATGTAAGGTTTTAGAGGAGATGGTAATCAAATCGTTTTGCCCTTGCTCTAACACA
TTCAATACACTTATCAAAGGTCTTTGCTCAGTTGGAAAACACTATGAAGCTGTGATGTGGTTGGAAGAAATGATTAGCCAAGGTCAATTGCCTCATGTTTGTGTCTGGAA
TTCTTTGGTTTCATCTTTGTGTTGCGATGTGGCTGGCATCGATATGTGTTCCAGGGTTTTATGA
mRNA sequenceShow/hide mRNA sequence
AGCAAGTTGAAATTGAGTATTTAGGAGGCGCGCTCTCACCCCCCGAATTCCATTGAAGGGAATCAGCTGAAGAGTTTCCGTCTCCGGGAAAAGGAGAAGGCCATATTCTT
AGCCAAATTCGATCCAAGATACCTTTTTTTTCAAGCGCTGAAGTTGATTATAAGATAGGACAGATAATCCCACCTAGCTGGTAAATTTCAAAGCTCTCAACTTCCTGTCT
GCCCATTAGGTGTTTGTCAAAAAGTCTTGCCTAAAATTCGTCAAATTTCTCTTCATTTTTTATGAAGTTCGATAAGATTTTGGTCACTTGTTTCAAATATAAGTATGTTA
TTGATTAAACATACCACCAATTAAGTAGGTATGAGTATTAGGTGGCCAAGGATTTTAACGCCCACATGTTTATCTCAGATTATTAGGAAGCAGAATAATCCCCAAACAGC
TTACCAACTGTTTAAGGAAGCCAAATGTAGGTACCCAGATTATCGGCACAATGGTCCAGTGTATGCCACAATGATCAATATACTCGGAAACTCGGGCAGAGTTTCCGAGA
TGAGAGAAGTGATGGATCAGATGAGAGATGACTCTTGTGAATGCAAAGATTCTGTATTTTCATTTGCAATTAAAACGTATGCTAGTCATGGATTATTGGAAGATGGTATA
TCTCTTTTTAAAAGTTTTGGGAGATTTAACTGTACCAATAGAACACAAACTTTCAATACTCTTTTAGAAATCCTGTTGAAGGAATCTCAGCTTCATGCTGCTTGTCAGCT
TTTTCAGGAGTGTTCTTATGGTTGGGGAGTGAAATCCAGGACTCAGTCCTTGAATTTGTTGATGCAATCTCTCTGCCAAAGAGGTCAGTCGGAGCTTGCTTTACATGTCT
TTCAAGAAATGGATTACCAAAGTTGCTATCCAAATAGACTGAGTTATTTGATTGTAATGAAAGGACTGTGTCAAGATGGTAGGCTTAACGAGGCCATCCATTTGTTGTAT
TCCATGTTTTGGAGAATTTCCCGAAAGGGTGGTGGCGGGGACATAGTAATTTATAGAACCCTTCTGTTTGCTTTATGTGATAATGGAGAGATCGAGCAAGCTGTGGAAAT
ACTTGGCAAGATCTTGAGGAAAGGACTAAAAGCCCCTAAACGGGCTCATTACCGAATCGACTTAGATCAATGCAGGAATAGCAACCTCACCATTGAGGAAATCAAGAGTT
TAATCAATGAAGCTCTAATCAAAGGCGGAATTCCCAGTTCAGATAGCTATTGTGCCATGGCTGTTGATCTATATAACGAAAACAAGACTGATCAGGGAGATAAAGTGGTT
AGCCACATGATAGCTAAAGGCTTCAGGCCACCATCCTTGATCTATGAAGCGAAAGCAGCTTCATTATGCAAAGAAGGCAAAGTTGACGATGCAGTCAAAGTAATTGAAGA
GCAAATAGTGGGAGGCTGTGTTCCAACTATTGCATTGTACAACATCGTTCTGAAGGGTCTTTGTGATGATGGAAAATCAACAGTGGCTATGGAGTATTTGAAGAAAATGG
CAAAGCAGGTCGGTCTTGTTGCCAACAAAGAAACTTACAGCACTTTAGTACATGGACTTTGTCTCGAAAATCGATATATTGAAGCATGTAAGGTTTTAGAGGAGATGGTA
ATCAAATCGTTTTGCCCTTGCTCTAACACATTCAATACACTTATCAAAGGTCTTTGCTCAGTTGGAAAACACTATGAAGCTGTGATGTGGTTGGAAGAAATGATTAGCCA
AGGTCAATTGCCTCATGTTTGTGTCTGGAATTCTTTGGTTTCATCTTTGTGTTGCGATGTGGCTGGCATCGATATGTGTTCCAGGGTTTTATGATACTGGAGTATTTACT
ATAAAAATCTCATTCTGGTT
Protein sequenceShow/hide protein sequence
MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFN
CTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKG
GGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRP
PSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNT
FNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL