; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030375 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030375
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr8:46736297..46738105
RNA-Seq ExpressionLag0030375
SyntenyLag0030375
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578360.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]3.1e-29691.16Show/hide
Query:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG
        MKPHL ELATRVSR ILSISNHTRPAGSWTPSLEQNLHRLGFRE LNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGF+HNSESYKS+LKSLSLSRQFG
Subjt:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG

Query:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR
         IHSLLKQVKT +IGLDLSVYHSVIDSLI+GKKTHDAFLVF E+TSVT VIG EPCN LLAALASDGFFEHAQKVFDEMSLK IPFNTLGFGVFIWR+CR
Subjt:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR

Query:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF
        NADVVKVLN+LDDA TNN EINGSV+ATLI+HGLCGASRLSEAS+ILDELKNRGCKPDFLTYWIL EA+QS GSVVDREK LKKKRKLGVAPRL+DYKEF
Subjt:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF

Query:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR
        LFALIA RRICEAKELGEVI+RGNFPMDEDVSNVLIGSV+A+DPSSAIMF K MVEKERFPTLLTLRNLSRNLCKHGK+DELLEV+Q+LS HNYF D+DR
Subjt:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR

Query:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE
        YHLRISFLCKAGMVKEAY VLQEMKKNGF+PDV FYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNIL+QKFSKSNQ+EEAL LY HMLGKKVE
Subjt:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE

Query:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF
        PDITIYTSLLQGLCQESQLEAAFEVFSK VEQDV+LAGTLLSTFILCLCKAG F
Subjt:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF

KAG7015944.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-29691.19Show/hide
Query:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG
        MKPHL ELATRVSR ILSISNHTRPAGSWTPSLEQNLHRLGFRE LNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGF+HNSESYKS+LKSLSLSRQFG
Subjt:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG

Query:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR
         IHSLLKQVKT +IGLDLSVYHSVIDSLI+GKKTHDAFLVF E+TSVT VIG EPCN LLAALASDGFFEHAQKVFDEMSLK IPFNTLGFGVFIWR+CR
Subjt:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR

Query:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF
        NADVVKVLN+LDDA TNN EINGSV+ATLI+HGLCGASRLSEAS+ILDELKNRGCKPDFLTYWIL EA+QSAGSVVDREK LKKKRKLGVAPRL+DYKEF
Subjt:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF

Query:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR
        LFALIA RRICEAKELGEVI+RGNFPMDEDVSNVLIGSV+A+DPSSAIMF K MVEKERFPTLLTLRNLSRNLCKHGK+DELLEV+Q+LS HNYF D+DR
Subjt:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR

Query:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE
        YHLRISFLCKAGMVKEAY VLQEMKKNGF+PDV FYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNIL+QKFSKSNQ+EEAL LY HMLGKKVE
Subjt:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE

Query:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKFFL
        PDITIYTSLLQGLCQESQLEAAFEVFSK VEQDV+LAGTLLSTFILCLCKAGK  L
Subjt:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKFFL

XP_022939514.1 pentatricopeptide repeat-containing protein At5g14080 isoform X1 [Cucurbita moschata]5.2e-29691.16Show/hide
Query:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG
        MKPHL ELATRVSR ILSISNHTRPAGSWTPSLEQNLHRLGFRE LNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGF+HNSESYKS+LKSLSLSRQFG
Subjt:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG

Query:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR
         IHSLLKQVKT +IGLDLSVYHSVIDSLI+GKKTHDAFLVF E+TSVT VIG EPCN LLAALASDGFFEHAQKVFDEMSLK IPFNTLGFGVFIWR+CR
Subjt:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR

Query:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF
        NADVVKVLN+LDDA TNN EINGSV+ATLI+HGLCGASRL EAS+ILDELKNRGCKPDFLTYWIL EA+QSAGSVVDREK LKKKRKLGVAPRL+DYKEF
Subjt:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF

Query:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR
        LFALIA RRICEAKELGEVI+RGNFPMDEDVSNVLIGSV+A+DPSSAIMF K MVEKERFPTLLTLRNLSRNLCKHGK+DELLEV+Q+LS HNYF D+DR
Subjt:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR

Query:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE
        YHLRISFLCKAGMVKEAY VLQEMKKNGF+PDV FYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNIL+QKFSKSNQ+EEAL LY HMLGKKVE
Subjt:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE

Query:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF
        PDITIYTSLLQGLCQESQLEAAFEVFSK VEQDV+LAGTLLSTFILCLCKAG F
Subjt:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF

XP_022992821.1 pentatricopeptide repeat-containing protein At5g14080 isoform X1 [Cucurbita maxima]6.4e-29490.79Show/hide
Query:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG
        MKPHL ELATRVSR +LSISNHT PAGSWTPSLEQNLHRLGFRE LNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGF+HNSESYKS+LKSLSLSRQFG
Subjt:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG

Query:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR
         IH LLKQVKT +IGLDLSVYHSVIDSLI+GKKTHDAFLVF EVTSVT VIG EPCN LLAALASDGFFEHAQKVFDEMSLK IPFNTLGFGVFIWR+CR
Subjt:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR

Query:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF
        NADVVKVLN+LDDA TNN EINGSV+ATLI+HGLCGASRLSEAS+ILDELKNRGCKPDFLTYWIL EA++ AGSVVDREK LKKKRKLGVAPRL+DYKEF
Subjt:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF

Query:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR
        LFALIA RRICEAKELGEVI+R NFPMDEDVSNVLIGSV+A+DPSSAIMF  FMVEKERFPTLLTLRNLSRNLCKHGK+DELLEV+QVLS HNYF D+DR
Subjt:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR

Query:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE
        Y LRISFLCKAGMVKEAY VLQEMKKNGF+PDV FYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQ+EEAL LY HMLGKKVE
Subjt:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE

Query:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF
        PDITIYTSLLQGLCQESQLEAAFEVFSK VEQDVNLAGTLLSTFILCLCKAG F
Subjt:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF

XP_023550678.1 pentatricopeptide repeat-containing protein At5g14080 isoform X1 [Cucurbita pepo subsp. pepo]4.7e-29791.88Show/hide
Query:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG
        MKPHL ELATRVSR ILSISNHTRPAGSWTPSLEQNLHRLGFRE LNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGF+HNSESYKS+LKSLSLS QFG
Subjt:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG

Query:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR
         IHSLLKQVKT +IGLDLSVYHSVIDSLI+GKKTHDAFLVF EVTSVT VIG EPCN LLAALASDGFF+HAQKVFDEMSLK IPFNTLGFGVFIWR+CR
Subjt:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR

Query:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF
        NADVVKVLN+LDDA TNN EINGSV+ATLI+HGLCGASRLSEAS+ILDELKNRGCKPDFLTYWIL EA+QSAGSVVDREK LKKKRKLGVAPRL+DYKEF
Subjt:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF

Query:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR
        LFALIA RRICEAKELGEVI+RGNFPMDEDVSNVLIGSV+A+DPSSAIMF K MVEKERFPTLLTLRNLSRNLCKHGKIDELLEV+QVLS HNYF D+DR
Subjt:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR

Query:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE
        YHLRISFLCKAGMVKEAY VLQEMKKNGF+PDV FYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQ+EEAL LY HMLGKKVE
Subjt:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE

Query:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF
        PDITIYTSLLQGLCQESQLEAAFEVFSK VEQDVNLAGTLLSTFILCLCKAG F
Subjt:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF

TrEMBL top hitse value%identityAlignment
A0A0A0LMX0 Uncharacterized protein2.1e-28287.73Show/hide
Query:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG
        M+PH PELATR+SRAILSISN T PAGSWTPSLEQNLHRLGFR+ LNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGF+HNS+SY SILKSLSLSR FG
Subjt:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG

Query:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR
         IHSLLKQVKT KIGLDLSVY +VIDSLI+ KKTHDAFLVFNEVTS+T +IG E CN LLAALASDGFFEHAQKVFDEMSLK IPFNTLGFGVFIWRICR
Subjt:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR

Query:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF
        N DVVKVLN++D ARTNN +INGSVIATLI+HGLC ASRL EAS+ILDELKNRGCKPDFLTYWIL EAFQSA +VVDREKILKKKRKLGVAPRLNDYKE+
Subjt:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF

Query:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR
        LF LIA RRI EAKELGEVI++GNFPMDE+VSNVLIGSV++VDP SAIMFFKFMVEK RFPTLLTLRNLSRNLCKHGK DELLEVFQVL I+NYF+D DR
Subjt:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR

Query:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE
        YHLRISFLCKAG VKEAY VLQEMKKNGF PDVSFYNSVLEACCREDLLRPARKLWDEMFA GC GNLKTY+IL+QKFSKSNQIEEAL LY HMLGK VE
Subjt:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE

Query:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF
        PDI IYTSLLQGLCQ+SQLEAAFEVFSKSVEQDVNLA TLLSTFILCLCK G F
Subjt:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF

A0A1S3B2P5 pentatricopeptide repeat-containing protein At5g14080 isoform X11.8e-28186.64Show/hide
Query:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG
        M+PHLPELATR+SRAILSISN T PAGSWTPSLEQNLHRLGFR+ LNPSLVSQVIDPHLLSH+SLALGFFNWASQQPGF+HNS+SY SILKSLSLSR FG
Subjt:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG

Query:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR
         IHSLLKQVKT KIGLDLSVY SVIDSLI+ KKTHDAFLVFNEVTS+T +IG E CN LLAAL+SDGF+E A KVFDEMSLKCIPFNTLG GVFIW++CR
Subjt:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR

Query:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF
        N DVVKVLN++DD RTNN ++NGS+IATLI+HGLCGASRL EAS+ILDELKNRGCKPDFLTYWIL EAFQSAG+VVDREKILKKKRKLGVAPRLNDYKE+
Subjt:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF

Query:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR
        LFALIA +RI EAKELGEVI++GNFPMDE+VSNVLIGSV++VDP SAIMFFKFMVEK RFPTLLTLRNLSRNLCKHGK DELLEVFQVL I NYF+D DR
Subjt:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR

Query:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE
        YHLRISFLCKAG VKEAY VLQEMKKNGF+PD SFYNSVLEACCREDLLRPARKLWDEMFASGC GNLKTY+IL+QKFSKSNQIEEAL LY HMLGK VE
Subjt:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE

Query:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF
        PDI IYTSLLQGLCQ SQLE AFEVFSKSVEQDVNLA TLLSTFILCLCK G F
Subjt:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF

A0A6J1BXA4 pentatricopeptide repeat-containing protein At5g140801.5e-28587.36Show/hide
Query:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG
        M+PH+P+LATRVSRA+LS       AG+WTPSLEQNLHRLGFR+ LNPSLVSQVIDPHLL+HHSLALGFFNWASQQPGF+HNSESYKS+LK LS SRQ+G
Subjt:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG

Query:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR
         IHSLLKQV+T KIGL LSVY SVIDSLIVGKKTHDAFLVFNEVTS+T+ IG EPCN LLAALASDGFFEHAQKVFDEMS+KCIPF+TLGFGVFIWR+CR
Subjt:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR

Query:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF
        NAD+VKVL +LDDARTNN EINGS+IATLI+HGLCGASR+SEAS+++DELKNR CKPDFL YWI+AEAFQSAGSV +REKILKKKRKLGVAPRLNDYKEF
Subjt:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF

Query:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR
        LFALIA RRICEAKELGEVIIRGNFP+DEDVSNVLIGSV+A+DP  AIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVL +HNYF+DFDR
Subjt:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR

Query:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE
        YHLR SFLCKAGMVKEAY VLQEMKK GF+ DVSFYN VLEACCREDLLRPARKLWDEMFASGCGGNLKTYNIL+QKFSKSNQI EALTLYHHMLGKKVE
Subjt:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE

Query:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF
        PDIT YTSLLQGLCQESQLEAAFEVFSKSVEQD NLAGTLL+TFILCLC+AG F
Subjt:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF

A0A6J1FHE2 pentatricopeptide repeat-containing protein At5g14080 isoform X12.5e-29691.16Show/hide
Query:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG
        MKPHL ELATRVSR ILSISNHTRPAGSWTPSLEQNLHRLGFRE LNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGF+HNSESYKS+LKSLSLSRQFG
Subjt:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG

Query:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR
         IHSLLKQVKT +IGLDLSVYHSVIDSLI+GKKTHDAFLVF E+TSVT VIG EPCN LLAALASDGFFEHAQKVFDEMSLK IPFNTLGFGVFIWR+CR
Subjt:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR

Query:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF
        NADVVKVLN+LDDA TNN EINGSV+ATLI+HGLCGASRL EAS+ILDELKNRGCKPDFLTYWIL EA+QSAGSVVDREK LKKKRKLGVAPRL+DYKEF
Subjt:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF

Query:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR
        LFALIA RRICEAKELGEVI+RGNFPMDEDVSNVLIGSV+A+DPSSAIMF K MVEKERFPTLLTLRNLSRNLCKHGK+DELLEV+Q+LS HNYF D+DR
Subjt:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR

Query:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE
        YHLRISFLCKAGMVKEAY VLQEMKKNGF+PDV FYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNIL+QKFSKSNQ+EEAL LY HMLGKKVE
Subjt:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE

Query:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF
        PDITIYTSLLQGLCQESQLEAAFEVFSK VEQDV+LAGTLLSTFILCLCKAG F
Subjt:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF

A0A6J1JUL2 pentatricopeptide repeat-containing protein At5g14080 isoform X13.1e-29490.79Show/hide
Query:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG
        MKPHL ELATRVSR +LSISNHT PAGSWTPSLEQNLHRLGFRE LNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGF+HNSESYKS+LKSLSLSRQFG
Subjt:  MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFG

Query:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR
         IH LLKQVKT +IGLDLSVYHSVIDSLI+GKKTHDAFLVF EVTSVT VIG EPCN LLAALASDGFFEHAQKVFDEMSLK IPFNTLGFGVFIWR+CR
Subjt:  VIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICR

Query:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF
        NADVVKVLN+LDDA TNN EINGSV+ATLI+HGLCGASRLSEAS+ILDELKNRGCKPDFLTYWIL EA++ AGSVVDREK LKKKRKLGVAPRL+DYKEF
Subjt:  NADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEF

Query:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR
        LFALIA RRICEAKELGEVI+R NFPMDEDVSNVLIGSV+A+DPSSAIMF  FMVEKERFPTLLTLRNLSRNLCKHGK+DELLEV+QVLS HNYF D+DR
Subjt:  LFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDR

Query:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE
        Y LRISFLCKAGMVKEAY VLQEMKKNGF+PDV FYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQ+EEAL LY HMLGKKVE
Subjt:  YHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVE

Query:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF
        PDITIYTSLLQGLCQESQLEAAFEVFSK VEQDVNLAGTLLSTFILCLCKAG F
Subjt:  PDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCKAGKF

SwissProt top hitse value%identityAlignment
Q9FIT7 Pentatricopeptide repeat-containing protein At5g61990, mitochondrial1.2e-4025.11Show/hide
Query:  SYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCI
        +Y  ++  L   ++     SLL ++ +  + LD   Y  +ID L+ G+    A  + +E+ S    I P   +  +  ++ +G  E A+ +FD M    +
Subjt:  SYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCI

Query:  PFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKK
              +   I   CR  +V +   LL + +  N+ I+     T +V G+C +  L  A +I+ E+   GC+P+ + Y  L + F       D  ++LK+
Subjt:  PFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKK

Query:  KRKLGVAPRLNDYKEFLFALIAARRICEAKE-LGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELL
         ++ G+AP +  Y   +  L  A+R+ EA+  L E++  G  P        + G + A + +SA  + K M E    P  +    L    CK GK+ E  
Subjt:  KRKLGVAPRLNDYKEFLFALIAARRICEAKE-LGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELL

Query:  EVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQ
          ++ +       D   Y + ++ L K   V +A  + +EM+  G +PDV  Y  ++    +   ++ A  ++DEM   G   N+  YN+L+  F +S +
Subjt:  EVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQ

Query:  IEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFI
        IE+A  L   M  K + P+   Y +++ G C+   L  AF +F      ++ L G +  +F+
Subjt:  IEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFI

Q9FMU2 Pentatricopeptide repeat-containing protein At5g140805.3e-17453.93Show/hide
Query:  ELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFGVIHSLL
        ELA R+ R +L +S  +R A  W+P +EQ+LH LGFR +++PSLV++VIDP LL+HHSLALGFFNWA+QQPG+SH+S SY SI KSLSLSRQF  + +L 
Subjt:  ELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFGVIHSLL

Query:  KQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVK
        KQVK++KI LD SVY S+ID+L++G+K   AF V  E  S    I P+ CN LLA L SDG +++AQK+F +M  K +  NTLGFGV+I   CR+++  +
Subjt:  KQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVK

Query:  VLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIA
        +L L+D+ +  NL INGS+IA LI+H LC  SR  +A  IL+EL+N  CKPDF+ Y ++AEAF   G++ +R+ +LKKKRKLGVAPR +DY+ F+  LI+
Subjt:  VLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIA

Query:  ARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRIS
        A+R+ EAKE+ EVI+ G FPMD D+ + LIGSVSAVDP SA+ F  +MV   + P + TL  LS+NLC+H K D L++ +++LS   YFS+   Y L IS
Subjt:  ARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRIS

Query:  FLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIY
        FLCKAG V+E+Y+ LQEMKK G +PDVS YN+++EACC+ +++RPA+KLWDEMF  GC  NL TYN+L++K S+  + EE+L L+  ML + +EPD TIY
Subjt:  FLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIY

Query:  TSLLQGLCQESQLEAAFEVFSKSVEQD-VNLAGTLLSTFILCLCKAG
         SL++GLC+E+++EAA EVF K +E+D   +   +LS F+L LC  G
Subjt:  TSLLQGLCQESQLEAAFEVFSKSVEQD-VNLAGTLLSTFILCLCKAG

Q9M9X9 Pentatricopeptide repeat-containing protein At1g06710, mitochondrial4.2e-3824.25Show/hide
Query:  FREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAF-LV
        FRE L+ SLV +V+   L++  S  + FF WA +Q G+ H +  Y +++  +       V    L+Q++      D  V+   ++ L+     + +F + 
Subjt:  FREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAF-LV

Query:  FNEVTSVTDV---IGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGA
          E+  + D          N L+ A       + A  +  EMSL  +  +      F + +C+     + L L++   T N  +  +V  T ++ GLC A
Subjt:  FNEVTSVTDV---IGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGA

Query:  SRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIG
        S   EA D L+ ++   C P+ +TY  L     +   +   +++L      G  P    +   + A   +     A +L + +++        V N+LIG
Subjt:  SRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIG

Query:  SVSAVDPSSAIMFFKFMVEKERFPTL--------LTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGF
        S+   D  S       + EK     L        + + + +R LC  GK ++   V + +    +  D   Y   +++LC A  ++ A+ + +EMK+ G 
Subjt:  SVSAVDPSSAIMFFKFMVEKERFPTL--------LTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGF

Query:  SPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSK
          DV  Y  ++++ C+  L+  ARK ++EM   GC  N+ TY  L+  + K+ ++  A  L+  ML +   P+I  Y++L+ G C+  Q+E A ++F +
Subjt:  SPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSK

Q9SH26 Pentatricopeptide repeat-containing protein At1g634004.6e-3724.09Show/hide
Query:  QQPGFSHNSESYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSV--YHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEH-
        Q+ G SHN  +Y  ++       Q  +  +LL   K  K+G + S+    S+++    GK+  DA  + +++      +G  P  +    L   G F H 
Subjt:  QQPGFSHNSESYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSV--YHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEH-

Query:  ----AQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAE
            A  + D M  +    N + +GV +  +C+  D+    NLL+      +E N  VI + ++  LC      +A ++  E++N+G +P+ +TY  L  
Subjt:  ----AQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAE

Query:  AFQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDP-SSAIMFFKFMVEKERFPTLLTL
           +     D  ++L    +  + P +  +   + A +   ++ EA++L + +I+ +   D    + LI      D    A   F+ M+ K+ FP ++T 
Subjt:  AFQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDP-SSAIMFFKFMVEKERFPTLLTL

Query:  RNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGG
          L    CK  +IDE +E+F+ +S      +   Y   I    +A     A  V ++M  +G  P++  YN++L+  C+   L  A  +++ +  S    
Subjt:  RNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGG

Query:  NLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSKSVE
         + TYNI+++   K+ ++E+   L+  +  K V+PD+ IY +++ G C++   E A  +F K  E
Subjt:  NLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSKSVE

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial1.4e-3823.88Show/hide
Query:  QQPGFSHNSESYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSV--YHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEP----CNLLLAALASDGF
        Q  G  HN  +Y  ++       Q  +  ++L   K  K+G + ++    S+++     K+  +A  + +++     V G +P     N L+  L     
Subjt:  QQPGFSHNSESYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSV--YHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEP----CNLLLAALASDGF

Query:  FEHAQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEA
           A  + D M  K    + + +GV +  +C+  D     NLL+      LE  G +I   I+ GLC    + +A ++  E++ +G +P+ +TY  L   
Subjt:  FEHAQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEA

Query:  FQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIAARRICEAKEL-GEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLR
          + G   D  ++L    +  + P +  +   + A +   ++ EA++L  E++ R   P     S+++ G         A   F+FMV K  FP ++T  
Subjt:  FQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIAARRICEAKEL-GEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLR

Query:  NLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGN
         L +  CK+ +++E +EVF+ +S      +   Y++ I  L +AG    A  + +EM  +G  P++  YN++L+  C+   L  A  +++ +  S     
Subjt:  NLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGN

Query:  LKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNL
        + TYNI+++   K+ ++E+   L+ ++  K V+PD+  Y +++ G C++   E A  +F K +++D  L
Subjt:  LKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNL

Arabidopsis top hitse value%identityAlignment
AT1G06710.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-3924.25Show/hide
Query:  FREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAF-LV
        FRE L+ SLV +V+   L++  S  + FF WA +Q G+ H +  Y +++  +       V    L+Q++      D  V+   ++ L+     + +F + 
Subjt:  FREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAF-LV

Query:  FNEVTSVTDV---IGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGA
          E+  + D          N L+ A       + A  +  EMSL  +  +      F + +C+     + L L++   T N  +  +V  T ++ GLC A
Subjt:  FNEVTSVTDV---IGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGA

Query:  SRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIG
        S   EA D L+ ++   C P+ +TY  L     +   +   +++L      G  P    +   + A   +     A +L + +++        V N+LIG
Subjt:  SRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIG

Query:  SVSAVDPSSAIMFFKFMVEKERFPTL--------LTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGF
        S+   D  S       + EK     L        + + + +R LC  GK ++   V + +    +  D   Y   +++LC A  ++ A+ + +EMK+ G 
Subjt:  SVSAVDPSSAIMFFKFMVEKERFPTL--------LTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGF

Query:  SPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSK
          DV  Y  ++++ C+  L+  ARK ++EM   GC  N+ TY  L+  + K+ ++  A  L+  ML +   P+I  Y++L+ G C+  Q+E A ++F +
Subjt:  SPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSK

AT1G62670.1 rna processing factor 21.0e-3923.88Show/hide
Query:  QQPGFSHNSESYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSV--YHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEP----CNLLLAALASDGF
        Q  G  HN  +Y  ++       Q  +  ++L   K  K+G + ++    S+++     K+  +A  + +++     V G +P     N L+  L     
Subjt:  QQPGFSHNSESYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSV--YHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEP----CNLLLAALASDGF

Query:  FEHAQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEA
           A  + D M  K    + + +GV +  +C+  D     NLL+      LE  G +I   I+ GLC    + +A ++  E++ +G +P+ +TY  L   
Subjt:  FEHAQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEA

Query:  FQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIAARRICEAKEL-GEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLR
          + G   D  ++L    +  + P +  +   + A +   ++ EA++L  E++ R   P     S+++ G         A   F+FMV K  FP ++T  
Subjt:  FQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIAARRICEAKEL-GEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLR

Query:  NLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGN
         L +  CK+ +++E +EVF+ +S      +   Y++ I  L +AG    A  + +EM  +G  P++  YN++L+  C+   L  A  +++ +  S     
Subjt:  NLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGN

Query:  LKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNL
        + TYNI+++   K+ ++E+   L+ ++  K V+PD+  Y +++ G C++   E A  +F K +++D  L
Subjt:  LKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNL

AT1G63400.1 Pentatricopeptide repeat (PPR) superfamily protein3.3e-3824.09Show/hide
Query:  QQPGFSHNSESYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSV--YHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEH-
        Q+ G SHN  +Y  ++       Q  +  +LL   K  K+G + S+    S+++    GK+  DA  + +++      +G  P  +    L   G F H 
Subjt:  QQPGFSHNSESYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSV--YHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEH-

Query:  ----AQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAE
            A  + D M  +    N + +GV +  +C+  D+    NLL+      +E N  VI + ++  LC      +A ++  E++N+G +P+ +TY  L  
Subjt:  ----AQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAE

Query:  AFQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDP-SSAIMFFKFMVEKERFPTLLTL
           +     D  ++L    +  + P +  +   + A +   ++ EA++L + +I+ +   D    + LI      D    A   F+ M+ K+ FP ++T 
Subjt:  AFQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIAARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDP-SSAIMFFKFMVEKERFPTLLTL

Query:  RNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGG
          L    CK  +IDE +E+F+ +S      +   Y   I    +A     A  V ++M  +G  P++  YN++L+  C+   L  A  +++ +  S    
Subjt:  RNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGG

Query:  NLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSKSVE
         + TYNI+++   K+ ++E+   L+  +  K V+PD+ IY +++ G C++   E A  +F K  E
Subjt:  NLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSKSVE

AT5G14080.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-17553.93Show/hide
Query:  ELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFGVIHSLL
        ELA R+ R +L +S  +R A  W+P +EQ+LH LGFR +++PSLV++VIDP LL+HHSLALGFFNWA+QQPG+SH+S SY SI KSLSLSRQF  + +L 
Subjt:  ELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFGVIHSLL

Query:  KQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVK
        KQVK++KI LD SVY S+ID+L++G+K   AF V  E  S    I P+ CN LLA L SDG +++AQK+F +M  K +  NTLGFGV+I   CR+++  +
Subjt:  KQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVK

Query:  VLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIA
        +L L+D+ +  NL INGS+IA LI+H LC  SR  +A  IL+EL+N  CKPDF+ Y ++AEAF   G++ +R+ +LKKKRKLGVAPR +DY+ F+  LI+
Subjt:  VLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIA

Query:  ARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRIS
        A+R+ EAKE+ EVI+ G FPMD D+ + LIGSVSAVDP SA+ F  +MV   + P + TL  LS+NLC+H K D L++ +++LS   YFS+   Y L IS
Subjt:  ARRICEAKELGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRIS

Query:  FLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIY
        FLCKAG V+E+Y+ LQEMKK G +PDVS YN+++EACC+ +++RPA+KLWDEMF  GC  NL TYN+L++K S+  + EE+L L+  ML + +EPD TIY
Subjt:  FLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIY

Query:  TSLLQGLCQESQLEAAFEVFSKSVEQD-VNLAGTLLSTFILCLCKAG
         SL++GLC+E+++EAA EVF K +E+D   +   +LS F+L LC  G
Subjt:  TSLLQGLCQESQLEAAFEVFSKSVEQD-VNLAGTLLSTFILCLCKAG

AT5G61990.1 Pentatricopeptide repeat (PPR) superfamily protein8.4e-4225.11Show/hide
Query:  SYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCI
        +Y  ++  L   ++     SLL ++ +  + LD   Y  +ID L+ G+    A  + +E+ S    I P   +  +  ++ +G  E A+ +FD M    +
Subjt:  SYKSILKSLSLSRQFGVIHSLLKQVKTHKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCI

Query:  PFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKK
              +   I   CR  +V +   LL + +  N+ I+     T +V G+C +  L  A +I+ E+   GC+P+ + Y  L + F       D  ++LK+
Subjt:  PFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLEINGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKK

Query:  KRKLGVAPRLNDYKEFLFALIAARRICEAKE-LGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELL
         ++ G+AP +  Y   +  L  A+R+ EA+  L E++  G  P        + G + A + +SA  + K M E    P  +    L    CK GK+ E  
Subjt:  KRKLGVAPRLNDYKEFLFALIAARRICEAKE-LGEVIIRGNFPMDEDVSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELL

Query:  EVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQ
          ++ +       D   Y + ++ L K   V +A  + +EM+  G +PDV  Y  ++    +   ++ A  ++DEM   G   N+  YN+L+  F +S +
Subjt:  EVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQ

Query:  IEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFI
        IE+A  L   M  K + P+   Y +++ G C+   L  AF +F      ++ L G +  +F+
Subjt:  IEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACCCCATTTACCAGAATTAGCTACTCGAGTGAGCAGAGCCATACTTTCAATATCAAATCACACAAGACCAGCTGGATCATGGACGCCTTCATTGGAGCAGAATTT
GCATCGACTCGGTTTCCGCGAAGCACTAAATCCATCTCTCGTCTCTCAAGTTATCGACCCACATCTTCTCAGCCACCACTCTCTCGCTCTTGGTTTCTTCAATTGGGCTT
CTCAGCAACCTGGTTTCTCCCACAATTCCGAATCCTACAAGTCGATTCTCAAGTCTCTCTCTCTTTCGCGCCAGTTTGGGGTCATTCATAGTCTCTTGAAACAGGTAAAA
ACTCATAAAATTGGCCTCGATTTATCAGTTTATCACTCTGTTATTGATTCGTTGATCGTTGGCAAGAAGACGCATGATGCTTTTTTGGTTTTTAATGAGGTTACTTCAGT
TACTGACGTTATTGGACCAGAACCATGTAATTTGCTTTTGGCTGCTCTTGCTTCTGATGGGTTTTTTGAGCATGCCCAGAAGGTTTTTGATGAAATGTCTCTTAAATGCA
TTCCTTTTAACACCCTTGGATTTGGTGTGTTTATTTGGAGGATTTGTAGAAATGCTGATGTAGTTAAAGTTTTGAACTTGCTAGATGATGCCAGGACCAATAATTTAGAG
ATCAATGGTTCTGTTATTGCCACGTTGATCGTTCATGGGCTCTGTGGGGCATCTAGACTTTCAGAAGCTTCAGATATTTTGGATGAACTTAAGAATAGGGGTTGCAAGCC
TGACTTTTTGACGTATTGGATTCTTGCAGAAGCATTTCAGTCAGCAGGGAGTGTGGTTGATAGGGAGAAAATTCTGAAGAAGAAGAGAAAGCTAGGGGTAGCTCCAAGGC
TTAATGATTATAAGGAGTTCTTATTTGCTTTAATAGCTGCGAGACGAATATGTGAAGCTAAAGAGTTGGGTGAAGTTATTATTAGAGGAAATTTTCCAATGGATGAAGAT
GTTTCTAATGTGCTGATAGGGTCGGTTTCTGCCGTTGATCCTTCATCTGCTATTATGTTCTTTAAGTTTATGGTCGAGAAAGAGAGATTTCCTACTCTCTTGACTTTAAG
AAATCTGAGTAGGAATTTATGTAAGCATGGAAAGATTGATGAACTGCTGGAAGTTTTCCAAGTTCTGAGTATACATAACTACTTCAGTGATTTTGATAGGTACCATTTGA
GAATTTCGTTCTTATGCAAGGCCGGAATGGTGAAAGAAGCCTATAGTGTTCTGCAGGAGATGAAGAAAAATGGATTTTCCCCTGATGTATCTTTCTACAATTCTGTCCTA
GAAGCATGTTGTAGAGAAGATCTACTTCGTCCTGCTAGAAAGCTGTGGGATGAGATGTTTGCTAGTGGCTGTGGTGGTAATTTAAAGACATATAACATCCTTGTTCAAAA
GTTTTCGAAATCTAATCAAATCGAGGAAGCTTTGACACTTTACCATCATATGCTTGGAAAAAAGGTGGAACCCGACATTACAATCTACACGTCCCTGCTTCAAGGGCTCT
GTCAGGAATCACAACTTGAAGCTGCTTTTGAAGTCTTTAGCAAGTCGGTTGAACAGGATGTAAATCTTGCGGGAACCTTGCTGAGCACTTTTATCCTGTGTCTGTGTAAA
GCAGGCAAGTTCTTTTTACCTGGTTATACGTTTATGCTTCTTGTTTGGAGTCATTTGTTTCTCATAGAACAAGTCCTTATCAACCCTCTCTTCAATGAGCCAAATCACTT
CTTTTGGGGTCATCTCTTTGACTACACCCTCCTCAGCTTTGTTACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAACCCCATTTACCAGAATTAGCTACTCGAGTGAGCAGAGCCATACTTTCAATATCAAATCACACAAGACCAGCTGGATCATGGACGCCTTCATTGGAGCAGAATTT
GCATCGACTCGGTTTCCGCGAAGCACTAAATCCATCTCTCGTCTCTCAAGTTATCGACCCACATCTTCTCAGCCACCACTCTCTCGCTCTTGGTTTCTTCAATTGGGCTT
CTCAGCAACCTGGTTTCTCCCACAATTCCGAATCCTACAAGTCGATTCTCAAGTCTCTCTCTCTTTCGCGCCAGTTTGGGGTCATTCATAGTCTCTTGAAACAGGTAAAA
ACTCATAAAATTGGCCTCGATTTATCAGTTTATCACTCTGTTATTGATTCGTTGATCGTTGGCAAGAAGACGCATGATGCTTTTTTGGTTTTTAATGAGGTTACTTCAGT
TACTGACGTTATTGGACCAGAACCATGTAATTTGCTTTTGGCTGCTCTTGCTTCTGATGGGTTTTTTGAGCATGCCCAGAAGGTTTTTGATGAAATGTCTCTTAAATGCA
TTCCTTTTAACACCCTTGGATTTGGTGTGTTTATTTGGAGGATTTGTAGAAATGCTGATGTAGTTAAAGTTTTGAACTTGCTAGATGATGCCAGGACCAATAATTTAGAG
ATCAATGGTTCTGTTATTGCCACGTTGATCGTTCATGGGCTCTGTGGGGCATCTAGACTTTCAGAAGCTTCAGATATTTTGGATGAACTTAAGAATAGGGGTTGCAAGCC
TGACTTTTTGACGTATTGGATTCTTGCAGAAGCATTTCAGTCAGCAGGGAGTGTGGTTGATAGGGAGAAAATTCTGAAGAAGAAGAGAAAGCTAGGGGTAGCTCCAAGGC
TTAATGATTATAAGGAGTTCTTATTTGCTTTAATAGCTGCGAGACGAATATGTGAAGCTAAAGAGTTGGGTGAAGTTATTATTAGAGGAAATTTTCCAATGGATGAAGAT
GTTTCTAATGTGCTGATAGGGTCGGTTTCTGCCGTTGATCCTTCATCTGCTATTATGTTCTTTAAGTTTATGGTCGAGAAAGAGAGATTTCCTACTCTCTTGACTTTAAG
AAATCTGAGTAGGAATTTATGTAAGCATGGAAAGATTGATGAACTGCTGGAAGTTTTCCAAGTTCTGAGTATACATAACTACTTCAGTGATTTTGATAGGTACCATTTGA
GAATTTCGTTCTTATGCAAGGCCGGAATGGTGAAAGAAGCCTATAGTGTTCTGCAGGAGATGAAGAAAAATGGATTTTCCCCTGATGTATCTTTCTACAATTCTGTCCTA
GAAGCATGTTGTAGAGAAGATCTACTTCGTCCTGCTAGAAAGCTGTGGGATGAGATGTTTGCTAGTGGCTGTGGTGGTAATTTAAAGACATATAACATCCTTGTTCAAAA
GTTTTCGAAATCTAATCAAATCGAGGAAGCTTTGACACTTTACCATCATATGCTTGGAAAAAAGGTGGAACCCGACATTACAATCTACACGTCCCTGCTTCAAGGGCTCT
GTCAGGAATCACAACTTGAAGCTGCTTTTGAAGTCTTTAGCAAGTCGGTTGAACAGGATGTAAATCTTGCGGGAACCTTGCTGAGCACTTTTATCCTGTGTCTGTGTAAA
GCAGGCAAGTTCTTTTTACCTGGTTATACGTTTATGCTTCTTGTTTGGAGTCATTTGTTTCTCATAGAACAAGTCCTTATCAACCCTCTCTTCAATGAGCCAAATCACTT
CTTTTGGGGTCATCTCTTTGACTACACCCTCCTCAGCTTTGTTACCTAG
Protein sequenceShow/hide protein sequence
MKPHLPELATRVSRAILSISNHTRPAGSWTPSLEQNLHRLGFREALNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFSHNSESYKSILKSLSLSRQFGVIHSLLKQVK
THKIGLDLSVYHSVIDSLIVGKKTHDAFLVFNEVTSVTDVIGPEPCNLLLAALASDGFFEHAQKVFDEMSLKCIPFNTLGFGVFIWRICRNADVVKVLNLLDDARTNNLE
INGSVIATLIVHGLCGASRLSEASDILDELKNRGCKPDFLTYWILAEAFQSAGSVVDREKILKKKRKLGVAPRLNDYKEFLFALIAARRICEAKELGEVIIRGNFPMDED
VSNVLIGSVSAVDPSSAIMFFKFMVEKERFPTLLTLRNLSRNLCKHGKIDELLEVFQVLSIHNYFSDFDRYHLRISFLCKAGMVKEAYSVLQEMKKNGFSPDVSFYNSVL
EACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSKSNQIEEALTLYHHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCK
AGKFFLPGYTFMLLVWSHLFLIEQVLINPLFNEPNHFFWGHLFDYTLLSFVT