; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010478 (gene) of Snake gourd v1 genome

Gene IDTan0010478
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG11:3493929..3496492
RNA-Seq ExpressionTan0010478
SyntenyTan0010478
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008459413.1 PREDICTED: pentatricopeptide repeat-containing protein At3g29290 isoform X1 [Cucumis melo]3.7e-28982Show/hide
Query:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL
        M+GVL N SPT IL NE NYQ DS Y      KH   CINVNP LKSC+RC I Y GN VSML MS PRLNLVVQS R ++FRT VGTLL C EDEAI L
Subjt:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL

Query:  VADEGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSL
        V DE   E S+EWK PPW DM +QDE  FQSEDVN  K+LEGEAL N++KVHFLEETD+V+LSKRILILSRKNKV+S LELFRSMQLAG+LP+LHALNSL
Subjt:  VADEGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSL

Query:  LACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATH
        LACLLRNG F DGLRIFEFMK N+LSTGHTYSL+LKAVA+ HGFLSALEMFKAWEH++   QFDAIVYNTMIS+CGK+NNW EAERTWRLME NGC ATH
Subjt:  LACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATH

Query:  ITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMS
        ITYSLLVSTFVRCNQNELAID YVKMVQ+ FKPGNDTMQAIIGASSKEGKWDFAL VFQDMLKCGLQPNS++FNALI+ALGKAKE+TLAF+IY++MKSM 
Subjt:  ITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMS

Query:  HSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYE
        HSPDVYTW+ALLGALY+ANRYNDAIHLF FVKREEK QLNIHIYN IL+ CSKLGLW+RALQILWEME SGLLIST+SYNIV++ACE ARKPEIALQVYE
Subjt:  HSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYE

Query:  RMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGSR
        RM+HQKHTPDTFTHLSLIR CIWGSLWDEVELLLNK  PDVSVYN VIQGMCLRGKTDLAKKLYTKMRENSIQ DGKTRALMLQNLPKDPARLKNRW S 
Subjt:  RMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGSR

Query:  FRKRHRHYHHR
        F+KR R YHHR
Subjt:  FRKRHRHYHHR

XP_022134520.1 pentatricopeptide repeat-containing protein At3g29290 isoform X1 [Momordica charantia]4.1e-28881.34Show/hide
Query:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL
        M+GVLINPSPT +LSNELNYQ DS Y V+ A +  H C NVN  LKS +RC I Y GNV+SMLSMS+PRLNLV++STR +EF T VGTLL CEEDEAI L
Subjt:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL

Query:  VAD-EGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNS
        V D EG+E  S EWKSPPW D+   DE  FQSED N+ +MLE EA +N +KVHFLEETDEVMLSKRILILSRKNKV+S +ELFRSMQLAGLLPSLHALNS
Subjt:  VAD-EGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNS

Query:  LLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIAT
        LLACLLRN  FDDGLRIF  MK+NKLSTGHTYSLILKAVAD  GFLSALEMFKAWE E+D K FD IVYNTMISVCGKENNW EAERTWRLMEANGC AT
Subjt:  LLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIAT

Query:  HITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSM
         ITY LLVSTFVRC QNELAID YVKMVQN+FKPGNDTMQ IIGAS KEGKWDFALRVF+DMLKCGLQPNS+AFNALIHALGKAK++TLAFN+Y++MKSM
Subjt:  HITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSM

Query:  SHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVY
         H PD YTW+ALLG+LY+ANRYNDAI LFEFVKRE+K QLNIH+YN IL+SCSKLGLWDR +QILWEME SGLLIST+SYNIVISACEMARKP+IALQVY
Subjt:  SHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVY

Query:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS
        ERMIHQKHTPDTFTHLSL+R CIWGSLWDEVE+LLNK AP+VSVYNAVIQGMCLRGKTDLAK+LY+KMREN IQ DGKTRALMLQNLPKDPAR  NRW S
Subjt:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS

Query:  RFRKRHRHYHH
        RFRKR+RHYHH
Subjt:  RFRKRHRHYHH

XP_022134523.1 pentatricopeptide repeat-containing protein At3g29290 isoform X2 [Momordica charantia]4.1e-28881.34Show/hide
Query:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL
        M+GVLINPSPT +LSNELNYQ DS Y V+ A +  H C NVN  LKS +RC I Y GNV+SMLSMS+PRLNLV++STR +EF T VGTLL CEEDEAI L
Subjt:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL

Query:  VAD-EGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNS
        V D EG+E  S EWKSPPW D+   DE  FQSED N+ +MLE EA +N +KVHFLEETDEVMLSKRILILSRKNKV+S +ELFRSMQLAGLLPSLHALNS
Subjt:  VAD-EGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNS

Query:  LLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIAT
        LLACLLRN  FDDGLRIF  MK+NKLSTGHTYSLILKAVAD  GFLSALEMFKAWE E+D K FD IVYNTMISVCGKENNW EAERTWRLMEANGC AT
Subjt:  LLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIAT

Query:  HITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSM
         ITY LLVSTFVRC QNELAID YVKMVQN+FKPGNDTMQ IIGAS KEGKWDFALRVF+DMLKCGLQPNS+AFNALIHALGKAK++TLAFN+Y++MKSM
Subjt:  HITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSM

Query:  SHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVY
         H PD YTW+ALLG+LY+ANRYNDAI LFEFVKRE+K QLNIH+YN IL+SCSKLGLWDR +QILWEME SGLLIST+SYNIVISACEMARKP+IALQVY
Subjt:  SHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVY

Query:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS
        ERMIHQKHTPDTFTHLSL+R CIWGSLWDEVE+LLNK AP+VSVYNAVIQGMCLRGKTDLAK+LY+KMREN IQ DGKTRALMLQNLPKDPAR  NRW S
Subjt:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS

Query:  RFRKRHRHYHH
        RFRKR+RHYHH
Subjt:  RFRKRHRHYHH

XP_022990659.1 pentatricopeptide repeat-containing protein At3g29290 isoform X1 [Cucurbita maxima]1.2e-28782.84Show/hide
Query:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL
        M+G+LIN  PT ILSNELNYQL SCY VSCA KHFH    V PELKSCIR RIT+GGNV SM SMSIPRLN VV+ST+ LEFRT       CEEDEAI L
Subjt:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL

Query:  VADEGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSL
        V DEGVEE S+EWKSPPW +++NQDEPIFQSEDVN+S++LEGE LV+D KV+FLEETDEVMLSKRILILSRKNKV+S +ELFRSM LAGLLPS HA NSL
Subjt:  VADEGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSL

Query:  LACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATH
        LACLLRNG FDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMF+ WEHE+D KQFDAIVYN MISVCGKENNW EAER WRLME NGC ATH
Subjt:  LACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATH

Query:  ITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMS
        +TYSLLVST+VRCNQNELAID+YVKMVQN +KP NDTMQAIIGASS+EG+WDFALRVFQ+MLKCGL+PNS+AFNALI+ALGKA E+TLAF+IY+ MKSM 
Subjt:  ITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMS

Query:  HSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEME-TSGLLISTSSYNIVISACEMARKPEIALQVY
        HSPDVYTW ALLGALY+ANRYNDAI LFEFVKREEK QLNIHIYN +LLSCSKLGLWDRALQILWEME  SG L+S SSYNIVISACEMARKPEIAL+VY
Subjt:  HSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEME-TSGLLISTSSYNIVISACEMARKPEIALQVY

Query:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS
        ERMIHQK TPDTFT LSLIRSCIWGSLWDEVELLL+K A D SVYNAVIQGMCLRGKTDLAKK YTKM E  IQPDGKTRALMLQ LPKD A LKNR  S
Subjt:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS

Query:  RFRKRHRHYHHR
        RF+KRHRHYHHR
Subjt:  RFRKRHRHYHHR

XP_038891011.1 pentatricopeptide repeat-containing protein At3g29290 [Benincasa hispida]3.1e-29683.82Show/hide
Query:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL
        M+G + NPSPT IL NE +YQ DS Y  S   K   PCINVNP  KSCIRC ITY GN VSML MS PRLNL+VQSTR +EFRT VGTLL C EDEAI L
Subjt:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL

Query:  VAD-EGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNS
        V D EGV E S+EWK PPW DM +QDEP FQSED N+SK+LEGE L ND+KVHFLEETD+VMLSKR+LILSRKNKV+S LELFRS+QLA LLPSLHALNS
Subjt:  VAD-EGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNS

Query:  LLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIAT
        LLACLLR G FDDGLRIFEFMKSN+L TGHTYSL+LKAVA+ HGFLSALEMFKAWEH++D  QFDA+VYNTMIS+CGK+NNW EAERTWRLME NGC AT
Subjt:  LLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIAT

Query:  HITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSM
        HITY+LLVSTFVRCNQNELAID YVKMVQN FKPGNDTMQAIIGASSKEGKWDFALRVF DMLKCGLQPNS+AFNALI+ALG AKE+TLAF+IY++MKSM
Subjt:  HITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSM

Query:  SHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVY
         HSPDVYTW+ALLGALY+ANRYNDAIHLFEFVKR EK QLNIHIYN IL+SCSKLGLWDRALQILWEME SGL IS SSYNIVI+ACEMARKPEIALQVY
Subjt:  SHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVY

Query:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS
        ERMIHQKHTPDTFTHLSLIR CIWGSLWDEVELLLNK APDVSVYNAVIQGMCLRGKTDLAKKLYTKMREN IQPDGKTRALMLQNLPKDPARLKNRW S
Subjt:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS

Query:  RFRKRHRHYHHR
         F+KRHRHYHHR
Subjt:  RFRKRHRHYHHR

TrEMBL top hitse value%identityAlignment
A0A1S3C9M1 pentatricopeptide repeat-containing protein At3g29290 isoform X11.8e-28982Show/hide
Query:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL
        M+GVL N SPT IL NE NYQ DS Y      KH   CINVNP LKSC+RC I Y GN VSML MS PRLNLVVQS R ++FRT VGTLL C EDEAI L
Subjt:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL

Query:  VADEGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSL
        V DE   E S+EWK PPW DM +QDE  FQSEDVN  K+LEGEAL N++KVHFLEETD+V+LSKRILILSRKNKV+S LELFRSMQLAG+LP+LHALNSL
Subjt:  VADEGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSL

Query:  LACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATH
        LACLLRNG F DGLRIFEFMK N+LSTGHTYSL+LKAVA+ HGFLSALEMFKAWEH++   QFDAIVYNTMIS+CGK+NNW EAERTWRLME NGC ATH
Subjt:  LACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATH

Query:  ITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMS
        ITYSLLVSTFVRCNQNELAID YVKMVQ+ FKPGNDTMQAIIGASSKEGKWDFAL VFQDMLKCGLQPNS++FNALI+ALGKAKE+TLAF+IY++MKSM 
Subjt:  ITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMS

Query:  HSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYE
        HSPDVYTW+ALLGALY+ANRYNDAIHLF FVKREEK QLNIHIYN IL+ CSKLGLW+RALQILWEME SGLLIST+SYNIV++ACE ARKPEIALQVYE
Subjt:  HSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYE

Query:  RMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGSR
        RM+HQKHTPDTFTHLSLIR CIWGSLWDEVELLLNK  PDVSVYN VIQGMCLRGKTDLAKKLYTKMRENSIQ DGKTRALMLQNLPKDPARLKNRW S 
Subjt:  RMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGSR

Query:  FRKRHRHYHHR
        F+KR R YHHR
Subjt:  FRKRHRHYHHR

A0A6J1BY07 pentatricopeptide repeat-containing protein At3g29290 isoform X12.0e-28881.34Show/hide
Query:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL
        M+GVLINPSPT +LSNELNYQ DS Y V+ A +  H C NVN  LKS +RC I Y GNV+SMLSMS+PRLNLV++STR +EF T VGTLL CEEDEAI L
Subjt:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL

Query:  VAD-EGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNS
        V D EG+E  S EWKSPPW D+   DE  FQSED N+ +MLE EA +N +KVHFLEETDEVMLSKRILILSRKNKV+S +ELFRSMQLAGLLPSLHALNS
Subjt:  VAD-EGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNS

Query:  LLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIAT
        LLACLLRN  FDDGLRIF  MK+NKLSTGHTYSLILKAVAD  GFLSALEMFKAWE E+D K FD IVYNTMISVCGKENNW EAERTWRLMEANGC AT
Subjt:  LLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIAT

Query:  HITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSM
         ITY LLVSTFVRC QNELAID YVKMVQN+FKPGNDTMQ IIGAS KEGKWDFALRVF+DMLKCGLQPNS+AFNALIHALGKAK++TLAFN+Y++MKSM
Subjt:  HITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSM

Query:  SHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVY
         H PD YTW+ALLG+LY+ANRYNDAI LFEFVKRE+K QLNIH+YN IL+SCSKLGLWDR +QILWEME SGLLIST+SYNIVISACEMARKP+IALQVY
Subjt:  SHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVY

Query:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS
        ERMIHQKHTPDTFTHLSL+R CIWGSLWDEVE+LLNK AP+VSVYNAVIQGMCLRGKTDLAK+LY+KMREN IQ DGKTRALMLQNLPKDPAR  NRW S
Subjt:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS

Query:  RFRKRHRHYHH
        RFRKR+RHYHH
Subjt:  RFRKRHRHYHH

A0A6J1C280 pentatricopeptide repeat-containing protein At3g29290 isoform X22.0e-28881.34Show/hide
Query:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL
        M+GVLINPSPT +LSNELNYQ DS Y V+ A +  H C NVN  LKS +RC I Y GNV+SMLSMS+PRLNLV++STR +EF T VGTLL CEEDEAI L
Subjt:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL

Query:  VAD-EGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNS
        V D EG+E  S EWKSPPW D+   DE  FQSED N+ +MLE EA +N +KVHFLEETDEVMLSKRILILSRKNKV+S +ELFRSMQLAGLLPSLHALNS
Subjt:  VAD-EGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNS

Query:  LLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIAT
        LLACLLRN  FDDGLRIF  MK+NKLSTGHTYSLILKAVAD  GFLSALEMFKAWE E+D K FD IVYNTMISVCGKENNW EAERTWRLMEANGC AT
Subjt:  LLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIAT

Query:  HITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSM
         ITY LLVSTFVRC QNELAID YVKMVQN+FKPGNDTMQ IIGAS KEGKWDFALRVF+DMLKCGLQPNS+AFNALIHALGKAK++TLAFN+Y++MKSM
Subjt:  HITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSM

Query:  SHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVY
         H PD YTW+ALLG+LY+ANRYNDAI LFEFVKRE+K QLNIH+YN IL+SCSKLGLWDR +QILWEME SGLLIST+SYNIVISACEMARKP+IALQVY
Subjt:  SHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVY

Query:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS
        ERMIHQKHTPDTFTHLSL+R CIWGSLWDEVE+LLNK AP+VSVYNAVIQGMCLRGKTDLAK+LY+KMREN IQ DGKTRALMLQNLPKDPAR  NRW S
Subjt:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS

Query:  RFRKRHRHYHH
        RFRKR+RHYHH
Subjt:  RFRKRHRHYHH

A0A6J1JQP6 pentatricopeptide repeat-containing protein At3g29290 isoform X23.2e-28682.68Show/hide
Query:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL
        M+G+LIN  PT ILSNELNYQL SCY VSCA KHFH    V PELKSCIR RIT+GGNV SM SMSIPRLN VV+ST+ LEFRT       CEEDEAI L
Subjt:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL

Query:  VADEGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSL
        V DEGVEE S+EWKSPPW +++NQDEPIFQSEDVN+S++LEGE LV+D KV+FLEETDEVMLSKRILILSRKNKV+S +ELFRSM LAGLLPS HA NSL
Subjt:  VADEGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSL

Query:  LACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATH
        LACLLRNG FDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMF+ WEHE+D KQFDAIVYN MISVCGKENNW EAER WRLME NGC ATH
Subjt:  LACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATH

Query:  ITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMS
        +TYSLLVST+VRCNQNELAID+YVKMVQN +KP NDTMQAIIGASS+EG+WDFALRVFQ+MLKCGL+PNS+AFNALI+ALGKA E+TLAF+IY+ MKSM 
Subjt:  ITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMS

Query:  HSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEME-TSGLLISTSSYNIVISACEMARKPEIALQVY
        HSPDVYTW ALLGALY+ANRYNDAI LFEFVKREEK QLNIHIYN +LLSCSKLGLWDRALQILWEME  SG L+S SSYNIVISACEMARKPEIAL+VY
Subjt:  HSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEME-TSGLLISTSSYNIVISACEMARKPEIALQVY

Query:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS
        ERMIHQK TPDTFT LSLIRSCIWGSLWDEVELLL+  A D SVYNAVIQGMCLRGKTDLAKK YTKM E  IQPDGKTRALMLQ LPKD A LKNR  S
Subjt:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS

Query:  RFRKRHRHYHHR
        RF+KRHRHYHHR
Subjt:  RFRKRHRHYHHR

A0A6J1JSM4 pentatricopeptide repeat-containing protein At3g29290 isoform X15.7e-28882.84Show/hide
Query:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL
        M+G+LIN  PT ILSNELNYQL SCY VSCA KHFH    V PELKSCIR RIT+GGNV SM SMSIPRLN VV+ST+ LEFRT       CEEDEAI L
Subjt:  MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGL

Query:  VADEGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSL
        V DEGVEE S+EWKSPPW +++NQDEPIFQSEDVN+S++LEGE LV+D KV+FLEETDEVMLSKRILILSRKNKV+S +ELFRSM LAGLLPS HA NSL
Subjt:  VADEGVEEPSQEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSL

Query:  LACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATH
        LACLLRNG FDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMF+ WEHE+D KQFDAIVYN MISVCGKENNW EAER WRLME NGC ATH
Subjt:  LACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATH

Query:  ITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMS
        +TYSLLVST+VRCNQNELAID+YVKMVQN +KP NDTMQAIIGASS+EG+WDFALRVFQ+MLKCGL+PNS+AFNALI+ALGKA E+TLAF+IY+ MKSM 
Subjt:  ITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMS

Query:  HSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEME-TSGLLISTSSYNIVISACEMARKPEIALQVY
        HSPDVYTW ALLGALY+ANRYNDAI LFEFVKREEK QLNIHIYN +LLSCSKLGLWDRALQILWEME  SG L+S SSYNIVISACEMARKPEIAL+VY
Subjt:  HSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEME-TSGLLISTSSYNIVISACEMARKPEIALQVY

Query:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS
        ERMIHQK TPDTFT LSLIRSCIWGSLWDEVELLL+K A D SVYNAVIQGMCLRGKTDLAKK YTKM E  IQPDGKTRALMLQ LPKD A LKNR  S
Subjt:  ERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGS

Query:  RFRKRHRHYHHR
        RF+KRHRHYHHR
Subjt:  RFRKRHRHYHHR

SwissProt top hitse value%identityAlignment
Q84J46 Pentatricopeptide repeat-containing protein At3g292902.9e-14355.31Show/hide
Query:  FQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLSTG
        F  E+V     LE +   + N++HFLEE +E  LSKR+  LSR +KV+S LELF SM+  GL P+ HA NS L+CLLRNG       +FEFM+  +  TG
Subjt:  FQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLSTG

Query:  HTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQ-FDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMV
        HTYSL+LKAVA+  G  SAL MF+  E E   +  FD ++YNT IS+CG+ NN  E ER WR+M+ +G I T ITYSLLVS FVRC ++ELA+DVY +MV
Subjt:  HTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQ-FDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMV

Query:  QNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHL
         N      D M A+I A +KE KWD AL++FQ MLK G++PN +A N LI++LGKA ++ L F +YS++KS+ H PD YTW+ALL ALY+ANRY D + L
Subjt:  QNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHL

Query:  FEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRSCIWGSLW
        F+ ++ E    LN ++YN  ++SC KLG W++A+++L+EME SGL +STSSYN+VISACE +RK ++AL VYE M  +   P+TFT+LSL+RSCIWGSLW
Subjt:  FEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRSCIWGSLW

Query:  DEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPK
        DEVE +L K  PDVS+YNA I GMCLR +   AK+LY KMRE  ++PDGKTRA+MLQNL K
Subjt:  DEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPK

Q9ASZ8 Pentatricopeptide repeat-containing protein At1g126201.9e-3025.75Show/hide
Query:  ETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGH-TYSLILKAVADTHGFLSALEMFKAW
        E D V  S  I  L  + +V   LEL   M   G  P+L  LN+L+  L  NG   D + + + M          TY  +LK +  +     A+E+ +  
Subjt:  ETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGH-TYSLILKAVADTHGFLSALEMFKAW

Query:  EHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFA
        E E   K  DA+ Y+ +I    K+ +   A   +  ME  G  A  I Y+ L+  F    + +    +   M++    P      A+I    KEGK   A
Subjt:  EHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFA

Query:  LRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKL
          + ++M++ G+ P+++ + +LI    K  +L  A ++  LM S    P++ T++ L+    +AN  +D + LF  +         +  YN ++    +L
Subjt:  LRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKL

Query:  GLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRSCIWGSLWDEV-----ELLLNKFAPDVSVYNAVIQ
        G  + A ++  EM +  +     SY I++       +PE AL+++E++   K   D   +  +I      S  D+       L L    PDV  YN +I 
Subjt:  GLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRSCIWGSLWDEV-----ELLLNKFAPDVSVYNAVIQ

Query:  GMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQ
        G+C +G    A  L+ KM E+   P+G T  ++++
Subjt:  GMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQ

Q9CAN0 Pentatricopeptide repeat-containing protein At1g63130, mitochondrial3.3e-3024.41Show/hide
Query:  VMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIF-EFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEH
        V  SK +  +++ NK   V+ L   MQ  G+  +L+  + L+ C  R       L +  + MK        T + +L      +    A+ +    +   
Subjt:  VMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIF-EFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEH

Query:  DFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPG-----------------ND-----
           Q D+  +NT+I    + N  +EA      M   GC    +TY ++V+   +    +LA+ +  KM Q   +PG                 ND     
Subjt:  DFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPG-----------------ND-----

Query:  -------------TMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYND
                     T  ++I      G+W  A R+  DM++  + PN + F+ALI A  K  +L  A  +Y  M   S  PD++T+S+L+      +R ++
Subjt:  -------------TMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYND

Query:  AIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRS-CI
        A H+FE +  ++    N+  YN ++    K    D  +++  EM   GL+ +T +Y  +I     AR+ + A  V+++M+     PD  T+  L+   C 
Subjt:  AIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRS-CI

Query:  WGSLWDEVELLL------NKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALML
         G +  E  L++      +K  PD+  YN +I+GMC  GK +    L+  +    ++P+  T   M+
Subjt:  WGSLWDEVELLL------NKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALML

Q9LPX2 Pentatricopeptide repeat-containing protein At1g12775, mitochondrial2.1e-2924.49Show/hide
Query:  ETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGH-TYSLILKAVADTHGFLSALEMFKAW
        E D V+ +  +  L  + +V   LEL   M   G  P+L  LN+L+  L  NG   D + + + M          TY  +L  +  +     A+E+ +  
Subjt:  ETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGH-TYSLILKAVADTHGFLSALEMFKAW

Query:  EHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFA
        E E + K  DA+ Y+ +I    K+ +   A   +  ME  G  A  ITY+ L+  F    + +    +   M++    P   T   +I +  KEGK   A
Subjt:  EHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFA

Query:  LRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKL
         ++ ++M++ G+ PN+I +N+LI    K   L  A  +  LM S    PD+ T++ L+    +ANR +D + LF                          
Subjt:  LRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKL

Query:  GLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRS-CIWGSLWDEVELL----LNKFAPDVSVYNAVIQ
                   EM   G++ +T +YN ++     + K E+A ++++ M+ ++  PD  ++  L+   C  G L   +E+      +K   D+ +Y  +I 
Subjt:  GLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRS-CIWGSLWDEVELL----LNKFAPDVSVYNAVIQ

Query:  GMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNL
        GMC   K D A  L+  +    ++ D +   +M+  L
Subjt:  GMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNL

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028604.0e-2822.32Show/hide
Query:  DEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLS-TGHTYSLIL----KAVADTHGFLSALEMFK
        D  +++  I +L ++ +V S   +F  +Q  G    +++  SL++    +G + + + +F+ M+ +    T  TY++IL    K     +   S +E  K
Subjt:  DEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLS-TGHTYSLIL----KAVADTHGFLSALEMFK

Query:  AWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWD
        +     D    DA  YNT+I+ C + +   EA + +  M+A G     +TY+ L+  + + ++ + A+ V  +MV N F P   T  ++I A +++G  D
Subjt:  AWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWD

Query:  FALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCS
         A+ +   M + G +P+   +  L+    +A ++  A +I+  M++    P++ T++A +       ++ + + +F+ +     +  +I  +N +L    
Subjt:  FALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCS

Query:  KLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLL-----NKFAPDVSVYNAV
        + G+      +  EM+ +G +    ++N +ISA       E A+ VY RM+    TPD  T+ +++ +   G +W++ E +L      +  P+   Y ++
Subjt:  KLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLL-----NKFAPDVSVYNAV

Query:  IQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNL
        +       +  L   L  ++    I+P    RA++L+ L
Subjt:  IQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNL

Arabidopsis top hitse value%identityAlignment
AT1G12620.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-3125.75Show/hide
Query:  ETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGH-TYSLILKAVADTHGFLSALEMFKAW
        E D V  S  I  L  + +V   LEL   M   G  P+L  LN+L+  L  NG   D + + + M          TY  +LK +  +     A+E+ +  
Subjt:  ETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGH-TYSLILKAVADTHGFLSALEMFKAW

Query:  EHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFA
        E E   K  DA+ Y+ +I    K+ +   A   +  ME  G  A  I Y+ L+  F    + +    +   M++    P      A+I    KEGK   A
Subjt:  EHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFA

Query:  LRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKL
          + ++M++ G+ P+++ + +LI    K  +L  A ++  LM S    P++ T++ L+    +AN  +D + LF  +         +  YN ++    +L
Subjt:  LRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKL

Query:  GLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRSCIWGSLWDEV-----ELLLNKFAPDVSVYNAVIQ
        G  + A ++  EM +  +     SY I++       +PE AL+++E++   K   D   +  +I      S  D+       L L    PDV  YN +I 
Subjt:  GLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRSCIWGSLWDEV-----ELLLNKFAPDVSVYNAVIQ

Query:  GMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQ
        G+C +G    A  L+ KM E+   P+G T  ++++
Subjt:  GMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQ

AT1G12775.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-3024.49Show/hide
Query:  ETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGH-TYSLILKAVADTHGFLSALEMFKAW
        E D V+ +  +  L  + +V   LEL   M   G  P+L  LN+L+  L  NG   D + + + M          TY  +L  +  +     A+E+ +  
Subjt:  ETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGH-TYSLILKAVADTHGFLSALEMFKAW

Query:  EHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFA
        E E + K  DA+ Y+ +I    K+ +   A   +  ME  G  A  ITY+ L+  F    + +    +   M++    P   T   +I +  KEGK   A
Subjt:  EHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWDFA

Query:  LRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKL
         ++ ++M++ G+ PN+I +N+LI    K   L  A  +  LM S    PD+ T++ L+    +ANR +D + LF                          
Subjt:  LRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCSKL

Query:  GLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRS-CIWGSLWDEVELL----LNKFAPDVSVYNAVIQ
                   EM   G++ +T +YN ++     + K E+A ++++ M+ ++  PD  ++  L+   C  G L   +E+      +K   D+ +Y  +I 
Subjt:  GLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRS-CIWGSLWDEVELL----LNKFAPDVSVYNAVIQ

Query:  GMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNL
        GMC   K D A  L+  +    ++ D +   +M+  L
Subjt:  GMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNL

AT1G63130.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.3e-3124.41Show/hide
Query:  VMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIF-EFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEH
        V  SK +  +++ NK   V+ L   MQ  G+  +L+  + L+ C  R       L +  + MK        T + +L      +    A+ +    +   
Subjt:  VMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIF-EFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEH

Query:  DFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPG-----------------ND-----
           Q D+  +NT+I    + N  +EA      M   GC    +TY ++V+   +    +LA+ +  KM Q   +PG                 ND     
Subjt:  DFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPG-----------------ND-----

Query:  -------------TMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYND
                     T  ++I      G+W  A R+  DM++  + PN + F+ALI A  K  +L  A  +Y  M   S  PD++T+S+L+      +R ++
Subjt:  -------------TMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYND

Query:  AIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRS-CI
        A H+FE +  ++    N+  YN ++    K    D  +++  EM   GL+ +T +Y  +I     AR+ + A  V+++M+     PD  T+  L+   C 
Subjt:  AIHLFEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRS-CI

Query:  WGSLWDEVELLL------NKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALML
         G +  E  L++      +K  PD+  YN +I+GMC  GK +    L+  +    ++P+  T   M+
Subjt:  WGSLWDEVELLL------NKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALML

AT3G29290.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-14455.31Show/hide
Query:  FQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLSTG
        F  E+V     LE +   + N++HFLEE +E  LSKR+  LSR +KV+S LELF SM+  GL P+ HA NS L+CLLRNG       +FEFM+  +  TG
Subjt:  FQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLSTG

Query:  HTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQ-FDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMV
        HTYSL+LKAVA+  G  SAL MF+  E E   +  FD ++YNT IS+CG+ NN  E ER WR+M+ +G I T ITYSLLVS FVRC ++ELA+DVY +MV
Subjt:  HTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQ-FDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMV

Query:  QNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHL
         N      D M A+I A +KE KWD AL++FQ MLK G++PN +A N LI++LGKA ++ L F +YS++KS+ H PD YTW+ALL ALY+ANRY D + L
Subjt:  QNYFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHL

Query:  FEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRSCIWGSLW
        F+ ++ E    LN ++YN  ++SC KLG W++A+++L+EME SGL +STSSYN+VISACE +RK ++AL VYE M  +   P+TFT+LSL+RSCIWGSLW
Subjt:  FEFVKREEKTQLNIHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRSCIWGSLW

Query:  DEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPK
        DEVE +L K  PDVS+YNA I GMCLR +   AK+LY KMRE  ++PDGKTRA+MLQNL K
Subjt:  DEVELLLNKFAPDVSVYNAVIQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPK

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein2.8e-2922.32Show/hide
Query:  DEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLS-TGHTYSLIL----KAVADTHGFLSALEMFK
        D  +++  I +L ++ +V S   +F  +Q  G    +++  SL++    +G + + + +F+ M+ +    T  TY++IL    K     +   S +E  K
Subjt:  DEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFMKSNKLS-TGHTYSLIL----KAVADTHGFLSALEMFK

Query:  AWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWD
        +     D    DA  YNT+I+ C + +   EA + +  M+A G     +TY+ L+  + + ++ + A+ V  +MV N F P   T  ++I A +++G  D
Subjt:  AWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNYFKPGNDTMQAIIGASSKEGKWD

Query:  FALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCS
         A+ +   M + G +P+   +  L+    +A ++  A +I+  M++    P++ T++A +       ++ + + +F+ +     +  +I  +N +L    
Subjt:  FALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLNIHIYNNILLSCS

Query:  KLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLL-----NKFAPDVSVYNAV
        + G+      +  EM+ +G +    ++N +ISA       E A+ VY RM+    TPD  T+ +++ +   G +W++ E +L      +  P+   Y ++
Subjt:  KLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLL-----NKFAPDVSVYNAV

Query:  IQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNL
        +       +  L   L  ++    I+P    RA++L+ L
Subjt:  IQGMCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGGAGTGCTTATAAATCCATCTCCCACTCCAATTTTATCAAATGAGTTGAATTACCAACTTGACTCGTGTTACCTGGTTAGTTGTGCACCCAAGCATTTCCATCC
GTGCATAAATGTTAATCCAGAATTGAAATCATGTATAAGGTGTAGGATAACGTATGGGGGTAATGTAGTTTCAATGTTATCGATGAGTATTCCACGATTGAATTTGGTGG
TTCAGTCCACAAGGGTCCTGGAGTTTAGGACTAGTGTTGGGACCCTTTTGAAATGTGAAGAGGATGAGGCGATCGGATTGGTCGCTGATGAAGGAGTTGAAGAACCCTCT
CAGGAGTGGAAATCACCTCCCTGGGTAGACATGGAAAATCAGGATGAGCCAATCTTTCAATCTGAAGATGTAAACCGGTCCAAAATGTTAGAAGGGGAGGCTTTGGTAAA
CGACAACAAGGTGCATTTTCTTGAGGAAACTGACGAAGTTATGCTATCAAAGCGTATTTTAATTCTCAGTAGAAAAAATAAGGTCAAAAGTGTGCTGGAATTATTCAGGT
CCATGCAATTAGCAGGTCTTCTGCCAAGTTTGCATGCTTTAAATTCACTTTTAGCTTGTCTTTTGAGGAATGGGCCATTTGATGACGGTTTAAGAATCTTTGAGTTTATG
AAGTCAAACAAGTTATCAACAGGGCACACTTATAGCCTCATACTCAAAGCAGTTGCAGATACTCATGGATTTCTTTCTGCTCTTGAGATGTTCAAGGCATGGGAGCACGA
ACACGACTTCAAACAGTTTGATGCAATTGTTTATAACACAATGATATCAGTGTGTGGAAAAGAGAATAACTGGGCTGAAGCTGAGAGAACATGGAGACTAATGGAGGCAA
ATGGTTGTATCGCAACACACATAACTTATTCTCTATTGGTGAGCACTTTTGTCCGCTGCAATCAGAACGAACTTGCAATTGACGTTTATGTAAAGATGGTTCAAAATTAT
TTTAAACCAGGTAATGATACAATGCAAGCTATTATTGGGGCATCTTCAAAGGAAGGGAAGTGGGATTTTGCTTTAAGAGTCTTTCAAGACATGTTGAAATGCGGACTCCA
ACCTAATTCCATTGCATTCAATGCCTTGATCCATGCTCTAGGAAAAGCTAAGGAGCTCACTTTAGCATTCAACATATACAGTTTGATGAAATCTATGAGTCATTCACCTG
ATGTTTATACATGGAGTGCTCTACTTGGTGCTCTTTATCAGGCTAATCGCTACAATGATGCCATTCATCTCTTTGAGTTTGTGAAAAGAGAGGAGAAGACACAACTGAAT
ATACATATTTACAATAATATTCTATTGTCTTGTTCAAAGCTTGGGTTATGGGACAGGGCTCTCCAAATTTTATGGGAAATGGAGACTTCTGGTCTCTTAATTTCAACATC
ATCATATAATATTGTTATTAGTGCATGTGAGATGGCTAGGAAGCCAGAAATTGCATTACAAGTTTATGAACGCATGATTCATCAGAAGCACACTCCTGATACCTTCACTC
ATTTGTCACTTATCAGAAGCTGCATTTGGGGATCTTTATGGGATGAAGTGGAACTACTTCTAAATAAGTTTGCACCCGACGTATCTGTGTACAATGCTGTCATACAAGGA
ATGTGCTTAAGAGGCAAGACCGATTTAGCGAAAAAGCTTTACACGAAGATGCGCGAAAACAGTATCCAACCTGATGGAAAAACACGAGCTTTGATGCTTCAGAACTTGCC
AAAGGATCCTGCTAGACTGAAGAACAGGTGGGGTTCTCGTTTCAGGAAAAGACACAGACATTATCACCACAGGTAA
mRNA sequenceShow/hide mRNA sequence
GGCTCACTGATTTGGAATGACTACCCCTGGATTGTAAACTTTCTGGTTCTTTTTATGTTTGGTAAATTATGAAAGGAGTGCTTATAAATCCATCTCCCACTCCAATTTTA
TCAAATGAGTTGAATTACCAACTTGACTCGTGTTACCTGGTTAGTTGTGCACCCAAGCATTTCCATCCGTGCATAAATGTTAATCCAGAATTGAAATCATGTATAAGGTG
TAGGATAACGTATGGGGGTAATGTAGTTTCAATGTTATCGATGAGTATTCCACGATTGAATTTGGTGGTTCAGTCCACAAGGGTCCTGGAGTTTAGGACTAGTGTTGGGA
CCCTTTTGAAATGTGAAGAGGATGAGGCGATCGGATTGGTCGCTGATGAAGGAGTTGAAGAACCCTCTCAGGAGTGGAAATCACCTCCCTGGGTAGACATGGAAAATCAG
GATGAGCCAATCTTTCAATCTGAAGATGTAAACCGGTCCAAAATGTTAGAAGGGGAGGCTTTGGTAAACGACAACAAGGTGCATTTTCTTGAGGAAACTGACGAAGTTAT
GCTATCAAAGCGTATTTTAATTCTCAGTAGAAAAAATAAGGTCAAAAGTGTGCTGGAATTATTCAGGTCCATGCAATTAGCAGGTCTTCTGCCAAGTTTGCATGCTTTAA
ATTCACTTTTAGCTTGTCTTTTGAGGAATGGGCCATTTGATGACGGTTTAAGAATCTTTGAGTTTATGAAGTCAAACAAGTTATCAACAGGGCACACTTATAGCCTCATA
CTCAAAGCAGTTGCAGATACTCATGGATTTCTTTCTGCTCTTGAGATGTTCAAGGCATGGGAGCACGAACACGACTTCAAACAGTTTGATGCAATTGTTTATAACACAAT
GATATCAGTGTGTGGAAAAGAGAATAACTGGGCTGAAGCTGAGAGAACATGGAGACTAATGGAGGCAAATGGTTGTATCGCAACACACATAACTTATTCTCTATTGGTGA
GCACTTTTGTCCGCTGCAATCAGAACGAACTTGCAATTGACGTTTATGTAAAGATGGTTCAAAATTATTTTAAACCAGGTAATGATACAATGCAAGCTATTATTGGGGCA
TCTTCAAAGGAAGGGAAGTGGGATTTTGCTTTAAGAGTCTTTCAAGACATGTTGAAATGCGGACTCCAACCTAATTCCATTGCATTCAATGCCTTGATCCATGCTCTAGG
AAAAGCTAAGGAGCTCACTTTAGCATTCAACATATACAGTTTGATGAAATCTATGAGTCATTCACCTGATGTTTATACATGGAGTGCTCTACTTGGTGCTCTTTATCAGG
CTAATCGCTACAATGATGCCATTCATCTCTTTGAGTTTGTGAAAAGAGAGGAGAAGACACAACTGAATATACATATTTACAATAATATTCTATTGTCTTGTTCAAAGCTT
GGGTTATGGGACAGGGCTCTCCAAATTTTATGGGAAATGGAGACTTCTGGTCTCTTAATTTCAACATCATCATATAATATTGTTATTAGTGCATGTGAGATGGCTAGGAA
GCCAGAAATTGCATTACAAGTTTATGAACGCATGATTCATCAGAAGCACACTCCTGATACCTTCACTCATTTGTCACTTATCAGAAGCTGCATTTGGGGATCTTTATGGG
ATGAAGTGGAACTACTTCTAAATAAGTTTGCACCCGACGTATCTGTGTACAATGCTGTCATACAAGGAATGTGCTTAAGAGGCAAGACCGATTTAGCGAAAAAGCTTTAC
ACGAAGATGCGCGAAAACAGTATCCAACCTGATGGAAAAACACGAGCTTTGATGCTTCAGAACTTGCCAAAGGATCCTGCTAGACTGAAGAACAGGTGGGGTTCTCGTTT
CAGGAAAAGACACAGACATTATCACCACAGGTAACAGAAATGTAAAAGAAGTTGACAAAAGATTAAGTTACTGTAAATGAAGTATATATTGTACATGCATAATGCTAGCT
CAAAGTCATTGTTGTATAGCTGAGAAGAATGGCTGGTTAATTCAATCAGTTGGATTATATTCCATCCAGAACCTGTTTTCTTCTGTCTCTAGTCCTCACCTTAATATGAT
CCCAGA
Protein sequenceShow/hide protein sequence
MKGVLINPSPTPILSNELNYQLDSCYLVSCAPKHFHPCINVNPELKSCIRCRITYGGNVVSMLSMSIPRLNLVVQSTRVLEFRTSVGTLLKCEEDEAIGLVADEGVEEPS
QEWKSPPWVDMENQDEPIFQSEDVNRSKMLEGEALVNDNKVHFLEETDEVMLSKRILILSRKNKVKSVLELFRSMQLAGLLPSLHALNSLLACLLRNGPFDDGLRIFEFM
KSNKLSTGHTYSLILKAVADTHGFLSALEMFKAWEHEHDFKQFDAIVYNTMISVCGKENNWAEAERTWRLMEANGCIATHITYSLLVSTFVRCNQNELAIDVYVKMVQNY
FKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQPNSIAFNALIHALGKAKELTLAFNIYSLMKSMSHSPDVYTWSALLGALYQANRYNDAIHLFEFVKREEKTQLN
IHIYNNILLSCSKLGLWDRALQILWEMETSGLLISTSSYNIVISACEMARKPEIALQVYERMIHQKHTPDTFTHLSLIRSCIWGSLWDEVELLLNKFAPDVSVYNAVIQG
MCLRGKTDLAKKLYTKMRENSIQPDGKTRALMLQNLPKDPARLKNRWGSRFRKRHRHYHHR