; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001529 (gene) of Snake gourd v1 genome

Gene IDTan0001529
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG05:85084666..85088366
RNA-Seq ExpressionTan0001529
SyntenyTan0001529
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0080156 - mitochondrial mRNA modification (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005739 - mitochondrion (cellular component)
GO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137435.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Momordica charantia]0.0e+0089.8Show/hide
Query:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA
        MN I IQT V KTL SPLRLISSVAT D ASNFSF++++T SL DPLQLLNDYVKSRKCSLKNTKV+HAKLLRATLLHS+IYV+NSLLDCYSKS AMD+A
Subjt:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA

Query:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD
        LKLFD MLH NVISWNI ISGFNQNFLFL+SWRTFCRMHFLGFEPSEITYGSVLSACAA+QAPMFGKQ+YSL VRNG FVNGYVRAGMIDLFAK SSF D
Subjt:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD

Query:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE
        ALRVF+DV+CENVVCWNAIVSAAVRNGEN +ALDLF+TMCSGFLEPNSFTFSSVLTACAA+EDLEFGKRVQGRVIKCGG+DVFVETALIDLYAKCGD+DE
Subjt:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE

Query:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID
        AVKTFLQMPIRNVVSWTAIISGFVQKND  MALK F+DMR LGEEINSYTVTSVLTACANPAM KEAIQLHSWILKAGF S+AVVV+ALINMYSK+G ID
Subjt:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID

Query:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE
        LSMMVFRE+D+QRNLSSW AMITSFAQN +KE+A ELF+KMLQE+IGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSL TMYSKCGYLEE
Subjt:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE

Query:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE
        AFQ F NMPKKD+VSWASMISCFSEH YAKEAI LFR+MLFEEYVPDH TLSAVLT CSVLHSIQIGREIHGYSVRVGLGKDVAIGG LVTMYSKC NLE
Subjt:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE

Query:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI
         ARRVFETLPQKD IACSSLVSGYAQHK I+EAL LF DLLV GLAIDPFS+SSILGAIA+L+RPGIG QLHALIMKVGLEKDVSVGSSLVMVYSK GSI
Subjt:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI

Query:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG
        EDCCKAFEQ GKPDLIGWTAMIVSYAQHGKG EAL VYELMK+EGIKPDPVTFVGVLSACSH+GLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG
Subjt:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG

Query:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL
        +LKEAEELIN+MPIEPDALIWGTLLAACKVHGDIE GKLAA+KVMELKP DTGAYVSLSNICADMGLWEEVLN+RSLMKGAGVTKEPGWS L
Subjt:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL

XP_022923751.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita moschata]0.0e+0089.69Show/hide
Query:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA
        MNF  I TFV KTL S  RLISSVATVDNAS+FSFT+I+TY LFDP QLL+DYVKSRKCSL+NTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSN++DHA
Subjt:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA

Query:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD
        LKLFDTMLHPNVISWNI IS FN NFL+LDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAK SSFLD
Subjt:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD

Query:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE
        ALRVFHD+ CENVVCWNAIVSAAVRNGEN MALDL++TMC G LEPNSFTFSSVLTACAALE  EFGKRVQG+VIKCGG+DVFVETALIDLY+KCG+MDE
Subjt:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE

Query:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID
        AVK FL+MPIRNVVSWTAIISGFVQKND+LMALKFF+DMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWIL+AG+SSHAVV AALINMYSK+GAID
Subjt:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID

Query:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE
        LSM VF EMDNQRNLSSWTAMITSFAQNN+KE+A ELF+KML+E++GPDTFCTSSVLSVTDCITFGRQIHC+T KTGLIFD+SVGS+L TMYSKCGYLEE
Subjt:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE

Query:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE
        AF VF NM KKDN+SWASM+SCFSEH YAKE IQLFREMLFEEYVPD+  LS VL ACSVLHSIQIGREIH YSVR+GL KDVAIGGSLVTMYSKC NLE
Subjt:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE

Query:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI
         ARRVFETLP+KD+IACSSLVSGYAQHKCIKE +LLF+DLL AGLAIDPFSISSILGAIALLNRPGIGTQLHA+I KVGLEKDVSVGSSLVMVYSK GSI
Subjt:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI

Query:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG
        EDCCKAFEQ GKPDLIGWTAMIVSYAQHGKG EAL VYELMK+EGIKPDPVTFVGVLSACSH+GLVDEAYFHLNSMVKDYGIQPG+RHYACMVDLLGRCG
Subjt:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG

Query:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL
        +LK AEELIN+MPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWS+L
Subjt:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL

XP_023001341.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita maxima]0.0e+0089.24Show/hide
Query:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA
        MNF  I TFV KTL S  RLISSVATVDNAS+FSFT+I+TY LFDP QLL+DYVKSRKCSL++TKVLHAKLLRATLLHSNIYVSNSLLDCYSKSN++DHA
Subjt:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA

Query:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD
        LKLFDTMLHPNVISWNI IS FN NFL+LDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAK SSFLD
Subjt:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD

Query:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE
        ALRVF DV+CENVVCWNAIVSAAVRNGEN MALDL++TMC GFLEPNSFTFSSVLTACAALE  EFGKRVQG+VIKCGG+DVFVETALIDLY+KCG+MDE
Subjt:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE

Query:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID
        AVK FL+MPIRNVVSWTAIISGFVQKND+LMALKFF+DMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWIL+AGFSSHAVV AALINMYSK+GAID
Subjt:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID

Query:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE
        LSM VF EMDNQRNLSSWTAMITSFAQNN+KE+A ELF+KML+E++GPDTFCTSSVLSVTDCITFGRQIHC+T KTGL+F +SVGS+L TMYSKCGYLEE
Subjt:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE

Query:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE
        AF VF NMPKKD++SWASM+SCFSEH YAKE IQLFREMLFEEYVPD   L+ VL ACSVLHSIQIGREIH YSVR+GL KDVAIGGSLVTMYSKC NLE
Subjt:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE

Query:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI
         ARRVFETLP+KD+IACSSLVSGYAQHKCIKE +LLF+DLL AGLAIDPFSISSILGAIALLNRPGIGTQLHA+I KVGLEKDVS+GSSLVMVYSK GSI
Subjt:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI

Query:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG
        EDCCKAFEQ GKPDLIGWTAMIVSYAQHGKG EAL VYELMK+EGIKPDPVTFVGVLSACSH+GLVDEAYFHLNSMVKDYGIQPG+RHYACMVDLLGRCG
Subjt:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG

Query:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL
        +LK AEELIN+MPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRS MKGAGVTKEPGWS+L
Subjt:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL

XP_023519257.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0089.91Show/hide
Query:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA
        MNF  I TFV KTL S  RLISSVATVDNAS+FSFT+I+TY LFDP QLL+DYVKSRKCSL+NTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSN++DHA
Subjt:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA

Query:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD
        LKLFDTMLHPNVISWNI IS FN NF++LDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAK SSFLD
Subjt:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD

Query:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE
        ALRVFHDV+CENVVCWNAIVSAAVRNGEN MALDL++TMC GFLEPNSFTFSSVLTACAALE  EFGKRVQG+VIKCGG+DVFVETALIDLY+KCG+MDE
Subjt:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE

Query:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID
        AVK FL+MPIRNVVSWTAIISGFVQKND+LMALKFF+DMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWIL+AGFSSHAVV AALINMYSK+GAID
Subjt:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID

Query:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE
        LSM VF EMDN+RNLSSWTAMITSFAQNN+KE+A ELF+KML+E++GPDTFCTSSVLSVTDCITFGRQIHC+T KTGLIFD+SVGS+L TMYSKCGYLEE
Subjt:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE

Query:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE
        AF VF NMPKKDN+SWASM+SCFSEH YAKE IQLFREMLFEEYVPD+  LS VL ACSVLHSIQIGREIH YSVR+GL KDVAIGGSLVTMYSKC NLE
Subjt:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE

Query:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI
         ARRVFETLP+KD+IACSSLVSGYAQHKCIKE +LLF+DLL AGLAIDPFSISSILGAIALLNRPGIGTQLHA+I KVGLEKDVSVGSSLVMVYSK GSI
Subjt:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI

Query:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG
        EDCCKAFEQ GKPDLIGWTAMIVSYAQHGKG EAL VYELMK+EGIKPDPVTFVGVLSACSH+GLVDEAYFHLNSMVKDYGIQPG+RHYACMVDLLGRCG
Subjt:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG

Query:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL
        +LK AEELIN+MPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWS+L
Subjt:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL

XP_038893557.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Benincasa hispida]0.0e+0090.13Show/hide
Query:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA
        MNFIAIQTFV KTL SP  L+SSVATVD+ SNFSFT+I+T+   DPLQ LND+VKSRKCSL+NTKVLHAKLLRA LLHSNIYVSNSLLDCYSKSNAMDHA
Subjt:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA

Query:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD
        LKLFDTMLHPNVISWNI ISGFN  FL LD+ RTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAK SSFLD
Subjt:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD

Query:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE
        ALRVFHDV+CENVVCWNAIVSAAVRNGENLMALDLF+TMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGG+DVFVETALID YAKCGD DE
Subjt:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE

Query:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID
        AVK FL+MPIRNVVSWTAIISGFVQ ND+LMALKFFEDMRK GEEINSYTVTSVLTACANPAMTKEA QLHSWILKAGFSSHAVV AALINMYSK+GAID
Subjt:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID

Query:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE
        LS+MVFREMDNQRNLSSWTAMITSFAQNN+KE A ELF KML+E++GPDTFCTSSVLSVTDCITFGR+IHCYTLKTGLIFDVSVGSSL TMYSKCG+L+E
Subjt:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE

Query:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE
        AFQVF NM +KDNVSWASMISCF EH YA EAIQLFREMLFEEYVPDH TLSAVLTACSVLHSIQIGREIHGYSVR GLGKDVA+G SLV MYSKC NL 
Subjt:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE

Query:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI
         ARR+FE LPQKDHIACSSL+SGYAQ KC ++A LLFRDLLVAGLAIDPFSISSILGAIALLNRP IGTQ+HA+IMKVGLEKDVSVGSSLVMVYSK GS+
Subjt:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI

Query:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG
        EDCCKAF Q GKPDLIGWTAMIVSYAQHGKG EAL VYELMK+EG KPDPVTFVGVLSACSH+GLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG
Subjt:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG

Query:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL
        KLKEAEELIN MPIEPDAL+WGTLLAACKVHGDIELGKLAARKVMELKPSDTGA+VSLSNICADMGLWEEVL VRSLMKGAGVTKEPGWS L
Subjt:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL

TrEMBL top hitse value%identityAlignment
A0A1S3B4I2 pentatricopeptide repeat-containing protein At1g74600, chloroplastic0.0e+0081.95Show/hide
Query:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA
        M+FIAIQT V KTL SP RL+SSVATVDN SNFSFT+I+T++ F+P++LLND+VK    SL+NTKVLHAK LR T    +IYVSNSLL CYSKSNAMDHA
Subjt:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA

Query:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD
        LKLFDT+L+PNVISWN  I+GFN NFL LDS R FC MH+LGF+P+E+T GSVLSACAAIQA MFGKQVYSLAVRNGFF NGYVR  MIDLFAK S FLD
Subjt:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD

Query:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE
        ALRVFHDV+C NVVCWNAIVSAAV NGE LMALDLF+ MCS FLEPNSFTFSSVLTAC+AL+DLEFGK VQGRVIKCGG DVFVETAL+ LYAKCGDMDE
Subjt:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE

Query:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID
        AVK F QMPIRNVVSWT I+SGFVQ ND+LM +K FED+RK+GEEINSYTVT++L ACANP M KEA QLHSWILKAGFSS A VVAALI MYSK+GAID
Subjt:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID

Query:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE
        LS+MVFREMDN RNLSSWTAMI S A+NN+KE A +LF KML+E + PD+ CTS++LS+TDCITFGRQIHCYTLKT LIF+VSVGSSL TMYSKCG+L+E
Subjt:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE

Query:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE
        AFQVF NMP+KDNVSW  MISCFSEH +A+EAIQLFREMLFEE VPD T+LSAVLTAC  L SIQ+GREIHGYS+RVGL ++V+ G SLVTMYSKC NL 
Subjt:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE

Query:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI
         ARRVFETLPQKD I CSSLVSGYAQ KCIKEALLLFR LLVAGLAIDPFSISSILG IALL RP IGTQ+HALI+KVGLEKDVSVGSSLVMVYS+ GSI
Subjt:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI

Query:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG
        EDCCKAF Q GKPDLIGWT+MIVSYAQHGKG EAL  YELMK+EG KPDPVTFVGVLSACSH+GLVDEAYFHLNSMV+DYGIQPG RHYAC+VDLLGRCG
Subjt:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG

Query:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL
        KLKEAEELIN MPIEPDALIWGTLLAACKVHGDIELGKLAARKVMEL P DTGAYVSLSNICADMGLWEEVLNVRSLMK AGVTKE GWS L
Subjt:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL

A0A5D3BIJ5 Pentatricopeptide repeat-containing protein0.0e+0081.84Show/hide
Query:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA
        M+FIAIQT V KTL SP RL+SSVATVDN SNFSFT+I+T++ F+P++LLND+VK    SL+NTKVLHAK LR T    +IYVSNSLL CYSKSNAMDHA
Subjt:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA

Query:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD
        LKLFDT+L+PNVISWN  I+GFN NFL LDS R FC MH+LGF+P+E+T GSVLSACAAIQA MFGKQVYSLAVRNGFF NGYVR  MIDLFAK S FLD
Subjt:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD

Query:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE
        ALRVFHDV+C NVVCWNAIVSAAV NGE LMALDLF+ MCS FLEPNSFTFSSVLTAC+AL+DLEFGK VQGRVIKCGG DVFVETAL+ LYAKCGDMDE
Subjt:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE

Query:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID
        AVK F QMPIRNVVSWT I+SGFVQ ND+LM +K FED+RK+GEEINSYTVT++L ACANP M KEA QLHSWILKAGFSS A VVAALI MYSK+GAID
Subjt:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID

Query:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE
        LS+MVFREMDN RNLSSWTAMI S A+NN+KE A +LF KML+E + PD+ CTS++LS+TDCITFGRQIHCYTLKT LIF+VSVGSSL TMYSKCG+L+E
Subjt:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE

Query:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE
        AFQVF NMP+KDNVSW  MISCFSEH +A+EAIQLFREMLFEE VPD T+LSAVLTAC  L SIQ+GREIHGYS+RVGL ++V+ G SLVTMYSKC NL 
Subjt:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE

Query:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI
         ARRVFETLPQKD I CSSLVSGYAQ KCIKEALLLFR LLVAGLAIDPFSISSILG IALL RP IGTQ+HALI+KVGLEKDVSVGSSLVMVYS+ GSI
Subjt:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI

Query:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG
        EDCCKAF Q GKPDLIGWT+MIVSYAQHGKG EAL  YELMK+EG KPDPVTFVGVLSACSH+GLVDEAYFHLNSMV+DYGIQPG RHYAC+VDLLGRCG
Subjt:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG

Query:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEE
        KLKEAEELIN MPIEPDALIWGTLLAACKVHGDIELGKLAARKVMEL P DTGAYVSLSNICADMGLWEE
Subjt:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEE

A0A6J1C6M8 pentatricopeptide repeat-containing protein At1g74600, chloroplastic0.0e+0089.8Show/hide
Query:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA
        MN I IQT V KTL SPLRLISSVAT D ASNFSF++++T SL DPLQLLNDYVKSRKCSLKNTKV+HAKLLRATLLHS+IYV+NSLLDCYSKS AMD+A
Subjt:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA

Query:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD
        LKLFD MLH NVISWNI ISGFNQNFLFL+SWRTFCRMHFLGFEPSEITYGSVLSACAA+QAPMFGKQ+YSL VRNG FVNGYVRAGMIDLFAK SSF D
Subjt:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD

Query:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE
        ALRVF+DV+CENVVCWNAIVSAAVRNGEN +ALDLF+TMCSGFLEPNSFTFSSVLTACAA+EDLEFGKRVQGRVIKCGG+DVFVETALIDLYAKCGD+DE
Subjt:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE

Query:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID
        AVKTFLQMPIRNVVSWTAIISGFVQKND  MALK F+DMR LGEEINSYTVTSVLTACANPAM KEAIQLHSWILKAGF S+AVVV+ALINMYSK+G ID
Subjt:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID

Query:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE
        LSMMVFRE+D+QRNLSSW AMITSFAQN +KE+A ELF+KMLQE+IGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSL TMYSKCGYLEE
Subjt:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE

Query:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE
        AFQ F NMPKKD+VSWASMISCFSEH YAKEAI LFR+MLFEEYVPDH TLSAVLT CSVLHSIQIGREIHGYSVRVGLGKDVAIGG LVTMYSKC NLE
Subjt:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE

Query:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI
         ARRVFETLPQKD IACSSLVSGYAQHK I+EAL LF DLLV GLAIDPFS+SSILGAIA+L+RPGIG QLHALIMKVGLEKDVSVGSSLVMVYSK GSI
Subjt:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI

Query:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG
        EDCCKAFEQ GKPDLIGWTAMIVSYAQHGKG EAL VYELMK+EGIKPDPVTFVGVLSACSH+GLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG
Subjt:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG

Query:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL
        +LKEAEELIN+MPIEPDALIWGTLLAACKVHGDIE GKLAA+KVMELKP DTGAYVSLSNICADMGLWEEVLN+RSLMKGAGVTKEPGWS L
Subjt:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL

A0A6J1E7L2 pentatricopeptide repeat-containing protein At1g74600, chloroplastic0.0e+0089.69Show/hide
Query:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA
        MNF  I TFV KTL S  RLISSVATVDNAS+FSFT+I+TY LFDP QLL+DYVKSRKCSL+NTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSN++DHA
Subjt:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA

Query:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD
        LKLFDTMLHPNVISWNI IS FN NFL+LDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAK SSFLD
Subjt:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD

Query:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE
        ALRVFHD+ CENVVCWNAIVSAAVRNGEN MALDL++TMC G LEPNSFTFSSVLTACAALE  EFGKRVQG+VIKCGG+DVFVETALIDLY+KCG+MDE
Subjt:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE

Query:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID
        AVK FL+MPIRNVVSWTAIISGFVQKND+LMALKFF+DMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWIL+AG+SSHAVV AALINMYSK+GAID
Subjt:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID

Query:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE
        LSM VF EMDNQRNLSSWTAMITSFAQNN+KE+A ELF+KML+E++GPDTFCTSSVLSVTDCITFGRQIHC+T KTGLIFD+SVGS+L TMYSKCGYLEE
Subjt:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE

Query:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE
        AF VF NM KKDN+SWASM+SCFSEH YAKE IQLFREMLFEEYVPD+  LS VL ACSVLHSIQIGREIH YSVR+GL KDVAIGGSLVTMYSKC NLE
Subjt:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE

Query:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI
         ARRVFETLP+KD+IACSSLVSGYAQHKCIKE +LLF+DLL AGLAIDPFSISSILGAIALLNRPGIGTQLHA+I KVGLEKDVSVGSSLVMVYSK GSI
Subjt:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI

Query:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG
        EDCCKAFEQ GKPDLIGWTAMIVSYAQHGKG EAL VYELMK+EGIKPDPVTFVGVLSACSH+GLVDEAYFHLNSMVKDYGIQPG+RHYACMVDLLGRCG
Subjt:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG

Query:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL
        +LK AEELIN+MPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWS+L
Subjt:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL

A0A6J1KIC5 pentatricopeptide repeat-containing protein At1g74600, chloroplastic0.0e+0089.24Show/hide
Query:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA
        MNF  I TFV KTL S  RLISSVATVDNAS+FSFT+I+TY LFDP QLL+DYVKSRKCSL++TKVLHAKLLRATLLHSNIYVSNSLLDCYSKSN++DHA
Subjt:  MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA

Query:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD
        LKLFDTMLHPNVISWNI IS FN NFL+LDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAK SSFLD
Subjt:  LKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD

Query:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE
        ALRVF DV+CENVVCWNAIVSAAVRNGEN MALDL++TMC GFLEPNSFTFSSVLTACAALE  EFGKRVQG+VIKCGG+DVFVETALIDLY+KCG+MDE
Subjt:  ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDE

Query:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID
        AVK FL+MPIRNVVSWTAIISGFVQKND+LMALKFF+DMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWIL+AGFSSHAVV AALINMYSK+GAID
Subjt:  AVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAID

Query:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE
        LSM VF EMDNQRNLSSWTAMITSFAQNN+KE+A ELF+KML+E++GPDTFCTSSVLSVTDCITFGRQIHC+T KTGL+F +SVGS+L TMYSKCGYLEE
Subjt:  LSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEE

Query:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE
        AF VF NMPKKD++SWASM+SCFSEH YAKE IQLFREMLFEEYVPD   L+ VL ACSVLHSIQIGREIH YSVR+GL KDVAIGGSLVTMYSKC NLE
Subjt:  AFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLE

Query:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI
         ARRVFETLP+KD+IACSSLVSGYAQHKCIKE +LLF+DLL AGLAIDPFSISSILGAIALLNRPGIGTQLHA+I KVGLEKDVS+GSSLVMVYSK GSI
Subjt:  FARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSI

Query:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG
        EDCCKAFEQ GKPDLIGWTAMIVSYAQHGKG EAL VYELMK+EGIKPDPVTFVGVLSACSH+GLVDEAYFHLNSMVKDYGIQPG+RHYACMVDLLGRCG
Subjt:  EDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG

Query:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL
        +LK AEELIN+MPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRS MKGAGVTKEPGWS+L
Subjt:  KLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWSVL

SwissProt top hitse value%identityAlignment
Q9CA56 Pentatricopeptide repeat-containing protein At1g74600, chloroplastic2.9e-28555.3Show/hide
Query:  FSPLRLISSVATVDNASNFSFTRIQTYSL-FDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVI
        FS  RL+SSV    N  +FS     + S  F+P +  ND   SR C+L+ TK+L A LLR  LL  +++++ SLL  YS S +M  A KLFDT+  P+V+
Subjt:  FSPLRLISSVATVDNASNFSFTRIQTYSL-FDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVI

Query:  SWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLDALRVFHDVECENV
        S NI ISG+ Q+ LF +S R F +MHFLGFE +EI+YGSV+SAC+A+QAP+F + V    ++ G+F    V + +ID+F+K   F DA +VF D    NV
Subjt:  SWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLDALRVFHDVECENV

Query:  VCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNV
         CWN I++ A+RN       DLF  MC GF +P+S+T+SSVL ACA+LE L FGK VQ RVIKCG +DVFV TA++DLYAKCG M EA++ F ++P  +V
Subjt:  VCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNV

Query:  VSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAIDLSMMVFREMDNQR
        VSWT ++SG+ + ND   AL+ F++MR  G EIN+ TVTSV++AC  P+M  EA Q+H+W+ K+GF   + V AALI+MYSK G IDLS  VF ++D+ +
Subjt:  VSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAIDLSMMVFREMDNQR

Query:  NLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDN
          +    MITSF+Q+    +A  LF +MLQE +  D F   S+LSV DC+  G+Q+H YTLK+GL+ D++VGSSL T+YSKCG LEE++++F  +P KDN
Subjt:  NLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDN

Query:  VSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLEFARRVFETLPQKD
          WASMIS F+E+ Y +EAI LF EML +   PD +TL+AVLT CS   S+  G+EIHGY++R G+ K + +G +LV MYSKC +L+ AR+V++ LP+ D
Subjt:  VSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLEFARRVFETLPQKD

Query:  HIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKP
         ++CSSL+SGY+QH  I++  LLFRD++++G  +D F+ISSIL A AL +   +G Q+HA I K+GL  + SVGSSL+ +YSK+GSI+DCCKAF Q   P
Subjt:  HIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKP

Query:  DLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMP
        DLI WTA+I SYAQHGK  EAL VY LMKE+G KPD VTFVGVLSACSH GLV+E+YFHLNSMVKDYGI+P  RHY CMVD LGR G+L+EAE  IN+M 
Subjt:  DLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMP

Query:  IEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWS
        I+PDAL+WGTLLAACK+HG++ELGK+AA+K +EL+PSD GAY+SLSNI A++G W+EV   R LMKG GV KEPGWS
Subjt:  IEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWS

Q9FIB2 Putative pentatricopeptide repeat-containing protein At5g099505.3e-12233.18Show/hide
Query:  KVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAP-
        +  H++L +   L  ++Y+ N+L++ Y ++     A K+FD M   N +SW   +SG+++N    ++      M   G   ++  + SVL AC  I +  
Subjt:  KVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAP-

Query:  -MFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKG-SSFLDALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSS-VLTACAA
         +FG+Q++ L  +  + V+  V   +I ++ K   S   AL  F D+E +N V WN+I+S   + G+   A  +F +M      P  +TF S V TAC+ 
Subjt:  -MFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKG-SSFLDALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSS-VLTACAA

Query:  LE-DLEFGKRVQGRVIKCG-GDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLT--
         E D+   +++   + K G   D+FV + L+  +AK G +  A K F QM  RN V+   ++ G V++     A K F DM  +  +++  +   +L+  
Subjt:  LE-DLEFGKRVQGRVIKCG-GDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLT--

Query:  ---ACANPAMTKEAIQLHSWILKAGFSSHAVVVA-ALINMYSKVGAIDLSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFC
           + A     K+  ++H  ++  G     V +   L+NMY+K G+I  +  VF  M ++ ++ SW +MIT   QN     A E ++ M + +I P +F 
Subjt:  ---ACANPAMTKEAIQLHSWILKAGFSSHAVVVA-ALINMYSKVGAIDLSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFC

Query:  TSSVLSVTDCITF---GRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCF--SEHSYAKEAIQLFREMLFEEYVPD
          S LS    + +   G+QIH  +LK G+  +VSV ++L T+Y++ GYL E  ++F +MP+ D VSW S+I     SE S   EA+  F          +
Subjt:  TSSVLSVTDCITF---GRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCF--SEHSYAKEAIQLFREMLFEEYVPD

Query:  HTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLEFARRVFETLPQ-KDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLA
          T S+VL+A S L   ++G++IHG +++  +  +     +L+  Y KC  ++   ++F  + + +D++  +S++SGY  ++ + +AL L   +L  G  
Subjt:  HTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLEFARRVFETLPQ-KDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLA

Query:  IDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEG-
        +D F  +++L A A +     G ++HA  ++  LE DV VGS+LV +YSK G ++   + F      +   W +MI  YA+HG+G EAL ++E MK +G 
Subjt:  IDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEG-

Query:  IKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLL-AACKVHG-DIELGKLAARK
          PD VTFVGVLSACSH+GL++E + H  SM   YG+ P   H++CM D+LGR G+L + E+ I  MP++P+ LIW T+L A C+ +G   ELGK AA  
Subjt:  IKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLL-AACKVHG-DIELGKLAARK

Query:  VMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWS
        + +L+P +   YV L N+ A  G WE+++  R  MK A V KE G+S
Subjt:  VMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWS

Q9FWA6 Pentatricopeptide repeat-containing protein At3g02330, mitochondrial1.4e-13035.62Show/hide
Query:  MIDLFAKGSSFLDALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGD-DVFVET
        MI+ ++K +    A   F+ +   +VV WN+++S  ++NGE+L ++++F  M    +E +  TF+ +L  C+ LED   G ++ G V++ G D DV   +
Subjt:  MIDLFAKGSSFLDALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGD-DVFVET

Query:  ALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVV
        AL+D+YAK     E+++ F  +P +N VSW+AII+G VQ N   +ALKFF++M+K+   ++     SVL +CA  +  +   QLH+  LK+ F++  +V 
Subjt:  ALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVV

Query:  AALINMYSKVGAIDLSMMVFREMDNQRNLS--SWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVL---SVTDCITFGRQIHCYTLKTGLIFD
         A ++MY+K   +  + ++F   DN  NL+  S+ AMIT ++Q  +  +A  LF +++   +G D    S V    ++   ++ G QI+   +K+ L  D
Subjt:  AALINMYSKVGAIDLSMMVFREMDNQRNLS--SWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVL---SVTDCITFGRQIHCYTLKTGLIFD

Query:  VSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGK
        V V ++   MY KC  L EAF+VF  M ++D VSW ++I+   ++    E + LF  ML     PD  T  ++L AC+   S+  G EIH   V+ G+  
Subjt:  VSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGK

Query:  DVAIGGSLVTMYSKCSNLEFARRVFETLPQKDH--------------------IACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIAL
        + ++G SL+ MYSKC  +E A ++     Q+ +                    ++ +S++SGY   +  ++A +LF  ++  G+  D F+ +++L   A 
Subjt:  DVAIGGSLVTMYSKCSNLEFARRVFETLPQKDH--------------------IACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIAL

Query:  LNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACS
        L   G+G Q+HA ++K  L+ DV + S+LV +YSK G + D    FE++ + D + W AMI  YA HGKG EA+ ++E M  E IKP+ VTF+ +L AC+
Subjt:  LNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACS

Query:  HSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLLAACKVH-GDIELGKLAARKVMELKPSDTGAYVSLSN
        H GL+D+   +   M +DYG+ P   HY+ MVD+LG+ GK+K A ELI  MP E D +IW TLL  C +H  ++E+ + A   ++ L P D+ AY  LSN
Subjt:  HSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLLAACKVH-GDIELGKLAARKVMELKPSDTGAYVSLSN

Query:  ICADMGLWEEVLNVRSLMKGAGVTKEPGWS
        + AD G+WE+V ++R  M+G  + KEPG S
Subjt:  ICADMGLWEEVLNVRSLMKGAGVTKEPGWS

Q9SS83 Pentatricopeptide repeat-containing protein At3g09040, mitochondrial1.4e-12229.32Show/hide
Query:  LFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLG
        LF   ++ ++  +    +L+  K +H+K L    + S   + N+++D Y+K   + +A K FD  L  +V +WN  +S ++         R+F  +    
Subjt:  LFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLG

Query:  FEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLDALRVFHDVECENVVC---------------------------
          P++ T+  VLS CA      FG+Q++   ++ G   N Y    ++D++AK     DA RVF  +   N VC                           
Subjt:  FEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLDALRVFHDVECENVVC---------------------------

Query:  ---------------------------------------WNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIK
                                               WN ++S   + G   +A++ F  M    ++    T  SVL+A   + +L+ G  V    IK
Subjt:  ---------------------------------------WNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIK

Query:  CG-GDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWIL
         G   +++V ++L+ +Y+KC  M+ A K F  +  +N V W A+I G+    +    ++ F DM+  G  I+ +T TS+L+ CA     +   Q HS I+
Subjt:  CG-GDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWIL

Query:  KAGFSSHAVVVAALINMYSKVGAIDLSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSV---TDCITFGRQIHCY
        K   + +  V  AL++MY+K GA++ +  +F  M ++ N+ +W  +I S+ Q+ N+  A +LF++M    I  D  C +S L        +  G+Q+HC 
Subjt:  KAGFSSHAVVVAALINMYSKVGAIDLSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSV---TDCITFGRQIHCY

Query:  TLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHG
        ++K GL  D+  GSSL  MYSKCG +++A +VF ++P+   VS  ++I+ +S+++  +EA+ LF+EML     P   T + ++ AC    S+ +G + HG
Subjt:  TLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHG

Query:  YSVRVGLGKD-VAIGGSLVTMYSKCSNLEFARRVFETLPQ-KDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQ
           + G   +   +G SL+ MY     +  A  +F  L   K  +  + ++SG++Q+   +EAL  ++++   G+  D  +  ++L   ++L+    G  
Subjt:  YSVRVGLGKD-VAIGGSLVTMYSKCSNLEFARRVFETLPQ-KDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQ

Query:  LHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAF-EQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEA
        +H+LI  +  + D    ++L+ +Y+K G ++   + F E   + +++ W ++I  YA++G   +AL +++ M++  I PD +TF+GVL+ACSH+G V + 
Subjt:  LHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAF-EQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEA

Query:  YFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWE
              M+  YGI+    H ACMVDLLGR G L+EA++ I +  ++PDA +W +LL AC++HGD   G+++A K++EL+P ++ AYV LSNI A  G WE
Subjt:  YFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWE

Query:  EVLNVRSLMKGAGVTKEPGWS
        +   +R +M+  GV K PG+S
Subjt:  EVLNVRSLMKGAGVTKEPGWS

Q9SVP7 Pentatricopeptide repeat-containing protein At4g136506.5e-12833.61Show/hide
Query:  SLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAA
        SL   + LH+++L+   L SN  +S  L D Y     +  A K+FD M    + +WN  I       L  + +  F RM      P+E T+  VL AC  
Subjt:  SLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAA

Query:  IQAPM-FGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD-ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTA
                +Q+++  +  G   +  V   +IDL+++ + F+D A RVF  +  ++   W A++S   +N     A+ LF  M    + P  + FSSVL+A
Subjt:  IQAPM-FGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD-ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTA

Query:  CAALEDLEFGKRVQGRVIKCG-GDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLT
        C  +E LE G+++ G V+K G   D +V  AL+ LY   G++  A   F  M  R+ V++  +I+G  Q      A++ F+ M   G E +S T+ S++ 
Subjt:  CAALEDLEFGKRVQGRVIKCG-GDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLT

Query:  ACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAIDLSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSV
        AC+         QLH++  K GF+S+  +  AL+N+Y+K   I+ ++  F E + + N+  W  M+ ++   ++   +  +F +M  E I P+ +   S+
Subjt:  ACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAIDLSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSV

Query:  LSVTDCI-----TFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTL
        L    CI       G QIH   +KT    +  V S L  MY+K G L+ A+ +      KD VSW +MI+ ++++++  +A+  FR+ML      D   L
Subjt:  LSVTDCI-----TFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTL

Query:  SAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLEFARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFS
        +  ++AC+ L +++ G++IH  +   G   D+    +LVT+YS+C  +E +   FE     D+IA ++LVSG+ Q    +EAL +F  +   G+  + F+
Subjt:  SAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLEFARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFS

Query:  ISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPV
          S + A +       G Q+HA+I K G + +  V ++L+ +Y+K GSI D  K F +    + + W A+I +Y++HG G+EAL  ++ M    ++P+ V
Subjt:  ISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPV

Query:  TFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSD
        T VGVLSACSH GLVD+   +  SM  +YG+ P   HY C+VD+L R G L  A+E I  MPI+PDAL+W TLL+AC VH ++E+G+ AA  ++EL+P D
Subjt:  TFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSD

Query:  TGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWS
        +  YV LSN+ A    W+     R  MK  GV KEPG S
Subjt:  TGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWS

Arabidopsis top hitse value%identityAlignment
AT1G74600.1 pentatricopeptide (PPR) repeat-containing protein2.0e-28655.3Show/hide
Query:  FSPLRLISSVATVDNASNFSFTRIQTYSL-FDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVI
        FS  RL+SSV    N  +FS     + S  F+P +  ND   SR C+L+ TK+L A LLR  LL  +++++ SLL  YS S +M  A KLFDT+  P+V+
Subjt:  FSPLRLISSVATVDNASNFSFTRIQTYSL-FDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVI

Query:  SWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLDALRVFHDVECENV
        S NI ISG+ Q+ LF +S R F +MHFLGFE +EI+YGSV+SAC+A+QAP+F + V    ++ G+F    V + +ID+F+K   F DA +VF D    NV
Subjt:  SWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLDALRVFHDVECENV

Query:  VCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNV
         CWN I++ A+RN       DLF  MC GF +P+S+T+SSVL ACA+LE L FGK VQ RVIKCG +DVFV TA++DLYAKCG M EA++ F ++P  +V
Subjt:  VCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNV

Query:  VSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAIDLSMMVFREMDNQR
        VSWT ++SG+ + ND   AL+ F++MR  G EIN+ TVTSV++AC  P+M  EA Q+H+W+ K+GF   + V AALI+MYSK G IDLS  VF ++D+ +
Subjt:  VSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAIDLSMMVFREMDNQR

Query:  NLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDN
          +    MITSF+Q+    +A  LF +MLQE +  D F   S+LSV DC+  G+Q+H YTLK+GL+ D++VGSSL T+YSKCG LEE++++F  +P KDN
Subjt:  NLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDN

Query:  VSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLEFARRVFETLPQKD
          WASMIS F+E+ Y +EAI LF EML +   PD +TL+AVLT CS   S+  G+EIHGY++R G+ K + +G +LV MYSKC +L+ AR+V++ LP+ D
Subjt:  VSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLEFARRVFETLPQKD

Query:  HIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKP
         ++CSSL+SGY+QH  I++  LLFRD++++G  +D F+ISSIL A AL +   +G Q+HA I K+GL  + SVGSSL+ +YSK+GSI+DCCKAF Q   P
Subjt:  HIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKP

Query:  DLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMP
        DLI WTA+I SYAQHGK  EAL VY LMKE+G KPD VTFVGVLSACSH GLV+E+YFHLNSMVKDYGI+P  RHY CMVD LGR G+L+EAE  IN+M 
Subjt:  DLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMP

Query:  IEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWS
        I+PDAL+WGTLLAACK+HG++ELGK+AA+K +EL+PSD GAY+SLSNI A++G W+EV   R LMKG GV KEPGWS
Subjt:  IEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWS

AT3G02330.1 Pentatricopeptide repeat (PPR) superfamily protein9.9e-13235.62Show/hide
Query:  MIDLFAKGSSFLDALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGD-DVFVET
        MI+ ++K +    A   F+ +   +VV WN+++S  ++NGE+L ++++F  M    +E +  TF+ +L  C+ LED   G ++ G V++ G D DV   +
Subjt:  MIDLFAKGSSFLDALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGD-DVFVET

Query:  ALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVV
        AL+D+YAK     E+++ F  +P +N VSW+AII+G VQ N   +ALKFF++M+K+   ++     SVL +CA  +  +   QLH+  LK+ F++  +V 
Subjt:  ALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVV

Query:  AALINMYSKVGAIDLSMMVFREMDNQRNLS--SWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVL---SVTDCITFGRQIHCYTLKTGLIFD
         A ++MY+K   +  + ++F   DN  NL+  S+ AMIT ++Q  +  +A  LF +++   +G D    S V    ++   ++ G QI+   +K+ L  D
Subjt:  AALINMYSKVGAIDLSMMVFREMDNQRNLS--SWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVL---SVTDCITFGRQIHCYTLKTGLIFD

Query:  VSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGK
        V V ++   MY KC  L EAF+VF  M ++D VSW ++I+   ++    E + LF  ML     PD  T  ++L AC+   S+  G EIH   V+ G+  
Subjt:  VSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGK

Query:  DVAIGGSLVTMYSKCSNLEFARRVFETLPQKDH--------------------IACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIAL
        + ++G SL+ MYSKC  +E A ++     Q+ +                    ++ +S++SGY   +  ++A +LF  ++  G+  D F+ +++L   A 
Subjt:  DVAIGGSLVTMYSKCSNLEFARRVFETLPQKDH--------------------IACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIAL

Query:  LNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACS
        L   G+G Q+HA ++K  L+ DV + S+LV +YSK G + D    FE++ + D + W AMI  YA HGKG EA+ ++E M  E IKP+ VTF+ +L AC+
Subjt:  LNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACS

Query:  HSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLLAACKVH-GDIELGKLAARKVMELKPSDTGAYVSLSN
        H GL+D+   +   M +DYG+ P   HY+ MVD+LG+ GK+K A ELI  MP E D +IW TLL  C +H  ++E+ + A   ++ L P D+ AY  LSN
Subjt:  HSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLLAACKVH-GDIELGKLAARKVMELKPSDTGAYVSLSN

Query:  ICADMGLWEEVLNVRSLMKGAGVTKEPGWS
        + AD G+WE+V ++R  M+G  + KEPG S
Subjt:  ICADMGLWEEVLNVRSLMKGAGVTKEPGWS

AT3G09040.1 Pentatricopeptide repeat (PPR) superfamily protein9.9e-12429.32Show/hide
Query:  LFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLG
        LF   ++ ++  +    +L+  K +H+K L    + S   + N+++D Y+K   + +A K FD  L  +V +WN  +S ++         R+F  +    
Subjt:  LFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLG

Query:  FEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLDALRVFHDVECENVVC---------------------------
          P++ T+  VLS CA      FG+Q++   ++ G   N Y    ++D++AK     DA RVF  +   N VC                           
Subjt:  FEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLDALRVFHDVECENVVC---------------------------

Query:  ---------------------------------------WNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIK
                                               WN ++S   + G   +A++ F  M    ++    T  SVL+A   + +L+ G  V    IK
Subjt:  ---------------------------------------WNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIK

Query:  CG-GDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWIL
         G   +++V ++L+ +Y+KC  M+ A K F  +  +N V W A+I G+    +    ++ F DM+  G  I+ +T TS+L+ CA     +   Q HS I+
Subjt:  CG-GDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWIL

Query:  KAGFSSHAVVVAALINMYSKVGAIDLSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSV---TDCITFGRQIHCY
        K   + +  V  AL++MY+K GA++ +  +F  M ++ N+ +W  +I S+ Q+ N+  A +LF++M    I  D  C +S L        +  G+Q+HC 
Subjt:  KAGFSSHAVVVAALINMYSKVGAIDLSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSVLSV---TDCITFGRQIHCY

Query:  TLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHG
        ++K GL  D+  GSSL  MYSKCG +++A +VF ++P+   VS  ++I+ +S+++  +EA+ LF+EML     P   T + ++ AC    S+ +G + HG
Subjt:  TLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTLSAVLTACSVLHSIQIGREIHG

Query:  YSVRVGLGKD-VAIGGSLVTMYSKCSNLEFARRVFETLPQ-KDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQ
           + G   +   +G SL+ MY     +  A  +F  L   K  +  + ++SG++Q+   +EAL  ++++   G+  D  +  ++L   ++L+    G  
Subjt:  YSVRVGLGKD-VAIGGSLVTMYSKCSNLEFARRVFETLPQ-KDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIALLNRPGIGTQ

Query:  LHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAF-EQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEA
        +H+LI  +  + D    ++L+ +Y+K G ++   + F E   + +++ W ++I  YA++G   +AL +++ M++  I PD +TF+GVL+ACSH+G V + 
Subjt:  LHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAF-EQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEA

Query:  YFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWE
              M+  YGI+    H ACMVDLLGR G L+EA++ I +  ++PDA +W +LL AC++HGD   G+++A K++EL+P ++ AYV LSNI A  G WE
Subjt:  YFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWE

Query:  EVLNVRSLMKGAGVTKEPGWS
        +   +R +M+  GV K PG+S
Subjt:  EVLNVRSLMKGAGVTKEPGWS

AT4G13650.1 Pentatricopeptide repeat (PPR) superfamily protein4.6e-12933.61Show/hide
Query:  SLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAA
        SL   + LH+++L+   L SN  +S  L D Y     +  A K+FD M    + +WN  I       L  + +  F RM      P+E T+  VL AC  
Subjt:  SLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAA

Query:  IQAPM-FGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD-ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTA
                +Q+++  +  G   +  V   +IDL+++ + F+D A RVF  +  ++   W A++S   +N     A+ LF  M    + P  + FSSVL+A
Subjt:  IQAPM-FGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLD-ALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTA

Query:  CAALEDLEFGKRVQGRVIKCG-GDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLT
        C  +E LE G+++ G V+K G   D +V  AL+ LY   G++  A   F  M  R+ V++  +I+G  Q      A++ F+ M   G E +S T+ S++ 
Subjt:  CAALEDLEFGKRVQGRVIKCG-GDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLT

Query:  ACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAIDLSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSV
        AC+         QLH++  K GF+S+  +  AL+N+Y+K   I+ ++  F E + + N+  W  M+ ++   ++   +  +F +M  E I P+ +   S+
Subjt:  ACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAIDLSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFCTSSV

Query:  LSVTDCI-----TFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTL
        L    CI       G QIH   +KT    +  V S L  MY+K G L+ A+ +      KD VSW +MI+ ++++++  +A+  FR+ML      D   L
Subjt:  LSVTDCI-----TFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTTL

Query:  SAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLEFARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFS
        +  ++AC+ L +++ G++IH  +   G   D+    +LVT+YS+C  +E +   FE     D+IA ++LVSG+ Q    +EAL +F  +   G+  + F+
Subjt:  SAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLEFARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFS

Query:  ISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPV
          S + A +       G Q+HA+I K G + +  V ++L+ +Y+K GSI D  K F +    + + W A+I +Y++HG G+EAL  ++ M    ++P+ V
Subjt:  ISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPV

Query:  TFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSD
        T VGVLSACSH GLVD+   +  SM  +YG+ P   HY C+VD+L R G L  A+E I  MPI+PDAL+W TLL+AC VH ++E+G+ AA  ++EL+P D
Subjt:  TFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSD

Query:  TGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWS
        +  YV LSN+ A    W+     R  MK  GV KEPG S
Subjt:  TGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWS

AT5G09950.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.8e-12333.18Show/hide
Query:  KVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAP-
        +  H++L +   L  ++Y+ N+L++ Y ++     A K+FD M   N +SW   +SG+++N    ++      M   G   ++  + SVL AC  I +  
Subjt:  KVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHPNVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAP-

Query:  -MFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKG-SSFLDALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSS-VLTACAA
         +FG+Q++ L  +  + V+  V   +I ++ K   S   AL  F D+E +N V WN+I+S   + G+   A  +F +M      P  +TF S V TAC+ 
Subjt:  -MFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKG-SSFLDALRVFHDVECENVVCWNAIVSAAVRNGENLMALDLFDTMCSGFLEPNSFTFSS-VLTACAA

Query:  LE-DLEFGKRVQGRVIKCG-GDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLT--
         E D+   +++   + K G   D+FV + L+  +AK G +  A K F QM  RN V+   ++ G V++     A K F DM  +  +++  +   +L+  
Subjt:  LE-DLEFGKRVQGRVIKCG-GDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFLMALKFFEDMRKLGEEINSYTVTSVLT--

Query:  ---ACANPAMTKEAIQLHSWILKAGFSSHAVVVA-ALINMYSKVGAIDLSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFC
           + A     K+  ++H  ++  G     V +   L+NMY+K G+I  +  VF  M ++ ++ SW +MIT   QN     A E ++ M + +I P +F 
Subjt:  ---ACANPAMTKEAIQLHSWILKAGFSSHAVVVA-ALINMYSKVGAIDLSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEKMLQENIGPDTFC

Query:  TSSVLSVTDCITF---GRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCF--SEHSYAKEAIQLFREMLFEEYVPD
          S LS    + +   G+QIH  +LK G+  +VSV ++L T+Y++ GYL E  ++F +MP+ D VSW S+I     SE S   EA+  F          +
Subjt:  TSSVLSVTDCITF---GRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCF--SEHSYAKEAIQLFREMLFEEYVPD

Query:  HTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLEFARRVFETLPQ-KDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLA
          T S+VL+A S L   ++G++IHG +++  +  +     +L+  Y KC  ++   ++F  + + +D++  +S++SGY  ++ + +AL L   +L  G  
Subjt:  HTTLSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLEFARRVFETLPQ-KDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLA

Query:  IDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEG-
        +D F  +++L A A +     G ++HA  ++  LE DV VGS+LV +YSK G ++   + F      +   W +MI  YA+HG+G EAL ++E MK +G 
Subjt:  IDPFSISSILGAIALLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEG-

Query:  IKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLL-AACKVHG-DIELGKLAARK
          PD VTFVGVLSACSH+GL++E + H  SM   YG+ P   H++CM D+LGR G+L + E+ I  MP++P+ LIW T+L A C+ +G   ELGK AA  
Subjt:  IKPDPVTFVGVLSACSHSGLVDEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLL-AACKVHG-DIELGKLAARK

Query:  VMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWS
        + +L+P +   YV L N+ A  G WE+++  R  MK A V KE G+S
Subjt:  VMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKGAGVTKEPGWS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTTATTGCAATTCAAACCTTCGTATATAAAACATTATTCTCCCCCCTTAGATTGATTTCCTCTGTCGCGACTGTAGACAATGCGTCGAATTTTTCCTTCACCAG
AATTCAAACTTACTCTCTTTTCGACCCTTTACAGCTGCTCAATGATTATGTAAAATCGAGAAAATGCTCTCTCAAAAACACCAAAGTTCTACACGCGAAGTTGCTCAGGG
CAACTCTTCTTCATTCCAATATCTATGTTTCAAATTCTTTGCTCGATTGCTACTCAAAGTCTAACGCTATGGACCATGCACTCAAGTTGTTCGATACAATGCTCCACCCA
AATGTCATTTCTTGGAATATCACGATCTCGGGTTTCAACCAGAACTTTTTGTTTTTGGATTCGTGGAGAACATTTTGCAGGATGCATTTCCTGGGTTTTGAACCCAGCGA
GATAACTTATGGGAGTGTTTTATCCGCTTGTGCTGCCATTCAAGCTCCAATGTTTGGTAAGCAGGTTTATTCACTTGCGGTGAGAAATGGGTTTTTTGTGAATGGTTATG
TTCGAGCTGGGATGATTGATTTATTTGCAAAAGGTTCTAGCTTTCTGGATGCTCTAAGGGTGTTTCATGATGTTGAGTGTGAGAATGTGGTGTGTTGGAATGCTATTGTC
TCTGCAGCTGTAAGAAATGGGGAGAATTTGATGGCTTTGGATCTTTTCGACACAATGTGTAGTGGGTTTCTGGAGCCAAATAGTTTTACATTTTCTAGTGTTCTAACTGC
GTGTGCTGCACTTGAAGACCTTGAATTTGGGAAAAGAGTTCAAGGGAGAGTGATTAAATGTGGCGGAGATGATGTTTTTGTAGAGACGGCCCTTATTGATTTGTACGCCA
AGTGTGGAGATATGGATGAAGCTGTTAAGACATTCTTGCAGATGCCCATCCGCAATGTGGTCTCCTGGACTGCCATAATATCTGGTTTTGTGCAGAAGAATGATTTCTTA
ATGGCCCTCAAGTTTTTTGAAGATATGAGAAAACTGGGAGAGGAAATTAATAGTTATACAGTTACTAGCGTGTTAACTGCATGTGCTAATCCAGCCATGACAAAAGAAGC
AATCCAACTCCACTCCTGGATTTTAAAAGCTGGTTTTTCTTCACATGCAGTGGTGGTAGCTGCTTTAATTAACATGTATTCAAAAGTTGGAGCTATTGATCTTTCTATGA
TGGTTTTCAGAGAGATGGACAATCAAAGGAATCTCAGTTCTTGGACAGCTATGATAACTTCATTTGCACAGAATAATAATAAAGAGAGAGCATGTGAATTGTTCGAAAAA
ATGTTACAGGAAAATATCGGACCAGATACATTTTGTACTTCCAGTGTCTTGAGTGTTACAGACTGTATTACTTTTGGGAGACAGATACACTGCTACACACTTAAAACTGG
ATTAATATTTGATGTTTCTGTCGGCAGTTCTCTTTCCACAATGTATTCCAAATGCGGCTATCTAGAGGAAGCTTTTCAAGTTTTTGGAAACATGCCAAAGAAGGACAATG
TTTCATGGGCGTCAATGATTTCCTGTTTCTCAGAACATAGCTATGCGAAAGAGGCCATTCAATTATTTAGAGAAATGTTGTTTGAAGAATATGTACCTGATCATACGACT
TTAAGTGCAGTCCTAACTGCGTGTTCTGTTCTTCATTCTATTCAAATAGGCAGAGAAATTCATGGTTATTCTGTTCGTGTGGGACTTGGCAAAGACGTAGCTATTGGAGG
TTCGCTTGTGACTATGTACTCGAAATGCAGCAACCTGGAATTCGCTAGGAGGGTGTTCGAAACATTGCCCCAGAAAGATCATATTGCATGCTCTTCATTGGTTTCAGGAT
ATGCTCAACACAAGTGCATCAAAGAGGCACTTTTGCTATTCCGTGATCTGCTGGTGGCTGGTTTAGCTATCGATCCCTTCTCAATCTCATCCATACTGGGAGCCATTGCG
CTTTTAAATAGGCCTGGCATTGGGACTCAGCTCCATGCACTCATTATGAAAGTAGGCTTGGAGAAAGATGTTTCTGTTGGGAGTTCGCTAGTAATGGTATACTCCAAATA
TGGAAGTATAGAAGACTGCTGCAAAGCATTCGAGCAGACTGGAAAGCCTGATTTGATAGGCTGGACAGCCATGATTGTCAGTTATGCTCAGCATGGGAAAGGCACAGAAG
CTTTATACGTCTATGAACTTATGAAAGAAGAAGGAATTAAGCCTGATCCAGTGACCTTTGTTGGGGTTTTGTCTGCTTGTAGCCATAGTGGTTTGGTCGATGAAGCTTAT
TTCCACCTTAATTCAATGGTGAAAGACTATGGTATACAGCCAGGATATCGACATTATGCTTGTATGGTAGATCTTCTTGGTCGGTGTGGGAAGTTGAAAGAGGCAGAAGA
ACTGATTAACAGTATGCCTATCGAACCTGATGCTCTCATTTGGGGAACGCTTCTTGCTGCTTGTAAAGTACATGGAGATATTGAACTTGGAAAACTTGCAGCACGAAAGG
TGATGGAGTTGAAGCCAAGTGATACTGGTGCTTATGTCTCTCTTTCAAACATCTGTGCTGATATGGGTTTGTGGGAAGAGGTCCTGAACGTTAGAAGCCTTATGAAGGGA
GCTGGAGTAACAAAGGAACCTGGTTGGAGCGTTCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTTTATTGCAATTCAAACCTTCGTATATAAAACATTATTCTCCCCCCTTAGATTGATTTCCTCTGTCGCGACTGTAGACAATGCGTCGAATTTTTCCTTCACCAG
AATTCAAACTTACTCTCTTTTCGACCCTTTACAGCTGCTCAATGATTATGTAAAATCGAGAAAATGCTCTCTCAAAAACACCAAAGTTCTACACGCGAAGTTGCTCAGGG
CAACTCTTCTTCATTCCAATATCTATGTTTCAAATTCTTTGCTCGATTGCTACTCAAAGTCTAACGCTATGGACCATGCACTCAAGTTGTTCGATACAATGCTCCACCCA
AATGTCATTTCTTGGAATATCACGATCTCGGGTTTCAACCAGAACTTTTTGTTTTTGGATTCGTGGAGAACATTTTGCAGGATGCATTTCCTGGGTTTTGAACCCAGCGA
GATAACTTATGGGAGTGTTTTATCCGCTTGTGCTGCCATTCAAGCTCCAATGTTTGGTAAGCAGGTTTATTCACTTGCGGTGAGAAATGGGTTTTTTGTGAATGGTTATG
TTCGAGCTGGGATGATTGATTTATTTGCAAAAGGTTCTAGCTTTCTGGATGCTCTAAGGGTGTTTCATGATGTTGAGTGTGAGAATGTGGTGTGTTGGAATGCTATTGTC
TCTGCAGCTGTAAGAAATGGGGAGAATTTGATGGCTTTGGATCTTTTCGACACAATGTGTAGTGGGTTTCTGGAGCCAAATAGTTTTACATTTTCTAGTGTTCTAACTGC
GTGTGCTGCACTTGAAGACCTTGAATTTGGGAAAAGAGTTCAAGGGAGAGTGATTAAATGTGGCGGAGATGATGTTTTTGTAGAGACGGCCCTTATTGATTTGTACGCCA
AGTGTGGAGATATGGATGAAGCTGTTAAGACATTCTTGCAGATGCCCATCCGCAATGTGGTCTCCTGGACTGCCATAATATCTGGTTTTGTGCAGAAGAATGATTTCTTA
ATGGCCCTCAAGTTTTTTGAAGATATGAGAAAACTGGGAGAGGAAATTAATAGTTATACAGTTACTAGCGTGTTAACTGCATGTGCTAATCCAGCCATGACAAAAGAAGC
AATCCAACTCCACTCCTGGATTTTAAAAGCTGGTTTTTCTTCACATGCAGTGGTGGTAGCTGCTTTAATTAACATGTATTCAAAAGTTGGAGCTATTGATCTTTCTATGA
TGGTTTTCAGAGAGATGGACAATCAAAGGAATCTCAGTTCTTGGACAGCTATGATAACTTCATTTGCACAGAATAATAATAAAGAGAGAGCATGTGAATTGTTCGAAAAA
ATGTTACAGGAAAATATCGGACCAGATACATTTTGTACTTCCAGTGTCTTGAGTGTTACAGACTGTATTACTTTTGGGAGACAGATACACTGCTACACACTTAAAACTGG
ATTAATATTTGATGTTTCTGTCGGCAGTTCTCTTTCCACAATGTATTCCAAATGCGGCTATCTAGAGGAAGCTTTTCAAGTTTTTGGAAACATGCCAAAGAAGGACAATG
TTTCATGGGCGTCAATGATTTCCTGTTTCTCAGAACATAGCTATGCGAAAGAGGCCATTCAATTATTTAGAGAAATGTTGTTTGAAGAATATGTACCTGATCATACGACT
TTAAGTGCAGTCCTAACTGCGTGTTCTGTTCTTCATTCTATTCAAATAGGCAGAGAAATTCATGGTTATTCTGTTCGTGTGGGACTTGGCAAAGACGTAGCTATTGGAGG
TTCGCTTGTGACTATGTACTCGAAATGCAGCAACCTGGAATTCGCTAGGAGGGTGTTCGAAACATTGCCCCAGAAAGATCATATTGCATGCTCTTCATTGGTTTCAGGAT
ATGCTCAACACAAGTGCATCAAAGAGGCACTTTTGCTATTCCGTGATCTGCTGGTGGCTGGTTTAGCTATCGATCCCTTCTCAATCTCATCCATACTGGGAGCCATTGCG
CTTTTAAATAGGCCTGGCATTGGGACTCAGCTCCATGCACTCATTATGAAAGTAGGCTTGGAGAAAGATGTTTCTGTTGGGAGTTCGCTAGTAATGGTATACTCCAAATA
TGGAAGTATAGAAGACTGCTGCAAAGCATTCGAGCAGACTGGAAAGCCTGATTTGATAGGCTGGACAGCCATGATTGTCAGTTATGCTCAGCATGGGAAAGGCACAGAAG
CTTTATACGTCTATGAACTTATGAAAGAAGAAGGAATTAAGCCTGATCCAGTGACCTTTGTTGGGGTTTTGTCTGCTTGTAGCCATAGTGGTTTGGTCGATGAAGCTTAT
TTCCACCTTAATTCAATGGTGAAAGACTATGGTATACAGCCAGGATATCGACATTATGCTTGTATGGTAGATCTTCTTGGTCGGTGTGGGAAGTTGAAAGAGGCAGAAGA
ACTGATTAACAGTATGCCTATCGAACCTGATGCTCTCATTTGGGGAACGCTTCTTGCTGCTTGTAAAGTACATGGAGATATTGAACTTGGAAAACTTGCAGCACGAAAGG
TGATGGAGTTGAAGCCAAGTGATACTGGTGCTTATGTCTCTCTTTCAAACATCTGTGCTGATATGGGTTTGTGGGAAGAGGTCCTGAACGTTAGAAGCCTTATGAAGGGA
GCTGGAGTAACAAAGGAACCTGGTTGGAGCGTTCTGTAAGATGCTATTATCCATAGTTGACCTCAGAAAAACTTGGAAGTTTTGATCATTATGACAGCCTTAGAAAAACT
AGTGAGACAAGCTGAGTGAAGACAGCAGGCTCCTGCCATTTTTAAATTTCTCAGAAAAGAGATTCTCAGTGATAAAGTAAAAGGGAGGAAGTTTGTTTGCTTCTGTCTAC
TTCAATACCTCATCTTTGGTCCTTAAATCAGGAAGACAGGAATGGAAGCAACAGAAAGTTATTCAAATAGGGGGTTGACGATCAACAATCACCTTATCCATAATTTATCG
ATCGGGCTGAAAGTTATTCGAATTGCCTAAAATATGTTTTGTTCTTTTGAGAACCAAATATGAGTGAAAATGGAGGATGTTGCGAGTCCAACGATATTCAGAAGGGGAAA
TATCTGTAAATCTTCAAGTTCAAGTACATCAGCTAGCCTTTCTATTTTGTTGTGGATTGGCAGATGCAATGCAAAGGATGAGAATTTCTTCAGGGCGGCCTACGAATAAT
TGCATTCTAGGCTTTTTGGGGAATGTGGA
Protein sequenceShow/hide protein sequence
MNFIAIQTFVYKTLFSPLRLISSVATVDNASNFSFTRIQTYSLFDPLQLLNDYVKSRKCSLKNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLHP
NVISWNITISGFNQNFLFLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKGSSFLDALRVFHDVECENVVCWNAIV
SAAVRNGENLMALDLFDTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGDDVFVETALIDLYAKCGDMDEAVKTFLQMPIRNVVSWTAIISGFVQKNDFL
MALKFFEDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILKAGFSSHAVVVAALINMYSKVGAIDLSMMVFREMDNQRNLSSWTAMITSFAQNNNKERACELFEK
MLQENIGPDTFCTSSVLSVTDCITFGRQIHCYTLKTGLIFDVSVGSSLSTMYSKCGYLEEAFQVFGNMPKKDNVSWASMISCFSEHSYAKEAIQLFREMLFEEYVPDHTT
LSAVLTACSVLHSIQIGREIHGYSVRVGLGKDVAIGGSLVTMYSKCSNLEFARRVFETLPQKDHIACSSLVSGYAQHKCIKEALLLFRDLLVAGLAIDPFSISSILGAIA
LLNRPGIGTQLHALIMKVGLEKDVSVGSSLVMVYSKYGSIEDCCKAFEQTGKPDLIGWTAMIVSYAQHGKGTEALYVYELMKEEGIKPDPVTFVGVLSACSHSGLVDEAY
FHLNSMVKDYGIQPGYRHYACMVDLLGRCGKLKEAEELINSMPIEPDALIWGTLLAACKVHGDIELGKLAARKVMELKPSDTGAYVSLSNICADMGLWEEVLNVRSLMKG
AGVTKEPGWSVL