; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022163 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022163
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold2:12533033..12536121
RNA-Seq ExpressionSpg022163
SyntenySpg022163
Gene Ontology termsGO:1900865 - chloroplast RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6590206.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]8.5e-25785.27Show/hide
Query:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI
        MP +PPK  IS AQL+QIHAQL+TNPKP +FN L GA VDS APENGLFL+NQMLRHPSSHNHYTFTYALKAC LLH TH GL+IHARL+KSGHLSDIFI
Subjt:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN
        QNSLLHFYIVD DVPSASR+FDS+PDPDVVSWTSIISGL+KLGF+EEALGKFLSMNV PNSATLVSALSACSSL C+K+GKAIHGL+LRSL+E SV+LDN
Subjt:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN

Query:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS
        ALLDFYVRCGSLR A+NLFDEMP+RDVVSWTT+IGGYA  GLCE+AVR+FQNMVH  E  PNEATLINVLSACSSMSALHLGQWVHSYINSR DV +DG+
Subjt:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS

Query:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ
        +GNALINMYVKCGSM+ AI IFKT+EHKDIISWSTIISGLAMNG G QAF LFSLMLVHGI+PD ITFL LLSACSHGGLINQGLMV++AMKDVYNVAP+
Subjt:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ

Query:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
        MRHYACMVDMYG+AGL DEAEAFIKEMPVEAEGPVWGALLHACQ+HGNE +YE+VRQWLLSSK +TVGT+ALLSNTYASCDRWNDANEVRD MRSRGLKK
Subjt:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  VAGCSWIEI
        +AGCSW E+
Subjt:  VAGCSWIEI

XP_011660133.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic isoform X1 [Cucumis sativus]1.4e-25684.28Show/hide
Query:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI
        MPLKP KPSISIAQ TQIHA+L+TNPKPHIFN L G+ V+S  PENGLFL+NQMLR+PSSHNH+TFTYALKACC LH T  GL+IHA L+KSGHLSDIFI
Subjt:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN
        QNSLLHFYI+D DV SAS IFDS+PDPDVVSWTSIISGL+KLGFE+EAL KFLSMNVRPNS TLV+ALSACSSL CLKLGKAIHGLR+R+L+E +V L+N
Subjt:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN

Query:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS
        ALLDFYVRC  LRSAENLF++MPKRDVVSWTTMIGGYAQ GLCE+AVR+FQNMVHVGE  PNEATL+NVLSACSS+SALHLGQWVHSYINSR DV +DG+
Subjt:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS

Query:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ
        VGNALINMYVKCG+MEMAI+IFK +EHKDI+SWSTIISGLAMNGLG QAFVLFSLMLVHG+SPDDITFLGLLSACSHGGLINQG+MV++AMKDVYN++PQ
Subjt:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ

Query:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
        MRHYACMVDMYG+AGL DEAEAFIKEMP+EAEGPVWGALLHACQ+HGNEKKYE+VR+WLL SKGVTVGTFALLSNTYA CDRWNDAN+VR  MRSRGLKK
Subjt:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  VAGCSWIEI
        +AG SWIE+
Subjt:  VAGCSWIEI

XP_022960642.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucurbita moschata]2.9e-25785.46Show/hide
Query:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI
        MP +PPK  IS AQL+QIHAQL+TNPKP +FN L GA VDS APENGLFL+NQMLRHPSSHNHYTFTYALKAC LLH TH GL+IHARL+KSGHLSDIFI
Subjt:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN
        QNSLLHFYIVD DVPSASR+FDS+PDPDVVSWTSIISGL+KLGF+EEALGKFLSMNV PNSATLVSALSACSSL C+K+GKAIHGL+LRSL+E SV+LDN
Subjt:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN

Query:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS
        ALLDFYVRCGSLR A+NLFDEMP+RDVVSWTT+IGGYA  GLCE+AVR+FQNMVH  E  PNEATLINVLSACSSMSALHLGQWVHSYINSR DV +DG+
Subjt:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS

Query:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ
        +GNALINMYVKCGSM+ AI IFKT+EHKDIISWSTIISGLAMNG G QAF LFSLMLVHGI+PD ITFL LLSACSHGGLINQGLMV++AMKDVYNVAP+
Subjt:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ

Query:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
        MRHYACMVDMYG+AGL DEAEAFIKEMPVEAEGPVWGALLHACQ+HGNE +YE+VRQWLLSSK +TVGT+ALLSNTYASCDRWNDANEVRD MRSRGLKK
Subjt:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  VAGCSWIEI
        +AGCSWIE+
Subjt:  VAGCSWIEI

XP_023515586.1 pentatricopeptide repeat-containing protein At4g38010-like isoform X1 [Cucurbita pepo subsp. pepo]4.5e-25885.66Show/hide
Query:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI
        MPL+PPK SIS AQL+QIHAQL+TNPKPH+FN L GA VDS APENGLFL+NQMLRHPSSHNHYTFTYALKAC LLH TH GL+IHARL+KSGHLSDIFI
Subjt:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN
        QNSLLHFYIVD DVPSASR+FDS+PDPDVVSWTSIISGL+KLGF+EEALGKFLSMNV PNSATLVSALSACSSL C+K+GKAIHGL+LRSL+  SVSLDN
Subjt:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN

Query:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS
        ALLDFYVRCGSLR A+NLFDEMP+RDVVSWTT+IGGYA  GLCE+AVR+FQNMVH  E  PNEATLINVLSACSSMSALHLG+WVHSYINSR DV +DG+
Subjt:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS

Query:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ
        +GNALINMYVKCGSM+ AI IFKT+EHKD+ISWSTIISGLAMNG G QAF LFSLMLVHGI+PD ITFL LLSACSHGGLINQGLMV++AMKDVYNVAP+
Subjt:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ

Query:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
        MRHYACMVDMYG+AGL DEAEAFIKEMPVEAEGPVWGALLHACQ+HGNE  YE+VRQWLLSSK +TVGT+ALLSNTYASCDRWNDANEVRD MRSRGLKK
Subjt:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  VAGCSWIEI
        +AGCSWIE+
Subjt:  VAGCSWIEI

XP_038878297.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Benincasa hispida]4.8e-26085.66Show/hide
Query:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI
        M LKPPKPSI IAQL QIH  L+ NPKPHI N L G+ V+S +PENGLFL+NQMLR+PSSHNH+TFTYALKACC LH T  GL+IHA L+KSGHLSDIF+
Subjt:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN
        QNSLLHFYI+D DVPSASRIFDS+PDPDV+SWTSIISGL+KLGFE+EALGKFLSMNVRPNS TLV+ALSACSSL CLKLGKAIHGLRLRSL+E +VSLDN
Subjt:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN

Query:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS
        ALLDFYVRCG LRSAE LFDEMPKRDVVSWTTMIGGYAQ GLCE+AVR+FQNMVHVGE  PNEATLINVLSACSS+SALHLGQWVHSYINSR DV +DG+
Subjt:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS

Query:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ
        VGNALINMYVKCG+MEMAI+IFK +EHKDIISWSTIISGLAMNGLG+QAF LFSLMLVHGISPDDITFL LLSACSHGGLINQGLMV++AMKDVYN++PQ
Subjt:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ

Query:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
        MRHYACMVD+YG+AGL DEAEAFIKEMP+EAEG VWGALLHACQIHGNEKKYE+V++WLL SKGVTVGTFALLSNTYASCDRWNDANEVRDTMRS+GLKK
Subjt:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  VAGCSWIEI
        +AGCSWIE+
Subjt:  VAGCSWIEI

TrEMBL top hitse value%identityAlignment
A0A0A0LXJ1 Uncharacterized protein7.0e-25784.28Show/hide
Query:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI
        MPLKP KPSISIAQ TQIHA+L+TNPKPHIFN L G+ V+S  PENGLFL+NQMLR+PSSHNH+TFTYALKACC LH T  GL+IHA L+KSGHLSDIFI
Subjt:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN
        QNSLLHFYI+D DV SAS IFDS+PDPDVVSWTSIISGL+KLGFE+EAL KFLSMNVRPNS TLV+ALSACSSL CLKLGKAIHGLR+R+L+E +V L+N
Subjt:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN

Query:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS
        ALLDFYVRC  LRSAENLF++MPKRDVVSWTTMIGGYAQ GLCE+AVR+FQNMVHVGE  PNEATL+NVLSACSS+SALHLGQWVHSYINSR DV +DG+
Subjt:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS

Query:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ
        VGNALINMYVKCG+MEMAI+IFK +EHKDI+SWSTIISGLAMNGLG QAFVLFSLMLVHG+SPDDITFLGLLSACSHGGLINQG+MV++AMKDVYN++PQ
Subjt:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ

Query:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
        MRHYACMVDMYG+AGL DEAEAFIKEMP+EAEGPVWGALLHACQ+HGNEKKYE+VR+WLL SKGVTVGTFALLSNTYA CDRWNDAN+VR  MRSRGLKK
Subjt:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  VAGCSWIEI
        +AG SWIE+
Subjt:  VAGCSWIEI

A0A1S4DY27 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X16.1e-25383.5Show/hide
Query:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI
        MPLKP KPSIS AQ TQIHA+L+TNPKPHIFN L G+ V+S +PENGLFL+NQML +PSSHNH+TFTYALKACC LH T  GL+IHA L+KSGHLSDIFI
Subjt:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN
        QNSLLHFYI+  DV SAS IFDS+P+PDVVSWTSIISG +KLGFE+EALGKFLSMNVRPNS TLV+ALSACSSL  LKLGKAIHGLRLR+L+E +VSL+N
Subjt:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN

Query:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS
        ALLDFYVRC  LRSAENLF++M KRDVVSWTTMIGGYAQ GLCE+AVR+FQNMVH GE  PNEATL+NVLSACSS+SALHLGQWVHSYINSR DV +DG+
Subjt:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS

Query:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ
        VGNALINMYVKCG+MEMAI+IFK +EHKDIISWST+ISGLAMNGLG QAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQG+MV++AMKDVYN++PQ
Subjt:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ

Query:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
        +RHYACMVDMYG+AGL DEAEAFIKEMP+EAEGPVWGALLHACQIHGNEKKYE+VR+ LL SKGVTVG FALLSNTYASCDRWNDAN+VR  MRSRGLKK
Subjt:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  VAGCSWIEI
        +AGCSWIE+
Subjt:  VAGCSWIEI

A0A5D3D5L6 Pentatricopeptide repeat-containing protein2.0e-25684.15Show/hide
Query:  NSMPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDI
        NSMPLKP KPSISIAQ TQIHA+L+TNPKPHIFN L G+ V+S +PENGLFL+NQML +PSSHNH+TFTYALKACC LH T  GL+IHA L+KSGHLSDI
Subjt:  NSMPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDI

Query:  FIQNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSL
        FIQNSLLHFYI++ DV SAS IFDS+P+PDVVSWTSIISGL+KLGFE+EALGKFLSMNVRPNS TLV+ALSACSSL CLKLGKAIHGLRLR+L+E +VSL
Subjt:  FIQNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSL

Query:  DNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVD
        +NALLDFYVRC  LRSAENLF++M KRDVVSWTTMIGGYAQ GLCE+AVR+FQNMVHVGE  PNEATL+NVLSACSS+SALHLGQWVHSYINSR DV +D
Subjt:  DNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVD

Query:  GSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVA
        G+VGNALINMYVKCG+MEMAI+IF  +EHKDIISWST+ISGLAMNGLG QAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQG+MV++AMKDVYN++
Subjt:  GSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVA

Query:  PQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGL
        PQ+RHYACMVDMYG+AGL DEAEAFIKEMP+EAEGPVWGALLHACQIHGNEKKYE+VR+ LL SKGVTVG FALLSNTYASCDRWNDAN+VR  MRSRGL
Subjt:  PQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGL

Query:  KKVAGCSWIEI
        KK+AGCSWIE+
Subjt:  KKVAGCSWIEI

A0A6J1H7Z9 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X11.4e-25785.46Show/hide
Query:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI
        MP +PPK  IS AQL+QIHAQL+TNPKP +FN L GA VDS APENGLFL+NQMLRHPSSHNHYTFTYALKAC LLH TH GL+IHARL+KSGHLSDIFI
Subjt:  MPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN
        QNSLLHFYIVD DVPSASR+FDS+PDPDVVSWTSIISGL+KLGF+EEALGKFLSMNV PNSATLVSALSACSSL C+K+GKAIHGL+LRSL+E SV+LDN
Subjt:  QNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN

Query:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS
        ALLDFYVRCGSLR A+NLFDEMP+RDVVSWTT+IGGYA  GLCE+AVR+FQNMVH  E  PNEATLINVLSACSSMSALHLGQWVHSYINSR DV +DG+
Subjt:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGS

Query:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ
        +GNALINMYVKCGSM+ AI IFKT+EHKDIISWSTIISGLAMNG G QAF LFSLMLVHGI+PD ITFL LLSACSHGGLINQGLMV++AMKDVYNVAP+
Subjt:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ

Query:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
        MRHYACMVDMYG+AGL DEAEAFIKEMPVEAEGPVWGALLHACQ+HGNE +YE+VRQWLLSSK +TVGT+ALLSNTYASCDRWNDANEVRD MRSRGLKK
Subjt:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  VAGCSWIEI
        +AGCSWIE+
Subjt:  VAGCSWIEI

A0A6J1JC26 pentatricopeptide repeat-containing protein At4g38010-like isoform X11.7e-25584.54Show/hide
Query:  NSMPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDI
        NSMP KPPK  IS AQL+QIHAQL+TNPKP++FN L GA +DS APENGLFL+NQMLRHPSSHNHYTFTYALKAC LLH TH GL+IHARL+KSGHLSDI
Subjt:  NSMPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDI

Query:  FIQNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSL
        FIQNSLLHFYIVD DV SASR+FDS+PDPDVVSWTSIISGL+KLGF+EEALGKFLSMNV PNSATLVSALSACS+L CLK+GKAIHGL+LRSL+E SV+L
Subjt:  FIQNSLLHFYIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSL

Query:  DNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVD
        DNALLDFYVRCGSLR A+NLFDEMP+RDVVSWTT+IGGYA  GLCE+AVR+FQNMVH  E  PNEATLINVLSACSSMSALH GQWVHSY+NSR D+ +D
Subjt:  DNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVD

Query:  GSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVA
        G++GNALINMYVKCGSME AI IFKT+EHKDIISWSTIISGLAMNG G QAF LFSLMLVHGI+PD ITFL LLSACSHGGLINQGLMV++AMKDVYNVA
Subjt:  GSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVA

Query:  PQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGL
        P+MR YACMVDMYG+AGL DEAEAFIKEMPVEAEGPVWGALLHACQ+HGNE +YE+VRQWLLSSK +TVGT+ALLSNTYASC RWNDANEVRD MRSRGL
Subjt:  PQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGL

Query:  KKVAGCSWIEI
        KK+AGCSWIE+
Subjt:  KKVAGCSWIEI

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.4e-9235.89Show/hide
Query:  PKPHIF--NSLFGAFVDSGAPENGLFLFNQMLRHPSSH-NHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASRIFD
        PKP+ F  N+L  A+     P   ++ F  M+     + N YTF + +KA   + S   G  +H   VKS   SD+F+ NSL+H Y    D+ SA ++F 
Subjt:  PKPHIF--NSLFGAFVDSGAPENGLFLFNQMLRHPSSH-NHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASRIFD

Query:  SMPDPDVVSWTSIISGLAKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLF
        ++ + DVVSW S+I+G  + G  ++AL  F  M   +V+ +  T+V  LSAC+ +  L+ G+ +      +    +++L NA+LD Y +CGS+  A+ LF
Subjt:  SMPDPDVVSWTSIISGLAKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLF

Query:  DEMPKRDVVSWTTMIGGYA-------------------------------QGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSY
        D M ++D V+WTTM+ GYA                               Q G   +A+ +F  +     +K N+ TL++ LSAC+ + AL LG+W+HSY
Subjt:  DEMPKRDVVSWTTMIGGYA-------------------------------QGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSY

Query:  INSRPDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVY
        I  +  + ++  V +ALI+MY KCG +E +  +F ++E +D+  WS +I GLAM+G G++A  +F  M    + P+ +TF  +  ACSH GL+++   ++
Subjt:  INSRPDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVY

Query:  KAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANE
          M+  Y + P+ +HYAC+VD+ GR+G  ++A  FI+ MP+     VWGALL AC+IH N    E     LL  +    G   LLSN YA   +W + +E
Subjt:  KAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANE

Query:  VRDTMRSRGLKKVAGCSWIEI
        +R  MR  GLKK  GCS IEI
Subjt:  VRDTMRSRGLKKVAGCSWIEI

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.7e-10338.62Show/hide
Query:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYI----------VDD
        I  P   I+N++F     S  P + L L+  M+      N YTF + LK+C    +   G QIH  ++K G   D+++  SL+  Y+          V D
Subjt:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYI----------VDD

Query:  DVP---------------------SASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRL
          P                     +A ++FD +P  DVVSW ++ISG A+ G  +EAL  F  M   NVRP+ +T+V+ +SAC+  G ++LG+ +H L +
Subjt:  DVP---------------------SASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRL

Query:  RSLSEGS-VSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHS
             GS + + NAL+D Y +CG L +A  LF+ +P +DV+SW T+IGGY    L ++A+ +FQ M+  GE  PN+ T++++L AC+ + A+ +G+W+H 
Subjt:  RSLSEGS-VSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHS

Query:  YINSR-PDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM
        YI+ R   VT   S+  +LI+MY KCG +E A  +F ++ HK + SW+ +I G AM+G    +F LFS M   GI PDDITF+GLLSACSH G+++ G  
Subjt:  YINSR-PDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM

Query:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA
        +++ M   Y + P++ HY CM+D+ G +GLF EAE  I  M +E +G +W +LL AC++HGN +  E   + L+  +    G++ LLSN YAS  RWN+ 
Subjt:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKVAGCSWIEI
         + R  +  +G+KKV GCS IEI
Subjt:  NEVRDTMRSRGLKKVAGCSWIEI

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136008.1e-9335.84Show/hide
Query:  FNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASRIFDSMPDPDVV
        +NS+   F      E  L  F  M +     N Y+F   L AC  L+  + G+Q+H+ + KS  LSD++I ++L+  Y    +V  A R+FD M D +VV
Subjt:  FNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASRIFDSMPDPDVV

Query:  SWTSIISGLAKLGFEEEALGKF---LSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRS-LSEGSVSLDNALLDFYVRCGSLRSAENLFDEMP---
        SW S+I+   + G   EAL  F   L   V P+  TL S +SAC+SL  +K+G+ +HG  +++      + L NA +D Y +C  ++ A  +FD MP   
Subjt:  SWTSIISGLAKLGFEEEALGKF---LSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRS-LSEGSVSLDNALLDFYVRCGSLRSAENLFDEMP---

Query:  ----------------------------KRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDV
                                    +R+VVSW  +I GY Q G  E+A+ +F  ++    V P   +  N+L AC+ ++ LHLG   H ++      
Subjt:  ----------------------------KRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDV

Query:  TVDGS-----VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKA
           G      VGN+LI+MYVKCG +E   ++F+ +  +D +SW+ +I G A NG G++A  LF  ML  G  PD IT +G+LSAC H G + +G   + +
Subjt:  TVDGS-----VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKA

Query:  MKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVR
        M   + VAP   HY CMVD+ GRAG  +EA++ I+EMP++ +  +WG+LL AC++H N    + V + LL  +    G + LLSN YA   +W D   VR
Subjt:  MKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVR

Query:  DTMRSRGLKKVAGCSWIEI
         +MR  G+ K  GCSWI+I
Subjt:  DTMRSRGLKKVAGCSWIEI

Q9SJZ3 Pentatricopeptide repeat-containing protein At2g22410, mitochondrial4.3e-9434.8Show/hide
Query:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRH---PSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASR
        I NP    +N     F +S  P+    L+ QMLRH    S  +H+T+    K C  L  +  G  I   ++K        + N+ +H +    D+ +A +
Subjt:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRH---PSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASR

Query:  IFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAE
        +FD  P  D+VSW  +I+G  K+G  E+A+  +  M    V+P+  T++  +S+CS LG L  GK  +     +    ++ L NAL+D + +CG +  A 
Subjt:  IFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAE

Query:  NLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNM------------------------------VHVGEVKPNEATLINVLSACSSMSALHLGQWVH
         +FD + KR +VSWTTMI GYA+ GL + + ++F +M                              +     KP+E T+I+ LSACS + AL +G W+H
Subjt:  NLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNM------------------------------VHVGEVKPNEATLINVLSACSSMSALHLGQWVH

Query:  SYINSRPDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM
         YI  +  ++++ ++G +L++MY KCG++  A+ +F  ++ ++ ++++ II GLA++G  S A   F+ M+  GI+PD+ITF+GLLSAC HGG+I  G  
Subjt:  SYINSRPDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM

Query:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA
         +  MK  +N+ PQ++HY+ MVD+ GRAGL +EA+  ++ MP+EA+  VWGALL  C++HGN +  E+  + LL       G + LL   Y   + W DA
Subjt:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKVAGCSWIEI
           R  M  RG++K+ GCS IE+
Subjt:  NEVRDTMRSRGLKKVAGCSWIEI

Q9SZK1 Pentatricopeptide repeat-containing protein At4g380106.6e-9538.12Show/hide
Query:  FNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASRIFDSMPDPDVV
        +N+L  ++     P   +F +   + +  S + +TF    KAC        G QIH  + K G   DI++QNSL+HFY V  +  +A ++F  MP  DVV
Subjt:  FNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASRIFDSMPDPDVV

Query:  SWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSW
        SWT II+G  + G  +EAL  F  M+V PN AT V  L +   +GCL LGK IHGL L+  S  S+   NAL+D YV+C  L  A  +F E+ K+D VSW
Subjt:  SWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSW

Query:  TTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDI
         +MI G       ++A+ +F  M     +KP+   L +VLSAC+S+ A+  G+WVH YI +   +  D  +G A+++MY KCG +E A+ IF  +  K++
Subjt:  TTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDI

Query:  ISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKD-VYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPV
         +W+ ++ GLA++G G ++   F  M+  G  P+ +TFL  L+AC H GL+++G   +  MK   YN+ P++ HY CM+D+  RAGL DEA   +K MPV
Subjt:  ISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKD-VYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPV

Query:  EAEGPVWGALLHACQIHGNEKKY-EEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKVAGCSWIE
        + +  + GA+L AC+  G   +  +E+    L  +    G + LLSN +A+  RW+D   +R  M+ +G+ KV G S+IE
Subjt:  EAEGPVWGALLHACQIHGNEKKY-EEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKVAGCSWIE

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-10438.62Show/hide
Query:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYI----------VDD
        I  P   I+N++F     S  P + L L+  M+      N YTF + LK+C    +   G QIH  ++K G   D+++  SL+  Y+          V D
Subjt:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYI----------VDD

Query:  DVP---------------------SASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRL
          P                     +A ++FD +P  DVVSW ++ISG A+ G  +EAL  F  M   NVRP+ +T+V+ +SAC+  G ++LG+ +H L +
Subjt:  DVP---------------------SASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRL

Query:  RSLSEGS-VSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHS
             GS + + NAL+D Y +CG L +A  LF+ +P +DV+SW T+IGGY    L ++A+ +FQ M+  GE  PN+ T++++L AC+ + A+ +G+W+H 
Subjt:  RSLSEGS-VSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHS

Query:  YINSR-PDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM
        YI+ R   VT   S+  +LI+MY KCG +E A  +F ++ HK + SW+ +I G AM+G    +F LFS M   GI PDDITF+GLLSACSH G+++ G  
Subjt:  YINSR-PDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM

Query:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA
        +++ M   Y + P++ HY CM+D+ G +GLF EAE  I  M +E +G +W +LL AC++HGN +  E   + L+  +    G++ LLSN YAS  RWN+ 
Subjt:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKVAGCSWIEI
         + R  +  +G+KKV GCS IEI
Subjt:  NEVRDTMRSRGLKKVAGCSWIEI

AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein5.7e-9435.84Show/hide
Query:  FNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASRIFDSMPDPDVV
        +NS+   F      E  L  F  M +     N Y+F   L AC  L+  + G+Q+H+ + KS  LSD++I ++L+  Y    +V  A R+FD M D +VV
Subjt:  FNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASRIFDSMPDPDVV

Query:  SWTSIISGLAKLGFEEEALGKF---LSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRS-LSEGSVSLDNALLDFYVRCGSLRSAENLFDEMP---
        SW S+I+   + G   EAL  F   L   V P+  TL S +SAC+SL  +K+G+ +HG  +++      + L NA +D Y +C  ++ A  +FD MP   
Subjt:  SWTSIISGLAKLGFEEEALGKF---LSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRS-LSEGSVSLDNALLDFYVRCGSLRSAENLFDEMP---

Query:  ----------------------------KRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDV
                                    +R+VVSW  +I GY Q G  E+A+ +F  ++    V P   +  N+L AC+ ++ LHLG   H ++      
Subjt:  ----------------------------KRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDV

Query:  TVDGS-----VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKA
           G      VGN+LI+MYVKCG +E   ++F+ +  +D +SW+ +I G A NG G++A  LF  ML  G  PD IT +G+LSAC H G + +G   + +
Subjt:  TVDGS-----VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKA

Query:  MKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVR
        M   + VAP   HY CMVD+ GRAG  +EA++ I+EMP++ +  +WG+LL AC++H N    + V + LL  +    G + LLSN YA   +W D   VR
Subjt:  MKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVR

Query:  DTMRSRGLKKVAGCSWIEI
         +MR  G+ K  GCSWI+I
Subjt:  DTMRSRGLKKVAGCSWIEI

AT2G22410.1 SLOW GROWTH 13.0e-9534.8Show/hide
Query:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRH---PSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASR
        I NP    +N     F +S  P+    L+ QMLRH    S  +H+T+    K C  L  +  G  I   ++K        + N+ +H +    D+ +A +
Subjt:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRH---PSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASR

Query:  IFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAE
        +FD  P  D+VSW  +I+G  K+G  E+A+  +  M    V+P+  T++  +S+CS LG L  GK  +     +    ++ L NAL+D + +CG +  A 
Subjt:  IFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAE

Query:  NLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNM------------------------------VHVGEVKPNEATLINVLSACSSMSALHLGQWVH
         +FD + KR +VSWTTMI GYA+ GL + + ++F +M                              +     KP+E T+I+ LSACS + AL +G W+H
Subjt:  NLFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNM------------------------------VHVGEVKPNEATLINVLSACSSMSALHLGQWVH

Query:  SYINSRPDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM
         YI  +  ++++ ++G +L++MY KCG++  A+ +F  ++ ++ ++++ II GLA++G  S A   F+ M+  GI+PD+ITF+GLLSAC HGG+I  G  
Subjt:  SYINSRPDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM

Query:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA
         +  MK  +N+ PQ++HY+ MVD+ GRAGL +EA+  ++ MP+EA+  VWGALL  C++HGN +  E+  + LL       G + LL   Y   + W DA
Subjt:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKVAGCSWIEI
           R  M  RG++K+ GCS IE+
Subjt:  NEVRDTMRSRGLKKVAGCSWIEI

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.8e-9435.89Show/hide
Query:  PKPHIF--NSLFGAFVDSGAPENGLFLFNQMLRHPSSH-NHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASRIFD
        PKP+ F  N+L  A+     P   ++ F  M+     + N YTF + +KA   + S   G  +H   VKS   SD+F+ NSL+H Y    D+ SA ++F 
Subjt:  PKPHIF--NSLFGAFVDSGAPENGLFLFNQMLRHPSSH-NHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASRIFD

Query:  SMPDPDVVSWTSIISGLAKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLF
        ++ + DVVSW S+I+G  + G  ++AL  F  M   +V+ +  T+V  LSAC+ +  L+ G+ +      +    +++L NA+LD Y +CGS+  A+ LF
Subjt:  SMPDPDVVSWTSIISGLAKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLF

Query:  DEMPKRDVVSWTTMIGGYA-------------------------------QGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSY
        D M ++D V+WTTM+ GYA                               Q G   +A+ +F  +     +K N+ TL++ LSAC+ + AL LG+W+HSY
Subjt:  DEMPKRDVVSWTTMIGGYA-------------------------------QGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSY

Query:  INSRPDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVY
        I  +  + ++  V +ALI+MY KCG +E +  +F ++E +D+  WS +I GLAM+G G++A  +F  M    + P+ +TF  +  ACSH GL+++   ++
Subjt:  INSRPDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVY

Query:  KAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANE
          M+  Y + P+ +HYAC+VD+ GR+G  ++A  FI+ MP+     VWGALL AC+IH N    E     LL  +    G   LLSN YA   +W + +E
Subjt:  KAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANE

Query:  VRDTMRSRGLKKVAGCSWIEI
        +R  MR  GLKK  GCS IEI
Subjt:  VRDTMRSRGLKKVAGCSWIEI

AT4G38010.1 Pentatricopeptide repeat (PPR-like) superfamily protein4.7e-9638.12Show/hide
Query:  FNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASRIFDSMPDPDVV
        +N+L  ++     P   +F +   + +  S + +TF    KAC        G QIH  + K G   DI++QNSL+HFY V  +  +A ++F  MP  DVV
Subjt:  FNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDDDVPSASRIFDSMPDPDVV

Query:  SWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSW
        SWT II+G  + G  +EAL  F  M+V PN AT V  L +   +GCL LGK IHGL L+  S  S+   NAL+D YV+C  L  A  +F E+ K+D VSW
Subjt:  SWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSW

Query:  TTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDI
         +MI G       ++A+ +F  M     +KP+   L +VLSAC+S+ A+  G+WVH YI +   +  D  +G A+++MY KCG +E A+ IF  +  K++
Subjt:  TTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDI

Query:  ISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKD-VYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPV
         +W+ ++ GLA++G G ++   F  M+  G  P+ +TFL  L+AC H GL+++G   +  MK   YN+ P++ HY CM+D+  RAGL DEA   +K MPV
Subjt:  ISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKD-VYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPV

Query:  EAEGPVWGALLHACQIHGNEKKY-EEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKVAGCSWIE
        + +  + GA+L AC+  G   +  +E+    L  +    G + LLSN +A+  RW+D   +R  M+ +G+ KV G S+IE
Subjt:  EAEGPVWGALLHACQIHGNEKKY-EEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKVAGCSWIE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTTCCGTTGCAAAACCTGCCTCAAAATTTTCGAAAGCCTCAGATCTGAAATGGATGGTCTAGGTCACGAACCGGTTGCAGATTTGAAATGGATGGTGGTGCGACA
TCTCACGTCATCTCTCATCTCCGATCTCTCCCTCTCACAAGTCGTCGACCGTGCATCTGAAACGAGGACCAGCAACCCAACACTCTGGCGCTACTACCAGAGGACAAGCA
AACCTCGAACAGTGCCGACGTTCAAACTAGATTCAAACCAGATTTGTGTTTGTCTTTCAGAAGGATCAACGTCCAATTCGGACAAAACTTCTAAGCTCTCCACGCATGGT
GGCAATAGCATGCCATTAAAACCTCCAAAACCCTCAATTTCAATCGCTCAGCTAACCCAAATCCACGCCCAACTCATCACAAATCCAAAACCCCACATTTTCAACTCTTT
GTTCGGTGCGTTCGTCGATTCCGGCGCCCCTGAAAATGGCCTCTTCCTCTTCAACCAAATGCTTCGCCACCCATCTTCTCACAACCATTACACTTTCACCTACGCCCTCA
AAGCCTGTTGCCTTCTCCATTCAACCCACAACGGCCTCCAAATCCATGCCCGTCTCGTCAAATCTGGGCACCTTTCTGACATCTTCATCCAAAATTCCTTGCTCCATTTC
TACATTGTCGATGACGACGTTCCTTCTGCTTCTCGAATCTTTGATTCCATGCCTGACCCAGATGTGGTTTCGTGGACTTCGATCATTTCGGGGCTTGCCAAGCTGGGTTT
TGAAGAGGAGGCTCTGGGTAAGTTCTTGTCTATGAATGTGAGGCCTAATTCGGCTACTCTTGTTAGTGCTTTATCTGCTTGTTCTAGTCTTGGATGTCTTAAGCTTGGGA
AAGCTATACATGGGCTGAGATTGCGGAGTTTGAGTGAGGGAAGTGTTAGTTTGGACAATGCCCTTTTGGATTTTTATGTTAGATGTGGGTCTTTGAGGAGTGCAGAGAAC
CTGTTTGATGAAATGCCTAAGAGAGATGTAGTGTCTTGGACTACAATGATTGGGGGTTATGCACAGGGTGGATTGTGTGAAAAGGCTGTGAGGATATTTCAAAACATGGT
TCATGTGGGAGAGGTTAAGCCCAATGAAGCCACTCTAATTAACGTATTATCTGCATGTTCTTCCATGTCTGCTCTGCATTTGGGTCAATGGGTGCATTCCTATATCAACT
CTAGGCCTGATGTGACAGTTGATGGAAGTGTTGGAAACGCTTTGATTAATATGTATGTCAAATGTGGTAGCATGGAAATGGCAATTATGATTTTTAAAACCCTGGAACAC
AAGGATATCATATCATGGAGCACAATTATAAGTGGGTTAGCCATGAATGGCCTAGGCAGCCAAGCTTTTGTTCTCTTCTCCCTCATGCTAGTTCATGGCATTTCTCCAGA
TGACATAACATTTCTTGGCCTGTTATCTGCATGCAGCCATGGAGGGTTGATCAATCAAGGCTTGATGGTTTATAAAGCCATGAAAGATGTTTATAATGTTGCACCTCAGA
TGAGGCATTATGCTTGCATGGTGGACATGTATGGAAGGGCAGGGCTTTTTGATGAAGCAGAGGCATTCATAAAGGAGATGCCTGTGGAAGCAGAAGGCCCAGTTTGGGGA
GCTCTGTTGCATGCCTGTCAAATTCATGGGAATGAGAAGAAGTATGAGGAAGTTAGGCAATGGCTGCTTAGCAGCAAAGGGGTTACAGTGGGAACTTTTGCTTTGTTATC
AAATACTTATGCTAGTTGTGATAGATGGAACGATGCGAATGAAGTTCGAGATACAATGAGAAGTAGAGGGTTGAAGAAAGTGGCTGGATGTAGTTGGATTGAAATTGGTT
GA
mRNA sequenceShow/hide mRNA sequence
ATGTGTTTCCGTTGCAAAACCTGCCTCAAAATTTTCGAAAGCCTCAGATCTGAAATGGATGGTCTAGGTCACGAACCGGTTGCAGATTTGAAATGGATGGTGGTGCGACA
TCTCACGTCATCTCTCATCTCCGATCTCTCCCTCTCACAAGTCGTCGACCGTGCATCTGAAACGAGGACCAGCAACCCAACACTCTGGCGCTACTACCAGAGGACAAGCA
AACCTCGAACAGTGCCGACGTTCAAACTAGATTCAAACCAGATTTGTGTTTGTCTTTCAGAAGGATCAACGTCCAATTCGGACAAAACTTCTAAGCTCTCCACGCATGGT
GGCAATAGCATGCCATTAAAACCTCCAAAACCCTCAATTTCAATCGCTCAGCTAACCCAAATCCACGCCCAACTCATCACAAATCCAAAACCCCACATTTTCAACTCTTT
GTTCGGTGCGTTCGTCGATTCCGGCGCCCCTGAAAATGGCCTCTTCCTCTTCAACCAAATGCTTCGCCACCCATCTTCTCACAACCATTACACTTTCACCTACGCCCTCA
AAGCCTGTTGCCTTCTCCATTCAACCCACAACGGCCTCCAAATCCATGCCCGTCTCGTCAAATCTGGGCACCTTTCTGACATCTTCATCCAAAATTCCTTGCTCCATTTC
TACATTGTCGATGACGACGTTCCTTCTGCTTCTCGAATCTTTGATTCCATGCCTGACCCAGATGTGGTTTCGTGGACTTCGATCATTTCGGGGCTTGCCAAGCTGGGTTT
TGAAGAGGAGGCTCTGGGTAAGTTCTTGTCTATGAATGTGAGGCCTAATTCGGCTACTCTTGTTAGTGCTTTATCTGCTTGTTCTAGTCTTGGATGTCTTAAGCTTGGGA
AAGCTATACATGGGCTGAGATTGCGGAGTTTGAGTGAGGGAAGTGTTAGTTTGGACAATGCCCTTTTGGATTTTTATGTTAGATGTGGGTCTTTGAGGAGTGCAGAGAAC
CTGTTTGATGAAATGCCTAAGAGAGATGTAGTGTCTTGGACTACAATGATTGGGGGTTATGCACAGGGTGGATTGTGTGAAAAGGCTGTGAGGATATTTCAAAACATGGT
TCATGTGGGAGAGGTTAAGCCCAATGAAGCCACTCTAATTAACGTATTATCTGCATGTTCTTCCATGTCTGCTCTGCATTTGGGTCAATGGGTGCATTCCTATATCAACT
CTAGGCCTGATGTGACAGTTGATGGAAGTGTTGGAAACGCTTTGATTAATATGTATGTCAAATGTGGTAGCATGGAAATGGCAATTATGATTTTTAAAACCCTGGAACAC
AAGGATATCATATCATGGAGCACAATTATAAGTGGGTTAGCCATGAATGGCCTAGGCAGCCAAGCTTTTGTTCTCTTCTCCCTCATGCTAGTTCATGGCATTTCTCCAGA
TGACATAACATTTCTTGGCCTGTTATCTGCATGCAGCCATGGAGGGTTGATCAATCAAGGCTTGATGGTTTATAAAGCCATGAAAGATGTTTATAATGTTGCACCTCAGA
TGAGGCATTATGCTTGCATGGTGGACATGTATGGAAGGGCAGGGCTTTTTGATGAAGCAGAGGCATTCATAAAGGAGATGCCTGTGGAAGCAGAAGGCCCAGTTTGGGGA
GCTCTGTTGCATGCCTGTCAAATTCATGGGAATGAGAAGAAGTATGAGGAAGTTAGGCAATGGCTGCTTAGCAGCAAAGGGGTTACAGTGGGAACTTTTGCTTTGTTATC
AAATACTTATGCTAGTTGTGATAGATGGAACGATGCGAATGAAGTTCGAGATACAATGAGAAGTAGAGGGTTGAAGAAAGTGGCTGGATGTAGTTGGATTGAAATTGGTT
GA
Protein sequenceShow/hide protein sequence
MCFRCKTCLKIFESLRSEMDGLGHEPVADLKWMVVRHLTSSLISDLSLSQVVDRASETRTSNPTLWRYYQRTSKPRTVPTFKLDSNQICVCLSEGSTSNSDKTSKLSTHG
GNSMPLKPPKPSISIAQLTQIHAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRHPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHF
YIVDDDVPSASRIFDSMPDPDVVSWTSIISGLAKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAEN
LFDEMPKRDVVSWTTMIGGYAQGGLCEKAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVTVDGSVGNALINMYVKCGSMEMAIMIFKTLEH
KDIISWSTIISGLAMNGLGSQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWG
ALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKVAGCSWIEIG