; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034878 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034878
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr3:11934288..11935907
RNA-Seq ExpressionLag0034878
SyntenyLag0034878
Gene Ontology termsGO:1900865 - chloroplast RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK18848.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]3.3e-25784.74Show/hide
Query:  NSMPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDI
        NSMPLKP KPSISIAQ TQI A+L+TNPKPHIFN L G+ V+S +PENGLFL+NQML YPSSHNH+TFTYALKACC LH T  GL+IHA L+KSGHLSDI
Subjt:  NSMPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDI

Query:  FIQNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSL
        FIQNSLLHFYI++GDV SAS IFDS+P+PDVVSWTSIISGLSKLGFE+EALGKFLSMNVRPNS TLV+ALSACSSL CLKLGKAIHGLRLR+L+E +VSL
Subjt:  FIQNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSL

Query:  DNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVD
        +NALLDFYVRC  LRSAENLF++M KRDVVSWTTMIGGYAQ GLCEEAVR+FQNMVHVGE  PNEATL+NVLSACSS+SALHLGQWVHSYINSR DVI+D
Subjt:  DNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVD

Query:  GSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVA
        G+VGNALINMYVKCG+MEMAI+IF  +EHKDIISWST+ISGLAMNGL +QAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQG+MV++AMKDVYN++
Subjt:  GSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVA

Query:  PQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGL
        PQ+RHYACMVDMYG+AGL DEAEAFIKEMP+EAEGPVWGALLHACQIHGNEKKYE+VR+ LL SKGVTVG FALLSNTYASCDRWNDAN+VR  MRSRGL
Subjt:  PQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGL

Query:  KKVAGCSWIEI
        KK+AGCSWIE+
Subjt:  KKVAGCSWIEI

XP_011660133.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic isoform X1 [Cucumis sativus]1.1e-25784.87Show/hide
Query:  MPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI
        MPLKP KPSISIAQ TQI A+L+TNPKPHIFN L G+ V+S  PENGLFL+NQMLRYPSSHNH+TFTYALKACC LH T  GL+IHA L+KSGHLSDIFI
Subjt:  MPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN
        QNSLLHFYI+DGDV SAS IFDS+PDPDVVSWTSIISGLSKLGFE+EAL KFLSMNVRPNS TLV+ALSACSSL CLKLGKAIHGLR+R+L+E +V L+N
Subjt:  QNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN

Query:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGS
        ALLDFYVRC  LRSAENLF++MPKRDVVSWTTMIGGYAQ GLCEEAVR+FQNMVHVGE  PNEATL+NVLSACSS+SALHLGQWVHSYINSR DVI+DG+
Subjt:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGS

Query:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ
        VGNALINMYVKCG+MEMAI+IFK +EHKDI+SWSTIISGLAMNGL +QAFVLFSLMLVHG+SPDDITFLGLLSACSHGGLINQG+MV++AMKDVYN++PQ
Subjt:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ

Query:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
        MRHYACMVDMYG+AGL DEAEAFIKEMP+EAEGPVWGALLHACQ+HGNEKKYE+VR+WLL SKGVTVGTFALLSNTYA CDRWNDAN+VR  MRSRGLKK
Subjt:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  VAGCSWIEI
        +AG SWIE+
Subjt:  VAGCSWIEI

XP_022960642.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucurbita moschata]4.3e-25785.66Show/hide
Query:  MPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI
        MP +PPK  IS AQL+QI AQL+TNPKP +FN L GA VDS APENGLFL+NQMLR+PSSHNHYTFTYALKAC LLH TH GL+IHARL+KSGHLSDIFI
Subjt:  MPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN
        QNSLLHFYIVDGDVPSASR+FDS+PDPDVVSWTSIISGLSKLGF+EEALGKFLSMNV PNSATLVSALSACSSL C+K+GKAIHGL+LRSL+E SV+LDN
Subjt:  QNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN

Query:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGS
        ALLDFYVRCGSLR A+NLFDEMP+RDVVSWTT+IGGYA  GLCEEAVR+FQNMVH  E  PNEATLINVLSACSSMSALHLGQWVHSYINSR DVI+DG+
Subjt:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGS

Query:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ
        +GNALINMYVKCGSM+ AI IFKT+EHKDIISWSTIISGLAMNG  +QAF LFSLMLVHGI+PD ITFL LLSACSHGGLINQGLMV++AMKDVYNVAP+
Subjt:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ

Query:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
        MRHYACMVDMYG+AGL DEAEAFIKEMPVEAEGPVWGALLHACQ+HGNE +YE+VRQWLLSSK +TVGT+ALLSNTYASCDRWNDANEVRD MRSRGLKK
Subjt:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  VAGCSWIEI
        +AGCSWIE+
Subjt:  VAGCSWIEI

XP_023515586.1 pentatricopeptide repeat-containing protein At4g38010-like isoform X1 [Cucurbita pepo subsp. pepo]6.6e-25885.85Show/hide
Query:  MPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI
        MPL+PPK SIS AQL+QI AQL+TNPKPH+FN L GA VDS APENGLFL+NQMLR+PSSHNHYTFTYALKAC LLH TH GL+IHARL+KSGHLSDIFI
Subjt:  MPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN
        QNSLLHFYIVDGDVPSASR+FDS+PDPDVVSWTSIISGLSKLGF+EEALGKFLSMNV PNSATLVSALSACSSL C+K+GKAIHGL+LRSL+  SVSLDN
Subjt:  QNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN

Query:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGS
        ALLDFYVRCGSLR A+NLFDEMP+RDVVSWTT+IGGYA  GLCEEAVR+FQNMVH  E  PNEATLINVLSACSSMSALHLG+WVHSYINSR DVI+DG+
Subjt:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGS

Query:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ
        +GNALINMYVKCGSM+ AI IFKT+EHKD+ISWSTIISGLAMNG  +QAF LFSLMLVHGI+PD ITFL LLSACSHGGLINQGLMV++AMKDVYNVAP+
Subjt:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ

Query:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
        MRHYACMVDMYG+AGL DEAEAFIKEMPVEAEGPVWGALLHACQ+HGNE  YE+VRQWLLSSK +TVGT+ALLSNTYASCDRWNDANEVRD MRSRGLKK
Subjt:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  VAGCSWIEI
        +AGCSWIE+
Subjt:  VAGCSWIEI

XP_038878297.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Benincasa hispida]8.4e-26186.25Show/hide
Query:  MPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI
        M LKPPKPSI IAQL QI   L+ NPKPHI N L G+ V+S +PENGLFL+NQMLRYPSSHNH+TFTYALKACC LH T  GL+IHA L+KSGHLSDIF+
Subjt:  MPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN
        QNSLLHFYI+DGDVPSASRIFDS+PDPDV+SWTSIISGLSKLGFE+EALGKFLSMNVRPNS TLV+ALSACSSL CLKLGKAIHGLRLRSL+E +VSLDN
Subjt:  QNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN

Query:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGS
        ALLDFYVRCG LRSAE LFDEMPKRDVVSWTTMIGGYAQ GLCEEAVR+FQNMVHVGE  PNEATLINVLSACSS+SALHLGQWVHSYINSR DVI+DG+
Subjt:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGS

Query:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ
        VGNALINMYVKCG+MEMAI+IFK +EHKDIISWSTIISGLAMNGL  QAF LFSLMLVHGISPDDITFL LLSACSHGGLINQGLMV++AMKDVYN++PQ
Subjt:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ

Query:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
        MRHYACMVD+YG+AGL DEAEAFIKEMP+EAEG VWGALLHACQIHGNEKKYE+V++WLL SKGVTVGTFALLSNTYASCDRWNDANEVRDTMRS+GLKK
Subjt:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  VAGCSWIEI
        +AGCSWIE+
Subjt:  VAGCSWIEI

TrEMBL top hitse value%identityAlignment
A0A0A0LXJ1 Uncharacterized protein5.5e-25884.87Show/hide
Query:  MPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI
        MPLKP KPSISIAQ TQI A+L+TNPKPHIFN L G+ V+S  PENGLFL+NQMLRYPSSHNH+TFTYALKACC LH T  GL+IHA L+KSGHLSDIFI
Subjt:  MPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN
        QNSLLHFYI+DGDV SAS IFDS+PDPDVVSWTSIISGLSKLGFE+EAL KFLSMNVRPNS TLV+ALSACSSL CLKLGKAIHGLR+R+L+E +V L+N
Subjt:  QNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN

Query:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGS
        ALLDFYVRC  LRSAENLF++MPKRDVVSWTTMIGGYAQ GLCEEAVR+FQNMVHVGE  PNEATL+NVLSACSS+SALHLGQWVHSYINSR DVI+DG+
Subjt:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGS

Query:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ
        VGNALINMYVKCG+MEMAI+IFK +EHKDI+SWSTIISGLAMNGL +QAFVLFSLMLVHG+SPDDITFLGLLSACSHGGLINQG+MV++AMKDVYN++PQ
Subjt:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ

Query:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
        MRHYACMVDMYG+AGL DEAEAFIKEMP+EAEGPVWGALLHACQ+HGNEKKYE+VR+WLL SKGVTVGTFALLSNTYA CDRWNDAN+VR  MRSRGLKK
Subjt:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  VAGCSWIEI
        +AG SWIE+
Subjt:  VAGCSWIEI

A0A1S4DY27 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X14.8e-25484.09Show/hide
Query:  MPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI
        MPLKP KPSIS AQ TQI A+L+TNPKPHIFN L G+ V+S +PENGLFL+NQML YPSSHNH+TFTYALKACC LH T  GL+IHA L+KSGHLSDIFI
Subjt:  MPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN
        QNSLLHFYI+ GDV SAS IFDS+P+PDVVSWTSIISG SKLGFE+EALGKFLSMNVRPNS TLV+ALSACSSL  LKLGKAIHGLRLR+L+E +VSL+N
Subjt:  QNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN

Query:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGS
        ALLDFYVRC  LRSAENLF++M KRDVVSWTTMIGGYAQ GLCEEAVR+FQNMVH GE  PNEATL+NVLSACSS+SALHLGQWVHSYINSR DVI+DG+
Subjt:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGS

Query:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ
        VGNALINMYVKCG+MEMAI+IFK +EHKDIISWST+ISGLAMNGL +QAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQG+MV++AMKDVYN++PQ
Subjt:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ

Query:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
        +RHYACMVDMYG+AGL DEAEAFIKEMP+EAEGPVWGALLHACQIHGNEKKYE+VR+ LL SKGVTVG FALLSNTYASCDRWNDAN+VR  MRSRGLKK
Subjt:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  VAGCSWIEI
        +AGCSWIE+
Subjt:  VAGCSWIEI

A0A5D3D5L6 Pentatricopeptide repeat-containing protein1.6e-25784.74Show/hide
Query:  NSMPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDI
        NSMPLKP KPSISIAQ TQI A+L+TNPKPHIFN L G+ V+S +PENGLFL+NQML YPSSHNH+TFTYALKACC LH T  GL+IHA L+KSGHLSDI
Subjt:  NSMPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDI

Query:  FIQNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSL
        FIQNSLLHFYI++GDV SAS IFDS+P+PDVVSWTSIISGLSKLGFE+EALGKFLSMNVRPNS TLV+ALSACSSL CLKLGKAIHGLRLR+L+E +VSL
Subjt:  FIQNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSL

Query:  DNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVD
        +NALLDFYVRC  LRSAENLF++M KRDVVSWTTMIGGYAQ GLCEEAVR+FQNMVHVGE  PNEATL+NVLSACSS+SALHLGQWVHSYINSR DVI+D
Subjt:  DNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVD

Query:  GSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVA
        G+VGNALINMYVKCG+MEMAI+IF  +EHKDIISWST+ISGLAMNGL +QAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQG+MV++AMKDVYN++
Subjt:  GSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVA

Query:  PQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGL
        PQ+RHYACMVDMYG+AGL DEAEAFIKEMP+EAEGPVWGALLHACQIHGNEKKYE+VR+ LL SKGVTVG FALLSNTYASCDRWNDAN+VR  MRSRGL
Subjt:  PQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGL

Query:  KKVAGCSWIEI
        KK+AGCSWIE+
Subjt:  KKVAGCSWIEI

A0A6J1H7Z9 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X12.1e-25785.66Show/hide
Query:  MPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI
        MP +PPK  IS AQL+QI AQL+TNPKP +FN L GA VDS APENGLFL+NQMLR+PSSHNHYTFTYALKAC LLH TH GL+IHARL+KSGHLSDIFI
Subjt:  MPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN
        QNSLLHFYIVDGDVPSASR+FDS+PDPDVVSWTSIISGLSKLGF+EEALGKFLSMNV PNSATLVSALSACSSL C+K+GKAIHGL+LRSL+E SV+LDN
Subjt:  QNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDN

Query:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGS
        ALLDFYVRCGSLR A+NLFDEMP+RDVVSWTT+IGGYA  GLCEEAVR+FQNMVH  E  PNEATLINVLSACSSMSALHLGQWVHSYINSR DVI+DG+
Subjt:  ALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGS

Query:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ
        +GNALINMYVKCGSM+ AI IFKT+EHKDIISWSTIISGLAMNG  +QAF LFSLMLVHGI+PD ITFL LLSACSHGGLINQGLMV++AMKDVYNVAP+
Subjt:  VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQ

Query:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
        MRHYACMVDMYG+AGL DEAEAFIKEMPVEAEGPVWGALLHACQ+HGNE +YE+VRQWLLSSK +TVGT+ALLSNTYASCDRWNDANEVRD MRSRGLKK
Subjt:  MRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  VAGCSWIEI
        +AGCSWIE+
Subjt:  VAGCSWIEI

A0A6J1JC26 pentatricopeptide repeat-containing protein At4g38010-like isoform X12.5e-25584.74Show/hide
Query:  NSMPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDI
        NSMP KPPK  IS AQL+QI AQL+TNPKP++FN L GA +DS APENGLFL+NQMLR+PSSHNHYTFTYALKAC LLH TH GL+IHARL+KSGHLSDI
Subjt:  NSMPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDI

Query:  FIQNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSL
        FIQNSLLHFYIVDGDV SASR+FDS+PDPDVVSWTSIISGLSKLGF+EEALGKFLSMNV PNSATLVSALSACS+L CLK+GKAIHGL+LRSL+E SV+L
Subjt:  FIQNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSL

Query:  DNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVD
        DNALLDFYVRCGSLR A+NLFDEMP+RDVVSWTT+IGGYA  GLCEEAVR+FQNMVH  E  PNEATLINVLSACSSMSALH GQWVHSY+NSR D+I+D
Subjt:  DNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVD

Query:  GSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVA
        G++GNALINMYVKCGSME AI IFKT+EHKDIISWSTIISGLAMNG  +QAF LFSLMLVHGI+PD ITFL LLSACSHGGLINQGLMV++AMKDVYNVA
Subjt:  GSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVA

Query:  PQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGL
        P+MR YACMVDMYG+AGL DEAEAFIKEMPVEAEGPVWGALLHACQ+HGNE +YE+VRQWLLSSK +TVGT+ALLSNTYASC RWNDANEVRD MRSRGL
Subjt:  PQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGL

Query:  KKVAGCSWIEI
        KK+AGCSWIE+
Subjt:  KKVAGCSWIEI

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.2e-9236.08Show/hide
Query:  PKPHIF--NSLFGAFVDSGAPENGLFLFNQMLRYPSSH-NHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFD
        PKP+ F  N+L  A+     P   ++ F  M+     + N YTF + +KA   + S   G  +H   VKS   SD+F+ NSL+H Y   GD+ SA ++F 
Subjt:  PKPHIF--NSLFGAFVDSGAPENGLFLFNQMLRYPSSH-NHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFD

Query:  SMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLF
        ++ + DVVSW S+I+G  + G  ++AL  F  M   +V+ +  T+V  LSAC+ +  L+ G+ +      +    +++L NA+LD Y +CGS+  A+ LF
Subjt:  SMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLF

Query:  DEMPKRDVVSWTTMIGGYA-------------------------------QGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSY
        D M ++D V+WTTM+ GYA                               Q G   EA+ +F  +     +K N+ TL++ LSAC+ + AL LG+W+HSY
Subjt:  DEMPKRDVVSWTTMIGGYA-------------------------------QGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSY

Query:  INSRPDVIVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVY
        I  +  + ++  V +ALI+MY KCG +E +  +F ++E +D+  WS +I GLAM+G   +A  +F  M    + P+ +TF  +  ACSH GL+++   ++
Subjt:  INSRPDVIVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVY

Query:  KAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANE
          M+  Y + P+ +HYAC+VD+ GR+G  ++A  FI+ MP+     VWGALL AC+IH N    E     LL  +    G   LLSN YA   +W + +E
Subjt:  KAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANE

Query:  VRDTMRSRGLKKVAGCSWIEI
        +R  MR  GLKK  GCS IEI
Subjt:  VRDTMRSRGLKKVAGCSWIEI

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic6.7e-10437.86Show/hide
Query:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGD---------
        I  P   I+N++F     S  P + L L+  M+      N YTF + LK+C    +   G QIH  ++K G   D+++  SL+  Y+ +G          
Subjt:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGD---------

Query:  ----------------------VPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRL
                              + +A ++FD +P  DVVSW ++ISG ++ G  +EAL  F  M   NVRP+ +T+V+ +SAC+  G ++LG+ +H L +
Subjt:  ----------------------VPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRL

Query:  RSLSEGS-VSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHS
             GS + + NAL+D Y +CG L +A  LF+ +P +DV+SW T+IGGY    L +EA+ +FQ M+  GE  PN+ T++++L AC+ + A+ +G+W+H 
Subjt:  RSLSEGS-VSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHS

Query:  YINSRPDVIVD-GSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM
        YI+ R   + +  S+  +LI+MY KCG +E A  +F ++ HK + SW+ +I G AM+G +  +F LFS M   GI PDDITF+GLLSACSH G+++ G  
Subjt:  YINSRPDVIVD-GSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM

Query:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA
        +++ M   Y + P++ HY CM+D+ G +GLF EAE  I  M +E +G +W +LL AC++HGN +  E   + L+  +    G++ LLSN YAS  RWN+ 
Subjt:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKVAGCSWIEI
         + R  +  +G+KKV GCS IEI
Subjt:  NEVRDTMRSRGLKKVAGCSWIEI

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136001.4e-9336.03Show/hide
Query:  FNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFDSMPDPDVV
        +NS+   F      E  L  F  M +     N Y+F   L AC  L+  + G+Q+H+ + KS  LSD++I ++L+  Y   G+V  A R+FD M D +VV
Subjt:  FNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFDSMPDPDVV

Query:  SWTSIISGLSKLGFEEEALGKF---LSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRS-LSEGSVSLDNALLDFYVRCGSLRSAENLFDEMP---
        SW S+I+   + G   EAL  F   L   V P+  TL S +SAC+SL  +K+G+ +HG  +++      + L NA +D Y +C  ++ A  +FD MP   
Subjt:  SWTSIISGLSKLGFEEEALGKF---LSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRS-LSEGSVSLDNALLDFYVRCGSLRSAENLFDEMP---

Query:  ----------------------------KRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDV
                                    +R+VVSW  +I GY Q G  EEA+ +F  ++    V P   +  N+L AC+ ++ LHLG   H ++      
Subjt:  ----------------------------KRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDV

Query:  IVDGS-----VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKA
           G      VGN+LI+MYVKCG +E   ++F+ +  +D +SW+ +I G A NG   +A  LF  ML  G  PD IT +G+LSAC H G + +G   + +
Subjt:  IVDGS-----VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKA

Query:  MKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVR
        M   + VAP   HY CMVD+ GRAG  +EA++ I+EMP++ +  +WG+LL AC++H N    + V + LL  +    G + LLSN YA   +W D   VR
Subjt:  MKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVR

Query:  DTMRSRGLKKVAGCSWIEI
         +MR  G+ K  GCSWI+I
Subjt:  DTMRSRGLKKVAGCSWIEI

Q9SJZ3 Pentatricopeptide repeat-containing protein At2g22410, mitochondrial1.1e-9334.61Show/hide
Query:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRY---PSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASR
        I NP    +N     F +S  P+    L+ QMLR+    S  +H+T+    K C  L  +  G  I   ++K        + N+ +H +   GD+ +A +
Subjt:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRY---PSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASR

Query:  IFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAE
        +FD  P  D+VSW  +I+G  K+G  E+A+  +  M    V+P+  T++  +S+CS LG L  GK  +     +    ++ L NAL+D + +CG +  A 
Subjt:  IFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAE

Query:  NLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNM------------------------------VHVGEVKPNEATLINVLSACSSMSALHLGQWVH
         +FD + KR +VSWTTMI GYA+ GL + + ++F +M                              +     KP+E T+I+ LSACS + AL +G W+H
Subjt:  NLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNM------------------------------VHVGEVKPNEATLINVLSACSSMSALHLGQWVH

Query:  SYINSRPDVIVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM
         YI  +  + ++ ++G +L++MY KCG++  A+ +F  ++ ++ ++++ II GLA++G +  A   F+ M+  GI+PD+ITF+GLLSAC HGG+I  G  
Subjt:  SYINSRPDVIVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM

Query:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA
         +  MK  +N+ PQ++HY+ MVD+ GRAGL +EA+  ++ MP+EA+  VWGALL  C++HGN +  E+  + LL       G + LL   Y   + W DA
Subjt:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKVAGCSWIEI
           R  M  RG++K+ GCS IE+
Subjt:  NEVRDTMRSRGLKKVAGCSWIEI

Q9SZK1 Pentatricopeptide repeat-containing protein At4g380103.3e-9538.33Show/hide
Query:  FNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFDSMPDPDVV
        +N+L  ++     P   +F +   +    S + +TF    KAC        G QIH  + K G   DI++QNSL+HFY V G+  +A ++F  MP  DVV
Subjt:  FNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFDSMPDPDVV

Query:  SWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSW
        SWT II+G ++ G  +EAL  F  M+V PN AT V  L +   +GCL LGK IHGL L+  S  S+   NAL+D YV+C  L  A  +F E+ K+D VSW
Subjt:  SWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSW

Query:  TTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDI
         +MI G       +EA+ +F  M     +KP+   L +VLSAC+S+ A+  G+WVH YI +   +  D  +G A+++MY KCG +E A+ IF  +  K++
Subjt:  TTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDI

Query:  ISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKD-VYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPV
         +W+ ++ GLA++G   ++   F  M+  G  P+ +TFL  L+AC H GL+++G   +  MK   YN+ P++ HY CM+D+  RAGL DEA   +K MPV
Subjt:  ISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKD-VYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPV

Query:  EAEGPVWGALLHACQIHGNEKKY-EEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKVAGCSWIE
        + +  + GA+L AC+  G   +  +E+    L  +    G + LLSN +A+  RW+D   +R  M+ +G+ KV G S+IE
Subjt:  EAEGPVWGALLHACQIHGNEKKY-EEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKVAGCSWIE

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.8e-10537.86Show/hide
Query:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGD---------
        I  P   I+N++F     S  P + L L+  M+      N YTF + LK+C    +   G QIH  ++K G   D+++  SL+  Y+ +G          
Subjt:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGD---------

Query:  ----------------------VPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRL
                              + +A ++FD +P  DVVSW ++ISG ++ G  +EAL  F  M   NVRP+ +T+V+ +SAC+  G ++LG+ +H L +
Subjt:  ----------------------VPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRL

Query:  RSLSEGS-VSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHS
             GS + + NAL+D Y +CG L +A  LF+ +P +DV+SW T+IGGY    L +EA+ +FQ M+  GE  PN+ T++++L AC+ + A+ +G+W+H 
Subjt:  RSLSEGS-VSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHS

Query:  YINSRPDVIVD-GSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM
        YI+ R   + +  S+  +LI+MY KCG +E A  +F ++ HK + SW+ +I G AM+G +  +F LFS M   GI PDDITF+GLLSACSH G+++ G  
Subjt:  YINSRPDVIVD-GSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM

Query:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA
        +++ M   Y + P++ HY CM+D+ G +GLF EAE  I  M +E +G +W +LL AC++HGN +  E   + L+  +    G++ LLSN YAS  RWN+ 
Subjt:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKVAGCSWIEI
         + R  +  +G+KKV GCS IEI
Subjt:  NEVRDTMRSRGLKKVAGCSWIEI

AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-9436.03Show/hide
Query:  FNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFDSMPDPDVV
        +NS+   F      E  L  F  M +     N Y+F   L AC  L+  + G+Q+H+ + KS  LSD++I ++L+  Y   G+V  A R+FD M D +VV
Subjt:  FNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFDSMPDPDVV

Query:  SWTSIISGLSKLGFEEEALGKF---LSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRS-LSEGSVSLDNALLDFYVRCGSLRSAENLFDEMP---
        SW S+I+   + G   EAL  F   L   V P+  TL S +SAC+SL  +K+G+ +HG  +++      + L NA +D Y +C  ++ A  +FD MP   
Subjt:  SWTSIISGLSKLGFEEEALGKF---LSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRS-LSEGSVSLDNALLDFYVRCGSLRSAENLFDEMP---

Query:  ----------------------------KRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDV
                                    +R+VVSW  +I GY Q G  EEA+ +F  ++    V P   +  N+L AC+ ++ LHLG   H ++      
Subjt:  ----------------------------KRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDV

Query:  IVDGS-----VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKA
           G      VGN+LI+MYVKCG +E   ++F+ +  +D +SW+ +I G A NG   +A  LF  ML  G  PD IT +G+LSAC H G + +G   + +
Subjt:  IVDGS-----VGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKA

Query:  MKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVR
        M   + VAP   HY CMVD+ GRAG  +EA++ I+EMP++ +  +WG+LL AC++H N    + V + LL  +    G + LLSN YA   +W D   VR
Subjt:  MKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVR

Query:  DTMRSRGLKKVAGCSWIEI
         +MR  G+ K  GCSWI+I
Subjt:  DTMRSRGLKKVAGCSWIEI

AT2G22410.1 SLOW GROWTH 17.6e-9534.61Show/hide
Query:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRY---PSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASR
        I NP    +N     F +S  P+    L+ QMLR+    S  +H+T+    K C  L  +  G  I   ++K        + N+ +H +   GD+ +A +
Subjt:  ITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRY---PSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASR

Query:  IFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAE
        +FD  P  D+VSW  +I+G  K+G  E+A+  +  M    V+P+  T++  +S+CS LG L  GK  +     +    ++ L NAL+D + +CG +  A 
Subjt:  IFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAE

Query:  NLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNM------------------------------VHVGEVKPNEATLINVLSACSSMSALHLGQWVH
         +FD + KR +VSWTTMI GYA+ GL + + ++F +M                              +     KP+E T+I+ LSACS + AL +G W+H
Subjt:  NLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNM------------------------------VHVGEVKPNEATLINVLSACSSMSALHLGQWVH

Query:  SYINSRPDVIVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM
         YI  +  + ++ ++G +L++MY KCG++  A+ +F  ++ ++ ++++ II GLA++G +  A   F+ M+  GI+PD+ITF+GLLSAC HGG+I  G  
Subjt:  SYINSRPDVIVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM

Query:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA
         +  MK  +N+ PQ++HY+ MVD+ GRAGL +EA+  ++ MP+EA+  VWGALL  C++HGN +  E+  + LL       G + LL   Y   + W DA
Subjt:  VYKAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKVAGCSWIEI
           R  M  RG++K+ GCS IE+
Subjt:  NEVRDTMRSRGLKKVAGCSWIEI

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.5e-9436.08Show/hide
Query:  PKPHIF--NSLFGAFVDSGAPENGLFLFNQMLRYPSSH-NHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFD
        PKP+ F  N+L  A+     P   ++ F  M+     + N YTF + +KA   + S   G  +H   VKS   SD+F+ NSL+H Y   GD+ SA ++F 
Subjt:  PKPHIF--NSLFGAFVDSGAPENGLFLFNQMLRYPSSH-NHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFD

Query:  SMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLF
        ++ + DVVSW S+I+G  + G  ++AL  F  M   +V+ +  T+V  LSAC+ +  L+ G+ +      +    +++L NA+LD Y +CGS+  A+ LF
Subjt:  SMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLF

Query:  DEMPKRDVVSWTTMIGGYA-------------------------------QGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSY
        D M ++D V+WTTM+ GYA                               Q G   EA+ +F  +     +K N+ TL++ LSAC+ + AL LG+W+HSY
Subjt:  DEMPKRDVVSWTTMIGGYA-------------------------------QGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSY

Query:  INSRPDVIVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVY
        I  +  + ++  V +ALI+MY KCG +E +  +F ++E +D+  WS +I GLAM+G   +A  +F  M    + P+ +TF  +  ACSH GL+++   ++
Subjt:  INSRPDVIVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVY

Query:  KAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANE
          M+  Y + P+ +HYAC+VD+ GR+G  ++A  FI+ MP+     VWGALL AC+IH N    E     LL  +    G   LLSN YA   +W + +E
Subjt:  KAMKDVYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANE

Query:  VRDTMRSRGLKKVAGCSWIEI
        +R  MR  GLKK  GCS IEI
Subjt:  VRDTMRSRGLKKVAGCSWIEI

AT4G38010.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.4e-9638.33Show/hide
Query:  FNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFDSMPDPDVV
        +N+L  ++     P   +F +   +    S + +TF    KAC        G QIH  + K G   DI++QNSL+HFY V G+  +A ++F  MP  DVV
Subjt:  FNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHNGLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFDSMPDPDVV

Query:  SWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSW
        SWT II+G ++ G  +EAL  F  M+V PN AT V  L +   +GCL LGK IHGL L+  S  S+   NAL+D YV+C  L  A  +F E+ K+D VSW
Subjt:  SWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSLSEGSVSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSW

Query:  TTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDI
         +MI G       +EA+ +F  M     +KP+   L +VLSAC+S+ A+  G+WVH YI +   +  D  +G A+++MY KCG +E A+ IF  +  K++
Subjt:  TTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGSVGNALINMYVKCGSMEMAIMIFKTLEHKDI

Query:  ISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKD-VYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPV
         +W+ ++ GLA++G   ++   F  M+  G  P+ +TFL  L+AC H GL+++G   +  MK   YN+ P++ HY CM+D+  RAGL DEA   +K MPV
Subjt:  ISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKD-VYNVAPQMRHYACMVDMYGRAGLFDEAEAFIKEMPV

Query:  EAEGPVWGALLHACQIHGNEKKY-EEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKVAGCSWIE
        + +  + GA+L AC+  G   +  +E+    L  +    G + LLSN +A+  RW+D   +R  M+ +G+ KV G S+IE
Subjt:  EAEGPVWGALLHACQIHGNEKKY-EEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKVAGCSWIE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTTATTTTTGACAATTTCCCGAGAAGGATCAACGTCCAATTCGGACAAAACTTCCAAGCTCTCCACGCATGGTGGCAATAGCATGCCATTAAAACCTCCAAAACC
CTCAATTTCAATCGCTCAGCTAACCCAAATCCTCGCCCAACTCATCACAAATCCAAAACCCCACATTTTCAACTCTTTGTTCGGTGCGTTCGTCGATTCCGGCGCCCCTG
AAAATGGCCTCTTCCTCTTCAACCAAATGCTTCGCTACCCATCTTCCCACAACCATTACACTTTCACTTACGCCCTCAAAGCCTGTTGCCTTCTCCATTCAACCCACAAC
GGCCTCCAAATCCATGCCCGTCTCGTCAAATCTGGGCACCTTTCTGACATCTTCATCCAAAATTCCTTGCTCCATTTCTACATTGTCGATGGCGACGTTCCTTCTGCTTC
TCGAATCTTTGATTCCATGCCTGACCCAGATGTGGTTTCGTGGACTTCGATCATTTCGGGGCTTTCCAAGCTGGGTTTTGAAGAGGAGGCTCTGGGTAAGTTCTTGTCTA
TGAATGTGAGGCCTAATTCGGCTACTCTTGTTAGTGCTTTATCTGCTTGTTCTAGTCTTGGATGTCTTAAGCTTGGGAAAGCTATACATGGGCTGAGATTGCGGAGTTTG
AGTGAGGGAAGTGTTAGTTTGGACAATGCCCTTTTGGATTTTTATGTTAGATGTGGGTCTTTGAGGAGTGCAGAGAACCTGTTTGATGAAATGCCTAAGAGAGATGTAGT
GTCTTGGACTACAATGATTGGGGGTTATGCACAGGGTGGATTGTGTGAAGAGGCTGTGAGGATATTTCAAAACATGGTTCATGTGGGAGAGGTTAAGCCCAACGAGGCCA
CTCTGATTAATGTATTATCTGCATGTTCTTCCATGTCTGCTCTGCATTTGGGTCAATGGGTGCATTCCTATATCAACTCTAGGCCTGATGTGATAGTTGATGGAAGCGTT
GGAAACGCTTTGATTAATATGTATGTCAAATGTGGTAGCATGGAAATGGCAATTATGATCTTTAAAACCCTTGAGCACAAGGATATCATATCATGGAGCACAATCATAAG
TGGGTTAGCCATGAATGGCCTAAGCAGGCAAGCTTTTGTTCTCTTCTCCCTCATGCTGGTTCATGGCATTTCTCCAGATGACATAACATTTCTTGGCCTGTTATCTGCAT
GCAGCCATGGTGGGTTGATCAATCAAGGCTTGATGGTTTATAAAGCCATGAAAGATGTTTATAATGTTGCACCTCAGATGAGGCATTATGCTTGCATGGTGGACATGTAT
GGAAGGGCAGGGCTTTTTGATGAAGCAGAGGCATTCATAAAGGAGATGCCTGTGGAAGCAGAAGGCCCAGTTTGGGGAGCTCTGTTGCATGCCTGTCAAATTCATGGGAA
TGAGAAGAAGTATGAGGAAGTTAGGCAATGGCTGCTTAGCAGCAAAGGGGTTACAGTAGGAACTTTTGCTTTGTTATCAAATACTTATGCTAGTTGTGATAGATGGAATG
ATGCTAATGAAGTTCGAGATACCATGAGAAGTAGAGGGTTGAAGAAAGTGGCTGGATGTAGTTGGATTGAAATTGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGTTATTTTTGACAATTTCCCGAGAAGGATCAACGTCCAATTCGGACAAAACTTCCAAGCTCTCCACGCATGGTGGCAATAGCATGCCATTAAAACCTCCAAAACC
CTCAATTTCAATCGCTCAGCTAACCCAAATCCTCGCCCAACTCATCACAAATCCAAAACCCCACATTTTCAACTCTTTGTTCGGTGCGTTCGTCGATTCCGGCGCCCCTG
AAAATGGCCTCTTCCTCTTCAACCAAATGCTTCGCTACCCATCTTCCCACAACCATTACACTTTCACTTACGCCCTCAAAGCCTGTTGCCTTCTCCATTCAACCCACAAC
GGCCTCCAAATCCATGCCCGTCTCGTCAAATCTGGGCACCTTTCTGACATCTTCATCCAAAATTCCTTGCTCCATTTCTACATTGTCGATGGCGACGTTCCTTCTGCTTC
TCGAATCTTTGATTCCATGCCTGACCCAGATGTGGTTTCGTGGACTTCGATCATTTCGGGGCTTTCCAAGCTGGGTTTTGAAGAGGAGGCTCTGGGTAAGTTCTTGTCTA
TGAATGTGAGGCCTAATTCGGCTACTCTTGTTAGTGCTTTATCTGCTTGTTCTAGTCTTGGATGTCTTAAGCTTGGGAAAGCTATACATGGGCTGAGATTGCGGAGTTTG
AGTGAGGGAAGTGTTAGTTTGGACAATGCCCTTTTGGATTTTTATGTTAGATGTGGGTCTTTGAGGAGTGCAGAGAACCTGTTTGATGAAATGCCTAAGAGAGATGTAGT
GTCTTGGACTACAATGATTGGGGGTTATGCACAGGGTGGATTGTGTGAAGAGGCTGTGAGGATATTTCAAAACATGGTTCATGTGGGAGAGGTTAAGCCCAACGAGGCCA
CTCTGATTAATGTATTATCTGCATGTTCTTCCATGTCTGCTCTGCATTTGGGTCAATGGGTGCATTCCTATATCAACTCTAGGCCTGATGTGATAGTTGATGGAAGCGTT
GGAAACGCTTTGATTAATATGTATGTCAAATGTGGTAGCATGGAAATGGCAATTATGATCTTTAAAACCCTTGAGCACAAGGATATCATATCATGGAGCACAATCATAAG
TGGGTTAGCCATGAATGGCCTAAGCAGGCAAGCTTTTGTTCTCTTCTCCCTCATGCTGGTTCATGGCATTTCTCCAGATGACATAACATTTCTTGGCCTGTTATCTGCAT
GCAGCCATGGTGGGTTGATCAATCAAGGCTTGATGGTTTATAAAGCCATGAAAGATGTTTATAATGTTGCACCTCAGATGAGGCATTATGCTTGCATGGTGGACATGTAT
GGAAGGGCAGGGCTTTTTGATGAAGCAGAGGCATTCATAAAGGAGATGCCTGTGGAAGCAGAAGGCCCAGTTTGGGGAGCTCTGTTGCATGCCTGTCAAATTCATGGGAA
TGAGAAGAAGTATGAGGAAGTTAGGCAATGGCTGCTTAGCAGCAAAGGGGTTACAGTAGGAACTTTTGCTTTGTTATCAAATACTTATGCTAGTTGTGATAGATGGAATG
ATGCTAATGAAGTTCGAGATACCATGAGAAGTAGAGGGTTGAAGAAAGTGGCTGGATGTAGTTGGATTGAAATTGGTTGA
Protein sequenceShow/hide protein sequence
MRLFLTISREGSTSNSDKTSKLSTHGGNSMPLKPPKPSISIAQLTQILAQLITNPKPHIFNSLFGAFVDSGAPENGLFLFNQMLRYPSSHNHYTFTYALKACCLLHSTHN
GLQIHARLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFDSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVSALSACSSLGCLKLGKAIHGLRLRSL
SEGSVSLDNALLDFYVRCGSLRSAENLFDEMPKRDVVSWTTMIGGYAQGGLCEEAVRIFQNMVHVGEVKPNEATLINVLSACSSMSALHLGQWVHSYINSRPDVIVDGSV
GNALINMYVKCGSMEMAIMIFKTLEHKDIISWSTIISGLAMNGLSRQAFVLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVAPQMRHYACMVDMY
GRAGLFDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEEVRQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKVAGCSWIEIG