; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010804 (gene) of Snake gourd v1 genome

Gene IDTan0010804
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG07:12299521..12301083
RNA-Seq ExpressionTan0010804
SyntenyTan0010804
Gene Ontology termsGO:1900865 - chloroplast RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK18848.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]7.1e-25784.01Show/hide
Query:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI
        MPLKP KPSISIAQ TQIHA L+TNP PHI NPLLG+LV+SI+PENGLFL+NQML + SSHNH++FTYALKACC LHQT  GL+IHA L+KSGHLSDIFI
Subjt:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN
        QNSLLHFY L+GD  SAS IF S+P+PDVVSWTSIISGLSKLGFE+EALGKFLSMNV PNS TLV+ALSACSSLRCLK GKA+HGLRLR+L EE V+L+N
Subjt:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN

Query:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV
        ALLDFYVRC  LRSAENLF+KM KRDVVSWTT+IGGYAQSGLCEEAVR+FQNMVH+   GEA PNEATL+NVLSACS +SALHLGQWVHSYINSR DV++
Subjt:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV

Query:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV
        DGNVGNALINMYVKCG+MEMAI+IF A+EHKDIISWST+ISGLAMNGLGKQAF LFSLMLVHGISPDDITFLGLLSACSHGGLINQG+MV++AMKDVYN+
Subjt:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV

Query:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG
         PQ+RHYAC+VDMYG+AGLLDEAEAFIKEMP+EAEGPVWGALLHACQIHGNEKKYEKV + LL SKGVTVG FALLSNTYASCDRWNDAN+VR  MRSRG
Subjt:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG

Query:  LKKMAGCSWIELVDLSNPL
        LKKMAGCSWIELV+ SNP+
Subjt:  LKKMAGCSWIELVDLSNPL

XP_011660133.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic isoform X1 [Cucumis sativus]1.9e-25784.01Show/hide
Query:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI
        MPLKP KPSISIAQ TQIHA L+TNP PHI NPLLG+LV+SI PENGLFL+NQMLR+ SSHNH++FTYALKACC LHQT  GL+IHA L+KSGHLSDIFI
Subjt:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN
        QNSLLHFY L GD  SAS IF S+PDPDVVSWTSIISGLSKLGFE+EAL KFLSMNV PNS TLV+ALSACSSLRCLK GKA+HGLR+R+L EE V L+N
Subjt:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN

Query:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV
        ALLDFYVRC  LRSAENLF+KMPKRDVVSWTT+IGGYAQSGLCEEAVR+FQNMVH+   GEA PNEATL+NVLSACS +SALHLGQWVHSYINSR DV++
Subjt:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV

Query:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV
        DGNVGNALINMYVKCG+MEMAI+IFKA+EHKDI+SWSTIISGLAMNGLGKQAF LFSLMLVHG+SPDDITFLGLLSACSHGGLINQG+MV++AMKDVYN+
Subjt:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV

Query:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG
         PQMRHYAC+VDMYG+AGLLDEAEAFIKEMP+EAEGPVWGALLHACQ+HGNEKKYEKV +WLL SKGVTVGTFALLSNTYA CDRWNDAN+VR  MRSRG
Subjt:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG

Query:  LKKMAGCSWIELVDLSNPL
        LKKMAG SWIE+VD + PL
Subjt:  LKKMAGCSWIELVDLSNPL

XP_022960642.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucurbita moschata]3.2e-25784.04Show/hide
Query:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI
        MP +P K  IS AQL+QIHA L+TNP P + NPLLG LV S+APENGLFL+NQMLRH SSHNHY+FTYALKAC LLH+TH GL+IHARL+KSGHLSDIFI
Subjt:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN
        QNSLLHFY + GD PSASR+F S+PDPDVVSWTSIISGLSKLGF+EEALGKFLSMNV PNSATLVSALSACSSLRC+K GKA+HGL+LRSL EE VNLDN
Subjt:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN

Query:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV
        ALLDFYVRCGSLR A+NLFD+MP+RDVVSWTT+IGGYA +GLCEEAVR+FQNMVH     EA PNEATLINVLSACS MSALHLGQWVHSYINSR DV++
Subjt:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV

Query:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV
        DGN+GNALINMYVKCGSM+ AI IFK VEHKDIISWSTIISGLAMNG GKQAFGLFSLMLVHGI+PD ITFL LLSACSHGGLINQGLMV++AMKDVYNV
Subjt:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV

Query:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG
         P+MRHYAC+VDMYG+AGLLDEAEAFIKEMPVEAEGPVWGALLHACQ+HGNE +YEKV QWLLSSK +TVGT+ALLSNTYASCDRWNDANEVRD MRSRG
Subjt:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG

Query:  LKKMAGCSWIELVDLSNPLG
        LKKMAGCSWIEL D  NPLG
Subjt:  LKKMAGCSWIELVDLSNPLG

XP_023515586.1 pentatricopeptide repeat-containing protein At4g38010-like isoform X1 [Cucurbita pepo subsp. pepo]4.1e-25783.85Show/hide
Query:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI
        MPL+P K SIS AQL+QIHA L+TNP PH+ NPLLG LV S+APENGLFL+NQMLRH SSHNHY+FTYALKAC LLH+TH GL+IHARL+KSGHLSDIFI
Subjt:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN
        QNSLLHFY + GD PSASR+F S+PDPDVVSWTSIISGLSKLGF+EEALGKFLSMNV PNSATLVSALSACSSLRC+K GKA+HGL+LRSL  E V+LDN
Subjt:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN

Query:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV
        ALLDFYVRCGSLR A+NLFD+MP+RDVVSWTT+IGGYA +GLCEEAVR+FQNMVH     EA PNEATLINVLSACS MSALHLG+WVHSYINSR DV++
Subjt:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV

Query:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV
        DGN+GNALINMYVKCGSM+ AI IFK VEHKD+ISWSTIISGLAMNG GKQAFGLFSLMLVHGI+PD ITFL LLSACSHGGLINQGLMV++AMKDVYNV
Subjt:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV

Query:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG
         P+MRHYAC+VDMYG+AGLLDEAEAFIKEMPVEAEGPVWGALLHACQ+HGNE  YEKV QWLLSSK +TVGT+ALLSNTYASCDRWNDANEVRD MRSRG
Subjt:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG

Query:  LKKMAGCSWIELVDLSNPLG
        LKKMAGCSWIEL D  NPLG
Subjt:  LKKMAGCSWIELVDLSNPLG

XP_038878297.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Benincasa hispida]5.6e-26285.19Show/hide
Query:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI
        M LKP KPSI IAQL QIH +L+ NP PHILNPLLG+LV+S++PENGLFL+NQMLR+ SSHNH++FTYALKACC LH+T  GL+IHA L+KSGHLSDIF+
Subjt:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN
        QNSLLHFY L GD PSASRIF S+PDPDV+SWTSIISGLSKLGFE+EALGKFLSMNV PNS TLV+ALSACSSLRCLK GKA+HGLRLRSL EE V+LDN
Subjt:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN

Query:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV
        ALLDFYVRCG LRSAE LFD+MPKRDVVSWTT+IGGYAQ GLCEEAVR+FQNMVH+   GEA PNEATLINVLSACS +SALHLGQWVHSYINSR DV++
Subjt:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV

Query:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV
        DGNVGNALINMYVKCG+MEMAI+IFKA+EHKDIISWSTIISGLAMNGLG QAFGLFSLMLVHGISPDDITFL LLSACSHGGLINQGLMV++AMKDVYN+
Subjt:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV

Query:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG
         PQMRHYAC+VD+YG+AGLLDEAEAFIKEMP+EAEG VWGALLHACQIHGNEKKYEKV +WLL SKGVTVGTFALLSNTYASCDRWNDANEVRDTMRS+G
Subjt:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG

Query:  LKKMAGCSWIELVDLSNPLG
        LKKMAGCSWIELVD SN LG
Subjt:  LKKMAGCSWIELVDLSNPLG

TrEMBL top hitse value%identityAlignment
A0A0A0LXJ1 Uncharacterized protein9.0e-25884.01Show/hide
Query:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI
        MPLKP KPSISIAQ TQIHA L+TNP PHI NPLLG+LV+SI PENGLFL+NQMLR+ SSHNH++FTYALKACC LHQT  GL+IHA L+KSGHLSDIFI
Subjt:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN
        QNSLLHFY L GD  SAS IF S+PDPDVVSWTSIISGLSKLGFE+EAL KFLSMNV PNS TLV+ALSACSSLRCLK GKA+HGLR+R+L EE V L+N
Subjt:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN

Query:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV
        ALLDFYVRC  LRSAENLF+KMPKRDVVSWTT+IGGYAQSGLCEEAVR+FQNMVH+   GEA PNEATL+NVLSACS +SALHLGQWVHSYINSR DV++
Subjt:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV

Query:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV
        DGNVGNALINMYVKCG+MEMAI+IFKA+EHKDI+SWSTIISGLAMNGLGKQAF LFSLMLVHG+SPDDITFLGLLSACSHGGLINQG+MV++AMKDVYN+
Subjt:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV

Query:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG
         PQMRHYAC+VDMYG+AGLLDEAEAFIKEMP+EAEGPVWGALLHACQ+HGNEKKYEKV +WLL SKGVTVGTFALLSNTYA CDRWNDAN+VR  MRSRG
Subjt:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG

Query:  LKKMAGCSWIELVDLSNPL
        LKKMAG SWIE+VD + PL
Subjt:  LKKMAGCSWIELVDLSNPL

A0A1S4DY27 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X19.9e-25783.85Show/hide
Query:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI
        MPLKP KPSIS AQ TQIHA L+TNP PHI NPLLG+LV+SI+PENGLFL+NQML + SSHNH++FTYALKACC LHQT  GL+IHA L+KSGHLSDIFI
Subjt:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN
        QNSLLHFY LHGD  SAS IF S+P+PDVVSWTSIISG SKLGFE+EALGKFLSMNV PNS TLV+ALSACSSLR LK GKA+HGLRLR+L EE V+L+N
Subjt:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN

Query:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV
        ALLDFYVRC  LRSAENLF+KM KRDVVSWTT+IGGYAQSGLCEEAVR+FQNMVH    GEA PNEATL+NVLSACS +SALHLGQWVHSYINSR DV++
Subjt:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV

Query:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV
        DGNVGNALINMYVKCG+MEMAI+IFKA+EHKDIISWST+ISGLAMNGLGKQAF LFSLMLVHGISPDDITFLGLLSACSHGGLINQG+MV++AMKDVYN+
Subjt:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV

Query:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG
         PQ+RHYAC+VDMYG+AGLLDEAEAFIKEMP+EAEGPVWGALLHACQIHGNEKKYEKV + LL SKGVTVG FALLSNTYASCDRWNDAN+VR  MRSRG
Subjt:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG

Query:  LKKMAGCSWIELVDLSNPLG
        LKKMAGCSWIELV+ SNP+G
Subjt:  LKKMAGCSWIELVDLSNPLG

A0A5D3D5L6 Pentatricopeptide repeat-containing protein3.4e-25784.01Show/hide
Query:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI
        MPLKP KPSISIAQ TQIHA L+TNP PHI NPLLG+LV+SI+PENGLFL+NQML + SSHNH++FTYALKACC LHQT  GL+IHA L+KSGHLSDIFI
Subjt:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN
        QNSLLHFY L+GD  SAS IF S+P+PDVVSWTSIISGLSKLGFE+EALGKFLSMNV PNS TLV+ALSACSSLRCLK GKA+HGLRLR+L EE V+L+N
Subjt:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN

Query:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV
        ALLDFYVRC  LRSAENLF+KM KRDVVSWTT+IGGYAQSGLCEEAVR+FQNMVH+   GEA PNEATL+NVLSACS +SALHLGQWVHSYINSR DV++
Subjt:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV

Query:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV
        DGNVGNALINMYVKCG+MEMAI+IF A+EHKDIISWST+ISGLAMNGLGKQAF LFSLMLVHGISPDDITFLGLLSACSHGGLINQG+MV++AMKDVYN+
Subjt:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV

Query:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG
         PQ+RHYAC+VDMYG+AGLLDEAEAFIKEMP+EAEGPVWGALLHACQIHGNEKKYEKV + LL SKGVTVG FALLSNTYASCDRWNDAN+VR  MRSRG
Subjt:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG

Query:  LKKMAGCSWIELVDLSNPL
        LKKMAGCSWIELV+ SNP+
Subjt:  LKKMAGCSWIELVDLSNPL

A0A6J1H7Z9 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X11.5e-25784.04Show/hide
Query:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI
        MP +P K  IS AQL+QIHA L+TNP P + NPLLG LV S+APENGLFL+NQMLRH SSHNHY+FTYALKAC LLH+TH GL+IHARL+KSGHLSDIFI
Subjt:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN
        QNSLLHFY + GD PSASR+F S+PDPDVVSWTSIISGLSKLGF+EEALGKFLSMNV PNSATLVSALSACSSLRC+K GKA+HGL+LRSL EE VNLDN
Subjt:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN

Query:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV
        ALLDFYVRCGSLR A+NLFD+MP+RDVVSWTT+IGGYA +GLCEEAVR+FQNMVH     EA PNEATLINVLSACS MSALHLGQWVHSYINSR DV++
Subjt:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV

Query:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV
        DGN+GNALINMYVKCGSM+ AI IFK VEHKDIISWSTIISGLAMNG GKQAFGLFSLMLVHGI+PD ITFL LLSACSHGGLINQGLMV++AMKDVYNV
Subjt:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV

Query:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG
         P+MRHYAC+VDMYG+AGLLDEAEAFIKEMPVEAEGPVWGALLHACQ+HGNE +YEKV QWLLSSK +TVGT+ALLSNTYASCDRWNDANEVRD MRSRG
Subjt:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG

Query:  LKKMAGCSWIELVDLSNPLG
        LKKMAGCSWIEL D  NPLG
Subjt:  LKKMAGCSWIELVDLSNPLG

A0A6J1JC26 pentatricopeptide repeat-containing protein At4g38010-like isoform X12.1e-25483.08Show/hide
Query:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI
        MP KP K  IS AQL+QIHA L+TNP P++ NPLLG L+ S+APENGLFL+NQMLRH SSHNHY+FTYALKAC LLH+TH GL+IHARL+KSGHLSDIFI
Subjt:  MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFI

Query:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN
        QNSLLHFY + GD  SASR+F S+PDPDVVSWTSIISGLSKLGF+EEALGKFLSMNV PNSATLVSALSACS+LRCLK GKA+HGL+LRSL EE VNLDN
Subjt:  QNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDN

Query:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV
        ALLDFYVRCGSLR A+NLFD+MP+RDVVSWTT+IGGYA +GLCEEAVR+FQNMVH     EA PNEATLINVLSACS MSALH GQWVHSY+NSR D+++
Subjt:  ALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMV

Query:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV
        DGN+GNALINMYVKCGSME AI IFK VEHKDIISWSTIISGLAMNG GKQAFGLFSLMLVHGI+PD ITFL LLSACSHGGLINQGLMV++AMKDVYNV
Subjt:  DGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNV

Query:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG
         P+MR YAC+VDMYG+AGLLDEAEAFIKEMPVEAEGPVWGALLHACQ+HGNE +YEKV QWLLSSK +TVGT+ALLSNTYASC RWNDANEVRD MRSRG
Subjt:  VPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRG

Query:  LKKMAGCSWIELVDLSNPLG
        LKKMAGCSWIEL D  NPLG
Subjt:  LKKMAGCSWIELVDLSNPLG

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic9.4e-9537.16Show/hide
Query:  ITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSH-NHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASRIF
        I  PN    N L+        P   ++ F  M+  S  + N Y+F + +KA   +    +G  +H   VKS   SD+F+ NSL+H YF  GD  SA ++F
Subjt:  ITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSH-NHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASRIF

Query:  HSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDNALLDFYVRCGSLRSAENL
         ++ + DVVSW S+I+G  + G  ++AL  F  M   +V  +  T+V  LSAC+ +R L+FG+ V      +     + L NA+LD Y +CGS+  A+ L
Subjt:  HSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDNALLDFYVRCGSLRSAENL

Query:  FDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHV----------RGEAKPNEA------------------TLINVLSACSFMSALHLGQWVHS
        FD M ++D V+WTT++ GYA S   E A  +  +M    +              KPNEA                  TL++ LSAC+ + AL LG+W+HS
Subjt:  FDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHV----------RGEAKPNEA------------------TLINVLSACSFMSALHLGQWVHS

Query:  YINSRPDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMV
        YI  +  + ++ +V +ALI+MY KCG +E +  +F +VE +D+  WS +I GLAM+G G +A  +F  M    + P+ +TF  +  ACSH GL+++   +
Subjt:  YINSRPDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMV

Query:  YKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDAN
        +  M+  Y +VP+ +HYACIVD+ GR+G L++A  FI+ MP+     VWGALL AC+IH N    E     LL  +    G   LLSN YA   +W + +
Subjt:  YKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDAN

Query:  EVRDTMRSRGLKKMAGCSWIEL
        E+R  MR  GLKK  GCS IE+
Subjt:  EVRDTMRSRGLKKMAGCSWIEL

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic7.2e-9536Show/hide
Query:  ITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHG---DA-----
        I  PN  I N +      S  P + L L+  M+      N Y+F + LK+C        G QIH  ++K G   D+++  SL+  Y  +G   DA     
Subjt:  ITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHG---DA-----

Query:  -----------------------PSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRL
                                +A ++F  +P  DVVSW ++ISG ++ G  +EAL  F  M   NV P+ +T+V+ +SAC+    ++ G+ VH    
Subjt:  -----------------------PSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRL

Query:  RSLREECVNLDNALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWV
               + + NAL+D Y +CG L +A  LF+++P +DV+SW T+IGGY    L +EA+ LFQ M    +R    PN+ T++++L AC+ + A+ +G+W+
Subjt:  RSLREECVNLDNALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWV

Query:  HSYINSR-PDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQG
        H YI+ R   V    ++  +LI+MY KCG +E A  +F ++ HK + SW+ +I G AM+G    +F LFS M   GI PDDITF+GLLSACSH G+++ G
Subjt:  HSYINSR-PDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQG

Query:  LMVYKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWN
          +++ M   Y + P++ HY C++D+ G +GL  EAE  I  M +E +G +W +LL AC++HGN +  E   + L+  +    G++ LLSN YAS  RWN
Subjt:  LMVYKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWN

Query:  DANEVRDTMRSRGLKKMAGCSWIEL
        +  + R  +  +G+KK+ GCS IE+
Subjt:  DANEVRDTMRSRGLKKMAGCSWIEL

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136009.7e-9236.74Show/hide
Query:  ENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGF
        E  L  F  M +     N YSF   L AC  L+  + G+Q+H+ + KS  LSD++I ++L+  Y   G+   A R+F  M D +VVSW S+I+   + G 
Subjt:  ENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGF

Query:  EEEALGKF---LSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRS--LREECVNLDNALLDFYVRCGSLRSAENLFDKMP----------------
          EAL  F   L   V P+  TL S +SAC+SL  +K G+ VHG  +++  LR + + L NA +D Y +C  ++ A  +FD MP                
Subjt:  EEEALGKF---LSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRS--LREECVNLDNALLDFYVRCGSLRSAENLFDKMP----------------

Query:  ---------------KRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMVDGN-----
                       +R+VVSW  +I GY Q+G  EEA+ LF     L  R    P   +  N+L AC+ ++ LHLG   H ++         G      
Subjt:  ---------------KRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMVDGN-----

Query:  VGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVVPQ
        VGN+LI+MYVKCG +E   ++F+ +  +D +SW+ +I G A NG G +A  LF  ML  G  PD IT +G+LSAC H G + +G   + +M   + V P 
Subjt:  VGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVVPQ

Query:  MRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
          HY C+VD+ GRAG L+EA++ I+EMP++ +  +WG+LL AC++H N    + V + LL  +    G + LLSN YA   +W D   VR +MR  G+ K
Subjt:  MRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  MAGCSWIEL
          GCSWI++
Subjt:  MAGCSWIEL

Q9SJZ3 Pentatricopeptide repeat-containing protein At2g22410, mitochondrial2.2e-9134.99Show/hide
Query:  ITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRH---SSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASR
        I NPN    N  +     S  P+    L+ QMLRH    S  +H+++    K C  L  + +G  I   ++K        + N+ +H +   GD  +A +
Subjt:  ITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRH---SSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASR

Query:  IFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDNALLDFYVRCGSLRSAE
        +F   P  D+VSW  +I+G  K+G  E+A+  +  M    V P+  T++  +S+CS L  L  GK  +     +     + L NAL+D + +CG +  A 
Subjt:  IFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDNALLDFYVRCGSLRSAE

Query:  NLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHV--------------RGE-------------AKPNEATLINVLSACSFMSALHLGQWVH
         +FD + KR +VSWTT+I GYA+ GL + + +LF +M    V              RG+              KP+E T+I+ LSACS + AL +G W+H
Subjt:  NLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHV--------------RGE-------------AKPNEATLINVLSACSFMSALHLGQWVH

Query:  SYINSRPDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM
         YI  +  + ++  +G +L++MY KCG++  A+ +F  ++ ++ ++++ II GLA++G    A   F+ M+  GI+PD+ITF+GLLSAC HGG+I  G  
Subjt:  SYINSRPDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM

Query:  VYKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDA
         +  MK  +N+ PQ++HY+ +VD+ GRAGLL+EA+  ++ MP+EA+  VWGALL  C++HGN +  EK  + LL       G + LL   Y   + W DA
Subjt:  VYKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKMAGCSWIEL
           R  M  RG++K+ GCS IE+
Subjt:  NEVRDTMRSRGLKKMAGCSWIEL

Q9SX45 Pentatricopeptide repeat-containing protein At1g502705.7e-9237.34Show/hide
Query:  FNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALG
        +  M R+    + ++F   LKA   L  ++   Q HA +VK G  SD F++NSL+  Y   G    ASR+F    D DVV+WT++I G  + G   EA+ 
Subjt:  FNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALG

Query:  KFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREEC-VNLDNALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEA
         F+ M    V  N  T+VS L A   +  ++FG++VHGL L + R +C V + ++L+D Y +C     A+ +FD+MP R+VV+WT +I GY QS   ++ 
Subjt:  KFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREEC-VNLDNALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEA

Query:  VRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMN
        + +F+ M    ++ +  PNE TL +VLSAC+ + ALH G+ VH Y+  +  + ++   G  LI++YVKCG +E AI++F+ +  K++ +W+ +I+G A +
Subjt:  VRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMN

Query:  GLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHAC
        G  + AF LF  ML   +SP+++TF+ +LSAC+HGGL+ +G  ++ +MK  +N+ P+  HYAC+VD++GR GLL+EA+A I+ MP+E    VWGAL  +C
Subjt:  GLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHAC

Query:  QIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKMAGCSWIEL
         +H + +  +     ++  +    G + LL+N Y+    W++   VR  M+ + + K  G SWIE+
Subjt:  QIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKMAGCSWIEL

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.1e-9636Show/hide
Query:  ITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHG---DA-----
        I  PN  I N +      S  P + L L+  M+      N Y+F + LK+C        G QIH  ++K G   D+++  SL+  Y  +G   DA     
Subjt:  ITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHG---DA-----

Query:  -----------------------PSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRL
                                +A ++F  +P  DVVSW ++ISG ++ G  +EAL  F  M   NV P+ +T+V+ +SAC+    ++ G+ VH    
Subjt:  -----------------------PSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRL

Query:  RSLREECVNLDNALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWV
               + + NAL+D Y +CG L +A  LF+++P +DV+SW T+IGGY    L +EA+ LFQ M    +R    PN+ T++++L AC+ + A+ +G+W+
Subjt:  RSLREECVNLDNALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWV

Query:  HSYINSR-PDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQG
        H YI+ R   V    ++  +LI+MY KCG +E A  +F ++ HK + SW+ +I G AM+G    +F LFS M   GI PDDITF+GLLSACSH G+++ G
Subjt:  HSYINSR-PDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQG

Query:  LMVYKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWN
          +++ M   Y + P++ HY C++D+ G +GL  EAE  I  M +E +G +W +LL AC++HGN +  E   + L+  +    G++ LLSN YAS  RWN
Subjt:  LMVYKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWN

Query:  DANEVRDTMRSRGLKKMAGCSWIEL
        +  + R  +  +G+KK+ GCS IE+
Subjt:  DANEVRDTMRSRGLKKMAGCSWIEL

AT1G50270.1 Pentatricopeptide repeat (PPR) superfamily protein4.0e-9337.34Show/hide
Query:  FNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALG
        +  M R+    + ++F   LKA   L  ++   Q HA +VK G  SD F++NSL+  Y   G    ASR+F    D DVV+WT++I G  + G   EA+ 
Subjt:  FNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALG

Query:  KFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREEC-VNLDNALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEA
         F+ M    V  N  T+VS L A   +  ++FG++VHGL L + R +C V + ++L+D Y +C     A+ +FD+MP R+VV+WT +I GY QS   ++ 
Subjt:  KFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREEC-VNLDNALLDFYVRCGSLRSAENLFDKMPKRDVVSWTTIIGGYAQSGLCEEA

Query:  VRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMN
        + +F+ M    ++ +  PNE TL +VLSAC+ + ALH G+ VH Y+  +  + ++   G  LI++YVKCG +E AI++F+ +  K++ +W+ +I+G A +
Subjt:  VRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMN

Query:  GLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHAC
        G  + AF LF  ML   +SP+++TF+ +LSAC+HGGL+ +G  ++ +MK  +N+ P+  HYAC+VD++GR GLL+EA+A I+ MP+E    VWGAL  +C
Subjt:  GLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHAC

Query:  QIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKMAGCSWIEL
         +H + +  +     ++  +    G + LL+N Y+    W++   VR  M+ + + K  G SWIE+
Subjt:  QIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKMAGCSWIEL

AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein6.9e-9336.74Show/hide
Query:  ENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGF
        E  L  F  M +     N YSF   L AC  L+  + G+Q+H+ + KS  LSD++I ++L+  Y   G+   A R+F  M D +VVSW S+I+   + G 
Subjt:  ENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGF

Query:  EEEALGKF---LSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRS--LREECVNLDNALLDFYVRCGSLRSAENLFDKMP----------------
          EAL  F   L   V P+  TL S +SAC+SL  +K G+ VHG  +++  LR + + L NA +D Y +C  ++ A  +FD MP                
Subjt:  EEEALGKF---LSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRS--LREECVNLDNALLDFYVRCGSLRSAENLFDKMP----------------

Query:  ---------------KRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMVDGN-----
                       +R+VVSW  +I GY Q+G  EEA+ LF     L  R    P   +  N+L AC+ ++ LHLG   H ++         G      
Subjt:  ---------------KRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMVDGN-----

Query:  VGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVVPQ
        VGN+LI+MYVKCG +E   ++F+ +  +D +SW+ +I G A NG G +A  LF  ML  G  PD IT +G+LSAC H G + +G   + +M   + V P 
Subjt:  VGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVVPQ

Query:  MRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK
          HY C+VD+ GRAG L+EA++ I+EMP++ +  +WG+LL AC++H N    + V + LL  +    G + LLSN YA   +W D   VR +MR  G+ K
Subjt:  MRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKK

Query:  MAGCSWIEL
          GCSWI++
Subjt:  MAGCSWIEL

AT2G22410.1 SLOW GROWTH 11.5e-9234.99Show/hide
Query:  ITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRH---SSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASR
        I NPN    N  +     S  P+    L+ QMLRH    S  +H+++    K C  L  + +G  I   ++K        + N+ +H +   GD  +A +
Subjt:  ITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRH---SSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASR

Query:  IFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDNALLDFYVRCGSLRSAE
        +F   P  D+VSW  +I+G  K+G  E+A+  +  M    V P+  T++  +S+CS L  L  GK  +     +     + L NAL+D + +CG +  A 
Subjt:  IFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDNALLDFYVRCGSLRSAE

Query:  NLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHV--------------RGE-------------AKPNEATLINVLSACSFMSALHLGQWVH
         +FD + KR +VSWTT+I GYA+ GL + + +LF +M    V              RG+              KP+E T+I+ LSACS + AL +G W+H
Subjt:  NLFDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHV--------------RGE-------------AKPNEATLINVLSACSFMSALHLGQWVH

Query:  SYINSRPDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM
         YI  +  + ++  +G +L++MY KCG++  A+ +F  ++ ++ ++++ II GLA++G    A   F+ M+  GI+PD+ITF+GLLSAC HGG+I  G  
Subjt:  SYINSRPDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM

Query:  VYKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDA
         +  MK  +N+ PQ++HY+ +VD+ GRAGLL+EA+  ++ MP+EA+  VWGALL  C++HGN +  EK  + LL       G + LL   Y   + W DA
Subjt:  VYKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDA

Query:  NEVRDTMRSRGLKKMAGCSWIEL
           R  M  RG++K+ GCS IE+
Subjt:  NEVRDTMRSRGLKKMAGCSWIEL

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.7e-9637.16Show/hide
Query:  ITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSH-NHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASRIF
        I  PN    N L+        P   ++ F  M+  S  + N Y+F + +KA   +    +G  +H   VKS   SD+F+ NSL+H YF  GD  SA ++F
Subjt:  ITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSH-NHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFLHGDAPSASRIF

Query:  HSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDNALLDFYVRCGSLRSAENL
         ++ + DVVSW S+I+G  + G  ++AL  F  M   +V  +  T+V  LSAC+ +R L+FG+ V      +     + L NA+LD Y +CGS+  A+ L
Subjt:  HSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDNALLDFYVRCGSLRSAENL

Query:  FDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHV----------RGEAKPNEA------------------TLINVLSACSFMSALHLGQWVHS
        FD M ++D V+WTT++ GYA S   E A  +  +M    +              KPNEA                  TL++ LSAC+ + AL LG+W+HS
Subjt:  FDKMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHV----------RGEAKPNEA------------------TLINVLSACSFMSALHLGQWVHS

Query:  YINSRPDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMV
        YI  +  + ++ +V +ALI+MY KCG +E +  +F +VE +D+  WS +I GLAM+G G +A  +F  M    + P+ +TF  +  ACSH GL+++   +
Subjt:  YINSRPDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEHKDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMV

Query:  YKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDAN
        +  M+  Y +VP+ +HYACIVD+ GR+G L++A  FI+ MP+     VWGALL AC+IH N    E     LL  +    G   LLSN YA   +W + +
Subjt:  YKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWGALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDAN

Query:  EVRDTMRSRGLKKMAGCSWIEL
        E+R  MR  GLKK  GCS IE+
Subjt:  EVRDTMRSRGLKKMAGCSWIEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTAAAACCTTCAAAACCCTCGATTTCAATCGCTCAGCTAACTCAAATCCACGCCCATCTCATCACAAATCCAAACCCCCACATTTTAAACCCTTTACTGGGCAC
TTTGGTTCATTCTATCGCCCCTGAAAATGGCCTCTTCCTCTTCAACCAAATGCTTCGTCACTCATCTTCCCACAACCACTACTCTTTCACCTACGCCCTCAAAGCCTGTT
GCCTCCTCCATCAAACCCACATCGGCCTCCAAATCCATGCCCGTCTTGTCAAATCTGGACACCTTTCTGACATCTTCATCCAAAATTCTTTGCTCCATTTCTACTTTCTC
CATGGCGACGCTCCTTCTGCTTCTCGAATCTTTCATTCCATGCCTGACCCAGATGTGGTTTCCTGGACTTCGATCATTTCCGGCCTTTCCAAGCTAGGTTTTGAAGAGGA
GGCTCTGGGTAAGTTCTTGTCCATGAATGTGTGGCCTAACTCTGCTACTCTTGTTAGTGCTTTATCTGCTTGTTCTAGCCTTAGATGTCTCAAGTTTGGGAAAGCTGTAC
ATGGGCTGAGATTGCGGAGTTTGCGGGAGGAATGTGTTAATTTGGACAATGCCCTTTTGGATTTTTATGTTAGATGTGGGTCTTTGAGAAGTGCAGAGAACCTGTTCGAT
AAAATGCCCAAGAGAGACGTTGTGTCTTGGACTACAATAATTGGGGGTTATGCACAGAGTGGATTGTGTGAAGAGGCTGTGAGGCTGTTTCAAAACATGGTTCATCTTCA
TGTGAGAGGAGAGGCCAAGCCCAATGAGGCCACTCTGATTAATGTATTATCTGCATGTTCTTTCATGTCTGCTCTGCATTTGGGTCAGTGGGTGCATTCCTATATCAACT
CTAGGCCTGATGTCATGGTTGATGGAAACGTCGGAAATGCTTTGATTAACATGTATGTGAAATGTGGTAGCATGGAAATGGCAATTATGATCTTCAAAGCTGTTGAACAC
AAGGATATCATATCATGGAGCACAATTATAAGTGGGTTAGCCATGAATGGCCTAGGCAAGCAAGCTTTTGGTCTCTTCTCACTCATGCTAGTTCATGGCATTTCTCCAGA
TGACATAACATTTCTTGGCCTGTTATCTGCATGCAGCCATGGTGGGCTGATAAATCAAGGTTTGATGGTTTATAAAGCTATGAAAGATGTTTATAATGTTGTACCTCAGA
TGAGGCATTATGCTTGCATCGTAGACATGTATGGAAGGGCTGGGCTTTTAGATGAAGCAGAGGCATTCATAAAGGAGATGCCTGTGGAAGCAGAAGGCCCAGTTTGGGGA
GCTCTGCTTCATGCTTGTCAAATTCATGGGAATGAAAAGAAGTATGAGAAAGTTACGCAATGGCTGCTTAGCAGCAAGGGGGTTACAGTGGGAACTTTTGCTTTGTTATC
AAATACTTATGCTAGTTGTGATAGATGGAATGATGCTAATGAAGTTCGAGATACCATGAGAAGTAGAGGGTTGAAGAAAATGGCTGGATGTAGTTGGATTGAATTGGTTG
ATCTCTCGAATCCATTGGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCATTAAAACCTTCAAAACCCTCGATTTCAATCGCTCAGCTAACTCAAATCCACGCCCATCTCATCACAAATCCAAACCCCCACATTTTAAACCCTTTACTGGGCAC
TTTGGTTCATTCTATCGCCCCTGAAAATGGCCTCTTCCTCTTCAACCAAATGCTTCGTCACTCATCTTCCCACAACCACTACTCTTTCACCTACGCCCTCAAAGCCTGTT
GCCTCCTCCATCAAACCCACATCGGCCTCCAAATCCATGCCCGTCTTGTCAAATCTGGACACCTTTCTGACATCTTCATCCAAAATTCTTTGCTCCATTTCTACTTTCTC
CATGGCGACGCTCCTTCTGCTTCTCGAATCTTTCATTCCATGCCTGACCCAGATGTGGTTTCCTGGACTTCGATCATTTCCGGCCTTTCCAAGCTAGGTTTTGAAGAGGA
GGCTCTGGGTAAGTTCTTGTCCATGAATGTGTGGCCTAACTCTGCTACTCTTGTTAGTGCTTTATCTGCTTGTTCTAGCCTTAGATGTCTCAAGTTTGGGAAAGCTGTAC
ATGGGCTGAGATTGCGGAGTTTGCGGGAGGAATGTGTTAATTTGGACAATGCCCTTTTGGATTTTTATGTTAGATGTGGGTCTTTGAGAAGTGCAGAGAACCTGTTCGAT
AAAATGCCCAAGAGAGACGTTGTGTCTTGGACTACAATAATTGGGGGTTATGCACAGAGTGGATTGTGTGAAGAGGCTGTGAGGCTGTTTCAAAACATGGTTCATCTTCA
TGTGAGAGGAGAGGCCAAGCCCAATGAGGCCACTCTGATTAATGTATTATCTGCATGTTCTTTCATGTCTGCTCTGCATTTGGGTCAGTGGGTGCATTCCTATATCAACT
CTAGGCCTGATGTCATGGTTGATGGAAACGTCGGAAATGCTTTGATTAACATGTATGTGAAATGTGGTAGCATGGAAATGGCAATTATGATCTTCAAAGCTGTTGAACAC
AAGGATATCATATCATGGAGCACAATTATAAGTGGGTTAGCCATGAATGGCCTAGGCAAGCAAGCTTTTGGTCTCTTCTCACTCATGCTAGTTCATGGCATTTCTCCAGA
TGACATAACATTTCTTGGCCTGTTATCTGCATGCAGCCATGGTGGGCTGATAAATCAAGGTTTGATGGTTTATAAAGCTATGAAAGATGTTTATAATGTTGTACCTCAGA
TGAGGCATTATGCTTGCATCGTAGACATGTATGGAAGGGCTGGGCTTTTAGATGAAGCAGAGGCATTCATAAAGGAGATGCCTGTGGAAGCAGAAGGCCCAGTTTGGGGA
GCTCTGCTTCATGCTTGTCAAATTCATGGGAATGAAAAGAAGTATGAGAAAGTTACGCAATGGCTGCTTAGCAGCAAGGGGGTTACAGTGGGAACTTTTGCTTTGTTATC
AAATACTTATGCTAGTTGTGATAGATGGAATGATGCTAATGAAGTTCGAGATACCATGAGAAGTAGAGGGTTGAAGAAAATGGCTGGATGTAGTTGGATTGAATTGGTTG
ATCTCTCGAATCCATTGGGTTAA
Protein sequenceShow/hide protein sequence
MPLKPSKPSISIAQLTQIHAHLITNPNPHILNPLLGTLVHSIAPENGLFLFNQMLRHSSSHNHYSFTYALKACCLLHQTHIGLQIHARLVKSGHLSDIFIQNSLLHFYFL
HGDAPSASRIFHSMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVWPNSATLVSALSACSSLRCLKFGKAVHGLRLRSLREECVNLDNALLDFYVRCGSLRSAENLFD
KMPKRDVVSWTTIIGGYAQSGLCEEAVRLFQNMVHLHVRGEAKPNEATLINVLSACSFMSALHLGQWVHSYINSRPDVMVDGNVGNALINMYVKCGSMEMAIMIFKAVEH
KDIISWSTIISGLAMNGLGKQAFGLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVYKAMKDVYNVVPQMRHYACIVDMYGRAGLLDEAEAFIKEMPVEAEGPVWG
ALLHACQIHGNEKKYEKVTQWLLSSKGVTVGTFALLSNTYASCDRWNDANEVRDTMRSRGLKKMAGCSWIELVDLSNPLG