; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg16943 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg16943
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCarg_Chr10:4842470..4844020
RNA-Seq ExpressionCarg16943
SyntenyCarg16943
Gene Ontology termsGO:1900865 - chloroplast RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6590206.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.6e-30199.81Show/hide
Query:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
        MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
Subjt:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN
        QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN
Subjt:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN

Query:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN
        ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN
Subjt:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN

Query:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
        IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
Subjt:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE

Query:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM
        MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM
Subjt:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM

Query:  AGCSWIELADPLNPLG
        AGCSW ELADPLNPLG
Subjt:  AGCSWIELADPLNPLG

XP_022960642.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucurbita moschata]5.5e-302100Show/hide
Query:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
        MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
Subjt:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN
        QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN
Subjt:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN

Query:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN
        ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN
Subjt:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN

Query:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
        IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
Subjt:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE

Query:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM
        MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM
Subjt:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM

Query:  AGCSWIELADPLNPLG
        AGCSWIELADPLNPLG
Subjt:  AGCSWIELADPLNPLG

XP_022960643.1 pentatricopeptide repeat-containing protein At4g38010-like isoform X2 [Cucurbita moschata]7.0e-28194.57Show/hide
Query:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
        MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
Subjt:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN
        QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN
Subjt:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN

Query:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN
        ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN
Subjt:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN

Query:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
        IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
Subjt:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE

Query:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM
        MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNE                            SCDRWNDANEVRDAMRSRGLKKM
Subjt:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM

Query:  AGCSWIELADPLNPLG
        AGCSWIELADPLNPLG
Subjt:  AGCSWIELADPLNPLG

XP_022988072.1 pentatricopeptide repeat-containing protein At4g38010-like isoform X1 [Cucurbita maxima]2.2e-29597.48Show/hide
Query:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
        MPF+PPKLWISSAQLSQIHAQLLTNPKP VFNPLLGAL+DSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
Subjt:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN
        QNSLLHFYIVDGDV SASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACS+LRC+KIGKAIHGLKLRSLNEESVNLDN
Subjt:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN

Query:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN
        ALLDFYVRCGSLRGAQNLFDEMP+RDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALH GQWVHSY+NSRHD+IIDGN
Subjt:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN

Query:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
        IGNALINMYVKCGSM+KAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
Subjt:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE

Query:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM
        MR YACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASC RWNDANEVRDAMRSRGLKKM
Subjt:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM

Query:  AGCSWIELADPLNPLG
        AGCSWIELADPLNPLG
Subjt:  AGCSWIELADPLNPLG

XP_023515586.1 pentatricopeptide repeat-containing protein At4g38010-like isoform X1 [Cucurbita pepo subsp. pepo]2.0e-29698.45Show/hide
Query:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
        MP RPPKL ISSAQLSQIHAQLLTNPKP VFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
Subjt:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN
        QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLN ESV+LDN
Subjt:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN

Query:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN
        ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLG+WVHSYINSRHDVIIDGN
Subjt:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN

Query:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
        IGNALINMYVKCGSMDKAISIFKTVEHKD+ISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
Subjt:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE

Query:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM
        MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNE YEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM
Subjt:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM

Query:  AGCSWIELADPLNPLG
        AGCSWIELADPLNPLG
Subjt:  AGCSWIELADPLNPLG

TrEMBL top hitse value%identityAlignment
A0A0A0LXJ1 Uncharacterized protein2.7e-25482.75Show/hide
Query:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
        MP +P K  IS AQ +QIHA+LLTNPKP +FNPLLG+LV+S+ PENGLFLYNQMLR+PSSHNH+TFTYALKAC  LH+T KGLEIHA LIKSGHLSDIFI
Subjt:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN
        QNSLLHFYI+DGDV SAS +FDSIPDPDVVSWTSIISGLSKLGF++EAL KFLSMNV PNS TLV+ALSACSSLRC+K+GKAIHGL++R+LNEE+V L+N
Subjt:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN

Query:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN
        ALLDFYVRC  LR A+NLF++MP+RDVVSWTT+IGGYA +GLCEEAVRVFQNMVH  EAIPNEATL+NVLSACSS+SALHLGQWVHSYINSRHDVIIDGN
Subjt:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN

Query:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
        +GNALINMYVKCG+M+ AI IFK +EHKDI+SWSTIISGLAMNG GKQAF LFSLMLVHG++PD ITFL LLSACSHGGLINQG+MVFEAMKDVYN++P+
Subjt:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE

Query:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNE-RYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKK
        MRHYACMVDMYGKAGLLDEAEAFIKEMP+EAEGPVWGALLHACQLHGNE +YEKVR+WLL SK +TVGT+ALLSNTYA CDRWNDAN+VR AMRSRGLKK
Subjt:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNE-RYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKK

Query:  MAGCSWIELADPLNPL
        MAG SWIE+ D   PL
Subjt:  MAGCSWIELADPLNPL

A0A6J1H7Z9 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X12.7e-302100Show/hide
Query:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
        MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
Subjt:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN
        QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN
Subjt:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN

Query:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN
        ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN
Subjt:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN

Query:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
        IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
Subjt:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE

Query:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM
        MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM
Subjt:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM

Query:  AGCSWIELADPLNPLG
        AGCSWIELADPLNPLG
Subjt:  AGCSWIELADPLNPLG

A0A6J1H9K3 pentatricopeptide repeat-containing protein At4g38010-like isoform X23.4e-28194.57Show/hide
Query:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
        MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
Subjt:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN
        QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN
Subjt:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN

Query:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN
        ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN
Subjt:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN

Query:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
        IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
Subjt:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE

Query:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM
        MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNE                            SCDRWNDANEVRDAMRSRGLKKM
Subjt:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM

Query:  AGCSWIELADPLNPLG
        AGCSWIELADPLNPLG
Subjt:  AGCSWIELADPLNPLG

A0A6J1JC26 pentatricopeptide repeat-containing protein At4g38010-like isoform X11.1e-29597.48Show/hide
Query:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
        MPF+PPKLWISSAQLSQIHAQLLTNPKP VFNPLLGAL+DSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
Subjt:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN
        QNSLLHFYIVDGDV SASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACS+LRC+KIGKAIHGLKLRSLNEESVNLDN
Subjt:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN

Query:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN
        ALLDFYVRCGSLRGAQNLFDEMP+RDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALH GQWVHSY+NSRHD+IIDGN
Subjt:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN

Query:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
        IGNALINMYVKCGSM+KAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
Subjt:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE

Query:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM
        MR YACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASC RWNDANEVRDAMRSRGLKKM
Subjt:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM

Query:  AGCSWIELADPLNPLG
        AGCSWIELADPLNPLG
Subjt:  AGCSWIELADPLNPLG

A0A6J1JL75 pentatricopeptide repeat-containing protein At4g38010-like isoform X21.4e-27492.05Show/hide
Query:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
        MPF+PPKLWISSAQLSQIHAQLLTNPKP VFNPLLGAL+DSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI
Subjt:  MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFI

Query:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN
        QNSLLHFYIVDGDV SASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACS+LRC+KIGKAIHGLKLRSLNEESVNLDN
Subjt:  QNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDN

Query:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN
        ALLDFYVRCGSLRGAQNLFDEMP+RDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALH GQWVHSY+NSRHD+IIDGN
Subjt:  ALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGN

Query:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
        IGNALINMYVKCGSM+KAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE
Subjt:  IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPE

Query:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM
        MR YACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNE                            SC RWNDANEVRDAMRSRGLKKM
Subjt:  MRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKM

Query:  AGCSWIELADPLNPLG
        AGCSWIELADPLNPLG
Subjt:  AGCSWIELADPLNPLG

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.0e-9336.28Show/hide
Query:  PKPRVF--NPLLGALVDSVAPENGLFLYNQMLRHPSSH-NHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFD
        PKP  F  N L+ A      P   ++ +  M+     + N YTF + +KA   +     G  +H   +KS   SD+F+ NSL+H Y   GD+ SA +VF 
Subjt:  PKPRVF--NPLLGALVDSVAPENGLFLYNQMLRHPSSH-NHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFD

Query:  SIPDPDVVSWTSIISGLSKLGFKEEALGKFLSM---NVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDNALLDFYVRCGSLRGAQNLF
        +I + DVVSW S+I+G  + G  ++AL  F  M   +V  +  T+V  LSAC+ +R ++ G+ +      +    ++ L NA+LD Y +CGS+  A+ LF
Subjt:  SIPDPDVVSWTSIISGLSKLGFKEEALGKFLSM---NVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDNALLDFYVRCGSLRGAQNLF

Query:  DEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNM-------------VHAREAIPNEA------------------TLINVLSACSSMSALHLGQWVHSY
        D M ++D V+WTT++ GYA++   E A  V  +M              + +   PNEA                  TL++ LSAC+ + AL LG+W+HSY
Subjt:  DEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNM-------------VHAREAIPNEA------------------TLINVLSACSSMSALHLGQWVHSY

Query:  INSRHDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVF
        I  +H + ++ ++ +ALI+MY KCG ++K+  +F +VE +D+  WS +I GLAM+G G +A  +F  M    + P+ +TF ++  ACSH GL+++   +F
Subjt:  INSRHDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVF

Query:  EAMKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGN-ERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANE
          M+  Y + PE +HYAC+VD+ G++G L++A  FI+ MP+     VWGALL AC++H N    E     LL  +    G + LLSN YA   +W + +E
Subjt:  EAMKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGN-ERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANE

Query:  VRDAMRSRGLKKMAGCSWIEL
        +R  MR  GLKK  GCS IE+
Subjt:  VRDAMRSRGLKKMAGCSWIEL

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic8.4e-9635.82Show/hide
Query:  LTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGD---------
        +  P   ++N +      S  P + L LY  M+      N YTF + LK+C       +G +IH  ++K G   D+++  SL+  Y+ +G          
Subjt:  LTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGD---------

Query:  ----------------------VPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSM---NVSPNSATLVSALSACSSLRCVKIGKAIHGLKL
                              + +A ++FD IP  DVVSW ++ISG ++ G  +EAL  F  M   NV P+ +T+V+ +SAC+    +++G+ +H    
Subjt:  ----------------------VPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSM---NVSPNSATLVSALSACSSLRCVKIGKAIHGLKL

Query:  RSLNEESVNLDNALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSY
              ++ + NAL+D Y +CG L  A  LF+ +P +DV+SW TLIGGY    L +EA+ +FQ M+ + E  PN+ T++++L AC+ + A+ +G+W+H Y
Subjt:  RSLNEESVNLDNALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSY

Query:  INSR-HDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMV
        I+ R   V    ++  +LI+MY KCG ++ A  +F ++ HK + SW+ +I G AM+G+   +F LFS M   GI PD ITF+ LLSACSH G+++ G  +
Subjt:  INSR-HDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMV

Query:  FEAMKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGN-ERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDAN
        F  M   Y + P++ HY CM+D+ G +GL  EAE  I  M +E +G +W +LL AC++HGN E  E   + L+  +    G+Y LLSN YAS  RWN+  
Subjt:  FEAMKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGN-ERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDAN

Query:  EVRDAMRSRGLKKMAGCSWIEL
        + R  +  +G+KK+ GCS IE+
Subjt:  EVRDAMRSRGLKKMAGCSWIEL

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136008.2e-9135.26Show/hide
Query:  FNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFDSIPDPDVV
        +N ++         E  L  +  M +     N Y+F   L AC  L++ +KG+++H+ + KS  LSD++I ++L+  Y   G+V  A RVFD + D +VV
Subjt:  FNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFDSIPDPDVV

Query:  SWTSIISGLSKLGFKEEALGKF---LSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLN-EESVNLDNALLDFYVRCGSLRGAQNLFDEMP---
        SW S+I+   + G   EAL  F   L   V P+  TL S +SAC+SL  +K+G+ +HG  +++      + L NA +D Y +C  ++ A+ +FD MP   
Subjt:  SWTSIISGLSKLGFKEEALGKF---LSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLN-EESVNLDNALLDFYVRCGSLRGAQNLFDEMP---

Query:  ----------------------------QRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDV
                                    +R+VVSW  LI GY   G  EEA+ +F  ++      P   +  N+L AC+ ++ LHLG   H ++      
Subjt:  ----------------------------QRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDV

Query:  IIDGN-----IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEA
           G      +GN+LI+MYVKCG +++   +F+ +  +D +SW+ +I G A NG G +A  LF  ML  G  PD IT + +LSAC H G + +G   F +
Subjt:  IIDGN-----IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEA

Query:  MKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEK-VRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVR
        M   + VAP   HY CMVD+ G+AG L+EA++ I+EMP++ +  +WG+LL AC++H N    K V + LL  +    G Y LLSN YA   +W D   VR
Subjt:  MKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEK-VRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVR

Query:  DAMRSRGLKKMAGCSWIEL
         +MR  G+ K  GCSWI++
Subjt:  DAMRSRGLKKMAGCSWIEL

Q9SX45 Pentatricopeptide repeat-containing protein At1g502707.4e-9238.33Show/hide
Query:  YNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALG
        Y  M R+    + +TF   LKA F L +++   + HA ++K G  SD F++NSL+  Y   G    ASR+FD   D DVV+WT++I G  + G   EA+ 
Subjt:  YNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALG

Query:  KFLSM---NVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEE-SVNLDNALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEA
         F+ M    V+ N  T+VS L A   +  V+ G+++HGL L +   +  V + ++L+D Y +C     AQ +FDEMP R+VV+WT LI GY  +   ++ 
Subjt:  KFLSM---NVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEE-SVNLDNALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEA

Query:  VRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQG
        + VF+ M+ + +  PNE TL +VLSAC+ + ALH G+ VH Y+  ++ + I+   G  LI++YVKCG +++AI +F+ +  K++ +W+ +I+G A +G  
Subjt:  VRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQG

Query:  KQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLH
        + AF LF  ML   ++P+ +TF+++LSAC+HGGL+ +G  +F +MK  +N+ P+  HYACMVD++G+ GLL+EA+A I+ MP+E    VWGAL  +C LH
Subjt:  KQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLH

Query:  GNERYEK-VRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKMAGCSWIELADPL
         +    K     ++  +    G Y LL+N Y+    W++   VR  M+ + + K  G SWIE+   L
Subjt:  GNERYEK-VRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKMAGCSWIELADPL

Q9SZK1 Pentatricopeptide repeat-containing protein At4g380103.9e-9337.5Show/hide
Query:  FNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFDSIPDPDVV
        +N LL +      P   +F Y   + +  S + +TF    KAC       +G +IH  + K G   DI++QNSL+HFY V G+  +A +VF  +P  DVV
Subjt:  FNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFDSIPDPDVV

Query:  SWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDNALLDFYVRCGSLRGAQNLFDEMPQRDVVSW
        SWT II+G ++ G  +EAL  F  M+V PN AT V  L +   + C+ +GK IHGL L+  +  S+   NAL+D YV+C  L  A  +F E+ ++D VSW
Subjt:  SWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDNALLDFYVRCGSLRGAQNLFDEMPQRDVVSW

Query:  TTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDI
         ++I G       +EA+ +F  M  +    P+   L +VLSAC+S+ A+  G+WVH YI +   +  D +IG A+++MY KCG ++ A+ IF  +  K++
Subjt:  TTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDI

Query:  ISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKD-VYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPV
         +W+ ++ GLA++G G ++   F  M+  G  P+ +TFL+ L+AC H GL+++G   F  MK   YN+ P++ HY CM+D+  +AGLLDEA   +K MPV
Subjt:  ISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKD-VYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPV

Query:  EAEGPVWGALLHACQLHGN--ERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKMAGCSWIE
        + +  + GA+L AC+  G   E  +++    L  +    G Y LLSN +A+  RW+D   +R  M+ +G+ K+ G S+IE
Subjt:  EAEGPVWGALLHACQLHGN--ERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKMAGCSWIE

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.0e-9735.82Show/hide
Query:  LTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGD---------
        +  P   ++N +      S  P + L LY  M+      N YTF + LK+C       +G +IH  ++K G   D+++  SL+  Y+ +G          
Subjt:  LTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGD---------

Query:  ----------------------VPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSM---NVSPNSATLVSALSACSSLRCVKIGKAIHGLKL
                              + +A ++FD IP  DVVSW ++ISG ++ G  +EAL  F  M   NV P+ +T+V+ +SAC+    +++G+ +H    
Subjt:  ----------------------VPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSM---NVSPNSATLVSALSACSSLRCVKIGKAIHGLKL

Query:  RSLNEESVNLDNALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSY
              ++ + NAL+D Y +CG L  A  LF+ +P +DV+SW TLIGGY    L +EA+ +FQ M+ + E  PN+ T++++L AC+ + A+ +G+W+H Y
Subjt:  RSLNEESVNLDNALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSY

Query:  INSR-HDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMV
        I+ R   V    ++  +LI+MY KCG ++ A  +F ++ HK + SW+ +I G AM+G+   +F LFS M   GI PD ITF+ LLSACSH G+++ G  +
Subjt:  INSR-HDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMV

Query:  FEAMKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGN-ERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDAN
        F  M   Y + P++ HY CM+D+ G +GL  EAE  I  M +E +G +W +LL AC++HGN E  E   + L+  +    G+Y LLSN YAS  RWN+  
Subjt:  FEAMKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGN-ERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDAN

Query:  EVRDAMRSRGLKKMAGCSWIEL
        + R  +  +G+KK+ GCS IE+
Subjt:  EVRDAMRSRGLKKMAGCSWIEL

AT1G50270.1 Pentatricopeptide repeat (PPR) superfamily protein5.2e-9338.33Show/hide
Query:  YNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALG
        Y  M R+    + +TF   LKA F L +++   + HA ++K G  SD F++NSL+  Y   G    ASR+FD   D DVV+WT++I G  + G   EA+ 
Subjt:  YNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALG

Query:  KFLSM---NVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEE-SVNLDNALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEA
         F+ M    V+ N  T+VS L A   +  V+ G+++HGL L +   +  V + ++L+D Y +C     AQ +FDEMP R+VV+WT LI GY  +   ++ 
Subjt:  KFLSM---NVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEE-SVNLDNALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTLIGGYALTGLCEEA

Query:  VRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQG
        + VF+ M+ + +  PNE TL +VLSAC+ + ALH G+ VH Y+  ++ + I+   G  LI++YVKCG +++AI +F+ +  K++ +W+ +I+G A +G  
Subjt:  VRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQG

Query:  KQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLH
        + AF LF  ML   ++P+ +TF+++LSAC+HGGL+ +G  +F +MK  +N+ P+  HYACMVD++G+ GLL+EA+A I+ MP+E    VWGAL  +C LH
Subjt:  KQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLH

Query:  GNERYEK-VRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKMAGCSWIELADPL
         +    K     ++  +    G Y LL+N Y+    W++   VR  M+ + + K  G SWIE+   L
Subjt:  GNERYEK-VRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKMAGCSWIELADPL

AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein5.8e-9235.26Show/hide
Query:  FNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFDSIPDPDVV
        +N ++         E  L  +  M +     N Y+F   L AC  L++ +KG+++H+ + KS  LSD++I ++L+  Y   G+V  A RVFD + D +VV
Subjt:  FNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFDSIPDPDVV

Query:  SWTSIISGLSKLGFKEEALGKF---LSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLN-EESVNLDNALLDFYVRCGSLRGAQNLFDEMP---
        SW S+I+   + G   EAL  F   L   V P+  TL S +SAC+SL  +K+G+ +HG  +++      + L NA +D Y +C  ++ A+ +FD MP   
Subjt:  SWTSIISGLSKLGFKEEALGKF---LSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLN-EESVNLDNALLDFYVRCGSLRGAQNLFDEMP---

Query:  ----------------------------QRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDV
                                    +R+VVSW  LI GY   G  EEA+ +F  ++      P   +  N+L AC+ ++ LHLG   H ++      
Subjt:  ----------------------------QRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDV

Query:  IIDGN-----IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEA
           G      +GN+LI+MYVKCG +++   +F+ +  +D +SW+ +I G A NG G +A  LF  ML  G  PD IT + +LSAC H G + +G   F +
Subjt:  IIDGN-----IGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEA

Query:  MKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEK-VRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVR
        M   + VAP   HY CMVD+ G+AG L+EA++ I+EMP++ +  +WG+LL AC++H N    K V + LL  +    G Y LLSN YA   +W D   VR
Subjt:  MKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNERYEK-VRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVR

Query:  DAMRSRGLKKMAGCSWIEL
         +MR  G+ K  GCSWI++
Subjt:  DAMRSRGLKKMAGCSWIEL

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.3e-9536.28Show/hide
Query:  PKPRVF--NPLLGALVDSVAPENGLFLYNQMLRHPSSH-NHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFD
        PKP  F  N L+ A      P   ++ +  M+     + N YTF + +KA   +     G  +H   +KS   SD+F+ NSL+H Y   GD+ SA +VF 
Subjt:  PKPRVF--NPLLGALVDSVAPENGLFLYNQMLRHPSSH-NHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFD

Query:  SIPDPDVVSWTSIISGLSKLGFKEEALGKFLSM---NVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDNALLDFYVRCGSLRGAQNLF
        +I + DVVSW S+I+G  + G  ++AL  F  M   +V  +  T+V  LSAC+ +R ++ G+ +      +    ++ L NA+LD Y +CGS+  A+ LF
Subjt:  SIPDPDVVSWTSIISGLSKLGFKEEALGKFLSM---NVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDNALLDFYVRCGSLRGAQNLF

Query:  DEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNM-------------VHAREAIPNEA------------------TLINVLSACSSMSALHLGQWVHSY
        D M ++D V+WTT++ GYA++   E A  V  +M              + +   PNEA                  TL++ LSAC+ + AL LG+W+HSY
Subjt:  DEMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNM-------------VHAREAIPNEA------------------TLINVLSACSSMSALHLGQWVHSY

Query:  INSRHDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVF
        I  +H + ++ ++ +ALI+MY KCG ++K+  +F +VE +D+  WS +I GLAM+G G +A  +F  M    + P+ +TF ++  ACSH GL+++   +F
Subjt:  INSRHDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVF

Query:  EAMKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGN-ERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANE
          M+  Y + PE +HYAC+VD+ G++G L++A  FI+ MP+     VWGALL AC++H N    E     LL  +    G + LLSN YA   +W + +E
Subjt:  EAMKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGN-ERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANE

Query:  VRDAMRSRGLKKMAGCSWIEL
        +R  MR  GLKK  GCS IE+
Subjt:  VRDAMRSRGLKKMAGCSWIEL

AT4G38010.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.8e-9437.5Show/hide
Query:  FNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFDSIPDPDVV
        +N LL +      P   +F Y   + +  S + +TF    KAC       +G +IH  + K G   DI++QNSL+HFY V G+  +A +VF  +P  DVV
Subjt:  FNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGDVPSASRVFDSIPDPDVV

Query:  SWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDNALLDFYVRCGSLRGAQNLFDEMPQRDVVSW
        SWT II+G ++ G  +EAL  F  M+V PN AT V  L +   + C+ +GK IHGL L+  +  S+   NAL+D YV+C  L  A  +F E+ ++D VSW
Subjt:  SWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDNALLDFYVRCGSLRGAQNLFDEMPQRDVVSW

Query:  TTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDI
         ++I G       +EA+ +F  M  +    P+   L +VLSAC+S+ A+  G+WVH YI +   +  D +IG A+++MY KCG ++ A+ IF  +  K++
Subjt:  TTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDI

Query:  ISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKD-VYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPV
         +W+ ++ GLA++G G ++   F  M+  G  P+ +TFL+ L+AC H GL+++G   F  MK   YN+ P++ HY CM+D+  +AGLLDEA   +K MPV
Subjt:  ISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKD-VYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPV

Query:  EAEGPVWGALLHACQLHGN--ERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKMAGCSWIE
        + +  + GA+L AC+  G   E  +++    L  +    G Y LLSN +A+  RW+D   +R  M+ +G+ K+ G S+IE
Subjt:  EAEGPVWGALLHACQLHGN--ERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKMAGCSWIE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTTAGACCTCCAAAACTCTGGATTTCAAGCGCTCAACTATCCCAAATCCACGCCCAACTCCTCACGAATCCAAAGCCCCGCGTTTTCAACCCTTTGCTCGGTGC
TTTGGTGGACTCCGTCGCCCCTGAAAATGGCCTCTTCCTCTACAACCAAATGCTTCGACACCCATCTTCCCACAACCACTATACCTTCACTTACGCCCTCAAAGCCTGTT
TTCTTCTCCATGAAACCCACAAGGGCCTCGAAATCCATGCCCGTCTCATCAAATCAGGTCACCTTTCTGACATCTTCATCCAAAATTCTTTGCTCCATTTCTACATTGTC
GATGGCGATGTTCCTTCTGCTTCTCGAGTCTTTGATTCCATCCCTGACCCAGATGTGGTTTCGTGGACTTCGATCATTTCGGGGCTTTCCAAGTTGGGTTTTAAAGAGGA
AGCTCTGGGTAAGTTCTTGTCCATGAATGTGAGCCCCAATTCTGCTACTCTTGTTAGTGCTTTATCTGCTTGTTCTAGTCTAAGGTGTGTTAAGATTGGAAAAGCCATAC
ATGGGCTAAAATTGCGGAGTTTGAATGAGGAAAGTGTTAATTTGGACAATGCCCTTCTGGATTTTTACGTTAGATGTGGGTCTTTGAGGGGTGCGCAGAACCTGTTCGAC
GAAATGCCTCAAAGAGATGTAGTGTCTTGGACTACGTTAATCGGAGGTTATGCACTGACAGGATTATGTGAAGAGGCTGTGAGGGTATTCCAAAACATGGTTCATGCGAG
AGAGGCCATACCCAATGAGGCCACTCTAATCAATGTACTATCTGCATGTTCTTCCATGTCTGCTCTGCATTTGGGTCAATGGGTACATTCATATATCAACTCTAGGCACG
ACGTGATAATTGATGGAAACATTGGAAATGCTTTGATTAACATGTATGTAAAATGTGGCAGCATGGATAAGGCAATTTCAATCTTCAAAACTGTTGAACACAAGGATATC
ATATCATGGAGCACAATCATAAGTGGGTTAGCCATGAATGGCCAAGGCAAGCAAGCTTTTGGTCTCTTTTCACTCATGCTAGTTCATGGCATTACTCCAGATGCCATAAC
ATTTCTCAGCTTGTTATCTGCATGCAGCCATGGTGGGTTGATCAATCAAGGCTTGATGGTGTTTGAAGCCATGAAAGACGTTTACAATGTTGCACCTGAGATGAGGCATT
ATGCTTGCATGGTGGACATGTATGGGAAGGCTGGGCTTTTAGATGAAGCAGAGGCGTTCATAAAGGAGATGCCTGTGGAAGCAGAAGGGCCAGTATGGGGAGCGCTGCTT
CATGCTTGTCAACTCCATGGGAATGAGAGGTATGAGAAAGTTAGGCAATGGCTGCTTAGCAGCAAGAGCATTACAGTGGGAACTTATGCTTTGTTATCGAATACTTATGC
TAGTTGTGATAGATGGAATGATGCTAATGAAGTTCGAGACGCCATGAGAAGTAGAGGACTGAAGAAAATGGCTGGTTGTAGCTGGATTGAATTGGCTGATCCCTTGAATC
CCTTGGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCATTTAGACCTCCAAAACTCTGGATTTCAAGCGCTCAACTATCCCAAATCCACGCCCAACTCCTCACGAATCCAAAGCCCCGCGTTTTCAACCCTTTGCTCGGTGC
TTTGGTGGACTCCGTCGCCCCTGAAAATGGCCTCTTCCTCTACAACCAAATGCTTCGACACCCATCTTCCCACAACCACTATACCTTCACTTACGCCCTCAAAGCCTGTT
TTCTTCTCCATGAAACCCACAAGGGCCTCGAAATCCATGCCCGTCTCATCAAATCAGGTCACCTTTCTGACATCTTCATCCAAAATTCTTTGCTCCATTTCTACATTGTC
GATGGCGATGTTCCTTCTGCTTCTCGAGTCTTTGATTCCATCCCTGACCCAGATGTGGTTTCGTGGACTTCGATCATTTCGGGGCTTTCCAAGTTGGGTTTTAAAGAGGA
AGCTCTGGGTAAGTTCTTGTCCATGAATGTGAGCCCCAATTCTGCTACTCTTGTTAGTGCTTTATCTGCTTGTTCTAGTCTAAGGTGTGTTAAGATTGGAAAAGCCATAC
ATGGGCTAAAATTGCGGAGTTTGAATGAGGAAAGTGTTAATTTGGACAATGCCCTTCTGGATTTTTACGTTAGATGTGGGTCTTTGAGGGGTGCGCAGAACCTGTTCGAC
GAAATGCCTCAAAGAGATGTAGTGTCTTGGACTACGTTAATCGGAGGTTATGCACTGACAGGATTATGTGAAGAGGCTGTGAGGGTATTCCAAAACATGGTTCATGCGAG
AGAGGCCATACCCAATGAGGCCACTCTAATCAATGTACTATCTGCATGTTCTTCCATGTCTGCTCTGCATTTGGGTCAATGGGTACATTCATATATCAACTCTAGGCACG
ACGTGATAATTGATGGAAACATTGGAAATGCTTTGATTAACATGTATGTAAAATGTGGCAGCATGGATAAGGCAATTTCAATCTTCAAAACTGTTGAACACAAGGATATC
ATATCATGGAGCACAATCATAAGTGGGTTAGCCATGAATGGCCAAGGCAAGCAAGCTTTTGGTCTCTTTTCACTCATGCTAGTTCATGGCATTACTCCAGATGCCATAAC
ATTTCTCAGCTTGTTATCTGCATGCAGCCATGGTGGGTTGATCAATCAAGGCTTGATGGTGTTTGAAGCCATGAAAGACGTTTACAATGTTGCACCTGAGATGAGGCATT
ATGCTTGCATGGTGGACATGTATGGGAAGGCTGGGCTTTTAGATGAAGCAGAGGCGTTCATAAAGGAGATGCCTGTGGAAGCAGAAGGGCCAGTATGGGGAGCGCTGCTT
CATGCTTGTCAACTCCATGGGAATGAGAGGTATGAGAAAGTTAGGCAATGGCTGCTTAGCAGCAAGAGCATTACAGTGGGAACTTATGCTTTGTTATCGAATACTTATGC
TAGTTGTGATAGATGGAATGATGCTAATGAAGTTCGAGACGCCATGAGAAGTAGAGGACTGAAGAAAATGGCTGGTTGTAGCTGGATTGAATTGGCTGATCCCTTGAATC
CCTTGGGTTGA
Protein sequenceShow/hide protein sequence
MPFRPPKLWISSAQLSQIHAQLLTNPKPRVFNPLLGALVDSVAPENGLFLYNQMLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIV
DGDVPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSSLRCVKIGKAIHGLKLRSLNEESVNLDNALLDFYVRCGSLRGAQNLFD
EMPQRDVVSWTTLIGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRHDVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDI
ISWSTIISGLAMNGQGKQAFGLFSLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGPVWGALL
HACQLHGNERYEKVRQWLLSSKSITVGTYALLSNTYASCDRWNDANEVRDAMRSRGLKKMAGCSWIELADPLNPLG