; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G00460 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G00460
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr3:355096..357148
RNA-Seq ExpressionCSPI03G00460
SyntenyCSPI03G00460
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596261.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.2e-26483.84Show/hide
Query:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT
        MNGL RR+ +AL  RN+PFFR LRS F++WS  +YHRDSY + KLL HCRSIRSVQELHAQI+VEG DQNGF+A KLIGKY E+  GE KM  ARKVFD 
Subjt:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT

Query:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD
        L+ +DVFVWNVVIQGYA+ GPF EALNL+DEMRV GEPTNRYTFPFVLKACGAMKN DKG+IVHGHV+KCGLDLDLFVGNALIAFYSKCQDVETARKVFD
Subjt:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD

Query:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD
        +MSLRDIVSWNSMI GYTLNGK DEAIM FHAMLHNQ  C+PD+ATLV ILPAC TKSASQVGFWVHSYVIKTG+EVGAPLGSCLI MY NCGHVN+ARD
Subjt:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD

Query:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV
        VFDRI DKNVIVWSAIIR YGMHGFADEA NMF  LEEAG+KPDG+IFLNLLS CSHAGLV KGREIYEKMEAYG ERK+ HYACMVDLLGRAGFLEQAV
Subjt:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE
        EFIEGMPVQAGKDVYGALLGACRIHNN+ELAK+VG+KLF+LDPE A RYV LA+MYEDAGQWEDAAKLRKLLRDRNIRKP GCSSIE+DRI+HVFGK+DE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE

Query:  THPLTEEIFDTLEKLERIMEEDFEPI
        +HP TE+IFDTLEKLER+M+E+FEPI
Subjt:  THPLTEEIFDTLEKLERIMEEDFEPI

XP_008449159.1 PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Cucumis melo]3.1e-27688.21Show/hide
Query:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT
        MNGL RRTWQALTFR+QP       FFSE   GKYHRDSY F+KLLGHCR+IRSVQELHAQILVEGLDQNGF+A KLIGKYVE  EGE KMGTARKVFD 
Subjt:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT

Query:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD
        L++RDVF+WNVVIQGYAS GPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGA+KNSDKGEIVHG+VVKCGLDLDLFVGNALI+ YSKCQDVETARKVFD
Subjt:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD

Query:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD
        DMSLRD VSWNSMIVGYTLN +ED+AIMFFHAMLHNQADC PD ATLVAILPACATKSASQVGFWVHSY+IKTG+EVGA LGSCLI MYGNCGH+NIARD
Subjt:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD

Query:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV
        VF+RID+KNVIVWSAIIRCYGMHG ADEA NMFRRLEE GVKPDGLIFLNLLSACSHAGLVAKGREIY+KMEAYGLER D HYACMVDLL RAGFLEQA 
Subjt:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE
        EFIE MPVQAGKDVYGAL GACRIHNNLELAKEVGEKLFILDPE A RY+ LA+MYEDAGQWEDAAK+RKLLRDRNI+KPAGCSSIEVDRIHHVFGKKDE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE

Query:  THPLTEEIFDTLEKLERIMEEDFEPI
        THP TEEIFDT+EKLER+MEEDFEPI
Subjt:  THPLTEEIFDTLEKLERIMEEDFEPI

XP_011650274.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Cucumis sativus]1.2e-30798.48Show/hide
Query:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT
        MNGLCRRTWQALTFRNQP       FFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT
Subjt:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT

Query:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD
        LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD
Subjt:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD

Query:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD
        DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD
Subjt:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD

Query:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV
        VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKG EIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV
Subjt:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE
        EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE

Query:  THPLTEEIFDTLEKLERIMEEDFEPI
        THPLTEEIFDTLEKLERIMEEDFEPI
Subjt:  THPLTEEIFDTLEKLERIMEEDFEPI

XP_022971604.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucurbita maxima]4.2e-26584.22Show/hide
Query:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT
        MNGL RR  +AL  R QPFFR LRS F++WS  +YHRDSY + KLL HCRSIRSVQELHAQI+VEG DQNGF+A KLIGKY E+  GE KMG ARKVFD 
Subjt:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT

Query:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD
        L+ +DVFVWNVVIQGYA+ GPF EALNL+DEMRV GEPTNRYTFPFVLKACGAMKNS+KG+IVHGHV+KCGLDLDLFVGNALIAFYSKCQDVETARKVFD
Subjt:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD

Query:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD
        +MSLRDIVSWNSMI GYTLNGK DEAIM FHAMLHNQ  C+PD+ATLV ILPAC TKSASQVGFWVHSYVIKTG+EVGAPLGSCLI MY NCGHVNIARD
Subjt:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD

Query:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV
        VFDRI DKNVIVWSAIIR YGMHGFADEA NMF  LEE G+KPDG+IFLNLLS CSHAGLV KGREIYEKMEAYG ERK+ HYACMVDLLGRAGFLEQAV
Subjt:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE
        EFIEGMPVQAGKDVYGALLGACRIHNN+ELAKEVG+KLF+LDPE A RYV LA+MYEDAGQWEDAAKLRKLLRDRNIRKP GCSSIE+DRI+HVFGK+DE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE

Query:  THPLTEEIFDTLEKLERIMEEDFEPI
        +HP TE+IFDTLEKLER+M+E+FEPI
Subjt:  THPLTEEIFDTLEKLERIMEEDFEPI

XP_038905146.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Benincasa hispida]3.4e-26784.98Show/hide
Query:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT
        MN LCRRT QAL  RNQP FRLLRSFFS+ S GKYHRDSY + KLL HCR+IRSVQ LHAQI+VEG DQNGF+A KLIGKYVEH  GE KMG ARKVFD 
Subjt:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT

Query:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD
        L++RDVFVWNVVIQGYA+LGPFVEALNLFDEMRVSG PTNRYTFPFVLKACGAMKNSDKG+IVHGHV+KCGLDLDLFV NALIAFY+KCQDVETARKVFD
Subjt:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD

Query:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD
        +MSLRDIVSWNSMI GYTLN K ++AIM FHAMLHNQ+ C+PDSATL+ ILPAC TKSASQVGFWVHSYVIKTG+EVGA LGSCLI +Y NCGHVNIARD
Subjt:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD

Query:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV
        VF+RIDDKNVIVWSAIIRCYGMHGFADEA NMF RLEEAG+KPDG++FLNLLS CSHAGL+AKG EIYEKME YG+ERK+ HYACMVDLLGRAGFLEQAV
Subjt:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE
        EFIEGMPVQAGKDVYGALLGACRIH N+ELAKE+GEKLFILD + A RY+ LA+MYEDAGQWEDAAKLRKLLRDRN+RKP GCSSIEVDRIHHVFGK+DE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE

Query:  THPLTEEIFDTLEKLERIMEEDFEPI
        THP TE+IFDTLEKLE IME DFEPI
Subjt:  THPLTEEIFDTLEKLERIMEEDFEPI

TrEMBL top hitse value%identityAlignment
A0A0A0L101 Uncharacterized protein5.6e-30898.48Show/hide
Query:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT
        MNGLCRRTWQALTFRNQP       FFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT
Subjt:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT

Query:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD
        LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD
Subjt:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD

Query:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD
        DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD
Subjt:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD

Query:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV
        VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKG EIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV
Subjt:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE
        EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE

Query:  THPLTEEIFDTLEKLERIMEEDFEPI
        THPLTEEIFDTLEKLERIMEEDFEPI
Subjt:  THPLTEEIFDTLEKLERIMEEDFEPI

A0A1S3BMB7 pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like1.5e-27688.21Show/hide
Query:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT
        MNGL RRTWQALTFR+QP       FFSE   GKYHRDSY F+KLLGHCR+IRSVQELHAQILVEGLDQNGF+A KLIGKYVE  EGE KMGTARKVFD 
Subjt:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT

Query:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD
        L++RDVF+WNVVIQGYAS GPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGA+KNSDKGEIVHG+VVKCGLDLDLFVGNALI+ YSKCQDVETARKVFD
Subjt:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD

Query:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD
        DMSLRD VSWNSMIVGYTLN +ED+AIMFFHAMLHNQADC PD ATLVAILPACATKSASQVGFWVHSY+IKTG+EVGA LGSCLI MYGNCGH+NIARD
Subjt:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD

Query:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV
        VF+RID+KNVIVWSAIIRCYGMHG ADEA NMFRRLEE GVKPDGLIFLNLLSACSHAGLVAKGREIY+KMEAYGLER D HYACMVDLL RAGFLEQA 
Subjt:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE
        EFIE MPVQAGKDVYGAL GACRIHNNLELAKEVGEKLFILDPE A RY+ LA+MYEDAGQWEDAAK+RKLLRDRNI+KPAGCSSIEVDRIHHVFGKKDE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE

Query:  THPLTEEIFDTLEKLERIMEEDFEPI
        THP TEEIFDT+EKLER+MEEDFEPI
Subjt:  THPLTEEIFDTLEKLERIMEEDFEPI

A0A5A7UCB1 Pentatricopeptide repeat-containing protein1.5e-27688.21Show/hide
Query:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT
        MNGL RRTWQALTFR+QP       FFSE   GKYHRDSY F+KLLGHCR+IRSVQELHAQILVEGLDQNGF+A KLIGKYVE  EGE KMGTARKVFD 
Subjt:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT

Query:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD
        L++RDVF+WNVVIQGYAS GPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGA+KNSDKGEIVHG+VVKCGLDLDLFVGNALI+ YSKCQDVETARKVFD
Subjt:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD

Query:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD
        DMSLRD VSWNSMIVGYTLN +ED+AIMFFHAMLHNQADC PD ATLVAILPACATKSASQVGFWVHSY+IKTG+EVGA LGSCLI MYGNCGH+NIARD
Subjt:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD

Query:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV
        VF+RID+KNVIVWSAIIRCYGMHG ADEA NMFRRLEE GVKPDGLIFLNLLSACSHAGLVAKGREIY+KMEAYGLER D HYACMVDLL RAGFLEQA 
Subjt:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE
        EFIE MPVQAGKDVYGAL GACRIHNNLELAKEVGEKLFILDPE A RY+ LA+MYEDAGQWEDAAK+RKLLRDRNI+KPAGCSSIEVDRIHHVFGKKDE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE

Query:  THPLTEEIFDTLEKLERIMEEDFEPI
        THP TEEIFDT+EKLER+MEEDFEPI
Subjt:  THPLTEEIFDTLEKLERIMEEDFEPI

A0A6J1FMZ9 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like1.3e-26483.65Show/hide
Query:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT
        MNGL RR+ +AL  RN+PFFR LRS F++WS  +YHRDSY + KLL HCRSIRSV+ELHAQI+VEG DQNGF+A KLIGKY E+  GE KM  ARKVFD 
Subjt:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT

Query:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD
        L+ +DVFVWNVVIQGYA+ GPF EALNL+DEMRV GEPTNRYTFPFVLKACGAMKN DKG+IVHGHV+KCGLDLDLFVGNALIAFYSKCQDVETARKVFD
Subjt:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD

Query:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD
        +MSLRDIVSWNSMI GYTLNGK DEAIM FHAMLHNQ  C+PD+ATLV ILPAC TKSASQVGFWVHSYVIKTG+EVGAPLGSCLI MY NCGHVN+ARD
Subjt:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD

Query:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV
        VFDRI DKNVIVWSAIIR YGMHGFADEA NMF  LEEAG+KPDG+IFLNLLS CSHAGLV KGREIYEKMEAYG ERK+ HYACMVDLLGRAGFLEQAV
Subjt:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE
        EFIEGMPVQAGKDVYGALLGACRIHNN+ELAK+VG+KLF+LDPE A RYV LA+MYEDAGQWEDAAKLRKLLRDRNIRKP GCSSIE+DRI+HVFGK+DE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE

Query:  THPLTEEIFDTLEKLERIMEEDFEPI
        +HP TE+IFDTLEKLER+M+E+FEPI
Subjt:  THPLTEEIFDTLEKLERIMEEDFEPI

A0A6J1I919 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like2.0e-26584.22Show/hide
Query:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT
        MNGL RR  +AL  R QPFFR LRS F++WS  +YHRDSY + KLL HCRSIRSVQELHAQI+VEG DQNGF+A KLIGKY E+  GE KMG ARKVFD 
Subjt:  MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDT

Query:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD
        L+ +DVFVWNVVIQGYA+ GPF EALNL+DEMRV GEPTNRYTFPFVLKACGAMKNS+KG+IVHGHV+KCGLDLDLFVGNALIAFYSKCQDVETARKVFD
Subjt:  LVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFD

Query:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD
        +MSLRDIVSWNSMI GYTLNGK DEAIM FHAMLHNQ  C+PD+ATLV ILPAC TKSASQVGFWVHSYVIKTG+EVGAPLGSCLI MY NCGHVNIARD
Subjt:  DMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARD

Query:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV
        VFDRI DKNVIVWSAIIR YGMHGFADEA NMF  LEE G+KPDG+IFLNLLS CSHAGLV KGREIYEKMEAYG ERK+ HYACMVDLLGRAGFLEQAV
Subjt:  VFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE
        EFIEGMPVQAGKDVYGALLGACRIHNN+ELAKEVG+KLF+LDPE A RYV LA+MYEDAGQWEDAAKLRKLLRDRNIRKP GCSSIE+DRI+HVFGK+DE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDE

Query:  THPLTEEIFDTLEKLERIMEEDFEPI
        +HP TE+IFDTLEKLER+M+E+FEPI
Subjt:  THPLTEEIFDTLEKLERIMEEDFEPI

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic3.5e-9736.43Show/hide
Query:  IKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRY
        IK      S+   Q LH   +   +  + FVA  LI  Y    +    + +A KVF T+  +DV  WN +I G+   G   +AL LF +M       +  
Subjt:  IKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRY

Query:  TFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKC-------------------------------QDVETARKVFDDMSLRDIVSWN
        T   VL AC  ++N + G  V  ++ +  ++++L + NA++  Y+KC                               +D E AR+V + M  +DIV+WN
Subjt:  TFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKC-------------------------------QDVETARKVFDDMSLRDIVSWN

Query:  SMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARDVFDRIDDKNVI
        ++I  Y  NGK +EA++ FH  L  Q +   +  TLV+ L ACA   A ++G W+HSY+ K GI +   + S LI MY  CG +  +R+VF+ ++ ++V 
Subjt:  SMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARDVFDRIDDKNVI

Query:  VWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEA-YGLERKDNHYACMVDLLGRAGFLEQAVEFIEGMPVQA
        VWSA+I    MHG  +EA +MF +++EA VKP+G+ F N+  ACSH GLV +   ++ +ME+ YG+  ++ HYAC+VD+LGR+G+LE+AV+FIE MP+  
Subjt:  VWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEA-YGLERKDNHYACMVDLLGRAGFLEQAVEFIEGMPVQA

Query:  GKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPLTEEIFD
           V+GALLGAC+IH NL LA+    +L  L+P     +V L+ +Y   G+WE+ ++LRK +R   ++K  GCSSIE+D + H F   D  HP++E+++ 
Subjt:  GKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPLTEEIFD

Query:  TL-EKLERIMEEDFEP
         L E +E++    +EP
Subjt:  TL-EKLERIMEEDFEP

Q7Y211 Pentatricopeptide repeat-containing protein At3g57430, chloroplastic1.9e-9538.82Show/hide
Query:  HCRSIRSVQELHAQILVEG-LDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVS-GEPTNRYTFP
        H   +R+ +ELHA  L  G LD+N FV + L+  Y    +    + + R+VFD +  R + +WN +I GY+      EAL LF  M  S G   N  T  
Subjt:  HCRSIRSVQELHAQILVEG-LDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVS-GEPTNRYTFP

Query:  FVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFDDMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLH---------N
         V+ AC       + E +HG VVK GLD D FV N L+  YS+   ++ A ++F  M  RD+V+WN+MI GY  +   ++A++  H M +         +
Subjt:  FVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFDDMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLH---------N

Query:  QADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRL
        +    P+S TL+ ILP+CA  SA   G  +H+Y IK  +     +GS L+ MY  CG + ++R VFD+I  KNVI W+ II  YGMHG   EA ++ R +
Subjt:  QADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRL

Query:  EEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEA-YGLERKDNHYACMVDLLGRAGFLEQAVEFIEGMPVQAGK-DVYGALLGACRIHNNLELAKEV
           GVKP+ + F+++ +ACSH+G+V +G  I+  M+  YG+E   +HYAC+VDLLGRAG +++A + +  MP    K   + +LLGA RIHNNLE+ +  
Subjt:  EEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEA-YGLERKDNHYACMVDLLGRAGFLEQAVEFIEGMPVQAGK-DVYGALLGACRIHNNLELAKEV

Query:  GEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPLTEEIFDTLEKL-ERIMEEDFEP
         + L  L+P  AS YV LA +Y  AG W+ A ++R+ ++++ +RK  GCS IE     H F   D +HP +E++   LE L ER+ +E + P
Subjt:  GEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPLTEEIFDTLEKL-ERIMEEDFEP

Q9FI80 Pentatricopeptide repeat-containing protein At5g489107.5e-9234.59Show/hide
Query:  LGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASL--GPFVEALNLFDEMRVSGE--PTNR
        + +CR+IR + ++HA  +  G  ++   AA+++      D     +  A K+F+ + +R+ F WN +I+G++       + A+ LF EM +S E    NR
Subjt:  LGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASL--GPFVEALNLFDEMRVSGE--PTNR

Query:  YTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKC---------------------------------------------QDVETAR
        +TFP VLKAC       +G+ +HG  +K G   D FV + L+  Y  C                                              D + AR
Subjt:  YTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKC---------------------------------------------QDVETAR

Query:  KVFDDMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVN
         +FD M  R +VSWN+MI GY+LNG   +A+  F  M   + D  P+  TLV++LPA +   + ++G W+H Y   +GI +   LGS LI MY  CG + 
Subjt:  KVFDDMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVN

Query:  IARDVFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAY-GLERKDNHYACMVDLLGRAGF
         A  VF+R+  +NVI WSA+I  + +HG A +A + F ++ +AGV+P  + ++NLL+ACSH GLV +GR  + +M +  GLE +  HY CMVDLLGR+G 
Subjt:  IARDVFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAY-GLERKDNHYACMVDLLGRAGF

Query:  LEQAVEFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVF
        L++A EFI  MP++    ++ ALLGACR+  N+E+ K V   L  + P  +  YV L+ MY   G W + +++R  +++++IRK  GCS I++D + H F
Subjt:  LEQAVEFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVF

Query:  GKKDETHPLTEEIFDTLEKL-ERIMEEDFEPI
          +D++HP  +EI   L ++ +++    + PI
Subjt:  GKKDETHPLTEEIFDTLEKL-ERIMEEDFEPI

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.1e-9335.59Show/hide
Query:  DSYGFIKLLGHC---RSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHD------------------------EGEVKMG---TARKVFDTLVRRDVF
        +SY F  +L  C   ++ +  Q++H  +L  G D + +V   LI  YV++                         +G    G    A+K+FD +  +DV 
Subjt:  DSYGFIKLLGHC---RSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHD------------------------EGEVKMG---TARKVFDTLVRRDVF

Query:  VWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFDDMSLRDI
         WN +I GYA  G + EAL LF +M  +    +  T   V+ AC    + + G  VH  +   G   +L + NALI  YSKC ++ETA  +F+ +  +D+
Subjt:  VWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFDDMSLRDI

Query:  VSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIK--TGIEVGAPLGSCLICMYGNCGHVNIARDVFDRI
        +SWN++I GYT      EA++ F  ML +    TP+  T+++ILPACA   A  +G W+H Y+ K   G+   + L + LI MY  CG +  A  VF+ I
Subjt:  VSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIK--TGIEVGAPLGSCLICMYGNCGHVNIARDVFDRI

Query:  DDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKM-EAYGLERKDNHYACMVDLLGRAGFLEQAVEFIE
          K++  W+A+I  + MHG AD +F++F R+ + G++PD + F+ LLSACSH+G++  GR I+  M + Y +  K  HY CM+DLLG +G  ++A E I 
Subjt:  DDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKM-EAYGLERKDNHYACMVDLLGRAGFLEQAVEFIE

Query:  GMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPL
         M ++    ++ +LL AC++H N+EL +   E L  ++PE    YV L+ +Y  AG+W + AK R LL D+ ++K  GCSSIE+D + H F   D+ HP 
Subjt:  GMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPL

Query:  TEEIFDTLEKLERIMEE
          EI+  LE++E ++E+
Subjt:  TEEIFDTLEKLERIMEE

Q9STF3 Pentatricopeptide repeat-containing protein At3g46790, chloroplastic1.7e-9937.92Show/hide
Query:  GHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPF
        GH  S+     +H  IL  G DQ+ F+A KLIG Y   D G V    ARKVFD   +R ++VWN + +     G   E L L+ +M   G  ++R+T+ +
Subjt:  GHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPF

Query:  VLKACGA----MKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFDDMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTP
        VLKAC A    + +  KG+ +H H+ + G    +++   L+  Y++   V+ A  VF  M +R++VSW++MI  Y  NGK  EA+  F  M+    D +P
Subjt:  VLKACGA----MKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFDDMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTP

Query:  DSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVK
        +S T+V++L ACA+ +A + G  +H Y+++ G++   P+ S L+ MYG CG + + + VFDR+ D++V+ W+++I  YG+HG+  +A  +F  +   G  
Subjt:  DSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVK

Query:  PDGLIFLNLLSACSHAGLVAKGREIYEKM-EAYGLERKDNHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFIL
        P  + F+++L ACSH GLV +G+ ++E M   +G++ +  HYACMVDLLGRA  L++A + ++ M  + G  V+G+LLG+CRIH N+ELA+    +LF L
Subjt:  PDGLIFLNLLSACSHAGLVAKGREIYEKM-EAYGLERKDNHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFIL

Query:  DPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPLTEEIFDTLEKLERIMEE
        +P+ A  YV LA +Y +A  W++  +++KLL  R ++K  G   +EV R  + F   DE +PL E+I   L KL   M+E
Subjt:  DPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPLTEEIFDTLEKLERIMEE

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.5e-9535.59Show/hide
Query:  DSYGFIKLLGHC---RSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHD------------------------EGEVKMG---TARKVFDTLVRRDVF
        +SY F  +L  C   ++ +  Q++H  +L  G D + +V   LI  YV++                         +G    G    A+K+FD +  +DV 
Subjt:  DSYGFIKLLGHC---RSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHD------------------------EGEVKMG---TARKVFDTLVRRDVF

Query:  VWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFDDMSLRDI
         WN +I GYA  G + EAL LF +M  +    +  T   V+ AC    + + G  VH  +   G   +L + NALI  YSKC ++ETA  +F+ +  +D+
Subjt:  VWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFDDMSLRDI

Query:  VSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIK--TGIEVGAPLGSCLICMYGNCGHVNIARDVFDRI
        +SWN++I GYT      EA++ F  ML +    TP+  T+++ILPACA   A  +G W+H Y+ K   G+   + L + LI MY  CG +  A  VF+ I
Subjt:  VSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIK--TGIEVGAPLGSCLICMYGNCGHVNIARDVFDRI

Query:  DDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKM-EAYGLERKDNHYACMVDLLGRAGFLEQAVEFIE
          K++  W+A+I  + MHG AD +F++F R+ + G++PD + F+ LLSACSH+G++  GR I+  M + Y +  K  HY CM+DLLG +G  ++A E I 
Subjt:  DDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKM-EAYGLERKDNHYACMVDLLGRAGFLEQAVEFIE

Query:  GMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPL
         M ++    ++ +LL AC++H N+EL +   E L  ++PE    YV L+ +Y  AG+W + AK R LL D+ ++K  GCSSIE+D + H F   D+ HP 
Subjt:  GMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPL

Query:  TEEIFDTLEKLERIMEE
          EI+  LE++E ++E+
Subjt:  TEEIFDTLEKLERIMEE

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-9836.43Show/hide
Query:  IKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRY
        IK      S+   Q LH   +   +  + FVA  LI  Y    +    + +A KVF T+  +DV  WN +I G+   G   +AL LF +M       +  
Subjt:  IKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRY

Query:  TFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKC-------------------------------QDVETARKVFDDMSLRDIVSWN
        T   VL AC  ++N + G  V  ++ +  ++++L + NA++  Y+KC                               +D E AR+V + M  +DIV+WN
Subjt:  TFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKC-------------------------------QDVETARKVFDDMSLRDIVSWN

Query:  SMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARDVFDRIDDKNVI
        ++I  Y  NGK +EA++ FH  L  Q +   +  TLV+ L ACA   A ++G W+HSY+ K GI +   + S LI MY  CG +  +R+VF+ ++ ++V 
Subjt:  SMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARDVFDRIDDKNVI

Query:  VWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEA-YGLERKDNHYACMVDLLGRAGFLEQAVEFIEGMPVQA
        VWSA+I    MHG  +EA +MF +++EA VKP+G+ F N+  ACSH GLV +   ++ +ME+ YG+  ++ HYAC+VD+LGR+G+LE+AV+FIE MP+  
Subjt:  VWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEA-YGLERKDNHYACMVDLLGRAGFLEQAVEFIEGMPVQA

Query:  GKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPLTEEIFD
           V+GALLGAC+IH NL LA+    +L  L+P     +V L+ +Y   G+WE+ ++LRK +R   ++K  GCSSIE+D + H F   D  HP++E+++ 
Subjt:  GKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPLTEEIFD

Query:  TL-EKLERIMEEDFEP
         L E +E++    +EP
Subjt:  TL-EKLERIMEEDFEP

AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-10037.92Show/hide
Query:  GHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPF
        GH  S+     +H  IL  G DQ+ F+A KLIG Y   D G V    ARKVFD   +R ++VWN + +     G   E L L+ +M   G  ++R+T+ +
Subjt:  GHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPF

Query:  VLKACGA----MKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFDDMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTP
        VLKAC A    + +  KG+ +H H+ + G    +++   L+  Y++   V+ A  VF  M +R++VSW++MI  Y  NGK  EA+  F  M+    D +P
Subjt:  VLKACGA----MKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFDDMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTP

Query:  DSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVK
        +S T+V++L ACA+ +A + G  +H Y+++ G++   P+ S L+ MYG CG + + + VFDR+ D++V+ W+++I  YG+HG+  +A  +F  +   G  
Subjt:  DSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVK

Query:  PDGLIFLNLLSACSHAGLVAKGREIYEKM-EAYGLERKDNHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFIL
        P  + F+++L ACSH GLV +G+ ++E M   +G++ +  HYACMVDLLGRA  L++A + ++ M  + G  V+G+LLG+CRIH N+ELA+    +LF L
Subjt:  PDGLIFLNLLSACSHAGLVAKGREIYEKM-EAYGLERKDNHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFIL

Query:  DPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPLTEEIFDTLEKLERIMEE
        +P+ A  YV LA +Y +A  W++  +++KLL  R ++K  G   +EV R  + F   DE +PL E+I   L KL   M+E
Subjt:  DPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPLTEEIFDTLEKLERIMEE

AT3G57430.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-9638.82Show/hide
Query:  HCRSIRSVQELHAQILVEG-LDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVS-GEPTNRYTFP
        H   +R+ +ELHA  L  G LD+N FV + L+  Y    +    + + R+VFD +  R + +WN +I GY+      EAL LF  M  S G   N  T  
Subjt:  HCRSIRSVQELHAQILVEG-LDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASLGPFVEALNLFDEMRVS-GEPTNRYTFP

Query:  FVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFDDMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLH---------N
         V+ AC       + E +HG VVK GLD D FV N L+  YS+   ++ A ++F  M  RD+V+WN+MI GY  +   ++A++  H M +         +
Subjt:  FVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFDDMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLH---------N

Query:  QADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRL
        +    P+S TL+ ILP+CA  SA   G  +H+Y IK  +     +GS L+ MY  CG + ++R VFD+I  KNVI W+ II  YGMHG   EA ++ R +
Subjt:  QADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRL

Query:  EEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEA-YGLERKDNHYACMVDLLGRAGFLEQAVEFIEGMPVQAGK-DVYGALLGACRIHNNLELAKEV
           GVKP+ + F+++ +ACSH+G+V +G  I+  M+  YG+E   +HYAC+VDLLGRAG +++A + +  MP    K   + +LLGA RIHNNLE+ +  
Subjt:  EEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEA-YGLERKDNHYACMVDLLGRAGFLEQAVEFIEGMPVQAGK-DVYGALLGACRIHNNLELAKEV

Query:  GEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPLTEEIFDTLEKL-ERIMEEDFEP
         + L  L+P  AS YV LA +Y  AG W+ A ++R+ ++++ +RK  GCS IE     H F   D +HP +E++   LE L ER+ +E + P
Subjt:  GEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPLTEEIFDTLEKL-ERIMEEDFEP

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein5.4e-9334.59Show/hide
Query:  LGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASL--GPFVEALNLFDEMRVSGE--PTNR
        + +CR+IR + ++HA  +  G  ++   AA+++      D     +  A K+F+ + +R+ F WN +I+G++       + A+ LF EM +S E    NR
Subjt:  LGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASL--GPFVEALNLFDEMRVSGE--PTNR

Query:  YTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKC---------------------------------------------QDVETAR
        +TFP VLKAC       +G+ +HG  +K G   D FV + L+  Y  C                                              D + AR
Subjt:  YTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKC---------------------------------------------QDVETAR

Query:  KVFDDMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVN
         +FD M  R +VSWN+MI GY+LNG   +A+  F  M   + D  P+  TLV++LPA +   + ++G W+H Y   +GI +   LGS LI MY  CG + 
Subjt:  KVFDDMSLRDIVSWNSMIVGYTLNGKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVN

Query:  IARDVFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAY-GLERKDNHYACMVDLLGRAGF
         A  VF+R+  +NVI WSA+I  + +HG A +A + F ++ +AGV+P  + ++NLL+ACSH GLV +GR  + +M +  GLE +  HY CMVDLLGR+G 
Subjt:  IARDVFDRIDDKNVIVWSAIIRCYGMHGFADEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAY-GLERKDNHYACMVDLLGRAGF

Query:  LEQAVEFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVF
        L++A EFI  MP++    ++ ALLGACR+  N+E+ K V   L  + P  +  YV L+ MY   G W + +++R  +++++IRK  GCS I++D + H F
Subjt:  LEQAVEFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFILDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVF

Query:  GKKDETHPLTEEIFDTLEKL-ERIMEEDFEPI
          +D++HP  +EI   L ++ +++    + PI
Subjt:  GKKDETHPLTEEIFDTLEKL-ERIMEEDFEPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGGGCTTTGCCGGCGAACTTGGCAAGCTCTGACGTTCAGAAATCAACCCTTCTTCAGATTGCTAAGATCCTTCTTCAGCGAATGGAGTTTTGGGAAGTATCATCG
CGATTCGTACGGTTTTATCAAGTTATTGGGGCATTGCAGAAGCATTAGAAGCGTTCAGGAACTACATGCCCAGATCCTCGTTGAGGGTCTTGATCAAAATGGATTTGTTG
CCGCGAAACTGATCGGGAAATACGTGGAGCATGATGAGGGTGAAGTGAAAATGGGAACTGCACGGAAGGTGTTTGATACATTGGTTCGAAGAGATGTGTTTGTGTGGAAT
GTGGTTATACAAGGGTATGCGAGTTTGGGTCCGTTTGTTGAAGCTCTTAATCTGTTTGATGAAATGCGAGTCAGTGGGGAACCCACAAATCGCTATACGTTTCCTTTTGT
GTTGAAGGCATGTGGCGCAATGAAGAATAGTGATAAGGGTGAGATTGTTCATGGACATGTTGTGAAATGTGGGTTGGACTTGGATTTGTTTGTGGGTAATGCTCTGATCG
CGTTTTATTCCAAATGTCAGGACGTTGAAACTGCTCGTAAAGTGTTTGATGATATGTCTCTGAGAGACATTGTGAGTTGGAACTCCATGATTGTTGGTTATACTTTGAAT
GGGAAAGAGGACGAGGCTATAATGTTTTTTCATGCCATGCTGCATAACCAAGCTGATTGCACACCTGATTCTGCAACTCTTGTTGCGATTCTACCTGCTTGTGCTACAAA
ATCTGCTTCCCAAGTTGGCTTTTGGGTTCATTCCTATGTTATAAAGACAGGAATAGAAGTTGGGGCTCCCCTGGGCAGTTGCCTTATTTGTATGTATGGTAACTGTGGTC
ATGTGAACATTGCAAGAGATGTTTTCGACCGAATCGACGACAAAAATGTTATCGTATGGAGTGCAATCATAAGGTGTTATGGAATGCATGGTTTTGCAGATGAAGCATTT
AACATGTTCAGAAGATTGGAAGAAGCTGGTGTTAAACCAGATGGTCTGATCTTCCTGAACTTGTTGTCGGCTTGTAGTCACGCAGGGCTCGTTGCTAAAGGTCGCGAGAT
ATACGAAAAAATGGAGGCTTACGGTTTGGAGAGGAAAGATAACCACTACGCGTGCATGGTGGATCTCTTAGGGAGGGCTGGTTTCTTAGAACAAGCGGTAGAATTCATCG
AAGGGATGCCCGTGCAGGCAGGAAAAGATGTATATGGTGCATTGTTGGGTGCTTGTAGGATACATAACAACCTAGAGCTAGCTAAAGAAGTGGGGGAGAAGTTGTTTATC
TTAGATCCCGAAAAAGCCAGTCGGTATGTGACCTTAGCAACTATGTATGAAGATGCAGGGCAGTGGGAAGATGCTGCTAAACTAAGGAAGTTGCTGAGGGATAGGAATAT
TAGGAAGCCAGCTGGTTGCAGTTCGATAGAGGTAGATAGGATCCATCATGTGTTTGGGAAGAAGGATGAAACTCATCCCCTCACGGAAGAGATTTTTGACACATTAGAGA
AGCTAGAAAGAATAATGGAGGAAGATTTTGAACCTATTTAA
mRNA sequenceShow/hide mRNA sequence
TGTCAACATCCATACATCATCATCATCACCACCATCATCATCAATCATCACGTATTATATCCACAACGGAGACTACATTCGGTACCGGGAAACTGATCGGCGTATTTTCC
ACTGCCGTGAGAGAACAGTAATAAATGAATGGGCTTTGCCGGCGAACTTGGCAAGCTCTGACGTTCAGAAATCAACCCTTCTTCAGATTGCTAAGATCCTTCTTCAGCGA
ATGGAGTTTTGGGAAGTATCATCGCGATTCGTACGGTTTTATCAAGTTATTGGGGCATTGCAGAAGCATTAGAAGCGTTCAGGAACTACATGCCCAGATCCTCGTTGAGG
GTCTTGATCAAAATGGATTTGTTGCCGCGAAACTGATCGGGAAATACGTGGAGCATGATGAGGGTGAAGTGAAAATGGGAACTGCACGGAAGGTGTTTGATACATTGGTT
CGAAGAGATGTGTTTGTGTGGAATGTGGTTATACAAGGGTATGCGAGTTTGGGTCCGTTTGTTGAAGCTCTTAATCTGTTTGATGAAATGCGAGTCAGTGGGGAACCCAC
AAATCGCTATACGTTTCCTTTTGTGTTGAAGGCATGTGGCGCAATGAAGAATAGTGATAAGGGTGAGATTGTTCATGGACATGTTGTGAAATGTGGGTTGGACTTGGATT
TGTTTGTGGGTAATGCTCTGATCGCGTTTTATTCCAAATGTCAGGACGTTGAAACTGCTCGTAAAGTGTTTGATGATATGTCTCTGAGAGACATTGTGAGTTGGAACTCC
ATGATTGTTGGTTATACTTTGAATGGGAAAGAGGACGAGGCTATAATGTTTTTTCATGCCATGCTGCATAACCAAGCTGATTGCACACCTGATTCTGCAACTCTTGTTGC
GATTCTACCTGCTTGTGCTACAAAATCTGCTTCCCAAGTTGGCTTTTGGGTTCATTCCTATGTTATAAAGACAGGAATAGAAGTTGGGGCTCCCCTGGGCAGTTGCCTTA
TTTGTATGTATGGTAACTGTGGTCATGTGAACATTGCAAGAGATGTTTTCGACCGAATCGACGACAAAAATGTTATCGTATGGAGTGCAATCATAAGGTGTTATGGAATG
CATGGTTTTGCAGATGAAGCATTTAACATGTTCAGAAGATTGGAAGAAGCTGGTGTTAAACCAGATGGTCTGATCTTCCTGAACTTGTTGTCGGCTTGTAGTCACGCAGG
GCTCGTTGCTAAAGGTCGCGAGATATACGAAAAAATGGAGGCTTACGGTTTGGAGAGGAAAGATAACCACTACGCGTGCATGGTGGATCTCTTAGGGAGGGCTGGTTTCT
TAGAACAAGCGGTAGAATTCATCGAAGGGATGCCCGTGCAGGCAGGAAAAGATGTATATGGTGCATTGTTGGGTGCTTGTAGGATACATAACAACCTAGAGCTAGCTAAA
GAAGTGGGGGAGAAGTTGTTTATCTTAGATCCCGAAAAAGCCAGTCGGTATGTGACCTTAGCAACTATGTATGAAGATGCAGGGCAGTGGGAAGATGCTGCTAAACTAAG
GAAGTTGCTGAGGGATAGGAATATTAGGAAGCCAGCTGGTTGCAGTTCGATAGAGGTAGATAGGATCCATCATGTGTTTGGGAAGAAGGATGAAACTCATCCCCTCACGG
AAGAGATTTTTGACACATTAGAGAAGCTAGAAAGAATAATGGAGGAAGATTTTGAACCTATTTAATGGATTTTTTGTTCACACACGTTTCAATATACGTAGTTTAAGGAC
AATTTATCCTTGACTATGCTTTTATTTTTCTCCTCTTTCTAGGAAGTTTCAATCCAAATTCAAATGGATCAAACATAAATTGAAGACAAGCATACTTTAACCATCATTTT
GTTTTCCTCTCCTCGTGTTAATGACTTCAATTCGAACATTAGTGATAGCAAACGGTTTGTTGGATGCTGGTGAAATTTACAAAGGCAGTTTTAGTTTACATTTAGTTTAA
TTTCATCTTTTTTTAGTTATGTTATTCTAAGGGAGGAAAAGGCGGAGTTAAAGTAGTTTTGAATGTTAGAATC
Protein sequenceShow/hide protein sequence
MNGLCRRTWQALTFRNQPFFRLLRSFFSEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHAQILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWN
VVIQGYASLGPFVEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGEIVHGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFDDMSLRDIVSWNSMIVGYTLN
GKEDEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFADEAF
NMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGREIYEKMEAYGLERKDNHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNLELAKEVGEKLFI
LDPEKASRYVTLATMYEDAGQWEDAAKLRKLLRDRNIRKPAGCSSIEVDRIHHVFGKKDETHPLTEEIFDTLEKLERIMEEDFEPI