; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020997 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020997
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold290:436432..437823
RNA-Seq ExpressionMS020997
SyntenyMS020997
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596261.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.5e-24587.93Show/hide
Query:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC
        R+IRSVQELHAQI+VEG DQNGFLATKLIGKYAE+G+ KM +ARKVFDRL+E+DVFVWNVVI+GYANWGPF EALNL+DEMRV G PTNRYTFPFVLKAC
Subjt:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC

Query:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL
        GAMKN DKGK+VHGHVLK GLDLDLFVGNALIAFY+KC D+ETARKVFDEM L+DIVSWNSMIAG+T NGKVDEAIMLFHAM+HNQ AC+PDNATLVGIL
Subjt:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL

Query:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL
        PACV+KSA+QVGFWVHSYVIKTGM+VGAPLGSCLISMYANCGHVN+ARDVF++I DKNVIVWSA+IR YGMHG ADEAL MFTSLEE GLKPDGVIFLNL
Subjt:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL

Query:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI
        LSTCSHAGLV KGREIYEKME YG ERKEEHYACMVDLLGRAGF++QAV+FIEGMP+QAGKDVYGALLGACRIHNNIE+AK+  +KLFVLDPENAGRYVI
Subjt:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI

Query:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
        LASM+EDAGQWEDAAKLRKLLRDR I+KPVGCSSIEIDRI+HVFGKEDESHPFTEQIFDTLEKL
Subjt:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

XP_022145276.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Momordica charantia]4.5e-27198.71Show/hide
Query:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC
        RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRV GAPTNRYTFPFVLKAC
Subjt:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC

Query:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL
        GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIET RKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGAC+PDNATLVGIL
Subjt:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL

Query:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL
        PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFN+IDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL
Subjt:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL

Query:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI
        LSTCSHAGLVAKGR+IYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI
Subjt:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI

Query:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
        LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPF+EQIFDTLEKL
Subjt:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

XP_022942131.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucurbita moschata]3.2e-24587.72Show/hide
Query:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC
        R+IRSV+ELHAQI+VEG DQNGFLATKLIGKYAE+G+ KM +ARKVFDRL+E+DVFVWNVVI+GYANWGPF EALNL+DEMRV G PTNRYTFPFVLKAC
Subjt:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC

Query:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL
        GAMKN DKGK+VHGHVLK GLDLDLFVGNALIAFY+KC D+ETARKVFDEM L+DIVSWNSMIAG+T NGKVDEAIMLFHAM+HNQ AC+PDNATLVGIL
Subjt:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL

Query:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL
        PACV+KSA+QVGFWVHSYVIKTGM+VGAPLGSCLISMYANCGHVN+ARDVF++I DKNVIVWSA+IR YGMHG ADEAL MFTSLEE GLKPDGVIFLNL
Subjt:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL

Query:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI
        LSTCSHAGLV KGREIYEKME YG ERKEEHYACMVDLLGRAGF++QAV+FIEGMP+QAGKDVYGALLGACRIHNNIE+AK+  +KLFVLDPENAGRYVI
Subjt:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI

Query:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
        LASM+EDAGQWEDAAKLRKLLRDR I+KPVGCSSIEIDRI+HVFGKEDESHPFTEQIFDTLEKL
Subjt:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

XP_022971604.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucurbita maxima]3.5e-24788.58Show/hide
Query:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC
        R+IRSVQELHAQI+VEG DQNGFLATKLIGKYAE+G+ KMG+ARKVFDRL+E+DVFVWNVVI+GYANWGPF EALNL+DEMRV G PTNRYTFPFVLKAC
Subjt:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC

Query:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL
        GAMKN +KGK+VHGHVLK GLDLDLFVGNALIAFY+KC D+ETARKVFDEM L+DIVSWNSMIAG+T NGKVDEAIMLFHAM+HNQ AC+PDNATLVGIL
Subjt:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL

Query:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL
        PACV+KSA+QVGFWVHSYVIKTGM+VGAPLGSCLISMYANCGHVNIARDVF++I DKNVIVWSA+IR YGMHG ADEAL MFTSLEEVGLKPDGVIFLNL
Subjt:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL

Query:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI
        LSTCSHAGLV KGREIYEKME YG ERKEEHYACMVDLLGRAGF++QAV+FIEGMP+QAGKDVYGALLGACRIHNNIE+AKE  +KLFVLDPENAGRYVI
Subjt:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI

Query:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
        LASM+EDAGQWEDAAKLRKLLRDR I+KPVGCSSIEIDRI+HVFGKEDESHPFTEQIFDTLEKL
Subjt:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

XP_023540099.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucurbita pepo subsp. pepo]5.0e-24687.72Show/hide
Query:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC
        R+IRSVQELHAQI+VEG DQNGFLATKLIGKYAE+G+ KMG+ARKVFDRL+E+DVF+WNVVI+GYANWGPF EALNL+DEMRV G PTNRYTFPFVLKAC
Subjt:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC

Query:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL
        GAMKN DKGK+VHGHVLK GLDLDLFVGNALIAFY+KC D+ETARKVFDEM L+DIVSWNSMIAG+T NGKVDEAIM+FHAM+HNQ AC+PDNATLVGIL
Subjt:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL

Query:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL
        PACV+KSA+QVGFWVHSY+IKTGM+VGAPLGSCLISMYANCGHVNIARDVF++I DKNVIVWSA+IR YGMHG ADEAL MFTSLEE GLKPDGVIFLNL
Subjt:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL

Query:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI
        LSTCSHAGLV KGREIYE+ME YG ERKEEHYACMVDLLGRAGF++QAV+FIEGMP+QAGKDVYGALLGACRIHNNIE+AKE  +KLFVLDPENAGRYVI
Subjt:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI

Query:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
        LASM+EDAGQWEDAAKLRKLLRDR I+KPVGCSSIEIDRI+HVFGKEDESHPFTEQIFDTLEKL
Subjt:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

TrEMBL top hitse value%identityAlignment
A0A0A0L101 Uncharacterized protein8.4e-23181.55Show/hide
Query:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEH--GDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLK
        R+IRSVQELHAQI+VEGLDQNGF+A KLIGKY EH  G+ KMG ARKVFD LV RDVFVWNVVI+GYA+ GPF EALNLFDEMRV G PTNRYTFPFVLK
Subjt:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEH--GDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLK

Query:  ACGAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVG
        ACGAMKN DKG++VHGHV+K GLDLDLFVGNALIAFY+KC D+ETARKVFD+M L+DIVSWNSMI G+T NGK DEAIM FHAM+HNQ  C PD+ATLV 
Subjt:  ACGAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVG

Query:  ILPACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFL
        ILPAC +KSA+QVGFWVHSYVIKTG++VGAPLGSCLI MY NCGHVNIARDVF++IDDKNVIVWSA+IRCYGMHG ADEA  MF  LEE G+KPDG+IFL
Subjt:  ILPACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFL

Query:  NLLSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRY
        NLLS CSHAGLVAKG EIYEKME YG+ERK+ HYACMVDLLGRAGF++QAV+FIEGMP+QAGKDVYGALLGACRIHNN+E+AKE  EKLF+LDPE A RY
Subjt:  NLLSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRY

Query:  VILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
        V LA+M+EDAGQWEDAAKLRKLLRDR I+KP GCSSIE+DRIHHVFGK+DE+HP TE+IFDTLEKL
Subjt:  VILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

A0A5A7UCB1 Pentatricopeptide repeat-containing protein6.0e-22980.26Show/hide
Query:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAE--HGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLK
        RTIRSVQELHAQI+VEGLDQNGFLATKLIGKY E   G+SKMG ARKVFDRL++RDVF+WNVVI+GYA++GPF EALNLFDEMRV G PTNRYTFPFVLK
Subjt:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAE--HGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLK

Query:  ACGAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVG
        ACGA+KN DKG++VHG+V+K GLDLDLFVGNALI+ Y+KC D+ETARKVFD+M L+D VSWNSMI G+T N + D+AIM FHAM+HNQ  C PD+ATLV 
Subjt:  ACGAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVG

Query:  ILPACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFL
        ILPAC +KSA+QVGFWVHSY+IKTGM+VGA LGSCLISMY NCGH+NIARDVFN+ID+KNVIVWSA+IRCYGMHGLADEAL MF  LEEVG+KPDG+IFL
Subjt:  ILPACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFL

Query:  NLLSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRY
        NLLS CSHAGLVAKGREIY+KME YG+ER ++HYACMVDLL RAGF++QA +FIE MP+QAGKDVYGAL GACRIHNN+E+AKE  EKLF+LDPENAGRY
Subjt:  NLLSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRY

Query:  VILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
        +ILASM+EDAGQWEDAAK+RKLLRDR IKKP GCSSIE+DRIHHVFGK+DE+HPFTE+IFDT+EKL
Subjt:  VILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

A0A6J1CVV9 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like2.2e-27198.71Show/hide
Query:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC
        RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRV GAPTNRYTFPFVLKAC
Subjt:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC

Query:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL
        GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIET RKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGAC+PDNATLVGIL
Subjt:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL

Query:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL
        PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFN+IDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL
Subjt:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL

Query:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI
        LSTCSHAGLVAKGR+IYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI
Subjt:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI

Query:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
        LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPF+EQIFDTLEKL
Subjt:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

A0A6J1FMZ9 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like1.6e-24587.72Show/hide
Query:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC
        R+IRSV+ELHAQI+VEG DQNGFLATKLIGKYAE+G+ KM +ARKVFDRL+E+DVFVWNVVI+GYANWGPF EALNL+DEMRV G PTNRYTFPFVLKAC
Subjt:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC

Query:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL
        GAMKN DKGK+VHGHVLK GLDLDLFVGNALIAFY+KC D+ETARKVFDEM L+DIVSWNSMIAG+T NGKVDEAIMLFHAM+HNQ AC+PDNATLVGIL
Subjt:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL

Query:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL
        PACV+KSA+QVGFWVHSYVIKTGM+VGAPLGSCLISMYANCGHVN+ARDVF++I DKNVIVWSA+IR YGMHG ADEAL MFTSLEE GLKPDGVIFLNL
Subjt:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL

Query:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI
        LSTCSHAGLV KGREIYEKME YG ERKEEHYACMVDLLGRAGF++QAV+FIEGMP+QAGKDVYGALLGACRIHNNIE+AK+  +KLFVLDPENAGRYVI
Subjt:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI

Query:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
        LASM+EDAGQWEDAAKLRKLLRDR I+KPVGCSSIEIDRI+HVFGKEDESHPFTEQIFDTLEKL
Subjt:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

A0A6J1I919 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like1.7e-24788.58Show/hide
Query:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC
        R+IRSVQELHAQI+VEG DQNGFLATKLIGKYAE+G+ KMG+ARKVFDRL+E+DVFVWNVVI+GYANWGPF EALNL+DEMRV G PTNRYTFPFVLKAC
Subjt:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKAC

Query:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL
        GAMKN +KGK+VHGHVLK GLDLDLFVGNALIAFY+KC D+ETARKVFDEM L+DIVSWNSMIAG+T NGKVDEAIMLFHAM+HNQ AC+PDNATLVGIL
Subjt:  GAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGIL

Query:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL
        PACV+KSA+QVGFWVHSYVIKTGM+VGAPLGSCLISMYANCGHVNIARDVF++I DKNVIVWSA+IR YGMHG ADEAL MFTSLEEVGLKPDGVIFLNL
Subjt:  PACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNL

Query:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI
        LSTCSHAGLV KGREIYEKME YG ERKEEHYACMVDLLGRAGF++QAV+FIEGMP+QAGKDVYGALLGACRIHNNIE+AKE  +KLFVLDPENAGRYVI
Subjt:  LSTCSHAGLVAKGREIYEKMETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVI

Query:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
        LASM+EDAGQWEDAAKLRKLLRDR I+KPVGCSSIEIDRI+HVFGKEDESHPFTEQIFDTLEKL
Subjt:  LASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.9e-9937.14Show/hide
Query:  QELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGAMKNR
        Q LH   +   +  + F+A  LI  Y   GD  +  A KVF  + E+DV  WN +I G+   G  D+AL LF +M  +    +  T   VL AC  ++N 
Subjt:  QELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGAMKNR

Query:  DKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFD-------------------------------EMPLKDIVSWNSMIAGFTSNGKVDEA
        + G+ V  ++ ++ ++++L + NA++  Y KC  IE A+++FD                                MP KDIV+WN++I+ +  NGK +EA
Subjt:  DKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFD-------------------------------EMPLKDIVSWNSMIAGFTSNGKVDEA

Query:  IMLFHAMVHNQGACAPDNATLVGILPACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLA
        +++FH +   Q     +  TLV  L AC    A ++G W+HSY+ K G+++   + S LI MY+ CG +  +R+VFN ++ ++V VWSA+I    MHG  
Subjt:  IMLFHAMVHNQGACAPDNATLVGILPACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLA

Query:  DEALTMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGREIYEKMET-YGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIH
        +EA+ MF  ++E  +KP+GV F N+   CSH GLV +   ++ +ME+ YG+  +E+HYAC+VD+LGR+G++++AVKFIE MP+     V+GALLGAC+IH
Subjt:  DEALTMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGREIYEKMET-YGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIH

Query:  NNIEVAKEAAEKLFVLDPENAGRYVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
         N+ +A+ A  +L  L+P N G +V+L++++   G+WE+ ++LRK +R   +KK  GCSSIEID + H F   D +HP +E+++  L ++
Subjt:  NNIEVAKEAAEKLFVLDPENAGRYVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

P0C899 Putative pentatricopeptide repeat-containing protein At3g491427.6e-9635.73Show/hide
Query:  IRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGA
        IR+++ +H++II+E L  N  L  KL+  YA   D  +  ARKVFD + ER+V + NV+IR Y N G + E + +F  M       + YTFP VLKAC  
Subjt:  IRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGA

Query:  MKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAM-----VHNQGACA-------
              G+ +HG   K GL   LFVGN L++ Y KC  +  AR V DEM  +D+VSWNS++ G+  N + D+A+ +   M      H+ G  A       
Subjt:  MKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAM-----VHNQGACA-------

Query:  ------------------------------------------------------PDNATLVGILPACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLIS
                                                              PD  ++  +LPAC   SA  +G  +H Y+ +  +     L + LI 
Subjt:  ------------------------------------------------------PDNATLVGILPACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLIS

Query:  MYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGREIYEKM-ETYGVERKEEHYACM
        MYA CG +  ARDVF  +  ++V+ W+A+I  YG  G   +A+ +F+ L++ GL PD + F+  L+ CSHAGL+ +GR  ++ M + Y +  + EH ACM
Subjt:  MYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGREIYEKM-ETYGVERKEEHYACM

Query:  VDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSI
        VDLLGRAG V +A +FI+ M M+  + V+GALLGACR+H++ ++   AA+KLF L PE +G YV+L++++  AG+WE+   +R +++ + +KK  G S++
Subjt:  VDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSI

Query:  EIDRIHHVFGKEDESHPFTEQIFDTLEKL
        E++RI H F   D SHP +++I+  L+ L
Subjt:  EIDRIHHVFGKEDESHPFTEQIFDTLEKL

P93011 Pentatricopeptide repeat-containing protein At2g337601.1e-9137.04Show/hide
Query:  IRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGA
        ++ +Q++HA +IV G  ++  L TKLI          +     +F  +   D F++N VI+  +        +  +  M       + YTF  V+K+C  
Subjt:  IRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGA

Query:  MKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGILPA
        +     GK VH H + SG  LD +V  AL+ FY+KC D+E AR+VFD MP K IV+WNS+++GF  NG  DEAI +F+ M   +    PD+AT V +L A
Subjt:  MKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGILPA

Query:  CVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLE-EVGLKPDGVIFLNLL
        C    A  +G WVH Y+I  G+ +   LG+ LI++Y+ CG V  AR+VF+++ + NV  W+A+I  YG HG   +A+ +F  +E + G  P+ V F+ +L
Subjt:  CVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLE-EVGLKPDGVIFLNLL

Query:  STCSHAGLVAKGREIYEKM-ETYGVERKEEHYACMVDLLGRAGFVDQAVKFI---EGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGR
        S C+HAGLV +GR +Y++M ++Y +    EH+ CMVD+LGRAGF+D+A KFI   +         ++ A+LGAC++H N ++  E A++L  L+P+N G 
Subjt:  STCSHAGLVAKGREIYEKM-ETYGVERKEEHYACMVDLLGRAGFVDQAVKFI---EGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGR

Query:  YVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
        +V+L++++  +G+ ++ + +R  +    ++K VG S IE++   ++F   DESH  T +I+  LE L
Subjt:  YVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic6.0e-9335.08Show/hide
Query:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSK-----------------------------MGVARKVFDRLVERDVFVWNVVIRGYANWGPF
        +  +  Q++H  ++  G D + ++ T LI  Y ++G  +                             +  A+K+FD +  +DV  WN +I GYA  G +
Subjt:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSK-----------------------------MGVARKVFDRLVERDVFVWNVVIRGYANWGPF

Query:  DEALNLFDEMRVKGAPTNRYTFPFVLKACGAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGK
         EAL LF +M       +  T   V+ AC    + + G+ VH  +   G   +L + NALI  Y+KC ++ETA  +F+ +P KD++SWN++I G+T    
Subjt:  DEALNLFDEMRVKGAPTNRYTFPFVLKACGAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGK

Query:  VDEAIMLFHAMVHNQGACAPDNATLVGILPACVSKSAAQVGFWVHSYVIK--TGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCY
          EA++LF  M+  +    P++ T++ ILPAC    A  +G W+H Y+ K   G+   + L + LI MYA CG +  A  VFN I  K++  W+A+I  +
Subjt:  VDEAIMLFHAMVHNQGACAPDNATLVGILPACVSKSAAQVGFWVHSYVIK--TGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCY

Query:  GMHGLADEALTMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGREIYEKM-ETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALL
         MHG AD +  +F+ + ++G++PD + F+ LLS CSH+G++  GR I+  M + Y +  K EHY CM+DLLG +G   +A + I  M M+    ++ +LL
Subjt:  GMHGLADEALTMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGREIYEKM-ETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALL

Query:  GACRIHNNIEVAKEAAEKLFVLDPENAGRYVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
         AC++H N+E+ +  AE L  ++PEN G YV+L++++  AG+W + AK R LL D+ +KK  GCSSIEID + H F   D+ HP   +I+  LE++
Subjt:  GACRIHNNIEVAKEAAEKLFVLDPENAGRYVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

Q9STF3 Pentatricopeptide repeat-containing protein At3g46790, chloroplastic3.3e-9937.96Show/hide
Query:  LHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGA----MK
        +H  I+  G DQ+ FLATKLIG Y++ G   +  ARKVFD+  +R ++VWN + R     G  +E L L+ +M   G  ++R+T+ +VLKAC A    + 
Subjt:  LHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGA----MK

Query:  NRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGILPACV
        +  KGK +H H+ + G    +++   L+  YA+   ++ A  VF  MP++++VSW++MIA +  NGK  EA+  F  M+      +P++ T+V +L AC 
Subjt:  NRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGILPACV

Query:  SKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNLLSTC
        S +A + G  +H Y+++ G+    P+ S L++MY  CG + + + VF+++ D++V+ W+++I  YG+HG   +A+ +F  +   G  P  V F+++L  C
Subjt:  SKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNLLSTC

Query:  SHAGLVAKGREIYEKM-ETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVILAS
        SH GLV +G+ ++E M   +G++ + EHYACMVDLLGRA  +D+A K ++ M  + G  V+G+LLG+CRIH N+E+A+ A+ +LF L+P+NAG YV+LA 
Subjt:  SHAGLVAKGREIYEKM-ETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVILAS

Query:  MHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
        ++ +A  W++  +++KLL  R ++K  G   +E+ R  + F   DE +P  EQI   L KL
Subjt:  MHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.3e-9435.08Show/hide
Query:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSK-----------------------------MGVARKVFDRLVERDVFVWNVVIRGYANWGPF
        +  +  Q++H  ++  G D + ++ T LI  Y ++G  +                             +  A+K+FD +  +DV  WN +I GYA  G +
Subjt:  RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSK-----------------------------MGVARKVFDRLVERDVFVWNVVIRGYANWGPF

Query:  DEALNLFDEMRVKGAPTNRYTFPFVLKACGAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGK
         EAL LF +M       +  T   V+ AC    + + G+ VH  +   G   +L + NALI  Y+KC ++ETA  +F+ +P KD++SWN++I G+T    
Subjt:  DEALNLFDEMRVKGAPTNRYTFPFVLKACGAMKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGK

Query:  VDEAIMLFHAMVHNQGACAPDNATLVGILPACVSKSAAQVGFWVHSYVIK--TGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCY
          EA++LF  M+  +    P++ T++ ILPAC    A  +G W+H Y+ K   G+   + L + LI MYA CG +  A  VFN I  K++  W+A+I  +
Subjt:  VDEAIMLFHAMVHNQGACAPDNATLVGILPACVSKSAAQVGFWVHSYVIK--TGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCY

Query:  GMHGLADEALTMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGREIYEKM-ETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALL
         MHG AD +  +F+ + ++G++PD + F+ LLS CSH+G++  GR I+  M + Y +  K EHY CM+DLLG +G   +A + I  M M+    ++ +LL
Subjt:  GMHGLADEALTMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGREIYEKM-ETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALL

Query:  GACRIHNNIEVAKEAAEKLFVLDPENAGRYVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
         AC++H N+E+ +  AE L  ++PEN G YV+L++++  AG+W + AK R LL D+ +KK  GCSSIEID + H F   D+ HP   +I+  LE++
Subjt:  GACRIHNNIEVAKEAAEKLFVLDPENAGRYVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-10037.14Show/hide
Query:  QELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGAMKNR
        Q LH   +   +  + F+A  LI  Y   GD  +  A KVF  + E+DV  WN +I G+   G  D+AL LF +M  +    +  T   VL AC  ++N 
Subjt:  QELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGAMKNR

Query:  DKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFD-------------------------------EMPLKDIVSWNSMIAGFTSNGKVDEA
        + G+ V  ++ ++ ++++L + NA++  Y KC  IE A+++FD                                MP KDIV+WN++I+ +  NGK +EA
Subjt:  DKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFD-------------------------------EMPLKDIVSWNSMIAGFTSNGKVDEA

Query:  IMLFHAMVHNQGACAPDNATLVGILPACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLA
        +++FH +   Q     +  TLV  L AC    A ++G W+HSY+ K G+++   + S LI MY+ CG +  +R+VFN ++ ++V VWSA+I    MHG  
Subjt:  IMLFHAMVHNQGACAPDNATLVGILPACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLA

Query:  DEALTMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGREIYEKMET-YGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIH
        +EA+ MF  ++E  +KP+GV F N+   CSH GLV +   ++ +ME+ YG+  +E+HYAC+VD+LGR+G++++AVKFIE MP+     V+GALLGAC+IH
Subjt:  DEALTMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGREIYEKMET-YGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIH

Query:  NNIEVAKEAAEKLFVLDPENAGRYVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
         N+ +A+ A  +L  L+P N G +V+L++++   G+WE+ ++LRK +R   +KK  GCSSIEID + H F   D +HP +E+++  L ++
Subjt:  NNIEVAKEAAEKLFVLDPENAGRYVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

AT2G33760.1 Pentatricopeptide repeat (PPR) superfamily protein8.0e-9337.04Show/hide
Query:  IRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGA
        ++ +Q++HA +IV G  ++  L TKLI          +     +F  +   D F++N VI+  +        +  +  M       + YTF  V+K+C  
Subjt:  IRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGA

Query:  MKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGILPA
        +     GK VH H + SG  LD +V  AL+ FY+KC D+E AR+VFD MP K IV+WNS+++GF  NG  DEAI +F+ M   +    PD+AT V +L A
Subjt:  MKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGILPA

Query:  CVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLE-EVGLKPDGVIFLNLL
        C    A  +G WVH Y+I  G+ +   LG+ LI++Y+ CG V  AR+VF+++ + NV  W+A+I  YG HG   +A+ +F  +E + G  P+ V F+ +L
Subjt:  CVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLE-EVGLKPDGVIFLNLL

Query:  STCSHAGLVAKGREIYEKM-ETYGVERKEEHYACMVDLLGRAGFVDQAVKFI---EGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGR
        S C+HAGLV +GR +Y++M ++Y +    EH+ CMVD+LGRAGF+D+A KFI   +         ++ A+LGAC++H N ++  E A++L  L+P+N G 
Subjt:  STCSHAGLVAKGREIYEKM-ETYGVERKEEHYACMVDLLGRAGFVDQAVKFI---EGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGR

Query:  YVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
        +V+L++++  +G+ ++ + +R  +    ++K VG S IE++   ++F   DESH  T +I+  LE L
Subjt:  YVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.3e-10037.96Show/hide
Query:  LHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGA----MK
        +H  I+  G DQ+ FLATKLIG Y++ G   +  ARKVFD+  +R ++VWN + R     G  +E L L+ +M   G  ++R+T+ +VLKAC A    + 
Subjt:  LHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGA----MK

Query:  NRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGILPACV
        +  KGK +H H+ + G    +++   L+  YA+   ++ A  VF  MP++++VSW++MIA +  NGK  EA+  F  M+      +P++ T+V +L AC 
Subjt:  NRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGILPACV

Query:  SKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNLLSTC
        S +A + G  +H Y+++ G+    P+ S L++MY  CG + + + VF+++ D++V+ W+++I  YG+HG   +A+ +F  +   G  P  V F+++L  C
Subjt:  SKSAAQVGFWVHSYVIKTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNLLSTC

Query:  SHAGLVAKGREIYEKM-ETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVILAS
        SH GLV +G+ ++E M   +G++ + EHYACMVDLLGRA  +D+A K ++ M  + G  V+G+LLG+CRIH N+E+A+ A+ +LF L+P+NAG YV+LA 
Subjt:  SHAGLVAKGREIYEKM-ETYGVERKEEHYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVILAS

Query:  MHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL
        ++ +A  W++  +++KLL  R ++K  G   +E+ R  + F   DE +P  EQI   L KL
Subjt:  MHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLEKL

AT3G49142.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.4e-9735.73Show/hide
Query:  IRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGA
        IR+++ +H++II+E L  N  L  KL+  YA   D  +  ARKVFD + ER+V + NV+IR Y N G + E + +F  M       + YTFP VLKAC  
Subjt:  IRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGA

Query:  MKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAM-----VHNQGACA-------
              G+ +HG   K GL   LFVGN L++ Y KC  +  AR V DEM  +D+VSWNS++ G+  N + D+A+ +   M      H+ G  A       
Subjt:  MKNRDKGKVVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAM-----VHNQGACA-------

Query:  ------------------------------------------------------PDNATLVGILPACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLIS
                                                              PD  ++  +LPAC   SA  +G  +H Y+ +  +     L + LI 
Subjt:  ------------------------------------------------------PDNATLVGILPACVSKSAAQVGFWVHSYVIKTGMQVGAPLGSCLIS

Query:  MYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGREIYEKM-ETYGVERKEEHYACM
        MYA CG +  ARDVF  +  ++V+ W+A+I  YG  G   +A+ +F+ L++ GL PD + F+  L+ CSHAGL+ +GR  ++ M + Y +  + EH ACM
Subjt:  MYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGREIYEKM-ETYGVERKEEHYACM

Query:  VDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSI
        VDLLGRAG V +A +FI+ M M+  + V+GALLGACR+H++ ++   AA+KLF L PE +G YV+L++++  AG+WE+   +R +++ + +KK  G S++
Subjt:  VDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSI

Query:  EIDRIHHVFGKEDESHPFTEQIFDTLEKL
        E++RI H F   D SHP +++I+  L+ L
Subjt:  EIDRIHHVFGKEDESHPFTEQIFDTLEKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AGAACCATCAGGAGCGTTCAAGAACTACACGCCCAAATCATCGTTGAAGGTCTCGACCAAAACGGATTCCTAGCCACGAAATTAATCGGCAAATACGCCGAGCATGGCGA
CTCGAAAATGGGAGTTGCACGGAAGGTGTTCGACAGATTGGTTGAGAGAGATGTGTTCGTGTGGAACGTGGTGATTCGAGGGTACGCCAATTGGGGACCGTTTGATGAAG
CCCTCAACCTGTTTGATGAAATGCGAGTCAAGGGCGCACCCACCAATCGTTACACATTCCCTTTTGTGTTGAAGGCATGTGGCGCTATGAAGAACAGGGACAAGGGGAAG
GTTGTTCATGGACACGTTTTGAAATCGGGGTTGGACTTGGATTTGTTCGTGGGCAATGCTCTGATCGCGTTTTATGCCAAGTGCCTGGACATTGAAACAGCTAGGAAGGT
GTTTGATGAAATGCCTCTGAAAGATATTGTTAGTTGGAACTCTATGATTGCTGGGTTTACTTCAAATGGGAAAGTGGATGAAGCCATCATGCTTTTTCATGCCATGGTGC
ATAATCAAGGTGCTTGCGCACCTGATAATGCTACTCTTGTTGGGATTCTGCCTGCTTGTGTTTCAAAATCTGCCGCCCAGGTCGGCTTCTGGGTCCATTCCTATGTTATA
AAGACAGGAATGCAAGTTGGTGCTCCTTTGGGTAGTTGTCTCATTTCAATGTATGCCAACTGTGGTCATGTGAACATTGCTAGAGATGTTTTCAACCAAATCGACGACAA
AAACGTCATCGTATGGAGCGCGGTTATCAGGTGTTACGGAATGCATGGTCTTGCTGATGAGGCACTAACCATGTTCACTAGTTTGGAAGAAGTTGGCCTGAAACCAGATG
GTGTGATCTTCTTGAATCTGTTATCGACGTGTAGTCATGCAGGGCTCGTCGCTAAAGGCCGCGAGATATACGAAAAAATGGAGACTTACGGTGTGGAAAGGAAAGAGGAA
CATTATGCGTGCATGGTGGATCTCTTAGGTAGGGCTGGTTTTGTTGATCAAGCAGTCAAGTTCATTGAGGGCATGCCAATGCAGGCGGGGAAAGATGTGTATGGCGCGTT
GCTTGGCGCTTGTCGAATACATAACAACATAGAAGTAGCCAAAGAAGCTGCGGAGAAGCTGTTCGTTCTGGATCCCGAAAACGCGGGGCGATACGTGATCTTGGCTAGTA
TGCATGAAGATGCAGGACAGTGGGAAGATGCTGCAAAACTAAGGAAGTTGCTGAGAGATAGGAAGATTAAGAAGCCAGTTGGTTGCAGTTCAATTGAGATAGATAGGATC
CATCATGTGTTTGGGAAGGAGGATGAATCTCACCCCTTCACAGAACAAATTTTTGACACATTGGAGAAACTA
mRNA sequenceShow/hide mRNA sequence
AGAACCATCAGGAGCGTTCAAGAACTACACGCCCAAATCATCGTTGAAGGTCTCGACCAAAACGGATTCCTAGCCACGAAATTAATCGGCAAATACGCCGAGCATGGCGA
CTCGAAAATGGGAGTTGCACGGAAGGTGTTCGACAGATTGGTTGAGAGAGATGTGTTCGTGTGGAACGTGGTGATTCGAGGGTACGCCAATTGGGGACCGTTTGATGAAG
CCCTCAACCTGTTTGATGAAATGCGAGTCAAGGGCGCACCCACCAATCGTTACACATTCCCTTTTGTGTTGAAGGCATGTGGCGCTATGAAGAACAGGGACAAGGGGAAG
GTTGTTCATGGACACGTTTTGAAATCGGGGTTGGACTTGGATTTGTTCGTGGGCAATGCTCTGATCGCGTTTTATGCCAAGTGCCTGGACATTGAAACAGCTAGGAAGGT
GTTTGATGAAATGCCTCTGAAAGATATTGTTAGTTGGAACTCTATGATTGCTGGGTTTACTTCAAATGGGAAAGTGGATGAAGCCATCATGCTTTTTCATGCCATGGTGC
ATAATCAAGGTGCTTGCGCACCTGATAATGCTACTCTTGTTGGGATTCTGCCTGCTTGTGTTTCAAAATCTGCCGCCCAGGTCGGCTTCTGGGTCCATTCCTATGTTATA
AAGACAGGAATGCAAGTTGGTGCTCCTTTGGGTAGTTGTCTCATTTCAATGTATGCCAACTGTGGTCATGTGAACATTGCTAGAGATGTTTTCAACCAAATCGACGACAA
AAACGTCATCGTATGGAGCGCGGTTATCAGGTGTTACGGAATGCATGGTCTTGCTGATGAGGCACTAACCATGTTCACTAGTTTGGAAGAAGTTGGCCTGAAACCAGATG
GTGTGATCTTCTTGAATCTGTTATCGACGTGTAGTCATGCAGGGCTCGTCGCTAAAGGCCGCGAGATATACGAAAAAATGGAGACTTACGGTGTGGAAAGGAAAGAGGAA
CATTATGCGTGCATGGTGGATCTCTTAGGTAGGGCTGGTTTTGTTGATCAAGCAGTCAAGTTCATTGAGGGCATGCCAATGCAGGCGGGGAAAGATGTGTATGGCGCGTT
GCTTGGCGCTTGTCGAATACATAACAACATAGAAGTAGCCAAAGAAGCTGCGGAGAAGCTGTTCGTTCTGGATCCCGAAAACGCGGGGCGATACGTGATCTTGGCTAGTA
TGCATGAAGATGCAGGACAGTGGGAAGATGCTGCAAAACTAAGGAAGTTGCTGAGAGATAGGAAGATTAAGAAGCCAGTTGGTTGCAGTTCAATTGAGATAGATAGGATC
CATCATGTGTTTGGGAAGGAGGATGAATCTCACCCCTTCACAGAACAAATTTTTGACACATTGGAGAAACTA
Protein sequenceShow/hide protein sequence
RTIRSVQELHAQIIVEGLDQNGFLATKLIGKYAEHGDSKMGVARKVFDRLVERDVFVWNVVIRGYANWGPFDEALNLFDEMRVKGAPTNRYTFPFVLKACGAMKNRDKGK
VVHGHVLKSGLDLDLFVGNALIAFYAKCLDIETARKVFDEMPLKDIVSWNSMIAGFTSNGKVDEAIMLFHAMVHNQGACAPDNATLVGILPACVSKSAAQVGFWVHSYVI
KTGMQVGAPLGSCLISMYANCGHVNIARDVFNQIDDKNVIVWSAVIRCYGMHGLADEALTMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGREIYEKMETYGVERKEE
HYACMVDLLGRAGFVDQAVKFIEGMPMQAGKDVYGALLGACRIHNNIEVAKEAAEKLFVLDPENAGRYVILASMHEDAGQWEDAAKLRKLLRDRKIKKPVGCSSIEIDRI
HHVFGKEDESHPFTEQIFDTLEKL