; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007406 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007406
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold7:20234133..20237938
RNA-Seq ExpressionSpg007406
SyntenySpg007406
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601646.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089.25Show/hide
Query:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV
        MLQRFFS PR FSRLAGPLL +GHH+  ST+A  P LS+LD+TMASV FISL+SCFY MGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGAL DV
Subjt:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV

Query:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI
         SARMVFDQM DPDFYAWKVMIRWYFLNDMFA+IIPFYNRMRMSFK+CDNIIFSIILKACSE REIDEGRKVHCQIVKVGGPDSFVLTGLIDMY KCGQI
Subjt:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI

Query:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNV-VELSSFLATAFLDMYVKCG
        ECSS+VFEGIIDKNVVSWT+MIAGYVQNDCAEEGLVLFNRMRE+LVESNQFTLGSIITAC RLRALHQGKWVHGYAIKNV +EL+SFLATAFLDMYVKCG
Subjt:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNV-VELSSFLATAFLDMYVKCG

Query:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH
        QTRDARMIFDELP+IDLVSWTAMIVGY+QAGQP++ALRLF D+IRS LLPNSVTAAS+LS+CSVSGNLS+GM VHGLGIKLGLEECAVKNALIDMYAKCH
Subjt:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH

Query:  MIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK
        MIDDAYV+F G+LEKDVITWNSMISGYAQ+GSAY+ALRLFNQM+S+S APDAITLVS LSASA LGAVQVGSSLHAYSIKEGLFSSNLYIGTALLN YAK
Subjt:  MIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK

Query:  CGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLL
        CGDAK ARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKED KPN+VIFTTVLSACSYSGMVEEGWRYFKSMSQ+YN+ PSMKHYACMVDLL
Subjt:  CGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLL

Query:  ARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNA
        +RSGRL+EA +FIKKMPVQPDISLYGAFLHGCGLYSRFDLGEV+VRE+L LHPNEAC+YVLLSNLYASDGRWGQVNEVR+LML+RGL KVPG+SLVETNA
Subjt:  ARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNA

Query:  GCLFEVF
        G + + +
Subjt:  GCLFEVF

KAG7032406.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0090.06Show/hide
Query:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV
        MLQRFFS PR FSRLAGPLL +GHH+S ST+A  P LS+LD+TMASV FISL+SCFY MGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGAL DV
Subjt:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV

Query:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI
         SARMVFDQM DPDFYAWKVMIRWYFLNDMFA+IIPFYNRMRMSFK+CDNIIFSIILKACSE REIDEGRKVHCQIVKVGGPDSFVLTGLIDMY KCGQI
Subjt:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI

Query:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNV-VELSSFLATAFLDMYVKCG
        ECSS+VFEGIIDKNVVSWT+MIAGYVQNDCAEEGLVLFNRMRE+LVESNQFTLGSIITAC RLRALHQGKWVHGYAIKNV +EL+SFLATAFLDMYVKCG
Subjt:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNV-VELSSFLATAFLDMYVKCG

Query:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH
        QTRDARMIFDELP+IDLVSWTAMIVGY+QAGQP++ALRLF D+IRS LLPNSVTAAS+LS+CSVSGNLS+GM VHGLGIKLGLEECAVKNALIDMYAKCH
Subjt:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH

Query:  MIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK
        MIDDAYV+F G+LEKDVITWNSMISGYAQ+GSAY+ALRLFNQM+S+S APDAITLVS LSASA LGAVQVGSSLHAYSIKEGLFSSNLYIGTALLN YAK
Subjt:  MIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK

Query:  CGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLL
        CGDAK ARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKED KPN+VIFTTVLSACSYSGMVEEGWRYFKSMSQ+YN+APSMKHYACMVDLL
Subjt:  CGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLL

Query:  ARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNA
        +RSGRL+EA +FIKKMPVQPDISLYGAFLHGCGLYSRFDLGEV+VRE+L LHPNEAC+YVLLSNLYASDGRWGQVNEVR+LML+RGL KVPG+SLVETNA
Subjt:  ARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNA

Query:  GCLF
        G  F
Subjt:  GCLF

XP_022957571.1 pentatricopeptide repeat-containing protein At2g03380, mitochondrial [Cucurbita moschata]0.0e+0090.2Show/hide
Query:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV
        MLQRFFS PR FSRLAGPLL +GHH+S ST+A  P LS+LD+TMASV FISL+SCFY MGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGAL DV
Subjt:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV

Query:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI
         SARMVFDQM DPDFYAWKVMIRWYFLNDMFA+IIPFYNRMRMSFK+CDNIIFSIILKACSE REIDEGRKVHCQIVKVGGPDSFVLTGLIDMY KCGQI
Subjt:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI

Query:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVV-ELSSFLATAFLDMYVKCG
        ECSS+VFEGIIDKNVVSWT+MIAGYVQNDCAEEGLVLFNRMRE+LVESNQFTLGSIITAC RLRALHQGKWVHGYAIKNV+ EL+SFLATAFLDMYVKCG
Subjt:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVV-ELSSFLATAFLDMYVKCG

Query:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH
        QTRDARMIFDELP+IDLVSWTAMIVGY+QAGQP+EALRLF D+IRS LLPNSVTAAS+LS+CSVSGNLS+GM VHGLGIKLGLEECAVKNALIDMYAKCH
Subjt:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH

Query:  MIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK
        MIDDAYV+F G+LEKDVITWNSMISGYAQ+GSAY+ALRLFNQMRS+S APDAITLVS LSASA LGAVQVGSSLH YSIKEGLFSSNLYIGTALLN YAK
Subjt:  MIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK

Query:  CGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLL
        CGDAK ARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKED KPN+VIFTTVLSACSYSGMVEEGWRYFKSMSQ+YN+APSMKHYACMVDLL
Subjt:  CGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLL

Query:  ARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNA
        +RSGRL+EA DFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEV+VRE+L LHPNEA  YVLLSNLYASDGRWGQVNEVR+LML+RGL KVPG+SLVETNA
Subjt:  ARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNA

Query:  GCLF
        G  F
Subjt:  GCLF

XP_022997487.1 pentatricopeptide repeat-containing protein At2g03380, mitochondrial [Cucurbita maxima]0.0e+0090.47Show/hide
Query:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV
        MLQRFFS PR FSRLAGPLL +GHH+S ST+A  P LS+LD+TMASV FISL+SCFY MGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGAL DV
Subjt:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV

Query:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI
         SARMVFDQM DPDFYAWKVMIRWYFLNDMFA+IIPFYNRMRMSF++CDNIIFSIILKAC E REIDEGRKVHCQIVKVGGPDSFVLTGLIDMY KCGQI
Subjt:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI

Query:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQ
        ECSS+VFEGIIDKNVVSWT+MIAGYVQNDCAEEGLVLFNRMRE+LVESNQFTLGSIITAC RLRALHQGKWVHGYAIKNV+ELSSFLATAFLDMYVKCGQ
Subjt:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQ

Query:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM
        TRDARMIFDELPTIDLVSWTAMIVGYTQAGQP+EAL LF D+IRS LLPNSVTAAS+LS+CSVSGNLS+GMSVHGLGIKLGL ECAVKNALIDMYAKCHM
Subjt:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM

Query:  IDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC
        IDDAYV+F G+LEKDVITWNSMISGY QNGSAY+ALRLFNQMRS+S APDAITLVSTLSASA LGAVQ+GSSLHAYSIKEGLFSSNLYIGTALLN YAKC
Subjt:  IDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC

Query:  GDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLA
        GDAK ARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKED KPN+VIFTTVLSACSYSGMVEEGW+YFKSMSQ+YN+APSMKHYACMVDLL+
Subjt:  GDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLA

Query:  RSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNAG
        RSGRL+EA DFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEV+VRE+L LHPNEAC YVLLSNLYASDGRWGQVNEVR+LML+RGL KVPG+SLVETNAG
Subjt:  RSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNAG

Query:  CLF
          F
Subjt:  CLF

XP_038893238.1 pentatricopeptide repeat-containing protein At2g03380, mitochondrial [Benincasa hispida]0.0e+0089.9Show/hide
Query:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV
        MLQRFF+ PR FSRLAGPLL +GHHMS   +A  P LS+L +TMAS   ISL+SCFYL+GLC+NIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGAL DV
Subjt:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV

Query:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI
         SARMVFDQMPDPDFYAWKVMIRWYFLND+F DIIPFYNRMRMSF++CD IIFSIILKACSE REIDEGRKVHCQIVKVGGPDSFVLTGLIDMY KCGQI
Subjt:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI

Query:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQ
        E SS VFE I+DKNVVSWTSMIAGYVQN+CAEEGLVLFNRMREALVESN FTLGSIITAC +LRALHQGKWVHGYAIKN++ELSSFLATAFLDMYVKCGQ
Subjt:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQ

Query:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM
        T+DARMIFDELPTIDLVSWTAM+VGYTQ GQP+EALRLFADEIRS LLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMY+KCHM
Subjt:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM

Query:  IDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC
        IDDAYVIF G+LEKDVITWNSMISGYAQNGSAYE+LRLFN MRS+  APDAITLVSTLSASAT+GAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC
Subjt:  IDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC

Query:  GDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLA
        GDAK ARMVFD MGDKNIITWSAMIGGYGVQGDGSGSL++FSDMLKED KPNEVIFTTVLSACSYSGMVEEGWRYFKSM Q+YNF PSMKHYACMVDLLA
Subjt:  GDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLA

Query:  RSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNAG
        RSGRLDEA DFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVRE+L+LHPNEACYYVLLSNLYASDGRWGQVNEVR+LMLQRGLNKVPG+S+VE+N G
Subjt:  RSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNAG

Query:  CLF
         LF
Subjt:  CLF

TrEMBL top hitse value%identityAlignment
A0A1S4E1F1 pentatricopeptide repeat-containing protein At2g03380, mitochondrial0.0e+0089.05Show/hide
Query:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV
        MLQRFFS PR FS L G LL +GH MS ST+A  P LS+L +TM SV FISL+SC YLMGL RNIDTLIKFHGLLIVHGL+GNLLCDTKLVGVYGAL DV
Subjt:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV

Query:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI
         SARMVFDQMPDPDFYAWKVMIRWYFLND+F D+IPFYN MRMSF++CDNIIFSIILKACSE REIDEGRKVHCQIVKVGGPDSFV+TGLIDMY KC Q+
Subjt:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI

Query:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQ
        ECSS+VFE I+DKNVVSWTSMIAGYVQN+CAEEGLVLFNRMR+ALVESN FTLGSII AC +LRALHQGKWVHGYAIKN+VE SSFLAT FLDMYVKCGQ
Subjt:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQ

Query:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM
        TRDARMIFDELPTIDLVSWTAMIVGYTQA QP++ LRLFADEIRSDLLPNSVTAASVLS+CSVSGNL+LGMSVHGLGIKLGLEECAVKNALIDMYAKCH 
Subjt:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM

Query:  IDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC
        I DAYVIFHG+LEKDVITWNSMISGYAQNGSAY+ALRLFNQMRS S APDAITLVSTLSASATLGAVQVGSSLHAYS+KEGLFSSNLYIGTALLNFYAKC
Subjt:  IDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC

Query:  GDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLA
        GDAK AR VFDSMGDKNIITWSAMIGGYGVQGDGSGSL+IFSDMLKED KPNEVIFTT+LSACS SGMVEEGWRYFKSM Q+YNF PSMKHYACMVDLLA
Subjt:  GDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLA

Query:  RSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNAG
        RSGRLDEA DFIKKMPVQPD+SLYGAFLHGCGLYSRFDLGEVVVRE+L+LHPNEACYYVLLSNLYASDG+WGQVNEVR+LMLQRGLNKVPG+SLVETNAG
Subjt:  RSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNAG

Query:  CLF
         LF
Subjt:  CLF

A0A5D3DQT2 Pentatricopeptide repeat-containing protein0.0e+0089.13Show/hide
Query:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV
        MLQRFFS PR FS L G LL +GH MS ST+A  P LS+L +TM SV FISL+SC YLMGL RNIDTLIKFHGLLIVHGL+GNLLCDTKLVGVYGAL DV
Subjt:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV

Query:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI
         SARMVFDQMPDPDFYAWKVMIRWYFLND+F D+IPFYN MRMSF++CDNIIFSIILKACSE REIDEGRKVHCQIVKVGGPDSFV+TGLIDMY KC Q+
Subjt:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI

Query:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQ
        ECSS+VFE I+DKNVVSWTSMIAGYVQN+CAEEGLVLFNRMR+ALVESN FTLGSII AC +LRALHQGKWVHGYAIKN+VE SSFLAT FLDMYVKCGQ
Subjt:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQ

Query:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM
        TRDARMIFDELPTIDLVSWTAMIVGYTQA QP++ LRLFADEIRSDLLPNSVTAASVLS+CSVSGNL+LGMSVHGLGIKLGLEECAVKNALIDMYAKCH 
Subjt:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM

Query:  IDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC
        I DAYVIFHG+LEKDVITWNSMISGYAQNGSAY+ALRLFNQMRS S APDAITLVSTLSASATLGAVQVGSSLHAYS+KEGLFSSNLYIGTALLNFYAKC
Subjt:  IDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC

Query:  GDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLA
        GDAK AR VFDSMGDKNIITWSAMIGGYGVQGDGSGSL+IFSDMLKED KPNEVIFTT+LSACS SGMVEEGWRYFKSM Q+YNF PSMKHYACMVDLLA
Subjt:  GDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLA

Query:  RSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNA
        RSGRLDEA DFIKKMPVQPD+SLYGAFLHGCGLYSRFDLGEVVVRE+L+LHPNEACYYVLLSNLYASDG+WGQVNEVR+LMLQRGLNKVPG+SLVETNA
Subjt:  RSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNA

A0A6J1DCB9 pentatricopeptide repeat-containing protein At2g03380, mitochondrial isoform X20.0e+0089.47Show/hide
Query:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV
        MLQRF S  R+FSRLAG L+ +G +M CST+ P PQLSELD+TMASV FISLNS FYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGAL DV
Subjt:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV

Query:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI
         SAR+VFDQMPD DFYAWKVMIRWYFLNDMFADIIPFYN MRMSF + DNIIFSIILKACSE REI EGRKVH QIVKVGGPDSFVLTGL+DMYAKCGQI
Subjt:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI

Query:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQ
        ECS +VFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMR ALVESNQFTLGSIITAC +LRALHQGKWVHGYAIKNVV+LS+FL T FLDMYVKCGQ
Subjt:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQ

Query:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM
        TRDAR++FDELPTID VSWTAMIVGYTQAGQP+EALRLFA EIRSDLLPNSVTAA+VLSSCSVSGNL+LGMSVHGLGIKLGLEEC VKNALIDMYAKCHM
Subjt:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM

Query:  IDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC
        ID+AYVIFHG+LEKDVITWNSMISGYAQNG+A EAL LFNQMRS+S+APDAITLVS LSASATLGAV VGSSLHAYSIK GLFS NLYIGT LLNFYAKC
Subjt:  IDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC

Query:  GDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLA
        GDAK ARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKED KPNEVIFTTVLSACSYSGMV EGWRYFKSMSQ+YNF PSMKHYACMVDLLA
Subjt:  GDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLA

Query:  RSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNAG
        RSGRLDEA DFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVV+E+L+LHP+EACYYVLLSNLYASDGRWGQVN+VRELM QRGLNK PG+SLVET+AG
Subjt:  RSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNAG

Query:  CLF
         LF
Subjt:  CLF

A0A6J1GZK8 pentatricopeptide repeat-containing protein At2g03380, mitochondrial0.0e+0090.2Show/hide
Query:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV
        MLQRFFS PR FSRLAGPLL +GHH+S ST+A  P LS+LD+TMASV FISL+SCFY MGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGAL DV
Subjt:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV

Query:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI
         SARMVFDQM DPDFYAWKVMIRWYFLNDMFA+IIPFYNRMRMSFK+CDNIIFSIILKACSE REIDEGRKVHCQIVKVGGPDSFVLTGLIDMY KCGQI
Subjt:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI

Query:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVV-ELSSFLATAFLDMYVKCG
        ECSS+VFEGIIDKNVVSWT+MIAGYVQNDCAEEGLVLFNRMRE+LVESNQFTLGSIITAC RLRALHQGKWVHGYAIKNV+ EL+SFLATAFLDMYVKCG
Subjt:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVV-ELSSFLATAFLDMYVKCG

Query:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH
        QTRDARMIFDELP+IDLVSWTAMIVGY+QAGQP+EALRLF D+IRS LLPNSVTAAS+LS+CSVSGNLS+GM VHGLGIKLGLEECAVKNALIDMYAKCH
Subjt:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH

Query:  MIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK
        MIDDAYV+F G+LEKDVITWNSMISGYAQ+GSAY+ALRLFNQMRS+S APDAITLVS LSASA LGAVQVGSSLH YSIKEGLFSSNLYIGTALLN YAK
Subjt:  MIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK

Query:  CGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLL
        CGDAK ARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKED KPN+VIFTTVLSACSYSGMVEEGWRYFKSMSQ+YN+APSMKHYACMVDLL
Subjt:  CGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLL

Query:  ARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNA
        +RSGRL+EA DFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEV+VRE+L LHPNEA  YVLLSNLYASDGRWGQVNEVR+LML+RGL KVPG+SLVETNA
Subjt:  ARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNA

Query:  GCLF
        G  F
Subjt:  GCLF

A0A6J1KBI2 pentatricopeptide repeat-containing protein At2g03380, mitochondrial0.0e+0090.47Show/hide
Query:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV
        MLQRFFS PR FSRLAGPLL +GHH+S ST+A  P LS+LD+TMASV FISL+SCFY MGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGAL DV
Subjt:  MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDV

Query:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI
         SARMVFDQM DPDFYAWKVMIRWYFLNDMFA+IIPFYNRMRMSF++CDNIIFSIILKAC E REIDEGRKVHCQIVKVGGPDSFVLTGLIDMY KCGQI
Subjt:  ASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQI

Query:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQ
        ECSS+VFEGIIDKNVVSWT+MIAGYVQNDCAEEGLVLFNRMRE+LVESNQFTLGSIITAC RLRALHQGKWVHGYAIKNV+ELSSFLATAFLDMYVKCGQ
Subjt:  ECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQ

Query:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM
        TRDARMIFDELPTIDLVSWTAMIVGYTQAGQP+EAL LF D+IRS LLPNSVTAAS+LS+CSVSGNLS+GMSVHGLGIKLGL ECAVKNALIDMYAKCHM
Subjt:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM

Query:  IDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC
        IDDAYV+F G+LEKDVITWNSMISGY QNGSAY+ALRLFNQMRS+S APDAITLVSTLSASA LGAVQ+GSSLHAYSIKEGLFSSNLYIGTALLN YAKC
Subjt:  IDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC

Query:  GDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLA
        GDAK ARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKED KPN+VIFTTVLSACSYSGMVEEGW+YFKSMSQ+YN+APSMKHYACMVDLL+
Subjt:  GDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLA

Query:  RSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNAG
        RSGRL+EA DFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEV+VRE+L LHPNEAC YVLLSNLYASDGRWGQVNEVR+LML+RGL KVPG+SLVETNAG
Subjt:  RSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNAG

Query:  CLF
          F
Subjt:  CLF

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic6.9e-12436.39Show/hide
Query:  LMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNII--FSI
        L+  C ++  L +   L+  +GL       TKLV ++     V  A  VF+ +       +  M++ +         + F+ RMR  + D + ++  F+ 
Subjt:  LMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNII--FSI

Query:  ILKACSEFREIDEGRKVHCQIVKVG-GPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLG
        +LK C +  E+  G+++H  +VK G   D F +TGL +MYAKC Q+  +  VF+ + ++++VSW +++AGY QN  A   L +   M E  ++ +  T+ 
Subjt:  ILKACSEFREIDEGRKVHCQIVKVG-GPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLG

Query:  SIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTA
        S++ A + LR +  GK +HGYA+++  +    ++TA +DMY KCG    AR +FD +   ++VSW +MI  Y Q   P EA+ +F   +   + P  V+ 
Subjt:  SIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTA

Query:  ASVLSSCSVSGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAIT
           L +C+  G+L  G  +H L ++LGL+   +V N+LI MY KC  +D A  +F  +  + +++WN+MI G+AQNG   +AL  F+QMRS +  PD  T
Subjt:  ASVLSSCSVSGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAIT

Query:  LVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNE
         VS ++A A L        +H   ++  L   N+++ TAL++ YAKCG    AR++FD M ++++ TW+AMI GYG  G G  +L +F +M K   KPN 
Subjt:  LVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNE

Query:  VIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPN
        V F +V+SACS+SG+VE G + F  M + Y+   SM HY  MVDLL R+GRL+EA+DFI +MPV+P +++YGA L  C ++   +  E     +  L+P+
Subjt:  VIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPN

Query:  EACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVE
        +  Y+VLL+N+Y +   W +V +VR  ML++GL K PG S+VE
Subjt:  EACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVE

Q9C507 Putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial1.1e-11633.55Show/hide
Query:  KFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEG
        K HG +I  G+  + + +T L+ +YG   +++ A  VFD MP  D  AW  ++     N      +  +  M     + D +    +++ C+E   +   
Subjt:  KFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEG

Query:  RKVHCQIV-KVGGPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQ
        R VH QI  K+   D  +   L+ MY+KCG +  S  +FE I  KN VSWT+MI+ Y + + +E+ L  F+ M ++ +E N  TL S++++C  +  + +
Subjt:  RKVHCQIV-KVGGPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQ

Query:  GKWVHGYAIKNVVELS-SFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNL
        GK VHG+A++  ++ +   L+ A +++Y +CG+  D   +   +   ++V+W ++I  Y   G   +AL LF   +   + P++ T AS +S+C  +G +
Subjt:  GKWVHGYAIKNVVELS-SFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNL

Query:  SLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAV
         LG  +HG  I+  + +  V+N+LIDMY+K   +D A  +F+ I  + V+TWNSM+ G++QNG++ EA+ LF+ M  +    + +T ++ + A +++G++
Subjt:  SLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAV

Query:  QVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSG
        + G  +H   I  GL   +L+  TAL++ YAKCGD   A  VF +M  ++I++WS+MI  YG+ G    +++ F+ M++   KPNEV+F  VLSAC +SG
Subjt:  QVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSG

Query:  MVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYAS
         VEEG +Y+ ++ + +  +P+ +H+AC +DLL+RSG L EA+  IK+MP   D S++G+ ++GC ++ + D+ + +  ++  +  ++  YY LLSN+YA 
Subjt:  MVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYAS

Query:  DGRWGQVNEVRELMLQRGLNKVPGFSLVE
        +G W +   +R  M    L KVPG+S +E
Subjt:  DGRWGQVNEVRELMLQRGLNKVPGFSLVE

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic1.3e-11133.09Show/hide
Query:  DVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMR-MSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGP-DSFVLTGLIDMYAK
        ++  A  VF +M + + ++W V++  Y     F + +  Y+RM  +     D   F  +L+ C    ++  G++VH  +V+ G   D  V+  LI MY K
Subjt:  DVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMR-MSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGP-DSFVLTGLIDMYAK

Query:  CGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYV
        CG ++ +  +F+ +  ++++SW +MI+GY +N    EGL LF  MR   V+ +  TL S+I+AC  L     G+ +H Y I     +   +  +   MY+
Subjt:  CGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYV

Query:  KCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGL-EECAVKNALIDMY
          G  R+A  +F  +   D+VSWT MI GY     P +A+  +    +  + P+ +T A+VLS+C+  G+L  G+ +H L IK  L     V N LI+MY
Subjt:  KCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGL-EECAVKNALIDMY

Query:  AKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLN
        +KC  ID A  IFH I  K+VI+W S+I+G   N   +EAL    QM+  +  P+AITL + L+A A +GA+  G  +HA+ ++ G+   + ++  ALL+
Subjt:  AKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLN

Query:  FYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACM
         Y +CG    A   F+S   K++ +W+ ++ GY  +G GS  + +F  M+K   +P+E+ F ++L  CS S MV +G  YF  M ++Y   P++KHYAC+
Subjt:  FYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACM

Query:  VDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLV
        VDLL R+G L EA  FI+KMPV PD +++GA L+ C ++ + DLGE+  + I  L      YY+LL NLYA  G+W +V +VR +M + GL    G S V
Subjt:  VDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLV

Query:  ETNAGCLFEVFGVSLARNRDSCHMLEEMKSHFSEDRSAGRRAKEDTKPIDPCELQ----FCGHMNDKS----LVNHHEATGSKKSRDSEICEDLH
        E         F      +  +  +   ++  + +    G     ++  +D  E+     FCGH   K+    L+N         +++  +CE+ H
Subjt:  ETNAGCLFEVFGVSLARNRDSCHMLEEMKSHFSEDRSAGRRAKEDTKPIDPCELQ----FCGHMNDKS----LVNHHEATGSKKSRDSEICEDLH

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.3e-11432.92Show/hide
Query:  RNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSE
        + +D  I+ +G +I   L       +KL  +Y    D+  A  VFD++       W +++     +  F+  I  + +M  S  + D+  FS + K+ S 
Subjt:  RNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSE

Query:  FREIDEGRKVHCQIVKVG-GPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACA
         R +  G ++H  I+K G G  + V   L+  Y K  +++ +  VF+ + +++V+SW S+I GYV N  AE+GL +F +M  + +E +  T+ S+   CA
Subjt:  FREIDEGRKVHCQIVKVG-GPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACA

Query:  RLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSC
          R +  G+ VH   +K             LDMY KCG    A+ +F E+    +VS+T+MI GY + G   EA++LF +     + P+  T  +VL+ C
Subjt:  RLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSC

Query:  SVSGNLSLGMSVHGLGIK---LGLEECAVKNALIDMYAKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFN-QMRSNSSAPDAITLVST
        +    L  G  VH   IK   LG  +  V NAL+DMYAKC  + +A ++F  +  KD+I+WN++I GY++N  A EAL LFN  +     +PD  T+   
Subjt:  SVSGNLSLGMSVHGLGIK---LGLEECAVKNALIDMYAKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFN-QMRSNSSAPDAITLVST

Query:  LSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFT
        L A A+L A   G  +H Y ++ G FS   ++  +L++ YAKCG    A M+FD +  K++++W+ MI GYG+ G G  ++A+F+ M +   + +E+ F 
Subjt:  LSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFT

Query:  TVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACY
        ++L ACS+SG+V+EGWR+F  M  E    P+++HYAC+VD+LAR+G L +A+ FI+ MP+ PD +++GA L GC ++    L E V  ++  L P    Y
Subjt:  TVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACY

Query:  YVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNAGCLFEVFGVSLARNRDSCH-MLEEMKSHFSED--RSAGRRAKEDTKPIDPCELQFCGH
        YVL++N+YA   +W QV  +R+ + QRGL K PG S +E        V G S     ++    L ++++   E+      + A  D + ++  E   CGH
Subjt:  YVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNAGCLFEVFGVSLARNRDSCH-MLEEMKSHFSED--RSAGRRAKEDTKPIDPCELQFCGH

Query:  MNDKSLVNHHEATGSKK----SRDSEICEDLH
            ++     ++G  K    +++  +C D H
Subjt:  MNDKSLVNHHEATGSKK----SRDSEICEDLH

Q9ZQ74 Pentatricopeptide repeat-containing protein At2g03380, mitochondrial3.0e-22054.63Show/hide
Query:  PLLGIGHHMSCSTHAPLPQLSELDET-MASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFY
        P    G    C +   + +L   +E   +S+H+ + + CF L+  C NID+L + HG+L  +GL+G++   TKLV +YG       AR+VFDQ+P+PDFY
Subjt:  PLLGIGHHMSCSTHAPLPQLSELDET-MASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFY

Query:  AWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVV
         WKVM+R Y LN    +++  Y+ +       D+I+FS  LKAC+E +++D G+K+HCQ+VKV   D+ VLTGL+DMYAKCG+I+ +  VF  I  +NVV
Subjt:  AWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVV

Query:  SWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQTRDARMIFDELPTIDL
         WTSMIAGYV+ND  EEGLVLFNRMRE  V  N++T G++I AC +L ALHQGKW HG  +K+ +ELSS L T+ LDMYVKCG   +AR +F+E   +DL
Subjt:  SWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQTRDARMIFDELPTIDL

Query:  VSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVIFHGILEKDV
        V WTAMIVGYT  G  +EAL LF      ++ PN VT ASVLS C +  NL LG SVHGL IK+G+ +  V NAL+ MYAKC+   DA  +F    EKD+
Subjt:  VSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVIFHGILEKDV

Query:  ITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEG-LFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGD
        + WNS+ISG++QNGS +EAL LF++M S S  P+ +T+ S  SA A+LG++ VGSSLHAYS+K G L SS++++GTALL+FYAKCGD + AR++FD++ +
Subjt:  ITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEG-LFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGD

Query:  KNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKM
        KN ITWSAMIGGYG QGD  GSL +F +MLK+  KPNE  FT++LSAC ++GMV EG +YF SM ++YNF PS KHY CMVD+LAR+G L++A D I+KM
Subjt:  KNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKM

Query:  PVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVE
        P+QPD+  +GAFLHGCG++SRFDLGE+V++++L LHP++A YYVL+SNLYASDGRW Q  EVR LM QRGL+K+ G S +E
Subjt:  PVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVE

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein4.9e-12536.39Show/hide
Query:  LMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNII--FSI
        L+  C ++  L +   L+  +GL       TKLV ++     V  A  VF+ +       +  M++ +         + F+ RMR  + D + ++  F+ 
Subjt:  LMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNII--FSI

Query:  ILKACSEFREIDEGRKVHCQIVKVG-GPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLG
        +LK C +  E+  G+++H  +VK G   D F +TGL +MYAKC Q+  +  VF+ + ++++VSW +++AGY QN  A   L +   M E  ++ +  T+ 
Subjt:  ILKACSEFREIDEGRKVHCQIVKVG-GPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLG

Query:  SIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTA
        S++ A + LR +  GK +HGYA+++  +    ++TA +DMY KCG    AR +FD +   ++VSW +MI  Y Q   P EA+ +F   +   + P  V+ 
Subjt:  SIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTA

Query:  ASVLSSCSVSGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAIT
           L +C+  G+L  G  +H L ++LGL+   +V N+LI MY KC  +D A  +F  +  + +++WN+MI G+AQNG   +AL  F+QMRS +  PD  T
Subjt:  ASVLSSCSVSGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAIT

Query:  LVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNE
         VS ++A A L        +H   ++  L   N+++ TAL++ YAKCG    AR++FD M ++++ TW+AMI GYG  G G  +L +F +M K   KPN 
Subjt:  LVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNE

Query:  VIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPN
        V F +V+SACS+SG+VE G + F  M + Y+   SM HY  MVDLL R+GRL+EA+DFI +MPV+P +++YGA L  C ++   +  E     +  L+P+
Subjt:  VIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPN

Query:  EACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVE
        +  Y+VLL+N+Y +   W +V +VR  ML++GL K PG S+VE
Subjt:  EACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVE

AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.5e-11333.09Show/hide
Query:  DVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMR-MSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGP-DSFVLTGLIDMYAK
        ++  A  VF +M + + ++W V++  Y     F + +  Y+RM  +     D   F  +L+ C    ++  G++VH  +V+ G   D  V+  LI MY K
Subjt:  DVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMR-MSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGP-DSFVLTGLIDMYAK

Query:  CGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYV
        CG ++ +  +F+ +  ++++SW +MI+GY +N    EGL LF  MR   V+ +  TL S+I+AC  L     G+ +H Y I     +   +  +   MY+
Subjt:  CGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYV

Query:  KCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGL-EECAVKNALIDMY
          G  R+A  +F  +   D+VSWT MI GY     P +A+  +    +  + P+ +T A+VLS+C+  G+L  G+ +H L IK  L     V N LI+MY
Subjt:  KCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGL-EECAVKNALIDMY

Query:  AKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLN
        +KC  ID A  IFH I  K+VI+W S+I+G   N   +EAL    QM+  +  P+AITL + L+A A +GA+  G  +HA+ ++ G+   + ++  ALL+
Subjt:  AKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLN

Query:  FYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACM
         Y +CG    A   F+S   K++ +W+ ++ GY  +G GS  + +F  M+K   +P+E+ F ++L  CS S MV +G  YF  M ++Y   P++KHYAC+
Subjt:  FYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACM

Query:  VDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLV
        VDLL R+G L EA  FI+KMPV PD +++GA L+ C ++ + DLGE+  + I  L      YY+LL NLYA  G+W +V +VR +M + GL    G S V
Subjt:  VDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLV

Query:  ETNAGCLFEVFGVSLARNRDSCHMLEEMKSHFSEDRSAGRRAKEDTKPIDPCELQ----FCGHMNDKS----LVNHHEATGSKKSRDSEICEDLH
        E         F      +  +  +   ++  + +    G     ++  +D  E+     FCGH   K+    L+N         +++  +CE+ H
Subjt:  ETNAGCLFEVFGVSLARNRDSCHMLEEMKSHFSEDRSAGRRAKEDTKPIDPCELQ----FCGHMNDKS----LVNHHEATGSKKSRDSEICEDLH

AT1G69350.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.5e-11833.55Show/hide
Query:  KFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEG
        K HG +I  G+  + + +T L+ +YG   +++ A  VFD MP  D  AW  ++     N      +  +  M     + D +    +++ C+E   +   
Subjt:  KFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEG

Query:  RKVHCQIV-KVGGPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQ
        R VH QI  K+   D  +   L+ MY+KCG +  S  +FE I  KN VSWT+MI+ Y + + +E+ L  F+ M ++ +E N  TL S++++C  +  + +
Subjt:  RKVHCQIV-KVGGPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQ

Query:  GKWVHGYAIKNVVELS-SFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNL
        GK VHG+A++  ++ +   L+ A +++Y +CG+  D   +   +   ++V+W ++I  Y   G   +AL LF   +   + P++ T AS +S+C  +G +
Subjt:  GKWVHGYAIKNVVELS-SFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNL

Query:  SLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAV
         LG  +HG  I+  + +  V+N+LIDMY+K   +D A  +F+ I  + V+TWNSM+ G++QNG++ EA+ LF+ M  +    + +T ++ + A +++G++
Subjt:  SLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAV

Query:  QVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSG
        + G  +H   I  GL   +L+  TAL++ YAKCGD   A  VF +M  ++I++WS+MI  YG+ G    +++ F+ M++   KPNEV+F  VLSAC +SG
Subjt:  QVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSG

Query:  MVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYAS
         VEEG +Y+ ++ + +  +P+ +H+AC +DLL+RSG L EA+  IK+MP   D S++G+ ++GC ++ + D+ + +  ++  +  ++  YY LLSN+YA 
Subjt:  MVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYAS

Query:  DGRWGQVNEVRELMLQRGLNKVPGFSLVE
        +G W +   +R  M    L KVPG+S +E
Subjt:  DGRWGQVNEVRELMLQRGLNKVPGFSLVE

AT2G03380.1 Pentatricopeptide repeat (PPR) superfamily protein2.1e-22154.63Show/hide
Query:  PLLGIGHHMSCSTHAPLPQLSELDET-MASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFY
        P    G    C +   + +L   +E   +S+H+ + + CF L+  C NID+L + HG+L  +GL+G++   TKLV +YG       AR+VFDQ+P+PDFY
Subjt:  PLLGIGHHMSCSTHAPLPQLSELDET-MASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFY

Query:  AWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVV
         WKVM+R Y LN    +++  Y+ +       D+I+FS  LKAC+E +++D G+K+HCQ+VKV   D+ VLTGL+DMYAKCG+I+ +  VF  I  +NVV
Subjt:  AWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVV

Query:  SWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQTRDARMIFDELPTIDL
         WTSMIAGYV+ND  EEGLVLFNRMRE  V  N++T G++I AC +L ALHQGKW HG  +K+ +ELSS L T+ LDMYVKCG   +AR +F+E   +DL
Subjt:  SWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQTRDARMIFDELPTIDL

Query:  VSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVIFHGILEKDV
        V WTAMIVGYT  G  +EAL LF      ++ PN VT ASVLS C +  NL LG SVHGL IK+G+ +  V NAL+ MYAKC+   DA  +F    EKD+
Subjt:  VSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVIFHGILEKDV

Query:  ITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEG-LFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGD
        + WNS+ISG++QNGS +EAL LF++M S S  P+ +T+ S  SA A+LG++ VGSSLHAYS+K G L SS++++GTALL+FYAKCGD + AR++FD++ +
Subjt:  ITWNSMISGYAQNGSAYEALRLFNQMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEG-LFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGD

Query:  KNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKM
        KN ITWSAMIGGYG QGD  GSL +F +MLK+  KPNE  FT++LSAC ++GMV EG +YF SM ++YNF PS KHY CMVD+LAR+G L++A D I+KM
Subjt:  KNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKM

Query:  PVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVE
        P+QPD+  +GAFLHGCG++SRFDLGE+V++++L LHP++A YYVL+SNLYASDGRW Q  EVR LM QRGL+K+ G S +E
Subjt:  PVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVE

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein9.2e-11632.92Show/hide
Query:  RNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSE
        + +D  I+ +G +I   L       +KL  +Y    D+  A  VFD++       W +++     +  F+  I  + +M  S  + D+  FS + K+ S 
Subjt:  RNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQMPDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSE

Query:  FREIDEGRKVHCQIVKVG-GPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACA
         R +  G ++H  I+K G G  + V   L+  Y K  +++ +  VF+ + +++V+SW S+I GYV N  AE+GL +F +M  + +E +  T+ S+   CA
Subjt:  FREIDEGRKVHCQIVKVG-GPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACA

Query:  RLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSC
          R +  G+ VH   +K             LDMY KCG    A+ +F E+    +VS+T+MI GY + G   EA++LF +     + P+  T  +VL+ C
Subjt:  RLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPSEALRLFADEIRSDLLPNSVTAASVLSSC

Query:  SVSGNLSLGMSVHGLGIK---LGLEECAVKNALIDMYAKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFN-QMRSNSSAPDAITLVST
        +    L  G  VH   IK   LG  +  V NAL+DMYAKC  + +A ++F  +  KD+I+WN++I GY++N  A EAL LFN  +     +PD  T+   
Subjt:  SVSGNLSLGMSVHGLGIK---LGLEECAVKNALIDMYAKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFN-QMRSNSSAPDAITLVST

Query:  LSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFT
        L A A+L A   G  +H Y ++ G FS   ++  +L++ YAKCG    A M+FD +  K++++W+ MI GYG+ G G  ++A+F+ M +   + +E+ F 
Subjt:  LSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFKPNEVIFT

Query:  TVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACY
        ++L ACS+SG+V+EGWR+F  M  E    P+++HYAC+VD+LAR+G L +A+ FI+ MP+ PD +++GA L GC ++    L E V  ++  L P    Y
Subjt:  TVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACY

Query:  YVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNAGCLFEVFGVSLARNRDSCH-MLEEMKSHFSED--RSAGRRAKEDTKPIDPCELQFCGH
        YVL++N+YA   +W QV  +R+ + QRGL K PG S +E        V G S     ++    L ++++   E+      + A  D + ++  E   CGH
Subjt:  YVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNAGCLFEVFGVSLARNRDSCH-MLEEMKSHFSED--RSAGRRAKEDTKPIDPCELQFCGH

Query:  MNDKSLVNHHEATGSKK----SRDSEICEDLH
            ++     ++G  K    +++  +C D H
Subjt:  MNDKSLVNHHEATGSKK----SRDSEICEDLH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCAACGCTTCTTCAGCTTCCCTCGAACCTTCTCCCGTCTCGCTGGTCCTTTACTTGGAATTGGGCACCACATGTCTTGTTCGACACATGCTCCACTGCCCCAACT
CTCCGAGCTCGATGAAACCATGGCTTCAGTGCACTTCATTTCGTTAAACTCGTGCTTTTATCTCATGGGACTTTGCAGGAACATTGATACCCTCATCAAGTTCCATGGAT
TGCTCATAGTACACGGCCTTGTCGGCAACCTTCTTTGTGATACAAAGTTAGTCGGTGTTTATGGCGCACTTCGGGACGTGGCGTCTGCTCGCATGGTGTTCGATCAAATG
CCCGACCCAGATTTTTATGCGTGGAAAGTGATGATTAGGTGGTATTTTTTGAACGACATGTTTGCAGATATTATTCCCTTCTATAATCGCATGAGAATGTCGTTTAAGGA
CTGTGATAATATTATTTTTTCGATTATCCTTAAAGCGTGTAGTGAATTTCGTGAAATTGATGAAGGGAGAAAGGTCCATTGCCAGATTGTAAAGGTGGGGGGTCCGGATA
GTTTTGTATTGACTGGTTTGATAGATATGTATGCGAAATGTGGGCAGATTGAGTGCTCAAGCTCTGTGTTTGAAGGAATTATTGATAAGAATGTGGTTTCTTGGACTTCA
ATGATTGCGGGATATGTACAAAATGATTGTGCAGAGGAGGGTTTGGTTTTATTCAATCGGATGAGAGAGGCATTGGTTGAAAGCAACCAATTTACTTTAGGGAGCATAAT
AACTGCGTGTGCAAGATTAAGAGCTTTGCATCAGGGGAAATGGGTTCATGGCTATGCCATAAAGAACGTTGTTGAGCTTAGCTCTTTCTTAGCGACTGCTTTTTTAGACA
TGTATGTTAAATGTGGGCAAACTAGAGATGCTCGCATGATATTTGACGAGCTACCTACTATTGACCTCGTTTCATGGACTGCAATGATTGTTGGATATACACAAGCTGGC
CAACCCAGCGAGGCGTTGAGGCTTTTCGCTGATGAAATAAGGTCTGATCTCTTACCTAATTCTGTCACTGCTGCAAGTGTTCTCTCGTCGTGTTCAGTTTCTGGTAATTT
AAGTTTAGGAATGTCAGTTCATGGACTTGGAATTAAACTTGGGCTCGAAGAGTGTGCAGTGAAGAATGCTCTTATTGATATGTATGCTAAATGCCATATGATTGATGATG
CTTATGTTATATTTCATGGGATTTTGGAAAAAGATGTGATTACTTGGAACTCAATGATATCTGGGTATGCTCAGAATGGGTCTGCATATGAAGCCCTCCGGCTCTTTAAT
CAAATGAGATCAAACTCCTCTGCACCTGATGCAATAACACTTGTGAGCACCCTTTCAGCATCTGCCACCCTAGGTGCTGTACAGGTCGGTTCATCACTTCATGCTTACTC
GATAAAAGAAGGCTTATTTTCATCTAATCTTTACATTGGCACTGCACTTTTGAACTTTTATGCTAAATGTGGCGATGCTAAATGTGCACGGATGGTGTTCGATAGTATGG
GAGATAAGAACATTATCACGTGGAGTGCAATGATAGGTGGTTATGGAGTGCAGGGTGATGGAAGTGGATCCCTTGCCATTTTCTCTGACATGTTGAAGGAGGATTTCAAA
CCCAATGAAGTAATTTTCACGACAGTATTATCTGCTTGTAGCTATTCAGGAATGGTTGAAGAGGGATGGAGATATTTCAAATCTATGAGTCAGGAATATAACTTTGCACC
TTCCATGAAACACTATGCCTGTATGGTTGATCTTTTAGCTCGATCTGGTAGACTGGATGAAGCATTCGACTTTATTAAGAAAATGCCAGTTCAACCAGACATTAGTTTGT
ATGGAGCTTTTCTTCATGGATGTGGATTATACTCGAGGTTTGATCTTGGAGAAGTGGTAGTCAGGGAAATACTACGACTTCATCCAAACGAAGCTTGCTATTATGTGCTT
TTATCTAACCTGTATGCTTCAGATGGGAGATGGGGTCAAGTTAATGAGGTGAGAGAGTTGATGCTACAGAGAGGATTGAACAAAGTTCCAGGGTTTAGCCTAGTAGAAAC
TAATGCAGGTTGTCTGTTTGAGGTTTTTGGGGTGAGCCTGGCACGTAATAGAGACAGTTGTCATATGTTGGAGGAGATGAAATCTCACTTTAGTGAAGATAGGAGCGCAG
GGAGGAGAGCTAAGGAGGATACGAAACCTATTGATCCTTGTGAACTCCAATTCTGTGGTCACATGAATGATAAATCGTTGGTAAATCATCATGAAGCCACTGGAAGTAAG
AAGTCTCGTGACAGCGAAATTTGTGAAGATTTGCATGGAGCGTGTCAGGAGGATAATTTGTATGTTTCACTTTCAGAATGTCCTCCCAGTTGCACATCTTCTGGTAGAGA
ACTTTTTGATGAAACTATGCATAATAATAACAGGACGTGGACGAATGCCCGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGCAACGCTTCTTCAGCTTCCCTCGAACCTTCTCCCGTCTCGCTGGTCCTTTACTTGGAATTGGGCACCACATGTCTTGTTCGACACATGCTCCACTGCCCCAACT
CTCCGAGCTCGATGAAACCATGGCTTCAGTGCACTTCATTTCGTTAAACTCGTGCTTTTATCTCATGGGACTTTGCAGGAACATTGATACCCTCATCAAGTTCCATGGAT
TGCTCATAGTACACGGCCTTGTCGGCAACCTTCTTTGTGATACAAAGTTAGTCGGTGTTTATGGCGCACTTCGGGACGTGGCGTCTGCTCGCATGGTGTTCGATCAAATG
CCCGACCCAGATTTTTATGCGTGGAAAGTGATGATTAGGTGGTATTTTTTGAACGACATGTTTGCAGATATTATTCCCTTCTATAATCGCATGAGAATGTCGTTTAAGGA
CTGTGATAATATTATTTTTTCGATTATCCTTAAAGCGTGTAGTGAATTTCGTGAAATTGATGAAGGGAGAAAGGTCCATTGCCAGATTGTAAAGGTGGGGGGTCCGGATA
GTTTTGTATTGACTGGTTTGATAGATATGTATGCGAAATGTGGGCAGATTGAGTGCTCAAGCTCTGTGTTTGAAGGAATTATTGATAAGAATGTGGTTTCTTGGACTTCA
ATGATTGCGGGATATGTACAAAATGATTGTGCAGAGGAGGGTTTGGTTTTATTCAATCGGATGAGAGAGGCATTGGTTGAAAGCAACCAATTTACTTTAGGGAGCATAAT
AACTGCGTGTGCAAGATTAAGAGCTTTGCATCAGGGGAAATGGGTTCATGGCTATGCCATAAAGAACGTTGTTGAGCTTAGCTCTTTCTTAGCGACTGCTTTTTTAGACA
TGTATGTTAAATGTGGGCAAACTAGAGATGCTCGCATGATATTTGACGAGCTACCTACTATTGACCTCGTTTCATGGACTGCAATGATTGTTGGATATACACAAGCTGGC
CAACCCAGCGAGGCGTTGAGGCTTTTCGCTGATGAAATAAGGTCTGATCTCTTACCTAATTCTGTCACTGCTGCAAGTGTTCTCTCGTCGTGTTCAGTTTCTGGTAATTT
AAGTTTAGGAATGTCAGTTCATGGACTTGGAATTAAACTTGGGCTCGAAGAGTGTGCAGTGAAGAATGCTCTTATTGATATGTATGCTAAATGCCATATGATTGATGATG
CTTATGTTATATTTCATGGGATTTTGGAAAAAGATGTGATTACTTGGAACTCAATGATATCTGGGTATGCTCAGAATGGGTCTGCATATGAAGCCCTCCGGCTCTTTAAT
CAAATGAGATCAAACTCCTCTGCACCTGATGCAATAACACTTGTGAGCACCCTTTCAGCATCTGCCACCCTAGGTGCTGTACAGGTCGGTTCATCACTTCATGCTTACTC
GATAAAAGAAGGCTTATTTTCATCTAATCTTTACATTGGCACTGCACTTTTGAACTTTTATGCTAAATGTGGCGATGCTAAATGTGCACGGATGGTGTTCGATAGTATGG
GAGATAAGAACATTATCACGTGGAGTGCAATGATAGGTGGTTATGGAGTGCAGGGTGATGGAAGTGGATCCCTTGCCATTTTCTCTGACATGTTGAAGGAGGATTTCAAA
CCCAATGAAGTAATTTTCACGACAGTATTATCTGCTTGTAGCTATTCAGGAATGGTTGAAGAGGGATGGAGATATTTCAAATCTATGAGTCAGGAATATAACTTTGCACC
TTCCATGAAACACTATGCCTGTATGGTTGATCTTTTAGCTCGATCTGGTAGACTGGATGAAGCATTCGACTTTATTAAGAAAATGCCAGTTCAACCAGACATTAGTTTGT
ATGGAGCTTTTCTTCATGGATGTGGATTATACTCGAGGTTTGATCTTGGAGAAGTGGTAGTCAGGGAAATACTACGACTTCATCCAAACGAAGCTTGCTATTATGTGCTT
TTATCTAACCTGTATGCTTCAGATGGGAGATGGGGTCAAGTTAATGAGGTGAGAGAGTTGATGCTACAGAGAGGATTGAACAAAGTTCCAGGGTTTAGCCTAGTAGAAAC
TAATGCAGGTTGTCTGTTTGAGGTTTTTGGGGTGAGCCTGGCACGTAATAGAGACAGTTGTCATATGTTGGAGGAGATGAAATCTCACTTTAGTGAAGATAGGAGCGCAG
GGAGGAGAGCTAAGGAGGATACGAAACCTATTGATCCTTGTGAACTCCAATTCTGTGGTCACATGAATGATAAATCGTTGGTAAATCATCATGAAGCCACTGGAAGTAAG
AAGTCTCGTGACAGCGAAATTTGTGAAGATTTGCATGGAGCGTGTCAGGAGGATAATTTGTATGTTTCACTTTCAGAATGTCCTCCCAGTTGCACATCTTCTGGTAGAGA
ACTTTTTGATGAAACTATGCATAATAATAACAGGACGTGGACGAATGCCCGTTAA
Protein sequenceShow/hide protein sequence
MLQRFFSFPRTFSRLAGPLLGIGHHMSCSTHAPLPQLSELDETMASVHFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALRDVASARMVFDQM
PDPDFYAWKVMIRWYFLNDMFADIIPFYNRMRMSFKDCDNIIFSIILKACSEFREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYAKCGQIECSSSVFEGIIDKNVVSWTS
MIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACARLRALHQGKWVHGYAIKNVVELSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAG
QPSEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVIFHGILEKDVITWNSMISGYAQNGSAYEALRLFN
QMRSNSSAPDAITLVSTLSASATLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKCARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDFK
PNEVIFTTVLSACSYSGMVEEGWRYFKSMSQEYNFAPSMKHYACMVDLLARSGRLDEAFDFIKKMPVQPDISLYGAFLHGCGLYSRFDLGEVVVREILRLHPNEACYYVL
LSNLYASDGRWGQVNEVRELMLQRGLNKVPGFSLVETNAGCLFEVFGVSLARNRDSCHMLEEMKSHFSEDRSAGRRAKEDTKPIDPCELQFCGHMNDKSLVNHHEATGSK
KSRDSEICEDLHGACQEDNLYVSLSECPPSCTSSGRELFDETMHNNNRTWTNAR