; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0018624 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0018624
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr12:19607164..19609436
RNA-Seq ExpressionIVF0018624
SyntenyIVF0018624
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573373.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.081.61Show/hide
Query:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL
        M HLQRSKPI       NFPATQSRLLNTLS LF+RC S Q LQQIHARF+LHGFHQNPTLS KLIDCYAN GLL  S  VF SIIDPN  L+NAILRNL
Subjt:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL

Query:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE
        TR+GE ER LLVY++MVAKSMHPDE+TYPF+ RSC   SNV FG+ IHG L+KLG DS+D V T L EMYEK I FENAHQLFDK SVKDL   SSL TE
Subjt:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE

Query:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA
          QNGNG+ I R+F RM++E LV DSLTF+NLLR ++GL+SIQLAKIVHCIAIVS L GDLLV TAVLSLYSKL SLVDAR+LF+K+PEKDRVVWNIMIA
Subjt:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI
        AYAREG+P ECLELF+SMARSGIR+DLFTALPVISSI+QLK  DWGKQTHA+ILRNGSDSQVSVHNSLIDMYCEC  LDSAC IFN +T+K+VISWSAMI
Subjt:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN
        KG VK+G  L A SLF +MKSDGIQADF+T+INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCG I+MAQRLFEEER+DDKDLIMWN
Subjt:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCF LYN+MKCSNS PDQVTFLGLLTACVNSGL+EKGKEFFKEM ESY C PSQEH+ACMVNLLGRAGLI+EAGELVRNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE
        VWGPLLSACK+HPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKW+ VAKMRSFLR+KGLKKTPGCS LEINGRV EFRVAD+THPRAEDIY IL 
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE

Query:  TLNLKSK
         L L  K
Subjt:  TLNLKSK

XP_008444579.1 PREDICTED: pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucumis melo]0.098.6Show/hide
Query:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL
        MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL
Subjt:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL

Query:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE
        TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE
Subjt:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE

Query:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA
        GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA
Subjt:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI
        AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI
Subjt:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN
        KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN
Subjt:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE
        VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTIL 
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE

Query:  TLNLKSKRLEKR
         L L+ K + ++
Subjt:  TLNLKSKRLEKR

XP_022139869.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Momordica charantia]0.080.76Show/hide
Query:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL
        MLHLQRSKPI       NFPATQSR LNTLS LF+RC+S Q L+QIHARFILHG HQNP LS +LID YANLGLL  S QVF SIIDP  TL++AILRNL
Subjt:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL

Query:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE
        + +GE ER LLVY++M AKSMHPDEETYP + RSC   SNV +GR IHG+LVKLG D +D  ATALAEMY K I FEN H LFDK  +KD    +SL +E
Subjt:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE

Query:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA
         SQNGNG+ IF++F RMR EQLV DSLTF+NLLR I GLNSIQLAKIVHC+AI S L GDLLV TAVLSLYSKL  LV+AR+LFDKMPEKDRVVWNIMIA
Subjt:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI
        AY REG P ECLELFKSMARSGIR+DLFTALPVISSI+QLKCVDWGKQTHAH LRNGSD+QVSVHNSLIDMYCE  +LDSAC IF+WMT+K+VISWSAMI
Subjt:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN
        KG VK+GQSL A SLFS+MKSDGIQADF+T+INILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQRLFEEER+DDKDLIMWN
Subjt:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCFK+YN+MKCSNS+PDQVTFLGLLTACVNSGL+EKGKE FKEM E+YGC PSQEH+ACMVNLLGRAGLI++AG LVRNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE
        VWGPLLSACK+HPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKW+ VAKMRSFLR+KGLKKTPGCS LEING VTEFRVAD+THPRAEDIYTIL 
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE

Query:  TLNLKSKRLEKR
         L L+ K   ++
Subjt:  TLNLKSKRLEKR

XP_022994744.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucurbita maxima]0.081.15Show/hide
Query:  MLHLQRSKPIIHTPILL----NFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAI
        M HLQRSK I  +PI      NFPATQSRLLNTLS LF+RC S Q L+QIHARF+LHGFHQNPTLS KLIDCYAN GLL  S  VF SIIDPN TL+NAI
Subjt:  MLHLQRSKPIIHTPILL----NFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAI

Query:  LRNLTRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSS
        LRNLTR+GE ER LLVY++MVAKSMHPDE+TYPF+ +SC   SNV FG+ IHG L+KLG DS+D V T LAEMY K I FENAHQLFDK SVKDL   SS
Subjt:  LRNLTRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSS

Query:  LTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWN
        L +E  QNGNG+ I  +  RM++E LV DSLTF+NLLR I+GL+SIQLAKIVHCIAIVS L GDLLV TAVLSLYSKL SLVDAR+LF+KMPEKDRVVWN
Subjt:  LTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWN

Query:  IMIAAYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISW
        IMIAAYAREG+P ECLELF+SMARSGIR+DLFTALPVISSI+QLKC DWGKQTHA+ILRNGSDSQVSVHNSLIDMYCEC  L+SAC IFN +T+K+VISW
Subjt:  IMIAAYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISW

Query:  SAMIKGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDL
        SAMIKG VK+G  L A SLF  MKSDGIQADF+T+INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCG IEMAQRLFEEER++DKDL
Subjt:  SAMIKGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDL

Query:  IMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIK
        IMWNSMISAHANHGDWSQCFKLYN+MKCSNS PDQVTFLGLLTACVNSGL+EKGKEFFKEM ESY C PSQEH+ACMVNLLGRAGLI+EAGELVRNMPIK
Subjt:  IMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIK

Query:  PDARVWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIY
        PDARVWGPLLSACK+HPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKW+ VAKMRSFLR+KGLKKTPGCS LEINGRV EFRVAD+THPRAEDIY
Subjt:  PDARVWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIY

Query:  TILETLNLKSK
         IL  L L  K
Subjt:  TILETLNLKSK

XP_038894029.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Benincasa hispida]0.086.1Show/hide
Query:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL
        MLHLQRSKP+IH+ I  NFPATQSRLLNTLS LF+RC+S QHL+QIHARF+LHGFHQNPTLSSKLIDCYANLGLL  SLQVF SI +PN T++NAILRNL
Subjt:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL

Query:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE
        TRYGE ER LLVY+QMVAKSMHPDEETYP + RSC SFSNVG GR IHGYLVKLGFDSFD+VATAL EMYE+ I FE+AHQLFDKRSVKDL   SS TTE
Subjt:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE

Query:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA
          QNGNGEGIF VF RMR EQLV DSLTF+NLLRFIAG NSIQLAKIVHCIAIVSKL GDLLV TAVLSLYSKL SLVDAR+LFDKMPE DRVVWNIMIA
Subjt:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI
        AYAREGKP ECL LFKSMARSGIRSD+FTALPVISSI+QLK  DWGKQTHA+ILRNGSDSQVSV+NSLIDMYCEC +LDSAC IFNWM DK+VISWSAMI
Subjt:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN
        KGYVK+G SL A SLFS MKSDGIQ+DF+T+INILPAFVHIG LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQR+FEEERIDDKDLIMWN
Subjt:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCFKLYN+MKCSN+KPDQVTFLGLLTACVNSGL+EKGKEF KEMTE+YGC PSQEH+ACMVNLLGRAGLI+EAGELVRNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE
        VWGPLLSACK+HPGSKLAEFAAEKL+DMEPKNAGNYILLSNIYAAAGKW+EVAKMRSFLR+KGLKKTPGCS LEING VTEFRVADQTHPRAEDIYTIL 
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE

Query:  TLNLKSKRLEKR
         L L+ K   ++
Subjt:  TLNLKSKRLEKR

TrEMBL top hitse value%identityAlignment
A0A0A0M0Z6 Uncharacterized protein0.0e+0092.84Show/hide
Query:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL
        MLHL RSKPIIH+PI LNFPATQSRLLNTLSLLF+RCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLL HSLQVFCS+IDPNLTLFNAILRNL
Subjt:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL

Query:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE
        TRYGESER LLVYQQMVAKSMHPDEETYPF+ RSCSSFSNVGFGRTIHGYLVKLGFD FDVVATALAEMYE+ I FENAHQLFDKRSVKDLGW SSLTTE
Subjt:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE

Query:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA
        G QN NGEGIFRVF RM AEQLVPDS TF NLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLV TAVLSLYSKLRSLVDAR+LFDKMPEKDRVVWNIMIA
Subjt:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI
        AYAREGKP ECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECK+LDSAC IFNWMTDKSVISWSAMI
Subjt:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN
        KGYVKNGQSLTA SLFSKMKSDGIQADFV MINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQRLFEEE+IDDKDLIMWN
Subjt:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGL+EKGKEFFKEMTESYGC PSQEH+ACMVNLLGRAGLISEAGELV+NMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE
        VWGPLLSACKMHPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKW+ VAKMRSFLRNKGLKK PGCS LEING VTEFRVADQTHPRA DIYTIL 
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE

Query:  TLNLKSKRLEKR
         L L+ K + ++
Subjt:  TLNLKSKRLEKR

A0A1S3BBG7 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0098.6Show/hide
Query:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL
        MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL
Subjt:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL

Query:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE
        TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE
Subjt:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE

Query:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA
        GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA
Subjt:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI
        AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI
Subjt:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN
        KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN
Subjt:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE
        VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTIL 
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE

Query:  TLNLKSKRLEKR
         L L+ K + ++
Subjt:  TLNLKSKRLEKR

A0A5D3DB69 Pentatricopeptide repeat-containing protein0.0e+0098.6Show/hide
Query:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL
        MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL
Subjt:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL

Query:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE
        TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE
Subjt:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE

Query:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA
        GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA
Subjt:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI
        AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI
Subjt:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN
        KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN
Subjt:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE
        VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTIL 
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE

Query:  TLNLKSKRLEKR
         L L+ K + ++
Subjt:  TLNLKSKRLEKR

A0A6J1CE61 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0080.76Show/hide
Query:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL
        MLHLQRSKPI       NFPATQSR LNTLS LF+RC+S Q L+QIHARFILHG HQNP LS +LID YANLGLL  S QVF SIIDP  TL++AILRNL
Subjt:  MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNL

Query:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE
        + +GE ER LLVY++M AKSMHPDEETYP + RSC   SNV +GR IHG+LVKLG D +D  ATALAEMY K I FEN H LFDK  +KD    +SL +E
Subjt:  TRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTE

Query:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA
         SQNGNG+ IF++F RMR EQLV DSLTF+NLLR I GLNSIQLAKIVHC+AI S L GDLLV TAVLSLYSKL  LV+AR+LFDKMPEKDRVVWNIMIA
Subjt:  GSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA

Query:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI
        AY REG P ECLELFKSMARSGIR+DLFTALPVISSI+QLKCVDWGKQTHAH LRNGSD+QVSVHNSLIDMYCE  +LDSAC IF+WMT+K+VISWSAMI
Subjt:  AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMI

Query:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN
        KG VK+GQSL A SLFS+MKSDGIQADF+T+INILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQRLFEEER+DDKDLIMWN
Subjt:  KGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWN

Query:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR
        SMISAHANHGDWSQCFK+YN+MKCSNS+PDQVTFLGLLTACVNSGL+EKGKE FKEM E+YGC PSQEH+ACMVNLLGRAGLI++AG LVRNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR

Query:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE
        VWGPLLSACK+HPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKW+ VAKMRSFLR+KGLKKTPGCS LEING VTEFRVAD+THPRAEDIYTIL 
Subjt:  VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILE

Query:  TLNLKSKRLEKR
         L L+ K   ++
Subjt:  TLNLKSKRLEKR

A0A6J1K3Q8 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0080.7Show/hide
Query:  MLHLQRSKPIIHTPILL----NFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAI
        M HLQRSK I  +PI      NFPATQSRLLNTLS LF+RC S Q L+QIHARF+LHGFHQNPTLS KLIDCYAN GLL  S  VF SIIDPN TL+NAI
Subjt:  MLHLQRSKPIIHTPILL----NFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAI

Query:  LRNLTRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSS
        LRNLTR+GE ER LLVY++MVAKSMHPDE+TYPF+ +SC   SNV FG+ IHG L+KLG DS+D V T LAEMY K I FENAHQLFDK SVKDL   SS
Subjt:  LRNLTRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSS

Query:  LTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWN
        L +E  QNGNG+ I  +  RM++E LV DSLTF+NLLR I+GL+SIQLAKIVHCIAIVS L GDLLV TAVLSLYSKL SLVDAR+LF+KMPEKDRVVWN
Subjt:  LTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWN

Query:  IMIAAYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISW
        IMIAAYAREG+P ECLELF+SMARSGIR+DLFTALPVISSI+QLKC DWGKQTHA+ILRNGSDSQVSVHNSLIDMYCEC  L+SAC IFN +T+K+VISW
Subjt:  IMIAAYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISW

Query:  SAMIKGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDL
        SAMIKG VK+G  L A SLF  MKSDGIQADF+T+INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCG IEMAQRLFEEER++DKDL
Subjt:  SAMIKGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDL

Query:  IMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIK
        IMWNSMISAHANHGDWSQCFKLYN+MKCSNS PDQVTFLGLLTACVNSGL+EKGKEFFKEM ESY C PSQEH+ACMVNLLGRAGLI+EAGELVRNMPIK
Subjt:  IMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIK

Query:  PDARVWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIY
        PDARVWGPLLSACK+HPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKW+ VAKMRSFLR+KGLKKTPGCS LEINGRV EFRVAD+THPRAEDIY
Subjt:  PDARVWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIY

Query:  TILETLNLKSKRLEK
         IL  L L  K  ++
Subjt:  TILETLNLKSKRLEK

SwissProt top hitse value%identityAlignment
O81767 Pentatricopeptide repeat-containing protein At4g339902.0e-11133.04Show/hide
Query:  QSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQ-MVAKSM
        +S+ ++ +  LF  C ++Q  + +HAR ++    QN  +S+KL++ Y  LG +  +   F  I + ++  +N ++    R G S   +  +   M++  +
Subjt:  QSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQ-MVAKSM

Query:  HPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQ
         PD  T+P + ++C +  +   G  IH   +K GF     VA +L  +Y ++ A  NA  LFD+  V+D+G  +++ +   Q+GN +    +   +RA  
Subjt:  HPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQ

Query:  LVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGKPRECLELFKSMARS
           DS+T V+LL              +H  +I   L  +L V   ++ LY++   L D +++FD+M  +D + WN +I AY    +P   + LF+ M  S
Subjt:  LVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGKPRECLELFKSMARS

Query:  GIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNG-SDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFSKMK
         I+ D  T + + S ++QL  +   +      LR G     +++ N+++ MY +  ++DSA  +FNW+ +  VISW+ +I GY +NG +  A  +++ M+
Subjt:  GIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNG-SDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFSKMK

Query:  SDG-IQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLY
         +G I A+  T +++LPA    GAL     LHG  +K GL     + T+L   Y KCG +E A  LF +  I   + + WN++I+ H  HG   +   L+
Subjt:  SDG-IQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLY

Query:  NRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAE
          M     KPD +TF+ LL+AC +SGL+++G+  F+ M   YG  PS +H+ CMV++ GRAG +  A + +++M ++PDA +WG LLSAC++H    L +
Subjt:  NRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAE

Query:  FAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILETLNLKSKRL
         A+E L ++EP++ G ++LLSN+YA+AGKWE V ++RS    KGL+KTPG SS+E++ +V  F   +QTHP  E++Y  L  L  K K +
Subjt:  FAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILETLNLKSKRL

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.3e-11533.43Show/hide
Query:  SLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQMVAKSMHPDEETYPF
        +LL  RC+S++ L+QI      +G +Q     +KL+  +   G +  + +VF  I      L++ +L+   +  + ++AL  + +M    + P    + +
Subjt:  SLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQMVAKSMHPDEETYPF

Query:  IFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVA-TALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTF
        + + C   + +  G+ IHG LVK GF S D+ A T L  MY K      A ++FD+   +DL   +++    SQNG       +   M  E L P  +T 
Subjt:  IFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVA-TALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTF

Query:  VNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGKPRECLELFKSMARSGIRSDLFT
        V++L  ++ L  I + K +H  A+ S     + + TA++ +Y+K  SL  AR+LFD M E++ V WN MI AY +   P+E + +F+ M   G++    +
Subjt:  VNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGKPRECLELFKSMARSGIRSDLFT

Query:  ALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFSKMKSDGIQADFV
         +  + + A L  ++ G+  H   +  G D  VSV NSLI MYC+CK +D+A ++F  +  ++++SW+AMI G+ +NG+ + A + FS+M+S  ++ D  
Subjt:  ALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFSKMKSDGIQADFV

Query:  TMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKP
        T ++++ A   +    + K++HG  M+  L     + TAL+  YAKCG I +A+ +F  + + ++ +  WN+MI  +  HG      +L+  M+    KP
Subjt:  TMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKP

Query:  DQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLIDME
        + VTFL +++AC +SGL+E G + F  M E+Y    S +H+  MV+LLGRAG ++EA + +  MP+KP   V+G +L AC++H     AE AAE+L ++ 
Subjt:  DQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLIDME

Query:  PKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILETL
        P + G ++LL+NIY AA  WE+V ++R  +  +GL+KTPGCS +EI   V  F      HP ++ IY  LE L
Subjt:  PKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILETL

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic5.5e-12236.42Show/hide
Query:  SKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLID-CYANLGL--LKHSLQVFCSIIDPNLTLFNAILRNLTRY
        S P    P   + P    R   +LSLL N C ++Q L+ IHA+ I  G H      SKLI+ C  +     L +++ VF +I +PNL ++N + R     
Subjt:  SKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLID-CYANLGL--LKHSLQVFCSIIDPNLTLFNAILRNLTRY

Query:  GESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTEGSQ
         +   AL +Y  M++  + P+  T+PF+ +SC+       G+ IHG+++KLG D    V T+L  MY +    E+AH++FDK   +              
Subjt:  GESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTEGSQ

Query:  NGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYA
                                                                D++ YTA++  Y+    + +A++LFD++P KD V WN MI+ YA
Subjt:  NGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYA

Query:  REGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGY
          G  +E LELFK M ++ +R D  T + V+S+ AQ   ++ G+Q H  I  +G  S + + N+LID+Y +C  L++AC +F  +  K VISW+ +I GY
Subjt:  REGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGY

Query:  VKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWNS
                A  LF +M   G   + VTM++ILPA  H+GA++  +++H Y  K   G+T+  SL T+L+  YAKCG IE A ++F    I  K L  WN+
Subjt:  VKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWNS

Query:  MISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDARV
        MI   A HG     F L++RM+    +PD +TF+GLL+AC +SG+++ G+  F+ MT+ Y   P  EH+ CM++LLG +GL  EA E++  M ++PD  +
Subjt:  MISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDARV

Query:  WGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILET
        W  LL ACKMH   +L E  AE LI +EP+N G+Y+LLSNIYA+AG+W EVAK R+ L +KG+KK PGCSS+EI+  V EF + D+ HPR  +IY +LE 
Subjt:  WGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILET

Query:  LNL
        + +
Subjt:  LNL

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226901.8e-11231.47Show/hide
Query:  PILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLT-----LFNAILRNLTRYGESER
        P LLN    QS+           C +I  L+  H      G   + +  +KL+     LG  + SL     + + + +     ++N+++R     G    
Subjt:  PILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLT-----LFNAILRNLTRYGESER

Query:  ALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKD-LGWSSSLTTEGSQNGNG
        A+L++ +M+   + PD+ T+PF   +C+     G G  IHG +VK+G+     V  +L   Y +    ++A ++FD+ S ++ + W+S +     ++   
Subjt:  ALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKD-LGWSSSLTTEGSQNGNG

Query:  EGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGK
        + +   F  +R E++ P+S+T V ++   A L  ++  + V+     S +  + L+ +A++ +Y K  ++  A+RLFD+    +  + N M + Y R+G 
Subjt:  EGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGK

Query:  PRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNG
         RE L +F  M  SG+R D  + L  ISS +QL+ + WGK  H ++LRNG +S  ++ N+LIDMY +C   D+A  IF+ M++K+V++W++++ GYV+NG
Subjt:  PRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNG

Query:  Q------------------------SLTASSLF--------SKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAK
        +                         L   SLF        S    +G+ AD VTM++I  A  H+GAL+  K+++ Y  K G+     L T L+  +++
Subjt:  Q------------------------SLTASSLF--------SKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAK

Query:  CGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVN
        CG  E A  +F    + ++D+  W + I A A  G+  +  +L++ M     KPD V F+G LTAC + GL+++GKE F  M + +G  P   H+ CMV+
Subjt:  CGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVN

Query:  LLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEI
        LLGRAGL+ EA +L+ +MP++P+  +W  LL+AC++    ++A +AAEK+  + P+  G+Y+LLSN+YA+AG+W ++AK+R  ++ KGL+K PG SS++I
Subjt:  LLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEI

Query:  NGRVTEFRVADQTHPRAEDIYTILETLNLKSKRL
         G+  EF   D++HP   +I  +L+ ++ ++  L
Subjt:  NGRVTEFRVADQTHPRAEDIYTILETLNLKSKRL

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic4.9e-11034.05Show/hide
Query:  HGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLV
        +GF  +  L SKL   Y N G LK + +VF  +       +N ++  L + G+   ++ ++++M++  +  D  T+  + +S SS  +V  G  +HG+++
Subjt:  HGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLV

Query:  KLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKD-LGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCI
        K GF   + V  +L   Y K    ++A ++FD+ + +D + W+S +    S NG  E    VFV+M    +  D  T V++    A    I L + VH I
Subjt:  KLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKD-LGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCI

Query:  AIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHA
         + +  S +      +L +YSK   L  A+ +F +M ++  V +  MIA YAREG   E ++LF+ M   GI  D++T   V++  A+ + +D GK+ H 
Subjt:  AIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHA

Query:  HILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFS-KMKSDGIQADFVTMINILPAFVHIGALENVKYL
         I  N     + V N+L+DMY +C  +  A  +F+ M  K +ISW+ +I GY KN  +  A SLF+  ++      D  T+  +LPA   + A +  + +
Subjt:  HILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFS-KMKSDGIQADFVTMINILPAFVHIGALENVKYL

Query:  HGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKG
        HGY M+ G  S   +  +L+  YAKCG + +A  LF++  I  KDL+ W  MI+ +  HG   +   L+N+M+ +  + D+++F+ LL AC +SGL+++G
Subjt:  HGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKG

Query:  KEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWE
          FF  M       P+ EH+AC+V++L R G + +A   + NMPI PDA +WG LL  C++H   KLAE  AEK+ ++EP+N G Y+L++NIYA A KWE
Subjt:  KEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWE

Query:  EVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILETLNLK
        +V ++R  +  +GL+K PGCS +EI GRV  F   D ++P  E+I   L  +  +
Subjt:  EVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILETLNLK

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.9e-12336.42Show/hide
Query:  SKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLID-CYANLGL--LKHSLQVFCSIIDPNLTLFNAILRNLTRY
        S P    P   + P    R   +LSLL N C ++Q L+ IHA+ I  G H      SKLI+ C  +     L +++ VF +I +PNL ++N + R     
Subjt:  SKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLID-CYANLGL--LKHSLQVFCSIIDPNLTLFNAILRNLTRY

Query:  GESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTEGSQ
         +   AL +Y  M++  + P+  T+PF+ +SC+       G+ IHG+++KLG D    V T+L  MY +    E+AH++FDK   +              
Subjt:  GESERALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTEGSQ

Query:  NGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYA
                                                                D++ YTA++  Y+    + +A++LFD++P KD V WN MI+ YA
Subjt:  NGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYA

Query:  REGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGY
          G  +E LELFK M ++ +R D  T + V+S+ AQ   ++ G+Q H  I  +G  S + + N+LID+Y +C  L++AC +F  +  K VISW+ +I GY
Subjt:  REGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGY

Query:  VKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWNS
                A  LF +M   G   + VTM++ILPA  H+GA++  +++H Y  K   G+T+  SL T+L+  YAKCG IE A ++F    I  K L  WN+
Subjt:  VKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWNS

Query:  MISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDARV
        MI   A HG     F L++RM+    +PD +TF+GLL+AC +SG+++ G+  F+ MT+ Y   P  EH+ CM++LLG +GL  EA E++  M ++PD  +
Subjt:  MISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDARV

Query:  WGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILET
        W  LL ACKMH   +L E  AE LI +EP+N G+Y+LLSNIYA+AG+W EVAK R+ L +KG+KK PGCSS+EI+  V EF + D+ HPR  +IY +LE 
Subjt:  WGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILET

Query:  LNL
        + +
Subjt:  LNL

AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-11633.43Show/hide
Query:  SLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQMVAKSMHPDEETYPF
        +LL  RC+S++ L+QI      +G +Q     +KL+  +   G +  + +VF  I      L++ +L+   +  + ++AL  + +M    + P    + +
Subjt:  SLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQMVAKSMHPDEETYPF

Query:  IFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVA-TALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTF
        + + C   + +  G+ IHG LVK GF S D+ A T L  MY K      A ++FD+   +DL   +++    SQNG       +   M  E L P  +T 
Subjt:  IFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVA-TALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTF

Query:  VNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGKPRECLELFKSMARSGIRSDLFT
        V++L  ++ L  I + K +H  A+ S     + + TA++ +Y+K  SL  AR+LFD M E++ V WN MI AY +   P+E + +F+ M   G++    +
Subjt:  VNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGKPRECLELFKSMARSGIRSDLFT

Query:  ALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFSKMKSDGIQADFV
         +  + + A L  ++ G+  H   +  G D  VSV NSLI MYC+CK +D+A ++F  +  ++++SW+AMI G+ +NG+ + A + FS+M+S  ++ D  
Subjt:  ALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFSKMKSDGIQADFV

Query:  TMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKP
        T ++++ A   +    + K++HG  M+  L     + TAL+  YAKCG I +A+ +F  + + ++ +  WN+MI  +  HG      +L+  M+    KP
Subjt:  TMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKP

Query:  DQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLIDME
        + VTFL +++AC +SGL+E G + F  M E+Y    S +H+  MV+LLGRAG ++EA + +  MP+KP   V+G +L AC++H     AE AAE+L ++ 
Subjt:  DQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLIDME

Query:  PKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILETL
        P + G ++LL+NIY AA  WE+V ++R  +  +GL+KTPGCS +EI   V  F      HP ++ IY  LE L
Subjt:  PKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILETL

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)1.3e-11331.47Show/hide
Query:  PILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLT-----LFNAILRNLTRYGESER
        P LLN    QS+           C +I  L+  H      G   + +  +KL+     LG  + SL     + + + +     ++N+++R     G    
Subjt:  PILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLT-----LFNAILRNLTRYGESER

Query:  ALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKD-LGWSSSLTTEGSQNGNG
        A+L++ +M+   + PD+ T+PF   +C+     G G  IHG +VK+G+     V  +L   Y +    ++A ++FD+ S ++ + W+S +     ++   
Subjt:  ALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKD-LGWSSSLTTEGSQNGNG

Query:  EGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGK
        + +   F  +R E++ P+S+T V ++   A L  ++  + V+     S +  + L+ +A++ +Y K  ++  A+RLFD+    +  + N M + Y R+G 
Subjt:  EGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGK

Query:  PRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNG
         RE L +F  M  SG+R D  + L  ISS +QL+ + WGK  H ++LRNG +S  ++ N+LIDMY +C   D+A  IF+ M++K+V++W++++ GYV+NG
Subjt:  PRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNG

Query:  Q------------------------SLTASSLF--------SKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAK
        +                         L   SLF        S    +G+ AD VTM++I  A  H+GAL+  K+++ Y  K G+     L T L+  +++
Subjt:  Q------------------------SLTASSLF--------SKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAK

Query:  CGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVN
        CG  E A  +F    + ++D+  W + I A A  G+  +  +L++ M     KPD V F+G LTAC + GL+++GKE F  M + +G  P   H+ CMV+
Subjt:  CGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVN

Query:  LLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEI
        LLGRAGL+ EA +L+ +MP++P+  +W  LL+AC++    ++A +AAEK+  + P+  G+Y+LLSN+YA+AG+W ++AK+R  ++ KGL+K PG SS++I
Subjt:  LLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEI

Query:  NGRVTEFRVADQTHPRAEDIYTILETLNLKSKRL
         G+  EF   D++HP   +I  +L+ ++ ++  L
Subjt:  NGRVTEFRVADQTHPRAEDIYTILETLNLKSKRL

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification1.3e-11331.47Show/hide
Query:  PILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLT-----LFNAILRNLTRYGESER
        P LLN    QS+           C +I  L+  H      G   + +  +KL+     LG  + SL     + + + +     ++N+++R     G    
Subjt:  PILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLT-----LFNAILRNLTRYGESER

Query:  ALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKD-LGWSSSLTTEGSQNGNG
        A+L++ +M+   + PD+ T+PF   +C+     G G  IHG +VK+G+     V  +L   Y +    ++A ++FD+ S ++ + W+S +     ++   
Subjt:  ALLVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKD-LGWSSSLTTEGSQNGNG

Query:  EGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGK
        + +   F  +R E++ P+S+T V ++   A L  ++  + V+     S +  + L+ +A++ +Y K  ++  A+RLFD+    +  + N M + Y R+G 
Subjt:  EGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGK

Query:  PRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNG
         RE L +F  M  SG+R D  + L  ISS +QL+ + WGK  H ++LRNG +S  ++ N+LIDMY +C   D+A  IF+ M++K+V++W++++ GYV+NG
Subjt:  PRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNG

Query:  Q------------------------SLTASSLF--------SKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAK
        +                         L   SLF        S    +G+ AD VTM++I  A  H+GAL+  K+++ Y  K G+     L T L+  +++
Subjt:  Q------------------------SLTASSLF--------SKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAK

Query:  CGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVN
        CG  E A  +F    + ++D+  W + I A A  G+  +  +L++ M     KPD V F+G LTAC + GL+++GKE F  M + +G  P   H+ CMV+
Subjt:  CGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVN

Query:  LLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEI
        LLGRAGL+ EA +L+ +MP++P+  +W  LL+AC++    ++A +AAEK+  + P+  G+Y+LLSN+YA+AG+W ++AK+R  ++ KGL+K PG SS++I
Subjt:  LLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEI

Query:  NGRVTEFRVADQTHPRAEDIYTILETLNLKSKRL
         G+  EF   D++HP   +I  +L+ ++ ++  L
Subjt:  NGRVTEFRVADQTHPRAEDIYTILETLNLKSKRL

AT4G33990.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-11233.04Show/hide
Query:  QSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQ-MVAKSM
        +S+ ++ +  LF  C ++Q  + +HAR ++    QN  +S+KL++ Y  LG +  +   F  I + ++  +N ++    R G S   +  +   M++  +
Subjt:  QSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQ-MVAKSM

Query:  HPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQ
         PD  T+P + ++C +  +   G  IH   +K GF     VA +L  +Y ++ A  NA  LFD+  V+D+G  +++ +   Q+GN +    +   +RA  
Subjt:  HPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQ

Query:  LVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGKPRECLELFKSMARS
           DS+T V+LL              +H  +I   L  +L V   ++ LY++   L D +++FD+M  +D + WN +I AY    +P   + LF+ M  S
Subjt:  LVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGKPRECLELFKSMARS

Query:  GIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNG-SDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFSKMK
         I+ D  T + + S ++QL  +   +      LR G     +++ N+++ MY +  ++DSA  +FNW+ +  VISW+ +I GY +NG +  A  +++ M+
Subjt:  GIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNG-SDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFSKMK

Query:  SDG-IQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLY
         +G I A+  T +++LPA    GAL     LHG  +K GL     + T+L   Y KCG +E A  LF +  I   + + WN++I+ H  HG   +   L+
Subjt:  SDG-IQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLY

Query:  NRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAE
          M     KPD +TF+ LL+AC +SGL+++G+  F+ M   YG  PS +H+ CMV++ GRAG +  A + +++M ++PDA +WG LLSAC++H    L +
Subjt:  NRMKCSNSKPDQVTFLGLLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAE

Query:  FAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILETLNLKSKRL
         A+E L ++EP++ G ++LLSN+YA+AGKWE V ++RS    KGL+KTPG SS+E++ +V  F   +QTHP  E++Y  L  L  K K +
Subjt:  FAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILETLNLKSKRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCACCTTCAACGATCAAAACCCATTATTCATACTCCCATTCTCCTCAACTTTCCCGCCACCCAATCAAGACTGCTCAACACGCTTTCCCTACTCTTCAATCGATG
CAACTCCATTCAACACCTCCAGCAAATTCATGCCAGGTTCATTCTCCATGGCTTCCACCAAAACCCAACTCTCTCTTCCAAACTTATTGATTGCTATGCCAATCTTGGTC
TCCTCAAACACTCTCTCCAAGTTTTCTGCTCCATAATCGACCCCAATTTGACTCTTTTCAACGCCATACTGAGAAATTTGACAAGATATGGAGAATCCGAGCGAGCCCTG
TTGGTGTATCAACAAATGGTCGCCAAATCTATGCACCCAGATGAAGAAACTTACCCTTTTATTTTTCGATCATGTTCTTCATTTTCAAATGTTGGATTTGGGAGGACGAT
TCATGGTTATTTGGTTAAGCTGGGTTTTGATTCGTTTGATGTTGTAGCTACTGCTCTGGCTGAGATGTATGAGAAATGGATTGCATTTGAGAATGCTCATCAACTGTTTG
ATAAAAGGTCTGTGAAAGATTTGGGATGGTCGAGTTCCTTGACCACGGAGGGTTCTCAAAATGGTAACGGGGAGGGAATTTTTCGGGTTTTTGTGAGAATGAGAGCAGAA
CAATTAGTACCAGACTCACTCACATTCGTCAATCTCTTGAGGTTCATTGCAGGTTTGAACTCAATTCAACTTGCAAAAATTGTTCATTGTATTGCAATTGTGAGCAAATT
GAGTGGAGATTTGTTAGTATATACTGCCGTGTTGTCTCTTTACTCTAAGTTACGTAGCTTAGTAGATGCTAGACGGTTATTTGACAAAATGCCAGAGAAAGATCGTGTTG
TATGGAATATAATGATAGCAGCTTATGCTCGGGAAGGTAAACCGAGGGAATGCCTTGAGCTCTTCAAGTCCATGGCAAGATCAGGGATTAGATCTGATCTGTTTACTGCA
CTGCCTGTTATCTCTTCAATTGCACAGTTGAAATGTGTTGATTGGGGGAAACAAACACATGCCCATATATTGAGGAATGGTTCCGACAGTCAAGTTTCAGTTCATAATTC
TCTCATTGACATGTACTGTGAATGTAAAATGTTAGATTCAGCTTGTAATATCTTCAACTGGATGACAGACAAGTCTGTAATTTCATGGAGTGCTATGATCAAGGGGTATG
TCAAAAATGGTCAGTCCCTCACTGCATCGTCTCTCTTCTCCAAGATGAAATCTGATGGGATTCAAGCTGATTTTGTTACAATGATCAATATCTTGCCTGCATTTGTTCAC
ATTGGAGCACTTGAAAATGTCAAATATTTACATGGGTACTCAATGAAGCTAGGCCTGACTTCCCTTCCATCACTTAACACAGCCCTTCTGATAACATATGCAAAATGTGG
GTATATAGAGATGGCTCAGAGGCTATTTGAGGAAGAGAGAATTGATGATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGCCAATCATGGAGACTGGTCCC
AGTGTTTTAAGCTATACAATCGAATGAAGTGCTCAAATTCAAAGCCAGACCAAGTAACATTTTTGGGACTACTAACAGCTTGTGTCAATTCTGGTCTTATCGAAAAGGGG
AAAGAATTTTTCAAGGAGATGACTGAAAGTTATGGGTGCCTACCAAGCCAAGAGCATTTTGCTTGCATGGTTAACCTCTTGGGGAGAGCTGGGCTTATCAGTGAAGCAGG
AGAACTTGTAAGAAACATGCCTATCAAACCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCTTGTAAGATGCATCCTGGGTCCAAGCTTGCAGAGTTCGCGGCCGAGA
AGCTCATTGATATGGAGCCTAAAAATGCCGGAAATTACATACTGCTTTCGAACATATATGCTGCTGCAGGTAAATGGGAAGAAGTGGCAAAGATGAGAAGTTTCTTAAGG
AATAAAGGGCTGAAGAAAACCCCTGGTTGTAGTTCGCTGGAGATAAATGGCCGTGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGAAGATATATATAC
CATCCTAGAAACCTTGAACTTGAAATCAAAGAGGTTAGAGAAAAGAGTCTAG
mRNA sequenceShow/hide mRNA sequence
GTCCGACATGCTTCACCTTCAACGATCAAAACCCATTATTCATACTCCCATTCTCCTCAACTTTCCCGCCACCCAATCAAGACTGCTCAACACGCTTTCCCTACTCTTCA
ATCGATGCAACTCCATTCAACACCTCCAGCAAATTCATGCCAGGTTCATTCTCCATGGCTTCCACCAAAACCCAACTCTCTCTTCCAAACTTATTGATTGCTATGCCAAT
CTTGGTCTCCTCAAACACTCTCTCCAAGTTTTCTGCTCCATAATCGACCCCAATTTGACTCTTTTCAACGCCATACTGAGAAATTTGACAAGATATGGAGAATCCGAGCG
AGCCCTGTTGGTGTATCAACAAATGGTCGCCAAATCTATGCACCCAGATGAAGAAACTTACCCTTTTATTTTTCGATCATGTTCTTCATTTTCAAATGTTGGATTTGGGA
GGACGATTCATGGTTATTTGGTTAAGCTGGGTTTTGATTCGTTTGATGTTGTAGCTACTGCTCTGGCTGAGATGTATGAGAAATGGATTGCATTTGAGAATGCTCATCAA
CTGTTTGATAAAAGGTCTGTGAAAGATTTGGGATGGTCGAGTTCCTTGACCACGGAGGGTTCTCAAAATGGTAACGGGGAGGGAATTTTTCGGGTTTTTGTGAGAATGAG
AGCAGAACAATTAGTACCAGACTCACTCACATTCGTCAATCTCTTGAGGTTCATTGCAGGTTTGAACTCAATTCAACTTGCAAAAATTGTTCATTGTATTGCAATTGTGA
GCAAATTGAGTGGAGATTTGTTAGTATATACTGCCGTGTTGTCTCTTTACTCTAAGTTACGTAGCTTAGTAGATGCTAGACGGTTATTTGACAAAATGCCAGAGAAAGAT
CGTGTTGTATGGAATATAATGATAGCAGCTTATGCTCGGGAAGGTAAACCGAGGGAATGCCTTGAGCTCTTCAAGTCCATGGCAAGATCAGGGATTAGATCTGATCTGTT
TACTGCACTGCCTGTTATCTCTTCAATTGCACAGTTGAAATGTGTTGATTGGGGGAAACAAACACATGCCCATATATTGAGGAATGGTTCCGACAGTCAAGTTTCAGTTC
ATAATTCTCTCATTGACATGTACTGTGAATGTAAAATGTTAGATTCAGCTTGTAATATCTTCAACTGGATGACAGACAAGTCTGTAATTTCATGGAGTGCTATGATCAAG
GGGTATGTCAAAAATGGTCAGTCCCTCACTGCATCGTCTCTCTTCTCCAAGATGAAATCTGATGGGATTCAAGCTGATTTTGTTACAATGATCAATATCTTGCCTGCATT
TGTTCACATTGGAGCACTTGAAAATGTCAAATATTTACATGGGTACTCAATGAAGCTAGGCCTGACTTCCCTTCCATCACTTAACACAGCCCTTCTGATAACATATGCAA
AATGTGGGTATATAGAGATGGCTCAGAGGCTATTTGAGGAAGAGAGAATTGATGATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGCCAATCATGGAGAC
TGGTCCCAGTGTTTTAAGCTATACAATCGAATGAAGTGCTCAAATTCAAAGCCAGACCAAGTAACATTTTTGGGACTACTAACAGCTTGTGTCAATTCTGGTCTTATCGA
AAAGGGGAAAGAATTTTTCAAGGAGATGACTGAAAGTTATGGGTGCCTACCAAGCCAAGAGCATTTTGCTTGCATGGTTAACCTCTTGGGGAGAGCTGGGCTTATCAGTG
AAGCAGGAGAACTTGTAAGAAACATGCCTATCAAACCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCTTGTAAGATGCATCCTGGGTCCAAGCTTGCAGAGTTCGCG
GCCGAGAAGCTCATTGATATGGAGCCTAAAAATGCCGGAAATTACATACTGCTTTCGAACATATATGCTGCTGCAGGTAAATGGGAAGAAGTGGCAAAGATGAGAAGTTT
CTTAAGGAATAAAGGGCTGAAGAAAACCCCTGGTTGTAGTTCGCTGGAGATAAATGGCCGTGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGAAGATA
TATATACCATCCTAGAAACCTTGAACTTGAAATCAAAGAGGTTAGAGAAAAGAGTCTAGATACCTTGGTAAATCCTCTTCTATAACACTTGGCATTCATTTTTTCTATTG
ATAGATCTCCTCATGTGACAGGTTTTACTCATTCATTTATTTACTTATTCTTCACACATTACATTGTTTAGTT
Protein sequenceShow/hide protein sequence
MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERAL
LVYQQMVAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAHQLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAE
QLVPDSLTFVNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIAAYAREGKPRECLELFKSMARSGIRSDLFTA
LPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFSKMKSDGIQADFVTMINILPAFVH
IGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLIEKG
KEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLR
NKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILETLNLKSKRLEKRV