; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0098491 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0098491
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCMiso1.1chr04:14210601..14216588
RNA-Seq ExpressionCmc04g0098491
SyntenyCmc04g0098491
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008447444.1 PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Cucumis melo]1.5e-29988.15Show/hide
Query:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN
        MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN
Subjt:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN

Query:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE
        VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE
Subjt:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE

Query:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ
        RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ
Subjt:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ

Query:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN
        +                                                                      VIEGLAVHGYAEKALRMFAIMEREKILPN
Subjt:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN

Query:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP
        GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP
Subjt:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP

Query:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI
        MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI
Subjt:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI

XP_011651448.1 pentatricopeptide repeat-containing protein At1g06143 [Cucumis sativus]2.4e-27682.33Show/hide
Query:  MFSFVTTIALKQLTRSIGNFVS-PSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENP
        MFSFVTT ALKQLTRSIGNFVS PSISMPLQ PS PSFKQTLLNRIKNCS INELH + ASMIK+NAIQDCFLVHQFISASFA NSVHYPVFAFTQMENP
Subjt:  MFSFVTTIALKQLTRSIGNFVS-PSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENP

Query:  NVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMC
        NVFVYNAMIKGFVY GYPFR LQCYVHMLE SNVLP SYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLE LSEARKVFDEMC
Subjt:  NVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMC

Query:  ERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPD
        ERDAFAWT MVSALARVGDMD+ARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSE RLNGIIPD
Subjt:  ERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPD

Query:  QI----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILP
        ++                                                                      VIEGLAVHGYAEKALRMFAIMEREKI+P
Subjt:  QI----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILP

Query:  NGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILE
        NGVTFISILSACTHAGLV+EGRSRFLSMTRDY I P+IRHYGCMVDMLSK+G L EALELIKSMEFEPNSIIWGALLNGCKLHGN  IA+DAVEQLMILE
Subjt:  NGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILE

Query:  PMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI
        PMNSGHYNLLVSM AEEKDWMEVAHIR MMKE+GVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTAL+F EEI
Subjt:  PMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI

XP_022967388.1 pentatricopeptide repeat-containing protein At1g06143 [Cucurbita maxima]1.4e-23672.11Show/hide
Query:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN
        MFS   T ALKQ+TRSI NFVS S S  LQ P  P+FKQTLL+RIKNCS INEL  +YASMIK+NA QDCFLV+QFISAS  FNSV YPV AFTQMENPN
Subjt:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN

Query:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE
        VFVYNAMI+GFVY GYPFR +QCYVHMLE S VLP+SYTFSSLVKACT MCA++LG+M+HC IW  G E  +FVQT+L+D YS LE+  +ARKVFDEM E
Subjt:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE

Query:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ
        RD FAWTTMVSALAR GDMD+ARKLFEEMPE NTATWNTMIDGYARLGNVESAE LFNQMP +DIISWTTMITCYSQNKQY++AL IY + RLNGIIPD+
Subjt:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ

Query:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN
        +                                                                      VIEGLAVHGYAEKALRMF IMEREKI+PN
Subjt:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN

Query:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP
        GVTFISILSACTHAGLV EGRSRFLSM RDYGI PE+ HYGCMVDMLSKAGLL EALELI  MEFEPNSIIWGALLNGCKLHGNS IAKDAV +L ILEP
Subjt:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP

Query:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSV
         NSGHYNLLVSM AEEK W+EVAHIR MMKE GVEKKYPGSSWIELEG IHQFSASAD HPDSDKIYF+LTELDGQLKLAG +LEPSV
Subjt:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSV

XP_023554768.1 pentatricopeptide repeat-containing protein At1g06143 [Cucurbita pepo subsp. pepo]3.1e-23672.11Show/hide
Query:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN
        MFS   T ALKQ+TRSI NF S S    LQ     +FKQTLL+RIKNCS INEL  +YASMIK+NA QDCFLV+QFISAS  FNSV YPV AFTQMENPN
Subjt:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN

Query:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE
        VFVYNAMI+GFVY GYPFR +QCYVHMLE S VLP+SYTFSSLVKACT MCA++LG+M+HCHIWK G E  +FVQT+L+D YS LE+  +ARKVFDEM E
Subjt:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE

Query:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ
        RD FAWTTMVSALAR GDMDTARKLFEEMPE NTATWNTMIDGYARLGNVESAE LFNQMP +DIISWTTMITCYSQNKQY++AL IY + RLNGIIPD+
Subjt:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ

Query:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN
        +                                                                      VIEGLAVHGYAEKALRMF IMEREKI+PN
Subjt:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN

Query:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP
        GVTFISILSACTHAGLV EGRSRF SM RDYGI PE+ HYGCMVDMLSKAGLL EALELI  MEFEPNSIIWGALLNGCKLHGNS IAKDAV QL ILEP
Subjt:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP

Query:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSV
         NSGHYNLLVSM AEEK WMEVAHIR MMKE GVEKKYPGSSWIELEG IHQFSASAD HPDSDKIYF+LTELDGQLKLAG +LEPSV
Subjt:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSV

XP_038888390.1 pentatricopeptide repeat-containing protein At1g06143 [Benincasa hispida]2.1e-25677.1Show/hide
Query:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN
        MFSFV T ALKQLTRSI NFVS SISMP Q PS PSFKQTLLNRIKNCS INEL  +YASMIK+NA QDCFLV+QFIS S AFNSV YPV AFTQMENPN
Subjt:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN

Query:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE
        VFVYNAMI+GFVY GYPF  LQCYVHMLE + V P SYTFSSLVKACTFMCAVELG+M+HCHIWK GFESHLFVQTAL+DFYS LE+LSEARKVFDEM E
Subjt:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE

Query:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ
        RD+FAWTTMVSALAR GDMD+ARKLFEEMPE NTATWNTMIDGYARLGNVESAE LFNQMP +DIISWTTMITCYSQNKQYQ+AL IY + RLNGIIPD+
Subjt:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ

Query:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN
        +                                                                      VIEGLAVHGYAEKALRMF IMEREKI PN
Subjt:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN

Query:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP
        GVTFISILSACTHAGLVEEGRSRFLSMTRDYGI PEI HYGCMVDMLSKAG L EALELIKSMEFEPNSIIWGALLNGCKLHGNS IAKDAV+QLMILEP
Subjt:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP

Query:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALV
        M+SGHYNLLVSM AEEKDWMEVAHIR MMKEQGVEKKYPGSSWIEL+G IHQFSASADSHPDSD+IYF+LTELDGQLKLAGYILEP VC  ALV
Subjt:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALV

TrEMBL top hitse value%identityAlignment
A0A0A0LB99 Uncharacterized protein1.1e-27682.33Show/hide
Query:  MFSFVTTIALKQLTRSIGNFVS-PSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENP
        MFSFVTT ALKQLTRSIGNFVS PSISMPLQ PS PSFKQTLLNRIKNCS INELH + ASMIK+NAIQDCFLVHQFISASFA NSVHYPVFAFTQMENP
Subjt:  MFSFVTTIALKQLTRSIGNFVS-PSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENP

Query:  NVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMC
        NVFVYNAMIKGFVY GYPFR LQCYVHMLE SNVLP SYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLE LSEARKVFDEMC
Subjt:  NVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMC

Query:  ERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPD
        ERDAFAWT MVSALARVGDMD+ARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSE RLNGIIPD
Subjt:  ERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPD

Query:  QI----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILP
        ++                                                                      VIEGLAVHGYAEKALRMFAIMEREKI+P
Subjt:  QI----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILP

Query:  NGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILE
        NGVTFISILSACTHAGLV+EGRSRFLSMTRDY I P+IRHYGCMVDMLSK+G L EALELIKSMEFEPNSIIWGALLNGCKLHGN  IA+DAVEQLMILE
Subjt:  NGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILE

Query:  PMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI
        PMNSGHYNLLVSM AEEKDWMEVAHIR MMKE+GVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTAL+F EEI
Subjt:  PMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI

A0A1S3BHH1 pentatricopeptide repeat-containing protein At1g06145-like7.4e-30088.15Show/hide
Query:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN
        MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN
Subjt:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN

Query:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE
        VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE
Subjt:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE

Query:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ
        RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ
Subjt:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ

Query:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN
        +                                                                      VIEGLAVHGYAEKALRMFAIMEREKILPN
Subjt:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN

Query:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP
        GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP
Subjt:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP

Query:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI
        MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI
Subjt:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI

A0A5A7T9J0 Pentatricopeptide repeat-containing protein7.4e-30088.15Show/hide
Query:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN
        MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN
Subjt:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN

Query:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE
        VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE
Subjt:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE

Query:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ
        RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ
Subjt:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ

Query:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN
        +                                                                      VIEGLAVHGYAEKALRMFAIMEREKILPN
Subjt:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN

Query:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP
        GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP
Subjt:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP

Query:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI
        MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI
Subjt:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI

A0A6J1HIB3 pentatricopeptide repeat-containing protein At1g061432.2e-23571.6Show/hide
Query:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN
        MFS   T ALKQ+TRSI NFVS S    LQ     +FKQTLL+RIKNCS INEL  +YASMIK+NA QDCFLV+QFISAS  FNSV YPV AFTQMENPN
Subjt:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN

Query:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE
        VFVYNAMI+GFVY GYPFR +QCYVHMLE S VLP+SYTFSSLVKACT MCA++LG+M+HCHIWK G E  +FVQT+L+D YS LE+  +ARKVFDEM E
Subjt:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE

Query:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ
        RD FAWTTMVSALAR GDMD+ARKLFEEMPE NTATWNTMIDGYARLGNVESAE LFNQMP +DIISWTTMITCYSQNKQY++AL IY   RLNGIIPD+
Subjt:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ

Query:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN
        +                                                                      VIEGLAVHGYAEKALRMF IMEREKI+PN
Subjt:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN

Query:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP
        GVTFISILSACTHAGLV EGRSRF SM RDYGI PE+ HYGCMVDMLSKAGLL EALELI  MEFEPNSIIWGALLNGCKLHGNS IAKDAV+QL +LEP
Subjt:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP

Query:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSV
         NSGHYNLLVSM AEEK WM+VAHIR MMKE GVEKKYPGSSWIELEG IHQFSASA+ HPDSDKIYF+LTELDGQLKLAG +LEPSV
Subjt:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSV

A0A6J1HUY6 pentatricopeptide repeat-containing protein At1g061436.8e-23772.11Show/hide
Query:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN
        MFS   T ALKQ+TRSI NFVS S S  LQ P  P+FKQTLL+RIKNCS INEL  +YASMIK+NA QDCFLV+QFISAS  FNSV YPV AFTQMENPN
Subjt:  MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPN

Query:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE
        VFVYNAMI+GFVY GYPFR +QCYVHMLE S VLP+SYTFSSLVKACT MCA++LG+M+HC IW  G E  +FVQT+L+D YS LE+  +ARKVFDEM E
Subjt:  VFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCE

Query:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ
        RD FAWTTMVSALAR GDMD+ARKLFEEMPE NTATWNTMIDGYARLGNVESAE LFNQMP +DIISWTTMITCYSQNKQY++AL IY + RLNGIIPD+
Subjt:  RDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ

Query:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN
        +                                                                      VIEGLAVHGYAEKALRMF IMEREKI+PN
Subjt:  I----------------------------------------------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPN

Query:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP
        GVTFISILSACTHAGLV EGRSRFLSM RDYGI PE+ HYGCMVDMLSKAGLL EALELI  MEFEPNSIIWGALLNGCKLHGNS IAKDAV +L ILEP
Subjt:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEP

Query:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSV
         NSGHYNLLVSM AEEK W+EVAHIR MMKE GVEKKYPGSSWIELEG IHQFSASAD HPDSDKIYF+LTELDGQLKLAG +LEPSV
Subjt:  MNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSV

SwissProt top hitse value%identityAlignment
Q56X05 Pentatricopeptide repeat-containing protein At1g061431.9e-13548.24Show/hide
Query:  IKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLV
        IK CS    L    A+MIK++  QDC L++QFI+A  +F  +   V   TQM+ PNVFVYNA+ KGFV   +P R L+ YV ML  S V P+SYT+SSLV
Subjt:  IKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLV

Query:  KACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGY
        KA +F  A   G+ +  HIWK GF  H+ +QT L+DFYS   ++ EARKVFDEM ERD  AWTTMVSA  RV DMD+A  L  +M E+N AT N +I+GY
Subjt:  KACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGY

Query:  ARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQI-------------------------------------------
          LGN+E AE LFNQMP KDIISWTTMI  YSQNK+Y++A+A++ +    GIIPD++                                           
Subjt:  ARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQI-------------------------------------------

Query:  ---------------------------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMV
                                   +IEGLA HG+A++AL+MFA ME E + PN VTF+S+ +ACTHAGLV+EGR  + SM  DY I   + HYG MV
Subjt:  ---------------------------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMV

Query:  DMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWI
         + SKAGL+ EALELI +MEFEPN++IWGALL+GC++H N VIA+ A  +LM+LEPMNSG+Y LLVSM AE+  W +VA IR  M+E G+EK  PG+S I
Subjt:  DMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWI

Query:  ELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILE
         ++   H F+A+  SH  SD++  +L E+  Q+ LAGY+ E
Subjt:  ELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILE

Q9FG16 Pentatricopeptide repeat-containing protein At5g065409.6e-7932.89Show/hide
Query:  FKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISA-------SFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLE
        FK   L  +++CS+ ++L +++  +++++ I D F+  + ++        +   N + Y    F+Q++NPN+FV+N +I+ F     P +    Y  ML+
Subjt:  FKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISA-------SFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLE

Query:  GSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEM
         S + P++ TF  L+KA + M  V +G+  H  I + GF++ ++V+ +LV  Y+    ++ A ++F +M  RD  +WT+MV+   + G ++ AR++F+EM
Subjt:  GSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEM

Query:  PERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDAL---------AIYSETRLNGIIPDQIV------------------
        P RN  TW+ MI+GYA+    E A  LF  M  + +++  T++     +  +  AL          + S   +N I+   +V                  
Subjt:  PERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDAL---------AIYSETRLNGIIPDQIV------------------

Query:  ------------IEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELI
                    I+GLAVHG+A KA+  F+ M     +P  VTF ++LSAC+H GLVE+G   + +M +D+GI P + HYGC+VDML +AG L EA   I
Subjt:  ------------IEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELI

Query:  KSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASAD-S
          M  +PN+ I GALL  CK++ N+ +A+     L+ ++P +SG+Y LL ++ A    W ++  +R MMKE+ V KK PG S IE++G I++F+   D  
Subjt:  KSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASAD-S

Query:  HPDSDKIYFVLTELDGQLKLAGY
        HP+  KI     E+ G+++L GY
Subjt:  HPDSDKIYFVLTELDGQLKLAGY

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665202.4e-8232.94Show/hide
Query:  IKNCSNINELHVVYASMIKSNAIQDCFLVHQFIS---ASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFS
        ++ CS   EL  ++A M+K+  +QD + + +F+S   +S + + + Y    F   + P+ F++N MI+GF     P R L  Y  ML  S+   N+YTF 
Subjt:  IKNCSNINELHVVYASMIKSNAIQDCFLVHQFIS---ASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFS

Query:  SLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMI
        SL+KAC+ + A E    +H  I K G+E+ ++   +L++ Y+       A  +FD + E D  +W +++    + G MD A  LF +M E+N  +W TMI
Subjt:  SLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMI

Query:  DGYARLGNVESAELLFNQMPTKDI---------------------------------------ISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQIV
         GY +    + A  LF++M   D+                                       +    +I  Y++  + ++AL ++   +   +     +
Subjt:  DGYARLGNVESAELLFNQMPTKDI---------------------------------------ISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQIV

Query:  IEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIW
        I G A HG+  +A+  F  M++  I PN +TF ++L+AC++ GLVEEG+  F SM RDY + P I HYGC+VD+L +AGLL EA   I+ M  +PN++IW
Subjt:  IEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIW

Query:  GALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTE
        GALL  C++H N  + ++  E L+ ++P + G Y    ++ A +K W + A  R +MKEQGV  K PG S I LEGT H+F A   SHP+ +KI      
Subjt:  GALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTE

Query:  LDGQLKLAGYILE
        +  +L+  GY+ E
Subjt:  LDGQLKLAGYILE

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.7e-8028.75Show/hide
Query:  PSISMP---LQSPSRPSF----KQTLLNRIKNCSNINELHVVYASMIK---SNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFV
        PS S P   L S S P +        L+ + NC  +  L +++A MIK    N       + +F   S  F  + Y +  F  ++ PN+ ++N M +G  
Subjt:  PSISMP---LQSPSRPSF----KQTLLNRIKNCSNINELHVVYASMIK---SNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFV

Query:  YRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSA
            P   L+ YV M+    +LPNSYTF  ++K+C    A + GQ +H H+ K G +  L+V T+L+  Y +  +L +A KVFD+   RD  ++T ++  
Subjt:  YRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSA

Query:  LARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGN---------------------------------------------------------------
         A  G ++ A+KLF+E+P ++  +WN MI GYA  GN                                                               
Subjt:  LARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGN---------------------------------------------------------------

Query:  -------VESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQI-----------------------------------------
               +E+A  LF ++P KD+ISW T+I  Y+    Y++AL ++ E   +G  P+ +                                         
Subjt:  -------VESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQI-----------------------------------------

Query:  -------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHY
                                       +I G A+HG A+ +  +F+ M +  I P+ +TF+ +LSAC+H+G+++ GR  F +MT+DY ++P++ HY
Subjt:  -------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHY

Query:  GCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG
        GCM+D+L  +GL KEA E+I  ME EP+ +IW +LL  CK+HGN  + +   E L+ +EP N G Y LL ++ A    W EVA  R ++ ++G+ KK PG
Subjt:  GCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG

Query:  SSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPS
         S IE++  +H+F      HP + +IY +L E++  L+ AG++ + S
Subjt:  SSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPS

Q9LS72 Pentatricopeptide repeat-containing protein At3g292308.7e-8831.74Show/hide
Query:  SMPLQSPSRPSFKQTLLNRIKN---CSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQ
        S+P+++PS  S ++    R+++   C+N+N++  ++A +I+ N  +D  +  + ISA       +  V  F Q++ PNV + N++I+       P++   
Subjt:  SMPLQSPSRPSFKQTLLNRIKN---CSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQ

Query:  CYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSK------------LEKLSE--------------------
         +  M +   +  +++T+  L+KAC+    + + +M+H HI K G  S ++V  AL+D YS+             EK+SE                    
Subjt:  CYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSK------------LEKLSE--------------------

Query:  -ARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQM--PTKDIISWTTMITCYSQNKQYQDA---
         AR++FDEM +RD  +W TM+   AR  +M  A +LFE+MPERNT +W+TM+ GY++ G++E A ++F++M  P K++++WT +I  Y++    ++A   
Subjt:  -ARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQM--PTKDIISWTTMITCYSQNKQYQDA---

Query:  -------------------LAIYSETRL------------------------------------------------NGIIPDQIVIEGLAVHGYAEKALR
                           LA  +E+ L                                                  ++    ++ GL VHG+ ++A+ 
Subjt:  -------------------LAIYSETRL------------------------------------------------NGIIPDQIVIEGLAVHGYAEKALR

Query:  MFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVI
        +F+ M RE I P+ VTFI++L +C HAGL++EG   F SM + Y + P++ HYGC+VD+L + G LKEA++++++M  EPN +IWGALL  C++H    I
Subjt:  MFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVI

Query:  AKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTEL
        AK+ ++ L+ L+P + G+Y+LL ++ A  +DW  VA IR  MK  GVEK   G+S +ELE  IH+F+    SHP SD+IY +L  L
Subjt:  AKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTEL

Arabidopsis top hitse value%identityAlignment
AT1G06150.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.4e-13648.24Show/hide
Query:  IKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLV
        IK CS    L    A+MIK++  QDC L++QFI+A  +F  +   V   TQM+ PNVFVYNA+ KGFV   +P R L+ YV ML  S V P+SYT+SSLV
Subjt:  IKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLV

Query:  KACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGY
        KA +F  A   G+ +  HIWK GF  H+ +QT L+DFYS   ++ EARKVFDEM ERD  AWTTMVSA  RV DMD+A  L  +M E+N AT N +I+GY
Subjt:  KACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGY

Query:  ARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQI-------------------------------------------
          LGN+E AE LFNQMP KDIISWTTMI  YSQNK+Y++A+A++ +    GIIPD++                                           
Subjt:  ARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQI-------------------------------------------

Query:  ---------------------------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMV
                                   +IEGLA HG+A++AL+MFA ME E + PN VTF+S+ +ACTHAGLV+EGR  + SM  DY I   + HYG MV
Subjt:  ---------------------------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMV

Query:  DMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWI
         + SKAGL+ EALELI +MEFEPN++IWGALL+GC++H N VIA+ A  +LM+LEPMNSG+Y LLVSM AE+  W +VA IR  M+E G+EK  PG+S I
Subjt:  DMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWI

Query:  ELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILE
         ++   H F+A+  SH  SD++  +L E+  Q+ LAGY+ E
Subjt:  ELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILE

AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-8128.75Show/hide
Query:  PSISMP---LQSPSRPSF----KQTLLNRIKNCSNINELHVVYASMIK---SNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFV
        PS S P   L S S P +        L+ + NC  +  L +++A MIK    N       + +F   S  F  + Y +  F  ++ PN+ ++N M +G  
Subjt:  PSISMP---LQSPSRPSF----KQTLLNRIKNCSNINELHVVYASMIK---SNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFV

Query:  YRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSA
            P   L+ YV M+    +LPNSYTF  ++K+C    A + GQ +H H+ K G +  L+V T+L+  Y +  +L +A KVFD+   RD  ++T ++  
Subjt:  YRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSA

Query:  LARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGN---------------------------------------------------------------
         A  G ++ A+KLF+E+P ++  +WN MI GYA  GN                                                               
Subjt:  LARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGN---------------------------------------------------------------

Query:  -------VESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQI-----------------------------------------
               +E+A  LF ++P KD+ISW T+I  Y+    Y++AL ++ E   +G  P+ +                                         
Subjt:  -------VESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQI-----------------------------------------

Query:  -------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHY
                                       +I G A+HG A+ +  +F+ M +  I P+ +TF+ +LSAC+H+G+++ GR  F +MT+DY ++P++ HY
Subjt:  -------------------------------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHY

Query:  GCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG
        GCM+D+L  +GL KEA E+I  ME EP+ +IW +LL  CK+HGN  + +   E L+ +EP N G Y LL ++ A    W EVA  R ++ ++G+ KK PG
Subjt:  GCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG

Query:  SSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPS
         S IE++  +H+F      HP + +IY +L E++  L+ AG++ + S
Subjt:  SSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPS

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.2e-8931.74Show/hide
Query:  SMPLQSPSRPSFKQTLLNRIKN---CSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQ
        S+P+++PS  S ++    R+++   C+N+N++  ++A +I+ N  +D  +  + ISA       +  V  F Q++ PNV + N++I+       P++   
Subjt:  SMPLQSPSRPSFKQTLLNRIKN---CSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQ

Query:  CYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSK------------LEKLSE--------------------
         +  M +   +  +++T+  L+KAC+    + + +M+H HI K G  S ++V  AL+D YS+             EK+SE                    
Subjt:  CYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSK------------LEKLSE--------------------

Query:  -ARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQM--PTKDIISWTTMITCYSQNKQYQDA---
         AR++FDEM +RD  +W TM+   AR  +M  A +LFE+MPERNT +W+TM+ GY++ G++E A ++F++M  P K++++WT +I  Y++    ++A   
Subjt:  -ARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQM--PTKDIISWTTMITCYSQNKQYQDA---

Query:  -------------------LAIYSETRL------------------------------------------------NGIIPDQIVIEGLAVHGYAEKALR
                           LA  +E+ L                                                  ++    ++ GL VHG+ ++A+ 
Subjt:  -------------------LAIYSETRL------------------------------------------------NGIIPDQIVIEGLAVHGYAEKALR

Query:  MFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVI
        +F+ M RE I P+ VTFI++L +C HAGL++EG   F SM + Y + P++ HYGC+VD+L + G LKEA++++++M  EPN +IWGALL  C++H    I
Subjt:  MFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVI

Query:  AKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTEL
        AK+ ++ L+ L+P + G+Y+LL ++ A  +DW  VA IR  MK  GVEK   G+S +ELE  IH+F+    SHP SD+IY +L  L
Subjt:  AKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTEL

AT5G06540.1 Pentatricopeptide repeat (PPR) superfamily protein6.8e-8032.89Show/hide
Query:  FKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISA-------SFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLE
        FK   L  +++CS+ ++L +++  +++++ I D F+  + ++        +   N + Y    F+Q++NPN+FV+N +I+ F     P +    Y  ML+
Subjt:  FKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISA-------SFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLE

Query:  GSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEM
         S + P++ TF  L+KA + M  V +G+  H  I + GF++ ++V+ +LV  Y+    ++ A ++F +M  RD  +WT+MV+   + G ++ AR++F+EM
Subjt:  GSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEM

Query:  PERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDAL---------AIYSETRLNGIIPDQIV------------------
        P RN  TW+ MI+GYA+    E A  LF  M  + +++  T++     +  +  AL          + S   +N I+   +V                  
Subjt:  PERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDAL---------AIYSETRLNGIIPDQIV------------------

Query:  ------------IEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELI
                    I+GLAVHG+A KA+  F+ M     +P  VTF ++LSAC+H GLVE+G   + +M +D+GI P + HYGC+VDML +AG L EA   I
Subjt:  ------------IEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELI

Query:  KSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASAD-S
          M  +PN+ I GALL  CK++ N+ +A+     L+ ++P +SG+Y LL ++ A    W ++  +R MMKE+ V KK PG S IE++G I++F+   D  
Subjt:  KSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASAD-S

Query:  HPDSDKIYFVLTELDGQLKLAGY
        HP+  KI     E+ G+++L GY
Subjt:  HPDSDKIYFVLTELDGQLKLAGY

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-8332.94Show/hide
Query:  IKNCSNINELHVVYASMIKSNAIQDCFLVHQFIS---ASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFS
        ++ CS   EL  ++A M+K+  +QD + + +F+S   +S + + + Y    F   + P+ F++N MI+GF     P R L  Y  ML  S+   N+YTF 
Subjt:  IKNCSNINELHVVYASMIKSNAIQDCFLVHQFIS---ASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFS

Query:  SLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMI
        SL+KAC+ + A E    +H  I K G+E+ ++   +L++ Y+       A  +FD + E D  +W +++    + G MD A  LF +M E+N  +W TMI
Subjt:  SLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMI

Query:  DGYARLGNVESAELLFNQMPTKDI---------------------------------------ISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQIV
         GY +    + A  LF++M   D+                                       +    +I  Y++  + ++AL ++   +   +     +
Subjt:  DGYARLGNVESAELLFNQMPTKDI---------------------------------------ISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQIV

Query:  IEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIW
        I G A HG+  +A+  F  M++  I PN +TF ++L+AC++ GLVEEG+  F SM RDY + P I HYGC+VD+L +AGLL EA   I+ M  +PN++IW
Subjt:  IEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIW

Query:  GALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTE
        GALL  C++H N  + ++  E L+ ++P + G Y    ++ A +K W + A  R +MKEQGV  K PG S I LEGT H+F A   SHP+ +KI      
Subjt:  GALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTE

Query:  LDGQLKLAGYILE
        +  +L+  GY+ E
Subjt:  LDGQLKLAGYILE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTCATTTGTGACTACGATCGCTCTTAAACAGTTAACAAGAAGCATTGGCAACTTTGTAAGTCCTTCAATCTCAATGCCTCTTCAATCACCATCTCGTCCTTCTTT
CAAGCAAACTCTGCTTAATCGAATAAAAAACTGTTCCAACATAAACGAACTGCATGTTGTATATGCTTCCATGATCAAAAGTAATGCAATCCAAGATTGTTTTCTGGTGC
ATCAGTTTATTAGCGCGTCTTTTGCTTTTAACTCTGTACATTACCCAGTTTTCGCCTTTACCCAGATGGAAAACCCTAATGTTTTTGTGTATAATGCGATGATTAAGGGA
TTTGTATACCGTGGGTACCCATTTCGTGGTTTACAATGTTATGTACATATGTTGGAAGGATCGAACGTTTTGCCAAATAGTTATACGTTTTCTTCGTTGGTTAAAGCTTG
CACCTTTATGTGTGCTGTTGAGTTGGGACAGATGGTGCATTGTCACATTTGGAAGAAGGGGTTTGAATCCCATTTGTTTGTTCAAACTGCTTTGGTTGATTTTTACTCGA
AGTTGGAGAAACTTAGTGAGGCAAGAAAGGTGTTTGATGAAATGTGTGAAAGAGATGCTTTTGCATGGACTACTATGGTTTCTGCTCTAGCTCGTGTTGGAGATATGGAT
ACGGCTAGGAAGTTGTTTGAGGAGATGCCTGAAAGGAATACTGCAACTTGGAATACCATGATTGACGGCTATGCAAGATTGGGAAATGTGGAGTCTGCAGAGCTTCTGTT
CAATCAGATGCCAACTAAGGACATAATCTCCTGGACAACCATGATCACTTGTTATTCACAGAACAAGCAATATCAAGATGCGTTGGCAATTTATAGTGAGACGAGATTGA
ATGGGATTATTCCTGATCAGATTGTAATTGAAGGGCTTGCAGTTCATGGTTATGCGGAGAAGGCTTTGAGGATGTTCGCTATCATGGAGAGGGAGAAGATCCTGCCCAAT
GGTGTTACCTTTATTAGTATATTAAGTGCTTGCACACATGCTGGGTTAGTTGAAGAAGGCAGGAGTAGATTTTTGAGCATGACTCGTGATTATGGCATTTCTCCTGAAAT
CAGACACTACGGTTGCATGGTTGATATGTTAAGTAAAGCAGGATTGCTCAAAGAAGCATTAGAATTGATTAAAAGTATGGAATTTGAACCAAACTCTATTATTTGGGGAG
CCTTGTTGAATGGGTGCAAACTTCACGGGAACTCTGTGATTGCAAAAGATGCTGTTGAACAGTTGATGATTTTGGAACCCATGAATAGTGGGCATTACAATCTTTTGGTT
AGCATGTGTGCTGAAGAGAAGGATTGGATGGAGGTTGCGCATATTCGATTAATGATGAAAGAACAAGGAGTAGAAAAGAAATATCCTGGCTCAAGTTGGATTGAATTGGA
AGGGACAATTCATCAGTTTTCAGCTTCAGCTGATTCTCACCCTGATTCTGACAAAATATACTTCGTACTGACAGAACTAGATGGACAACTGAAGCTAGCTGGTTACATAC
TTGAGCCTTCAGTATGCAGTACTGCTTTGGTTTTTCCAGAGGAAATTTGA
mRNA sequenceShow/hide mRNA sequence
GTTGGAAATGGGAGAGGGCGGGAGATGAGAGAAATGAAGAACAACCAAAATGGGTGCATTTCGATTCTTCTTAAGTGAAAAACTCGACTTCTCAACTTCTATAATCTCCA
GAATGTTTCAATTAAATGCGAAAGAATAGCTGCACCCTGCAGGGATGTTCTCATTTGTGACTACGATCGCTCTTAAACAGTTAACAAGAAGCATTGGCAACTTTGTAAGT
CCTTCAATCTCAATGCCTCTTCAATCACCATCTCGTCCTTCTTTCAAGCAAACTCTGCTTAATCGAATAAAAAACTGTTCCAACATAAACGAACTGCATGTTGTATATGC
TTCCATGATCAAAAGTAATGCAATCCAAGATTGTTTTCTGGTGCATCAGTTTATTAGCGCGTCTTTTGCTTTTAACTCTGTACATTACCCAGTTTTCGCCTTTACCCAGA
TGGAAAACCCTAATGTTTTTGTGTATAATGCGATGATTAAGGGATTTGTATACCGTGGGTACCCATTTCGTGGTTTACAATGTTATGTACATATGTTGGAAGGATCGAAC
GTTTTGCCAAATAGTTATACGTTTTCTTCGTTGGTTAAAGCTTGCACCTTTATGTGTGCTGTTGAGTTGGGACAGATGGTGCATTGTCACATTTGGAAGAAGGGGTTTGA
ATCCCATTTGTTTGTTCAAACTGCTTTGGTTGATTTTTACTCGAAGTTGGAGAAACTTAGTGAGGCAAGAAAGGTGTTTGATGAAATGTGTGAAAGAGATGCTTTTGCAT
GGACTACTATGGTTTCTGCTCTAGCTCGTGTTGGAGATATGGATACGGCTAGGAAGTTGTTTGAGGAGATGCCTGAAAGGAATACTGCAACTTGGAATACCATGATTGAC
GGCTATGCAAGATTGGGAAATGTGGAGTCTGCAGAGCTTCTGTTCAATCAGATGCCAACTAAGGACATAATCTCCTGGACAACCATGATCACTTGTTATTCACAGAACAA
GCAATATCAAGATGCGTTGGCAATTTATAGTGAGACGAGATTGAATGGGATTATTCCTGATCAGATTGTAATTGAAGGGCTTGCAGTTCATGGTTATGCGGAGAAGGCTT
TGAGGATGTTCGCTATCATGGAGAGGGAGAAGATCCTGCCCAATGGTGTTACCTTTATTAGTATATTAAGTGCTTGCACACATGCTGGGTTAGTTGAAGAAGGCAGGAGT
AGATTTTTGAGCATGACTCGTGATTATGGCATTTCTCCTGAAATCAGACACTACGGTTGCATGGTTGATATGTTAAGTAAAGCAGGATTGCTCAAAGAAGCATTAGAATT
GATTAAAAGTATGGAATTTGAACCAAACTCTATTATTTGGGGAGCCTTGTTGAATGGGTGCAAACTTCACGGGAACTCTGTGATTGCAAAAGATGCTGTTGAACAGTTGA
TGATTTTGGAACCCATGAATAGTGGGCATTACAATCTTTTGGTTAGCATGTGTGCTGAAGAGAAGGATTGGATGGAGGTTGCGCATATTCGATTAATGATGAAAGAACAA
GGAGTAGAAAAGAAATATCCTGGCTCAAGTTGGATTGAATTGGAAGGGACAATTCATCAGTTTTCAGCTTCAGCTGATTCTCACCCTGATTCTGACAAAATATACTTCGT
ACTGACAGAACTAGATGGACAACTGAAGCTAGCTGGTTACATACTTGAGCCTTCAGTATGCAGTACTGCTTTGGTTTTTCCAGAGGAAATTTGATCAACATTAATTGAGG
TCATAGTGAGATCGAATATTATTTGCATATCAATCATTTCAGCTTCATTGAATATGGTACATTGAACTGAAGGGAAAATTCTTGAGGTCAAGTGCTAAATGTCAAAGCAG
GGCTACTATAAGAGTTCATAATTATTCAGATCAAGGCTCAAGTTAGCCTCATTAAGAGCCATGCTATCAAAAGAAGGGGCAGGGACTAAATTTATAACGAGATAACGTAC
CACTATTGTACAACCGAAGCTGACTTAGTGAAGTGATTTGATCGAAAAACCCAAGCTAACAGGATATCCTGTAACAGGTCGGTCCTTTTTCCTCCTTTGTACAGCACTTC
CGTATATTTCAAATACATACATGATCATATTGTCTGCTTATGCACTTCCTTAGAAACAAGCTAGGTTTTGAGGAAAATTGAAACCAAAACTTATCAGAATCTTCACGTAT
AATTTCATTCATGACGATAAATTGCCAATACCGTTTGAACCGTGTTATAATATTTCTAAAACATCCAACTTGTCCCACAGCTTTTTTTTGAACTATGTTATAATATTTCC
AAAACATCCAACTCGTCCGACAGAATTTTTTTAGGTTGGAATGGTGTGGAGAAAATAAAACTCCTAGCTTGATCTTATATTGTTTTAATTTTCTTTTTCTAAACGAAGAA
CATGCCAAATCTTCTCTTTCCTACTTCAAGAGTGTTCTTAATCCATCAATTTAGCATTGGAGAATGCTAAAACACTGGCTAAAATTGAATGGAAGTGCGACTCTCGTCTT
CAAGCACCAGCCATTTTTATTATTATGGGAGTAGCTTGCTTTCTTCTGGCAAGATACATGGTTCAAGGGATAGGGGAAATTCTTCAAACCTTGTAGGCTATGCATCTGCT
GTAAGATATGGTTCTTAGGGAACATTATACACGTTTCTATCTTAGTCTCTTTTCATTTTCTTAATGAAAAGTTTCGTATCATTTCAAGGTTCAACGATAGTGATTGCTGT
TATTCCACTAGTTTGACTTTACCTTACCAAGTTGGTGGGGGAAAATCCTGCCCAAGTATCGCATAGTCTTTCTCATACTCTGCCAAAGTAGAAGTGTGTGCTTTCTGACC
TTGCTATAAGCTTACATCCACAAAGATCATGAAAATTTGTAAAAAGGAGGTGGATTAATCAGGTTGATTATAGAGACAGAAGTGGCCTCTGTTTACTAACAAGTTAGTAT
TGTATTTTTTTTTTTTTTTTTTTTTTTTTTTGAGCATTTTATCACAATAAGAAAGGGAAAAAAATTGTTTCACTAGACACTTATCGGCTTTAGAGACGCCCAAATTCATA
GTTTCTTCTAACAGGATTTCGTGCACTTGGCAATGGACCAGGATTGGGTTAATATTTAATCACATGATATTTTTACGTTCCAAATTTGTTTAAAAAATGATCTGAAGTAC
TTACTGCTTCCGGTTCAATTCACTAGACCCTTTTATAGTTCTCTCGTTTTTGAGAGTAACAAAAAAGTCTGTTGTTGTTCTTTAGTGGTGTAAACCTTTTCTCCTTCCAA
CAATTATAGTCTAAATAGTCTTTTCCTTTTAGTTTAGAACTTGAGGCCCTCGTTATTCTCATATCTCATACCATCTATGAACTTGTTTTTTATTTTAAAAGAAGGTCCGT
GCTCGAGATGAAAACTTCTTGAAAAATTCAGAACAAAAGGCCATGATTTGGCTAATAATTTAACTGAGATTAAATAAAAGTACAGTAAGTCTAACTTATCTAAGCATATA
ATCGATTTGGATGAAACTAATGTACGGATATAGAGCTAACGATCTAAACGTTGTTGCACAAACTAACAATTAGAAGACGGTATATAGCTTGAGACTTAAGATGAAAAGTA
TACACCTCCCCTCCCCAAAGCTTAGAGATAAAAATTGATCCCCACACCAAAAGCCCTCTCGTCCTTTGAATTTACTTTTTAATTGACCATCCGTTTAGGAATCATCTTCA
AAACCTACTCTTGGCTTATATAGAATGAGCGGAATCATCGTATATCTAAAGATAAAGAATCAGATTTCACACATTTTTTTATCACTTTTGTATACATCTCTATCATGGTG
TAAATTTACTTCTAACTTTTGTGACTATAATCTCACCTCTTAAATCTCAATGGAATAGCATGATGTAATCTTTCTCGCCACTTGATGCCCTTTTGTAATTTCATTTTATC
AATGAAATTTGATCGACTATTTCTAAAATCCCCCACACCAAAAGAGGAAAAGCTATTGCCAGTACAAGGGAAACAAAACAGAGGAAAATGAACGAAGAGACTAGAATTAT
GTAAAGTGACAACTGACCGTGATAGGAAAAAGAAGAAAGTTTCATAAAACTCTTAAATAAATGATGATTAAAAAATAGTCTACAACCATTAGGGAGGGGCTTGACCGATC
AAAGCTCGAGTTCAACATCAACTGTATTCATGAAAAAGTCCAAAAAAAAAAAAAAAGAAAAGAAAAGAAAAATCTACGAGCCGAATAAGTTGATAAGGATGTCGCTATGG
CAACAATAGGTTTACCGACATAGAGGAGACATTGCTCAAACTGGAAGCCATTATTACTTTTTCTTTAAAACATTAAAAACCCCATCATCTGACCAAAAATTTAGACTTTT
TTTTTGGAGGGGGTGAAGAAAGAAACACCAAACAAAAACAAACAAGACTGCAAAACCTGAAGCCAAACTCAGTTGAAAGATGGAACTTTTTTTAAAAGGTTAAATCTTCT
CTCTGTGTCTTTTTAAATGGGATACAACCAACTTTTGATGGCCTGCACTGTACAGACTTTCTCGGCCTCATTCATTATGGAACATCTAATTAATTTTCTAAGTTGATTGG
GATGAGTAGTTTGTGTGGTTCAAAAGTAAAAGTACTGATAAGAAATTTAGAATAATGCTTCCAATTGTGTCAGTTTTTGTGCTGAAGCCTGAGGATAGAAAGAGTTTTTG
TAGTGTTGTGGGTTTGTGGGGCTTTCTTGCATCCTCGAAAACCTAGAAAGGAATTCCATCTTCTTAAAATTATAAAACAATGGTTCAATTATATATTTATTGAACAAATA
CTTGTACTTTGCCATATTACCCTTTAGCTCTTCGTGAACATTTCAAAACTATTGGAATAGAAATAGTTTAGAATTTTAGATGAAAGTTGAGACATCAAATTTTGTGGCGA
TTCTATTCTAACACCCTAGTTTTCCAGACTAGGATTCAAAATCCGAACTTAGCACCTTATAATCTTACATTACTCACAAATCAATATGGATCTTTTTAGCATTGTTTGTT
TTTACTCAGACGATTCATAGAAAATTTTTAAGAAGGCAGTCAACATAGAATTGCTTCCAAAGCT
Protein sequenceShow/hide protein sequence
MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKG
FVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMD
TARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQIVIEGLAVHGYAEKALRMFAIMEREKILPN
GVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLV
SMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI