; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0024338 (gene) of Chayote v1 genome

Gene IDSed0024338
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG06:3300890..3303802
RNA-Seq ExpressionSed0024338
SyntenySed0024338
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147934.1 putative pentatricopeptide repeat-containing protein At4g17915 [Momordica charantia]5.4e-14777.49Show/hide
Query:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL
        SYNTLMHCFFRLGKP EA RVF DII   L P P TFNT+INGLCKYGY   AIM  R LQ HGFVPQL+TYNIL++ LCK+     AR MLNEA DSGL
Subjt:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL

Query:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV
        EPDAITYTTLMKSC R RQYK G EIFFEMKNKGYA DGFAYCTV+GAFLKL RFEEA VC+ QM+ N +  DLVFYNTF++L+CKEG L+AAYKLLDE+
Subjt:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV

Query:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS
        ESRGLE D+YTH+IITDGLCR GNI+ A R LNYMYTTG +SNLV LNCLIDRL KAGQID A+KLFESME RDS T+T LVHNLCKARRFRCASKLLLS
Subjt:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS

Query:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY
        CIRGGMKVLKS Q AVIDGLC SGF+S+ARKLK KL+LARL+
Subjt:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY

XP_022932936.1 putative pentatricopeptide repeat-containing protein At4g17915 isoform X1 [Cucurbita moschata]7.3e-14474.85Show/hide
Query:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL
        SYNTLMHC FRLGKP EA R+F DII   L P P TFNT+INGLCKYGY   AIM  R LQ HGFVPQL+TYNIL++ LCK+     AR MLNEAMDSGL
Subjt:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL

Query:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV
        EP+A+TYTTLMKSCFRCRQYK G EIFFEMKN+GYA DGFAYCTV+GAFLKL RFEEA VC+ QM+ NG+  DLVFYNTF+NL+CKEG LEAAYK+LDE+
Subjt:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV

Query:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS
        ES+GLE DDYTHSIITDGLCR GNIE A R LNYMYTTG  SN VALNCLI+RLGKAGQID A+KLFESMEIRDS  +T LVHNLCKARRFRCAS+LL+S
Subjt:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS

Query:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY
        CI+GGMKVLKS +  VIDGL  SG++S+A K++ KL +ARL+
Subjt:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY

XP_022973869.1 putative pentatricopeptide repeat-containing protein At4g17915 isoform X1 [Cucurbita maxima]6.0e-14676.9Show/hide
Query:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL
        SYNTLMHC FRLGKP EA R+F DII   L P P TFNT+INGLCKYGY   AIM  R LQ HGFVPQL+TYNIL++ LCK+     A  MLNEAMDSGL
Subjt:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL

Query:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV
        EP A+TYTTLMKSCFRCRQYK G EIFFEMKNKGYA DGFAYCTV+GAFLKL RFEEA VC+ QM+ NG+  DLVFYNT +NL+CKEG LEAAYKLLDE+
Subjt:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV

Query:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS
        ES GLE DDYTHSIITDGLCR GNIE A R LNYMYTTG +SNLVALNCLIDRLGKAGQID A+KLFESMEIRDS T+T LVHNLCKARRFRCASKLL+S
Subjt:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS

Query:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY
        CI+GGMKVLKS Q  VIDGL  SGF+S+ARK++ KL +ARL+
Subjt:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY

XP_023554660.1 putative pentatricopeptide repeat-containing protein At4g17915 isoform X1 [Cucurbita pepo subsp. pepo]3.2e-14777.19Show/hide
Query:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL
        SYNTLMHC FRLGKP EA RVF DII   L P P TFNT+INGLCKYGY   AIM  R LQ HGFVPQL+TYNIL++ LCK+     AR MLNEAMDSGL
Subjt:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL

Query:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV
        EPDA+TYTTLMKSCFRCRQYK G EIFFEMKNKGYA DGFAYCTV+GAFLKL RFEEA VC+ QM+ NG+  DLVFYNTF+NL+CKEG LEAAYKLLDE+
Subjt:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV

Query:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS
        ES+GLE DDYTHSIIT+GLC  GNIE A R LNYMYTTG +SNLVALNCLIDRLGKAGQID A+KLFESMEIRDS T+T LVHNLCKARRFRCASKLL+S
Subjt:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS

Query:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY
        CI+GGMKVLKS Q  VIDGL  SGF+S+ARK++ KL +A+L+
Subjt:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY

XP_023554661.1 putative pentatricopeptide repeat-containing protein At4g17915 isoform X2 [Cucurbita pepo subsp. pepo]3.2e-14777.19Show/hide
Query:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL
        SYNTLMHC FRLGKP EA RVF DII   L P P TFNT+INGLCKYGY   AIM  R LQ HGFVPQL+TYNIL++ LCK+     AR MLNEAMDSGL
Subjt:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL

Query:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV
        EPDA+TYTTLMKSCFRCRQYK G EIFFEMKNKGYA DGFAYCTV+GAFLKL RFEEA VC+ QM+ NG+  DLVFYNTF+NL+CKEG LEAAYKLLDE+
Subjt:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV

Query:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS
        ES+GLE DDYTHSIIT+GLC  GNIE A R LNYMYTTG +SNLVALNCLIDRLGKAGQID A+KLFESMEIRDS T+T LVHNLCKARRFRCASKLL+S
Subjt:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS

Query:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY
        CI+GGMKVLKS Q  VIDGL  SGF+S+ARK++ KL +A+L+
Subjt:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY

TrEMBL top hitse value%identityAlignment
A0A1S3B4M5 putative pentatricopeptide repeat-containing protein At4g179156.2e-14172.59Show/hide
Query:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL
        SYNTLMHCFF LGKP EAYRVF DII   L P P TFNT+INGLCK+GY   A+M  R LQ HGF+PQL+TYNIL++ LCK+     A  MLNEA+DSGL
Subjt:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL

Query:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV
        EP+A+TYTTLMKSCFR RQY+ G EIF +MK+KGYA DGFAYCTV+GAFLKL RFEEA  C  QM+ N V  D+ FYNT +NL+CKEG LEAAYKLLD++
Subjt:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV

Query:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS
        ESRGLECDDYTHSIIT+GLCRVGNIE A + LN +YTTG +SNLVALNCLIDRL KAGQID A++LFESME RDS T+T LVHNLCKARRFRCASKLL+S
Subjt:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS

Query:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLYP
        C RGG+K+L++ + AVIDGL  SGF+S+ARKLKFKL+LARL+P
Subjt:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLYP

A0A5A7TEA6 Putative pentatricopeptide repeat-containing protein6.2e-14172.59Show/hide
Query:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL
        SYNTLMHCFF LGKP EAYRVF DII   L P P TFNT+INGLCK+GY   A+M  R LQ HGF+PQL+TYNIL++ LCK+     A  MLNEA+DSGL
Subjt:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL

Query:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV
        EP+A+TYTTLMKSCFR RQY+ G EIF +MK+KGYA DGFAYCTV+GAFLKL RFEEA  C  QM+ N V  D+ FYNT +NL+CKEG LEAAYKLLD++
Subjt:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV

Query:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS
        ESRGLECDDYTHSIIT+GLCRVGNIE A + LN +YTTG +SNLVALNCLIDRL KAGQID A++LFESME RDS T+T LVHNLCKARRFRCASKLL+S
Subjt:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS

Query:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLYP
        C RGG+K+L++ + AVIDGL  SGF+S+ARKLKFKL+LARL+P
Subjt:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLYP

A0A6J1D2P0 putative pentatricopeptide repeat-containing protein At4g179152.6e-14777.49Show/hide
Query:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL
        SYNTLMHCFFRLGKP EA RVF DII   L P P TFNT+INGLCKYGY   AIM  R LQ HGFVPQL+TYNIL++ LCK+     AR MLNEA DSGL
Subjt:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL

Query:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV
        EPDAITYTTLMKSC R RQYK G EIFFEMKNKGYA DGFAYCTV+GAFLKL RFEEA VC+ QM+ N +  DLVFYNTF++L+CKEG L+AAYKLLDE+
Subjt:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV

Query:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS
        ESRGLE D+YTH+IITDGLCR GNI+ A R LNYMYTTG +SNLV LNCLIDRL KAGQID A+KLFESME RDS T+T LVHNLCKARRFRCASKLLLS
Subjt:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS

Query:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY
        CIRGGMKVLKS Q AVIDGLC SGF+S+ARKLK KL+LARL+
Subjt:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY

A0A6J1EXR2 putative pentatricopeptide repeat-containing protein At4g17915 isoform X13.5e-14474.85Show/hide
Query:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL
        SYNTLMHC FRLGKP EA R+F DII   L P P TFNT+INGLCKYGY   AIM  R LQ HGFVPQL+TYNIL++ LCK+     AR MLNEAMDSGL
Subjt:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL

Query:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV
        EP+A+TYTTLMKSCFRCRQYK G EIFFEMKN+GYA DGFAYCTV+GAFLKL RFEEA VC+ QM+ NG+  DLVFYNTF+NL+CKEG LEAAYK+LDE+
Subjt:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV

Query:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS
        ES+GLE DDYTHSIITDGLCR GNIE A R LNYMYTTG  SN VALNCLI+RLGKAGQID A+KLFESMEIRDS  +T LVHNLCKARRFRCAS+LL+S
Subjt:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS

Query:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY
        CI+GGMKVLKS +  VIDGL  SG++S+A K++ KL +ARL+
Subjt:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY

A0A6J1IFW8 putative pentatricopeptide repeat-containing protein At4g17915 isoform X12.9e-14676.9Show/hide
Query:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL
        SYNTLMHC FRLGKP EA R+F DII   L P P TFNT+INGLCKYGY   AIM  R LQ HGFVPQL+TYNIL++ LCK+     A  MLNEAMDSGL
Subjt:  SYNTLMHCFFRLGKPGEAYRVFMDIIC--LRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL

Query:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV
        EP A+TYTTLMKSCFRCRQYK G EIFFEMKNKGYA DGFAYCTV+GAFLKL RFEEA VC+ QM+ NG+  DLVFYNT +NL+CKEG LEAAYKLLDE+
Subjt:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV

Query:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS
        ES GLE DDYTHSIITDGLCR GNIE A R LNYMYTTG +SNLVALNCLIDRLGKAGQID A+KLFESMEIRDS T+T LVHNLCKARRFRCASKLL+S
Subjt:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLS

Query:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY
        CI+GGMKVLKS Q  VIDGL  SGF+S+ARK++ KL +ARL+
Subjt:  CIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLARLY

SwissProt top hitse value%identityAlignment
P0C043 Putative pentatricopeptide repeat-containing protein At4g179153.4e-8346.9Show/hide
Query:  SYNTLMHCFFRLGKPGEAYRVF---MDIICLRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSG
        SYNTLM C+F+LGK  EA+RV    + +  L P P T+N L++ LCK GY   A+   + +Q   F P+L+TYNIL++ LCK      A+ ML E   SG
Subjt:  SYNTLMHCFFRLGKPGEAYRVF---MDIICLRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSG

Query:  LEPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDE
          P+A+TYTT++K  F+ R+ + G+++F EMK +GY  DG+AY  VV A +K  R +EA   + ++V  G  +D+V YNT +NL+ K+GNL+A   LL E
Subjt:  LEPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDE

Query:  VESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLL
        +E RG++ D+YTH+II +GL R G    AE     M   G   NLV  NCL+D L KAG +D A++ FESME++D +T+T +VHNLCK  RF CASKLLL
Subjt:  VESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLL

Query:  SCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNL
        SC   G+K+  SA+ AV+ GL  SG   +ARK K ++ L
Subjt:  SCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNL

Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial8.1e-3725Show/hide
Query:  SYNTLMHCFFRLGKPGEAYR--VFMDIICLRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL
        SYN ++H   +LG+  EA+   + M++    P   +++T++NG C++G        + +++  G  P    Y  ++  LC++C   +A    +E +  G+
Subjt:  SYNTLMHCFFRLGKPGEAYR--VFMDIICLRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL

Query:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV
         PD + YTTL+    +    +   + F+EM ++    D   Y  ++  F ++    EA     +M   G+E D V +   +N +CK G+++ A+++ + +
Subjt:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV

Query:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEI----RDSHTFTYLVHNLCKARRFRCASK
           G   +  T++ + DGLC+ G++++A   L+ M+  G   N+   N +++ L K+G I+ AVKL    E      D+ T+T L+   CK+     A +
Subjt:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEI----RDSHTFTYLVHNLCKARRFRCASK

Query:  LLLSCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKL
        +L   +  G++      + +++G C  G      KL
Subjt:  LLLSCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKL

Q3EDF8 Pentatricopeptide repeat-containing protein At1g099006.9e-3630.22Show/hide
Query:  PLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGLEPDAITYTTLMKSCFRCRQYKCG--IEIFFE
        P   T+N L+NG+CK G    AI FL  +   G  P +IT+NI++ S+C    ++DA  +L + +  G  P  +T+  L+   F CR+   G  I+I  +
Subjt:  PLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGLEPDAITYTTLMKSCFRCRQYKCG--IEIFFE

Query:  MKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEVESRGLECDDYTHSIITDGLCRVGNIEAAE
        M   G   +  +Y  ++  F K  + + A   + +MV  G   D+V YNT +   CK+G +E A ++L+++ S+G      T++ + DGL + G    A 
Subjt:  MKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEVESRGLECDDYTHSIITDGLCRVGNIEAAE

Query:  RRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKL---FESMEIR-DSHTFTYLVHNLCKARRFRCASKLLLSCIRGGMKVLKSAQHAVIDGLCYSGF
        + L+ M       + +  + L+  L + G++D A+K    FE M IR ++ TF  ++  LCK+R+   A   L+  I  G K  +++   +I+GL Y G 
Subjt:  RRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKL---FESMEIR-DSHTFTYLVHNLCKARRFRCASKLLLSCIRGGMKVLKSAQHAVIDGLCYSGF

Query:  SSKARKLKFKLNLARLYPRSS
        + +A +L  +L    L  +SS
Subjt:  SSKARKLKFKLNLARLYPRSS

Q56XR6 Pentatricopeptide repeat-containing protein At5g466802.5e-7844.87Show/hide
Query:  SYNTLMHCFFRLGKPGEAYRVFMDIICLRPLPA---TFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSG
        SYNTLM C+F+LG+ GEA+++  + I L  L     T+N L++ LCK G+   AI   + L+     P+L+TYNIL++ LCK         M+ E   SG
Subjt:  SYNTLMHCFFRLGKPGEAYRVFMDIICLRPLPA---TFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSG

Query:  LEPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVEN-DLVFYNTFMNLHCKEGNLEAAYKLLD
          P+A+TYTT++K  F+ ++ + G+++F +MK +GY  DGFA C VV A +K  R EEA  C+ ++V +G  + D+V YNT +NL+ K+GNL+A   LL+
Subjt:  LEPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVEN-DLVFYNTFMNLHCKEGNLEAAYKLLD

Query:  EVESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLL
        E+E +GL+ DDYTH+II +GL  +GN   AE+ L  +   G   ++V  NCLID L KAG +D A++LF SME+RD  T+T +VHNLCK  R  CASKLL
Subjt:  EVESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLL

Query:  LSCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLA
        LSC   GMK+  SA+ AV+ G+  +     ARK   K+  A
Subjt:  LSCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLA

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic1.2e-3528.12Show/hide
Query:  SYNTLMHCFFRLGKPGEAYRVFMDIICLR---PLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSG
        ++NTL++   + G    A  + MD++      P   T+N++I+GLCK G  + A+  L  +      P  +TYN L+ +LCK     +A  +       G
Subjt:  SYNTLMHCFFRLGKPGEAYRVFMDIICLR---PLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSG

Query:  LEPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDE
        + PD  T+ +L++     R ++  +E+F EM++KG   D F Y  ++ +     + +EA   + QM ++G    ++ YNT ++  CK      A ++ DE
Subjt:  LEPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDE

Query:  VESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIR----DSHTFTYLVHNLCKARRFRCAS
        +E  G+  +  T++ + DGLC+   +E A + ++ M   G   +    N L+    + G I  A  + ++M       D  T+  L+  LCKA R   AS
Subjt:  VESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIR----DSHTFTYLVHNLCKARRFRCAS

Query:  KLLLSCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLAR
        KLL S    G+ +   A + VI GL       + RK    +NL R
Subjt:  KLLLSCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLAR

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein5.8e-3825Show/hide
Query:  SYNTLMHCFFRLGKPGEAYR--VFMDIICLRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL
        SYN ++H   +LG+  EA+   + M++    P   +++T++NG C++G        + +++  G  P    Y  ++  LC++C   +A    +E +  G+
Subjt:  SYNTLMHCFFRLGKPGEAYR--VFMDIICLRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL

Query:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV
         PD + YTTL+    +    +   + F+EM ++    D   Y  ++  F ++    EA     +M   G+E D V +   +N +CK G+++ A+++ + +
Subjt:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV

Query:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEI----RDSHTFTYLVHNLCKARRFRCASK
           G   +  T++ + DGLC+ G++++A   L+ M+  G   N+   N +++ L K+G I+ AVKL    E      D+ T+T L+   CK+     A +
Subjt:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEI----RDSHTFTYLVHNLCKARRFRCASK

Query:  LLLSCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKL
        +L   +  G++      + +++G C  G      KL
Subjt:  LLLSCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKL

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein5.8e-3825Show/hide
Query:  SYNTLMHCFFRLGKPGEAYR--VFMDIICLRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL
        SYN ++H   +LG+  EA+   + M++    P   +++T++NG C++G        + +++  G  P    Y  ++  LC++C   +A    +E +  G+
Subjt:  SYNTLMHCFFRLGKPGEAYR--VFMDIICLRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGL

Query:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV
         PD + YTTL+    +    +   + F+EM ++    D   Y  ++  F ++    EA     +M   G+E D V +   +N +CK G+++ A+++ + +
Subjt:  EPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEV

Query:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEI----RDSHTFTYLVHNLCKARRFRCASK
           G   +  T++ + DGLC+ G++++A   L+ M+  G   N+   N +++ L K+G I+ AVKL    E      D+ T+T L+   CK+     A +
Subjt:  ESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEI----RDSHTFTYLVHNLCKARRFRCASK

Query:  LLLSCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKL
        +L   +  G++      + +++G C  G      KL
Subjt:  LLLSCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKL

AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein4.9e-3730.22Show/hide
Query:  PLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGLEPDAITYTTLMKSCFRCRQYKCG--IEIFFE
        P   T+N L+NG+CK G    AI FL  +   G  P +IT+NI++ S+C    ++DA  +L + +  G  P  +T+  L+   F CR+   G  I+I  +
Subjt:  PLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSGLEPDAITYTTLMKSCFRCRQYKCG--IEIFFE

Query:  MKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEVESRGLECDDYTHSIITDGLCRVGNIEAAE
        M   G   +  +Y  ++  F K  + + A   + +MV  G   D+V YNT +   CK+G +E A ++L+++ S+G      T++ + DGL + G    A 
Subjt:  MKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEVESRGLECDDYTHSIITDGLCRVGNIEAAE

Query:  RRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKL---FESMEIR-DSHTFTYLVHNLCKARRFRCASKLLLSCIRGGMKVLKSAQHAVIDGLCYSGF
        + L+ M       + +  + L+  L + G++D A+K    FE M IR ++ TF  ++  LCK+R+   A   L+  I  G K  +++   +I+GL Y G 
Subjt:  RRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKL---FESMEIR-DSHTFTYLVHNLCKARRFRCASKLLLSCIRGGMKVLKSAQHAVIDGLCYSGF

Query:  SSKARKLKFKLNLARLYPRSS
        + +A +L  +L    L  +SS
Subjt:  SSKARKLKFKLNLARLYPRSS

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein8.3e-3728.12Show/hide
Query:  SYNTLMHCFFRLGKPGEAYRVFMDIICLR---PLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSG
        ++NTL++   + G    A  + MD++      P   T+N++I+GLCK G  + A+  L  +      P  +TYN L+ +LCK     +A  +       G
Subjt:  SYNTLMHCFFRLGKPGEAYRVFMDIICLR---PLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSG

Query:  LEPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDE
        + PD  T+ +L++     R ++  +E+F EM++KG   D F Y  ++ +     + +EA   + QM ++G    ++ YNT ++  CK      A ++ DE
Subjt:  LEPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDE

Query:  VESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIR----DSHTFTYLVHNLCKARRFRCAS
        +E  G+  +  T++ + DGLC+   +E A + ++ M   G   +    N L+    + G I  A  + ++M       D  T+  L+  LCKA R   AS
Subjt:  VESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIR----DSHTFTYLVHNLCKARRFRCAS

Query:  KLLLSCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLAR
        KLL S    G+ +   A + VI GL       + RK    +NL R
Subjt:  KLLLSCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLAR

AT5G46680.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.8e-7944.87Show/hide
Query:  SYNTLMHCFFRLGKPGEAYRVFMDIICLRPLPA---TFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSG
        SYNTLM C+F+LG+ GEA+++  + I L  L     T+N L++ LCK G+   AI   + L+     P+L+TYNIL++ LCK         M+ E   SG
Subjt:  SYNTLMHCFFRLGKPGEAYRVFMDIICLRPLPA---TFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEAMDSG

Query:  LEPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVEN-DLVFYNTFMNLHCKEGNLEAAYKLLD
          P+A+TYTT++K  F+ ++ + G+++F +MK +GY  DGFA C VV A +K  R EEA  C+ ++V +G  + D+V YNT +NL+ K+GNL+A   LL+
Subjt:  LEPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVEN-DLVFYNTFMNLHCKEGNLEAAYKLLD

Query:  EVESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLL
        E+E +GL+ DDYTH+II +GL  +GN   AE+ L  +   G   ++V  NCLID L KAG +D A++LF SME+RD  T+T +VHNLCK  R  CASKLL
Subjt:  EVESRGLECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLL

Query:  LSCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLA
        LSC   GMK+  SA+ AV+ G+  +     ARK   K+  A
Subjt:  LSCIRGGMKVLKSAQHAVIDGLCYSGFSSKARKLKFKLNLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCCCAAACCCTCTAAAAACCCCTTTTTCTCTTCCTTCATGTATGGAAGTTACAACACATTGATGCACTGTTTCTTCAGATTAGGAAAGCCAGGTGAAGCTTATAG
AGTTTTCATGGATATTATATGCCTCCGTCCTCTTCCAGCTACATTTAATACATTGATTAACGGCCTTTGTAAATATGGATATGCACGTACTGCCATTATGTTTTTGAGAA
TTTTACAATGCCATGGATTTGTTCCTCAATTAATTACATATAATATTCTTGTTGATAGTTTATGCAAGTTGTGTTGGTTTGTGGATGCTAGGTTGATGCTCAATGAGGCC
ATGGATTCAGGACTTGAGCCTGATGCCATAACATACACTACATTGATGAAAAGCTGCTTTAGATGCAGGCAATATAAATGTGGAATTGAGATTTTCTTTGAGATGAAAAA
CAAAGGATATGCTATTGATGGCTTTGCTTACTGCACAGTTGTTGGTGCTTTTCTTAAGTTAGATAGGTTTGAAGAGGCAACTGTTTGCATTGGACAGATGGTAATGAATG
GAGTGGAAAATGATTTAGTTTTTTATAACACATTTATGAACTTGCATTGTAAAGAAGGTAATTTGGAGGCTGCATATAAGTTGTTGGATGAAGTAGAGTCACGGGGACTA
GAATGCGACGATTACACGCATTCAATAATAACTGATGGATTGTGCAGGGTTGGAAATATTGAGGCGGCGGAGCGACGTTTGAATTACATGTATACAACAGGCTGTTCTTC
AAATCTGGTAGCCTTAAATTGTCTAATTGACAGGTTGGGTAAGGCTGGTCAGATTGATCTTGCAGTGAAATTGTTCGAATCAATGGAAATAAGGGATTCTCACACTTTTA
CCTACTTGGTGCACAATCTTTGCAAGGCAAGGCGGTTTCGTTGTGCGTCGAAGTTACTGCTTTCCTGTATAAGGGGTGGCATGAAGGTTCTTAAGTCCGCACAACATGCA
GTTATTGATGGCCTTTGTTATTCAGGATTTTCAAGTAAAGCAAGGAAGCTCAAATTCAAATTAAATTTGGCTCGGCTCTATCCTCGATCATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGCCCAAACCCTCTAAAAACCCCTTTTTCTCTTCCTTCATGTATGGAAGTTACAACACATTGATGCACTGTTTCTTCAGATTAGGAAAGCCAGGTGAAGCTTATAG
AGTTTTCATGGATATTATATGCCTCCGTCCTCTTCCAGCTACATTTAATACATTGATTAACGGCCTTTGTAAATATGGATATGCACGTACTGCCATTATGTTTTTGAGAA
TTTTACAATGCCATGGATTTGTTCCTCAATTAATTACATATAATATTCTTGTTGATAGTTTATGCAAGTTGTGTTGGTTTGTGGATGCTAGGTTGATGCTCAATGAGGCC
ATGGATTCAGGACTTGAGCCTGATGCCATAACATACACTACATTGATGAAAAGCTGCTTTAGATGCAGGCAATATAAATGTGGAATTGAGATTTTCTTTGAGATGAAAAA
CAAAGGATATGCTATTGATGGCTTTGCTTACTGCACAGTTGTTGGTGCTTTTCTTAAGTTAGATAGGTTTGAAGAGGCAACTGTTTGCATTGGACAGATGGTAATGAATG
GAGTGGAAAATGATTTAGTTTTTTATAACACATTTATGAACTTGCATTGTAAAGAAGGTAATTTGGAGGCTGCATATAAGTTGTTGGATGAAGTAGAGTCACGGGGACTA
GAATGCGACGATTACACGCATTCAATAATAACTGATGGATTGTGCAGGGTTGGAAATATTGAGGCGGCGGAGCGACGTTTGAATTACATGTATACAACAGGCTGTTCTTC
AAATCTGGTAGCCTTAAATTGTCTAATTGACAGGTTGGGTAAGGCTGGTCAGATTGATCTTGCAGTGAAATTGTTCGAATCAATGGAAATAAGGGATTCTCACACTTTTA
CCTACTTGGTGCACAATCTTTGCAAGGCAAGGCGGTTTCGTTGTGCGTCGAAGTTACTGCTTTCCTGTATAAGGGGTGGCATGAAGGTTCTTAAGTCCGCACAACATGCA
GTTATTGATGGCCTTTGTTATTCAGGATTTTCAAGTAAAGCAAGGAAGCTCAAATTCAAATTAAATTTGGCTCGGCTCTATCCTCGATCATCTTGA
Protein sequenceShow/hide protein sequence
MLPKPSKNPFFSSFMYGSYNTLMHCFFRLGKPGEAYRVFMDIICLRPLPATFNTLINGLCKYGYARTAIMFLRILQCHGFVPQLITYNILVDSLCKLCWFVDARLMLNEA
MDSGLEPDAITYTTLMKSCFRCRQYKCGIEIFFEMKNKGYAIDGFAYCTVVGAFLKLDRFEEATVCIGQMVMNGVENDLVFYNTFMNLHCKEGNLEAAYKLLDEVESRGL
ECDDYTHSIITDGLCRVGNIEAAERRLNYMYTTGCSSNLVALNCLIDRLGKAGQIDLAVKLFESMEIRDSHTFTYLVHNLCKARRFRCASKLLLSCIRGGMKVLKSAQHA
VIDGLCYSGFSSKARKLKFKLNLARLYPRSS