; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0016884 (gene) of Chayote v1 genome

Gene IDSed0016884
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG04:47880461..47885896
RNA-Seq ExpressionSed0016884
SyntenySed0016884
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011009 - Protein kinase-like domain superfamily
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6590519.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]3.4e-9244.12Show/hide
Query:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY
        F F  LLN     KSASLDQTLL FSS  H SKN++SWTSLI  F+RS RPFH L+FFNH+ R SR+YPNHYT SAVLS       S H   M SL+  +
Subjt:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY

Query:  -------------------------------------------------NKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGL
                                                         NK Y  A LFF+ LLL  LT  DEVSFSSA  L+ACAN  N +FGKQVHG+
Subjt:  -------------------------------------------------NKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGL

Query:  SLKLVI----------------------------------------------------------------------------------------------
        +LKL +                                                                                              
Subjt:  SLKLVI----------------------------------------------------------------------------------------------

Query:  ----------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYF
                   CVASSLV MYAKC SLVDAFQIF E+ D NVV WT IIAACQQHGHAN+V          G+KPDYITFVS+LS CSHTGRVEEGFFYF
Subjt:  ----------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYF

Query:  DSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADE
        +SM KVHGI   YEHYACI+DLLGR G   + +                  L AC++H+NLEM KEVA + F+LEPDNP NYVLLCNILTRNG+L EADE
Subjt:  DSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADE

Query:  IRRKMESIEVKGNPNISYI
        +RRKME+I V+  P  S+I
Subjt:  IRRKMESIEVKGNPNISYI

KAG7024054.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-9143.93Show/hide
Query:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY
        F F  LLN     KSASLD TLLLFSSA H SKN++SWTSLI  F+RS RPFH L+FFNH+ R S +YPNHYT SAVLS       S H   M SL+  +
Subjt:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY

Query:  -------------------------------------------------NKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGL
                                                         NK Y  A LFF+ LLL  LT  DEVSFSSA  L+ACAN  N +FGKQVHG+
Subjt:  -------------------------------------------------NKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGL

Query:  SLKLVI----------------------------------------------------------------------------------------------
        +LKL +                                                                                              
Subjt:  SLKLVI----------------------------------------------------------------------------------------------

Query:  ----------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYF
                   CVASSLV MYAKC SLVDAFQIF E+ D NVV WT IIAACQQHGHAN+V          G+KPDYITFVS+LS CSHTGRVEEGFFYF
Subjt:  ----------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYF

Query:  DSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADE
        +SM KVHGI   YEHYACI+DLLGR G   + +                  L AC++H+NLEM KEVA + F+LEPDNP NYVLLCNILTRNG+L EADE
Subjt:  DSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADE

Query:  IRRKMESIEVKGNPNISYI
        +RRKME++ V+  P  S+I
Subjt:  IRRKMESIEVKGNPNISYI

XP_022961446.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial-like isoform X1 [Cucurbita moschata]9.8e-9244.12Show/hide
Query:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY
        F F  LLN     KSASLD TLLLFSSA H SKN++SWTSLI  F+RS RPFH L+FFNH+ R S +YPNHYT SAVLS       S H   M SL+  +
Subjt:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY

Query:  -------------------------------------------------NKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGL
                                                         NK Y  A LFF+ LLL  LT  DEVSFSSA  L+ACAN  N +FGKQVHG+
Subjt:  -------------------------------------------------NKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGL

Query:  SLKLVI----------------------------------------------------------------------------------------------
        +LKL +                                                                                              
Subjt:  SLKLVI----------------------------------------------------------------------------------------------

Query:  ----------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYF
                   CVASSLV MYAKC SLVDAFQIF E+ D NVV WT IIAACQQHGHAN+V          G+KPDYITFVS+LS CSHTGRVEEGFFYF
Subjt:  ----------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYF

Query:  DSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADE
        +SM KVHGI   YEHYACI+DLLGR G   + +                  L AC++H+NLEM KEVA + F+LEPDNP NYVLLCNILTRNG+L EADE
Subjt:  DSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADE

Query:  IRRKMESIEVKGNPNISYI
        +RRKME+I V+  P  S+I
Subjt:  IRRKMESIEVKGNPNISYI

XP_023538639.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial-like isoform X1 [Cucurbita pepo subsp. pepo]6.3e-9143.74Show/hide
Query:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY
        F F  LLN     KS SLDQTLL FSS  H SKN++SWTSLI  F+RS RPFH L+FFNH+ R S +YPNHYT SAVLS       S H   M SL+  +
Subjt:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY

Query:  -------------------------------------------------NKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGL
                                                         NK Y  A LFF+ LLL  LT  DEVSFSSA  L+ACAN  N +FGKQVHG+
Subjt:  -------------------------------------------------NKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGL

Query:  SLKLVI----------------------------------------------------------------------------------------------
        +LKL +                                                                                              
Subjt:  SLKLVI----------------------------------------------------------------------------------------------

Query:  ----------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYF
                   CVASSLV MYAKC SLVDAFQIF E+ D NVV WT IIAACQQHGHAN+V          G+KPDYITFVS+LS CSHTGRVEEGFFYF
Subjt:  ----------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYF

Query:  DSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADE
        +SM KVHGI   YEHYACI+DLLGR G   + +                  L AC++H+NLEM KEVA + F+LEPDNP NYVLLCNILTRNG+L EADE
Subjt:  DSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADE

Query:  IRRKMESIEVKGNPNISYI
        +RRKME+I V+  P  S+I
Subjt:  IRRKMESIEVKGNPNISYI

XP_038878947.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial-like [Benincasa hispida]1.1e-9043.55Show/hide
Query:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY
        F F  LLN     K  S+DQTLLLFSSA   SKN++SWTSLI  F RS RPF  L+FFNH+RRS  +YPNHYT SAVLS       S H   M SL+  +
Subjt:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY

Query:  -------------------------------------------------NKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGL
                                                         N+ Y  A LFF+ LLL   T  DEVSFSSA  L+ACAN  NL FGKQVHG+
Subjt:  -------------------------------------------------NKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGL

Query:  SLKLVI----------------------------------------------------------------------------------------------
        SLKL +                                                                                              
Subjt:  SLKLVI----------------------------------------------------------------------------------------------

Query:  ----------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYF
                   CVASSL+ MYAKC SLVDAFQIF E+ + NVV WT IIAACQQHGH NRV          G+KPDYITFVS+LS CSHTGRV+EGFFYF
Subjt:  ----------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYF

Query:  DSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADE
        +SM+KVHGI   YEHYAC++DLLGR G   + +                  L AC++H+NLEM KEVAL+ FDLEPDNP NYVLLCNILTRNG+LNEADE
Subjt:  DSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADE

Query:  IRRKMESIEVKGNPNISYI
        IRRKME+I V+  P  S+I
Subjt:  IRRKMESIEVKGNPNISYI

TrEMBL top hitse value%identityAlignment
A0A0A0LVV0 DYW_deaminase domain-containing protein6.6e-9447.58Show/hide
Query:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLS-----TFSTHTTTMASLM---
        F F  LLN     K  S+DQTLLLFSSA   SKN++SWTSLI    R  RPF  L+FFNH+RRS  +YPNHYT SAVLS     T S H   M SL+   
Subjt:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLS-----TFSTHTTTMASLM---

Query:  --LSYNKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVIF-------------------------------------
          L+ NK Y  A  FF+ LLL  LT  DEVSFSS    +ACANA NL+FGKQVHG++LKL ++                                     
Subjt:  --LSYNKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVIF-------------------------------------

Query:  -----------------------------------------C--------------------------VASSLVIMYAKCDSLVDAFQIFGESNDPNVVF
                                                 C                          VASSL+ MYAKC SLVDAFQIF E+ D NVV 
Subjt:  -----------------------------------------C--------------------------VASSLVIMYAKCDSLVDAFQIFGESNDPNVVF

Query:  WTTIIAACQQHGHAN----------RVGVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------
        WT IIAACQQHGHAN          R G+KPDYITFVS+LS CSHTGRVEEGFFYF+SM+KVHGI   +EHYACI+DLL R G   + +           
Subjt:  WTTIIAACQQHGHAN----------RVGVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------

Query:  ------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYI
               L AC++H+NL M KEVALK FDLEPDNP NYVLLCNILTRNG+LNEADE+RRKMESI V+  P  S+I
Subjt:  ------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYI

A0A5A7UBQ7 Pentatricopeptide repeat-containing protein3.3e-8540.39Show/hide
Query:  FKLHYH----HQTPTCFKFKRLLNAARTPKSAS------------------------------LDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHT
        FK H+H    H  PT   F  LLN+ RT K A+                              +DQTLLLFSSA   SK+++SWTSLI    RS RPF  
Subjt:  FKLHYH----HQTPTCFKFKRLLNAARTPKSAS------------------------------LDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHT

Query:  LSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY-------------------------------------------------NKGYT
        L+FFN +R S  +YPNHYTLSAVLS       S H   M SL+  +                                                 NK Y 
Subjt:  LSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY-------------------------------------------------NKGYT

Query:  SANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVI------------------------------------------------
         A  FF+ LLL  LT  DEVSFSS    +ACANA NL FGKQVHG++LKL +                                                
Subjt:  SANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVI------------------------------------------------

Query:  --------------------------------------------------------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQ
                                                                 CVASSL+ MYAKC SLVDAFQIF E  D NVV WT II ACQQ
Subjt:  --------------------------------------------------------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQ

Query:  HGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCLG-----------------L
        HGHANRV          G+KPDYITFVS+LS CSHTGRVEEGFFYF+SM+K+HGI   +EHYACI+DLL R G    NR                    L
Subjt:  HGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCLG-----------------L

Query:  GACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYI
         AC++H+NL M KEVALK F+LEPDNP NYVLLCNILTRNG+LNEAD++RRKME I V+  P  S+I
Subjt:  GACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYI

A0A5D3CIX2 Sister-chromatid cohesion protein 3 isoform X23.3e-8540.39Show/hide
Query:  FKLHYH----HQTPTCFKFKRLLNAARTPKSAS------------------------------LDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHT
        FK H+H    H  PT   F  LLN+ RT K A+                              +DQTLLLFSSA   SK+++SWTSLI    RS RPF  
Subjt:  FKLHYH----HQTPTCFKFKRLLNAARTPKSAS------------------------------LDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHT

Query:  LSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY-------------------------------------------------NKGYT
        L+FFN +R S  +YPNHYTLSAVLS       S H   M SL+  +                                                 NK Y 
Subjt:  LSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY-------------------------------------------------NKGYT

Query:  SANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVI------------------------------------------------
         A  FF+ LLL  LT  DEVSFSS    +ACANA NL FGKQVHG++LKL +                                                
Subjt:  SANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVI------------------------------------------------

Query:  --------------------------------------------------------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQ
                                                                 CVASSL+ MYAKC SLVDAFQIF E  D NVV WT II ACQQ
Subjt:  --------------------------------------------------------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQ

Query:  HGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCLG-----------------L
        HGHANRV          G+KPDYITFVS+LS CSHTGRVEEGFFYF+SM+K+HGI   +EHYACI+DLL R G    NR                    L
Subjt:  HGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCLG-----------------L

Query:  GACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYI
         AC++H+NL M KEVALK F+LEPDNP NYVLLCNILTRNG+LNEAD++RRKME I V+  P  S+I
Subjt:  GACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYI

A0A6J1HAE0 pentatricopeptide repeat-containing protein At3g24000, mitochondrial-like isoform X14.7e-9244.12Show/hide
Query:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY
        F F  LLN     KSASLD TLLLFSSA H SKN++SWTSLI  F+RS RPFH L+FFNH+ R S +YPNHYT SAVLS       S H   M SL+  +
Subjt:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY

Query:  -------------------------------------------------NKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGL
                                                         NK Y  A LFF+ LLL  LT  DEVSFSSA  L+ACAN  N +FGKQVHG+
Subjt:  -------------------------------------------------NKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGL

Query:  SLKLVI----------------------------------------------------------------------------------------------
        +LKL +                                                                                              
Subjt:  SLKLVI----------------------------------------------------------------------------------------------

Query:  ----------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYF
                   CVASSLV MYAKC SLVDAFQIF E+ D NVV WT IIAACQQHGHAN+V          G+KPDYITFVS+LS CSHTGRVEEGFFYF
Subjt:  ----------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYF

Query:  DSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADE
        +SM KVHGI   YEHYACI+DLLGR G   + +                  L AC++H+NLEM KEVA + F+LEPDNP NYVLLCNILTRNG+L EADE
Subjt:  DSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADE

Query:  IRRKMESIEVKGNPNISYI
        +RRKME+I V+  P  S+I
Subjt:  IRRKMESIEVKGNPNISYI

A0A6J1HVZ6 pentatricopeptide repeat-containing protein At3g24000, mitochondrial-like isoform X11.4e-8842.77Show/hide
Query:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY
        F F  LLN     KS S D TLLLFSS  H SKN++SWTSLI  F+RS RPFH L+FFN + R S ++PNHYT SAVLS       S H   M SL+  +
Subjt:  FKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLST-----FSTHTTTMASLMLSY

Query:  -------------------------------------------------NKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGL
                                                         NK Y  A LFF+ LLL  LT  DEVSFSSA  L+ACAN  N +FGKQVHG+
Subjt:  -------------------------------------------------NKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGL

Query:  SLKLVI----------------------------------------------------------------------------------------------
        +LKL +                                                                                              
Subjt:  SLKLVI----------------------------------------------------------------------------------------------

Query:  ----------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYF
                   CVASSLV MYAKC SLVDAFQ F E+ D NVV WT IIAACQQHGHAN+V          G+KPDYITFVS+LS CSHTGR+EEGFFYF
Subjt:  ----------FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYF

Query:  DSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADE
        +SM+KVHGI   YEHYACI+DLLGR G   + +                  L AC++H+NLEM KEVAL+ F+LEPDNP NYVLLCNILTRNG+L EAD 
Subjt:  DSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADE

Query:  IRRKMESIEVKGNPNISYI
        +RRKME+I V+  P  S+I
Subjt:  IRRKMESIEVKGNPNISYI

SwissProt top hitse value%identityAlignment
O23169 Pentatricopeptide repeat-containing protein At4g371705.5e-3733.2Show/hide
Query:  PDEVSFSSALTLNACANASNLDFGKQVHGLSLKL---VIFCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHAN----------RV
        P+E +F+    LNACA+ +  + GKQVHG   ++        +SSLV MY KC ++  A  +      P++V WT++I  C Q+G  +          + 
Subjt:  PDEVSFSSALTLNACANASNLDFGKQVHGLSLKL---VIFCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHAN----------RV

Query:  GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKF
        G KPD++TFV++LS C+H G VE+G  +F S+ + H ++H  +HY C++DLL R G   Q + +                LG C ++ N+++++E A + 
Subjt:  GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKF

Query:  FDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISY
        F +EP+NP  YV + NI    G   E  ++R++M+ I V   P  S+
Subjt:  FDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISY

Q5G1T1 Pentatricopeptide repeat-containing protein At3g49170, chloroplastic1.5e-3428.5Show/hide
Query:  ARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARS-NRPFHTLSFFNHIRRSSRIYPNHYTLSAV------------------------LSTFSTH
        A+     S+D    +F     HS  ++SWT+LI  + ++ N     ++ F+ +     + PNH+T S+                         L++ S+ 
Subjt:  ARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARS-NRPFHTLSFFNHIRRSSRIYPNHYTLSAV------------------------LSTFSTH

Query:  TTTMASLMLSYN--------------KGYTSANLF-------------FRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVIFC---
          ++ S+ +  +              K   S N F             F++L  +        +F+ A  L+  AN  ++  G+Q+H   +KL + C   
Subjt:  TTTMASLMLSYN--------------KGYTSANLF-------------FRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVIFC---

Query:  VASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHR
        V ++L+ MY+KC S+  A ++F    + NV+ WT++I    +HG A RV          GVKP+ +T+V+ILS CSH G V EG+ +F+SM + H I  +
Subjt:  VASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHR

Query:  YEHYACIIDLLGRVG-----------HACQNRCL----GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKG
         EHYAC++DLL R G              Q   L     LGAC+ H+N E+ K  A K  +L+P+ P  Y+ L NI    G   E+ E+RRKM+   +  
Subjt:  YEHYACIIDLLGRVG-----------HACQNRCL----GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKG

Query:  NPNISYI
            S+I
Subjt:  NPNISYI

Q9CA56 Pentatricopeptide repeat-containing protein At1g74600, chloroplastic1.5e-3435.63Show/hide
Query:  LFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVIF---CVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHA
        L FR +++   T+ D  + SS L   A ++ S+L  G QVH    K+ +     V SSL+ MY+K  S+ D  + F + N P+++ WT +IA+  QHG A
Subjt:  LFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVIF---CVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHA

Query:  NRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSH
        N            G KPD +TFV +LS CSH G VEE +F+ +SMVK +GI     HY C++D LGR G   +                    L ACK H
Subjt:  NRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSH

Query:  NNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYI
          +E+ K  A K  +LEP +   Y+ L NIL   G  +E +E R+ M+   V+  P  S +
Subjt:  NNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYI

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202301.8e-3528.33Show/hide
Query:  KSASLDQTLLLFSSASHHSK--NIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLSTFSTHTTTMASLMLSYNKGYTSANLFFRMLLL
        ++  +D+ L +F      +   N++SWTS+I   A++ +    L  F  ++ +  + PNH T+ ++L                                 
Subjt:  KSASLDQTLLLFSSASHHSK--NIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLSTFSTHTTTMASLMLSYNKGYTSANLFFRMLLL

Query:  VPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVIF---CVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHAN-------
                          AC N + L  G+  HG ++++ +     V S+L+ MYAKC  +  +  +F      N+V W +++     HG A        
Subjt:  VPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVIF---CVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHAN-------

Query:  ---RVGVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQ------------NRCLG---LGACKSHNNLEMSKE
           R  +KPD+I+F S+LS C   G  +EG+ YF  M + +GI  R EHY+C+++LLGR G   +            + C+    L +C+  NN+++++ 
Subjt:  ---RVGVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQ------------NRCLG---LGACKSHNNLEMSKE

Query:  VALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYI
         A K F LEP+NP  YVLL NI    G+  E D IR KMES+ +K NP  S+I
Subjt:  VALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYI

Q9STE1 Pentatricopeptide repeat-containing protein At4g213009.3e-3734.02Show/hide
Query:  DEVSFSSALTLNACANASNLDFGKQVHGLSLKLVI---FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGH-----------ANRV
        D VS S+A  L+ACAN  +  FGK +HG  +K  +       S+L+ MYAKC +L  A  +F    + N+V W +IIAAC  HG              + 
Subjt:  DEVSFSSALTLNACANASNLDFGKQVHGLSLKLVI---FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGH-----------ANRV

Query:  GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNR---------------CLGLGACKSHNNLEMSKEVALKF
        G++PD ITF+ I+S+C H G V+EG  +F SM + +GI  + EHYAC++DL GR G   +                    LGAC+ H N+E+++  + K 
Subjt:  GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNR---------------CLGLGACKSHNNLEMSKEVALKF

Query:  FDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYICSRYHRTPDLIFGAIEYTTAIGIWS-AGCVLDEL-LVGQLENP
         DL+P N   YVL+ N            ++R  M+  EV+  P  S+I     RT   + G + +  +  I+S    +L EL L G +  P
Subjt:  FDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYICSRYHRTPDLIFGAIEYTTAIGIWS-AGCVLDEL-LVGQLENP

Arabidopsis top hitse value%identityAlignment
AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-3628.33Show/hide
Query:  KSASLDQTLLLFSSASHHSK--NIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLSTFSTHTTTMASLMLSYNKGYTSANLFFRMLLL
        ++  +D+ L +F      +   N++SWTS+I   A++ +    L  F  ++ +  + PNH T+ ++L                                 
Subjt:  KSASLDQTLLLFSSASHHSK--NIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLSTFSTHTTTMASLMLSYNKGYTSANLFFRMLLL

Query:  VPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVIF---CVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHAN-------
                          AC N + L  G+  HG ++++ +     V S+L+ MYAKC  +  +  +F      N+V W +++     HG A        
Subjt:  VPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVIF---CVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHAN-------

Query:  ---RVGVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQ------------NRCLG---LGACKSHNNLEMSKE
           R  +KPD+I+F S+LS C   G  +EG+ YF  M + +GI  R EHY+C+++LLGR G   +            + C+    L +C+  NN+++++ 
Subjt:  ---RVGVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQ------------NRCLG---LGACKSHNNLEMSKE

Query:  VALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYI
         A K F LEP+NP  YVLL NI    G+  E D IR KMES+ +K NP  S+I
Subjt:  VALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYI

AT1G74600.1 pentatricopeptide (PPR) repeat-containing protein1.1e-3535.63Show/hide
Query:  LFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVIF---CVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHA
        L FR +++   T+ D  + SS L   A ++ S+L  G QVH    K+ +     V SSL+ MY+K  S+ D  + F + N P+++ WT +IA+  QHG A
Subjt:  LFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVIF---CVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHA

Query:  NRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSH
        N            G KPD +TFV +LS CSH G VEE +F+ +SMVK +GI     HY C++D LGR G   +                    L ACK H
Subjt:  NRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSH

Query:  NNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYI
          +E+ K  A K  +LEP +   Y+ L NIL   G  +E +E R+ M+   V+  P  S +
Subjt:  NNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYI

AT3G49170.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-3528.5Show/hide
Query:  ARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARS-NRPFHTLSFFNHIRRSSRIYPNHYTLSAV------------------------LSTFSTH
        A+     S+D    +F     HS  ++SWT+LI  + ++ N     ++ F+ +     + PNH+T S+                         L++ S+ 
Subjt:  ARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARS-NRPFHTLSFFNHIRRSSRIYPNHYTLSAV------------------------LSTFSTH

Query:  TTTMASLMLSYN--------------KGYTSANLF-------------FRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVIFC---
          ++ S+ +  +              K   S N F             F++L  +        +F+ A  L+  AN  ++  G+Q+H   +KL + C   
Subjt:  TTTMASLMLSYN--------------KGYTSANLF-------------FRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVIFC---

Query:  VASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHR
        V ++L+ MY+KC S+  A ++F    + NV+ WT++I    +HG A RV          GVKP+ +T+V+ILS CSH G V EG+ +F+SM + H I  +
Subjt:  VASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHANRV----------GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHR

Query:  YEHYACIIDLLGRVG-----------HACQNRCL----GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKG
         EHYAC++DLL R G              Q   L     LGAC+ H+N E+ K  A K  +L+P+ P  Y+ L NI    G   E+ E+RRKM+   +  
Subjt:  YEHYACIIDLLGRVG-----------HACQNRCL----GLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKG

Query:  NPNISYI
            S+I
Subjt:  NPNISYI

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.6e-3834.02Show/hide
Query:  DEVSFSSALTLNACANASNLDFGKQVHGLSLKLVI---FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGH-----------ANRV
        D VS S+A  L+ACAN  +  FGK +HG  +K  +       S+L+ MYAKC +L  A  +F    + N+V W +IIAAC  HG              + 
Subjt:  DEVSFSSALTLNACANASNLDFGKQVHGLSLKLVI---FCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGH-----------ANRV

Query:  GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNR---------------CLGLGACKSHNNLEMSKEVALKF
        G++PD ITF+ I+S+C H G V+EG  +F SM + +GI  + EHYAC++DL GR G   +                    LGAC+ H N+E+++  + K 
Subjt:  GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNR---------------CLGLGACKSHNNLEMSKEVALKF

Query:  FDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYICSRYHRTPDLIFGAIEYTTAIGIWS-AGCVLDEL-LVGQLENP
         DL+P N   YVL+ N            ++R  M+  EV+  P  S+I     RT   + G + +  +  I+S    +L EL L G +  P
Subjt:  FDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISYICSRYHRTPDLIFGAIEYTTAIGIWS-AGCVLDEL-LVGQLENP

AT4G37170.1 Pentatricopeptide repeat (PPR) superfamily protein3.9e-3833.2Show/hide
Query:  PDEVSFSSALTLNACANASNLDFGKQVHGLSLKL---VIFCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHAN----------RV
        P+E +F+    LNACA+ +  + GKQVHG   ++        +SSLV MY KC ++  A  +      P++V WT++I  C Q+G  +          + 
Subjt:  PDEVSFSSALTLNACANASNLDFGKQVHGLSLKL---VIFCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIAACQQHGHAN----------RV

Query:  GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKF
        G KPD++TFV++LS C+H G VE+G  +F S+ + H ++H  +HY C++DLL R G   Q + +                LG C ++ N+++++E A + 
Subjt:  GVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCL---------------GLGACKSHNNLEMSKEVALKF

Query:  FDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISY
        F +EP+NP  YV + NI    G   E  ++R++M+ I V   P  S+
Subjt:  FDLEPDNPRNYVLLCNILTRNGLLNEADEIRRKMESIEVKGNPNISY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACTTATGTTCCTTGTAAATTTGAAGCCTCCATTTTCTTGTAAATTCAAATTACATTACCACCACCAAACTCCCACCTGTTTTAAGTTTAAGAGGCTTCTGAATGC
TGCCCGCACCCCCAAATCTGCCTCTCTCGACCAGACCTTGCTGCTTTTCTCCTCTGCCTCACACCACTCCAAGAATATCATCTCCTGGACTTCTCTTATTATGCTATTTG
CTCGCTCTAACAGACCCTTTCACACCTTGTCTTTCTTCAACCATATAAGGAGGTCTTCTAGGATTTATCCCAACCATTACACCTTATCTGCTGTCTTATCTACTTTTAGT
ACTCACACTACTACTATGGCTTCTCTGATGCTCTCATATAACAAAGGTTACACTTCAGCCAATCTCTTCTTTAGGATGCTTTTACTCGTCCCTCTAACTGTTCCTGATGA
GGTAAGCTTTTCCAGTGCCTTGACCTTGAATGCTTGCGCCAATGCTAGTAACCTGGATTTTGGGAAACAAGTTCATGGACTTTCTCTCAAGCTTGTTATTTTTTGTGTTG
CAAGTTCTTTGGTTATAATGTATGCAAAATGCGACAGCTTGGTAGATGCTTTTCAAATATTTGGTGAGAGCAATGACCCTAACGTGGTTTTTTGGACAACAATAATTGCA
GCTTGTCAACAACATGGCCACGCTAACCGGGTCGGGGTTAAACCTGACTACATAACTTTTGTTTCTATTCTTTCTACATGCAGCCACACTGGTAGAGTTGAAGAAGGATT
CTTCTACTTTGATTCAATGGTTAAAGTGCATGGTATTAATCATAGATATGAACATTATGCATGCATAATTGATTTGCTTGGTCGTGTTGGCCATGCCTGTCAAAACAGAT
GCCTCGGTTTGGGTGCATGTAAGAGTCATAATAACCTTGAAATGAGTAAAGAAGTGGCTCTAAAATTTTTTGATTTGGAACCAGATAATCCTAGAAATTATGTGCTGCTT
TGTAACATCTTGACACGTAATGGGTTATTAAATGAGGCTGATGAGATTAGAAGAAAGATGGAATCCATCGAGGTAAAAGGGAATCCAAATATATCCTACATCTGTTCAAG
ATACCATAGAACTCCTGATCTCATTTTTGGAGCAATAGAGTATACCACCGCAATTGGCATATGGTCTGCTGGTTGTGTTCTTGATGAGCTATTGGTCGGTCAGCTAGAAA
ATCCAAAGACTCGTCAAGATTTTAATGACCGAGCATTATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAACTTATGTTCCTTGTAAATTTGAAGCCTCCATTTTCTTGTAAATTCAAATTACATTACCACCACCAAACTCCCACCTGTTTTAAGTTTAAGAGGCTTCTGAATGC
TGCCCGCACCCCCAAATCTGCCTCTCTCGACCAGACCTTGCTGCTTTTCTCCTCTGCCTCACACCACTCCAAGAATATCATCTCCTGGACTTCTCTTATTATGCTATTTG
CTCGCTCTAACAGACCCTTTCACACCTTGTCTTTCTTCAACCATATAAGGAGGTCTTCTAGGATTTATCCCAACCATTACACCTTATCTGCTGTCTTATCTACTTTTAGT
ACTCACACTACTACTATGGCTTCTCTGATGCTCTCATATAACAAAGGTTACACTTCAGCCAATCTCTTCTTTAGGATGCTTTTACTCGTCCCTCTAACTGTTCCTGATGA
GGTAAGCTTTTCCAGTGCCTTGACCTTGAATGCTTGCGCCAATGCTAGTAACCTGGATTTTGGGAAACAAGTTCATGGACTTTCTCTCAAGCTTGTTATTTTTTGTGTTG
CAAGTTCTTTGGTTATAATGTATGCAAAATGCGACAGCTTGGTAGATGCTTTTCAAATATTTGGTGAGAGCAATGACCCTAACGTGGTTTTTTGGACAACAATAATTGCA
GCTTGTCAACAACATGGCCACGCTAACCGGGTCGGGGTTAAACCTGACTACATAACTTTTGTTTCTATTCTTTCTACATGCAGCCACACTGGTAGAGTTGAAGAAGGATT
CTTCTACTTTGATTCAATGGTTAAAGTGCATGGTATTAATCATAGATATGAACATTATGCATGCATAATTGATTTGCTTGGTCGTGTTGGCCATGCCTGTCAAAACAGAT
GCCTCGGTTTGGGTGCATGTAAGAGTCATAATAACCTTGAAATGAGTAAAGAAGTGGCTCTAAAATTTTTTGATTTGGAACCAGATAATCCTAGAAATTATGTGCTGCTT
TGTAACATCTTGACACGTAATGGGTTATTAAATGAGGCTGATGAGATTAGAAGAAAGATGGAATCCATCGAGGTAAAAGGGAATCCAAATATATCCTACATCTGTTCAAG
ATACCATAGAACTCCTGATCTCATTTTTGGAGCAATAGAGTATACCACCGCAATTGGCATATGGTCTGCTGGTTGTGTTCTTGATGAGCTATTGGTCGGTCAGCTAGAAA
ATCCAAAGACTCGTCAAGATTTTAATGACCGAGCATTATAA
Protein sequenceShow/hide protein sequence
MKLMFLVNLKPPFSCKFKLHYHHQTPTCFKFKRLLNAARTPKSASLDQTLLLFSSASHHSKNIISWTSLIMLFARSNRPFHTLSFFNHIRRSSRIYPNHYTLSAVLSTFS
THTTTMASLMLSYNKGYTSANLFFRMLLLVPLTVPDEVSFSSALTLNACANASNLDFGKQVHGLSLKLVIFCVASSLVIMYAKCDSLVDAFQIFGESNDPNVVFWTTIIA
ACQQHGHANRVGVKPDYITFVSILSTCSHTGRVEEGFFYFDSMVKVHGINHRYEHYACIIDLLGRVGHACQNRCLGLGACKSHNNLEMSKEVALKFFDLEPDNPRNYVLL
CNILTRNGLLNEADEIRRKMESIEVKGNPNISYICSRYHRTPDLIFGAIEYTTAIGIWSAGCVLDELLVGQLENPKTRQDFNDRAL