; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh15G006470 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh15G006470
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionpentatricopeptide repeat-containing protein At2g21090-like
Genome locationCmo_Chr15:3139226..3141784
RNA-Seq ExpressionCmoCh15G006470
SyntenyCmoCh15G006470
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578928.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.8e-19391.36Show/hide
Query:  HYRIKCSLQPIEKQCMFKSSVRRIQDLNLPNMLTVTIPIKCLKGIFRFGDVKGARHQVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKY
        H RIKCSLQPIEKQCMFKSSVRRIQDLNLPNMLTVTIPIKCLKGIFRFGDVKGARHQVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRP+KY
Subjt:  HYRIKCSLQPIEKQCMFKSSVRRIQDLNLPNMLTVTIPIKCLKGIFRFGDVKGARHQVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKY

Query:  TFSILLPVSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNN
        TFSILLPVSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALSRSVHCINHAKLGLT+NAIIVFIDIPVRGLISWNTMIMVLVNN
Subjt:  TFSILLPVSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNN

Query:  VYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEHCSFVVNLLSWAGKFEAVIIIKTPKCQSTSTLCKSLLGVCAI
        VYFEALRT NKLVREGVLPDRITLAG SLACSHAGFVEEGIATFFSMAYEHGGVPRNEHCSFVVNLLSWAGKFEAV IIKTPKCQ TST C SLLGVCAI
Subjt:  VYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEHCSFVVNLLSWAGKFEAVIIIKTPKCQSTSTLCKSLLGVCAI

Query:  HGDLKVVERVAERVVKLELQSSLLYSIMQLCSEVYCLFSLHLCGFAIVGYQLKAFPRSFAGANFRAMEDGVWKKAEAVWRDM
        HGD KVVERVAERV+KLELQSSLLYS+    ++ + +  +       VGYQLKAFPRSFAG NFRAMEDGVWKKAEAVWRDM
Subjt:  HGDLKVVERVAERVVKLELQSSLLYSIMQLCSEVYCLFSLHLCGFAIVGYQLKAFPRSFAGANFRAMEDGVWKKAEAVWRDM

KAG7016451.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]3.1e-19391.34Show/hide
Query:  YRIKCSLQPIEKQCMFKSSVRRIQDLNLPNMLTVTIPIKCLKGIFRFGDVKGARHQVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYT
        Y+IKCSLQPIEKQCMFKSSVRRIQDLNLPNMLTVTIPIKCLKGIFRFGDVKGARHQVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRP+KYT
Subjt:  YRIKCSLQPIEKQCMFKSSVRRIQDLNLPNMLTVTIPIKCLKGIFRFGDVKGARHQVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYT

Query:  FSILLPVSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNV
        FSILLPVSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALSRSVHCINHAKLGLT+NAIIVFIDIPVRGLISWNTMIMVLVNNV
Subjt:  FSILLPVSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNV

Query:  YFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEHCSFVVNLLSWAGKFEAVIIIKTPKCQSTSTLCKSLLGVCAIH
        YFEALRT NKLVREGVLPDRITLAG SLACSHAGFVEEGIATFFSMAYEHGGVPRNEHCSFVVNLLSWAGKFEAV IIKTPKCQ TST C SLLGVCAIH
Subjt:  YFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEHCSFVVNLLSWAGKFEAVIIIKTPKCQSTSTLCKSLLGVCAIH

Query:  GDLKVVERVAERVVKLELQSSLLYSIMQLCSEVYCLFSLHLCGFAIVGYQLKAFPRSFAGANFRAMEDGVWKKAEAVWRDM
        GD KVVERVAERV+KLELQSSLLYS+    ++ + +  +       VGYQLKAFPRSFAG NFRAMEDGVWKKAEAVWRDM
Subjt:  GDLKVVERVAERVVKLELQSSLLYSIMQLCSEVYCLFSLHLCGFAIVGYQLKAFPRSFAGANFRAMEDGVWKKAEAVWRDM

TYK23780.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]2.6e-7837.22Show/hide
Query:  CLKGIFRFGDVKGARH--QVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLP-VSYACHGKQFLGSMVRSGVDVSSVVLRNSL
        CLKG+FRFGDV   RH   VMPE+DI+SWN+ ISG VSSGFA +AM + LEM+NAGFRPS+YTFSILL  VS A HGKQ  GSM+RSGVDVSS+VL NSL
Subjt:  CLKGIFRFGDVKGARH--QVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLP-VSYACHGKQFLGSMVRSGVDVSSVVLRNSL

Query:  IDM---------------------------------QPGLRVLALYQFSL--------------------------------------------------
        IDM                                 + G RVLAL QF L                                                  
Subjt:  IDM---------------------------------QPGLRVLALYQFSL--------------------------------------------------

Query:  ----------------------------------------------------------------------------VALSRSVHCI--------------
                                                                                    V L   +H +              
Subjt:  ----------------------------------------------------------------------------VALSRSVHCI--------------

Query:  ----NHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNN-VYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEH
             +AK+G  ++A+ VFID+P R LISWNTMIM LV+N  YFEAL T NKLV EGVLPDRITLAGV LACS+AGFVEEG+  F +M YEHG VP NEH
Subjt:  ----NHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNN-VYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEH

Query:  CSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM------------------------------
         S VVNLLS AGKF EAV IIKT   Q TST   SLLGVCAIHGDLK++E+VAE V+KLE QSSL YS++                              
Subjt:  CSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM------------------------------

Query:  --------------------------------------------------------------------------------------QLCSEVYCLFSLHL
                                                                                              +LC +V  LFSLHL
Subjt:  --------------------------------------------------------------------------------------QLCSEVYCLFSLHL

Query:  CGFAIVGYQLKAFPRSFAGANFRAMEDGVWKKAE
        CGFA+VGYQLKAF  +FAG N  AMED VWKK +
Subjt:  CGFAIVGYQLKAFPRSFAGANFRAMEDGVWKKAE

XP_011657317.1 pentatricopeptide repeat-containing protein At1g43980, mitochondrial [Cucumis sativus]1.7e-7443.1Show/hide
Query:  CLKGIFRFGDVKGARH--QVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLP-VSYACHGKQFLGSMVRSGVDVSSVVLRNSL
        CLKG+FRFGDV GARH   VMPE+DI+SWN+ ISG VSSGF N+AM V LEM+NAGFRPS+YTFSILL  VS A HGKQ  GSM+RSGVDVSS+VL NSL
Subjt:  CLKGIFRFGDVKGARH--QVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLP-VSYACHGKQFLGSMVRSGVDVSSVVLRNSL

Query:  IDM---------------------------------QPGLRVLALYQFSL--------------------------------------------------
        IDM                                 + G RVLAL QF L                                                  
Subjt:  IDM---------------------------------QPGLRVLALYQFSL--------------------------------------------------

Query:  ----------------------------------------------------------------------------VALSRSVHCI--------------
                                                                                    V L   +H +              
Subjt:  ----------------------------------------------------------------------------VALSRSVHCI--------------

Query:  ----NHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNN-VYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEH
             +AK+G  ++A+ VFID+P R LISWNTMIM LV+N  YFEAL T NKLV EGVLPDRITLAGV LACS+AG VEEG+  F  M YEHG VPRNEH
Subjt:  ----NHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNN-VYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEH

Query:  CSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIMQLCSEVYC
         S VVNLLS AGKF EAV IIKT   Q TST   SLLGVCAIHGDLK++E+VAE ++KLE QSSL YS++     + C
Subjt:  CSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIMQLCSEVYC

XP_022939150.1 pentatricopeptide repeat-containing protein At2g21090-like [Cucurbita moschata]4.0e-180100Show/hide
Query:  MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALS
        MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALS
Subjt:  MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALS

Query:  RSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNE
        RSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNE
Subjt:  RSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNE

Query:  HCSFVVNLLSWAGKFEAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIMQLCSEVYCLFSLHLCGFAIVGYQLKAFPRS
        HCSFVVNLLSWAGKFEAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIMQLCSEVYCLFSLHLCGFAIVGYQLKAFPRS
Subjt:  HCSFVVNLLSWAGKFEAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIMQLCSEVYCLFSLHLCGFAIVGYQLKAFPRS

Query:  FAGANFRAMEDGVWKKAEAVWRDM
        FAGANFRAMEDGVWKKAEAVWRDM
Subjt:  FAGANFRAMEDGVWKKAEAVWRDM

TrEMBL top hitse value%identityAlignment
A0A1S3B3Q8 pentatricopeptide repeat-containing protein At1g43980, mitochondrial isoform X19.2e-7442.68Show/hide
Query:  CLKGIFRFGDVKGARH--QVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLP-VSYACHGKQFLGSMVRSGVDVSSVVLRNSL
        CLKG+FRFGDV   RH   VMPE+DI+SWN+ ISG VSSGFA +AM + LEM+NAGFRPS+YTFSILL  VS A HGKQ  GSM+RSGVDVSS+VL NSL
Subjt:  CLKGIFRFGDVKGARH--QVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLP-VSYACHGKQFLGSMVRSGVDVSSVVLRNSL

Query:  IDM---------------------------------QPGLRVLALYQFSL--------------------------------------------------
        IDM                                 + G RVLAL QF L                                                  
Subjt:  IDM---------------------------------QPGLRVLALYQFSL--------------------------------------------------

Query:  ----------------------------------------------------------------------------VALSRSVHCI--------------
                                                                                    V L   +H +              
Subjt:  ----------------------------------------------------------------------------VALSRSVHCI--------------

Query:  ----NHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNN-VYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEH
             +AK+G  ++A+ VFID+P R LISWNTMIM LV+N  YFEAL T NKLV EGVLPDRITLAGV LACS+AGFVEEG+  F +M YEHG VP NEH
Subjt:  ----NHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNN-VYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEH

Query:  CSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIMQLCSEVYC
         S VVNLLS AGKF EAV IIKT   Q TST   SLLGVCAIHGDLK++E+VAE V+KLE QSSL YS++     + C
Subjt:  CSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIMQLCSEVYC

A0A5D3DK49 Pentatricopeptide repeat-containing protein1.2e-7837.22Show/hide
Query:  CLKGIFRFGDVKGARH--QVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLP-VSYACHGKQFLGSMVRSGVDVSSVVLRNSL
        CLKG+FRFGDV   RH   VMPE+DI+SWN+ ISG VSSGFA +AM + LEM+NAGFRPS+YTFSILL  VS A HGKQ  GSM+RSGVDVSS+VL NSL
Subjt:  CLKGIFRFGDVKGARH--QVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLP-VSYACHGKQFLGSMVRSGVDVSSVVLRNSL

Query:  IDM---------------------------------QPGLRVLALYQFSL--------------------------------------------------
        IDM                                 + G RVLAL QF L                                                  
Subjt:  IDM---------------------------------QPGLRVLALYQFSL--------------------------------------------------

Query:  ----------------------------------------------------------------------------VALSRSVHCI--------------
                                                                                    V L   +H +              
Subjt:  ----------------------------------------------------------------------------VALSRSVHCI--------------

Query:  ----NHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNN-VYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEH
             +AK+G  ++A+ VFID+P R LISWNTMIM LV+N  YFEAL T NKLV EGVLPDRITLAGV LACS+AGFVEEG+  F +M YEHG VP NEH
Subjt:  ----NHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNN-VYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEH

Query:  CSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM------------------------------
         S VVNLLS AGKF EAV IIKT   Q TST   SLLGVCAIHGDLK++E+VAE V+KLE QSSL YS++                              
Subjt:  CSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM------------------------------

Query:  --------------------------------------------------------------------------------------QLCSEVYCLFSLHL
                                                                                              +LC +V  LFSLHL
Subjt:  --------------------------------------------------------------------------------------QLCSEVYCLFSLHL

Query:  CGFAIVGYQLKAFPRSFAGANFRAMEDGVWKKAE
        CGFA+VGYQLKAF  +FAG N  AMED VWKK +
Subjt:  CGFAIVGYQLKAFPRSFAGANFRAMEDGVWKKAE

A0A6J1FLV2 pentatricopeptide repeat-containing protein At2g21090-like1.9e-180100Show/hide
Query:  MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALS
        MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALS
Subjt:  MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALS

Query:  RSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNE
        RSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNE
Subjt:  RSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNE

Query:  HCSFVVNLLSWAGKFEAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIMQLCSEVYCLFSLHLCGFAIVGYQLKAFPRS
        HCSFVVNLLSWAGKFEAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIMQLCSEVYCLFSLHLCGFAIVGYQLKAFPRS
Subjt:  HCSFVVNLLSWAGKFEAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIMQLCSEVYCLFSLHLCGFAIVGYQLKAFPRS

Query:  FAGANFRAMEDGVWKKAEAVWRDM
        FAGANFRAMEDGVWKKAEAVWRDM
Subjt:  FAGANFRAMEDGVWKKAEAVWRDM

A0A6J1GNQ4 pentatricopeptide repeat-containing protein At1g43980, mitochondrial1.8e-6941.7Show/hide
Query:  CLKGIFRFGDVKGARH--QVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPV-SYACHGKQFLGSMVRSGVDVSSVVLRNSL
        CLKG+FRFG V GAR+    MPE+DI+SWN  +SG VSSGFAN AMDVFLEM++AGFRPS+YTFSILL V S   HGKQ  GSM+RSGVDVS+VVL NSL
Subjt:  CLKGIFRFGDVKGARH--QVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPV-SYACHGKQFLGSMVRSGVDVSSVVLRNSL

Query:  IDM---------------------------------QPGLRVLALYQFSL--------------------------------------------------
        IDM                                 + G  VLAL QF L                                                  
Subjt:  IDM---------------------------------QPGLRVLALYQFSL--------------------------------------------------

Query:  -------------------------------------------------------------VALSRSVHCIN----------------------------
                                                                     + LS  +  I+                            
Subjt:  -------------------------------------------------------------VALSRSVHCIN----------------------------

Query:  -----HAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNN-VYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEH
             +AK+GL +NA+ VF D+P R LISWNTMIM LVNN  YFEAL TL  LVREGV+ DRITLAGV LACSHAGFV+EG+  F +M  +HG VP NEH
Subjt:  -----HAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNN-VYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEH

Query:  CSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM
         + VV+LLS AGKF EAV II+T  CQ TST   SLL  CAIHGD+  +ERVAERV+KLE QSSL YS++
Subjt:  CSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM

A0A6J1JL44 pentatricopeptide repeat-containing protein At1g43980, mitochondrial8.9e-6941.28Show/hide
Query:  CLKGIFRFGDVKGARH--QVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLP-VSYACHGKQFLGSMVRSGVDVSSVVLRNSL
        CLKG+FRFG V GAR+    MPE+DI+SWN  +SG VSSGFAN AMDVFLEM++AGFRPS+YTFSILL  VS   HGKQ  GSM+RSG+DVS+VVL NSL
Subjt:  CLKGIFRFGDVKGARH--QVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLP-VSYACHGKQFLGSMVRSGVDVSSVVLRNSL

Query:  IDM---------------------------------QPGLRVLALYQFSL---------------------------------------------VALSR
        IDM                                 + G RVLAL QF L                                             + LS 
Subjt:  IDM---------------------------------QPGLRVLALYQFSL---------------------------------------------VALSR

Query:  SVHCIN----------------------------------------------------------------------------------------------
        ++   +                                                                                              
Subjt:  SVHCIN----------------------------------------------------------------------------------------------

Query:  -----HAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNN-VYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEH
             +AK+GL +NA+ VF D+P R LISWNTMIM LVNN  YFEAL TL  LV+EGV+ DRITLAGV LAC HAGFV+EG+  F +M  EHG VP NEH
Subjt:  -----HAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNN-VYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEH

Query:  CSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM
         + VV+LLS AGKF EAV II+   CQ TST   SLL  CAIHGDL V+ERVAERV+KLE Q SL YS++
Subjt:  CSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM

SwissProt top hitse value%identityAlignment
Q9FRI5 Pentatricopeptide repeat-containing protein At1g253601.2e-2530.94Show/hide
Query:  MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLP----VSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSL
        M EK+I+SW   ISG   +GF    + +F  MK  GF P  Y FS  +     +   C+G+Q+   +++ G D SS+   N+LI M              
Subjt:  MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLP----VSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSL

Query:  VALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVY-FEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGG
                   +AK G+ E A  VF  +P    +SWN +I  L  + +  EA+    +++++G+ PDRITL  V  ACSHAG V++G   F SM   +  
Subjt:  VALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVY-FEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGG

Query:  VPRNEHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKL
         P  +H + +++LL  +GKF +A  +I++   + T+ + ++LL  C +HG++++    A+++  L
Subjt:  VPRNEHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKL

Q9S7F4 Putative pentatricopeptide repeat-containing protein At2g015101.5e-2528.57Show/hide
Query:  MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSYA----CHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSL
        +P++  +SW + ISG V  G     + +F +M+ +  R  + TF+ +L  S +      GKQ    ++RSG ++ +V   + L+DM              
Subjt:  MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSYA----CHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSL

Query:  VALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYFE-ALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGG
                   +AK G  ++A+ VF ++P R  +SWN +I    +N   E A+    K++  G+ PD +++ GV  ACSH GFVE+G   F +M+  +G 
Subjt:  VALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYFE-ALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGG

Query:  VPRNEHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLE
         P+ +H + +++LL   G+F EA  ++     +    +  S+L  C IH +  + ER AE++  +E
Subjt:  VPRNEHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLE

Q9SI53 Pentatricopeptide repeat-containing protein At2g03880, mitochondrial5.3e-2630.51Show/hide
Query:  DIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSYACHGKQFLGSMVRSGVDV----SSVVLRNSLIDMQPGLRVLALYQFSLVALS
        D I WNS I G   +  ++ A+++F  MK AGF   + T   L  V  AC G   L   +++ V +      ++L N+L+DM                  
Subjt:  DIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSYACHGKQFLGSMVRSGVDV----SSVVLRNSLIDMQPGLRVLALYQFSLVALS

Query:  RSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYF-EALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRN
               + K G  E+A+ VF  +  R +I+W+TMI  L  N Y  EAL+   ++   G  P+ IT+ GV  ACSHAG +E+G   F SM   +G  P  
Subjt:  RSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYF-EALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRN

Query:  EHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM
        EH   +++LL  AGK  +AV ++   +C+  +   ++LLG C +  ++ + E  A++V+ L+ + +  Y+++
Subjt:  EHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM

Q9SKQ4 Pentatricopeptide repeat-containing protein At2g210901.5e-3331.18Show/hide
Query:  FIAHYRIKCSLQPIEKQCMFKSSVRRIQDLNLPNMLTVTIPIKCLKGIFRFGDVKGARHQV--MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGF
        F+++  + CS+     +C    S +R  D     +  + I    + G  + GD++ A      MPEK+ +SW + I+G V  G  N A+D+F +M   G 
Subjt:  FIAHYRIKCSLQPIEKQCMFKSSVRRIQDLNLPNMLTVTIPIKCLKGIFRFGDVKGARHQV--MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGF

Query:  RPSKYTFSILLPVSYAC----HGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALSRSVHCINHAKLGLTENAIIVF-IDIPVRGLISW
        +P ++TFS  L  S +     HGK+  G M+R+ V  +++V+ +SLIDM                         ++K G  E +  VF I       + W
Subjt:  RPSKYTFSILLPVSYAC----HGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALSRSVHCINHAKLGLTENAIIVF-IDIPVRGLISW

Query:  NTMIMVLV-NNVYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTS
        NTMI  L  + +  +ALR L+ +++  V P+R TL  +  ACSH+G VEEG+  F SM  +HG VP  EH + +++LL  AG F E +  I+    +   
Subjt:  NTMIMVLV-NNVYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTS

Query:  TLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM
         +  ++LGVC IHG+ ++ ++ A+ ++KL+ +SS  Y ++
Subjt:  TLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic4.0e-2631.03Show/hide
Query:  GDVKGAR--HQVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSY-----ACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQ
        G+V+ AR     M E+DI+SWN  I G    GF N+A+ +F ++   G +P     +++  +S      A    +++   V+S     +V +   LIDM 
Subjt:  GDVKGAR--HQVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSY-----ACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQ

Query:  PGLRVLALYQFSLVALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYF-EALRTLNKLVR-EGVLPDRITLAGVSLACSHAGFVEE
                                ++K G  E A++VF D P + +++WN MI     + Y  +ALR  N++    G+ P  IT  G   AC+HAG V E
Subjt:  PGLRVLALYQFSLVALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYF-EALRTLNKLVR-EGVLPDRITLAGVSLACSHAGFVEE

Query:  GIATFFSMAYEHGGVPRNEHCSFVVNLLSWAGKFE-AVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM
        GI  F SM  E+G  P+ EH   +V+LL  AG+ + A   IK     + S L  S+LG C +HGD  + + +AE ++ L +++S +Y ++
Subjt:  GIATFFSMAYEHGGVPRNEHCSFVVNLLSWAGKFE-AVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM

Arabidopsis top hitse value%identityAlignment
AT1G25360.1 Pentatricopeptide repeat (PPR) superfamily protein8.3e-2730.94Show/hide
Query:  MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLP----VSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSL
        M EK+I+SW   ISG   +GF    + +F  MK  GF P  Y FS  +     +   C+G+Q+   +++ G D SS+   N+LI M              
Subjt:  MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLP----VSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSL

Query:  VALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVY-FEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGG
                   +AK G+ E A  VF  +P    +SWN +I  L  + +  EA+    +++++G+ PDRITL  V  ACSHAG V++G   F SM   +  
Subjt:  VALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVY-FEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGG

Query:  VPRNEHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKL
         P  +H + +++LL  +GKF +A  +I++   + T+ + ++LL  C +HG++++    A+++  L
Subjt:  VPRNEHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKL

AT2G03880.1 Pentatricopeptide repeat (PPR) superfamily protein3.7e-2730.51Show/hide
Query:  DIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSYACHGKQFLGSMVRSGVDV----SSVVLRNSLIDMQPGLRVLALYQFSLVALS
        D I WNS I G   +  ++ A+++F  MK AGF   + T   L  V  AC G   L   +++ V +      ++L N+L+DM                  
Subjt:  DIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSYACHGKQFLGSMVRSGVDV----SSVVLRNSLIDMQPGLRVLALYQFSLVALS

Query:  RSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYF-EALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRN
               + K G  E+A+ VF  +  R +I+W+TMI  L  N Y  EAL+   ++   G  P+ IT+ GV  ACSHAG +E+G   F SM   +G  P  
Subjt:  RSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYF-EALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRN

Query:  EHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM
        EH   +++LL  AGK  +AV ++   +C+  +   ++LLG C +  ++ + E  A++V+ L+ + +  Y+++
Subjt:  EHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM

AT2G21090.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.1e-3431.18Show/hide
Query:  FIAHYRIKCSLQPIEKQCMFKSSVRRIQDLNLPNMLTVTIPIKCLKGIFRFGDVKGARHQV--MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGF
        F+++  + CS+     +C    S +R  D     +  + I    + G  + GD++ A      MPEK+ +SW + I+G V  G  N A+D+F +M   G 
Subjt:  FIAHYRIKCSLQPIEKQCMFKSSVRRIQDLNLPNMLTVTIPIKCLKGIFRFGDVKGARHQV--MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGF

Query:  RPSKYTFSILLPVSYAC----HGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALSRSVHCINHAKLGLTENAIIVF-IDIPVRGLISW
        +P ++TFS  L  S +     HGK+  G M+R+ V  +++V+ +SLIDM                         ++K G  E +  VF I       + W
Subjt:  RPSKYTFSILLPVSYAC----HGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALSRSVHCINHAKLGLTENAIIVF-IDIPVRGLISW

Query:  NTMIMVLV-NNVYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTS
        NTMI  L  + +  +ALR L+ +++  V P+R TL  +  ACSH+G VEEG+  F SM  +HG VP  EH + +++LL  AG F E +  I+    +   
Subjt:  NTMIMVLV-NNVYFEALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTS

Query:  TLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM
         +  ++LGVC IHG+ ++ ++ A+ ++KL+ +SS  Y ++
Subjt:  TLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM

AT3G02010.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-2628.57Show/hide
Query:  MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSYA----CHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSL
        +P++  +SW + ISG V  G     + +F +M+ +  R  + TF+ +L  S +      GKQ    ++RSG ++ +V   + L+DM              
Subjt:  MPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSYA----CHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSL

Query:  VALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYFE-ALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGG
                   +AK G  ++A+ VF ++P R  +SWN +I    +N   E A+    K++  G+ PD +++ GV  ACSH GFVE+G   F +M+  +G 
Subjt:  VALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYFE-ALRTLNKLVREGVLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGG

Query:  VPRNEHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLE
         P+ +H + +++LL   G+F EA  ++     +    +  S+L  C IH +  + ER AE++  +E
Subjt:  VPRNEHCSFVVNLLSWAGKF-EAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLE

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.9e-2731.03Show/hide
Query:  GDVKGAR--HQVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSY-----ACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQ
        G+V+ AR     M E+DI+SWN  I G    GF N+A+ +F ++   G +P     +++  +S      A    +++   V+S     +V +   LIDM 
Subjt:  GDVKGAR--HQVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILLPVSY-----ACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQ

Query:  PGLRVLALYQFSLVALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYF-EALRTLNKLVR-EGVLPDRITLAGVSLACSHAGFVEE
                                ++K G  E A++VF D P + +++WN MI     + Y  +ALR  N++    G+ P  IT  G   AC+HAG V E
Subjt:  PGLRVLALYQFSLVALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYF-EALRTLNKLVR-EGVLPDRITLAGVSLACSHAGFVEE

Query:  GIATFFSMAYEHGGVPRNEHCSFVVNLLSWAGKFE-AVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM
        GI  F SM  E+G  P+ EH   +V+LL  AG+ + A   IK     + S L  S+LG C +HGD  + + +AE ++ L +++S +Y ++
Subjt:  GIATFFSMAYEHGGVPRNEHCSFVVNLLSWAGKFE-AVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYSIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTATTGCTCATTACAGGATCAAGTGTTCGCTCCAGCCTATTGAAAAGCAATGCATGTTCAAGAGCTCTGTGAGAAGAATCCAGGATCTCAATCTACCCAACATGTT
GACTGTCACAATCCCCATCAAATGCTTGAAAGGTATATTTAGATTTGGTGATGTCAAAGGTGCACGCCATCAGGTAATGCCTGAGAAGGACATTATTTCTTGGAACTCTA
CGATTTCTGGTTCTGTTTCATCTGGATTTGCTAATAATGCTATGGACGTTTTTCTGGAAATGAAAAACGCTGGTTTTAGACCAAGTAAATATACCTTCTCCATTTTGCTT
CCAGTGTCGTATGCTTGTCATGGTAAGCAATTTCTTGGCAGTATGGTTCGAAGTGGTGTGGATGTGTCAAGTGTGGTGCTCAGAAATTCATTGATTGATATGCAACCAGG
CTTAAGAGTATTGGCACTATATCAGTTCTCTCTAGTAGCACTCTCCCGATCAGTTCACTGTATTAATCATGCTAAACTTGGATTAACTGAGAATGCCATTATAGTCTTCA
TAGATATACCTGTTAGAGGTTTGATATCATGGAACACTATGATTATGGTTCTGGTTAACAATGTGTACTTTGAAGCCTTACGCACTTTGAATAAGTTGGTCAGGGAAGGT
GTACTGCCAGATAGGATAACTCTAGCTGGAGTCTCATTAGCTTGCAGTCATGCTGGTTTTGTTGAGGAAGGGATAGCCACCTTCTTTTCAATGGCATATGAACATGGAGG
CGTGCCGAGGAATGAACATTGTTCTTTTGTAGTGAACTTGCTGAGTTGGGCTGGTAAATTTGAAGCAGTTATTATTATTAAAACACCAAAATGCCAATCTACTTCTACAC
TTTGCAAGTCACTTCTTGGTGTCTGTGCAATTCATGGAGACCTAAAAGTTGTTGAAAGAGTTGCAGAGAGGGTGGTGAAGCTGGAACTGCAATCATCCTTACTGTATTCG
ATTATGCAGCTTTGTTCTGAAGTTTACTGTCTATTCTCTCTGCATTTGTGTGGGTTTGCTATAGTGGGATACCAATTGAAAGCATTCCCAAGGTCGTTTGCTGGAGCTAA
CTTTCGTGCAATGGAGGATGGAGTTTGGAAGAAAGCTGAAGCAGTGTGGAGAGATATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTATTGCTCATTACAGGATCAAGTGTTCGCTCCAGCCTATTGAAAAGCAATGCATGTTCAAGAGCTCTGTGAGAAGAATCCAGGATCTCAATCTACCCAACATGTT
GACTGTCACAATCCCCATCAAATGCTTGAAAGGTATATTTAGATTTGGTGATGTCAAAGGTGCACGCCATCAGGTAATGCCTGAGAAGGACATTATTTCTTGGAACTCTA
CGATTTCTGGTTCTGTTTCATCTGGATTTGCTAATAATGCTATGGACGTTTTTCTGGAAATGAAAAACGCTGGTTTTAGACCAAGTAAATATACCTTCTCCATTTTGCTT
CCAGTGTCGTATGCTTGTCATGGTAAGCAATTTCTTGGCAGTATGGTTCGAAGTGGTGTGGATGTGTCAAGTGTGGTGCTCAGAAATTCATTGATTGATATGCAACCAGG
CTTAAGAGTATTGGCACTATATCAGTTCTCTCTAGTAGCACTCTCCCGATCAGTTCACTGTATTAATCATGCTAAACTTGGATTAACTGAGAATGCCATTATAGTCTTCA
TAGATATACCTGTTAGAGGTTTGATATCATGGAACACTATGATTATGGTTCTGGTTAACAATGTGTACTTTGAAGCCTTACGCACTTTGAATAAGTTGGTCAGGGAAGGT
GTACTGCCAGATAGGATAACTCTAGCTGGAGTCTCATTAGCTTGCAGTCATGCTGGTTTTGTTGAGGAAGGGATAGCCACCTTCTTTTCAATGGCATATGAACATGGAGG
CGTGCCGAGGAATGAACATTGTTCTTTTGTAGTGAACTTGCTGAGTTGGGCTGGTAAATTTGAAGCAGTTATTATTATTAAAACACCAAAATGCCAATCTACTTCTACAC
TTTGCAAGTCACTTCTTGGTGTCTGTGCAATTCATGGAGACCTAAAAGTTGTTGAAAGAGTTGCAGAGAGGGTGGTGAAGCTGGAACTGCAATCATCCTTACTGTATTCG
ATTATGCAGCTTTGTTCTGAAGTTTACTGTCTATTCTCTCTGCATTTGTGTGGGTTTGCTATAGTGGGATACCAATTGAAAGCATTCCCAAGGTCGTTTGCTGGAGCTAA
CTTTCGTGCAATGGAGGATGGAGTTTGGAAGAAAGCTGAAGCAGTGTGGAGAGATATGTGA
Protein sequenceShow/hide protein sequence
MFIAHYRIKCSLQPIEKQCMFKSSVRRIQDLNLPNMLTVTIPIKCLKGIFRFGDVKGARHQVMPEKDIISWNSTISGSVSSGFANNAMDVFLEMKNAGFRPSKYTFSILL
PVSYACHGKQFLGSMVRSGVDVSSVVLRNSLIDMQPGLRVLALYQFSLVALSRSVHCINHAKLGLTENAIIVFIDIPVRGLISWNTMIMVLVNNVYFEALRTLNKLVREG
VLPDRITLAGVSLACSHAGFVEEGIATFFSMAYEHGGVPRNEHCSFVVNLLSWAGKFEAVIIIKTPKCQSTSTLCKSLLGVCAIHGDLKVVERVAERVVKLELQSSLLYS
IMQLCSEVYCLFSLHLCGFAIVGYQLKAFPRSFAGANFRAMEDGVWKKAEAVWRDM