; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0007893 (gene) of Chayote v1 genome

Gene IDSed0007893
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein DOT4
Genome locationLG09:3378076..3382893
RNA-Seq ExpressionSed0007893
SyntenySed0007893
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7023956.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]9.0e-17558.05Show/hide
Query:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV
        LG LENA K+FD+  QPK VL N MVNGYLQN +YNE+IELFK+MGR   E DSY CNFALKAC FL+  + G                    SILNF+V
Subjt:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV

Query:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCI--YWLCY----------------
        + GDIM A+ FF +MVEKD VCWNVMIG FMQEGLFSEG+K+F+ ML N I PS + MTSL+QSCG +R LEFGKCI  Y L +                
Subjt:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCI--YWLCY----------------

Query:  ----------WI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV
                  WI           W+                        ++GGFDS T VSL+Q+CS TADL+ GK++H C+YRR L +    S  ++ +
Subjt:  ----------WI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV

Query:  ------LPVHYSVV-RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID
              L   YSV  RMK KN++SWTAMLVGLAQNG AR A +LF+QMQNER+  NALTLVSLVHCC LL SL EG  VHA+LI   FAS+ V  T LID
Subjt:  ------LPVHYSVV-RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID

Query:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLG------------RLQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCACF
        MYAKCS+I S EKV  +GFTPK+ ILYNS+ISGYG HGLG             LQPNESTFVSLLSACSHSGLVE GISLF +++  H++TPT+KL ACF
Subjt:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLG------------RLQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCACF

Query:  VDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG
        VDLLSRAGRL+QAE  INQM F+P SGILETLLNGCL+H DIELGVKIADRLLS  SRNPS+YV+LSNIY EA +WD+V Y+RGLMTEQELKKI G
Subjt:  VDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG

XP_008450740.1 PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucumis melo]3.4e-17458.12Show/hide
Query:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV
        LG LE A K+FDE  QPK VL N MVNGYLQN ++N+ IEL +MM R  LE DSY CNFALKACTFL+  + G                    SILNF+V
Subjt:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV

Query:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWL--------------------
        +TGDIMCAQ FF QM EKD VCWNVMIG FMQEGLF EG+ +F  ML N I PS + M SLIQSCG  R L+FGKC++                      
Subjt:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWL--------------------

Query:  --------CYWI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV
                  WI           W+                        DD GFDSGT VSLIQ+CS TADL+ GK+LH CIYRRGL +    S  ++ +
Subjt:  --------CYWI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV

Query:  LPVHYSVV-------RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID
             S+        R+KNKN+ISWTAMLVGLAQNGHAR A +LF+QMQNER+  N LTLVSLV+CC LL  L EG  VHA L    FASE VV T LID
Subjt:  LPVHYSVV-------RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID

Query:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCAC
        MYAKCSKI SAE V KYG TPK+ ILYNS+ISGYG HGLG              LQPNESTFVSLLSACSHSGLVE GI+LF N+  DHN TPT+KL AC
Subjt:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCAC

Query:  FVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG
         VDLLSRAGRLQQAE  INQM F P SGILETLLNGCL+H DIELGVK+ADRLLSL SRNPSIY+TLSNIY +A +WDSVK++RGLM EQE+KKI G
Subjt:  FVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG

XP_011659934.2 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucumis sativus]2.4e-17558.63Show/hide
Query:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV
        LG LENA K+FDE  QPK VL N MVNGYLQN  YN+ IEL KMM R  LE DSY CNFALKAC FL+  + G                    SILNF+V
Subjt:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV

Query:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWL--------------------
        +TGDIMCAQ FF QMVEKD VCWNVMIG FMQEGLF EG+ +F+ ML N I PS + M SLIQSCG +R L FGKC++                      
Subjt:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWL--------------------

Query:  --------CYWI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV
                  WI           W+                        DD GFDSGT VSLIQ+CS TADL+ GK+LH  IYRRGL +       ++ +
Subjt:  --------CYWI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV

Query:  LPVHYSVV-------RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID
             S+        RMKNKN+ISWTAMLVGLAQNGHAR A +LF+QMQNER+  NALTLVSLV+CC LL  L EG  VHA L    FASE VV T LID
Subjt:  LPVHYSVV-------RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID

Query:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCAC
        MYAKCSKI SAE V KYG TPK+ ILYNS+ISGYG HGLG              LQPNESTFVSLLSACSHSGLVE GI+LF N+  DHN TPT+KL AC
Subjt:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCAC

Query:  FVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG
         VDLLSRAGRL+QAE  INQM F P SGILETLLNGCL+H DIELGVK+ADRLLSL SRNPSIY+TLSNIY +A +WDSVKY+RGLM EQE+KKI G
Subjt:  FVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG

XP_022961380.1 pentatricopeptide repeat-containing protein At5g39350-like [Cucurbita moschata]1.5e-17457.72Show/hide
Query:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV
        LG LENA K+FD+  QPK +L N MVNGYLQN +YNE+IELFK+MGR   E DSY CNFALKAC FL+  + G                    SILNF+V
Subjt:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV

Query:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCI--YWLCY----------------
        + GDIM A+ FF +MVEKD VCWNVMIG FMQEGLFSEG+K+F+ ML N I PS + MTSL+QSCG +R LEFGKCI  Y L +                
Subjt:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCI--YWLCY----------------

Query:  ----------WI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV
                  WI           W+                        ++GGFDS T VSL+Q+CS TADL+ GK++H C+YRR L +    S  ++ +
Subjt:  ----------WI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV

Query:  ------LPVHYSVV-RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID
              L   YSV  RMK KN++SWTAMLVGLAQNG AR A +LF+QMQNER+  NALTLVSLVHCC LL SL EG  VHA+LI   FAS+ V  T LID
Subjt:  ------LPVHYSVV-RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID

Query:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLG------------RLQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCACF
        MYAKCS+I S EKV  +GFTPK+ ILYNS+ISGYG HGLG             LQPNESTFVSLLSACSHSGLVE GISLF +++  H++TPT+KL ACF
Subjt:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLG------------RLQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCACF

Query:  VDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG
        VDLLSRAGRL+QAE  INQM F+P SG+LETLLNGCL+H DIELGVKIADRLLS  SRNPS+YV+LSNIY EA +WD+V Y+RGLMTEQELKKI G
Subjt:  VDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG

XP_038877556.1 pentatricopeptide repeat-containing protein At5g39350-like [Benincasa hispida]2.3e-17859.13Show/hide
Query:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGWI------------------SILNFMV
        LG LE A KLFD+  QPK VL N MVNGYLQN +YNES+EL KMM R  LE DSY CNFALKAC FLM  + G                    SILNF+V
Subjt:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGWI------------------SILNFMV

Query:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWLCY------------------
        +TGDIMCAQ FF QMVEKD VCWNVMIG  +QEGLF+EG+ +F+ ML N I PS + MTSLIQSCG +R L+FGKC++   +                  
Subjt:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWLCY------------------

Query:  ----------WI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV
                  WI           W+                        +DGGFDSGT VSLIQ+CS TADL+ GK++H CIYRRGL +    S  ++ +
Subjt:  ----------WI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV

Query:  LPVHYSVV-------RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID
             SV        RMKNKN+ISWTAMLVGLAQNGHAR A +LF  MQNE++  NALTLVSLVHCC LL SL EG  +HA+L    FA E VV T LID
Subjt:  LPVHYSVV-------RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID

Query:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCAC
        MYAKCSKI SAEKV KYGFTPK+ ILYN++ISGYG HGLGR             LQ NESTFVSLLSACSHSGL E GISLF N++ DHNITPT+KL AC
Subjt:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCAC

Query:  FVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG
         VDLL RAGRL+QAE  INQM F P SGILETLL+GCL+H DIELGVKIADRLLSL SRNPS Y+TLSNIY EA +WDSVKY+RGLMTEQELKKI G
Subjt:  FVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG

TrEMBL top hitse value%identityAlignment
A0A1S3BQY0 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like1.7e-17458.12Show/hide
Query:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV
        LG LE A K+FDE  QPK VL N MVNGYLQN ++N+ IEL +MM R  LE DSY CNFALKACTFL+  + G                    SILNF+V
Subjt:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV

Query:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWL--------------------
        +TGDIMCAQ FF QM EKD VCWNVMIG FMQEGLF EG+ +F  ML N I PS + M SLIQSCG  R L+FGKC++                      
Subjt:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWL--------------------

Query:  --------CYWI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV
                  WI           W+                        DD GFDSGT VSLIQ+CS TADL+ GK+LH CIYRRGL +    S  ++ +
Subjt:  --------CYWI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV

Query:  LPVHYSVV-------RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID
             S+        R+KNKN+ISWTAMLVGLAQNGHAR A +LF+QMQNER+  N LTLVSLV+CC LL  L EG  VHA L    FASE VV T LID
Subjt:  LPVHYSVV-------RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID

Query:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCAC
        MYAKCSKI SAE V KYG TPK+ ILYNS+ISGYG HGLG              LQPNESTFVSLLSACSHSGLVE GI+LF N+  DHN TPT+KL AC
Subjt:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCAC

Query:  FVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG
         VDLLSRAGRLQQAE  INQM F P SGILETLLNGCL+H DIELGVK+ADRLLSL SRNPSIY+TLSNIY +A +WDSVK++RGLM EQE+KKI G
Subjt:  FVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG

A0A5D3CG12 Pentatricopeptide repeat-containing protein DOT41.7e-17458.12Show/hide
Query:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV
        LG LE A K+FDE  QPK VL N MVNGYLQN ++N+ IEL +MM R  LE DSY CNFALKACTFL+  + G                    SILNF+V
Subjt:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV

Query:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWL--------------------
        +TGDIMCAQ FF QM EKD VCWNVMIG FMQEGLF EG+ +F  ML N I PS + M SLIQSCG  R L+FGKC++                      
Subjt:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWL--------------------

Query:  --------CYWI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV
                  WI           W+                        DD GFDSGT VSLIQ+CS TADL+ GK+LH CIYRRGL +    S  ++ +
Subjt:  --------CYWI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV

Query:  LPVHYSVV-------RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID
             S+        R+KNKN+ISWTAMLVGLAQNGHAR A +LF+QMQNER+  N LTLVSLV+CC LL  L EG  VHA L    FASE VV T LID
Subjt:  LPVHYSVV-------RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID

Query:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCAC
        MYAKCSKI SAE V KYG TPK+ ILYNS+ISGYG HGLG              LQPNESTFVSLLSACSHSGLVE GI+LF N+  DHN TPT+KL AC
Subjt:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCAC

Query:  FVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG
         VDLLSRAGRLQQAE  INQM F P SGILETLLNGCL+H DIELGVK+ADRLLSL SRNPSIY+TLSNIY +A +WDSVK++RGLM EQE+KKI G
Subjt:  FVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG

A0A6J1DRK8 pentatricopeptide repeat-containing protein At4g21300-like1.4e-17058.36Show/hide
Query:  MVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFL-----------MILKWGW-------ISILNFMVRTGDIMCAQRFFDQMVEKDAVCWN
        MVNGYLQN +Y ESIELFK+MGR DLE DSY CNFALKACTFL           + +  GW        SILNF+V+TGDI  AQ+FF QM+ KD VCWN
Subjt:  MVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFL-----------MILKWGW-------ISILNFMVRTGDIMCAQRFFDQMVEKDAVCWN

Query:  VMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYW----------------------------LCYWIWH----------
        VMIG FM+EGL+SEGF VF+ ML + I P+ + MTSLIQ+CG    +EFGKCI+                                WI++          
Subjt:  VMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYW----------------------------LCYWIWH----------

Query:  -------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQVLPVHYSVV-------RMKNKNLIS
                                  DGGFDS T VSL+QVCSH  DL+ GK+LH C+YR GL +    S  ++ +     S+        RMK+KN+IS
Subjt:  -------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQVLPVHYSVV-------RMKNKNLIS

Query:  WTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLIDMYAKCSKIVSAEKVLKYG-FTPKE
        WTAMLVGL QNGHAR A RLFNQMQNE++  NALTL+SL+HCC LL SL++G  VHAILI  GF+S+ +  T LIDMYAKCSKI SAEKV KYG FTPK+
Subjt:  WTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLIDMYAKCSKIVSAEKVLKYG-FTPKE

Query:  AILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLF
         ILYNS+ISGYGTHGLG              LQPNESTFVSLL ACSHSGLVE G+SLF ++K DHNITPT+KL ACFVDLLSRAGRL+QA+A INQM F
Subjt:  AILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLF

Query:  KPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG
         P SGILETLLNGCLMH DIELGVKIADRLLSL+S+N SIY+TLSNIY EA+QWDSVKY+RGLM EQELKKI+G
Subjt:  KPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG

A0A6J1HC27 pentatricopeptide repeat-containing protein At5g39350-like7.4e-17557.72Show/hide
Query:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV
        LG LENA K+FD+  QPK +L N MVNGYLQN +YNE+IELFK+MGR   E DSY CNFALKAC FL+  + G                    SILNF+V
Subjt:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV

Query:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCI--YWLCY----------------
        + GDIM A+ FF +MVEKD VCWNVMIG FMQEGLFSEG+K+F+ ML N I PS + MTSL+QSCG +R LEFGKCI  Y L +                
Subjt:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCI--YWLCY----------------

Query:  ----------WI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV
                  WI           W+                        ++GGFDS T VSL+Q+CS TADL+ GK++H C+YRR L +    S  ++ +
Subjt:  ----------WI-----------WH------------------------DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV

Query:  ------LPVHYSVV-RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID
              L   YSV  RMK KN++SWTAMLVGLAQNG AR A +LF+QMQNER+  NALTLVSLVHCC LL SL EG  VHA+LI   FAS+ V  T LID
Subjt:  ------LPVHYSVV-RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID

Query:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLG------------RLQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCACF
        MYAKCS+I S EKV  +GFTPK+ ILYNS+ISGYG HGLG             LQPNESTFVSLLSACSHSGLVE GISLF +++  H++TPT+KL ACF
Subjt:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLG------------RLQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCACF

Query:  VDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG
        VDLLSRAGRL+QAE  INQM F+P SG+LETLLNGCL+H DIELGVKIADRLLS  SRNPS+YV+LSNIY EA +WD+V Y+RGLMTEQELKKI G
Subjt:  VDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG

A0A6J1HU79 pentatricopeptide repeat-containing protein At1g06140, mitochondrial-like2.6e-17257.38Show/hide
Query:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV
        LG LENA K+FD+  QPK VL N MVNGYLQN  YNE+IELFK+MGR   E DSY CNFALKAC FL+  + G                    SILNF+V
Subjt:  LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGW------------------ISILNFMV

Query:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIY----------------------
        + GDIM A+ FF +MVEKD VCWNVMIG FMQEGLFSEG+K+F+ ML N I PS + MTSL+QSCG +R LEFGKCI+                      
Subjt:  RTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIY----------------------

Query:  ----------W---------LCYW------------------IWH----DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV
                  W         L  W                  ++H    ++GGFDS T VSL+Q+CS TADL+ GK++H C+YRR L +    S  ++ +
Subjt:  ----------W---------LCYW------------------IWH----DDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQV

Query:  ------LPVHYSVV-RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID
              L   YSV  RMK KN++SWTAMLVGLAQNG AR A +LF+QMQNER+  NALTLVSLVHCC LL SL EG  VHA+LI   FA + V  T LID
Subjt:  ------LPVHYSVV-RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLID

Query:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCACF
        MYAKCS+I S EKV  +GFTPK+ ILYNS+IS YG HG GR            LQPNESTFVSLLSACSHSGLVE GISLF N++  HN+TPT+KL ACF
Subjt:  MYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCACF

Query:  VDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG
        VDLLSRAGRL QAE  IN M F+P SGILETLLNGCL+H +IELGVKIADRLLSL SRNPS+YV+LSNIY EA +WD+V  +R LMTEQELKKI G
Subjt:  VDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISG

SwissProt top hitse value%identityAlignment
P0C898 Putative pentatricopeptide repeat-containing protein At3g151305.1e-5629.45Show/hide
Query:  AHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWG------------------WISILNFMVRTGDIM
        A+K+FD   +   V ++ +++G++ NG+   S+ LF  MGR  +  + +  +  LKAC  L  L+ G                    S+++   + G I 
Subjt:  AHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWG------------------WISILNFMVRTGDIM

Query:  CAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIV--PSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQVCS
         A++ F ++V++  + WN MI  F+  G  S+    F  M   NI   P    +TSL+++C +   +  GK I+               G  V     C 
Subjt:  CAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIV--PSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQVCS

Query:  HTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLH
         +A +    V         L VK  +      +     +  ++K K +ISW+++++G AQ G   +A  LF ++Q      ++  L S++   A  + L 
Subjt:  HTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLH

Query:  EGLRVHAILICCGFASEDVVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGL
        +G ++ A+ +      E  V   ++DMY KC  +  AEK        K+ I +  +I+GYG HGLG+             ++P+E  ++++LSACSHSG+
Subjt:  EGLRVHAILICCGFASEDVVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGL

Query:  VEVGISLFHNIKNDHNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEA
        ++ G  LF  +   H I P  +  AC VDLL RAGRL++A+  I+ M  KP  GI +TLL+ C +H DIELG ++   LL ++++NP+ YV +SN+Y +A
Subjt:  VEVGISLFHNIKNDHNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEA

Query:  KQWDSVKYIRGLMTEQELKKISG
          W+     R L   + LKK +G
Subjt:  KQWDSVKYIRGLMTEQELKKISG

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.3e-5626.45Show/hide
Query:  MYC-LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKAC-----------TFLMILKWGW-------ISIL
        ++C  G ++ A ++F+      NVLY+TM+ G+ +  + +++++ F  M   D+E   Y   + LK C              +++K G+         + 
Subjt:  MYC-LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKAC-----------TFLMILKWGW-------ISIL

Query:  NFMVRTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIY------------------
        N   +   +  A++ FD+M E+D V WN ++  + Q G+     ++   M   N+ PSF+ + S++ +  A+R +  GK I+                  
Subjt:  NFMVRTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIY------------------

Query:  ---------------------------W-----------------LCYWIWHDDGGFDSGTFV-SLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVP
                                   W                 L +    D+G   +   V   +  C+   DL  G+ +H      GL      +V 
Subjt:  ---------------------------W-----------------LCYWIWHDDGGFDSGTFV-SLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVP

Query:  VLQVLPVHYSVV-----------RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASED
        V+  L   Y              +++++ L+SW AM++G AQNG    A   F+QM++  +  +  T VS++   A LS  H    +H +++        
Subjt:  VLQVLPVHYSVV-----------RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASED

Query:  VVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNIT
         V T L+DMYAKC  I+ A  +     + +    +N++I GYGTHG G+             ++PN  TF+S++SACSHSGLVE G+  F+ +K +++I 
Subjt:  VVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNIT

Query:  PTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQEL
         +       VDLL RAGRL +A  FI QM  KP   +   +L  C +H ++    K A+RL  LN  +   +V L+NIY  A  W+ V  +R  M  Q L
Subjt:  PTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQEL

Query:  KKISG
        +K  G
Subjt:  KKISG

Q9LND4 Pentatricopeptide repeat-containing protein At1g06140, mitochondrial1.3e-5428.82Show/hide
Query:  YNTMVNGYLQNGN--YNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGWI------------------SILNFMVRTGDIMCAQRFFDQMVEKD
        +NT+++GY ++    Y++ + L+  M R+   +DS+   FA+KAC  L +L+ G +                  S++    + G +  AQ+ FD++  ++
Subjt:  YNTMVNGYLQNGN--YNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGWI------------------SILNFMVRTGDIMCAQRFFDQMVEKD

Query:  AVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCI
        +V W V++  +++     E F++F  M    +    + +  L+++CG +   + GKC++                  VS+ +     +D     ++   +
Subjt:  AVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCI

Query:  YRRGL-GVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGF
          R L   +  F   V              ++N++ WT ++ G A+   A +AF LF QM  E I+ N  TL +++  C+ L SL  G  VH  +I  G 
Subjt:  YRRGL-GVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGF

Query:  ASEDVVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGL-------------GRLQPNESTFVSLLSACSHSGLVEVGISLFHNIKND
          + V  T  IDMYA+C  I  A  V       +  I ++S+I+ +G +GL               + PN  TFVSLLSACSHSG V+ G   F ++  D
Subjt:  ASEDVVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGL-------------GRLQPNESTFVSLLSACSHSGLVEVGISLFHNIKND

Query:  HNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMT
        + + P  +  AC VDLL RAG + +A++FI+ M  KP++     LL+ C +H +++L  +IA++LLS+     S+YV LSNIY +A  W+ V  +R  M 
Subjt:  HNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMT

Query:  EQELKKISGK
         +  +K  G+
Subjt:  EQELKKISGK

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.7e-5928.15Show/hide
Query:  LENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGWI------------------SILNFMVRTG
        +++A K+FDE  +   + +N+++NGY+ NG   + + +F  M    +E+D          C    ++  G                    ++L+   + G
Subjt:  LENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGWI------------------SILNFMVRTG

Query:  DIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQVC
        D+  A+  F +M ++  V +  MI  + +EGL  E  K+F  M    I P    +T+++  C   R L+ GK ++    WI  +D GFD     +L+ + 
Subjt:  DIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQVC

Query:  SHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFN-QMQNERIVCNALTLVSLVHCCALLSS
        +    + + +++              FS               M+ K++ISW  ++ G ++N +A +A  LFN  ++ +R   +  T+  ++  CA LS+
Subjt:  SHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFN-QMQNERIVCNALTLVSLVHCCALLSS

Query:  LHEGLRVHAILICCGFASEDVVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHS
          +G  +H  ++  G+ S+  VA  L+DMYAKC  ++ A  +L      K+ + +  +I+GYG HG G+             ++ +E +FVSLL ACSHS
Subjt:  LHEGLRVHAILICCGFASEDVVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHS

Query:  GLVEVGISLFHNIKNDHNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYT
        GLV+ G   F+ ++++  I PT +  AC VD+L+R G L +A  FI  M   P + I   LL GC +H+D++L  K+A+++  L   N   YV ++NIY 
Subjt:  GLVEVGISLFHNIKNDHNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYT

Query:  EAKQWDSVKYIRGLMTEQELKKISGKGENLQPKVNFLELK
        EA++W+ VK +R        K+I  +G    P  +++E+K
Subjt:  EAKQWDSVKYIRGLMTEQELKKISGKGENLQPKVNFLELK

Q9ZQ74 Pentatricopeptide repeat-containing protein At2g03380, mitochondrial3.7e-6230.49Show/hide
Query:  GYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWG-WI-----------------SILNFMVR
        G +++AHK+F++      V + +M+ GY++N    E + LF  M   ++  + Y     + ACT L  L  G W                  S+L+  V+
Subjt:  GYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWG-WI-----------------SILNFMVR

Query:  TGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQ
         GDI  A+R F++    D V W  MI  +   G  +E   +F  M    I P+ + + S++  CG I  LE G+ ++ L   +    G +D+    +L+ 
Subjt:  TGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQ

Query:  VCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLS
        + +      D K                            Y       K++++W +++ G +QNG   +A  LF++M +E +  N +T+ SL   CA L 
Subjt:  VCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLS

Query:  SLHEGLRVHAILICCGF-ASEDV-VATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHG--LGRLQ-----------PNESTFVSLLSAC
        SL  G  +HA  +  GF AS  V V T L+D YAKC    SA  +       K  I ++++I GYG  G  +G L+           PNESTF S+LSAC
Subjt:  SLHEGLRVHAILICCGF-ASEDV-VATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHG--LGRLQ-----------PNESTFVSLLSAC

Query:  SHSGLVEVGISLFHNIKNDHNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSN
         H+G+V  G   F ++  D+N TP+ K   C VD+L+RAG L+QA   I +M  +P        L+GC MH+  +LG  +  ++L L+  + S YV +SN
Subjt:  SHSGLVEVGISLFHNIKNDHNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSN

Query:  IYTEAKQWDSVKYIRGLMTEQELKKISG
        +Y    +W+  K +R LM ++ L KI+G
Subjt:  IYTEAKQWDSVKYIRGLMTEQELKKISG

Arabidopsis top hitse value%identityAlignment
AT1G06140.1 Pentatricopeptide repeat (PPR) superfamily protein9.0e-5628.82Show/hide
Query:  YNTMVNGYLQNGN--YNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGWI------------------SILNFMVRTGDIMCAQRFFDQMVEKD
        +NT+++GY ++    Y++ + L+  M R+   +DS+   FA+KAC  L +L+ G +                  S++    + G +  AQ+ FD++  ++
Subjt:  YNTMVNGYLQNGN--YNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGWI------------------SILNFMVRTGDIMCAQRFFDQMVEKD

Query:  AVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCI
        +V W V++  +++     E F++F  M    +    + +  L+++CG +   + GKC++                  VS+ +     +D     ++   +
Subjt:  AVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCI

Query:  YRRGL-GVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGF
          R L   +  F   V              ++N++ WT ++ G A+   A +AF LF QM  E I+ N  TL +++  C+ L SL  G  VH  +I  G 
Subjt:  YRRGL-GVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGF

Query:  ASEDVVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGL-------------GRLQPNESTFVSLLSACSHSGLVEVGISLFHNIKND
          + V  T  IDMYA+C  I  A  V       +  I ++S+I+ +G +GL               + PN  TFVSLLSACSHSG V+ G   F ++  D
Subjt:  ASEDVVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGL-------------GRLQPNESTFVSLLSACSHSGLVEVGISLFHNIKND

Query:  HNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMT
        + + P  +  AC VDLL RAG + +A++FI+ M  KP++     LL+ C +H +++L  +IA++LLS+     S+YV LSNIY +A  W+ V  +R  M 
Subjt:  HNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMT

Query:  EQELKKISGK
         +  +K  G+
Subjt:  EQELKKISGK

AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-5726.45Show/hide
Query:  MYC-LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKAC-----------TFLMILKWGW-------ISIL
        ++C  G ++ A ++F+      NVLY+TM+ G+ +  + +++++ F  M   D+E   Y   + LK C              +++K G+         + 
Subjt:  MYC-LGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKAC-----------TFLMILKWGW-------ISIL

Query:  NFMVRTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIY------------------
        N   +   +  A++ FD+M E+D V WN ++  + Q G+     ++   M   N+ PSF+ + S++ +  A+R +  GK I+                  
Subjt:  NFMVRTGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIY------------------

Query:  ---------------------------W-----------------LCYWIWHDDGGFDSGTFV-SLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVP
                                   W                 L +    D+G   +   V   +  C+   DL  G+ +H      GL      +V 
Subjt:  ---------------------------W-----------------LCYWIWHDDGGFDSGTFV-SLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVP

Query:  VLQVLPVHYSVV-----------RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASED
        V+  L   Y              +++++ L+SW AM++G AQNG    A   F+QM++  +  +  T VS++   A LS  H    +H +++        
Subjt:  VLQVLPVHYSVV-----------RMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASED

Query:  VVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNIT
         V T L+DMYAKC  I+ A  +     + +    +N++I GYGTHG G+             ++PN  TF+S++SACSHSGLVE G+  F+ +K +++I 
Subjt:  VVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNIT

Query:  PTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQEL
         +       VDLL RAGRL +A  FI QM  KP   +   +L  C +H ++    K A+RL  LN  +   +V L+NIY  A  W+ V  +R  M  Q L
Subjt:  PTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQEL

Query:  KKISG
        +K  G
Subjt:  KKISG

AT2G03380.1 Pentatricopeptide repeat (PPR) superfamily protein2.6e-6330.49Show/hide
Query:  GYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWG-WI-----------------SILNFMVR
        G +++AHK+F++      V + +M+ GY++N    E + LF  M   ++  + Y     + ACT L  L  G W                  S+L+  V+
Subjt:  GYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWG-WI-----------------SILNFMVR

Query:  TGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQ
         GDI  A+R F++    D V W  MI  +   G  +E   +F  M    I P+ + + S++  CG I  LE G+ ++ L   +    G +D+    +L+ 
Subjt:  TGDIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQ

Query:  VCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLS
        + +      D K                            Y       K++++W +++ G +QNG   +A  LF++M +E +  N +T+ SL   CA L 
Subjt:  VCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLS

Query:  SLHEGLRVHAILICCGF-ASEDV-VATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHG--LGRLQ-----------PNESTFVSLLSAC
        SL  G  +HA  +  GF AS  V V T L+D YAKC    SA  +       K  I ++++I GYG  G  +G L+           PNESTF S+LSAC
Subjt:  SLHEGLRVHAILICCGF-ASEDV-VATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHG--LGRLQ-----------PNESTFVSLLSAC

Query:  SHSGLVEVGISLFHNIKNDHNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSN
         H+G+V  G   F ++  D+N TP+ K   C VD+L+RAG L+QA   I +M  +P        L+GC MH+  +LG  +  ++L L+  + S YV +SN
Subjt:  SHSGLVEVGISLFHNIKNDHNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSN

Query:  IYTEAKQWDSVKYIRGLMTEQELKKISG
        +Y    +W+  K +R LM ++ L KI+G
Subjt:  IYTEAKQWDSVKYIRGLMTEQELKKISG

AT3G15130.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.6e-5729.45Show/hide
Query:  AHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWG------------------WISILNFMVRTGDIM
        A+K+FD   +   V ++ +++G++ NG+   S+ LF  MGR  +  + +  +  LKAC  L  L+ G                    S+++   + G I 
Subjt:  AHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWG------------------WISILNFMVRTGDIM

Query:  CAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIV--PSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQVCS
         A++ F ++V++  + WN MI  F+  G  S+    F  M   NI   P    +TSL+++C +   +  GK I+               G  V     C 
Subjt:  CAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIV--PSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQVCS

Query:  HTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLH
         +A +    V         L VK  +      +     +  ++K K +ISW+++++G AQ G   +A  LF ++Q      ++  L S++   A  + L 
Subjt:  HTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLH

Query:  EGLRVHAILICCGFASEDVVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGL
        +G ++ A+ +      E  V   ++DMY KC  +  AEK        K+ I +  +I+GYG HGLG+             ++P+E  ++++LSACSHSG+
Subjt:  EGLRVHAILICCGFASEDVVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHSGL

Query:  VEVGISLFHNIKNDHNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEA
        ++ G  LF  +   H I P  +  AC VDLL RAGRL++A+  I+ M  KP  GI +TLL+ C +H DIELG ++   LL ++++NP+ YV +SN+Y +A
Subjt:  VEVGISLFHNIKNDHNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEA

Query:  KQWDSVKYIRGLMTEQELKKISG
          W+     R L   + LKK +G
Subjt:  KQWDSVKYIRGLMTEQELKKISG

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-6028.15Show/hide
Query:  LENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGWI------------------SILNFMVRTG
        +++A K+FDE  +   + +N+++NGY+ NG   + + +F  M    +E+D          C    ++  G                    ++L+   + G
Subjt:  LENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGWI------------------SILNFMVRTG

Query:  DIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQVC
        D+  A+  F +M ++  V +  MI  + +EGL  E  K+F  M    I P    +T+++  C   R L+ GK ++    WI  +D GFD     +L+ + 
Subjt:  DIMCAQRFFDQMVEKDAVCWNVMIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQVC

Query:  SHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFN-QMQNERIVCNALTLVSLVHCCALLSS
        +    + + +++              FS               M+ K++ISW  ++ G ++N +A +A  LFN  ++ +R   +  T+  ++  CA LS+
Subjt:  SHTADLNDGKVLHDCIYRRGLGVKAQFSVPVLQVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFN-QMQNERIVCNALTLVSLVHCCALLSS

Query:  LHEGLRVHAILICCGFASEDVVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHS
          +G  +H  ++  G+ S+  VA  L+DMYAKC  ++ A  +L      K+ + +  +I+GYG HG G+             ++ +E +FVSLL ACSHS
Subjt:  LHEGLRVHAILICCGFASEDVVATVLIDMYAKCSKIVSAEKVLKYGFTPKEAILYNSIISGYGTHGLGR-------------LQPNESTFVSLLSACSHS

Query:  GLVEVGISLFHNIKNDHNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYT
        GLV+ G   F+ ++++  I PT +  AC VD+L+R G L +A  FI  M   P + I   LL GC +H+D++L  K+A+++  L   N   YV ++NIY 
Subjt:  GLVEVGISLFHNIKNDHNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCLMHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYT

Query:  EAKQWDSVKYIRGLMTEQELKKISGKGENLQPKVNFLELK
        EA++W+ VK +R        K+I  +G    P  +++E+K
Subjt:  EAKQWDSVKYIRGLMTEQELKKISGKGENLQPKVNFLELK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACTGTTTGGGTTATTTGGAAAATGCACACAAACTATTTGATGAATTTCTTCAACCAAAAAATGTTCTTTACAATACCATGGTTAATGGGTATCTGCAGAATGGGAA
TTATAATGAGAGTATTGAGCTGTTTAAGATGATGGGTAGATATGATTTGGAGTTAGATAGTTATATGTGTAATTTTGCTCTTAAGGCATGTACTTTCTTAATGATTTTGA
AATGGGGATGGATTTCGATTTTGAATTTTATGGTAAGAACTGGTGATATTATGTGTGCACAAAGATTTTTTGATCAAATGGTTGAGAAAGATGCTGTTTGTTGGAATGTA
ATGATTGGTGCCTTTATGCAAGAAGGGTTGTTTAGTGAAGGTTTTAAGGTGTTTATTGGTATGCTTTGTAATAATATTGTGCCTAGTTTTATGATCATGACAAGCTTGAT
TCAATCATGTGGGGCGATTAGGTATTTGGAGTTTGGAAAATGTATTTATTGGCTATGTTATTGGATTTGGCATGATGATGGAGGTTTTGATTCGGGTACCTTTGTTAGCC
TCATCCAGGTTTGTTCTCACACGGCTGATCTGAACGACGGGAAAGTTCTTCACGACTGTATTTATCGAAGGGGGCTTGGTGTAAAGGCCCAGTTTTCCGTTCCAGTTCTT
CAAGTGCTTCCGGTTCATTATTCGGTTGTTCGAATGAAAAATAAGAATTTGATTTCATGGACTGCTATGCTTGTGGGATTGGCACAAAATGGGCATGCTAGAAAAGCTTT
TAGGTTATTTAATCAGATGCAAAATGAGAGGATTGTGTGCAACGCTCTCACCTTAGTCAGTCTAGTTCATTGTTGTGCGCTCCTCAGCTCGTTGCATGAAGGTTTACGTG
TGCATGCTATCTTAATTTGTTGTGGTTTTGCTTCCGAAGATGTCGTTGCGACCGTGCTCATTGATATGTATGCAAAATGCAGCAAAATAGTATCAGCTGAGAAGGTACTC
AAGTATGGTTTCACACCCAAGGAAGCGATTTTGTATAATTCTATAATTTCAGGCTATGGAACACACGGTCTCGGGCGACTTCAGCCAAATGAAAGCACCTTTGTTTCTCT
GCTATCTGCTTGTAGCCATTCAGGCCTCGTAGAAGTAGGAATCTCTCTGTTTCATAATATAAAGAATGACCATAACATAACACCTACTAATAAACTTTGTGCTTGTTTTG
TTGATCTTCTAAGTCGAGCGGGACGCCTGCAGCAAGCCGAGGCATTTATAAACCAAATGCTTTTCAAACCAATAAGTGGCATACTCGAAACTCTGCTGAATGGATGTCTG
ATGCACAATGACATTGAATTGGGTGTAAAAATTGCTGACAGATTACTGTCCTTGAATTCTAGAAATCCCAGCATCTATGTTACCTTGTCGAATATATACACCGAAGCGAA
ACAATGGGATTCAGTTAAGTATATACGAGGGCTCATGACTGAGCAGGAACTTAAAAAGATTTCGGGCAAGGGGGAGAATTTGCAGCCTAAAGTAAATTTTTTGGAGTTAA
AAGATATCCATTTGTTGGGTCATATTTTGTTCAAGGAAATTCTTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTACTGTTTGGGTTATTTGGAAAATGCACACAAACTATTTGATGAATTTCTTCAACCAAAAAATGTTCTTTACAATACCATGGTTAATGGGTATCTGCAGAATGGGAA
TTATAATGAGAGTATTGAGCTGTTTAAGATGATGGGTAGATATGATTTGGAGTTAGATAGTTATATGTGTAATTTTGCTCTTAAGGCATGTACTTTCTTAATGATTTTGA
AATGGGGATGGATTTCGATTTTGAATTTTATGGTAAGAACTGGTGATATTATGTGTGCACAAAGATTTTTTGATCAAATGGTTGAGAAAGATGCTGTTTGTTGGAATGTA
ATGATTGGTGCCTTTATGCAAGAAGGGTTGTTTAGTGAAGGTTTTAAGGTGTTTATTGGTATGCTTTGTAATAATATTGTGCCTAGTTTTATGATCATGACAAGCTTGAT
TCAATCATGTGGGGCGATTAGGTATTTGGAGTTTGGAAAATGTATTTATTGGCTATGTTATTGGATTTGGCATGATGATGGAGGTTTTGATTCGGGTACCTTTGTTAGCC
TCATCCAGGTTTGTTCTCACACGGCTGATCTGAACGACGGGAAAGTTCTTCACGACTGTATTTATCGAAGGGGGCTTGGTGTAAAGGCCCAGTTTTCCGTTCCAGTTCTT
CAAGTGCTTCCGGTTCATTATTCGGTTGTTCGAATGAAAAATAAGAATTTGATTTCATGGACTGCTATGCTTGTGGGATTGGCACAAAATGGGCATGCTAGAAAAGCTTT
TAGGTTATTTAATCAGATGCAAAATGAGAGGATTGTGTGCAACGCTCTCACCTTAGTCAGTCTAGTTCATTGTTGTGCGCTCCTCAGCTCGTTGCATGAAGGTTTACGTG
TGCATGCTATCTTAATTTGTTGTGGTTTTGCTTCCGAAGATGTCGTTGCGACCGTGCTCATTGATATGTATGCAAAATGCAGCAAAATAGTATCAGCTGAGAAGGTACTC
AAGTATGGTTTCACACCCAAGGAAGCGATTTTGTATAATTCTATAATTTCAGGCTATGGAACACACGGTCTCGGGCGACTTCAGCCAAATGAAAGCACCTTTGTTTCTCT
GCTATCTGCTTGTAGCCATTCAGGCCTCGTAGAAGTAGGAATCTCTCTGTTTCATAATATAAAGAATGACCATAACATAACACCTACTAATAAACTTTGTGCTTGTTTTG
TTGATCTTCTAAGTCGAGCGGGACGCCTGCAGCAAGCCGAGGCATTTATAAACCAAATGCTTTTCAAACCAATAAGTGGCATACTCGAAACTCTGCTGAATGGATGTCTG
ATGCACAATGACATTGAATTGGGTGTAAAAATTGCTGACAGATTACTGTCCTTGAATTCTAGAAATCCCAGCATCTATGTTACCTTGTCGAATATATACACCGAAGCGAA
ACAATGGGATTCAGTTAAGTATATACGAGGGCTCATGACTGAGCAGGAACTTAAAAAGATTTCGGGCAAGGGGGAGAATTTGCAGCCTAAAGTAAATTTTTTGGAGTTAA
AAGATATCCATTTGTTGGGTCATATTTTGTTCAAGGAAATTCTTTTTTAA
Protein sequenceShow/hide protein sequence
MYCLGYLENAHKLFDEFLQPKNVLYNTMVNGYLQNGNYNESIELFKMMGRYDLELDSYMCNFALKACTFLMILKWGWISILNFMVRTGDIMCAQRFFDQMVEKDAVCWNV
MIGAFMQEGLFSEGFKVFIGMLCNNIVPSFMIMTSLIQSCGAIRYLEFGKCIYWLCYWIWHDDGGFDSGTFVSLIQVCSHTADLNDGKVLHDCIYRRGLGVKAQFSVPVL
QVLPVHYSVVRMKNKNLISWTAMLVGLAQNGHARKAFRLFNQMQNERIVCNALTLVSLVHCCALLSSLHEGLRVHAILICCGFASEDVVATVLIDMYAKCSKIVSAEKVL
KYGFTPKEAILYNSIISGYGTHGLGRLQPNESTFVSLLSACSHSGLVEVGISLFHNIKNDHNITPTNKLCACFVDLLSRAGRLQQAEAFINQMLFKPISGILETLLNGCL
MHNDIELGVKIADRLLSLNSRNPSIYVTLSNIYTEAKQWDSVKYIRGLMTEQELKKISGKGENLQPKVNFLELKDIHLLGHILFKEILF