; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016813 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016813
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat (PPR-like) superfamily protein
Genome locationchr12:41501932..41507296
RNA-Seq ExpressionLag0016813
SyntenyLag0016813
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606718.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]9.8e-18466.55Show/hide
Query:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL
        RASSNLG FNLCK+LHCHVVQFGF NH+HVVNEL+GMYVKL RM   RK                               MFLQMELEGVE NPVTWTSL
Subjt:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL

Query:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA
        +SSHARCGHLEET+ALFSKMRMKGVGATAEML VVLS+C D+A L+RGQM+HGYIVKGG+E YLFAKNALI VYG+GGD+RD EKLFHEMKV++LVSWN+
Subjt:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA

Query:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM
        LISSYAESGLYDKAFE FS+  +M+  PEMKP+VITWSAVICGFAS G GEESLEVFRQMQLANV ANSVTISSVLSI A LAALNLGREMHGHVIRA M
Subjt:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM

Query:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR
        +DNILVGNGLINMYTKCGSFKP CLVF KLENRDLISWNSMIA YGMHGLGKDAL TFD +I SG      P  + F+  ++    + +V  G ++    
Subjt:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR

Query:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI
             +K    H                 S++V  +  K NA  +     +C               I NLNSEITGSHMLLS+IF+ SCR  DSARVRI
Subjt:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI

Query:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE
        SARM+GLKKVPGCSWIEVKK+VYMFK+ NS+QEGLE+VDEILHDLALQIE
Subjt:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE

XP_022148095.1 putative pentatricopeptide repeat-containing protein At1g17630 [Momordica charantia]7.0e-18266.67Show/hide
Query:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL
        RASSNLGSFNLCKNLHCHVVQFGFQNH+HVVNELIGMY KLGRM   +K                               MFL+ME EGVE NPVTWTSL
Subjt:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL

Query:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA
        +SSHARCGHLE TMALFS+MRMKG+GATAEML VVLS+C D+A L+RGQMIHGYI+KGG+E YLFAKNALI VYG+GGDIRD EKLFHEM+V++LVSWNA
Subjt:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA

Query:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM
        LISSYAESGL DKAFEVFSQ EKMDV+PEMKP+VITWSAVICGFASKGLGEESLEVFRQMQLA+V ANSVTISSV S+ A LAALNLGREMHGHVIRALM
Subjt:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM

Query:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR
        DDNILVGNGLINMYTKCG+FKP CLVFEKLENRDLISWNSMIA YG HGLGKDALT FD +I SG     +P  + F+  ++    + +V  G ++    
Subjt:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR

Query:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNCP-------------GIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI
             +K    H                 S++V G+  + NA  +     +C               I NLNSEI GSHMLLS+IFA   R  DSARVRI
Subjt:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNCP-------------GIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI

Query:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIEGN
         AR +GL  VPG SWIEVKKKVYMFKA NSI EGLEKVDEILHDLALQIEG+
Subjt:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIEGN

XP_022949499.1 putative pentatricopeptide repeat-containing protein At1g17630 [Cucurbita moschata]5.2e-18567.09Show/hide
Query:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL
        RASSNLG FNLCK+LHCHVVQFGF NH+HVVNEL+GMYVKL RM   RK                               MFLQMELEGVE NPVTWTSL
Subjt:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL

Query:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA
        +SSHARCGHLEET+ALFSKMRMKGVGATAEML VVLS+C D+A L+RGQM+HGYIVKGG+E YLFAKNALI VYG+GGDIRD EKLFHEMKV++LVSWN+
Subjt:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA

Query:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM
        LISSYAESGLYDKAFE FS+ EKM   PEMKP+VITWSAVICGFAS G GEESLEVFRQMQLANV ANSVTISSVLSI A LAALNLGREMHGHVIRA M
Subjt:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM

Query:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR
        +DNILVGNGLINMYTKCGSFKP CLVF KLENRDLISWNSMIA YGMHGLGKDAL TFD +I SG      P  + F+  ++    + +V  G ++    
Subjt:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR

Query:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI
             +K    H                 S++V  +  K NA  +     +C               I NLNSEITGSHMLLS+IF+ SCR  DSARVRI
Subjt:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI

Query:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE
        SARM+GLKKVPGCSWIEVKK+VYMFK+ NS+QEGLE+VDEILHDLALQIE
Subjt:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE

XP_022997825.1 putative pentatricopeptide repeat-containing protein At1g17630 [Cucurbita maxima]3.6e-18667.64Show/hide
Query:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL
        RASSNLG FNLCK LHCHVVQFGFQNH+HVVNELIGMYVKL RM   RK                               MFLQMELEGVE NPVTWTSL
Subjt:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL

Query:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA
        +SSHARCGHLEET+ALFSKMRMKGVGATAEML VVLS+C D+A L+RGQM+HGYIVKGG+E YLFAKNALI VYG+GGDIRD EKLFHEMKV++LVSWN+
Subjt:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA

Query:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM
        LISSYAESGLYDKAFE FS+ EKM+  PEMKP+VITWSAVICGFAS G GEESLEVFRQMQLANV ANSVTISSVLSI A LAALNLGREMHGHVIRA M
Subjt:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM

Query:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR
        +DNILVGNGLINMYTKCGSFKP CLVFEKLENRDLISWNSMIA YGMHGLGKDAL TFD +I SG      P  + F+  ++    + +V  G ++    
Subjt:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR

Query:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI
             +K    H                 S++V  +  K N   +     +C               I NLNSEI GSHMLLS+IF+ SCR  DSARVRI
Subjt:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI

Query:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE
        SARM+GLKKVPGCSWIEVKKKVYMFKA NS+QEGLE+VDEILHDLALQIE
Subjt:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE

XP_023523771.1 putative pentatricopeptide repeat-containing protein At1g17630 [Cucurbita pepo subsp. pepo]9.8e-18466.73Show/hide
Query:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL
        RASSNLG FNLCK+LHCHVVQFGFQNH+HVVNEL+GMYVKL RM   RK                               MFLQMELEGVE NPVTWTSL
Subjt:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL

Query:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA
        +SSHARCGHLEET+ALFSKMRMKGVGATAEML VVLS+C D+   +RGQM+HGYIVKGG+E YLFAKNALI VYG+GGDIRD EKLFHEMKV++LVSWN+
Subjt:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA

Query:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM
        LISSYAESGLYDKAFE FS+ EKM+  PEMKPSVITWSAVICGFAS G GEESLEVFRQMQLANV ANSVTISSVLSI A LAALNLGREMHGHVIRA M
Subjt:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM

Query:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR
        +DNILVGNGLINMYTKCGSFKP CLVF KLENRDLISWNS+IA YGMHGLGKDAL TFD +I SG      P  + F+  ++    + +V  G ++    
Subjt:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR

Query:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI
             +K    H                 S++V  +  K NA  +     +C               I +L+SEITGSHMLLS+I++ SCR  DSARVRI
Subjt:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI

Query:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE
        SARM+GLKKVPGCSWIEVKKKVYMFKA NS+QEGLE+VDEILHDLALQIE
Subjt:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE

TrEMBL top hitse value%identityAlignment
A0A0A0LFT1 Uncharacterized protein4.8e-17664Show/hide
Query:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL
        RASSNLG+FN+CKNLHCHVVQFGFQNH+HV NELIGMY KL RM   RK                               MF QMELEGVE NPVTWTSL
Subjt:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL

Query:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA
        +SSHARCGHLEETM LF KMRMKGVG TAEML VVLS+C D+A LN GQMIHGY+VKGG+  YLFAKNALI +YG+GG + D EKLFHEMKV++LVSWNA
Subjt:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA

Query:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM
        LISS+AESG+YDKA E+ SQ EKM+ +PEMKP+VITWSA+ICGFASKGLGEESLEVFR+MQLANV ANSVTI+SVLSI A LAALNLGREMHGHVIRA M
Subjt:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM

Query:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTS
        DDN+LVGNGLINMYTKCGSFKP  +VFEKLENRD ISWNSMIA YG HGLGKDAL TF+++I SG      P  + F+  ++    + +V      +   
Subjt:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTS

Query:  RSNLK-------WSTM--------HAWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI
        R N K       ++ M             S+++ G+  + NA  + +   +C               I NLNS+ITGSHMLLS+IFA SCR  DSARVRI
Subjt:  RSNLK-------WSTM--------HAWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI

Query:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE
        SAR +GLKKVPG SWIEVKKKVYMFKA  +I EGLEKVDEILHDLA QIE
Subjt:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE

A0A5A7U7B1 Putative pentatricopeptide repeat-containing protein2.1e-17162.55Show/hide
Query:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL
        RASSNLG+ ++CKNLHCHVVQFGFQNH+HV NELIGMY KL RM   RK                               MF QMELEGVE NPVTWTSL
Subjt:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL

Query:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA
        +SSHARCGHL ETM LF KMRMKGVGATAEML VVLS+C D+A LN GQMIHGY+VKGG+  YLFAKNALI +YG+GGD+ D EKLFHEMKV++LVSWNA
Subjt:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA

Query:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM
        LISS+AESG+YDKA E+ SQ EKM+ +PEMKP+VITWS++ICGF+SKGLGEESLEVFR+MQLANV ANSVTI+SVLSI A LAALNLGRE+HGHVIRA M
Subjt:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM

Query:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTS
        D+N+LVGNGLINMYTKCGSFKP  LVFEKLENRD ISWNSMIA YG HGLGKDAL T +++I SG      P  + F+  ++    + +V      +   
Subjt:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTS

Query:  RSNLK-------WSTM--------HAWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI
        R N K       ++ M             S+++  +  + NA  + +   +C               I NLNS+ITGSHMLLS+IFA SCR  DSARVRI
Subjt:  RSNLK-------WSTM--------HAWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI

Query:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE
        SAR++GLKKVPG SWIEVKKKVY+FKA  +  EGLEKVDEILHDLA QIE
Subjt:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE

A0A6J1D1Z6 putative pentatricopeptide repeat-containing protein At1g176303.4e-18266.67Show/hide
Query:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL
        RASSNLGSFNLCKNLHCHVVQFGFQNH+HVVNELIGMY KLGRM   +K                               MFL+ME EGVE NPVTWTSL
Subjt:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL

Query:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA
        +SSHARCGHLE TMALFS+MRMKG+GATAEML VVLS+C D+A L+RGQMIHGYI+KGG+E YLFAKNALI VYG+GGDIRD EKLFHEM+V++LVSWNA
Subjt:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA

Query:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM
        LISSYAESGL DKAFEVFSQ EKMDV+PEMKP+VITWSAVICGFASKGLGEESLEVFRQMQLA+V ANSVTISSV S+ A LAALNLGREMHGHVIRALM
Subjt:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM

Query:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR
        DDNILVGNGLINMYTKCG+FKP CLVFEKLENRDLISWNSMIA YG HGLGKDALT FD +I SG     +P  + F+  ++    + +V  G ++    
Subjt:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR

Query:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNCP-------------GIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI
             +K    H                 S++V G+  + NA  +     +C               I NLNSEI GSHMLLS+IFA   R  DSARVRI
Subjt:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNCP-------------GIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI

Query:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIEGN
         AR +GL  VPG SWIEVKKKVYMFKA NSI EGLEKVDEILHDLALQIEG+
Subjt:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIEGN

A0A6J1GD03 putative pentatricopeptide repeat-containing protein At1g176302.5e-18567.09Show/hide
Query:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL
        RASSNLG FNLCK+LHCHVVQFGF NH+HVVNEL+GMYVKL RM   RK                               MFLQMELEGVE NPVTWTSL
Subjt:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL

Query:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA
        +SSHARCGHLEET+ALFSKMRMKGVGATAEML VVLS+C D+A L+RGQM+HGYIVKGG+E YLFAKNALI VYG+GGDIRD EKLFHEMKV++LVSWN+
Subjt:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA

Query:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM
        LISSYAESGLYDKAFE FS+ EKM   PEMKP+VITWSAVICGFAS G GEESLEVFRQMQLANV ANSVTISSVLSI A LAALNLGREMHGHVIRA M
Subjt:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM

Query:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR
        +DNILVGNGLINMYTKCGSFKP CLVF KLENRDLISWNSMIA YGMHGLGKDAL TFD +I SG      P  + F+  ++    + +V  G ++    
Subjt:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR

Query:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI
             +K    H                 S++V  +  K NA  +     +C               I NLNSEITGSHMLLS+IF+ SCR  DSARVRI
Subjt:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI

Query:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE
        SARM+GLKKVPGCSWIEVKK+VYMFK+ NS+QEGLE+VDEILHDLALQIE
Subjt:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE

A0A6J1KAZ0 putative pentatricopeptide repeat-containing protein At1g176301.7e-18667.64Show/hide
Query:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL
        RASSNLG FNLCK LHCHVVQFGFQNH+HVVNELIGMYVKL RM   RK                               MFLQMELEGVE NPVTWTSL
Subjt:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRK-------------------------------MFLQMELEGVESNPVTWTSL

Query:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA
        +SSHARCGHLEET+ALFSKMRMKGVGATAEML VVLS+C D+A L+RGQM+HGYIVKGG+E YLFAKNALI VYG+GGDIRD EKLFHEMKV++LVSWN+
Subjt:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA

Query:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM
        LISSYAESGLYDKAFE FS+ EKM+  PEMKP+VITWSAVICGFAS G GEESLEVFRQMQLANV ANSVTISSVLSI A LAALNLGREMHGHVIRA M
Subjt:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM

Query:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR
        +DNILVGNGLINMYTKCGSFKP CLVFEKLENRDLISWNSMIA YGMHGLGKDAL TFD +I SG      P  + F+  ++    + +V  G ++    
Subjt:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV--GFFIRCYR

Query:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI
             +K    H                 S++V  +  K N   +     +C               I NLNSEI GSHMLLS+IF+ SCR  DSARVRI
Subjt:  TSRSNLKWSTMH-------------AWSISSVVLGLWNKRNAQRYRTCRRNC-------------PGIFNLNSEITGSHMLLSSIFATSCRL-DSARVRI

Query:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE
        SARM+GLKKVPGCSWIEVKKKVYMFKA NS+QEGLE+VDEILHDLALQIE
Subjt:  SARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIE

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic3.0e-4226.13Show/hide
Query:  ASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMFLQMELEGVESNPVTWTSLMSSHARCGHLEETMALFSKMRMKGVGATAEML
        A S L   ++ K +H + ++ GF + +++   L+ MY K G +   R++F  M    +E N V+W S++ ++ +  + +E M +F KM  +GV  T   +
Subjt:  ASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMFLQMELEGVESNPVTWTSLMSSHARCGHLEETMALFSKMRMKGVGATAEML

Query:  VVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALIVYGRGGDIRDVEKLFHEMKVRSLVSWNALISSYAESGLYDKAFEVFSQPEKMDVHPEMKPS
        +  L  C D+  L RG+ IH   V+ G +     +N  +V                         N+LIS Y +    D A  +F + +          +
Subjt:  VVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALIVYGRGGDIRDVEKLFHEMKVRSLVSWNALISSYAESGLYDKAFEVFSQPEKMDVHPEMKPS

Query:  VITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGSFKPCCLVFEKLENR
        +++W+A+I GFA  G   ++L  F QM+   V  ++ T  SV++  A L+  +  + +HG V+R+ +D N+ V   L++MY KCG+     L+F+ +  R
Subjt:  VITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGSFKPCCLVFEKLENR

Query:  DLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCY------------------------RTSRSNLKWS--
         + +WN+MI  YG HG GK AL  F+ +        + P  + FL V++    S +V   ++C+                        R  R N  W   
Subjt:  DLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCY------------------------RTSRSNLKWS--

Query:  ----TMHAWSISSVVLGLWNKRNAQRYRTCRRNCPGIFNLNSEITGSHMLLSSIF-ATSCRLDSARVRISARMEGLKKVPGCSWIEVKKKVYMFKARNSI
               A ++   +LG         +    +    +F LN +  G H+LL++I+ A S      +VR+S   +GL+K PGCS +E+K +V+ F + ++ 
Subjt:  ----TMHAWSISSVVLGLWNKRNAQRYRTCRRNCPGIFNLNSEITGSHMLLSSIF-ATSCRLDSARVRISARMEGLKKVPGCSWIEVKKKVYMFKARNSI

Query:  QEGLEKVDEILHDLALQIEGNA---------GVRLDVKSHSMELH-DSVKSSFGI
            +K+   L  L   I+            GV  DVK   +  H + +  SFG+
Subjt:  QEGLEKVDEILHDLALQIEGNA---------GVRLDVKSHSMELH-DSVKSSFGI

Q9LNP2 Putative pentatricopeptide repeat-containing protein At1g176306.3e-9339.04Show/hide
Query:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRM-------------------VMLR------------KMFLQMELEGVESNPVTWTSL
        RA   LG F LC+  H  V+Q G + ++HVVNEL+ +Y K GRM                   VM++            K+F  M+ E  + + VTWTS+
Subjt:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRM-------------------VMLR------------KMFLQMELEGVESNPVTWTSL

Query:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA
        +S H++CG  E+ +  F  MRM G   + E L V  S+C ++  L+  + +HGY++KGG+E YL ++NALI VYG+ G ++D E LF +++ + + SWN+
Subjt:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA

Query:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM
        LI+S+ ++G  D+A  +FS+ E+M+    +K +V+TW++VI G   +G G++SLE FRQMQ + V+ANSVTI  +LSI A L ALNLGRE+HGHVIR  M
Subjt:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM

Query:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTS
         +NILV N L+NMY KCG      LVFE + ++DLISWNS+I  YGMHG  + AL+ FD +I SG     + L+ +        L+ K  G  I    + 
Subjt:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTS

Query:  RSNLKWSTMHAWSISSVVLGLWNKRNAQR-----------------YRTCRRN-----CPGIFN----LNSEITGSHMLLSSIFATSCRL-DSARVRISA
        R  L+    H   I  ++  +   + A                     +CR +       GI +    L  E TGS+MLLS+I++   R  +SA VR  A
Subjt:  RSNLKWSTMHAWSISSVVLGLWNKRNAQR-----------------YRTCRRN-----CPGIFN----LNSEITGSHMLLSSIFATSCRL-DSARVRISA

Query:  RMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDL
        + + LKKV G SWIEVKKK Y F + + +Q   E +  +L DL
Subjt:  RMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDL

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202301.2e-5127.02Show/hide
Query:  TTFSISSIVFFGNAADSTLQTSSFRHRCHRHGFVTARLVSVYARSRASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMF----
        T +S SS+++    A    Q+     R   HG +    V +    +  + L +F + K +HC     G      V   +  MY++ GRM   RK+F    
Subjt:  TTFSISSIVFFGNAADSTLQTSSFRHRCHRHGFVTARLVSVYARSRASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMF----

Query:  ---------------------------LQMELEGVESNPVTWTSLMSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYI
                                    +ME  G+E+N V+W  ++S   R G+ +E + +F K+   G       +  VL    D  +LN G++IHGY+
Subjt:  ---------------------------LQMELEGVESNPVTWTSLMSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYI

Query:  VKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNALISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLE
        +K G        +A+I +YG+ G +  +  LF++ ++      NA I+  + +GL DKA E+F    ++     M+ +V++W+++I G A  G   E+LE
Subjt:  VKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNALISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLE

Query:  VFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDAL
        +FR+MQ+A V  N VTI S+L     +AAL  GR  HG  +R  + DN+ VG+ LI+MY KCG      +VF  +  ++L+ WNS++  + MHG  K+ +
Subjt:  VFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDAL

Query:  TTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTSRSNLKWSTMHAWSISSVVLGLWNKRN------------------AQRYRTCRRN-
        + F+ ++ + L    +    L        L  +   +F     +    +K    H +S    +LG   K                         +CR   
Subjt:  TTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTSRSNLKWSTMHAWSISSVVLGLWNKRN------------------AQRYRTCRRN-

Query:  --------CPGIFNLNSEITGSHMLLSSIFATS---CRLDSARVRISARMEGLKKVPGCSWIEVKKKVYMF----KARNSIQEGLEKVDEILHDLALQIE
                   +F+L  E  G+++LLS+I+A       +DS R ++ +   GLKK PGCSWI+VK +VY      K+   I +  EK+DEI  ++    +
Subjt:  --------CPGIFNLNSEITGSHMLLSSIFATS---CRLDSARVRISARMEGLKKVPGCSWIEVKKKVYMF----KARNSIQEGLEKVDEILHDLALQIE

Query:  GNAGVRLDVKSHSMELHDSVKSSFGIPSRLLALLKVVIQIIDLP
              LD   H +E  +  +  +G   +    L VV  +++ P
Subjt:  GNAGVRLDVKSHSMELHDSVKSSFGIPSRLLALLKVVIQIIDLP

Q9SJK9 Pentatricopeptide repeat-containing protein At2g36980, mitochondrial9.6e-4127.09Show/hide
Query:  SNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMFLQMELEGVESNPVTWTSLM-------------------------------SS
        ++LG+    + +   V++ GF   + V N LI MY K    +   K+F  M  +    N VTW SL+                               S 
Subjt:  SNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMFLQMELEGVESNPVTWTSLM-------------------------------SS

Query:  HARCGHLEETMALFSKMRMKGVGATAEMLVVVLSIC-VDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNALI
        HA CG LE  ++LF +M              +++ C  D + +  G+M+H  ++K G+   + AKN+++  Y + G   D  +    ++V + VSWN++I
Subjt:  HARCGHLEETMALFSKMRMKGVGATAEMLVVVLSIC-VDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNALI

Query:  SSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALMDD
         +  + G  +KA EVF        H   + +++TW+ +I G+   G GE++L  F +M  + V ++     +VL   + LA L  G+ +HG +I      
Subjt:  SSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALMDD

Query:  NILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV-------GFFIR
           VGN L+N+Y KCG  K     F  + N+DL+SWN+M+  +G+HGL   AL  +D +I SG+     P  + F+ ++TT   S +V          ++
Subjt:  NILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV-------GFFIR

Query:  CYRT---------------SRSNLKWSTMHAWSISSVVLGLWNKRNAQRYRTCRRNCPGIFN--LNSEITG-----------SHMLLSSIFATSCR-LDS
         YR                   +L  +   A + SS+V    +  N   + T    C   ++  L  E++            S +LLS+++ ++ R  + 
Subjt:  CYRT---------------SRSNLKWSTMHAWSISSVVLGLWNKRNAQRYRTCRRNCPGIFN--LNSEITG-----------SHMLLSSIFATSCR-LDS

Query:  ARVRISARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDL
          VR      G+KK PGCSWIEV  +V  F   +S    LE++ E L+ L
Subjt:  ARVRISARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDL

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.9e-4123.56Show/hide
Query:  SRASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMFLQMELEGVESNPVTWTSLMSSHARCGHLEETMALFSKMRMKGVGATAE
        S++ S+L S +  + LH  +++ GF     V N L+  Y+K  R+   RK+F +M     E + ++W S+++ +   G  E+ +++F +M + G+     
Subjt:  SRASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMFLQMELEGVESNPVTWTSLMSSHARCGHLEETMALFSKMRMKGVGATAE

Query:  MLVVVLSICVDIAILNRGQMIHGYIVKGGY-EVYLFAKNALIVYGRGGDIRDVEKLFHEMKVRSLVSWNALISSYAESGLYDKAFEVFSQPEKMDVHPEM
         +V V + C D  +++ G+ +H   VK  +     F    L +Y + GD+   + +F EM  RS+VS+ ++I+ YA  GL  +A ++F + E+  + P++
Subjt:  MLVVVLSICVDIAILNRGQMIHGYIVKGGY-EVYLFAKNALIVYGRGGDIRDVEKLFHEMKVRSLVSWNALISSYAESGLYDKAFEVFSQPEKMDVHPEM

Query:  --------------------------------------------------------------KPSVITWSAVICGFASKGLGEESLEVFR-QMQLANVMA
                                                                         +I+W+ +I G++      E+L +F   ++      
Subjt:  --------------------------------------------------------------KPSVITWSAVICGFASKGLGEESLEVFR-QMQLANVMA

Query:  NSVTISSVLSIFATLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLD
        +  T++ VL   A+L+A + GRE+HG+++R     +  V N L++MY KCG+     ++F+ + ++DL+SW  MIA YGMHG GK+A+  F+ +  +G++
Subjt:  NSVTISSVLSIFATLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLD

Query:  QMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTSRSNLKWSTMHAWSISSVVLGLWNKRNAQRY--------------------------RTCRRNCPGIF
           +  + L      + L+ +   FF          ++ +  H   I  ++    +   A R+                          +   +    +F
Subjt:  QMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTSRSNLKWSTMHAWSISSVVLGLWNKRNAQRY--------------------------RTCRRNCPGIF

Query:  NLNSEITGSHMLLSSIFATSCRLDSA-RVRISARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEIL
         L  E TG ++L+++I+A + + +   R+R      GL+K PGCSWIE+K +V +F A +S     E ++  L
Subjt:  NLNSEITGSHMLLSSIFATSCRLDSA-RVRISARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEIL

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein2.1e-4326.13Show/hide
Query:  ASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMFLQMELEGVESNPVTWTSLMSSHARCGHLEETMALFSKMRMKGVGATAEML
        A S L   ++ K +H + ++ GF + +++   L+ MY K G +   R++F  M    +E N V+W S++ ++ +  + +E M +F KM  +GV  T   +
Subjt:  ASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMFLQMELEGVESNPVTWTSLMSSHARCGHLEETMALFSKMRMKGVGATAEML

Query:  VVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALIVYGRGGDIRDVEKLFHEMKVRSLVSWNALISSYAESGLYDKAFEVFSQPEKMDVHPEMKPS
        +  L  C D+  L RG+ IH   V+ G +     +N  +V                         N+LIS Y +    D A  +F + +          +
Subjt:  VVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALIVYGRGGDIRDVEKLFHEMKVRSLVSWNALISSYAESGLYDKAFEVFSQPEKMDVHPEMKPS

Query:  VITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGSFKPCCLVFEKLENR
        +++W+A+I GFA  G   ++L  F QM+   V  ++ T  SV++  A L+  +  + +HG V+R+ +D N+ V   L++MY KCG+     L+F+ +  R
Subjt:  VITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGSFKPCCLVFEKLENR

Query:  DLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCY------------------------RTSRSNLKWS--
         + +WN+MI  YG HG GK AL  F+ +        + P  + FL V++    S +V   ++C+                        R  R N  W   
Subjt:  DLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCY------------------------RTSRSNLKWS--

Query:  ----TMHAWSISSVVLGLWNKRNAQRYRTCRRNCPGIFNLNSEITGSHMLLSSIF-ATSCRLDSARVRISARMEGLKKVPGCSWIEVKKKVYMFKARNSI
               A ++   +LG         +    +    +F LN +  G H+LL++I+ A S      +VR+S   +GL+K PGCS +E+K +V+ F + ++ 
Subjt:  ----TMHAWSISSVVLGLWNKRNAQRYRTCRRNCPGIFNLNSEITGSHMLLSSIF-ATSCRLDSARVRISARMEGLKKVPGCSWIEVKKKVYMFKARNSI

Query:  QEGLEKVDEILHDLALQIEGNA---------GVRLDVKSHSMELH-DSVKSSFGI
            +K+   L  L   I+            GV  DVK   +  H + +  SFG+
Subjt:  QEGLEKVDEILHDLALQIEGNA---------GVRLDVKSHSMELH-DSVKSSFGI

AT1G17630.1 Pentatricopeptide repeat (PPR-like) superfamily protein4.5e-9439.04Show/hide
Query:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRM-------------------VMLR------------KMFLQMELEGVESNPVTWTSL
        RA   LG F LC+  H  V+Q G + ++HVVNEL+ +Y K GRM                   VM++            K+F  M+ E  + + VTWTS+
Subjt:  RASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRM-------------------VMLR------------KMFLQMELEGVESNPVTWTSL

Query:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA
        +S H++CG  E+ +  F  MRM G   + E L V  S+C ++  L+  + +HGY++KGG+E YL ++NALI VYG+ G ++D E LF +++ + + SWN+
Subjt:  MSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNA

Query:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM
        LI+S+ ++G  D+A  +FS+ E+M+    +K +V+TW++VI G   +G G++SLE FRQMQ + V+ANSVTI  +LSI A L ALNLGRE+HGHVIR  M
Subjt:  LISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALM

Query:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTS
         +NILV N L+NMY KCG      LVFE + ++DLISWNS+I  YGMHG  + AL+ FD +I SG     + L+ +        L+ K  G  I    + 
Subjt:  DDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTS

Query:  RSNLKWSTMHAWSISSVVLGLWNKRNAQR-----------------YRTCRRN-----CPGIFN----LNSEITGSHMLLSSIFATSCRL-DSARVRISA
        R  L+    H   I  ++  +   + A                     +CR +       GI +    L  E TGS+MLLS+I++   R  +SA VR  A
Subjt:  RSNLKWSTMHAWSISSVVLGLWNKRNAQR-----------------YRTCRRN-----CPGIFN----LNSEITGSHMLLSSIFATSCRL-DSARVRISA

Query:  RMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDL
        + + LKKV G SWIEVKKK Y F + + +Q   E +  +L DL
Subjt:  RMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDL

AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein8.6e-5327.02Show/hide
Query:  TTFSISSIVFFGNAADSTLQTSSFRHRCHRHGFVTARLVSVYARSRASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMF----
        T +S SS+++    A    Q+     R   HG +    V +    +  + L +F + K +HC     G      V   +  MY++ GRM   RK+F    
Subjt:  TTFSISSIVFFGNAADSTLQTSSFRHRCHRHGFVTARLVSVYARSRASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMF----

Query:  ---------------------------LQMELEGVESNPVTWTSLMSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYI
                                    +ME  G+E+N V+W  ++S   R G+ +E + +F K+   G       +  VL    D  +LN G++IHGY+
Subjt:  ---------------------------LQMELEGVESNPVTWTSLMSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYI

Query:  VKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNALISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLE
        +K G        +A+I +YG+ G +  +  LF++ ++      NA I+  + +GL DKA E+F    ++     M+ +V++W+++I G A  G   E+LE
Subjt:  VKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNALISSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLE

Query:  VFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDAL
        +FR+MQ+A V  N VTI S+L     +AAL  GR  HG  +R  + DN+ VG+ LI+MY KCG      +VF  +  ++L+ WNS++  + MHG  K+ +
Subjt:  VFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDAL

Query:  TTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTSRSNLKWSTMHAWSISSVVLGLWNKRN------------------AQRYRTCRRN-
        + F+ ++ + L    +    L        L  +   +F     +    +K    H +S    +LG   K                         +CR   
Subjt:  TTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTSRSNLKWSTMHAWSISSVVLGLWNKRN------------------AQRYRTCRRN-

Query:  --------CPGIFNLNSEITGSHMLLSSIFATS---CRLDSARVRISARMEGLKKVPGCSWIEVKKKVYMF----KARNSIQEGLEKVDEILHDLALQIE
                   +F+L  E  G+++LLS+I+A       +DS R ++ +   GLKK PGCSWI+VK +VY      K+   I +  EK+DEI  ++    +
Subjt:  --------CPGIFNLNSEITGSHMLLSSIFATS---CRLDSARVRISARMEGLKKVPGCSWIEVKKKVYMF----KARNSIQEGLEKVDEILHDLALQIE

Query:  GNAGVRLDVKSHSMELHDSVKSSFGIPSRLLALLKVVIQIIDLP
              LD   H +E  +  +  +G   +    L VV  +++ P
Subjt:  GNAGVRLDVKSHSMELHDSVKSSFGIPSRLLALLKVVIQIIDLP

AT2G36980.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.8e-4227.09Show/hide
Query:  SNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMFLQMELEGVESNPVTWTSLM-------------------------------SS
        ++LG+    + +   V++ GF   + V N LI MY K    +   K+F  M  +    N VTW SL+                               S 
Subjt:  SNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMFLQMELEGVESNPVTWTSLM-------------------------------SS

Query:  HARCGHLEETMALFSKMRMKGVGATAEMLVVVLSIC-VDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNALI
        HA CG LE  ++LF +M              +++ C  D + +  G+M+H  ++K G+   + AKN+++  Y + G   D  +    ++V + VSWN++I
Subjt:  HARCGHLEETMALFSKMRMKGVGATAEMLVVVLSIC-VDIAILNRGQMIHGYIVKGGYEVYLFAKNALI-VYGRGGDIRDVEKLFHEMKVRSLVSWNALI

Query:  SSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALMDD
         +  + G  +KA EVF        H   + +++TW+ +I G+   G GE++L  F +M  + V ++     +VL   + LA L  G+ +HG +I      
Subjt:  SSYAESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALMDD

Query:  NILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV-------GFFIR
           VGN L+N+Y KCG  K     F  + N+DL+SWN+M+  +G+HGL   AL  +D +I SG+     P  + F+ ++TT   S +V          ++
Subjt:  NILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVV-------GFFIR

Query:  CYRT---------------SRSNLKWSTMHAWSISSVVLGLWNKRNAQRYRTCRRNCPGIFN--LNSEITG-----------SHMLLSSIFATSCR-LDS
         YR                   +L  +   A + SS+V    +  N   + T    C   ++  L  E++            S +LLS+++ ++ R  + 
Subjt:  CYRT---------------SRSNLKWSTMHAWSISSVVLGLWNKRNAQRYRTCRRNCPGIFN--LNSEITG-----------SHMLLSSIFATSCR-LDS

Query:  ARVRISARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDL
          VR      G+KK PGCSWIEV  +V  F   +S    LE++ E L+ L
Subjt:  ARVRISARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDL

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-4223.56Show/hide
Query:  SRASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMFLQMELEGVESNPVTWTSLMSSHARCGHLEETMALFSKMRMKGVGATAE
        S++ S+L S +  + LH  +++ GF     V N L+  Y+K  R+   RK+F +M     E + ++W S+++ +   G  E+ +++F +M + G+     
Subjt:  SRASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMFLQMELEGVESNPVTWTSLMSSHARCGHLEETMALFSKMRMKGVGATAE

Query:  MLVVVLSICVDIAILNRGQMIHGYIVKGGY-EVYLFAKNALIVYGRGGDIRDVEKLFHEMKVRSLVSWNALISSYAESGLYDKAFEVFSQPEKMDVHPEM
         +V V + C D  +++ G+ +H   VK  +     F    L +Y + GD+   + +F EM  RS+VS+ ++I+ YA  GL  +A ++F + E+  + P++
Subjt:  MLVVVLSICVDIAILNRGQMIHGYIVKGGY-EVYLFAKNALIVYGRGGDIRDVEKLFHEMKVRSLVSWNALISSYAESGLYDKAFEVFSQPEKMDVHPEM

Query:  --------------------------------------------------------------KPSVITWSAVICGFASKGLGEESLEVFR-QMQLANVMA
                                                                         +I+W+ +I G++      E+L +F   ++      
Subjt:  --------------------------------------------------------------KPSVITWSAVICGFASKGLGEESLEVFR-QMQLANVMA

Query:  NSVTISSVLSIFATLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLD
        +  T++ VL   A+L+A + GRE+HG+++R     +  V N L++MY KCG+     ++F+ + ++DL+SW  MIA YGMHG GK+A+  F+ +  +G++
Subjt:  NSVTISSVLSIFATLAALNLGREMHGHVIRALMDDNILVGNGLINMYTKCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLD

Query:  QMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTSRSNLKWSTMHAWSISSVVLGLWNKRNAQRY--------------------------RTCRRNCPGIF
           +  + L      + L+ +   FF          ++ +  H   I  ++    +   A R+                          +   +    +F
Subjt:  QMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTSRSNLKWSTMHAWSISSVVLGLWNKRNAQRY--------------------------RTCRRNCPGIF

Query:  NLNSEITGSHMLLSSIFATSCRLDSA-RVRISARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEIL
         L  E TG ++L+++I+A + + +   R+R      GL+K PGCSWIE+K +V +F A +S     E ++  L
Subjt:  NLNSEITGSHMLLSSIFATSCRLDSA-RVRISARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGACGACCTTCTCGATTTCTTCGATCGTCTTCTTCGGCAATGCAGCGGATTCAACACTGCAAACAAGTTCATTTCGCCACCGTTGTCACCGGCATGGGTTCGTCAC
CGCTCGGCTTGTGTCCGTCTATGCCCGTTCTAGGGCTTCTTCCAATTTGGGTAGTTTCAACTTGTGCAAGAATCTTCATTGTCATGTTGTGCAATTTGGATTCCAAAATC
ATATGCATGTTGTGAATGAATTGATAGGAATGTATGTGAAGCTCGGTCGAATGGTGATGCTTAGAAAGATGTTCCTTCAAATGGAGTTGGAAGGGGTCGAGTCAAACCCT
GTAACTTGGACTTCATTAATGTCGAGTCACGCCCGGTGTGGTCATCTTGAAGAAACTATGGCCTTGTTTAGCAAGATGAGGATGAAAGGTGTTGGTGCCACTGCTGAAAT
GCTTGTTGTAGTGTTATCTATTTGTGTTGATATAGCTATATTGAACAGAGGTCAGATGATTCATGGATATATTGTAAAGGGAGGTTATGAAGTTTACTTGTTTGCTAAAA
ACGCGCTTATTGTTTATGGAAGAGGAGGAGATATAAGAGATGTTGAGAAGTTATTTCATGAGATGAAAGTGAGGAGTCTTGTGAGTTGGAATGCTCTTATATCCTCCTAT
GCTGAATCTGGATTATATGACAAAGCTTTTGAAGTGTTTTCTCAGCCGGAGAAAATGGATGTCCATCCAGAGATGAAACCTAGTGTAATAACTTGGAGTGCAGTCATTTG
TGGATTTGCTTCCAAGGGACTAGGAGAAGAATCATTAGAAGTTTTTCGTCAAATGCAGCTTGCAAATGTAATGGCAAACTCGGTGACGATATCTAGTGTTTTATCGATTT
TTGCTACGCTAGCAGCTCTAAATCTTGGTAGGGAAATGCATGGCCATGTCATCAGAGCTCTGATGGATGATAACATATTGGTAGGAAATGGATTGATTAACATGTATACA
AAGTGTGGAAGTTTCAAGCCATGCTGTTTGGTGTTTGAAAAACTTGAAAATCGAGATTTGATCTCATGGAACTCAATGATTGCAAGATATGGAATGCATGGACTTGGTAA
AGATGCTCTTACAACTTTTGATTATGTGATCATATCAGGATTAGACCAGATGATGTTACCTTTATTGCTGCTCTTTCTGCTTGTAGTCACGACGGTCTTGTTGTCGAAGG
TCGTTGGCTTTTTTATCAGATGCTACAGAACTTCAAGATCAAACCTCAAATGGAGCACTATGCATGCATGGTCGATCTCCTCGGTCGTGTTGGGCTTGTGGAATAAGCGA
AATGCACAAAGATACAGAACTTGCAGAAGAAACTGCCCTGGGATCTTTAATCTGAATTCCGAGATAACGGGGAGTCATATGTTGCTCTCGAGTATTTTTGCTACAAGCTG
TAGATTGGATTCTGCAAGGGTGAGGATCTCAGCAAGGATGGAGGGCTTAAAGAAAGTTCCTGGGTGCAGCTGGATTGAGGTGAAGAAGAAGGTTTACATGTTCAAAGCAA
GAAACTCAATACAAGAAGGTTTAGAGAAAGTTGATGAAATTCTTCATGATTTGGCTCTTCAGATTGAAGGAAATGCAGGAGTCAGATTAGATGTAAAAAGCCACTCCATG
GAATTGCACGACTCAGTGAAAAGCTCTTTTGGCATCCCTTCAAGGTTGCTGGCACTTCTCAAAGTGGTTATTCAGATCATTGACCTTCCAAAACGTTACAAGAACCAATT
ATTCGTGCAAACAACGCATCCGCCAGTTTCTCCACTAGCTACAAAACACGAAACGTCCAAGAATGTCAACATACACAAAACACGGACATTAGAGAACACTCGAAATCTAG
GGATTCTGTGCGAATTACATGAAATAGATGAAAATGGGAAGCCAGAGATTGGTGTTGGTGATATTGCTGCTGTTGCTGTTGCTGCTGTTGCTGCTGTTGCTGTTGTTGCT
GTTGCTGTTGCTGTTGAAGAAGTCGCTGCTGCTGTTGTTGCTGTTGTTGCTGTTGTTGCTGTTGTTGTTGTTGTTGAAAATGATATTGCTGCTGTTGTTGTTGTTGTTGC
TGCTGCTGCTGCTGCTGCTGCTGGGGGAAATGTTGGGGTTGCAATTGCTGCTGGAATTGAGGGAGGGTTGCTGTTGGGGCTGGATTTGGGGCGAGGGTTGCTGCTGTTGA
AATTGGGGAGAGGGTTGTTGCTGCTGAAATTGCTGTTGCTGCTGAATTTGAGGCTGGAATTGCTGTTGTTGGTGGTGAAAGGGTTGTTGTTGTTGAAGGGAAAATTGTTG
CTGTTGAAGATAGAGGGATTCTTGATTGGAAAGATTAGAGGAGTTGCTGAGATTAGGGATGTTGGGGATCATGCTCCAGTTTCCTCCTCCAGAGAAATCCATATCGCCGA
TTTGCTTTCTTCTTCGGAATTTGAAGATTTTGGAATTTGGAGCTTGCGATTGGTTAGGGTTTTAGGGTTTTTTCTGCAGCTCCGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGACGACCTTCTCGATTTCTTCGATCGTCTTCTTCGGCAATGCAGCGGATTCAACACTGCAAACAAGTTCATTTCGCCACCGTTGTCACCGGCATGGGTTCGTCAC
CGCTCGGCTTGTGTCCGTCTATGCCCGTTCTAGGGCTTCTTCCAATTTGGGTAGTTTCAACTTGTGCAAGAATCTTCATTGTCATGTTGTGCAATTTGGATTCCAAAATC
ATATGCATGTTGTGAATGAATTGATAGGAATGTATGTGAAGCTCGGTCGAATGGTGATGCTTAGAAAGATGTTCCTTCAAATGGAGTTGGAAGGGGTCGAGTCAAACCCT
GTAACTTGGACTTCATTAATGTCGAGTCACGCCCGGTGTGGTCATCTTGAAGAAACTATGGCCTTGTTTAGCAAGATGAGGATGAAAGGTGTTGGTGCCACTGCTGAAAT
GCTTGTTGTAGTGTTATCTATTTGTGTTGATATAGCTATATTGAACAGAGGTCAGATGATTCATGGATATATTGTAAAGGGAGGTTATGAAGTTTACTTGTTTGCTAAAA
ACGCGCTTATTGTTTATGGAAGAGGAGGAGATATAAGAGATGTTGAGAAGTTATTTCATGAGATGAAAGTGAGGAGTCTTGTGAGTTGGAATGCTCTTATATCCTCCTAT
GCTGAATCTGGATTATATGACAAAGCTTTTGAAGTGTTTTCTCAGCCGGAGAAAATGGATGTCCATCCAGAGATGAAACCTAGTGTAATAACTTGGAGTGCAGTCATTTG
TGGATTTGCTTCCAAGGGACTAGGAGAAGAATCATTAGAAGTTTTTCGTCAAATGCAGCTTGCAAATGTAATGGCAAACTCGGTGACGATATCTAGTGTTTTATCGATTT
TTGCTACGCTAGCAGCTCTAAATCTTGGTAGGGAAATGCATGGCCATGTCATCAGAGCTCTGATGGATGATAACATATTGGTAGGAAATGGATTGATTAACATGTATACA
AAGTGTGGAAGTTTCAAGCCATGCTGTTTGGTGTTTGAAAAACTTGAAAATCGAGATTTGATCTCATGGAACTCAATGATTGCAAGATATGGAATGCATGGACTTGGTAA
AGATGCTCTTACAACTTTTGATTATGTGATCATATCAGGATTAGACCAGATGATGTTACCTTTATTGCTGCTCTTTCTGCTTGTAGTCACGACGGTCTTGTTGTCGAAGG
TCGTTGGCTTTTTTATCAGATGCTACAGAACTTCAAGATCAAACCTCAAATGGAGCACTATGCATGCATGGTCGATCTCCTCGGTCGTGTTGGGCTTGTGGAATAAGCGA
AATGCACAAAGATACAGAACTTGCAGAAGAAACTGCCCTGGGATCTTTAATCTGAATTCCGAGATAACGGGGAGTCATATGTTGCTCTCGAGTATTTTTGCTACAAGCTG
TAGATTGGATTCTGCAAGGGTGAGGATCTCAGCAAGGATGGAGGGCTTAAAGAAAGTTCCTGGGTGCAGCTGGATTGAGGTGAAGAAGAAGGTTTACATGTTCAAAGCAA
GAAACTCAATACAAGAAGGTTTAGAGAAAGTTGATGAAATTCTTCATGATTTGGCTCTTCAGATTGAAGGAAATGCAGGAGTCAGATTAGATGTAAAAAGCCACTCCATG
GAATTGCACGACTCAGTGAAAAGCTCTTTTGGCATCCCTTCAAGGTTGCTGGCACTTCTCAAAGTGGTTATTCAGATCATTGACCTTCCAAAACGTTACAAGAACCAATT
ATTCGTGCAAACAACGCATCCGCCAGTTTCTCCACTAGCTACAAAACACGAAACGTCCAAGAATGTCAACATACACAAAACACGGACATTAGAGAACACTCGAAATCTAG
GGATTCTGTGCGAATTACATGAAATAGATGAAAATGGGAAGCCAGAGATTGGTGTTGGTGATATTGCTGCTGTTGCTGTTGCTGCTGTTGCTGCTGTTGCTGTTGTTGCT
GTTGCTGTTGCTGTTGAAGAAGTCGCTGCTGCTGTTGTTGCTGTTGTTGCTGTTGTTGCTGTTGTTGTTGTTGTTGAAAATGATATTGCTGCTGTTGTTGTTGTTGTTGC
TGCTGCTGCTGCTGCTGCTGCTGGGGGAAATGTTGGGGTTGCAATTGCTGCTGGAATTGAGGGAGGGTTGCTGTTGGGGCTGGATTTGGGGCGAGGGTTGCTGCTGTTGA
AATTGGGGAGAGGGTTGTTGCTGCTGAAATTGCTGTTGCTGCTGAATTTGAGGCTGGAATTGCTGTTGTTGGTGGTGAAAGGGTTGTTGTTGTTGAAGGGAAAATTGTTG
CTGTTGAAGATAGAGGGATTCTTGATTGGAAAGATTAGAGGAGTTGCTGAGATTAGGGATGTTGGGGATCATGCTCCAGTTTCCTCCTCCAGAGAAATCCATATCGCCGA
TTTGCTTTCTTCTTCGGAATTTGAAGATTTTGGAATTTGGAGCTTGCGATTGGTTAGGGTTTTAGGGTTTTTTCTGCAGCTCCGGTGA
Protein sequenceShow/hide protein sequence
MMTTFSISSIVFFGNAADSTLQTSSFRHRCHRHGFVTARLVSVYARSRASSNLGSFNLCKNLHCHVVQFGFQNHMHVVNELIGMYVKLGRMVMLRKMFLQMELEGVESNP
VTWTSLMSSHARCGHLEETMALFSKMRMKGVGATAEMLVVVLSICVDIAILNRGQMIHGYIVKGGYEVYLFAKNALIVYGRGGDIRDVEKLFHEMKVRSLVSWNALISSY
AESGLYDKAFEVFSQPEKMDVHPEMKPSVITWSAVICGFASKGLGEESLEVFRQMQLANVMANSVTISSVLSIFATLAALNLGREMHGHVIRALMDDNILVGNGLINMYT
KCGSFKPCCLVFEKLENRDLISWNSMIARYGMHGLGKDALTTFDYVIISGLDQMMLPLLLLFLLVVTTVLLSKVVGFFIRCYRTSRSNLKWSTMHAWSISSVVLGLWNKR
NAQRYRTCRRNCPGIFNLNSEITGSHMLLSSIFATSCRLDSARVRISARMEGLKKVPGCSWIEVKKKVYMFKARNSIQEGLEKVDEILHDLALQIEGNAGVRLDVKSHSM
ELHDSVKSSFGIPSRLLALLKVVIQIIDLPKRYKNQLFVQTTHPPVSPLATKHETSKNVNIHKTRTLENTRNLGILCELHEIDENGKPEIGVGDIAAVAVAAVAAVAVVA
VAVAVEEVAAAVVAVVAVVAVVVVVENDIAAVVVVVAAAAAAAAGGNVGVAIAAGIEGGLLLGLDLGRGLLLLKLGRGLLLLKLLLLLNLRLELLLLVVKGLLLLKGKLL
LLKIEGFLIGKIRGVAEIRDVGDHAPVSSSREIHIADLLSSSEFEDFGIWSLRLVRVLGFFLQLR