; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg10500 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg10500
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCarg_Chr03:8471868..8473105
RNA-Seq ExpressionCarg10500
SyntenyCarg10500
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595166.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]8.9e-8955.33Show/hide
Query:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNC-----WCVHENGVAEKVFDEMPV
        LRC VDEDLY T  +                    SEK VVSWTAMIVGYSS GNLVEAK+  FD+  +  V   N        + +   AEKVFDEMP 
Subjt:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNC-----WCVHENGVAEKVFDEMPV

Query:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV
         N+VSFT M+ G  K           + APERDIVA SALISGY QN QPNEAVKTFLEMGSRNVK DEF+L SLMSACS                 CLV
Subjt:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV

Query:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH
        DL GAHVRAAL DMNAKCGNM+RAM LFEEMPKRDLI +      LS+HG G Q VSLFERML E LTPD VA TVIL ACSRAGL    W     +R+ 
Subjt:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH

Query:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN
             S   + C+   +S+S  LK               G    LL A KLYSDS   EVVA +L+ELEPQNA NYVLLSNIY +AERWLDVSVVRN+M+
Subjt:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN

Query:  ERG
        ERG
Subjt:  ERG

KAG7027181.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]8.9e-8955.33Show/hide
Query:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNC-----WCVHENGVAEKVFDEMPV
        LRC VDEDLY T  +                    SEK VVSWTAMIVGYSS GNLVEAK+  FD+  +  V   N        + +   AEKVFDEMP 
Subjt:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNC-----WCVHENGVAEKVFDEMPV

Query:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV
         N+VSFT M+ G  K           + APERDIVA SALISGY QN QPNEAVKTFLEMGSRNVK DEF+L SLMSACS                 CLV
Subjt:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV

Query:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH
        DL GAHVRAAL DMNAKCGNM+RAM LFEEMPKRDLI +      LS+HG G Q VSLFERML E LTPD VA TVIL ACSRAGL    W     +R+ 
Subjt:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH

Query:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN
             S   + C+   +S+S  LK               G    LL A KLYSDS   EVVA +L+ELEPQNA NYVLLSNIY +AERWLDVSVVRN+M+
Subjt:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN

Query:  ERG
        ERG
Subjt:  ERG

XP_008462604.1 PREDICTED: putative pentatricopeptide repeat-containing protein At5g37570 [Cucumis melo]8.1e-9054.34Show/hide
Query:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNC-----WCVHENGVAEKVFDEMPV
        LRC VDED+Y T  +                    SEK VVSWTAMIVGYSS+GNLVEAK+  FD   +  V   N        + +   AEKVFDEMP 
Subjt:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNC-----WCVHENGVAEKVFDEMPV

Query:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV
         N+VSFT M+ G  K           + APERDI+A SALISGYTQN QPNEAV+TFLEM SRNVK D+F+L SLM ACS                 CLV
Subjt:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV

Query:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH
        DL GAHVRAAL DMNAKCGNM+RAMYLFEEMPKRDLI +      LS+HG G Q VSLFERMLDEDLTPD VA TVIL ACSRAGL    W     +R  
Subjt:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH

Query:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN
             S+  + C+   +S+S  LK               G    LL A KLY DS   EVVA +LIELEP+NA NYVLLSNIY AA+RWLDVS VR++MN
Subjt:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN

Query:  ERG
        ERG
Subjt:  ERG

XP_022132921.1 putative pentatricopeptide repeat-containing protein At5g37570 [Momordica charantia]8.9e-8954.84Show/hide
Query:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECN-----CWCVHENGVAEKVFDEMPV
        LRC VD DLY T  +                    S+K VVSWTAMIVGYS+IGNLVEAKK  FD+  +  V   N     C  + +   AEKVFDEMP 
Subjt:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECN-----CWCVHENGVAEKVFDEMPV

Query:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV
         N+VSFT M+ G  K           + APERDIVA SALISGYTQN  PNEAV+TFLEMG RNVK DEF+LASLMSACS                  L+
Subjt:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV

Query:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH
        DL GAHVRAAL DMNAKCGNM+RAM LF+EMPKRDLI +      LS+HGRG Q VSLFERMLDE LTPD VA TVIL ACSRAG     W     +R+ 
Subjt:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH

Query:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN
             S   + C+   +S+S  LK               G    LL A KLY +S   EVVA +LIELEPQNA NYVLLSNIY AAERWLDVSVVRN+MN
Subjt:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN

Query:  ERG
        ERG
Subjt:  ERG

XP_038882068.1 putative pentatricopeptide repeat-containing protein At5g37570 [Benincasa hispida]5.2e-8955.58Show/hide
Query:  LRCDVDEDLY-ETCLI-------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNC-----WCVHENGVAEKVFDEMPV
        LRC VDED+Y  T L+                   S+K VVSWTAMIVGYSSIGNLVEAK+  FD+  +  V   N        + +   AEKVFDEMP 
Subjt:  LRCDVDEDLY-ETCLI-------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNC-----WCVHENGVAEKVFDEMPV

Query:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACS----------------HCLV
         N+VS+T M+ G  K           + APERDIVA SALISGYTQN Q NEAVKTFLEMGS NVK DEF+LASLMSACS                 CL 
Subjt:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACS----------------HCLV

Query:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH
        DL GAHVRAAL DMNAKCGNM+RAM LFEEMPKRDLI +      LS+HG G QVV LFERMLDEDLTPD VA TVIL ACSRAGL    W     +R  
Subjt:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH

Query:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN
             S   + C+   +S+S  LK               G    LL A KLY DS   EVVA +LIELEPQNA NYVLLSNIY AAERWLDV+VVRN+MN
Subjt:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN

Query:  ERG
        ERG
Subjt:  ERG

TrEMBL top hitse value%identityAlignment
A0A1S3CHC2 putative pentatricopeptide repeat-containing protein At5g375703.9e-9054.34Show/hide
Query:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNC-----WCVHENGVAEKVFDEMPV
        LRC VDED+Y T  +                    SEK VVSWTAMIVGYSS+GNLVEAK+  FD   +  V   N        + +   AEKVFDEMP 
Subjt:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNC-----WCVHENGVAEKVFDEMPV

Query:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV
         N+VSFT M+ G  K           + APERDI+A SALISGYTQN QPNEAV+TFLEM SRNVK D+F+L SLM ACS                 CLV
Subjt:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV

Query:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH
        DL GAHVRAAL DMNAKCGNM+RAMYLFEEMPKRDLI +      LS+HG G Q VSLFERMLDEDLTPD VA TVIL ACSRAGL    W     +R  
Subjt:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH

Query:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN
             S+  + C+   +S+S  LK               G    LL A KLY DS   EVVA +LIELEP+NA NYVLLSNIY AA+RWLDVS VR++MN
Subjt:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN

Query:  ERG
        ERG
Subjt:  ERG

A0A5A7SLV1 Putative pentatricopeptide repeat-containing protein3.9e-9054.34Show/hide
Query:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNC-----WCVHENGVAEKVFDEMPV
        LRC VDED+Y T  +                    SEK VVSWTAMIVGYSS+GNLVEAK+  FD   +  V   N        + +   AEKVFDEMP 
Subjt:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNC-----WCVHENGVAEKVFDEMPV

Query:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV
         N+VSFT M+ G  K           + APERDI+A SALISGYTQN QPNEAV+TFLEM SRNVK D+F+L SLM ACS                 CLV
Subjt:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV

Query:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH
        DL GAHVRAAL DMNAKCGNM+RAMYLFEEMPKRDLI +      LS+HG G Q VSLFERMLDEDLTPD VA TVIL ACSRAGL    W     +R  
Subjt:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH

Query:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN
             S+  + C+   +S+S  LK               G    LL A KLY DS   EVVA +LIELEP+NA NYVLLSNIY AA+RWLDVS VR++MN
Subjt:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN

Query:  ERG
        ERG
Subjt:  ERG

A0A6J1BTM3 putative pentatricopeptide repeat-containing protein At5g375704.3e-8954.84Show/hide
Query:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECN-----CWCVHENGVAEKVFDEMPV
        LRC VD DLY T  +                    S+K VVSWTAMIVGYS+IGNLVEAKK  FD+  +  V   N     C  + +   AEKVFDEMP 
Subjt:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECN-----CWCVHENGVAEKVFDEMPV

Query:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV
         N+VSFT M+ G  K           + APERDIVA SALISGYTQN  PNEAV+TFLEMG RNVK DEF+LASLMSACS                  L+
Subjt:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV

Query:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH
        DL GAHVRAAL DMNAKCGNM+RAM LF+EMPKRDLI +      LS+HGRG Q VSLFERMLDE LTPD VA TVIL ACSRAG     W     +R+ 
Subjt:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH

Query:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN
             S   + C+   +S+S  LK               G    LL A KLY +S   EVVA +LIELEPQNA NYVLLSNIY AAERWLDVSVVRN+MN
Subjt:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN

Query:  ERG
        ERG
Subjt:  ERG

A0A6J1HI92 putative pentatricopeptide repeat-containing protein At5g375709.6e-8955.09Show/hide
Query:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNC-----WCVHENGVAEKVFDEMPV
        LRC VDEDLY T  +                    SEK VVSWT+MIVGYSS GNLVEAK+  FD+  +  V   N        + +   AEKVFDEMP 
Subjt:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNC-----WCVHENGVAEKVFDEMPV

Query:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV
         N+VSFT M+ G  K           + APERDIVA SALISGY QN QPNEAVKTFLEMGSRNVK DEF+L SLMSACS                 CLV
Subjt:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV

Query:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH
        DL GAHVRAAL DMNAKCGNM+RAM LFEEMPKRDLI +      LS+HG G Q VSLFERML E LTPD VA TVIL ACSRAGL    W     +R+ 
Subjt:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH

Query:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN
             S   + C+   +S+S  LK               G    LL A KLYSDS   EVVA +L+ELEPQNA NYVLLSNIY +AERWLDVSVVRN+M+
Subjt:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN

Query:  ERG
        ERG
Subjt:  ERG

A0A6J1IAM0 putative pentatricopeptide repeat-containing protein At5g375705.6e-8955.33Show/hide
Query:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNCWC-----VHENGVAEKVFDEMPV
        LRC VDEDLY T  +                    SEK VVSWTAMIVGYSS GNLVEAK+  FD+  +  V   N        + +   AEKVFDEMP 
Subjt:  LRCDVDEDLYETCLI--------------------SEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNCWC-----VHENGVAEKVFDEMPV

Query:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV
         N+VSFT M+ G  K           + AP RDIVA SALISGY QN QPNEAVKTFLEMGSRNVK DEF+L SLMSACS                 CLV
Subjt:  NNLVSFTKMMGGMQKRWHAF----CHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSH----------------CLV

Query:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH
        DL GAHVRAAL DMNAKCGNM+RAM LFEEMPKRDLI +      LS+HGRG Q VSLFERML E LTPD VA TVIL ACSRAGL    W     + + 
Subjt:  DLCGAHVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL--SIWETMSSLRAH

Query:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN
             S   + C+   +S+S  LK               G    LL A KLYS+S   EVVA +LIELEPQNA NYVLLSNIY AAERWLDVSVVRN+M+
Subjt:  KIRACSISCW-CMGCTVSQSCLLK---------------GDCDTLLAAGKLYSDS---EVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMN

Query:  ERG
        ERG
Subjt:  ERG

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic2.6e-2729.53Show/hide
Query:  DAGEKCGVVECNCWCVHENGVAEKVFDEMPVNNLVSFTKMMGGMQKRWHAFCHEF---------APERDIVALSALISGYTQNRQPNEAVKTFLEMG-SR
        D   KCG +E           A+++FD M   + V++T M+ G     +A   ++          P++DIVA +ALIS Y QN +PNEA+  F E+   +
Subjt:  DAGEKCGVVECNCWCVHENGVAEKVFDEMPVNNLVSFTKMMGGMQKRWHAFCHEF---------APERDIVALSALISGYTQNRQPNEAVKTFLEMG-SR

Query:  NVKSDEFLLASLMSACS-----------HCLVDLCGA----HVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLD
        N+K ++  L S +SAC+           H  +   G     HV +AL  M +KCG+++++  +F  + KRD+  W      L++HG G + V +F +M +
Subjt:  NVKSDEFLLASLMSACS-----------HCLVDLCGA----HVRAALKDMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLD

Query:  EDLTPDVVALTVILAACSRAGL-----SIWETMSS----LRAHKIRACSISC------------WCMGCTVSQSCLLKGDCDTLLAAGKLYSD---SEVV
         ++ P+ V  T +  ACS  GL     S++  M S    +   K  AC +              +     +  S  + G    LL A K++++   +E+ 
Subjt:  EDLTPDVVALTVILAACSRAGL-----SIWETMSS----LRAHKIRACSISC------------WCMGCTVSQSCLLKGDCDTLLAAGKLYSD---SEVV

Query:  AYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNERG
          +L+ELEP+N   +VLLSNIY    +W +VS +R  M   G
Subjt:  AYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNERG

Q9FHR3 Putative pentatricopeptide repeat-containing protein At5g375703.7e-5339.25Show/hide
Query:  EKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNCWCVHENGV--------AEKVFDEMPVNNLVSFTKMMGGMQKRWHAF----CHEFAPERD
        E+  VSWTA++V Y   G L EAK   FD   +  +     W    +G+        A+K+FDEMP  +++S+T M+ G  K           E A   D
Subjt:  EKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNCWCVHENGV--------AEKVFDEMPVNNLVSFTKMMGGMQKRWHAF----CHEFAPERD

Query:  IVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSHC-LVDLC---------------GAHVRAALKDMNAKCGNMKRAMYLFEEMPK
        + A SALI GY QN QPNEA K F EM ++NVK DEF++  LMSACS     +LC                 +V  AL DMNAKCG+M RA  LFEEMP+
Subjt:  IVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSHC-LVDLC---------------GAHVRAALKDMNAKCGNMKRAMYLFEEMPK

Query:  RDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRA-----GLSIWETMSSLRAHKIRACSISCWCMGCTVSQSCLLKGDCD---
        RDL+ +      +++HG G++ + LFE+M+DE + PD VA TVIL  C ++     GL  +E M   + + I A      C+   +S++  LK   +   
Subjt:  RDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRA-----GLSIWETMSSLRAHKIRACSISCWCMGCTVSQSCLLKGDCD---

Query:  ------------TLLAAGKLYSD---SEVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNERG
                    +LL    L+ +   +EVVA  L ELEPQ+A +YVLLSNIY A +RW DV+ +R++MNE G
Subjt:  ------------TLLAAGKLYSD---SEVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNERG

Q9LIC3 Putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial1.6e-2730.36Show/hide
Query:  RCDVDEDLYETC-LISEKVVVSWTAMIVGYSSIGNLVE-----AKKFRFDAGEK---CGVVECNCWCVHENGVAEKV--------FDE--MPVNNLVSFT
        +CD  ED  +    + EK VVSWTAMI  YS  G+  E     A+  R D          V  +C      G+ +++        +D      ++L+   
Subjt:  RCDVDEDLYETC-LISEKVVVSWTAMIVGYSSIGNLVE-----AKKFRFDAGEK---CGVVECNCWCVHENGVAEKV--------FDE--MPVNNLVSFT

Query:  KMMGGMQKRWHAFCHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACS-----------HCLV---DL-CGAHVRAALK
           G +++    F  E  PERD+V+ +A+I+GY Q     EA++ F  + S  +  +    ASL++A S           HC V   +L   A ++ +L 
Subjt:  KMMGGMQKRWHAFCHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACS-----------HCLV---DL-CGAHVRAALK

Query:  DMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDED-LTPDVVALTVILAACSR-----AGLSIWETM-----SSLRAHKIR
        DM +KCGN+  A  LF+ MP+R  I W       S HG G +V+ LF  M DE  + PD V L  +L+ CS       GL+I++ M      +    +  
Subjt:  DMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDED-LTPDVVALTVILAACSR-----AGLSIWETM-----SSLRAHKIR

Query:  ACSISCWCMGCTVSQ---------SCLLKGDCDTLLAAGKLYSD---SEVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNER
         C +        + +         S    G   +LL A +++      E V  +LIE+EP+NA NYV+LSN+Y +A RW DV+ VR  M ++
Subjt:  ACSISCWCMGCTVSQ---------SCLLKGDCDTLLAAGKLYSD---SEVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNER

Q9LS72 Pentatricopeptide repeat-containing protein At3g292302.0e-3028.38Show/hide
Query:  ISEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNCWC-----VHENGVAEKVFDEMPVNNLVSFTKMM------GGMQKRWHAFCHEFAPER
        +SE+  VSW +M+ G    G L +A++  FD   +  ++  N          E   A ++F++MP  N VS++ M+      G M+     F     P +
Subjt:  ISEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNCWC-----VHENGVAEKVFDEMPVNNLVSFTKMM------GGMQKRWHAFCHEFAPER

Query:  DIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSHC-LVDL--------------CGAHVRAALKDMNAKCGNMKRAMYLFEEMPK
        ++V  + +I+GY +     EA +   +M +  +K D   + S+++AC+   L+ L                A+V  AL DM AKCGN+K+A  +F ++PK
Subjt:  DIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSHC-LVDL--------------CGAHVRAALKDMNAKCGNMKRAMYLFEEMPK

Query:  RDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGLSIWETMSSLRAHKIRACSISCWCMGCTVS---------------QSC
        +DL+ W      L VHG G + + LF RM  E + PD V    +L +C+ AGL            K+          GC V                Q+ 
Subjt:  RDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGLSIWETMSSLRAHKIRACSISCWCMGCTVS---------------QSC

Query:  LLKGDC---DTLLAAGKLYSDSEV---VAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNERG
         ++ +      LL A +++++ ++   V   L++L+P +  NY LLSNIY AAE W  V+ +R++M   G
Subjt:  LLKGDC---DTLLAAGKLYSDSEV---VAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNERG

Q9M4P3 Pentatricopeptide repeat-containing protein At4g16835, mitochondrial1.1e-2827.84Show/hide
Query:  KVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEK---------CGVVECNCWCVHENGVAEKVFDEMPVNNLVSFTKMMGGMQKR-----WHAFCHEFAPER
        K   SW  MI GY+  G + +A++  +   EK          G +EC      +   A   F   PV  +V++T M+ G  K        A   +    +
Subjt:  KVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEK---------CGVVECNCWCVHENGVAEKVFDEMPVNNLVSFTKMMGGMQKR-----WHAFCHEFAPER

Query:  DIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACS-----------HCLVD---LCG-AHVRAALKDMNAKCGNMKRAMYLFEEMPK
        ++V  +A+ISGY +N +P + +K F  M    ++ +   L+S +  CS           H +V    LC       +L  M  KCG +  A  LFE M K
Subjt:  DIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACS-----------HCLVD---LCG-AHVRAALKDMNAKCGNMKRAMYLFEEMPK

Query:  RDLIPWLSV------HGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL-----SIWETMSSLRAHKIRACSISCWCMGCTVSQSCLLK-------
        +D++ W ++      HG   + + LF  M+D  + PD +    +L AC+ AGL     + +E+M  +R +K+        CM   + ++  L+       
Subjt:  RDLIPWLSV------HGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL-----SIWETMSSLRAHKIRACSISCWCMGCTVSQSCLLK-------

Query:  --------GDCDTLLAAGKLYSD---SEVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNE
                    TLL A +++ +   +E  A +L++L  QNA  YV L+NIY +  RW DV+ VR RM E
Subjt:  --------GDCDTLLAAGKLYSD---SEVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNE

Arabidopsis top hitse value%identityAlignment
AT1G13410.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-3029.1Show/hide
Query:  CLISEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNCWCVHENGV--------AEKVFDEMPVNNLVSFTKMM------GGMQKRWHAFCHE
        C + EK VV WT+MI GY    +LV A+++ FD   +  +V    W    +G         A  +FD+MP  +++S+  ++      G M+     F  +
Subjt:  CLISEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNCWCVHENGV--------AEKVFDEMPVNNLVSFTKMM------GGMQKRWHAFCHE

Query:  FAPERDIVALSALISGYTQNRQPNEAVKTFLEM-GSRNVKSDEFLLASLMSACS-----------HCLVDLCG-----AHVRAALKDMNAKCGNMKRAMY
          PER++ + + LI GY QN + +E + +F  M    +V  ++  +  ++SAC+           H   +  G      +V+ AL DM  KCG ++ AM 
Subjt:  FAPERDIVALSALISGYTQNRQPNEAVKTFLEM-GSRNVKSDEFLLASLMSACS-----------HCLVDLCG-----AHVRAALKDMNAKCGNMKRAMY

Query:  LFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL---------SIWETMSSLRAHKIRACSISCWCMGCTVSQS
        +F+ + +RDLI W      L+ HG G + ++LF  M +  ++PD V    +L AC   GL         S++   S +   +   C +        ++Q+
Subjt:  LFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL---------SIWETMSSLRAHKIRACSISCWCMGCTVSQS

Query:  ------CLLKGDC---DTLLAAGKLYSD---SEVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNERGF
                +K D     TLL A K+Y      EV   +LI+LEP+N  N+V+LSNIY  A R+ D + ++  M + GF
Subjt:  ------CLLKGDC---DTLLAAGKLYSD---SEVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNERGF

AT3G13770.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-2830.36Show/hide
Query:  RCDVDEDLYETC-LISEKVVVSWTAMIVGYSSIGNLVE-----AKKFRFDAGEK---CGVVECNCWCVHENGVAEKV--------FDE--MPVNNLVSFT
        +CD  ED  +    + EK VVSWTAMI  YS  G+  E     A+  R D          V  +C      G+ +++        +D      ++L+   
Subjt:  RCDVDEDLYETC-LISEKVVVSWTAMIVGYSSIGNLVE-----AKKFRFDAGEK---CGVVECNCWCVHENGVAEKV--------FDE--MPVNNLVSFT

Query:  KMMGGMQKRWHAFCHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACS-----------HCLV---DL-CGAHVRAALK
           G +++    F  E  PERD+V+ +A+I+GY Q     EA++ F  + S  +  +    ASL++A S           HC V   +L   A ++ +L 
Subjt:  KMMGGMQKRWHAFCHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACS-----------HCLV---DL-CGAHVRAALK

Query:  DMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDED-LTPDVVALTVILAACSR-----AGLSIWETM-----SSLRAHKIR
        DM +KCGN+  A  LF+ MP+R  I W       S HG G +V+ LF  M DE  + PD V L  +L+ CS       GL+I++ M      +    +  
Subjt:  DMNAKCGNMKRAMYLFEEMPKRDLIPW------LSVHGRGAQVVSLFERMLDED-LTPDVVALTVILAACSR-----AGLSIWETM-----SSLRAHKIR

Query:  ACSISCWCMGCTVSQ---------SCLLKGDCDTLLAAGKLYSD---SEVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNER
         C +        + +         S    G   +LL A +++      E V  +LIE+EP+NA NYV+LSN+Y +A RW DV+ VR  M ++
Subjt:  ACSISCWCMGCTVSQ---------SCLLKGDCDTLLAAGKLYSD---SEVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNER

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-3128.38Show/hide
Query:  ISEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNCWC-----VHENGVAEKVFDEMPVNNLVSFTKMM------GGMQKRWHAFCHEFAPER
        +SE+  VSW +M+ G    G L +A++  FD   +  ++  N          E   A ++F++MP  N VS++ M+      G M+     F     P +
Subjt:  ISEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNCWC-----VHENGVAEKVFDEMPVNNLVSFTKMM------GGMQKRWHAFCHEFAPER

Query:  DIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSHC-LVDL--------------CGAHVRAALKDMNAKCGNMKRAMYLFEEMPK
        ++V  + +I+GY +     EA +   +M +  +K D   + S+++AC+   L+ L                A+V  AL DM AKCGN+K+A  +F ++PK
Subjt:  DIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSHC-LVDL--------------CGAHVRAALKDMNAKCGNMKRAMYLFEEMPK

Query:  RDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGLSIWETMSSLRAHKIRACSISCWCMGCTVS---------------QSC
        +DL+ W      L VHG G + + LF RM  E + PD V    +L +C+ AGL            K+          GC V                Q+ 
Subjt:  RDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGLSIWETMSSLRAHKIRACSISCWCMGCTVS---------------QSC

Query:  LLKGDC---DTLLAAGKLYSDSEV---VAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNERG
         ++ +      LL A +++++ ++   V   L++L+P +  NY LLSNIY AAE W  V+ +R++M   G
Subjt:  LLKGDC---DTLLAAGKLYSDSEV---VAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNERG

AT4G16835.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.6e-3027.84Show/hide
Query:  KVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEK---------CGVVECNCWCVHENGVAEKVFDEMPVNNLVSFTKMMGGMQKR-----WHAFCHEFAPER
        K   SW  MI GY+  G + +A++  +   EK          G +EC      +   A   F   PV  +V++T M+ G  K        A   +    +
Subjt:  KVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEK---------CGVVECNCWCVHENGVAEKVFDEMPVNNLVSFTKMMGGMQKR-----WHAFCHEFAPER

Query:  DIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACS-----------HCLVD---LCG-AHVRAALKDMNAKCGNMKRAMYLFEEMPK
        ++V  +A+ISGY +N +P + +K F  M    ++ +   L+S +  CS           H +V    LC       +L  M  KCG +  A  LFE M K
Subjt:  DIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACS-----------HCLVD---LCG-AHVRAALKDMNAKCGNMKRAMYLFEEMPK

Query:  RDLIPWLSV------HGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL-----SIWETMSSLRAHKIRACSISCWCMGCTVSQSCLLK-------
        +D++ W ++      HG   + + LF  M+D  + PD +    +L AC+ AGL     + +E+M  +R +K+        CM   + ++  L+       
Subjt:  RDLIPWLSV------HGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGL-----SIWETMSSLRAHKIRACSISCWCMGCTVSQSCLLK-------

Query:  --------GDCDTLLAAGKLYSD---SEVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNE
                    TLL A +++ +   +E  A +L++L  QNA  YV L+NIY +  RW DV+ VR RM E
Subjt:  --------GDCDTLLAAGKLYSD---SEVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNE

AT5G37570.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.6e-5439.25Show/hide
Query:  EKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNCWCVHENGV--------AEKVFDEMPVNNLVSFTKMMGGMQKRWHAF----CHEFAPERD
        E+  VSWTA++V Y   G L EAK   FD   +  +     W    +G+        A+K+FDEMP  +++S+T M+ G  K           E A   D
Subjt:  EKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNCWCVHENGV--------AEKVFDEMPVNNLVSFTKMMGGMQKRWHAF----CHEFAPERD

Query:  IVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSHC-LVDLC---------------GAHVRAALKDMNAKCGNMKRAMYLFEEMPK
        + A SALI GY QN QPNEA K F EM ++NVK DEF++  LMSACS     +LC                 +V  AL DMNAKCG+M RA  LFEEMP+
Subjt:  IVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSHC-LVDLC---------------GAHVRAALKDMNAKCGNMKRAMYLFEEMPK

Query:  RDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRA-----GLSIWETMSSLRAHKIRACSISCWCMGCTVSQSCLLKGDCD---
        RDL+ +      +++HG G++ + LFE+M+DE + PD VA TVIL  C ++     GL  +E M   + + I A      C+   +S++  LK   +   
Subjt:  RDLIPW------LSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRA-----GLSIWETMSSLRAHKIRACSISCWCMGCTVSQSCLLKGDCD---

Query:  ------------TLLAAGKLYSD---SEVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNERG
                    +LL    L+ +   +EVVA  L ELEPQ+A +YVLLSNIY A +RW DV+ +R++MNE G
Subjt:  ------------TLLAAGKLYSD---SEVVAYQLIELEPQNADNYVLLSNIYTAAERWLDVSVVRNRMNERG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GACCGGCATAATCAAAAGCTATCAAATATCATTTCCGTAGACATTTCCTTCGTTTCTCAAGGCGCGCGCAACCGAGGGGAAGGTGATGGAAGGGATGACCCTTTGAGATG
TGATGTTGATGAGGATTTGTATGAGACATGTTTGATATCAGAGAAGGTTGTGGTTTCATGGACGGCTATGATTGTTGGGTATTCGAGCATTGGGAATTTGGTGGAGGCAA
AGAAGTTTCGATTCGATGCCGGAGAGAAATGTGGCGTCGTGGAATGCAATTGTTGGTGCGTACATGAAAATGGTGTTGCTGAGAAGGTGTTCGATGAAATGCCGGTGAAT
AATCTTGTATCTTTCACAAAAATGATGGGTGGGATGCAAAAGCGGTGGCATGCTTTCTGCCATGAATTTGCACCTGAAAGAGATATTGTGGCATTGTCGGCTTTGATATC
TGGGTATACACAAAATCGCCAGCCAAATGAGGCTGTCAAAACTTTTCTTGAAATGGGTTCTAGGAATGTGAAATCTGATGAGTTTCTATTGGCAAGCTTAATGTCAGCTT
GTTCTCACTGCTTAGTTGATCTTTGTGGAGCTCATGTTAGGGCAGCTCTTAAAGATATGAATGCCAAGTGTGGAAACATGAAGCGAGCCATGTATCTGTTTGAAGAAATG
CCTAAAAGAGATCTAATTCCCTGGTTGTCAGTCCATGGGCGTGGAGCTCAGGTTGTGTCACTCTTCGAGAGGATGTTAGATGAAGATCTAACTCCCGATGTAGTAGCCTT
AACAGTCATCTTAGCAGCTTGTAGCCGTGCTGGACTTTCGATCTGGGAGACTATGAGCAGTTTACGAGCTCATAAAATCCGTGCTTGTTCAATCTCATGTTGGTGCATGG
GGTGCACTGTTTCACAGTCATGCTTGTTAAAGGGCGATTGTGACACACTGCTTGCGGCCGGTAAATTATATTCTGATTCAGAGGTGGTAGCTTATCAACTTATTGAGCTC
GAGCCTCAAAATGCAGACAATTATGTTCTATTATCCAATATCTATACTGCAGCAGAAAGGTGGCTGGATGTGTCTGTTGTGCGGAACCGGATGAATGAAAGAGGGTTC
mRNA sequenceShow/hide mRNA sequence
GACCGGCATAATCAAAAGCTATCAAATATCATTTCCGTAGACATTTCCTTCGTTTCTCAAGGCGCGCGCAACCGAGGGGAAGGTGATGGAAGGGATGACCCTTTGAGATG
TGATGTTGATGAGGATTTGTATGAGACATGTTTGATATCAGAGAAGGTTGTGGTTTCATGGACGGCTATGATTGTTGGGTATTCGAGCATTGGGAATTTGGTGGAGGCAA
AGAAGTTTCGATTCGATGCCGGAGAGAAATGTGGCGTCGTGGAATGCAATTGTTGGTGCGTACATGAAAATGGTGTTGCTGAGAAGGTGTTCGATGAAATGCCGGTGAAT
AATCTTGTATCTTTCACAAAAATGATGGGTGGGATGCAAAAGCGGTGGCATGCTTTCTGCCATGAATTTGCACCTGAAAGAGATATTGTGGCATTGTCGGCTTTGATATC
TGGGTATACACAAAATCGCCAGCCAAATGAGGCTGTCAAAACTTTTCTTGAAATGGGTTCTAGGAATGTGAAATCTGATGAGTTTCTATTGGCAAGCTTAATGTCAGCTT
GTTCTCACTGCTTAGTTGATCTTTGTGGAGCTCATGTTAGGGCAGCTCTTAAAGATATGAATGCCAAGTGTGGAAACATGAAGCGAGCCATGTATCTGTTTGAAGAAATG
CCTAAAAGAGATCTAATTCCCTGGTTGTCAGTCCATGGGCGTGGAGCTCAGGTTGTGTCACTCTTCGAGAGGATGTTAGATGAAGATCTAACTCCCGATGTAGTAGCCTT
AACAGTCATCTTAGCAGCTTGTAGCCGTGCTGGACTTTCGATCTGGGAGACTATGAGCAGTTTACGAGCTCATAAAATCCGTGCTTGTTCAATCTCATGTTGGTGCATGG
GGTGCACTGTTTCACAGTCATGCTTGTTAAAGGGCGATTGTGACACACTGCTTGCGGCCGGTAAATTATATTCTGATTCAGAGGTGGTAGCTTATCAACTTATTGAGCTC
GAGCCTCAAAATGCAGACAATTATGTTCTATTATCCAATATCTATACTGCAGCAGAAAGGTGGCTGGATGTGTCTGTTGTGCGGAACCGGATGAATGAAAGAGGGTTC
Protein sequenceShow/hide protein sequence
DRHNQKLSNIISVDISFVSQGARNRGEGDGRDDPLRCDVDEDLYETCLISEKVVVSWTAMIVGYSSIGNLVEAKKFRFDAGEKCGVVECNCWCVHENGVAEKVFDEMPVN
NLVSFTKMMGGMQKRWHAFCHEFAPERDIVALSALISGYTQNRQPNEAVKTFLEMGSRNVKSDEFLLASLMSACSHCLVDLCGAHVRAALKDMNAKCGNMKRAMYLFEEM
PKRDLIPWLSVHGRGAQVVSLFERMLDEDLTPDVVALTVILAACSRAGLSIWETMSSLRAHKIRACSISCWCMGCTVSQSCLLKGDCDTLLAAGKLYSDSEVVAYQLIEL
EPQNADNYVLLSNIYTAAERWLDVSVVRNRMNERGF