; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G21567 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G21567
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionPentatricopeptide repeat-containing protein
Genome locationctg944:321140..322590
RNA-Seq ExpressionCucsat.G21567
SyntenyCucsat.G21567
Gene Ontology termsGO:0006749 - glutathione metabolic process (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004364 - glutathione transferase activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043201.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]1.74e-24493.93Show/hide
Query:  MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAIS
        MQFLAS RILR HGFLQKLCS Q GSS SAS+AFFSSTHFDSISSPHHDFSSSS LQSP++K CSLVL+ YLRQPHLRFSPSKLNLDMDA SLTHEQAIS
Subjt:  MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAIS

Query:  AVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR
        AVA LASEEGSMVALSFFYWA+GFPKFRYFMRLYIVCTMSL+GKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAA M 
Subjt:  AVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR

Query:  LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG
        LVEYAGNVFDEMSARGVYPDSCTYK IIVGYCRNG+VLEADRWICEMMERGFVVDNATLTLII AFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG
Subjt:  LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG

Query:  LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS
        LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKL+RSDNYKPNVHTYTAMISGYCKE+KLS
Subjt:  LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS

XP_004145475.3 pentatricopeptide repeat-containing protein At4g19890 [Cucumis sativus]8.54e-286100Show/hide
Query:  MNSTNPRQLALNGGRGPAVFIPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHL
        MNSTNPRQLALNGGRGPAVFIPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHL
Subjt:  MNSTNPRQLALNGGRGPAVFIPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHL

Query:  RFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDM
        RFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDM
Subjt:  RFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDM

Query:  RNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVW
        RNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVW
Subjt:  RNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVW

Query:  FFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEK
        FFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEK
Subjt:  FFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEK

Query:  LS
        LS
Subjt:  LS

XP_008459042.1 PREDICTED: pentatricopeptide repeat-containing protein At4g19890 [Cucumis melo]1.07e-24993.93Show/hide
Query:  MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAIS
        MQFLAS RILR HGFLQKLCS Q GSS SAS+AFFSSTHFDSISSPHHDFSSSS LQSP++K CSLVL+ YLRQPHLRFSPSKLNLDMDA SLTHEQAIS
Subjt:  MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAIS

Query:  AVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR
        AVA LASEEGSMVALSFFYWA+GFPKFRYFMRLYIVCTMSL+GKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAA M 
Subjt:  AVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR

Query:  LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG
        LVEYAGNVFDEMSARGVYPDSCTYK IIVGYCRNG+VLEADRWICEMMERGFVVDNATLTLII AFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG
Subjt:  LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG

Query:  LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS
        LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKL+RSDNYKPNVHTYTAMISGYCKE+KLS
Subjt:  LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS

XP_038895547.1 pentatricopeptide repeat-containing protein At4g19890 isoform X2 [Benincasa hispida]1.60e-23689.18Show/hide
Query:  MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAIS
        MQFLASLRILRPHGFLQKLC FQQ SS SAS+ FFSSTH  SISS H+D SSSSSLQSP++ ICSLVL TY RQPHLRFSPSKLNLDMDA SLTHEQAIS
Subjt:  MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAIS

Query:  AVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR
         VA LASEEGSMVALSFFYWA+GFPKFR+FMRLYIVCTMSL+GKCNLERA EVVECMVG FAEIGKLKEAVDMI DMRNQGLVLTTRVMNR I+VAAEM 
Subjt:  AVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR

Query:  LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG
        L+EYAGNVFDEMSARGV PDSCTYK II GYCR G+VLEADRWICEMMERGFVVDNATLTLII AFCEKS VNRA+WFFHKV+KMGLSPNLINYSSMI+G
Subjt:  LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG

Query:  LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS
        LCKRGSVKQAFELLEEMV+NGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKL+RSDNYKPNVHTYTAMISGYCKEEKL+
Subjt:  LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS

XP_038895548.1 pentatricopeptide repeat-containing protein At4g19890 isoform X3 [Benincasa hispida]1.26e-23689.18Show/hide
Query:  MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAIS
        MQFLASLRILRPHGFLQKLC FQQ SS SAS+ FFSSTH  SISS H+D SSSSSLQSP++ ICSLVL TY RQPHLRFSPSKLNLDMDA SLTHEQAIS
Subjt:  MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAIS

Query:  AVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR
         VA LASEEGSMVALSFFYWA+GFPKFR+FMRLYIVCTMSL+GKCNLERA EVVECMVG FAEIGKLKEAVDMI DMRNQGLVLTTRVMNR I+VAAEM 
Subjt:  AVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR

Query:  LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG
        L+EYAGNVFDEMSARGV PDSCTYK II GYCR G+VLEADRWICEMMERGFVVDNATLTLII AFCEKS VNRA+WFFHKV+KMGLSPNLINYSSMI+G
Subjt:  LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG

Query:  LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS
        LCKRGSVKQAFELLEEMV+NGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKL+RSDNYKPNVHTYTAMISGYCKEEKL+
Subjt:  LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS

TrEMBL top hitse value%identityAlignment
A0A0A0LYL9 Uncharacterized protein4.14e-286100Show/hide
Query:  MNSTNPRQLALNGGRGPAVFIPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHL
        MNSTNPRQLALNGGRGPAVFIPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHL
Subjt:  MNSTNPRQLALNGGRGPAVFIPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHL

Query:  RFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDM
        RFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDM
Subjt:  RFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDM

Query:  RNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVW
        RNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVW
Subjt:  RNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVW

Query:  FFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEK
        FFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEK
Subjt:  FFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEK

Query:  LS
        LS
Subjt:  LS

A0A1S3CAH0 pentatricopeptide repeat-containing protein At4g198905.20e-25093.93Show/hide
Query:  MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAIS
        MQFLAS RILR HGFLQKLCS Q GSS SAS+AFFSSTHFDSISSPHHDFSSSS LQSP++K CSLVL+ YLRQPHLRFSPSKLNLDMDA SLTHEQAIS
Subjt:  MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAIS

Query:  AVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR
        AVA LASEEGSMVALSFFYWA+GFPKFRYFMRLYIVCTMSL+GKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAA M 
Subjt:  AVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR

Query:  LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG
        LVEYAGNVFDEMSARGVYPDSCTYK IIVGYCRNG+VLEADRWICEMMERGFVVDNATLTLII AFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG
Subjt:  LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG

Query:  LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS
        LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKL+RSDNYKPNVHTYTAMISGYCKE+KLS
Subjt:  LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS

A0A5A7TNN8 Pentatricopeptide repeat-containing protein8.44e-24593.93Show/hide
Query:  MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAIS
        MQFLAS RILR HGFLQKLCS Q GSS SAS+AFFSSTHFDSISSPHHDFSSSS LQSP++K CSLVL+ YLRQPHLRFSPSKLNLDMDA SLTHEQAIS
Subjt:  MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAIS

Query:  AVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR
        AVA LASEEGSMVALSFFYWA+GFPKFRYFMRLYIVCTMSL+GKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAA M 
Subjt:  AVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR

Query:  LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG
        LVEYAGNVFDEMSARGVYPDSCTYK IIVGYCRNG+VLEADRWICEMMERGFVVDNATLTLII AFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG
Subjt:  LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG

Query:  LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS
        LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKL+RSDNYKPNVHTYTAMISGYCKE+KLS
Subjt:  LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS

A0A6J1GR66 pentatricopeptide repeat-containing protein At4g198905.16e-21581.87Show/hide
Query:  IPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSS----LQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASL
        +P MQFLAS+RILR HGFL K  SF Q  S SAS   FSS+HFDSISSP+ D SSSSS    LQSP++ ICSLV+++Y RQPHLRFSP KLNLDMDA  L
Subjt:  IPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSS----LQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASL

Query:  THEQAISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRII
        THEQAIS VA LASEEGSM+ALSFFYWA+GFPKFRYFMRLYIVCTM LVGKC  ERA EVVECM+GVFAEIGKLKEAVDMI+DMRNQGLVLTTRVMNRII
Subjt:  THEQAISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRII

Query:  LVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLIN
        +VAAEM L+EYAGN+FDEMSARG+ P+SCTYK IIVGYCR GNVL+ DRW+ EMMERGFVVDNATLTLII AFC+K  V+RA W FHKV KMGLSPNLIN
Subjt:  LVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLIN

Query:  YSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS
        YSSMISGLCKRGSVKQAFELLEEMV+NGWKPNVYTHTSLI GLCKKGWT+RAFRLFLKL+RSD YKPNV+TYTAMISGYCKEEKL+
Subjt:  YSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS

A0A6J1K602 pentatricopeptide repeat-containing protein At4g198901.53e-21582.98Show/hide
Query:  IPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQ
        +P MQFLASLRILR HGFL K  SF Q  S SAS  FFSS+HFDSISSPH D SSSSSLQSP++ ICSLV+++Y RQPHLRFSP KLNLD+DA  LTHEQ
Subjt:  IPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQ

Query:  AISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAA
        AIS VA LASEEGSM+ALSFFYWA+ FPKFRYFMRLYIVCTM LV KC  ERA EVVECM+GVFAEIGKLKEAVDMI+DMRNQGLVLTTRVMNRII+VAA
Subjt:  AISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAA

Query:  EMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSM
        EM L+EYAGN+FDEMSARG+ P+SCTYK IIVGYCR GNVL+ DRWI EMMERGFVVDNATLTLII AFC+K  V+RA W FHKV KMGLSPNLINYSSM
Subjt:  EMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSM

Query:  ISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS
        ISGLCKRGSVKQAFELLEEMV+NGWKPNVYTHTSLI GLCKKGWT+RAFRLFLKL+RSD YKPNV+TYTAMISGYCKEEKL+
Subjt:  ISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS

SwissProt top hitse value%identityAlignment
P0C8Q3 Pentatricopeptide repeat-containing protein At4g198904.5e-12462.61Show/hide
Query:  SLAFF---SSTHFDS-ISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPK
        SL FF   SS H  S +S P     SSS  Q  +K +CSLV  +YLRQ H+  SP ++NLD DA SLTHEQAI+ VA LASE GSMVAL FFYWAVGF K
Subjt:  SLAFF---SSTHFDS-ISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPK

Query:  FRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKY
        FR+FMRLY+V   SL+   NL++AHEV+ CM+  F+EIG+L EAV M++DM+NQGL  ++  MN ++ +A E+ L+EYA NVFDEMS RGV PDS +YK 
Subjt:  FRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKY

Query:  IIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNV
        +++G  R+G + EADRW+  M++RGF+ DNAT TLI+TA CE  LVNRA+W+F K+  +G  PNLIN++S+I GLCK+GS+KQAFE+LEEMV+NGWKPNV
Subjt:  IIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNV

Query:  YTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS
        YTHT+LI GLCK+GWTE+AFRLFLKL+RSD YKPNVHTYT+MI GYCKE+KL+
Subjt:  YTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS

Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial1.8e-3233.33Show/hide
Query:  MVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDN
        ++G+   I KL EA +   +M  QG++  T V   +I    +   +  A   F EM +R + PD  TY  II G+C+ G+++EA +   EM  +G   D+
Subjt:  MVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDN

Query:  ATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSD
         T T +I  +C+   +  A    + + + G SPN++ Y+++I GLCK G +  A ELL EM K G +PN++T+ S+++GLCK G  E A +L +    + 
Subjt:  ATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSD

Query:  NYKPNVHTYTAMISGYCKEEKLSELKCCLKE
            +  TYT ++  YCK  ++ + +  LKE
Subjt:  NYKPNVHTYTAMISGYCKEEKLSELKCCLKE

Q6NQ83 Pentatricopeptide repeat-containing protein At3g22470, mitochondrial3.5e-2829.91Show/hide
Query:  MVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDN
        ++ VF + GKL EA ++  +M  +G+   T   N +I    +   +  A  +FD M ++G  PD  TY  +I  YC+   V +  R   E+  +G + + 
Subjt:  MVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDN

Query:  ATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSD
         T   ++  FC+   +N A   F ++   G+ P+++ Y  ++ GLC  G + +A E+ E+M K+     +  +  +IHG+C     + A+ LF  L    
Subjt:  ATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSD

Query:  NYKPNVHTYTAMISGYCKEEKLSE
          KP+V TY  MI G CK+  LSE
Subjt:  NYKPNVHTYTAMISGYCKEEKLSE

Q9FNL2 Pentatricopeptide repeat-containing protein At5g461002.7e-2834.62Show/hide
Query:  MVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR---LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFV
        ++ +  E  +L  A     +MR  GL  T   +N  +L+ A  R    V+    +F EM  RG  PDS TY  +I G CR G + EA +   EM+E+   
Subjt:  MVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR---LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFV

Query:  VDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLI
            T T +I   C    V+ A+ +  ++   G+ PN+  YSS++ GLCK G   QA EL E M+  G +PN+ T+T+LI GLCK+   + A  L L  +
Subjt:  VDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLI

Query:  RSDNYKPNVHTYTAMISGYCKEEKLSELKCCLKE
             KP+   Y  +ISG+C   K  E    L E
Subjt:  RSDNYKPNVHTYTAMISGYCKEEKLSELKCCLKE

Q9LSL9 Pentatricopeptide repeat-containing protein At5g655601.1e-2929.59Show/hide
Query:  YIVCTMSLVGKCNLERAHEVV---------------ECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVY
        Y V   SL  +C  E+A E++                 ++  + + G +++AVD++  M ++ L   TR  N +I    +   V  A  V ++M  R V 
Subjt:  YIVCTMSLVGKCNLERAHEVV---------------ECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVY

Query:  PDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMV
        PD  TY  +I G CR+GN   A R +  M +RG V D  T T +I + C+   V  A   F  + + G++PN++ Y+++I G CK G V +A  +LE+M+
Subjt:  PDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMV

Query:  KNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIR----------------------------------SDNYKPNVHTYTAMISGYCKEEKL
             PN  T  +LIHGLC  G  + A  L  K+++                                  S   KP+ HTYT  I  YC+E +L
Subjt:  KNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIR----------------------------------SDNYKPNVHTYTAMISGYCKEEKL

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.3e-3333.33Show/hide
Query:  MVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDN
        ++G+   I KL EA +   +M  QG++  T V   +I    +   +  A   F EM +R + PD  TY  II G+C+ G+++EA +   EM  +G   D+
Subjt:  MVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDN

Query:  ATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSD
         T T +I  +C+   +  A    + + + G SPN++ Y+++I GLCK G +  A ELL EM K G +PN++T+ S+++GLCK G  E A +L +    + 
Subjt:  ATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSD

Query:  NYKPNVHTYTAMISGYCKEEKLSELKCCLKE
            +  TYT ++  YCK  ++ + +  LKE
Subjt:  NYKPNVHTYTAMISGYCKEEKLSELKCCLKE

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein1.3e-3333.33Show/hide
Query:  MVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDN
        ++G+   I KL EA +   +M  QG++  T V   +I    +   +  A   F EM +R + PD  TY  II G+C+ G+++EA +   EM  +G   D+
Subjt:  MVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDN

Query:  ATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSD
         T T +I  +C+   +  A    + + + G SPN++ Y+++I GLCK G +  A ELL EM K G +PN++T+ S+++GLCK G  E A +L +    + 
Subjt:  ATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSD

Query:  NYKPNVHTYTAMISGYCKEEKLSELKCCLKE
            +  TYT ++  YCK  ++ + +  LKE
Subjt:  NYKPNVHTYTAMISGYCKEEKLSELKCCLKE

AT4G19890.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.2e-12562.61Show/hide
Query:  SLAFF---SSTHFDS-ISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPK
        SL FF   SS H  S +S P     SSS  Q  +K +CSLV  +YLRQ H+  SP ++NLD DA SLTHEQAI+ VA LASE GSMVAL FFYWAVGF K
Subjt:  SLAFF---SSTHFDS-ISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPK

Query:  FRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKY
        FR+FMRLY+V   SL+   NL++AHEV+ CM+  F+EIG+L EAV M++DM+NQGL  ++  MN ++ +A E+ L+EYA NVFDEMS RGV PDS +YK 
Subjt:  FRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKY

Query:  IIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNV
        +++G  R+G + EADRW+  M++RGF+ DNAT TLI+TA CE  LVNRA+W+F K+  +G  PNLIN++S+I GLCK+GS+KQAFE+LEEMV+NGWKPNV
Subjt:  IIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNV

Query:  YTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS
        YTHT+LI GLCK+GWTE+AFRLFLKL+RSD YKPNVHTYT+MI GYCKE+KL+
Subjt:  YTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLS

AT5G46100.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-2934.62Show/hide
Query:  MVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR---LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFV
        ++ +  E  +L  A     +MR  GL  T   +N  +L+ A  R    V+    +F EM  RG  PDS TY  +I G CR G + EA +   EM+E+   
Subjt:  MVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMR---LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFV

Query:  VDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLI
            T T +I   C    V+ A+ +  ++   G+ PN+  YSS++ GLCK G   QA EL E M+  G +PN+ T+T+LI GLCK+   + A  L L  +
Subjt:  VDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLI

Query:  RSDNYKPNVHTYTAMISGYCKEEKLSELKCCLKE
             KP+   Y  +ISG+C   K  E    L E
Subjt:  RSDNYKPNVHTYTAMISGYCKEEKLSELKCCLKE

AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein7.7e-3129.59Show/hide
Query:  YIVCTMSLVGKCNLERAHEVV---------------ECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVY
        Y V   SL  +C  E+A E++                 ++  + + G +++AVD++  M ++ L   TR  N +I    +   V  A  V ++M  R V 
Subjt:  YIVCTMSLVGKCNLERAHEVV---------------ECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVY

Query:  PDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMV
        PD  TY  +I G CR+GN   A R +  M +RG V D  T T +I + C+   V  A   F  + + G++PN++ Y+++I G CK G V +A  +LE+M+
Subjt:  PDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMV

Query:  KNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIR----------------------------------SDNYKPNVHTYTAMISGYCKEEKL
             PN  T  +LIHGLC  G  + A  L  K+++                                  S   KP+ HTYT  I  YC+E +L
Subjt:  KNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIR----------------------------------SDNYKPNVHTYTAMISGYCKEEKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTCAACAAATCCTCGACAATTAGCATTGAATGGAGGGAGAGGTCCTGCAGTTTTTATACCTTTGATGCAGTTCTTGGCGTCTCTCCGAATTCTGAGGCCCCATGG
ATTTCTCCAGAAATTATGTTCCTTTCAACAGGGATCTTCAGCTTCTGCCTCCCTCGCATTTTTCTCCTCAACTCATTTTGATTCCATCTCTTCGCCGCACCATGATTTTT
CTTCTTCTTCTTCGTTGCAGTCCCCTCTGAAAAAGATTTGTTCATTAGTTCTCGACACTTATTTACGTCAACCCCATCTGAGATTCTCTCCATCTAAGCTGAATCTTGAT
ATGGATGCTGCCTCTTTGACTCATGAGCAAGCCATTTCTGCCGTTGCTTTGCTTGCTAGCGAGGAGGGTTCAATGGTCGCGCTGAGTTTCTTTTACTGGGCAGTTGGGTT
CCCCAAATTCCGGTATTTCATGCGGCTCTACATAGTTTGTACGATGTCATTGGTTGGCAAATGTAATCTAGAGCGAGCTCATGAAGTTGTGGAGTGTATGGTAGGTGTTT
TTGCAGAAATTGGGAAGTTGAAGGAGGCGGTGGATATGATCCTTGACATGAGAAATCAGGGACTTGTGTTGACCACCAGGGTAATGAATCGTATTATCCTGGTGGCTGCT
GAAATGAGGCTGGTTGAATATGCAGGCAATGTGTTCGACGAAATGTCTGCAAGAGGTGTGTATCCTGATTCTTGCACTTATAAGTATATAATTGTAGGTTACTGTAGAAA
TGGTAATGTTTTGGAAGCAGATAGGTGGATATGTGAGATGATGGAGAGAGGTTTTGTGGTTGATAATGCCACACTGACTTTGATTATTACAGCATTTTGTGAAAAAAGTT
TAGTAAACAGGGCAGTTTGGTTTTTCCATAAGGTTACTAAGATGGGTTTATCACCAAATTTGATAAACTATTCATCTATGATTAGTGGGTTGTGCAAGAGGGGTAGTGTT
AAGCAAGCATTTGAGTTATTGGAAGAGATGGTTAAAAATGGATGGAAACCCAATGTGTATACCCACACATCATTAATTCATGGCCTTTGCAAGAAGGGATGGACAGAGAG
AGCTTTTAGACTGTTTCTTAAACTTATTAGAAGTGATAATTACAAGCCAAATGTGCACACTTACACAGCCATGATAAGTGGGTACTGCAAAGAGGAGAAGTTGAGTGAGC
TGAAATGTTGTTTGAAAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAACTCAACAAATCCTCGACAATTAGCATTGAATGGAGGGAGAGGTCCTGCAGTTTTTATACCTTTGATGCAGTTCTTGGCGTCTCTCCGAATTCTGAGGCCCCATGG
ATTTCTCCAGAAATTATGTTCCTTTCAACAGGGATCTTCAGCTTCTGCCTCCCTCGCATTTTTCTCCTCAACTCATTTTGATTCCATCTCTTCGCCGCACCATGATTTTT
CTTCTTCTTCTTCGTTGCAGTCCCCTCTGAAAAAGATTTGTTCATTAGTTCTCGACACTTATTTACGTCAACCCCATCTGAGATTCTCTCCATCTAAGCTGAATCTTGAT
ATGGATGCTGCCTCTTTGACTCATGAGCAAGCCATTTCTGCCGTTGCTTTGCTTGCTAGCGAGGAGGGTTCAATGGTCGCGCTGAGTTTCTTTTACTGGGCAGTTGGGTT
CCCCAAATTCCGGTATTTCATGCGGCTCTACATAGTTTGTACGATGTCATTGGTTGGCAAATGTAATCTAGAGCGAGCTCATGAAGTTGTGGAGTGTATGGTAGGTGTTT
TTGCAGAAATTGGGAAGTTGAAGGAGGCGGTGGATATGATCCTTGACATGAGAAATCAGGGACTTGTGTTGACCACCAGGGTAATGAATCGTATTATCCTGGTGGCTGCT
GAAATGAGGCTGGTTGAATATGCAGGCAATGTGTTCGACGAAATGTCTGCAAGAGGTGTGTATCCTGATTCTTGCACTTATAAGTATATAATTGTAGGTTACTGTAGAAA
TGGTAATGTTTTGGAAGCAGATAGGTGGATATGTGAGATGATGGAGAGAGGTTTTGTGGTTGATAATGCCACACTGACTTTGATTATTACAGCATTTTGTGAAAAAAGTT
TAGTAAACAGGGCAGTTTGGTTTTTCCATAAGGTTACTAAGATGGGTTTATCACCAAATTTGATAAACTATTCATCTATGATTAGTGGGTTGTGCAAGAGGGGTAGTGTT
AAGCAAGCATTTGAGTTATTGGAAGAGATGGTTAAAAATGGATGGAAACCCAATGTGTATACCCACACATCATTAATTCATGGCCTTTGCAAGAAGGGATGGACAGAGAG
AGCTTTTAGACTGTTTCTTAAACTTATTAGAAGTGATAATTACAAGCCAAATGTGCACACTTACACAGCCATGATAAGTGGGTACTGCAAAGAGGAGAAGTTGAGTGAGC
TGAAATGTTGTTTGAAAGAATGA
Protein sequenceShow/hide protein sequence
MNSTNPRQLALNGGRGPAVFIPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLD
MDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAA
EMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSV
KQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSELKCCLKE