; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G006340 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G006340
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein isoform 2
Genome locationGy14Chr4:4584998..4588782
RNA-Seq ExpressionCsGy4G006340
SyntenyCsGy4G006340
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044898.1 Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Cucumis melo var. makuwa]7.64e-11594.15Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW
        APIRPRAFTRRAPNWNKTRV+A PPVP+K FTNGNFVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG    QQRNGGRQQ RPPW
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW

Query:  GKRPF
        GKRPF
Subjt:  GKRPF

KAE8649209.1 hypothetical protein Csa_014401 [Cucumis sativus]2.71e-145100Show/hide
Query:  LSASGRFLGSVRMAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF
        LSASGRFLGSVRMAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF
Subjt:  LSASGRFLGSVRMAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF

Query:  PLATEVARKAAVAPIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNG
        PLATEVARKAAVAPIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNG
Subjt:  PLATEVARKAAVAPIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNG

Query:  GRQQQRPPWGKRPFW
        GRQQQRPPWGKRPFW
Subjt:  GRQQQRPPWGKRPFW

XP_004148996.1 uncharacterized protein LOC101210049 [Cucumis sativus]5.52e-137100Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
        APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR

Query:  PFW
        PFW
Subjt:  PFW

XP_008451954.1 PREDICTED: uncharacterized protein LOC103493102 [Cucumis melo]6.28e-12594.17Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW
        APIRPRAFTRRAPNWNKTRV+A PPVP+K FTNGNFVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG    QQRNGGRQQ RPPW
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW

Query:  GKRPFW
        GKRPFW
Subjt:  GKRPFW

XP_038888349.1 uncharacterized protein LOC120078194 isoform X1 [Benincasa hispida]1.06e-11791.13Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKG KQRR+PNKMQKFPNNA+Q+RPRKLQRFMDSRSSLRQGALANRRS+FQGNQFPLATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
        APIRPRAF RRA NWNKTRV+  PPVPRKPF NG FVPKVSAPAQPQTN TPRQRPQTLDSLFANMKEQRLRVL+QRQN GGAQ RNGG  QQRPPWGKR
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR

Query:  PFW
        PFW
Subjt:  PFW

TrEMBL top hitse value%identityAlignment
A0A0A0KUX2 Uncharacterized protein2.67e-137100Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
        APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR

Query:  PFW
        PFW
Subjt:  PFW

A0A1S3BTH9 uncharacterized protein LOC1034931023.04e-12594.17Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW
        APIRPRAFTRRAPNWNKTRV+A PPVP+K FTNGNFVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG    QQRNGGRQQ RPPW
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW

Query:  GKRPFW
        GKRPFW
Subjt:  GKRPFW

A0A5A7TPS5 Pentatricopeptide repeat (PPR) superfamily protein isoform 23.70e-11594.15Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW
        APIRPRAFTRRAPNWNKTRV+A PPVP+K FTNGNFVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG    QQRNGGRQQ RPPW
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW

Query:  GKRPF
        GKRPF
Subjt:  GKRPF

A0A5D3CYD8 Uncharacterized protein3.04e-12594.17Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW
        APIRPRAFTRRAPNWNKTRV+A PPVP+K FTNGNFVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG    QQRNGGRQQ RPPW
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW

Query:  GKRPFW
        GKRPFW
Subjt:  GKRPFW

A0A6J1C7V3 uncharacterized protein LOC1110082181.60e-11187.25Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRRLPNK QKFPNNATQDRPRKLQRFMDSRSSLRQGALA +RSNFQGNQFPLA EVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVS--APAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWG
        APIRPR F RRAPNW+KTR +A PPV RKPFTNG F+PKV+  A AQPQTNT PRQRPQTLDSLFANMKEQRLRVLSQRQNG GA QRNGG +QQRPPWG
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVS--APAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWG

Query:  KRPF
        +  F
Subjt:  KRPF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G10970.1 unknown protein5.7e-3149.29Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA
        KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG+KQR L NK +KF + A ++   K QR+MDSRS +RQGA A +RSNFQGNQFP+ T VARKAA A
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA

Query:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ
          R R +   R  N N++R  A PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+     N         G  Q
Subjt:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ

Query:  QR---PPWGKR
        Q+    PW +R
Subjt:  QR---PPWGKR

AT4G10970.2 unknown protein5.7e-3149.29Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA
        KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG+KQR L NK +KF + A ++   K QR+MDSRS +RQGA A +RSNFQGNQFP+ T VARKAA A
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA

Query:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ
          R R +   R  N N++R  A PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+     N         G  Q
Subjt:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ

Query:  QR---PPWGKR
        Q+    PW +R
Subjt:  QR---PPWGKR

AT4G10970.3 unknown protein5.7e-3149.29Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA
        KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG+KQR L NK +KF + A ++   K QR+MDSRS +RQGA A +RSNFQGNQFP+ T VARKAA A
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA

Query:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ
          R R +   R  N N++R  A PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+     N         G  Q
Subjt:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ

Query:  QR---PPWGKR
        Q+    PW +R
Subjt:  QR---PPWGKR

AT4G10970.4 unknown protein5.7e-3149.29Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA
        KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG+KQR L NK +KF + A ++   K QR+MDSRS +RQGA A +RSNFQGNQFP+ T VARKAA A
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA

Query:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ
          R R +   R  N N++R  A PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+     N         G  Q
Subjt:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ

Query:  QR---PPWGKR
        Q+    PW +R
Subjt:  QR---PPWGKR

AT4G10970.5 unknown protein2.2e-2244.98Show/hide
Query:  MDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVAPIRPRAFT-RRAPN
        MDM+LD+IIKM K NT  NKG+KQR L NK +KF + A ++   K QR+MDSRS +RQGA A +RSNFQGNQFP+ T VARKAA A  R R +   R  N
Subjt:  MDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVAPIRPRAFT-RRAPN

Query:  WNKTR---------VEAH---PPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQR
         N++          V+A    PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+     N         G  QQ+
Subjt:  WNKTR---------VEAH---PPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQR

Query:  ---PPWGKR
            PW +R
Subjt:  ---PPWGKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGCTGTCATCTTTACCCCCACAACAAATTCAGGCTGCCCATCGATAAGCTAATATCTTTCGTCTACTCTTCCTTCGGCGGCAAGGAATTTGACTCTCCCGCGTA
CCTCTTACAACTTTTAATCTTTCCTTTGGGATCTCCCAAATTTCTTTGCCCTAATTCTTCCCGGTATTTCTCCGATCAGTTTTCCACCCGGCGAATTATCTCCTCTTCAA
GAGGTGTGTTCTTTCTTCTCCGTTTCGGTCCTACGATGTTGAGTGCGAGCGGAAGATTTCTTGGTTCTGTTAGGATGGCCGCTAAACCACTTACAACTGAGGCAATTGCC
ATAACTGAGAAGAAGATGGACATGGCTTTAGACGACATCATTAAAATGTCCAAGAATACGGGAAATAAAGGCAGGAAACAGAGAAGGTTACCGAACAAAATGCAGAAGTT
TCCAAATAATGCTACTCAAGATAGACCTAGGAAGTTGCAGCGATTCATGGACTCTAGATCTTCTCTGAGACAGGGGGCTTTGGCCAACAGAAGGTCAAACTTTCAAGGGA
ATCAGTTTCCTTTGGCAACGGAGGTTGCAAGAAAGGCTGCAGTTGCTCCTATTCGTCCTAGAGCTTTTACTCGTAGGGCACCCAATTGGAATAAAACAAGGGTTGAGGCT
CATCCACCGGTTCCGAGGAAGCCTTTCACCAATGGAAACTTTGTTCCCAAGGTATCTGCACCGGCCCAGCCACAAACAAATACCACGCCAAGACAGAGGCCACAGACTCT
GGACTCACTGTTTGCCAACATGAAGGAACAGAGGCTAAGAGTGTTGTCCCAGCGACAAAATGGGGGTGGGGCACAACAACGGAATGGTGGTCGCCAGCAACAAAGACCTC
CTTGGGGGAAGAGGCCGTTTTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAGCTGTCATCTTTACCCCCACAACAAATTCAGGCTGCCCATCGATAAGCTAATATCTTTCGTCTACTCTTCCTTCGGCGGCAAGGAATTTGACTCTCCCGCGTA
CCTCTTACAACTTTTAATCTTTCCTTTGGGATCTCCCAAATTTCTTTGCCCTAATTCTTCCCGGTATTTCTCCGATCAGTTTTCCACCCGGCGAATTATCTCCTCTTCAA
GAGGTGTGTTCTTTCTTCTCCGTTTCGGTCCTACGATGTTGAGTGCGAGCGGAAGATTTCTTGGTTCTGTTAGGATGGCCGCTAAACCACTTACAACTGAGGCAATTGCC
ATAACTGAGAAGAAGATGGACATGGCTTTAGACGACATCATTAAAATGTCCAAGAATACGGGAAATAAAGGCAGGAAACAGAGAAGGTTACCGAACAAAATGCAGAAGTT
TCCAAATAATGCTACTCAAGATAGACCTAGGAAGTTGCAGCGATTCATGGACTCTAGATCTTCTCTGAGACAGGGGGCTTTGGCCAACAGAAGGTCAAACTTTCAAGGGA
ATCAGTTTCCTTTGGCAACGGAGGTTGCAAGAAAGGCTGCAGTTGCTCCTATTCGTCCTAGAGCTTTTACTCGTAGGGCACCCAATTGGAATAAAACAAGGGTTGAGGCT
CATCCACCGGTTCCGAGGAAGCCTTTCACCAATGGAAACTTTGTTCCCAAGGTATCTGCACCGGCCCAGCCACAAACAAATACCACGCCAAGACAGAGGCCACAGACTCT
GGACTCACTGTTTGCCAACATGAAGGAACAGAGGCTAAGAGTGTTGTCCCAGCGACAAAATGGGGGTGGGGCACAACAACGGAATGGTGGTCGCCAGCAACAAAGACCTC
CTTGGGGGAAGAGGCCGTTTTGGTAA
Protein sequenceShow/hide protein sequence
MASCHLYPHNKFRLPIDKLISFVYSSFGGKEFDSPAYLLQLLIFPLGSPKFLCPNSSRYFSDQFSTRRIISSSRGVFFLLRFGPTMLSASGRFLGSVRMAAKPLTTEAIA
ITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVAPIRPRAFTRRAPNWNKTRVEA
HPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKRPFW