; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G11970 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G11970
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionPentatricopeptide repeat (PPR) superfamily protein isoform 2
Genome locationctg1820:4099202..4103279
RNA-Seq ExpressionCucsat.G11970
SyntenyCucsat.G11970
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044898.1 Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Cucumis melo var. makuwa]1.60e-11694.15Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW
        APIRPRAFTRRAPNWNKTRV+A PPVP+K FTNGNFVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG    QQRNGGRQQ RPPW
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW

Query:  GKRPF
        GKRPF
Subjt:  GKRPF

KAE8649209.1 hypothetical protein Csa_014401 [Cucumis sativus]1.81e-138100Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
        APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR

Query:  PFW
        PFW
Subjt:  PFW

XP_004148996.1 uncharacterized protein LOC101210049 [Cucumis sativus]1.10e-138100Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
        APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR

Query:  PFW
        PFW
Subjt:  PFW

XP_008451954.1 PREDICTED: uncharacterized protein LOC103493102 [Cucumis melo]1.31e-12694.17Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW
        APIRPRAFTRRAPNWNKTRV+A PPVP+K FTNGNFVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG    QQRNGGRQQ RPPW
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW

Query:  GKRPFW
        GKRPFW
Subjt:  GKRPFW

XP_038888349.1 uncharacterized protein LOC120078194 isoform X1 [Benincasa hispida]2.26e-11991.13Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKG KQRR+PNKMQKFPNNA+Q+RPRKLQRFMDSRSSLRQGALANRRS+FQGNQFPLATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
        APIRPRAF RRA NWNKTRV+  PPVPRKPF NG FVPKVSAPAQPQTN TPRQRPQTLDSLFANMKEQRLRVL+QRQN GGAQ RNGG  QQRPPWGKR
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR

Query:  PFW
        PFW
Subjt:  PFW

TrEMBL top hitse value%identityAlignment
A0A0A0KUX2 Uncharacterized protein5.32e-139100Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
        APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR

Query:  PFW
        PFW
Subjt:  PFW

A0A1S3BTH9 uncharacterized protein LOC1034931026.32e-12794.17Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW
        APIRPRAFTRRAPNWNKTRV+A PPVP+K FTNGNFVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG    QQRNGGRQQ RPPW
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW

Query:  GKRPFW
        GKRPFW
Subjt:  GKRPFW

A0A5A7TPS5 Pentatricopeptide repeat (PPR) superfamily protein isoform 27.73e-11794.15Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW
        APIRPRAFTRRAPNWNKTRV+A PPVP+K FTNGNFVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG    QQRNGGRQQ RPPW
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW

Query:  GKRPF
        GKRPF
Subjt:  GKRPF

A0A5D3CYD8 Uncharacterized protein6.32e-12794.17Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW
        APIRPRAFTRRAPNWNKTRV+A PPVP+K FTNGNFVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG    QQRNGGRQQ RPPW
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW

Query:  GKRPFW
        GKRPFW
Subjt:  GKRPFW

A0A6J1C7V3 uncharacterized protein LOC1110082183.52e-11387.25Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRRLPNK QKFPNNATQDRPRKLQRFMDSRSSLRQGALA +RSNFQGNQFPLA EVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVS--APAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWG
        APIRPR F RRAPNW+KTR +A PPV RKPFTNG F+PKV+  A AQPQTNT PRQRPQTLDSLFANMKEQRLRVLSQRQNG GA QRNGG +QQRPPWG
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVS--APAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWG

Query:  KRPF
        +  F
Subjt:  KRPF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G10970.1 unknown protein8.5e-3149.29Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA
        KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG+KQR L NK +KF + A ++   K QR+MDSRS +RQGA A +RSNFQGNQFP+ T VARKAA A
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA

Query:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ
          R R +   R  N N++R  A PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+     N         G  Q
Subjt:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ

Query:  QR---PPWGKR
        Q+    PW +R
Subjt:  QR---PPWGKR

AT4G10970.2 unknown protein8.5e-3149.29Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA
        KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG+KQR L NK +KF + A ++   K QR+MDSRS +RQGA A +RSNFQGNQFP+ T VARKAA A
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA

Query:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ
          R R +   R  N N++R  A PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+     N         G  Q
Subjt:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ

Query:  QR---PPWGKR
        Q+    PW +R
Subjt:  QR---PPWGKR

AT4G10970.3 unknown protein8.5e-3149.29Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA
        KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG+KQR L NK +KF + A ++   K QR+MDSRS +RQGA A +RSNFQGNQFP+ T VARKAA A
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA

Query:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ
          R R +   R  N N++R  A PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+     N         G  Q
Subjt:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ

Query:  QR---PPWGKR
        Q+    PW +R
Subjt:  QR---PPWGKR

AT4G10970.4 unknown protein8.5e-3149.29Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA
        KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG+KQR L NK +KF + A ++   K QR+MDSRS +RQGA A +RSNFQGNQFP+ T VARKAA A
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA

Query:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ
          R R +   R  N N++R  A PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+     N         G  Q
Subjt:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ

Query:  QR---PPWGKR
        Q+    PW +R
Subjt:  QR---PPWGKR

AT4G10970.5 unknown protein2.5e-2244.98Show/hide
Query:  MDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVAPIRPRAFT-RRAPN
        MDM+LD+IIKM K NT  NKG+KQR L NK +KF + A ++   K QR+MDSRS +RQGA A +RSNFQGNQFP+ T VARKAA A  R R +   R  N
Subjt:  MDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVAPIRPRAFT-RRAPN

Query:  WNKTR---------VEAH---PPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQR
         N++          V+A    PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+     N         G  QQ+
Subjt:  WNKTR---------VEAH---PPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQR

Query:  ---PPWGKR
            PW +R
Subjt:  ---PPWGKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTAAACCACTTACAACAGAGGCAATTGCCATAACTGAGAAGAAGATGGACATGGCTTTAGACGACATCATTAAAATGTCCAAAAATACGGGAAATAAA
GGCAGGAAACAGAGAAGGTTACCGAACAAAATGCAGAAGTTTCCAAATAATGCTACTCAAGATAGACCTAGGAAGTTGCAGCGATTCATGGACTCTAGATCTTCT
CTGAGACAGGGGGCTTTGGCCAACAGAAGGTCAAACTTTCAAGGGAATCAGTTTCCTTTGGCAACGGAGGTTGCAAGAAAGGCTGCAGTTGCTCCTATTCGTCCT
AGAGCTTTTACTCGTAGGGCACCCAATTGGAATAAAACAAGGGTTGAGGCTCATCCACCGGTTCCGAGGAAGCCTTTCACCAATGGAAATTTTGTTCCCAAGGTA
TCTGCACCGGCCCAGCCACAAACAAATACCACGCCAAGACAGAGGCCACAGACTCTGGACTCACTGTTTGCCAACATGAAGGAACAGAGGCTAAGAGTGTTGTCC
CAGCGACAAAATGGGGGTGGGGCACAACAACGGAATGGTGGTCGCCAGCAACAAAGACCTCCTTGGGGGAAGAGGCCGTTTTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCTAAACCACTTACAACAGAGGCAATTGCCATAACTGAGAAGAAGATGGACATGGCTTTAGACGACATCATTAAAATGTCCAAAAATACGGGAAATAAA
GGCAGGAAACAGAGAAGGTTACCGAACAAAATGCAGAAGTTTCCAAATAATGCTACTCAAGATAGACCTAGGAAGTTGCAGCGATTCATGGACTCTAGATCTTCT
CTGAGACAGGGGGCTTTGGCCAACAGAAGGTCAAACTTTCAAGGGAATCAGTTTCCTTTGGCAACGGAGGTTGCAAGAAAGGCTGCAGTTGCTCCTATTCGTCCT
AGAGCTTTTACTCGTAGGGCACCCAATTGGAATAAAACAAGGGTTGAGGCTCATCCACCGGTTCCGAGGAAGCCTTTCACCAATGGAAATTTTGTTCCCAAGGTA
TCTGCACCGGCCCAGCCACAAACAAATACCACGCCAAGACAGAGGCCACAGACTCTGGACTCACTGTTTGCCAACATGAAGGAACAGAGGCTAAGAGTGTTGTCC
CAGCGACAAAATGGGGGTGGGGCACAACAACGGAATGGTGGTCGCCAGCAACAAAGACCTCCTTGGGGGAAGAGGCCGTTTTGGTAA
Protein sequenceShow/hide protein sequence
MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVAPIRP
RAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKRPFW