; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G06420 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G06420
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein isoform 2
Genome locationChr4:4458943..4463049
RNA-Seq ExpressionCSPI04G06420
SyntenyCSPI04G06420
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044898.1 Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Cucumis melo var. makuwa]9.3e-9694.15Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW
        APIRPRAFTRRAPNWNKTRV+A PPVP+K FTNGNFVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG    QQRNGGR QQRPPW
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW

Query:  GKRPF
        GKRPF
Subjt:  GKRPF

KAE8649209.1 hypothetical protein Csa_014401 [Cucumis sativus]3.4e-106100Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
        APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR

Query:  PFW
        PFW
Subjt:  PFW

XP_004148996.1 uncharacterized protein LOC101210049 [Cucumis sativus]3.4e-106100Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
        APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR

Query:  PFW
        PFW
Subjt:  PFW

XP_008451954.1 PREDICTED: uncharacterized protein LOC103493102 [Cucumis melo]4.9e-9794.17Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW
        APIRPRAFTRRAPNWNKTRV+A PPVP+K FTNGNFVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG    QQRNGGR QQRPPW
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW

Query:  GKRPFW
        GKRPFW
Subjt:  GKRPFW

XP_038888349.1 uncharacterized protein LOC120078194 isoform X1 [Benincasa hispida]1.8e-9191.13Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKG KQRR+PNKMQKFPNNA+Q+RPRKLQRFMDSRSSLRQGALANRRS+FQGNQFPLATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
        APIRPRAF RRA NWNKTRV+  PPVPRKPF NG FVPKVSAPAQPQTN TPRQRPQTLDSLFANMKEQRLRVL+QRQN GGA QRNGG  QQRPPWGKR
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR

Query:  PFW
        PFW
Subjt:  PFW

TrEMBL top hitse value%identityAlignment
A0A0A0KUX2 Uncharacterized protein1.7e-106100Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
        APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKR

Query:  PFW
        PFW
Subjt:  PFW

A0A1S3BTH9 uncharacterized protein LOC1034931022.4e-9794.17Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW
        APIRPRAFTRRAPNWNKTRV+A PPVP+K FTNGNFVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG    QQRNGGR QQRPPW
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW

Query:  GKRPFW
        GKRPFW
Subjt:  GKRPFW

A0A5A7TPS5 Pentatricopeptide repeat (PPR) superfamily protein isoform 24.5e-9694.15Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW
        APIRPRAFTRRAPNWNKTRV+A PPVP+K FTNGNFVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG    QQRNGGR QQRPPW
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW

Query:  GKRPF
        GKRPF
Subjt:  GKRPF

A0A5D3CYD8 Uncharacterized protein2.4e-9794.17Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW
        APIRPRAFTRRAPNWNKTRV+A PPVP+K FTNGNFVPKVSAPAQ QTN TPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG    QQRNGGR QQRPPW
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGA---QQRNGGRQQQRPPW

Query:  GKRPFW
        GKRPFW
Subjt:  GKRPFW

A0A6J1C7V3 uncharacterized protein LOC1110082186.5e-8787.25Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRRLPNK QKFPNNATQDRPRKLQRFMDSRSSLRQGALA +RSNFQGNQFPLA EVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAV

Query:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKV--SAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWG
        APIRPR F RRAPNW+KTR +A PPV RKPFTNG F+PKV  +A AQPQTNT PRQRPQTLDSLFANMKEQRLRVLSQRQNG GA QRNGG +QQRPPWG
Subjt:  APIRPRAFTRRAPNWNKTRVEAHPPVPRKPFTNGNFVPKV--SAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWG

Query:  KRPF
        +  F
Subjt:  KRPF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G10970.1 unknown protein5.0e-3149.29Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA
        KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG+KQR L NK +KF + A ++   K QR+MDSRS +RQGA A +RSNFQGNQFP+ T VARKAA A
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA

Query:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ
          R R +   R  N N++R  A PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+     N         G  Q
Subjt:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ

Query:  QR---PPWGKR
        Q+    PW +R
Subjt:  QR---PPWGKR

AT4G10970.2 unknown protein5.0e-3149.29Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA
        KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG+KQR L NK +KF + A ++   K QR+MDSRS +RQGA A +RSNFQGNQFP+ T VARKAA A
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA

Query:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ
          R R +   R  N N++R  A PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+     N         G  Q
Subjt:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ

Query:  QR---PPWGKR
        Q+    PW +R
Subjt:  QR---PPWGKR

AT4G10970.3 unknown protein5.0e-3149.29Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA
        KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG+KQR L NK +KF + A ++   K QR+MDSRS +RQGA A +RSNFQGNQFP+ T VARKAA A
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA

Query:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ
          R R +   R  N N++R  A PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+     N         G  Q
Subjt:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ

Query:  QR---PPWGKR
        Q+    PW +R
Subjt:  QR---PPWGKR

AT4G10970.4 unknown protein5.0e-3149.29Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA
        KP+TTE +A+TEKKMDM+LD+IIKM K NT  NKG+KQR L NK +KF + A ++   K QR+MDSRS +RQGA A +RSNFQGNQFP+ T VARKAA A
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVA

Query:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ
          R R +   R  N N++R  A PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+     N         G  Q
Subjt:  PIRPRAFT-RRAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQ

Query:  QR---PPWGKR
        Q+    PW +R
Subjt:  QR---PPWGKR

AT4G10970.5 unknown protein1.5e-2244.98Show/hide
Query:  MDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVAPIRPRAFT-RRAPN
        MDM+LD+IIKM K NT  NKG+KQR L NK +KF + A ++   K QR+MDSRS +RQGA A +RSNFQGNQFP+ T VARKAA A  R R +   R  N
Subjt:  MDMALDDIIKMSK-NTG-NKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVAPIRPRAFT-RRAPN

Query:  WNKTR---------VEAH---PPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQR
         N++          V+A    PP   +    G FV K          Q Q N      RQ PQTLDS FANMKE+R+R+     N         G  QQ+
Subjt:  WNKTR---------VEAH---PPVPRKPFTNGNFVPKVSAP-----AQPQTN---TTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQR

Query:  ---PPWGKR
            PW +R
Subjt:  ---PPWGKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTAAACCACTTACAACTGAGGCAATTGCCATAACTGAGAAGAAGATGGACATGGCTTTAGACGACATCATTAAAATGTCCAAAAATACGGGAAATAAAGGCAG
GAAACAGAGAAGGTTACCGAACAAAATGCAGAAGTTTCCAAATAATGCTACTCAAGATAGACCTAGGAAGTTGCAGCGATTCATGGACTCTAGATCTTCTCTGAGACAGG
GGGCTTTGGCCAACAGAAGGTCAAACTTTCAAGGGAATCAGTTTCCTTTGGCAACGGAGGTTGCAAGAAAGGCTGCAGTTGCTCCTATTCGTCCTAGAGCTTTTACTCGT
AGGGCACCCAATTGGAATAAAACAAGGGTTGAGGCTCATCCACCGGTTCCGAGGAAGCCTTTCACCAATGGAAATTTTGTTCCCAAGGTATCTGCACCGGCCCAGCCACA
AACAAATACCACGCCAAGACAGAGGCCACAGACTCTGGACTCACTGTTTGCCAACATGAAGGAACAGAGGCTAAGAGTGTTGTCCCAGCGACAAAATGGGGGTGGGGCAC
AACAACGGAATGGTGGTCGCCAGCAACAAAGACCTCCTTGGGGGAAGAGGCCGTTTTGGTAA
mRNA sequenceShow/hide mRNA sequence
CAGCTATCACGGACTCCAAATCCCTATCGAACATCCTTTAATATTATTTCTACACCAAGTTTCCTCTCAAACGACGTCGTTATCTTCTTCTATATATTACCCCTCCCTTG
CCCTATTTTTCCACCCAAGTCTACTCTTCCTTCGGCGGCAAGGAATTTGACTCTCCCGCGTACCTCTTACAACTTTTAATCTTTCCTTTGGGATCTCCCAAATTTCTTTG
CCCTAATTCTTCCCGGTATTTCTCCGATCAGCTTTCCACCCGGCGAATTATCTCCTCTTCAAGAGATGGCCGCTAAACCACTTACAACTGAGGCAATTGCCATAACTGAG
AAGAAGATGGACATGGCTTTAGACGACATCATTAAAATGTCCAAAAATACGGGAAATAAAGGCAGGAAACAGAGAAGGTTACCGAACAAAATGCAGAAGTTTCCAAATAA
TGCTACTCAAGATAGACCTAGGAAGTTGCAGCGATTCATGGACTCTAGATCTTCTCTGAGACAGGGGGCTTTGGCCAACAGAAGGTCAAACTTTCAAGGGAATCAGTTTC
CTTTGGCAACGGAGGTTGCAAGAAAGGCTGCAGTTGCTCCTATTCGTCCTAGAGCTTTTACTCGTAGGGCACCCAATTGGAATAAAACAAGGGTTGAGGCTCATCCACCG
GTTCCGAGGAAGCCTTTCACCAATGGAAATTTTGTTCCCAAGGTATCTGCACCGGCCCAGCCACAAACAAATACCACGCCAAGACAGAGGCCACAGACTCTGGACTCACT
GTTTGCCAACATGAAGGAACAGAGGCTAAGAGTGTTGTCCCAGCGACAAAATGGGGGTGGGGCACAACAACGGAATGGTGGTCGCCAGCAACAAAGACCTCCTTGGGGGA
AGAGGCCGTTTTGGTAACTGATGATTACACAACTCAAACCTGGTGGTGCTATGAATAAGCATATTCGTGTGGGGAATGTAGATGATGCTTGCTTGTCCATGGCACCCGAT
AGCTGATAGGGGATCAAAAGGAATGTGATTGTTTCTCGTCTTTGTTTTTTTTTTCCTTAAAAAATCCGAACTTTGCTGGCTTGCTATTTTTAGCGTTTTCTAGCTTCGCT
TTGAACCGATTATTATAGAAATTTGTTGCAATGTATGTAAAGTGTTTTTTCTTTTGTTCTTGTGGCCTATTTCCTCTGGCTTGTTCTGTTTAAATCAATCTCATTACACC
TCCGCTCCCTTCATCTTATTTCTAACACTACCCTGTATTGTTGCGAAGATTGTTATTAATTAAATTAGTCAGCTGCTGAAGGCCAA
Protein sequenceShow/hide protein sequence
MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRLPNKMQKFPNNATQDRPRKLQRFMDSRSSLRQGALANRRSNFQGNQFPLATEVARKAAVAPIRPRAFTR
RAPNWNKTRVEAHPPVPRKPFTNGNFVPKVSAPAQPQTNTTPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGAQQRNGGRQQQRPPWGKRPFW