; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G03230 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G03230
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionBEST Arabidopsis thaliana protein match is: embryo defective 2170 .
Genome locationClcChr08:7669455..7676287
RNA-Seq ExpressionClc08G03230
SyntenyClc08G03230
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0016301 - kinase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059829.1 putative serine/threonine-protein kinase ndrD [Cucumis melo var. makuwa]8.2e-10284.65Show/hide
Query:  ASASATRSSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALG
        +S S TRSSRPALNHDIFRSWNGKQIHLRDD P EYGFRL+SPQRSPQFYRSNY +LSP SKALAIATGQKELME+VNNMPESCYELSLRDLVEQPM +G
Subjt:  ASASATRSSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALG

Query:  EREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEKDWWKRRLSVS
        +RE+TGV+ RDSNLGG RE+FSRENRKSRKETRALVGRS  ++EN GLYLKMG PKSIG TTT+KKKKKNDS  NMSAKVSPKPPQLVEKDWWKRRLSVS
Subjt:  EREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEKDWWKRRLSVS

Query:  SESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR
        SESESV+YGS+VNNGS+KS SSSSSNGS   NK+RTK+TGR
Subjt:  SESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR

KAG6592161.1 hypothetical protein SDJN03_14507, partial [Cucurbita argyrosperma subsp. sororia]6.5e-9978.88Show/hide
Query:  ALDAIEFASASASATR-SSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLR
        A DA++F S +++ +R SSRPALNHDIFRSWNGKQIHL+DD+ +EYGFRLSSPQRSPQFYRSNYQSLSP SKALAIATGQKELME+VNNMPESCYELSLR
Subjt:  ALDAIEFASASASATR-SSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLR

Query:  DLVEQPMALGEREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEK
        DLVEQPM LG++E TG N RD   GGDRE+F+ ENRKS+KET ALVGRS+ N+ENGGLYLKMG P SIGT T KKKKK NDSG N SAKVSPKP   VEK
Subjt:  DLVEQPMALGEREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEK

Query:  DWWKRRLSVSSESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR
        DWWKRRLSVSSE+ SVAY SSVNNGS+KS SSSSSNGSN  NKNRTK++GR
Subjt:  DWWKRRLSVSSESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR

XP_004146783.1 uncharacterized protein LOC101215856 [Cucumis sativus]9.1e-10184.1Show/hide
Query:  ASATRSSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALGER
        +S TRSSRPALNHDIFRSWNGKQIHLRDD P EYGFRL+SPQRSPQFYRSNY +LSP SKALAIATGQKELME+VNNMPESCYELSLRDLVEQPM LG+R
Subjt:  ASATRSSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALGER

Query:  EETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEKDWWKRRLSVSSE
        E+TGV+ +DS LGGDRE+FSRENRKSRKETRALVGR+  ++EN GLYLKMG PKSIG TTT+KKKKKNDS  NMSAKVSPKPPQLVEKDWWKRRLSVSSE
Subjt:  EETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEKDWWKRRLSVSSE

Query:  SESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR
        SES++YGS+VNNGS+KS SSSSS+GS   NKNRTK+TGR
Subjt:  SESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR

XP_008466690.1 PREDICTED: uncharacterized protein LOC103504041 [Cucumis melo]8.2e-10284.65Show/hide
Query:  ASASATRSSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALG
        +S S TRSSRPALNHDIFRSWNGKQIHLRDD P EYGFRL+SPQRSPQFYRSNY +LSP SKALAIATGQKELME+VNNMPESCYELSLRDLVEQPM +G
Subjt:  ASASATRSSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALG

Query:  EREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEKDWWKRRLSVS
        +RE+TGV+ RDSNLGG RE+FSRENRKSRKETRALVGRS  ++EN GLYLKMG PKSIG TTT+KKKKKNDS  NMSAKVSPKPPQLVEKDWWKRRLSVS
Subjt:  EREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEKDWWKRRLSVS

Query:  SESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR
        SESESV+YGS+VNNGS+KS SSSSSNGS   NK+RTK+TGR
Subjt:  SESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR

XP_038883973.1 uncharacterized protein LOC120074937 [Benincasa hispida]1.2e-10886.9Show/hide
Query:  ALDAIEFASASASATRSSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRD
        A +A+EFAS S S TRSSRPALNHDIFRSWNGKQIHLRDD PLEYGFRL+SPQRSPQFYRSNYQSLSP SKALAIATGQ+ELMEMVNNMPESCYELSLRD
Subjt:  ALDAIEFASASASATRSSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRD

Query:  LVEQPMALGEREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKK--KNDSGSNMSAKVSPKPPQLVE
        LVEQPM LGEREE GVN RD NLGGDRE+FSRENRKS+KETRALVG+S   +ENGGLYLKMGLPKSI TTTTKKKKK  KNDSG NMSAKVSPKPPQLVE
Subjt:  LVEQPMALGEREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKK--KNDSGSNMSAKVSPKPPQLVE

Query:  KDWWKRRLSVSSESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR
        KDWWKRRLSVSSESES AYGS +NNGS+KS SSSSSNGSN  N NRTKATGR
Subjt:  KDWWKRRLSVSSESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR

TrEMBL top hitse value%identityAlignment
A0A0A0KH64 Uncharacterized protein4.4e-10184.1Show/hide
Query:  ASATRSSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALGER
        +S TRSSRPALNHDIFRSWNGKQIHLRDD P EYGFRL+SPQRSPQFYRSNY +LSP SKALAIATGQKELME+VNNMPESCYELSLRDLVEQPM LG+R
Subjt:  ASATRSSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALGER

Query:  EETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEKDWWKRRLSVSSE
        E+TGV+ +DS LGGDRE+FSRENRKSRKETRALVGR+  ++EN GLYLKMG PKSIG TTT+KKKKKNDS  NMSAKVSPKPPQLVEKDWWKRRLSVSSE
Subjt:  EETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEKDWWKRRLSVSSE

Query:  SESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR
        SES++YGS+VNNGS+KS SSSSS+GS   NKNRTK+TGR
Subjt:  SESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR

A0A1S4E6L8 uncharacterized protein LOC1035040414.0e-10284.65Show/hide
Query:  ASASATRSSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALG
        +S S TRSSRPALNHDIFRSWNGKQIHLRDD P EYGFRL+SPQRSPQFYRSNY +LSP SKALAIATGQKELME+VNNMPESCYELSLRDLVEQPM +G
Subjt:  ASASATRSSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALG

Query:  EREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEKDWWKRRLSVS
        +RE+TGV+ RDSNLGG RE+FSRENRKSRKETRALVGRS  ++EN GLYLKMG PKSIG TTT+KKKKKNDS  NMSAKVSPKPPQLVEKDWWKRRLSVS
Subjt:  EREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEKDWWKRRLSVS

Query:  SESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR
        SESESV+YGS+VNNGS+KS SSSSSNGS   NK+RTK+TGR
Subjt:  SESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR

A0A5A7V1Z3 Putative serine/threonine-protein kinase ndrD4.0e-10284.65Show/hide
Query:  ASASATRSSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALG
        +S S TRSSRPALNHDIFRSWNGKQIHLRDD P EYGFRL+SPQRSPQFYRSNY +LSP SKALAIATGQKELME+VNNMPESCYELSLRDLVEQPM +G
Subjt:  ASASATRSSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALG

Query:  EREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEKDWWKRRLSVS
        +RE+TGV+ RDSNLGG RE+FSRENRKSRKETRALVGRS  ++EN GLYLKMG PKSIG TTT+KKKKKNDS  NMSAKVSPKPPQLVEKDWWKRRLSVS
Subjt:  EREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEKDWWKRRLSVS

Query:  SESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR
        SESESV+YGS+VNNGS+KS SSSSSNGS   NK+RTK+TGR
Subjt:  SESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR

A0A6J1F9W6 uncharacterized protein LOC1114436072.0e-9878.97Show/hide
Query:  ALDAIEFASASASATR-SSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLR
        A DA++F S +++ +R SSRPALNHDIFRSWNGKQIHL+DD+ +EYGFRLSSPQRSPQFYRSNYQSLSP SKALAIATGQKELME+VNNMPESCYELSLR
Subjt:  ALDAIEFASASASATR-SSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLR

Query:  DLVEQPMALGEREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKK-NDSGSNMSAKVSPKPPQLVE
        DLVEQPM LG++E TG N RD   GGDRE+F+ ENRKS+KET ALVGRS+ N+ENGGLYLKMG P SIGT T KKKKKK NDSG N SAKVSPKP   VE
Subjt:  DLVEQPMALGEREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKK-NDSGSNMSAKVSPKPPQLVE

Query:  KDWWKRRLSVSSESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR
        KDWWKRRLSVSSE+ SVAY SSVNNGS+KS SSSSSNGSN  NKNRTK++GR
Subjt:  KDWWKRRLSVSSESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR

A0A6J1IJ72 uncharacterized protein LOC1114739111.0e-9777.69Show/hide
Query:  ALDAIEFASASASATR-SSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLR
        A DA++F S +++ +R SSRPALNHDIFRSWNGKQIHL+DD+ +EYGFRLSSPQRSPQFYRSNYQSLSP SK+LAIATGQKELME+VNNMPESCYELSLR
Subjt:  ALDAIEFASASASATR-SSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLR

Query:  DLVEQPMALGEREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEK
        DLVEQPM LG++E TG N RD   GGDRE+F+ ENRKS+KET  LVGRS+ N+ENGGLYLKMG P SIGT T KKKKK NDSG N SAKVSPKP   VEK
Subjt:  DLVEQPMALGEREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEK

Query:  DWWKRRLSVSSESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR
        DWWKRRLSVSSE+ SVAY S VNNGS+KS SSSSSNGSN  NKNRTK++GR
Subjt:  DWWKRRLSVSSESESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21390.1 embryo defective 21701.5e-2142Show/hide
Query:  FRLSSPQRSPQFYR-SNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALGEREETGVNARDSNLGGDREIFSRENRKSR-----KE
        +R S P+  P F+R  +Y SLSP SKA AIA GQ+ELMEMV+ MPESCYELSL+DLV          E  VN  +     D E+  R NR+S+     K 
Subjt:  FRLSSPQRSPQFYR-SNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALGEREETGVNARDSNLGGDREIFSRENRKSR-----KE

Query:  TRALVGRSTGNLENGGLYLKMGLPKSIGT---TTTKKKKKKNDSGSNMSAKVSPKPP------QLVEKDWWKRRLSVSSESESVAYGSSVNNGSVKSCSS
         + +    +G   N G  LK+    S+G    TT KKKKKK D     + KVSP+P       ++ +K+WW R     SES +   GSS +N S++S SS
Subjt:  TRALVGRSTGNLENGGLYLKMGLPKSIGT---TTTKKKKKKNDSGSNMSAKVSPKPP------QLVEKDWWKRRLSVSSESESVAYGSSVNNGSVKSCSS

AT1G76980.1 BEST Arabidopsis thaliana protein match is: embryo defective 2170 (TAIR:AT1G21390.1)6.9e-2237.98Show/hide
Query:  SSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALGEREETGVNARDSNLGGDREIFSRENRKSRKETRA---LVG
        +SP +SP    +NYQ+LSP +KA  IA GQ+ELM+MV+ MPESCYELSL+DLVE            VN  +  +  +     ++ RK  ++T++   +  
Subjt:  SSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALGEREETGVNARDSNLGGDREIFSRENRKSRKETRA---LVG

Query:  RSTGNLENGGLYLKMGLPKSIGT-TTTKKKKKKNDSGSNMSAK----VSPKPP------QLVEKDWWKRRLSVSSESESVAYGSSVNNGSVKSCSSSSSN
           G + N G  LK+  P S+G    T KKK  +D  S++++K     SP+P       +  +KDWWK  LS S  S+SV   S +N+GS KS   SSS 
Subjt:  RSTGNLENGGLYLKMGLPKSIGT-TTTKKKKKKNDSGSNMSAK----VSPKPP------QLVEKDWWKRRLSVSSESESVAYGSSVNNGSVKSCSSSSSN

Query:  GSNRDNKN
         ++  ++N
Subjt:  GSNRDNKN

AT1G76980.2 FUNCTIONS IN: molecular_function unknown9.0e-2237.21Show/hide
Query:  SSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALGEREETGVNARDSNLGGDREIFSRENRKSRKETRA---LVG
        +SP +SP    +NYQ+LSP +KA  IA GQ+ELM+MV+ MPESCYELSL+DLVE            VN  +  +  +     ++ RK  ++T++   +  
Subjt:  SSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDLVEQPMALGEREETGVNARDSNLGGDREIFSRENRKSRKETRA---LVG

Query:  RSTGNLENGGLYLKMGLPKSIGT-TTTKKKKKKNDSGSNMSAK----VSPKPP------QLVEKDWWKRRLSVSSESESVAYGSSVNNGSVKSCSSSSSN
           G + N G  LK+  P S+G    T KKK  +D  S++++K     SP+P       +  +KDWWK  LS S  S+SV   S +N+GS KS   SSS 
Subjt:  RSTGNLENGGLYLKMGLPKSIGT-TTTKKKKKKNDSGSNMSAK----VSPKPP------QLVEKDWWKRRLSVSSESESVAYGSSVNNGSVKSCSSSSSN

Query:  GSNRDNKNRTKATGR
         ++  ++N  +   R
Subjt:  GSNRDNKNRTKATGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAAGCTTGGAAATCAGCAATTGGGCTTTGGACGCCATCGAATTCGCTTCTGCTTCTGCTTCTGCAACAAGGTCCTCCAGACCGGCTTTGAATCACGATATCTTCAG
GAGTTGGAATGGGAAGCAGATTCATCTTAGGGATGATCAACCGTTGGAATATGGATTTCGTTTAAGTAGTCCTCAACGGAGCCCTCAGTTTTACCGATCGAATTACCAGA
GCCTGTCGCCGACGTCCAAAGCTCTCGCTATAGCTACTGGACAGAAGGAGCTTATGGAAATGGTGAACAATATGCCGGAGTCGTGTTACGAGTTGTCGTTGAGAGACTTG
GTGGAGCAGCCGATGGCTTTGGGGGAACGAGAAGAGACTGGAGTGAATGCGAGGGACTCTAATCTGGGCGGCGATCGGGAGATTTTTTCGAGAGAGAATCGGAAATCGAG
GAAGGAAACTAGGGCTTTGGTTGGTAGAAGTACAGGGAATCTGGAGAATGGAGGTTTGTATCTGAAAATGGGACTTCCAAAATCTATTGGAACGACGACGACAAAGAAGA
AGAAGAAGAAGAATGATTCTGGTTCGAATATGAGTGCTAAAGTTTCGCCTAAACCTCCTCAGTTGGTGGAAAAGGATTGGTGGAAGAGAAGACTCTCAGTTTCATCTGAG
AGTGAAAGCGTTGCTTATGGTTCTAGTGTTAACAATGGAAGCGTCAAGAGCTGTAGCAGTAGCAGCAGCAATGGCAGCAATCGCGACAACAAGAATAGAACAAAAGCCAC
TGGCAGGCTTACTCCACCATCAACTCTACCACCAAGATTATCACTAACGGTTACTCCACCATCTATTACCCCTCCAATTACCCAACGGCCCTTTATTAAACCGATTTGGA
ACCCTAATATCCCAAGTTCATCTCAAATTCCCTCACCTTACACTTACACCCCCGCTCAACCTAGTTTTCCATATCCGCCAGCATTCAATGGCTCTGATACCATGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAAGCTTGGAAATCAGCAATTGGGCTTTGGACGCCATCGAATTCGCTTCTGCTTCTGCTTCTGCAACAAGGTCCTCCAGACCGGCTTTGAATCACGATATCTTCAG
GAGTTGGAATGGGAAGCAGATTCATCTTAGGGATGATCAACCGTTGGAATATGGATTTCGTTTAAGTAGTCCTCAACGGAGCCCTCAGTTTTACCGATCGAATTACCAGA
GCCTGTCGCCGACGTCCAAAGCTCTCGCTATAGCTACTGGACAGAAGGAGCTTATGGAAATGGTGAACAATATGCCGGAGTCGTGTTACGAGTTGTCGTTGAGAGACTTG
GTGGAGCAGCCGATGGCTTTGGGGGAACGAGAAGAGACTGGAGTGAATGCGAGGGACTCTAATCTGGGCGGCGATCGGGAGATTTTTTCGAGAGAGAATCGGAAATCGAG
GAAGGAAACTAGGGCTTTGGTTGGTAGAAGTACAGGGAATCTGGAGAATGGAGGTTTGTATCTGAAAATGGGACTTCCAAAATCTATTGGAACGACGACGACAAAGAAGA
AGAAGAAGAAGAATGATTCTGGTTCGAATATGAGTGCTAAAGTTTCGCCTAAACCTCCTCAGTTGGTGGAAAAGGATTGGTGGAAGAGAAGACTCTCAGTTTCATCTGAG
AGTGAAAGCGTTGCTTATGGTTCTAGTGTTAACAATGGAAGCGTCAAGAGCTGTAGCAGTAGCAGCAGCAATGGCAGCAATCGCGACAACAAGAATAGAACAAAAGCCAC
TGGCAGGCTTACTCCACCATCAACTCTACCACCAAGATTATCACTAACGGTTACTCCACCATCTATTACCCCTCCAATTACCCAACGGCCCTTTATTAAACCGATTTGGA
ACCCTAATATCCCAAGTTCATCTCAAATTCCCTCACCTTACACTTACACCCCCGCTCAACCTAGTTTTCCATATCCGCCAGCATTCAATGGCTCTGATACCATGAAATGA
Protein sequenceShow/hide protein sequence
MLSLEISNWALDAIEFASASASATRSSRPALNHDIFRSWNGKQIHLRDDQPLEYGFRLSSPQRSPQFYRSNYQSLSPTSKALAIATGQKELMEMVNNMPESCYELSLRDL
VEQPMALGEREETGVNARDSNLGGDREIFSRENRKSRKETRALVGRSTGNLENGGLYLKMGLPKSIGTTTTKKKKKKNDSGSNMSAKVSPKPPQLVEKDWWKRRLSVSSE
SESVAYGSSVNNGSVKSCSSSSSNGSNRDNKNRTKATGRLTPPSTLPPRLSLTVTPPSITPPITQRPFIKPIWNPNIPSSSQIPSPYTYTPAQPSFPYPPAFNGSDTMK