; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0104591 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0104591
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr04:22459629..22460276
RNA-Seq ExpressionCmc04g0104591
SyntenyCmc04g0104591
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]6.4e-10384.65Show/hide
Query:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR
        MSQ SYIDK+L RYKMQNSKKG LP+R+GIHL KEQCPKTPQEVEDM NIPY+S +GS+MYAMLCTR DICYS+GIVSRYQ+NPG DHWT VKNILKYLR
Subjt:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR

Query:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD
        RT++YMLVYG+KDLILTGYTDSDFQ+DK+ARK TS SVFTLN GAVVWRS+K++CIADSTMEAEYVA CEAAKEAVWL+KFLTDLEVVPNMHLPITLYCD
Subjt:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD

Query:  NSGAVANLREPRSYK
        NSGAVAN +EPRS+K
Subjt:  NSGAVANLREPRSYK

KAA0052272.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-10386.05Show/hide
Query:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR
        MSQTSYIDK+L RYKMQNSKK LLPYRYGIHL KEQCPKTPQEV+DMSNIPYAS +GS+MYAMLCTR DICYS+GIVSRYQ+NPG DHWT VKNILKYLR
Subjt:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR

Query:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD
        RTKDYMLVYGSKDLILT YTDSDFQ+DK+ RK TS SVFTLN  AVVW+ IK+SCIADSTMEAEYVA+CEAAKEAVWLKKFLTDLE+VPN+HLPITLYCD
Subjt:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD

Query:  NSGAVANLREPRSYK
        NSGAVAN +EPRS+K
Subjt:  NSGAVANLREPRSYK

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]4.9e-10386.98Show/hide
Query:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR
        MSQTSYIDK+L RYKM NSKKGLLPYRYGIHL KEQCPKTPQEVEDMSNIPYAS +GS+MY MLCTR +ICYS+GIVSR Q+ PG DHWTTVKNILKYLR
Subjt:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR

Query:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD
        RTKDYMLVYGSKDLILTGYTD  FQTDK+ARK TS  VFT+N GAVVWRSIK+SCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLP TLYCD
Subjt:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD

Query:  NSGAVANLREPRSYK
        NSGAV N REPRS+K
Subjt:  NSGAVANLREPRSYK

TYK04838.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-10185.58Show/hide
Query:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR
        MSQTSYIDK+L RYKMQNSKKGLLPY+YGIHL KEQC KTPQEVEDM NIPYAS +GS+MYAMLCTRLDICYS+GIVSRYQ+N   DHWT VKNILKYLR
Subjt:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR

Query:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD
        RTKDYML+YGSKDLILTGYTDSDFQTDKNARK TS SVFTLN GAVVWRSIK+SC ADSTMEAEYVA CE AKE VWLKKFLTDL+V PNMH PITLYCD
Subjt:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD

Query:  NSGAVANLREPRSYK
        NSG VAN +EPRS+K
Subjt:  NSGAVANLREPRSYK

TYK11050.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-10386.98Show/hide
Query:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR
        MSQTSYIDK+L RYKMQNSKKGLLPYRYGIHL KEQCPKTPQEVEDMSNIPYAS IGS+MYAMLCTR+DICYS+GIV+RYQ+NPG DHWT VKNILKYLR
Subjt:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR

Query:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD
        RTKDYMLVYGSKDLILTGYTDSDFQTDK+ARK TS S+FTLN GAVVW+SIK+SCIA STMEA+YVA  EAAKEAV  KKFLTDLEVVPNMHLPITLYCD
Subjt:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD

Query:  NSGAVANLREPRSYK
        NSGAV N R PRS+K
Subjt:  NSGAVANLREPRSYK

TrEMBL top hitse value%identityAlignment
A0A5A7U945 Gag/pol protein8.1e-10486.05Show/hide
Query:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR
        MSQTSYIDK+L RYKMQNSKK LLPYRYGIHL KEQCPKTPQEV+DMSNIPYAS +GS+MYAMLCTR DICYS+GIVSRYQ+NPG DHWT VKNILKYLR
Subjt:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR

Query:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD
        RTKDYMLVYGSKDLILT YTDSDFQ+DK+ RK TS SVFTLN  AVVW+ IK+SCIADSTMEAEYVA+CEAAKEAVWLKKFLTDLE+VPN+HLPITLYCD
Subjt:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD

Query:  NSGAVANLREPRSYK
        NSGAVAN +EPRS+K
Subjt:  NSGAVANLREPRSYK

A0A5D3BX45 Gag/pol protein2.4e-10386.98Show/hide
Query:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR
        MSQTSYIDK+L RYKM NSKKGLLPYRYGIHL KEQCPKTPQEVEDMSNIPYAS +GS+MY MLCTR +ICYS+GIVSR Q+ PG DHWTTVKNILKYLR
Subjt:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR

Query:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD
        RTKDYMLVYGSKDLILTGYTD  FQTDK+ARK TS  VFT+N GAVVWRSIK+SCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLP TLYCD
Subjt:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD

Query:  NSGAVANLREPRSYK
        NSGAV N REPRS+K
Subjt:  NSGAVANLREPRSYK

A0A5D3C0P2 Gag/pol protein5.8e-10285.58Show/hide
Query:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR
        MSQTSYIDK+L RYKMQNSKKGLLPY+YGIHL KEQC KTPQEVEDM NIPYAS +GS+MYAMLCTRLDICYS+GIVSRYQ+N   DHWT VKNILKYLR
Subjt:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR

Query:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD
        RTKDYML+YGSKDLILTGYTDSDFQTDKNARK TS SVFTLN GAVVWRSIK+SC ADSTMEAEYVA CE AKE VWLKKFLTDL+V PNMH PITLYCD
Subjt:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD

Query:  NSGAVANLREPRSYK
        NSG VAN +EPRS+K
Subjt:  NSGAVANLREPRSYK

A0A5D3CI71 Gag/pol protein1.8e-10386.98Show/hide
Query:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR
        MSQTSYIDK+L RYKMQNSKKGLLPYRYGIHL KEQCPKTPQEVEDMSNIPYAS IGS+MYAMLCTR+DICYS+GIV+RYQ+NPG DHWT VKNILKYLR
Subjt:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR

Query:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD
        RTKDYMLVYGSKDLILTGYTDSDFQTDK+ARK TS S+FTLN GAVVW+SIK+SCIA STMEA+YVA  EAAKEAV  KKFLTDLEVVPNMHLPITLYCD
Subjt:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD

Query:  NSGAVANLREPRSYK
        NSGAV N R PRS+K
Subjt:  NSGAVANLREPRSYK

E2GK51 Gag/pol protein (Fragment)3.1e-10384.65Show/hide
Query:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR
        MSQ SYIDK+L RYKMQNSKKG LP+R+GIHL KEQCPKTPQEVEDM NIPY+S +GS+MYAMLCTR DICYS+GIVSRYQ+NPG DHWT VKNILKYLR
Subjt:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR

Query:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD
        RT++YMLVYG+KDLILTGYTDSDFQ+DK+ARK TS SVFTLN GAVVWRS+K++CIADSTMEAEYVA CEAAKEAVWL+KFLTDLEVVPNMHLPITLYCD
Subjt:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD

Query:  NSGAVANLREPRSYK
        NSGAVAN +EPRS+K
Subjt:  NSGAVANLREPRSYK

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.5e-2732.88Show/hide
Query:  MSQTSYIDKILLRYKMQ--NSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKY
        +SQ++Y+ KIL ++ M+  N+    LP +    LL           ++  N P  S+IG +MY MLCTR D+  ++ I+SRY +    + W  +K +L+Y
Subjt:  MSQTSYIDKILLRYKMQ--NSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKY

Query:  LRRTKDYMLVYGSKDLI----LTGYTDSDFQTDKNARKFTSRSVFTL-NRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHL
        L+ T D  L++  K+L     + GY DSD+   +  RK T+  +F + +   + W + +++ +A S+ EAEY+A  EA +EA+WLK  LT + +   +  
Subjt:  LRRTKDYMLVYGSKDLI----LTGYTDSDFQTDKNARKFTSRSVFTL-NRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHL

Query:  PITLYCDNSGAVANLREPRSYK
        PI +Y DN G ++    P  +K
Subjt:  PITLYCDNSGAVANLREPRSYK

P0CV72 Secreted RxLR effector protein 1616.1e-2441.35Show/hide
Query:  MSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLRRTKDYMLVYGSKDLI-LTGYTDSDFQTDKNARKFTSRSVFTLNRGA
        M N+PY S +G++MY M+ TR D+  ++G++S++ ++P   HW  +K +L+YL+ T+ Y L +       L GY+D+D+  D  +R+ TS  +F LN G 
Subjt:  MSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLRRTKDYMLVYGSKDLI-LTGYTDSDFQTDKNARKFTSRSVFTLNRGA

Query:  VVWRSIKKSCIADSTMEAEYVATCEAAKEAVWL
        V WRS K+  +A S+ E EY+A  EA +EAVWL
Subjt:  VVWRSIKKSCIADSTMEAEYVATCEAAKEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.7e-4643.41Show/hide
Query:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR
        +SQ  YI+++L R+ M+N+K    P    + L K+ CP T +E  +M+ +PY+S +GS+MYAM+CTR DI +++G+VSR+  NPG +HW  VK IL+YLR
Subjt:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR

Query:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD
         T    L +G  D IL GYTD+D   D + RK ++  +FT + GA+ W+S  + C+A ST EAEY+A  E  KE +WLK+FL +L +    ++   +YCD
Subjt:  RTKDYMLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCD

Query:  NSGAV
        +  A+
Subjt:  NSGAV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.8e-1829.27Show/hide
Query:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR
        +SQ  Y   +L R  M  +K    P      L      K P   E      Y  ++GS+ Y +  TR D+ Y++  +S+Y   P  DHW  +K +L+YL 
Subjt:  MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLR

Query:  RTKDY-MLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYC
         T D+ + +     L L  Y+D+D+  D +    T+  +  L    + W S K+  +  S+ EAEY +    + E  W+   LT+L +   +  P  +YC
Subjt:  RTKDY-MLVYGSKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYC

Query:  DNSGA
        DN GA
Subjt:  DNSGA

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.8e-1832.26Show/hide
Query:  YASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLRRTKDYMLVYGSK-DLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRS
        Y  +IG +MY  + TRLDI +++  +S++   P   H   V  IL Y++ T    L Y S+ ++ L  ++D+ FQ+ K+ R+ T+     L    + W+S
Subjt:  YASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLRRTKDYMLVYGSK-DLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRS

Query:  IKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAV
         K+  ++ S+ EAEY A   A  E +WL +F  +L++   +  P  L+CDN+ A+
Subjt:  IKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAAACATCTTATATAGACAAAATTTTGTTAAGATATAAGATGCAGAATTCCAAAAAGGGTCTGCTTCCGTACAGATATGGAATTCATTTATTAAAAGAACAATG
TCCAAAGACACCTCAAGAAGTTGAGGATATGAGTAACATTCCATATGCTTCTGTTATTGGAAGCATGATGTATGCAATGTTATGTACTAGACTTGACATTTGCTATTCAA
TAGGGATAGTTAGTAGATATCAGACTAATCCTGGATGTGATCATTGGACAACCGTTAAGAATATTTTAAAATACCTTAGAAGAACAAAAGACTACATGCTTGTGTATGGT
TCTAAAGATCTGATCCTTACTGGATACACTGACTCCGATTTTCAAACTGATAAAAATGCTAGAAAGTTTACATCAAGATCAGTTTTCACTCTGAATAGAGGAGCAGTAGT
GTGGAGAAGCATAAAAAAATCATGTATTGCCGACTCCACTATGGAAGCTGAATATGTAGCTACCTGTGAAGCAGCAAAGGAAGCAGTATGGCTTAAAAAGTTCTTAACAG
ATTTGGAAGTTGTTCCAAATATGCATTTGCCAATCACCTTATACTGTGACAATAGTGGTGCAGTTGCAAATTTACGAGAACCTAGAAGTTATAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCAAACATCTTATATAGACAAAATTTTGTTAAGATATAAGATGCAGAATTCCAAAAAGGGTCTGCTTCCGTACAGATATGGAATTCATTTATTAAAAGAACAATG
TCCAAAGACACCTCAAGAAGTTGAGGATATGAGTAACATTCCATATGCTTCTGTTATTGGAAGCATGATGTATGCAATGTTATGTACTAGACTTGACATTTGCTATTCAA
TAGGGATAGTTAGTAGATATCAGACTAATCCTGGATGTGATCATTGGACAACCGTTAAGAATATTTTAAAATACCTTAGAAGAACAAAAGACTACATGCTTGTGTATGGT
TCTAAAGATCTGATCCTTACTGGATACACTGACTCCGATTTTCAAACTGATAAAAATGCTAGAAAGTTTACATCAAGATCAGTTTTCACTCTGAATAGAGGAGCAGTAGT
GTGGAGAAGCATAAAAAAATCATGTATTGCCGACTCCACTATGGAAGCTGAATATGTAGCTACCTGTGAAGCAGCAAAGGAAGCAGTATGGCTTAAAAAGTTCTTAACAG
ATTTGGAAGTTGTTCCAAATATGCATTTGCCAATCACCTTATACTGTGACAATAGTGGTGCAGTTGCAAATTTACGAGAACCTAGAAGTTATAAATGA
Protein sequenceShow/hide protein sequence
MSQTSYIDKILLRYKMQNSKKGLLPYRYGIHLLKEQCPKTPQEVEDMSNIPYASVIGSMMYAMLCTRLDICYSIGIVSRYQTNPGCDHWTTVKNILKYLRRTKDYMLVYG
SKDLILTGYTDSDFQTDKNARKFTSRSVFTLNRGAVVWRSIKKSCIADSTMEAEYVATCEAAKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANLREPRSYK