; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0018743 (gene) of Chayote v1 genome

Gene IDSed0018743
OrganismSechium edule (Chayote v1)
DescriptionWRC domain-containing protein
Genome locationLG09:345291..346911
RNA-Seq ExpressionSed0018743
SyntenySed0018743
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575189.1 hypothetical protein SDJN03_25828, partial [Cucurbita argyrosperma subsp. sororia]2.2e-8971.72Show/hide
Query:  MRIRKNAKKLSPLLFSAVESVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV----------E
        MRIRKNA KLSPLLFSAVE VP+VL THVCQLNQSPWDVI LEQHA +QLE+ EDSF ENASLG SIGAVESVAS+ME SA LS N+ V          +
Subjt:  MRIRKNAKKLSPLLFSAVESVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV----------E

Query:  MADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD---HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP
          DY EDLDEN D FEKLVD +L   KQF +EE+YSSFS D   HHRR+ LRSSENNYYST NN SI+KKS AG ++SRR RP +ASKKAV +AA GSNP
Subjt:  MADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD---HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP

Query:  YEFYYYSGFGPLWGKKRRDRGG-------ENTTGVRSNAT--TPSPSEIDGDELDYVEDDDDDDDDEEDDDGGKKRMRKPVKARSLKSLM
        YEFYYYSGFGPLWGKKRR+RGG       EN  G+RSNAT  +PSPSE+DG+ELDYVE+DDDD+ +E+D DGGKKRMRKPVKARSLKSLM
Subjt:  YEFYYYSGFGPLWGKKRRDRGG-------ENTTGVRSNAT--TPSPSEIDGDELDYVEDDDDDDDDEEDDDGGKKRMRKPVKARSLKSLM

XP_008458199.1 PREDICTED: uncharacterized protein LOC103497701 [Cucumis melo]8.5e-9773.45Show/hide
Query:  MRIRKNAKKLSPLLFSAVE--SVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV---------
        MRIRKNA KLSPLLFSA+E  SVPEVL TH+CQLNQSPWDVI L QH TNQLE+ EDSF  NAS GDS+GAVESV+S+MEESA LS N+ V         
Subjt:  MRIRKNAKKLSPLLFSAVE--SVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV---------

Query:  -EMADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD-HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP
         +  DY ED DEN +GFEKLVD NLK KKQF +EE+YSSFSSD HHRR+ LRSSENNYYST +NCSI+KKS AGG++SRR RPAR+SKKAV  AAGGSNP
Subjt:  -EMADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD-HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP

Query:  YEFYYYSGFGPLWGKKRRDRGG--------ENTTGVRSNATTPSPSEIDGDELDYVEDDDDDDDDEE-DDDGGKKRMRKPVKARSLKSLM
        YEFYYYSGFGPLWGKKRRDRGG        ENTTG+RSNATTPSPS++  +ELDYVEDD+DD+++E+ D DGGKKRMRKPVKARSLKSLM
Subjt:  YEFYYYSGFGPLWGKKRRDRGG--------ENTTGVRSNATTPSPSEIDGDELDYVEDDDDDDDDEE-DDDGGKKRMRKPVKARSLKSLM

XP_011656350.1 uncharacterized protein LOC105435722 [Cucumis sativus]6.7e-9471.82Show/hide
Query:  MRIRKNAKKLSPLLFSAVE--SVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV---------
        MRIRKNA KLSPLLFSA+E  SVPE L TH+CQLNQSPWDVI L QH TNQLE+ EDSF ENAS GDS+GAVESV+S+MEES  LS N+ V         
Subjt:  MRIRKNAKKLSPLLFSAVE--SVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV---------

Query:  -EMADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD-HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP
         +  DY ED DEN +GFEKLVD +LKCKKQF +EE+YSSFSSD HHRR+ LRSSENNYYST +NCSI+KKS  GG++SRR RPAR SKKAV  AAGGSNP
Subjt:  -EMADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD-HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP

Query:  YEFYYYSGFGPLWGKKRRDRGG--------ENTTGVRSNATTPSPSEIDGDELDYVEDDDDDDDDEED--DDGGKKRMRKPVKARSLKSLM
        YEFYYYSGFGPLWGKKRRDRGG        ENTTG+RSNATTPSPS  D +ELDYVE D+DD++++ D   +GGKKRMRKPVKARSLKSLM
Subjt:  YEFYYYSGFGPLWGKKRRDRGG--------ENTTGVRSNATTPSPSEIDGDELDYVEDDDDDDDDEED--DDGGKKRMRKPVKARSLKSLM

XP_022958971.1 uncharacterized protein LOC111460101 [Cucurbita moschata]1.7e-8971.38Show/hide
Query:  MRIRKNAKKLSPLLFSAVESVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV----------E
        MRIRKNA KLSPLLFSAVE VP+VL THVCQLNQSPWDVI LEQHA +QLE+ EDSF ENASLG SIGAVESVAS+ME SA LS N+ V          +
Subjt:  MRIRKNAKKLSPLLFSAVESVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV----------E

Query:  MADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD---HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP
          DY ED DEN D FEKLVD +L   KQF +EE+YSSFS D   HHRR+ LRSSENNYYST NN SI+KKS AG ++SRR RP +ASKKAV +AA GSNP
Subjt:  MADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD---HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP

Query:  YEFYYYSGFGPLWGKKRRDRGG-------ENTTGVRSNAT--TPSPSEIDGDELDYVEDDDDDDDDEEDDDGGKKRMRKPVKARSLKSLM
        YEFYYYSGFGPLWGKKRR+RGG       EN  G+RSNAT  +PSPSE+DG+ELDYVE+DDDD+++E+D DGGKKRMRKPVKARSLKSLM
Subjt:  YEFYYYSGFGPLWGKKRRDRGG-------ENTTGVRSNAT--TPSPSEIDGDELDYVEDDDDDDDDEEDDDGGKKRMRKPVKARSLKSLM

XP_038875417.1 uncharacterized protein LOC120067877 [Benincasa hispida]1.8e-9974.22Show/hide
Query:  MRIRKNAKKLSPLLFSAVESVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV----------E
        MRIRKNA KLSPLLFSAVESVPEVL TH+CQLNQSPWDVI L QH TNQ+E+ EDSF ENASL DSIGAVESVAS+MEESA LS N+            +
Subjt:  MRIRKNAKKLSPLLFSAVESVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV----------E

Query:  MADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSDHH-RRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNPYE
          DY EDL+EN DGFEKLVD +LKCKKQF +EE+YSSFSSDHH RR+ LRSSENNYYST NNCSI KKS AGG++SRR RPAR+SKK V  AAGGSNPYE
Subjt:  MADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSDHH-RRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNPYE

Query:  FYYYSGFGPLWGKKRRDRGG--------ENTTGVRSNATTPSPSEIDGDELDYVEDDDDDDDDEEDDDGGKKRMRKPVKARSLKSLM
        FYYYSGFGPLWGKKRR+RGG        ENTTG+RSNATTPSPS++  +ELDY+EDDDD++++  D DGGKKRMRKPVKARSLKSLM
Subjt:  FYYYSGFGPLWGKKRRDRGG--------ENTTGVRSNATTPSPSEIDGDELDYVEDDDDDDDDEEDDDGGKKRMRKPVKARSLKSLM

TrEMBL top hitse value%identityAlignment
A0A0A0K7G9 Uncharacterized protein1.3e-8771.11Show/hide
Query:  SVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV----------EMADYFEDLDENEDGFEKLV
        SVPE L TH+CQLNQSPWDVI L QH TNQLE+ EDSF ENAS GDS+GAVESV+S+MEES  LS N+ V          +  DY ED DEN +GFEKLV
Subjt:  SVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV----------EMADYFEDLDENEDGFEKLV

Query:  DRNLKCKKQFNEEENYSSFSSD-HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNPYEFYYYSGFGPLWGKKRRDRG
        D +LKCKKQF +EE+YSSFSSD HHRR+ LRSSENNYYST +NCSI+KKS  GG++SRR RPAR SKKAV  AAGGSNPYEFYYYSGFGPLWGKKRRDRG
Subjt:  DRNLKCKKQFNEEENYSSFSSD-HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNPYEFYYYSGFGPLWGKKRRDRG

Query:  G--------ENTTGVRSNATTPSPSEIDGDELDYVEDDDDDDDDEED--DDGGKKRMRKPVKARSLKSLM
        G        ENTTG+RSNATTPSPS  D +ELDYVE D+DD++++ D   +GGKKRMRKPVKARSLKSLM
Subjt:  G--------ENTTGVRSNATTPSPSEIDGDELDYVEDDDDDDDDEED--DDGGKKRMRKPVKARSLKSLM

A0A1S3C7F3 uncharacterized protein LOC1034977014.1e-9773.45Show/hide
Query:  MRIRKNAKKLSPLLFSAVE--SVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV---------
        MRIRKNA KLSPLLFSA+E  SVPEVL TH+CQLNQSPWDVI L QH TNQLE+ EDSF  NAS GDS+GAVESV+S+MEESA LS N+ V         
Subjt:  MRIRKNAKKLSPLLFSAVE--SVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV---------

Query:  -EMADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD-HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP
         +  DY ED DEN +GFEKLVD NLK KKQF +EE+YSSFSSD HHRR+ LRSSENNYYST +NCSI+KKS AGG++SRR RPAR+SKKAV  AAGGSNP
Subjt:  -EMADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD-HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP

Query:  YEFYYYSGFGPLWGKKRRDRGG--------ENTTGVRSNATTPSPSEIDGDELDYVEDDDDDDDDEE-DDDGGKKRMRKPVKARSLKSLM
        YEFYYYSGFGPLWGKKRRDRGG        ENTTG+RSNATTPSPS++  +ELDYVEDD+DD+++E+ D DGGKKRMRKPVKARSLKSLM
Subjt:  YEFYYYSGFGPLWGKKRRDRGG--------ENTTGVRSNATTPSPSEIDGDELDYVEDDDDDDDDEE-DDDGGKKRMRKPVKARSLKSLM

A0A5D3BVK5 Protein ecdysoneless-like protein4.1e-9773.45Show/hide
Query:  MRIRKNAKKLSPLLFSAVE--SVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV---------
        MRIRKNA KLSPLLFSA+E  SVPEVL TH+CQLNQSPWDVI L QH TNQLE+ EDSF  NAS GDS+GAVESV+S+MEESA LS N+ V         
Subjt:  MRIRKNAKKLSPLLFSAVE--SVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV---------

Query:  -EMADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD-HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP
         +  DY ED DEN +GFEKLVD NLK KKQF +EE+YSSFSSD HHRR+ LRSSENNYYST +NCSI+KKS AGG++SRR RPAR+SKKAV  AAGGSNP
Subjt:  -EMADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD-HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP

Query:  YEFYYYSGFGPLWGKKRRDRGG--------ENTTGVRSNATTPSPSEIDGDELDYVEDDDDDDDDEE-DDDGGKKRMRKPVKARSLKSLM
        YEFYYYSGFGPLWGKKRRDRGG        ENTTG+RSNATTPSPS++  +ELDYVEDD+DD+++E+ D DGGKKRMRKPVKARSLKSLM
Subjt:  YEFYYYSGFGPLWGKKRRDRGG--------ENTTGVRSNATTPSPSEIDGDELDYVEDDDDDDDDEE-DDDGGKKRMRKPVKARSLKSLM

A0A6J1H3L8 uncharacterized protein LOC1114601018.3e-9071.38Show/hide
Query:  MRIRKNAKKLSPLLFSAVESVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV----------E
        MRIRKNA KLSPLLFSAVE VP+VL THVCQLNQSPWDVI LEQHA +QLE+ EDSF ENASLG SIGAVESVAS+ME SA LS N+ V          +
Subjt:  MRIRKNAKKLSPLLFSAVESVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV----------E

Query:  MADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD---HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP
          DY ED DEN D FEKLVD +L   KQF +EE+YSSFS D   HHRR+ LRSSENNYYST NN SI+KKS AG ++SRR RP +ASKKAV +AA GSNP
Subjt:  MADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD---HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP

Query:  YEFYYYSGFGPLWGKKRRDRGG-------ENTTGVRSNAT--TPSPSEIDGDELDYVEDDDDDDDDEEDDDGGKKRMRKPVKARSLKSLM
        YEFYYYSGFGPLWGKKRR+RGG       EN  G+RSNAT  +PSPSE+DG+ELDYVE+DDDD+++E+D DGGKKRMRKPVKARSLKSLM
Subjt:  YEFYYYSGFGPLWGKKRRDRGG-------ENTTGVRSNAT--TPSPSEIDGDELDYVEDDDDDDDDEEDDDGGKKRMRKPVKARSLKSLM

A0A6J1L0N0 uncharacterized protein LOC1114992986.0e-8869.86Show/hide
Query:  MRIRKNAKKLSPLLFSAVESVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV----------E
        MRIRKNA KLSPLLFSAVE VP++L THVCQLNQSPWDVI LEQHA +QLE+ EDSF ENASLG SIGAVESVAS+ME SA LS N+ V          +
Subjt:  MRIRKNAKKLSPLLFSAVESVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRV----------E

Query:  MADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD---HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP
          DY EDLDEN D FEKLVD +L   KQF +EE+YSSFS D   +HRR+ LRSSENNYYST NN SI+KKS AG ++SRR R  +ASKKAV +A+ GSNP
Subjt:  MADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSD---HHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNP

Query:  YEFYYYSGFGPLWGKKRRDRGG-------ENTTGVRSNAT----TPSPSEIDGDELDYVEDDDDDDDDEEDDDGGKKRMRKPVKARSLKSLM
        YEFYYYSGFGPLWGKKRR+RGG       EN  G+RSNAT    +PSPSE+DG+ELDYVE+DDDD+++E+D DGGKKRMRKPVKARSLKSLM
Subjt:  YEFYYYSGFGPLWGKKRRDRGG-------ENTTGVRSNAT----TPSPSEIDGDELDYVEDDDDDDDDEEDDDGGKKRMRKPVKARSLKSLM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00310.1 Putative membrane lipoprotein8.7e-1529.25Show/hide
Query:  MRIRKNAKKLSPLLFSAVESVPEVLPTHVCQLNQSPWDVISLEQ----HATNQLEDG-----------------------EDSFAENASLGD--------
        MRIRKN  KLS +L S      E   T+VC LNQSPWDVI +        TN ++                         EDSF  N SLGD        
Subjt:  MRIRKNAKKLSPLLFSAVESVPEVLPTHVCQLNQSPWDVISLEQ----HATNQLEDG-----------------------EDSFAENASLGD--------

Query:  --SIGAVESVASIMEESANLSKNSRVEMADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSDHHRRTVLRSSENNYYSTSNNCSITKKSTAGGS
          S+ + +S+ S+ + + N ++  +  ++   ED  +  D ++K        K +  E  +    S D  +  V                         +
Subjt:  --SIGAVESVASIMEESANLSKNSRVEMADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSDHHRRTVLRSSENNYYSTSNNCSITKKSTAGGS

Query:  ISRRPRPARASKKAVGTAAGGSNPYEFYYYSGFGPLWGKKRRDRGGEN---TTGVRSNATTPSPSEIDG-------------DELDYVEDDDDDDDDEED
          +R RP  + KK   +A    NPYEFYYYSGFGP WG+KR     E        +S ++  S S+ DG             DE D+V++D D  + ++ 
Subjt:  ISRRPRPARASKKAVGTAAGGSNPYEFYYYSGFGPLWGKKRRDRGGEN---TTGVRSNATTPSPSEIDG-------------DELDYVEDDDDDDDDEED

Query:  DDGGKKRM--------------RKPVKARSLKSLM
         D G+K+               RKPVK RSLKSLM
Subjt:  DDGGKKRM--------------RKPVKARSLKSLM

AT4G00310.2 Putative membrane lipoprotein8.7e-1529.25Show/hide
Query:  MRIRKNAKKLSPLLFSAVESVPEVLPTHVCQLNQSPWDVISLEQ----HATNQLEDG-----------------------EDSFAENASLGD--------
        MRIRKN  KLS +L S      E   T+VC LNQSPWDVI +        TN ++                         EDSF  N SLGD        
Subjt:  MRIRKNAKKLSPLLFSAVESVPEVLPTHVCQLNQSPWDVISLEQ----HATNQLEDG-----------------------EDSFAENASLGD--------

Query:  --SIGAVESVASIMEESANLSKNSRVEMADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSDHHRRTVLRSSENNYYSTSNNCSITKKSTAGGS
          S+ + +S+ S+ + + N ++  +  ++   ED  +  D ++K        K +  E  +    S D  +  V                         +
Subjt:  --SIGAVESVASIMEESANLSKNSRVEMADYFEDLDENEDGFEKLVDRNLKCKKQFNEEENYSSFSSDHHRRTVLRSSENNYYSTSNNCSITKKSTAGGS

Query:  ISRRPRPARASKKAVGTAAGGSNPYEFYYYSGFGPLWGKKRRDRGGEN---TTGVRSNATTPSPSEIDG-------------DELDYVEDDDDDDDDEED
          +R RP  + KK   +A    NPYEFYYYSGFGP WG+KR     E        +S ++  S S+ DG             DE D+V++D D  + ++ 
Subjt:  ISRRPRPARASKKAVGTAAGGSNPYEFYYYSGFGPLWGKKRRDRGGEN---TTGVRSNATTPSPSEIDG-------------DELDYVEDDDDDDDDEED

Query:  DDGGKKRM--------------RKPVKARSLKSLM
         D G+K+               RKPVK RSLKSLM
Subjt:  DDGGKKRM--------------RKPVKARSLKSLM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGATCCGGAAGAACGCGAAGAAGTTATCGCCGTTGTTGTTTTCGGCCGTGGAGTCCGTTCCTGAGGTTCTTCCGACGCACGTTTGTCAGTTGAACCAGTCGCCATG
GGATGTGATTTCTCTCGAGCAGCACGCCACGAACCAGCTCGAAGACGGTGAAGATAGCTTCGCTGAGAATGCTAGCTTAGGGGATTCTATCGGAGCTGTCGAGAGCGTTG
CGTCGATTATGGAGGAATCAGCCAATTTGTCGAAGAATAGTAGGGTAGAAATGGCGGATTATTTTGAGGATTTGGATGAAAATGAGGACGGATTCGAGAAATTAGTTGAT
AGAAATTTGAAATGTAAGAAACAATTTAACGAGGAAGAGAATTATTCTTCGTTTAGCTCCGACCACCACCGCCGTACAGTTCTTAGGTCTAGTGAGAATAATTATTATTC
TACTTCCAATAACTGCTCGATTACGAAGAAATCCACCGCCGGAGGTTCGATCTCGCGGCGGCCGCGTCCGGCGAGGGCCTCGAAGAAGGCGGTTGGTACGGCGGCCGGCG
GATCGAATCCGTACGAGTTTTATTATTATTCTGGATTTGGGCCACTGTGGGGGAAGAAACGACGGGATAGAGGAGGGGAAAATACGACGGGAGTCCGAAGCAATGCAACG
ACGCCGTCGCCGTCGGAAATCGACGGCGACGAATTGGATTATGTGGAGGACGACGACGACGATGACGACGATGAAGAGGATGACGACGGCGGGAAGAAGCGGATGAGGAA
GCCGGTGAAAGCTCGGTCGTTGAAATCGCTGATGTAA
mRNA sequenceShow/hide mRNA sequence
GAAAGAAGAACTTCTTCACCCTCCCAAAAAATTTCTCTCTCGACAAACTTTTCAGACGCAAGAGAAGAAAAGCAGCGGCGGAGACGAATCCGCCGCCGCCGTCGCCGCCG
TAGACTGTGTATTCGTGTATCCCCGCGGCGGTTCCGAAAGCGATTTCTCGAATCTGCTCTTTGCTGTTCTTCATTTTTTTGGCCATGAGGATCCGGAAGAACGCGAAGAA
GTTATCGCCGTTGTTGTTTTCGGCCGTGGAGTCCGTTCCTGAGGTTCTTCCGACGCACGTTTGTCAGTTGAACCAGTCGCCATGGGATGTGATTTCTCTCGAGCAGCACG
CCACGAACCAGCTCGAAGACGGTGAAGATAGCTTCGCTGAGAATGCTAGCTTAGGGGATTCTATCGGAGCTGTCGAGAGCGTTGCGTCGATTATGGAGGAATCAGCCAAT
TTGTCGAAGAATAGTAGGGTAGAAATGGCGGATTATTTTGAGGATTTGGATGAAAATGAGGACGGATTCGAGAAATTAGTTGATAGAAATTTGAAATGTAAGAAACAATT
TAACGAGGAAGAGAATTATTCTTCGTTTAGCTCCGACCACCACCGCCGTACAGTTCTTAGGTCTAGTGAGAATAATTATTATTCTACTTCCAATAACTGCTCGATTACGA
AGAAATCCACCGCCGGAGGTTCGATCTCGCGGCGGCCGCGTCCGGCGAGGGCCTCGAAGAAGGCGGTTGGTACGGCGGCCGGCGGATCGAATCCGTACGAGTTTTATTAT
TATTCTGGATTTGGGCCACTGTGGGGGAAGAAACGACGGGATAGAGGAGGGGAAAATACGACGGGAGTCCGAAGCAATGCAACGACGCCGTCGCCGTCGGAAATCGACGG
CGACGAATTGGATTATGTGGAGGACGACGACGACGATGACGACGATGAAGAGGATGACGACGGCGGGAAGAAGCGGATGAGGAAGCCGGTGAAAGCTCGGTCGTTGAAAT
CGCTGATGTAAAATACGAAATGAAGTTTGGATTATGGGGTTTTGGCAGTGGGATTGTGAAAAAGATATCGGGGTTCTTCTTCGTCCTTTGTTTAACTATTTATTTTTAGG
G
Protein sequenceShow/hide protein sequence
MRIRKNAKKLSPLLFSAVESVPEVLPTHVCQLNQSPWDVISLEQHATNQLEDGEDSFAENASLGDSIGAVESVASIMEESANLSKNSRVEMADYFEDLDENEDGFEKLVD
RNLKCKKQFNEEENYSSFSSDHHRRTVLRSSENNYYSTSNNCSITKKSTAGGSISRRPRPARASKKAVGTAAGGSNPYEFYYYSGFGPLWGKKRRDRGGENTTGVRSNAT
TPSPSEIDGDELDYVEDDDDDDDDEEDDDGGKKRMRKPVKARSLKSLM