; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C04G073530 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C04G073530
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
Descriptionlate embryogenesis abundant protein B19.4
Genome locationCla97Chr04:21226693..21230421
RNA-Seq ExpressionCla97C04G073530
SyntenyCla97C04G073530
Gene Ontology termsGO:0009737 - response to abscisic acid (biological process)
InterPro domainsIPR000389 - Small hydrophilic plant seed protein
IPR022377 - Small hydrophilic plant seed protein, conserved site
IPR038956 - Late embryogenesis abundant protein, LEA_5 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4401425.1 hypothetical protein G4B88_001619 [Cannabis sativa]1.9e-6759.04Show/hide
Query:  SQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH-----------------------------------------
        SQ++R ELDAKAKQGETVVPGGTGG+S EAQE LAEGRSRGGQTR EQLGHEGYQE+G                                          
Subjt:  SQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH-----------------------------------------

Query:  -------------------QGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETV
                           +GG+ R EQ+GHEGYQEMGRKGGLST DKSGG+RA EEGI+IDES                 +S+++R ELDA+A+QGETV
Subjt:  -------------------QGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETV

Query:  VPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK
        VPGGTGG+SLEAQEHLAEGRSRGGQTR EQLGHEGYQEMGRKGGLS T   GGERA EEG++IDESKFRTK
Subjt:  VPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK

KAG7033764.1 Em-like protein GEA1, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-8584.98Show/hide
Query:  MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEG
        MA+QQERSEL+AKAKQGETVVPGGTGGKS EAQER    RSRGGQTRKEQLGHEGYQE+GH+GGE RREQMG EGYQEMG+KGGLSTMDKS  ER  EEG
Subjt:  MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEG

Query:  IEIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAE
        IEIDESK            ++EMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSN+GMPGGERAAE
Subjt:  IEIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAE

Query:  EGVEIDESKFRTK
        EGVEIDESKFR K
Subjt:  EGVEIDESKFRTK

XP_007206180.2 late embryogenesis abundant protein B19.3 [Prunus persica]3.6e-5860.85Show/hide
Query:  QERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEID
        Q+R ELD KA++GE V+PGGTGGKS EAQE LAEGRSRGGQTRK ++                    GHEGY EMG+KGGLST DKSGGERAAEEGI +D
Subjt:  QERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEID

Query:  ESKFRTKDPKYLEELKKEMSSEQERC------ELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQ--------------------TRKEQLGHEGY
        ESK++T          KEM+SEQER       ELD +ARQGE VVPGGTGGKSLEAQEHLAEGRSRGGQ                    TRKEQ+GHEGY
Subjt:  ESKFRTKDPKYLEELKKEMSSEQERC------ELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQ--------------------TRKEQLGHEGY

Query:  QEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK
        +EMG+KGGLS     GGERAAEEG+ IDESK++TK
Subjt:  QEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK

XP_016183975.2 LOW QUALITY PROTEIN: late embryogenesis abundant protein B19.4 [Arachis ipaensis]1.4e-6265.67Show/hide
Query:  MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEG
        MAS+Q++ ELD +AKQGETVVPGGTGGKS EAQE LAEGRS+GGQT                    RREQ+G EGYQEMGRKGG STM+KSGGERA EEG
Subjt:  MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEG

Query:  IEIDESKFRTKD----PKYLE-ELKKEM---------------SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQE
        +EIDESKF TK+    P+Y   E K  M               S +Q R ELD RA+QGETVVPGGTGGKSLEAQEHLAEGRS+GGQTR+EQLG EGYQE
Subjt:  IEIDESKFRTKD----PKYLE-ELKKEM---------------SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQE

Query:  MGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK
        MGRKGG S     GGERA EEGVEIDESKF TK
Subjt:  MGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK

XP_021296037.1 late embryogenesis abundant protein B19.4 [Herrania umbratica]1.7e-5260Show/hide
Query:  MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEG
        M+ QQ+R ELD +A++ E V+PGGTGGKS EA+E LAEGRSRGGQTRKEQ+  EGYQE+GHQ                    GGLST DKSGGERA EEG
Subjt:  MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEG

Query:  IEIDESKFR-TKDPKYLEELKK------EMSSEQ-------ERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGG
        ++I++SK+R ++  K    +KK       M+SEQ       ER ELDARARQGE VVPGGT GKSLEAQE LAEGR  GG+  K+Q+G EGYQEMGRKGG
Subjt:  IEIDESKFR-TKDPKYLEELKK------EMSSEQ-------ERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGG

Query:  LSNTGMPGGERAAEEGVEIDESKFR
        LS T    GERAAEEG+ IDESK R
Subjt:  LSNTGMPGGERAAEEGVEIDESKFR

TrEMBL top hitse value%identityAlignment
A0A446J0E8 Uncharacterized protein2.6e-4654.72Show/hide
Query:  ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGI
        + QQERSELD  A++GETVVPGGTGGKS EAQE LA+GRSRGG+TRKEQLG EGY+E+GH+GGE R+EQ+G EGY+EMGRKGGLSTM++SGGERAA    
Subjt:  ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGI

Query:  EIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEE
                                                                  EGRSRGGQTR+EQ+G EGY EMGRKGGLS     GGERAA E
Subjt:  EIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEE

Query:  GVEIDESKFRTK
        G++IDESKF+TK
Subjt:  GVEIDESKFRTK

A0A498K5A7 Uncharacterized protein5.3e-4748.75Show/hide
Query:  MASQQE------RSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGE
        MAS+QE      R+ELD KA++GET+VPGGTGG S EAQE LAEGRSRGGQTRK Q+                    G EGY EMG+KGGLST DK GGE
Subjt:  MASQQE------RSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGE

Query:  RAAEEGIEIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQL---------------------
        RAAEEGI+IDESK   +DP+              R ELD +ARQGE VVPGGTG K+L AQEHLAEGR RGG+ RKEQL                     
Subjt:  RAAEEGIEIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQL---------------------

Query:  ---------------------------------------GHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK
                                               G EGYQEMG+KGGLS     GGERAAEEG+EI+ESK++TK
Subjt:  ---------------------------------------GHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK

A0A6J1B9A4 late embryogenesis abundant protein B19.48.4e-5360Show/hide
Query:  MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEG
        M+ QQ+R ELD +A++ E V+PGGTGGKS EA+E LAEGRSRGGQTRKEQ+  EGYQE+GHQ                    GGLST DKSGGERA EEG
Subjt:  MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEG

Query:  IEIDESKFR-TKDPKYLEELKK------EMSSEQ-------ERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGG
        ++I++SK+R ++  K    +KK       M+SEQ       ER ELDARARQGE VVPGGT GKSLEAQE LAEGR  GG+  K+Q+G EGYQEMGRKGG
Subjt:  IEIDESKFR-TKDPKYLEELKK------EMSSEQ-------ERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGG

Query:  LSNTGMPGGERAAEEGVEIDESKFR
        LS T    GERAAEEG+ IDESK R
Subjt:  LSNTGMPGGERAAEEGVEIDESKFR

A0A7J6I2G2 Uncharacterized protein9.3e-6859.04Show/hide
Query:  SQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH-----------------------------------------
        SQ++R ELDAKAKQGETVVPGGTGG+S EAQE LAEGRSRGGQTR EQLGHEGYQE+G                                          
Subjt:  SQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH-----------------------------------------

Query:  -------------------QGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETV
                           +GG+ R EQ+GHEGYQEMGRKGGLST DKSGG+RA EEGI+IDES                 +S+++R ELDA+A+QGETV
Subjt:  -------------------QGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETV

Query:  VPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK
        VPGGTGG+SLEAQEHLAEGRSRGGQTR EQLGHEGYQEMGRKGGLS T   GGERA EEG++IDESKFRTK
Subjt:  VPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK

R0HE60 Uncharacterized protein1.7e-4557.67Show/hide
Query:  MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEE
        MAS+Q  R ELD KAKQGETVV GGTGGKS EAQE LAEGRS+GGQTRKEQLGHEGYQE+G +GGE R+EQ+GHEGYQEMGRKGG +  ++ G E   E 
Subjt:  MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEE

Query:  GIEIDES-KFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERA
        G +  E+ K +     Y E  +K   + +E+   +     G+    GG   K     E   E   +GG+ RKEQLGHEGYQEMGRKGGLS     GGERA
Subjt:  GIEIDES-KFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERA

Query:  AEEGVEIDESKFRTK
         EEG+EIDESKF  K
Subjt:  AEEGVEIDESKFRTK

SwissProt top hitse value%identityAlignment
I1N2Z5 Protein SLE12.3e-3981.82Show/hide
Query:  MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEE
        M SQQ  R ELD KA+QGETVVPGGTGGKS EAQE LAEGRSRGGQTRK+QLG EGY E+G +GG+ R+EQMG EGYQEMGRKGGLSTMDKSGGERA EE
Subjt:  MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEE

Query:  GIEIDESKFR
        GIEIDESKF+
Subjt:  GIEIDESKFR

Q02400 Late embryogenesis abundant protein B19.32.8e-3767.69Show/hide
Query:  ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQ--------------------TRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGR
        + QQERSELD  A++GETVVPGGTGGK+ EAQE LAEGRSRGGQ                    TRKEQLG EGY+E+GH+GGE R+EQMG EGY EMGR
Subjt:  ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQ--------------------TRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGR

Query:  KGGLSTMDKSGGERAAEEGIEIDESKFRTK
        KGGLSTM++SGGERAA EGI+IDESKF+TK
Subjt:  KGGLSTMDKSGGERAAEEGIEIDESKFRTK

Q05191 Late embryogenesis abundant protein B19.49.3e-4149.06Show/hide
Query:  ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGI
        + QQERSELD  A++GETVVPGGTGGK+ EAQE LAEGRSRGGQTRKEQLG EGY+E+GH+GGE R+EQ+G EGY+EMG KGG +  ++ G E   E G 
Subjt:  ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGI

Query:  EIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEE
                                                                      +GG+TRKEQ+G EGY+EMGRKGGLS     GGERAA E
Subjt:  EIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEE

Query:  GVEIDESKFRTK
        G++IDESKF+TK
Subjt:  GVEIDESKFRTK

Q07187 Em-like protein GEA14.5e-4352.34Show/hide
Query:  MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEE
        MAS+Q  R ELD KAKQGETVVPGGTGG S EAQE LAEGRS+GGQTRKEQLGHEGYQE+GH+GGEAR+EQ+GHEGYQEMG KGG +  ++ G E   E 
Subjt:  MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEE

Query:  GIEIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAA
        G                                                               +GG+ RKEQLGHEGY+EMGRKGGLS     GGERA 
Subjt:  GIEIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAA

Query:  EEGVEIDESKFRTK
        EEG+EIDESKF  K
Subjt:  EEGVEIDESKFRTK

Q5KTS7 Carrot ABA-induced in somatic embryos 38.7e-3978.18Show/hide
Query:  ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGI
        + Q++RSELDA+AKQGETVVPGGTGGKS EAQE LAEGRS+GG TRKEQLG EGYQE+G +GGE RREQMG EGY++MGR GGL+T DKSG ERA EEGI
Subjt:  ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGI

Query:  EIDESKFRTK
        +ID+SKFRTK
Subjt:  EIDESKFRTK

Arabidopsis top hitse value%identityAlignment
AT2G40170.1 Stress induced protein1.1e-3380.22Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK
        M+S+QE+ +LD RA++GETVVPGGTGGKS EAQ+HLAEGRSRGGQTRKEQLG EGYQ+MGRKGGLS    PGGE A EEGVEIDESKFRTK
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK

AT3G51810.1 Stress induced protein3.2e-4452.34Show/hide
Query:  MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEE
        MAS+Q  R ELD KAKQGETVVPGGTGG S EAQE LAEGRS+GGQTRKEQLGHEGYQE+GH+GGEAR+EQ+GHEGYQEMG KGG +  ++ G E   E 
Subjt:  MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEE

Query:  GIEIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAA
        G                                                               +GG+ RKEQLGHEGY+EMGRKGGLS     GGERA 
Subjt:  GIEIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAA

Query:  EEGVEIDESKFRTK
        EEG+EIDESKF  K
Subjt:  EEGVEIDESKFRTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCGCAACAGGAAAGATCAGAGCTGGACGCTAAAGCAAAGCAGGGTGAGACTGTGGTTCCTGGCGGAACCGGCGGCAAAAGCTTTGAAGCTCAAGAACGTCTCGC
TGAAGGTCGGAGTCGGGGTGGGCAGACGAGGAAGGAGCAATTGGGGCATGAAGGGTATCAGGAGCTAGGGCACCAGGGAGGAGAGGCAAGAAGGGAGCAGATGGGCCATG
AAGGCTACCAAGAAATGGGTCGTAAAGGAGGGCTGAGCACCATGGACAAGTCTGGTGGTGAGCGGGCAGCGGAGGAAGGAATCGAGATCGACGAGTCCAAGTTCAGGACC
AAGGACCCTAAGTATTTGGAAGAGTTGAAGAAAGAGATGTCGTCTGAGCAAGAAAGATGTGAACTCGACGCCAGGGCCAGGCAAGGGGAGACTGTCGTCCCCGGTGGAAC
TGGGGGCAAGAGTCTCGAAGCTCAGGAGCACCTTGCTGAAGGGCGGAGCCGTGGGGGCCAGACAAGGAAGGAGCAACTAGGACACGAAGGGTACCAAGAGATGGGCCGTA
AAGGAGGGCTAAGCAACACGGGTATGCCAGGAGGAGAGCGTGCTGCTGAGGAAGGGGTTGAAATTGACGAATCCAAGTTCAGGACTAAGTAG
mRNA sequenceShow/hide mRNA sequence
AAGTTTCTTTAGAAGAATTAGAAGGTCGGGCAACACAAATGGCATCGCAACAGGAAAGATCAGAGCTGGACGCTAAAGCAAAGCAGGGTGAGACTGTGGTTCCTGGCGGA
ACCGGCGGCAAAAGCTTTGAAGCTCAAGAACGTCTCGCTGAAGGTCGGAGTCGGGGTGGGCAGACGAGGAAGGAGCAATTGGGGCATGAAGGGTATCAGGAGCTAGGGCA
CCAGGGAGGAGAGGCAAGAAGGGAGCAGATGGGCCATGAAGGCTACCAAGAAATGGGTCGTAAAGGAGGGCTGAGCACCATGGACAAGTCTGGTGGTGAGCGGGCAGCGG
AGGAAGGAATCGAGATCGACGAGTCCAAGTTCAGGACCAAGGACCCTAAGTATTTGGAAGAGTTGAAGAAAGAGATGTCGTCTGAGCAAGAAAGATGTGAACTCGACGCC
AGGGCCAGGCAAGGGGAGACTGTCGTCCCCGGTGGAACTGGGGGCAAGAGTCTCGAAGCTCAGGAGCACCTTGCTGAAGGGCGGAGCCGTGGGGGCCAGACAAGGAAGGA
GCAACTAGGACACGAAGGGTACCAAGAGATGGGCCGTAAAGGAGGGCTAAGCAACACGGGTATGCCAGGAGGAGAGCGTGCTGCTGAGGAAGGGGTTGAAATTGACGAAT
CCAAGTTCAGGACTAAGTAGAAGAAAGCCTTTCACAATGTCGTTGAGTTTCAAGTTCTAAGTTCCATTTTCAGCTTT
Protein sequenceShow/hide protein sequence
MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRT
KDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK