; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C08G150920 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C08G150920
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionXS domain-containing protein
Genome locationCla97Chr08:19327783..19328695
RNA-Seq ExpressionCla97C08G150920
SyntenyCla97C08G150920
Gene Ontology termsGO:0031047 - gene silencing by RNA (biological process)
InterPro domainsIPR005380 - XS domain
IPR038588 - XS domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK11753.1 uncharacterized protein E5676_scaffold304G00720 [Cucumis melo var. makuwa]8.7e-9487.8Show/hide
Query:  MITERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSK
        M ++RLVKHAYMSHKVGL+AQHLGL KAICVLMGWNSV PQDTVTWVPEVLSKE  V++KEDLIIWPPVIIVRNVSLSH+SPDKWRVVTIEALE+FLRSK
Subjt:  MITERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSK

Query:  NLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNG-VEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKK
        NLLKGRVKM+LGCPADQSVM LKFLPTFSGLT+AERLNKFFSENRRGREDFE+AKC NG V+MEG KIEEEVLYGYLGTAEDL DVELN+RK  MIKSKK
Subjt:  NLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNG-VEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKK

Query:  EILEL
        EILE+
Subjt:  EILEL

XP_008456586.1 PREDICTED: uncharacterized protein LOC103496499 [Cucumis melo]8.7e-9487.8Show/hide
Query:  MITERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSK
        M ++RLVKHAYMSHKVGL+AQHLGL KAICVLMGWNSV PQDTVTWVPEVLSKE  V++KEDLIIWPPVIIVRNVSLSH+SPDKWRVVTIEALE+FLRSK
Subjt:  MITERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSK

Query:  NLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNG-VEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKK
        NLLKGRVKM+LGCPADQSVM LKFLPTFSGLT+AERLNKFFSENRRGREDFE+AKC NG V+MEG KIEEEVLYGYLGTAEDL DVELN+RK  MIKSKK
Subjt:  NLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNG-VEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKK

Query:  EILEL
        EILE+
Subjt:  EILEL

XP_011656567.1 uncharacterized protein LOC101208223 [Cucumis sativus]5.7e-9387.32Show/hide
Query:  MITERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSK
        M ++RLVKHAYMSHKVGL+AQHLGLAKAICVLMGWNSV PQDTVTWVPEVLSKE  VV+KEDLIIWPPVII+RN+SLSH+SPDKWRVVTIEALE+FLRSK
Subjt:  MITERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSK

Query:  NLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNG-VEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKK
        NLLKGRVKM+LGCPADQSVMVLKFLPTFSGLT+AERL+KFFSENRRGREDFE+AKC  G V+MEG KIEEEVLYGYLGTAEDL DVELN+RK  MIKSKK
Subjt:  NLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNG-VEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKK

Query:  EILEL
        EILE+
Subjt:  EILEL

XP_022133809.1 uncharacterized protein LOC111006280 [Momordica charantia]1.0e-8681.28Show/hide
Query:  TERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNL
        T+RLVKHAYMSH+ GLRAQHLGLAKAICVLMGWNS +PQDTVTWVPEVL KE  VV+KEDLIIWPPVII+RN+SLSHS+PD+WRVVTIEALE FLRSKNL
Subjt:  TERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNL

Query:  LKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKN-GVEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKKEI
        LKGRVK+ LG PADQSVMVLKFL  FSGLT+AERL+KFFSE R GR +FE+AKC+N G EMEG K EE +LYGYLG +EDLDDVE N+RKLS IKSKKEI
Subjt:  LKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKN-GVEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKKEI

Query:  LEL
        LEL
Subjt:  LEL

XP_038884675.1 uncharacterized protein LOC120075393 [Benincasa hispida]4.9e-9791.22Show/hide
Query:  MITERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSK
        M T+RLVKHAYMSHKVGLRA HLGLAKAICVLMGWNSV PQDTVTWVPEVLSKE  VV+KEDLIIWPPVIIVRN+SLS+SSPDKWRVVTI+ALE+FLRSK
Subjt:  MITERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSK

Query:  NLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKC-KNGVEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKK
        NLLKGRVKMNLGCPADQSVMVLKFLPTFSGLT+AERL+KFFSENRRGREDFELAKC K GV MEG KIEEEVLYGYLGTAEDLDDVELN+RKLSMIKSKK
Subjt:  NLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKC-KNGVEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKK

Query:  EILEL
        EILEL
Subjt:  EILEL

TrEMBL top hitse value%identityAlignment
A0A1S3C369 uncharacterized protein LOC1034964994.2e-9487.8Show/hide
Query:  MITERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSK
        M ++RLVKHAYMSHKVGL+AQHLGL KAICVLMGWNSV PQDTVTWVPEVLSKE  V++KEDLIIWPPVIIVRNVSLSH+SPDKWRVVTIEALE+FLRSK
Subjt:  MITERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSK

Query:  NLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNG-VEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKK
        NLLKGRVKM+LGCPADQSVM LKFLPTFSGLT+AERLNKFFSENRRGREDFE+AKC NG V+MEG KIEEEVLYGYLGTAEDL DVELN+RK  MIKSKK
Subjt:  NLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNG-VEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKK

Query:  EILEL
        EILE+
Subjt:  EILEL

A0A5D3CIK8 XS domain-containing protein4.2e-9487.8Show/hide
Query:  MITERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSK
        M ++RLVKHAYMSHKVGL+AQHLGL KAICVLMGWNSV PQDTVTWVPEVLSKE  V++KEDLIIWPPVIIVRNVSLSH+SPDKWRVVTIEALE+FLRSK
Subjt:  MITERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSK

Query:  NLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNG-VEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKK
        NLLKGRVKM+LGCPADQSVM LKFLPTFSGLT+AERLNKFFSENRRGREDFE+AKC NG V+MEG KIEEEVLYGYLGTAEDL DVELN+RK  MIKSKK
Subjt:  NLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNG-VEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKK

Query:  EILEL
        EILE+
Subjt:  EILEL

A0A6J1BX13 uncharacterized protein LOC1110062805.0e-8781.28Show/hide
Query:  TERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNL
        T+RLVKHAYMSH+ GLRAQHLGLAKAICVLMGWNS +PQDTVTWVPEVL KE  VV+KEDLIIWPPVII+RN+SLSHS+PD+WRVVTIEALE FLRSKNL
Subjt:  TERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNL

Query:  LKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKN-GVEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKKEI
        LKGRVK+ LG PADQSVMVLKFL  FSGLT+AERL+KFFSE R GR +FE+AKC+N G EMEG K EE +LYGYLG +EDLDDVE N+RKLS IKSKKEI
Subjt:  LKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKN-GVEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKKEI

Query:  LEL
        LEL
Subjt:  LEL

A0A6J1HC30 uncharacterized protein LOC1114614702.8e-8278.47Show/hide
Query:  TERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNL
        T+RLVKHAYMSHK+GLRA+HLGLAKAICVLMGWNS LPQDTVTWVPE L KE  VV+KEDLIIWPPV+IVRN+S+S S+P KW+V+TIEALEAFLRSKNL
Subjt:  TERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNL

Query:  LKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKN------GVEMEGGKI-EEEVLYGYLGTAEDLDDVELNIRKLSMI
        LKGRVKM+LGCPADQSVMVLKFLPTFSGLT+AERLNKFF E R GR +FE +K  N      G   +G KI EEEVLYGYLG AEDLD VE NIRK S I
Subjt:  LKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKN------GVEMEGGKI-EEEVLYGYLGTAEDLDDVELNIRKLSMI

Query:  KSKKEILEL
        KSKKEILEL
Subjt:  KSKKEILEL

A0A6J1JSP3 uncharacterized protein LOC1114871811.4e-8177.51Show/hide
Query:  TERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNL
        T+RLVKHAYMSHK+GLRAQHLGLAKAICVLMGWNS LPQDTV WVPE L KE  VV+KEDLIIWPPVIIVRN+S+S S+P KW+V+TIEALEAFLRSKNL
Subjt:  TERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNL

Query:  LKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNGVEMEGGKI-------EEEVLYGYLGTAEDLDDVELNIRKLSMI
        LKGRVKM+LGCPADQSVMVLKFLPTFSGLT+AERL+KFF E R GR +FE +K  NG   + G         EEEVLYGYLG AEDLD VE NIRK S I
Subjt:  LKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNGVEMEGGKI-------EEEVLYGYLGTAEDLDDVELNIRKLSMI

Query:  KSKKEILEL
        KSKKEILEL
Subjt:  KSKKEILEL

SwissProt top hitse value%identityAlignment
A1Y2B7 Protein SUPPRESSOR OF GENE SILENCING 3 homolog1.2e-0529.32Show/hide
Query:  IIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNGVEME
        I+WPP++IV N  L     DKW+ +  + L  +       K R     G    + + VL F  +  G   AERL+K F      R  + L K +    + 
Subjt:  IIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNGVEME

Query:  GGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIK
        GGK +   LYG+L   ED++    +    S +K
Subjt:  GGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIK

A2ZIW7 Protein SUPPRESSOR OF GENE SILENCING 3 homolog1.3e-0427.82Show/hide
Query:  IIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNGVEME
        I+WPP+++V N  L     DKW+ +  + L  +       K R     G    + + VL F  +  G   AERL+  F   R  R  +  A       + 
Subjt:  IIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNGVEME

Query:  GGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIK
        GGK +   LYG+L T +D++    +    S +K
Subjt:  GGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIK

A5YVF1 Protein SUPPRESSOR OF GENE SILENCING 37.6e-0830Show/hide
Query:  EVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGR
        EV  +   +  K+  I+WPP++I+ N  L     DKW  +  + L  +  S   +K R   + G    + + +L F  +  G   A+RL++ FSEN R R
Subjt:  EVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGR

Query:  EDFELAKCKNGVEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSK
        + +E    +      GGK    +LYGY+   +D+D    N  + S  KSK
Subjt:  EDFELAKCKNGVEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSK

Q2QWE9 Protein SUPPRESSOR OF GENE SILENCING 3 homolog1.3e-0427.82Show/hide
Query:  IIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNGVEME
        I+WPP+++V N  L     DKW+ +  + L  +       K R     G    + + VL F  +  G   AERL+  F   R  R  +  A       + 
Subjt:  IIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNGVEME

Query:  GGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIK
        GGK +   LYG+L T +D++    +    S +K
Subjt:  GGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIK

Q9LDX1 Protein SUPPRESSOR OF GENE SILENCING 33.5e-0530.95Show/hide
Query:  EKEDLIIWPPVIIVRNVSLSHSSPDKW-RVVTIEALEAFLRSKNLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCK
        EK+  I+WPP++I+ N  L     DKW  +   E LE F + + L   R + + G    + + VL F  + +G   AERL++  +E    R  +   +  
Subjt:  EKEDLIIWPPVIIVRNVSLSHSSPDKW-RVVTIEALEAFLRSKNLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCK

Query:  NGVEMEGGKIEEEVLYGYLGTAEDLD
            M  G + +  LYG+L T +DLD
Subjt:  NGVEMEGGKIEEEVLYGYLGTAEDLD

Arabidopsis top hitse value%identityAlignment
AT3G22430.1 CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380)2.8e-2132.23Show/hide
Query:  TERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWN-SVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKN
        T  LV H Y S     R  HLGL KA+CVLMGWN S  P ++  +  + L  +   + +  LIIWPP +IV+N S       +      + ++  +R   
Subjt:  TERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWN-SVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKN

Query:  LLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDF----ELAKCKNG------VEMEGGKIEEE-VLYGYLGTAEDLDDVELNIR
        L  G+ K   G      + + KF    SGL +A R+ ++F +  RGR+ +     L   K+       VE++G   E++ + YGYL T  DLD V++  +
Subjt:  LLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDF----ELAKCKNG------VEMEGGKIEEE-VLYGYLGTAEDLDDVELNIR

Query:  KLSMIKSKKEI
        K + I+S +E+
Subjt:  KLSMIKSKKEI

AT5G23570.1 XS domain-containing protein / XS zinc finger domain-containing protein-related2.5e-0630.95Show/hide
Query:  EKEDLIIWPPVIIVRNVSLSHSSPDKW-RVVTIEALEAFLRSKNLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCK
        EK+  I+WPP++I+ N  L     DKW  +   E LE F + + L   R + + G    + + VL F  + +G   AERL++  +E    R  +   +  
Subjt:  EKEDLIIWPPVIIVRNVSLSHSSPDKW-RVVTIEALEAFLRSKNLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCK

Query:  NGVEMEGGKIEEEVLYGYLGTAEDLD
            M  G + +  LYG+L T +DLD
Subjt:  NGVEMEGGKIEEEVLYGYLGTAEDLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTACTGAACGCCTAGTGAAGCATGCTTATATGTCCCACAAGGTTGGGTTGAGGGCTCAGCATTTAGGTCTTGCCAAAGCCATATGCGTTTTGATGGGGTGGAATAG
TGTCCTTCCTCAAGACACTGTAACATGGGTTCCTGAGGTTTTGTCCAAGGAAGCAACTGTCGTTGAGAAGGAAGATCTTATCATCTGGCCTCCTGTTATTATTGTCCGCA
ACGTTTCTCTGTCACACAGCAGTCCTGATAAGTGGAGAGTTGTAACAATTGAAGCACTTGAGGCTTTCTTGAGAAGTAAAAATCTGCTGAAGGGAAGAGTGAAAATGAAT
TTGGGTTGTCCTGCAGATCAAAGTGTAATGGTGTTGAAGTTTCTGCCTACCTTTTCTGGTTTAACAAACGCAGAAAGACTCAACAAATTCTTCTCTGAAAACAGACGTGG
AAGAGAGGATTTTGAGTTGGCAAAGTGCAAAAATGGAGTTGAAATGGAAGGAGGCAAAATAGAAGAGGAAGTGCTTTATGGATACTTGGGAACTGCAGAGGATTTGGATG
ATGTTGAACTCAATATAAGGAAGTTGAGTATGATAAAGAGCAAAAAGGAAATTTTGGAGTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTACTGAACGCCTAGTGAAGCATGCTTATATGTCCCACAAGGTTGGGTTGAGGGCTCAGCATTTAGGTCTTGCCAAAGCCATATGCGTTTTGATGGGGTGGAATAG
TGTCCTTCCTCAAGACACTGTAACATGGGTTCCTGAGGTTTTGTCCAAGGAAGCAACTGTCGTTGAGAAGGAAGATCTTATCATCTGGCCTCCTGTTATTATTGTCCGCA
ACGTTTCTCTGTCACACAGCAGTCCTGATAAGTGGAGAGTTGTAACAATTGAAGCACTTGAGGCTTTCTTGAGAAGTAAAAATCTGCTGAAGGGAAGAGTGAAAATGAAT
TTGGGTTGTCCTGCAGATCAAAGTGTAATGGTGTTGAAGTTTCTGCCTACCTTTTCTGGTTTAACAAACGCAGAAAGACTCAACAAATTCTTCTCTGAAAACAGACGTGG
AAGAGAGGATTTTGAGTTGGCAAAGTGCAAAAATGGAGTTGAAATGGAAGGAGGCAAAATAGAAGAGGAAGTGCTTTATGGATACTTGGGAACTGCAGAGGATTTGGATG
ATGTTGAACTCAATATAAGGAAGTTGAGTATGATAAAGAGCAAAAAGGAAATTTTGGAGTTGTAA
Protein sequenceShow/hide protein sequence
MITERLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPQDTVTWVPEVLSKEATVVEKEDLIIWPPVIIVRNVSLSHSSPDKWRVVTIEALEAFLRSKNLLKGRVKMN
LGCPADQSVMVLKFLPTFSGLTNAERLNKFFSENRRGREDFELAKCKNGVEMEGGKIEEEVLYGYLGTAEDLDDVELNIRKLSMIKSKKEILEL