; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001714 (gene) of Snake gourd v1 genome

Gene IDTan0001714
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionU6 snRNA-associated Sm-like protein LSm4
Genome locationLG01:79202760..79215367
RNA-Seq ExpressionTan0001714
SyntenyTan0001714
Gene Ontology termsGO:0000956 - nuclear-transcribed mRNA catabolic process (biological process)
GO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0006772 - thiamine metabolic process (biological process)
GO:0009229 - thiamine diphosphate biosynthetic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0033962 - cytoplasmic mRNA processing body assembly (biological process)
GO:0000932 - P-body (cellular component)
GO:0005681 - spliceosomal complex (cellular component)
GO:0005688 - U6 snRNP (cellular component)
GO:0097526 - spliceosomal tri-snRNP complex (cellular component)
GO:0030975 - thiamine binding (molecular function)
GO:0017070 - U6 snRNA binding (molecular function)
GO:0016301 - kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0004788 - thiamine diphosphokinase activity (molecular function)
InterPro domainsIPR034101 - Sm-like protein Lsm4
IPR027141 - Like-Sm (LSM) domain containing protein, LSm4/SmD1/SmD3
IPR010920 - LSM domain superfamily
IPR001163 - LSM domain, eukaryotic/archaea-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592199.1 hypothetical protein SDJN03_14545, partial [Cucurbita argyrosperma subsp. sororia]3.3e-7488.89Show/hide
Query:  FSSDRGFQWLFLLLGIFSVSLKMLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDK
        + SDR  + L        VSLKMLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDK
Subjt:  FSSDRGFQWLFLLLGIFSVSLKMLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDK

Query:  VQEETKSRADRKPPGVGRGRGRGREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
        VQEETKSR DRKPPGVGRGRGRGREDGPGGRP+KGMGRGFDDGAKAASGGRGKG  GGK GANRVGGRGRG
Subjt:  VQEETKSRADRKPPGVGRGRGRGREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG

XP_008454536.1 PREDICTED: probable U6 snRNA-associated Sm-like protein LSm4 [Cucumis melo]1.1e-7499.33Show/hide
Query:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR
        MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKP GVGRGRGR
Subjt:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR

Query:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
        GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
Subjt:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG

XP_022926288.1 sm-like protein LSM4 isoform X1 [Cucurbita moschata]1.7e-7599.33Show/hide
Query:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR
        MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETK+RADRKPPGVGRGRGR
Subjt:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR

Query:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
        GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
Subjt:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG

XP_022975088.1 sm-like protein LSM4 isoform X1 [Cucurbita maxima]4.2e-7497.99Show/hide
Query:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR
        MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSR DRKPPGVGRGRGR
Subjt:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR

Query:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
        GREDGPGGRP+KGMGRGFDDGAKAASGGRGKGG GGKPGANRVGGRGRG
Subjt:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG

XP_038896461.1 uncharacterized protein LOC120084721 [Benincasa hispida]2.4e-7792.73Show/hide
Query:  FQWLFLLLGIFSVSLKMLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETK
        + W+     +  VSLKMLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETK
Subjt:  FQWLFLLLGIFSVSLKMLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETK

Query:  SRADRKPPGVGRGRGRGREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
        SRADRKPPGVGRGRGRGREDGPGGRPAKGMGRGFDD AKAASGGRGKGGPGGKPGANRVGGRGRG
Subjt:  SRADRKPPGVGRGRGRGREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG

TrEMBL top hitse value%identityAlignment
A0A0A0L3Q6 U6 snRNA-associated Sm-like protein LSm48.3e-7698.04Show/hide
Query:  VSLKMLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGR
        +SLKMLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKP GVGR
Subjt:  VSLKMLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGR

Query:  GRGRGREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
        GRGRGREDGPG RPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
Subjt:  GRGRGREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG

A0A1S3BYD6 U6 snRNA-associated Sm-like protein LSm45.4e-7599.33Show/hide
Query:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR
        MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKP GVGRGRGR
Subjt:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR

Query:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
        GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
Subjt:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG

A0A6J1EHL8 U6 snRNA-associated Sm-like protein LSm48.3e-7699.33Show/hide
Query:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR
        MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETK+RADRKPPGVGRGRGR
Subjt:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR

Query:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
        GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
Subjt:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG

A0A6J1ID83 U6 snRNA-associated Sm-like protein LSm42.1e-7497.99Show/hide
Query:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR
        MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSR DRKPPGVGRGRGR
Subjt:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR

Query:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
        GREDGPGGRP+KGMGRGFDDGAKAASGGRGKGG GGKPGANRVGGRGRG
Subjt:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG

A0A6J1J2N4 U6 snRNA-associated Sm-like protein LSm48.3e-7699.33Show/hide
Query:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR
        MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETK+RADRKPPGVGRGRGR
Subjt:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR

Query:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
        GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
Subjt:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG

SwissProt top hitse value%identityAlignment
F4K4E3 Sm-like protein LSM43.3e-5377.85Show/hide
Query:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR
        MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE K+R DRKPPGVGRGRGR
Subjt:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR

Query:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
        G +DG               GA+    GRG+G   GK G NR  GRGRG
Subjt:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG

Q43582 Probable U6 snRNA-associated Sm-like protein LSm41.2e-5882.67Show/hide
Query:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR
        MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECY+RGNTIKYLRVPDEVIDKVQEE KSR DRKPPGVGR R R
Subjt:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR

Query:  -GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
         GR+D   GR  KG+GRG DDG    + GRGKGGP  K G  R GGRGRG
Subjt:  -GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG

Q9LGE6 Probable U6 snRNA-associated Sm-like protein LSm49.9e-5882.78Show/hide
Query:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEET-KSRADRKPPGVGRGRG
        MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGD+FWRMPECYIRGNTIKYLRVPDEVIDKVQEET KSR+DR+PPGVGRGRG
Subjt:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEET-KSRADRKPPGVGRGRG

Query:  RGR-EDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
        RG     PGGR   G+GRG DDG     GGRG+GG GGK G  + GGRGRG
Subjt:  RGR-EDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG

Q9QXA5 U6 snRNA-associated Sm-like protein LSm43.5e-3964.29Show/hide
Query:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR
        MLPLSLLKTAQ HPMLVELKNGETYNGHLV+CD WMNI+LREVICTS+DGD+FWRMPECYIRG+TIKYLR+PDE+ID V+EE            GRGRG 
Subjt:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR

Query:  GREDGPGGRPAKGMGRG-FDDGAKAASGGRGKGGPGGKPG
         ++    GR   G GRG F    +    G G+G P  KPG
Subjt:  GREDGPGGRPAKGMGRG-FDDGAKAASGGRGKGGPGGKPG

Q9ZRU9 Probable U6 snRNA-associated Sm-like protein LSm41.1e-5681.33Show/hide
Query:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR
        MLPLSLLKTAQGHPMLVELK+GETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMP+CYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR
Subjt:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR

Query:  GREDGPGGRPAKGMGRG-FDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
        GRE+G G R  +G GR      AKA    RG+G   GK G    GGRGRG
Subjt:  GREDGPGGRPAKGMGRG-FDDGAKAASGGRGKGGPGGKPGANRVGGRGRG

Arabidopsis top hitse value%identityAlignment
AT1G20580.1 Small nuclear ribonucleoprotein family protein9.3e-1134.06Show/hide
Query:  LPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPD-----EVIDKVQEETKSRADRKPPGVGR
        +P+ LL  A GH + VELK+GE Y G ++ C+   N  L ++  T+KDG +  ++   +IRG+ ++++ +PD      +  ++    K ++     GVGR
Subjt:  LPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPD-----EVIDKVQEETKSRADRKPPGVGR

Query:  GRGRGREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGP
        GRG  R     G+PA G GRG        +GGRG   P
Subjt:  GRGRGREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGP

AT1G76300.1 snRNP core protein SMD31.8e-0932.09Show/hide
Query:  LPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVID-KVQEETKSRADRKPPGVGRGRGR
        +P+ LL  + GH + VE+K+GE Y G ++ C+   N  L  +  T+KDG +  ++   +IRG+ +++L +PD + +  + ++ + +      GVGRGRG 
Subjt:  LPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVID-KVQEETKSRADRKPPGVGRGRGR

Query:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGP
                  AKG GRG         GGRG   P
Subjt:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGP

AT4G02840.1 Small nuclear ribonucleoprotein family protein9.6e-0839.6Show/hide
Query:  VELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPD--EVIDKVQEET------KSRADRKPPGVGRGRGRGREDGPGG
        +ELKNG   +G +   D  MN HL+ V  T K G     +    +RGN I+Y  +PD   +   + E+T      K  A + P G GRGRGRGR  G GG
Subjt:  VELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPD--EVIDKVQEET------KSRADRKPPGVGRGRGRGREDGPGG

Query:  R
        R
Subjt:  R

AT4G02840.2 Small nuclear ribonucleoprotein family protein2.0e-0536.04Show/hide
Query:  VELKNGETYNGHLVN----------CDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPD--EVIDKVQEET------KSRADRKPPGVGRGR
        +ELKNG   +G + +           D  MN HL+ V  T K G     +    +RGN I+Y  +PD   +   + E+T      K  A + P G GRGR
Subjt:  VELKNGETYNGHLVN----------CDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPD--EVIDKVQEET------KSRADRKPPGVGRGR

Query:  GRGREDGPGGR
        GRGR  G GGR
Subjt:  GRGREDGPGGR

AT5G27720.1 Small nuclear ribonucleoprotein family protein2.3e-5477.85Show/hide
Query:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR
        MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE K+R DRKPPGVGRGRGR
Subjt:  MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGR

Query:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG
        G +DG               GA+    GRG+G   GK G NR  GRGRG
Subjt:  GREDGPGGRPAKGMGRGFDDGAKAASGGRGKGGPGGKPGANRVGGRGRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCAATCAAAAGCAAAAGCCCTAGAATCCGCCGCTCCTTTCCATCTCGGTGCCGCTCCCTCCGGCCATTCTTGTCTACGCCGCCGCAATCGCTCCACCGTGGGCTC
AGTTAAAGCCCAAATCGGAAGACCGAACCGATCGAATTGGTTGAAAATTACTTCTTCGAACCTGCCCAAAGACACATTGCGTTCACTTTTGAAACCGTTCGAAAATTTTC
TACTGTACTCCAAGAAGCGATTCTCTTCGGATCGAGGGTTCCAGTGGTTGTTTCTGCTTCTAGGAATCTTCTCCGTATCGTTAAAGATGCTTCCCCTTTCGCTCCTAAAG
ACTGCTCAAGGGCATCCCATGTTGGTGGAACTGAAAAATGGTGAGACTTACAACGGCCATTTGGTTAATTGTGATACATGGATGAACATTCATCTCCGGGAAGTCATCTG
CACTTCTAAAGATGGTGACCGGTTCTGGCGAATGCCTGAATGTTATATCCGTGGTAATACAATCAAGTATTTGCGAGTTCCCGATGAGGTTATTGATAAAGTACAGGAAG
AAACCAAAAGCCGAGCAGATAGGAAACCTCCGGGTGTTGGGCGTGGAAGAGGAAGAGGGCGTGAGGATGGCCCTGGTGGAAGACCAGCTAAAGGAATGGGGCGAGGCTTT
GATGATGGTGCTAAAGCTGCTTCTGGAGGACGTGGAAAAGGTGGCCCTGGTGGAAAACCTGGTGCCAACAGAGTTGGAGGCCGAGGCCGAGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCAATCAAAAGCAAAAGCCCTAGAATCCGCCGCTCCTTTCCATCTCGGTGCCGCTCCCTCCGGCCATTCTTGTCTACGCCGCCGCAATCGCTCCACCGTGGGCTC
AGTTAAAGCCCAAATCGGAAGACCGAACCGATCGAATTGGTTGAAAATTACTTCTTCGAACCTGCCCAAAGACACATTGCGTTCACTTTTGAAACCGTTCGAAAATTTTC
TACTGTACTCCAAGAAGCGATTCTCTTCGGATCGAGGGTTCCAGTGGTTGTTTCTGCTTCTAGGAATCTTCTCCGTATCGTTAAAGATGCTTCCCCTTTCGCTCCTAAAG
ACTGCTCAAGGGCATCCCATGTTGGTGGAACTGAAAAATGGTGAGACTTACAACGGCCATTTGGTTAATTGTGATACATGGATGAACATTCATCTCCGGGAAGTCATCTG
CACTTCTAAAGATGGTGACCGGTTCTGGCGAATGCCTGAATGTTATATCCGTGGTAATACAATCAAGTATTTGCGAGTTCCCGATGAGGTTATTGATAAAGTACAGGAAG
AAACCAAAAGCCGAGCAGATAGGAAACCTCCGGGTGTTGGGCGTGGAAGAGGAAGAGGGCGTGAGGATGGCCCTGGTGGAAGACCAGCTAAAGGAATGGGGCGAGGCTTT
GATGATGGTGCTAAAGCTGCTTCTGGAGGACGTGGAAAAGGTGGCCCTGGTGGAAAACCTGGTGCCAACAGAGTTGGAGGCCGAGGCCGAGGGTGA
Protein sequenceShow/hide protein sequence
MKQSKAKALESAAPFHLGAAPSGHSCLRRRNRSTVGSVKAQIGRPNRSNWLKITSSNLPKDTLRSLLKPFENFLLYSKKRFSSDRGFQWLFLLLGIFSVSLKMLPLSLLK
TAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEETKSRADRKPPGVGRGRGRGREDGPGGRPAKGMGRGF
DDGAKAASGGRGKGGPGGKPGANRVGGRGRG