; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023468 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023468
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description30S ribosomal protein S18
Genome locationtig00000892:3563578..3566140
RNA-Seq ExpressionSgr023468
SyntenySgr023468
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005763 - mitochondrial small ribosomal subunit (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0070181 - small ribosomal subunit rRNA binding (molecular function)
InterPro domainsIPR001648 - Ribosomal protein S18
IPR036870 - Ribosomal protein S18 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570652.1 hypothetical protein SDJN03_29567, partial [Cucurbita argyrosperma subsp. sororia]3.5e-10889.22Show/hide
Query:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR
        DDKGKPDSFESADDFERRIFGGVS GD GNDAFFEKLDR+GKPRERIGSRLSG NNFQ+LYGLDDNLNTLSDGMDGKLKKA+TYFEFD EEIAKDDYTFR
Subjt:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR

Query:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF
        ADMSFKPGSTYE+K    +  GVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFIT++GIIIKRSKT ISAKAQRKVAREIKTARAFGLMPFTTMGTK+F
Subjt:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF

Query:  VFGKTMEYLDKDYEYEVFDNTADADGERPLRS
        VFGKTME LDKDYEYEVFDN    DGERPL S
Subjt:  VFGKTMEYLDKDYEYEVFDNTADADGERPLRS

XP_022148322.1 uncharacterized protein LOC111017003 [Momordica charantia]7.2e-10989.66Show/hide
Query:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR
        D+K KPDSFESADDFERRIFGGVSL DSGN+AFFEKLDRLGKPRER+GSRLSG NNFQALYGLDD LNTLSDGMDGKLKKAATYFEFDPEEI+KDDYTFR
Subjt:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR

Query:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF
        ADMSFKPGSTYEIK       GVRKPPKRVEFQVTTEEVL+KADFRNVRFLANFIT++GII KRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF
Subjt:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF

Query:  VFGKTMEYLDKDYEYEVFDNTADADGERPLRS
        VFGKTME LDKDYEYEVFDNTADAD   PL S
Subjt:  VFGKTMEYLDKDYEYEVFDNTADADGERPLRS

XP_022943865.1 uncharacterized protein LOC111448465 [Cucurbita moschata]2.7e-10889.22Show/hide
Query:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR
        DDKGKPDSFESADDFERRIFGGVS GDSGNDAFFEKLDR+GKPRERIGSRLSG NNFQ+LYGLDDNLNTLSDGMDGKLKKA+TYFEFD EEIAKDDYTFR
Subjt:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR

Query:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF
        ADMSFKPGSTYE+K    +  GVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFIT++GIIIKRSKT ISAKAQRKVAREIKTARAFGLMPFTTMGTK+F
Subjt:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF

Query:  VFGKTMEYLDKDYEYEVFDNTADADGERPLRS
        VFGKTME LDKDYEYEVFDN    DGE PL S
Subjt:  VFGKTMEYLDKDYEYEVFDNTADADGERPLRS

XP_022947026.1 uncharacterized protein LOC111451027 isoform X1 [Cucurbita moschata]1.9e-10989.96Show/hide
Query:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR
        DD+GK +SFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSG NNFQALYG++DNLNTLSDGMDGKLKKA+TYFEFDPEEIAKDDYTFR
Subjt:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR

Query:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF
        ADMSFKPGSTYEIK    +  GVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFIT++GII KRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTK+F
Subjt:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF

Query:  VFGKTMEYLDKDYEYEVFDNTADADGERP
        VFGKTMEYLDKDYEYEVF+N  DAD  RP
Subjt:  VFGKTMEYLDKDYEYEVFDNTADADGERP

XP_022986919.1 uncharacterized protein LOC111484513 [Cucurbita maxima]3.5e-10890.09Show/hide
Query:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR
        DDKGKPDSFESADDFERRIFGGVS GDSGNDAFFEKLDRLGKPRERIGSRLSG NNFQALYGLDDNLNTLSDGMDGKLKKA+TYFEFD EEIAKDDYTFR
Subjt:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR

Query:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF
        ADMSFKPGSTYEIK    +  GVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFIT++GIIIKRSKT ISAKAQRKVAREIKTARAFGLMPFTTMGTK+F
Subjt:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF

Query:  VFGKTMEYLDKDYEYEVFDNTADADGERPLRS
        VFGKTME LDKDYEYEVFDN    DG  PL S
Subjt:  VFGKTMEYLDKDYEYEVFDNTADADGERPLRS

TrEMBL top hitse value%identityAlignment
A0A6J1D4R9 uncharacterized protein LOC1110170033.5e-10989.66Show/hide
Query:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR
        D+K KPDSFESADDFERRIFGGVSL DSGN+AFFEKLDRLGKPRER+GSRLSG NNFQALYGLDD LNTLSDGMDGKLKKAATYFEFDPEEI+KDDYTFR
Subjt:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR

Query:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF
        ADMSFKPGSTYEIK       GVRKPPKRVEFQVTTEEVL+KADFRNVRFLANFIT++GII KRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF
Subjt:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF

Query:  VFGKTMEYLDKDYEYEVFDNTADADGERPLRS
        VFGKTME LDKDYEYEVFDNTADAD   PL S
Subjt:  VFGKTMEYLDKDYEYEVFDNTADADGERPLRS

A0A6J1FYC1 uncharacterized protein LOC1114484651.3e-10889.22Show/hide
Query:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR
        DDKGKPDSFESADDFERRIFGGVS GDSGNDAFFEKLDR+GKPRERIGSRLSG NNFQ+LYGLDDNLNTLSDGMDGKLKKA+TYFEFD EEIAKDDYTFR
Subjt:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR

Query:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF
        ADMSFKPGSTYE+K    +  GVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFIT++GIIIKRSKT ISAKAQRKVAREIKTARAFGLMPFTTMGTK+F
Subjt:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF

Query:  VFGKTMEYLDKDYEYEVFDNTADADGERPLRS
        VFGKTME LDKDYEYEVFDN    DGE PL S
Subjt:  VFGKTMEYLDKDYEYEVFDNTADADGERPLRS

A0A6J1G586 uncharacterized protein LOC111451027 isoform X19.1e-11089.96Show/hide
Query:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR
        DD+GK +SFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSG NNFQALYG++DNLNTLSDGMDGKLKKA+TYFEFDPEEIAKDDYTFR
Subjt:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR

Query:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF
        ADMSFKPGSTYEIK    +  GVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFIT++GII KRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTK+F
Subjt:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF

Query:  VFGKTMEYLDKDYEYEVFDNTADADGERP
        VFGKTMEYLDKDYEYEVF+N  DAD  RP
Subjt:  VFGKTMEYLDKDYEYEVFDNTADADGERP

A0A6J1JHF0 uncharacterized protein LOC1114845131.7e-10890.09Show/hide
Query:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR
        DDKGKPDSFESADDFERRIFGGVS GDSGNDAFFEKLDRLGKPRERIGSRLSG NNFQALYGLDDNLNTLSDGMDGKLKKA+TYFEFD EEIAKDDYTFR
Subjt:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR

Query:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF
        ADMSFKPGSTYEIK    +  GVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFIT++GIIIKRSKT ISAKAQRKVAREIKTARAFGLMPFTTMGTK+F
Subjt:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF

Query:  VFGKTMEYLDKDYEYEVFDNTADADGERPLRS
        VFGKTME LDKDYEYEVFDN    DG  PL S
Subjt:  VFGKTMEYLDKDYEYEVFDNTADADGERPLRS

A0A6J1KYL3 LOW QUALITY PROTEIN: uncharacterized protein LOC1114999128.0e-10687.77Show/hide
Query:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR
        DD+GK +SFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSG NNFQALYG++DNLNTLSDGMDGKLKKA+TYFEFDPEEIAKDDYTFR
Subjt:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFR

Query:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF
        ADMSFKPGSTYEIK    +  GVRK PKRVEF V TEEVLR ADFRNVRFLANFIT++GII KRSKTGISAKAQRKVAREIKTARAFGLMP TTMGTK+F
Subjt:  ADMSFKPGSTYEIKARS-SATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTF

Query:  VFGKTMEYLDKDYEYEVFDNTADADGERP
        VFGKTMEYLDKDYEYEVF+N  DAD  RP
Subjt:  VFGKTMEYLDKDYEYEVFDNTADADGERP

SwissProt top hitse value%identityAlignment
P59502 30S ribosomal protein S184.1e-0642.42Show/hide
Query:  KRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFT
        +R +F   T E +++ D++++  L N+ITESG I+    TG SA+ QR++AR IK AR   L+P+T
Subjt:  KRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFT

P62659 30S ribosomal protein S189.7e-0841.67Show/hide
Query:  STYEIKARSSATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFT
        ST   K +  A   R+P ++ + + T  E     D+RNV  L  F++E+G I+ R +TG+SAK QR +A+ IK AR  GL+PFT
Subjt:  STYEIKARSSATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFT

P80382 30S ribosomal protein S182.8e-0740.48Show/hide
Query:  STYEIKARSSATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFT
        ST   K +  A   R+P ++ + + T  E     D+RNV  L  F++E+G I+ R +TG+S K QR +A+ IK AR  GL+PFT
Subjt:  STYEIKARSSATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFT

Q5SLQ0 30S ribosomal protein S189.7e-0841.67Show/hide
Query:  STYEIKARSSATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFT
        ST   K +  A   R+P ++ + + T  E     D+RNV  L  F++E+G I+ R +TG+SAK QR +A+ IK AR  GL+PFT
Subjt:  STYEIKARSSATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFT

Q9RY50 30S ribosomal protein S181.1e-0635.29Show/hide
Query:  GSTYEIKARSSATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFT
        G++ E K R       + PK   F +   E+    D+++V+ L  F++++G I+ R +TG+SAK QR++A+ IK AR   L+P+T
Subjt:  GSTYEIKARSSATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFT

Arabidopsis top hitse value%identityAlignment
AT1G07210.1 Ribosomal protein S189.2e-4648.05Show/hide
Query:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERI-------GSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIA
        D      SF+S+D+ +  +FG  +  D  ++ FF+ L +  K +          GSR SG  +       D+  +  SDG+DGKLK+AA  +  D  +  
Subjt:  DDKGKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERI-------GSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIA

Query:  KDDYTFRADMSFKPGSTYEIKARSSATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTT
        K +Y+FR D +   G     + +      R+   + E  VTTEEVL+ ADFRNVRFLANFITE+GIIIKR +TGISAKAQRK+AREIKTARAFGLMPFTT
Subjt:  KDDYTFRADMSFKPGSTYEIKARSSATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTT

Query:  MGTKTFVFGKTMEYLDKDYEYEVFDNTADAD
        MGTK F FGKTME  D+D+EYEV D+  + D
Subjt:  MGTKTFVFGKTMEYLDKDYEYEVFDNTADAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGTTTTTCAGTTTAAGATTTCGGGCCTATAGGCCCAAAAAATTTGGAGCTTTTTCGGATGGTTCACATCCGCGGCTCAAGCCCAGCGAGCGAACGTTCTGCTTCAC
TTGCGTTGCCGCCCTGCGAGGCCCCTTCCCTAACCGGCGGCAGCTGAAGCAGCAACCCTACGGGACTACGTTCCAACTTCATTCAAGCTTTTGCATCAGTGAAGTACTTC
TCAGGTTGCTTTGCGATCCCTTAGTGGTGTCTTATCGCAGAGATTTAAGCAAACCTCTGCGCTCAAAAACCTTTCCACAAACTCCGCTCGTGATAGTGGTTGATGATAAG
GGAAAACCCGATTCATTTGAGTCTGCTGATGACTTTGAACGTCGGATATTTGGTGGTGTTTCTTTGGGCGATTCCGGAAACGATGCTTTCTTTGAGAAGCTTGATAGACT
TGGCAAGCCTCGTGAAAGAATAGGTTCAAGACTGAGTGGAGCAAACAATTTTCAGGCGTTGTATGGTCTTGATGATAATCTTAACACGTTGTCGGATGGGATGGATGGCA
AGTTGAAGAAAGCTGCCACTTATTTTGAGTTTGATCCTGAGGAAATAGCAAAAGATGATTATACTTTCAGAGCAGATATGTCTTTTAAGCCTGGATCGACGTACGAAATC
AAGGCACGTAGCAGTGCAACTGGCGTACGTAAACCTCCCAAAAGGGTTGAGTTTCAGGTGACAACGGAGGAGGTCTTAAGAAAAGCTGATTTCAGGAATGTTAGATTCCT
GGCAAACTTTATAACCGAGTCTGGGATCATTATCAAGAGGAGCAAGACAGGTATCAGTGCCAAGGCACAGAGGAAAGTTGCCCGAGAGATCAAAACTGCACGAGCTTTTG
GCTTAATGCCTTTCACAACAATGGGAACCAAAACATTTGTTTTTGGGAAAACCATGGAGTATCTTGATAAGGACTATGAGTATGAAGTGTTTGATAATACTGCTGATGCT
GATGGAGAACGCCCGCTCAGATCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCGTTTTTCAGTTTAAGATTTCGGGCCTATAGGCCCAAAAAATTTGGAGCTTTTTCGGATGGTTCACATCCGCGGCTCAAGCCCAGCGAGCGAACGTTCTGCTTCAC
TTGCGTTGCCGCCCTGCGAGGCCCCTTCCCTAACCGGCGGCAGCTGAAGCAGCAACCCTACGGGACTACGTTCCAACTTCATTCAAGCTTTTGCATCAGTGAAGTACTTC
TCAGGTTGCTTTGCGATCCCTTAGTGGTGTCTTATCGCAGAGATTTAAGCAAACCTCTGCGCTCAAAAACCTTTCCACAAACTCCGCTCGTGATAGTGGTTGATGATAAG
GGAAAACCCGATTCATTTGAGTCTGCTGATGACTTTGAACGTCGGATATTTGGTGGTGTTTCTTTGGGCGATTCCGGAAACGATGCTTTCTTTGAGAAGCTTGATAGACT
TGGCAAGCCTCGTGAAAGAATAGGTTCAAGACTGAGTGGAGCAAACAATTTTCAGGCGTTGTATGGTCTTGATGATAATCTTAACACGTTGTCGGATGGGATGGATGGCA
AGTTGAAGAAAGCTGCCACTTATTTTGAGTTTGATCCTGAGGAAATAGCAAAAGATGATTATACTTTCAGAGCAGATATGTCTTTTAAGCCTGGATCGACGTACGAAATC
AAGGCACGTAGCAGTGCAACTGGCGTACGTAAACCTCCCAAAAGGGTTGAGTTTCAGGTGACAACGGAGGAGGTCTTAAGAAAAGCTGATTTCAGGAATGTTAGATTCCT
GGCAAACTTTATAACCGAGTCTGGGATCATTATCAAGAGGAGCAAGACAGGTATCAGTGCCAAGGCACAGAGGAAAGTTGCCCGAGAGATCAAAACTGCACGAGCTTTTG
GCTTAATGCCTTTCACAACAATGGGAACCAAAACATTTGTTTTTGGGAAAACCATGGAGTATCTTGATAAGGACTATGAGTATGAAGTGTTTGATAATACTGCTGATGCT
GATGGAGAACGCCCGCTCAGATCCTAG
Protein sequenceShow/hide protein sequence
MPFFSLRFRAYRPKKFGAFSDGSHPRLKPSERTFCFTCVAALRGPFPNRRQLKQQPYGTTFQLHSSFCISEVLLRLLCDPLVVSYRRDLSKPLRSKTFPQTPLVIVVDDK
GKPDSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGANNFQALYGLDDNLNTLSDGMDGKLKKAATYFEFDPEEIAKDDYTFRADMSFKPGSTYEI
KARSSATGVRKPPKRVEFQVTTEEVLRKADFRNVRFLANFITESGIIIKRSKTGISAKAQRKVAREIKTARAFGLMPFTTMGTKTFVFGKTMEYLDKDYEYEVFDNTADA
DGERPLRS