; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024739 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024739
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRibosomal protein
Genome locationtig00002486:2361150..2366404
RNA-Seq ExpressionSgr024739
SyntenySgr024739
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005840 - ribosome (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0008097 - 5S rRNA binding (molecular function)
InterPro domainsIPR005484 - Ribosomal protein L18
IPR036967 - Ribosomal protein S11 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597446.1 hypothetical protein SDJN03_10626, partial [Cucurbita argyrosperma subsp. sororia]9.0e-10066.67Show/hide
Query:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN
        K++A+G L SENERVAHSDSI NGSS PQ SPLFP+                   VS GLARAWEGREKG SG +NRYFLED DHDP+DYCDS+FDD+DN
Subjt:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN

Query:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVER-ENDDVERR------------------
        MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKED QQ++SK+NDKP+N L +GHVKLPEH+KNKYVI+ER EN+D E++                  
Subjt:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVER-ENDDVERR------------------

Query:  ---------------SSGHPRLISLL-----DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQ
                         G  +++++      DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKL+IVLQSVIDNGINVKVKIKQQ
Subjt:  ---------------SSGHPRLISLL-----DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQ

Query:  KRPGKL
        KRP K+
Subjt:  KRPGKL

XP_008459355.1 PREDICTED: uncharacterized protein LOC103498514 [Cucumis melo]6.0e-10468.61Show/hide
Query:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN
        K+   G + SENERVAHS+SIANGSSRPQL+PLFPD                   VSCGLARAWEGREKGWSGLENRY LED  HDPVDYCDSDFDD+DN
Subjt:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN

Query:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVERENDDVER--RSSGHPRLISLL------
        MRIRGNLFYKLDRGSKEF+EYS DFHRKKKSIKEKE+ ++SRSK+NDK D  LP+G VKLPEH+KNKYVIVERENDDVE+  R+    +L  L       
Subjt:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVERENDDVER--RSSGHPRLISLL------

Query:  ------------------------------DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK
                                      DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNL+YTPRKGERIEGKL++VLQSVIDNGINVKVKIKQQK
Subjt:  ------------------------------DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK

Query:  RPGKLEVHL
        RP KLEVHL
Subjt:  RPGKLEVHL

XP_022934433.1 uncharacterized protein LOC111441610 isoform X1 [Cucurbita moschata]6.2e-10167.32Show/hide
Query:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN
        K++A+G LRSENERVAHSDSI NGSS PQ SPLFP+                   VS GLARAWEGREKG SG +NRYFLED DHDP+DYCDS+FDD+DN
Subjt:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN

Query:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVER-ENDDVERR------------------
        MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKED QQ++SK+NDKP+N L +GHVKLPEHVKNKYVI+ER EN+D E++                  
Subjt:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVER-ENDDVERR------------------

Query:  ---------------SSGHPRLISLL-----DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQ
                         G  +++++      DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKL+IVLQSVIDNGINVKVKIKQQ
Subjt:  ---------------SSGHPRLISLL-----DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQ

Query:  KRPGKL
        KRP K+
Subjt:  KRPGKL

XP_022973751.1 uncharacterized protein LOC111472322 [Cucurbita maxima]3.4e-9966.89Show/hide
Query:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN
        K++A+G L SENERVAHSDSI NGSS  Q SPLFP+                   VS  LARAWEGREKG SG +NRYFLED DHDP+DYCDS+FDD+DN
Subjt:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN

Query:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVERENDDVERR-------------------
        MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKED QQ++SK+NDKP+N L +GHVKLPEHVKNKYVI+EREN+D E++                   
Subjt:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVERENDDVERR-------------------

Query:  ----SSGHPRLISL---------------LDMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK
            S    R   +                DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKL+IVLQSVIDNGINVKVKIKQQK
Subjt:  ----SSGHPRLISL---------------LDMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK

Query:  RPGKL
        RP K+
Subjt:  RPGKL

XP_038896573.1 uncharacterized protein LOC120084825 [Benincasa hispida]1.3e-10167.86Show/hide
Query:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN
        K++  G L SENERVAHSDSIAN SSRPQ SPLF D                   VSCGLARAWEGRE+GW+G ENR FLED  HDP+DYCD DFD++DN
Subjt:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN

Query:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVERENDDVERR-------------------
        MRIRGNLFYKLDRGSKEFEEYS DFHRKKKS+KEKED+Q+S+SKINDK DNRL NGHVKLPEHVKNKYVIVERE+DDVE++                   
Subjt:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVERENDDVERR-------------------

Query:  ----SSGHPRLISL---------------LDMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK
            S    R   +                DMKFDLTSRKDSSAC AVGAVLAQRALADDIHNLVYTPRKGERIEGKL+IVLQSVIDNGINVKVKIKQQK
Subjt:  ----SSGHPRLISL---------------LDMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK

Query:  RPGKLEVH
        RP K EVH
Subjt:  RPGKLEVH

TrEMBL top hitse value%identityAlignment
A0A0A0KSG6 Uncharacterized protein6.5e-9665.9Show/hide
Query:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN
        K+   G + SENERVAHS+SIANGS RPQL+PLFPD                   VSCGLARAWE REKGWSG ENRY LED  HDPVDYCDSDFDD+DN
Subjt:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN

Query:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVERENDDVERR----------SSGHP----
        MRIRGNLFYKLD+ SKEFEE S DFHRKKKS+KEKED+ + RSK+NDK D  L +G VKLPEH+KNKYVIVERENDDVE++          S  H     
Subjt:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVERENDDVERR----------SSGHP----

Query:  ---------------RLISLL---------DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK
                       R+ S +         DMKFDLTSRKDSSACAAVGAVLAQRAL DDIHNL+YTPRKGERIEGKL+IVLQSVIDNGINV VKIKQQK
Subjt:  ---------------RLISLL---------DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK

Query:  RPGKL
        RP KL
Subjt:  RPGKL

A0A1S3CAH6 uncharacterized protein LOC1034985142.9e-10468.61Show/hide
Query:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN
        K+   G + SENERVAHS+SIANGSSRPQL+PLFPD                   VSCGLARAWEGREKGWSGLENRY LED  HDPVDYCDSDFDD+DN
Subjt:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN

Query:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVERENDDVER--RSSGHPRLISLL------
        MRIRGNLFYKLDRGSKEF+EYS DFHRKKKSIKEKE+ ++SRSK+NDK D  LP+G VKLPEH+KNKYVIVERENDDVE+  R+    +L  L       
Subjt:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVERENDDVER--RSSGHPRLISLL------

Query:  ------------------------------DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK
                                      DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNL+YTPRKGERIEGKL++VLQSVIDNGINVKVKIKQQK
Subjt:  ------------------------------DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK

Query:  RPGKLEVHL
        RP KLEVHL
Subjt:  RPGKLEVHL

A0A6J1C8X9 uncharacterized protein LOC1110092862.6e-9766.89Show/hide
Query:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN
        K++ +G L SENERV+ SD IANGS R QLSPLFPD                   VSCGLA AWEGREK WSGLEN+YFLE+ DHDPVDYCDSD DD+DN
Subjt:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN

Query:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVERENDDVERR-------------------
        MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSI  KED +QSRS+INDKPDN L +G+VKLPEHVKNKYVIVER+NDDVE++                   
Subjt:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVERENDDVERR-------------------

Query:  ----SSGHPRLISL---------------LDMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK
            S    R   +                DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRK ERIEGKL+IVLQSVIDNG+NVKVKIKQQK
Subjt:  ----SSGHPRLISL---------------LDMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK

Query:  RPGKL
        RP K+
Subjt:  RPGKL

A0A6J1F7N1 uncharacterized protein LOC111441610 isoform X13.0e-10167.32Show/hide
Query:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN
        K++A+G LRSENERVAHSDSI NGSS PQ SPLFP+                   VS GLARAWEGREKG SG +NRYFLED DHDP+DYCDS+FDD+DN
Subjt:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN

Query:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVER-ENDDVERR------------------
        MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKED QQ++SK+NDKP+N L +GHVKLPEHVKNKYVI+ER EN+D E++                  
Subjt:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVER-ENDDVERR------------------

Query:  ---------------SSGHPRLISLL-----DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQ
                         G  +++++      DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKL+IVLQSVIDNGINVKVKIKQQ
Subjt:  ---------------SSGHPRLISLL-----DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQ

Query:  KRPGKL
        KRP K+
Subjt:  KRPGKL

A0A6J1I8E0 uncharacterized protein LOC1114723221.7e-9966.89Show/hide
Query:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN
        K++A+G L SENERVAHSDSI NGSS  Q SPLFP+                   VS  LARAWEGREKG SG +NRYFLED DHDP+DYCDS+FDD+DN
Subjt:  KSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPD-------------------VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDN

Query:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVERENDDVERR-------------------
        MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKED QQ++SK+NDKP+N L +GHVKLPEHVKNKYVI+EREN+D E++                   
Subjt:  MRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKNKYVIVERENDDVERR-------------------

Query:  ----SSGHPRLISL---------------LDMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK
            S    R   +                DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKL+IVLQSVIDNGINVKVKIKQQK
Subjt:  ----SSGHPRLISL---------------LDMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK

Query:  RPGKL
        RP K+
Subjt:  RPGKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G08845.1 Ribosomal L18p/L5e family protein1.1e-1046.88Show/hide
Query:  DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKV
        D+K  L SR D  AC ++G +L++RA   D++   YTPR  ++ EGK++ V+QS+IDNGI+VK+
Subjt:  DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKV

AT1G08845.2 Ribosomal L18p/L5e family protein1.1e-1046.88Show/hide
Query:  DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKV
        D+K  L SR D  AC ++G +L++RA   D++   YTPR  ++ EGK++ V+QS+IDNGI+VK+
Subjt:  DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKV

AT3G20230.1 Ribosomal L18p/L5e family protein1.0e-0841.67Show/hide
Query:  DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGI
        D++ ++ S  D+ AC  +G ++A+R++  D++ + Y PRKGERIEGKL IV+ ++ ++GI
Subjt:  DMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGI

AT3G22450.1 Ribosomal L18p/L5e family protein6.1e-3841.98Show/hide
Query:  VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDNMRIRGNLFYKLDRGSKEFEEYSFDFHRK---KKSIKEKEDLQQSRSKINDKPDNR
        VS  L++AW  + +  +   +   ++ V    +D  D D D++DNMRIRG+LF+KLDRGSKEFEEY++DFHRK   K + KE+   ++++ K ++K + R
Subjt:  VSCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDNMRIRGNLFYKLDRGSKEFEEYSFDFHRK---KKSIKEKEDLQQSRSKINDKPDNR

Query:  ---------------LPNGHVKLPEHV---------------KNKYVIVERENDDVERRSSGHPRLIS---LLDMKFDLTSRKDSSACAAVGAVLAQRAL
                        PN  VK  E                  + Y+  E     V  R +     ++     DMKFDL SR++ +ACAAVGAVLAQR+L
Subjt:  ---------------LPNGHVKLPEHV---------------KNKYVIVERENDDVERRSSGHPRLIS---LLDMKFDLTSRKDSSACAAVGAVLAQRAL

Query:  ADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK
         DDIH+++YTPRKG++IEGKL++VLQ++IDNG+NVKVK+KQ+K
Subjt:  ADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCGCTATCTCCATTGTTCCTTGATGGTAAACCCCATTCGGTTGCTTAATTATGATGTTGAACTTGTGGATGATGATATTTGGGCAGTTTTCTGTGGCATTGATGA
TATTGACAACATGAGAATACGTGGACATCTCTTCTATAAACTTGACCGAAGTTCCAAAGAGTTTGAAGAAAATAGCTCGGATTTTCATCAGAAGAAAAAATCTCCAAAAC
CCGATAATAGTTTGACGAATGGCCATGTAAAGCCTCCTGAACCTGTGAAGAATATACATGTAATAGTTGAAAGGGAGAACGATGGTGATGTTGAGAAGAAGAAGAAGCTC
AGGACACCCATATTTAATCAGCTTACTGGTCCTTACCATGAGCCGTTTTGTTTGATATATACATTTCAAAGGCCTCAGTTCGCGCCTGCATTATTCACCGAGTTACTAGT
AAGGTTGTTGCTGTGGCTCATTCAATTTCTAAGGATTCCTCTGCCGGTGCTGCTGTGGGGGCTGTTTTGGGTCAGGGAGCACTGGGTCGTGATATCCACAATTTGCCACA
CACAGAAGCGCATAGCCACGCTCCACAACGTCATCGCTCAGCATGCCTTCACTCTGATCAACGGCGCCGCTGACCAGGCGAGCAGGGCAAGTCATGCACACTCCAAGCTT
GCAATCATGAGGCACAGACAAGCCAGAGTCCAATGCACTGCTCAATATGCTCTCGTCGTCTTCAACCTCCAGCTGGGTGGTCTGCCCCTCGTGCTCAATCACCACTTTAG
GAAAGCTTGGTGGGTTGCTTTTGTTTTAGCGGAGAGAAAGCGGATGGAAGAAGATGAAGAGTTGCCATGGCTTTTCTCTGCTTGGGGACGAAATGAAAAGAGGTGTTTTG
TGAGGTTGTGGCCGTTCGAGGGCGTTGGTGAAGGGATAAGTACGCTTCGTTCTCTCTGCTCATCTCTTTCTGCCTTCCTTCGACCACTATGTTCAAGTTTACGATTGAAT
GCTTTTAATTTCTTTGCGGATCATGTACCTGTTTTGCTTGTGGCGTCGCTGAAATTAGAAATTAGGGATTTATTCTATCGTACTGAATCGTTTGTCCCCAAGTATAGTAA
TTGGACTGCCGAGGCATTAGCTGTGGCCATTTCTTATCCTTGGTTGACTTTCGCCACTAGGTTTCAAGTTCTGACCACTGTTGATGAATGGGAGGAGAAACTTTTGCTCA
AAAGCATCGCCCAAGGGCATCTTAGAAGCGAGAATGAAAGGGTAGCTCATTCAGATAGCATTGCAAATGGTTCATCCAGGCCTCAGTTATCTCCATTGTTCCCTGATGTT
TCATGTGGTTTAGCACGAGCATGGGAAGGGAGAGAGAAGGGCTGGTCTGGCTTGGAAAATAGATACTTTTTAGAGGACGTGGATCATGACCCAGTTGATTATTGTGATTC
AGATTTTGATGATGTTGACAATATGAGAATACGTGGAAATCTCTTTTATAAACTTGACCGAGGTTCCAAAGAGTTTGAAGAATATAGCTTTGATTTTCATCGTAAGAAAA
AATCTATTAAGGAAAAGGAAGATCTACAGCAAAGCAGAAGCAAAATCAATGATAAACCCGATAATCGTTTGCCAAATGGCCATGTAAAACTTCCTGAACATGTGAAGAAC
AAATATGTAATAGTTGAAAGGGAGAATGATGATGTTGAGAGAAGAAGCTCAGGACACCCACGTTTAATCAGCTTACTGGACATGAAATTTGACCTTACTTCAAGGAAGGA
TTCCTCTGCCTGTGCTGCTGTGGGGGCTGTTTTGGCCCAGAGAGCATTGGCTGATGATATTCACAATTTGGTCTACACCCCAAGAAAAGGCGAAAGGATAGAAGGGAAGC
TTAAAATTGTGCTTCAGTCTGTCATTGATAATGGGATTAATGTGAAGGTAAAGATTAAGCAGCAAAAGAGACCAGGAAAGCTGGAAGTCCACCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCCCGCTATCTCCATTGTTCCTTGATGGTAAACCCCATTCGGTTGCTTAATTATGATGTTGAACTTGTGGATGATGATATTTGGGCAGTTTTCTGTGGCATTGATGA
TATTGACAACATGAGAATACGTGGACATCTCTTCTATAAACTTGACCGAAGTTCCAAAGAGTTTGAAGAAAATAGCTCGGATTTTCATCAGAAGAAAAAATCTCCAAAAC
CCGATAATAGTTTGACGAATGGCCATGTAAAGCCTCCTGAACCTGTGAAGAATATACATGTAATAGTTGAAAGGGAGAACGATGGTGATGTTGAGAAGAAGAAGAAGCTC
AGGACACCCATATTTAATCAGCTTACTGGTCCTTACCATGAGCCGTTTTGTTTGATATATACATTTCAAAGGCCTCAGTTCGCGCCTGCATTATTCACCGAGTTACTAGT
AAGGTTGTTGCTGTGGCTCATTCAATTTCTAAGGATTCCTCTGCCGGTGCTGCTGTGGGGGCTGTTTTGGGTCAGGGAGCACTGGGTCGTGATATCCACAATTTGCCACA
CACAGAAGCGCATAGCCACGCTCCACAACGTCATCGCTCAGCATGCCTTCACTCTGATCAACGGCGCCGCTGACCAGGCGAGCAGGGCAAGTCATGCACACTCCAAGCTT
GCAATCATGAGGCACAGACAAGCCAGAGTCCAATGCACTGCTCAATATGCTCTCGTCGTCTTCAACCTCCAGCTGGGTGGTCTGCCCCTCGTGCTCAATCACCACTTTAG
GAAAGCTTGGTGGGTTGCTTTTGTTTTAGCGGAGAGAAAGCGGATGGAAGAAGATGAAGAGTTGCCATGGCTTTTCTCTGCTTGGGGACGAAATGAAAAGAGGTGTTTTG
TGAGGTTGTGGCCGTTCGAGGGCGTTGGTGAAGGGATAAGTACGCTTCGTTCTCTCTGCTCATCTCTTTCTGCCTTCCTTCGACCACTATGTTCAAGTTTACGATTGAAT
GCTTTTAATTTCTTTGCGGATCATGTACCTGTTTTGCTTGTGGCGTCGCTGAAATTAGAAATTAGGGATTTATTCTATCGTACTGAATCGTTTGTCCCCAAGTATAGTAA
TTGGACTGCCGAGGCATTAGCTGTGGCCATTTCTTATCCTTGGTTGACTTTCGCCACTAGGTTTCAAGTTCTGACCACTGTTGATGAATGGGAGGAGAAACTTTTGCTCA
AAAGCATCGCCCAAGGGCATCTTAGAAGCGAGAATGAAAGGGTAGCTCATTCAGATAGCATTGCAAATGGTTCATCCAGGCCTCAGTTATCTCCATTGTTCCCTGATGTT
TCATGTGGTTTAGCACGAGCATGGGAAGGGAGAGAGAAGGGCTGGTCTGGCTTGGAAAATAGATACTTTTTAGAGGACGTGGATCATGACCCAGTTGATTATTGTGATTC
AGATTTTGATGATGTTGACAATATGAGAATACGTGGAAATCTCTTTTATAAACTTGACCGAGGTTCCAAAGAGTTTGAAGAATATAGCTTTGATTTTCATCGTAAGAAAA
AATCTATTAAGGAAAAGGAAGATCTACAGCAAAGCAGAAGCAAAATCAATGATAAACCCGATAATCGTTTGCCAAATGGCCATGTAAAACTTCCTGAACATGTGAAGAAC
AAATATGTAATAGTTGAAAGGGAGAATGATGATGTTGAGAGAAGAAGCTCAGGACACCCACGTTTAATCAGCTTACTGGACATGAAATTTGACCTTACTTCAAGGAAGGA
TTCCTCTGCCTGTGCTGCTGTGGGGGCTGTTTTGGCCCAGAGAGCATTGGCTGATGATATTCACAATTTGGTCTACACCCCAAGAAAAGGCGAAAGGATAGAAGGGAAGC
TTAAAATTGTGCTTCAGTCTGTCATTGATAATGGGATTAATGTGAAGGTAAAGATTAAGCAGCAAAAGAGACCAGGAAAGCTGGAAGTCCACCTTTAG
Protein sequenceShow/hide protein sequence
MPRYLHCSLMVNPIRLLNYDVELVDDDIWAVFCGIDDIDNMRIRGHLFYKLDRSSKEFEENSSDFHQKKKSPKPDNSLTNGHVKPPEPVKNIHVIVERENDGDVEKKKKL
RTPIFNQLTGPYHEPFCLIYTFQRPQFAPALFTELLVRLLLWLIQFLRIPLPVLLWGLFWVREHWVVISTICHTQKRIATLHNVIAQHAFTLINGAADQASRASHAHSKL
AIMRHRQARVQCTAQYALVVFNLQLGGLPLVLNHHFRKAWWVAFVLAERKRMEEDEELPWLFSAWGRNEKRCFVRLWPFEGVGEGISTLRSLCSSLSAFLRPLCSSLRLN
AFNFFADHVPVLLVASLKLEIRDLFYRTESFVPKYSNWTAEALAVAISYPWLTFATRFQVLTTVDEWEEKLLLKSIAQGHLRSENERVAHSDSIANGSSRPQLSPLFPDV
SCGLARAWEGREKGWSGLENRYFLEDVDHDPVDYCDSDFDDVDNMRIRGNLFYKLDRGSKEFEEYSFDFHRKKKSIKEKEDLQQSRSKINDKPDNRLPNGHVKLPEHVKN
KYVIVERENDDVERRSSGHPRLISLLDMKFDLTSRKDSSACAAVGAVLAQRALADDIHNLVYTPRKGERIEGKLKIVLQSVIDNGINVKVKIKQQKRPGKLEVHL