; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037366 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037366
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Description60S ribosomal protein L36
Genome locationscaffold8:953264..966673
RNA-Seq ExpressionSpg037366
SyntenySpg037366
Gene Ontology termsGO:0002181 - cytoplasmic translation (biological process)
GO:0022625 - cytosolic large ribosomal subunit (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR000509 - Ribosomal protein L36e
IPR038097 - Ribosomal protein L36e domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574185.1 60S ribosomal protein L36-2, partial [Cucurbita argyrosperma subsp. sororia]8.1e-1283.33Show/hide
Query:  PSEKAMAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI
        PSEKAMAPKQ NTGLFVGLNKGHIVTKKELAPRPSDRKG    R L +
Subjt:  PSEKAMAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI

KAG6574185.1 60S ribosomal protein L36-2, partial [Cucurbita argyrosperma subsp. sororia]9.5e-29100Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
        GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK

KAG7013242.1 60S ribosomal protein L36-2, partial [Cucurbita argyrosperma subsp. argyrosperma]9.5e-29100Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
        GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK

KAG7013242.1 60S ribosomal protein L36-2, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-0981.82Show/hide
Query:  AMAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI
        AMAPKQ NTGLFVGLNKGHIVTKKELAPRPSDRKG    R L +
Subjt:  AMAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI

KAG7013242.1 60S ribosomal protein L36-2, partial [Cucurbita argyrosperma subsp. argyrosperma]9.5e-29100Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
        GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK

TYK14183.1 hypothetical protein E5676_scaffold8046G00070 [Cucumis melo var. makuwa]4.9e-3341.43Show/hide
Query:  RKLVIEERRGKRVHKVELEIGASISVRDCLEVAVKTGNPRGFWRRGRLDYAVLFFQVLENERRRFGLLSMDNFRRKRMAIYVPERFKGKGWGLLAGEISN
        RK+ IEER G+   K+EL+ G S  VRDCL  A    N   FW+R RL++A++FFQVLENE+ RF +LS+++F+ ++  I++PE  +G GW  LAGEIS 
Subjt:  RKLVIEERRGKRVHKVELEIGASISVRDCLEVAVKTGNPRGFWRRGRLDYAVLFFQVLENERRRFGLLSMDNFRRKRMAIYVPERFKGKGWGLLAGEISN

Query:  VLYGSNKIGKSTTGPIGNSDRREEVKILPNPNDQKGESGTTVVDEVFNKSEEFDFAFECSATVIIKRFSIKEPWFSIKDSIQRESIIGWMFKSFCANLAV
        +L  S  I + T      +  RE  K L          G  V        ++ +FAFE  +TVIIK+      W ++K  +++  +I   FK+FCANLAV
Subjt:  VLYGSNKIGKSTTGPIGNSDRREEVKILPNPNDQKGESGTTVVDEVFNKSEEFDFAFECSATVIIKRFSIKEPWFSIKDSIQRESIIGWMFKSFCANLAV

Query:  GIYESESAAK
        GI  S+S AK
Subjt:  GIYESESAAK

XP_022945858.1 60S ribosomal protein L36-2-like [Cucurbita moschata]6.4e-0981.4Show/hide
Query:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI
        MAPKQ NTGLFVGLNKGHIVTKKELAPRPSDRKG    R L +
Subjt:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI

XP_022968526.1 60S ribosomal protein L36-2-like [Cucurbita maxima]2.4e-0879.07Show/hide
Query:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI
        MAPK  NTGLFVGLNKGHIVTKKELAPRPSDRKG    R L +
Subjt:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI

XP_022968526.1 60S ribosomal protein L36-2-like [Cucurbita maxima]9.5e-29100Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
        GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK

TrEMBL top hitse value%identityAlignment
A0A5D3CQQ0 Uncharacterized protein2.4e-3341.43Show/hide
Query:  RKLVIEERRGKRVHKVELEIGASISVRDCLEVAVKTGNPRGFWRRGRLDYAVLFFQVLENERRRFGLLSMDNFRRKRMAIYVPERFKGKGWGLLAGEISN
        RK+ IEER G+   K+EL+ G S  VRDCL  A    N   FW+R RL++A++FFQVLENE+ RF +LS+++F+ ++  I++PE  +G GW  LAGEIS 
Subjt:  RKLVIEERRGKRVHKVELEIGASISVRDCLEVAVKTGNPRGFWRRGRLDYAVLFFQVLENERRRFGLLSMDNFRRKRMAIYVPERFKGKGWGLLAGEISN

Query:  VLYGSNKIGKSTTGPIGNSDRREEVKILPNPNDQKGESGTTVVDEVFNKSEEFDFAFECSATVIIKRFSIKEPWFSIKDSIQRESIIGWMFKSFCANLAV
        +L  S  I + T      +  RE  K L          G  V        ++ +FAFE  +TVIIK+      W ++K  +++  +I   FK+FCANLAV
Subjt:  VLYGSNKIGKSTTGPIGNSDRREEVKILPNPNDQKGESGTTVVDEVFNKSEEFDFAFECSATVIIKRFSIKEPWFSIKDSIQRESIIGWMFKSFCANLAV

Query:  GIYESESAAK
        GI  S+S AK
Subjt:  GIYESESAAK

A0A6J1H2C5 60S ribosomal protein L364.6e-29100Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
        GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK

A0A6J1H2C5 60S ribosomal protein L363.1e-0981.4Show/hide
Query:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI
        MAPKQ NTGLFVGLNKGHIVTKKELAPRPSDRKG    R L +
Subjt:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI

A0A6J1H2C5 60S ribosomal protein L364.6e-29100Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
        GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK

A0A6J1HV53 60S ribosomal protein L361.2e-0879.07Show/hide
Query:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI
        MAPK  NTGLFVGLNKGHIVTKKELAPRPSDRKG    R L +
Subjt:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI

A0A6J1HV53 60S ribosomal protein L364.6e-29100Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
        GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK

A0A6J1JC29 60S ribosomal protein L363.1e-0981.4Show/hide
Query:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI
        MAPKQ NTGLFVGLNKGHIVTKKELAPRPSDRKG    R L +
Subjt:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI

A0A6J1K6N3 60S ribosomal protein L363.1e-0981.4Show/hide
Query:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI
        MAPKQ NTGLFVGLNKGHIVTKKELAPRPSDRKG    R L +
Subjt:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI

A0A6J1K6N3 60S ribosomal protein L361.0e-2898.7Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
        GKSSKRVLFVR+LIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK

SwissProt top hitse value%identityAlignment
O80929 60S ribosomal protein L36-11.1e-2483.54Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGG---GEKK
        GK+SKR +F+R LIREVAG APYEKRITELLKVGKDKRALKVAKRKLGTHKRAK+KREEMSSVLRKMR+ GG    EKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGG---GEKK

Q66KU4 60S ribosomal protein L368.1e-1564.71Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMR
        G+ +K   FVR +IREV GFAPYE+R  ELLKV KDKRALK  K+++GTH RAK+KREE+S+VL  MR
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMR

Q9LRB8 60S ribosomal protein L362.1e-1870.27Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGE
        G  S+RV  VR+++REVAG+APYE+R+ ELLKVGKDKRALK+ KRKLGTH R KKKREEM+ VLRKM+A   GE
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGE

Q9LZ57 60S ribosomal protein L36-37.8e-2684.81Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGG--EKKK
        GK+SKR +F+R+LI+EVAG APYEKRITELLKVGKDKRALKVAKRKLGTHKRAK+KREEMSSVLRKMR+GG G  EKKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGG--EKKK

Q9M352 60S ribosomal protein L36-21.6e-2686.08Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGG--EKKK
        GK+SKR +F+R+LI+EVAG APYEKRITELLKVGKDKRALKVAKRKLGTHKRAK+KREEMSSVLRKMR+GGGG  EKKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGG--EKKK

Q9M352 60S ribosomal protein L36-21.4e-0660.47Show/hide
Query:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI
        M   Q  TGLFVGLNKGH+VT++ELAPRP  RKG    R + I
Subjt:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI

Arabidopsis top hitse value%identityAlignment
AT2G37600.1 Ribosomal protein L36e family protein8.0e-2683.54Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGG---GEKK
        GK+SKR +F+R LIREVAG APYEKRITELLKVGKDKRALKVAKRKLGTHKRAK+KREEMSSVLRKMR+ GG    EKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGG---GEKK

AT3G53740.2 Ribosomal protein L36e family protein1.1e-2786.08Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGG--EKKK
        GK+SKR +F+R+LI+EVAG APYEKRITELLKVGKDKRALKVAKRKLGTHKRAK+KREEMSSVLRKMR+GGGG  EKKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGG--EKKK

AT3G53740.2 Ribosomal protein L36e family protein9.8e-0860.47Show/hide
Query:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI
        M   Q  TGLFVGLNKGH+VT++ELAPRP  RKG    R + I
Subjt:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI

AT3G53740.3 Ribosomal protein L36e family protein1.1e-2786.08Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGG--EKKK
        GK+SKR +F+R+LI+EVAG APYEKRITELLKVGKDKRALKVAKRKLGTHKRAK+KREEMSSVLRKMR+GGGG  EKKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGG--EKKK

AT3G53740.3 Ribosomal protein L36e family protein9.8e-0860.47Show/hide
Query:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI
        M   Q  TGLFVGLNKGH+VT++ELAPRP  RKG    R + I
Subjt:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI

AT3G53740.4 Ribosomal protein L36e family protein1.1e-2786.08Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGG--EKKK
        GK+SKR +F+R+LI+EVAG APYEKRITELLKVGKDKRALKVAKRKLGTHKRAK+KREEMSSVLRKMR+GGGG  EKKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGG--EKKK

AT3G53740.4 Ribosomal protein L36e family protein9.8e-0860.47Show/hide
Query:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI
        M   Q  TGLFVGLNKGH+VT++ELAPRP  RKG    R + I
Subjt:  MAPKQSNTGLFVGLNKGHIVTKKELAPRPSDRKGSGCGRKLVI

AT5G02450.1 Ribosomal protein L36e family protein5.6e-2784.81Show/hide
Query:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGG--EKKK
        GK+SKR +F+R+LI+EVAG APYEKRITELLKVGKDKRALKVAKRKLGTHKRAK+KREEMSSVLRKMR+GG G  EKKK
Subjt:  GKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGG--EKKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCAATCGTCCTTGGTACACCCATCAACGAAGGTCCAGATGTTGAGCATGGGTACAATGAATGGTATGCTAGGATTAACAGACGATTCATCACACAAGCAGGAGCT
GCATATCATTTCGTGGAAAGAAAATGCTTGTGCCTGTCCCCGAGACGACGTGGAGCCAAAGGCATCGATCGGAGCACTTAGACACCGTCATTGGAGCCAAAAGTATCGAT
CAAGAAGTTGCAAAAATTGGGCAAACCTGAGATCAAGTGTCTCGATGCTATCCCGATTACAACCCTCGGACGGAAGGGCGATAATTATAAATACGATTCTTTTCAATCAA
GTTTTGTCCAGATCTCTGATTCTTGACGAGCCGCCGCCAGTTCACTCCTTCGGTGCTACAACTCATATCTCCATCACTCCTTCGGTGGACGGGTGGAAGATGATGATTGA
AGCAGGTACTTATGATAATCGGGCATATCTTACATATATTACCTACGCGAGTTCGACAAGGATTCTCATCGAGTTCGTGTCTTCCCAGTTGAATCTTGGGGGCGTTACGG
TTTACCTCCGATCATTTGGTTCTGGTCCATCCGAGAAGGCAATGGCTCCGAAGCAGTCGAATACTGGTCTCTTTGTAGGACTTAACAAAGGGCACATTGTTACAAAGAAG
GAGTTGGCCCCGCGCCCTTCAGATCGTAAAGGAAGTGGGTGTGGTAGGAAGTTGGTGATAGAAGAGAGAAGAGGGAAGAGAGTTCACAAAGTGGAGCTTGAGATCGGGGC
ATCTATATCGGTTCGTGATTGTTTAGAGGTAGCGGTGAAAACAGGGAACCCTCGTGGATTTTGGAGGCGGGGGAGGTTGGATTATGCGGTGTTATTTTTTCAGGTATTAG
AAAATGAGAGAAGGAGGTTTGGGCTTCTTTCTATGGATAACTTTAGGCGGAAAAGAATGGCGATTTATGTTCCAGAGAGGTTCAAAGGCAAGGGATGGGGATTGTTAGCA
GGGGAAATTTCAAATGTGTTATACGGATCCAATAAAATAGGGAAGTCGACAACGGGGCCCATTGGAAACTCTGATCGAAGGGAAGAAGTTAAGATCCTCCCGAATCCAAA
TGATCAGAAAGGGGAGTCGGGAACTACTGTTGTGGATGAAGTGTTCAACAAGTCTGAAGAGTTTGATTTTGCCTTCGAGTGTTCGGCGACGGTCATTATTAAAAGGTTTA
GTATCAAGGAACCGTGGTTCTCTATAAAGGATTCTATTCAGAGAGAAAGTATTATAGGGTGGATGTTCAAGTCGTTTTGTGCAAACCTTGCTGTCGGAATCTACGAGAGT
GAATCAGCAGCCAAGAAATTGGTTTCCGTGGTTAAGACCAAAGGAGAGTCAGGGTATTCAGTTCACGAATGGAACTGTAAGGTTCTGGCTGAGGGTCGCCGACAAGCGTT
TTCTGGTGGAAGGATTACGACCCTGGATATTCCACCATTCCTTCATACAAAAAGTTTGATCTCAGTAGTGGCGAATCTCTGTGGAGGAATTAAAGAAGACGGAGAATTGG
AAGGGGACGACGTTTCGATGAAGGAAGTCAGTTTCGAGGCCAAAGGAAATCTGAATGGTCTTATTCCAGTGGTTAGCTACCTCCGTCATAAGGGGTTCTTTCCGGTCAGA
TTGGTAGTCGGTCGTCGGCTGGAGAAGGAGGAGATCGAGGCAGATGGAGACAAAGTCTGTGAAGGAGGGACATCCGATCAAAACACAGCCCATGAGAAGATGGAAGGAGA
GGGAGGGGAAGGGCATAAAGAGGAGAACTCTGTGTCTGAAGGGTTCAGGGCTATTGAAGCAGCGGTTAATCACGCAAAGCAGTCGACTGAAGTCCATCGAAACAGCCAAT
CAACAGAGTCTCTTTCTCAGTCCATTATCAAGGCAAGGGGGGTTCTCTCATCCGTGGGCATGTCGGCCAGCCCCTGGGCAATTGGGTTGCCTAAAGGAAGTTGTAACTCC
AGGATTTTGCCTAGTCCTTTCGTTGTTAGCTCTCCTTCGGGTCTTGGGGGTACTCTGTCTCAACCGGTCAGTGTGTCGTGGGGGGTTAAGGGTTTGCCTAATTCCTCGAA
TGTGGGTGCCAAAGGATGTATTTTGGAGGGGGGAAGCTCGCCTTTACTTGATAGGGATGCAGGAGGAGAATTGTTAGCCTTTTCTAGTCCTATCTGTCCTTCAGGTTTGA
ATCCCAGTGGGGGATACTTGGGTACCTGTCCGAAAGGTTGGGTTGCCTTAGCTTCAGTTGGTGTCCACGCTACTCCTACTCCTGAATCCCAATCAGCTCTGTCGCCCAAT
GGTGGCGAGGGAGAGCCAGGTTTCAAGATGAAAGTTATTTCCTGGAATGTTAGAGGGCTTGGGGGTAGGGTTAAGCATAGTTTGGTGAAAGACGTCATCTCCCAAGAAAG
TCCAAAGAGTCTTTGGGGGGTTAGAAGAGTGGAGTGGGCTGGTCTCAAGTTTGAAGGAGCTTTCAGAGGCATCTTAGTTATGTGGGATAGCCGTAGGCTTTCGACGGAGG
AAGATCTGGACCATATTTTATGGAGTTGCGAGTTTGCTTGTTCGATATGGGGTTTGTTTCACAACGCTTTTGGGCTACAGGCGAGACCCTTTAGATACTACAGGGAGATG
ATCCAGGAGTTCCTCCTCCATCCGTCGTTTCGCGATAAGGGGAGGTTTTTATGGTTAGCTGGGAAAAGTAGCAAAAGAGTTCTCTTTGTGAGGAGTTTGATCCGGGAAGT
TGCTGGTTTTGCACCATATGAGAAGAGAATCACTGAGCTTCTTAAAGTTGGAAAGGACAAGAGAGCACTAAAAGTGGCTAAGAGAAAGTTGGGAACTCACAAGAGAGCCA
AGAAGAAGAGAGAGGAGATGTCCAGCGTTCTCCGCAAGATGAGAGCTGGTGGGGGCGGTGAGAAGAAGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCAATCGTCCTTGGTACACCCATCAACGAAGGTCCAGATGTTGAGCATGGGTACAATGAATGGTATGCTAGGATTAACAGACGATTCATCACACAAGCAGGAGCT
GCATATCATTTCGTGGAAAGAAAATGCTTGTGCCTGTCCCCGAGACGACGTGGAGCCAAAGGCATCGATCGGAGCACTTAGACACCGTCATTGGAGCCAAAAGTATCGAT
CAAGAAGTTGCAAAAATTGGGCAAACCTGAGATCAAGTGTCTCGATGCTATCCCGATTACAACCCTCGGACGGAAGGGCGATAATTATAAATACGATTCTTTTCAATCAA
GTTTTGTCCAGATCTCTGATTCTTGACGAGCCGCCGCCAGTTCACTCCTTCGGTGCTACAACTCATATCTCCATCACTCCTTCGGTGGACGGGTGGAAGATGATGATTGA
AGCAGGTACTTATGATAATCGGGCATATCTTACATATATTACCTACGCGAGTTCGACAAGGATTCTCATCGAGTTCGTGTCTTCCCAGTTGAATCTTGGGGGCGTTACGG
TTTACCTCCGATCATTTGGTTCTGGTCCATCCGAGAAGGCAATGGCTCCGAAGCAGTCGAATACTGGTCTCTTTGTAGGACTTAACAAAGGGCACATTGTTACAAAGAAG
GAGTTGGCCCCGCGCCCTTCAGATCGTAAAGGAAGTGGGTGTGGTAGGAAGTTGGTGATAGAAGAGAGAAGAGGGAAGAGAGTTCACAAAGTGGAGCTTGAGATCGGGGC
ATCTATATCGGTTCGTGATTGTTTAGAGGTAGCGGTGAAAACAGGGAACCCTCGTGGATTTTGGAGGCGGGGGAGGTTGGATTATGCGGTGTTATTTTTTCAGGTATTAG
AAAATGAGAGAAGGAGGTTTGGGCTTCTTTCTATGGATAACTTTAGGCGGAAAAGAATGGCGATTTATGTTCCAGAGAGGTTCAAAGGCAAGGGATGGGGATTGTTAGCA
GGGGAAATTTCAAATGTGTTATACGGATCCAATAAAATAGGGAAGTCGACAACGGGGCCCATTGGAAACTCTGATCGAAGGGAAGAAGTTAAGATCCTCCCGAATCCAAA
TGATCAGAAAGGGGAGTCGGGAACTACTGTTGTGGATGAAGTGTTCAACAAGTCTGAAGAGTTTGATTTTGCCTTCGAGTGTTCGGCGACGGTCATTATTAAAAGGTTTA
GTATCAAGGAACCGTGGTTCTCTATAAAGGATTCTATTCAGAGAGAAAGTATTATAGGGTGGATGTTCAAGTCGTTTTGTGCAAACCTTGCTGTCGGAATCTACGAGAGT
GAATCAGCAGCCAAGAAATTGGTTTCCGTGGTTAAGACCAAAGGAGAGTCAGGGTATTCAGTTCACGAATGGAACTGTAAGGTTCTGGCTGAGGGTCGCCGACAAGCGTT
TTCTGGTGGAAGGATTACGACCCTGGATATTCCACCATTCCTTCATACAAAAAGTTTGATCTCAGTAGTGGCGAATCTCTGTGGAGGAATTAAAGAAGACGGAGAATTGG
AAGGGGACGACGTTTCGATGAAGGAAGTCAGTTTCGAGGCCAAAGGAAATCTGAATGGTCTTATTCCAGTGGTTAGCTACCTCCGTCATAAGGGGTTCTTTCCGGTCAGA
TTGGTAGTCGGTCGTCGGCTGGAGAAGGAGGAGATCGAGGCAGATGGAGACAAAGTCTGTGAAGGAGGGACATCCGATCAAAACACAGCCCATGAGAAGATGGAAGGAGA
GGGAGGGGAAGGGCATAAAGAGGAGAACTCTGTGTCTGAAGGGTTCAGGGCTATTGAAGCAGCGGTTAATCACGCAAAGCAGTCGACTGAAGTCCATCGAAACAGCCAAT
CAACAGAGTCTCTTTCTCAGTCCATTATCAAGGCAAGGGGGGTTCTCTCATCCGTGGGCATGTCGGCCAGCCCCTGGGCAATTGGGTTGCCTAAAGGAAGTTGTAACTCC
AGGATTTTGCCTAGTCCTTTCGTTGTTAGCTCTCCTTCGGGTCTTGGGGGTACTCTGTCTCAACCGGTCAGTGTGTCGTGGGGGGTTAAGGGTTTGCCTAATTCCTCGAA
TGTGGGTGCCAAAGGATGTATTTTGGAGGGGGGAAGCTCGCCTTTACTTGATAGGGATGCAGGAGGAGAATTGTTAGCCTTTTCTAGTCCTATCTGTCCTTCAGGTTTGA
ATCCCAGTGGGGGATACTTGGGTACCTGTCCGAAAGGTTGGGTTGCCTTAGCTTCAGTTGGTGTCCACGCTACTCCTACTCCTGAATCCCAATCAGCTCTGTCGCCCAAT
GGTGGCGAGGGAGAGCCAGGTTTCAAGATGAAAGTTATTTCCTGGAATGTTAGAGGGCTTGGGGGTAGGGTTAAGCATAGTTTGGTGAAAGACGTCATCTCCCAAGAAAG
TCCAAAGAGTCTTTGGGGGGTTAGAAGAGTGGAGTGGGCTGGTCTCAAGTTTGAAGGAGCTTTCAGAGGCATCTTAGTTATGTGGGATAGCCGTAGGCTTTCGACGGAGG
AAGATCTGGACCATATTTTATGGAGTTGCGAGTTTGCTTGTTCGATATGGGGTTTGTTTCACAACGCTTTTGGGCTACAGGCGAGACCCTTTAGATACTACAGGGAGATG
ATCCAGGAGTTCCTCCTCCATCCGTCGTTTCGCGATAAGGGGAGGTTTTTATGGTTAGCTGGGAAAAGTAGCAAAAGAGTTCTCTTTGTGAGGAGTTTGATCCGGGAAGT
TGCTGGTTTTGCACCATATGAGAAGAGAATCACTGAGCTTCTTAAAGTTGGAAAGGACAAGAGAGCACTAAAAGTGGCTAAGAGAAAGTTGGGAACTCACAAGAGAGCCA
AGAAGAAGAGAGAGGAGATGTCCAGCGTTCTCCGCAAGATGAGAGCTGGTGGGGGCGGTGAGAAGAAGAAATGA
Protein sequenceShow/hide protein sequence
MLQSSLVHPSTKVQMLSMGTMNGMLGLTDDSSHKQELHIISWKENACACPRDDVEPKASIGALRHRHWSQKYRSRSCKNWANLRSSVSMLSRLQPSDGRAIIINTILFNQ
VLSRSLILDEPPPVHSFGATTHISITPSVDGWKMMIEAGTYDNRAYLTYITYASSTRILIEFVSSQLNLGGVTVYLRSFGSGPSEKAMAPKQSNTGLFVGLNKGHIVTKK
ELAPRPSDRKGSGCGRKLVIEERRGKRVHKVELEIGASISVRDCLEVAVKTGNPRGFWRRGRLDYAVLFFQVLENERRRFGLLSMDNFRRKRMAIYVPERFKGKGWGLLA
GEISNVLYGSNKIGKSTTGPIGNSDRREEVKILPNPNDQKGESGTTVVDEVFNKSEEFDFAFECSATVIIKRFSIKEPWFSIKDSIQRESIIGWMFKSFCANLAVGIYES
ESAAKKLVSVVKTKGESGYSVHEWNCKVLAEGRRQAFSGGRITTLDIPPFLHTKSLISVVANLCGGIKEDGELEGDDVSMKEVSFEAKGNLNGLIPVVSYLRHKGFFPVR
LVVGRRLEKEEIEADGDKVCEGGTSDQNTAHEKMEGEGGEGHKEENSVSEGFRAIEAAVNHAKQSTEVHRNSQSTESLSQSIIKARGVLSSVGMSASPWAIGLPKGSCNS
RILPSPFVVSSPSGLGGTLSQPVSVSWGVKGLPNSSNVGAKGCILEGGSSPLLDRDAGGELLAFSSPICPSGLNPSGGYLGTCPKGWVALASVGVHATPTPESQSALSPN
GGEGEPGFKMKVISWNVRGLGGRVKHSLVKDVISQESPKSLWGVRRVEWAGLKFEGAFRGILVMWDSRRLSTEEDLDHILWSCEFACSIWGLFHNAFGLQARPFRYYREM
IQEFLLHPSFRDKGRFLWLAGKSSKRVLFVRSLIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMRAGGGGEKKK