; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023846 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023846
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationtig00000892:7493849..7500099
RNA-Seq ExpressionSgr023846
SyntenySgr023846
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148842.1 uncharacterized protein LOC101210790 isoform X1 [Cucumis sativus]5.3e-8086.96Show/hide
Query:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE
        M+TL E+EEEY+WREVRLPSLIPVVPEPELERETAERRRGRDIL+AVDHGPNSKHAFDWALIHFCRLADTIHL+HAVS+             VKNELVYE
Subjt:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE

Query:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE
         SQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+ FHNCKSAPV+IVPGKE
Subjt:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE

XP_022146303.1 uncharacterized protein LOC111015542 isoform X1 [Momordica charantia]1.8e-8389.13Show/hide
Query:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE
        METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDIL+AVDHGPNSKHA DWAL+HFCRLADTIHL+HAVS+             VKNELVYE
Subjt:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE

Query:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE
        ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKV+CKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEY FHNCKSAPVIIVPGKE
Subjt:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE

XP_022943557.1 uncharacterized protein LOC111448293 [Cucurbita moschata]4.5e-7986.41Show/hide
Query:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE
        M+TL E+EEEY+WREVRLPSLIPVVPEPELERETA RRRGRDIL+AVDHGPNSKHAFDWALIHFCRLADTIHL+HAVS+             VKNELVYE
Subjt:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE

Query:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE
         SQGLMEKLAVEAFEVAMVRTVARIVQGDAGK+ICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+ FHNCKSAPVIIVPGKE
Subjt:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE

XP_031737219.1 uncharacterized protein LOC101210790 isoform X2 [Cucumis sativus]1.5e-7986.89Show/hide
Query:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE
        M+TL E+EEEY+WREVRLPSLIPVVPEPELERETAERRRGRDIL+AVDHGPNSKHAFDWALIHFCRLADTIHL+HAVS+             VKNELVYE
Subjt:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE

Query:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGK
         SQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+ FHNCKSAPV+IVPGK
Subjt:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGK

XP_038901616.1 uncharacterized protein LOC120088411 [Benincasa hispida]4.1e-8087.5Show/hide
Query:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE
        M+TL E+EEEY+WREVRLPSLIPVVPEPELERETAERRRGRDIL+AVDHGPNSKHAFDWALIHFCRLADTIHL+HAVS+             VKNELVYE
Subjt:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE

Query:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE
         SQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+ FHNCKSAPVIIVPGKE
Subjt:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE

TrEMBL top hitse value%identityAlignment
A0A0A0LL61 Usp domain-containing protein2.6e-8086.96Show/hide
Query:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE
        M+TL E+EEEY+WREVRLPSLIPVVPEPELERETAERRRGRDIL+AVDHGPNSKHAFDWALIHFCRLADTIHL+HAVS+             VKNELVYE
Subjt:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE

Query:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE
         SQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+ FHNCKSAPV+IVPGKE
Subjt:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE

A0A1S3CET4 uncharacterized protein LOC1034996762.6e-8086.96Show/hide
Query:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE
        M+TL E+EEEY+WREVRLPSLIPVVPEPELERETAERRRGRDIL+AVDHGPNSKHAFDWALIHFCRLADTIHL+HAVS+             VKNELVYE
Subjt:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE

Query:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE
         SQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+ FHNCKSAPV+IVPGKE
Subjt:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE

A0A5D3DYQ0 Universal stress protein PHOS322.6e-8086.96Show/hide
Query:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE
        M+TL E+EEEY+WREVRLPSLIPVVPEPELERETAERRRGRDIL+AVDHGPNSKHAFDWALIHFCRLADTIHL+HAVS+             VKNELVYE
Subjt:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE

Query:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE
         SQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+ FHNCKSAPV+IVPGKE
Subjt:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE

A0A6J1CY89 uncharacterized protein LOC111015542 isoform X18.6e-8489.13Show/hide
Query:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE
        METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDIL+AVDHGPNSKHA DWAL+HFCRLADTIHL+HAVS+             VKNELVYE
Subjt:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE

Query:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE
        ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKV+CKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEY FHNCKSAPVIIVPGKE
Subjt:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE

A0A6J1FTC2 uncharacterized protein LOC1114482932.2e-7986.41Show/hide
Query:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE
        M+TL E+EEEY+WREVRLPSLIPVVPEPELERETA RRRGRDIL+AVDHGPNSKHAFDWALIHFCRLADTIHL+HAVS+             VKNELVYE
Subjt:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE

Query:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE
         SQGLMEKLAVEAFEVAMVRTVARIVQGDAGK+ICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+ FHNCKSAPVIIVPGKE
Subjt:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE

SwissProt top hitse value%identityAlignment
P87132 Uncharacterized protein C167.051.2e-0524.18Show/hide
Query:  ELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKN-ELVYEASQGLMEKLAVEAFEVAMVRTVARIVQ
        + +   +  +R     L +D    S HA +WA+    R  DT+ ++  +    P+ +   +    +  E + + ++ +++ L+    EV +   +  I  
Subjt:  ELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKN-ELVYEASQGLMEKLAVEAFEVAMVRTVARIVQ

Query:  GDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKEEAKNLKQKEKKSEAGNYSKRSRVDQ
          A  +I +  + ++P+ VVMG+RGRS ++ VL GS S Y  +  KS+  ++V  K+  KN ++   +S   N    + VD+
Subjt:  GDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKEEAKNLKQKEKKSEAGNYSKRSRVDQ

Q8L4N1 Universal stress protein PHOS342.2e-0426.25Show/hide
Query:  RDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSS--------------YKPTPQIFTE--SHYVKNELVYEA-SQGLMEKLAVEAFEVAMVRTVA
        R I +AVD    S  A  WA+ H+ R  D + ++H   +                P P   T+  +    ++  ++A +   +  LA    E      + 
Subjt:  RDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSS--------------YKPTPQIFTE--SHYVKNELVYEA-SQGLMEKLAVEAFEVAMVRTVA

Query:  RIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQ---GSVSEYCFHNCKSAPVIIV
         +   D  + +C E E+L  +AV+MG+RG    +       GSVS+YC H+C   PV++V
Subjt:  RIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQ---GSVSEYCFHNCKSAPVIIV

Q8VYN9 Universal stress protein PHOS322.6e-0528.12Show/hide
Query:  RDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESH---YVKNELVYEASQGLMEKLAVEAFEVAMVRTVAR-------------
        R I +AVD    S  A  WA+ H+ R  D + L+H      PT  +F        +K ++    +Q    +   +AF    V  +A+             
Subjt:  RDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESH---YVKNELVYEASQGLMEKLAVEAFEVAMVRTVAR-------------

Query:  IVQGDAGKVICKEAEKLKPAAVVMGTRG----RSLIQSVLQGSVSEYCFHNCKSAPVIIV
        +   D  + +C E E+L  +AV+MG+RG    +        GSVS+YC H+C   PV++V
Subjt:  IVQGDAGKVICKEAEKLKPAAVVMGTRG----RSLIQSVLQGSVSEYCFHNCKSAPVIIV

Arabidopsis top hitse value%identityAlignment
AT2G21620.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.2e-7777.72Show/hide
Query:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE
        ME L ED EEYS+REV LPSLIPVVPEPELERE+ ERRRGRD+++AVDHGPNSKHAFDWAL+HFCRLADT+HL+HAVSS             VKN++VYE
Subjt:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE

Query:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE
         SQ LMEKLAVEA++VAMV++VAR+V+GDAGKVICKEAEK+KPAAV++GTRGRSL++SVLQGSVSEYCFHNCKSAPVIIVPGKE
Subjt:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE

AT2G21620.2 Adenine nucleotide alpha hydrolases-like superfamily protein2.6e-7777.72Show/hide
Query:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE
        ME L ED EEYS+REV LPSLIPVVPEPELERE+ ERRRGRD+++AVDHGPNSKHAFDWAL+HFCRLADT+HL+HAVSS             VKN++VYE
Subjt:  METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYE

Query:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE
         SQ LMEKLAVEA++VAMV++VAR+V+GDAGKVICKEAEK+KPAAV++GTRGRSL++SVLQGSVSEYCFHNCKSAPVIIVPGKE
Subjt:  ASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKE

AT3G11930.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.2e-1025.4Show/hide
Query:  SLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADT-------------IHLIHAVSSYKPTP------QIFTESHYVKN--ELV
        S +   PE   E E       R +++A+D   +S +A  W + HF  L  T             IH+    + +   P       ++  S  +++  +  
Subjt:  SLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADT-------------IHLIHAVSSYKPTP------QIFTESHYVKN--ELV

Query:  YEASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKEEAK
         E S  L+ + A++      +RT   +++G+A ++IC+  EK+    +V+G+RG   I+    GSVS+YC H+     +I+ P KE  K
Subjt:  YEASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKEEAK

AT3G11930.2 Adenine nucleotide alpha hydrolases-like superfamily protein1.2e-1025.26Show/hide
Query:  SLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADT----------IHLIHAVSSYK----------PTPQIFTESHYVKN--EL
        S +   PE   E E       R +++A+D   +S +A  W + HF  L  T          + +IH  S +               ++  S  +++  + 
Subjt:  SLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADT----------IHLIHAVSSYK----------PTPQIFTESHYVKN--EL

Query:  VYEASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKEEAK
          E S  L+ + A++      +RT   +++G+A ++IC+  EK+    +V+G+RG   I+    GSVS+YC H+     +I+ P KE  K
Subjt:  VYEASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKEEAK

AT3G11930.4 Adenine nucleotide alpha hydrolases-like superfamily protein3.3e-1125.65Show/hide
Query:  SLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADT----------IHLIHAVSSYK-----------PTPQIFTESHYVKN--E
        S +   PE   E E       R +++A+D   +S +A  W + HF  L  T          + +IH  S +             T  ++  S  +++  +
Subjt:  SLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADT----------IHLIHAVSSYK-----------PTPQIFTESHYVKN--E

Query:  LVYEASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKEEAK
           E S  L+ + A++      +RT   +++G+A ++IC+  EK+    +V+G+RG   I+    GSVS+YC H+     +I+ P KE  K
Subjt:  LVYEASQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKEEAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGACATTGATGGAGGACGAGGAAGAGTACAGCTGGAGAGAAGTGCGGCTGCCGTCTCTGATCCCGGTGGTGCCGGAGCCGGAGCTGGAGAGGGAAACGGCGGAGAG
ACGGCGAGGCAGAGACATCCTCCTCGCCGTCGACCATGGACCCAACAGCAAGCACGCCTTCGATTGGGCTCTCATCCATTTCTGCCGCCTCGCCGACACCATCCATCTCA
TCCACGCCGTTTCCAGTTACAAACCAACCCCTCAAATCTTCACAGAATCACACTATGTGAAGAATGAACTGGTTTACGAGGCGAGCCAGGGGCTGATGGAGAAGCTTGCG
GTGGAGGCCTTTGAGGTGGCCATGGTGAGGACTGTGGCTCGAATTGTGCAGGGAGATGCAGGGAAGGTTATTTGCAAGGAAGCAGAGAAGTTGAAGCCTGCTGCTGTTGT
CATGGGCACCAGGGGAAGAAGCTTGATACAGAGTGTTCTGCAGGGAAGTGTGAGTGAGTATTGCTTCCACAACTGCAAATCAGCACCTGTTATAATAGTCCCTGGGAAAG
AAGAAGCGAAGAATTTGAAGCAGAAAGAAAAGAAATCGGAAGCAGGAAATTACAGTAAAAGAAGCCGAGTTGATCAATGGAGTTTCAGGGACTGTTACGGAGCTCGTGCC
GATTCCGAGTTGATCAAAGTAGGAAGAAGAATTAGCCGTTTCCGTCGAGTCTGGGATTCAGAGCAGCCAACTTCATTGATAAGAACTGCTTCATATGGAAGAGAAAACTG
TGGCTGTCGGTGGCTTGGCCTCGACGTGCTCGGACGTGAATGTAACCTGATTTTTGAACTTCTTGATCAGAAGCTTTCGAGTTCTGCTTCGACTTCGCAGCTTCTTCAGA
TTGAACATTTCCCACCTTCGTTTTCATCTTTCTCTTCTTGGGATTATTCTCTCCGGCAGCCGGAGAGGTGGTGTGAACCTGAACCGTCAGCTCCGGCGGCGGCGAAGATG
TCGCAGGGCGTCCTCCAGACGCCACCAGATCCTGAAAATCCCGAACCCGACTGGCCTGAGTTAACACAGAAGACAAGCTGCCGCAGTCAGCGCCGTCGTATTGCTGGTGC
CGCCGCCATGTCATCTCCGTCTTCAAGTACTGCAACATCTCCGGCAACCCCCAAAAAGTTGGTTTCTGCTGAAGAAACGCAGGCTGCAGACTTTAAATCTGATGGAAGGA
TATTGAAAACAAACAATTTGACAATGGGTAAATCGGAGGAGACGTCAGTGAATGGCTCTCATTCTGCTGATTCATCTCAGTTTTTTTTTTTTTTCTTTTTCTTTTGGGTG
TAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGACATTGATGGAGGACGAGGAAGAGTACAGCTGGAGAGAAGTGCGGCTGCCGTCTCTGATCCCGGTGGTGCCGGAGCCGGAGCTGGAGAGGGAAACGGCGGAGAG
ACGGCGAGGCAGAGACATCCTCCTCGCCGTCGACCATGGACCCAACAGCAAGCACGCCTTCGATTGGGCTCTCATCCATTTCTGCCGCCTCGCCGACACCATCCATCTCA
TCCACGCCGTTTCCAGTTACAAACCAACCCCTCAAATCTTCACAGAATCACACTATGTGAAGAATGAACTGGTTTACGAGGCGAGCCAGGGGCTGATGGAGAAGCTTGCG
GTGGAGGCCTTTGAGGTGGCCATGGTGAGGACTGTGGCTCGAATTGTGCAGGGAGATGCAGGGAAGGTTATTTGCAAGGAAGCAGAGAAGTTGAAGCCTGCTGCTGTTGT
CATGGGCACCAGGGGAAGAAGCTTGATACAGAGTGTTCTGCAGGGAAGTGTGAGTGAGTATTGCTTCCACAACTGCAAATCAGCACCTGTTATAATAGTCCCTGGGAAAG
AAGAAGCGAAGAATTTGAAGCAGAAAGAAAAGAAATCGGAAGCAGGAAATTACAGTAAAAGAAGCCGAGTTGATCAATGGAGTTTCAGGGACTGTTACGGAGCTCGTGCC
GATTCCGAGTTGATCAAAGTAGGAAGAAGAATTAGCCGTTTCCGTCGAGTCTGGGATTCAGAGCAGCCAACTTCATTGATAAGAACTGCTTCATATGGAAGAGAAAACTG
TGGCTGTCGGTGGCTTGGCCTCGACGTGCTCGGACGTGAATGTAACCTGATTTTTGAACTTCTTGATCAGAAGCTTTCGAGTTCTGCTTCGACTTCGCAGCTTCTTCAGA
TTGAACATTTCCCACCTTCGTTTTCATCTTTCTCTTCTTGGGATTATTCTCTCCGGCAGCCGGAGAGGTGGTGTGAACCTGAACCGTCAGCTCCGGCGGCGGCGAAGATG
TCGCAGGGCGTCCTCCAGACGCCACCAGATCCTGAAAATCCCGAACCCGACTGGCCTGAGTTAACACAGAAGACAAGCTGCCGCAGTCAGCGCCGTCGTATTGCTGGTGC
CGCCGCCATGTCATCTCCGTCTTCAAGTACTGCAACATCTCCGGCAACCCCCAAAAAGTTGGTTTCTGCTGAAGAAACGCAGGCTGCAGACTTTAAATCTGATGGAAGGA
TATTGAAAACAAACAATTTGACAATGGGTAAATCGGAGGAGACGTCAGTGAATGGCTCTCATTCTGCTGATTCATCTCAGTTTTTTTTTTTTTTCTTTTTCTTTTGGGTG
TAA
Protein sequenceShow/hide protein sequence
METLMEDEEEYSWREVRLPSLIPVVPEPELERETAERRRGRDILLAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSSYKPTPQIFTESHYVKNELVYEASQGLMEKLA
VEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYCFHNCKSAPVIIVPGKEEAKNLKQKEKKSEAGNYSKRSRVDQWSFRDCYGARA
DSELIKVGRRISRFRRVWDSEQPTSLIRTASYGRENCGCRWLGLDVLGRECNLIFELLDQKLSSSASTSQLLQIEHFPPSFSSFSSWDYSLRQPERWCEPEPSAPAAAKM
SQGVLQTPPDPENPEPDWPELTQKTSCRSQRRRIAGAAAMSSPSSSTATSPATPKKLVSAEETQAADFKSDGRILKTNNLTMGKSEETSVNGSHSADSSQFFFFFFFFWV