; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018772 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018772
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionWPP domain-associated protein
Genome locationtig00153210:508265..509368
RNA-Seq ExpressionSgr018772
SyntenySgr018772
Gene Ontology termsNA
InterPro domainsIPR037490 - WPP domain-associated protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601161.1 hypothetical protein SDJN03_06394, partial [Cucurbita argyrosperma subsp. sororia]5.7e-12866.94Show/hide
Query:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND
        +LNA++NKILGQN D +EEDIP +  EQ+FTE  RQKSDV TLA++W K+H+LR++E+ GIQNQICML  +RED +FQNIMMEEI+ TLF+G+ EKF ND
Subjt:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND

Query:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEI
        LSRWELEI ISDG CR FIR+MFNQ +ETMESYKI+A IKDDIYH  F EAMKGY         R QD   VKDENLYLEGLTSDN+PS+CL C+ +QEI
Subjt:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEI

Query:  YGIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYY
        YGIP  VML+EW ++I EHT+E LLREE+SWF+  E IKSI Y+AN CP TKFFNDFLP  QITI+EDVC VF REMV EWE+ IE  NLETLIREEIY+
Subjt:  YGIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYY

Query:  TVFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN
        T+  EA+ EVC+R     VP QDSDVTE    RK LGEG + G  SL QKL+L SEGIEV +N
Subjt:  TVFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN

KAG7031963.1 WPP domain-associated protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.7e-12866.94Show/hide
Query:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND
        +LNA++NKILGQN D +EEDIP +  EQ+FTE  RQKSDV TLA++W K+H+LR++E+ GIQNQICML  +RED +FQNIMMEEI+ TLF+G+ EKF ND
Subjt:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND

Query:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEI
        LSRWELEI ISDG CR FIR+MFNQ +ETMESYKI+A IKDDIYH  F EAMKGY         R QD   VKDENLYLEGLTSDN+PS+CL C+ +QEI
Subjt:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEI

Query:  YGIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYY
        YGIP  VML+EW ++I EHT+E LLREE+SWF+  E IKSI Y+AN CP TKFFNDFLP  QITI+EDVC VF REMV EWE+ IE  NLETLIREEIY+
Subjt:  YGIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYY

Query:  TVFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN
        T+  EA+ EVC+R     VP QDSDVTE    RK LGEG + G  SL QKL+L SEGIEV +N
Subjt:  TVFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN

XP_022956940.1 uncharacterized protein LOC111458475 [Cucurbita moschata]8.3e-12766.67Show/hide
Query:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND
        +LNA++NKILGQN D +EEDIP +  EQ+  E  RQKSDV TLA++W K+H+LR++E+ GIQNQICML  +RED +FQNI+MEEIY TLF+GL EKF ND
Subjt:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND

Query:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEI
        LSRWELE  ISDG CR FIR+MFNQ +ETMESYKI+A IKDDIYH  F EAMKGY         R QD   VKDENLYLEGLTSDN+PS+CL C+ +QEI
Subjt:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEI

Query:  YGIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYY
        YGIP  VML+EW ++I EHT+E LLREE+SWF+  ETIKSI Y+AN CP TKFFNDFLP  QITI+EDVC VF REMV EWE+ IE  NLETLIREEIY+
Subjt:  YGIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYY

Query:  TVFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN
        T+  EA+ EVC+R     VP QDSDVTE    RK LGEG + G  SL QKL+L SEGIEV +N
Subjt:  TVFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN

XP_022985013.1 uncharacterized protein LOC111483104 [Cucurbita maxima]7.7e-12565.84Show/hide
Query:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND
        +LNA++NKILGQN D +EEDIP +  +Q+FTE  RQKSDV TLA++W K+H+LR++E+ GIQNQICM   +RED +FQNIM EEIY TLF+GL EKF ND
Subjt:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND

Query:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEI
        LSRWELEI ISDG CR FIR+MF+Q +ETMESY I+A IKDDIYH  F EAMKGY         R QD   VKDENLYLEGLTSDN+PS+CL  + RQEI
Subjt:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEI

Query:  YGIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYY
        YGIP  VMLKEW ++I EHT+E LLREE+SWF+  ETIKSI Y+ N CP TKFFNDFLP  QITI+EDVC +F REMV EWE+ IE  NLETLIREEIY+
Subjt:  YGIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYY

Query:  TVFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN
        T+  EA+ EVC+R     VP QDSDVTE    RK LGEG + G  S  QKL+L SEGIEV +N
Subjt:  TVFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN

XP_023542201.1 uncharacterized protein LOC111802165 [Cucurbita pepo subsp. pepo]3.7e-12767.22Show/hide
Query:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND
        +LNA++NKILGQN D +EEDIP +  EQ+FTE  RQKSDV TLA++W K+H+LR++E+ GIQNQICML  +RED +FQNIMMEEIY TLF+G+ EKF ND
Subjt:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND

Query:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEI
        LSR ELE+ ISDG CR FIR+MFNQ +ETM SYKI+A IKDDIYH  F EAMKGY         R QD   VKDENLYLEGLTSDN+PS+CL C+ RQEI
Subjt:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEI

Query:  YGIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYY
        YGIP  VMLKEW K+I EHT+E LLREE+SWF+  ETIKSI Y+AN CP TKFFNDFLP  QITI+EDVC VF REMV EWE+ IE  NLETLIREEIY+
Subjt:  YGIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYY

Query:  TVFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN
        T+  EA+ EVC+R     VP QDSDVTE    RK LGEG + G  S  QKL+L SEGIEV +N
Subjt:  TVFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN

TrEMBL top hitse value%identityAlignment
A0A0A0KWT8 Uncharacterized protein1.1e-6064.06Show/hide
Query:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND
        +LNA VNKI+ QN D NEEDIP +K EQ+F E  RQKSDVDTLA+VW K+H+L+D+E+ GIQNQIC LRQERE++EFQNIM EE YI LF+GL EKF +D
Subjt:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND

Query:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCL
        LS WELEI ISDG CRD IRNMFNQ +ETM+S  I+A IKDDIYH VF E M+ YCSI DLGL R Q+  I K   L L  +  +   S+ L
Subjt:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCL

A0A5D3CG51 WPP domain-associated protein isoform X25.7e-5761.46Show/hide
Query:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND
        +LNA VNK + QN D +EEDIPL+K EQ+F E  +QKSDVDTLA+VW K+H+L+D+E+ GIQNQIC LRQERED+EFQNIM EE YITL +GL EKF +D
Subjt:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND

Query:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCL
        LS WELEI ISDG  RD IR+MFNQ +ETM+S   +A IKDDIYH VF E M+ YCSI D GL R Q+  I K   L L  +  +   S+ L
Subjt:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCL

A0A6J1CF63 uncharacterized protein LOC1110101824.3e-12147.69Show/hide
Query:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIG-IQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYN
        +LNAKVNKI GQ+GDVNEEDIPL++ EQ+FTET+RQKSDVDTL +VW K+HKL+D+E  G I+NQI ML QERE+KEFQNIMMEEIYIT+FKGLIE+F N
Subjt:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIG-IQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYN

Query:  DLSRWELEIQISDGTCRDFIRNMFNQQ-------------------------------------------------------------------------
        +L  WELEIQISDG CRDFIRNMFNQQ                                                                         
Subjt:  DLSRWELEIQISDGTCRDFIRNMFNQQ-------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------NETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEIY
                                 NETMESYKI+AH+KDDIY+ V NEAMKGYCS YDL +AR  +   VKDE+LYLEGLTSDND SQC  C+IR EIY
Subjt:  -------------------------NETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEIY

Query:  GIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYYT
        GIP AVML EWQKSIGEHTTESLL+EEVSWF+FGETIKSITY+AN+C          PDS+ITIEEDVC VF+REMVREWEEKIE CNLE  IREEI Y 
Subjt:  GIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYYT

Query:  VFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN
        V I+AEREV NRY    VPIQDSD  EKPP RKR  +G  + +ESL+QKL+L SEGI+V++N
Subjt:  VFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN

A0A6J1GZ55 uncharacterized protein LOC1114584754.0e-12766.67Show/hide
Query:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND
        +LNA++NKILGQN D +EEDIP +  EQ+  E  RQKSDV TLA++W K+H+LR++E+ GIQNQICML  +RED +FQNI+MEEIY TLF+GL EKF ND
Subjt:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND

Query:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEI
        LSRWELE  ISDG CR FIR+MFNQ +ETMESYKI+A IKDDIYH  F EAMKGY         R QD   VKDENLYLEGLTSDN+PS+CL C+ +QEI
Subjt:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEI

Query:  YGIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYY
        YGIP  VML+EW ++I EHT+E LLREE+SWF+  ETIKSI Y+AN CP TKFFNDFLP  QITI+EDVC VF REMV EWE+ IE  NLETLIREEIY+
Subjt:  YGIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYY

Query:  TVFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN
        T+  EA+ EVC+R     VP QDSDVTE    RK LGEG + G  SL QKL+L SEGIEV +N
Subjt:  TVFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN

A0A6J1JCB6 uncharacterized protein LOC1114831043.7e-12565.84Show/hide
Query:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND
        +LNA++NKILGQN D +EEDIP +  +Q+FTE  RQKSDV TLA++W K+H+LR++E+ GIQNQICM   +RED +FQNIM EEIY TLF+GL EKF ND
Subjt:  MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYND

Query:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEI
        LSRWELEI ISDG CR FIR+MF+Q +ETMESY I+A IKDDIYH  F EAMKGY         R QD   VKDENLYLEGLTSDN+PS+CL  + RQEI
Subjt:  LSRWELEIQISDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEI

Query:  YGIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYY
        YGIP  VMLKEW ++I EHT+E LLREE+SWF+  ETIKSI Y+ N CP TKFFNDFLP  QITI+EDVC +F REMV EWE+ IE  NLETLIREEIY+
Subjt:  YGIPSAVMLKEWQKSIGEHTTESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYY

Query:  TVFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN
        T+  EA+ EVC+R     VP QDSDVTE    RK LGEG + G  S  QKL+L SEGIEV +N
Subjt:  TVFIEAEREVCNRY----VPIQDSDVTEKPPPRKRLGEGIDTGMESLIQKLNLRSEGIEVEKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAATGCTAAGGTTAACAAAATTTTAGGCCAAAATGGAGATGTTAATGAAGAAGACATTCCTCTACAGAAATGGGAGCAAGTATTTACAGAAACTGAGAGACAGAA
ATCAGATGTCGATACTTTGGCAGAAGTCTGGGCCAAGATACATAAACTGCGAGATAAAGAAGACATAGGAATACAAAATCAAATATGCATGCTAAGGCAAGAAAGAGAGG
ACAAAGAATTTCAAAACATAATGATGGAAGAAATTTACATAACTTTATTCAAAGGCTTGATAGAAAAGTTTTATAATGATTTGAGTCGTTGGGAACTGGAGATCCAGATT
TCAGATGGTACATGCAGAGACTTCATTAGGAATATGTTCAATCAGCAGAATGAGACCATGGAAAGTTACAAAATTGATGCCCACATAAAAGATGATATATATCATGCTGT
CTTCAACGAGGCAATGAAAGGTTATTGCTCCATATATGACTTGGGATTAGCCAGATCACAGGATGTGAACATTGTGAAGGACGAAAACCTATATTTGGAAGGTTTGACAT
CTGACAATGACCCTTCTCAATGTTTAGGATGCAAAATAAGGCAGGAAATTTATGGAATCCCTTCTGCAGTAATGCTAAAGGAATGGCAGAAAAGCATAGGAGAACATACA
ACTGAAAGCCTTCTTAGAGAAGAGGTATCTTGGTTCATCTTTGGTGAGACAATCAAAAGTATCACCTACGAAGCCAACCGTTGTCCAGATACCAAATTCTTCAACGATTT
TCTTCCAGATTCTCAAATTACAATTGAAGAAGATGTCTGCTTAGTTTTCTTTAGGGAAATGGTTAGGGAATGGGAGGAGAAGATAGAGACGTGTAACTTGGAAACTTTAA
TTAGGGAAGAAATTTATTACACTGTTTTCATTGAGGCAGAAAGAGAAGTCTGTAACAGATATGTCCCGATTCAGGACAGTGACGTGACAGAAAAACCTCCACCTAGGAAA
AGATTAGGTGAAGGCATAGATACTGGCATGGAAAGTTTGATTCAAAAACTAAATTTGCGTTCAGAAGGCATTGAAGTAGAGAAAAATTGGAGCTCAGTGCAAGTTTTGAG
ATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAATGCTAAGGTTAACAAAATTTTAGGCCAAAATGGAGATGTTAATGAAGAAGACATTCCTCTACAGAAATGGGAGCAAGTATTTACAGAAACTGAGAGACAGAA
ATCAGATGTCGATACTTTGGCAGAAGTCTGGGCCAAGATACATAAACTGCGAGATAAAGAAGACATAGGAATACAAAATCAAATATGCATGCTAAGGCAAGAAAGAGAGG
ACAAAGAATTTCAAAACATAATGATGGAAGAAATTTACATAACTTTATTCAAAGGCTTGATAGAAAAGTTTTATAATGATTTGAGTCGTTGGGAACTGGAGATCCAGATT
TCAGATGGTACATGCAGAGACTTCATTAGGAATATGTTCAATCAGCAGAATGAGACCATGGAAAGTTACAAAATTGATGCCCACATAAAAGATGATATATATCATGCTGT
CTTCAACGAGGCAATGAAAGGTTATTGCTCCATATATGACTTGGGATTAGCCAGATCACAGGATGTGAACATTGTGAAGGACGAAAACCTATATTTGGAAGGTTTGACAT
CTGACAATGACCCTTCTCAATGTTTAGGATGCAAAATAAGGCAGGAAATTTATGGAATCCCTTCTGCAGTAATGCTAAAGGAATGGCAGAAAAGCATAGGAGAACATACA
ACTGAAAGCCTTCTTAGAGAAGAGGTATCTTGGTTCATCTTTGGTGAGACAATCAAAAGTATCACCTACGAAGCCAACCGTTGTCCAGATACCAAATTCTTCAACGATTT
TCTTCCAGATTCTCAAATTACAATTGAAGAAGATGTCTGCTTAGTTTTCTTTAGGGAAATGGTTAGGGAATGGGAGGAGAAGATAGAGACGTGTAACTTGGAAACTTTAA
TTAGGGAAGAAATTTATTACACTGTTTTCATTGAGGCAGAAAGAGAAGTCTGTAACAGATATGTCCCGATTCAGGACAGTGACGTGACAGAAAAACCTCCACCTAGGAAA
AGATTAGGTGAAGGCATAGATACTGGCATGGAAAGTTTGATTCAAAAACTAAATTTGCGTTCAGAAGGCATTGAAGTAGAGAAAAATTGGAGCTCAGTGCAAGTTTTGAG
ATAA
Protein sequenceShow/hide protein sequence
MLNAKVNKILGQNGDVNEEDIPLQKWEQVFTETERQKSDVDTLAEVWAKIHKLRDKEDIGIQNQICMLRQEREDKEFQNIMMEEIYITLFKGLIEKFYNDLSRWELEIQI
SDGTCRDFIRNMFNQQNETMESYKIDAHIKDDIYHAVFNEAMKGYCSIYDLGLARSQDVNIVKDENLYLEGLTSDNDPSQCLGCKIRQEIYGIPSAVMLKEWQKSIGEHT
TESLLREEVSWFIFGETIKSITYEANRCPDTKFFNDFLPDSQITIEEDVCLVFFREMVREWEEKIETCNLETLIREEIYYTVFIEAEREVCNRYVPIQDSDVTEKPPPRK
RLGEGIDTGMESLIQKLNLRSEGIEVEKNWSSVQVLR