; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023550 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023550
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionX8 domain-containing protein
Genome locationtig00000892:4395518..4399378
RNA-Seq ExpressionSgr023550
SyntenySgr023550
Gene Ontology termsGO:0046658 - anchored component of plasma membrane (cellular component)
InterPro domainsIPR012946 - X8 domain
IPR044788 - Carbohydrate-binding X8 domain-containing protein, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456350.1 PREDICTED: PLASMODESMATA CALLOSE-BINDING PROTEIN 5-like [Cucumis melo]1.8e-4765.08Show/hide
Query:  TNKPDPTQKPG-QFRAASR-TSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMG
        +++ DP QK G QFRAA R +ST+QKDITTPITTVPTINIPT+PI +    NP +    +TTPSF+P TT  GGSSWCIAS SASQ ALQLALDYACGMG
Subjt:  TNKPDPTQKPG-QFRAASR-TSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMG

Query:  GADCAAIQS-----------------------EEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS
        GADC++IQ+                       + P  NSCNFGGTAVITSTNPSSGTCEYPSTSTSS++LNTTNSSGSTVFGAVP+  S
Subjt:  GADCAAIQS-----------------------EEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS

XP_011656991.1 PLASMODESMATA CALLOSE-BINDING PROTEIN 5 isoform X1 [Cucumis sativus]7.4e-4966.14Show/hide
Query:  TNKPDPTQKPG-QFRAASR-TSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMG
        +++ DPTQK G QFRAASR +STTQKDITTPITTVPTINIPT+PI +    NP +    +TTPSF+P+TT  GGSSWCIAS SASQ ALQLALDYACG+G
Subjt:  TNKPDPTQKPG-QFRAASR-TSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMG

Query:  GADCAAIQ-----------------------SEEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS
        GADC++IQ                        + P  NSCNFGGTAVITSTNPS+GTCEYPSTSTSS+VLNTTNSSGSTVFGAVP+  S
Subjt:  GADCAAIQ-----------------------SEEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS

XP_022943478.1 major pollen allergen Ole e 10-like [Cucurbita moschata]5.3e-4765.78Show/hide
Query:  TNKPDPTQKPGQFRAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMGGA
        +++ DPTQK G++RAAS  ST QKDITTPITTVPTINIPT   TS    NP S   T+TTPSF+P TTA GGSSWCIASL ASQ  LQLALDYACGMGGA
Subjt:  TNKPDPTQKPGQFRAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMGGA

Query:  DCAAIQS-----------------------EEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS
        DC+AIQ+                       + P  NSCNFGGTAVITSTNPSSG+CEYPSTSTSS++LNTTNSSGSTVFGA P+  S
Subjt:  DCAAIQS-----------------------EEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS

XP_022986028.1 major pollen allergen Ole e 10-like [Cucurbita maxima]3.1e-4766.31Show/hide
Query:  TNKPDPTQKPGQFRAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMGGA
        +++ DPTQK G++RAAS  ST QKDITTPITTVPTINIPT   TS    NP S   T+TTPSF+P TT  GGSSWCIA+L ASQ ALQLALDYACGMGGA
Subjt:  TNKPDPTQKPGQFRAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMGGA

Query:  DCAAIQS-----------------------EEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS
        DC+AIQ+                       + P  NSCNFGGTAVITSTNPSSG+CEYPSTSTSS+VLNTTNSSGSTVFGAVP+  S
Subjt:  DCAAIQS-----------------------EEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS

XP_038900831.1 PLASMODESMATA CALLOSE-BINDING PROTEIN 3-like isoform X1 [Benincasa hispida]5.3e-4755.74Show/hide
Query:  VHLHHSNSTASYSLSLPRISLCLFLSLSFDKLFNISMMRSSASIRQTNKPDPTQKPG-QFRAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTS
        +H   +N + +   +  RIS+     L  D L  + +  S      +++ DPTQK G QFR   ++S+ QKDITTPITTVPT+NIPTMPI +    NP +
Subjt:  VHLHHSNSTASYSLSLPRISLCLFLSLSFDKLFNISMMRSSASIRQTNKPDPTQKPG-QFRAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTS

Query:  TPDTVTTPSFSP-TTTAGGGSSWCIASLSASQTALQLALDYACGMGGADCAAIQS-----------------------EEPTSNSCNFGGTAVITSTNPS
            +TTPSF+P TTTA GGSSWCIAS+SASQ ALQLALDYACGMGGADC++IQ+                       + P  NSCNFGGTA+ITSTNPS
Subjt:  TPDTVTTPSFSP-TTTAGGGSSWCIASLSASQTALQLALDYACGMGGADCAAIQS-----------------------EEPTSNSCNFGGTAVITSTNPS

Query:  SGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS
        SGTCEYPSTSTSS+VLNTTNSSGSTVFGAVP+  S
Subjt:  SGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS

TrEMBL top hitse value%identityAlignment
A0A1S4E1R5 PLASMODESMATA CALLOSE-BINDING PROTEIN 5-like8.9e-4865.08Show/hide
Query:  TNKPDPTQKPG-QFRAASR-TSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMG
        +++ DP QK G QFRAA R +ST+QKDITTPITTVPTINIPT+PI +    NP +    +TTPSF+P TT  GGSSWCIAS SASQ ALQLALDYACGMG
Subjt:  TNKPDPTQKPG-QFRAASR-TSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMG

Query:  GADCAAIQS-----------------------EEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS
        GADC++IQ+                       + P  NSCNFGGTAVITSTNPSSGTCEYPSTSTSS++LNTTNSSGSTVFGAVP+  S
Subjt:  GADCAAIQS-----------------------EEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS

A0A2I4HV56 glucan endo-1,3-beta-glucosidase 12-like isoform X33.0e-4054.76Show/hide
Query:  ISMMRSSASIRQTNKPDPTQKPGQFRAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTV----TTPSFSPTTTA------GGGSSWCIA
        +SM R  A +++  +P  T +P         STTQKDITTPITTVPTI  PT P ++TPIINP+STPD+V     TP  +P  +        GG+SWC+A
Subjt:  ISMMRSSASIRQTNKPDPTQKPGQFRAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTV----TTPSFSPTTTA------GGGSSWCIA

Query:  SLSASQTALQLALDYACGMGGADCAAIQ-----------------------SEEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTV
        S SAS+TALQ+ALDYACG GGADC+AIQ                        + P  NSCNFGGTAV TS++PS+GTC+YPSTSTSS+VLNTTNSSGSTV
Subjt:  SLSASQTALQLALDYACGMGGADCAAIQ-----------------------SEEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTV

Query:  FGAVPADQSP
        FGAVP+  SP
Subjt:  FGAVPADQSP

A0A6J1D8X8 glucan endo-1,3-beta-glucosidase 3-like2.4e-4566.49Show/hide
Query:  PGQFRAASRTS--TTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVT----TPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMGGADCA
        PG   ++ R S    QKDITTPITTVPTI+IPTM     PI+NPTSTPDTV+    TPSF P TTA GGSSWCIASL ASQ ALQLALDYACG+GGADC 
Subjt:  PGQFRAASRTS--TTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVT----TPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMGGADCA

Query:  AIQ-----------------------SEEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQSP
         IQ                        + P  NSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPA + P
Subjt:  AIQ-----------------------SEEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQSP

A0A6J1FWZ7 major pollen allergen Ole e 10-like2.6e-4765.78Show/hide
Query:  TNKPDPTQKPGQFRAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMGGA
        +++ DPTQK G++RAAS  ST QKDITTPITTVPTINIPT   TS    NP S   T+TTPSF+P TTA GGSSWCIASL ASQ  LQLALDYACGMGGA
Subjt:  TNKPDPTQKPGQFRAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMGGA

Query:  DCAAIQS-----------------------EEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS
        DC+AIQ+                       + P  NSCNFGGTAVITSTNPSSG+CEYPSTSTSS++LNTTNSSGSTVFGA P+  S
Subjt:  DCAAIQS-----------------------EEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS

A0A6J1JFA6 major pollen allergen Ole e 10-like1.5e-4766.31Show/hide
Query:  TNKPDPTQKPGQFRAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMGGA
        +++ DPTQK G++RAAS  ST QKDITTPITTVPTINIPT   TS    NP S   T+TTPSF+P TT  GGSSWCIA+L ASQ ALQLALDYACGMGGA
Subjt:  TNKPDPTQKPGQFRAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMGGA

Query:  DCAAIQS-----------------------EEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS
        DC+AIQ+                       + P  NSCNFGGTAVITSTNPSSG+CEYPSTSTSS+VLNTTNSSGSTVFGAVP+  S
Subjt:  DCAAIQS-----------------------EEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQS

SwissProt top hitse value%identityAlignment
Q8VYE5 Glucan endo-1,3-beta-glucosidase 121.4e-0533.97Show/hide
Query:  STPIINPTSTPDTVTTPSFSP------TTTAGGG-----SSWCIASLSASQTALQLALDYACGMGGADCAAIQSEEP-----------------------
        +TP+    ST  T  +PS SP      T T GGG       WCIAS  AS T LQ ALD+ACG G  DC+A+Q ++P                       
Subjt:  STPIINPTSTPDTVTTPSFSP------TTTAGGG-----SSWCIASLSASQTALQLALDYACGMGGADCAAIQSEEP-----------------------

Query:  -TSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQSP
         +S  C+F G +V    +PS G C Y   + ++   N T +   T  G + A  SP
Subjt:  -TSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQSP

Q9FJU9 Glucan endo-1,3-beta-glucosidase 134.0e-0533.06Show/hide
Query:  SPTTTAGGGSSWCIASLSASQTALQLALDYACGMGGADCAAIQSEEP------------------------TSNSCNFGGTAVITSTNPSSGTCEYPSTS
        S T ++G  +SWCIAS  AS+  L+ ALD+ACG G  DC AIQ  +P                        T  +C+FGG  V  + +PS   C Y +  
Subjt:  SPTTTAGGGSSWCIASLSASQTALQLALDYACGMGGADCAAIQSEEP------------------------TSNSCNFGGTAVITSTNPSSGTCEYPSTS

Query:  TSSTVLNTTNSSGSTVFGAVP
         + T    TN++  T   + P
Subjt:  TSSTVLNTTNSSGSTVFGAVP

Q9M2K6 PLASMODESMATA CALLOSE-BINDING PROTEIN 53.0e-0835.09Show/hide
Query:  WCIASLSASQTALQLALDYACGMGGADCAAIQSEEPTSN------------------------SCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNS
        WC+A  +A  ++LQ A+++ACG GGADC  IQ   P ++                        +CNF   A +TS NPS GTC+YPS+  +    N    
Subjt:  WCIASLSASQTALQLALDYACGMGGADCAAIQSEEPTSN------------------------SCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNS

Query:  SGSTVFGAVPADQS
        +  T  GA  AD S
Subjt:  SGSTVFGAVPADQS

Q9SD84 PLASMODESMATA CALLOSE-BINDING PROTEIN 21.4e-0534.75Show/hide
Query:  SSWCIASLSASQTALQLALDYACGMGGADC------------------------AAIQSEEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTT
        +SWC+     S + LQ  LDYACG  GADC                        +  Q +   S SCNF GTA +T+T+PS   C +PS+++ S      
Subjt:  SSWCIASLSASQTALQLALDYACGMGGADC------------------------AAIQSEEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTT

Query:  NSSGSTVFGAVPADQSPK
        + SGST     P   SPK
Subjt:  NSSGSTVFGAVPADQSPK

Q9ZU91 Glucan endo-1,3-beta-glucosidase 31.4e-0533.62Show/hide
Query:  SWCIASLSASQTALQLALDYACGMGGADCAAIQSEE------------------------PTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVL-NTT
        ++CIA     +  LQ ALD+ACG G  DC+A+   E                          S SC+F G A +T+T+PS GTC +P ++ S+  L N T
Subjt:  SWCIASLSASQTALQLALDYACGMGGADCAAIQSEE------------------------PTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVL-NTT

Query:  N----SSGSTVFGAVP
        +    S+ ST  G +P
Subjt:  N----SSGSTVFGAVP

Arabidopsis top hitse value%identityAlignment
AT1G09460.1 Carbohydrate-binding X8 domain superfamily protein1.9e-1034.32Show/hide
Query:  ITTPITTVPTINIPTMPITSTPIINPTST---PDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMGGADCAAIQ----------------
        I  P  T P  N  T P+T  P   P+ T   P  V  P     + +  G SWC+A   ASQ +LQ ALDYACG+  ADC+ +Q                
Subjt:  ITTPITTVPTINIPTMPITSTPIINPTST---PDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMGGADCAAIQ----------------

Query:  -------SEEPTSNSCNFGGTAVITSTNPSSGTCEY---PSTSTSSTVLNTTNSSGSTVFGAVPADQSP
                + P+  SC+FGG A + +TNPS+G+C Y    STST  T   TT +  +      P   +P
Subjt:  -------SEEPTSNSCNFGGTAVITSTNPSSGTCEY---PSTSTSSTVLNTTNSSGSTVFGAVPADQSP

AT1G29380.1 Carbohydrate-binding X8 domain superfamily protein9.1e-1342.06Show/hide
Query:  TTAGGG---SSWCIASLSASQTALQLALDYACGMGGADCAAIQ-----------------------SEEPTSNSCNFGGTAVITSTNPSSGTCEYPSTST
        T AGGG     WCIA  +AS T+LQ+ALDYACG GGADC  IQ                        + P S+SCNFGG A +TST+PS G+C +  +S+
Subjt:  TTAGGG---SSWCIASLSASQTALQLALDYACGMGGADCAAIQ-----------------------SEEPTSNSCNFGGTAVITSTNPSSGTCEYPSTST

Query:  SSTVLNTTNSSGSTV-FGAVPADQSP
        S TV  +  S  S   F + P+   P
Subjt:  SSTVLNTTNSSGSTV-FGAVPADQSP

AT2G30933.1 Carbohydrate-binding X8 domain superfamily protein1.3e-1939.78Show/hide
Query:  RAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTV-TTPSFSPTTTAG----GGSSWCIASLSASQTALQLALDYACGMGGADCAAIQ--
        +A +     +KDITTP+ T PT    T P T  P  N  S    V TTP   P++  G    G  SWC+A  + ++ ALQ ALDYACG+GGADC+ IQ  
Subjt:  RAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTV-TTPSFSPTTTAG----GGSSWCIASLSASQTALQLALDYACGMGGADCAAIQ--

Query:  ---------------------SEEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQSPK
                              + P  +SCNF GTA+  S +PS G+C +PSTSTS ++LN T+  G  +FG +P+  +PK
Subjt:  ---------------------SEEPTSNSCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQSPK

AT2G30933.2 Carbohydrate-binding X8 domain superfamily protein1.2e-0938.19Show/hide
Query:  RAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTV-TTPSFSPTTTAG----GGSSWCIASLSASQTALQLALDYACGMGGADCAAIQ--
        +A +     +KDITTP+ T PT    T P T  P  N  S    V TTP   P++  G    G  SWC+A  + ++ ALQ ALDYACG+GGADC+ IQ  
Subjt:  RAASRTSTTQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTV-TTPSFSPTTTAG----GGSSWCIASLSASQTALQLALDYACGMGGADCAAIQ--

Query:  ---------------------SEEPTSNSCNFGGTAVITSTNPS
                              + P  +SCNF GTA+  S +PS
Subjt:  ---------------------SEEPTSNSCNFGGTAVITSTNPS

AT3G58100.1 plasmodesmata callose-binding protein 52.1e-0935.09Show/hide
Query:  WCIASLSASQTALQLALDYACGMGGADCAAIQSEEPTSN------------------------SCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNS
        WC+A  +A  ++LQ A+++ACG GGADC  IQ   P ++                        +CNF   A +TS NPS GTC+YPS+  +    N    
Subjt:  WCIASLSASQTALQLALDYACGMGGADCAAIQSEEPTSN------------------------SCNFGGTAVITSTNPSSGTCEYPSTSTSSTVLNTTNS

Query:  SGSTVFGAVPADQS
        +  T  GA  AD S
Subjt:  SGSTVFGAVPADQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTCGCAGGAAACCAAAGACACCACTAGAAATCCGACGATAACACACCGGATAAGTTCCGACGAGGAGAACGCAGAGCAAAAGATTTTGGAGGAAGAACAGAGAGG
GTCTGCGAATGGATGGGAGGTTCACTTGCACCACTCTAATTCTACTGCTTCTTACTCACTGTCTCTACCCAGAATCTCTCTCTGTCTCTTTCTCTCCTTGTCTTTTGACA
AATTATTCAACATTTCAATGATGCGTTCAAGTGCTTCAATTAGGCAAACCAACAAACCAGATCCCACACAAAAACCAGGGCAATTCAGAGCAGCTTCCAGAACTTCGACC
ACCCAAAAGGACATCACCACCCCAATAACGACGGTCCCAACAATCAACATCCCCACCATGCCGATCACATCGACGCCCATCATAAACCCAACTTCGACCCCCGACACCGT
CACAACCCCAAGTTTCAGCCCGACAACCACCGCCGGCGGCGGCTCCAGCTGGTGCATTGCGAGCCTCAGCGCGTCGCAGACGGCGCTGCAATTAGCTCTGGACTACGCTT
GCGGCATGGGCGGCGCCGATTGTGCGGCGATTCAGTCCGAAGAACCCACTTCCAATAGCTGTAATTTCGGAGGCACTGCTGTGATCACCAGCACAAATCCCAGCAGCGGC
ACATGCGAGTACCCATCAACAAGCACAAGTTCAACCGTCCTTAACACTACAAATTCAAGCGGCTCCACCGTGTTCGGCGCTGTTCCTGCTGATCAAAGTCCCAAAGTAAA
GCAAAACTGGGCCCAAGGGAAGTTAACGGGGCCCAAATCTGAAGGGCGTGGAGAGTCCGACGCTAAGAAAGAAGAGAAGTGGTGCAACCGAACGAACCGACACCCCACCC
CCCCCTCCGGTGGGCCCGTCCTCTTCGACTGCCAGTGCCAGCCACGGTTCACCAAAAACCCTTCACGTGGCCACGTGGAGCTACAGTGGCCGCACGGATGGGGAGAGCCA
AGTATATTCGTTGGCCCTTTTTTCGGTTTTGGCCAGCGGGGAGGGGCAAATGCGTCAATGGTCTCCCTTGACTTTCCAGAGAAGAGAATCACGGATCCGTGCGACGGAGA
AACGTCGCAATTGGAGGTTGGGAGGCGGAGCGGAGGCGGAGGCGTCGCTTGCTTCCCCCTCACGAGTCACGAGTGGAACAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCTCGCAGGAAACCAAAGACACCACTAGAAATCCGACGATAACACACCGGATAAGTTCCGACGAGGAGAACGCAGAGCAAAAGATTTTGGAGGAAGAACAGAGAGG
GTCTGCGAATGGATGGGAGGTTCACTTGCACCACTCTAATTCTACTGCTTCTTACTCACTGTCTCTACCCAGAATCTCTCTCTGTCTCTTTCTCTCCTTGTCTTTTGACA
AATTATTCAACATTTCAATGATGCGTTCAAGTGCTTCAATTAGGCAAACCAACAAACCAGATCCCACACAAAAACCAGGGCAATTCAGAGCAGCTTCCAGAACTTCGACC
ACCCAAAAGGACATCACCACCCCAATAACGACGGTCCCAACAATCAACATCCCCACCATGCCGATCACATCGACGCCCATCATAAACCCAACTTCGACCCCCGACACCGT
CACAACCCCAAGTTTCAGCCCGACAACCACCGCCGGCGGCGGCTCCAGCTGGTGCATTGCGAGCCTCAGCGCGTCGCAGACGGCGCTGCAATTAGCTCTGGACTACGCTT
GCGGCATGGGCGGCGCCGATTGTGCGGCGATTCAGTCCGAAGAACCCACTTCCAATAGCTGTAATTTCGGAGGCACTGCTGTGATCACCAGCACAAATCCCAGCAGCGGC
ACATGCGAGTACCCATCAACAAGCACAAGTTCAACCGTCCTTAACACTACAAATTCAAGCGGCTCCACCGTGTTCGGCGCTGTTCCTGCTGATCAAAGTCCCAAAGTAAA
GCAAAACTGGGCCCAAGGGAAGTTAACGGGGCCCAAATCTGAAGGGCGTGGAGAGTCCGACGCTAAGAAAGAAGAGAAGTGGTGCAACCGAACGAACCGACACCCCACCC
CCCCCTCCGGTGGGCCCGTCCTCTTCGACTGCCAGTGCCAGCCACGGTTCACCAAAAACCCTTCACGTGGCCACGTGGAGCTACAGTGGCCGCACGGATGGGGAGAGCCA
AGTATATTCGTTGGCCCTTTTTTCGGTTTTGGCCAGCGGGGAGGGGCAAATGCGTCAATGGTCTCCCTTGACTTTCCAGAGAAGAGAATCACGGATCCGTGCGACGGAGA
AACGTCGCAATTGGAGGTTGGGAGGCGGAGCGGAGGCGGAGGCGTCGCTTGCTTCCCCCTCACGAGTCACGAGTGGAACAAGTGA
Protein sequenceShow/hide protein sequence
MFSQETKDTTRNPTITHRISSDEENAEQKILEEEQRGSANGWEVHLHHSNSTASYSLSLPRISLCLFLSLSFDKLFNISMMRSSASIRQTNKPDPTQKPGQFRAASRTST
TQKDITTPITTVPTINIPTMPITSTPIINPTSTPDTVTTPSFSPTTTAGGGSSWCIASLSASQTALQLALDYACGMGGADCAAIQSEEPTSNSCNFGGTAVITSTNPSSG
TCEYPSTSTSSTVLNTTNSSGSTVFGAVPADQSPKVKQNWAQGKLTGPKSEGRGESDAKKEEKWCNRTNRHPTPPSGGPVLFDCQCQPRFTKNPSRGHVELQWPHGWGEP
SIFVGPFFGFGQRGGANASMVSLDFPEKRITDPCDGETSQLEVGRRSGGGGVACFPLTSHEWNK