; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029178 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029178
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionNucleotide-diphospho-sugar transferase family protein
Genome locationtig00153210:3893903..3898206
RNA-Seq ExpressionSgr029178
SyntenySgr029178
Gene Ontology termsGO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR005069 - Nucleotide-diphospho-sugar transferase
IPR044575 - Beta-arabinofuranosyltransferase RAY1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573710.1 Beta-arabinofuranosyltransferase RAY1, partial [Cucurbita argyrosperma subsp. sororia]6.0e-8978.95Show/hide
Query:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
        NDCHFGTECFQRVTKVKSR+VLRILKLGYNVLLSDVD+YWF+NPLPFLY+F SGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
Subjt:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA

Query:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM
        TSGQSEQPSFYDTLCG+G                                 ELWKKKNIK ACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYD STRM
Subjt:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM

Query:  CKHNLQSEL
        CKHNLQ ++
Subjt:  CKHNLQSEL

XP_022945744.1 uncharacterized protein LOC111449891 [Cucurbita moschata]1.3e-8878.95Show/hide
Query:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
        NDCHFGTECFQRVTKVKSR+VLRILKLGYNVLLSDVD+YWFKNPLPFLY+F SGVLAAQSDEYKKTGPINLPRRLNSGFYFARSD STIAAMEKVVKHAA
Subjt:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA

Query:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM
        TSGQSEQPSFYDTLCG+G                                 ELWKKKNIK ACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYD STRM
Subjt:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM

Query:  CKHNLQSEL
        CKHNLQ ++
Subjt:  CKHNLQSEL

XP_022966691.1 uncharacterized protein LOC111466319 [Cucurbita maxima]2.1e-8979.43Show/hide
Query:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
        NDCHFGTECFQRVTKVKSR+VLRILKLGYNVLLSDVD+YWFKNPLPFLY+F SGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
Subjt:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA

Query:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM
        TSGQSEQPSFYDTLCG+G                                 ELWKKKNIK ACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYD STRM
Subjt:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM

Query:  CKHNLQSEL
        CKHNLQ ++
Subjt:  CKHNLQSEL

XP_023542074.1 uncharacterized protein LOC111802050 [Cucurbita pepo subsp. pepo]2.7e-8978.95Show/hide
Query:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
        NDCHFGTECFQRVTKVKSR+VLRILKLGYNVLLSDVD+YWFKNPLPFLY+F SGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
Subjt:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA

Query:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM
        TSGQSEQPSFYDTLCG+G                                 ELWKKKN+K ACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYD STRM
Subjt:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM

Query:  CKHNLQSEL
        CKHNLQ ++
Subjt:  CKHNLQSEL

XP_038891724.1 uncharacterized protein LOC120081123 [Benincasa hispida]2.3e-8878.47Show/hide
Query:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
        NDCHFGTECFQRVTKVKSR+VLRILKLGYNVLLSDVDVYWF NPLPFLY+F SGVL AQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
Subjt:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA

Query:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM
        TSGQSEQPSFYDTLCG+G                                 ELWKKKNIK  CRKKGCFVLHNNWI+GRLKKLERQMFSGLWEYDMSTRM
Subjt:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM

Query:  CKHNLQSEL
        CKHNLQ ++
Subjt:  CKHNLQSEL

TrEMBL top hitse value%identityAlignment
A0A5A7SYJ9 UDP-galactose:fucoside alpha-3-galactosyltransferase1.3e-8676.74Show/hide
Query:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
        NDCHFGTECFQRVTKVKSR+VLRILKLGYNVLLSDVDVYWF NPLPFLYTF SGVL AQSDEYKKTGPINLPRRLNSGFYFARSDE TIAAMEKVVKHAA
Subjt:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA

Query:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM
        TS QSEQPSFYDTLCG+G                                  LW KKNIK+ACRKKGCFVLHNNWISGRLKKLERQMFSGLW+YDMSTRM
Subjt:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM

Query:  CKHNLQSELAKPLCL
        C HNLQ   AKP+ L
Subjt:  CKHNLQSELAKPLCL

A0A5D3CW00 Beta-arabinofuranosyltransferase RAY1 isoform X21.3e-8676.74Show/hide
Query:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
        NDCHFGTECFQRVTKVKSR+VLRILKLGYNVLLSDVDVYWF NPLPFLYTF SGVL AQSDEYKKTGPINLPRRLNSGFYFARSDE TIAAMEKVVKHAA
Subjt:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA

Query:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM
        TS QSEQPSFYDTLCG+G                                  LW KKNIK+ACRKKGCFVLHNNWISGRLKKLERQMFSGLW+YDMSTRM
Subjt:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM

Query:  CKHNLQSELAKPLCL
        C HNLQ   AKP+ L
Subjt:  CKHNLQSELAKPLCL

A0A6J1DD34 uncharacterized protein LOC1110196731.2e-8778.47Show/hide
Query:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
        NDCHFGTECFQRVTKVKSR+VLRILKLGYNVLLSDVDVYWFKNPLPFLY F SGVLAAQSDEYKKTGPINLPRRLNSGFYFARSD+STIAAMEKVVKHAA
Subjt:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA

Query:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM
        TSGQSEQPSFYDTLCGD                                  ELWKKKNIKAACRKKGC+VLHNNWISGRLKKLERQMFSGLWEYDMST+M
Subjt:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM

Query:  CKHNLQSEL
        CK +L  E+
Subjt:  CKHNLQSEL

A0A6J1G1U1 uncharacterized protein LOC1114498916.5e-8978.95Show/hide
Query:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
        NDCHFGTECFQRVTKVKSR+VLRILKLGYNVLLSDVD+YWFKNPLPFLY+F SGVLAAQSDEYKKTGPINLPRRLNSGFYFARSD STIAAMEKVVKHAA
Subjt:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA

Query:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM
        TSGQSEQPSFYDTLCG+G                                 ELWKKKNIK ACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYD STRM
Subjt:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM

Query:  CKHNLQSEL
        CKHNLQ ++
Subjt:  CKHNLQSEL

A0A6J1HQ10 uncharacterized protein LOC1114663191.0e-8979.43Show/hide
Query:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
        NDCHFGTECFQRVTKVKSR+VLRILKLGYNVLLSDVD+YWFKNPLPFLY+F SGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
Subjt:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA

Query:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM
        TSGQSEQPSFYDTLCG+G                                 ELWKKKNIK ACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYD STRM
Subjt:  TSGQSEQPSFYDTLCGDG---------------------------------ELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM

Query:  CKHNLQSEL
        CKHNLQ ++
Subjt:  CKHNLQSEL

SwissProt top hitse value%identityAlignment
F4I6V0 Beta-arabinofuranosyltransferase RAY13.7e-7367.66Show/hide
Query:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
        NDCHFG++CFQRVTKVKSR VL+ILKLGYNVLLSDVDVYWF+NPLP L +F   VLAAQSDEY  T PIN PRRLNSGFYFARSD  TIAAMEKVVKHAA
Subjt:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA

Query:  TSGQSEQPSFYDTLCGD---------------------------------GELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM
        TSG SEQPSFYDTLCG+                                 G+LW K++++A C KK CFVLHNNWISGRLKKLERQM  GLWEYD S RM
Subjt:  TSGQSEQPSFYDTLCGD---------------------------------GELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM

Query:  C
        C
Subjt:  C

Q3E6Y3 Uncharacterized protein At1g286957.3e-0535.44Show/hide
Query:  KSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRL-NSGFYFARSDESTIAAMEK
        ++RL+L +L+ GYNV+ +D DV W ++PL          L    D       IN+  +L N+GFY  RS+  TI+  +K
Subjt:  KSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRL-NSGFYFARSDESTIAAMEK

Q54RP0 UDP-galactose:fucoside alpha-3-galactosyltransferase3.3e-0524.12Show/hide
Query:  YQLNDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINL-----PRRLNSGFYFARSDESTIAAM
        Y  N   +G   F+ +   K  +VL +LK GYNVL +D D+ W ++  PF++ +       Q +++     I+L        + +GFYF RS++ TI  +
Subjt:  YQLNDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINL-----PRRLNSGFYFARSDESTIAAM

Query:  ------------EKVVKHAATSGQSEQPSFYDTLCGDGELWKKKNIKAACRKKGC--------------------FVLHNNWISGRLKKLERQMFSGLW
                    +++        Q       + L    E  KK  I+     K                      F++HNN I G   K +R +  GLW
Subjt:  ------------EKVVKHAATSGQSEQPSFYDTLCGDGELWKKKNIKAACRKKGC--------------------FVLHNNWISGRLKKLERQMFSGLW

Q8VXZ5 Arabinosyltransferase XEG1131.2e-0432.31Show/hide
Query:  DCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKT
        D  +G+  F ++ + K  L+  +L  GY +L+ D D+ W KNP+P+L  F    +   SD+   T
Subjt:  DCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKT

Arabidopsis top hitse value%identityAlignment
AT1G14590.1 Nucleotide-diphospho-sugar transferase family protein4.7e-0732.95Show/hide
Query:  DCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTI
        + +F T  + ++   +  L+  +L++GYN + +D DV WF+NP P  Y +    +A   D Y      +L  R N GF F RS+  TI
Subjt:  DCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTI

AT1G28695.1 Nucleotide-diphospho-sugar transferase family protein5.2e-0635.44Show/hide
Query:  KSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRL-NSGFYFARSDESTIAAMEK
        ++RL+L +L+ GYNV+ +D DV W ++PL          L    D       IN+  +L N+GFY  RS+  TI+  +K
Subjt:  KSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRL-NSGFYFARSDESTIAAMEK

AT1G70630.1 Nucleotide-diphospho-sugar transferase family protein2.6e-7467.66Show/hide
Query:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA
        NDCHFG++CFQRVTKVKSR VL+ILKLGYNVLLSDVDVYWF+NPLP L +F   VLAAQSDEY  T PIN PRRLNSGFYFARSD  TIAAMEKVVKHAA
Subjt:  NDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAMEKVVKHAA

Query:  TSGQSEQPSFYDTLCGD---------------------------------GELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM
        TSG SEQPSFYDTLCG+                                 G+LW K++++A C KK CFVLHNNWISGRLKKLERQM  GLWEYD S RM
Subjt:  TSGQSEQPSFYDTLCGD---------------------------------GELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRM

Query:  C
        C
Subjt:  C

AT2G35610.1 xyloglucanase 1138.8e-0632.31Show/hide
Query:  DCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKT
        D  +G+  F ++ + K  L+  +L  GY +L+ D D+ W KNP+P+L  F    +   SD+   T
Subjt:  DCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKT

AT4G19970.1 CONTAINS InterPro DOMAIN/s: Nucleotide-diphospho-sugar transferase, predicted (InterPro:IPR005069)2.0e-0523.64Show/hide
Query:  FGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAME-----------
        F T  + ++   +  L+ ++L++GYN + +D D+ W ++P P LY    G      D +    P +    +N GF + +S+  +I   +           
Subjt:  FGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAME-----------

Query:  -------KVVKH-AATSGQSEQPSFYDTLCGDGELWKKKNIKAACRKKGCFVLHNNWISGRLKKL
                 +KH A  S    Q  F+DT+   G     ++I   C       +H N   G  KKL
Subjt:  -------KVVKH-AATSGQSEQPSFYDTLCGDGELWKKKNIKAACRKKGCFVLHNNWISGRLKKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAGGGTCTGCCGGTCTACAGAGATCCGTTGGCTCCAACCAATATCAGCTTAATGACTGTCACTTTGGAACAGAGTGCTTTCAGAGGGTGACAAAAGTGAAGTCCAG
ATTGGTTTTAAGGATATTAAAGCTGGGTTACAACGTACTTCTTAGTGACGTTGATGTATATTGGTTTAAAAATCCTCTTCCTTTTCTTTACACTTTTGATTCTGGTGTTC
TTGCAGCACAATCTGATGAGTACAAGAAGACAGGACCAATAAACTTACCTAGACGCTTGAACTCTGGTTTTTATTTTGCTCGTTCTGATGAATCAACAATAGCTGCCATG
GAGAAGGTGGTGAAGCATGCAGCAACTTCGGGACAGTCGGAGCAGCCAAGCTTCTATGATACCCTTTGCGGGGACGGAGAACTTTGGAAAAAGAAAAATATCAAAGCAGC
CTGCAGGAAGAAGGGATGTTTTGTTCTCCACAACAACTGGATTAGTGGAAGACTAAAGAAACTGGAACGTCAGATGTTTTCAGGCCTTTGGGAATATGACATGAGCACAA
GAATGTGCAAGCATAACTTGCAAAGTGAACTTGCTAAACCTTTATGTTTGCACGAATCAATGGATGTTGAAAAACCAGGAGGAAGAGCTCCCATGGGTATGAATCTGGAT
TGGAAGGTACTTTGTGCTGTAAACTTGAGCTTGTGGCTTGATCATAGCATTCCAAGTGAATATGAATATTGGTGTAACCAGTTGAGAAAATCTATTGCCCTGTTAGCCAC
ATATTTTGTCATTCCGAAACTGAAGGTAAGCAGCTCTACATCTCTGGAGAAGATAACTGAAAGGAAAGCACCAACAGGAATTGTCACTGCCCAAGAAACAACTATCTCTC
TCACAGTTTCTGCTCTAACACTGTTAAGTCCCCTTGCGAAGCCTACACCCATGACTGCACCAACCAATGTATGGGTTGCTGATATTGGTAGGCCCAGTTTCGATGCAACA
AGAACCACAGAAGCAGCAGCAAATTCCGCTGCAAATCCTCTCGTTGGTAATGCACCTGCTGCTCCACAGGCAACGGCCCGTGCTAGAGCCAATTTAAGATTCTTGCTTAA
GGGCAATGCTGCAAAGGATATCCCAGTTACGCCCAGGAATACGAGTAGCGGTGCAGCTGCAGCTGCAGCCTTGAGCAGGAACATACCCTACGGATGCACTTGTAGACAAG
GAATGAGACCAGAGCTCCCACTAGTGGAGAAATCACCCATGAAGAAGCAACTCTCGCTAGGGAACCCCAGAAGACAGCGCTGGCTCCTCCATAGACAAGACCAAATCCAA
CCATTGATCCCACTATACAATGCGTGCCTACTGGGTGTTGATGTTGGTACCTGCAACCAAGAACCAGCTGCAGCCAAAGACGAAAGCAGCCCAGCAAATAATGCCCCAGA
GAATTCCAGAACCGCCGCCGTGACCACTGCCTGTCGAAGCGTCAATGCACCGGAACCCACGGAAGTCCCCATGGCATTAGCCACATCATTGGCTCCTATATTCCAAGCCA
TGGCTAATCCCTGCCCCAAAGACTTCATGAAGAGTGGAAAACTGAGTGCTGATAATGCTATACATGTTATAATAGCAGTGGCAGTTCTGGAAGATATATGGAAGGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAGGGTCTGCCGGTCTACAGAGATCCGTTGGCTCCAACCAATATCAGCTTAATGACTGTCACTTTGGAACAGAGTGCTTTCAGAGGGTGACAAAAGTGAAGTCCAG
ATTGGTTTTAAGGATATTAAAGCTGGGTTACAACGTACTTCTTAGTGACGTTGATGTATATTGGTTTAAAAATCCTCTTCCTTTTCTTTACACTTTTGATTCTGGTGTTC
TTGCAGCACAATCTGATGAGTACAAGAAGACAGGACCAATAAACTTACCTAGACGCTTGAACTCTGGTTTTTATTTTGCTCGTTCTGATGAATCAACAATAGCTGCCATG
GAGAAGGTGGTGAAGCATGCAGCAACTTCGGGACAGTCGGAGCAGCCAAGCTTCTATGATACCCTTTGCGGGGACGGAGAACTTTGGAAAAAGAAAAATATCAAAGCAGC
CTGCAGGAAGAAGGGATGTTTTGTTCTCCACAACAACTGGATTAGTGGAAGACTAAAGAAACTGGAACGTCAGATGTTTTCAGGCCTTTGGGAATATGACATGAGCACAA
GAATGTGCAAGCATAACTTGCAAAGTGAACTTGCTAAACCTTTATGTTTGCACGAATCAATGGATGTTGAAAAACCAGGAGGAAGAGCTCCCATGGGTATGAATCTGGAT
TGGAAGGTACTTTGTGCTGTAAACTTGAGCTTGTGGCTTGATCATAGCATTCCAAGTGAATATGAATATTGGTGTAACCAGTTGAGAAAATCTATTGCCCTGTTAGCCAC
ATATTTTGTCATTCCGAAACTGAAGGTAAGCAGCTCTACATCTCTGGAGAAGATAACTGAAAGGAAAGCACCAACAGGAATTGTCACTGCCCAAGAAACAACTATCTCTC
TCACAGTTTCTGCTCTAACACTGTTAAGTCCCCTTGCGAAGCCTACACCCATGACTGCACCAACCAATGTATGGGTTGCTGATATTGGTAGGCCCAGTTTCGATGCAACA
AGAACCACAGAAGCAGCAGCAAATTCCGCTGCAAATCCTCTCGTTGGTAATGCACCTGCTGCTCCACAGGCAACGGCCCGTGCTAGAGCCAATTTAAGATTCTTGCTTAA
GGGCAATGCTGCAAAGGATATCCCAGTTACGCCCAGGAATACGAGTAGCGGTGCAGCTGCAGCTGCAGCCTTGAGCAGGAACATACCCTACGGATGCACTTGTAGACAAG
GAATGAGACCAGAGCTCCCACTAGTGGAGAAATCACCCATGAAGAAGCAACTCTCGCTAGGGAACCCCAGAAGACAGCGCTGGCTCCTCCATAGACAAGACCAAATCCAA
CCATTGATCCCACTATACAATGCGTGCCTACTGGGTGTTGATGTTGGTACCTGCAACCAAGAACCAGCTGCAGCCAAAGACGAAAGCAGCCCAGCAAATAATGCCCCAGA
GAATTCCAGAACCGCCGCCGTGACCACTGCCTGTCGAAGCGTCAATGCACCGGAACCCACGGAAGTCCCCATGGCATTAGCCACATCATTGGCTCCTATATTCCAAGCCA
TGGCTAATCCCTGCCCCAAAGACTTCATGAAGAGTGGAAAACTGAGTGCTGATAATGCTATACATGTTATAATAGCAGTGGCAGTTCTGGAAGATATATGGAAGGCCTGA
Protein sequenceShow/hide protein sequence
MLGSAGLQRSVGSNQYQLNDCHFGTECFQRVTKVKSRLVLRILKLGYNVLLSDVDVYWFKNPLPFLYTFDSGVLAAQSDEYKKTGPINLPRRLNSGFYFARSDESTIAAM
EKVVKHAATSGQSEQPSFYDTLCGDGELWKKKNIKAACRKKGCFVLHNNWISGRLKKLERQMFSGLWEYDMSTRMCKHNLQSELAKPLCLHESMDVEKPGGRAPMGMNLD
WKVLCAVNLSLWLDHSIPSEYEYWCNQLRKSIALLATYFVIPKLKVSSSTSLEKITERKAPTGIVTAQETTISLTVSALTLLSPLAKPTPMTAPTNVWVADIGRPSFDAT
RTTEAAANSAANPLVGNAPAAPQATARARANLRFLLKGNAAKDIPVTPRNTSSGAAAAAALSRNIPYGCTCRQGMRPELPLVEKSPMKKQLSLGNPRRQRWLLHRQDQIQ
PLIPLYNACLLGVDVGTCNQEPAAAKDESSPANNAPENSRTAAVTTACRSVNAPEPTEVPMALATSLAPIFQAMANPCPKDFMKSGKLSADNAIHVIIAVAVLEDIWKA