; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032489 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032489
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionAlpha-L-fucosidase
Genome locationchr11:33427077..33431523
RNA-Seq ExpressionLag0032489
SyntenyLag0032489
Gene Ontology termsGO:0006004 - fucose metabolic process (biological process)
GO:0016139 - glycoside catabolic process (biological process)
GO:0005764 - lysosome (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0004560 - alpha-L-fucosidase activity (molecular function)
InterPro domainsIPR000933 - Glycoside hydrolase, family 29
IPR002156 - Ribonuclease H domain
IPR008979 - Galactose-binding-like domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR017853 - Glycoside hydrolase superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579606.1 Alpha-L-fucosidase 1, partial [Cucurbita argyrosperma subsp. sororia]5.4e-11688.21Show/hide
Query:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR
        YLNKGDPKGTHW+P ECDVSIRKGWFWHKSESPKSL +LLKIYYNSVGRNCVLL NVPPNSTGLIAQ+DA+TLKQFKAAIDTIF+TNLA NCS+KASSQR
Subjt:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR

Query:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAG--HWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVV
        GGK   FGPENVLD+DHLWTYWAP EADADA   HWIEIRSQ+NQ LRFNV+RIQEAIGLGQRI+RHEIYLDGK+IV+ES+VGYKRLHRIKTGVVSGYVV
Subjt:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAG--HWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVV

Query:  RVRVNEFRAVPLISSLGLHLDPFWHPTGL
        RVR+ EFRAVPLISSLGLHLDPFWHPTGL
Subjt:  RVRVNEFRAVPLISSLGLHLDPFWHPTGL

KAG7017065.1 Alpha-L-fucosidase 1, partial [Cucurbita argyrosperma subsp. argyrosperma]2.7e-11587.77Show/hide
Query:  GRYLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASS
        G YLNKGDPKGTHW+P ECDVSIRKGWFWHKSESPKSL +LLKIYYNSVGRNCVLL NVPPNSTGLIAQ+DA+TLKQFKAAIDTIF+TNLA NCS+KASS
Subjt:  GRYLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASS

Query:  QRGGKGSAFGPENVLDNDHLWTYWAPKEADADAGHWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVV
        QRGGK   FGPENVLD+DHLWTYWAP EADA   HWIEIRSQ+NQ LRFNV+RIQEAIGLGQRI+RHEIYLDGK+IV+ES+VGYKRLHRIKTGVVSGYVV
Subjt:  QRGGKGSAFGPENVLDNDHLWTYWAPKEADADAGHWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVV

Query:  RVRVNEFRAVPLISSLGLHLDPFWHPTGL
        RVR+ EFRAVPLISSLGLHLDPFWHPTGL
Subjt:  RVRVNEFRAVPLISSLGLHLDPFWHPTGL

XP_022929122.1 alpha-L-fucosidase 1-like [Cucurbita moschata]5.4e-11688.21Show/hide
Query:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR
        YLNKGDPKGTHW+P ECDVSIRKGWFWHKSESPKSL +LLKIYYNSVGRNCVLL NVPPNSTGLIAQ+DA+TLKQFKAAIDTIF+TNLA NCS+KASSQR
Subjt:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR

Query:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAG--HWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVV
        GGK   FGPENVLD+DHLWTYWAP EADADA   HWIEIRSQ+NQ LRFNV+RIQEAIGLGQRI+RHEIYLDGK+IV+ES+VGYKRLHRIKTGVVSGYVV
Subjt:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAG--HWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVV

Query:  RVRVNEFRAVPLISSLGLHLDPFWHPTGL
        RVR+ EFRAVPLISSLGLHLDPFWHPTGL
Subjt:  RVRVNEFRAVPLISSLGLHLDPFWHPTGL

XP_022970037.1 putative alpha-L-fucosidase 1 [Cucurbita maxima]4.1e-11688.21Show/hide
Query:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR
        YLNKGDPKGTHW+P ECDVSIRKGWFWHKSESPKSL +LL IYYNSVGRNCVLL NVPPNSTGLIAQ+DA+TLKQFKAAIDTIF+TNLA NCS+KASSQR
Subjt:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR

Query:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAG--HWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVV
        GGK   FGPENVLD+DHLWTYWAP EADADAG  HWIEIRSQ+NQ LRFNV+RIQEAIGLGQRI+RHEIYLDGK+IV+ES+VGYKRLHRIKTGVVSGYVV
Subjt:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAG--HWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVV

Query:  RVRVNEFRAVPLISSLGLHLDPFWHPTGL
        RVR+ EFRAVPLISSLGLHLDPFWHPTGL
Subjt:  RVRVNEFRAVPLISSLGLHLDPFWHPTGL

XP_023520506.1 alpha-L-fucosidase 1-like [Cucurbita pepo subsp. pepo]2.2e-11487.28Show/hide
Query:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR
        YLNKGDPKGTHW+P ECDVSIR+GWFWHKSESPKSL +LLKIYYNSVGRNCVLL NVPPNSTGLIAQ+DA TLKQFKAAIDTIF+ NLA NCS+KASSQR
Subjt:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR

Query:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAG--HWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVV
        GGK   FGPENVLD+DHLWTYWAP EADADA   HWIEIRSQ+NQ LRFNV+RIQEAIGLGQRI+RHEIYLDGK+IV+ES+VGYKRLHRIKTGVVSGYVV
Subjt:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAG--HWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVV

Query:  RVRVNEFRAVPLISSLGLHLDPFWHPTG
        RVR+ EFRAVPLISSLGLHLDPFWHPTG
Subjt:  RVRVNEFRAVPLISSLGLHLDPFWHPTG

TrEMBL top hitse value%identityAlignment
A0A1S3ATS4 Alpha-L-fucosidase4.6e-9776.29Show/hide
Query:  RYLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQ
        +YLNKGDPKG  W+P ECDVSIR+GWFWHK++SPK+LK LLKIYYNSVGRNCVLL NVPPNSTGLI Q+DA  LKQFK  ID IF+TNLA NCS+KASSQ
Subjt:  RYLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQ

Query:  RGGKGSAFGPENVLDNDHLWTYWAPKEADADAGHWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEE---STVGYKRLHRIKTGVVSGY
        R   G  FGP+NV+D+DHLWTYWAPKE D D  HWIEIRSQ N+RLRFNVVRIQEAIGLGQRI RHEIYLDGKRIV+E    ++GYKRL+RIK+GVV GY
Subjt:  RGGKGSAFGPENVLDNDHLWTYWAPKEADADAGHWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEE---STVGYKRLHRIKTGVVSGY

Query:  VVRVRVNEFRAVPLISSLGLHLDPFWHPTGLS
         ++VR  EF+ VPLISSLGLHLDPFW+PT LS
Subjt:  VVRVRVNEFRAVPLISSLGLHLDPFWHPTGLS

A0A6J1DPV7 Alpha-L-fucosidase4.1e-10683.04Show/hide
Query:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR
        YL+KGDPKGTHW+  ECDVSIR+GWFWHKSESPKS+KKLLKIYYNSVGRNCVLL NVPPNSTGLI QQDA+TLK FK+AIDTIFSTNLA +CSLKASSQR
Subjt:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR

Query:  GGKGSAFGPENVLDNDHLWTYWAPK-EADADAGHWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVVR
        GGK  AFGP+NVLD+DHLWTYWAP+ E D D+ HWIEIRSQ ++RLRFNVVRIQEAIGLGQRIKRHEIYLDGK IV + TVGYKRLHRI +GVVSG VVR
Subjt:  GGKGSAFGPENVLDNDHLWTYWAPK-EADADAGHWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVVR

Query:  VRVNE--FRAVPLISSLGLHLDPFWHPTGL
        VR  E   RAVPLISSLGLHLDPFW PTGL
Subjt:  VRVNE--FRAVPLISSLGLHLDPFWHPTGL

A0A6J1ELW0 Alpha-L-fucosidase2.6e-11688.21Show/hide
Query:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR
        YLNKGDPKGTHW+P ECDVSIRKGWFWHKSESPKSL +LLKIYYNSVGRNCVLL NVPPNSTGLIAQ+DA+TLKQFKAAIDTIF+TNLA NCS+KASSQR
Subjt:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR

Query:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAG--HWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVV
        GGK   FGPENVLD+DHLWTYWAP EADADA   HWIEIRSQ+NQ LRFNV+RIQEAIGLGQRI+RHEIYLDGK+IV+ES+VGYKRLHRIKTGVVSGYVV
Subjt:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAG--HWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVV

Query:  RVRVNEFRAVPLISSLGLHLDPFWHPTGL
        RVR+ EFRAVPLISSLGLHLDPFWHPTGL
Subjt:  RVRVNEFRAVPLISSLGLHLDPFWHPTGL

A0A6J1H387 Alpha-L-fucosidase3.0e-11286.03Show/hide
Query:  RYLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQ
        +YLN+GDP+GTHW+P ECDVSIR GWFWHKSESPKSLK LL+IYYNSVGRNCVLLFNVPPNSTGLIAQ+DA+TL QFKAAI TIFS+NLA NCSL+ASSQ
Subjt:  RYLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQ

Query:  RGGKGSAFGPENVLDNDHLWTYWAPKEADADAGHWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVVR
        RGGK SAFGPENVLDNDHLWTYWAP+ ADA   HWIEIRSQ+ Q LRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEES+VGYKRLHRIK+GVV G VVR
Subjt:  RGGKGSAFGPENVLDNDHLWTYWAPKEADADAGHWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVVR

Query:  VRVNEFRAVPLISSLGLHLDPFWHPTGLS
        VR+ E RAVPLISSLGLHLDPFW PTGLS
Subjt:  VRVNEFRAVPLISSLGLHLDPFWHPTGLS

A0A6J1HY04 Alpha-L-fucosidase2.0e-11688.21Show/hide
Query:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR
        YLNKGDPKGTHW+P ECDVSIRKGWFWHKSESPKSL +LL IYYNSVGRNCVLL NVPPNSTGLIAQ+DA+TLKQFKAAIDTIF+TNLA NCS+KASSQR
Subjt:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR

Query:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAG--HWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVV
        GGK   FGPENVLD+DHLWTYWAP EADADAG  HWIEIRSQ+NQ LRFNV+RIQEAIGLGQRI+RHEIYLDGK+IV+ES+VGYKRLHRIKTGVVSGYVV
Subjt:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAG--HWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVV

Query:  RVRVNEFRAVPLISSLGLHLDPFWHPTGL
        RVR+ EFRAVPLISSLGLHLDPFWHPTGL
Subjt:  RVRVNEFRAVPLISSLGLHLDPFWHPTGL

SwissProt top hitse value%identityAlignment
Q7XUR3 Putative alpha-L-fucosidase 13.4e-4947.79Show/hide
Query:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR
        Y   GDP G  W+PAECDVSIR GWFWH SE PK+   LL IYY SVGRNC+L+ NVPPNS+GLI+ +D   L++F     TIFS N A N ++ AS+ R
Subjt:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR

Query:  GGKGS-AFGPENVLDNDHLWTYWAPKEADADAGHWIEIRSQKNQRLRFNVVRIQEAIGLGQRI--KRHEIYLD--GKRIVEESTVGYKRLHRIKTGVVSG
        GG G+  F P NVL  + +++YWAP+E  +    W E+     Q   FNV+++QE I +GQR+   R EI +D   + IVE +T+GYKRL +    VV G
Subjt:  GGKGS-AFGPENVLDNDHLWTYWAPKEADADAGHWIEIRSQKNQRLRFNVVRIQEAIGLGQRI--KRHEIYLD--GKRIVEESTVGYKRLHRIKTGVVSG

Query:  YVVRVRVNEFRAVPLISSLGLHLDPF
          +++ ++  RA PLIS  G+  D F
Subjt:  YVVRVRVNEFRAVPLISSLGLHLDPF

Q8GW72 Alpha-L-fucosidase 12.9e-4846.7Show/hide
Query:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR
        Y  +GD  G  W+PAECDVSIR GWFWH SESPK   +LL IYYNSVGRNC+ L NVPPNS+GLI++QD   L++F    ++IFS NLA    + +SS R
Subjt:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR

Query:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAGHWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYL------DGKRIVEESTVGYKRLHRIKTGVVS
        G + S FGP+NVL+ + L  YWAP+E   +   W+ +  +    + FNV+ I+E I +GQRI    +        + +R+V  +TVG KRL R    VV 
Subjt:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAGHWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYL------DGKRIVEESTVGYKRLHRIKTGVVS

Query:  GYVVRVRVNEFRAVPLISSLGLHLDPF
           +++ V++ R  PLIS LGL++D F
Subjt:  GYVVRVRVNEFRAVPLISSLGLHLDPF

Arabidopsis top hitse value%identityAlignment
AT2G28100.1 alpha-L-fucosidase 12.1e-4946.7Show/hide
Query:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR
        Y  +GD  G  W+PAECDVSIR GWFWH SESPK   +LL IYYNSVGRNC+ L NVPPNS+GLI++QD   L++F    ++IFS NLA    + +SS R
Subjt:  YLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSVGRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQR

Query:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAGHWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYL------DGKRIVEESTVGYKRLHRIKTGVVS
        G + S FGP+NVL+ + L  YWAP+E   +   W+ +  +    + FNV+ I+E I +GQRI    +        + +R+V  +TVG KRL R    VV 
Subjt:  GGKGSAFGPENVLDNDHLWTYWAPKEADADAGHWIEIRSQKNQRLRFNVVRIQEAIGLGQRIKRHEIYL------DGKRIVEESTVGYKRLHRIKTGVVS

Query:  GYVVRVRVNEFRAVPLISSLGLHLDPF
           +++ V++ R  PLIS LGL++D F
Subjt:  GYVVRVRVNEFRAVPLISSLGLHLDPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGAAAATGAACCCCAACACGCTGCAAAAGCCGTCGCACGCCCTTACCGCTGTACAGTTGTGATACTCGTTGGAAACAACAAAAAATGTAGAGAAGGAACGCACGT
CGATCTCCTTGCTGTCACCGCCGCTCGAGCCGCTGCACTGCCGAAAACGCTGGTCGGAGAAAGACTCGCGAGGGAGGTCGCCGGAGAACCATCTTTTGAAGAAGAAGAGG
TGGAGGAGGAGGAAGCAGTAGAGTCAATAATTGCGAACAACTTTGCCGATCGTTTAGTTATTTTGGCAACTCACTTATCACATGAGGACTTTGAACGGGCTTGCATAGCC
TTTTGGGCTATATGGAATGATAGGAACAACCATTCTAGAGGTATGAAGATAATGGACTGGGACCAACGCTGTCAATGGATATGTGGATATTGGGAGGAAACACGTCTGCC
GCGAAAATCTTTAGTCAAGGAGATAGCGTCTCAAGATGTTGTTCGAAACTCCCATGAGCAGGGTTACACGTTGTTCACTGATGCGGCGGTAAACCCACATAATACTGGTG
CAGGGTATGGTATGGTGATTTTGGGTCGTAATGGTACACTAATAGCGGCAATGGAGATGTTTGACTCAACGTGTTTTACTCCGTTAGCAGCTGAGGTTCAAGCAATTCTA
CATGGAATGCGACTAGTGCATCGGTTGCAATATACGACGGTAAAAATTGTTTCAGATTCTCTGATTGCGATACAAATGATATCAGGTGAAGTGCCAATTTCATCTGAGGT
CTTTGTTGTGGTTGTCAGATTTCCCTCCTTGGCTTTTATCCATGGGAATGGCAACTTCAGGCAAATGTATTTTGGACGGTACCTAAACAAAGGAGACCCAAAAGGGACAC
ATTGGATACCAGCAGAATGTGATGTATCCATAAGAAAGGGATGGTTTTGGCACAAATCAGAGTCCCCAAAAAGCCTAAAGAAGCTGCTGAAAATCTACTACAACTCAGTG
GGAAGAAACTGTGTCCTTCTCTTCAATGTCCCTCCCAATTCCACAGGCCTAATTGCCCAGCAAGATGCCAACACTCTCAAGCAATTCAAAGCAGCCATTGACACAATTTT
CTCCACAAATTTGGCTCTAAATTGCTCACTGAAAGCCAGCAGCCAAAGGGGTGGCAAAGGAAGTGCTTTTGGGCCTGAAAATGTGTTGGACAATGACCATTTGTGGACTT
ATTGGGCACCCAAGGAAGCTGATGCTGATGCTGGCCATTGGATTGAGATCAGAAGTCAGAAGAACCAACGGCTGAGATTCAACGTGGTGAGGATCCAAGAGGCCATTGGG
CTTGGTCAGAGGATCAAACGGCATGAGATTTATTTGGATGGGAAGAGGATTGTGGAGGAGAGTACTGTTGGGTACAAGCGGCTGCATAGGATTAAAACTGGAGTGGTCTC
TGGATATGTTGTGAGGGTTAGGGTCAATGAGTTTAGGGCTGTTCCTTTGATCTCTTCTTTGGGTCTCCATTTGGATCCTTTTTGGCACCCAACTGGGCTGTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGAAAATGAACCCCAACACGCTGCAAAAGCCGTCGCACGCCCTTACCGCTGTACAGTTGTGATACTCGTTGGAAACAACAAAAAATGTAGAGAAGGAACGCACGT
CGATCTCCTTGCTGTCACCGCCGCTCGAGCCGCTGCACTGCCGAAAACGCTGGTCGGAGAAAGACTCGCGAGGGAGGTCGCCGGAGAACCATCTTTTGAAGAAGAAGAGG
TGGAGGAGGAGGAAGCAGTAGAGTCAATAATTGCGAACAACTTTGCCGATCGTTTAGTTATTTTGGCAACTCACTTATCACATGAGGACTTTGAACGGGCTTGCATAGCC
TTTTGGGCTATATGGAATGATAGGAACAACCATTCTAGAGGTATGAAGATAATGGACTGGGACCAACGCTGTCAATGGATATGTGGATATTGGGAGGAAACACGTCTGCC
GCGAAAATCTTTAGTCAAGGAGATAGCGTCTCAAGATGTTGTTCGAAACTCCCATGAGCAGGGTTACACGTTGTTCACTGATGCGGCGGTAAACCCACATAATACTGGTG
CAGGGTATGGTATGGTGATTTTGGGTCGTAATGGTACACTAATAGCGGCAATGGAGATGTTTGACTCAACGTGTTTTACTCCGTTAGCAGCTGAGGTTCAAGCAATTCTA
CATGGAATGCGACTAGTGCATCGGTTGCAATATACGACGGTAAAAATTGTTTCAGATTCTCTGATTGCGATACAAATGATATCAGGTGAAGTGCCAATTTCATCTGAGGT
CTTTGTTGTGGTTGTCAGATTTCCCTCCTTGGCTTTTATCCATGGGAATGGCAACTTCAGGCAAATGTATTTTGGACGGTACCTAAACAAAGGAGACCCAAAAGGGACAC
ATTGGATACCAGCAGAATGTGATGTATCCATAAGAAAGGGATGGTTTTGGCACAAATCAGAGTCCCCAAAAAGCCTAAAGAAGCTGCTGAAAATCTACTACAACTCAGTG
GGAAGAAACTGTGTCCTTCTCTTCAATGTCCCTCCCAATTCCACAGGCCTAATTGCCCAGCAAGATGCCAACACTCTCAAGCAATTCAAAGCAGCCATTGACACAATTTT
CTCCACAAATTTGGCTCTAAATTGCTCACTGAAAGCCAGCAGCCAAAGGGGTGGCAAAGGAAGTGCTTTTGGGCCTGAAAATGTGTTGGACAATGACCATTTGTGGACTT
ATTGGGCACCCAAGGAAGCTGATGCTGATGCTGGCCATTGGATTGAGATCAGAAGTCAGAAGAACCAACGGCTGAGATTCAACGTGGTGAGGATCCAAGAGGCCATTGGG
CTTGGTCAGAGGATCAAACGGCATGAGATTTATTTGGATGGGAAGAGGATTGTGGAGGAGAGTACTGTTGGGTACAAGCGGCTGCATAGGATTAAAACTGGAGTGGTCTC
TGGATATGTTGTGAGGGTTAGGGTCAATGAGTTTAGGGCTGTTCCTTTGATCTCTTCTTTGGGTCTCCATTTGGATCCTTTTTGGCACCCAACTGGGCTGTCATGA
Protein sequenceShow/hide protein sequence
MSENEPQHAAKAVARPYRCTVVILVGNNKKCREGTHVDLLAVTAARAAALPKTLVGERLAREVAGEPSFEEEEVEEEEAVESIIANNFADRLVILATHLSHEDFERACIA
FWAIWNDRNNHSRGMKIMDWDQRCQWICGYWEETRLPRKSLVKEIASQDVVRNSHEQGYTLFTDAAVNPHNTGAGYGMVILGRNGTLIAAMEMFDSTCFTPLAAEVQAIL
HGMRLVHRLQYTTVKIVSDSLIAIQMISGEVPISSEVFVVVVRFPSLAFIHGNGNFRQMYFGRYLNKGDPKGTHWIPAECDVSIRKGWFWHKSESPKSLKKLLKIYYNSV
GRNCVLLFNVPPNSTGLIAQQDANTLKQFKAAIDTIFSTNLALNCSLKASSQRGGKGSAFGPENVLDNDHLWTYWAPKEADADAGHWIEIRSQKNQRLRFNVVRIQEAIG
LGQRIKRHEIYLDGKRIVEESTVGYKRLHRIKTGVVSGYVVRVRVNEFRAVPLISSLGLHLDPFWHPTGLS