; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G001810 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G001810
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionglucan endo-1,3-beta-glucosidase 12-like isoform X8
Genome locationchr01:1758361..1761742
RNA-Seq ExpressionLsi01G001810
SyntenyLsi01G001810
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0009506 - plasmodesma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
InterPro domainsIPR012946 - X8 domain
IPR044965 - Glycoside hydrolase family 17, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052570.1 PLASMODESMATA CALLOSE-BINDING PROTEIN 3-like [Cucumis melo var. makuwa]2.7e-8393.6Show/hide
Query:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTAL
        SSIAQRPFFEANQED VLQN+EEQIPYFSTSTSS+QLDTIPIVNPTTPGGTTPTQTPIVNP   PAQT PTGPTITPTTPTTTGGGSWCIASPNASP AL
Subjt:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTAL

Query:  QVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHFK
        QVALDYACGYGGADCSAIQSGGSCFEPNTM+DHASYAFNDYYQKNPAPTSCVFGGTAQLT+TDPSK DHHFK
Subjt:  QVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHFK

KAG6581403.1 Glucan endo-1,3-beta-glucosidase 7, partial [Cucurbita argyrosperma subsp. sororia]1.6e-8383.58Show/hide
Query:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTP-GGTTPTQTPIVNPPSLPAQTIPTG--------------PTITPTTPTTTGG
        SSIA+ P+FEANQED  LQNQEEQIPYFSTS S+ QLDTIPIVNPTTP GGTTPTQTPIV+P S PAQTIPTG              PTITPTTPTTTGG
Subjt:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTP-GGTTPTQTPIVNPPSLPAQTIPTG--------------PTITPTTPTTTGG

Query:  GSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHFKQWKLSLRSIKINA
        GSWCIAS +ASPTALQVALDYACGYGGADCSAIQ+GGSCFEPNTMRDHASYAFN YYQKNPAPTSCVFGGTAQLTSTDPSK DHHF+QWKLSLRSIKINA
Subjt:  GSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHFKQWKLSLRSIKINA

Query:  K
        +
Subjt:  K

XP_004134626.1 PLASMODESMATA CALLOSE-BINDING PROTEIN 3 [Cucumis sativus]2.6e-7889.66Show/hide
Query:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTI---TPTTPTTTGGGSWCIASPNASP
        SSIAQRPFFEANQ+D VLQNQEEQIPYFS STSS+QLDTIPIVNPTTPGGTTPTQTPIVNP   PAQT PTGPTI   TPTTPTTTGGGSWCIASPNASP
Subjt:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTI---TPTTPTTTGGGSWCIASPNASP

Query:  TALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHF
        TALQVA+DYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLT+TDPS  + H+
Subjt:  TALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHF

XP_008439674.1 PREDICTED: PLASMODESMATA CALLOSE-BINDING PROTEIN 3-like [Cucumis melo]6.1e-8091.23Show/hide
Query:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTAL
        SSIAQRPFFEANQED VLQN+EEQIPYFSTSTSS+QLDTIPIVNPTTPGGTTPTQTPIVNP   PAQT PTGPTITPTTPTTTGGGSWCIASPNASP AL
Subjt:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTAL

Query:  QVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHF
        QVALDYACGYGGADCSAIQSGGSCFEPNTM+DHASYAFNDYYQKNPAPTSCVFGGTAQLT+TDPS  + H+
Subjt:  QVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHF

XP_038882928.1 PLASMODESMATA CALLOSE-BINDING PROTEIN 3-like [Benincasa hispida]8.2e-8595.32Show/hide
Query:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTAL
        SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSS+QLDTIPIVNPTTPGGTTPTQTPIVNPPS PAQTIPTGP ITPTTPTTTGGGSWCIASPNASPTAL
Subjt:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTAL

Query:  QVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHF
        QVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPS  + H+
Subjt:  QVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHF

TrEMBL top hitse value%identityAlignment
A0A0A0KI12 X8 domain-containing protein1.2e-7889.66Show/hide
Query:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTI---TPTTPTTTGGGSWCIASPNASP
        SSIAQRPFFEANQ+D VLQNQEEQIPYFS STSS+QLDTIPIVNPTTPGGTTPTQTPIVNP   PAQT PTGPTI   TPTTPTTTGGGSWCIASPNASP
Subjt:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTI---TPTTPTTTGGGSWCIASPNASP

Query:  TALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHF
        TALQVA+DYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLT+TDPS  + H+
Subjt:  TALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHF

A0A1S3AZA1 PLASMODESMATA CALLOSE-BINDING PROTEIN 3-like3.0e-8091.23Show/hide
Query:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTAL
        SSIAQRPFFEANQED VLQN+EEQIPYFSTSTSS+QLDTIPIVNPTTPGGTTPTQTPIVNP   PAQT PTGPTITPTTPTTTGGGSWCIASPNASP AL
Subjt:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTAL

Query:  QVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHF
        QVALDYACGYGGADCSAIQSGGSCFEPNTM+DHASYAFNDYYQKNPAPTSCVFGGTAQLT+TDPS  + H+
Subjt:  QVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHF

A0A5A7UBN9 PLASMODESMATA CALLOSE-BINDING PROTEIN 3-like1.3e-8393.6Show/hide
Query:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTAL
        SSIAQRPFFEANQED VLQN+EEQIPYFSTSTSS+QLDTIPIVNPTTPGGTTPTQTPIVNP   PAQT PTGPTITPTTPTTTGGGSWCIASPNASP AL
Subjt:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTAL

Query:  QVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHFK
        QVALDYACGYGGADCSAIQSGGSCFEPNTM+DHASYAFNDYYQKNPAPTSCVFGGTAQLT+TDPSK DHHFK
Subjt:  QVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHFK

A0A5D3CNT9 PLASMODESMATA CALLOSE-BINDING PROTEIN 3-like3.0e-8091.23Show/hide
Query:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTAL
        SSIAQRPFFEANQED VLQN+EEQIPYFSTSTSS+QLDTIPIVNPTTPGGTTPTQTPIVNP   PAQT PTGPTITPTTPTTTGGGSWCIASPNASP AL
Subjt:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTAL

Query:  QVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHF
        QVALDYACGYGGADCSAIQSGGSCFEPNTM+DHASYAFNDYYQKNPAPTSCVFGGTAQLT+TDPS  + H+
Subjt:  QVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHF

A0A6J1IUV4 PLASMODESMATA CALLOSE-BINDING PROTEIN 2-like isoform X14.6e-7382.51Show/hide
Query:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTP-GGTTPTQTPIVNPPSLPAQTIPTG--------------PTITPTTPTTTGG
        SSIA+ P+FEANQED VLQNQEEQIPYFSTS S+ QLDTIPIVNPTTP GGTTPTQTPIV+P S PAQTIPTG              PTITPTTPTTTGG
Subjt:  SSIAQRPFFEANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTP-GGTTPTQTPIVNPPSLPAQTIPTG--------------PTITPTTPTTTGG

Query:  GSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKND
        GSWCIAS +ASPTALQVALDYACGYGGADCSAIQ+GGSCFEPNTMRDHASYAFN YYQKNPAPTSCVFGGTAQLTSTDPS  +
Subjt:  GSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKND

SwissProt top hitse value%identityAlignment
O65399 Glucan endo-1,3-beta-glucosidase 15.0e-1651.19Show/hide
Query:  TGGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQK-NPAPTSCVFGGTAQLTSTDPS
        T   ++CIA        LQ ALD+ACG G ++CS IQ G SC++PN ++ HAS+AFN YYQK   A  SC F G A +T+TDPS
Subjt:  TGGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQK-NPAPTSCVFGGTAQLTSTDPS

P52409 Glucan endo-1,3-beta-glucosidase8.5e-1642.74Show/hide
Query:  GTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKN-PAP
        G +   TP  NP   P+             P  +GGG WC+A   A+ T LQ  ++YACG+   DC  IQSGG+CF PN+++ HASY  N YYQ N    
Subjt:  GTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKN-PAP

Query:  TSCVFGGTAQLTSTDPS
         +C F GT  +TS+DPS
Subjt:  TSCVFGGTAQLTSTDPS

Q84V39 Major pollen allergen Ole e 103.8e-1641.96Show/hide
Query:  PPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQ-KNPAPTSCVFGGTAQL
        P   P  ++PT  +  P  P T G   WC+    A+   LQ  +DY C   G DC  IQ+ G+CF PNT+R HASYA N +YQ K      C F GT  +
Subjt:  PPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQ-KNPAPTSCVFGGTAQL

Query:  TSTDPSKNDHHF
        TS+DPS     F
Subjt:  TSTDPSKNDHHF

Q8VYE5 Glucan endo-1,3-beta-glucosidase 122.2e-1644.36Show/hide
Query:  DTIPI--VNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGS--WCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDH
        +T P+   N TT    +P+ +PI+N  S            T T     GGG+  WCIAS  AS T LQ ALD+ACG G  DCSA+Q    CFEP+T+  H
Subjt:  DTIPI--VNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGS--WCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDH

Query:  ASYAFNDYYQKNPAPT-SCVFGGTAQLTSTDPS
        ASYAFN YYQ++ A +  C F G +     DPS
Subjt:  ASYAFNDYYQKNPAPT-SCVFGGTAQLTSTDPS

Q9M069 Glucan endo-1,3-beta-glucosidase 77.6e-1750Show/hide
Query:  TPTTPTTTGGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNP-APTSCVFGGTAQLTSTDPSKND
        TP+   T+ G  WC+    A+   LQ +LD+ACG+ G DC AIQ GG+CFEPN +  HA+YA N Y+QK+P  PT C F  TA +TS +PS N+
Subjt:  TPTTPTTTGGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNP-APTSCVFGGTAQLTSTDPSKND

Arabidopsis top hitse value%identityAlignment
AT1G09460.1 Carbohydrate-binding X8 domain superfamily protein2.7e-2548.87Show/hide
Query:  IPIVNPT-TPGGTTPTQTPIVN-PPSLPAQTIP-----TGPTITPTTPTTTGGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRD
        I I  PT TP  T P   P+   PP+ P+ T+P       P +   +P+ + G SWC+A P AS  +LQ ALDYACG   ADCS +Q GG+C+ P +++ 
Subjt:  IPIVNPT-TPGGTTPTQTPIVN-PPSLPAQTIP-----TGPTITPTTPTTTGGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRD

Query:  HASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPS
        HAS+AFN YYQKNP+P SC FGG A L +T+PS
Subjt:  HASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPS

AT1G29380.1 Carbohydrate-binding X8 domain superfamily protein8.3e-3547.69Show/hide
Query:  IPIVNPTTPGGTTPTQT-PIVNPPSL--PAQTIPTG--PTITPTTPT--------TT-------------------------------------------
        IP+VNPT PGG+T T T    +PPSL  P  T PTG  P +  TTPT        TT                                           
Subjt:  IPIVNPTTPGGTTPTQT-PIVNPPSL--PAQTIPTG--PTITPTTPT--------TT-------------------------------------------

Query:  -------GGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHF
               G G WCIA  NASPT+LQVALDYACGYGGADC  IQ G +C+EPNT+RDHAS+AFN YYQK+P   SC FGG AQLTSTDPSK   HF
Subjt:  -------GGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHF

AT2G30933.1 Carbohydrate-binding X8 domain superfamily protein1.5e-2847.95Show/hide
Query:  TTPGGTTPTQTP--IVNPPSLPAQTIPTGPTITPTTP---TTTGGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFND
        TTP  T PT TP  +V      A  + T P   P++P      G  SWC+A  N +  ALQ ALDYACG GGADCS IQ GG+C+ PN++R HAS+AFN 
Subjt:  TTPGGTTPTQTP--IVNPPSLPAQTIPTGPTITPTTP---TTTGGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFND

Query:  YYQKNPAPTSCVFGGTAQLTSTDPSKNDHHFKQWKLSLRSIKINAK
        YYQKNP P+SC F GTA   S DPS    HF     S   + + ++
Subjt:  YYQKNPAPTSCVFGGTAQLTSTDPSKNDHHFKQWKLSLRSIKINAK

AT2G30933.2 Carbohydrate-binding X8 domain superfamily protein1.7e-2753.6Show/hide
Query:  TTPGGTTPTQTP--IVNPPSLPAQTIPTGPTITPTTP---TTTGGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFND
        TTP  T PT TP  +V      A  + T P   P++P      G  SWC+A  N +  ALQ ALDYACG GGADCS IQ GG+C+ PN++R HAS+AFN 
Subjt:  TTPGGTTPTQTP--IVNPPSLPAQTIPTGPTITPTTP---TTTGGGSWCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDHASYAFND

Query:  YYQKNPAPTSCVFGGTAQLTSTDPS
        YYQKNP P+SC F GTA   S DPS
Subjt:  YYQKNPAPTSCVFGGTAQLTSTDPS

AT4G29360.2 O-Glycosyl hydrolases family 17 protein1.9e-1842.21Show/hide
Query:  DTIPI--VNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGS--WCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDH
        +T P+   N TT    +P+ +PI+N  S            T T     GGG+  WCIAS  AS T LQ ALD+ACG G  DCSA+Q    CFEP+T+  H
Subjt:  DTIPI--VNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGS--WCIASPNASPTALQVALDYACGYGGADCSAIQSGGSCFEPNTMRDH

Query:  ASYAFNDYYQKNPAPT-SCVFGGTAQLTSTDPSKNDHHFKQWKLSLRSIKINAK
        ASYAFN YYQ++ A +  C F G +     DPSK   +   +     SIK N K
Subjt:  ASYAFNDYYQKNPAPT-SCVFGGTAQLTSTDPSKNDHHFKQWKLSLRSIKINAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAAAGTTGCTTCAACATCATTATTCGACAAAAAAAAGTTTTAAGTTCCCAACTATTCAAATATACAATGCCTCCTCACTCAAGTTTCCTCATATTCTCCTCCCCTT
CTTTCTGACTCTTGATGATCCAAGAAAGCTTAGATTCTGTCGTCCAGCTCCTTGTATATATTGCTTGCCGCCAAAAAGGCTAATTCTTCTTTACAGAAACACATTTGACA
TACTTCGGATTAACACAAAATCTTTGGTTCTCTATCATTGTTCAAAAATCTTTGAAGATCCAGAACCAAACATTGGAAGTTCAAGTATTGCACAGAGACCGTTCTTTGAA
GCAAACCAAGAAGACTCTGTTCTGCAAAATCAAGAAGAACAAATACCCTACTTTTCAACTTCAACATCTAGCGATCAACTGGACACAATCCCCATTGTGAATCCAACCAC
CCCAGGTGGCACAACTCCAACCCAGACACCGATTGTTAATCCACCGTCACTGCCAGCACAAACAATCCCCACCGGACCGACCATAACGCCGACAACCCCAACAACCACAG
GTGGTGGAAGTTGGTGCATAGCCAGCCCAAATGCTTCCCCAACCGCTTTGCAGGTAGCTCTTGACTATGCTTGTGGCTATGGAGGTGCGGATTGCTCGGCGATTCAATCG
GGAGGTAGTTGCTTTGAGCCCAACACCATGCGAGACCATGCTTCTTATGCCTTCAACGACTACTACCAGAAGAATCCAGCACCCACAAGCTGCGTGTTTGGAGGAACGGC
GCAGCTCACCAGCACAGACCCCAGTAAGAATGATCATCACTTCAAGCAATGGAAATTGTCACTTCGCAGCATCAAGATCAACGCCAAGCACGACAACGCCCATCAATCCA
CCGATGAATCCAACTCCGATGCCGCCGCCAACCCCAACCATAACGACGCCAACACCAACCGACACAATGCCTACCGATACACCGCCGACCGATACCACACCCGCAGACGC
CGGCGGTTACGGATCAGAACCATCAGACACAGCAAGCTCGGCGATTCCGGTCAAGAGCTTCTTCTCTTTCGTCTTCTTGGGGTCACTTCTTATGGCAAATTGCATGTAAG
AGGAAGCAGAAAGCGACTCGGATCTTGTCGACTATTACAATCTGCAGACCGAGAAGGCGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTAAAGTTGCTTCAACATCATTATTCGACAAAAAAAAGTTTTAAGTTCCCAACTATTCAAATATACAATGCCTCCTCACTCAAGTTTCCTCATATTCTCCTCCCCTT
CTTTCTGACTCTTGATGATCCAAGAAAGCTTAGATTCTGTCGTCCAGCTCCTTGTATATATTGCTTGCCGCCAAAAAGGCTAATTCTTCTTTACAGAAACACATTTGACA
TACTTCGGATTAACACAAAATCTTTGGTTCTCTATCATTGTTCAAAAATCTTTGAAGATCCAGAACCAAACATTGGAAGTTCAAGTATTGCACAGAGACCGTTCTTTGAA
GCAAACCAAGAAGACTCTGTTCTGCAAAATCAAGAAGAACAAATACCCTACTTTTCAACTTCAACATCTAGCGATCAACTGGACACAATCCCCATTGTGAATCCAACCAC
CCCAGGTGGCACAACTCCAACCCAGACACCGATTGTTAATCCACCGTCACTGCCAGCACAAACAATCCCCACCGGACCGACCATAACGCCGACAACCCCAACAACCACAG
GTGGTGGAAGTTGGTGCATAGCCAGCCCAAATGCTTCCCCAACCGCTTTGCAGGTAGCTCTTGACTATGCTTGTGGCTATGGAGGTGCGGATTGCTCGGCGATTCAATCG
GGAGGTAGTTGCTTTGAGCCCAACACCATGCGAGACCATGCTTCTTATGCCTTCAACGACTACTACCAGAAGAATCCAGCACCCACAAGCTGCGTGTTTGGAGGAACGGC
GCAGCTCACCAGCACAGACCCCAGTAAGAATGATCATCACTTCAAGCAATGGAAATTGTCACTTCGCAGCATCAAGATCAACGCCAAGCACGACAACGCCCATCAATCCA
CCGATGAATCCAACTCCGATGCCGCCGCCAACCCCAACCATAACGACGCCAACACCAACCGACACAATGCCTACCGATACACCGCCGACCGATACCACACCCGCAGACGC
CGGCGGTTACGGATCAGAACCATCAGACACAGCAAGCTCGGCGATTCCGGTCAAGAGCTTCTTCTCTTTCGTCTTCTTGGGGTCACTTCTTATGGCAAATTGCATGTAAG
AGGAAGCAGAAAGCGACTCGGATCTTGTCGACTATTACAATCTGCAGACCGAGAAGGCGATTAGATTTTTTTTTTTTTTTAAATTATTTTTTTCTTTTATTTTCTGCCAT
GTAACGTTTATACAAAGGATACGGGGAACTCGGGGGAAGAAATTAGTGTTTAATTATACTAAAAAATGAGGGGAAAATGGCAGAAAATTATCCTAGGGTTTCTGCTATAA
TAGTGATAGAATAAATTTCTAGCTAATATCTACGTTTAGGATTATATTCCCTCTGGAGCTATGTGTTCTCCATTTTAGTTGATGTTCATCCCTATATTCAAAAAAATAAA
ATAGAAAAAAGTTCATGTT
Protein sequenceShow/hide protein sequence
MVKLLQHHYSTKKSFKFPTIQIYNASSLKFPHILLPFFLTLDDPRKLRFCRPAPCIYCLPPKRLILLYRNTFDILRINTKSLVLYHCSKIFEDPEPNIGSSSIAQRPFFE
ANQEDSVLQNQEEQIPYFSTSTSSDQLDTIPIVNPTTPGGTTPTQTPIVNPPSLPAQTIPTGPTITPTTPTTTGGGSWCIASPNASPTALQVALDYACGYGGADCSAIQS
GGSCFEPNTMRDHASYAFNDYYQKNPAPTSCVFGGTAQLTSTDPSKNDHHFKQWKLSLRSIKINAKHDNAHQSTDESNSDAAANPNHNDANTNRHNAYRYTADRYHTRRR
RRLRIRTIRHSKLGDSGQELLLFRLLGVTSYGKLHVRGSRKRLGSCRLLQSADREGD