; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0001036 (gene) of Chayote v1 genome

Gene IDSed0001036
OrganismSechium edule (Chayote v1)
Descriptionjacalin lectin family protein
Genome locationLG01:19408681..19412247
RNA-Seq ExpressionSed0001036
SyntenySed0001036
Gene Ontology termsGO:0030246 - carbohydrate binding (molecular function)
InterPro domainsIPR001229 - Jacalin-like lectin domain
IPR033734 - Jacalin-like lectin domain, plant
IPR036404 - Jacalin-like lectin domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041599.1 putative Cysteine/Histidine-rich C1 domain family protein [Cucumis melo var. makuwa]4.7e-2947.24Show/hide
Query:  KGNDVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGA
        K +D+ +  +AM LG++GG  G+ WDD  F SI++V++T   +  I SI  +Y   D+N   S+RHG   N       V+L+YP+EYL+SI G +G  G 
Subjt:  KGNDVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGA

Query:  GHCVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK
         H VI SL+LESNK  YGP+G+E  +RF FPT+G KIV FHG SG  L+SIGI  + + NN+K
Subjt:  GHCVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK

XP_008466555.1 PREDICTED: uncharacterized protein LOC103503941 [Cucumis melo]4.7e-2947.24Show/hide
Query:  KGNDVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGA
        K +D+ +  +AM LG++GG  G+ WDD  F SI++V++T   +  I SI  +Y   D+N   S+RHG   N       V+L+YP+EYL+SI G +G  G 
Subjt:  KGNDVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGA

Query:  GHCVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK
         H VI SL+LESNK  YGP+G+E  +RF FPT+G KIV FHG SG  L+SIGI  + + NN+K
Subjt:  GHCVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK

XP_031738573.1 uncharacterized protein LOC101222978 isoform X1 [Cucumis sativus]1.4e-2848.75Show/hide
Query:  DVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGAGHC
        DV +  +AM LG++GG  G+ WDD  F SI++V++T   +  I SI  +Y   D+N   S+RHG  + GR     V+L+YP+EYL+SI G +G  G  H 
Subjt:  DVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGAGHC

Query:  VIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK
        VI SL+LESNK  YGP+G+E   RF FPT+G KIV FHG SG  L+SIGI  + + NN+K
Subjt:  VIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK

XP_038877874.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120070094 [Benincasa hispida]4.0e-2847.83Show/hide
Query:  NDVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGAGH
        +D+ +  +AM LG++GG  G+ WDD     I+  KL    +  I SI+T+Y   D+N  WS+RHG     R     V+L+YP+EYL+SI G +G  G  H
Subjt:  NDVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGAGH

Query:  CVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK
         VI SL+LESNK  YGP+G E    F FPT+G KIVGFHGRSG  L+SIGI    I N++K
Subjt:  CVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK

XP_038898849.1 jacalin-related lectin 3-like isoform X1 [Benincasa hispida]5.2e-3655.06Show/hide
Query:  MPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGA--GHCVIHSLT
        + LG+HGG+GGD WDDGVF SIK V +    +  I SI+TEY  D  NS +SD+HG +SN     +VV L+YPNEYLVSIHG +GD+G    H VI SLT
Subjt:  MPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGA--GHCVIHSLT

Query:  LESNKNCYGPYGKEGTKRFWFPTTG-CKIVGFHGRSGHSLDSIGIKAVSILNNIKINN
         ESNK  +GP+G+     FWFPT+G  K+VGFHG++G  LDSIGIK V + NN  I++
Subjt:  LESNKNCYGPYGKEGTKRFWFPTTG-CKIVGFHGRSGHSLDSIGIKAVSILNNIKINN

TrEMBL top hitse value%identityAlignment
A0A0A0LDX5 Uncharacterized protein6.7e-2948.75Show/hide
Query:  DVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGAGHC
        DV +  +AM LG++GG  G+ WDD  F SI++V++T   +  I SI  +Y   D+N   S+RHG  + GR     V+L+YP+EYL+SI G +G  G  H 
Subjt:  DVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGAGHC

Query:  VIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK
        VI SL+LESNK  YGP+G+E   RF FPT+G KIV FHG SG  L+SIGI  + + NN+K
Subjt:  VIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK

A0A1S3CSU6 uncharacterized protein LOC1035039412.3e-2947.24Show/hide
Query:  KGNDVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGA
        K +D+ +  +AM LG++GG  G+ WDD  F SI++V++T   +  I SI  +Y   D+N   S+RHG   N       V+L+YP+EYL+SI G +G  G 
Subjt:  KGNDVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGA

Query:  GHCVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK
         H VI SL+LESNK  YGP+G+E  +RF FPT+G KIV FHG SG  L+SIGI  + + NN+K
Subjt:  GHCVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK

A0A2Z6MY79 Uncharacterized protein (Fragment)9.9e-2542.86Show/hide
Query:  MKGNDVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENST-WSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-
        M+ ++  +KK A  +G  GG GG  WDDG++  ++  +L  +    I+S+Q EY  D + S+ WS++HG S   +  +  V LDYP+E+L SIHG  G  
Subjt:  MKGNDVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENST-WSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-

Query:  DGAGHCVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI--KAVSILNNIK
        +  GH ++ SL+ ESNK  YGP+G E    F  P TG KIVGFHGR G  LD+IG+  K++  LN  K
Subjt:  DGAGHCVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI--KAVSILNNIK

A0A5A7TJR2 Putative Cysteine/Histidine-rich C1 domain family protein2.3e-2947.24Show/hide
Query:  KGNDVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGA
        K +D+ +  +AM LG++GG  G+ WDD  F SI++V++T   +  I SI  +Y   D+N   S+RHG   N       V+L+YP+EYL+SI G +G  G 
Subjt:  KGNDVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGA

Query:  GHCVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK
         H VI SL+LESNK  YGP+G+E  +RF FPT+G KIV FHG SG  L+SIGI  + + NN+K
Subjt:  GHCVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK

M4EAF2 Uncharacterized protein9.9e-2545.45Show/hide
Query:  LGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDEN--STWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLTL
        +G  GG GG  WDDG+F +++ + +       I+SIQ EY   D+N  S WS++HG   NG  + E V LDYP+EYL S+HG+ G  D  GH  + SLTL
Subjt:  LGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDEN--STWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLTL

Query:  ESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI
        ESN+  YGP+G E    F  P +  K+ GFHG++G  LD+IG+
Subjt:  ESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI

SwissProt top hitse value%identityAlignment
F4HQX1 Jacalin-related lectin 34.0e-2341.26Show/hide
Query:  LGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDEN--STWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLTL
        LG  GG  G  WDDG++ ++K + +       I+SIQ EY   D+N  S WS++ G    G  + + V  DYP+EYL+S++G+ G  D  G   + SLT 
Subjt:  LGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDEN--STWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLTL

Query:  ESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI
        ESN+  YGP+G +    F  P +G KI+GFHG++G  LD+IG+
Subjt:  ESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI

P82859 Agglutinin1.1e-1737.75Show/hide
Query:  KMAMPL----GQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHC
        KMA+P+    G  GG GG  WDDGVFP+I+ + L  +    I +I+  Y   D     S +HG    G    + + L+   E+L+ I G  G  +G+G  
Subjt:  KMAMPL----GQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHC

Query:  -VIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI
          + S+T  +NK  YGPYG E  + F       ++VGFHGRSG  LD+IG+
Subjt:  -VIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI

Q9M5W9 Myrosinase-binding protein 28.1e-1634.42Show/hide
Query:  AMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLT
        A  L   GG GG  WDDGVF  ++ + L     D +  +  EY    + +   D HG+ +   +  E   LDYP+EY+ S+ G      G    V+ SLT
Subjt:  AMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLT

Query:  LESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK
         ++NK    P+G    + F     G KIVGFHG++G  +  IG+ AV I  N +
Subjt:  LESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK

Q9SAV1 Myrosinase-binding protein 23.6e-1635.1Show/hide
Query:  LGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLTLES
        L   GG GG  WDDGVF  ++ + L     D +  +  EY    + +   DRHG+ +   +  E   LDYP+EY+ S+ G      G    V+ SLT ++
Subjt:  LGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLTLES

Query:  NKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK
        NK    P+G    + F     G KIVGFHG++G  +  IG+ AV I  N +
Subjt:  NKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK

Q9SSM3 Jacalin-related lectin 192.7e-1936.14Show/hide
Query:  GLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENS--TWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGH
        G K + + +G  GG GG  WDDG++  ++ ++L  +    I+SI    V+ D+N     S++HG     +  E  + L YP EYL  + G       +G 
Subjt:  GLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENS--TWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGH

Query:  CVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI-----KAVSILNNIK
         VI S+T +SNK  YGPYG E    F F   G +IVG +GRSG  LDSIG      K+  ++N ++
Subjt:  CVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI-----KAVSILNNIK

Arabidopsis top hitse value%identityAlignment
AT1G19715.1 Mannose-binding lectin superfamily protein2.9e-2441.26Show/hide
Query:  LGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDEN--STWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLTL
        LG  GG  G  WDDG++ ++K + +       I+SIQ EY   D+N  S WS++ G    G  + + V  DYP+EYL+S++G+ G  D  G   + SLT 
Subjt:  LGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDEN--STWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLTL

Query:  ESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI
        ESN+  YGP+G +    F  P +G KI+GFHG++G  LD+IG+
Subjt:  ESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI

AT1G19715.2 Mannose-binding lectin superfamily protein8.1e-1944.14Show/hide
Query:  IESIQTEYVVDDEN--STWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHG
        I+SIQ EY   D+N  S WS++ G    G  + + V  DYP+EYL+S++G+ G  D  G   + SLT ESN+  YGP+G +    F  P +G KI+GFHG
Subjt:  IESIQTEYVVDDEN--STWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHG

Query:  RSGHSLDSIGI
        ++G  LD+IG+
Subjt:  RSGHSLDSIGI

AT1G19715.3 Mannose-binding lectin superfamily protein2.9e-2441.26Show/hide
Query:  LGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDEN--STWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLTL
        LG  GG  G  WDDG++ ++K + +       I+SIQ EY   D+N  S WS++ G    G  + + V  DYP+EYL+S++G+ G  D  G   + SLT 
Subjt:  LGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDEN--STWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLTL

Query:  ESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI
        ESN+  YGP+G +    F  P +G KI+GFHG++G  LD+IG+
Subjt:  ESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI

AT1G52030.2 myrosinase-binding protein 22.6e-1735.1Show/hide
Query:  LGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLTLES
        L   GG GG  WDDGVF  ++ + L     D +  +  EY    + +   DRHG+ +   +  E   LDYP+EY+ S+ G      G    V+ SLT ++
Subjt:  LGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGHCVIHSLTLES

Query:  NKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK
        NK    P+G    + F     G KIVGFHG++G  +  IG+ AV I  N +
Subjt:  NKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIK

AT1G73040.1 Mannose-binding lectin superfamily protein1.9e-2036.14Show/hide
Query:  GLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENS--TWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGH
        G K + + +G  GG GG  WDDG++  ++ ++L  +    I+SI    V+ D+N     S++HG     +  E  + L YP EYL  + G       +G 
Subjt:  GLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENS--TWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGD-DGAGH

Query:  CVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI-----KAVSILNNIK
         VI S+T +SNK  YGPYG E    F F   G +IVG +GRSG  LDSIG      K+  ++N ++
Subjt:  CVIHSLTLESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGI-----KAVSILNNIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGGAAATGACGTGGGGTTGAAAAAGATGGCGATGCCTCTCGGACAACACGGTGGCCTGGGCGGAGACGTATGGGATGATGGAGTCTTCCCATCGATTAAAACAGT
GAAGCTTACTTGTATTCCAAAGGATCGTATTGAGTCCATTCAGACTGAATATGTTGTTGACGATGAAAACTCAACTTGGTCAGATAGGCACGGCCAAAGTTCAAATGGTA
GAGTACAAGAAGAGGTGGTTAATCTTGACTATCCGAATGAGTATCTAGTTTCAATTCATGGCAGCCTGGGTGACGATGGTGCTGGCCACTGTGTCATTCACTCATTGACT
TTGGAAAGCAACAAAAATTGTTATGGGCCATATGGGAAGGAAGGAACAAAAAGATTTTGGTTTCCTACCACTGGTTGTAAGATTGTTGGTTTTCATGGTAGATCGGGTCA
TTCCCTTGATTCCATCGGGATTAAAGCTGTCTCAATTCTCAACAATATCAAGATCAACAATTAA
mRNA sequenceShow/hide mRNA sequence
GTGATATCTAGATGTATTCAGAAGAACAACTCCAAGGTCTTGTGATTTCGGTCCTTCAATACTAAATGTTGAGAATATTCTAATCTAAATTATAAGTTTCCCAGGTCCAC
ACTTGTTCGCATCGAGTTCTATTGTGTCGATGGTATTTATAATCTATGAGCGGCAGGTAGCGTTGAGCGTGTATAAGATCAGAGCAGGAGGAGTAGAGAAATATGAAAGG
AAATGACGTGGGGTTGAAAAAGATGGCGATGCCTCTCGGACAACACGGTGGCCTGGGCGGAGACGTATGGGATGATGGAGTCTTCCCATCGATTAAAACAGTGAAGCTTA
CTTGTATTCCAAAGGATCGTATTGAGTCCATTCAGACTGAATATGTTGTTGACGATGAAAACTCAACTTGGTCAGATAGGCACGGCCAAAGTTCAAATGGTAGAGTACAA
GAAGAGGTGGTTAATCTTGACTATCCGAATGAGTATCTAGTTTCAATTCATGGCAGCCTGGGTGACGATGGTGCTGGCCACTGTGTCATTCACTCATTGACTTTGGAAAG
CAACAAAAATTGTTATGGGCCATATGGGAAGGAAGGAACAAAAAGATTTTGGTTTCCTACCACTGGTTGTAAGATTGTTGGTTTTCATGGTAGATCGGGTCATTCCCTTG
ATTCCATCGGGATTAAAGCTGTCTCAATTCTCAACAATATCAAGATCAACAATTAAACACTTTCTCATTTTTGTACTATGCTAGTTAAGCTATTCTTTGTTGGAATAAAC
TTAAAATATATTTTAATGTATTTAGTGGGTAGATAATTTAGGATTTAATGGATTGTCAATTTAGGATACAAATTAAAGAATTAGTAGATTTTATTGATTTAGCAATTAGT
GGGAAATACTTATGGAAATAACTTGGAGACCAAGTTTGTAATAGGCTATAAATAGAGTGTTCCTTCATTGTGAATTGTAGAGAGAAAATGTGAGAGCAAAAAGTATAGAG
TTTTAGAGAGAAATAAAGTAAATTATTTTGAGAATTTGTGGTTTACTATTTTGTTGTTTCTAATTTAGTTTTTAAAAATTAGTATTAGAGGCGATGGCGAAGGTGGCGGG
TCAAATATCGCTACCACGATTGACGAAGGAGAATTAAGAAAATTGGACCATTCAGATGAAAGCGATTCTTGAATCTCATGATGCATGGGAAATGGTCGAAAAAGTTTTTG
TAGAACCAGAAGATACTTCAGATTATACGGCGATTCAAACCAAAGAATCGAAATAGACGCATTCGAAGGATAAAACAGCATTGTACATGTTGTTCAGTGTAGTTGACGAA
TCGAGCTTTGAAAAGATTGCGAGTTCAACTACTTCAAAAGAAGCGTGGAACACTTTAGAGAAAGTGTTCAAAGGAACTGAACGAGTGAAACAAGTGCGTCTCCAGACACT
TCGCTGCGAGTTAGAGAGCATAAAGATGAAGGAGTCAGAAAGTGTTCTGACTATATCACGCGTGTGCAGACAGTGTAAATCAACTAAATCACAATGGTGAAAAGCTAACG
GATGCGCGAGTCGTTGAAACGATTTTGAGATCATTGACGAAATGTTTTGAGAATGTGGTATGTGCAATAGAGGAGTCAAAAGACCTTAAGACACTCAAGTTTGACGAGCT
TGCTGGATCTCTTGAGGCACATGAGCAACGCAAGAACAAGAAGAAAGAAGAAACAATAGAGCAAGCGCTTCAAACTAAGGCCTTAATCGAAGATGAAGAGGTACTCTATT
CTAAACAATTTTAGAGTAGAGGTCGCGGTCGTGGATATGGTCGCAGTAGTCAAAGCGACGATTATGATGCGGAGAAGGGGCATACGAGTCCCACTGGGCATGCCAATTTG
TTCAAACGAGATTTCCGGAGAAATTTAGTTTCGTGGAGTTGAACTTGATTTCTGAGATGATTTCGCGAAAGTGGTTCAATCTCTTTGACTTGATCCTCTAATTCGTTCTC
TCTGAATGAATGTTCTTGATATGAAGAACAATTTAAAGAATAAAATGTGTGTTCTTCGTACTCAACTGCTCGGTTATGGAATTATTTGTCTTACGAAGTAATTTTTGCCT
TCGAAACGATGTTTGATCTTCAGAACCCTGGAGAGACCTCAACAAATGTGAATTCCCTCTCAAGTTCCAAAAAATTTCCACACCCTTTGAAATGAAAAGAACTTCTCTAT
TTATAGAGTTCTCATGAATCGTGGGCTCGGGTCCTGATGGGCTTTTGGGCCTGGCCCTTGGGCTTGGATTTCGTGGTTTGGTTCACTAAATGGACTTGGTCTGGTTTAAT
TTTGAACCAAATTTAGAATAAACCAATTCTTGGCCCAAATTAATTTTGAGAAATATTGTCCCAAAATTTAATCTAAAATTGTCATTAAATTTGAGATAATGACGTTGCAT
GCAACTTGTAATTGACCGAAATTTCAGAGTCAACACTATGATCAGGCAATTGGTCATTTCTTCGATGACCTAAAAGTAGACTACCGCCTCTTATTCAAAAGGAATTTCTA
GACCAATGTGGGAATACCATTACATAGTTTCTCTAAGAGTAAGAATCAATAGTTCGTTGTTACCTTAAATCCTAATTGACATTGAATTCCCTTTAACCAATTATCACAAC
TTTTCTATCGCCTTAACTTTTATTTAATTAAATTTTATTTTTTATTTTTTAAAATTTCGTATAGGCTTCAGACTTCGCACTACCCGTAATTAATTTTCATGAATCCTCAC
TATATTCTTTGTATTTCATTTCTTAGAATTCCAAACATTTTAATCTTCAAATTAAACTTGATAAACCTTCAACCCACTCCCTCATTTAAAATAGAAAACTATTTACTTCC
TTTCTTCTTTTACTTTCTTTGTGCTCTCTCCCCTGTTTCTTCTTGTCAACTAAAGATGAAGTCCAATCATTATTAATTACATTTCCTCATTGGAACTATTTTTTTACTTT
GATTAAATCTCACACAAAGTTGCATCAATACCAACGATTTTTATCTACTTGTTAATGTGCTAGAAATTTTCCACATTTTCTATATTTTTATTTCCATGGTTTCATCCAAA
ATCAAGCCAAAACAAGCTAGTCATGAAATACAAAAGTGTAATGGTTAAATCTCTACATTCCTATGATTTTGTTTTAGAAAAGAGTTTATGAAAGCTTACCGGCCACAATC
TCGAAATTTCTTCCCGAAACATCTATGGTCAGCCTCCAATTACTCCTTTGCCAAGTTCTCTCTCTCTTTAAGTGTAATGTCGAAGAAATGAGGAAGGTTTTCCCTTCTTT
CCATTTTCCTTCTTAAAGC
Protein sequenceShow/hide protein sequence
MKGNDVGLKKMAMPLGQHGGLGGDVWDDGVFPSIKTVKLTCIPKDRIESIQTEYVVDDENSTWSDRHGQSSNGRVQEEVVNLDYPNEYLVSIHGSLGDDGAGHCVIHSLT
LESNKNCYGPYGKEGTKRFWFPTTGCKIVGFHGRSGHSLDSIGIKAVSILNNIKINN