; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G20230 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G20230
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein canopy-1-like isoform X1
Genome locationClcChr11:30608794..30612469
RNA-Seq ExpressionClc11G20230
SyntenyClc11G20230
Gene Ontology termsNA
InterPro domainsIPR021852 - Domain of unknown function DUF3456


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008452994.1 PREDICTED: protein canopy-1 isoform X1 [Cucumis melo]1.6e-9194.02Show/hide
Query:  MTMKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGK
        MTMK NAWVLLLLVIYSGVV+CIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKT STG+
Subjt:  MTMKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGK

Query:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL
        QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV+ GDVSKVLCHDLSRHC+ ASSVQLND+DD  DGEL
Subjt:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL

XP_022936575.1 protein canopy-1-like isoform X1 [Cucurbita moschata]9.6e-9294.51Show/hide
Query:  MKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQW
        MKSNAWVLLL+ IY GVVNCIDDKCAACNAVAEEIE GLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQW
Subjt:  MKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQW

Query:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL
        IKVD+WDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV  G VSKVLCHDLSRHCSKASSVQLND+DD ADGEL
Subjt:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL

XP_022975788.1 protein canopy-1-like isoform X1 [Cucurbita maxima]1.1e-9295.05Show/hide
Query:  MKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQW
        MKSNAWVLLL+VIY GVVNCIDDKCAACNAVAEEIE GLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEK+DSTGKQW
Subjt:  MKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQW

Query:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL
        IKVD+WDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV  GDVSKVLCHDLSRHCSKASSVQLND+DD ADGEL
Subjt:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL

XP_038896985.1 protein canopy-1 isoform X1 [Benincasa hispida]1.4e-9596.74Show/hide
Query:  MTMKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGK
        MTMKSNAWVLLLLVIYSGVV+CIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTG+
Subjt:  MTMKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGK

Query:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL
        QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV  GDVSKVLCHDLSRHCSKASSV LNDEDD ADGEL
Subjt:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL

XP_038896987.1 protein canopy-1 isoform X2 [Benincasa hispida]1.0e-9396.2Show/hide
Query:  MTMKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGK
        MTMKSNAWVLLLLVIYSGVV+CIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTG+
Subjt:  MTMKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGK

Query:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL
        QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV  GDVSKVLCHDLSRHCSKA SV LNDEDD ADGEL
Subjt:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL

TrEMBL top hitse value%identityAlignment
A0A1S3BUL2 protein canopy-1 isoform X26.7e-9192.93Show/hide
Query:  MTMKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGK
        MTMK NAWVLLLLVIYSGVV+CIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKT STG+
Subjt:  MTMKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGK

Query:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL
        QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV+ GDVSKVLCHDLSRHC+  +SVQLND+DD  DGEL
Subjt:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL

A0A1S3BWB0 protein canopy-1 isoform X18.0e-9294.02Show/hide
Query:  MTMKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGK
        MTMK NAWVLLLLVIYSGVV+CIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKT STG+
Subjt:  MTMKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGK

Query:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL
        QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV+ GDVSKVLCHDLSRHC+ ASSVQLND+DD  DGEL
Subjt:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL

A0A6J1FE32 protein canopy-1-like isoform X14.7e-9294.51Show/hide
Query:  MKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQW
        MKSNAWVLLL+ IY GVVNCIDDKCAACNAVAEEIE GLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQW
Subjt:  MKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQW

Query:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL
        IKVD+WDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV  G VSKVLCHDLSRHCSKASSVQLND+DD ADGEL
Subjt:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL

A0A6J1IHQ2 protein canopy-1-like isoform X23.9e-9194.51Show/hide
Query:  MKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQW
        MKSNAWVLLL+VIY GVVNCIDDKCAACNAVAEEIE GLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEK+DSTGKQW
Subjt:  MKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQW

Query:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL
        IKVD+WDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV  GDVSKVLCHDLSRHCSKA SVQLND+DD ADGEL
Subjt:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL

A0A6J1IKA3 protein canopy-1-like isoform X15.5e-9395.05Show/hide
Query:  MKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQW
        MKSNAWVLLL+VIY GVVNCIDDKCAACNAVAEEIE GLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEK+DSTGKQW
Subjt:  MKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQW

Query:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL
        IKVD+WDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV  GDVSKVLCHDLSRHCSKASSVQLND+DD ADGEL
Subjt:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL

SwissProt top hitse value%identityAlignment
Q5HZV5 Protein canopy homolog 35.4e-0525.44Show/hide
Query:  WVLLLLVIYSGVVNCID-------DKCAACNAVAEEIEHGL-SNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTG
        W+LLLL ++ G     D        KC  C  VA E++       + R  +D R+    +  +K K I Y  S++R++E+ +GLC ++ +Y + K  S  
Subjt:  WVLLLLVIYSGVVNCID-------DKCAACNAVAEEIEHGL-SNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTG

Query:  KQWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKA
         ++ K  +    T         K +      L  ET  ++A++ K+  V +     V+  D  RH  +A
Subjt:  KQWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKA

Q5M7D4 Protein canopy homolog 14.9e-0625.93Show/hide
Query:  CAACNAVAEEIEHGLSNEKPRNHLDM-RHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTG----KQWIKVDNWD----NLTNKQEA
        C AC A+ +E+ + +    P+  +D+   R+   G+++   + +  SEL + ++L+ +CEKM DY +    +T     K++   DN      +  N Q  
Subjt:  CAACNAVAEEIEHGLSNEKPRNHLDM-RHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTG----KQWIKVDNWD----NLTNKQEA

Query:  RAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGD
           S  +   C R++EE ED++  +I K + ++ D
Subjt:  RAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGD

Q7JXF7 Protein seele1.5e-0725Show/hide
Query:  KCAACNAVAEEIEHGLSNEKPRNHLDMR-HRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQWI-------KVDNWDNLTNKQEA
        KC  C AV  E+E  ++ E P    D+   RLD++G    K +    SE+ + EL++ +CEKM DY      S GK  +       +++   +L +  + 
Subjt:  KCAACNAVAEEIEHGLSNEKPRNHLDMR-HRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQWI-------KVDNWDNLTNKQEA

Query:  RAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGAD
           +K +  +C  +LE+ ++   +  +   +   D+   +C + + +C + S VQ   + DG +
Subjt:  RAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGAD

Q9QXT0 Protein canopy homolog 23.4e-0725.42Show/hide
Query:  MKSNAWVLLLLVIYSGVV---NCIDDKCAACNAVAEEIEHGLSNEKPRNHLDM-RHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDST
        MK   W+ LLL +  G        D  C AC A+ +E+E  ++   P+  + M   R++  G +    + Y  SE  + ELL+ +C++M++Y  +   ST
Subjt:  MKSNAWVLLLLVIYSGVV---NCIDDKCAACNAVAEEIEHGLSNEKPRNHLDM-RHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDST

Query:  -GKQWIKVDNWDNLTNKQEARAYSKD------ISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKA
          K +++V + +  +++ + +    D      +   C  ++EE ED+L E   + + +V D    LC   +  C  A
Subjt:  -GKQWIKVDNWDNLTNKQEARAYSKD------ISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKA

Q9Y2B0 Protein canopy homolog 29.9e-0725.42Show/hide
Query:  MKSNAWVLLLLVIYSGVV---NCIDDKCAACNAVAEEIEHGLSNEKPRNHLDM-RHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDST
        MK   W+ LLL    G        D  C AC A+ +E+E  ++   P+  + M   R++  G +    + Y  SE  + ELL+ +C++M++Y  +   ST
Subjt:  MKSNAWVLLLLVIYSGVV---NCIDDKCAACNAVAEEIEHGLSNEKPRNHLDM-RHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDST

Query:  -GKQWIKVDNWDNLTNKQEARAYSKD------ISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKA
          K +++V   +  +++ + +    D      +   C  ++EE ED+L E   + + +V D    LC   +  C  A
Subjt:  -GKQWIKVDNWDNLTNKQEARAYSKD------ISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKA

Arabidopsis top hitse value%identityAlignment
AT1G42480.1 unknown protein7.2e-6162.01Show/hide
Query:  LVIYSGVV-------NCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQWIKV
        LV+++ V+       + +DDKCAACNAVAEE+E  L  EKPRNHLDMR+RL+SKGQR+GKVIDYR+S+LRVV+LLDGLC++MQDYT++K +   +QW+KV
Subjt:  LVIYSGVV-------NCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQWIKV

Query:  DNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL
         N+DNLTNKQEA+A++ DISTYCGRLLEETED+L E+IK GS+  G+V KVLC  LS HC ++S     DE+D    EL
Subjt:  DNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL

AT1G42480.2 unknown protein6.8e-3562.96Show/hide
Query:  LVIYSGVV-------NCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQWIKV
        LV+++ V+       + +DDKCAACNAVAEE+E  L  EKPRNHLDMR+RL+SKGQR+GKVIDYR+S+LRVV+LLDGLC++MQDYT++K +   +QW+KV
Subjt:  LVIYSGVV-------NCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQWIKV

Query:  DNWDNLTN
         N+DNLT+
Subjt:  DNWDNLTN

AT1G42480.3 unknown protein8.0e-4450.56Show/hide
Query:  LVIYSGVV-------NCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRV-SELRVVELLDGLCEKMQDYTIEKTDSTGKQWIK
        LV+++ V+       + +DDKCAACNAVAEE+E  L     +    +   + +KG+   ++   RV S+LRVV+LLDGLC++MQDYT++K +   +QW+K
Subjt:  LVIYSGVV-------NCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRV-SELRVVELLDGLCEKMQDYTIEKTDSTGKQWIK

Query:  VDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL
        V N+DNLTNKQEA+A++ DISTYCGRLLEETED+L E+IK GS+  G+V KVLC  LS HC ++S     DE+D    EL
Subjt:  VDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGATGAAATCCAATGCGTGGGTTCTATTGTTATTAGTGATCTACTCTGGCGTCGTCAATTGCATTGATGACAAATGCGCCGCTTGCAATGCTGTTGCCGAGGAGAT
AGAACATGGACTTTCCAATGAAAAACCGAGGAATCATTTAGATATGAGACATCGGTTGGATTCTAAAGGTCAGCGTAAGGGGAAGGTGATTGATTACAGGGTCAGTGAGC
TAAGAGTTGTCGAACTTCTGGATGGGCTTTGTGAAAAGATGCAAGATTACACCATTGAGAAGACAGATTCAACTGGAAAACAGTGGATCAAGGTGGATAACTGGGACAAC
TTGACAAATAAACAAGAAGCCCGGGCCTATTCTAAAGATATATCAACCTATTGTGGGAGGTTATTAGAGGAAACAGAAGATGATTTAGCAGAGTTGATTAAGAAAGGATC
TGTCAGCGTAGGTGATGTTAGCAAAGTCCTATGCCATGATTTGAGCAGGCATTGCAGCAAGGCCAGCAGTGTTCAGCTGAATGACGAGGACGACGGAGCAGATGGAGAAC
TATGA
mRNA sequenceShow/hide mRNA sequence
ATGACGATGAAATCCAATGCGTGGGTTCTATTGTTATTAGTGATCTACTCTGGCGTCGTCAATTGCATTGATGACAAATGCGCCGCTTGCAATGCTGTTGCCGAGGAGAT
AGAACATGGACTTTCCAATGAAAAACCGAGGAATCATTTAGATATGAGACATCGGTTGGATTCTAAAGGTCAGCGTAAGGGGAAGGTGATTGATTACAGGGTCAGTGAGC
TAAGAGTTGTCGAACTTCTGGATGGGCTTTGTGAAAAGATGCAAGATTACACCATTGAGAAGACAGATTCAACTGGAAAACAGTGGATCAAGGTGGATAACTGGGACAAC
TTGACAAATAAACAAGAAGCCCGGGCCTATTCTAAAGATATATCAACCTATTGTGGGAGGTTATTAGAGGAAACAGAAGATGATTTAGCAGAGTTGATTAAGAAAGGATC
TGTCAGCGTAGGTGATGTTAGCAAAGTCCTATGCCATGATTTGAGCAGGCATTGCAGCAAGGCCAGCAGTGTTCAGCTGAATGACGAGGACGACGGAGCAGATGGAGAAC
TATGA
Protein sequenceShow/hide protein sequence
MTMKSNAWVLLLLVIYSGVVNCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTDSTGKQWIKVDNWDN
LTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSVGDVSKVLCHDLSRHCSKASSVQLNDEDDGADGEL