; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G11337 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G11337
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionprotein canopy-1 isoform X1
Genome locationctg1798:1262500..1264417
RNA-Seq ExpressionCucsat.G11337
SyntenyCucsat.G11337
Gene Ontology termsNA
InterPro domainsIPR021852 - Domain of unknown function DUF3456


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145555.1 protein canopy-1 isoform X2 [Cucumis sativus]2.45e-128100Show/hide
Query:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ
        MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ
Subjt:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ

Query:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL
        QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL
Subjt:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL

XP_008452994.1 PREDICTED: protein canopy-1 isoform X1 [Cucumis melo]1.42e-12094.02Show/hide
Query:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ
        MT+KFNAWVLLLL+IYS VVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGS GQ
Subjt:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ

Query:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL
        QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV+PGDVSKVLCHDLS+HC+ +SVQLNDDDD E+DGEL
Subjt:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL

XP_008452995.1 PREDICTED: protein canopy-1 isoform X2 [Cucumis melo]2.27e-11994.57Show/hide
Query:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ
        MT+KFNAWVLLLL+IYS VVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGS GQ
Subjt:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ

Query:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL
        QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV+PGDVSKVLCHDLS+HC+ ASVQLNDDDD E+DGEL
Subjt:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL

XP_031739828.1 protein canopy-1 isoform X1 [Cucumis sativus]1.72e-12699.46Show/hide
Query:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ
        MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ
Subjt:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ

Query:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKAS-VQLNDDDDDESDGEL
        QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKAS VQLNDDDDDESDGEL
Subjt:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKAS-VQLNDDDDDESDGEL

XP_038896987.1 protein canopy-1 isoform X2 [Benincasa hispida]2.63e-11692.39Show/hide
Query:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ
        MT+K NAWVLLLL+IYS VV CIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKT S GQ
Subjt:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ

Query:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL
        QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV  GDVSKVLCHDLS+HCSKASV LND+DD E+DGEL
Subjt:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL

TrEMBL top hitse value%identityAlignment
A0A0A0L1D8 DUF3456 domain-containing protein1.19e-128100Show/hide
Query:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ
        MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ
Subjt:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ

Query:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL
        QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL
Subjt:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL

A0A1S3BUL2 protein canopy-1 isoform X21.10e-11994.57Show/hide
Query:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ
        MT+KFNAWVLLLL+IYS VVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGS GQ
Subjt:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ

Query:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL
        QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV+PGDVSKVLCHDLS+HC+ ASVQLNDDDD E+DGEL
Subjt:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL

A0A1S3BWB0 protein canopy-1 isoform X16.86e-12194.02Show/hide
Query:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ
        MT+KFNAWVLLLL+IYS VVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGS GQ
Subjt:  MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQ

Query:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL
        QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV+PGDVSKVLCHDLS+HC+ +SVQLNDDDD E+DGEL
Subjt:  QWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL

A0A6J1CJN9 protein seele isoform X25.42e-11390.61Show/hide
Query:  KFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQQWI
        KFNAW LLLL+IYS  V+CIDDKCAACNAVAEEIE GLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKT S+GQQWI
Subjt:  KFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQQWI

Query:  KVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL
        KVD+WDNLTN+QEARAYSKDISTYCGRLLEETEDDLAELIKKGSV  GDVSKVLCHDLS+HCS ASVQ+NDDDD E+DGEL
Subjt:  KVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL

A0A6J1IHQ2 protein canopy-1-like isoform X29.36e-11491.16Show/hide
Query:  KFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQQWI
        K NAWVLLL++IY  VV+CIDDKCAACNAVAEEIE GLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEK+ S G+QWI
Subjt:  KFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQQWI

Query:  KVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL
        KVD+WDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV PGDVSKVLCHDLS+HCSKASVQLNDDDD E+DGEL
Subjt:  KVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL

SwissProt top hitse value%identityAlignment
Q5HZV5 Protein canopy homolog 36.0e-0427.62Show/hide
Query:  WVLLLLLIYSAVVDCID-------DKCAACNAVAEEIEHGL-SNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAG
        W+LLLL ++    +  D        KC  C  VA E++       + R  +D R+    +  +K K I Y  S++R++E+ +GLC ++ +Y + K  S  
Subjt:  WVLLLLLIYSAVVDCID-------DKCAACNAVAEEIEHGL-SNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAG

Query:  QQWIK
         ++ K
Subjt:  QQWIK

Q5M7D4 Protein canopy homolog 17.1e-0525.78Show/hide
Query:  CAACNAVAEEIEHGLSNEKPRNHLDM-RHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQQWIK----VDNWD----NLTNKQEA
        C AC A+ +E+ + +    P+  +D+   R+   G+++   + +  SEL + ++L+ +CEKM DY +    +  ++  K     DN      +  N Q  
Subjt:  CAACNAVAEEIEHGLSNEKPRNHLDM-RHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQQWIK----VDNWD----NLTNKQEA

Query:  RAYSKDISTYCGRLLEETEDDLAELIKK
           S  +   C R++EE ED++  +I K
Subjt:  RAYSKDISTYCGRLLEETEDDLAELIKK

Q7JXF7 Protein seele2.0e-0723.78Show/hide
Query:  KCAACNAVAEEIEHGLSNEKPRNHLDMR-HRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQQWI-------KVDNWDNLTNKQEA
        KC  C AV  E+E  ++ E P    D+   RLD++G    K +    SE+ + EL++ +CEKM DY      S G+  +       +++   +L +  + 
Subjt:  KCAACNAVAEEIEHGLSNEKPRNHLDMR-HRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQQWI-------KVDNWDNLTNKQEA

Query:  RAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESD
           +K +  +C  +LE+ ++   +  +   +   D+   +C + + +C ++ VQ   D D + +
Subjt:  RAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESD

Q8BQ47 Protein canopy homolog 46.0e-0429.1Show/hide
Query:  IDDKCAACNAVAEEIEHGLSNE-KPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTI--EKTGSAG----------------QQWI
        +  KC  C  ++ E++  LS   + R  L++   LD+ G+RK + + Y +SE R+ E L+ LCE++ DY +  E+ GS                  Q+ +
Subjt:  IDDKCAACNAVAEEIEHGLSNE-KPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTI--EKTGSAG----------------QQWI

Query:  KVDNWDNLTNKQEARAYSKDISTYCGRLLEETED
        KVD    L    E       +   C  +LEE ED
Subjt:  KVDNWDNLTNKQEARAYSKDISTYCGRLLEETED

Arabidopsis top hitse value%identityAlignment
AT1G42480.1 unknown protein1.0e-5961.45Show/hide
Query:  LLIYSAVV-------DCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQQWIKV
        L++++AV+         +DDKCAACNAVAEE+E  L  EKPRNHLDMR+RL+SKGQR+GKVIDYR+S+LRVV+LLDGLC++MQDYT++K     +QW+KV
Subjt:  LLIYSAVV-------DCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQQWIKV

Query:  DNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL
         N+DNLTNKQEA+A++ DISTYCGRLLEETED+L E+IK GS+  G+V KVLC  LS HC ++S   ++D++D+   EL
Subjt:  DNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL

AT1G42480.2 unknown protein2.0e-3462.96Show/hide
Query:  LLIYSAVV-------DCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQQWIKV
        L++++AV+         +DDKCAACNAVAEE+E  L  EKPRNHLDMR+RL+SKGQR+GKVIDYR+S+LRVV+LLDGLC++MQDYT++K     +QW+KV
Subjt:  LLIYSAVV-------DCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQQWIKV

Query:  DNWDNLTN
         N+DNLT+
Subjt:  DNWDNLTN

AT1G42480.3 unknown protein1.2e-4250Show/hide
Query:  LLIYSAVV-------DCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRV-SELRVVELLDGLCEKMQDYTIEKTGSAGQQWIK
        L++++AV+         +DDKCAACNAVAEE+E  L     +    +   + +KG+   ++   RV S+LRVV+LLDGLC++MQDYT++K     +QW+K
Subjt:  LLIYSAVV-------DCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRV-SELRVVELLDGLCEKMQDYTIEKTGSAGQQWIK

Query:  VDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL
        V N+DNLTNKQEA+A++ DISTYCGRLLEETED+L E+IK GS+  G+V KVLC  LS HC ++S   ++D++D+   EL
Subjt:  VDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGATCAAATTCAACGCCTGGGTTCTATTGTTATTGCTGATCTACTCCGCCGTCGTCGATTGCATTGATGACAAATGCGCCGCTTGCAATGCTGTTGCTGAGGAGAT
AGAACATGGACTTTCCAATGAAAAACCGAGGAATCATTTAGATATGAGACATCGGTTGGATTCTAAAGGTCAGCGTAAGGGGAAGGTGATTGATTACAGGGTTAGTGAGC
TAAGAGTTGTCGAACTCCTGGATGGGCTTTGTGAAAAGATGCAAGATTACACTATTGAGAAGACAGGTTCAGCTGGACAACAGTGGATCAAGGTGGATAACTGGGACAAC
TTGACAAATAAACAAGAAGCTCGGGCTTATTCTAAAGATATATCAACGTATTGTGGGAGATTATTAGAGGAAACAGAAGATGATTTGGCAGAGTTGATTAAGAAAGGATC
TGTCAGCCCAGGTGACGTTAGCAAAGTCCTATGCCATGATTTGAGCAAGCATTGCAGCAAGGCGAGCGTTCAGCTGAATGACGATGACGACGATGAATCAGATGGAGAAC
TATGA
mRNA sequenceShow/hide mRNA sequence
ATGACGATCAAATTCAACGCCTGGGTTCTATTGTTATTGCTGATCTACTCCGCCGTCGTCGATTGCATTGATGACAAATGCGCCGCTTGCAATGCTGTTGCTGAGGAGAT
AGAACATGGACTTTCCAATGAAAAACCGAGGAATCATTTAGATATGAGACATCGGTTGGATTCTAAAGGTCAGCGTAAGGGGAAGGTGATTGATTACAGGGTTAGTGAGC
TAAGAGTTGTCGAACTCCTGGATGGGCTTTGTGAAAAGATGCAAGATTACACTATTGAGAAGACAGGTTCAGCTGGACAACAGTGGATCAAGGTGGATAACTGGGACAAC
TTGACAAATAAACAAGAAGCTCGGGCTTATTCTAAAGATATATCAACGTATTGTGGGAGATTATTAGAGGAAACAGAAGATGATTTGGCAGAGTTGATTAAGAAAGGATC
TGTCAGCCCAGGTGACGTTAGCAAAGTCCTATGCCATGATTTGAGCAAGCATTGCAGCAAGGCGAGCGTTCAGCTGAATGACGATGACGACGATGAATCAGATGGAGAAC
TATGA
Protein sequenceShow/hide protein sequence
MTIKFNAWVLLLLLIYSAVVDCIDDKCAACNAVAEEIEHGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTGSAGQQWIKVDNWDN
LTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVSPGDVSKVLCHDLSKHCSKASVQLNDDDDDESDGEL