; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0013089 (gene) of Chayote v1 genome

Gene IDSed0013089
OrganismSechium edule (Chayote v1)
DescriptionTransmembrane protein
Genome locationLG11:5565239..5566484
RNA-Seq ExpressionSed0013089
SyntenySed0013089
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055589.1 hypothetical protein E6C27_scaffold222G00710 [Cucumis melo var. makuwa]1.5e-8064.31Show/hide
Query:  MSEREDSHDHHH-HHHLQAHEKPA-NLFFKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESRS
        MSEREDS  HHH HHHL    + +   FFKIL +SLQI  +NKRHFL IFLLL+LPLS L+FTLSL SHPLKSHILHLES+LRHSPTRFEFRHVFSESR+
Subjt:  MSEREDSHDHHH-HHHLQAHEKPA-NLFFKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESRS

Query:  DAFTLLRLRAAFSVPIAVFSLLAAVSAVSAAL-----------NRHSPLKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALLEIY
        DAF+LLRLRAAF +PI  FSL  AVS VS+ L           +  S  K+++ RPL+T++  Y  L+A+S++PNTLAS+SPSP LRF +LVF  + E+Y
Subjt:  DAFTLLRLRAAFSVPIAVFSLLAAVSAVSAAL-----------NRHSPLKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALLEIY

Query:  LISILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK
        LISI+ L +VVSIAE+RFGFDAIR AA L+ADRR+    LTA+F+  S+ IS+ MEG MDGVDHWMRSTAAVT +V V VGDK
Subjt:  LISILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK

KAE8652471.1 hypothetical protein Csa_014189 [Cucumis sativus]2.2e-7962.59Show/hide
Query:  MSERED--------SHDHHHHHHLQAHEKPAN-----LFFKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRF
        MSERED        SH HH HHHL     P+       FFKIL +SLQI  +NKRHFL IFLLL+LPLS L+FTLSL SHPLKSHILHLES+LRHSPTRF
Subjt:  MSERED--------SHDHHHHHHLQAHEKPAN-----LFFKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRF

Query:  EFRHVFSESRSDAFTLLRLRAAFSVPIAVFSLLAAVSAVSAAL-----------NRHSPLKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFA
        EFRHVFSESR+DAF+LLRLRAAF +PI  FSL  AVS VS+ L           +  S  K+++ RPL+T++  YA L+A+S++PNTLAS+SPSP  RF 
Subjt:  EFRHVFSESRSDAFTLLRLRAAFSVPIAVFSLLAAVSAVSAAL-----------NRHSPLKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFA

Query:  ILVFSALLEIYLISILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK
        +LVF  + E+YLISI  L +VVSIAE+RFGFDAIR AAGL+ADRR+    LTA+F+  S+ IS+ MEG MDGVDHWMRSTAAVT +V V VGDK
Subjt:  ILVFSALLEIYLISILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK

KAG6598865.1 hypothetical protein SDJN03_08643, partial [Cucurbita argyrosperma subsp. sororia]9.7e-7260.56Show/hide
Query:  MSEREDSHDHHHHHHLQAHEKPANLF---FKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESR
        MSER+DS  H        HE     F   FKIL NSLQI  +NKR FL IF    LPLS L+F LSL SHPLKSHI+HLES+LRHSPTRFEFRHVFSESR
Subjt:  MSEREDSHDHHHHHHLQAHEKPANLF---FKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESR

Query:  SDAFTLLRLRAAFSVPIAVFSLLAAVSAVS-AALNRHSP----------LKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALLEI
         DAF+LLRLRAAF +PI VFSLL A S VS   L+ H+           LK+++ RPL+T++  YA L+A+S++PNTLAS+S SP +RFAILV   + E+
Subjt:  SDAFTLLRLRAAFSVPIAVFSLLAAVSAVS-AALNRHSP----------LKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALLEI

Query:  YLISILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK
        YLI+IL L +VVSIAE+RFGFDAIR AA L+ADRR+    LTA+F+  S+ IS  MEG MDGVDHWMR+TAAVT++V +GV DK
Subjt:  YLISILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK

XP_022997441.1 uncharacterized protein LOC111492360 [Cucurbita maxima]3.7e-7160.14Show/hide
Query:  MSEREDSHDHHHHHHLQAHEKPANLF---FKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESR
        MSER+DS  H        HE     F   FKIL NSLQI  +NKR FL IF    LPLS L+F LSL SHPLKSHILHLES+LRHSPTRFEFRHVFSESR
Subjt:  MSEREDSHDHHHHHHLQAHEKPANLF---FKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESR

Query:  SDAFTLLRLRAAFSVPIAVFSLLAAVSAVSAALNRHS-------------PLKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALL
         DAF+LLRLRAAF +PI  FSLL A S VS     HS              LK+++ RPL+T++  YA L+A+S++PNTLAS+S SP LRFAILV   + 
Subjt:  SDAFTLLRLRAAFSVPIAVFSLLAAVSAVSAALNRHS-------------PLKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALL

Query:  EIYLISILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK
        E+YLI++L L +VVSIAE+RFGFDAIR AA L+ADRR+    LTA+F+  S+ IS  MEG MDGVDHWMR+TAAVT++V +GV DK
Subjt:  EIYLISILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK

XP_038889309.1 uncharacterized protein LOC120079221 [Benincasa hispida]2.8e-7964.54Show/hide
Query:  MSEREDSHDHHHHHHLQAHEKPA-NLFFKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESRSD
        MSEREDS     HH L   E      FFKIL +SLQI  +NKRHF  +FL L+LPLS L+FTLSL SHPLKSHILHLES+LRHSPTRFEFRHVFSESR+D
Subjt:  MSEREDSHDHHHHHHLQAHEKPA-NLFFKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESRSD

Query:  AFTLLRLRAAFSVPIAVFSLLAAVSAVSAAL----NRHSPLKS-------TFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALLEIYL
        AF+LLRLRAAF  PI   SL  A++ VS+ L    ++   LKS       T+ RPL+T++  YA L+A+S+LPNTLAS+SPS PLRF +LVF  + E+YL
Subjt:  AFTLLRLRAAFSVPIAVFSLLAAVSAVSAAL----NRHSPLKS-------TFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALLEIYL

Query:  ISILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK
        ISIL L++VVSIAEDRFGFDAIRVAAGL+ADR++C   LTA+F+  S+ IS+ MEG MDGVDHWMRSTAAVT++VV+GVGDK
Subjt:  ISILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK

TrEMBL top hitse value%identityAlignment
A0A251QV24 Uncharacterized protein1.6e-4344.84Show/hide
Query:  MSEREDSHDHHHHHHLQAHEKPANLFFKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESRSDA
        MSER+    H             +   K L +SL+I  +NK  F+ IF L TLPLS L+F+LSL SHPLKSHI HLESL R +PTRFE R V+ ESR DA
Subjt:  MSEREDSHDHHHHHHLQAHEKPANLFFKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESRSDA

Query:  FTLLRLRAAFSVPIAVFSLLAAVSAVSAALNRH-----------SPLKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALLEIYLI
         +LLR++A F +P    SLLA+V+AV+A  +             + +K T+ RPL+TS+  YA  +A++++P TL+ +  S   RF ILV  + LEIYL+
Subjt:  FTLLRLRAAFSVPIAVFSLLAAVSAVSAALNRH-----------SPLKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALLEIYLI

Query:  SILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK
        ++LGL +V SI E+RFG+DAIRV   L+A +R+C W+L+ +FV ++  ++  +E  MDG D    S+ A    VVVG+ DK
Subjt:  SILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK

A0A2P6Q3G0 Uncharacterized protein3.2e-4445.63Show/hide
Query:  PANLFFK-ILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESRSDAFTLLRLRAAFSVPIAVFSLL
        P  L  K +L +S++I  +NK+ FL IF L TLPLS L+F+LSL SHPLKSH+ HLESL   +PTRFE R V+ ESR DA +L R++  + +P  +FSL 
Subjt:  PANLFFK-ILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESRSDAFTLLRLRAAFSVPIAVFSLL

Query:  AAVSAVSA---ALNRHSP---------LKSTFPRPLLTSLTSYAALLAFSLLPNTL-ASLSPSPPLRFAILVFSALLEIYLISILGLAIVVSIAEDRFGF
        A+V+AV+A   A +  SP         +KS++ RP  TS+  YA L+A++L+P TL A++  S   RF I +  + LEIYL++++GL +VVSI E+R+G+
Subjt:  AAVSAVSA---ALNRHSP---------LKSTFPRPLLTSLTSYAALLAFSLLPNTL-ASLSPSPPLRFAILVFSALLEIYLISILGLAIVVSIAEDRFGF

Query:  DAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK
        +AIRV +GL+A +R+C W L+  FV V+  I+  +E  MDG D   +S+      VVVG+ +K
Subjt:  DAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK

A0A5A7UQ45 Uncharacterized protein7.2e-8164.31Show/hide
Query:  MSEREDSHDHHH-HHHLQAHEKPA-NLFFKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESRS
        MSEREDS  HHH HHHL    + +   FFKIL +SLQI  +NKRHFL IFLLL+LPLS L+FTLSL SHPLKSHILHLES+LRHSPTRFEFRHVFSESR+
Subjt:  MSEREDSHDHHH-HHHLQAHEKPA-NLFFKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESRS

Query:  DAFTLLRLRAAFSVPIAVFSLLAAVSAVSAAL-----------NRHSPLKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALLEIY
        DAF+LLRLRAAF +PI  FSL  AVS VS+ L           +  S  K+++ RPL+T++  Y  L+A+S++PNTLAS+SPSP LRF +LVF  + E+Y
Subjt:  DAFTLLRLRAAFSVPIAVFSLLAAVSAVSAAL-----------NRHSPLKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALLEIY

Query:  LISILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK
        LISI+ L +VVSIAE+RFGFDAIR AA L+ADRR+    LTA+F+  S+ IS+ MEG MDGVDHWMRSTAAVT +V V VGDK
Subjt:  LISILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK

A0A6J1K518 uncharacterized protein LOC1114923601.8e-7160.14Show/hide
Query:  MSEREDSHDHHHHHHLQAHEKPANLF---FKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESR
        MSER+DS  H        HE     F   FKIL NSLQI  +NKR FL IF    LPLS L+F LSL SHPLKSHILHLES+LRHSPTRFEFRHVFSESR
Subjt:  MSEREDSHDHHHHHHLQAHEKPANLF---FKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESR

Query:  SDAFTLLRLRAAFSVPIAVFSLLAAVSAVSAALNRHS-------------PLKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALL
         DAF+LLRLRAAF +PI  FSLL A S VS     HS              LK+++ RPL+T++  YA L+A+S++PNTLAS+S SP LRFAILV   + 
Subjt:  SDAFTLLRLRAAFSVPIAVFSLLAAVSAVSAALNRHS-------------PLKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALL

Query:  EIYLISILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK
        E+YLI++L L +VVSIAE+RFGFDAIR AA L+ADRR+    LTA+F+  S+ IS  MEG MDGVDHWMR+TAAVT++V +GV DK
Subjt:  EIYLISILGLAIVVSIAEDRFGFDAIRVAAGLVADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDK

W9RKJ7 Uncharacterized protein6.8e-4748.65Show/hide
Query:  KILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESRSDAFTLLRLRAAFSVPIAVFSLLAAVSAVS
        KIL +SL+I  +NKR FL IF L TLPLS L+F+LSL SH L+SH+L LESL   SPTRFE R V+ ESR DA ++L  +A F +P    SLLAAVSAV+
Subjt:  KILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESRSDAFTLLRLRAAFSVPIAVFSLLAAVSAVS

Query:  A---ALNRHSP--------LKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALLEIYLISILGLAIVVSIAEDRFGFDAIRVAAGL
        +   A+N   P        +KST+ RPL TS+  YA  LA+  +P TL++  PSP LR  IL   +++E+YL++++ LA+VVSIAE+RFG DAIRV +GL
Subjt:  A---ALNRHSP--------LKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALLEIYLISILGLAIVVSIAEDRFGFDAIRVAAGL

Query:  VADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTA-----AVTASVVVGVGDK
        +  RR+C W L+ + V+ + +IS  +E AMDG D   ++T+     A    V  GV D+
Subjt:  VADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTA-----AVTASVVVGVGDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G16850.1 unknown protein6.6e-2634.52Show/hide
Query:  LQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESRSDAFTLLRLRAAFSVPIAVFSLLAAVSAVSAA
        L +S  IL +NK   L IF  + +PL+AL  +L+L S  LK+H+  LE+L     TRFE R ++ ESR DA +LL L+  + VP  + S +A+++ +++ 
Subjt:  LQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESRSDAFTLLRLRAAFSVPIAVFSLLAAVSAVSAA

Query:  LNRH-----------SPLKSTFPRPLLTSLTSYAALLAFSLLPNTLASL-SPSPPLRFAILVFSALLEIYLISILGLAIVVSIAEDRFGFDAIRVAAGLV
           H           + +KS++ R   TS+  Y  L  +S +   L++L   +P LR+ I +    +E+Y+++I GL  VVS+ E+R+GFDAI+    L+
Subjt:  LNRH-----------SPLKSTFPRPLLTSLTSYAALLAFSLLPNTLASL-SPSPPLRFAILVFSALLEIYLISILGLAIVVSIAEDRFGFDAIRVAAGLV

Query:  ADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGD
          RRI   +L  VFV +S++I   ME     +D  + S +   + VV G  D
Subjt:  ADRRICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGAGAGAGAAGACTCTCACGATCATCATCATCATCATCATCTTCAAGCTCATGAAAAACCTGCAAATCTGTTCTTCAAAATCCTCCAAAACTCCCTCCAAATCCT
CTGCAAGAACAAGCGCCATTTCCTCCAAATCTTCCTCCTCTTAACCCTCCCTTTATCCGCCCTGATTTTCACGCTTTCCCTTTGTTCCCACCCCCTCAAATCCCACATCC
TCCATCTCGAATCCCTCCTCCGCCACTCCCCCACCCGATTCGAGTTCCGCCACGTCTTCTCCGAGTCCCGGAGCGACGCTTTCACCCTCCTCCGCCTCCGCGCCGCCTTC
TCCGTTCCGATCGCTGTGTTCTCCCTCCTCGCCGCCGTCTCCGCCGTCTCCGCCGCCCTCAACCGTCACTCCCCCCTCAAATCCACCTTCCCCCGCCCTCTCCTCACCTC
CTTGACCTCCTACGCCGCCCTCCTCGCCTTCTCCCTCCTCCCAAACACCCTCGCCTCTCTCTCCCCCTCTCCGCCTCTCCGATTCGCGATCCTCGTATTCTCCGCTCTCC
TCGAGATCTACCTCATCTCGATCCTCGGCCTTGCGATCGTCGTCTCCATCGCCGAGGATCGATTCGGCTTCGATGCCATTCGTGTTGCGGCGGGGCTGGTTGCGGACCGG
CGGATCTGCGCGTGGAGTTTGACGGCGGTGTTTGTGGCGGTGTCGACGTGGATCTCGGCGGCAATGGAGGGGGCTATGGACGGCGTTGATCACTGGATGCGGTCCACGGC
GGCGGTGACGGCGAGTGTGGTCGTCGGCGTTGGCGATAAGGAAGAGGGATTTTCTTAG
mRNA sequenceShow/hide mRNA sequence
AAAAATTGAAGAACGAAAAAGAGATAAGAGAAAAAGAAAGAAGGAAAATCGGAGAAAATGTCTGAGAGAGAAGACTCTCACGATCATCATCATCATCATCATCTTCAAGC
TCATGAAAAACCTGCAAATCTGTTCTTCAAAATCCTCCAAAACTCCCTCCAAATCCTCTGCAAGAACAAGCGCCATTTCCTCCAAATCTTCCTCCTCTTAACCCTCCCTT
TATCCGCCCTGATTTTCACGCTTTCCCTTTGTTCCCACCCCCTCAAATCCCACATCCTCCATCTCGAATCCCTCCTCCGCCACTCCCCCACCCGATTCGAGTTCCGCCAC
GTCTTCTCCGAGTCCCGGAGCGACGCTTTCACCCTCCTCCGCCTCCGCGCCGCCTTCTCCGTTCCGATCGCTGTGTTCTCCCTCCTCGCCGCCGTCTCCGCCGTCTCCGC
CGCCCTCAACCGTCACTCCCCCCTCAAATCCACCTTCCCCCGCCCTCTCCTCACCTCCTTGACCTCCTACGCCGCCCTCCTCGCCTTCTCCCTCCTCCCAAACACCCTCG
CCTCTCTCTCCCCCTCTCCGCCTCTCCGATTCGCGATCCTCGTATTCTCCGCTCTCCTCGAGATCTACCTCATCTCGATCCTCGGCCTTGCGATCGTCGTCTCCATCGCC
GAGGATCGATTCGGCTTCGATGCCATTCGTGTTGCGGCGGGGCTGGTTGCGGACCGGCGGATCTGCGCGTGGAGTTTGACGGCGGTGTTTGTGGCGGTGTCGACGTGGAT
CTCGGCGGCAATGGAGGGGGCTATGGACGGCGTTGATCACTGGATGCGGTCCACGGCGGCGGTGACGGCGAGTGTGGTCGTCGGCGTTGGCGATAAGGAAGAGGGATTTT
CTTAGGGTTGTTGATGGTGAGGAGGATCGTGATCATATCCTCATGGTTTGATTTTTTTACTGTTTTTCTCAATACTGTTTCCACTGGTGATTGATTCCAATTGTTCATCA
TATTTTTTGTTTTTCTATGTGCATAATTTTTTTTTTCTTTTTTCTGGGGATGGTTGAAGATATGTGGATTTGAATTCAGATTATTGAACACACTCATATGATTGAGCATG
ACTTTAGTTGAAGGTTATATGTTATATTTGTTGTAATTTTTTCTATGAAGATGACAAGTATGTATTACA
Protein sequenceShow/hide protein sequence
MSEREDSHDHHHHHHLQAHEKPANLFFKILQNSLQILCKNKRHFLQIFLLLTLPLSALIFTLSLCSHPLKSHILHLESLLRHSPTRFEFRHVFSESRSDAFTLLRLRAAF
SVPIAVFSLLAAVSAVSAALNRHSPLKSTFPRPLLTSLTSYAALLAFSLLPNTLASLSPSPPLRFAILVFSALLEIYLISILGLAIVVSIAEDRFGFDAIRVAAGLVADR
RICAWSLTAVFVAVSTWISAAMEGAMDGVDHWMRSTAAVTASVVVGVGDKEEGFS