; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G05000 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G05000
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionSASA domain-containing protein
Genome locationChr6:4565767..4566761
RNA-Seq ExpressionCSPI06G05000
SyntenyCSPI06G05000
Gene Ontology termsNA
InterPro domainsIPR005181 - Sialate O-acetylesterase domain
IPR036514 - SGNH hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8637379.1 hypothetical protein CSA_004472 [Cucumis sativus]2.1e-11598.53Show/hide
Query:  MGDPGGVNYPLFCANICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASENLDCSRISEWIKGTGRYTSLIRR
        MGDPGGVNYPLFCANICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNG GPGMAFADHLLAKASENLDCSRISEWIKGTGRYTSLIRR
Subjt:  MGDPGGVNYPLFCANICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASENLDCSRISEWIKGTGRYTSLIRR

Query:  INASLESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQ
        INASLESGGRLQGFVWFQGESDAALEVESQVYH+NLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFI+YVEDVRKAEEAVDHELLDVTTVDAKKAVQ
Subjt:  INASLESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQ

Query:  HVLN
        HVLN
Subjt:  HVLN

KAE8646674.1 hypothetical protein Csa_005158 [Cucumis sativus]1.3e-13698.74Show/hide
Query:  MGDPGGVNYPLFCANICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASENLDCSRISEWIKGTGRYTSLIRR
        MGDPGGVNYPLFCANICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNG GPGMAFADHLLAKASENLDCSRISEWIKGTGRYTSLIRR
Subjt:  MGDPGGVNYPLFCANICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASENLDCSRISEWIKGTGRYTSLIRR

Query:  INASLESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQ
        INASLESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFI+YVEDVRKAEEAVDHELLDVTTVDAKKAVQ
Subjt:  INASLESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQ

Query:  HVLNLGQEPYNHDGHLSVHTEVKIGIMLAKAYLKFSLK
        HVLNLGQEPYNHDGHLSVHTEVKIGIMLAKAY+KFSLK
Subjt:  HVLNLGQEPYNHDGHLSVHTEVKIGIMLAKAYLKFSLK

KAG6578894.1 putative carbohydrate esterase, partial [Cucurbita argyrosperma subsp. sororia]3.9e-5349.56Show/hide
Query:  VWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASENLDC----------SRISEWIKGTGRYTSLIRRINASLES
        VWD +IPP S P  S  RF  +  WEQ REPLHWDID  KTNG GPGM FA+ LLAKA  ++            S + EW+KGT RYT L+ R+  S E 
Subjt:  VWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASENLDC----------SRISEWIKGTGRYTSLIRRINASLES

Query:  GGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLNLGQ
        GG+++GF W+QGESDAA+E E++ Y + L  FF DLR D+N P LPI+LVKI  HD  ISP   + E+V  A+EAV  +L +V  VD + AV +      
Subjt:  GGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLNLGQ

Query:  EPYNHD-GHLSVHTEVKIGIMLAKAY
        E  N D GHL+V +EV +G M A +Y
Subjt:  EPYNHD-GHLSVHTEVKIGIMLAKAY

XP_022134349.1 probable carbohydrate esterase At4g34215 [Momordica charantia]1.9e-6049.8Show/hide
Query:  MGDPGGVNYPLFCANICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASENLDC----------SRISEWIKG
        M   GGV       + C WD +IP  S  + S +RF     WE   EPLHWDID  KTNG GPGMAFA+ LLAKA++++            + + EW+KG
Subjt:  MGDPGGVNYPLFCANICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASENLDC----------SRISEWIKG

Query:  TGRYTSLIRRINASLESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDV
        T  YT L+ RINAS   GG++Q F WFQGESDA++ V+++ Y QNL  F  DLR DLN+P LPI+LVKI  +D  ISP +NY + +R+A+EAV H+L  +
Subjt:  TGRYTSLIRRINASLESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDV

Query:  TTVDAKKAVQHVLNLGQEPYNHD-GHLSVHTEVKIGIMLAKAYLK
        +TVDAKKA+Q V++  +   N D GHLSV++EV++G MLA AYL+
Subjt:  TTVDAKKAVQHVLNLGQEPYNHD-GHLSVHTEVKIGIMLAKAYLK

XP_022141846.1 probable carbohydrate esterase At4g34215 [Momordica charantia]7.9e-5448.89Show/hide
Query:  VWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASEN----------LDCSRISEWIKGTGRYTSLIRRINASLES
        VWD  +PP   P  S LRF+ N  WE+  EPLHWDID  KTNG GPGM FA  +LAKA             +  + + EW+KGT  YT L+ RI AS   
Subjt:  VWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASEN----------LDCSRISEWIKGTGRYTSLIRRINASLES

Query:  GGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLNLGQ
        GG++QG +W+QGESDAA+E ES+ Y  NL  F+ DLR D N P LPI+LVKI  HD  ISP IN+++DV KA+E +  +L++V  VD K+AV    N   
Subjt:  GGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLNLGQ

Query:  EPYNHDGHLSVHTEVKIGIMLAKAY
              GHLS  +EVK+G MLA ++
Subjt:  EPYNHDGHLSVHTEVKIGIMLAKAY

TrEMBL top hitse value%identityAlignment
A0A0A0KCN1 SASA domain-containing protein2.1e-9298.85Show/hide
Query:  MAFADHLLAKASENLDCSRISEWIKGTGRYTSLIRRINASLESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDL
        MAFADHLLAKASENLDCSRISEWIKGTGRYTSLIRRINASLESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDL
Subjt:  MAFADHLLAKASENLDCSRISEWIKGTGRYTSLIRRINASLESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDL

Query:  NISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLNLGQEPYNHDGHLSVHTEVKIGIMLAKAYLKFSLK
        NISPFI+YVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLNLGQEPYNHDGHLSVHTEVKIGIMLAKAY+KFSLK
Subjt:  NISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLNLGQEPYNHDGHLSVHTEVKIGIMLAKAYLKFSLK

A0A6J1BYJ2 probable carbohydrate esterase At4g342159.4e-6149.8Show/hide
Query:  MGDPGGVNYPLFCANICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASENLDC----------SRISEWIKG
        M   GGV       + C WD +IP  S  + S +RF     WE   EPLHWDID  KTNG GPGMAFA+ LLAKA++++            + + EW+KG
Subjt:  MGDPGGVNYPLFCANICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASENLDC----------SRISEWIKG

Query:  TGRYTSLIRRINASLESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDV
        T  YT L+ RINAS   GG++Q F WFQGESDA++ V+++ Y QNL  F  DLR DLN+P LPI+LVKI  +D  ISP +NY + +R+A+EAV H+L  +
Subjt:  TGRYTSLIRRINASLESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDV

Query:  TTVDAKKAVQHVLNLGQEPYNHD-GHLSVHTEVKIGIMLAKAYLK
        +TVDAKKA+Q V++  +   N D GHLSV++EV++G MLA AYL+
Subjt:  TTVDAKKAVQHVLNLGQEPYNHD-GHLSVHTEVKIGIMLAKAYLK

A0A6J1CKF9 probable carbohydrate esterase At4g342153.8e-5448.89Show/hide
Query:  VWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASEN----------LDCSRISEWIKGTGRYTSLIRRINASLES
        VWD  +PP   P  S LRF+ N  WE+  EPLHWDID  KTNG GPGM FA  +LAKA             +  + + EW+KGT  YT L+ RI AS   
Subjt:  VWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASEN----------LDCSRISEWIKGTGRYTSLIRRINASLES

Query:  GGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLNLGQ
        GG++QG +W+QGESDAA+E ES+ Y  NL  F+ DLR D N P LPI+LVKI  HD  ISP IN+++DV KA+E +  +L++V  VD K+AV    N   
Subjt:  GGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLNLGQ

Query:  EPYNHDGHLSVHTEVKIGIMLAKAY
              GHLS  +EVK+G MLA ++
Subjt:  EPYNHDGHLSVHTEVKIGIMLAKAY

A0A6J1FFF9 probable carbohydrate esterase At4g342152.5e-5349.12Show/hide
Query:  VWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASENLDC----------SRISEWIKGTGRYTSLIRRINASLES
        VWD +IPP S P  S  RF  +  WEQ REPLHWDID  KTNG GPGM FA+ LLAKA  ++            S + EW+KGT RYT L+ R+  S E 
Subjt:  VWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASENLDC----------SRISEWIKGTGRYTSLIRRINASLES

Query:  GGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLNLGQ
        GG+++GF W+QGESDAA+E E++ Y + L  FF DLR D+N P LPI+LVKI  HD  ISP   + E+V  A+EAV  +L ++  VD + AV +      
Subjt:  GGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLNLGQ

Query:  EPYNHD-GHLSVHTEVKIGIMLAKAY
        E  N D GHL+V +EV +G M A +Y
Subjt:  EPYNHD-GHLSVHTEVKIGIMLAKAY

A0A6J1K1G7 probable carbohydrate esterase At4g342157.2e-5349.12Show/hide
Query:  VWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASENLDC----------SRISEWIKGTGRYTSLIRRINASLES
        VWD +IPP S P  S  RF  +  WEQ REPLHWDID  KTNG GPGM FA+ LLAKA  ++            S + EW+KGT RYT L+ R+  S E 
Subjt:  VWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASENLDC----------SRISEWIKGTGRYTSLIRRINASLES

Query:  GGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLNLGQ
        GG+++GF W+QGESDAA+E E++ Y + L  FF DLR D+N P LPI+LVKI  HD  ISP   + ++V  A+EAV  +L +V  VD + AV +      
Subjt:  GGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLNLGQ

Query:  EPYNHD-GHLSVHTEVKIGIMLAKAY
        E  N D GHL+V +EV +G M A +Y
Subjt:  EPYNHD-GHLSVHTEVKIGIMLAKAY

SwissProt top hitse value%identityAlignment
Q8L9J9 Probable carbohydrate esterase At4g342154.4e-3135.37Show/hide
Query:  NICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASEN------LDC----SRISEWIKGTGRYTSLIRRINAS
        N  VWDK +PP   P  S LR + +  WE+  EPLH DID  K  G GPGMAFA+ +  +   +      + C    + I EW +G+  Y  +++R   S
Subjt:  NICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASEN------LDC----SRISEWIKGTGRYTSLIRRINAS

Query:  LESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLN
         + GG ++  +W+QGESD     +++ Y  N+    K+LR DLN P+LPI+ V IA+          Y++ VR+A+  +  +L +V  VDAK       N
Subjt:  LESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLN

Query:  LGQEPYNHDGHLSVHTEVKIGIMLAKAYL
        L         HL+   +V++G+ LA+AYL
Subjt:  LGQEPYNHDGHLSVHTEVKIGIMLAKAYL

Arabidopsis top hitse value%identityAlignment
AT3G53010.1 Domain of unknown function (DUF303)1.4e-3536.78Show/hide
Query:  MGDPGGVNYPLFCANICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASE------NLDCSRISEWIKGTGRY
        M   GGV Y     N  VWD  IPP     PS LR      W++ +EPLH DID  KTNG GPGM FA+ ++ +  +      ++  +++S+W KG   Y
Subjt:  MGDPGGVNYPLFCANICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASE------NLDCSRISEWIKGTGRY

Query:  TSLIRRINASLES--GGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTT
           ++R  A++ S  GG  +  +W+QGESD    V++ VY + L+ FF DLR+DL  P LPI+ V +A       P   Y++ VRKA+   D E  +V  
Subjt:  TSLIRRINASLES--GGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTT

Query:  VDAKKAVQHVLNLGQEPYNHDG-HLSVHTEVKIGIMLAKAYL
        VDA+            P   DG HL+  ++V++G M+A+++L
Subjt:  VDAKKAVQHVLNLGQEPYNHDG-HLSVHTEVKIGIMLAKAYL

AT4G34215.1 Domain of unknown function (DUF303)3.1e-3235.37Show/hide
Query:  NICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASEN------LDC----SRISEWIKGTGRYTSLIRRINAS
        N  VWDK +PP   P  S LR + +  WE+  EPLH DID  K  G GPGMAFA+ +  +   +      + C    + I EW +G+  Y  +++R   S
Subjt:  NICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASEN------LDC----SRISEWIKGTGRYTSLIRRINAS

Query:  LESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLN
         + GG ++  +W+QGESD     +++ Y  N+    K+LR DLN P+LPI+ V IA+          Y++ VR+A+  +  +L +V  VDAK       N
Subjt:  LESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLN

Query:  LGQEPYNHDGHLSVHTEVKIGIMLAKAYL
        L         HL+   +V++G+ LA+AYL
Subjt:  LGQEPYNHDGHLSVHTEVKIGIMLAKAYL

AT4G34215.2 Domain of unknown function (DUF303)3.1e-3235.37Show/hide
Query:  NICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASEN------LDC----SRISEWIKGTGRYTSLIRRINAS
        N  VWDK +PP   P  S LR + +  WE+  EPLH DID  K  G GPGMAFA+ +  +   +      + C    + I EW +G+  Y  +++R   S
Subjt:  NICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASEN------LDC----SRISEWIKGTGRYTSLIRRINAS

Query:  LESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLN
         + GG ++  +W+QGESD     +++ Y  N+    K+LR DLN P+LPI+ V IA+          Y++ VR+A+  +  +L +V  VDAK       N
Subjt:  LESGGRLQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLN

Query:  LGQEPYNHDGHLSVHTEVKIGIMLAKAYL
        L         HL+   +V++G+ LA+AYL
Subjt:  LGQEPYNHDGHLSVHTEVKIGIMLAKAYL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGATCCTGGTGGAGTTAATTATCCACTCTTCTGTGCCAACATTTGTGTTTGGGATAAACATATTCCTCCAGGATCGATACCCCAACCCTCCACTCTTCGATTTGC
TTTAAACTACACCTGGGAGCAAGGTCGGGAGCCCCTTCATTGGGACATTGACCCTACCAAGACCAACGGTTTCGGTCCAGGAATGGCTTTTGCAGATCATCTCCTTGCAA
AAGCTAGCGAAAATCTAGATTGTTCTCGTATAAGTGAATGGATTAAAGGGACTGGTAGGTATACAAGCTTGATCCGACGGATCAATGCTTCTTTGGAGTCTGGTGGCCGC
CTACAAGGGTTTGTATGGTTTCAAGGAGAATCTGATGCTGCATTAGAGGTAGAATCACAAGTCTACCACCAAAATTTGATAAACTTTTTTAAGGATCTTCGTGACGACCT
GAACCAACCGACTTTGCCCATCCTACTGGTGAAGATAGCTAACCACGATCTCAACATTAGCCCATTTATAAATTATGTAGAGGACGTAAGAAAGGCGGAAGAGGCAGTCG
ATCATGAGTTGCTTGACGTAACAACTGTGGATGCTAAAAAAGCAGTCCAACATGTTCTGAATCTCGGCCAAGAACCTTATAATCACGATGGACATCTCAGTGTACACACT
GAGGTAAAAATAGGAATAATGCTGGCCAAGGCTTATCTAAAATTTAGTTTGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGATCCTGGTGGAGTTAATTATCCACTCTTCTGTGCCAACATTTGTGTTTGGGATAAACATATTCCTCCAGGATCGATACCCCAACCCTCCACTCTTCGATTTGC
TTTAAACTACACCTGGGAGCAAGGTCGGGAGCCCCTTCATTGGGACATTGACCCTACCAAGACCAACGGTTTCGGTCCAGGAATGGCTTTTGCAGATCATCTCCTTGCAA
AAGCTAGCGAAAATCTAGATTGTTCTCGTATAAGTGAATGGATTAAAGGGACTGGTAGGTATACAAGCTTGATCCGACGGATCAATGCTTCTTTGGAGTCTGGTGGCCGC
CTACAAGGGTTTGTATGGTTTCAAGGAGAATCTGATGCTGCATTAGAGGTAGAATCACAAGTCTACCACCAAAATTTGATAAACTTTTTTAAGGATCTTCGTGACGACCT
GAACCAACCGACTTTGCCCATCCTACTGGTGAAGATAGCTAACCACGATCTCAACATTAGCCCATTTATAAATTATGTAGAGGACGTAAGAAAGGCGGAAGAGGCAGTCG
ATCATGAGTTGCTTGACGTAACAACTGTGGATGCTAAAAAAGCAGTCCAACATGTTCTGAATCTCGGCCAAGAACCTTATAATCACGATGGACATCTCAGTGTACACACT
GAGGTAAAAATAGGAATAATGCTGGCCAAGGCTTATCTAAAATTTAGTTTGAAGTAG
Protein sequenceShow/hide protein sequence
MGDPGGVNYPLFCANICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGFGPGMAFADHLLAKASENLDCSRISEWIKGTGRYTSLIRRINASLESGGR
LQGFVWFQGESDAALEVESQVYHQNLINFFKDLRDDLNQPTLPILLVKIANHDLNISPFINYVEDVRKAEEAVDHELLDVTTVDAKKAVQHVLNLGQEPYNHDGHLSVHT
EVKIGIMLAKAYLKFSLK