; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C002685 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C002685
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionEndoglucanase
Genome locationchr12:20696692..20700764
RNA-Seq ExpressionMELO3C002685
SyntenyMELO3C002685
Gene Ontology termsGO:0030245 - cellulose catabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008810 - cellulase activity (molecular function)
InterPro domainsIPR001701 - Glycoside hydrolase family 9
IPR008928 - Six-hairpin glycosidase superfamily
IPR012341 - Six-hairpin glycosidase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057069.1 endoglucanase 25 [Cucumis melo var. makuwa]3.0e-7195.89Show/hide
Query:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
        MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
Subjt:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN

Query:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLF
        TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAV    +L+
Subjt:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLF

KAE8653204.1 hypothetical protein Csa_019838 [Cucumis sativus]3.3e-7094.52Show/hide
Query:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
        MMYPRP+STCDARASDLAGEIVAALSASSLVFREDTNYS ELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
Subjt:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN

Query:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLF
        TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAV    +L+
Subjt:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLF

XP_008446758.2 PREDICTED: endoglucanase 9-like [Cucumis melo]3.1e-92100Show/hide
Query:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
        MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
Subjt:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN

Query:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLFEFQIKCKRSKTDLTYGLPHYIFHRYY
        TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLFEFQIKCKRSKTDLTYGLPHYIFHRYY
Subjt:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLFEFQIKCKRSKTDLTYGLPHYIFHRYY

XP_022140170.1 endoglucanase 25-like [Momordica charantia]1.3e-6175.15Show/hide
Query:  YPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTS
        YPRPVS CD RASDLAGEIVAALSA+SLVF+ED NYSGELAKAAEKLF++VTKLDP EQGTY+ VDSCGGEAR FYNSSSY DELIWAGTWLF+ATGNTS
Subjt:  YPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTS

Query:  YLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVL--------FEFQIKCKRSKTDL
        YL+YATDAVRFQLAQS+E+SI RGIF+WNNKFSATAV    +L        +E  +    +KTD+
Subjt:  YLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVL--------FEFQIKCKRSKTDL

XP_031745535.1 endoglucanase 9-like [Cucumis sativus]3.3e-7094.52Show/hide
Query:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
        MMYPRP+STCDARASDLAGEIVAALSASSLVFREDTNYS ELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
Subjt:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN

Query:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLF
        TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAV    +L+
Subjt:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLF

TrEMBL top hitse value%identityAlignment
A0A1S3BFV0 Cellulase1.5e-92100Show/hide
Query:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
        MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
Subjt:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN

Query:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLFEFQIKCKRSKTDLTYGLPHYIFHRYY
        TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLFEFQIKCKRSKTDLTYGLPHYIFHRYY
Subjt:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLFEFQIKCKRSKTDLTYGLPHYIFHRYY

A0A5A7UU46 Endoglucanase1.5e-7195.89Show/hide
Query:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
        MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
Subjt:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN

Query:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLF
        TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAV    +L+
Subjt:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLF

A0A5J5BY20 Cellulase1.3e-4365.47Show/hide
Query:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
        M YPRPVS CD  ASDLAGEIVAALSA+SLVF+E+T+YS +LA+AAEKLF+  T++DP +QGTY+  D+CGG+AR+FYNSS Y DEL+W GTWLFFATGN
Subjt:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN

Query:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAV
         SYL  ATD +  + A+ EE    +GIF WNNK +A AV
Subjt:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAV

A0A6J1CED2 Endoglucanase6.1e-6275.15Show/hide
Query:  YPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTS
        YPRPVS CD RASDLAGEIVAALSA+SLVF+ED NYSGELAKAAEKLF++VTKLDP EQGTY+ VDSCGGEAR FYNSSSY DELIWAGTWLF+ATGNTS
Subjt:  YPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTS

Query:  YLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVL--------FEFQIKCKRSKTDL
        YL+YATDAVRFQLAQS+E+SI RGIF+WNNKFSATAV    +L        +E  +    +KTD+
Subjt:  YLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVL--------FEFQIKCKRSKTDL

W9SKR8 Endoglucanase1.1e-4266.43Show/hide
Query:  MMYPRPVSTCDAR-ASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATG
        M YPRPVS+CD+  AS+LAGEIVAALSA+SLVF ED +YSG+L KAAEKLF+  +K DP +QGTY+S++ CG EAR FYNSS Y DEL+W GTWLFFATG
Subjt:  MMYPRPVSTCDAR-ASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATG

Query:  NTSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAV
        N S+L YAT    F +AQS E SI +GI  WNNK  A AV
Subjt:  NTSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAV

SwissProt top hitse value%identityAlignment
P0C1U4 Endoglucanase 95.1e-2142.34Show/hide
Query:  YPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTS
        YPRPV  C A  SDLA E+ A+L+A+S+VF+++  YS +L   A  LF    K     +G YS   + G +A KFYNS+SY DE +W G+W++ ATGN+S
Subjt:  YPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTS

Query:  YLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAV
        YL  AT     + A +       G+F+W+NK +   V
Subjt:  YLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAV

Q38890 Endoglucanase 251.5e-2039.26Show/hide
Query:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
        M Y RPV+TC+   SDLA E+ AAL+++S+VF+++  YS +L   A+ ++Q         +G YS+  +   E+ KFYNSS Y DE IW G W+++ATGN
Subjt:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN

Query:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFS
         +YL+  T     + A +       G+F+W+NK +
Subjt:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFS

Q7XUK4 Endoglucanase 127.6e-1735.34Show/hide
Query:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
        M YPRPV T  + A DL GE+ AAL+A+S+VFR++  YS +L   A  +++          G  +           +YNS+SY DE +W+  W+++ATGN
Subjt:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN

Query:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNK
         +Y+++ATD    + A++  + +   +F+W+NK
Subjt:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNK

Q84R49 Endoglucanase 101.2e-1940.58Show/hide
Query:  YPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTS
        YPRPV+ C +  SDLA E+ AAL+A+S+VF++   YS +L + A+ L+    K   +++G YS     G +   FYNS+SY DE +W G W++FATGN +
Subjt:  YPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTS

Query:  YLSYATDAVRFQLAQSEEA-SIGRGIFNWNNKFSATAV
        YLS AT     + A +    S   G+F W++K     V
Subjt:  YLSYATDAVRFQLAQSEEA-SIGRGIFNWNNKFSATAV

Q9STW8 Endoglucanase 211.4e-1840.46Show/hide
Query:  YPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTS
        Y R VS C +  SDLA E+ AAL+++S+VF+++  YS  L   A+ L++  T      +  YS     G E+ KFYNSS + DEL+W G WL++ATGN +
Subjt:  YPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTS

Query:  YLSYATDAVRFQLAQSEEASIGRGIFNWNNK
        YL   T     + A +   S   G+F+W+NK
Subjt:  YLSYATDAVRFQLAQSEEASIGRGIFNWNNK

Arabidopsis top hitse value%identityAlignment
AT1G65610.1 Six-hairpin glycosidases superfamily protein2.5e-1537.59Show/hide
Query:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
        M Y RPV +    A+DL  E+ AAL+A+S+VF +  +Y+ +L K AE L+          +  YS        A+ FYNS+S  DE +WAG WL++ATGN
Subjt:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN

Query:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNK
         +Y+ +AT     Q A++        + +WNNK
Subjt:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNK

AT1G75680.1 glycosyl hydrolase 9B75.6e-1536.96Show/hide
Query:  DARASDLAGEIVAALSASSLVFRE-DTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTSYLSYATD
        D   +++A E  AA++++SLVF++ D  YS  L K A++LF         ++G+YS       E +KFYNS+ Y DEL+WA +WL+ AT + +YL Y ++
Subjt:  DARASDLAGEIVAALSASSLVFRE-DTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTSYLSYATD

Query:  AVRFQLAQSEEASIGRGI-FNWNNKFSATAVTPINVLF
          +      E AS G    F+W+NK + T V    +LF
Subjt:  AVRFQLAQSEEASIGRGI-FNWNNKFSATAVTPINVLF

AT2G44570.1 glycosyl hydrolase 9B122.5e-1534.42Show/hide
Query:  MMYPRPVSTCDAR--ASDLAGEIVAALSASSLVFR-EDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFA
        M  PRP    D +   +DLAGE  AA++A+SL F   D+ Y+  L   A++LF+       +    ++S+ + GG    FY SS Y DEL+WA  WL  A
Subjt:  MMYPRPVSTCDAR--ASDLAGEIVAALSASSLVFR-EDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFA

Query:  TGNTSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLFEFQIK
        TG+ +YL +        L Q+  +   R +F W++KF    V    ++FE ++K
Subjt:  TGNTSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLFEFQIK

AT4G24260.1 glycosyl hydrolase 9A39.8e-2040.46Show/hide
Query:  YPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTS
        Y R VS C +  SDLA E+ AAL+++S+VF+++  YS  L   A+ L++  T      +  YS     G E+ KFYNSS + DEL+W G WL++ATGN +
Subjt:  YPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTS

Query:  YLSYATDAVRFQLAQSEEASIGRGIFNWNNK
        YL   T     + A +   S   G+F+W+NK
Subjt:  YLSYATDAVRFQLAQSEEASIGRGIFNWNNK

AT5G49720.1 glycosyl hydrolase 9A11.0e-2139.26Show/hide
Query:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN
        M Y RPV+TC+   SDLA E+ AAL+++S+VF+++  YS +L   A+ ++Q         +G YS+  +   E+ KFYNSS Y DE IW G W+++ATGN
Subjt:  MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGN

Query:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFS
         +YL+  T     + A +       G+F+W+NK +
Subjt:  TSYLSYATDAVRFQLAQSEEASIGRGIFNWNNKFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTACCCCAGACCTGTTTCAACATGTGATGCCCGGGCTTCAGATCTTGCTGGAGAAATCGTTGCAGCATTATCAGCTTCATCATTAGTATTCAGAGAAGAT
ACCAACTATTCAGGAGAATTAGCAAAAGCTGCAGAGAAATTGTTTCAGCAAGTGACTAAATTAGACCCAATTGAACAAGGAACTTACAGCTCAGTTGATTCTTGT
GGAGGAGAAGCAAGAAAGTTCTACAACTCATCAAGTTACACGGATGAATTGATATGGGCAGGAACTTGGCTGTTCTTTGCAACTGGAAACACTTCATATCTTTCA
TATGCCACTGATGCTGTCCGATTTCAGTTAGCACAAAGTGAGGAAGCAAGTATTGGCAGAGGAATTTTCAATTGGAACAATAAGTTCAGTGCAACTGCGGTAACT
CCAATCAATGTCTTATTTGAATTTCAGATAAAATGTAAAAGATCTAAGACAGATTTGACATATGGCTTACCTCACTACATTTTCCATAGGTATTATTGA
mRNA sequenceShow/hide mRNA sequence
CATACAATAAAAAATGATTTAAATTAAATAAAAATGTTTGAATTACAAAATCAATGGCTTAGAAGCAATTGATAGCTACAAAGCAAGAAGCAGATGGAGCTTGCT
GGCTTAGCTCCTTGAACTCTCATCTCCAATGGGCTACACCAGACAAGAAGAAATGTACAAGTACAAGACACCAGCAAGACATTGGAAGAACCGGAGGAACTTTCT
GAAGTGTATTTAGTATGTCCTAATAACATCATATGTAAGAATAAAAAAACCTATAAACAAACAGTCTGCAATTGTTTTTATCCTTTGTGTCTTTCAATGGAACTC
ATTGTTACTTCCACAGAAATTCATATCCAGTTGATTTCTGAGGTTTTCTTACCATTTCAAACTCGAATCGGCCATGTATTTCATAGACACAGTTTCATGGAAATC
AGTGACAGTTTAAGCTTCCAAAGTCCATCTTCAACGGAATTTTCTTCCACCCCTCCTATGATACAAGAACCTCTATCAACCATAACACCTTGATTCTTTCGAAGT
TCTCAAAACAGACGGTTTCTCTTCAAAGTTAATGATGCAACCAGAGAGATCTGTCCACACAGAACATTAAGCAGATCGTTTTCTTAGCATGACACGACATAACTT
CAATGTCTTTTCTAAATCTTCTACTGAATATAATTCAATTCCTTCTCCATACTCCAAGTCCTTTGACTTCAAGATCGTTATATCAAATCAAAGACGTTTCAAATG
CTGTTCTTACATTTCAGCTTTACTGCTACTGCTAATCATAGCACTTACACTTCTCCTTCAATTTTTGCCTCACAAACATAATCTTCATGAAGCATCAAACAATTA
CACAGTTGCAGTAAATCAAGCACTGAAGTTTTTTGATGCTCAAAAATGTAGAAATCCTTCTTAACTAAAAATGTTATGATGAACTTTTCATGGCCGACTGATAAA
AGCTTTTCTCAGCTGGTAGGTACCCGAAAAGTAGTCCAGTCAAGTTTCGGGGAGATTCAGGCTTGAAAGATGGGGTTTCGAGCAATAAACCAGATGGTCTCATCG
GTGGTTTCTATGATTCTGGAAACAACATGAAGTTCACTTTTCCTACAGCTTATACCATTACTCTTTTAAGCTGGAGTGTGATTGAGTATCATCCAAAGTATGCAG
ACATGAATGAGCTTGATCATGTAAAGGATATCATCAGATGGGGAACTGAATATTTGCTCAAAGTTTTTGTAGCCCCAAATGCAACTTCTGATCAAACCATAATAT
ATTCTCAGGTAAGTCATATAAGCTCCCGATCAAGATACTTTTAGTGCAATTACCTCAACATTGAATCCATGTCATCTTTATTACTTAACAGTTAACACTATCTCC
AATGTGTGATCCTAGGTTGGCAGTTCCAGTAATGAGAGTAAAGCTCAAACTAATGACAACTGCTGGCAAAGACCAGAAGACATGATGTACCCCAGACCTGTTTCA
ACATGTGATGCCCGGGCTTCAGATCTTGCTGGAGAAATCGTTGCAGCATTATCAGCTTCATCATTAGTATTCAGAGAAGATACCAACTATTCAGGAGAATTAGCA
AAAGCTGCAGAGAAATTGTTTCAGCAAGTGACTAAATTAGACCCAATTGAACAAGGAACTTACAGCTCAGTTGATTCTTGTGGAGGAGAAGCAAGAAAGTTCTAC
AACTCATCAAGTTACACGGATGAATTGATATGGGCAGGAACTTGGCTGTTCTTTGCAACTGGAAACACTTCATATCTTTCATATGCCACTGATGCTGTCCGATTT
CAGTTAGCACAAAGTGAGGAAGCAAGTATTGGCAGAGGAATTTTCAATTGGAACAATAAGTTCAGTGCAACTGCGGTAACTCCAATCAATGTCTTATTTGAATTT
CAGATAAAATGTAAAAGATCTAAGACAGATTTGACATATGGCTTACCTCACTACATTTTCCATAGGTATTATTGACACGCCTTCTCTACTTTCATGATACTGGCT
ATCCATATGAATATGCCTTGGGAGTATCATCAAACATGACAGAAATCCTCATGTGTTCTTATCTCATTGATCAACACTTCAACAGGACACCTAGTAAACCATCCA
TTCAGATCCCATGCATCACAGACTTACTATCCCCACTGATTTTTATTGTCAACAACAAGAATATGTGTAAGTTTGTTTCCAGTAATTTGCTAATCTTCCAATTGT
TAATGTAGGTGGATTGATCCTCCTAAGCCCTGATGATAAAGCACCACTCCAATTTGCTGCAACAGCATCATTTCTCAGTAAATTGTACAGTGATTACCTTGATCT
TTTGGGAGCATCTTACATGAGTTGCATTTTTGCCAATCCTCGCTTTTCTTTGGAAAAGTTGCGGAGCTTCTCCAAATCTCAGGCAAGTGCTGTATGAAGAATCAT
TTCTTTGTCTTCCATTTCGAAAAACAGAAAAATGTAGAACATTATAACATTCCCCAAAAATCACCTCTAGTTGAATGATCCAGGGAGCAATAAGACAACGTTTTT
AATTACCTTTTTGTTTGGTAAAGGAAAAACATAATGCTCTACATTCTAAGAAAAATAAAACTAATCATTGAACTGTCTTTTGAGAATTATTATCTTTATGTTTTC
AAAAGAAGTTTTCACACTTGTTAATTGGAAAACAAAACCAAACAATTGTTAGAAACATTGACATAAAGATTTTCAACCGAAGTTTAACAATCCTTAAGTAAGATA
TTTAATAGTATTCAAAATTTTCAGCTTAACTACATACTCGGGGATAATCCATTGAAAATGAGCTACGTAGTTGGCTATGGCAACAATTTTCCCACCCACGTCCAC
CACAGAGCTGCCTCAATTCCTTGGGATGGTCAATTCTATTCATGTGCTGAAGGAGATAGATGGCTGTTATCAAAGGCTTCGAATCCAAATATTCTTTCCGGAGCC
ATGGTGGCTGGTCCAGACAAGTTTGACCATTTCTCAGATGATAGGGAAAAACCTTGGTTTACTGAACCAAGCATAGCAAGCAATGCAGGTTTAGTCGCAGCACTT
GTTGCTCTAAATGATTATCCAGGTGACACCCCGGATTTTAATGGAAAAAATTTAGGCATAGATCAGATGTCAATCTTTAATAGAATCCCTAAGGCTTCCAGAGCT
CCTTGATAGTCCTAAGATATATATTTTTTCTCACAAGATCAGAAATGTGAATTCAGAAATCTTCACTTTGTAATATCCCACCACCCAGAAATGTTTCTTACAAGC
AAGAATCTGAAGGAAAAAAAAAAACTTTTCTTCCCGAACTCCATGCCTTTCTTAAGCAAATAACTATGCAATCCCATTATCAATTCTTTGCATTTGCATTTTTTA
TCACTGCTTTCTTTCTTTCTATCTCCACAAGCTAATATATAGCATTCCCAGTTTTTGATATTTGCCAATGAACCATCCAACCAAATATTAGCCTCTTCAAACTAG
AGATTGTTTTTAAATGTGAAAGATGAGAGTTAAAGACTCGAGAAGGTGTATTTTACTTCCACAATTCAGTAGTACATGAGAAAAGCATTGTTTCACTTCAACAGG
ATTTAGTTGTTCGTTTGAAATTCAAACACTAAAACATATCTTGAATTAACAATAAACCTAAAAGCTAAGCTTATAGAAAAGGCAAATTTAATATTATATCATCCA
AGGCAACACTCCCCTCACTTGTGGACATATGGAAAAACTCAACAAGTGGAAATCAATTTTGATTAAGAAAGAAATAACACTGCAGGAGCTTGAACACAAAATCTC
CTCAATCACCTGCAAAGAAAACTCGTTTGAAGACTAACATAACATCATGAACAAATATAATAGCATGCGATCCAAATTTTCACATACATTCTGTAAGGAAATCGG
TGATACATATTACTCAACCAATCCGCAAGTTCGATTAAAATAAACTCAAACAAATACATTGTATTGATCTTTCAAAATTCTAGTGTCCCAACCAACGCTAGAAGA
AAAGACCTAACCAGTAACCACTCAGCAGCCATGTACATGGAGATCAAAGTATATGAAGTAAACACATTACCCACAACCGGAAA
Protein sequenceShow/hide protein sequence
MMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTSYLS
YATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVTPINVLFEFQIKCKRSKTDLTYGLPHYIFHRYY