; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G24330 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G24330
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionbeta-glucosidase BoGH3B isoform X1
Genome locationClcChr05:32325490..32332086
RNA-Seq ExpressionClc05G24330
SyntenyClc05G24330
Gene Ontology termsGO:0009251 - glucan catabolic process (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008422 - beta-glucosidase activity (molecular function)
InterPro domainsIPR001764 - Glycoside hydrolase, family 3, N-terminal
IPR017853 - Glycoside hydrolase superfamily
IPR036962 - Glycoside hydrolase, family 3, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011655822.1 uncharacterized protein LOC101209588 isoform X1 [Cucumis sativus]5.4e-6678.48Show/hide
Query:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT
        L LKL WK KE G+ +Q  KMAKIFVQVV+ LCLGW  WA+MVD +NLKYKDPK PV VRVKDLLGRMTLEEKIGQM QIDRSVANATVMK+YFIGSVL+
Subjt:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT

Query:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM
        GGG+  LPDARA+DWVNMIN+ QKGSL+SRLGIPM YG+DAVHGHNN +NAT+FPHN+
Subjt:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM

XP_022150694.1 uncharacterized protein LOC111018764 isoform X1 [Momordica charantia]2.0e-6577.85Show/hide
Query:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT
        LKLKL WK ++ G TSQ+ KMA+IFVQVV  LCLGWW WA+ VD + LKYKDPK PVAVRV DLLGRMTLEEKIGQM QIDRSVAN TVMK+Y IGSVL+
Subjt:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT

Query:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM
        GGG+  LPDARA+DWVNMINE QKGSL+SRLGIPMMYG+DAVHGHNN +NAT+FPHN+
Subjt:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM

XP_022150698.1 uncharacterized protein LOC111018764 isoform X3 [Momordica charantia]2.0e-6577.85Show/hide
Query:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT
        LKLKL WK ++ G TSQ+ KMA+IFVQVV  LCLGWW WA+ VD + LKYKDPK PVAVRV DLLGRMTLEEKIGQM QIDRSVAN TVMK+Y IGSVL+
Subjt:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT

Query:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM
        GGG+  LPDARA+DWVNMINE QKGSL+SRLGIPMMYG+DAVHGHNN +NAT+FPHN+
Subjt:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM

XP_022945501.1 uncharacterized protein LOC111449719 isoform X1 [Cucurbita moschata]1.1e-6679.11Show/hide
Query:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT
        L LKLPWK KE G  S   KMAK FVQVV+ LCLGWW WA MV  +NLKYKDPK PV+VRVKDLLGRMTLEEKIGQM QIDRSVANATVMKNYFIGSVL+
Subjt:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT

Query:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM
        GGG+  LPDARAQDWV+MIN+ QKGSL+SRLGIPM+YG+DAVHGHNN +NAT+FPHN+
Subjt:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM

XP_022968400.1 uncharacterized protein LOC111467651 isoform X2 [Cucurbita maxima]1.5e-6880.38Show/hide
Query:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT
        L LKLPWK KE G+ S   KMAKIFVQVV+ LCLGWW WA MVD +NLKYKDPK PV+VRVKDLLGRMTLEEKIGQM QIDRSVANATVMKNYFIGSVL+
Subjt:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT

Query:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM
        GGG+  LPDARAQDWV+MIN+ QKGSL+SRLGIPM+YG+DAVHGHNN +NAT+FPHN+
Subjt:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM

TrEMBL top hitse value%identityAlignment
A0A1S3BGE4 beta-glucosidase BoGH3B isoform X11.3e-6573.96Show/hide
Query:  LLTKILHQLGTLKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATV
        L T++L  +  L LKL WK KE G+ +Q  KMAKIFVQVV+ LCLGW  WA+MVD +NLKYKDPK PV VRVKDLLGRMTLEEKIGQM QIDRSVANATV
Subjt:  LLTKILHQLGTLKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATV

Query:  MKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM
        MK+YFIGS+L+GGG+  LPDARA+DWV+MIN+ QKGSL+SRLGIPM YG+DAVHGHNN +NAT+FPHN+
Subjt:  MKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM

A0A6J1DA47 uncharacterized protein LOC111018764 isoform X39.9e-6677.85Show/hide
Query:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT
        LKLKL WK ++ G TSQ+ KMA+IFVQVV  LCLGWW WA+ VD + LKYKDPK PVAVRV DLLGRMTLEEKIGQM QIDRSVAN TVMK+Y IGSVL+
Subjt:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT

Query:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM
        GGG+  LPDARA+DWVNMINE QKGSL+SRLGIPMMYG+DAVHGHNN +NAT+FPHN+
Subjt:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM

A0A6J1DCA3 uncharacterized protein LOC111018764 isoform X19.9e-6677.85Show/hide
Query:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT
        LKLKL WK ++ G TSQ+ KMA+IFVQVV  LCLGWW WA+ VD + LKYKDPK PVAVRV DLLGRMTLEEKIGQM QIDRSVAN TVMK+Y IGSVL+
Subjt:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT

Query:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM
        GGG+  LPDARA+DWVNMINE QKGSL+SRLGIPMMYG+DAVHGHNN +NAT+FPHN+
Subjt:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM

A0A6J1G118 uncharacterized protein LOC111449719 isoform X15.2e-6779.11Show/hide
Query:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT
        L LKLPWK KE G  S   KMAK FVQVV+ LCLGWW WA MV  +NLKYKDPK PV+VRVKDLLGRMTLEEKIGQM QIDRSVANATVMKNYFIGSVL+
Subjt:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT

Query:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM
        GGG+  LPDARAQDWV+MIN+ QKGSL+SRLGIPM+YG+DAVHGHNN +NAT+FPHN+
Subjt:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM

A0A6J1HX36 uncharacterized protein LOC111467651 isoform X27.3e-6980.38Show/hide
Query:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT
        L LKLPWK KE G+ S   KMAKIFVQVV+ LCLGWW WA MVD +NLKYKDPK PV+VRVKDLLGRMTLEEKIGQM QIDRSVANATVMKNYFIGSVL+
Subjt:  LKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLT

Query:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM
        GGG+  LPDARAQDWV+MIN+ QKGSL+SRLGIPM+YG+DAVHGHNN +NAT+FPHN+
Subjt:  GGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM

SwissProt top hitse value%identityAlignment
A7LXU3 Beta-glucosidase BoGH3B2.4e-0830.25Show/hide
Query:  VAVRVKDLLGRMTLEEKIGQMAQIDRSVAN-----------------ATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGV
        +   +++ L +MTLE+KIGQM +I   V +                  TV+  Y +GS+L      L    + + W   I +IQ+ S+   +GIP +YGV
Subjt:  VAVRVKDLLGRMTLEEKIGQMAQIDRSVAN-----------------ATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGV

Query:  DAVHGHNNAFNATIFPHNM
        D +HG     + T+FP  +
Subjt:  DAVHGHNNAFNATIFPHNM

B8NGU6 Probable beta-glucosidase C2.7e-0428.89Show/hide
Query:  YKDPKHPVAVRVKDLLGRMTLEEKIGQMAQI---------DRSVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVD
        Y+D  + +  RV DLL RMT+EEK GQ+            + S  NA    +  IG               A +    IN IQ+ +L +RLGIP+    D
Subjt:  YKDPKHPVAVRVKDLLGRMTLEEKIGQMAQI---------DRSVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVD

Query:  AVHGH----NNAFNATIF---PHNMLIQSLYFTYI
          H         F A +F   P ++ + +L   Y+
Subjt:  AVHGH----NNAFNATIF---PHNMLIQSLYFTYI

Q23892 Lysosomal beta glucosidase5.1e-1138.6Show/hide
Query:  VKDLLGRMTLEEKIGQMAQIDRS--------VANATVM----KNYFIGSVL----TGGGTELLPDARAQDWVNMINEIQKGSLT-SRLGIPMMYGVDAVH
        V +L+ +M++ EKIGQM Q+D +          N T +    K Y+IGS L    +GG    +    +  W++MIN IQ   +  S   IPM+YG+D+VH
Subjt:  VKDLLGRMTLEEKIGQMAQIDRS--------VANATVM----KNYFIGSVL----TGGGTELLPDARAQDWVNMINEIQKGSLT-SRLGIPMMYGVDAVH

Query:  GHNNAFNATIFPHN
        G N    AT+FPHN
Subjt:  GHNNAFNATIFPHN

Q2UFP8 Probable beta-glucosidase C1.2e-0429.63Show/hide
Query:  YKDPKHPVAVRVKDLLGRMTLEEKIGQMAQI---------DRSVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVD
        YKD  + +  RV DLL RMT+EEK GQ+            + S  NA    +  IG               A +    IN IQ+ +L +RLGIP+    D
Subjt:  YKDPKHPVAVRVKDLLGRMTLEEKIGQMAQI---------DRSVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVD

Query:  AVHGH----NNAFNATIF---PHNMLIQSLYFTYI
          H         F A +F   P ++ + +L   Y+
Subjt:  AVHGH----NNAFNATIF---PHNMLIQSLYFTYI

Q5BCC6 Beta-glucosidase C4.2e-0533.06Show/hide
Query:  YKDPKHPVAVRVKDLLGRMTLEEKIGQMAQ--------IDRSVANATV-------MKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIP
        YK+  + V  RV+DLL RMTLEEK GQ+           D S  N+T        M ++ + S +T           A      IN IQK +L +RLGIP
Subjt:  YKDPKHPVAVRVKDLLGRMTLEEKIGQMAQ--------IDRSVANATV-------MKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIP

Query:  MMYGVDAVHGH----NNAFNATIF
        +    D  H         F A +F
Subjt:  MMYGVDAVHGH----NNAFNATIF

Arabidopsis top hitse value%identityAlignment
AT3G47010.1 Glycosyl hydrolase family protein2.9e-3359.63Show/hide
Query:  YKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAF
        YK+   PV  RVKDLL RMTL EKIGQM QI+RSVA+  V+ N FIGSV +G G+  L DA++ DW +MI+  Q+ +L SRLGIP++YG DAVHG+NN +
Subjt:  YKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAF

Query:  NATIFPHNM
         AT+FPHN+
Subjt:  NATIFPHNM

AT5G04885.1 Glycosyl hydrolase family protein5.0e-4665.41Show/hide
Query:  VQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKG
        V V++ +C+  W+     DG+ L YKDPK  V+ RV DL GRMTLEEKIGQM QIDRSVA   +M++YFIGSVL+GGG+  LP+A AQ+WV+MINE QKG
Subjt:  VQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKG

Query:  SLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM
        +L SRLGIPM+YG+DAVHGHNN +NATIFPHN+
Subjt:  SLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM

AT5G20940.1 Glycosyl hydrolase family protein5.1e-3854.14Show/hide
Query:  VQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKG
        +Q +  L L   + A+ V   N KYKDPK P+ VR+K+L+  MTLEEKIGQM Q++R  A   VM+ YF+GSV +GGG+   P    + WVNM+NE+QK 
Subjt:  VQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKG

Query:  SLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM
        +L++RLGIP++YG+DAVHGHN  +NATIFPHN+
Subjt:  SLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNM

AT5G20950.1 Glycosyl hydrolase family protein1.7e-4167.57Show/hide
Query:  LKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNN
        LKYKDPK P+  R++DL+ RMTL+EKIGQM QI+RSVA   VMK YFIGSVL+GGG+     A  + WVNM+NEIQK SL++RLGIPM+YG+DAVHGHNN
Subjt:  LKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNN

Query:  AFNATIFPHNM
         + ATIFPHN+
Subjt:  AFNATIFPHNM

AT5G20950.2 Glycosyl hydrolase family protein1.7e-4167.57Show/hide
Query:  LKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNN
        LKYKDPK P+  R++DL+ RMTL+EKIGQM QI+RSVA   VMK YFIGSVL+GGG+     A  + WVNM+NEIQK SL++RLGIPM+YG+DAVHGHNN
Subjt:  LKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNN

Query:  AFNATIFPHNM
         + ATIFPHN+
Subjt:  AFNATIFPHNM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAAATTATTGACTAAAATTTTACATCAACTCGGAACGCTTAAATTAAAACTACCATGGAAGTGTAAAGAGAGGGGAGTAACCTCTCAGAGATGGAAGATGGCCAA
GATTTTTGTTCAGGTGGTTATGACTCTGTGCCTGGGATGGTGGTTGTGGGCATCAATGGTGGACGGGGATAACTTGAAATACAAAGACCCTAAGCACCCAGTTGCTGTTC
GAGTTAAGGACCTTCTTGGCCGAATGACTCTGGAAGAGAAAATTGGTCAGATGGCTCAGATTGACAGGAGCGTTGCCAATGCTACAGTTATGAAAAATTATTTCATCGGA
AGTGTGCTAACTGGTGGTGGAACTGAGCTACTTCCAGATGCTCGTGCTCAAGACTGGGTTAACATGATAAATGAAATCCAGAAAGGTTCTCTTACTAGCCGATTGGGTAT
ACCAATGATGTATGGTGTTGATGCTGTTCATGGCCATAACAATGCTTTTAATGCTACAATATTTCCTCATAATATGCTTATACAAAGCCTTTACTTCACATACATCTTTG
ATGTGGACTGA
mRNA sequenceShow/hide mRNA sequence
GTAAGTAATTTGTGTCTTGGATTCCTTTCTCTCACTTTCTTTCTTCTTAATCTTTCTTTGGCTTTCCTAATCTGATTTTCCCATTAGCTGCCTTTTAAAAGAAACTCTGT
CGGTTTCAGGATGCTTAAATTATTGACTAAAATTTTACATCAACTCGGAACGCTTAAATTAAAACTACCATGGAAGTGTAAAGAGAGGGGAGTAACCTCTCAGAGATGGA
AGATGGCCAAGATTTTTGTTCAGGTGGTTATGACTCTGTGCCTGGGATGGTGGTTGTGGGCATCAATGGTGGACGGGGATAACTTGAAATACAAAGACCCTAAGCACCCA
GTTGCTGTTCGAGTTAAGGACCTTCTTGGCCGAATGACTCTGGAAGAGAAAATTGGTCAGATGGCTCAGATTGACAGGAGCGTTGCCAATGCTACAGTTATGAAAAATTA
TTTCATCGGAAGTGTGCTAACTGGTGGTGGAACTGAGCTACTTCCAGATGCTCGTGCTCAAGACTGGGTTAACATGATAAATGAAATCCAGAAAGGTTCTCTTACTAGCC
GATTGGGTATACCAATGATGTATGGTGTTGATGCTGTTCATGGCCATAACAATGCTTTTAATGCTACAATATTTCCTCATAATATGCTTATACAAAGCCTTTACTTCACA
TACATCTTTGATGTGGACTGA
Protein sequenceShow/hide protein sequence
MLKLLTKILHQLGTLKLKLPWKCKERGVTSQRWKMAKIFVQVVMTLCLGWWLWASMVDGDNLKYKDPKHPVAVRVKDLLGRMTLEEKIGQMAQIDRSVANATVMKNYFIG
SVLTGGGTELLPDARAQDWVNMINEIQKGSLTSRLGIPMMYGVDAVHGHNNAFNATIFPHNMLIQSLYFTYIFDVD