; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G014510 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G014510
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionMannan endo-1,4-beta-mannosidase
Genome locationCmo_Chr02:8537724..8540966
RNA-Seq ExpressionCmoCh02G014510
SyntenyCmoCh02G014510
Gene Ontology termsGO:0071704 - organic substance metabolic process (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016985 - mannan endo-1,4-beta-mannosidase activity (molecular function)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup
IPR017853 - Glycoside hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606064.1 hypothetical protein SDJN03_03381, partial [Cucurbita argyrosperma subsp. sororia]1.0e-13278.77Show/hide
Query:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIVVSVFVRNPNV
        MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIV SVFVRNPNV
Subjt:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIVVSVFVRNPNV

Query:  ASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGLSLMEDLKSGG----------------LNISSYTE--------------
        ASFKY KATTVIYYH++MIGEGET GGEAKAKDTMTMNVT+EIKAEEMDEGLSLMEDLKSGG                +N  S+T+              
Subjt:  ASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGLSLMEDLKSGG----------------LNISSYTE--------------

Query:  --------------IPGRVKIIGFIRKRTVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGD
                      +    ++      RTVEENFDLLDLE AKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGD
Subjt:  --------------IPGRVKIIGFIRKRTVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGD

Query:  RYFDPQEWLQGLSLVAQRFSKKSTV
        RYFDPQEWLQGLSLVAQRFSKKSTV
Subjt:  RYFDPQEWLQGLSLVAQRFSKKSTV

XP_022930165.1 uncharacterized protein LOC111436674 [Cucurbita moschata]3.8e-7193.2Show/hide
Query:  MTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKRTVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIA
        MTMNVT+EIKA EMD+GLSLMEDLKSGGLNISS+TEIPGRVKIIGFI+KRTVEENFDLLDLE AKAGLAQ NPFVLNKTIAEAYEAVVDVLGESGLMVIA
Subjt:  MTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKRTVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIA

Query:  DNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKSTV
        DNH+SQP WCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKK TV
Subjt:  DNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKSTV

XP_022958536.1 uncharacterized protein LOC111459739 [Cucurbita moschata]1.3e-95100Show/hide
Query:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIVVSVFVRNPNV
        MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIVVSVFVRNPNV
Subjt:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIVVSVFVRNPNV

Query:  ASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKR
        ASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKR
Subjt:  ASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKR

XP_022995815.1 uncharacterized protein LOC111491239 [Cucurbita maxima]1.2e-8893.44Show/hide
Query:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIVVSVFVRNPNV
        M EKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSN TSNGG+IIV SVFVRNPNV
Subjt:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIVVSVFVRNPNV

Query:  ASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRK
        ASFKY KAT +IYYH  MIGEGETPGGEAKAKDTMTMNVT+EIKAEEMDEGLSLMEDLKSGGLNISSY EIPGRVKIIGFI+K
Subjt:  ASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRK

XP_023532981.1 uncharacterized protein LOC111794996 [Cucurbita pepo subsp. pepo]2.8e-9094.02Show/hide
Query:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIVVSVFVRNPNV
        MAEKDQVKPLASPATHLRSDDD FLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGG+IIV SVFVRNPN 
Subjt:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIVVSVFVRNPNV

Query:  ASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKR
        ASFKY KATTVIYYH ++IGEGETPGGEAKAKDTMTMNVT+EIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFI+K+
Subjt:  ASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKR

TrEMBL top hitse value%identityAlignment
A0A6J1EW80 Mannan endo-1,4-beta-mannosidase1.8e-7193.2Show/hide
Query:  MTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKRTVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIA
        MTMNVT+EIKA EMD+GLSLMEDLKSGGLNISS+TEIPGRVKIIGFI+KRTVEENFDLLDLE AKAGLAQ NPFVLNKTIAEAYEAVVDVLGESGLMVIA
Subjt:  MTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKRTVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIA

Query:  DNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKSTV
        DNH+SQP WCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKK TV
Subjt:  DNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKSTV

A0A6J1H2C0 uncharacterized protein LOC1114597396.3e-96100Show/hide
Query:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIVVSVFVRNPNV
        MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIVVSVFVRNPNV
Subjt:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIVVSVFVRNPNV

Query:  ASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKR
        ASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKR
Subjt:  ASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKR

A0A6J1IDD0 Mannan endo-1,4-beta-mannosidase2.0e-7090.48Show/hide
Query:  MTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKRTVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIA
        M  NVT+EI+AEEMDEGLSLMEDLKSGGLNIS +T++PGRVKI+GFI+KRTVEENFDLLDLE AKAGLAQ NPFVLNKTIAEAYEAVVDVLGESGLMVIA
Subjt:  MTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKRTVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIA

Query:  DNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKSTV
        DNH+SQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKK TV
Subjt:  DNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKSTV

A0A6J1IFW7 Mannan endo-1,4-beta-mannosidase2.0e-7090.48Show/hide
Query:  MTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKRTVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIA
        M  NVT+EI+AEEMDEGLSLMEDLKSGGLNIS +T++PGRVKI+GFI+KRTVEENFDLLDLE AKAGLAQ NPFVLNKTIAEAYEAVVDVLGESGLMVIA
Subjt:  MTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKRTVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIA

Query:  DNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKSTV
        DNH+SQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKK TV
Subjt:  DNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKSTV

A0A6J1K4Z4 uncharacterized protein LOC1114912395.7e-8993.44Show/hide
Query:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIVVSVFVRNPNV
        M EKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSN TSNGG+IIV SVFVRNPNV
Subjt:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIVVSVFVRNPNV

Query:  ASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRK
        ASFKY KAT +IYYH  MIGEGETPGGEAKAKDTMTMNVT+EIKAEEMDEGLSLMEDLKSGGLNISSY EIPGRVKIIGFI+K
Subjt:  ASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRK

SwissProt top hitse value%identityAlignment
C0HLA0 Glycosyl hydrolase 5 family protein2.8e-2450.52Show/hide
Query:  TVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKSTV
        TV + F  L+L  A +G+  NNP +L+     AY  VV  L E+G+MVI DNH+S+P+WCC++DDGNGFFGDRYF+P  W++GL L+A  F+    V
Subjt:  TVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKSTV

Q6DST1 Late embryogenesis abundant protein At1g640655.3e-0727.41Show/hide
Query:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSG-CFAALLLILAVI---GIVLGFTVLHIKTPDLKIDKLSFSNATSNG-------GVII
        M ++D++  LA    + RSD++Q  P     R+ R K     G C    L I+ +I    ++L    L I  P+++   +S  +  S G          +
Subjt:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSG-CFAALLLILAVI---GIVLGFTVLHIKTPDLKIDKLSFSNATSNG-------GVII

Query:  VVSVFVRNPNVASFKYLKATTVIYYHDK-MIGEGETPGGEAKAKDTMTM-NVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKR
        V  + +RN N  +F++  +T  + Y D  ++GE +  G   +A  T+ +  V +EI +  + +   L +DL+ G L + S  E+ GR+K++G  RKR
Subjt:  VVSVFVRNPNVASFKYLKATTVIYYHDK-MIGEGETPGGEAKAKDTMTM-NVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKR

Arabidopsis top hitse value%identityAlignment
AT1G13130.1 Cellulase (glycosyl hydrolase family 5) protein3.5e-2245.36Show/hide
Query:  TVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKSTV
        TV ++F  L L     G   NNP +++  + EAY+ VV  LG + +MVI DNH+++P WCC+ DDGNGFFGD++FDP  W+  L  +A  F+  S V
Subjt:  TVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKSTV

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.6e-2237.44Show/hide
Query:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNK-YIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGG----------VII
        MA+ + V+PLA PAT L   D+           HR++  I CS C  A  LIL  I + L FTV  +K P +K++ +  +   S  G          + +
Subjt:  MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNK-YIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGG----------VII

Query:  VVSVFVRNPNVASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEM--DEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKR-TV
        +V V V+NPN ASFKY   TT IYY   ++GE     G+A+   T  MNVT++I  + +  D GL   E  +SG +N+ SYT + G+VKI+G ++K  TV
Subjt:  VVSVFVRNPNVASFKYLKATTVIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEM--DEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKR-TV

Query:  EEN
        + N
Subjt:  EEN

AT3G26130.1 Cellulase (glycosyl hydrolase family 5) protein1.2e-1943.75Show/hide
Query:  TVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKST
        TV ++     L  A +G   +NP +L+  + +A++ VV  L +  +MVI DNHISQP WCCS +DGNGFFGD++ +PQ W++GL  +A  F+  S+
Subjt:  TVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKST

AT3G26140.1 Cellulase (glycosyl hydrolase family 5) protein1.7e-2143.3Show/hide
Query:  TVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKSTV
        TV ++F  L L    +G    NP +++  + EAY+ VV  LG + +MVI DNH+++P WCC  +DGNGFFGD +FDP  W+ GL+ +A  F   + V
Subjt:  TVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKSTV

AT4G23610.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.8e-1934.36Show/hide
Query:  KDQVKPLASPATHLRSD----DDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFS--------NATSNGGVIIVV
        +DQ KPLA      RSD    +DQ+     K    + K I+C G  A+L +++AV  IVL  TV H+ +P+L +D +SF+           +N    + V
Subjt:  KDQVKPLASPATHLRSD----DDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFS--------NATSNGGVIIVV

Query:  SVFVRNPNVASFKYLKATTVIYYHDKM--IGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGL-SLMEDLKSGGLNISSYTEIPGRVKIIGFIRK
         + + NPN A F  +K   V +YH ++  +GE         AK T+ MN+T EI   ++   L  LMEDL   G+++ S  E+ GRVK +   RK
Subjt:  SVFVRNPNVASFKYLKATTVIYYHDKM--IGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGL-SLMEDLKSGGLNISSYTEIPGRVKIIGFIRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAGAAAGACCAGGTCAAACCCTTGGCTTCACCCGCCACCCATCTCCGGAGCGACGACGATCAGTTTCTTCCCCCTCCGGCCAAGCTCCGCCTCCACAGGAACAA
ATACATCATGTGCTCCGGCTGCTTTGCCGCACTCCTCTTGATCCTCGCCGTTATCGGCATCGTCCTCGGCTTCACCGTCCTCCATATCAAAACCCCAGATCTCAAAATCG
ATAAGCTCTCGTTTTCAAATGCTACTTCAAATGGCGGCGTAATCATTGTGGTCAGTGTCTTCGTGCGAAATCCTAATGTCGCGTCGTTCAAATACTTAAAAGCCACGACG
GTGATTTACTACCATGACAAGATGATCGGAGAGGGAGAGACGCCGGGGGGAGAGGCGAAGGCAAAGGACACGATGACGATGAATGTGACGATGGAGATCAAGGCGGAGGA
AATGGATGAGGGTTTGAGTTTGATGGAGGATTTGAAGTCAGGAGGTTTGAATATCAGTAGCTACACGGAAATCCCAGGAAGGGTCAAAATAATTGGATTCATCAGAAAAA
GGACAGTTGAAGAGAATTTTGACCTTCTTGATTTAGAGACAGCAAAAGCTGGATTGGCTCAAAATAATCCTTTTGTGTTGAACAAGACTATTGCTGAAGCTTATGAAGCC
GTTGTTGATGTGCTTGGAGAGAGTGGTTTAATGGTGATTGCTGACAACCATATTAGCCAACCAAGATGGTGTTGCTCTCTTGACGATGGCAATGGTTTCTTTGGAGACCG
CTATTTTGACCCTCAAGAATGGCTACAAGGTCTTAGCTTAGTTGCTCAACGCTTTTCCAAGAAATCAACGGTTCGTAAGCATTAA
mRNA sequenceShow/hide mRNA sequence
TTTACGATCATTTCCCCCATAGCCATGGCCGAGAAAGACCAGGTCAAACCCTTGGCTTCACCCGCCACCCATCTCCGGAGCGACGACGATCAGTTTCTTCCCCCTCCGGC
CAAGCTCCGCCTCCACAGGAACAAATACATCATGTGCTCCGGCTGCTTTGCCGCACTCCTCTTGATCCTCGCCGTTATCGGCATCGTCCTCGGCTTCACCGTCCTCCATA
TCAAAACCCCAGATCTCAAAATCGATAAGCTCTCGTTTTCAAATGCTACTTCAAATGGCGGCGTAATCATTGTGGTCAGTGTCTTCGTGCGAAATCCTAATGTCGCGTCG
TTCAAATACTTAAAAGCCACGACGGTGATTTACTACCATGACAAGATGATCGGAGAGGGAGAGACGCCGGGGGGAGAGGCGAAGGCAAAGGACACGATGACGATGAATGT
GACGATGGAGATCAAGGCGGAGGAAATGGATGAGGGTTTGAGTTTGATGGAGGATTTGAAGTCAGGAGGTTTGAATATCAGTAGCTACACGGAAATCCCAGGAAGGGTCA
AAATAATTGGATTCATCAGAAAAAGGACAGTTGAAGAGAATTTTGACCTTCTTGATTTAGAGACAGCAAAAGCTGGATTGGCTCAAAATAATCCTTTTGTGTTGAACAAG
ACTATTGCTGAAGCTTATGAAGCCGTTGTTGATGTGCTTGGAGAGAGTGGTTTAATGGTGATTGCTGACAACCATATTAGCCAACCAAGATGGTGTTGCTCTCTTGACGA
TGGCAATGGTTTCTTTGGAGACCGCTATTTTGACCCTCAAGAATGGCTACAAGGTCTTAGCTTAGTTGCTCAACGCTTTTCCAAGAAATCAACGGTTCGTAAGCATTAA
Protein sequenceShow/hide protein sequence
MAEKDQVKPLASPATHLRSDDDQFLPPPAKLRLHRNKYIMCSGCFAALLLILAVIGIVLGFTVLHIKTPDLKIDKLSFSNATSNGGVIIVVSVFVRNPNVASFKYLKATT
VIYYHDKMIGEGETPGGEAKAKDTMTMNVTMEIKAEEMDEGLSLMEDLKSGGLNISSYTEIPGRVKIIGFIRKRTVEENFDLLDLETAKAGLAQNNPFVLNKTIAEAYEA
VVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQGLSLVAQRFSKKSTVRKH