; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G13097 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G13097
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionAlpha/beta-Hydrolases superfamily protein
Genome locationctg1838:136022..141748
RNA-Seq ExpressionCucsat.G13097
SyntenyCucsat.G13097
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149211.1 uncharacterized protein LOC101206168 [Cucumis sativus]4.23e-153100Show/hide
Query:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR
        MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR
Subjt:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR

Query:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN
        EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN
Subjt:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN

Query:  NK
        NK
Subjt:  NK

XP_008442837.1 PREDICTED: uncharacterized protein LOC103486604 [Cucumis melo]1.33e-14291.09Show/hide
Query:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR
        MGGQAVWSCL YIPNRLAGAALLAPVVNYWWPG PANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQ+WFPSSS++A NPE+LSRQDKELLSK+VGR
Subjt:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR

Query:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN
         ECEL+FSQQGEYESIHKD NVGFG+WEFSPLDLENPFPGNEGSVHLWHGDEDK+VPVTL RYIAKQL WIHYHEIAGAGH FPYADGMSESIIK LLLN
Subjt:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN

Query:  NK
        +K
Subjt:  NK

XP_038904917.1 uncharacterized protein LOC120091130 isoform X1 [Benincasa hispida]3.93e-13890.59Show/hide
Query:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR
        MGGQAVWSCL YIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWT+RVAHYTPWLTYWWNTQRWFPSSSIIA +P +LSRQDKELLSKQVGR
Subjt:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR

Query:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN
        +ECEL+F QQGEYESIH+D NVGFG+WEFSPLDLENPFP NEGSVHLWHGDEDK+VPVTLQRYIAKQL WIHYHEIAGAGH   YADGMSESIIKALLLN
Subjt:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN

Query:  NK
         K
Subjt:  NK

XP_038904919.1 uncharacterized protein LOC120091130 isoform X2 [Benincasa hispida]3.42e-13890.59Show/hide
Query:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR
        MGGQAVWSCL YIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWT+RVAHYTPWLTYWWNTQRWFPSSSIIA +P +LSRQDKELLSKQVGR
Subjt:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR

Query:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN
        +ECEL+F QQGEYESIH+D NVGFG+WEFSPLDLENPFP NEGSVHLWHGDEDK+VPVTLQRYIAKQL WIHYHEIAGAGH   YADGMSESIIKALLLN
Subjt:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN

Query:  NK
         K
Subjt:  NK

XP_038904920.1 uncharacterized protein LOC120091130 isoform X3 [Benincasa hispida]3.31e-13890.59Show/hide
Query:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR
        MGGQAVWSCL YIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWT+RVAHYTPWLTYWWNTQRWFPSSSIIA +P +LSRQDKELLSKQVGR
Subjt:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR

Query:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN
        +ECEL+F QQGEYESIH+D NVGFG+WEFSPLDLENPFP NEGSVHLWHGDEDK+VPVTLQRYIAKQL WIHYHEIAGAGH   YADGMSESIIKALLLN
Subjt:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN

Query:  NK
         K
Subjt:  NK

TrEMBL top hitse value%identityAlignment
A0A1S3B7F3 uncharacterized protein LOC1034866046.45e-14391.09Show/hide
Query:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR
        MGGQAVWSCL YIPNRLAGAALLAPVVNYWWPG PANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQ+WFPSSS++A NPE+LSRQDKELLSK+VGR
Subjt:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR

Query:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN
         ECEL+FSQQGEYESIHKD NVGFG+WEFSPLDLENPFPGNEGSVHLWHGDEDK+VPVTL RYIAKQL WIHYHEIAGAGH FPYADGMSESIIK LLLN
Subjt:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN

Query:  NK
        +K
Subjt:  NK

A0A6J1CU12 uncharacterized protein LOC1110142842.37e-13385.15Show/hide
Query:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR
        MGGQAVWSCL YIPNRLAGAALLAPV+NYWWPGLPAN+TN AFYQQF++DQW VRVAHYTPWLTYWWNTQRWFPSSS+IA +P+ LSRQDKEL SKQVG 
Subjt:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR

Query:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN
        +ECE +FSQQGE+ESIH+D NVGFG+WEFSP+DLENPFP NEGSVHLWHGDED+LVPVTLQRYIAKQL WIHYHE+ G GHRFPYADG+SESIIKALLLN
Subjt:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN

Query:  NK
         K
Subjt:  NK

A0A6J1FA44 uncharacterized protein LOC1114421926.85e-12683.17Show/hide
Query:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR
        MGGQ VWSCL YIPNRLAGAALLAP +NYWW GLPANLTNEAFYQQ  QDQW VRVAHYTPWLTYWWNTQR FPSSSIIA +   LS QDKEL SK VGR
Subjt:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR

Query:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN
        +EC+ +FSQQGE+ESIH+D NVGFG+WEFSPLDLENPFPGNEGSVHLW GDEDK+VP  LQR+IAKQL WIHYHE+AGAGHRFP ADGMSESIIKALLLN
Subjt:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN

Query:  NK
         K
Subjt:  NK

A0A6J1J625 uncharacterized protein LOC1114816681.03e-12784.16Show/hide
Query:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR
        MGGQ VWSCL YIPNRLAGAALLAP +NYWW GLPANLTNEAFYQQ  QDQW VRVAHYTPWLTYWWNTQRWFPSSSIIA +   LS QDKEL SK VGR
Subjt:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR

Query:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN
        +EC+ +FSQQGE+ESIH+D NVGFGRWEFSPLDLENPFPGNEGSVHLW GDEDK+VPV LQR+IAKQL WIHYHE+ GAGHRFP ADGMSESIIKALLLN
Subjt:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN

Query:  NK
         K
Subjt:  NK

A0A6J1KW38 uncharacterized protein LOC1114987179.71e-12683.17Show/hide
Query:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR
        MG QAVWSCL YIPNRLAGAALLAPV+NYWWPGLPANLT EAFYQQF++DQW VRVAHYTPWLTYWW TQ+WFPSSSI+  NP +LSRQDKEL SK+V R
Subjt:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR

Query:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN
        E C  V SQQGE ESIH+D  VGFGRWEFSPL+LENPFP  EGSVHLWHGDEDK+VPVTLQRYIAKQL WIHYHE+AGAGH FP ADGMSESIIKALLLN
Subjt:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN

Query:  NK
        +K
Subjt:  NK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G74280.1 alpha/beta-Hydrolases superfamily protein1.0e-6857.43Show/hide
Query:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSK-QVG
        MGGQA W CL YIP+RLAG  L+APVVNY+W  LP N++ E F  Q ++DQ  VRVAHYTPWL YWWNTQ+WFP SSI   +  +L++ DK+++SK    
Subjt:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSK-QVG

Query:  REECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLL
        R+       QQG +ESI++D  VGFG WEF PLDLENPF   EGSVHLW GDED LVP  LQRY+A QL W+HYHE+  +GH F Y  G+ + I+K+LL 
Subjt:  REECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLL

Query:  NN
        ++
Subjt:  NN

AT1G74290.1 alpha/beta-Hydrolases superfamily protein1.5e-6757.71Show/hide
Query:  MGGQAVWSCLN--YIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQ-
        MGGQA W CLN  YIP+RLAG  L+APVVNY+W  LP N++ E F  Q ++DQW VRVAHY PWL YWWNTQ+WFP SS IA    +LS+ D++++SK+ 
Subjt:  MGGQAVWSCLN--YIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQ-

Query:  VGREECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKAL
          R+       QQG +ESI++D  VGFG WEF PLDL+NPF  NEG VHLW GDED LVPV LQRY+A QL W+HYHE+  +GH F +  G+ ++I+  L
Subjt:  VGREECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKAL

Query:  L
        L
Subjt:  L

AT1G74300.1 alpha/beta-Hydrolases superfamily protein4.9e-7159.3Show/hide
Query:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR
        MGGQA W CL Y P+RLAG  L+APVVNY+W  LP N++ E F  Q ++DQW VRVAHY PWL YWWNTQ WFP SS++  +  VLS+ DK+++ K    
Subjt:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR

Query:  EECELV-FSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALL
         +  L    QQG +ESI++D  VGFG WEF PL+LENPF   EGSVHLW GDED LVPVTLQRYIA +L W+HYHE+AG GH FP A G+ + I+K  L
Subjt:  EECELV-FSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALL

AT2G36290.1 alpha/beta-Hydrolases superfamily protein8.4e-7962.87Show/hide
Query:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSK-QVG
        MGGQA W+CL YIP+RLAG  L+APVVNYWW   P+ ++ EAF QQ R DQW VRVAHY PWLT+WWN+Q WFP SS++A N  +LS+ DKE++ K    
Subjt:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSK-QVG

Query:  REECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLL
        R + E    QQG +E++H+D  VGFG WEF P++LEN FP NEGSVHLW GD+D LVPVTLQRYIAK+L WIHYHEI GAGH FP+A GM  +I+K LL 
Subjt:  REECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLL

Query:  NN
        N+
Subjt:  NN

AT3G48410.1 alpha/beta-Hydrolases superfamily protein1.4e-7860.89Show/hide
Query:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR
        MGG+  W+CLNYIP+RLAGAAL+AP +NYWW  LP +LT EAF      DQW++RVAHY PWLTYWWNTQ+WFP S++IAGNP + SRQD E+LSK    
Subjt:  MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGR

Query:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN
                QQGEY S+H+D NV F  WEF PLDL++PFP N GSVH+W+GDEDK VPV LQRY+A +L WI YHEI+G+GH  P+ +GM++ IIK+LL+ 
Subjt:  EECELVFSQQGEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLN

Query:  NK
         +
Subjt:  NK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGCCAAGCAGTTTGGAGCTGTCTAAATTATATTCCTAACAGGCTAGCAGGAGCAGCACTATTGGCGCCAGTTGTTAACTACTGGTGGCCTGGGCTCCCTGCAAA
CTTAACGAATGAAGCTTTCTACCAACAGTTTCGGCAGGATCAGTGGACAGTTCGTGTAGCTCATTACACCCCTTGGCTTACCTACTGGTGGAATACACAAAGATGGTTTC
CTTCTTCTAGTATTATTGCTGGTAATCCTGAAGTTCTATCTCGTCAAGACAAAGAACTCTTGTCCAAGCAAGTGGGAAGGGAAGAGTGTGAGCTTGTATTTAGCCAACAA
GGAGAATACGAGTCCATTCACAAGGATACGAACGTTGGATTTGGGAGGTGGGAATTTAGTCCTCTGGATCTTGAAAACCCTTTCCCAGGTAATGAAGGTTCGGTCCATTT
ATGGCATGGAGATGAAGACAAGCTCGTGCCTGTCACTCTCCAACGTTACATTGCCAAACAGCTTTCATGGATTCATTATCACGAGATAGCAGGTGCTGGTCATCGCTTTC
CCTATGCTGATGGCATGTCTGAATCCATCATTAAAGCTCTTCTCCTTAACAACAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGCCAAGCAGTTTGGAGCTGTCTAAATTATATTCCTAACAGGCTAGCAGGAGCAGCACTATTGGCGCCAGTTGTTAACTACTGGTGGCCTGGGCTCCCTGCAAA
CTTAACGAATGAAGCTTTCTACCAACAGTTTCGGCAGGATCAGTGGACAGTTCGTGTAGCTCATTACACCCCTTGGCTTACCTACTGGTGGAATACACAAAGATGGTTTC
CTTCTTCTAGTATTATTGCTGGTAATCCTGAAGTTCTATCTCGTCAAGACAAAGAACTCTTGTCCAAGCAAGTGGGAAGGGAAGAGTGTGAGCTTGTATTTAGCCAACAA
GGAGAATACGAGTCCATTCACAAGGATACGAACGTTGGATTTGGGAGGTGGGAATTTAGTCCTCTGGATCTTGAAAACCCTTTCCCAGGTAATGAAGGTTCGGTCCATTT
ATGGCATGGAGATGAAGACAAGCTCGTGCCTGTCACTCTCCAACGTTACATTGCCAAACAGCTTTCATGGATTCATTATCACGAGATAGCAGGTGCTGGTCATCGCTTTC
CCTATGCTGATGGCATGTCTGAATCCATCATTAAAGCTCTTCTCCTTAACAACAAATAA
Protein sequenceShow/hide protein sequence
MGGQAVWSCLNYIPNRLAGAALLAPVVNYWWPGLPANLTNEAFYQQFRQDQWTVRVAHYTPWLTYWWNTQRWFPSSSIIAGNPEVLSRQDKELLSKQVGREECELVFSQQ
GEYESIHKDTNVGFGRWEFSPLDLENPFPGNEGSVHLWHGDEDKLVPVTLQRYIAKQLSWIHYHEIAGAGHRFPYADGMSESIIKALLLNNK