; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC10G192190 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC10G192190
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionfibroin heavy chain-like
Genome locationCicolChr10:9044295..9045320
RNA-Seq ExpressionCcUC10G192190
SyntenyCcUC10G192190
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060661.1 hypothetical protein E6C27_scaffold22G005540 [Cucumis melo var. makuwa]1.3e-5442.2Show/hide
Query:  MASLKYFLLCPFVFLCVSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVN
        MASLKYFLL PF+FLC+SYTFA+GVFN + G   G  + P PDPSAGPGVD GV N+G+GPKAGPRAG G+ GG+++V  +P                  
Subjt:  MASLKYFLLCPFVFLCVSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVN

Query:  AGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEG
         GPKAGP+A  G                                                                                        
Subjt:  AGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEG

Query:  VSNVGAGPRVGPNSGPEPKVGVSGTRAGPRAGPKGVNSIVNGVGVGYKPGFGPP--GFGPRPGYWPRPG---FEPYDDCILGYVCPANEARECSKFEYGT
                          K+GVSG  AGPRAGPKGVN    GVGV   P FG P  G  P PG W RPG    EPY +C+LGYVCP N    CSKF YG 
Subjt:  VSNVGAGPRVGPNSGPEPKVGVSGTRAGPRAGPKGVNSIVNGVGVGYKPGFGPP--GFGPRPGYWPRPG---FEPYDDCILGYVCPANEARECSKFEYGT

Query:  CHSYNFHPLTASTDLHEVDINWARSKPFATAQNGGSGPVIQIDSAH
        C SYNFHPL+ASTDLHEV INWA+SKP ATAQ+G SGP   +DSAH
Subjt:  CHSYNFHPLTASTDLHEVDINWARSKPFATAQNGGSGPVIQIDSAH

KAG6577377.1 hypothetical protein SDJN03_24951, partial [Cucurbita argyrosperma subsp. sororia]2.6e-5848.25Show/hide
Query:  MASLKYFLLCPFVFLCVSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVN
        M SLKYFLL PFVFLC+S TFAN V NS+DGS                G D     VG GP A P AGPGVE GV+NV A P AE          V +V 
Subjt:  MASLKYFLLCPFVFLCVSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVN

Query:  AGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEG
        AG KAGP+AGPG EG  S+V AG RAG                  +AGPKA  G E  VS++ AGPR GPKAG G EG VS++ A  R GPKAGP  E  
Subjt:  AGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEG

Query:  VSNVGAGPRVGPNSGPEPKVGVSGTRAGPRAGPKGVNSIVNGVG------VGYKPGF------GPPGFGPRPGYWPRPGFEPYDDCILGYVCPANEAREC
        VSNV AGP VGP + P  + GVS +  G R   + V+ ++NG+G      +GY+ GF      G   FGP  G     G    ++C LGYVCP    R C
Subjt:  VSNVGAGPRVGPNSGPEPKVGVSGTRAGPRAGPKGVNSIVNGVG------VGYKPGF------GPPGFGPRPGYWPRPGFEPYDDCILGYVCPANEAREC

Query:  SKFEYGTCHSYNFHPLTASTDLHEVDINWAR-SKPFATAQNG
         KF YG C +Y FHPL AS  LHEV++ WA+ SKP AT QNG
Subjt:  SKFEYGTCHSYNFHPLTASTDLHEVDINWAR-SKPFATAQNG

KGN56231.1 hypothetical protein Csa_011503 [Cucumis sativus]2.2e-5744Show/hide
Query:  MASLKYFLLCPFVFLCVSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVN
        MASLKYFLL PF+FLC+SYTFANGVFN +DG   G  + P PDPSAGP VDRGV N G+GPKAGPRAG GV                             
Subjt:  MASLKYFLLCPFVFLCVSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVN

Query:  AGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEG
                                                                                     GG+SN+   S  GPKAGP V+E 
Subjt:  AGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEG

Query:  VSNVGAGPRVGPNSGPEPKVGVSGTRAGPRAGPKGVNSIVNGVGVG----YKPGFGPP--GFGPRPGYWPRPG---FEPYDDCILGYVCPANEARECSKF
        +SNVGAGPRV       PK+GVS   AGPRAGPKGV+ IV G+GVG      P FG P  G  P PG W  PG    EPY++C+LGYVCP N    C K 
Subjt:  VSNVGAGPRVGPNSGPEPKVGVSGTRAGPRAGPKGVNSIVNGVGVG----YKPGFGPP--GFGPRPGYWPRPG---FEPYDDCILGYVCPANEARECSKF

Query:  EYGTCHSYNFHPLTASTDLHEVDINWARSKPFATAQNGGSGPVIQIDSAH
         YG C SYNF PL+AST+LH+V INWA+SK   TAQ+G SGP I IDSAH
Subjt:  EYGTCHSYNFHPLTASTDLHEVDINWARSKPFATAQNGGSGPVIQIDSAH

XP_022929340.1 fibroin heavy chain-like [Cucurbita moschata]5.3e-4344.41Show/hide
Query:  MASLKYFLLCPFVFLCVSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVN
        M SLKYFLL PFVFLC+S TFAN V NS+DGS                G D     VG GP A P AGPGVE GV+NV A P AE          V +V 
Subjt:  MASLKYFLLCPFVFLCVSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVN

Query:  AGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEG
        AG KAGP+AGPG EG                       S+V AGLRAGPKA  G EG VSN+ AGP VGP+A  G EGGVS    SS GG +    V+  
Subjt:  AGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEG

Query:  VSNVGAGPRVGPNSGPEPKVGVS-GTRAGPRAGPKGVNSIVNGVGVGYKPGFGPPGFGPRPGYWPRPGFEPYDDCILGYVCPANEARECSKFEYGTCHSY
        ++ +G G            +GV  G R+G RAG       + G    + PG G  G G              ++C LGYVCP    R C KF YG C +Y
Subjt:  VSNVGAGPRVGPNSGPEPKVGVS-GTRAGPRAGPKGVNSIVNGVGVGYKPGFGPPGFGPRPGYWPRPGFEPYDDCILGYVCPANEARECSKFEYGTCHSY

Query:  NFHPLTASTDLHEVDINWAR-SKPFATAQNG
         FHPL AS  LHEV++ WA+ SKP AT QNG
Subjt:  NFHPLTASTDLHEVDINWAR-SKPFATAQNG

XP_023551823.1 fibroin heavy chain-like isoform X1 [Cucurbita pepo subsp. pepo]3.5e-4747.87Show/hide
Query:  PKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVNAGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVS
        P A P AGPGVE GV+NV A P AE          V +V AG KAGP+AGPG EG  S+V AGPRAG +AG G E   S+V AG RAGPKA  G EG VS
Subjt:  PKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVNAGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVS

Query:  NIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEGVSNVGAGPRVGPNSGPEPKVGVSGTRAGPRAGPKGVNSIVNGVG------VGYKPGF---
        ++ AGPR GPKAG G EG                      V+NV AGP VGP + P  + GVS +  G R   + V+ ++NG+G      +GY+ GF   
Subjt:  NIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEGVSNVGAGPRVGPNSGPEPKVGVSGTRAGPRAGPKGVNSIVNGVG------VGYKPGF---

Query:  ---GPPGFGPRPGYWPRPGFEPYDDCILGYVCPANEARECSKFEYGTCHSYNFHPLTASTDLHEVDINWAR-SKPFATAQNG
           G   FGP  G     G    ++C LGYVCP    R C KF YG C SY FHPL AS  LHEV++ WA+ SKP AT QNG
Subjt:  ---GPPGFGPRPGYWPRPGFEPYDDCILGYVCPANEARECSKFEYGTCHSYNFHPLTASTDLHEVDINWAR-SKPFATAQNG

TrEMBL top hitse value%identityAlignment
A0A0A0L7X7 Uncharacterized protein1.1e-5744Show/hide
Query:  MASLKYFLLCPFVFLCVSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVN
        MASLKYFLL PF+FLC+SYTFANGVFN +DG   G  + P PDPSAGP VDRGV N G+GPKAGPRAG GV                             
Subjt:  MASLKYFLLCPFVFLCVSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVN

Query:  AGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEG
                                                                                     GG+SN+   S  GPKAGP V+E 
Subjt:  AGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEG

Query:  VSNVGAGPRVGPNSGPEPKVGVSGTRAGPRAGPKGVNSIVNGVGVG----YKPGFGPP--GFGPRPGYWPRPG---FEPYDDCILGYVCPANEARECSKF
        +SNVGAGPRV       PK+GVS   AGPRAGPKGV+ IV G+GVG      P FG P  G  P PG W  PG    EPY++C+LGYVCP N    C K 
Subjt:  VSNVGAGPRVGPNSGPEPKVGVSGTRAGPRAGPKGVNSIVNGVGVG----YKPGFGPP--GFGPRPGYWPRPG---FEPYDDCILGYVCPANEARECSKF

Query:  EYGTCHSYNFHPLTASTDLHEVDINWARSKPFATAQNGGSGPVIQIDSAH
         YG C SYNF PL+AST+LH+V INWA+SK   TAQ+G SGP I IDSAH
Subjt:  EYGTCHSYNFHPLTASTDLHEVDINWARSKPFATAQNGGSGPVIQIDSAH

A0A0M9A612 Uncharacterized protein2.5e-1438.89Show/hide
Query:  GVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVNAGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAG
        GV+ G   VG G   G     GVE G   V A         L VE G   V AG   G     GVE G+  V AG   G    LGVE G+  V AG   G
Subjt:  GVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVNAGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAG

Query:  PKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEGVSNVGAGPRVGPNSGPEPKVGVSGTRAGPRAGPKGVNSIVNGVGVG
           SLGVE G   +GAG   G    LGVE G   +GA S GG      VE G   VGAG       G    +GV     G  AG  GV +   GVG G
Subjt:  PKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEGVSNVGAGPRVGPNSGPEPKVGVSGTRAGPRAGPKGVNSIVNGVGVG

A0A5A7V4J6 Uncharacterized protein6.5e-5542.2Show/hide
Query:  MASLKYFLLCPFVFLCVSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVN
        MASLKYFLL PF+FLC+SYTFA+GVFN + G   G  + P PDPSAGPGVD GV N+G+GPKAGPRAG G+ GG+++V  +P                  
Subjt:  MASLKYFLLCPFVFLCVSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVN

Query:  AGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEG
         GPKAGP+A  G                                                                                        
Subjt:  AGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEG

Query:  VSNVGAGPRVGPNSGPEPKVGVSGTRAGPRAGPKGVNSIVNGVGVGYKPGFGPP--GFGPRPGYWPRPG---FEPYDDCILGYVCPANEARECSKFEYGT
                          K+GVSG  AGPRAGPKGVN    GVGV   P FG P  G  P PG W RPG    EPY +C+LGYVCP N    CSKF YG 
Subjt:  VSNVGAGPRVGPNSGPEPKVGVSGTRAGPRAGPKGVNSIVNGVGVGYKPGFGPP--GFGPRPGYWPRPG---FEPYDDCILGYVCPANEARECSKFEYGT

Query:  CHSYNFHPLTASTDLHEVDINWARSKPFATAQNGGSGPVIQIDSAH
        C SYNFHPL+ASTDLHEV INWA+SKP ATAQ+G SGP   +DSAH
Subjt:  CHSYNFHPLTASTDLHEVDINWARSKPFATAQNGGSGPVIQIDSAH

A0A6I8PB04 Bassoon presynaptic cytomatrix protein8.8e-1241.15Show/hide
Query:  AGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVNAGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGL
        AGP V       GVGP+AGP AGP    G     A PRA                 GP AGPRAGPG   G     AGPRAG   G G   G      G 
Subjt:  AGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVNAGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGL

Query:  RAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEGVSNVGAGPRVGPNSGPEPKVGV-SGTRAGPRAGPKGVNSIVNGVG
          GP    G   G    G GPR GP  G G   G    G   R GP AGP    G    G GP  GP +GP P  G  +G RAGP AGPK       G G
Subjt:  RAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEGVSNVGAGPRVGPNSGPEPKVGV-SGTRAGPRAGPKGVNSIVNGVG

Query:  VGYKPGFGP---PGFGPRPGYWPRPG
        VG + G GP   PG GP  G   R G
Subjt:  VGYKPGFGP---PGFGPRPGYWPRPG

A0A6J1EU53 fibroin heavy chain-like2.6e-4344.41Show/hide
Query:  MASLKYFLLCPFVFLCVSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVN
        M SLKYFLL PFVFLC+S TFAN V NS+DGS                G D     VG GP A P AGPGVE GV+NV A P AE          V +V 
Subjt:  MASLKYFLLCPFVFLCVSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVN

Query:  AGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEG
        AG KAGP+AGPG EG                       S+V AGLRAGPKA  G EG VSN+ AGP VGP+A  G EGGVS    SS GG +    V+  
Subjt:  AGPKAGPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEG

Query:  VSNVGAGPRVGPNSGPEPKVGVS-GTRAGPRAGPKGVNSIVNGVGVGYKPGFGPPGFGPRPGYWPRPGFEPYDDCILGYVCPANEARECSKFEYGTCHSY
        ++ +G G            +GV  G R+G RAG       + G    + PG G  G G              ++C LGYVCP    R C KF YG C +Y
Subjt:  VSNVGAGPRVGPNSGPEPKVGVS-GTRAGPRAGPKGVNSIVNGVGVGYKPGFGPPGFGPRPGYWPRPGFEPYDDCILGYVCPANEARECSKFEYGTCHSY

Query:  NFHPLTASTDLHEVDINWAR-SKPFATAQNG
         FHPL AS  LHEV++ WA+ SKP AT QNG
Subjt:  NFHPLTASTDLHEVDINWAR-SKPFATAQNG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCTCAAATATTTCCTTCTCTGTCCCTTTGTTTTCCTCTGTGTAAGCTACACCTTTGCCAATGGAGTCTTCAACTCCAATGATGGATCTCATTCTGGT
CCTAGGGCTTGGCCATTACCCGACCCGAGTGCTGGTCCAGGAGTTGATAGAGGGGTAAAGAATGTTGGGGTTGGCCCAAAAGCCGGACCGAGAGCTGGTCCAGGA
GTCGAGGGAGGAGTAACTAATGTCATTGCCGATCCAAGAGCCGAATCAAAAGCTGACTTGAAAGTCGAGGAAGGGGTAACTAATGTCAATGCTGGCCCAAAAGCC
GGACCGAGAGCTGGCCCAGGAGTTGAGGGAGGGGCAAGTAATGTCAATGCTGGTCCAAGAGCTGGAACAAGAGCTGGCCTAGGAGTTGAGAGAGGGGCAAGTAAT
GTCAATGCTGGTCTAAGAGCCGGACCTAAAGCTAGCCTGGGAGTCGAGGGAGGGGTAAGCAATATCGGTGCTGGTCCGAGAGTTGGACCAAAAGCTGGCCTAGGA
GTTGAGGGAGGGGTAAGCAATATTGGTGCTAGTTCGAGAGGTGGACCGAAAGCTGGCCCGAGAGTCGAGGAAGGGGTAAGCAATGTTGGTGCTGGTCCAAGAGTC
GGCCCCAATTCTGGCCCAGAACCTAAGGTAGGGGTAAGTGGTACTAGGGCTGGTCCGAGAGCGGGGCCAAAAGGTGTTAATTCAATTGTTAACGGAGTCGGAGTT
GGGTACAAGCCAGGATTTGGACCTCCTGGATTTGGGCCAAGGCCCGGGTATTGGCCTAGGCCAGGATTTGAACCGTACGATGATTGCATATTGGGCTATGTTTGT
CCAGCAAATGAAGCTAGGGAATGCAGCAAATTTGAGTATGGAACTTGCCATTCTTATAACTTTCATCCATTGACGGCTTCTACGGACCTACACGAAGTTGACATC
AATTGGGCCAGAAGCAAGCCTTTTGCAACGGCCCAAAATGGTGGATCTGGACCAGTTATTCAAATCGACTCAGCCCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTCTCAAATATTTCCTTCTCTGTCCCTTTGTTTTCCTCTGTGTAAGCTACACCTTTGCCAATGGAGTCTTCAACTCCAATGATGGATCTCATTCTGGT
CCTAGGGCTTGGCCATTACCCGACCCGAGTGCTGGTCCAGGAGTTGATAGAGGGGTAAAGAATGTTGGGGTTGGCCCAAAAGCCGGACCGAGAGCTGGTCCAGGA
GTCGAGGGAGGAGTAACTAATGTCATTGCCGATCCAAGAGCCGAATCAAAAGCTGACTTGAAAGTCGAGGAAGGGGTAACTAATGTCAATGCTGGCCCAAAAGCC
GGACCGAGAGCTGGCCCAGGAGTTGAGGGAGGGGCAAGTAATGTCAATGCTGGTCCAAGAGCTGGAACAAGAGCTGGCCTAGGAGTTGAGAGAGGGGCAAGTAAT
GTCAATGCTGGTCTAAGAGCCGGACCTAAAGCTAGCCTGGGAGTCGAGGGAGGGGTAAGCAATATCGGTGCTGGTCCGAGAGTTGGACCAAAAGCTGGCCTAGGA
GTTGAGGGAGGGGTAAGCAATATTGGTGCTAGTTCGAGAGGTGGACCGAAAGCTGGCCCGAGAGTCGAGGAAGGGGTAAGCAATGTTGGTGCTGGTCCAAGAGTC
GGCCCCAATTCTGGCCCAGAACCTAAGGTAGGGGTAAGTGGTACTAGGGCTGGTCCGAGAGCGGGGCCAAAAGGTGTTAATTCAATTGTTAACGGAGTCGGAGTT
GGGTACAAGCCAGGATTTGGACCTCCTGGATTTGGGCCAAGGCCCGGGTATTGGCCTAGGCCAGGATTTGAACCGTACGATGATTGCATATTGGGCTATGTTTGT
CCAGCAAATGAAGCTAGGGAATGCAGCAAATTTGAGTATGGAACTTGCCATTCTTATAACTTTCATCCATTGACGGCTTCTACGGACCTACACGAAGTTGACATC
AATTGGGCCAGAAGCAAGCCTTTTGCAACGGCCCAAAATGGTGGATCTGGACCAGTTATTCAAATCGACTCAGCCCACTAA
Protein sequenceShow/hide protein sequence
MASLKYFLLCPFVFLCVSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPKAGPRAGPGVEGGVTNVIADPRAESKADLKVEEGVTNVNAGPKA
GPRAGPGVEGGASNVNAGPRAGTRAGLGVERGASNVNAGLRAGPKASLGVEGGVSNIGAGPRVGPKAGLGVEGGVSNIGASSRGGPKAGPRVEEGVSNVGAGPRV
GPNSGPEPKVGVSGTRAGPRAGPKGVNSIVNGVGVGYKPGFGPPGFGPRPGYWPRPGFEPYDDCILGYVCPANEARECSKFEYGTCHSYNFHPLTASTDLHEVDI
NWARSKPFATAQNGGSGPVIQIDSAH