; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G20980 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G20980
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionglycine-rich cell wall structural protein-like
Genome locationctg907:496863..502181
RNA-Seq ExpressionCucsat.G20980
SyntenyCucsat.G20980
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0020037 - heme binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649877.1 hypothetical protein Csa_012908 [Cucumis sativus]1.27e-176100Show/hide
Query:  MVLTNFNGAGVGFGFGVGCGFGIGWGFGGMPLNFLGLGAGGGCGVGLGLGWGFGTAFGIAISWRIDMTRNCLISRGHILFCLFLFNNILDAASTRKLMLG
        MVLTNFNGAGVGFGFGVGCGFGIGWGFGGMPLNFLGLGAGGGCGVGLGLGWGFGTAFGIAISWRIDMTRNCLISRGHILFCLFLFNNILDAASTRKLMLG
Subjt:  MVLTNFNGAGVGFGFGVGCGFGIGWGFGGMPLNFLGLGAGGGCGVGLGLGWGFGTAFGIAISWRIDMTRNCLISRGHILFCLFLFNNILDAASTRKLMLG

Query:  SGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGIGYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIG
        SGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGIGYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIG
Subjt:  SGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGIGYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIG

Query:  GGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGGGYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGG
        GGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGGGYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGG
Subjt:  GGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGGGYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGG

Query:  GYGGNMNSP
        GYGGNMNSP
Subjt:  GYGGNMNSP

KAG7024291.1 hypothetical protein SDJN02_13105, partial [Cucurbita argyrosperma subsp. argyrosperma]1.92e-5264.31Show/hide
Query:  IAISWRIDMTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGT-KGGAFGSG---SGS
        +A SW IDMTRN  +SRG +LFCL L  N+LDAA+ RKL L S +  HEMGNGLP+ +N KKEV+ LE T+GGYGGVS GGG G    G FGSG   SGS
Subjt:  IAISWRIDMTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGT-KGGAFGSG---SGS

Query:  GGTGGGGFGPGIGYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDS
        GG GGGG G GIGYGSDGG+GPGIGYGSGSG GG  GG GGS GIGGGGG+ GSGGI   GGG LGGSGGI GG  GG GG+ G GG GLG SGGI G  
Subjt:  GGTGGGGFGPGIGYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDS

Query:  GGGVRGSGGTGGGYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP
        GGGV GSGG            GG  GSGGTGGG  G        GGGYGGNMN+P
Subjt:  GGGVRGSGGTGGGYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP

XP_011654145.2 glycine-rich protein 5 isoform X1 [Cucumis sativus]5.75e-127100Show/hide
Query:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGI
        MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGI
Subjt:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGI

Query:  GYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGG
        GYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGG
Subjt:  GYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGG

Query:  GYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP
        GYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP
Subjt:  GYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP

XP_016901290.1 PREDICTED: glycine-rich cell wall structural protein-like [Cucumis melo]2.98e-8781.89Show/hide
Query:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGI
        MTRN L+   HILFCLFLFNN+ DAASTRKLMLGS VITHEM N LPDYSNFK EVH LE  +GGYGGVSAGGGVGTKG  FGS SGSGGTGGGGFG GI
Subjt:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGI

Query:  GYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGG
        GYGSDGGFGPGIGYGSGSGIGG  GGFGGSVGIGGGGG+SGSGGIVGSGGG LGGSGGI GG+GG  GG+VG+GG GLGNSGGIVGDSGGGV        
Subjt:  GYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGG

Query:  GYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP
          SGSGG GGGYSGSGG+GGGYSGSGGMGDG GGGYGGN NSP
Subjt:  GYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP

XP_022937240.1 acanthoscurrin-2-like isoform X1 [Cucurbita moschata]2.39e-4964.68Show/hide
Query:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGT-KGGAFGSG---SGSGGTGGGGF
        MTRN  +SRG +LFCL L  N+LDAA+ RKL L S +  HEMGNGLP+ +N KKEV+ LE T+GGYGGVS GGG G    G FGSG   SGSGG GGGGF
Subjt:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGT-KGGAFGSG---SGSGGTGGGGF

Query:  GPGIGYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGV-LGGSGGI----VGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGG
        G GIGYGSDGG+GPGIGYGS SG GG  GG GGS GIGGGG I GSGGI GSGGG  LGGSGGI    +GG GG  GGV G GG GLG SGGI G  GGG
Subjt:  GPGIGYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGV-LGGSGGI----VGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGG

Query:  VRGSGGTGGGYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP
        V GSGG GGG  GSGGTGGG                   G GGGYGGNMN+P
Subjt:  VRGSGGTGGGYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP

TrEMBL top hitse value%identityAlignment
A0A0A0L2J1 Uncharacterized protein1.11e-11995.88Show/hide
Query:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGI
        MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGI
Subjt:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGI

Query:  GYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGG
        GYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGG
Subjt:  GYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGG

Query:  GYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP
        GYSGSGG          TGGGYSGSGGMGDGVGGGYGGNMNSP
Subjt:  GYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP

A0A1S4DZX5 glycine-rich cell wall structural protein-like1.44e-8781.89Show/hide
Query:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGI
        MTRN L+   HILFCLFLFNN+ DAASTRKLMLGS VITHEM N LPDYSNFK EVH LE  +GGYGGVSAGGGVGTKG  FGS SGSGGTGGGGFG GI
Subjt:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGI

Query:  GYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGG
        GYGSDGGFGPGIGYGSGSGIGG  GGFGGSVGIGGGGG+SGSGGIVGSGGG LGGSGGI GG+GG  GG+VG+GG GLGNSGGIVGDSGGGV        
Subjt:  GYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGG

Query:  GYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP
          SGSGG GGGYSGSGG+GGGYSGSGGMGDG GGGYGGN NSP
Subjt:  GYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP

A0A5A7VB20 Glycine-rich cell wall structural protein-like1.44e-8781.89Show/hide
Query:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGI
        MTRN L+   HILFCLFLFNN+ DAASTRKLMLGS VITHEM N LPDYSNFK EVH LE  +GGYGGVSAGGGVGTKG  FGS SGSGGTGGGGFG GI
Subjt:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGI

Query:  GYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGG
        GYGSDGGFGPGIGYGSGSGIGG  GGFGGSVGIGGGGG+SGSGGIVGSGGG LGGSGGI GG+GG  GG+VG+GG GLGNSGGIVGDSGGGV        
Subjt:  GYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGVLGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGG

Query:  GYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP
          SGSGG GGGYSGSGG+GGGYSGSGGMGDG GGGYGGN NSP
Subjt:  GYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP

A0A6J1F9U1 acanthoscurrin-2-like isoform X21.32e-4964.92Show/hide
Query:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGT-KGGAFGSG---SGSGGTGGGGF
        MTRN  +SRG +LFCL L  N+LDAA+ RKL L S +  HEMGNGLP+ +N KKEV+ LE T+GGYGGVS GGG G    G FGSG   SGSGG GGGGF
Subjt:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGT-KGGAFGSG---SGSGGTGGGGF

Query:  GPGIGYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGV-LGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGS
        G GIGYGSDGG+GPGIGYGS SG GG  GG GGS GIGGGG I GSGGI GSGGG  LGGSGGI G   GG+GG  G GG GLG SGGI G  GGGV GS
Subjt:  GPGIGYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGV-LGGSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGS

Query:  GGTGGGYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP
        GG GGG  GSGGTGGG                   G GGGYGGNMN+P
Subjt:  GGTGGGYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP

A0A6J1FG14 acanthoscurrin-2-like isoform X11.16e-4964.68Show/hide
Query:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGT-KGGAFGSG---SGSGGTGGGGF
        MTRN  +SRG +LFCL L  N+LDAA+ RKL L S +  HEMGNGLP+ +N KKEV+ LE T+GGYGGVS GGG G    G FGSG   SGSGG GGGGF
Subjt:  MTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGNGLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGT-KGGAFGSG---SGSGGTGGGGF

Query:  GPGIGYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGV-LGGSGGI----VGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGG
        G GIGYGSDGG+GPGIGYGS SG GG  GG GGS GIGGGG I GSGGI GSGGG  LGGSGGI    +GG GG  GGV G GG GLG SGGI G  GGG
Subjt:  GPGIGYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGV-LGGSGGI----VGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGG

Query:  VRGSGGTGGGYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP
        V GSGG GGG  GSGGTGGG                   G GGGYGGNMN+P
Subjt:  VRGSGGTGGGYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTACTCACCAACTTCAATGGAGCTGGCGTTGGATTCGGTTTTGGTGTCGGTTGTGGGTTCGGCATAGGATGGGGTTTCGGAGGAATGCCTTTGAACTTCTTGGGCCT
TGGCGCGGGTGGTGGGTGTGGCGTTGGATTAGGACTTGGATGGGGATTTGGCACAGCTTTCGGGATTGCAATTAGTTGGCGAATAGACATGACAAGAAATTGTTTGATTT
CACGGGGGCACATCCTTTTCTGTCTTTTCCTCTTCAATAATATTCTTGACGCTGCCAGTACAAGAAAATTGATGCTTGGGAGTGGTGTGATTACTCATGAAATGGGAAAT
GGACTTCCAGACTACTCAAACTTTAAAAAGGAAGTTCATCGTTTAGAGTTCACGATTGGAGGTTACGGGGGAGTATCAGCTGGTGGTGGTGTCGGGACTAAGGGAGGAGC
CTTTGGCTCAGGTTCTGGGTCTGGTGGGACTGGAGGTGGTGGATTTGGTCCGGGCATTGGTTATGGATCGGATGGTGGCTTTGGACCTGGAATTGGGTATGGCTCAGGTT
CAGGCATTGGTGGTGTGAGTGGCGGGTTCGGTGGTTCAGTTGGCATTGGAGGTGGTGGTGGAATAAGTGGGTCAGGTGGTATTGTGGGCAGCGGTGGCGGTGTACTAGGC
GGTTCAGGTGGCATCGTGGGTGGCATTGGCGGTGGAGTAGGTGGTGTTGTGGGCAATGGTGGCAGAGGACTAGGCAATTCAGGTGGTATTGTGGGTGACAGTGGCGGTGG
AGTAAGAGGGTCGGGAGGTACGGGTGGTGGTTATAGTGGTTCGGGAGGCACGGGTGGTGGTTATAGTGGTTCGGGAGGCACAGGCGGTGGTTATAGTGGTTCGGGAGGCA
TGGGTGATGGTGTTGGTGGTGGTTATGGAGGCAACATGAATAGTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTACTCACCAACTTCAATGGAGCTGGCGTTGGATTCGGTTTTGGTGTCGGTTGTGGGTTCGGCATAGGATGGGGTTTCGGAGGAATGCCTTTGAACTTCTTGGGCCT
TGGCGCGGGTGGTGGGTGTGGCGTTGGATTAGGACTTGGATGGGGATTTGGCACAGCTTTCGGGATTGCAATTAGTTGGCGAATAGACATGACAAGAAATTGTTTGATTT
CACGGGGGCACATCCTTTTCTGTCTTTTCCTCTTCAATAATATTCTTGACGCTGCCAGTACAAGAAAATTGATGCTTGGGAGTGGTGTGATTACTCATGAAATGGGAAAT
GGACTTCCAGACTACTCAAACTTTAAAAAGGAAGTTCATCGTTTAGAGTTCACGATTGGAGGTTACGGGGGAGTATCAGCTGGTGGTGGTGTCGGGACTAAGGGAGGAGC
CTTTGGCTCAGGTTCTGGGTCTGGTGGGACTGGAGGTGGTGGATTTGGTCCGGGCATTGGTTATGGATCGGATGGTGGCTTTGGACCTGGAATTGGGTATGGCTCAGGTT
CAGGCATTGGTGGTGTGAGTGGCGGGTTCGGTGGTTCAGTTGGCATTGGAGGTGGTGGTGGAATAAGTGGGTCAGGTGGTATTGTGGGCAGCGGTGGCGGTGTACTAGGC
GGTTCAGGTGGCATCGTGGGTGGCATTGGCGGTGGAGTAGGTGGTGTTGTGGGCAATGGTGGCAGAGGACTAGGCAATTCAGGTGGTATTGTGGGTGACAGTGGCGGTGG
AGTAAGAGGGTCGGGAGGTACGGGTGGTGGTTATAGTGGTTCGGGAGGCACGGGTGGTGGTTATAGTGGTTCGGGAGGCACAGGCGGTGGTTATAGTGGTTCGGGAGGCA
TGGGTGATGGTGTTGGTGGTGGTTATGGAGGCAACATGAATAGTCCTTGA
Protein sequenceShow/hide protein sequence
MVLTNFNGAGVGFGFGVGCGFGIGWGFGGMPLNFLGLGAGGGCGVGLGLGWGFGTAFGIAISWRIDMTRNCLISRGHILFCLFLFNNILDAASTRKLMLGSGVITHEMGN
GLPDYSNFKKEVHRLEFTIGGYGGVSAGGGVGTKGGAFGSGSGSGGTGGGGFGPGIGYGSDGGFGPGIGYGSGSGIGGVSGGFGGSVGIGGGGGISGSGGIVGSGGGVLG
GSGGIVGGIGGGVGGVVGNGGRGLGNSGGIVGDSGGGVRGSGGTGGGYSGSGGTGGGYSGSGGTGGGYSGSGGMGDGVGGGYGGNMNSP