; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018161 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018161
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionsyndetin
Genome locationChr04:1120924..1127642
RNA-Seq ExpressionHG10018161
SyntenyHG10018161
Gene Ontology termsGO:0032456 - endocytic recycling (biological process)
GO:0042147 - retrograde transport, endosome to Golgi (biological process)
GO:1990745 - EARP complex (cellular component)
InterPro domainsIPR040047 - Syndetin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008462106.1 PREDICTED: syndetin [Cucumis melo]4.6e-5055.6Show/hide
Query:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV
        MQPNLFPFGS LGNPLAFDGDLSEGFETSRFLFFVPF LLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPE                         
Subjt:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV

Query:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF
                                                                                    VPARAVAAAAVARALAGLPPHQRF
Subjt:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF

Query:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
        SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
Subjt:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE

XP_011654226.1 syndetin isoform X1 [Cucumis sativus]1.7e-4955.17Show/hide
Query:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV
        MQPNLFPFGS LGNPLAFDGDLSEGFET RFLFFVPF LLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPE                         
Subjt:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV

Query:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF
                                                                                    VPARAVAAAAVARALAGLPPHQRF
Subjt:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF

Query:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
        SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
Subjt:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE

XP_022152900.1 syndetin isoform X1 [Momordica charantia]1.1e-4854.74Show/hide
Query:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV
        MQPNLFPFGSVLGNPLAFDGDLSEGF TSRFLFFVPFFLLQGGGMDLS+VGEKILSSVRSARSLGLLP TSDRPE                         
Subjt:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV

Query:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF
                                                                                    VPARAVAAAAVARALAGLPPHQRF
Subjt:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF

Query:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
        SLSSSSEELSSIYGSRNHG EVEELEEVFYEE
Subjt:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE

XP_022152901.1 syndetin isoform X2 [Momordica charantia]1.1e-4854.74Show/hide
Query:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV
        MQPNLFPFGSVLGNPLAFDGDLSEGF TSRFLFFVPFFLLQGGGMDLS+VGEKILSSVRSARSLGLLP TSDRPE                         
Subjt:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV

Query:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF
                                                                                    VPARAVAAAAVARALAGLPPHQRF
Subjt:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF

Query:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
        SLSSSSEELSSIYGSRNHG EVEELEEVFYEE
Subjt:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE

XP_038895533.1 syndetin isoform X1 [Benincasa hispida]4.6e-5055.6Show/hide
Query:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV
        MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPF LLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPE                         
Subjt:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV

Query:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF
                                                                                    VPARAVAAAAVAR LAGLPPHQRF
Subjt:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF

Query:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
        SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
Subjt:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE

TrEMBL top hitse value%identityAlignment
A0A0A0LV98 Vps54_N domain-containing protein8.5e-5055.17Show/hide
Query:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV
        MQPNLFPFGS LGNPLAFDGDLSEGFET RFLFFVPF LLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPE                         
Subjt:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV

Query:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF
                                                                                    VPARAVAAAAVARALAGLPPHQRF
Subjt:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF

Query:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
        SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
Subjt:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE

A0A1S3CG39 syndetin2.2e-5055.6Show/hide
Query:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV
        MQPNLFPFGS LGNPLAFDGDLSEGFETSRFLFFVPF LLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPE                         
Subjt:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV

Query:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF
                                                                                    VPARAVAAAAVARALAGLPPHQRF
Subjt:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF

Query:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
        SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
Subjt:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE

A0A6J1DF95 syndetin isoform X15.5e-4954.74Show/hide
Query:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV
        MQPNLFPFGSVLGNPLAFDGDLSEGF TSRFLFFVPFFLLQGGGMDLS+VGEKILSSVRSARSLGLLP TSDRPE                         
Subjt:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV

Query:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF
                                                                                    VPARAVAAAAVARALAGLPPHQRF
Subjt:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF

Query:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
        SLSSSSEELSSIYGSRNHG EVEELEEVFYEE
Subjt:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE

A0A6J1DJ46 syndetin isoform X25.5e-4954.74Show/hide
Query:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV
        MQPNLFPFGSVLGNPLAFDGDLSEGF TSRFLFFVPFFLLQGGGMDLS+VGEKILSSVRSARSLGLLP TSDRPE                         
Subjt:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV

Query:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF
                                                                                    VPARAVAAAAVARALAGLPPHQRF
Subjt:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF

Query:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
        SLSSSSEELSSIYGSRNHG EVEELEEVFYEE
Subjt:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE

A0A6J1HWS6 syndetin1.0e-4753.45Show/hide
Query:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV
        MQPNLFPFGSVLGNPL ++GDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPT +DRPE                         
Subjt:  MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKV

Query:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF
                                                                                    VPARAVAAAAVARALAGLPPHQRF
Subjt:  LIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRF

Query:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
        SL SSSEELSSIYGSR+HGHEVEELEEVFYEE
Subjt:  SLSSSSEELSSIYGSRNHGHEVEELEEVFYEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27900.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2451, C-terminal (InterPro:IPR019514), Vacuolar protein sorting-associated protein 54 (InterPro:IPR019515); Has 316 Blast hits to 252 proteins in 92 species: Archae - 0; Bacteria - 2; Metazoa - 200; Fungi - 2; Plants - 68; Viruses - 0; Other Eukaryotes - 44 (source: NCBI BLink).2.2e-2639.51Show/hide
Query:  MQPNL-FPFGSVLGNPLAFD--GDLSE-----GFETSRFLFFVPFFLLQGGG-MDLSKVGEKILSSVRSARSLGLL--PTTSDRPEGIQVLHLMLGSWKF
        MQPNL FPFGSVLGNP  F+  GDL+E      FE+SR  F +PF L QG G MDLSKVGEK LSSV+SA SLGLL  P+ SDRPE              
Subjt:  MQPNL-FPFGSVLGNPLAFD--GDLSE-----GFETSRFLFFVPFFLLQGGG-MDLSKVGEKILSSVRSARSLGLL--PTTSDRPEGIQVLHLMLGSWKF

Query:  QVFSLLHLPKVLIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVAR
                                                                                               +PARA AAAAVAR
Subjt:  QVFSLLHLPKVLIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVAR

Query:  ALAGLPPHQRFSLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
        ALAGLP  QR S+SS++ EL+SIYG+R    +VEELEE FYEE
Subjt:  ALAGLPPHQRFSLSSSSEELSSIYGSRNHGHEVEELEEVFYEE

AT2G27900.2 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2451, C-terminal (InterPro:IPR019514), Vacuolar protein sorting-associated protein 54 (InterPro:IPR019515); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).2.2e-2639.51Show/hide
Query:  MQPNL-FPFGSVLGNPLAFD--GDLSE-----GFETSRFLFFVPFFLLQGGG-MDLSKVGEKILSSVRSARSLGLL--PTTSDRPEGIQVLHLMLGSWKF
        MQPNL FPFGSVLGNP  F+  GDL+E      FE+SR  F +PF L QG G MDLSKVGEK LSSV+SA SLGLL  P+ SDRPE              
Subjt:  MQPNL-FPFGSVLGNPLAFD--GDLSE-----GFETSRFLFFVPFFLLQGGG-MDLSKVGEKILSSVRSARSLGLL--PTTSDRPEGIQVLHLMLGSWKF

Query:  QVFSLLHLPKVLIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVAR
                                                                                               +PARA AAAAVAR
Subjt:  QVFSLLHLPKVLIKLLEGGLGLLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVAR

Query:  ALAGLPPHQRFSLSSSSEELSSIYGSRNHGHEVEELEEVFYEE
        ALAGLP  QR S+SS++ EL+SIYG+R    +VEELEE FYEE
Subjt:  ALAGLPPHQRFSLSSSSEELSSIYGSRNHGHEVEELEEVFYEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCCAAACCTGTTTCCATTTGGTAGTGTTCTCGGAAACCCTTTGGCGTTCGATGGTGATTTGAGCGAAGGATTCGAGACTTCCAGGTTTCTCTTCTTCGTTCCGTT
TTTTCTGCTGCAAGGAGGTGGAATGGACTTGTCCAAGGTTGGGGAGAAGATTTTGAGCTCCGTGAGGTCAGCTAGATCGCTTGGACTTCTTCCTACCACTTCTGATCGGC
CGGAGGGCATTCAGGTGCTTCATCTTATGCTGGGTTCTTGGAAGTTTCAAGTGTTTTCTTTGTTGCACCTTCCTAAAGTACTGATCAAGTTGCTTGAAGGTGGTCTTGGC
TTGCTTTCTGTGGCTCTTTTCATGGAGTTCTTTAATGGTTTTTTAGTTGTCTTCCTTTTCAACACGGAATCTTTGGTTTTTGTCTCAAGTGTTATTAATTTACCCAAGGT
CCTCAAGAATTCAGTAAATTGGGAGGATAATGAGTCAGATTCAGAATTGAGCCTTAGCCGTGGTGATTTGGATTTTAGTCTTCCAACGGTTCCAGCACGTGCTGTGGCTG
CGGCAGCTGTTGCCCGTGCACTTGCAGGATTGCCTCCTCACCAAAGATTTAGTCTCTCATCTAGCTCGGAAGAACTGAGCTCAATATATGGCAGTAGAAATCATGGCCAC
GAAGTAGAGGAACTAGAAGAAGTTTTCTATGAAGAGGTGCTACTTGCTATTGTTACTATCATTTACTTGGGATCATTCATTTACAGGCATGATTCTTGTAAATACTATGT
TACATTTTTAAAATTATTAACCTCTTCATGTATTTTAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGCCAAACCTGTTTCCATTTGGTAGTGTTCTCGGAAACCCTTTGGCGTTCGATGGTGATTTGAGCGAAGGATTCGAGACTTCCAGGTTTCTCTTCTTCGTTCCGTT
TTTTCTGCTGCAAGGAGGTGGAATGGACTTGTCCAAGGTTGGGGAGAAGATTTTGAGCTCCGTGAGGTCAGCTAGATCGCTTGGACTTCTTCCTACCACTTCTGATCGGC
CGGAGGGCATTCAGGTGCTTCATCTTATGCTGGGTTCTTGGAAGTTTCAAGTGTTTTCTTTGTTGCACCTTCCTAAAGTACTGATCAAGTTGCTTGAAGGTGGTCTTGGC
TTGCTTTCTGTGGCTCTTTTCATGGAGTTCTTTAATGGTTTTTTAGTTGTCTTCCTTTTCAACACGGAATCTTTGGTTTTTGTCTCAAGTGTTATTAATTTACCCAAGGT
CCTCAAGAATTCAGTAAATTGGGAGGATAATGAGTCAGATTCAGAATTGAGCCTTAGCCGTGGTGATTTGGATTTTAGTCTTCCAACGGTTCCAGCACGTGCTGTGGCTG
CGGCAGCTGTTGCCCGTGCACTTGCAGGATTGCCTCCTCACCAAAGATTTAGTCTCTCATCTAGCTCGGAAGAACTGAGCTCAATATATGGCAGTAGAAATCATGGCCAC
GAAGTAGAGGAACTAGAAGAAGTTTTCTATGAAGAGGTGCTACTTGCTATTGTTACTATCATTTACTTGGGATCATTCATTTACAGGCATGATTCTTGTAAATACTATGT
TACATTTTTAAAATTATTAACCTCTTCATGTATTTTAGATTAA
Protein sequenceShow/hide protein sequence
MQPNLFPFGSVLGNPLAFDGDLSEGFETSRFLFFVPFFLLQGGGMDLSKVGEKILSSVRSARSLGLLPTTSDRPEGIQVLHLMLGSWKFQVFSLLHLPKVLIKLLEGGLG
LLSVALFMEFFNGFLVVFLFNTESLVFVSSVINLPKVLKNSVNWEDNESDSELSLSRGDLDFSLPTVPARAVAAAAVARALAGLPPHQRFSLSSSSEELSSIYGSRNHGH
EVEELEEVFYEEVLLAIVTIIYLGSFIYRHDSCKYYVTFLKLLTSSCILD