; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025998 (gene) of Chayote v1 genome

Gene IDSed0025998
OrganismSechium edule (Chayote v1)
DescriptionTail fiber
Genome locationLG01:10176877..10179521
RNA-Seq ExpressionSed0025998
SyntenySed0025998
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585460.1 hypothetical protein SDJN03_18193, partial [Cucurbita argyrosperma subsp. sororia]5.0e-10089.35Show/hide
Query:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN
        MATSTPSSSS+ASK WIRN+SS+ASRIYFFLIILQIPLF+VPCRSGMCTTPLHVTSSQLI+SEVFP PVVKALLYPGAIVNGL+MNLTVPSWS+LLD+YN
Subjt:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN

Query:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKK-AVRSAPA
        LTN+KEA+A+TDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKP N DP KAVYVYPTMI+A+ICAFSSVKYD+KK A RSAPA
Subjt:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKK-AVRSAPA

Query:  QPIAKPLKSSSKSKLK
        +PIAKPL+SSSKSKLK
Subjt:  QPIAKPLKSSSKSKLK

XP_016900009.1 PREDICTED: uncharacterized protein LOC103488297 isoform X3 [Cucumis melo]5.6e-9986.98Show/hide
Query:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN
        MATSTPSS+S+ASK W+RN+SS+ASR+YF LIILQIPLF++ CRSGMCTTPLHVTSSQLI+SEVFP PVVKALLYPGA+VNGL+MNLTVPSWS+L DIYN
Subjt:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN

Query:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ
        LTN+KEA+AVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFG+LLVIWGLVKEGILGKP N DP KAVYVYPTMILA+ICAFSSVKYDVKK VR APA+
Subjt:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ

Query:  PIAKPLKSSSKSKLK
        PIAKPL+SSSKSKLK
Subjt:  PIAKPLKSSSKSKLK

XP_022131291.1 uncharacterized protein LOC111004557 [Momordica charantia]7.8e-10188.84Show/hide
Query:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN
        MA  TPSS+S+ S+GW+RN+SS+ASRI+FFLIILQIPLF++PCRSGMCTTPLHVTSSQLI+SEVFP PVVKALLYPGAIVNGL+MNLTVPSWSNLLDIYN
Subjt:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN

Query:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ
        LTNVKEA+AVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFG+LLVIWGLVKEGILGKP N DPTK+VYVYPTMILA+ICAFSSVKYDVKK VRSAPA+
Subjt:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ

Query:  PIAKPLKSSSKSKLK
        PIAKPL+SSSKSKLK
Subjt:  PIAKPLKSSSKSKLK

XP_023002496.1 uncharacterized protein LOC111496320 [Cucurbita maxima]1.5e-9988.37Show/hide
Query:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN
        M TST SSSS+ASK WIRN+SS+ASRIYFFLIILQIPLF+VPCRSGMCTTPLHVTSSQLI+SEVFP PVVKALLYPGAIVNGL+MNLTVPSWS+LLD+YN
Subjt:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN

Query:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ
        LTN+KEA+A+TDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKP N DP KAVYVYPTMI+A+ICAFSSVKYD+KK  RSAPA+
Subjt:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ

Query:  PIAKPLKSSSKSKLK
        PIAKPL+SSSKSKLK
Subjt:  PIAKPLKSSSKSKLK

XP_038883986.1 uncharacterized protein LOC120074948 [Benincasa hispida]5.0e-10088.84Show/hide
Query:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN
        MATST SS+S+ASK WIRN+SS+ASRIYFFLIILQIPLF++PCRSGMCTTPLHVTSSQLI+SEVFP PVVKALLYPGAIVNGL+MNLTVPSW+NL DIYN
Subjt:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN

Query:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ
        LTN+KEA+AVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFG+LLVIWGLVKEGILGKP N DP KAVYVYPTMILA+ICAFSSVKYDVKK VR APA+
Subjt:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ

Query:  PIAKPLKSSSKSKLK
        PIAKPL+SSSKSKLK
Subjt:  PIAKPLKSSSKSKLK

TrEMBL top hitse value%identityAlignment
A0A1S3BCZ0 uncharacterized protein LOC103488297 isoform X42.7e-9986.98Show/hide
Query:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN
        MATSTPSS+S+ASK W+RN+SS+ASR+YF LIILQIPLF++ CRSGMCTTPLHVTSSQLI+SEVFP PVVKALLYPGA+VNGL+MNLTVPSWS+L DIYN
Subjt:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN

Query:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ
        LTN+KEA+AVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFG+LLVIWGLVKEGILGKP N DP KAVYVYPTMILA+ICAFSSVKYDVKK VR APA+
Subjt:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ

Query:  PIAKPLKSSSKSKLK
        PIAKPL+SSSKSKLK
Subjt:  PIAKPLKSSSKSKLK

A0A1S4DVJ9 uncharacterized protein LOC103488297 isoform X32.7e-9986.98Show/hide
Query:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN
        MATSTPSS+S+ASK W+RN+SS+ASR+YF LIILQIPLF++ CRSGMCTTPLHVTSSQLI+SEVFP PVVKALLYPGA+VNGL+MNLTVPSWS+L DIYN
Subjt:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN

Query:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ
        LTN+KEA+AVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFG+LLVIWGLVKEGILGKP N DP KAVYVYPTMILA+ICAFSSVKYDVKK VR APA+
Subjt:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ

Query:  PIAKPLKSSSKSKLK
        PIAKPL+SSSKSKLK
Subjt:  PIAKPLKSSSKSKLK

A0A5A7VF29 Uncharacterized protein2.7e-9986.98Show/hide
Query:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN
        MATSTPSS+S+ASK W+RN+SS+ASR+YF LIILQIPLF++ CRSGMCTTPLHVTSSQLI+SEVFP PVVKALLYPGA+VNGL+MNLTVPSWS+L DIYN
Subjt:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN

Query:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ
        LTN+KEA+AVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFG+LLVIWGLVKEGILGKP N DP KAVYVYPTMILA+ICAFSSVKYDVKK VR APA+
Subjt:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ

Query:  PIAKPLKSSSKSKLK
        PIAKPL+SSSKSKLK
Subjt:  PIAKPLKSSSKSKLK

A0A6J1BP64 uncharacterized protein LOC1110045573.8e-10188.84Show/hide
Query:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN
        MA  TPSS+S+ S+GW+RN+SS+ASRI+FFLIILQIPLF++PCRSGMCTTPLHVTSSQLI+SEVFP PVVKALLYPGAIVNGL+MNLTVPSWSNLLDIYN
Subjt:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN

Query:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ
        LTNVKEA+AVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFG+LLVIWGLVKEGILGKP N DPTK+VYVYPTMILA+ICAFSSVKYDVKK VRSAPA+
Subjt:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ

Query:  PIAKPLKSSSKSKLK
        PIAKPL+SSSKSKLK
Subjt:  PIAKPLKSSSKSKLK

A0A6J1KLG8 uncharacterized protein LOC1114963207.1e-10088.37Show/hide
Query:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN
        M TST SSSS+ASK WIRN+SS+ASRIYFFLIILQIPLF+VPCRSGMCTTPLHVTSSQLI+SEVFP PVVKALLYPGAIVNGL+MNLTVPSWS+LLD+YN
Subjt:  MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYN

Query:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ
        LTN+KEA+A+TDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKP N DP KAVYVYPTMI+A+ICAFSSVKYD+KK  RSAPA+
Subjt:  LTNVKEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQ

Query:  PIAKPLKSSSKSKLK
        PIAKPL+SSSKSKLK
Subjt:  PIAKPLKSSSKSKLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G80200.1 unknown protein1.7e-2933.19Show/hide
Query:  ASKGWIRNVSSVASRIYFFLIILQIPLFK-----------------------------VPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNG
        + K W   +S +AS ++  LI+ QIPLF+                             V CR+  C TPL V SS+LI++++ P  +VK LLYPGAI   
Subjt:  ASKGWIRNVSSVASRIYFFLIILQIPLFK-----------------------------VPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNG

Query:  LLMNLTVPSWSNLLDIYNLTNV-KEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPEN---KDPTKAVYVYPTMILA
        L     +PS+ +L   Y+   + + ++  TD+  LEV AGS   + GA + L KP R++  G+LL+ WGL+++ +L    +        +V VYPT+ LA
Subjt:  LLMNLTVPSWSNLLDIYNLTNV-KEAAAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPEN---KDPTKAVYVYPTMILA

Query:  IICAFSSVKYDVKKAVRSAPAQPIAKPLK
         + AF S++ DV+K +R   +  ++K  K
Subjt:  IICAFSSVKYDVKKAVRSAPAQPIAKPLK

AT5G11280.1 unknown protein3.5e-9178.82Show/hide
Query:  SKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYNLTNVKEAAAVTD
        SKGWI+  +S+AS +YF LI+ QIPLF+VPCRSGMC++P+HVTSSQLISSE+FP P++KALLYPGA+VNGL +N+T P W N+LDIYNLTNVKEA+AVTD
Subjt:  SKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYNLTNVKEAAAVTD

Query:  LQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQPIAKPLKSSSKS
        LQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLL++WGLVKEGILGKP N DP K VYVYPTM+LA+ICAFS +KYD++KA R+APA+PIAKPL SSSKS
Subjt:  LQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQPIAKPLKSSSKS

Query:  KLK
        KLK
Subjt:  KLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACCTCAACGCCATCGTCGTCATCTTCAGCTTCAAAAGGTTGGATCAGGAATGTTTCATCCGTTGCTTCTCGCATCTACTTCTTCCTTATCATACTTCAGATCCC
TCTCTTCAAGGTCCCATGCAGATCTGGCATGTGTACAACCCCGTTGCACGTGACTTCATCGCAGTTGATTTCAAGTGAAGTCTTTCCTCAACCTGTAGTGAAGGCACTTC
TCTATCCAGGAGCAATTGTGAATGGCCTTCTCATGAACTTGACAGTTCCCAGCTGGAGTAACTTGTTAGACATCTATAACTTGACCAATGTGAAAGAAGCCGCTGCTGTG
ACTGATCTTCAACGCTTAGAGGTTCTTGCCGGAAGCTATTTTTCAGTGGCGGGTGCGTTTGTGGGTCTTTTGAAGCCCGGGAGAATGAGCATGTTTGGAAGCCTGTTGGT
AATTTGGGGTCTTGTTAAGGAAGGAATCCTGGGAAAACCTGAGAACAAAGATCCTACCAAGGCTGTTTATGTCTATCCTACAATGATTCTTGCTATCATCTGTGCTTTCT
CATCGGTTAAGTATGATGTGAAGAAGGCTGTTAGAAGTGCCCCTGCTCAACCGATTGCAAAGCCACTTAAAAGCTCATCCAAATCTAAGCTTAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATTACCAGATCTTTCCTCCATCTTCCTCGTCTGACCTTTCTCAAGTTGTCAACAATGGCAACCTCAACGCCATCGTCGTCATCTTCAGCTTCAAAAGGTTGGATCAGGAA
TGTTTCATCCGTTGCTTCTCGCATCTACTTCTTCCTTATCATACTTCAGATCCCTCTCTTCAAGGTCCCATGCAGATCTGGCATGTGTACAACCCCGTTGCACGTGACTT
CATCGCAGTTGATTTCAAGTGAAGTCTTTCCTCAACCTGTAGTGAAGGCACTTCTCTATCCAGGAGCAATTGTGAATGGCCTTCTCATGAACTTGACAGTTCCCAGCTGG
AGTAACTTGTTAGACATCTATAACTTGACCAATGTGAAAGAAGCCGCTGCTGTGACTGATCTTCAACGCTTAGAGGTTCTTGCCGGAAGCTATTTTTCAGTGGCGGGTGC
GTTTGTGGGTCTTTTGAAGCCCGGGAGAATGAGCATGTTTGGAAGCCTGTTGGTAATTTGGGGTCTTGTTAAGGAAGGAATCCTGGGAAAACCTGAGAACAAAGATCCTA
CCAAGGCTGTTTATGTCTATCCTACAATGATTCTTGCTATCATCTGTGCTTTCTCATCGGTTAAGTATGATGTGAAGAAGGCTGTTAGAAGTGCCCCTGCTCAACCGATT
GCAAAGCCACTTAAAAGCTCATCCAAATCTAAGCTTAAGTGAGTCGAAATGTAATTTCCTTCTCTACCTGTTTTGTTTGATTCTCACTCCTTAAAGAGTCCGTACATTAT
TTATGCATTCCAAGTTTCATTTTCTGGATGTTTTGGCGTGTCCTCGTTATCTGCATCTTTCTTCCCAGACATCCTTTTCAGCACTTCCCCAACTTTTCTTAACTTGAGTT
TGGTAGATCTGGATTTATTGGGTTCGTTTATATGAATTAGAAAATAGGTAGCACAAACACTCCTATTTTGACTATCTGACATTTGAAATTTTGTGTCCTCCT
Protein sequenceShow/hide protein sequence
MATSTPSSSSSASKGWIRNVSSVASRIYFFLIILQIPLFKVPCRSGMCTTPLHVTSSQLISSEVFPQPVVKALLYPGAIVNGLLMNLTVPSWSNLLDIYNLTNVKEAAAV
TDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPENKDPTKAVYVYPTMILAIICAFSSVKYDVKKAVRSAPAQPIAKPLKSSSKSKLK