; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy7G004880 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy7G004880
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionGlycosyltransferase
Genome locationGy14Chr7:3689208..3689998
RNA-Seq ExpressionCsGy7G004880
SyntenyCsGy7G004880
Gene Ontology termsGO:0080043 - quercetin 3-O-glucosyltransferase activity (molecular function)
GO:0080044 - quercetin 7-O-glucosyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645796.1 hypothetical protein Csa_017196 [Cucumis sativus]9.60e-8999.24Show/hide
Query:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR
        MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQH+PSLCYSASR
Subjt:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR

Query:  NCLAPLCSLISEINSSGTVPPVSCIIGDGVMT
        NCLAPLCSLISEINSSGTVPPVSCIIGDGVMT
Subjt:  NCLAPLCSLISEINSSGTVPPVSCIIGDGVMT

XP_008455233.1 PREDICTED: 7-deoxyloganetin glucosyltransferase-like [Cucumis melo]1.82e-8092.48Show/hide
Query:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR
        MGSL+KVDQ  QQPHAV  PYPSQGHISPMLKLAKLFHHKGFH+TFVNTEYNHRRLLRSRGPNSLDGLPDF FRAIPDGLPPSDGN+TQHVPSLCYS SR
Subjt:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR

Query:  NCLAPLCSLISEINSSG-TVPPVSCIIGDGVMT
        NCLAPLCSLISEINSSG TVPPVSCIIGDGVMT
Subjt:  NCLAPLCSLISEINSSG-TVPPVSCIIGDGVMT

XP_011658777.2 LOW QUALITY PROTEIN: 7-deoxyloganetin glucosyltransferase [Cucumis sativus]2.80e-8292.42Show/hide
Query:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR
        MGSLTKV+QGK QPHAV  PYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRS GPNSLDGLPDFHFRAIPDGLPPS+GN+TQHVPSLCYS SR
Subjt:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR

Query:  NCLAPLCSLISEINSSGTVPPVSCIIGDGVMT
        NCLAP CSLISEINSSGTVPPVSCIIGDG+MT
Subjt:  NCLAPLCSLISEINSSGTVPPVSCIIGDGVMT

XP_031744662.1 7-deoxyloganetin glucosyltransferase-like [Cucumis sativus]3.20e-95100Show/hide
Query:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR
        MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR
Subjt:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR

Query:  NCLAPLCSLISEINSSGTVPPVSCIIGDGVMTC
        NCLAPLCSLISEINSSGTVPPVSCIIGDGVMTC
Subjt:  NCLAPLCSLISEINSSGTVPPVSCIIGDGVMTC

XP_031744863.1 7-deoxyloganetin glucosyltransferase [Cucumis sativus]9.60e-8999.24Show/hide
Query:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR
        MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQH+PSLCYSASR
Subjt:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR

Query:  NCLAPLCSLISEINSSGTVPPVSCIIGDGVMT
        NCLAPLCSLISEINSSGTVPPVSCIIGDGVMT
Subjt:  NCLAPLCSLISEINSSGTVPPVSCIIGDGVMT

TrEMBL top hitse value%identityAlignment
A0A0A0K1Z5 Uncharacterized protein1.07e-8494.7Show/hide
Query:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR
        MGSLTKV+QGK QPHAV  PYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYS SR
Subjt:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR

Query:  NCLAPLCSLISEINSSGTVPPVSCIIGDGVMT
        NCLAP CSLISEINSSGTVPPVSCIIGDG+MT
Subjt:  NCLAPLCSLISEINSSGTVPPVSCIIGDGVMT

A0A0A0K5W9 Uncharacterized protein3.30e-86100Show/hide
Query:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR
        MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR
Subjt:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR

Query:  NCLAPLCSLISEINSSGTVPPVSC
        NCLAPLCSLISEINSSGTVPPVSC
Subjt:  NCLAPLCSLISEINSSGTVPPVSC

A0A1S3C1P0 7-deoxyloganetin glucosyltransferase-like8.82e-8192.48Show/hide
Query:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR
        MGSL+KVDQ  QQPHAV  PYPSQGHISPMLKLAKLFHHKGFH+TFVNTEYNHRRLLRSRGPNSLDGLPDF FRAIPDGLPPSDGN+TQHVPSLCYS SR
Subjt:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR

Query:  NCLAPLCSLISEINSSG-TVPPVSCIIGDGVMT
        NCLAPLCSLISEINSSG TVPPVSCIIGDGVMT
Subjt:  NCLAPLCSLISEINSSG-TVPPVSCIIGDGVMT

A0A5D3C9G1 7-deoxyloganetin glucosyltransferase-like8.82e-8192.48Show/hide
Query:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR
        MGSL+KVDQ  QQPHAV  PYPSQGHISPMLKLAKLFHHKGFH+TFVNTEYNHRRLLRSRGPNSLDGLPDF FRAIPDGLPPSDGN+TQHVPSLCYS SR
Subjt:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR

Query:  NCLAPLCSLISEINSSG-TVPPVSCIIGDGVMT
        NCLAPLCSLISEINSSG TVPPVSCIIGDGVMT
Subjt:  NCLAPLCSLISEINSSG-TVPPVSCIIGDGVMT

A0A6J1DLI8 Glycosyltransferase2.28e-6475.56Show/hide
Query:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR
        MGSL+K+++    PHA+  PYP+QGHI+PMLKLAKL H KGF+ITFVNTEYNHRRLL++RG NSLDGLPDF F+ IPDGLPPS+GN+TQHVPSLCYS S 
Subjt:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR

Query:  NCLAPLCSLISEINSS---GTVPPVSCIIGDGVMT
        NCL PLCSLISEINSS   G+ PPVSC+IGDGVMT
Subjt:  NCLAPLCSLISEINSS---GTVPPVSCIIGDGVMT

SwissProt top hitse value%identityAlignment
F8WKW1 7-deoxyloganetin glucosyltransferase3.3e-4465.85Show/hide
Query:  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASRNCLAPLCSLIS
        ++ HAV  PYP+QGHI+PMLKLAK+ HHKGFHITFVNTE+NH+RLL+SRGP++L+GLPDF F+ IPDGLPPSD ++TQ +PSLC S +  CL P  +L++
Subjt:  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASRNCLAPLCSLIS

Query:  EIN--SSGTVPPVSCIIGDGVMT
        E+N  SS  VPPVSCI+ DGVM+
Subjt:  EIN--SSGTVPPVSCIIGDGVMT

F8WLS6 7-deoxyloganetin glucosyltransferase6.3e-4361.94Show/hide
Query:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR
        MGSL+  D  K +PHAV  PYP+QGHI+PMLKLAKL H+KGFHITFVNTE+NH+RLL+SRG +SL GL  F F+ IPDGLPPSD ++TQ +PSLC S + 
Subjt:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR

Query:  NCLAPLCSLISEIN--SSGTVPPVSCIIGDGVMT
        +CL P   L+ ++N  SS  VPPVSC++ D VM+
Subjt:  NCLAPLCSLISEIN--SSGTVPPVSCIIGDGVMT

G3FIN8 Linamarin synthase 11.3e-4058.02Show/hide
Query:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR
        MGS++     ++ PHA+L PYP+QGH++P+++L KL H +GF+ITFVNTE+NHRRL+RSRG   +DGLPDF F AIPDGLP +D ++TQHVPSL  S  +
Subjt:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR

Query:  NCLAPLCSLISEINSSGTVPPVSCIIGDGVM
        +CLAP   LI+++ +S  VPP++CII DGVM
Subjt:  NCLAPLCSLISEINSSGTVPPVSCIIGDGVM

Q9LME8 UDP-glycosyltransferase 85A73.7e-4363.64Show/hide
Query:  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASRNCLAPLCSLIS
        Q+PH V  PYP+QGHI+PMLK+AKL + KGFH+TFVNT YNH RLLRSRGPN+LDG P F F +IPDGLP +DG+ TQH P++C S  +NCLAP   ++ 
Subjt:  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASRNCLAPLCSLIS

Query:  EINSSGTVPPVSCIIGDGVMT
         IN    VPPVSCI+ DGVM+
Subjt:  EINSSGTVPPVSCIIGDGVMT

Q9LMF0 UDP-glycosyltransferase 85A55.9e-4161.98Show/hide
Query:  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASRNCLAPLCSLIS
        Q+PH V  P+P+QGHI+PMLK+AKL + +GFH+TFVNT YNH RL+RSRGPNSLDGLP F F +IPDGLP  + +  Q VP+LC S  +NCLAP   L+ 
Subjt:  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASRNCLAPLCSLIS

Query:  EINSSGTVPPVSCIIGDGVMT
         IN++  VPPVSCI+ DGVM+
Subjt:  EINSSGTVPPVSCIIGDGVMT

Arabidopsis top hitse value%identityAlignment
AT1G22340.1 UDP-glucosyl transferase 85A72.6e-4463.64Show/hide
Query:  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASRNCLAPLCSLIS
        Q+PH V  PYP+QGHI+PMLK+AKL + KGFH+TFVNT YNH RLLRSRGPN+LDG P F F +IPDGLP +DG+ TQH P++C S  +NCLAP   ++ 
Subjt:  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASRNCLAPLCSLIS

Query:  EINSSGTVPPVSCIIGDGVMT
         IN    VPPVSCI+ DGVM+
Subjt:  EINSSGTVPPVSCIIGDGVMT

AT1G22360.1 UDP-glucosyl transferase 85A23.5e-4161.16Show/hide
Query:  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASRNCLAPLCSLIS
        Q+ H V  PYP+QGHI+PM+K+AKL + KGFHITFVNT YNH RLLRSRGPN++DGLP F F +IPDGLP +D + TQ +P+LC S  ++CLAP   L+ 
Subjt:  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASRNCLAPLCSLIS

Query:  EINSSGTVPPVSCIIGDGVMT
        +IN+   VPPVSCI+ DG M+
Subjt:  EINSSGTVPPVSCIIGDGVMT

AT1G22360.2 UDP-glucosyl transferase 85A23.5e-4161.16Show/hide
Query:  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASRNCLAPLCSLIS
        Q+ H V  PYP+QGHI+PM+K+AKL + KGFHITFVNT YNH RLLRSRGPN++DGLP F F +IPDGLP +D + TQ +P+LC S  ++CLAP   L+ 
Subjt:  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASRNCLAPLCSLIS

Query:  EINSSGTVPPVSCIIGDGVMT
        +IN+   VPPVSCI+ DG M+
Subjt:  EINSSGTVPPVSCIIGDGVMT

AT1G22370.2 UDP-glucosyl transferase 85A54.2e-4261.98Show/hide
Query:  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASRNCLAPLCSLIS
        Q+PH V  P+P+QGHI+PMLK+AKL + +GFH+TFVNT YNH RL+RSRGPNSLDGLP F F +IPDGLP  + +  Q VP+LC S  +NCLAP   L+ 
Subjt:  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASRNCLAPLCSLIS

Query:  EINSSGTVPPVSCIIGDGVMT
         IN++  VPPVSCI+ DGVM+
Subjt:  EINSSGTVPPVSCIIGDGVMT

AT1G22400.1 UDP-Glycosyltransferase superfamily protein6.6e-4055.3Show/hide
Query:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR
        MGS  ++    Q+PH V  PYP+QGHI+PM+++AKL H +GF++TFVNT YNH R LRSRG N+LDGLP F F +I DGLP +D ++TQ + +LC S  +
Subjt:  MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASR

Query:  NCLAPLCSLISEINSSGTVPPVSCIIGDGVMT
        NCLAP   L+  IN+   VPPVSCI+ DG M+
Subjt:  NCLAPLCSLISEINSSGTVPPVSCIIGDGVMT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTCTTACCAAAGTAGACCAAGGAAAACAGCAACCTCATGCAGTGCTCTTCCCATACCCATCTCAAGGCCACATAAGTCCCATGCTAAAGCTTGCAAAACTCTT
TCACCACAAAGGCTTTCACATAACGTTTGTCAATACAGAATACAACCATAGACGTCTCCTCAGATCCCGAGGTCCCAACTCTTTGGATGGCTTGCCTGACTTTCATTTTA
GAGCCATTCCCGACGGCCTCCCACCGTCCGACGGCAACTCCACCCAACATGTTCCTTCACTCTGCTATTCCGCTTCCCGCAATTGTTTGGCTCCACTCTGTAGTCTCATT
TCGGAGATCAACTCGAGTGGCACTGTTCCACCAGTTTCCTGTATTATTGGCGATGGGGTCATGACTTGTTAG
mRNA sequenceShow/hide mRNA sequence
CTTCAAAACAAATCCTTACATACAAACTAAGAGAAAGAAAAAAAATGGGTTCTCTTACCAAAGTAGACCAAGGAAAACAGCAACCTCATGCAGTGCTCTTCCCATACCCA
TCTCAAGGCCACATAAGTCCCATGCTAAAGCTTGCAAAACTCTTTCACCACAAAGGCTTTCACATAACGTTTGTCAATACAGAATACAACCATAGACGTCTCCTCAGATC
CCGAGGTCCCAACTCTTTGGATGGCTTGCCTGACTTTCATTTTAGAGCCATTCCCGACGGCCTCCCACCGTCCGACGGCAACTCCACCCAACATGTTCCTTCACTCTGCT
ATTCCGCTTCCCGCAATTGTTTGGCTCCACTCTGTAGTCTCATTTCGGAGATCAACTCGAGTGGCACTGTTCCACCAGTTTCCTGTATTATTGGCGATGGGGTCATGACT
TGTTAGAAATTAATAAGCTAGATACATTATGGATAACAAATTAATAGATACATTTCTAGTTTCAAGATACATATGCATTTCTAGATACATGTAATTATTCAAGTACATGA
ATGGTATGTATTCATGTACTTCATTTATTTTGTGTCTTTTAGGAAGTGTCTCTTGAATACTATATATATAGAGATCATTTCCTTCATTTGTATACAAGATAAAAGAATCA
AATCAATAGAAGCAAATATTGAGTTTTAGAAAGTAAGTGAGTTTCAAGAGAACAACATTCTTCTCTTGTGTGTTATTGAGTTTTCTAAGAGAAAAAATTTCTTCTCTTAA
AAGTTTGTGAGATTTAGAGTG
Protein sequenceShow/hide protein sequence
MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSASRNCLAPLCSLI
SEINSSGTVPPVSCIIGDGVMTC