; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G46880 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G46880
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionprotein LOL1
Genome locationChr3:39977177..39980243
RNA-Seq ExpressionCSPI03G46880
SyntenyCSPI03G46880
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR005735 - Zinc finger, LSD1-type
IPR036280 - Multiheme cytochrome superfamily
IPR040319 - LSD1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148490.1 protein LOL1 [Cucumis sativus]4.6e-7199.26Show/hide
Query:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVG+S
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS

XP_008465951.1 PREDICTED: protein LOL1 [Cucumis melo]3.9e-7098.53Show/hide
Query:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVP PGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVG+S
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS

XP_022952054.1 protein LOL1-like [Cucurbita moschata]8.7e-7096.32Show/hide
Query:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPPAPY+QP+NATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTS+G++
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS

XP_038887481.1 protein LOL1 isoform X1 [Benincasa hispida]1.1e-6997.78Show/hide
Query:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPP PYTQP+NATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVT VPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGV
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGV
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGV

XP_038887483.1 protein LOL1 isoform X2 [Benincasa hispida]8.7e-7097.06Show/hide
Query:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPP PYTQP+NATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVT VPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVG+S
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS

TrEMBL top hitse value%identityAlignment
A0A0A0LEE4 Uncharacterized protein2.2e-7199.26Show/hide
Query:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVG+S
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS

A0A1S3CQ35 protein LOL11.9e-7098.53Show/hide
Query:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVP PGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVG+S
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS

A0A6J1ET02 protein LOL11.0e-6895.59Show/hide
Query:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPP PYT P+N  QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVG+S
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS

A0A6J1GKN9 protein LOL1-like4.2e-7096.32Show/hide
Query:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
        MPPVPLAPYPTPPAPY+QP+NATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ
Subjt:  MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQ

Query:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS
        VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTS+G++
Subjt:  VAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS

A0A6J1I7B6 protein LOL1-like1.0e-6895.62Show/hide
Query:  MPPVPLAPYPTPPA-PYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEAN
        MPPVPLAPYPTPPA PY+QP+NATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEAN
Subjt:  MPPVPLAPYPTPPA-PYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEAN

Query:  QVAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS
        QVAHVSCGNCRMLLMYQYGARSVKCAVCNFVTS+G++
Subjt:  QVAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS

SwissProt top hitse value%identityAlignment
P94077 Protein LSD12.5e-3558.27Show/hide
Query:  QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGT--EMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLA-LEANQVAH--------VSCGNCRML
        Q QLVC GCRNLL+YP GA++V CA+CN +  VPPP    +MA ++CGGC T+LMY RGA+SV+CSCC T NL    +NQVAH        ++CG+CR  
Subjt:  QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGT--EMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLA-LEANQVAH--------VSCGNCRML

Query:  LMYQYGARSVKCAVCNFVTSVGVSKKR
        LMY YGA SVKCAVC FVT+V +S  R
Subjt:  LMYQYGARSVKCAVCNFVTSVGVSKKR

Q0J7V9 Protein LSD15.1e-6589.55Show/hide
Query:  PVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVA
        PVPLAPYPTPP P+T P N  QSQLVCSGCRNLL+YP GATSVCCAVC+ VTAVP PGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLA+EANQVA
Subjt:  PVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVA

Query:  HVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS
        HV+CGNCRMLLMYQYGARSVKCAVCNFVTSVG S
Subjt:  HVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS

Q2QMB3 Protein LOL22.8e-3961.86Show/hide
Query:  QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTA-VPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHVSCGNCRMLLMYQYGARSV
        QSQ+VC GCRN+LLYP GA SVCCAVC+AV++  P PG ++A L+CGGC TLLMY R ATSV+CSCC TVNL    + +AH++CG C+ +LMY YGA SV
Subjt:  QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTA-VPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHVSCGNCRMLLMYQYGARSV

Query:  KCAVCNFVTSVGVSKKRN
        KCA+CNF+T+ G++  R+
Subjt:  KCAVCNFVTSVGVSKKRN

Q6ASS2 Protein LOL34.1e-3862.61Show/hide
Query:  QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPG--TEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHVSCGNCRMLLMYQYGARS
        QSQ+VC GCR++L YP GA SVCCA+C A+T VPPP    EMA L+CGGC TLLMY R A +V+CSCC TVNL    N +AHVSCG CR  LMY YGA S
Subjt:  QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPG--TEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHVSCGNCRMLLMYQYGARS

Query:  VKCAVCNFVTSVGVS
        VKCA+C+++T+ G++
Subjt:  VKCAVCNFVTSVGVS

Q93ZB1 Protein LOL18.7e-6587.41Show/hide
Query:  PVPLAPYPTPPAPYTQPSNAT---------QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN
        PVPLAPYPTPPAP   PS  T         QSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN
Subjt:  PVPLAPYPTPPAPYTQPSNAT---------QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN

Query:  LALEANQVAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS
        LALEANQVAHV+CGNC MLLMYQYGARSVKCAVCNFVTSVG S
Subjt:  LALEANQVAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS

Arabidopsis top hitse value%identityAlignment
AT1G32540.1 lsd one like 11.7e-6094.02Show/hide
Query:  SNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHVSCGNCRMLLMYQYGA
        S + QSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHV+CGNC MLLMYQYGA
Subjt:  SNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHVSCGNCRMLLMYQYGA

Query:  RSVKCAVCNFVTSVGVS
        RSVKCAVCNFVTSVG S
Subjt:  RSVKCAVCNFVTSVGVS

AT1G32540.2 lsd one like 16.2e-6687.41Show/hide
Query:  PVPLAPYPTPPAPYTQPSNAT---------QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN
        PVPLAPYPTPPAP   PS  T         QSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN
Subjt:  PVPLAPYPTPPAPYTQPSNAT---------QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN

Query:  LALEANQVAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS
        LALEANQVAHV+CGNC MLLMYQYGARSVKCAVCNFVTSVG S
Subjt:  LALEANQVAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS

AT1G32540.3 lsd one like 16.2e-6687.41Show/hide
Query:  PVPLAPYPTPPAPYTQPSNAT---------QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN
        PVPLAPYPTPPAP   PS  T         QSQLVCSGCRNLL+YPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN
Subjt:  PVPLAPYPTPPAPYTQPSNAT---------QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVN

Query:  LALEANQVAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS
        LALEANQVAHV+CGNC MLLMYQYGARSVKCAVCNFVTSVG S
Subjt:  LALEANQVAHVSCGNCRMLLMYQYGARSVKCAVCNFVTSVGVS

AT4G20380.1 LSD1 zinc finger family protein1.8e-3658.27Show/hide
Query:  QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGT--EMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLA-LEANQVAH--------VSCGNCRML
        Q QLVC GCRNLL+YP GA++V CA+CN +  VPPP    +MA ++CGGC T+LMY RGA+SV+CSCC T NL    +NQVAH        ++CG+CR  
Subjt:  QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGT--EMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLA-LEANQVAH--------VSCGNCRML

Query:  LMYQYGARSVKCAVCNFVTSVGVSKKR
        LMY YGA SVKCAVC FVT+V +S  R
Subjt:  LMYQYGARSVKCAVCNFVTSVGVSKKR

AT4G20380.2 LSD1 zinc finger family protein1.8e-3658.27Show/hide
Query:  QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGT--EMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLA-LEANQVAH--------VSCGNCRML
        Q QLVC GCRNLL+YP GA++V CA+CN +  VPPP    +MA ++CGGC T+LMY RGA+SV+CSCC T NL    +NQVAH        ++CG+CR  
Subjt:  QSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGT--EMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLA-LEANQVAH--------VSCGNCRML

Query:  LMYQYGARSVKCAVCNFVTSVGVSKKR
        LMY YGA SVKCAVC FVT+V +S  R
Subjt:  LMYQYGARSVKCAVCNFVTSVGVSKKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACCAGTTCCTCTTGCTCCATATCCAACCCCTCCAGCTCCCTATACACAACCTTCTAATGCCACACAGAGCCAACTTGTGTGCTCAGGATGCAGAAACCTTTTACT
TTATCCTGTTGGGGCAACCTCTGTTTGCTGTGCTGTTTGTAATGCAGTTACTGCTGTACCGCCACCCGGCACAGAAATGGCACAATTGGTGTGTGGAGGCTGCCACACTC
TTCTCATGTACATCCGTGGTGCCACGAGTGTCCAATGTTCTTGTTGCCACACTGTCAACTTAGCTTTGGAAGCGAATCAGGTGGCGCACGTTAGCTGCGGGAACTGCAGG
ATGCTACTGATGTATCAATATGGAGCACGATCAGTGAAATGTGCAGTATGCAATTTTGTAACATCAGTTGGGGTAAGTAAGAAAAGAAACTTCTTTTTGTTGAACTCTCT
TTTTCTATACGAGCATTGTTGA
mRNA sequenceShow/hide mRNA sequence
GAACTGGGGATTAAGAGAAGGCCTTGCAAAGGAAAGAAAGAAAAGAAAGAAAAAGCTTTGTTGTTCATCTCCACAAACTTGTTTCCATGGAGGAAGTAGCTGGTTTTGTC
TGAGCTTTTGAGAGCTTGGTTTGAAGATAGAGATAAAGAAAAAGAAAAGTTAGGAAAGATTTTGCTTTATCAAAGCTGCAAAATGCCACCAGTTCCTCTTGCTCCATATC
CAACCCCTCCAGCTCCCTATACACAACCTTCTAATGCCACACAGAGCCAACTTGTGTGCTCAGGATGCAGAAACCTTTTACTTTATCCTGTTGGGGCAACCTCTGTTTGC
TGTGCTGTTTGTAATGCAGTTACTGCTGTACCGCCACCCGGCACAGAAATGGCACAATTGGTGTGTGGAGGCTGCCACACTCTTCTCATGTACATCCGTGGTGCCACGAG
TGTCCAATGTTCTTGTTGCCACACTGTCAACTTAGCTTTGGAAGCGAATCAGGTGGCGCACGTTAGCTGCGGGAACTGCAGGATGCTACTGATGTATCAATATGGAGCAC
GATCAGTGAAATGTGCAGTATGCAATTTTGTAACATCAGTTGGGGTAAGTAAGAAAAGAAACTTCTTTTTGTTGAACTCTCTTTTTCTATACGAGCATTGTTGAACTCTC
TTTGTCCATATGAACATGAACTTGGAATAAAGTCATTCGATCTAAATCATCGGGTTGTTTAAGAATTAACGAAAGCTTA
Protein sequenceShow/hide protein sequence
MPPVPLAPYPTPPAPYTQPSNATQSQLVCSGCRNLLLYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHVSCGNCR
MLLMYQYGARSVKCAVCNFVTSVGVSKKRNFFLLNSLFLYEHC