; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G018957 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G018957
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Description17 kDa phloem lectin
Genome locationGy14Chr4:24962509..24969023
RNA-Seq ExpressionCsGy4G018957
SyntenyCsGy4G018957
Gene Ontology termsGO:0030246 - carbohydrate binding (molecular function)
InterPro domainsIPR025886 - Phloem protein 2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAM77344.1 17 kDa phloem lectin [Cucumis sativus]3.15e-11299.35Show/hide
Query:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ
        MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ
Subjt:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ

Query:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
        IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIR LAIQA
Subjt:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA

KAE8649724.1 hypothetical protein Csa_012573 [Cucumis sativus]2.13e-11098.05Show/hide
Query:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ
        MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSW DCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ
Subjt:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ

Query:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
        IVLAGRQPNVW KIPIGKFILRGSLTSGTIRFG YNHEGNWKRGLNIRALAIQA
Subjt:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA

XP_031739683.1 lectin-like [Cucumis sativus]3.54e-10996.1Show/hide
Query:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ
        MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSW DCRWSMDASDFKQDIWYNASVEVM+T+N SGW+VPLHLEIELPDGSKQESQ
Subjt:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ

Query:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
        IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFG YNHEGNWKRGLNIRALAIQA
Subjt:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA

XP_031740318.1 lectin-like [Cucumis sativus]2.13e-11098.05Show/hide
Query:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ
        MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGS Q+SQ
Subjt:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ

Query:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
        IVLAGRQPNVWFKIPIGKFIL GSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
Subjt:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA

XP_031740319.1 lectin [Cucumis sativus]9.03e-11299.35Show/hide
Query:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ
        MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ
Subjt:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ

Query:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
        IVLAGRQPNVWFKIPIGKFIL GSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
Subjt:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA

TrEMBL top hitse value%identityAlignment
A0A0A0KYP0 Phloem lectin4.37e-11299.35Show/hide
Query:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ
        MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ
Subjt:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ

Query:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
        IVLAGRQPNVWFKIPIGKFIL GSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
Subjt:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA

A0A1S3CE66 uncharacterized protein PHLOEM PROTEIN 2-LIKE A4-like3.56e-9378.57Show/hide
Query:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ
        MAGQSTHY+A+PRA++ITW DDTRYWSWA VDFC Y IEEARLLQVSWLDCRW+MD+S FKQD+WYNASV+VM+T+ ASGWN+PL++EI++PDGSKQESQ
Subjt:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ

Query:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
        IVLAG+QPNVWFKIP+GKFI+  S+TSG IRFGFYNH G+WKRGL +RAL IQA
Subjt:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA

Q8L5A9 17 kDa phloem lectin5.49e-10493.51Show/hide
Query:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ
        MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMD SDFK+DIWYNASVEVMLTSNASGWNVPL+LEIELP GS+Q+SQ
Subjt:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ

Query:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
        IVL GRQPNVWFKI +GKFIL GSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
Subjt:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA

Q8LK69 17 kDa phloem lectin1.52e-11299.35Show/hide
Query:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ
        MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ
Subjt:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ

Query:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
        IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIR LAIQA
Subjt:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA

Q8LK70 17 kDa phloem lectin1.03e-11098.05Show/hide
Query:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ
        MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGS Q+SQ
Subjt:  MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQ

Query:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
        IVLAGRQPNVWFKIPIGKFIL GSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
Subjt:  IVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA

SwissProt top hitse value%identityAlignment
C0HJV2 Lectin3.6e-2638Show/hide
Query:  QSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQIVL
        +S  ++ + R   ITW  D RYW W         +E A L+ V WL+   +++ S     I Y A+ EVMLT++ASGW +P+ +++++PDGS+QESQ+ L
Subjt:  QSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQIVL

Query:  AGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQ
          +   VWF I +G F +    T G I F    H+   KRGL ++ L IQ
Subjt:  AGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQ

O81865 Protein PHLOEM PROTEIN 2-LIKE A12.8e-1832.03Show/hide
Query:  STHYLAFPRASTITWGDDTRYWSWAT-VDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSK--QESQI
        S  ++ F +  +ITW DD  YW+W T  +  +  +E   L  V WLD     D  +    I Y    +V L   A GW+ P++L++ LP+G +  QE ++
Subjt:  STHYLAFPRASTITWGDDTRYWSWAT-VDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSK--QESQI

Query:  VLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNH-EGNWKRGLNIRALAIQ
         L       W  + +G+F+   S  +G I F  Y H  G WK+GL+++ +AI+
Subjt:  VLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNH-EGNWKRGLNIRALAIQ

O81866 Protein PHLOEM PROTEIN 2-LIKE A25.2e-0926.72Show/hide
Query:  EEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQIVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHE
        E A++ +V+WL+     +      +  Y     V L  +A GW+  ++ ++ LP G  +E +  +   + N W +IP G+F++     SG I F     +
Subjt:  EEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQIVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHE

Query:  GN-WKRGLNIRALAIQ
         + WK GL ++ +AI+
Subjt:  GN-WKRGLNIRALAIQ

P0DSP5 Lectin2.1e-5054Show/hide
Query:  STHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQIVLA
        STH+L FPRA+T+TW DDTRYWSW  VDFC Y +EEA+L +VSW DCRW+++ +D K ++WYN  ++V + S ASGWN PL+LE+E+P+GSKQ SQ+VL 
Subjt:  STHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQIVLA

Query:  GRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA
         R  +VWFK+ +G  ++  S T G +R   YNH+ NWK G  +  LA++A
Subjt:  GRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA

Q9C8U9 Uncharacterized protein PHLOEM PROTEIN 2-LIKE A46.8e-1732.24Show/hide
Query:  YLAFPRASTITWGDDTRYWSWATV--DFCSYAIEEARLLQ-VSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSK--QESQIV
        ++ + R  +I W D   YWSW  +  D  S  + +A +L+ V WLD     D  +   +  Y     V L   ASGWN+P++L++ LPDG K  QE  + 
Subjt:  YLAFPRASTITWGDDTRYWSWATV--DFCSYAIEEARLLQ-VSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSK--QESQIV

Query:  LAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEG-NWKRGLNIRALAIQ
        L       W  I  G+F+      +G I F  Y  +   WKRGL ++ + I+
Subjt:  LAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEG-NWKRGLNIRALAIQ

Arabidopsis top hitse value%identityAlignment
AT1G33920.1 phloem protein 2-A44.8e-1832.24Show/hide
Query:  YLAFPRASTITWGDDTRYWSWATV--DFCSYAIEEARLLQ-VSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSK--QESQIV
        ++ + R  +I W D   YWSW  +  D  S  + +A +L+ V WLD     D  +   +  Y     V L   ASGWN+P++L++ LPDG K  QE  + 
Subjt:  YLAFPRASTITWGDDTRYWSWATV--DFCSYAIEEARLLQ-VSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSK--QESQIV

Query:  LAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEG-NWKRGLNIRALAIQ
        L       W  I  G+F+      +G I F  Y  +   WKRGL ++ + I+
Subjt:  LAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHEG-NWKRGLNIRALAIQ

AT4G19840.1 phloem protein 2-A12.0e-1932.03Show/hide
Query:  STHYLAFPRASTITWGDDTRYWSWAT-VDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSK--QESQI
        S  ++ F +  +ITW DD  YW+W T  +  +  +E   L  V WLD     D  +    I Y    +V L   A GW+ P++L++ LP+G +  QE ++
Subjt:  STHYLAFPRASTITWGDDTRYWSWAT-VDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSK--QESQI

Query:  VLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNH-EGNWKRGLNIRALAIQ
         L       W  + +G+F+   S  +G I F  Y H  G WK+GL+++ +AI+
Subjt:  VLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNH-EGNWKRGLNIRALAIQ

AT4G19850.1 lectin-related3.7e-1026.72Show/hide
Query:  EEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQIVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHE
        E A++ +V+WL+     +      +  Y     V L  +A GW+  ++ ++ LP G  +E +  +   + N W +IP G+F++     SG I F     +
Subjt:  EEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQIVLAGRQPNVWFKIPIGKFILRGSLTSGTIRFGFYNHE

Query:  GN-WKRGLNIRALAIQ
         + WK GL ++ +AI+
Subjt:  GN-WKRGLNIRALAIQ

AT4G19850.2 lectin-related4.9e-1026.56Show/hide
Query:  YLAFPRASTITWGDD--TRYWSW-ATVDFCSYAI--EEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQIV
        ++ + R  +ITW +    +YWSW + +D  S  +  E A++ +V+WL+     +      +  Y     V L  +A GW+  ++ ++ LP G  +E +  
Subjt:  YLAFPRASTITWGDD--TRYWSW-ATVDFCSYAI--EEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQIV

Query:  LAGRQPNVWFKIPIGKFILRGSLTSGTI
        +   + N W +IP G+F++     SG I
Subjt:  LAGRQPNVWFKIPIGKFILRGSLTSGTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGGCCAAAGCACACATTATTTGGCATTTCCAAGAGCTTCCACAATAACATGGGGTGATGACACTCGATACTGGAGTTGGGCCACCGTGGATTTTTGCAGCTACGC
AATTGAAGAAGCCCGACTTTTACAAGTATCTTGGCTCGATTGTCGTTGGAGCATGGATGCATCTGATTTCAAACAAGATATTTGGTACAATGCAAGCGTTGAAGTAATGT
TGACAAGCAACGCCTCTGGATGGAATGTTCCACTACACCTTGAAATCGAGTTGCCAGATGGGAGTAAGCAAGAGTCTCAAATAGTATTGGCAGGCAGACAACCAAATGTG
TGGTTCAAGATTCCAATCGGTAAATTCATACTAAGGGGTTCTCTGACTAGCGGAACAATCCGATTCGGCTTCTACAACCATGAAGGGAATTGGAAAAGAGGCTTGAACAT
AAGAGCCCTTGCCATTCAAGCATAA
mRNA sequenceShow/hide mRNA sequence
CTCCATACTTGAAACCAATCAAGGACCTTTATACAACATTTTCTCATTCTCATCCTTACTTTTATATCTCTCAAATACCATTTTAATGGCAGGCCAAAGCACACATTATT
TGGCATTTCCAAGAGCTTCCACAATAACATGGGGTGATGACACTCGATACTGGAGTTGGGCCACCGTGGATTTTTGCAGCTACGCAATTGAAGAAGCCCGACTTTTACAA
GTATCTTGGCTCGATTGTCGTTGGAGCATGGATGCATCTGATTTCAAACAAGATATTTGGTACAATGCAAGCGTTGAAGTAATGTTGACAAGCAACGCCTCTGGATGGAA
TGTTCCACTACACCTTGAAATCGAGTTGCCAGATGGGAGTAAGCAAGAGTCTCAAATAGTATTGGCAGGCAGACAACCAAATGTGTGGTTCAAGATTCCAATCGGTAAAT
TCATACTAAGGGGTTCTCTGACTAGCGGAACAATCCGATTCGGCTTCTACAACCATGAAGGGAATTGGAAAAGAGGCTTGAACATAAGAGCCCTTGCCATTCAAGCATAA
AAATGAGTTTGAAATGATATGTTCAAGAATGAATATAAAGTAATAGAATAAAACCCTAAATAATAAGACATATTGCTATGTTACAAAAACTTTTAAGTTTAAATGTGTTA
TATTT
Protein sequenceShow/hide protein sequence
MAGQSTHYLAFPRASTITWGDDTRYWSWATVDFCSYAIEEARLLQVSWLDCRWSMDASDFKQDIWYNASVEVMLTSNASGWNVPLHLEIELPDGSKQESQIVLAGRQPNV
WFKIPIGKFILRGSLTSGTIRFGFYNHEGNWKRGLNIRALAIQA