; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G1324 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G1324
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Description50S ribosomal protein L18
Genome locationctg1:11572182..11575724
RNA-Seq ExpressionCucsat.G1324
SyntenyCucsat.G1324
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0005840 - ribosome (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0008097 - 5S rRNA binding (molecular function)
InterPro domainsIPR004389 - Ribosomal protein L18, bacterial-type
IPR005484 - Ribosomal protein L18


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597602.1 50S ribosomal protein L18, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]5.34e-9888.24Show/hide
Query:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
        M S  AP SFL +S L VSYSQN+   PHL+FP TTSSSGRHSLV+EAKATTRREDR ARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
Subjt:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA

Query:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
        AVSTMQKSISEGLDY++GPTIEVAKK+GEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGL F
Subjt:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF

XP_004147640.1 50S ribosomal protein L18, chloroplastic [Cucumis sativus]1.29e-114100Show/hide
Query:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
        MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
Subjt:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA

Query:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
        AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
Subjt:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF

XP_008439021.1 PREDICTED: 50S ribosomal protein L18, chloroplastic [Cucumis melo]8.35e-11197.06Show/hide
Query:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
        MASTTAPCSFL TSFL+VSYSQNILNRPHLHFP TTSSSGRHSLVVEAKATTRREDR ARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
Subjt:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA

Query:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
        AVSTMQKSISEGLDYSAGPTIEVAKK+GEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
Subjt:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF

XP_023538937.1 50S ribosomal protein L18, chloroplastic [Cucurbita pepo subsp. pepo]6.51e-9988.82Show/hide
Query:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
        M+S  AP SFL TS L VSYSQN+   PHL+FP TTSSSGRHSLV+EAKATTRREDR ARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
Subjt:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA

Query:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
        AVSTMQKSISEGLDY++GPTIEVAKK+GEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGL F
Subjt:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF

XP_038885362.1 50S ribosomal protein L18, chloroplastic-like [Benincasa hispida]7.70e-10794.12Show/hide
Query:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
        MAS TAP SFL TSFL+VSYSQN+ NRPHLHFP TTSSSGRHSLVVEA+ATTRREDR+ARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
Subjt:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA

Query:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
        AVSTMQKSISEGLDYSAGPTIEVAKK+GEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
Subjt:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF

TrEMBL top hitse value%identityAlignment
A0A0A0L9A4 Uncharacterized protein6.23e-115100Show/hide
Query:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
        MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
Subjt:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA

Query:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
        AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
Subjt:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF

A0A1S3AXQ6 50S ribosomal protein L18, chloroplastic4.04e-11197.06Show/hide
Query:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
        MASTTAPCSFL TSFL+VSYSQNILNRPHLHFP TTSSSGRHSLVVEAKATTRREDR ARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
Subjt:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA

Query:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
        AVSTMQKSISEGLDYSAGPTIEVAKK+GEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
Subjt:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF

A0A5A7UA63 50S ribosomal protein L184.04e-11197.06Show/hide
Query:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
        MASTTAPCSFL TSFL+VSYSQNILNRPHLHFP TTSSSGRHSLVVEAKATTRREDR ARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
Subjt:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA

Query:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
        AVSTMQKSISEGLDYSAGPTIEVAKK+GEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
Subjt:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF

A0A6J1C7Y4 50S ribosomal protein L18, chloroplastic4.29e-9785.29Show/hide
Query:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
        M++  +PCSFL +SFL+VSY Q + NRP  +FP  TSSSGRHSL+VEAKATTRREDR ARHSRIRKKVEGT ERPRLSV+RSNKHLYVQVIDDSKMHTLA
Subjt:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA

Query:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
        A STMQKSISEGLDYS+GPTIEVAKK+GE IAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
Subjt:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF

A0A6J1IAC8 50S ribosomal protein L18, chloroplastic3.02e-9787.65Show/hide
Query:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
        M+S TA  SFL +S L VSYSQN+   PHL+FP +TSSSGRHSLV+EAKATTRREDR ARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA
Subjt:  MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLA

Query:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
        AVSTMQKSISEGLDY++GPTIEVAKK+GEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGL F
Subjt:  AVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF

SwissProt top hitse value%identityAlignment
P09415 50S ribosomal protein L181.3e-3263.56Show/hide
Query:  RREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLAAVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPY
        R   R  RH+RIRKK+ GTTERPRLSVFRSNKH+Y Q+IDD+K  T+ + ST+ K    GLD  +   IE AKK+GE +AK  LEKGI +V FDRGGY Y
Subjt:  RREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLAAVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPY

Query:  HGRVEALADAAREHGLQF
        HGRV+ALADAARE GL+F
Subjt:  HGRVEALADAAREHGLQF

P82195 50S ribosomal protein L18, chloroplastic1.4e-5570.35Show/hide
Query:  MASTTAPCSFLRTSFLTVSYSQNILNRP--HLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHT
        MA+ T+   F  T   + S S   L+ P   ++F P T        +++AKA TRREDRTARH RIRKKVEGT ERPRL VFRSNKHLYVQVIDD+KMHT
Subjt:  MASTTAPCSFLRTSFLTVSYSQNILNRP--HLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHT

Query:  LAAVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
        LAA STMQK+ISE +DYSAGPT+EVA+KIGE IAKSCLEKGITKVAFDRGGYPYHGRV+ALADAAREHGL F
Subjt:  LAAVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF

Q5L3S3 50S ribosomal protein L181.3e-3263.56Show/hide
Query:  RREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLAAVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPY
        R   R  RH+RIRKK+ GT ERPRLSVFRSNKH+Y Q+IDD+K  T+ + ST+ K    GLD  +   IE AKK+GE +AK  LEKGI KV FDRGGY Y
Subjt:  RREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLAAVSTMQKSISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPY

Query:  HGRVEALADAAREHGLQF
        HGRV+ALADAARE GL+F
Subjt:  HGRVEALADAAREHGLQF

Q8SAY0 50S ribosomal protein L18, chloroplastic1.2e-4668.28Show/hide
Query:  PHLHFPPTTSSSGRHSLVVEA--KATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLAAVSTMQKSISEGLDYSAGPTIEVAK
        P +  P  + +  R SLVV A  K +T + DR ARH R+RKKV GTTERPRLSVFRSNKHLY QVIDD+K  TL + STM KS+S+ L+YSAGPT+EVA+
Subjt:  PHLHFPPTTSSSGRHSLVVEA--KATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLAAVSTMQKSISEGLDYSAGPTIEVAK

Query:  KIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
        KIGE IAKSCLEKGITKV FDRGG+ YHGR++ALADAARE+GL F
Subjt:  KIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF

Q9SX68 50S ribosomal protein L18, chloroplastic1.2e-5479.71Show/hide
Query:  PPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLAAVSTMQKSISEGLDYSAGPTIEVAKKIGEAIA
        P ++SS    S+VVEAK  T  EDR ARHSRIRKKV GTTERPRL VFRSNKHLYVQVIDD+KMHTLA+ ST QK ISE  DY++GPTIEVAKK+GE IA
Subjt:  PPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLAAVSTMQKSISEGLDYSAGPTIEVAKKIGEAIA

Query:  KSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
        KSCLEKGITKVAFDRGGYPYHGR+EALA AAREHGLQF
Subjt:  KSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF

Arabidopsis top hitse value%identityAlignment
AT1G14205.1 Ribosomal L18p/L5e family protein1.2e-1436.65Show/hide
Query:  TSFLTVSYS-QNILN---RPHLHFP-PTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLAAVSTMQK
        TS  TV +   +IL    +P+  FP   T+ S     V+EA+  TR E    R+ R RKK  GT  +PRLSVF S+K LY  ++DD     L   ST+QK
Subjt:  TSFLTVSYS-QNILN---RPHLHFP-PTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLAAVSTMQK

Query:  SISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKV-AFDRGGYPYHGRVEALADAAREHG
        SI      +    IE AK++GE + K+ ++  I ++ ++DR G     R++A   A  +HG
Subjt:  SISEGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKV-AFDRGGYPYHGRVEALADAAREHG

AT1G48350.1 Ribosomal L18p/L5e family protein8.4e-5679.71Show/hide
Query:  PPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLAAVSTMQKSISEGLDYSAGPTIEVAKKIGEAIA
        P ++SS    S+VVEAK  T  EDR ARHSRIRKKV GTTERPRL VFRSNKHLYVQVIDD+KMHTLA+ ST QK ISE  DY++GPTIEVAKK+GE IA
Subjt:  PPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLAAVSTMQKSISEGLDYSAGPTIEVAKKIGEAIA

Query:  KSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF
        KSCLEKGITKVAFDRGGYPYHGR+EALA AAREHGLQF
Subjt:  KSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF

AT3G17626.1 structural constituent of ribosome2.6e-0972.97Show/hide
Query:  GPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHG
        G TIE+AKK+GE I KSC+E GITKVAFDR  Y YHG
Subjt:  GPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAACCACTGCTCCTTGTTCCTTCCTCCGCACTTCCTTTCTAACCGTATCTTATTCACAGAACATTCTCAACCGTCCGCACCTTCATTTTCCGCCTACTACTTC
AAGTTCCGGCCGTCACTCTCTTGTTGTTGAAGCCAAGGCTACGACCAGACGAGAGGACCGCACAGCTCGCCATTCTCGAATTAGGAAGAAGGTTGAAGGAACAACGGAGA
GGCCAAGGTTATCTGTTTTCCGTTCGAATAAGCATTTGTATGTACAGGTTATCGATGACTCCAAGATGCATACGCTTGCTGCAGTTTCTACAATGCAGAAATCAATCTCT
GAGGGGTTAGACTACAGTGCTGGCCCTACCATTGAAGTGGCAAAGAAAATAGGGGAAGCGATTGCAAAATCTTGTTTGGAGAAAGGAATAACAAAAGTTGCATTCGATCG
CGGTGGATATCCATACCATGGACGAGTAGAAGCACTTGCTGATGCGGCTCGTGAACATGGCCTTCAATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAACCACTGCTCCTTGTTCCTTCCTCCGCACTTCCTTTCTAACCGTATCTTATTCACAGAACATTCTCAACCGTCCGCACCTTCATTTTCCGCCTACTACTTC
AAGTTCCGGCCGTCACTCTCTTGTTGTTGAAGCCAAGGCTACGACCAGACGAGAGGACCGCACAGCTCGCCATTCTCGAATTAGGAAGAAGGTTGAAGGAACAACGGAGA
GGCCAAGGTTATCTGTTTTCCGTTCGAATAAGCATTTGTATGTACAGGTTATCGATGACTCCAAGATGCATACGCTTGCTGCAGTTTCTACAATGCAGAAATCAATCTCT
GAGGGGTTAGACTACAGTGCTGGCCCTACCATTGAAGTGGCAAAGAAAATAGGGGAAGCGATTGCAAAATCTTGTTTGGAGAAAGGAATAACAAAAGTTGCATTCGATCG
CGGTGGATATCCATACCATGGACGAGTAGAAGCACTTGCTGATGCGGCTCGTGAACATGGCCTTCAATTCTGA
Protein sequenceShow/hide protein sequence
MASTTAPCSFLRTSFLTVSYSQNILNRPHLHFPPTTSSSGRHSLVVEAKATTRREDRTARHSRIRKKVEGTTERPRLSVFRSNKHLYVQVIDDSKMHTLAAVSTMQKSIS
EGLDYSAGPTIEVAKKIGEAIAKSCLEKGITKVAFDRGGYPYHGRVEALADAAREHGLQF