; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G17390 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G17390
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationChr1:13077154..13078194
RNA-Seq ExpressionCSPI01G17390
SyntenyCSPI01G17390
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0068025.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]3.9e-5474.13Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC
        +KGSP +GI L K  D S+ AFADA+WGS  DT RSVTGFCVFLG SLVSWKSKKQQTV+RSSAEAEY+AL   +CEVIWL+S L EL+I+  +P ++FC
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC

Query:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVR
        DNQ AIYIANN MFHE+TKHIELDCHFVRDRI+DGSIKLL VR
Subjt:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVR

PON99369.1 hypothetical protein TorRG33x02_049120 [Trema orientale]3.5e-5560.12Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC
        +K  P QG+F S +  L +KAF+DADWGSCPDT +SVTGFC+FLG SLVSWK+KKQ T+SRSSAEAEYRALA  + E++ L++ L + Q+  ++P ++FC
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC

Query:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGVKNL
        DNQ AIYIA+N  FHERTKHIELDCHFVRD++  GS+KLL +RS HQ AD  TKPL++ +L   + KM V ++
Subjt:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGVKNL

XP_022134747.1 uncharacterized protein LOC111006944 [Momordica charantia]1.5e-5864.16Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC
        +KGSP QG+FLS +    L+AFADADWGSCPDT RS TGFCVF+G SLVSWKSKKQ T+SRSSAEAEYRALA VSCE++WL   L +LQ++   P +VFC
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC

Query:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGVKNL
        DNQ AI++A N +FHERTKHIELDC FVRDR+ DG ++LL  RS+ Q AD  TKPL+++I    M KM + N+
Subjt:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGVKNL

XP_022899321.1 uncharacterized protein LOC111412620 [Olea europaea var. sylvestris]4.6e-5562.79Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC
        IK SP QGI  S    L L+AFADADWGSC DT +SV GFCVFLG SL+SWK+KKQ TVSRSSAEAEYRALA+ + E+ WL   L + Q  T  PTV+FC
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC

Query:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGVKN
        DNQVA+++A+N +FHERTKHIE+DCHF+RD++ DGS+KLL VRS+HQ AD   K L A +L   + KM V N
Subjt:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGVKN

XP_031745923.1 uncharacterized protein LOC116406346 [Cucumis sativus]2.3e-5459.54Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC
        +KGSP QG+ +   D   LKAF DADWGSC DT RSVTGFC+FLG S++SWKSKKQ TVSRSSAEAEYRAL +V+ E++W+   L + +I+T  PT VFC
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC

Query:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGVKNL
        DNQ AI IA+N  FHERTKHIE+DCHFVRD+I++G +K+L + +S Q AD  TK L +  L+ H+ K+G+K++
Subjt:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGVKNL

TrEMBL top hitse value%identityAlignment
A0A2N9J5N4 Uncharacterized protein2.7e-5360.69Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC
        IKG+PSQG+  + + DL +KAF+D+DW  CPDT RS TG+CVFLG SLVSW+SKKQ TVSRSSAEAEYRA+A   CEVIW+K+ L +LQI  +   ++F 
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC

Query:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGVKNL
        D+Q A++IA N +FHERTKHIELDCH VRD+I +G IK L V S HQ AD +TK L   + S  + KMGV NL
Subjt:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGVKNL

A0A2P5FNK4 Uncharacterized protein1.7e-5560.12Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC
        +K  P QG+F S +  L +KAF+DADWGSCPDT +SVTGFC+FLG SLVSWK+KKQ T+SRSSAEAEYRALA  + E++ L++ L + Q+  ++P ++FC
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC

Query:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGVKNL
        DNQ AIYIA+N  FHERTKHIELDCHFVRD++  GS+KLL +RS HQ AD  TKPL++ +L   + KM V ++
Subjt:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGVKNL

A0A5A7TTG6 Cysteine-rich RLK (Receptor-like protein kinase) 82.4e-5469.28Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC
        +KGSP +GI L K  D S+ AFAD DWG C +T  SVTGFCVFLG SLVSWKSKKQQTV+RSS EAEYRALA  +C+VIWL S   ELQI+  +  ++FC
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC

Query:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALT
        DNQ  IYI NN MFHERTKHI+LDCHFVRDRI+DGSIKLL +RSS+Q AD  T
Subjt:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALT

A0A5A7VQN7 Cysteine-rich RLK (Receptor-like protein kinase) 81.9e-5474.13Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC
        +KGSP +GI L K  D S+ AFADA+WGS  DT RSVTGFCVFLG SLVSWKSKKQQTV+RSSAEAEY+AL   +CEVIWL+S L EL+I+  +P ++FC
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC

Query:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVR
        DNQ AIYIANN MFHE+TKHIELDCHFVRDRI+DGSIKLL VR
Subjt:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVR

A0A6J1C0G9 uncharacterized protein LOC1110069447.4e-5964.16Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC
        +KGSP QG+FLS +    L+AFADADWGSCPDT RS TGFCVF+G SLVSWKSKKQ T+SRSSAEAEYRALA VSCE++WL   L +LQ++   P +VFC
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC

Query:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGVKNL
        DNQ AI++A N +FHERTKHIELDC FVRDR+ DG ++LL  RS+ Q AD  TKPL+++I    M KM + N+
Subjt:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGVKNL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-2037.68Show/hide
Query:  FADADWGSCPDTHRSVTGFCV-FLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFCDNQVAIYIANNLMFHERTKH
        + D+DW       +S TG+       +L+ W +K+Q +V+ SS EAEY AL     E +WLK  L  + I+   P  ++ DNQ  I IANN   H+R KH
Subjt:  FADADWGSCPDTHRSVTGFCV-FLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFCDNQVAIYIANNLMFHERTKH

Query:  IELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNA
        I++  HF R+++ +  I L  + + +Q AD  TKPL A
Subjt:  IELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-2241.01Show/hide
Query:  DLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFCDNQVAIYIANNLMFH
        D  LK + DAD     D  +S TG+        +SW+SK Q+ V+ S+ EAEY A      E+IWLK FL+EL +      VV+CD+Q AI ++ N M+H
Subjt:  DLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFCDNQVAIYIANNLMFH

Query:  ERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTK
         RTKHI++  H++R+ + D S+K+L + ++   AD LTK
Subjt:  ERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTK

P92519 Uncharacterized mitochondrial protein AtMg008107.2e-1950Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIW
        +KG+   G+++ K   L+++AF D+DW  C  T RS TGFC FLG +++SW +K+Q TVSRSS E EYRALA  + E+ W
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.5e-3744.71Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC
        + G+P+ GIFL K + LSL A++DADW    D + S  G+ V+LG   +SW SKKQ+ V RSS EAEYR++A  S E+ W+ S L EL I    P V++C
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC

Query:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGV
        DN  A Y+  N +FH R KHI +D HF+R+++  G+++++ V +  Q AD LTKPL+         K+GV
Subjt:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.9e-3845.29Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC
        + G+P  GIFL K + LSL A++DADW    D + S  G+ V+LG   +SW SKKQ+ V RSS EAEYR++A  S E+ W+ S L EL I+ + P V++C
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC

Query:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGV
        DN  A Y+  N +FH R KHI LD HF+R+++  G+++++ V +  Q AD LTKPL+ +       K+GV
Subjt:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.1e-4251.2Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC
        IKG+  QG+F S   ++ L+ F+DA + SC DT RS  G+C+FLG SL+SWKSKKQQ VS+SSAEAEYRAL+  + E++WL  F +ELQ+  + PT++FC
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC

Query:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMY
        DN  AI+IA N +FHERTKHIE DCH VR+R V     L     ++   D  T+ L+ I+    MY
Subjt:  DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMY

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.1e-1270.73Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFC
        +KG+  QG+F S T DL LKAFAD+DW SCPDT RSVTGFC
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFC

ATMG00810.1 DNA/RNA polymerases superfamily protein5.1e-2050Show/hide
Query:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIW
        +KG+   G+++ K   L+++AF D+DW  C  T RS TGFC FLG +++SW +K+Q TVSRSS E EYRALA  + E+ W
Subjt:  IKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAACCTGACAACTCATATGAGTGCATCAAAGGATCACCAAGCCAAGGAATATTTCTGTCCAAAACAGATGATCTATCCTTAAAAGCATTTGCAGATGCTGATTG
GGGATCATGTCCTGATACCCATCGTTCAGTCACCGGGTTTTGTGTTTTCCTTGGAAAATCACTTGTATCATGGAAGTCAAAGAAACAGCAAACAGTTTCACGGTCATCAG
CTGAAGCAGAGTACCGAGCATTGGCCACTGTTTCTTGTGAAGTTATTTGGCTCAAGAGTTTTCTCAAAGAATTACAGATAGAAACAAATACACCAACTGTAGTGTTTTGT
GATAACCAAGTTGCCATTTACATCGCCAATAATCTCATGTTTCATGAGAGAACGAAACATATAGAGTTGGACTGCCACTTTGTTCGAGATAGAATTGTTGATGGATCCAT
CAAACTACTCCTTGTACGCTCTTCACATCAATTTGCTGATGCCCTCACTAAACCACTTAATGCCATAATTTTGTCTCTTCATATGTACAAGATGGGAGTTAAAAATTTGG
GGCATAATTCCTCTTACACTAGACCAGTATTCTATGGAGCTTTTTTCTCTTGGACAAAGATGGAGGGTGAGGGATGCAAGAAGCATCTTAATCTTTACCCAGCTTTGGAT
TTTCAACAATCGAAACTAAAATCTAGTTGGGTTCACGAGAAGACAAAAGAGTTAAAGTACAACAATTGTAGGGAAGGAGTATCAAACCTTCCACTCGAGAATAGAAAAGA
TTGTTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAACCTGACAACTCATATGAGTGCATCAAAGGATCACCAAGCCAAGGAATATTTCTGTCCAAAACAGATGATCTATCCTTAAAAGCATTTGCAGATGCTGATTG
GGGATCATGTCCTGATACCCATCGTTCAGTCACCGGGTTTTGTGTTTTCCTTGGAAAATCACTTGTATCATGGAAGTCAAAGAAACAGCAAACAGTTTCACGGTCATCAG
CTGAAGCAGAGTACCGAGCATTGGCCACTGTTTCTTGTGAAGTTATTTGGCTCAAGAGTTTTCTCAAAGAATTACAGATAGAAACAAATACACCAACTGTAGTGTTTTGT
GATAACCAAGTTGCCATTTACATCGCCAATAATCTCATGTTTCATGAGAGAACGAAACATATAGAGTTGGACTGCCACTTTGTTCGAGATAGAATTGTTGATGGATCCAT
CAAACTACTCCTTGTACGCTCTTCACATCAATTTGCTGATGCCCTCACTAAACCACTTAATGCCATAATTTTGTCTCTTCATATGTACAAGATGGGAGTTAAAAATTTGG
GGCATAATTCCTCTTACACTAGACCAGTATTCTATGGAGCTTTTTTCTCTTGGACAAAGATGGAGGGTGAGGGATGCAAGAAGCATCTTAATCTTTACCCAGCTTTGGAT
TTTCAACAATCGAAACTAAAATCTAGTTGGGTTCACGAGAAGACAAAAGAGTTAAAGTACAACAATTGTAGGGAAGGAGTATCAAACCTTCCACTCGAGAATAGAAAAGA
TTGTTTATAA
Protein sequenceShow/hide protein sequence
MAKPDNSYECIKGSPSQGIFLSKTDDLSLKAFADADWGSCPDTHRSVTGFCVFLGKSLVSWKSKKQQTVSRSSAEAEYRALATVSCEVIWLKSFLKELQIETNTPTVVFC
DNQVAIYIANNLMFHERTKHIELDCHFVRDRIVDGSIKLLLVRSSHQFADALTKPLNAIILSLHMYKMGVKNLGHNSSYTRPVFYGAFFSWTKMEGEGCKKHLNLYPALD
FQQSKLKSSWVHEKTKELKYNNCREGVSNLPLENRKDCL