; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G08290 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G08290
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionD-ribose-binding periplasmic protein
Genome locationChr1:5230142..5231303
RNA-Seq ExpressionCSPI01G08290
SyntenyCSPI01G08290
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152463.1 uncharacterized protein LOC101220404 [Cucumis sativus]4.2e-92100Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
        MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV

Query:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
        TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
Subjt:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS

XP_008438114.1 PREDICTED: uncharacterized protein LOC103483316 [Cucumis melo]1.0e-9098.31Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
        MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDN+TQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV

Query:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
        TTQDVLQGLKAKQEAKKKRNLL+FEGK GNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
Subjt:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS

XP_023000947.1 uncharacterized protein LOC111495231 [Cucurbita maxima]1.2e-7082.58Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
        MGNCQAIDTASLIIQHPNGKVDR YWPVNAGEIMK+NPGHYVALLISTK+C S++T+   RRRD D QTN+TNFNSVRLTRIKLLKPTDSLVLGQIYRLV
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV

Query:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
        T QDVLQGLKAKQEAK KRN L+FEGK GN EKGSEGE+NQGMK E+NR       VS+AAKSRGWQPSLQSISE GS
Subjt:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS

XP_023519579.1 uncharacterized protein LOC111782953 [Cucurbita pepo subsp. pepo]2.0e-7082.58Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
        MGNCQAIDTASLIIQHPNGKVDR YWPVNAGEIMK+NPGHYVALLISTK+C S++T+   RRRD+D QTN+TNFNSVRLTRIKLLKPTDSLVLGQIYRLV
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV

Query:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
        T QDVLQGLKAKQEAK KRN L+FEGK GN EKGSEGE+NQGMK E+NR       VS+AAKSRGWQPSLQSISE GS
Subjt:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS

XP_038894996.1 uncharacterized protein LOC120083345 [Benincasa hispida]6.1e-7585.96Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
        MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTK+C SETT+ HHRRRDN TQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV

Query:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
        TTQDVLQGLK KQEAK K        K GN + GSEGEI++GMK ERN VKKC+STVS AAKSRGWQPSLQSISEGGS
Subjt:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS

TrEMBL top hitse value%identityAlignment
A0A0A0LU27 Uncharacterized protein2.0e-92100Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
        MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV

Query:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
        TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
Subjt:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS

A0A1S3AV95 uncharacterized protein LOC1034833165.0e-9198.31Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
        MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDN+TQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV

Query:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
        TTQDVLQGLKAKQEAKKKRNLL+FEGK GNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
Subjt:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS

A0A5D3BH89 Uncharacterized protein5.0e-9198.31Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
        MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDN+TQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV

Query:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
        TTQDVLQGLKAKQEAKKKRNLL+FEGK GNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
Subjt:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS

A0A6J1E7K0 uncharacterized protein LOC1114314551.7e-7082.02Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
        MGNCQAIDTASLIIQHPNGKVDR YWPVNAGEIMK+NPGHYVALLISTK+C S++T+   RRRD+D QTN+TNFNSVRLTRIKLLKPTDSLVLGQIYRLV
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV

Query:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
        T QDVLQGLKAKQEAK KRN ++FEGK GN EKGSEGE+NQGMK E+NR       VS+AAKSRGWQPSLQSISE GS
Subjt:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS

A0A6J1KF34 uncharacterized protein LOC1114952315.8e-7182.58Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
        MGNCQAIDTASLIIQHPNGKVDR YWPVNAGEIMK+NPGHYVALLISTK+C S++T+   RRRD D QTN+TNFNSVRLTRIKLLKPTDSLVLGQIYRLV
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV

Query:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
        T QDVLQGLKAKQEAK KRN L+FEGK GN EKGSEGE+NQGMK E+NR       VS+AAKSRGWQPSLQSISE GS
Subjt:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10530.1 unknown protein1.2e-2539.89Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
        MGNCQA++ A L++QHP G +DR Y  V+  E+M   PGHYV+L+I   + + E  +     + +D +       +VR TR++LL+PT++LVLG  YRL+
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV

Query:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
        T+Q+V++ L+ K+ AK K++ +E   K   ++K S+ ++ +  + ++ RV + NST  +  KS+ W+PSLQSISE  S
Subjt:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS

AT1G60010.1 unknown protein7.5e-3144.57Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
        MGNCQA+D A+L++QHP+GK+DR Y PV+  EIM+  PGHYV+L+I         T+T        T  + +    VR TR+KLL+PT++LVLG  YRL+
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV

Query:  TTQDVLQGLKAKQEAKKKRNLLEF--EGKMGNSEK--GSEGEINQGM--KNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
        T+Q+V++ L+AK+ AK K++  E   E K  +SEK    E + NQ +  K+E+ R    N   S +++S+ W+PSLQSISE  S
Subjt:  TTQDVLQGLKAKQEAKKKRNLLEF--EGKMGNSEK--GSEGEINQGM--KNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS

AT5G50090.1 unknown protein1.5e-3144.94Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
        MGNCQA+DTA ++IQHPNGK ++L  PV+A  +MK NPGH V+LLIST    S                +S +   +RLTRIKLL+PTD+LVLG +YRL+
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV

Query:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
        TT++V++GL AK+ +K K+     E K  + +      IN    +  ++++        +  SR WQPSLQSISEGGS
Subjt:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS

AT5G50090.2 unknown protein4.0e-3245.25Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
        MGNCQA+DTA ++IQHPNGK ++L  PV+A  +MK NPGH V+LLIST    S                +S +   +RLTRIKLL+PTD+LVLG +YRL+
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV

Query:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEI-NQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
        TT++V++GL AK+ +K K+     + K+   +  +  ++ N+  + ER+R+            SR WQPSLQSISEGGS
Subjt:  TTQDVLQGLKAKQEAKKKRNLLEFEGKMGNSEKGSEGEI-NQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS

AT5G62900.1 unknown protein6.2e-2537.84Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV
        MGNCQA + A+ +IQ P+GK  R Y  VNA E++K++PGH+VALL+S+ V                      +  S+R+TRIKLL+P+D+L+LG +YRL+
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLV

Query:  TTQDVLQGLKAKQEAKKKRNLLEF---EGKMG----NSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS
        ++++V++G++AK+  K K+   EF   E ++      SE  S+ +  + +  ++  +    +T     K R WQPSLQSISE  S
Subjt:  TTQDVLQGLKAKQEAKKKRNLLEF---EGKMG----NSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAATTGCCAAGCCATAGACACAGCTTCTCTAATAATCCAACACCCAAATGGAAAAGTCGACAGACTTTACTGGCCGGTAAACGCCGGAGAGATCATGAAAACAAA
TCCCGGCCATTACGTTGCTCTTCTCATCTCCACAAAAGTTTGCCAATCGGAAACCACATCCACCCACCATCGTCGTCGTGATAACGATACTCAAACCAACAGTACAAATT
TCAACTCGGTTCGGCTGACCCGAATCAAGCTTCTGAAGCCTACCGATTCCCTCGTTCTCGGCCAAATTTACCGACTCGTCACAACCCAAGATGTTTTGCAGGGGTTGAAA
GCGAAACAGGAAGCGAAAAAGAAGAGGAATTTGTTGGAGTTTGAAGGAAAAATGGGGAACTCGGAGAAGGGATCTGAAGGGGAAATTAATCAGGGGATGAAGAATGAGAG
AAACAGAGTGAAGAAATGCAATTCAACAGTATCAACGGCGGCGAAATCGAGAGGGTGGCAGCCATCACTGCAGAGCATTTCTGAAGGGGGAAGTTGA
mRNA sequenceShow/hide mRNA sequence
CCAATTCAATCTTCTTCTTCTTCTTCTTCTTCGACCTCGCCGCCGTAGCTTCCACTTGTTTAGAGAGACAAAAAACTGATTACCAGTAGAGAGAGAAACGAACCTCATGG
GAAATTGCCAAGCCATAGACACAGCTTCTCTAATAATCCAACACCCAAATGGAAAAGTCGACAGACTTTACTGGCCGGTAAACGCCGGAGAGATCATGAAAACAAATCCC
GGCCATTACGTTGCTCTTCTCATCTCCACAAAAGTTTGCCAATCGGAAACCACATCCACCCACCATCGTCGTCGTGATAACGATACTCAAACCAACAGTACAAATTTCAA
CTCGGTTCGGCTGACCCGAATCAAGCTTCTGAAGCCTACCGATTCCCTCGTTCTCGGCCAAATTTACCGACTCGTCACAACCCAAGATGTTTTGCAGGGGTTGAAAGCGA
AACAGGAAGCGAAAAAGAAGAGGAATTTGTTGGAGTTTGAAGGAAAAATGGGGAACTCGGAGAAGGGATCTGAAGGGGAAATTAATCAGGGGATGAAGAATGAGAGAAAC
AGAGTGAAGAAATGCAATTCAACAGTATCAACGGCGGCGAAATCGAGAGGGTGGCAGCCATCACTGCAGAGCATTTCTGAAGGGGGAAGTTGAGGATTTGATATTTGAGA
CTAAGAACATAATAACGATTCAATGAACACAGATTTGTGGTTTTTTTTTTCTTTTGGGGGGAGTTTTTAGAGTGAGATTTGTGCATAAGAAAATGAGTAGTTAATTATAG
AACAGAACAGAGAAAGAAGGAGAGAAAAAAAGAAAAACAAAAGATGGATGAAATATGAAATATTGTAGTTGGAATTTCATATTAAGGAATTGAAATTGGGTTTTGGAAGA
TTCAAACCATTCCTACTTGTTTCCTCTTGTGGTTGATTAGCATTATTTCCTATTTTTATTTTCTTTTATCCTAAAATGATGAAAATAAATATATGGTTC
Protein sequenceShow/hide protein sequence
MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMKTNPGHYVALLISTKVCQSETTSTHHRRRDNDTQTNSTNFNSVRLTRIKLLKPTDSLVLGQIYRLVTTQDVLQGLK
AKQEAKKKRNLLEFEGKMGNSEKGSEGEINQGMKNERNRVKKCNSTVSTAAKSRGWQPSLQSISEGGS