; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006993 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006993
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold10:39345522..39347675
RNA-Seq ExpressionSpg006993
SyntenySpg006993
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]4.7e-2433.85Show/hide
Query:  FVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRNPSAKQMKKALKLVANKGVQWKESQT
        F++R+I Q+ W+  C HP   +VPLVREFY+ L + +     V+   V F++  IN ++ ++  ++    D     + +Q++  L  VA +G  W+ S  
Subjt:  FVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRNPSAKQMKKALKLVANKGVQWKESQT

Query:  KVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILAC--GRKRAGKLFFGSFITQLCQRVKIVPGQDE
           + +  +LK  + +W HF+    MP+TH  T++ DRV+LLY ++ G+ +N+  I   EI AC   RKR G L+F S ITQL  +  +   +DE
Subjt:  KVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILAC--GRKRAGKLFFGSFITQLCQRVKIVPGQDE

PIN01433.1 hypothetical protein CDL12_26059 [Handroanthus impetiginosus]1.9e-2533.03Show/hide
Query:  EKGFSNRAGALLEFVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRN--PSAKQMKKAL
        E+GF  +  A  E +   + + KW+   + P+  V+PLVREFY+   E      +VRG+ V F SV IN +Y I  P+ L   D   N   +    ++  
Subjt:  EKGFSNRAGALLEFVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRN--PSAKQMKKAL

Query:  KLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSFITQLCQRV
        + +   G QWK ++ +  S   + L   + +WL FI   ++PT H   ++ DR +LLYC+M G   +VG II D I+         L+F S IT+LC R 
Subjt:  KLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSFITQLCQRV

Query:  KIVPGQDEECHFFKPTID
         +   + EE  F +  ID
Subjt:  KIVPGQDEECHFFKPTID

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]3.0e-3437.62Show/hide
Query:  MKKRDFLNEKGF---SNRAGALLEFVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRNP
        ++ R    EKGF   ++     L F++++I+Q+ W+  C+HP++ +VPLVREFY+ L +   +   VRG  VS+S   IN V+ +  P++   ++ I N 
Subjt:  MKKRDFLNEKGF---SNRAGALLEFVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRNP

Query:  SAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGS
        +   +   L+ VA  G +W  S     + + S L P + VW HF+K+HL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF S
Subjt:  SAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGS

Query:  FITQLCQRVK
         IT+LC+  +
Subjt:  FITQLCQRVK

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]6.6e-3437.14Show/hide
Query:  MKKRDFLNEKGF---SNRAGALLEFVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRNP
        ++ R    EKGF   ++     L F++++I+Q+ W+  C+HP++ +VPLVREFY+ L +   +   VRG  VS+S   IN V+ +  P++   ++ I+N 
Subjt:  MKKRDFLNEKGF---SNRAGALLEFVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRNP

Query:  SAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGS
        + + +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF S
Subjt:  SAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGS

Query:  FITQLCQRVK
         IT+LC+  +
Subjt:  FITQLCQRVK

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]3.7e-2932.76Show/hide
Query:  ASPRNLFPEEFRDVNFQERMEIMKKRDFLNEKGFSNRAGALLE---FVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVD
        AS    F  +  ++ ++E ++    R    EK F       LE   F++ +I Q+ WQ  C+HP++ +VPLVREFY+ +         +RG  V  S   
Subjt:  ASPRNLFPEEFRDVNFQERMEIMKKRDFLNEKGFSNRAGALLE---FVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVD

Query:  INRVYRIKAPLNLRGNDVIRNPSAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S + V LLY ++ G  INVG
Subjt:  INRVYRIKAPLNLRGNDVIRNPSAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVG

Query:  SIIRDEILACGRKRAGKLFFGSFITQLCQRVK
         +I  EI AC  +++G LFF S IT +C+  +
Subjt:  SIIRDEILACGRKRAGKLFFGSFITQLCQRVK

TrEMBL top hitse value%identityAlignment
A0A2G9G807 Uncharacterized protein9.3e-2633.03Show/hide
Query:  EKGFSNRAGALLEFVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRN--PSAKQMKKAL
        E+GF  +  A  E +   + + KW+   + P+  V+PLVREFY+   E      +VRG+ V F SV IN +Y I  P+ L   D   N   +    ++  
Subjt:  EKGFSNRAGALLEFVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRN--PSAKQMKKAL

Query:  KLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSFITQLCQRV
        + +   G QWK ++ +  S   + L   + +WL FI   ++PT H   ++ DR +LLYC+M G   +VG II D I+         L+F S IT+LC R 
Subjt:  KLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSFITQLCQRV

Query:  KIVPGQDEECHFFKPTID
         +   + EE  F +  ID
Subjt:  KIVPGQDEECHFFKPTID

A0A2P5AGA5 Uncharacterized protein (Fragment)1.4e-3437.62Show/hide
Query:  MKKRDFLNEKGF---SNRAGALLEFVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRNP
        ++ R    EKGF   ++     L F++++I+Q+ W+  C+HP++ +VPLVREFY+ L +   +   VRG  VS+S   IN V+ +  P++   ++ I N 
Subjt:  MKKRDFLNEKGF---SNRAGALLEFVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRNP

Query:  SAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGS
        +   +   L+ VA  G +W  S     + + S L P + VW HF+K+HL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF S
Subjt:  SAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGS

Query:  FITQLCQRVK
         IT+LC+  +
Subjt:  FITQLCQRVK

A0A2P5BCG4 Uncharacterized protein (Fragment)3.2e-3437.14Show/hide
Query:  MKKRDFLNEKGF---SNRAGALLEFVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRNP
        ++ R    EKGF   ++     L F++++I+Q+ W+  C+HP++ +VPLVREFY+ L +   +   VRG  VS+S   IN V+ +  P++   ++ I+N 
Subjt:  MKKRDFLNEKGF---SNRAGALLEFVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRNP

Query:  SAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGS
        + + +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF S
Subjt:  SAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGS

Query:  FITQLCQRVK
         IT+LC+  +
Subjt:  FITQLCQRVK

A0A2P5DAQ2 Uncharacterized protein1.8e-2932.76Show/hide
Query:  ASPRNLFPEEFRDVNFQERMEIMKKRDFLNEKGFSNRAGALLE---FVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVD
        AS    F  +  ++ ++E ++    R    EK F       LE   F++ +I Q+ WQ  C+HP++ +VPLVREFY+ +         +RG  V  S   
Subjt:  ASPRNLFPEEFRDVNFQERMEIMKKRDFLNEKGFSNRAGALLE---FVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVD

Query:  INRVYRIKAPLNLRGNDVIRNPSAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S + V LLY ++ G  INVG
Subjt:  INRVYRIKAPLNLRGNDVIRNPSAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVG

Query:  SIIRDEILACGRKRAGKLFFGSFITQLCQRVK
         +I  EI AC  +++G LFF S IT +C+  +
Subjt:  SIIRDEILACGRKRAGKLFFGSFITQLCQRVK

W9QTD9 Uncharacterized protein2.3e-2433.85Show/hide
Query:  FVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRNPSAKQMKKALKLVANKGVQWKESQT
        F++R+I Q+ W+  C HP   +VPLVREFY+ L + +     V+   V F++  IN ++ ++  ++    D     + +Q++  L  VA +G  W+ S  
Subjt:  FVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRNPSAKQMKKALKLVANKGVQWKESQT

Query:  KVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILAC--GRKRAGKLFFGSFITQLCQRVKIVPGQDE
           + +  +LK  + +W HF+    MP+TH  T++ DRV+LLY ++ G+ +N+  I   EI AC   RKR G L+F S ITQL  +  +   +DE
Subjt:  KVKSLVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILAC--GRKRAGKLFFGSFITQLCQRVKIVPGQDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGCACACGAAGGACGAAACCCATGGGATTCTCGCTGGCAGTCATGAACCAAGCGCCCAACGTTCCAACTCCATCCTCTTCGACAATGCCGGCTAGTTCGAGGGA
GATGTCGAGTTCATCTACGGTAAGAAGGTTCACACGCGCCGCCGCCGTCCATCAAACCCAAAAGCCCACCGCTCAAAAGTTCAAGAAACGTTCACGAGAGTGGTTTGCAA
TGATCCGAGAGATGGGAGTCAAGAGACGTGCTGCCCTTGAAGAAGAAGGAAGTAGGCAAGATGAAGAAAAGGCCGCCAAGGCAGCTGAAAGCTCTCCGCAAGGAGAAGCT
TCAATGGGTAAGGTTTCCAAACCTTCAACTAACCCTTCTCTATCTTGCAGAATCAAACCCGTTGTTACTTACAGTGCAAGAAAGAGGAGGCCGAAGAAAAATGTGTCCGA
AAACCCGCTCGAGATTAAGCCCCTCAAAACCGCAAGGATGCCTCCCGATGTATTCGAAGGAATAATCCGCCAAGCTGTGGCAAAGGCCCTTGAGATTGCTGAAGGGTATA
AGGCTGAACAGGATGCTTTGAAAGAGCTCCTTGAGGAAGAGAAAAAGAGAAGAGAAGAAATAAAAGAAAATGAAAAAAGAAGAAAGGAAGCTGAAGACTTCCTTGCAGCC
TTTGAGCCACTCCACAAGGCTCAAAGTGAGGCTGAAGCACTGCAAGGAAAGGTAGAAGAAAAGGCCCAACAAGGTCCAACAGAAGAAAATTTTGAAAAAGAAAAAGAAAG
AGAAGTGGAGAATGAAGGCCAGAATGTGACAGCATCGGGGCCGCATTCTGAGGAAGGCCTAGCCGAGGCCAATAAAGAGCAACCTGCTGAGGAGGTCTTTGAACCTCTAT
TTACAAATGACCCACCAGCAGCTGATAGCACATCTTCAGGAGAGAAGAGGGAAGAAGAGGAAAAAGAAGACGTGGAGGCCGAGACCTCCAGTGATTCTGACTCTGATACA
GAATTTGACTCAGAGATAAGGGAGTTAGATGGCAACCAAGTCCCTATCTCTGCAGCATTGAGAAGAAAGAGAAAGAGAGAAATTAAGGCTGAGAGGAGGACAAAGAACAA
GAATGACCCGATATTTTCCAAGAGGCCGAGGACAAGGTCCATTGACGTCTCTCTTGCAATTCCTTTGACCGTCTCACCCGCCAAGCAAAAGGGCAAGTCACCCAAGGCTG
CATCTCCCAGAAATCTGTTCCCTGAGGAATTTAGAGATGTTAATTTTCAGGAACGAATGGAGATCATGAAGAAAAGAGATTTCCTCAACGAGAAGGGATTCTCTAACCGA
GCAGGAGCACTGCTAGAGTTTGTGAGCAGGATCATATCTCAATACAAGTGGCAGGACTTATGTTCTCACCCTCAGGAGGCTGTTGTGCCTCTAGTTCGAGAATTTTACTC
TGGCCTAAGGGAGGAGAGCATTAGCATGGCAGTGGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATCAATAGGGTGTACAGGATCAAAGCACCCCTGAATCTGA
GAGGGAATGATGTGATAAGGAACCCTTCAGCCAAGCAGATGAAGAAAGCATTGAAACTTGTGGCCAACAAGGGGGTTCAATGGAAAGAATCGCAGACGAAAGTGAAGTCT
TTAGTGCCAAGCGATTTAAAGCCAGAATCGGCAGTTTGGCTTCACTTCATCAAGAACCACTTGATGCCAACCACCCATGACAGCACAATTTCAGTGGATAGAGTGATGCT
ACTCTATTGCCTTATGAAGGGGTTGGAAATCAATGTAGGGAGCATAATCAGGGACGAGATCTTAGCCTGTGGGAGAAAGCGAGCAGGCAAGCTTTTCTTTGGATCATTCA
TCACCCAACTCTGCCAAAGGGTGAAGATCGTTCCAGGCCAGGACGAGGAGTGTCACTTCTTCAAGCCTACCATCGACCTGTCCTTGATCAGGAAGCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGGCACACGAAGGACGAAACCCATGGGATTCTCGCTGGCAGTCATGAACCAAGCGCCCAACGTTCCAACTCCATCCTCTTCGACAATGCCGGCTAGTTCGAGGGA
GATGTCGAGTTCATCTACGGTAAGAAGGTTCACACGCGCCGCCGCCGTCCATCAAACCCAAAAGCCCACCGCTCAAAAGTTCAAGAAACGTTCACGAGAGTGGTTTGCAA
TGATCCGAGAGATGGGAGTCAAGAGACGTGCTGCCCTTGAAGAAGAAGGAAGTAGGCAAGATGAAGAAAAGGCCGCCAAGGCAGCTGAAAGCTCTCCGCAAGGAGAAGCT
TCAATGGGTAAGGTTTCCAAACCTTCAACTAACCCTTCTCTATCTTGCAGAATCAAACCCGTTGTTACTTACAGTGCAAGAAAGAGGAGGCCGAAGAAAAATGTGTCCGA
AAACCCGCTCGAGATTAAGCCCCTCAAAACCGCAAGGATGCCTCCCGATGTATTCGAAGGAATAATCCGCCAAGCTGTGGCAAAGGCCCTTGAGATTGCTGAAGGGTATA
AGGCTGAACAGGATGCTTTGAAAGAGCTCCTTGAGGAAGAGAAAAAGAGAAGAGAAGAAATAAAAGAAAATGAAAAAAGAAGAAAGGAAGCTGAAGACTTCCTTGCAGCC
TTTGAGCCACTCCACAAGGCTCAAAGTGAGGCTGAAGCACTGCAAGGAAAGGTAGAAGAAAAGGCCCAACAAGGTCCAACAGAAGAAAATTTTGAAAAAGAAAAAGAAAG
AGAAGTGGAGAATGAAGGCCAGAATGTGACAGCATCGGGGCCGCATTCTGAGGAAGGCCTAGCCGAGGCCAATAAAGAGCAACCTGCTGAGGAGGTCTTTGAACCTCTAT
TTACAAATGACCCACCAGCAGCTGATAGCACATCTTCAGGAGAGAAGAGGGAAGAAGAGGAAAAAGAAGACGTGGAGGCCGAGACCTCCAGTGATTCTGACTCTGATACA
GAATTTGACTCAGAGATAAGGGAGTTAGATGGCAACCAAGTCCCTATCTCTGCAGCATTGAGAAGAAAGAGAAAGAGAGAAATTAAGGCTGAGAGGAGGACAAAGAACAA
GAATGACCCGATATTTTCCAAGAGGCCGAGGACAAGGTCCATTGACGTCTCTCTTGCAATTCCTTTGACCGTCTCACCCGCCAAGCAAAAGGGCAAGTCACCCAAGGCTG
CATCTCCCAGAAATCTGTTCCCTGAGGAATTTAGAGATGTTAATTTTCAGGAACGAATGGAGATCATGAAGAAAAGAGATTTCCTCAACGAGAAGGGATTCTCTAACCGA
GCAGGAGCACTGCTAGAGTTTGTGAGCAGGATCATATCTCAATACAAGTGGCAGGACTTATGTTCTCACCCTCAGGAGGCTGTTGTGCCTCTAGTTCGAGAATTTTACTC
TGGCCTAAGGGAGGAGAGCATTAGCATGGCAGTGGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATCAATAGGGTGTACAGGATCAAAGCACCCCTGAATCTGA
GAGGGAATGATGTGATAAGGAACCCTTCAGCCAAGCAGATGAAGAAAGCATTGAAACTTGTGGCCAACAAGGGGGTTCAATGGAAAGAATCGCAGACGAAAGTGAAGTCT
TTAGTGCCAAGCGATTTAAAGCCAGAATCGGCAGTTTGGCTTCACTTCATCAAGAACCACTTGATGCCAACCACCCATGACAGCACAATTTCAGTGGATAGAGTGATGCT
ACTCTATTGCCTTATGAAGGGGTTGGAAATCAATGTAGGGAGCATAATCAGGGACGAGATCTTAGCCTGTGGGAGAAAGCGAGCAGGCAAGCTTTTCTTTGGATCATTCA
TCACCCAACTCTGCCAAAGGGTGAAGATCGTTCCAGGCCAGGACGAGGAGTGTCACTTCTTCAAGCCTACCATCGACCTGTCCTTGATCAGGAAGCTCTAG
Protein sequenceShow/hide protein sequence
MQGTRRTKPMGFSLAVMNQAPNVPTPSSSTMPASSREMSSSSTVRRFTRAAAVHQTQKPTAQKFKKRSREWFAMIREMGVKRRAALEEEGSRQDEEKAAKAAESSPQGEA
SMGKVSKPSTNPSLSCRIKPVVTYSARKRRPKKNVSENPLEIKPLKTARMPPDVFEGIIRQAVAKALEIAEGYKAEQDALKELLEEEKKRREEIKENEKRRKEAEDFLAA
FEPLHKAQSEAEALQGKVEEKAQQGPTEENFEKEKEREVENEGQNVTASGPHSEEGLAEANKEQPAEEVFEPLFTNDPPAADSTSSGEKREEEEKEDVEAETSSDSDSDT
EFDSEIRELDGNQVPISAALRRKRKREIKAERRTKNKNDPIFSKRPRTRSIDVSLAIPLTVSPAKQKGKSPKAASPRNLFPEEFRDVNFQERMEIMKKRDFLNEKGFSNR
AGALLEFVSRIISQYKWQDLCSHPQEAVVPLVREFYSGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNDVIRNPSAKQMKKALKLVANKGVQWKESQTKVKS
LVPSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSFITQLCQRVKIVPGQDEECHFFKPTIDLSLIRKL