; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G009390 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G009390
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionUnknown protein
Genome locationCG_Chr08:22028020..22030764
RNA-Seq ExpressionClCG08G009390
SyntenyClCG08G009390
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038455.1 uncharacterized protein E6C27_scaffold119G00220 [Cucumis melo var. makuwa]4.2e-11679.93Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GM SCET+L +Y+SKQSPIAQKKVALRDVQNDNRSV+YNYPETSCSLGGKL+NGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEH +YA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASGGSTSCAPAFSS +A SPMAFSPVRSS PI  EKPGNFLAV GSNLLGIPPGLE LR  DSNGITDEQRTERLFNLQKLLKH+D+ D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFITWIEQVANRFMFFRERNPTDEGSEYSGQPSVTSERSFVTLMNNDFG
         GV E+ LH L PSELSNFAINLEKRSMHL                      ER+PTDEG EYS QPSV SERSFVTLMNNDFG
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFITWIEQVANRFMFFRERNPTDEGSEYSGQPSVTSERSFVTLMNNDFG

XP_008464851.1 PREDICTED: uncharacterized protein LOC103502625 isoform X1 [Cucumis melo]2.7e-10785.71Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GM SCET+L +Y+SKQSPIAQKKVALRDVQNDNRS++YNYPETSCSLGGKL+NGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEH +YA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASGGSTSCAPAFSS +A SPMAFSPVRSS PI  EKPGNFLAV GSNLLGIPPGLE LR  DSNGITDEQRTERLFNLQKLLKH+D+ D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI
         GV E+VLH L PSELSNFAINLEKRSMHLSVEE   I
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI

XP_008464853.1 PREDICTED: uncharacterized protein LOC103502625 isoform X2 [Cucumis melo]1.9e-10585.29Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GM SCET+L +Y+SKQSPIAQKKVALRDVQNDNRS++YNYPETSCSLGGKL+NGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEH +YA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASGGSTSCAPAFSS +A SPMAFSPVRSS PI  EKPGNFLAV GSNLLGIPPGLE LR  DSNGITDEQRTERLFNLQKLLKH+D+ D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI
         GV E+ LH L PSELSNFAINLEKRSMHLSVEE   I
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI

XP_022945996.1 uncharacterized protein LOC111450215 [Cucurbita moschata]4.8e-10487.39Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GMGSCETHLSMYKSKQSPIA KKVALRDVQNDNRSV+YNYPETSCSLGGKLVNGSK+SGSKRSNPTCSPSSAIHQSFKGIGVNEH IYA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRA GGSTSCAPAFSSL AASPMAFSPVRSSLPI  EKPGNFLAVAGSNLLGI PGLE L  VDSNGITDEQRTERLFNLQ LLKH DE D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI
         G  EL LH L PSELS  AINLEK+SMHLSVEE   I
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI

XP_038884507.1 uncharacterized protein LOC120075309 [Benincasa hispida]6.0e-10788.66Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GMGSCETHLSMY++KQSPIAQKKVALRDVQNDNRS+MYNYPETSCSLGGKLVNGSKLSGSKRSNPT SPSSAIHQSFKGIGVNEH IYASG
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASGGSTSCAPAFSSL+AASPMA SPVRSSLPI  EKPGNFLAVAGS+LLGIPPG E LR VDSNGITDEQRTERLFNLQK LKH DE DR
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI
         GV E  LH L PSELSNFAINLEKRSM+LSVEE   I
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI

TrEMBL top hitse value%identityAlignment
A0A0A0KCZ3 Uncharacterized protein1.8e-11273.44Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDS+LNS GM SCETHLSMY+SKQSPIAQKKVALRDVQNDNRSV+YNYPETSC+LGGKL+NGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEH +YA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASG                               EKPGNFLAVAGSNLLGI PGLE LR  DSNGITDEQR+ERLF+LQKLLKH+D+ D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFITWIEQVANRFMFFRERNPTDEGSEYSGQPSVTSERSFVTLMNNDFGYEPDERAICHFCHDYA
         GV E+ LH L PSELSNFAINLEKRSMHLSVEE          ANRFMFFRER+PTDEG EYS QPSV SERSFVTLMNNDFG E DERAI HFC +  
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFITWIEQVANRFMFFRERNPTDEGSEYSGQPSVTSERSFVTLMNNDFGYEPDERAICHFCHDYA

Query:  RGRTS
         G+ S
Subjt:  RGRTS

A0A1S3CME9 uncharacterized protein LOC103502625 isoform X11.3e-10785.71Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GM SCET+L +Y+SKQSPIAQKKVALRDVQNDNRS++YNYPETSCSLGGKL+NGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEH +YA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASGGSTSCAPAFSS +A SPMAFSPVRSS PI  EKPGNFLAV GSNLLGIPPGLE LR  DSNGITDEQRTERLFNLQKLLKH+D+ D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI
         GV E+VLH L PSELSNFAINLEKRSMHLSVEE   I
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI

A0A1S3CMI7 uncharacterized protein LOC103502625 isoform X29.4e-10685.29Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GM SCET+L +Y+SKQSPIAQKKVALRDVQNDNRS++YNYPETSCSLGGKL+NGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEH +YA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASGGSTSCAPAFSS +A SPMAFSPVRSS PI  EKPGNFLAV GSNLLGIPPGLE LR  DSNGITDEQRTERLFNLQKLLKH+D+ D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI
         GV E+ LH L PSELSNFAINLEKRSMHLSVEE   I
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI

A0A5A7TAR2 Uncharacterized protein2.0e-11679.93Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GM SCET+L +Y+SKQSPIAQKKVALRDVQNDNRSV+YNYPETSCSLGGKL+NGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEH +YA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASGGSTSCAPAFSS +A SPMAFSPVRSS PI  EKPGNFLAV GSNLLGIPPGLE LR  DSNGITDEQRTERLFNLQKLLKH+D+ D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFITWIEQVANRFMFFRERNPTDEGSEYSGQPSVTSERSFVTLMNNDFG
         GV E+ LH L PSELSNFAINLEKRSMHL                      ER+PTDEG EYS QPSV SERSFVTLMNNDFG
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFITWIEQVANRFMFFRERNPTDEGSEYSGQPSVTSERSFVTLMNNDFG

A0A6J1G2G3 uncharacterized protein LOC1114502152.3e-10487.39Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GMGSCETHLSMYKSKQSPIA KKVALRDVQNDNRSV+YNYPETSCSLGGKLVNGSK+SGSKRSNPTCSPSSAIHQSFKGIGVNEH IYA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRA GGSTSCAPAFSSL AASPMAFSPVRSSLPI  EKPGNFLAVAGSNLLGI PGLE L  VDSNGITDEQRTERLFNLQ LLKH DE D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI
         G  EL LH L PSELS  AINLEK+SMHLSVEE   I
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGATAGCAAGTTGAACAGCAGTGGAATGGGCAGCTGCGAAACTCATTTGTCTATGTATAAGAGCAAGCAGTCACCAATTGCACAGAAAAAGGTTGCTTTA
AGGGATGTGCAGAATGATAATAGGAGTGTCATGTATAACTATCCTGAAACTTCCTGTTCTTTGGGCGGAAAACTTGTGAATGGGAGTAAGCTTTCAGGAAGTAAG
AGATCCAACCCTACATGCTCACCGAGCTCTGCAATCCATCAATCCTTCAAAGGGATTGGGGTCAATGAGCACGCCATTTATGCCAGCGGAGAAGTCGATGTGAAG
CCTGGCAAAAAAAGAGCATCAGGAGGTAGCACATCTTGTGCACCTGCATTTTCTTCTCTTATTGCAGCCTCTCCGATGGCATTTTCACCTGTTAGGTCTTCACTT
CCCATTTCCGCAGAAAAGCCTGGTAATTTTCTGGCAGTTGCTGGATCCAATCTTCTGGGAATCCCTCCTGGTTTGGAGACTCTTCGCTTTGTTGATTCAAATGGG
ATTACTGATGAGCAGAGAACAGAGCGTTTATTCAATCTGCAGAAGCTCCTAAAACATTATGACGAGTTGGACCGAAGCGGCGTCAATGAGTTAGTGCTCCATAGT
TTACTTCCCTCTGAGCTCAGCAATTTTGCCATTAATCTGGAAAAGAGATCCATGCACCTGTCAGTAGAGGAAGCCAATTTTATCACTTGGATTGAACAAGTAGCT
AACAGGTTTATGTTCTTCAGGGAAAGAAATCCAACGGATGAAGGCTCTGAATATTCTGGGCAACCTTCAGTGACATCTGAAAGAAGTTTTGTTACACTAATGAAT
AACGATTTTGGATACGAACCAGATGAGAGGGCCATTTGCCATTTCTGCCATGATTACGCGAGAGGACGAACTAGTTAA
mRNA sequenceShow/hide mRNA sequence
ACGACCGCTACTTTTAAATGAATTTTCAAAAAACAAAATTGAAAAAAGAAAACCCAACTTGTTCGAGCCAAAGTTTTCCCTTTTGAACTCCCAGTCTCCCTCTCG
TCGGTTCTCAGCTCCCCCCTTCCTCCGTTGGATACCAGAAGCTTTGCCCACCGCCTTCCTTTTCTTCTCATCGATGCGTCCACATAAATAGAATGAATGCAGGAA
ACACTGTTTTAATCCTTCTAACTTTTTGGCTGTGGATGATTTTGATTTCGTTTGATGTTGCTGCAGAAAATTGCAACCGTCTTTACTATGGACTTTTCTAGGAGA
TAACTTGAAATTGTTATTGGTTGAAGCTCTTCAAGTTTTCTAAATGATTGATAGCAAGTTGAACAGCAGTGGAATGGGCAGCTGCGAAACTCATTTGTCTATGTA
TAAGAGCAAGCAGTCACCAATTGCACAGAAAAAGGTTGCTTTAAGGGATGTGCAGAATGATAATAGGAGTGTCATGTATAACTATCCTGAAACTTCCTGTTCTTT
GGGCGGAAAACTTGTGAATGGGAGTAAGCTTTCAGGAAGTAAGAGATCCAACCCTACATGCTCACCGAGCTCTGCAATCCATCAATCCTTCAAAGGGATTGGGGT
CAATGAGCACGCCATTTATGCCAGCGGAGAAGTCGATGTGAAGCCTGGCAAAAAAAGAGCATCAGGAGGTAGCACATCTTGTGCACCTGCATTTTCTTCTCTTAT
TGCAGCCTCTCCGATGGCATTTTCACCTGTTAGGTCTTCACTTCCCATTTCCGCAGAAAAGCCTGGTAATTTTCTGGCAGTTGCTGGATCCAATCTTCTGGGAAT
CCCTCCTGGTTTGGAGACTCTTCGCTTTGTTGATTCAAATGGGATTACTGATGAGCAGAGAACAGAGCGTTTATTCAATCTGCAGAAGCTCCTAAAACATTATGA
CGAGTTGGACCGAAGCGGCGTCAATGAGTTAGTGCTCCATAGTTTACTTCCCTCTGAGCTCAGCAATTTTGCCATTAATCTGGAAAAGAGATCCATGCACCTGTC
AGTAGAGGAAGCCAATTTTATCACTTGGATTGAACAAGTAGCTAACAGGTTTATGTTCTTCAGGGAAAGAAATCCAACGGATGAAGGCTCTGAATATTCTGGGCA
ACCTTCAGTGACATCTGAAAGAAGTTTTGTTACACTAATGAATAACGATTTTGGATACGAACCAGATGAGAGGGCCATTTGCCATTTCTGCCATGATTACGCGAG
AGGACGAACTAGTTAA
Protein sequenceShow/hide protein sequence
MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASGEVDVK
PGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDRSGVNELVLHS
LLPSELSNFAINLEKRSMHLSVEEANFITWIEQVANRFMFFRERNPTDEGSEYSGQPSVTSERSFVTLMNNDFGYEPDERAICHFCHDYARGRTS