; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G09660 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G09660
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUnknown protein
Genome locationClcChr08:21020505..21023249
RNA-Seq ExpressionClc08G09660
SyntenyClc08G09660
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038455.1 uncharacterized protein E6C27_scaffold119G00220 [Cucumis melo var. makuwa]4.2e-11679.93Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GM SCET+L +Y+SKQSPIAQKKVALRDVQNDNRSV+YNYPETSCSLGGKL+NGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEH +YA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASGGSTSCAPAFSS +A SPMAFSPVRSS PI  EKPGNFLAV GSNLLGIPPGLE LR  DSNGITDEQRTERLFNLQKLLKH+D+ D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFITWIEQVANRFMFFRERNPTDEGSEYSGQPSVTSERSFVTLMNNDFG
         GV E+ LH L PSELSNFAINLEKRSMHL                      ER+PTDEG EYS QPSV SERSFVTLMNNDFG
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFITWIEQVANRFMFFRERNPTDEGSEYSGQPSVTSERSFVTLMNNDFG

XP_008464851.1 PREDICTED: uncharacterized protein LOC103502625 isoform X1 [Cucumis melo]2.7e-10785.71Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GM SCET+L +Y+SKQSPIAQKKVALRDVQNDNRS++YNYPETSCSLGGKL+NGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEH +YA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASGGSTSCAPAFSS +A SPMAFSPVRSS PI  EKPGNFLAV GSNLLGIPPGLE LR  DSNGITDEQRTERLFNLQKLLKH+D+ D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI
         GV E+VLH L PSELSNFAINLEKRSMHLSVEE   I
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI

XP_008464853.1 PREDICTED: uncharacterized protein LOC103502625 isoform X2 [Cucumis melo]1.9e-10585.29Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GM SCET+L +Y+SKQSPIAQKKVALRDVQNDNRS++YNYPETSCSLGGKL+NGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEH +YA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASGGSTSCAPAFSS +A SPMAFSPVRSS PI  EKPGNFLAV GSNLLGIPPGLE LR  DSNGITDEQRTERLFNLQKLLKH+D+ D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI
         GV E+ LH L PSELSNFAINLEKRSMHLSVEE   I
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI

XP_022945996.1 uncharacterized protein LOC111450215 [Cucurbita moschata]4.8e-10487.39Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GMGSCETHLSMYKSKQSPIA KKVALRDVQNDNRSV+YNYPETSCSLGGKLVNGSK+SGSKRSNPTCSPSSAIHQSFKGIGVNEH IYA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRA GGSTSCAPAFSSL AASPMAFSPVRSSLPI  EKPGNFLAVAGSNLLGI PGLE L  VDSNGITDEQRTERLFNLQ LLKH DE D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI
         G  EL LH L PSELS  AINLEK+SMHLSVEE   I
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI

XP_038884507.1 uncharacterized protein LOC120075309 [Benincasa hispida]6.0e-10788.66Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GMGSCETHLSMY++KQSPIAQKKVALRDVQNDNRS+MYNYPETSCSLGGKLVNGSKLSGSKRSNPT SPSSAIHQSFKGIGVNEH IYASG
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASGGSTSCAPAFSSL+AASPMA SPVRSSLPI  EKPGNFLAVAGS+LLGIPPG E LR VDSNGITDEQRTERLFNLQK LKH DE DR
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI
         GV E  LH L PSELSNFAINLEKRSM+LSVEE   I
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI

TrEMBL top hitse value%identityAlignment
A0A0A0KCZ3 Uncharacterized protein1.8e-11273.44Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDS+LNS GM SCETHLSMY+SKQSPIAQKKVALRDVQNDNRSV+YNYPETSC+LGGKL+NGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEH +YA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASG                               EKPGNFLAVAGSNLLGI PGLE LR  DSNGITDEQR+ERLF+LQKLLKH+D+ D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFITWIEQVANRFMFFRERNPTDEGSEYSGQPSVTSERSFVTLMNNDFGYEPDERAICHFCHDYA
         GV E+ LH L PSELSNFAINLEKRSMHLSVEE          ANRFMFFRER+PTDEG EYS QPSV SERSFVTLMNNDFG E DERAI HFC +  
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFITWIEQVANRFMFFRERNPTDEGSEYSGQPSVTSERSFVTLMNNDFGYEPDERAICHFCHDYA

Query:  RGRTS
         G+ S
Subjt:  RGRTS

A0A1S3CME9 uncharacterized protein LOC103502625 isoform X11.3e-10785.71Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GM SCET+L +Y+SKQSPIAQKKVALRDVQNDNRS++YNYPETSCSLGGKL+NGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEH +YA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASGGSTSCAPAFSS +A SPMAFSPVRSS PI  EKPGNFLAV GSNLLGIPPGLE LR  DSNGITDEQRTERLFNLQKLLKH+D+ D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI
         GV E+VLH L PSELSNFAINLEKRSMHLSVEE   I
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI

A0A1S3CMI7 uncharacterized protein LOC103502625 isoform X29.4e-10685.29Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GM SCET+L +Y+SKQSPIAQKKVALRDVQNDNRS++YNYPETSCSLGGKL+NGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEH +YA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASGGSTSCAPAFSS +A SPMAFSPVRSS PI  EKPGNFLAV GSNLLGIPPGLE LR  DSNGITDEQRTERLFNLQKLLKH+D+ D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI
         GV E+ LH L PSELSNFAINLEKRSMHLSVEE   I
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI

A0A5A7TAR2 Uncharacterized protein2.0e-11679.93Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GM SCET+L +Y+SKQSPIAQKKVALRDVQNDNRSV+YNYPETSCSLGGKL+NGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEH +YA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRASGGSTSCAPAFSS +A SPMAFSPVRSS PI  EKPGNFLAV GSNLLGIPPGLE LR  DSNGITDEQRTERLFNLQKLLKH+D+ D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFITWIEQVANRFMFFRERNPTDEGSEYSGQPSVTSERSFVTLMNNDFG
         GV E+ LH L PSELSNFAINLEKRSMHL                      ER+PTDEG EYS QPSV SERSFVTLMNNDFG
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFITWIEQVANRFMFFRERNPTDEGSEYSGQPSVTSERSFVTLMNNDFG

A0A6J1G2G3 uncharacterized protein LOC1114502152.3e-10487.39Show/hide
Query:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG
        MIDSKLNS GMGSCETHLSMYKSKQSPIA KKVALRDVQNDNRSV+YNYPETSCSLGGKLVNGSK+SGSKRSNPTCSPSSAIHQSFKGIGVNEH IYA+G
Subjt:  MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASG

Query:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR
        EVDVKPGKKRA GGSTSCAPAFSSL AASPMAFSPVRSSLPI  EKPGNFLAVAGSNLLGI PGLE L  VDSNGITDEQRTERLFNLQ LLKH DE D+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDR

Query:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI
         G  EL LH L PSELS  AINLEK+SMHLSVEE   I
Subjt:  SGVNELVLHSLLPSELSNFAINLEKRSMHLSVEEANFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGATAGCAAGTTGAACAGCAGTGGAATGGGCAGCTGCGAAACTCATTTGTCTATGTATAAGAGCAAGCAGTCACCAATTGCACAGAAAAAGGTTGCTTTAAGGGA
TGTGCAGAATGATAATAGGAGTGTCATGTATAACTATCCTGAAACTTCCTGTTCTTTGGGCGGAAAACTTGTGAATGGGAGTAAGCTTTCAGGAAGTAAGAGATCCAACC
CTACATGCTCACCGAGCTCTGCAATCCATCAATCCTTCAAAGGGATTGGGGTCAATGAGCACGCCATTTATGCCAGCGGAGAAGTCGATGTGAAGCCTGGCAAAAAAAGA
GCATCAGGAGGTAGCACATCTTGTGCACCTGCATTTTCTTCTCTTATTGCAGCCTCTCCGATGGCATTTTCACCTGTTAGGTCTTCACTTCCCATTTCCGCAGAAAAGCC
TGGTAATTTTCTGGCAGTTGCTGGATCCAATCTTCTGGGAATCCCTCCTGGTTTGGAGACTCTTCGCTTTGTTGATTCAAATGGGATTACTGATGAGCAGAGAACAGAGC
GTTTATTCAATCTGCAGAAGCTCCTAAAACATTATGACGAGTTGGACCGAAGCGGCGTCAATGAGTTAGTGCTCCATAGTTTACTTCCCTCTGAGCTCAGCAATTTTGCC
ATTAATCTGGAAAAGAGATCCATGCACCTGTCAGTAGAGGAAGCCAATTTTATCACTTGGATTGAACAAGTAGCTAACAGGTTTATGTTCTTCAGGGAAAGAAATCCAAC
GGATGAAGGCTCTGAATATTCTGGGCAACCTTCAGTGACATCTGAAAGAAGTTTTGTTACACTAATGAATAACGATTTTGGATACGAACCAGATGAGAGGGCCATTTGCC
ATTTCTGCCATGATTACGCGAGAGGACGAACTAGTTAA
mRNA sequenceShow/hide mRNA sequence
ACGACCGCTACTTTTAAATGAATTTTCAAAAAACAAAATTGAAAAAAGAAAACCCAACTTGTTCGAGCCAAAGTTTTCCCTTTTGAACTCCCAGTCTCCCTCTCGTCGGT
TCTCAGCTCCCCCCTTCCTCCGTTGGATACCAGAAGCTTTGCCCACCGCCTTCCTTTTCTTCTCATCGATGCGTCCACATAAATAGAATGAATGCAGGAAACACTGTTTT
AATCCTTCTAACTTTTTGGCTGTGGATGATTTTGATTTCGTTTGATGTTGCTGCAGAAAATTGCAACCGTCTTTACTATGGACTTTTCTAGGAGATAACTTGAAATTGTT
ATTGGTTGAAGCTCTTCAAGTTTTCTAAATGATTGATAGCAAGTTGAACAGCAGTGGAATGGGCAGCTGCGAAACTCATTTGTCTATGTATAAGAGCAAGCAGTCACCAA
TTGCACAGAAAAAGGTTGCTTTAAGGGATGTGCAGAATGATAATAGGAGTGTCATGTATAACTATCCTGAAACTTCCTGTTCTTTGGGCGGAAAACTTGTGAATGGGAGT
AAGCTTTCAGGAAGTAAGAGATCCAACCCTACATGCTCACCGAGCTCTGCAATCCATCAATCCTTCAAAGGGATTGGGGTCAATGAGCACGCCATTTATGCCAGCGGAGA
AGTCGATGTGAAGCCTGGCAAAAAAAGAGCATCAGGAGGTAGCACATCTTGTGCACCTGCATTTTCTTCTCTTATTGCAGCCTCTCCGATGGCATTTTCACCTGTTAGGT
CTTCACTTCCCATTTCCGCAGAAAAGCCTGGTAATTTTCTGGCAGTTGCTGGATCCAATCTTCTGGGAATCCCTCCTGGTTTGGAGACTCTTCGCTTTGTTGATTCAAAT
GGGATTACTGATGAGCAGAGAACAGAGCGTTTATTCAATCTGCAGAAGCTCCTAAAACATTATGACGAGTTGGACCGAAGCGGCGTCAATGAGTTAGTGCTCCATAGTTT
ACTTCCCTCTGAGCTCAGCAATTTTGCCATTAATCTGGAAAAGAGATCCATGCACCTGTCAGTAGAGGAAGCCAATTTTATCACTTGGATTGAACAAGTAGCTAACAGGT
TTATGTTCTTCAGGGAAAGAAATCCAACGGATGAAGGCTCTGAATATTCTGGGCAACCTTCAGTGACATCTGAAAGAAGTTTTGTTACACTAATGAATAACGATTTTGGA
TACGAACCAGATGAGAGGGCCATTTGCCATTTCTGCCATGATTACGCGAGAGGACGAACTAGTTAA
Protein sequenceShow/hide protein sequence
MIDSKLNSSGMGSCETHLSMYKSKQSPIAQKKVALRDVQNDNRSVMYNYPETSCSLGGKLVNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHAIYASGEVDVKPGKKR
ASGGSTSCAPAFSSLIAASPMAFSPVRSSLPISAEKPGNFLAVAGSNLLGIPPGLETLRFVDSNGITDEQRTERLFNLQKLLKHYDELDRSGVNELVLHSLLPSELSNFA
INLEKRSMHLSVEEANFITWIEQVANRFMFFRERNPTDEGSEYSGQPSVTSERSFVTLMNNDFGYEPDERAICHFCHDYARGRTS