; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G18610 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G18610
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionNAC domain-containing protein
Genome locationctg3379:2128184..2130049
RNA-Seq ExpressionCucsat.G18610
SyntenyCucsat.G18610
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058167.1 uncharacterized protein E6C27_scaffold274G004860 [Cucumis melo var. makuwa]4.59e-14295.87Show/hide
Query:  DVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIVLFKPVNTSFFQSSPLSVSVDSDIISGFKSEFL
        DVGLPPIIEEEEPPE SVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIVLFKP+NTSFFQSSPLSVSVDSDIISGFKSEFL
Subjt:  DVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIVLFKPVNTSFFQSSPLSVSVDSDIISGFKSEFL

Query:  RENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEED-NNLNNSQQGYGYGGMDGANGIHQWHHQQQH
        RENCYDGRVKFGEDDEDMV ENKNLAVVPWVPRLQVPTSSTM+VPQEE APQLMEAEEVGEATMEIEE+ NNLNNSQQGYGYGGMDGAN IHQWHH QQH
Subjt:  RENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEED-NNLNNSQQGYGYGGMDGANGIHQWHHQQQH

Query:  CMIPQLPQQTSSPITWFR
        CMIPQLPQQTSSPITWFR
Subjt:  CMIPQLPQQTSSPITWFR

XP_008453528.1 PREDICTED: uncharacterized protein LOC103494211 [Cucumis melo]1.12e-16596.4Show/hide
Query:  MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV
        MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPE SVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV
Subjt:  MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV

Query:  LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEE
        LFKP+NTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMV ENKNLAVVPWVPRLQVPTSSTM+VPQEE APQLMEAEEVGEATMEIEE
Subjt:  LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEE

Query:  D-NNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR
        + NNLNNSQQGYGYGGMDGAN IHQWHH QQHCMIPQLPQQTSSPITWFR
Subjt:  D-NNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR

XP_022134590.1 uncharacterized protein LOC111006819 [Momordica charantia]3.56e-12978.71Show/hide
Query:  MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV
        MKMKMKRKDLDQIND+FSDFSLSSPARKIRRLDVGLPPIIEEEEPPE +VLS+QPL+PE+F     G+RIEEL DASSVS S  AMEDRPFCDNQERAIV
Subjt:  MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV

Query:  LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEE
        LFKPVNT+F QSSPLSVSVDSDIISGF+SE LREN    R+K GEDD++M  EN+NLAVVPWVPR+QVP  + M VPQEE APQ+MEAEE+  ATMEIE+
Subjt:  LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEE

Query:  DNNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR
        D+   +SQQ YGYGGMDGAN IHQWH  QQHCMIPQLPQQTS+PITWFR
Subjt:  DNNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR

XP_031745091.1 uncharacterized protein LOC101205533 [Cucumis sativus]8.26e-17599.2Show/hide
Query:  MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV
        MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV
Subjt:  MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV

Query:  LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEE
        LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVK GEDDEDMVIENKNLAVVPWVPRLQVPTSSTM+VPQEEEAPQLMEAEEVGEATMEIEE
Subjt:  LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEE

Query:  DNNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR
        DNNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR
Subjt:  DNNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR

XP_038880770.1 uncharacterized protein LOC120072359 [Benincasa hispida]8.74e-14285.6Show/hide
Query:  MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV
        MKMKMKRKDLDQIND+FSDFSLSSPARKIRRLDVGLPPIIEEEEPPE +VLSK+PLIPE+F V GN +RIEEL + SSVS S YAMEDRPFCDNQERAIV
Subjt:  MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV

Query:  LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEE
        LFKPVN+ F QSSPLSVSVDSDIISGFKSEFLREN YDGRV+F EDDE+M IENKNLAVVPWVPRLQVP  ++M VPQEE APQLMEAEEVGEATMEIEE
Subjt:  LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEE

Query:  DNNLNNSQQGYGY-GGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR
        DN L++SQQ YGY GGMDGAN IHQWH  QQHCMIPQLPQQTSSPITWFR
Subjt:  DNNLNNSQQGYGY-GGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR

TrEMBL top hitse value%identityAlignment
A0A0A0LUV7 Uncharacterized protein4.00e-17599.2Show/hide
Query:  MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV
        MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV
Subjt:  MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV

Query:  LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEE
        LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVK GEDDEDMVIENKNLAVVPWVPRLQVPTSSTM+VPQEEEAPQLMEAEEVGEATMEIEE
Subjt:  LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEE

Query:  DNNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR
        DNNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR
Subjt:  DNNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR

A0A1S3BX94 uncharacterized protein LOC1034942115.41e-16696.4Show/hide
Query:  MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV
        MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPE SVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV
Subjt:  MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV

Query:  LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEE
        LFKP+NTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMV ENKNLAVVPWVPRLQVPTSSTM+VPQEE APQLMEAEEVGEATMEIEE
Subjt:  LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEE

Query:  D-NNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR
        + NNLNNSQQGYGYGGMDGAN IHQWHH QQHCMIPQLPQQTSSPITWFR
Subjt:  D-NNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR

A0A5A7UTV3 Uncharacterized protein2.22e-14295.87Show/hide
Query:  DVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIVLFKPVNTSFFQSSPLSVSVDSDIISGFKSEFL
        DVGLPPIIEEEEPPE SVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIVLFKP+NTSFFQSSPLSVSVDSDIISGFKSEFL
Subjt:  DVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIVLFKPVNTSFFQSSPLSVSVDSDIISGFKSEFL

Query:  RENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEED-NNLNNSQQGYGYGGMDGANGIHQWHHQQQH
        RENCYDGRVKFGEDDEDMV ENKNLAVVPWVPRLQVPTSSTM+VPQEE APQLMEAEEVGEATMEIEE+ NNLNNSQQGYGYGGMDGAN IHQWHH QQH
Subjt:  RENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEED-NNLNNSQQGYGYGGMDGANGIHQWHHQQQH

Query:  CMIPQLPQQTSSPITWFR
        CMIPQLPQQTSSPITWFR
Subjt:  CMIPQLPQQTSSPITWFR

A0A6J1BYN8 uncharacterized protein LOC1110068191.72e-12978.71Show/hide
Query:  MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV
        MKMKMKRKDLDQIND+FSDFSLSSPARKIRRLDVGLPPIIEEEEPPE +VLS+QPL+PE+F     G+RIEEL DASSVS S  AMEDRPFCDNQERAIV
Subjt:  MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIV

Query:  LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEE
        LFKPVNT+F QSSPLSVSVDSDIISGF+SE LREN    R+K GEDD++M  EN+NLAVVPWVPR+QVP  + M VPQEE APQ+MEAEE+  ATMEIE+
Subjt:  LFKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEE

Query:  DNNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR
        D+   +SQQ YGYGGMDGAN IHQWH  QQHCMIPQLPQQTS+PITWFR
Subjt:  DNNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR

A0A6J1F0X3 uncharacterized protein LOC1114411477.87e-12879.44Show/hide
Query:  MKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSP-SVYAMEDRPFCDNQERAIVL
        MKMKRKDLDQ ND+FSDFSLSSPA KIRRLDVGLPPIIEEE PPE S L+++ ++P ++  GGNG+RIEEL DASSV+  S  AMEDRP CDNQERAIVL
Subjt:  MKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSP-SVYAMEDRPFCDNQERAIVL

Query:  FKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEED
        FKPVNTSF QSSPLSVSVDSDIISGFKS+ LREN YDGR+KFGEDD++M  ENKNLAVVPWVPRLQVP S+ M V Q+EEAPQLMEAEEVGEATM+IE+D
Subjt:  FKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEED

Query:  NNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR
        N L++ Q  YGYGGMDGAN IHQWH  QQHCMIPQLPQQTSSPITWFR
Subjt:  NNLNNSQQGYGYGGMDGANGIHQWHHQQQHCMIPQLPQQTSSPITWFR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G35320.1 unknown protein3.3e-3241.64Show/hide
Query:  KMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIVL
        ++ MKRKD+D++NDDFSDFSLSSPARKIRRLDV LPPI+EEEE      L  Q  + E+       + +E ++D                    ERAIVL
Subjt:  KMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIVL

Query:  FKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDD--EDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGE------
        FKP++  + Q S  +V VD  +ISGFK+ FLR+      V   +D+  ED  + NK  AVV W P  Q   S ++   Q+    ++ E +E GE      
Subjt:  FKPVNTSFFQSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDD--EDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGE------

Query:  ATMEIEED---NNLNNSQQG--------YGYGGMDGANGIHQWHHQQQHCMIPQLPQ--QTSSPITWFR
        A+ EIEED    +L+  QQG        YG+       G+H W  Q Q+CMIPQLPQ   T +PITWFR
Subjt:  ATMEIEED---NNLNNSQQG--------YGYGGMDGANGIHQWHHQQQHCMIPQLPQ--QTSSPITWFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATGAAGATGAAAAGAAAAGATCTCGATCAAATCAACGATGACTTCTCCGATTTCTCCCTCTCCTCACCTGCCAGGAAGATTCGCCGTCTGGATGTTGGTTTACC
GCCTATTATAGAAGAAGAAGAACCACCGGAATTTTCTGTTTTAAGTAAACAGCCTTTGATTCCTGAAGATTTCACAGTAGGTGGTAATGGTGTAAGGATTGAGGAATTGT
CGGATGCTTCCTCTGTTTCTCCTTCAGTTTATGCTATGGAGGATCGTCCTTTTTGTGACAATCAAGAGAGGGCTATTGTTCTGTTTAAGCCTGTTAATACGTCTTTCTTC
CAATCTTCTCCTCTCTCGGTATCTGTTGATTCTGACATTATATCTGGTTTCAAGAGTGAATTTCTTCGCGAGAATTGTTATGATGGCCGGGTGAAATTTGGTGAAGATGA
TGAAGATATGGTGATTGAGAACAAGAATCTGGCTGTTGTTCCTTGGGTTCCTCGTTTACAGGTTCCTACTTCCTCAACCATGAGTGTTCCCCAAGAGGAAGAAGCTCCAC
AGTTAATGGAAGCTGAAGAAGTTGGAGAAGCAACAATGGAGATTGAAGAAGACAACAACTTAAACAACAGTCAACAAGGTTATGGGTATGGTGGAATGGATGGAGCTAAT
GGTATACATCAATGGCATCATCAACAACAACACTGCATGATTCCACAGCTGCCACAGCAAACATCTTCACCCATAACTTGGTTTCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGATGAAGATGAAAAGAAAAGATCTCGATCAAATCAACGATGACTTCTCCGATTTCTCCCTCTCCTCACCTGCCAGGAAGATTCGCCGTCTGGATGTTGGTTTACC
GCCTATTATAGAAGAAGAAGAACCACCGGAATTTTCTGTTTTAAGTAAACAGCCTTTGATTCCTGAAGATTTCACAGTAGGTGGTAATGGTGTAAGGATTGAGGAATTGT
CGGATGCTTCCTCTGTTTCTCCTTCAGTTTATGCTATGGAGGATCGTCCTTTTTGTGACAATCAAGAGAGGGCTATTGTTCTGTTTAAGCCTGTTAATACGTCTTTCTTC
CAATCTTCTCCTCTCTCGGTATCTGTTGATTCTGACATTATATCTGGTTTCAAGAGTGAATTTCTTCGCGAGAATTGTTATGATGGCCGGGTGAAATTTGGTGAAGATGA
TGAAGATATGGTGATTGAGAACAAGAATCTGGCTGTTGTTCCTTGGGTTCCTCGTTTACAGGTTCCTACTTCCTCAACCATGAGTGTTCCCCAAGAGGAAGAAGCTCCAC
AGTTAATGGAAGCTGAAGAAGTTGGAGAAGCAACAATGGAGATTGAAGAAGACAACAACTTAAACAACAGTCAACAAGGTTATGGGTATGGTGGAATGGATGGAGCTAAT
GGTATACATCAATGGCATCATCAACAACAACACTGCATGATTCCACAGCTGCCACAGCAAACATCTTCACCCATAACTTGGTTTCGTTGA
Protein sequenceShow/hide protein sequence
MKMKMKRKDLDQINDDFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEFSVLSKQPLIPEDFTVGGNGVRIEELSDASSVSPSVYAMEDRPFCDNQERAIVLFKPVNTSFF
QSSPLSVSVDSDIISGFKSEFLRENCYDGRVKFGEDDEDMVIENKNLAVVPWVPRLQVPTSSTMSVPQEEEAPQLMEAEEVGEATMEIEEDNNLNNSQQGYGYGGMDGAN
GIHQWHHQQQHCMIPQLPQQTSSPITWFR