; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G5249 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G5249
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionProtein of unknown function (DUF1218)
Genome locationctg1228:269353..274820
RNA-Seq ExpressionCucsat.G5249
SyntenyCucsat.G5249
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140333.1 uncharacterized protein LOC101221296 [Cucumis sativus]5.97e-132100Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS
        MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS

Query:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA
        SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA
Subjt:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA

XP_008465787.1 PREDICTED: uncharacterized protein LOC103503388 [Cucumis melo]9.91e-13198.95Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS
        MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFG+PLTPGGNRAWTIIYFLS
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS

Query:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA
        SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQ SHKANRSSSTVGMTGYA
Subjt:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA

XP_022938517.1 uncharacterized protein LOC111444728 isoform X1 [Cucurbita moschata]1.45e-12395.26Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS
        MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSG+SLLM VTKCMCFG+PLTPGGNRAWTIIYFLS
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS

Query:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA
        S A+FLVAEACLIAGATKNAYHTKYRGMIYAQNL CETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSS  SHKANRSSSTVGMT YA
Subjt:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA

XP_023544604.1 uncharacterized protein LOC111804138 isoform X1 [Cucurbita pepo subsp. pepo]1.77e-12495.26Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS
        MEGKGSTLVHL+VVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSG+SLLM VTKCMCFG+PLTPGGNRAWTIIYFLS
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS

Query:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA
        S A+FLVAEACLIAGATKNAYHTKYRGMIYAQNL CETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSS  SHKANRSSSTVGMTGYA
Subjt:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA

XP_038906605.1 uncharacterized protein LOC120092556 [Benincasa hispida]8.15e-13097.89Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS
        MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSG++LLMGVTKCMCFG+PLTPGGNRAWTIIYFLS
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS

Query:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA
        SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQ SHKANRSSSTVGMTGYA
Subjt:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA

TrEMBL top hitse value%identityAlignment
A0A0A0KQ16 Uncharacterized protein2.89e-132100Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS
        MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS

Query:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA
        SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA
Subjt:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA

A0A1S3CPP1 uncharacterized protein LOC1035033884.80e-13198.95Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS
        MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFG+PLTPGGNRAWTIIYFLS
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS

Query:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA
        SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQ SHKANRSSSTVGMTGYA
Subjt:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA

A0A6J1DLS4 uncharacterized protein LOC1110223432.02e-12393.16Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS
        MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGT+FEDK +N TYCVY+SDVATGYGVGAFLFLLSG+SLLMGVTKCMCFG+PLTPGGNRAW IIYF S
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS

Query:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA
        S ATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATS+Q SHKANRSSSTVGM GYA
Subjt:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA

A0A6J1FE96 uncharacterized protein LOC111444728 isoform X17.04e-12495.26Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS
        MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSG+SLLM VTKCMCFG+PLTPGGNRAWTIIYFLS
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS

Query:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA
        S A+FLVAEACLIAGATKNAYHTKYRGMIYAQNL CETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSS  SHKANRSSSTVGMT YA
Subjt:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA

A0A6J1IEX4 uncharacterized protein LOC111472088 isoform X17.04e-12494.74Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS
        MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSG+SLLM VTKCMCFG+PLTPGGNRAWTIIYFLS
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLS

Query:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA
        S A+ LVAEACLIAGATKNAYHTKYRGMIYAQNL CETLRKGVFIAGAVFVVATMILNVYYY+YFTKATSS+ SHKANRSSSTVGMTGYA
Subjt:  SGATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)1.4e-7573.54Show/hide
Query:  EGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLSS
        EGK STLV +LVV L LVAFGFSIAAERRRS+G   +D   N T+CVY+SDVATGYGVGAFLFLLS +SLLM VTKCMCFG+PL PG +RAW+IIYF+SS
Subjt:  EGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLSS

Query:  GATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA
          TFLVAEAC+IAGATKNAYHTKY   + +Q   C +LRKG+FIAGAVF+VATM+LNVYYYMYFTK+ SS  +HKANRSSS +GM GYA
Subjt:  GATFLVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA

AT1G52910.1 Protein of unknown function (DUF1218)2.3e-3847.67Show/hide
Query:  STLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLSSGATF
        S LV ++V +L L+A G +IAAE+RRSVG +  D ++   +C Y SD+AT YG GAF+ L   Q ++M  ++C C GK L PGG+RA  I+ FL     F
Subjt:  STLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLSSGATF

Query:  LVAEACLIAGATKNAYHTKYRGMIYAQNLP-CETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSS-QTSH
        L+AE CL+AG+ +NAYHT YR M   +N P CE +RKGVF AGA F + T I++ +YY+ +++A    QT H
Subjt:  LVAEACLIAGATKNAYHTKYRGMIYAQNLP-CETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSS-QTSH

AT1G61065.1 Protein of unknown function (DUF1218)1.4e-3846.15Show/hide
Query:  STLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLSSGATF
        S L+ LLV V  L+AFG ++AAE+RR+   +  +  R+ +YCVY+ D+ATG GVG+FL LL+ Q L+M  ++C+C G+ LTP G+R+W I  F+++   F
Subjt:  STLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLSSGATF

Query:  LVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTS
         +A+ CL+AG+ +NAYHTKYR      +  C +LRKGVF AGA F+V T I++  YY+  ++A   Q S
Subjt:  LVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTS

AT3G15480.1 Protein of unknown function (DUF1218)1.6e-3948.78Show/hide
Query:  STLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLSSGATF
        S LV ++V +L L+A G +IAAE+RRSVG +  D+ +   YCVY +D+AT YG GAF+ L   Q L+M  ++C C GK L PGG+RA  II FL     F
Subjt:  STLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLSSGATF

Query:  LVAEACLIAGATKNAYHTKYRGMIYAQNLP-CETLRKGVFIAGAVFVVATMILNVYYYMYFTKA
        L+AE CL+A + +NAYHT+YR M   ++ P CE +RKGVF AGA F + T I++ +YY+ +++A
Subjt:  LVAEACLIAGATKNAYHTKYRGMIYAQNLP-CETLRKGVFIAGAVFVVATMILNVYYYMYFTKA

AT4G27435.1 Protein of unknown function (DUF1218)1.6e-4250.3Show/hide
Query:  STLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLSSGATF
        S +V  +V V  L+AFG ++AAE+RRS   + +D +    YCVY+SD ATGYGVGAFLF ++ Q L+M V++C C GKPL PGG+RA  +I F+ S   F
Subjt:  STLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLSSGATF

Query:  LVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTS
        L+AE CL+AG+ +NAYHTKYR M       C+TLRKGVF AGA FV    I++ +YY ++  A  +  S
Subjt:  LVAEACLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGCAAAGGCTCCACTCTGGTTCATCTTCTAGTTGTGGTTTTGTGCTTAGTGGCTTTTGGGTTCTCCATTGCCGCCGAGAGACGAAGAAGTGTGGGGACTCTGTT
TGAAGATAAGCAACGGAACGCAACCTACTGTGTTTACGAATCAGATGTTGCAACAGGTTATGGCGTAGGGGCTTTCTTATTTCTTCTCTCTGGTCAATCATTGCTGATGG
GGGTTACAAAGTGCATGTGTTTTGGAAAACCTTTAACCCCAGGAGGAAACCGGGCATGGACCATTATATACTTTCTCTCTTCAGGGGCTACCTTTTTGGTAGCGGAAGCG
TGTTTAATTGCGGGTGCAACAAAAAATGCATACCATACGAAGTATCGAGGAATGATATACGCTCAGAACTTACCATGCGAGACATTGAGGAAAGGAGTGTTCATTGCTGG
GGCAGTGTTTGTGGTTGCTACCATGATTCTTAATGTGTATTATTACATGTACTTCACCAAGGCCACATCTTCTCAAACATCTCACAAAGCAAATCGTTCGAGCTCAACAG
TTGGGATGACTGGATACGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGCAAAGGCTCCACTCTGGTTCATCTTCTAGTTGTGGTTTTGTGCTTAGTGGCTTTTGGGTTCTCCATTGCCGCCGAGAGACGAAGAAGTGTGGGGACTCTGTT
TGAAGATAAGCAACGGAACGCAACCTACTGTGTTTACGAATCAGATGTTGCAACAGGTTATGGCGTAGGGGCTTTCTTATTTCTTCTCTCTGGTCAATCATTGCTGATGG
GGGTTACAAAGTGCATGTGTTTTGGAAAACCTTTAACCCCAGGAGGAAACCGGGCATGGACCATTATATACTTTCTCTCTTCAGGGGCTACCTTTTTGGTAGCGGAAGCG
TGTTTAATTGCGGGTGCAACAAAAAATGCATACCATACGAAGTATCGAGGAATGATATACGCTCAGAACTTACCATGCGAGACATTGAGGAAAGGAGTGTTCATTGCTGG
GGCAGTGTTTGTGGTTGCTACCATGATTCTTAATGTGTATTATTACATGTACTTCACCAAGGCCACATCTTCTCAAACATCTCACAAAGCAAATCGTTCGAGCTCAACAG
TTGGGATGACTGGATACGCCTAA
Protein sequenceShow/hide protein sequence
MEGKGSTLVHLLVVVLCLVAFGFSIAAERRRSVGTLFEDKQRNATYCVYESDVATGYGVGAFLFLLSGQSLLMGVTKCMCFGKPLTPGGNRAWTIIYFLSSGATFLVAEA
CLIAGATKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSSQTSHKANRSSSTVGMTGYA