; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025996 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025996
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationtig00153017:2621295..2624833
RNA-Seq ExpressionSgr025996
SyntenySgr025996
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155200.1 uncharacterized protein LOC111022343 [Momordica charantia]1.2e-9796.32Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS
        MEGKGSTLVHLLVVVLCLVAFGF+IAAERRRSVGTMFEDK KNETYCVY+SDVATGYGVGAFLFLLSGESLLM VTKCMCFGRPLTPGGNRAWAIIYFFS
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS

Query:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA
        SWATFLVAEACLIAGA KNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQ SHKANRSSSTVGM GYA
Subjt:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA

XP_022938517.1 uncharacterized protein LOC111444728 isoform X1 [Cucurbita moschata]1.3e-9493.16Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS
        MEGKGSTLVHLLVVVLCLVAFGF+IAAERRRSVGT+FEDKQ+N TYCVY SDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAW IIYF S
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS

Query:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA
        SWA+FLVAEACLIAGA KNAYHTKYRGMIYAQNL CETLRKGVFIAGAVFVVATMILNVYYYMYFTKATS+ ASHKANRSSSTVGMT YA
Subjt:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA

XP_022973554.1 uncharacterized protein LOC111472088 isoform X1 [Cucurbita maxima]1.3e-9492.63Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS
        MEGKGSTLVHLLVVVLCLVAFGF+IAAERRRSVGT+FEDKQ+N TYCVY SDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAW IIYF S
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS

Query:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA
        SWA+ LVAEACLIAGA KNAYHTKYRGMIYAQNL CETLRKGVFIAGAVFVVATMILNVYYY+YFTKATS++ASHKANRSSSTVGMTGYA
Subjt:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA

XP_023538980.1 uncharacterized protein LOC111799749 [Cucurbita pepo subsp. pepo]9.6e-9593.16Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS
        MEGKGSTLVHLLVVVLCLVAFGF+IAAERRRSVGT+FEDK++N TYCVY+SDVATGYGVG FLFLLSGESLLM VTKCMCFGRPLTPGGNRAWAIIYF S
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS

Query:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA
        SWATF+VAEACLIAGAAKNAYHTKYRGMIYAQNL CETLRKGVFIAGAVFVVATMILNVYYYMYFTKATS+QASHKA RSSSTVGMTGYA
Subjt:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA

XP_023544604.1 uncharacterized protein LOC111804138 isoform X1 [Cucurbita pepo subsp. pepo]2.5e-9593.16Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS
        MEGKGSTLVHL+VVVLCLVAFGF+IAAERRRSVGT+FEDKQ+N TYCVY SDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAW IIYF S
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS

Query:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA
        SWA+FLVAEACLIAGA KNAYHTKYRGMIYAQNL CETLRKGVFIAGAVFVVATMILNVYYYMYFTKATS+ ASHKANRSSSTVGMTGYA
Subjt:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA

TrEMBL top hitse value%identityAlignment
A0A0A0KQ16 Uncharacterized protein5.1e-9492.63Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS
        MEGKGSTLVHLLVVVLCLVAFGF+IAAERRRSVGT+FEDKQ+N TYCVY SDVATGYGVGAFLFLLSG+SLLM VTKCMCFG+PLTPGGNRAW IIYF S
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS

Query:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA
        S ATFLVAEACLIAGA KNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATS+Q SHKANRSSSTVGMTGYA
Subjt:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA

A0A1S3CPP1 uncharacterized protein LOC1035033887.9e-9593.68Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS
        MEGKGSTLVHLLVVVLCLVAFGF+IAAERRRSVGT+FEDKQ+N TYCVY SDVATGYGVGAFLFLLSG+SLLM VTKCMCFGRPLTPGGNRAW IIYF S
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS

Query:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA
        S ATFLVAEACLIAGA KNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATS+QASHKANRSSSTVGMTGYA
Subjt:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA

A0A6J1DLS4 uncharacterized protein LOC1110223435.9e-9896.32Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS
        MEGKGSTLVHLLVVVLCLVAFGF+IAAERRRSVGTMFEDK KNETYCVY+SDVATGYGVGAFLFLLSGESLLM VTKCMCFGRPLTPGGNRAWAIIYFFS
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS

Query:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA
        SWATFLVAEACLIAGA KNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQ SHKANRSSSTVGM GYA
Subjt:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA

A0A6J1FE96 uncharacterized protein LOC111444728 isoform X16.1e-9593.16Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS
        MEGKGSTLVHLLVVVLCLVAFGF+IAAERRRSVGT+FEDKQ+N TYCVY SDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAW IIYF S
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS

Query:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA
        SWA+FLVAEACLIAGA KNAYHTKYRGMIYAQNL CETLRKGVFIAGAVFVVATMILNVYYYMYFTKATS+ ASHKANRSSSTVGMT YA
Subjt:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA

A0A6J1IEX4 uncharacterized protein LOC111472088 isoform X16.1e-9592.63Show/hide
Query:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS
        MEGKGSTLVHLLVVVLCLVAFGF+IAAERRRSVGT+FEDKQ+N TYCVY SDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAW IIYF S
Subjt:  MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFS

Query:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA
        SWA+ LVAEACLIAGA KNAYHTKYRGMIYAQNL CETLRKGVFIAGAVFVVATMILNVYYY+YFTKATS++ASHKANRSSSTVGMTGYA
Subjt:  SWATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)4.8e-7673.54Show/hide
Query:  EGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFSS
        EGK STLV +LVV L LVAFGF+IAAERRRS+G   +D   N T+CVY+SDVATGYGVGAFLFLLS ESLLM VTKCMCFGRPL PG +RAW+IIYF SS
Subjt:  EGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFSS

Query:  WATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA
        W TFLVAEAC+IAGA KNAYHTKY   + +Q   C +LRKG+FIAGAVF+VATM+LNVYYYMYFTK+ S+  +HKANRSSS +GM GYA
Subjt:  WATFLVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA

AT1G52910.1 Protein of unknown function (DUF1218)9.4e-4048.17Show/hide
Query:  STLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFSSWATF
        S LV ++V +L L+A G AIAAE+RRSVG +  D +K   +C Y SD+AT YG GAF+ L   + ++M  ++C C G+ L PGG+RA  I+ F   W  F
Subjt:  STLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFSSWATF

Query:  LVAEACLIAGAAKNAYHTKYRGMIYAQNLP-CETLRKGVFIAGAVFVVATMILNVYYYMYFTKA
        L+AE CL+AG+ +NAYHT YR M   +N P CE +RKGVF AGA F + T I++ +YY+ +++A
Subjt:  LVAEACLIAGAAKNAYHTKYRGMIYAQNLP-CETLRKGVFIAGAVFVVATMILNVYYYMYFTKA

AT1G61065.1 Protein of unknown function (DUF1218)4.2e-4047.34Show/hide
Query:  STLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFSSWATF
        S L+ LLV V  L+AFG A+AAE+RR+   +  +  ++ +YCVY+ D+ATG GVG+FL LL+ + L+M  ++C+C GR LTP G+R+WAI  F ++W  F
Subjt:  STLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFSSWATF

Query:  LVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQAS
         +A+ CL+AG+ +NAYHTKYR      +  C +LRKGVF AGA F+V T I++  YY+  ++A   Q S
Subjt:  LVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQAS

AT3G15480.1 Protein of unknown function (DUF1218)2.9e-4149.39Show/hide
Query:  STLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFSSWATF
        S LV ++V +L L+A G AIAAE+RRSVG +  D+ K   YCVY +D+AT YG GAF+ L   + L+M  ++C C G+ L PGG+RA AII F   W  F
Subjt:  STLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFSSWATF

Query:  LVAEACLIAGAAKNAYHTKYRGMIYAQNLP-CETLRKGVFIAGAVFVVATMILNVYYYMYFTKA
        L+AE CL+A + +NAYHT+YR M   ++ P CE +RKGVF AGA F + T I++ +YY+ +++A
Subjt:  LVAEACLIAGAAKNAYHTKYRGMIYAQNLP-CETLRKGVFIAGAVFVVATMILNVYYYMYFTKA

AT4G27435.1 Protein of unknown function (DUF1218)3.7e-4450.89Show/hide
Query:  STLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFSSWATF
        S +V  +V V  L+AFG A+AAE+RRS   + +D +    YCVY+SD ATGYGVGAFLF ++ + L+M V++C C G+PL PGG+RA A+I F  SW  F
Subjt:  STLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFSSWATF

Query:  LVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQAS
        L+AE CL+AG+ +NAYHTKYR M       C+TLRKGVF AGA FV    I++ +YY ++  A     S
Subjt:  LVAEACLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGAAAAGGGTCGACTCTGGTTCATCTTCTAGTTGTGGTGCTGTGCTTGGTGGCTTTTGGGTTTGCCATTGCTGCCGAGAGACGAAGAAGTGTGGGAACTATGTT
TGAGGATAAGCAAAAGAACGAGACCTATTGCGTCTACAACTCGGATGTTGCAACAGGTTACGGAGTAGGGGCTTTCTTATTTCTCCTCTCAGGTGAATCATTGCTGATGT
GTGTCACGAAGTGCATGTGTTTTGGTAGACCTTTAACCCCGGGAGGAAATCGAGCATGGGCTATTATATACTTTTTCTCCTCATGGGCAACCTTTTTAGTAGCGGAAGCA
TGTCTAATTGCCGGTGCAGCCAAAAATGCATACCATACCAAGTATCGAGGAATGATATACGCTCAGAACTTACCCTGTGAAACATTGAGGAAAGGAGTCTTCATTGCTGG
GGCAGTGTTTGTGGTTGCAACCATGATTCTTAACGTGTATTACTACATGTACTTCACCAAGGCGACATCGACTCAAGCGTCTCACAAAGCAAATCGTTCAAGCTCAACGG
TCGGGATGACCGGGTATGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGAAAAGGGTCGACTCTGGTTCATCTTCTAGTTGTGGTGCTGTGCTTGGTGGCTTTTGGGTTTGCCATTGCTGCCGAGAGACGAAGAAGTGTGGGAACTATGTT
TGAGGATAAGCAAAAGAACGAGACCTATTGCGTCTACAACTCGGATGTTGCAACAGGTTACGGAGTAGGGGCTTTCTTATTTCTCCTCTCAGGTGAATCATTGCTGATGT
GTGTCACGAAGTGCATGTGTTTTGGTAGACCTTTAACCCCGGGAGGAAATCGAGCATGGGCTATTATATACTTTTTCTCCTCATGGGCAACCTTTTTAGTAGCGGAAGCA
TGTCTAATTGCCGGTGCAGCCAAAAATGCATACCATACCAAGTATCGAGGAATGATATACGCTCAGAACTTACCCTGTGAAACATTGAGGAAAGGAGTCTTCATTGCTGG
GGCAGTGTTTGTGGTTGCAACCATGATTCTTAACGTGTATTACTACATGTACTTCACCAAGGCGACATCGACTCAAGCGTCTCACAAAGCAAATCGTTCAAGCTCAACGG
TCGGGATGACCGGGTATGCCTAG
Protein sequenceShow/hide protein sequence
MEGKGSTLVHLLVVVLCLVAFGFAIAAERRRSVGTMFEDKQKNETYCVYNSDVATGYGVGAFLFLLSGESLLMCVTKCMCFGRPLTPGGNRAWAIIYFFSSWATFLVAEA
CLIAGAAKNAYHTKYRGMIYAQNLPCETLRKGVFIAGAVFVVATMILNVYYYMYFTKATSTQASHKANRSSSTVGMTGYA