; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G190670 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G190670
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionProtein of unknown function (DUF1218)
Genome locationCla97Chr10:8920345..8922251
RNA-Seq ExpressionCla97C10G190670
SyntenyCla97C10G190670
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577379.1 hypothetical protein SDJN03_24953, partial [Cucurbita argyrosperma subsp. sororia]1.6e-9192.93Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
        MASKLLL +VFVFDVIAFGLAIAAEQRRS AKIVEDT+AKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWA++LFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV

Query:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FF IAEIC+LAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIF TSIVSQ YYVCYSRARESFQS+NKDTGIGMS+Y+
Subjt:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

XP_004133701.1 uncharacterized protein LOC101203051 [Cucumis sativus]3.0e-9395.65Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVED  AK NYCVYDSDISTGLGVGAFLFL+ASQILIMVASRCFCCGKPLSPGGSRAWAVVL ITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV

Query:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FF IAEICLLAGSARNAYHTKYRTLLT+TPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSY+KDTGIGMSTYK
Subjt:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

XP_008452269.1 PREDICTED: uncharacterized protein 2C05 isoform X1 [Cucumis melo]1.1e-9597.28Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDT AKRNYCVYDSDISTGLGVGAFLFL+ASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV

Query:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FF IAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSY+KDTG+GMSTYK
Subjt:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

XP_022142726.1 uncharacterized protein LOC111012773 [Momordica charantia]1.5e-9294.02Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIV+DT+AKRNYCVYDSDISTGLGVG FLFLMASQILIMVASRCFCCG+PLSPGGSRA AV+LFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV

Query:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FFFIAEICLLAGS RNAYHTKYRT+ T+TPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKD GIGMSTYK
Subjt:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

XP_038903364.1 uncharacterized protein LOC120089980 [Benincasa hispida]1.5e-9799.46Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDT+AKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV

Query:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
Subjt:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

TrEMBL top hitse value%identityAlignment
A0A0A0L5B8 Uncharacterized protein1.4e-9395.65Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVED  AK NYCVYDSDISTGLGVGAFLFL+ASQILIMVASRCFCCGKPLSPGGSRAWAVVL ITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV

Query:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FF IAEICLLAGSARNAYHTKYRTLLT+TPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSY+KDTGIGMSTYK
Subjt:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

A0A1S3BTF0 uncharacterized protein 2C05 isoform X15.3e-9697.28Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDT AKRNYCVYDSDISTGLGVGAFLFL+ASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV

Query:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FF IAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSY+KDTG+GMSTYK
Subjt:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

A0A5D3DJF4 Uncharacterized protein5.3e-9697.28Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDT AKRNYCVYDSDISTGLGVGAFLFL+ASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV

Query:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FF IAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSY+KDTG+GMSTYK
Subjt:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

A0A6J1CN21 uncharacterized protein LOC1110127737.2e-9394.02Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIV+DT+AKRNYCVYDSDISTGLGVG FLFLMASQILIMVASRCFCCG+PLSPGGSRA AV+LFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV

Query:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FFFIAEICLLAGS RNAYHTKYRT+ T+TPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKD GIGMSTYK
Subjt:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

A0A6J1EMF5 uncharacterized protein LOC1114359291.4e-9192.93Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
        MASKLLL  VFVFDVIAFGLAIAAEQRRS AKIVEDT+AKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWA++LFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV

Query:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FF IAEIC+LAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIF TSIVSQ YYVCYSRARESFQS+NKDTGIGMS+Y+
Subjt:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)6.1e-4444.32Show/hide
Query:  ASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWVF
        AS L+   V    ++AFG +IAAE+RRSI K ++D      +CVYDSD++TG GVGAFLFL++S+ L+M  ++C C G+PL+PG  RAW+++ FI+ W+ 
Subjt:  ASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWVF

Query:  FFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARES---FQSYNKDTGIGMSTY
        F +AE C++AG+ +NAYHTKY   L+    SC  LR+G+F AGA FI  T +++ +YY+ ++++  S    ++    + IGM+ Y
Subjt:  FFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARES---FQSYNKDTGIGMSTY

AT1G52910.1 Protein of unknown function (DUF1218)6.1e-6062.79Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
        MASKL++  VF+ D+IA GLAIAAEQRRS+ K+V D   +  +C Y SDI+T  G GAF+ L  SQ++IMVASRCFCCGK L PGGSRA  ++LF+ CWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV

Query:  FFFIAEICLLAGSARNAYHTKYRTLLT-DTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQS
        FF IAE+CLLAGS RNAYHT YR +   + PPSC+++R+GVFAAGA+F  FT+IVSQFYY+ YSRAR+ +Q+
Subjt:  FFFIAEICLLAGSARNAYHTKYRTLLT-DTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQS

AT1G61065.1 Protein of unknown function (DUF1218)4.2e-6165.38Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
        MAS LLL  VFVFD+IAFGLA+AAEQRR+  +I  ++    +YCVYD DI+TGLGVG+FL L+ASQ+LIMVASRC CCG+ L+P GSR+WA+ LFIT WV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV

Query:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMST
        FFFIA++CLLAGS RNAYHTKYR    +T PSC+ LR+GVF AGAAFI  T IVS+ YYV  SRA++ FQ  ++D GI MS+
Subjt:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMST

AT3G15480.1 Protein of unknown function (DUF1218)3.8e-6264.53Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
        MASKL++  VF+ D+IA GLAIAAEQRRS+ K+  D + + +YCVY +DI+T  G GAF+ L  SQ+LIM ASRCFCCGK L+PGGSRA A++LF+ CWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV

Query:  FFFIAEICLLAGSARNAYHTKYRTL-LTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQS
        FF IAE+CLLA S RNAYHT+YR +   + PPSC+++R+GVFAAGAAF  FT+IVSQFYYVCYSRAR+++Q+
Subjt:  FFFIAEICLLAGSARNAYHTKYRTL-LTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQS

AT4G27435.1 Protein of unknown function (DUF1218)2.8e-6570.24Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV
        MASK++   VFVF++IAFGLA+AAEQRRS A++V+DT  + NYCVYDSD +TG GVGAFLF +ASQILIM+ SRCFCCGKPL PGGSRA A++LFI  W+
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWV

Query:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARES
        FF IAEICLLAGS  NAYHTKYRT+  D PP CQ LR+GVFAAGA+F+FF +IVSQFYY  Y  A E+
Subjt:  FFFIAEICLLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCAAAGCTACTACTTTTCACCGTCTTTGTCTTTGATGTCATTGCTTTTGGCTTGGCCATTGCAGCTGAGCAAAGGAGAAGCATTGCAAAGATAGTTGAAGACAC
AAATGCCAAAAGAAACTATTGTGTATATGACTCGGACATTTCAACTGGGCTAGGAGTTGGAGCTTTTCTCTTCCTGATGGCTAGTCAGATCCTTATAATGGTGGCTAGCC
GATGCTTTTGCTGCGGCAAGCCTCTGAGTCCCGGTGGTTCAAGGGCTTGGGCAGTTGTTCTTTTCATAACTTGCTGGGTGTTTTTTTTCATCGCCGAGATTTGCTTGTTA
GCGGGATCGGCTAGGAATGCATACCACACCAAGTACAGAACATTGCTTACTGATACTCCTCCCTCTTGTCAAATGTTGAGAAGAGGAGTGTTTGCTGCAGGAGCAGCTTT
CATTTTCTTCACTTCCATTGTTTCTCAGTTCTACTATGTTTGCTACTCGAGGGCCCGGGAGAGCTTTCAATCATACAATAAAGACACCGGCATTGGCATGAGTACCTACA
AATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCAAAGCTACTACTTTTCACCGTCTTTGTCTTTGATGTCATTGCTTTTGGCTTGGCCATTGCAGCTGAGCAAAGGAGAAGCATTGCAAAGATAGTTGAAGACAC
AAATGCCAAAAGAAACTATTGTGTATATGACTCGGACATTTCAACTGGGCTAGGAGTTGGAGCTTTTCTCTTCCTGATGGCTAGTCAGATCCTTATAATGGTGGCTAGCC
GATGCTTTTGCTGCGGCAAGCCTCTGAGTCCCGGTGGTTCAAGGGCTTGGGCAGTTGTTCTTTTCATAACTTGCTGGGTGTTTTTTTTCATCGCCGAGATTTGCTTGTTA
GCGGGATCGGCTAGGAATGCATACCACACCAAGTACAGAACATTGCTTACTGATACTCCTCCCTCTTGTCAAATGTTGAGAAGAGGAGTGTTTGCTGCAGGAGCAGCTTT
CATTTTCTTCACTTCCATTGTTTCTCAGTTCTACTATGTTTGCTACTCGAGGGCCCGGGAGAGCTTTCAATCATACAATAAAGACACCGGCATTGGCATGAGTACCTACA
AATGA
Protein sequenceShow/hide protein sequence
MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTNAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAVVLFITCWVFFFIAEICLL
AGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK