; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10011669 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10011669
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationChr01:9067587..9069560
RNA-Seq ExpressionHG10011669
SyntenyHG10011669
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577379.1 hypothetical protein SDJN03_24953, partial [Cucurbita argyrosperma subsp. sororia]1.4e-9091.85Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV
        MASKLLL +VFVFDVIAFGLAIAAEQRRS AKIVEDTDAK+NYCVYDSDISTGLGVGAFLFLMAS ILIMVASRCFCCGKPLSPGGSRAWA+ILFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV

Query:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FF IAEIC+LAGSA+NAYHTKYRT+LTDTPPSCQMLRRGVFAAGAAFIF TSIVSQ YYVCYSRARESFQS+NKDTGIGMS+Y+
Subjt:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

XP_004133701.1 uncharacterized protein LOC101203051 [Cucumis sativus]9.6e-9293.48Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVED  AK NYCVYDSDISTGLGVGAFLFL+AS ILIMVASRCFCCGKPLSPGGSRAWAV+L ITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV

Query:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FF IAEICLLAGSA+NAYHTKYRT+LT+TPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSY+KDTGIGMSTYK
Subjt:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

XP_008452269.1 PREDICTED: uncharacterized protein 2C05 isoform X1 [Cucumis melo]7.9e-9494.57Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDT AK+NYCVYDSDISTGLGVGAFLFL+AS ILIMVASRCFCCGKPLSPGGSRAWAV+LFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV

Query:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FF IAEICLLAGSA+NAYHTKYRT+LTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSY+KDTG+GMSTYK
Subjt:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

XP_022142726.1 uncharacterized protein LOC111012773 [Momordica charantia]7.4e-9293.48Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIV+DTDAK+NYCVYDSDISTGLGVG FLFLMAS ILIMVASRCFCCG+PLSPGGSRA AVILFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV

Query:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FFFIAEICLLAGS +NAYHTKYRT+ T+TPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKD GIGMSTYK
Subjt:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

XP_038903364.1 uncharacterized protein LOC120089980 [Benincasa hispida]2.2e-9697.28Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAK+NYCVYDSDISTGLGVGAFLFLMAS ILIMVASRCFCCGKPLSPGGSRAWAV+LFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV

Query:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FFFIAEICLLAGSA+NAYHTKYRT+LTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
Subjt:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

TrEMBL top hitse value%identityAlignment
A0A0A0L5B8 Uncharacterized protein4.7e-9293.48Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVED  AK NYCVYDSDISTGLGVGAFLFL+AS ILIMVASRCFCCGKPLSPGGSRAWAV+L ITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV

Query:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FF IAEICLLAGSA+NAYHTKYRT+LT+TPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSY+KDTGIGMSTYK
Subjt:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

A0A1S3BTF0 uncharacterized protein 2C05 isoform X13.8e-9494.57Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDT AK+NYCVYDSDISTGLGVGAFLFL+AS ILIMVASRCFCCGKPLSPGGSRAWAV+LFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV

Query:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FF IAEICLLAGSA+NAYHTKYRT+LTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSY+KDTG+GMSTYK
Subjt:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

A0A5D3DJF4 Uncharacterized protein3.8e-9494.57Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDT AK+NYCVYDSDISTGLGVGAFLFL+AS ILIMVASRCFCCGKPLSPGGSRAWAV+LFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV

Query:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FF IAEICLLAGSA+NAYHTKYRT+LTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSY+KDTG+GMSTYK
Subjt:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

A0A6J1CN21 uncharacterized protein LOC1110127733.6e-9293.48Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV
        MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIV+DTDAK+NYCVYDSDISTGLGVG FLFLMAS ILIMVASRCFCCG+PLSPGGSRA AVILFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV

Query:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FFFIAEICLLAGS +NAYHTKYRT+ T+TPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKD GIGMSTYK
Subjt:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

A0A6J1EMF5 uncharacterized protein LOC1114359291.1e-9091.85Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV
        MASKLLL  VFVFDVIAFGLAIAAEQRRS AKIVEDTDAK+NYCVYDSDISTGLGVGAFLFLMAS ILIMVASRCFCCGKPLSPGGSRAWA+ILFITCWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV

Query:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK
        FF IAEIC+LAGSA+NAYHTKYRT+LTDTPPSCQMLRRGVFAAGAAFIF TSIVSQ YYVCYSRARESFQS+NKDTGIGMS+Y+
Subjt:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)3.6e-4445.41Show/hide
Query:  ASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWVF
        AS L+   V    ++AFG +IAAE+RRSI K ++D      +CVYDSD++TG GVGAFLFL++S  L+M  ++C C G+PL+PG  RAW++I FI+ W+ 
Subjt:  ASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWVF

Query:  FFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARES---FQSYNKDTGIGMSTY
        F +AE C++AG+ KNAYHTKY   L+    SC  LR+G+F AGA FI  T +++ +YY+ ++++  S    ++    + IGM+ Y
Subjt:  FFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARES---FQSYNKDTGIGMSTY

AT1G52910.1 Protein of unknown function (DUF1218)3.9e-5961.63Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV
        MASKL++  VF+ D+IA GLAIAAEQRRS+ K+V D + +  +C Y SDI+T  G GAF+ L  S ++IMVASRCFCCGK L PGGSRA  ++LF+ CWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV

Query:  FFFIAEICLLAGSAKNAYHTKYRTVLT-DTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQS
        FF IAE+CLLAGS +NAYHT YR +   + PPSC+++R+GVFAAGA+F  FT+IVSQFYY+ YSRAR+ +Q+
Subjt:  FFFIAEICLLAGSAKNAYHTKYRTVLT-DTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQS

AT1G61065.1 Protein of unknown function (DUF1218)3.6e-6064.29Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV
        MAS LLL  VFVFD+IAFGLA+AAEQRR+  +I  ++    +YCVYD DI+TGLGVG+FL L+AS +LIMVASRC CCG+ L+P GSR+WA+ LFIT WV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV

Query:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMST
        FFFIA++CLLAGS +NAYHTKYR    +T PSC+ LR+GVF AGAAFI  T IVS+ YYV  SRA++ FQ  ++D GI MS+
Subjt:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMST

AT3G15480.1 Protein of unknown function (DUF1218)8.5e-6264.53Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV
        MASKL++  VF+ D+IA GLAIAAEQRRS+ K+  D D + +YCVY +DI+T  G GAF+ L  S +LIM ASRCFCCGK L+PGGSRA A+ILF+ CWV
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV

Query:  FFFIAEICLLAGSAKNAYHTKYRTV-LTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQS
        FF IAE+CLLA S +NAYHT+YR +   + PPSC+++R+GVFAAGAAF  FT+IVSQFYYVCYSRAR+++Q+
Subjt:  FFFIAEICLLAGSAKNAYHTKYRTV-LTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQS

AT4G27435.1 Protein of unknown function (DUF1218)4.8e-6570.24Show/hide
Query:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV
        MASK++   VFVF++IAFGLA+AAEQRRS A++V+DT+ + NYCVYDSD +TG GVGAFLF +AS ILIM+ SRCFCCGKPL PGGSRA A+ILFI  W+
Subjt:  MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWV

Query:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARES
        FF IAEICLLAGS +NAYHTKYRT+  D PP CQ LR+GVFAAGA+F+FF +IVSQFYY  Y  A E+
Subjt:  FFFIAEICLLAGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCAAAGCTACTACTTTTCACTGTCTTTGTCTTTGATGTCATTGCTTTTGGCTTGGCCATTGCAGCTGAGCAAAGGAGAAGCATTGCAAAGATAGTTGAAGACAC
AGATGCAAAAAAAAACTATTGTGTATATGACTCAGACATTTCAACTGGATTAGGAGTTGGAGCATTTCTCTTCCTGATGGCTAGTCACATCCTCATAATGGTGGCTAGCC
GCTGCTTTTGCTGCGGCAAACCTCTGAGTCCTGGTGGTTCGAGGGCTTGGGCAGTCATTCTTTTCATAACTTGCTGGGTGTTTTTCTTCATCGCCGAGATTTGCCTGTTG
GCGGGTTCGGCTAAGAATGCATATCACACCAAGTACAGAACAGTGCTTACTGATACTCCTCCCTCTTGTCAAATGTTGAGACGAGGAGTGTTTGCTGCAGGAGCAGCTTT
CATTTTCTTCACTTCCATTGTTTCTCAGTTCTACTACGTTTGCTACTCGAGGGCCCGGGAGAGCTTTCAATCATACAACAAAGACACCGGCATTGGCATGAGTACCTACA
AATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCAAAGCTACTACTTTTCACTGTCTTTGTCTTTGATGTCATTGCTTTTGGCTTGGCCATTGCAGCTGAGCAAAGGAGAAGCATTGCAAAGATAGTTGAAGACAC
AGATGCAAAAAAAAACTATTGTGTATATGACTCAGACATTTCAACTGGATTAGGAGTTGGAGCATTTCTCTTCCTGATGGCTAGTCACATCCTCATAATGGTGGCTAGCC
GCTGCTTTTGCTGCGGCAAACCTCTGAGTCCTGGTGGTTCGAGGGCTTGGGCAGTCATTCTTTTCATAACTTGCTGGGTGTTTTTCTTCATCGCCGAGATTTGCCTGTTG
GCGGGTTCGGCTAAGAATGCATATCACACCAAGTACAGAACAGTGCTTACTGATACTCCTCCCTCTTGTCAAATGTTGAGACGAGGAGTGTTTGCTGCAGGAGCAGCTTT
CATTTTCTTCACTTCCATTGTTTCTCAGTTCTACTACGTTTGCTACTCGAGGGCCCGGGAGAGCTTTCAATCATACAACAAAGACACCGGCATTGGCATGAGTACCTACA
AATGA
Protein sequenceShow/hide protein sequence
MASKLLLFTVFVFDVIAFGLAIAAEQRRSIAKIVEDTDAKKNYCVYDSDISTGLGVGAFLFLMASHILIMVASRCFCCGKPLSPGGSRAWAVILFITCWVFFFIAEICLL
AGSAKNAYHTKYRTVLTDTPPSCQMLRRGVFAAGAAFIFFTSIVSQFYYVCYSRARESFQSYNKDTGIGMSTYK