; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg21083 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg21083
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProtein of unknown function (DUF1218)
Genome locationCarg_Chr07:8366223..8368304
RNA-Seq ExpressionCarg21083
SyntenyCarg21083
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577379.1 hypothetical protein SDJN03_24953, partial [Cucurbita argyrosperma subsp. sororia]3.4e-97100Show/hide
Query:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
        MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
Subjt:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV

Query:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR
        FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR
Subjt:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR

XP_022929307.1 uncharacterized protein LOC111435929 [Cucurbita moschata]1.7e-9699.46Show/hide
Query:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
        MASKLLLL VFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
Subjt:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV

Query:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR
        FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR
Subjt:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR

XP_022985157.1 uncharacterized protein LOC111483244 [Cucurbita maxima]2.1e-9497.83Show/hide
Query:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
        MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAK+NYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSP GSRAWAIILFITCWV
Subjt:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV

Query:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR
        FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQS N+DTGIGMSSYR
Subjt:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR

XP_023551921.1 uncharacterized protein LOC111809747 [Cucurbita pepo subsp. pepo]4.9e-9698.91Show/hide
Query:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
        MASKLLLL VFVFDVIAFGLAIAAEQRRSTAKIVED+DAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
Subjt:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV

Query:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR
        FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR
Subjt:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR

XP_038903364.1 uncharacterized protein LOC120089980 [Benincasa hispida]4.3e-9293.48Show/hide
Query:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
        MASKLLL +VFVFDVIAFGLAIAAEQRRS AKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWA++LFITCWV
Subjt:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV

Query:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR
        FF IAEIC+LAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIF TSIVSQ YYVCYSRARESFQS+NKDTGIGMS+Y+
Subjt:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR

TrEMBL top hitse value%identityAlignment
A0A0A0L5B8 Uncharacterized protein1.4e-8890.22Show/hide
Query:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
        MASKLLL +VFVFDVIAFGLAIAAEQRRS AKIVED  AK NYCVYDSDISTGLGVGAFLFL+ASQILIMVASRCFCCGKPLSPGGSRAWA++L ITCWV
Subjt:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV

Query:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR
        FFLIAEIC+LAGSARNAYHTKYRTLLT+TPPSCQMLRRGVFAAGAAFIF TSIVSQ YYVCYSRARESFQS++KDTGIGMS+Y+
Subjt:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR

A0A1S3BTF0 uncharacterized protein 2C05 isoform X15.2e-9191.85Show/hide
Query:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
        MASKLLL +VFVFDVIAFGLAIAAEQRRS AKIVEDT AKRNYCVYDSDISTGLGVGAFLFL+ASQILIMVASRCFCCGKPLSPGGSRAWA++LFITCWV
Subjt:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV

Query:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR
        FFLIAEIC+LAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIF TSIVSQ YYVCYSRARESFQS++KDTG+GMS+Y+
Subjt:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR

A0A5D3DJF4 Uncharacterized protein5.2e-9191.85Show/hide
Query:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
        MASKLLL +VFVFDVIAFGLAIAAEQRRS AKIVEDT AKRNYCVYDSDISTGLGVGAFLFL+ASQILIMVASRCFCCGKPLSPGGSRAWA++LFITCWV
Subjt:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV

Query:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR
        FFLIAEIC+LAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIF TSIVSQ YYVCYSRARESFQS++KDTG+GMS+Y+
Subjt:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR

A0A6J1EMF5 uncharacterized protein LOC1114359298.2e-9799.46Show/hide
Query:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
        MASKLLLL VFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
Subjt:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV

Query:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR
        FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR
Subjt:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR

A0A6J1J439 uncharacterized protein LOC1114832441.0e-9497.83Show/hide
Query:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
        MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAK+NYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSP GSRAWAIILFITCWV
Subjt:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV

Query:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR
        FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQS N+DTGIGMSSYR
Subjt:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)7.2e-4546.49Show/hide
Query:  ASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWVF
        AS L+ + V    ++AFG +IAAE+RRS  K ++D      +CVYDSD++TG GVGAFLFL++S+ L+M  ++C C G+PL+PG  RAW+II FI+ W+ 
Subjt:  ASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWVF

Query:  FLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSH---NKDTGIGMSSY
        FL+AE CV+AG+ +NAYHTKY   L+    SC  LR+G+F AGA FI  T +++  YY+ ++++  S  +H      + IGM+ Y
Subjt:  FLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSH---NKDTGIGMSSY

AT1G52910.1 Protein of unknown function (DUF1218)1.1e-5862.21Show/hide
Query:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
        MASKL+++ VF+ D+IA GLAIAAEQRRS  K+V D + +  +C Y SDI+T  G GAF+ L  SQ++IMVASRCFCCGK L PGGSRA  I+LF+ CWV
Subjt:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV

Query:  FFLIAEICVLAGSARNAYHTKYRTLLT-DTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQS
        FFLIAE+C+LAGS RNAYHT YR +   + PPSC+++R+GVFAAGA+F   T+IVSQ YY+ YSRAR+ +Q+
Subjt:  FFLIAEICVLAGSARNAYHTKYRTLLT-DTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQS

AT1G61065.1 Protein of unknown function (DUF1218)8.5e-6267.03Show/hide
Query:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
        MAS LLLL VFVFD+IAFGLA+AAEQRR+T +I  ++    +YCVYD DI+TGLGVG+FL L+ASQ+LIMVASRC CCG+ L+P GSR+WAI LFIT WV
Subjt:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV

Query:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSS
        FF IA++C+LAGS RNAYHTKYR    +T PSC+ LR+GVF AGAAFI LT IVS+ YYV  SRA++ FQ  ++D GI MSS
Subjt:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSS

AT3G15480.1 Protein of unknown function (DUF1218)2.5e-6165.12Show/hide
Query:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
        MASKL+++ VF+ D+IA GLAIAAEQRRS  K+  D D + +YCVY +DI+T  G GAF+ L  SQ+LIM ASRCFCCGK L+PGGSRA AIILF+ CWV
Subjt:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV

Query:  FFLIAEICVLAGSARNAYHTKYRTL-LTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQS
        FFLIAE+C+LA S RNAYHT+YR +   + PPSC+++R+GVFAAGAAF   T+IVSQ YYVCYSRAR+++Q+
Subjt:  FFLIAEICVLAGSARNAYHTKYRTL-LTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQS

AT4G27435.1 Protein of unknown function (DUF1218)4.8e-6570.24Show/hide
Query:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV
        MASK++   VFVF++IAFGLA+AAEQRRSTA++V+DT+ + NYCVYDSD +TG GVGAFLF +ASQILIM+ SRCFCCGKPL PGGSRA A+ILFI  W+
Subjt:  MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWV

Query:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARES
        FFLIAEIC+LAGS  NAYHTKYRT+  D PP CQ LR+GVFAAGA+F+F  +IVSQ YY  Y  A E+
Subjt:  FFLIAEICVLAGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCAAAGCTACTCCTTCTCTCAGTCTTCGTCTTTGATGTCATCGCTTTTGGGTTAGCCATTGCAGCTGAGCAAAGGAGAAGCACTGCAAAGATAGTTGAAGACAC
TGATGCCAAAAGAAACTATTGTGTATATGACTCAGACATTTCAACTGGCTTAGGAGTTGGAGCATTTCTCTTCCTGATGGCTAGCCAGATCCTCATAATGGTGGCTAGCC
GCTGCTTTTGCTGCGGTAAGCCTCTGAGTCCGGGCGGTTCGAGGGCTTGGGCCATCATTCTCTTCATAACTTGCTGGGTGTTCTTTTTAATCGCCGAGATTTGCGTACTC
GCGGGTTCTGCTAGGAATGCATACCACACCAAGTACAGAACATTGCTTACTGATACTCCTCCCTCTTGTCAAATGCTGAGAAGAGGAGTGTTTGCTGCAGGAGCAGCTTT
CATTTTCTTAACTTCCATTGTTTCGCAGTCCTACTACGTTTGCTACTCGAGGGCCCGGGAGAGCTTCCAATCACACAACAAAGACACTGGCATCGGCATGAGTTCGTACC
GATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCAAAGCTACTCCTTCTCTCAGTCTTCGTCTTTGATGTCATCGCTTTTGGGTTAGCCATTGCAGCTGAGCAAAGGAGAAGCACTGCAAAGATAGTTGAAGACAC
TGATGCCAAAAGAAACTATTGTGTATATGACTCAGACATTTCAACTGGCTTAGGAGTTGGAGCATTTCTCTTCCTGATGGCTAGCCAGATCCTCATAATGGTGGCTAGCC
GCTGCTTTTGCTGCGGTAAGCCTCTGAGTCCGGGCGGTTCGAGGGCTTGGGCCATCATTCTCTTCATAACTTGCTGGGTGTTCTTTTTAATCGCCGAGATTTGCGTACTC
GCGGGTTCTGCTAGGAATGCATACCACACCAAGTACAGAACATTGCTTACTGATACTCCTCCCTCTTGTCAAATGCTGAGAAGAGGAGTGTTTGCTGCAGGAGCAGCTTT
CATTTTCTTAACTTCCATTGTTTCGCAGTCCTACTACGTTTGCTACTCGAGGGCCCGGGAGAGCTTCCAATCACACAACAAAGACACTGGCATCGGCATGAGTTCGTACC
GATGA
Protein sequenceShow/hide protein sequence
MASKLLLLSVFVFDVIAFGLAIAAEQRRSTAKIVEDTDAKRNYCVYDSDISTGLGVGAFLFLMASQILIMVASRCFCCGKPLSPGGSRAWAIILFITCWVFFLIAEICVL
AGSARNAYHTKYRTLLTDTPPSCQMLRRGVFAAGAAFIFLTSIVSQSYYVCYSRARESFQSHNKDTGIGMSSYR