; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033684 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033684
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionVARLMGL domain-containing protein
Genome locationchr3:1075628..1076428
RNA-Seq ExpressionLag0033684
SyntenyLag0033684
Gene Ontology termsNA
InterPro domainsIPR032795 - DUF3741-associated sequence motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057827.1 hypothetical protein E6C27_scaffold274G001140 [Cucumis melo var. makuwa]1.5e-13089.1Show/hide
Query:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAAKMRKG RTFCNGDGSTSTLNQHKTNQD FPISPD H +Q  PTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFRNLD AE+ GRKSVCCEY LKGR+H+NEFNLTLETALRLPSTIAGE+VVWRKPGVVAKLMGLEA+P P+NA+S KA 
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL
        L SILKR+++RKRAKR E ERRFS+DYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAW+AR+FL
Subjt:  LTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL

KAG6589427.1 hypothetical protein SDJN03_14850, partial [Cucurbita argyrosperma subsp. sororia]1.0e-12686.89Show/hide
Query:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRG-RMSCVNNSDILRS
        MK+LSFFLFKNSLAAK+RKGIRTFCNG+ ST+TLNQHKTNQD FP+SPD H  Q  PTLEEMILQLELEEETARRAK YNYD+++G RMSCVNNSDIL S
Subjt:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRG-RMSCVNNSDILRS

Query:  ARNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKA
        ARNALNQYPRFSLDGKDAMYRSSFRNLDAAE+ GRKSVCCEYALKGR+HENEFNLTLETALRLPSTIAGE+V+WRKPGVVAKLMGLEAMP P+NA+SGKA
Subjt:  ARNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKA

Query:  KLTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL
         L+SI+KR+N+RKRAKRHE ERR SMDYPGSDGTITGRLSSCSSNNGCY+VKPIATESAAWKA QFL
Subjt:  KLTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL

XP_022135499.1 uncharacterized protein LOC111007440 [Momordica charantia]3.3e-13392.48Show/hide
Query:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHK NQDHFPISPD   ++N PTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFRNLDA   AGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEA+P PVNAK GKAK
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL
        L+SILKR N+R+RAKRHE ERRF MDYPGSDGTITGRLSSCSSNNGCY+VKPIATESAAWKARQFL
Subjt:  LTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL

XP_031736616.1 uncharacterized protein LOC116402050 [Cucumis sativus]4.0e-13189.85Show/hide
Query:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAAKMRKG RTFCNGDGSTSTLNQ KTNQD FPISPD H +Q  PTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFRNLDAAE+ GRKSVCCEY LKGR+H+NEFNLTLETALRLPSTIAGE+VVWRKPGVVAKLMGLEAMP P+NA+S KA 
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL
        LTSILKR+++RKRAKR E ERRFS+DYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAW+AR+FL
Subjt:  LTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL

XP_038878557.1 uncharacterized protein LOC120070753 [Benincasa hispida]1.1e-13392.11Show/hide
Query:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAAKMRKG RTFCNGD STSTLNQHKTNQD FPISPD H +Q  PTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEY LKGRIHENEFNLTLETALRLPSTIAGE+VVWRKPGVVAKLMGLEAMP P+NA+S KA 
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL
        LTSILKR+++RKRAKRHE ERRFSMDYPGSDGTITGRLSSCSSNNGCY+VKPIATESAAWKARQ L
Subjt:  LTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL

TrEMBL top hitse value%identityAlignment
A0A0A0LNU8 VARLMGL domain-containing protein1.9e-13189.85Show/hide
Query:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAAKMRKG RTFCNGDGSTSTLNQ KTNQD FPISPD H +Q  PTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFRNLDAAE+ GRKSVCCEY LKGR+H+NEFNLTLETALRLPSTIAGE+VVWRKPGVVAKLMGLEAMP P+NA+S KA 
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL
        LTSILKR+++RKRAKR E ERRFS+DYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAW+AR+FL
Subjt:  LTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL

A0A5D3BIJ6 VARLMGL domain-containing protein7.4e-13189.1Show/hide
Query:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAAKMRKG RTFCNGDGSTSTLNQHKTNQD FPISPD H +Q  PTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFRNLD AE+ GRKSVCCEY LKGR+H+NEFNLTLETALRLPSTIAGE+VVWRKPGVVAKLMGLEA+P P+NA+S KA 
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL
        L SILKR+++RKRAKR E ERRFS+DYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAW+AR+FL
Subjt:  LTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL

A0A6J1C1L3 uncharacterized protein LOC1110074401.6e-13392.48Show/hide
Query:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHK NQDHFPISPD   ++N PTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFRNLDA   AGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEA+P PVNAK GKAK
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL
        L+SILKR N+R+RAKRHE ERRF MDYPGSDGTITGRLSSCSSNNGCY+VKPIATESAAWKARQFL
Subjt:  LTSILKRRNIRKRAKRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL

A0A6J1EVK1 uncharacterized protein LOC1114381863.5e-12588.81Show/hide
Query:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAAKMRKG RTFCNGDGSTSTLNQHKTNQD FP SPD    QNQPTLEEMI+QLELEEETARR KL NY+EMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFR  DAAEK GRKSVCCEYAL+GRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNA+ G AK
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRRNIRKRA--KRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL
        L+SILK++++RKRA  KRHE ERRF MDY GSDGTITGRLSSCSSNNG YIVKPIATESAAWKAR FL
Subjt:  LTSILKRRNIRKRA--KRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL

A0A6J1L4I6 uncharacterized protein LOC1114998227.9e-12588.81Show/hide
Query:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAAKMRKG RTFCNGDGSTSTLNQHKTNQD FP SP+    QNQPTLEEMILQLELEEETARR KL NYDEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        +NALNQYPRFSLDGKDAMYRSSFR  DAAEK GRKSVCCEYAL+GRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNA+ G AK
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRRNIRKRA--KRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL
        L+SILK++++RKRA  KRHE ERRF MDY GSDGTITGRLSSCSSNNG YIVKPIATESAAWKAR FL
Subjt:  LTSILKRRNIRKRA--KRHENERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAGTTATCCTTCTTTCTCTTCAAGAACTCTCTGGCTGCAAAGATGAGGAAAGGCATCAGAACTTTCTGCAATGGTGATGGTTCTACCTCCACTCTCAACCAACA
CAAAACCAACCAGGATCACTTCCCTATCTCGCCCGACCGGCACTACAAACAAAACCAGCCAACCTTGGAAGAGATGATTCTACAGTTGGAACTGGAGGAGGAGACTGCAA
GAAGGGCAAAACTCTACAACTATGATGAGATGCGAGGCAGGATGTCGTGTGTAAACAACTCGGATATCTTGCGATCTGCACGGAATGCCTTGAATCAGTATCCTCGGTTT
TCTCTTGATGGAAAGGATGCAATGTATCGATCTTCCTTTAGGAACTTGGATGCTGCTGAAAAGGCTGGGAGGAAATCAGTTTGCTGCGAGTATGCTCTCAAAGGGAGAAT
TCATGAGAATGAGTTCAATTTGACACTGGAGACAGCCTTGCGGTTGCCATCAACGATAGCTGGAGAGAGTGTCGTGTGGCGCAAACCAGGAGTGGTAGCCAAGTTGATGG
GTCTCGAGGCAATGCCGACACCAGTGAATGCTAAAAGTGGCAAGGCAAAGTTGACCTCTATATTGAAGAGACGGAATATAAGGAAGAGAGCTAAGAGGCATGAAAATGAG
AGAAGATTTTCCATGGACTATCCTGGTTCTGATGGCACAATAACAGGAAGGTTAAGTTCTTGCAGTTCAAACAATGGATGCTACATCGTGAAGCCAATTGCAACAGAATC
AGCTGCATGGAAAGCTAGGCAATTTCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAGTTATCCTTCTTTCTCTTCAAGAACTCTCTGGCTGCAAAGATGAGGAAAGGCATCAGAACTTTCTGCAATGGTGATGGTTCTACCTCCACTCTCAACCAACA
CAAAACCAACCAGGATCACTTCCCTATCTCGCCCGACCGGCACTACAAACAAAACCAGCCAACCTTGGAAGAGATGATTCTACAGTTGGAACTGGAGGAGGAGACTGCAA
GAAGGGCAAAACTCTACAACTATGATGAGATGCGAGGCAGGATGTCGTGTGTAAACAACTCGGATATCTTGCGATCTGCACGGAATGCCTTGAATCAGTATCCTCGGTTT
TCTCTTGATGGAAAGGATGCAATGTATCGATCTTCCTTTAGGAACTTGGATGCTGCTGAAAAGGCTGGGAGGAAATCAGTTTGCTGCGAGTATGCTCTCAAAGGGAGAAT
TCATGAGAATGAGTTCAATTTGACACTGGAGACAGCCTTGCGGTTGCCATCAACGATAGCTGGAGAGAGTGTCGTGTGGCGCAAACCAGGAGTGGTAGCCAAGTTGATGG
GTCTCGAGGCAATGCCGACACCAGTGAATGCTAAAAGTGGCAAGGCAAAGTTGACCTCTATATTGAAGAGACGGAATATAAGGAAGAGAGCTAAGAGGCATGAAAATGAG
AGAAGATTTTCCATGGACTATCCTGGTTCTGATGGCACAATAACAGGAAGGTTAAGTTCTTGCAGTTCAAACAATGGATGCTACATCGTGAAGCCAATTGCAACAGAATC
AGCTGCATGGAAAGCTAGGCAATTTCTTTAG
Protein sequenceShow/hide protein sequence
MKELSFFLFKNSLAAKMRKGIRTFCNGDGSTSTLNQHKTNQDHFPISPDRHYKQNQPTLEEMILQLELEEETARRAKLYNYDEMRGRMSCVNNSDILRSARNALNQYPRF
SLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALKGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAKLTSILKRRNIRKRAKRHENE
RRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARQFL