; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033328 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033328
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionNascent polypeptide-associated complex subunit alpha, muscle-specific form
Genome locationchr11:42779843..42786050
RNA-Seq ExpressionLag0033328
SyntenyLag0033328
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571464.1 hypothetical protein SDJN03_28192, partial [Cucurbita argyrosperma subsp. sororia]2.1e-9487.13Show/hide
Query:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN
        V+VVEQHRNQYYGR++PHGPARF S PSRDFRGMNCRSFQSGAG+LPTPLKAC S TKHV+PSSPKTPPTCLSS + NG QLA+V+SAPIPI AKFSNKN
Subjt:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN

Query:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAKSAPPSPTREQNFSSRIFFHSADSATKTLRRILNLDVD
        SA HEEFYDR+FSFSELWAGPTYSNSPPPSSLPIPKFSVAKRT S+ELPRSAPEFEMH PSAKSAPPSPTRE   SSR  FHSADSATKTLRRILNLDVD
Subjt:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAKSAPPSPTREQNFSSRIFFHSADSATKTLRRILNLDVD

Query:  NE
        NE
Subjt:  NE

KAG6606421.1 hypothetical protein SDJN03_03738, partial [Cucurbita argyrosperma subsp. sororia]3.4e-8178.71Show/hide
Query:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN
        VVV +QHRNQYYG       A FGS PSRDFRG+NCRSFQSGAG+LPTP KA TSET+H +PSSPKTP TCLSS S N    ATV +APIPIK KF N N
Subjt:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN

Query:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAKSAPPSPTREQNFSSRIFFHSADSATKTLRRILNLDVD
        S  HEEFYD SFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTS E+PRSAPEF++HHPSAKSAPPSPTR+QNFS R FFH+ DSATKTLRRIL+LDVD
Subjt:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAKSAPPSPTREQNFSSRIFFHSADSATKTLRRILNLDVD

Query:  NE
        NE
Subjt:  NE

KAG7011227.1 hypothetical protein SDJN02_26130, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-8186.52Show/hide
Query:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN
        V+VVEQHRNQYYGR++PHGPARF S PSRDFRGMNCRSFQSGAG+LPTPLKAC S TKHV+PSSPKTPPTCLSS + NG QLA+V+SAPIPI AKFSNKN
Subjt:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN

Query:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAKSAPPSPTREQNFSSR
        SA HEEFYDR+FSFSELWAGPTYSNSPPPSSLPIPKFSVAKRT S+ELPRSAPEFEMH PSAKSAPPSPTRE   SSR
Subjt:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAKSAPPSPTREQNFSSR

KGN47907.1 hypothetical protein Csa_004001 [Cucumis sativus]1.3e-9387.13Show/hide
Query:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN
        VVV+EQHRNQYY RVKPHGPARFGSL SRDFRGMNCRSFQSGAG+LPTPLKAC SET+H +P SPKTPP CL+S SEN  QLAT+RSAPIPIK K SN++
Subjt:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN

Query:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAKSAPPSPTREQNFSSRIFFHSADSATKTLRRILNLDVD
        +AFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTS+EL RSAPEFEMHHPSAKSAPPSPTR+QNFS+R FFHSADSATKTLRRILNLDV 
Subjt:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAKSAPPSPTREQNFSSRIFFHSADSATKTLRRILNLDVD

Query:  NE
        NE
Subjt:  NE

XP_016900729.1 PREDICTED: uncharacterized protein LOC107990294 [Cucumis melo]4.8e-9185.64Show/hide
Query:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN
        VVV+EQHRNQYY RVKPHGPARFGSL SRDF GMNCRSFQSGAG+LPTPLKACTSET+H +P SPKTPP  L+S SEN  QLAT RSAPI IK K SN++
Subjt:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN

Query:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAKSAPPSPTREQNFSSRIFFHSADSATKTLRRILNLDVD
        + FHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTS+EL RSAPEFEMHHPSAKSAPPSPTR+Q+FS+R FFHSADSATKTLRRILNLDVD
Subjt:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAKSAPPSPTREQNFSSRIFFHSADSATKTLRRILNLDVD

Query:  NE
        NE
Subjt:  NE

TrEMBL top hitse value%identityAlignment
A0A0A0KHP6 Uncharacterized protein6.5e-9487.13Show/hide
Query:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN
        VVV+EQHRNQYY RVKPHGPARFGSL SRDFRGMNCRSFQSGAG+LPTPLKAC SET+H +P SPKTPP CL+S SEN  QLAT+RSAPIPIK K SN++
Subjt:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN

Query:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAKSAPPSPTREQNFSSRIFFHSADSATKTLRRILNLDVD
        +AFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTS+EL RSAPEFEMHHPSAKSAPPSPTR+QNFS+R FFHSADSATKTLRRILNLDV 
Subjt:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAKSAPPSPTREQNFSSRIFFHSADSATKTLRRILNLDVD

Query:  NE
        NE
Subjt:  NE

A0A1S4DXL6 uncharacterized protein LOC1079902942.3e-9185.64Show/hide
Query:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN
        VVV+EQHRNQYY RVKPHGPARFGSL SRDF GMNCRSFQSGAG+LPTPLKACTSET+H +P SPKTPP  L+S SEN  QLAT RSAPI IK K SN++
Subjt:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN

Query:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAKSAPPSPTREQNFSSRIFFHSADSATKTLRRILNLDVD
        + FHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTS+EL RSAPEFEMHHPSAKSAPPSPTR+Q+FS+R FFHSADSATKTLRRILNLDVD
Subjt:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAKSAPPSPTREQNFSSRIFFHSADSATKTLRRILNLDVD

Query:  NE
        NE
Subjt:  NE

A0A5A7UUU2 Nascent polypeptide-associated complex subunit alpha, muscle-specific form5.9e-7184.66Show/hide
Query:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN
        VVV+EQHRNQYY RVKPHGPARFGSL SRDF GMNCRSFQSGAG+LPTPLKACTSET+H +P SPKTPP  L+S SEN  QLAT RSAPI IK K SN++
Subjt:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN

Query:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAK
        + FHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTS+EL RSAPEFEMHHPSAK
Subjt:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAK

A0A5D3CPQ8 Nascent polypeptide-associated complex subunit alpha, muscle-specific form5.9e-7184.66Show/hide
Query:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN
        VVV+EQHRNQYY RVKPHGPARFGSL SRDF GMNCRSFQSGAG+LPTPLKACTSET+H +P SPKTPP  L+S SEN  QLAT RSAPI IK K SN++
Subjt:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKN

Query:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAK
        + FHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTS+EL RSAPEFEMHHPSAK
Subjt:  SAFHEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAK

A0A6J1CY32 uncharacterized protein LOC1110152752.3e-7084.43Show/hide
Query:  MGAVSPVPLFLATSL---HFP--STQTLLSNFLASSQVVSFAPNCTKFRRKILVFGKQSGNANESQFLDENGVVNDMDGYLNYLSLEYDSVWDTKPSWCQ
        MGA+S VPLFLATSL   HFP  STQTLL+NF+A+SQVVS  P CTKFRRK +VFGKQS NANESQFLDENGVV+DMDGYLNYLSLEYDSVWDTKPSWCQ
Subjt:  MGAVSPVPLFLATSL---HFP--STQTLLSNFLASSQVVSFAPNCTKFRRKILVFGKQSGNANESQFLDENGVVNDMDGYLNYLSLEYDSVWDTKPSWCQ

Query:  PWTIMLTGLLVIASSWFVIKSIAVTATILSLICLWWYIFLYSYPKAYSDMIAERRKRVTDGVEDTFG
        PWTI LTGLL+ ASSWFVIKSIAVTA ILS+I LWWYIFLYSYPKAYS+MIAERRK+VTDG EDTFG
Subjt:  PWTIMLTGLLVIASSWFVIKSIAVTATILSLICLWWYIFLYSYPKAYSDMIAERRKRVTDGVEDTFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02715.1 unknown protein4.5e-3146.38Show/hide
Query:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLK-ACTSETKHVFP--SSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFS
        ++V  +HR+QYYG+ K  G  RF S PS+ FR +NCR+FQSG GLLP P + + T  TK       SP++P + L     +   + + R++PIPI     
Subjt:  VVVVEQHRNQYYGRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLK-ACTSETKHVFP--SSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFS

Query:  NKNSAFHEEFYD--RSFSFSELWAGPTYSNSPPPSSLPIPKFSV-AKRTTSMELPRSAPEFEMH-HPSAKSAPPSPTREQNFSSRIFFHSADSATKTLRR
        ++      EF D  RS S+SELWAGPTYSNSPPP+S+PIPKFS+  KRT S+  P  AP+  +     AKSAP SPT     S    F S  SAT TLRR
Subjt:  NKNSAFHEEFYD--RSFSFSELWAGPTYSNSPPPSSLPIPKFSV-AKRTTSMELPRSAPEFEMH-HPSAKSAPPSPTREQNFSSRIFFHSADSATKTLRR

Query:  ILNLDVD
        +LNL+++
Subjt:  ILNLDVD

AT4G02725.1 unknown protein3.0e-4354.36Show/hide
Query:  PSTQTLLSNFLASSQVVSFAPNCTKFRRKILVFGKQSGNANESQFLDENGVVNDMDGYLNYLSLEYDSVWDTKPSWCQPWTIMLTGLLVIASSWFVIKSI
        P T+ LLS     + + SF     +   K+  FG       +S+F+DE GVV+DM+G+L+ LSLEYDSVWDTKPSWCQPWTIMLTGL ++A SW ++ S+
Subjt:  PSTQTLLSNFLASSQVVSFAPNCTKFRRKILVFGKQSGNANESQFLDENGVVNDMDGYLNYLSLEYDSVWDTKPSWCQPWTIMLTGLLVIASSWFVIKSI

Query:  AVTATILSLICLWWYIFLYSYPKAYSDMIAERRKRVTDGVEDTFGEERP
         V++  + +I  WWYIFLYSYPK+YS+MIAERRKRV DG ED +G+ +P
Subjt:  AVTATILSLICLWWYIFLYSYPKAYSDMIAERRKRVTDGVEDTFGEERP

AT4G02725.2 unknown protein7.0e-3252.03Show/hide
Query:  PSTQTLLSNFLASSQVVSFAPNCTKFRRKILVFGKQSGNANESQFLDENGVVNDMDGYLNYLSLEYDSVWDTKPSWCQPWTIMLTGLLVIASSWFVIKSI
        P T+ LLS     + + SF     +   K+  FG       +S+F+DE GVV+DM+G+L+ LSLEYDSVWDTKPSWCQPWTIMLTGL ++A SW ++ S+
Subjt:  PSTQTLLSNFLASSQVVSFAPNCTKFRRKILVFGKQSGNANESQFLDENGVVNDMDGYLNYLSLEYDSVWDTKPSWCQPWTIMLTGLLVIASSWFVIKSI

Query:  AVTATILSLICLWWYIFLYSYPK
         V++  + +I  WWYIFLYSYPK
Subjt:  AVTATILSLICLWWYIFLYSYPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCTGTTTCTCCAGTCCCTCTGTTCCTTGCAACTTCTCTCCATTTTCCTTCAACACAAACCCTTCTTTCTAATTTCTTAGCTTCGTCACAAGTCGTTTCGTTTGC
TCCCAATTGTACAAAGTTCCGCAGGAAAATTCTCGTTTTTGGGAAGCAATCAGGCAACGCAAATGAATCCCAGTTTTTAGATGAAAATGGGGTTGTCAATGATATGGATG
GATACTTGAATTATCTCTCTCTCGAATATGACTCCGTCTGGGATACGAAGCCTTCATGGTGTCAACCATGGACTATAATGCTGACAGGATTACTAGTGATTGCTTCTAGT
TGGTTTGTTATAAAGTCCATAGCAGTGACCGCAACAATACTCTCATTAATATGCTTATGGTGGTACATCTTTCTTTACTCATATCCAAAGGCTTATTCTGACATGATTGC
AGAGCGAAGAAAAAGGGTTACTGATGGAGTAGAAGACACATTTGGTGAGGAAAGGCCACATTGGTTATCACGTCAGTCTAACACACTGATTCGAGTCCGGGCAAAGGTCT
CTGATTCGAGTCCGGGCAAAGCTACGCTGGTTGAAGTGTCGGTTTCCGGAGTCGTCTCTGGTTTAACTTGTCGAGTGGTCGTTGTTGAGCAGCATAGGAACCAATATTAT
GGTCGGGTCAAGCCGCATGGGCCAGCTCGATTTGGATCACTCCCATCCCGAGACTTCAGAGGGATGAACTGTAGGAGTTTCCAATCGGGAGCTGGTTTACTCCCAACTCC
CTTGAAGGCTTGTACCTCTGAAACTAAACACGTTTTCCCTTCTTCTCCCAAAACCCCACCAACTTGTTTAAGTTCCACCTCCGAAAATGGCAACCAACTCGCTACTGTGC
GAAGTGCTCCGATTCCTATCAAAGCCAAATTTTCAAACAAGAACAGTGCTTTCCATGAAGAATTCTATGATCGAAGTTTCTCATTCTCTGAGCTTTGGGCTGGACCCACT
TACTCAAATTCACCGCCTCCAAGTTCATTACCCATTCCCAAATTTTCAGTTGCTAAGAGAACCACGTCAATGGAATTGCCTCGTTCTGCCCCTGAATTTGAAATGCATCA
TCCATCTGCCAAGTCTGCACCACCATCCCCGACTCGAGAGCAAAACTTTTCCTCCAGGATTTTCTTTCATAGTGCTGACTCTGCGACTAAGACTCTGCGTCGCATTCTCA
ATCTTGATGTTGACAATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGCTGTTTCTCCAGTCCCTCTGTTCCTTGCAACTTCTCTCCATTTTCCTTCAACACAAACCCTTCTTTCTAATTTCTTAGCTTCGTCACAAGTCGTTTCGTTTGC
TCCCAATTGTACAAAGTTCCGCAGGAAAATTCTCGTTTTTGGGAAGCAATCAGGCAACGCAAATGAATCCCAGTTTTTAGATGAAAATGGGGTTGTCAATGATATGGATG
GATACTTGAATTATCTCTCTCTCGAATATGACTCCGTCTGGGATACGAAGCCTTCATGGTGTCAACCATGGACTATAATGCTGACAGGATTACTAGTGATTGCTTCTAGT
TGGTTTGTTATAAAGTCCATAGCAGTGACCGCAACAATACTCTCATTAATATGCTTATGGTGGTACATCTTTCTTTACTCATATCCAAAGGCTTATTCTGACATGATTGC
AGAGCGAAGAAAAAGGGTTACTGATGGAGTAGAAGACACATTTGGTGAGGAAAGGCCACATTGGTTATCACGTCAGTCTAACACACTGATTCGAGTCCGGGCAAAGGTCT
CTGATTCGAGTCCGGGCAAAGCTACGCTGGTTGAAGTGTCGGTTTCCGGAGTCGTCTCTGGTTTAACTTGTCGAGTGGTCGTTGTTGAGCAGCATAGGAACCAATATTAT
GGTCGGGTCAAGCCGCATGGGCCAGCTCGATTTGGATCACTCCCATCCCGAGACTTCAGAGGGATGAACTGTAGGAGTTTCCAATCGGGAGCTGGTTTACTCCCAACTCC
CTTGAAGGCTTGTACCTCTGAAACTAAACACGTTTTCCCTTCTTCTCCCAAAACCCCACCAACTTGTTTAAGTTCCACCTCCGAAAATGGCAACCAACTCGCTACTGTGC
GAAGTGCTCCGATTCCTATCAAAGCCAAATTTTCAAACAAGAACAGTGCTTTCCATGAAGAATTCTATGATCGAAGTTTCTCATTCTCTGAGCTTTGGGCTGGACCCACT
TACTCAAATTCACCGCCTCCAAGTTCATTACCCATTCCCAAATTTTCAGTTGCTAAGAGAACCACGTCAATGGAATTGCCTCGTTCTGCCCCTGAATTTGAAATGCATCA
TCCATCTGCCAAGTCTGCACCACCATCCCCGACTCGAGAGCAAAACTTTTCCTCCAGGATTTTCTTTCATAGTGCTGACTCTGCGACTAAGACTCTGCGTCGCATTCTCA
ATCTTGATGTTGACAATGAATGA
Protein sequenceShow/hide protein sequence
MGAVSPVPLFLATSLHFPSTQTLLSNFLASSQVVSFAPNCTKFRRKILVFGKQSGNANESQFLDENGVVNDMDGYLNYLSLEYDSVWDTKPSWCQPWTIMLTGLLVIASS
WFVIKSIAVTATILSLICLWWYIFLYSYPKAYSDMIAERRKRVTDGVEDTFGEERPHWLSRQSNTLIRVRAKVSDSSPGKATLVEVSVSGVVSGLTCRVVVVEQHRNQYY
GRVKPHGPARFGSLPSRDFRGMNCRSFQSGAGLLPTPLKACTSETKHVFPSSPKTPPTCLSSTSENGNQLATVRSAPIPIKAKFSNKNSAFHEEFYDRSFSFSELWAGPT
YSNSPPPSSLPIPKFSVAKRTTSMELPRSAPEFEMHHPSAKSAPPSPTREQNFSSRIFFHSADSATKTLRRILNLDVDNE