; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015678 (gene) of Snake gourd v1 genome

Gene IDTan0015678
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNascent polypeptide-associated complex subunit alpha, muscle-specific form
Genome locationLG07:74408182..74411065
RNA-Seq ExpressionTan0015678
SyntenyTan0015678
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571464.1 hypothetical protein SDJN03_28192, partial [Cucurbita argyrosperma subsp. sororia]2.2e-9788.78Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS
        MEAV+VVEQHRNQYYGR++PH PARF S PSRDFRGMNCRSFQSGAGILPTPLKAC S TK  YPSSPKTPPTCLSSN+GNGKQLA+V+SAPIPI  KFS
Subjt:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS

Query:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSRFFFHSADSATKTLRRILNL
        NKNSA HEEFYD +FSFSELWAGPTYSNSPPPSSLPIPKFSVAKRT SLELPRSAPEFEMH PSAKSAPPSPTREL  SSRF FHSADSATKTLRRILNL
Subjt:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSRFFFHSADSATKTLRRILNL

Query:  DVDNE
        DVDNE
Subjt:  DVDNE

KAG6606421.1 hypothetical protein SDJN03_03738, partial [Cucurbita argyrosperma subsp. sororia]1.4e-8682.52Show/hide
Query:  MEAVVVVE-QHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKF
        MEAVVVVE QHRNQYYG       A FGS PSRDFRG+NCRSFQSGAGILPTP KA TSET+ FYPSSPKTP TCLSSNSGN K  ATV +APIPIK KF
Subjt:  MEAVVVVE-QHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKF

Query:  SNKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSRFFFHSADSATKTLRRILN
         N NS  HEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTS E+PRSAPEF++HHPSAKSAPPSPTR+ NFS RFFFH+ DSATKTLRRIL+
Subjt:  SNKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSRFFFHSADSATKTLRRILN

Query:  LDVDNE
        LDVDNE
Subjt:  LDVDNE

KAG7011227.1 hypothetical protein SDJN02_26130, partial [Cucurbita argyrosperma subsp. argyrosperma]6.3e-8487.85Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS
        MEAV+VVEQHRNQYYGR++PH PARF S PSRDFRGMNCRSFQSGAGILPTPLKAC S TK  YPSSPKTPPTCLSSN+GNGKQLA+V+SAPIPI  KFS
Subjt:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS

Query:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSR
        NKNSA HEEFYD +FSFSELWAGPTYSNSPPPSSLPIPKFSVAKRT SLELPRSAPEFEMH PSAKSAPPSPTREL  SSR
Subjt:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSR

KGN47907.1 hypothetical protein Csa_004001 [Cucumis sativus]2.7e-9588.29Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS
        MEAVVV+EQHRNQYY RVKPH PARFGSL SRDFRGMNCRSFQSGAGILPTPLKAC SET+ FYP SPKTPP CL+SNS N KQLAT+RSAPIPIK K S
Subjt:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS

Query:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSRFFFHSADSATKTLRRILNL
        N+++AFHEEFYD SFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLEL RSAPEFEMHHPSAKSAPPSPTR+ NFS+RFFFHSADSATKTLRRILNL
Subjt:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSRFFFHSADSATKTLRRILNL

Query:  DVDNE
        DV NE
Subjt:  DVDNE

XP_016900729.1 PREDICTED: uncharacterized protein LOC107990294 [Cucumis melo]1.7e-9286.34Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS
        MEAVVV+EQHRNQYY RVKPH PARFGSL SRDF GMNCRSFQSGAGILPTPLKACTSET+ FYP SPKTPP  L+SNS N KQLAT RSAPI IK K S
Subjt:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS

Query:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSRFFFHSADSATKTLRRILNL
        N+++ FHEEFYD SFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLEL RSAPEFEMHHPSAKSAPPSPTR+ +FS+R+FFHSADSATKTLRRILNL
Subjt:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSRFFFHSADSATKTLRRILNL

Query:  DVDNE
        DVDNE
Subjt:  DVDNE

TrEMBL top hitse value%identityAlignment
A0A0A0KHP6 Uncharacterized protein1.3e-9588.29Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS
        MEAVVV+EQHRNQYY RVKPH PARFGSL SRDFRGMNCRSFQSGAGILPTPLKAC SET+ FYP SPKTPP CL+SNS N KQLAT+RSAPIPIK K S
Subjt:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS

Query:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSRFFFHSADSATKTLRRILNL
        N+++AFHEEFYD SFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLEL RSAPEFEMHHPSAKSAPPSPTR+ NFS+RFFFHSADSATKTLRRILNL
Subjt:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSRFFFHSADSATKTLRRILNL

Query:  DVDNE
        DV NE
Subjt:  DVDNE

A0A1S4DXL6 uncharacterized protein LOC1079902948.0e-9386.34Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS
        MEAVVV+EQHRNQYY RVKPH PARFGSL SRDF GMNCRSFQSGAGILPTPLKACTSET+ FYP SPKTPP  L+SNS N KQLAT RSAPI IK K S
Subjt:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS

Query:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSRFFFHSADSATKTLRRILNL
        N+++ FHEEFYD SFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLEL RSAPEFEMHHPSAKSAPPSPTR+ +FS+R+FFHSADSATKTLRRILNL
Subjt:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSRFFFHSADSATKTLRRILNL

Query:  DVDNE
        DVDNE
Subjt:  DVDNE

A0A5A7UUU2 Nascent polypeptide-associated complex subunit alpha, muscle-specific form1.6e-7281.11Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS
        MEAVVV+EQHRNQYY RVKPH PARFGSL SRDF GMNCRSFQSGAGILPTPLKACTSET+ FYP SPKTPP  L+SNS N KQLAT RSAPI IK K S
Subjt:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS

Query:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSS
        N+++ FHEEFYD SFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLEL RSAPEFEMHHPSAK    +P   L  S+
Subjt:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSS

A0A5D3CPQ8 Nascent polypeptide-associated complex subunit alpha, muscle-specific form1.6e-7281.11Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS
        MEAVVV+EQHRNQYY RVKPH PARFGSL SRDF GMNCRSFQSGAGILPTPLKACTSET+ FYP SPKTPP  L+SNS N KQLAT RSAPI IK K S
Subjt:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFS

Query:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSS
        N+++ FHEEFYD SFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLEL RSAPEFEMHHPSAK    +P   L  S+
Subjt:  NKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSS

A0A6P4A757 uncharacterized protein LOC1074259501.6e-5664.47Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTP-PTCLSSNSGNGKQLATVRSAPIPIKTKF
        ME +VVV QHRNQYY R KP  PAR+GS PSRDFRG+NCRSFQS AG+LPTP KACTS   +   SSPKTP P+  S      + L   +S PI I    
Subjt:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTP-PTCLSSNSGNGKQLATVRSAPIPIKTKF

Query:  SNKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSV-AKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSRFFFHSADSATKTLR
        S+K   F EE  + + SFSELWAGP YSNSPPPSSLPIPKFSV  KRT SLELP SAPE EM HP AKSAPPSPTRE N S R  F SA+ ATKTL+
Subjt:  SNKNSAFHEEFYDPSFSFSELWAGPTYSNSPPPSSLPIPKFSV-AKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSRFFFHSADSATKTLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02715.1 unknown protein3.9e-3144.91Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPS---SPKTPPTCL------SSNSGNGKQLATVRSA
        ME ++V  +HR+QYYG+ K     RF S PS+ FR +NCR+FQSG G+LP P +  ++   +   S   SP++P + L      S +SG        R++
Subjt:  MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPS---SPKTPPTCL------SSNSGNGKQLATVRSA

Query:  PIPIKTKFSNKNSAFHEEFYDP--SFSFSELWAGPTYSNSPPPSSLPIPKFSV-AKRTTSLELPRSAPEFEMH-HPSAKSAPPSPTRELNFSSRFFFHSA
        PIPI     ++      EF D   S S+SELWAGPTYSNSPPP+S+PIPKFS+  KRT SL  P  AP+  +     AKSAP SPT     S    F S 
Subjt:  PIPIKTKFSNKNSAFHEEFYDP--SFSFSELWAGPTYSNSPPPSSLPIPKFSV-AKRTTSLELPRSAPEFEMH-HPSAKSAPPSPTRELNFSSRFFFHSA

Query:  DSATKTLRRILNLDVD
         SAT TLRR+LNL+++
Subjt:  DSATKTLRRILNLDVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCCGTGGTCGTTGTTGAGCAGCATAGGAACCAATATTATGGTCGGGTCAAGCCGCATGAGCCAGCTCGATTTGGATCACTCCCGTCCCGGGACTTCAGAGGGAT
GAACTGTAGGAGTTTTCAATCGGGAGCTGGTATACTCCCAACTCCCTTGAAGGCTTGTACCTCTGAAACTAAACAGTTCTACCCTTCTTCTCCCAAAACACCACCAACTT
GTTTAAGTTCCAACTCCGGAAATGGTAAACAACTTGCTACTGTGCGAAGTGCTCCAATTCCTATCAAAACCAAGTTTTCAAACAAGAACAGTGCTTTCCATGAAGAATTC
TATGATCCAAGTTTCTCATTCTCTGAGCTTTGGGCTGGACCCACTTACTCAAATTCACCGCCCCCAAGTTCATTGCCCATTCCAAAATTTTCAGTTGCTAAGAGAACCAC
GTCACTGGAGTTGCCTCGTTCTGCTCCTGAATTTGAAATGCATCATCCATCTGCCAAGTCTGCACCACCATCCCCAACTCGAGAGCTAAACTTTTCCTCCAGATTTTTCT
TTCATAGTGCTGACTCTGCGACTAAGACTCTACGTCGCATTCTTAATCTTGATGTTGACAATGAATGA
mRNA sequenceShow/hide mRNA sequence
CAATGCTCGGGGCTTGGGCATGATATTTAAATACAACTCTGAAAGCATTTCAGATCCGTGGATTCTGGTTCAAACCCTCGCTGGGGGCTCAATCTTCATCTCCATGGCAG
CACTCTAATCCAAACTGGAGATAATCTTCGAAAAAGAAAAAACATAAACAAAAAAAAAAACAAAAACAAAAACAAAAAAAAGAAGGAGAAAAAAATAGGAAAAAATTAGA
ACCGCTAGAGTTCGTTGACGAAGAATTTCAGTCTATATCTTCGTTACGGTTCATCTTTATCTTCTATAGATCTTAATCCTACTGAAGAACTTCCTCGATAATTCCGCTAA
GCTGGTTGAATCGTCGGTTTGTTCGATTTGGATCTAGCGTGGAAGGTTGTGACTTTTAGTTTAGTTTAGTTTAGTTTAGTTATCAGCCTTGTGTTGAGTTGTAGAGGTTG
GATCTACTGGGAGAAACTGAAATTGTGTCTCTAACTTAAAACTGGATACATACTTTGACTTGGAAGTTGTTTTGCTATGGAAGCCGTGGTCGTTGTTGAGCAGCATAGGA
ACCAATATTATGGTCGGGTCAAGCCGCATGAGCCAGCTCGATTTGGATCACTCCCGTCCCGGGACTTCAGAGGGATGAACTGTAGGAGTTTTCAATCGGGAGCTGGTATA
CTCCCAACTCCCTTGAAGGCTTGTACCTCTGAAACTAAACAGTTCTACCCTTCTTCTCCCAAAACACCACCAACTTGTTTAAGTTCCAACTCCGGAAATGGTAAACAACT
TGCTACTGTGCGAAGTGCTCCAATTCCTATCAAAACCAAGTTTTCAAACAAGAACAGTGCTTTCCATGAAGAATTCTATGATCCAAGTTTCTCATTCTCTGAGCTTTGGG
CTGGACCCACTTACTCAAATTCACCGCCCCCAAGTTCATTGCCCATTCCAAAATTTTCAGTTGCTAAGAGAACCACGTCACTGGAGTTGCCTCGTTCTGCTCCTGAATTT
GAAATGCATCATCCATCTGCCAAGTCTGCACCACCATCCCCAACTCGAGAGCTAAACTTTTCCTCCAGATTTTTCTTTCATAGTGCTGACTCTGCGACTAAGACTCTACG
TCGCATTCTTAATCTTGATGTTGACAATGAATGAACTCCGGGGAGCATCAAGCTCCTGTAAATAGGCTTACTTTGTGAGCTGCTTAGTGTCATGTATATAGAATGAATAA
ATTGGTTAGTGATGACAGTCATGAGTATCCTTAGTTTTCAGTGCTGGTGTGCATCCTCCGATTGCAGCTACGGGCTCTGATACCTTCAACTAATATCCAGATGTTGCACC
TTAATCTGTGCTTAGATGCTGATCTGGATTGGGTTCTGGCAGACAAAGACAAGATAGGGTGTGATAGATACACAGATTAAATCAAGATATTCGAAAGGCAGAAGTAGGCT
GTTGGTGAACTTTAATGTTGTTGTGAATGAAACCTTCATTCTGCACTGCATGAATCTGCTGGCTTTTATCTATCTAGTTAAATATGAACATTGCTAAGAGTAGTCATGGA
TCTGCTGGCTATTATCTCTGTCCCGGCCATCATGTCCTTACCAAATTCTCAGTTTTCATTCCATTCTAGAGTTTAGGTAGTCATGTAATTTGTATCATACTGTAGGATCC
CAAACTAATGCTTCATTTAAGATTTGTAGGCTTCTGTTTCATTCTTTTCATCCATTCCTTACTTACATCCTCGGTTCTAGTTGCTATCCTTCTTACATAGATAGTCCGAT
GACTTATTGGTTAGGTTCTTTGTCCCTGATGAATCGGTTATTCGACAGTTAAATCTCTCCTTTTCTTAATAGTTAATATCCTCAATCCTATACCTCCTCTAATCATCTTA
TAGCTTCACTCCTTTGCCTGCACACTATTGAGGGAGATATTCCTTCTTTTCTGCCTTTTCAGTTTTTTTGAAAGAAAAGGTGGGGTAGGATAGTGATAGTTATGACTTTC
TAGTTTGCTTCTTTTTGGGATTACTCCGTGTAGGATTTTCCTGGTTTCAGTGCCTCCTGGTTTTGTCATCTGTGTACCTGTCTAGTATGGAACTTTCTTACTGGGTTTTT
GCAAGGTTCGTGGTCCTTCTCGGAGATCATTGATCACATGGAGTGGTATATGAGACAACCTAAGGGTGAACAGTTCAGGTGATGGTTTCAGTAGTAGTATAGTTGGTTTG
CTGTGGACGTAGTTCGTCACCAGAGAAAGTCTTCTTCTTAGAAGCCTAGGTCCAGTGTATGTTTGTTTATCCTTGTATTGAAAGCTGTACACTCTCAGAGTCCTACTGTT
GGTCGTTGGAGCTTTTTTCTTTTTCCTTCATATCAGTTTGAATGGTAATTGCCTTCGATTTCTGTTTTGAAGAGCAATGTTTATTGCATCTCCAGAAAGTGAAGCAATAT
TGTAATGTTAATGTTATTTCAGTTTGTTAGCTACTTGGTTATGTAAAATCGATAAGATATCGTAAAAGCTTTGTTATAGATTCGTTTTCTGCTACATGTTGAGAACCACA
TGGGAATGATGAAGAAATCTGGAAAACTAAGGGCCCG
Protein sequenceShow/hide protein sequence
MEAVVVVEQHRNQYYGRVKPHEPARFGSLPSRDFRGMNCRSFQSGAGILPTPLKACTSETKQFYPSSPKTPPTCLSSNSGNGKQLATVRSAPIPIKTKFSNKNSAFHEEF
YDPSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMHHPSAKSAPPSPTRELNFSSRFFFHSADSATKTLRRILNLDVDNE