; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g33770 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g33770
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionNascent polypeptide-associated complex subunit alpha, muscle-specific form
Genome locationchr6:25624544..25629059
RNA-Seq ExpressionMoc06g33770
SyntenyMoc06g33770
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571464.1 hypothetical protein SDJN03_28192, partial [Cucurbita argyrosperma subsp. sororia]8.1e-9385.85Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKFS
        MEAV+VVEQHRNQYYGR++PHGPARF S PSRDF GMNCRSFQSGAGILPTPLKAC S  K  YPSSPKTPPTCLSS +GNGKQLA+++SAPIPI  KFS
Subjt:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKFS

Query:  NKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMH-PSAKSAPPSPTRDQNFSSRFFFHSADSATKTLRRILNL
        NKN A++EEFYDR+FSFSELWAGPTYSNSPPPSSLPIPKFSVAKRT SLELPRSAPEFEMH PSAKSAPPSPTR+   SSRF FHSADSATKTLRRILNL
Subjt:  NKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEMH-PSAKSAPPSPTRDQNFSSRFFFHSADSATKTLRRILNL

Query:  DVDNE
        DVDNE
Subjt:  DVDNE

KAG6606421.1 hypothetical protein SDJN03_03738, partial [Cucurbita argyrosperma subsp. sororia]1.3e-7979.13Show/hide
Query:  MEAVVVVE-QHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKF
        MEAVVVVE QHRNQYYG       A FGS PSRDF G+NCRSFQSGAGILPTP KA TSE +  YPSSPKTP TCLSS SGN K  AT+ +APIPIK KF
Subjt:  MEAVVVVE-QHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKF

Query:  SNKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEM-HPSAKSAPPSPTRDQNFSSRFFFHSADSATKTLRRILN
         N N  ++EEFYD SFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTS E+PRSAPEF++ HPSAKSAPPSPTRDQNFS RFFFH+ DSATKTLRRIL+
Subjt:  SNKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEM-HPSAKSAPPSPTRDQNFSSRFFFHSADSATKTLRRILN

Query:  LDVDNE
        LDVDNE
Subjt:  LDVDNE

KGN47907.1 hypothetical protein Csa_004001 [Cucumis sativus]2.6e-9187.32Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKFS
        MEAVVV+EQHRNQYY RVKPHGPARFGSL SRDF GMNCRSFQSGAGILPTPLKAC SE +  YP SPKTPP CL+S S N KQLATMRSAPIPIK K S
Subjt:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKFS

Query:  NKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEM-HPSAKSAPPSPTRDQNFSSRFFFHSADSATKTLRRILNL
        N++ A +EEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLEL RSAPEFEM HPSAKSAPPSPTRDQNFS+RFFFHSADSATKTLRRILNL
Subjt:  NKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEM-HPSAKSAPPSPTRDQNFSSRFFFHSADSATKTLRRILNL

Query:  DVDNE
        DV NE
Subjt:  DVDNE

XP_016900729.1 PREDICTED: uncharacterized protein LOC107990294 [Cucumis melo]7.1e-8985.37Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKFS
        MEAVVV+EQHRNQYY RVKPHGPARFGSL SRDF GMNCRSFQSGAGILPTPLKACTSE +  YP SPKTPP  L+S S N KQLAT RSAPI IK K S
Subjt:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKFS

Query:  NKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEM-HPSAKSAPPSPTRDQNFSSRFFFHSADSATKTLRRILNL
        N++   +EEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLEL RSAPEFEM HPSAKSAPPSPTRDQ+FS+R+FFHSADSATKTLRRILNL
Subjt:  NKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEM-HPSAKSAPPSPTRDQNFSSRFFFHSADSATKTLRRILNL

Query:  DVDNE
        DVDNE
Subjt:  DVDNE

XP_022145927.1 uncharacterized protein LOC111015275 [Momordica charantia]1.8e-87100Show/hide
Query:  MAMGALSLVPLFLATSLRSHHFPSISTQTLLANFVATSQVVSLTPRCTKFRRKNVVFGKQSNNANESQFLDENGVVDDMDGYLNYLSLEYDSVWDTKPSW
        MAMGALSLVPLFLATSLRSHHFPSISTQTLLANFVATSQVVSLTPRCTKFRRKNVVFGKQSNNANESQFLDENGVVDDMDGYLNYLSLEYDSVWDTKPSW
Subjt:  MAMGALSLVPLFLATSLRSHHFPSISTQTLLANFVATSQVVSLTPRCTKFRRKNVVFGKQSNNANESQFLDENGVVDDMDGYLNYLSLEYDSVWDTKPSW

Query:  CQPWTITLTGLLLTASSWFVIKSIAVTAVILSVIILWWYIFLYSYPKAYSEMIAERRKKVTDGDEDTFG
        CQPWTITLTGLLLTASSWFVIKSIAVTAVILSVIILWWYIFLYSYPKAYSEMIAERRKKVTDGDEDTFG
Subjt:  CQPWTITLTGLLLTASSWFVIKSIAVTAVILSVIILWWYIFLYSYPKAYSEMIAERRKKVTDGDEDTFG

TrEMBL top hitse value%identityAlignment
A0A0A0KHP6 Uncharacterized protein1.3e-9187.32Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKFS
        MEAVVV+EQHRNQYY RVKPHGPARFGSL SRDF GMNCRSFQSGAGILPTPLKAC SE +  YP SPKTPP CL+S S N KQLATMRSAPIPIK K S
Subjt:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKFS

Query:  NKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEM-HPSAKSAPPSPTRDQNFSSRFFFHSADSATKTLRRILNL
        N++ A +EEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLEL RSAPEFEM HPSAKSAPPSPTRDQNFS+RFFFHSADSATKTLRRILNL
Subjt:  NKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEM-HPSAKSAPPSPTRDQNFSSRFFFHSADSATKTLRRILNL

Query:  DVDNE
        DV NE
Subjt:  DVDNE

A0A1S4DXL6 uncharacterized protein LOC1079902943.4e-8985.37Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKFS
        MEAVVV+EQHRNQYY RVKPHGPARFGSL SRDF GMNCRSFQSGAGILPTPLKACTSE +  YP SPKTPP  L+S S N KQLAT RSAPI IK K S
Subjt:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKFS

Query:  NKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEM-HPSAKSAPPSPTRDQNFSSRFFFHSADSATKTLRRILNL
        N++   +EEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLEL RSAPEFEM HPSAKSAPPSPTRDQ+FS+R+FFHSADSATKTLRRILNL
Subjt:  NKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEM-HPSAKSAPPSPTRDQNFSSRFFFHSADSATKTLRRILNL

Query:  DVDNE
        DVDNE
Subjt:  DVDNE

A0A5A7UUU2 Nascent polypeptide-associated complex subunit alpha, muscle-specific form7.5e-6883.73Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKFS
        MEAVVV+EQHRNQYY RVKPHGPARFGSL SRDF GMNCRSFQSGAGILPTPLKACTSE +  YP SPKTPP  L+S S N KQLAT RSAPI IK K S
Subjt:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKFS

Query:  NKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEM-HPSAK
        N++   +EEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLEL RSAPEFEM HPSAK
Subjt:  NKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEM-HPSAK

A0A5D3CPQ8 Nascent polypeptide-associated complex subunit alpha, muscle-specific form7.5e-6883.73Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKFS
        MEAVVV+EQHRNQYY RVKPHGPARFGSL SRDF GMNCRSFQSGAGILPTPLKACTSE +  YP SPKTPP  L+S S N KQLAT RSAPI IK K S
Subjt:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKFS

Query:  NKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEM-HPSAK
        N++   +EEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLEL RSAPEFEM HPSAK
Subjt:  NKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPIPKFSVAKRTTSLELPRSAPEFEM-HPSAK

A0A6J1CY32 uncharacterized protein LOC1110152758.5e-88100Show/hide
Query:  MAMGALSLVPLFLATSLRSHHFPSISTQTLLANFVATSQVVSLTPRCTKFRRKNVVFGKQSNNANESQFLDENGVVDDMDGYLNYLSLEYDSVWDTKPSW
        MAMGALSLVPLFLATSLRSHHFPSISTQTLLANFVATSQVVSLTPRCTKFRRKNVVFGKQSNNANESQFLDENGVVDDMDGYLNYLSLEYDSVWDTKPSW
Subjt:  MAMGALSLVPLFLATSLRSHHFPSISTQTLLANFVATSQVVSLTPRCTKFRRKNVVFGKQSNNANESQFLDENGVVDDMDGYLNYLSLEYDSVWDTKPSW

Query:  CQPWTITLTGLLLTASSWFVIKSIAVTAVILSVIILWWYIFLYSYPKAYSEMIAERRKKVTDGDEDTFG
        CQPWTITLTGLLLTASSWFVIKSIAVTAVILSVIILWWYIFLYSYPKAYSEMIAERRKKVTDGDEDTFG
Subjt:  CQPWTITLTGLLLTASSWFVIKSIAVTAVILSVIILWWYIFLYSYPKAYSEMIAERRKKVTDGDEDTFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02715.1 unknown protein1.4e-3445.19Show/hide
Query:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPS---SPKTPPTCLSSISGNGKQLATMRSAPIPIKV
        ME ++V  +HR+QYYG+ K  G  RF S PS+ F  +NCR+FQSG G+LP P +  ++   +G  S   SP++P + L     +   + + R++PIPI  
Subjt:  MEAVVVVEQHRNQYYGRVKPHGPARFGSLPSRDFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPS---SPKTPPTCLSSISGNGKQLATMRSAPIPIKV

Query:  KFSNKNGAVNEEFYD--RSFSFSELWAGPTYSNSPPPSSLPIPKFSV-AKRTTSLELPRSAPEFEMHPSAKSAPPSPTRDQNFSSRFFFHSADSATKTLR
           ++    + EF D  RS S+SELWAGPTYSNSPPP+S+PIPKFS+  KRT SL  P      ++   AKSAP SPT     S    F S  SAT TLR
Subjt:  KFSNKNGAVNEEFYD--RSFSFSELWAGPTYSNSPPPSSLPIPKFSV-AKRTTSLELPRSAPEFEMHPSAKSAPPSPTRDQNFSSRFFFHSADSATKTLR

Query:  RILNLDVD
        R+LNL+++
Subjt:  RILNLDVD

AT4G02725.1 unknown protein4.2e-3954.55Show/hide
Query:  TQTLLANFVATSQVVSLTPRCTKFRRKNVVFGKQSNNANESQFLDENGVVDDMDGYLNYLSLEYDSVWDTKPSWCQPWTITLTGLLLTASSWFVIKSIAV
        T+ LL+     + + S   +  +   K   FG       +S+F+DE GVVDDM+G+L+ LSLEYDSVWDTKPSWCQPWTI LTGL + A SW ++ S+ V
Subjt:  TQTLLANFVATSQVVSLTPRCTKFRRKNVVFGKQSNNANESQFLDENGVVDDMDGYLNYLSLEYDSVWDTKPSWCQPWTITLTGLLLTASSWFVIKSIAV

Query:  TAVILSVIILWWYIFLYSYPKAYSEMIAERRKKVTDGDEDTFG
        +++ + VI  WWYIFLYSYPK+YSEMIAERRK+V DG ED +G
Subjt:  TAVILSVIILWWYIFLYSYPKAYSEMIAERRKKVTDGDEDTFG

AT4G02725.2 unknown protein6.8e-2951.24Show/hide
Query:  TQTLLANFVATSQVVSLTPRCTKFRRKNVVFGKQSNNANESQFLDENGVVDDMDGYLNYLSLEYDSVWDTKPSWCQPWTITLTGLLLTASSWFVIKSIAV
        T+ LL+     + + S   +  +   K   FG       +S+F+DE GVVDDM+G+L+ LSLEYDSVWDTKPSWCQPWTI LTGL + A SW ++ S+ V
Subjt:  TQTLLANFVATSQVVSLTPRCTKFRRKNVVFGKQSNNANESQFLDENGVVDDMDGYLNYLSLEYDSVWDTKPSWCQPWTITLTGLLLTASSWFVIKSIAV

Query:  TAVILSVIILWWYIFLYSYPK
        +++ + VI  WWYIFLYSYPK
Subjt:  TAVILSVIILWWYIFLYSYPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATGGGAGCTCTTTCTCTTGTCCCTCTGTTCCTTGCAACTTCTTTACGTTCCCACCATTTTCCTTCAATTTCAACACAGACTCTTCTCGCGAACTTCGTT
GCTACGTCACAAGTCGTTTCCTTGACTCCCAGATGTACAAAATTCCGCAGGAAAAATGTCGTTTTTGGGAAGCAATCAAACAACGCGAACGAGTCTCAGTTTTTA
GACGAAAATGGCGTCGTCGATGATATGGATGGATATTTAAATTATCTCTCTCTCGAATATGACTCCGTTTGGGATACGAAACCTTCATGGTGTCAACCATGGACG
ATAACACTGACAGGACTATTACTGACTGCCTCTAGCTGGTTTGTTATAAAGTCGATAGCAGTGACTGCAGTAATACTGTCCGTAATAATCTTATGGTGGTACATC
TTTCTTTACTCATATCCGAAGGCTTATTCTGAAATGATTGCGGAGCGAAGAAAAAAGGTTACAGATGGAGACGAAGACACATTTGGTACTTTTTGGCTTGGAAGT
TCTTTTGCTATGGAAGCAGTGGTCGTTGTTGAGCAGCATAGGAACCAATATTATGGTCGGGTCAAGCCACATGGGCCAGCTCGATTTGGGTCACTCCCGTCCCGA
GACTTTAGTGGGATGAACTGTAGGAGTTTCCAATCTGGAGCTGGTATACTCCCAACTCCCTTGAAGGCTTGTACGTCTGAACCTAAACAGGGCTACCCTTCTTCA
CCCAAAACACCACCAACTTGTTTAAGTTCCATCTCCGGAAATGGAAAACAACTTGCTACTATGCGGAGTGCTCCAATTCCTATCAAAGTCAAATTTTCAAACAAG
AACGGTGCTGTAAATGAAGAATTCTATGACCGGAGTTTCTCATTCTCTGAGCTTTGGGCTGGACCCACTTACTCAAACTCCCCGCCTCCAAGTTCATTACCTATT
CCCAAATTTTCAGTTGCGAAAAGAACCACATCACTGGAGTTGCCTCGATCTGCCCCTGAATTTGAAATGCATCCATCTGCTAAGTCTGCACCACCATCCCCAACT
AGGGACCAAAACTTTTCCTCCAGATTTTTCTTTCATAGTGCTGACTCTGCGACCAAGACTCTGCGTCGCATTCTTAACCTTGATGTTGACAATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCATGGGAGCTCTTTCTCTTGTCCCTCTGTTCCTTGCAACTTCTTTACGTTCCCACCATTTTCCTTCAATTTCAACACAGACTCTTCTCGCGAACTTCGTT
GCTACGTCACAAGTCGTTTCCTTGACTCCCAGATGTACAAAATTCCGCAGGAAAAATGTCGTTTTTGGGAAGCAATCAAACAACGCGAACGAGTCTCAGTTTTTA
GACGAAAATGGCGTCGTCGATGATATGGATGGATATTTAAATTATCTCTCTCTCGAATATGACTCCGTTTGGGATACGAAACCTTCATGGTGTCAACCATGGACG
ATAACACTGACAGGACTATTACTGACTGCCTCTAGCTGGTTTGTTATAAAGTCGATAGCAGTGACTGCAGTAATACTGTCCGTAATAATCTTATGGTGGTACATC
TTTCTTTACTCATATCCGAAGGCTTATTCTGAAATGATTGCGGAGCGAAGAAAAAAGGTTACAGATGGAGACGAAGACACATTTGGTACTTTTTGGCTTGGAAGT
TCTTTTGCTATGGAAGCAGTGGTCGTTGTTGAGCAGCATAGGAACCAATATTATGGTCGGGTCAAGCCACATGGGCCAGCTCGATTTGGGTCACTCCCGTCCCGA
GACTTTAGTGGGATGAACTGTAGGAGTTTCCAATCTGGAGCTGGTATACTCCCAACTCCCTTGAAGGCTTGTACGTCTGAACCTAAACAGGGCTACCCTTCTTCA
CCCAAAACACCACCAACTTGTTTAAGTTCCATCTCCGGAAATGGAAAACAACTTGCTACTATGCGGAGTGCTCCAATTCCTATCAAAGTCAAATTTTCAAACAAG
AACGGTGCTGTAAATGAAGAATTCTATGACCGGAGTTTCTCATTCTCTGAGCTTTGGGCTGGACCCACTTACTCAAACTCCCCGCCTCCAAGTTCATTACCTATT
CCCAAATTTTCAGTTGCGAAAAGAACCACATCACTGGAGTTGCCTCGATCTGCCCCTGAATTTGAAATGCATCCATCTGCTAAGTCTGCACCACCATCCCCAACT
AGGGACCAAAACTTTTCCTCCAGATTTTTCTTTCATAGTGCTGACTCTGCGACCAAGACTCTGCGTCGCATTCTTAACCTTGATGTTGACAATGAATGA
Protein sequenceShow/hide protein sequence
MAMGALSLVPLFLATSLRSHHFPSISTQTLLANFVATSQVVSLTPRCTKFRRKNVVFGKQSNNANESQFLDENGVVDDMDGYLNYLSLEYDSVWDTKPSWCQPWT
ITLTGLLLTASSWFVIKSIAVTAVILSVIILWWYIFLYSYPKAYSEMIAERRKKVTDGDEDTFGTFWLGSSFAMEAVVVVEQHRNQYYGRVKPHGPARFGSLPSR
DFSGMNCRSFQSGAGILPTPLKACTSEPKQGYPSSPKTPPTCLSSISGNGKQLATMRSAPIPIKVKFSNKNGAVNEEFYDRSFSFSELWAGPTYSNSPPPSSLPI
PKFSVAKRTTSLELPRSAPEFEMHPSAKSAPPSPTRDQNFSSRFFFHSADSATKTLRRILNLDVDNE