; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G002170 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G002170
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF962)
Genome locationCmo_Chr10:956719..958027
RNA-Seq ExpressionCmoCh10G002170
SyntenyCmoCh10G002170
Gene Ontology termsGO:0046521 - sphingoid catabolic process (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009305 - 2-hydroxy-palmitic acid dioxygenase Mpo1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138174.1 uncharacterized endoplasmic reticulum membrane protein YGL010W [Cucumis sativus]2.9e-8378.61Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV
        MGKTGLFDLE+ FAFYGAYHSNP+NIFIH             YLYFTPSFYTIPKSPCGFD GL  NFGFLFTL+YAA YVVFDK+AGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV

Query:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL
        GAS +A +LGYSQTWK+VL AQLFCWTNQ I HGVFE   KRAPALLDNLAQ FLMAPFFV  EVLQSLFKYEPYPGFSASVQAKIKADI+EWKE KEKL
Subjt:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL

Query:  T
        +
Subjt:  T

XP_022933403.1 uncharacterized endoplasmic reticulum membrane protein C16E8.02 [Cucurbita moschata]3.1e-8579.6Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV
        MGKT  FDLERHFAFYGAYHSNPVNIFIH             YLYFTPSFYTIPKSPCGFD GL  NFGFLFTLIYAASYV+FDK+AGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV

Query:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL
        GAS ++ RLGYSQTWK+VL AQLFCWTNQ I HGVFE   KRAPALLDNLAQ FLMAPFFVF EVLQ+LFKYEPYPGFSASVQAKI+ADIKEWKE+KEKL
Subjt:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL

Query:  T
        +
Subjt:  T

XP_022972978.1 uncharacterized endoplasmic reticulum membrane protein C16E8.02 [Cucurbita maxima]2.0e-8478.61Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV
        MGKT  FDLERH+AFYGAYHSNPVNIFIH             YLYFTPSFYTIPKSPCGFD GL  NFGFLFTL YAASYV+FDKKAGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV

Query:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL
        GAS ++ RLGYSQTWK+VL AQLFCWTNQ+I HGVFE   KRAPALLDNLAQ FLMAPFFVF EVLQ+LFKYEPYPGFS+SVQAKI+ADIKEWKE KEKL
Subjt:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL

Query:  T
        +
Subjt:  T

XP_023531593.1 uncharacterized endoplasmic reticulum membrane protein C16E8.02 [Cucurbita pepo subsp. pepo]7.6e-8479.29Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV
        MGKT  FDLERHFAFYGAYHSNP+NIFIH             YLYFTPSFYTIPKSPCGFD GL  NFGFLFTLIYAASYV+FDKKAGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV

Query:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKE
        GAS ++ RLGYSQTWK+VL AQLFCWTNQ I HGVFE   KRAPALLDNLAQ FLMAPFFVF EVLQ+LFKYEPYPGFSASVQAKI+ADIKEWKE+K+
Subjt:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKE

XP_038880038.1 2-hydroxy-palmitic acid dioxygenase MPO1 [Benincasa hispida]8.1e-8681.09Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV
        MGK+GLFDLERHFAFYGAYHSNPVNIFIH             YLYFTPSFYTIPKSPCGFD GL  NFGFLFTLIYAASYVVFDKKAGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV

Query:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL
        GAS +A RLGYSQTWK+VL AQLFCWTNQ I HGVFE   KRAPALLDNLAQ FLMAPFFV  EVLQSLFKYEPYPGFSASVQAKIKADI+EWKE+K+KL
Subjt:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL

Query:  T
        +
Subjt:  T

TrEMBL top hitse value%identityAlignment
A0A0A0LP54 Uncharacterized protein1.4e-8378.61Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV
        MGKTGLFDLE+ FAFYGAYHSNP+NIFIH             YLYFTPSFYTIPKSPCGFD GL  NFGFLFTL+YAA YVVFDK+AGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV

Query:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL
        GAS +A +LGYSQTWK+VL AQLFCWTNQ I HGVFE   KRAPALLDNLAQ FLMAPFFV  EVLQSLFKYEPYPGFSASVQAKIKADI+EWKE KEKL
Subjt:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL

Query:  T
        +
Subjt:  T

A0A1S3BV50 uncharacterized endoplasmic reticulum membrane protein YGL010W3.1e-8378.11Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV
        MGKTGLFDLE+ FAFYGAYHSNP+NIFIH             YLYFTPSFYTIPKSPCGFD GL  NFGF FTL+YAA YVVFDK+AGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV

Query:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL
        GAS +A +LGYSQTWK+VL AQLFCWTNQ I HGVFE   KRAPALLDNLAQ FLMAPFFV  EVLQSLFKYEPYPGFSASVQAKIKADI+EWKE+KEKL
Subjt:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL

Query:  T
        +
Subjt:  T

A0A5A7UQ13 Putative endoplasmic reticulum membrane protein3.1e-8378.11Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV
        MGKTGLFDLE+ FAFYGAYHSNP+NIFIH             YLYFTPSFYTIPKSPCGFD GL  NFGF FTL+YAA YVVFDK+AGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV

Query:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL
        GAS +A +LGYSQTWK+VL AQLFCWTNQ I HGVFE   KRAPALLDNLAQ FLMAPFFV  EVLQSLFKYEPYPGFSASVQAKIKADI+EWKE+KEKL
Subjt:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL

Query:  T
        +
Subjt:  T

A0A6J1F4N0 uncharacterized endoplasmic reticulum membrane protein C16E8.021.5e-8579.6Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV
        MGKT  FDLERHFAFYGAYHSNPVNIFIH             YLYFTPSFYTIPKSPCGFD GL  NFGFLFTLIYAASYV+FDK+AGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV

Query:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL
        GAS ++ RLGYSQTWK+VL AQLFCWTNQ I HGVFE   KRAPALLDNLAQ FLMAPFFVF EVLQ+LFKYEPYPGFSASVQAKI+ADIKEWKE+KEKL
Subjt:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL

Query:  T
        +
Subjt:  T

A0A6J1IBP8 uncharacterized endoplasmic reticulum membrane protein C16E8.029.6e-8578.61Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV
        MGKT  FDLERH+AFYGAYHSNPVNIFIH             YLYFTPSFYTIPKSPCGFD GL  NFGFLFTL YAASYV+FDKKAGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIH-------------YLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV

Query:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL
        GAS ++ RLGYSQTWK+VL AQLFCWTNQ+I HGVFE   KRAPALLDNLAQ FLMAPFFVF EVLQ+LFKYEPYPGFS+SVQAKI+ADIKEWKE KEKL
Subjt:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKL

Query:  T
        +
Subjt:  T

SwissProt top hitse value%identityAlignment
O13737 2-hydroxy-palmitic acid dioxygenase mpo11.0e-1435.8Show/hide
Query:  LERHFAFYGAYHSNPVNIFIHYLYFTPSFYTIPKSPCGF-----DRGLAFNFGFLFTLIYAASYVVFDKKAG-SNAALPCLICWVGASILAFRLGYSQTW
        L R ++FY AYHSNPVNI IH +       T       F     +  L  N   L  L Y   YV  D   G   + +  L  ++  S L      S   
Subjt:  LERHFAFYGAYHSNPVNIFIHYLYFTPSFYTIPKSPCGF-----DRGLAFNFGFLFTLIYAASYVVFDKKAG-SNAALPCLICWVGASILAFRLGYSQTW

Query:  KIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIK
        +      + CW  Q I HGVFE   KR PALLDNL Q   +AP F F E         P+ G+  SV +KI+A+IK
Subjt:  KIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIK

P25338 2-hydroxy-palmitic acid dioxygenase MPO13.0e-1129.83Show/hide
Query:  GLFDLERHFAFYGAYHSNPVNIFIHYLYFTPSFYTIPKSPCGFDR-------GLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWVGASILAFRLG
        GL DL     FY  YH NP N+ IH ++     ++     C   R        L      LF++ Y   Y+     AG       L+  +  +++  R+ 
Subjt:  GLFDLERHFAFYGAYHSNPVNIFIHYLYFTPSFYTIPKSPCGFDR-------GLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWVGASILAFRLG

Query:  YSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIK
           T+K  L      W  Q + HGVFE   KR PAL+DNL Q  ++AP+F+ FE L  L       GF   ++A ++ D++
Subjt:  YSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIK

Arabidopsis top hitse value%identityAlignment
AT1G18720.1 Protein of unknown function (DUF962)1.9e-5656.28Show/hide
Query:  GLFDLERHFAFYGAYHSNPVNIFIHYLYFTPSFYTI-----------PKSPCGFDRGLA------FNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV
        GLFDLE+HFAFYGAYHSNP+NI IH ++  P F+++             S  GF + L       FN GF+F LIYA  Y+  DKK+G  AAL C  CWV
Subjt:  GLFDLERHFAFYGAYHSNPVNIFIHYLYFTPSFYTI-----------PKSPCGFDRGLA------FNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWV

Query:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEK
        G+S LA RLG S   K+ L +QL CWT Q + HGVFE   KRAPALLDNL Q FLMAPFFV  EVLQS+F YEPYPGF A V AK+++DIKE++ +K+K
Subjt:  GASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEK

AT1G74440.1 Protein of unknown function (DUF962)3.3e-5352.24Show/hide
Query:  KTGLFDLERHFAFYGAYHSNPVNIFIHYLYFTPSFYT-----------IPKSPCGFDRGLAF------NFGFLFTLIYAASYVVFDKKAGSNAALPCLIC
        + GL DLE+HFAFYGAYHSNP+NI IH L+  P+ +            +  S  GF + L F      + GF  T+ YA  Y+  DKK+G  AAL C  C
Subjt:  KTGLFDLERHFAFYGAYHSNPVNIFIHYLYFTPSFYT-----------IPKSPCGFDRGLAF------NFGFLFTLIYAASYVVFDKKAGSNAALPCLIC

Query:  WVGASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKE
        W+G+S LA RLG+S T K+ + +QL CWT Q + HG+FE   KRAPALLDNL Q FLM PFFV  EVLQS+F YEPYPGF A V +KI++ IKEW+E+K+
Subjt:  WVGASILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKE

Query:  K
        +
Subjt:  K


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGACTGGTTTGTTTGATCTGGAGAGGCATTTCGCCTTCTATGGCGCTTATCACAGCAACCCAGTCAACATTTTCATTCATTATTTGTACTTCACCCCA
TCTTTCTACACTATTCCCAAATCCCCTTGTGGGTTTGACCGCGGCCTGGCCTTCAACTTCGGATTTCTTTTCACCTTAATATATGCTGCATCTTATGTGGTCTTC
GATAAGAAAGCGGGGTCCAATGCCGCTTTGCCTTGCTTGATTTGTTGGGTTGGAGCAAGCATACTCGCCTTTAGACTTGGTTATTCTCAGACCTGGAAGATAGTA
CTGACTGCTCAGTTGTTCTGTTGGACCAATCAGTTAATAGACCATGGAGTATTTGAGGTTAGACAGAAACGAGCACCGGCTTTGTTAGACAATCTTGCTCAACCT
TTTCTAATGGCTCCATTCTTTGTATTTTTTGAGGTTCTTCAAAGTTTGTTCAAATATGAACCATACCCAGGATTTAGTGCGAGTGTGCAAGCGAAGATAAAAGCA
GATATCAAAGAGTGGAAAGAACAGAAGGAAAAGCTGACATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAGACTGGTTTGTTTGATCTGGAGAGGCATTTCGCCTTCTATGGCGCTTATCACAGCAACCCAGTCAACATTTTCATTCATTATTTGTACTTCACCCCA
TCTTTCTACACTATTCCCAAATCCCCTTGTGGGTTTGACCGCGGCCTGGCCTTCAACTTCGGATTTCTTTTCACCTTAATATATGCTGCATCTTATGTGGTCTTC
GATAAGAAAGCGGGGTCCAATGCCGCTTTGCCTTGCTTGATTTGTTGGGTTGGAGCAAGCATACTCGCCTTTAGACTTGGTTATTCTCAGACCTGGAAGATAGTA
CTGACTGCTCAGTTGTTCTGTTGGACCAATCAGTTAATAGACCATGGAGTATTTGAGGTTAGACAGAAACGAGCACCGGCTTTGTTAGACAATCTTGCTCAACCT
TTTCTAATGGCTCCATTCTTTGTATTTTTTGAGGTTCTTCAAAGTTTGTTCAAATATGAACCATACCCAGGATTTAGTGCGAGTGTGCAAGCGAAGATAAAAGCA
GATATCAAAGAGTGGAAAGAACAGAAGGAAAAGCTGACATAGGTTGAATTGCCTAAAGTTCAAGACAACTCTTGCCTAGATCTCCCTTTGTAGTAGTACGAAGAG
AAAAACATTTTGAAATCTTTACGTGATATTTGTTAGGAATCACGAACTTTCACAATGATATGATATTGTCTATTTTGAGCATAAACTCTCATGAAGACATGACTC
TGATACCATGTTAGAAATCACGTACCTCCGCACTAGTATGATATTGTCCACTGCTCTCATGGCATAAGCTCCCTCCCAACAATCCTCGACAATATACGATGTTTT
Protein sequenceShow/hide protein sequence
MGKTGLFDLERHFAFYGAYHSNPVNIFIHYLYFTPSFYTIPKSPCGFDRGLAFNFGFLFTLIYAASYVVFDKKAGSNAALPCLICWVGASILAFRLGYSQTWKIV
LTAQLFCWTNQLIDHGVFEVRQKRAPALLDNLAQPFLMAPFFVFFEVLQSLFKYEPYPGFSASVQAKIKADIKEWKEQKEKLT