; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015362 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015362
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationscaffold10:15688639..15695970
RNA-Seq ExpressionSpg015362
SyntenySpg015362
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136694.1 uncharacterized protein LOC101213732 isoform X1 [Cucumis sativus]1.7e-8062.88Show/hide
Query:  MGHNLV-VSFQFPQLIMNPSGHLRNKRSSKFYEKKTKQ-------SFMCFAV---NNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRAT
        M H++V VS Q PQL++NP+     K SSK Y    K+       +F+CFA+   N+N +  QNPPIFSL+FSSF PLSESP+A FDDY+EDEARLLRAT
Subjt:  MGHNLV-VSFQFPQLIMNPSGHLRNKRSSKFYEKKTKQ-------SFMCFAV---NNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRAT

Query:  FSGKSEKLNE----------------VSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNNL
        FSGKSEK+N+                VSPVADVRL+C+SS +D PIHIP N+SKF+DLQLM WELKG+  DFK  +  I+VKGA+YAERT+SKSVLTNNL
Subjt:  FSGKSEKLNE----------------VSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNNL

Query:  LLNLHNFAAPTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQANVVPTNY
        LLNL+N A   P+DFFAQDFLQP  EKGLKGMME+ M+EFTENLLLDY KYKKE Q N VP+NY
Subjt:  LLNLHNFAAPTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQANVVPTNY

XP_008443384.1 PREDICTED: uncharacterized protein LOC103486982 [Cucumis melo]9.9e-8162.26Show/hide
Query:  MGHNLV-VSFQFPQLIMNPSGHLRNK---RSSKFYEKKTKQSFMCFAV--------NNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRA
        M HNLV VS Q PQLI+NP+  L +K      K +      +F+CFA+        +NN++QNQNPPIFSL+FSSFHPLSESP+A FDDY+EDE RLLRA
Subjt:  MGHNLV-VSFQFPQLIMNPSGHLRNK---RSSKFYEKKTKQSFMCFAV--------NNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRA

Query:  TFSGKSEKLNE----------------VSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNN
        TF+GKSEK+++                VSPVADVRL+C+S  +D PIHIPHN+SKF+DLQLM WELKG+  DFK  +  I+VKGA+YAERT+SKSVL NN
Subjt:  TFSGKSEKLNE----------------VSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNN

Query:  LLLNLHNFAAPTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQA--NVVPT
        LLLNL+N A P P+DFFAQDFLQP AEKGLKGMME+ M+EF ENLLLDY KYKKEKQ    VVP+
Subjt:  LLLNLHNFAAPTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQA--NVVPT

XP_023528168.1 uncharacterized protein LOC111791159 [Cucurbita pepo subsp. pepo]1.1e-7159.84Show/hide
Query:  MGHNL-VVSFQFPQLIMNPSGHLRNKRSSKFYEKKTKQSFMCFAVNNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGKSEKLN-
        M HNL  VSF FPQLI+N   H            + +  F  FAV NN++ +QNPPIFSLRFS+FHPL ESP A FD+Y+ DE RLLRATFSGKSEKL+ 
Subjt:  MGHNL-VVSFQFPQLIMNPSGHLRNKRSSKFYEKKTKQSFMCFAVNNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGKSEKLN-

Query:  ---------------EVSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERT--ESKSVLTNNLLLNLHNFA
                       ++SPV DVRL+CRSSA+DYPIHIP +++KFLDLQ+MRWE++GMG DFKPQ F ISVKGA YA RT  ESKSVL N+L+L+LH+F 
Subjt:  ---------------EVSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERT--ESKSVLTNNLLLNLHNFA

Query:  APTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQA
        +P P      DFLQPFAEKGL+GMM+++M++FT+NL+LDY KYKKEKQ+
Subjt:  APTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQA

XP_038905853.1 uncharacterized protein LOC120091799 isoform X1 [Benincasa hispida]1.5e-9271.26Show/hide
Query:  MGHNLV-VSFQFPQLIMNPSGHLRNKRSSKF--YEKKTKQSFMCFAV--NNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGKSE
        M HNLV VSFQ PQLI+N      NKRS  +  ++KK    F+CFAV  NN++H +QNPPIFSL+FSSFHPLSESP+A FDDY+EDEARLLR TFSGKSE
Subjt:  MGHNLV-VSFQFPQLIMNPSGHLRNKRSSKF--YEKKTKQSFMCFAV--NNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGKSE

Query:  KLN----------------EVSPVADVRLNCRS--SAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNNLLLNL
        K+N                EVS VADVRLNCRS  + QDYPIHIPH++SKF+DLQLMRWELKG+GT+FKPQRFTI+V+GALYAERTESKS+LTNN +LNL
Subjt:  KLN----------------EVSPVADVRLNCRS--SAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNNLLLNL

Query:  HNFAAPTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQANVVPTNYG
        HNFAAPTP DFFAQDFLQPFAEKGLKGMME+TM EFTE LLLDY KYKKEKQ N V  N G
Subjt:  HNFAAPTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQANVVPTNYG

XP_038905855.1 uncharacterized protein LOC120091799 isoform X2 [Benincasa hispida]2.0e-7364.61Show/hide
Query:  MGHNLV-VSFQFPQLIMNPSGHLRNKRSSKF--YEKKTKQSFMCFAV--NNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGKSE
        M HNLV VSFQ PQLI+N      NKRS  +  ++KK    F+CFAV  NN++H +QNPPIFSL+FSSFHPLSESP+A FDDY+EDEARLLR TFSGKSE
Subjt:  MGHNLV-VSFQFPQLIMNPSGHLRNKRSSKF--YEKKTKQSFMCFAV--NNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGKSE

Query:  KLNEVSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDFLQ
        K+N+                                  MRWELKG+GT+FKPQRFTI+V+GALYAERTESKS+LTNN +LNLHNFAAPTP DFFAQDFLQ
Subjt:  KLNEVSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDFLQ

Query:  PFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQANVVPTNYG
        PFAEKGLKGMME+TM EFTE LLLDY KYKKEKQ N V  N G
Subjt:  PFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQANVVPTNYG

TrEMBL top hitse value%identityAlignment
A0A0A0LC26 Uncharacterized protein8.2e-8162.88Show/hide
Query:  MGHNLV-VSFQFPQLIMNPSGHLRNKRSSKFYEKKTKQ-------SFMCFAV---NNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRAT
        M H++V VS Q PQL++NP+     K SSK Y    K+       +F+CFA+   N+N +  QNPPIFSL+FSSF PLSESP+A FDDY+EDEARLLRAT
Subjt:  MGHNLV-VSFQFPQLIMNPSGHLRNKRSSKFYEKKTKQ-------SFMCFAV---NNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRAT

Query:  FSGKSEKLNE----------------VSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNNL
        FSGKSEK+N+                VSPVADVRL+C+SS +D PIHIP N+SKF+DLQLM WELKG+  DFK  +  I+VKGA+YAERT+SKSVLTNNL
Subjt:  FSGKSEKLNE----------------VSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNNL

Query:  LLNLHNFAAPTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQANVVPTNY
        LLNL+N A   P+DFFAQDFLQP  EKGLKGMME+ M+EFTENLLLDY KYKKE Q N VP+NY
Subjt:  LLNLHNFAAPTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQANVVPTNY

A0A1S3B8N8 uncharacterized protein LOC1034869824.8e-8162.26Show/hide
Query:  MGHNLV-VSFQFPQLIMNPSGHLRNK---RSSKFYEKKTKQSFMCFAV--------NNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRA
        M HNLV VS Q PQLI+NP+  L +K      K +      +F+CFA+        +NN++QNQNPPIFSL+FSSFHPLSESP+A FDDY+EDE RLLRA
Subjt:  MGHNLV-VSFQFPQLIMNPSGHLRNK---RSSKFYEKKTKQSFMCFAV--------NNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRA

Query:  TFSGKSEKLNE----------------VSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNN
        TF+GKSEK+++                VSPVADVRL+C+S  +D PIHIPHN+SKF+DLQLM WELKG+  DFK  +  I+VKGA+YAERT+SKSVL NN
Subjt:  TFSGKSEKLNE----------------VSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNN

Query:  LLLNLHNFAAPTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQA--NVVPT
        LLLNL+N A P P+DFFAQDFLQP AEKGLKGMME+ M+EF ENLLLDY KYKKEKQ    VVP+
Subjt:  LLLNLHNFAAPTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQA--NVVPT

A0A6J1CT99 uncharacterized protein LOC1110141317.2e-6957.79Show/hide
Query:  MGHNLVVSFQFPQLIMNPSGHLRNKRSSKFYEKKTKQSFMCFAVNNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGKSEKLNE-
        MGH L+VS QFP  I +P  HLR +  +  +  + KQ+F+CFA+N     +QNPP+FSL FS  HPL ES +A FD+Y+EDE R+LRATF+GKSE+L + 
Subjt:  MGHNLVVSFQFPQLIMNPSGHLRNKRSSKFYEKKTKQSFMCFAVNNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGKSEKLNE-

Query:  ----------------VSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNNLLLNLHNFAAP
                        V+PV DVR  CRSSA+DYPIHIP +ISKFL+LQLMRWEL G+G DFK Q F ISVKGALYAER ESKS L   L+LNLH+FAAP
Subjt:  ----------------VSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNNLLLNLHNFAAP

Query:  TPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKE
        TPL F  QD     A+KGLKGMME+ M +F+E LLLDY K+K++
Subjt:  TPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKE

A0A6J1F2K4 uncharacterized protein LOC111441814 isoform X22.6e-7159.27Show/hide
Query:  MGHNL-VVSFQFPQLIMNPSGHLRNKRSSKFYEKKTKQSFMCFAVNNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGKSEKLN-
        M HNL  VSF FPQLI+N   H            + +  F  FAV NN++ +QNPPIFSLRFS+FHPL ESP A FD+Y+ DE RLLRATFSGKSEKLN 
Subjt:  MGHNL-VVSFQFPQLIMNPSGHLRNKRSSKFYEKKTKQSFMCFAVNNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGKSEKLN-

Query:  ---------------EVSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERT--ESKSVLTNNLLLNLHNFA
                       ++SPV DVRL+C+SS +DYPIHIP ++SKFLDLQ+MRWE++GMG DFKPQ F ISVKG +YA RT  ESKS+L N+L+L+LH+F 
Subjt:  ---------------EVSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERT--ESKSVLTNNLLLNLHNFA

Query:  APTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQ
        +P P      DFLQPFAEKGL+GMM+++M++FT+NL+LDY KYKKEKQ
Subjt:  APTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQ

A0A6J1J0I3 uncharacterized protein LOC1114823522.2e-7059.68Show/hide
Query:  MGHNL-VVSFQFPQLIMNPSGHLRNKRSSKFYEKKTKQSFMCFAVNNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGKSEKLN-
        M HNL  VSF FPQLI++   H            + + SF  FAV NN++ +QNPPIFSLRFS+FHPL ESP A FD+Y+ DE RLLRATFSGKSEKLN 
Subjt:  MGHNL-VVSFQFPQLIMNPSGHLRNKRSSKFYEKKTKQSFMCFAVNNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGKSEKLN-

Query:  ---------------EVSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERT--ESKSVLTNNLLLNLHNFA
                       ++SP+ DVRL+CRS A+DYPIHIP ++SKFLDLQ+MRWE++GMG DFK Q F ISVKGA YA RT  ESKSVL N+L+L+LH+F 
Subjt:  ---------------EVSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERT--ESKSVLTNNLLLNLHNFA

Query:  APTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQ
        +  P      DFLQPFAEKGLKGMM+++M++FT+NL+LDY KYKKEKQ
Subjt:  APTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G39520.1 Protein of unknown function (DUF1997)3.7e-2534.03Show/hide
Query:  FSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGKSE--KLNE----------------VSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWEL
        +S + S+   L ESP+ALFD+YLED++R+  A F  K +  +LNE                  PV  +R+ C+S+ QDYP  +P +I+K L+L + +WEL
Subjt:  FSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGKSE--KLNE----------------VSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMRWEL

Query:  KGMGTDFKPQRFTISVKGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQ
        +G+    +P  FT+ VKGALY +R    + L   L   + +F  P+ L    +D  +  A   L G+++       E+L+ DY K+K E++
Subjt:  KGMGTDFKPQRFTISVKGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQ

AT5G39530.1 Protein of unknown function (DUF1997)1.0e-3037.63Show/hide
Query:  PPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGK--SEKLNE----------------VSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMR
        P  +S R S+  PL+ESP+ALFD+YLED++R+  A F  K  S +LNE                V PV D+RL C+S+ QDYP  +P +I+K L+L +MR
Subjt:  PPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGK--SEKLNE----------------VSPVADVRLNCRSSAQDYPIHIPHNISKFLDLQLMR

Query:  WELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQ
        W+L+G+    +P  F++ VKGALY +R    + L   L +N+ +F  P  L+   +D  +  A   L G++E    +   +LL DY ++K E++
Subjt:  WELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLDYGKYKKEKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCATAATTTGGTTGTTTCTTTCCAGTTCCCACAGCTGATTATGAATCCAAGTGGGCATTTGAGGAACAAGAGATCATCAAAATTTTACGAGAAGAAAACGAAGCA
GTCTTTTATGTGCTTCGCAGTGAATAACAATCATCACCAAAATCAAAATCCTCCAATTTTCTCTCTTAGATTCTCCAGCTTCCATCCACTTTCCGAATCTCCTGAGGCTT
TGTTTGATGATTACCTTGAAGATGAAGCCAGATTGTTGAGAGCAACTTTTTCTGGAAAAAGTGAAAAACTAAACGAGGTCAGCCCGGTTGCTGACGTAAGATTAAACTGC
AGAAGTTCTGCCCAAGATTACCCTATTCATATTCCTCACAATATCTCCAAGTTTCTTGACCTTCAACTGATGAGATGGGAGCTGAAGGGAATGGGCACAGATTTCAAACC
ACAAAGGTTCACAATCAGTGTAAAAGGAGCTTTGTATGCTGAAAGAACAGAATCAAAAAGTGTCCTCACAAATAATTTGCTGCTCAATCTTCACAACTTTGCTGCCCCTA
CACCTCTTGACTTCTTTGCACAAGATTTTCTTCAACCCTTTGCAGAAAAGGGATTGAAGGGAATGATGGAGCAAACAATGCAAGAATTTACAGAAAATTTGCTTTTGGAT
TATGGCAAATACAAGAAGGAGAAGCAAGCGAATGTGGTTCCAACCAATTATGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCATAATTTGGTTGTTTCTTTCCAGTTCCCACAGCTGATTATGAATCCAAGTGGGCATTTGAGGAACAAGAGATCATCAAAATTTTACGAGAAGAAAACGAAGCA
GTCTTTTATGTGCTTCGCAGTGAATAACAATCATCACCAAAATCAAAATCCTCCAATTTTCTCTCTTAGATTCTCCAGCTTCCATCCACTTTCCGAATCTCCTGAGGCTT
TGTTTGATGATTACCTTGAAGATGAAGCCAGATTGTTGAGAGCAACTTTTTCTGGAAAAAGTGAAAAACTAAACGAGGTCAGCCCGGTTGCTGACGTAAGATTAAACTGC
AGAAGTTCTGCCCAAGATTACCCTATTCATATTCCTCACAATATCTCCAAGTTTCTTGACCTTCAACTGATGAGATGGGAGCTGAAGGGAATGGGCACAGATTTCAAACC
ACAAAGGTTCACAATCAGTGTAAAAGGAGCTTTGTATGCTGAAAGAACAGAATCAAAAAGTGTCCTCACAAATAATTTGCTGCTCAATCTTCACAACTTTGCTGCCCCTA
CACCTCTTGACTTCTTTGCACAAGATTTTCTTCAACCCTTTGCAGAAAAGGGATTGAAGGGAATGATGGAGCAAACAATGCAAGAATTTACAGAAAATTTGCTTTTGGAT
TATGGCAAATACAAGAAGGAGAAGCAAGCGAATGTGGTTCCAACCAATTATGGATAA
Protein sequenceShow/hide protein sequence
MGHNLVVSFQFPQLIMNPSGHLRNKRSSKFYEKKTKQSFMCFAVNNNHHQNQNPPIFSLRFSSFHPLSESPEALFDDYLEDEARLLRATFSGKSEKLNEVSPVADVRLNC
RSSAQDYPIHIPHNISKFLDLQLMRWELKGMGTDFKPQRFTISVKGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDFLQPFAEKGLKGMMEQTMQEFTENLLLD
YGKYKKEKQANVVPTNYG