; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G15920 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G15920
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionInositol-1-monophosphatase
Genome locationChr1:11428755..11431036
RNA-Seq ExpressionCSPI01G15920
SyntenyCSPI01G15920
Gene Ontology termsGO:0006021 - inositol biosynthetic process (biological process)
GO:0046854 - phosphatidylinositol phosphorylation (biological process)
GO:0046855 - inositol phosphate dephosphorylation (biological process)
GO:0008934 - inositol monophosphate 1-phosphatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0052832 - inositol monophosphate 3-phosphatase activity (molecular function)
GO:0052833 - inositol monophosphate 4-phosphatase activity (molecular function)
InterPro domainsIPR000760 - Inositol monophosphatase-like
IPR020550 - Inositol monophosphatase, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463623.1 PREDICTED: phosphatase IMPL1, chloroplastic [Cucumis melo]3.3e-7394.44Show/hide
Query:  VNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD
        +N      VERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD
Subjt:  VNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD

Query:  RSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTD
        RSVLVSNGVVHDKLLEKIGPATEKLK KGIEFSLWYKPENYQTD
Subjt:  RSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTD

XP_011655003.1 phosphatase IMPL1, chloroplastic [Cucumis sativus]3.8e-7495.17Show/hide
Query:  VNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD
        +N      VERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD
Subjt:  VNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD

Query:  RSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL
        RSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL
Subjt:  RSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL

XP_022927292.1 phosphatase IMPL1, chloroplastic [Cucurbita moschata]1.5e-7089.93Show/hide
Query:  QAIFVNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKF
        Q I V+Q     VERSLLVTGFGYEHDDPW+TNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVL+VEEAGGAVTRMDG KF
Subjt:  QAIFVNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKF

Query:  SVFDRSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL
        SVFDRSVLVSNG+VHDKLLEKIG ATEKLK KGI+FSLWYKPENYQTDL
Subjt:  SVFDRSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL

XP_038894424.1 phosphatase IMPL1, chloroplastic isoform X1 [Benincasa hispida]1.8e-7192.41Show/hide
Query:  VNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD
        +N      VERSLLVTGFGYEHDDPW TNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD
Subjt:  VNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD

Query:  RSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL
        RSVLVSNGVVHDKLLEKIG ATEKLKSKGI+FSLWYKPENY TDL
Subjt:  RSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL

XP_038894425.1 phosphatase IMPL1, chloroplastic isoform X2 [Benincasa hispida]1.8e-7192.41Show/hide
Query:  VNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD
        +N      VERSLLVTGFGYEHDDPW TNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD
Subjt:  VNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD

Query:  RSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL
        RSVLVSNGVVHDKLLEKIG ATEKLKSKGI+FSLWYKPENY TDL
Subjt:  RSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL

TrEMBL top hitse value%identityAlignment
A0A0A0LYM3 Uncharacterized protein3.5e-97100Show/hide
Query:  MVQDTPHLRGSISKPILEGGHDSFQAIFVNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPW
        MVQDTPHLRGSISKPILEGGHDSFQAIFVNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPW
Subjt:  MVQDTPHLRGSISKPILEGGHDSFQAIFVNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPW

Query:  DMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL
        DMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL
Subjt:  DMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL

A0A1S3CL94 Inositol-1-monophosphatase1.6e-7394.44Show/hide
Query:  VNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD
        +N      VERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD
Subjt:  VNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD

Query:  RSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTD
        RSVLVSNGVVHDKLLEKIGPATEKLK KGIEFSLWYKPENYQTD
Subjt:  RSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTD

A0A6J1CMR5 Inositol-1-monophosphatase1.2e-7089.93Show/hide
Query:  QAIFVNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKF
        Q I V+Q     VERSLLVTGFGYEHDDPW TNMDLFKEFTDVSRGVRRLGAAAVDM HV+LGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKF
Subjt:  QAIFVNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKF

Query:  SVFDRSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL
         VFDRSVLVSNGVVHDKLLEKIG ATEKLKSKG++FSLWYKPENYQTDL
Subjt:  SVFDRSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL

A0A6J1EKL1 Inositol-1-monophosphatase7.3e-7189.93Show/hide
Query:  QAIFVNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKF
        Q I V+Q     VERSLLVTGFGYEHDDPW+TNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVL+VEEAGGAVTRMDG KF
Subjt:  QAIFVNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKF

Query:  SVFDRSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL
        SVFDRSVLVSNG+VHDKLLEKIG ATEKLK KGI+FSLWYKPENYQTDL
Subjt:  SVFDRSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL

A0A6J1KJ24 Inositol-1-monophosphatase7.3e-7189.93Show/hide
Query:  QAIFVNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKF
        Q I V+Q     VERSLLVTGFGYEHDDPW+TNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVL+VEEAGGAVTRMDG KF
Subjt:  QAIFVNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKF

Query:  SVFDRSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL
        SVFDRSVLVSNG+VHDKLLEKIG ATEKLK KGI+FSLWYKPENYQTDL
Subjt:  SVFDRSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL

SwissProt top hitse value%identityAlignment
O33832 Fructose-1,6-bisphosphatase/inositol-1-monophosphatase2.7e-1437.08Show/hide
Query:  SRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSNGVVHDKLLEKIGPATEKLKSK
        +R +R LG+AA++ ++V  G V+ +  +R+ PWD+AAG++IV+EAGG VT   G + + F ++ + SNG++HD++++ +    E++  K
Subjt:  SRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSNGVVHDKLLEKIGPATEKLKSK

P74158 Inositol-1-monophosphatase9.7e-2040.91Show/hide
Query:  VERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSNG
        +++SLLVTGF Y+       N   F   T +++GVRR G+AA+D+  VA G ++ YWE  + PWDMAAG++IV EAGG V+  D     +    +L +NG
Subjt:  VERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSNG

Query:  VVHDKLLEKI
         +H +L + +
Subjt:  VVHDKLLEKI

Q94F00 Phosphatase IMPL1, chloroplastic1.6e-6786.86Show/hide
Query:  VERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSNG
        VER+LL+TGFGYEHDD W+TNM+LFKEFTDVSRGVRRLGAAAVDM HVALGI E+YWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSNG
Subjt:  VERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSNG

Query:  VVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL
        V+H KLLE+I PATE LKSKGI+FSLW+KPE+Y T+L
Subjt:  VVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL

Q9A3D5 Inositol-1-monophosphatase3.2e-1537.29Show/hide
Query:  HVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSR---GVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD-RSV
        H++ ++L TG  +        +    KE   VS+   GVRR GAA++D++ VA G  +A+WE  L  WD+AAGVL+++E+GG +T +D     V   +S+
Subjt:  HVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSR---GVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD-RSV

Query:  LVSNGVVHDKLLEKIGPA
        L SN  +H ++LE++  A
Subjt:  LVSNGVVHDKLLEKIGPA

Q9HXI4 Nus factor SuhB1.6e-1435.96Show/hide
Query:  VERSLLVTGFGYEHD--DPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVS
        +E +LL TGF +  +  D     +++F+     + G+RR GAA++D+++VA G  +A+WE+ L  WDMAAG L+V+EAGG V+   G    +    ++  
Subjt:  VERSLLVTGFGYEHD--DPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVS

Query:  NGVVHDKLLEKIGP
        N      LL  I P
Subjt:  NGVVHDKLLEKIGP

Arabidopsis top hitse value%identityAlignment
AT1G31190.1 myo-inositol monophosphatase like 11.2e-6886.86Show/hide
Query:  VERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSNG
        VER+LL+TGFGYEHDD W+TNM+LFKEFTDVSRGVRRLGAAAVDM HVALGI E+YWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSNG
Subjt:  VERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSNG

Query:  VVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL
        V+H KLLE+I PATE LKSKGI+FSLW+KPE+Y T+L
Subjt:  VVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL

AT3G02870.1 Inositol monophosphatase family protein5.1e-0831.53Show/hide
Query:  SLLVTGFGYEHDDPW---TTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRL-KPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSN
        +LLVT  G + D      TTN           R +R  G+ A+D+  VA G V+ ++E     PWD+AAG++IV+EAGG +    G    +  + +  SN
Subjt:  SLLVTGFGYEHDDPW---TTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRL-KPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSN

Query:  GVVHDKLLEKI
          + +   E +
Subjt:  GVVHDKLLEKI

AT3G02870.2 Inositol monophosphatase family protein5.1e-0831.53Show/hide
Query:  SLLVTGFGYEHDDPW---TTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRL-KPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSN
        +LLVT  G + D      TTN           R +R  G+ A+D+  VA G V+ ++E     PWD+AAG++IV+EAGG +    G    +  + +  SN
Subjt:  SLLVTGFGYEHDDPW---TTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRL-KPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSN

Query:  GVVHDKLLEKI
          + +   E +
Subjt:  GVVHDKLLEKI

AT3G02870.3 Inositol monophosphatase family protein5.1e-0831.53Show/hide
Query:  SLLVTGFGYEHDDPW---TTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRL-KPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSN
        +LLVT  G + D      TTN           R +R  G+ A+D+  VA G V+ ++E     PWD+AAG++IV+EAGG +    G    +  + +  SN
Subjt:  SLLVTGFGYEHDDPW---TTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRL-KPWDMAAGVLIVEEAGGAVTRMDGGKFSVFDRSVLVSN

Query:  GVVHDKLLEKI
          + +   E +
Subjt:  GVVHDKLLEKI

AT4G05090.1 Inositol monophosphatase family protein1.8e-0532.47Show/hide
Query:  IVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD-----------RSVLVSNGVVHDKLLEKIGPATEKL
        ++ A  +  +K WD A G++ V EAGG VT  +G + ++ +             V+VSNG +H+++LE I  A+  L
Subjt:  IVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGGKFSVFD-----------RSVLVSNGVVHDKLLEKIGPATEKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCAGGACACACCTCACCTTCGTGGTAGTATCTCCAAACCCATTTTGGAAGGAGGGCACGATTCTTTTCAAGCAATCTTTGTAAATCAAGGGCACCCTGGGCATGT
GGAACGGTCTCTTCTAGTTACAGGATTTGGATATGAACATGACGATCCATGGACTACAAATATGGATTTATTTAAAGAATTCACAGATGTTAGCAGGGGAGTGAGAAGGC
TAGGTGCAGCAGCAGTCGACATGAGCCATGTAGCTCTAGGAATTGTAGAAGCATATTGGGAGTATCGCCTGAAGCCATGGGATATGGCTGCCGGTGTTTTGATAGTTGAA
GAAGCGGGTGGAGCAGTGACTCGCATGGATGGTGGAAAGTTCAGTGTCTTTGATAGATCTGTTTTGGTATCTAATGGTGTTGTACATGACAAGCTTTTGGAGAAAATTGG
TCCTGCAACAGAAAAACTAAAAAGCAAAGGAATTGAGTTCTCATTGTGGTATAAGCCAGAAAACTACCAGACAGATCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCCAGGACACACCTCACCTTCGTGGTAGTATCTCCAAACCCATTTTGGAAGGAGGGCACGATTCTTTTCAAGCAATCTTTGTAAATCAAGGGCACCCTGGGCATGT
GGAACGGTCTCTTCTAGTTACAGGATTTGGATATGAACATGACGATCCATGGACTACAAATATGGATTTATTTAAAGAATTCACAGATGTTAGCAGGGGAGTGAGAAGGC
TAGGTGCAGCAGCAGTCGACATGAGCCATGTAGCTCTAGGAATTGTAGAAGCATATTGGGAGTATCGCCTGAAGCCATGGGATATGGCTGCCGGTGTTTTGATAGTTGAA
GAAGCGGGTGGAGCAGTGACTCGCATGGATGGTGGAAAGTTCAGTGTCTTTGATAGATCTGTTTTGGTATCTAATGGTGTTGTACATGACAAGCTTTTGGAGAAAATTGG
TCCTGCAACAGAAAAACTAAAAAGCAAAGGAATTGAGTTCTCATTGTGGTATAAGCCAGAAAACTACCAGACAGATCTTTGA
Protein sequenceShow/hide protein sequence
MVQDTPHLRGSISKPILEGGHDSFQAIFVNQGHPGHVERSLLVTGFGYEHDDPWTTNMDLFKEFTDVSRGVRRLGAAAVDMSHVALGIVEAYWEYRLKPWDMAAGVLIVE
EAGGAVTRMDGGKFSVFDRSVLVSNGVVHDKLLEKIGPATEKLKSKGIEFSLWYKPENYQTDL