; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002241 (gene) of Snake gourd v1 genome

Gene IDTan0002241
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionInositol-1-monophosphatase
Genome locationLG05:80571256..80592801
RNA-Seq ExpressionTan0002241
SyntenyTan0002241
Gene Ontology termsGO:0006021 - inositol biosynthetic process (biological process)
GO:0007165 - signal transduction (biological process)
GO:0046854 - phosphatidylinositol phosphorylation (biological process)
GO:0046855 - inositol phosphate dephosphorylation (biological process)
GO:0008934 - inositol monophosphate 1-phosphatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0052832 - inositol monophosphate 3-phosphatase activity (molecular function)
GO:0052833 - inositol monophosphate 4-phosphatase activity (molecular function)
InterPro domainsIPR000760 - Inositol monophosphatase-like
IPR020550 - Inositol monophosphatase, conserved site
IPR020583 - Inositol monophosphatase, metal-binding site
IPR033942 - Inositol monophosphatase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142432.1 phosphatase IMPL1, chloroplastic [Momordica charantia]2.2e-16590.94Show/hide
Query:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV
        RSISSLSPSLA+LFNA R+ N GF  IR LD KL K RAA S+ISYDK+YPKVGAKS+GPIPP+QLIQVVENAAKTGA+VVMDAVNKPRNVEYKGLTDLV
Subjt:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV

Query:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF
        T+TDKMSEAAIL+VVRKNFKDHLILGEEGG+IGDSSSDYLWCIDPLDGTTNFAH YPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF
Subjt:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF

Query:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK
        CNGQKI VSQTSQVERSLLVTGFGYEHDDPWA N+DLFKEFTDVSRGVRRLGAAAVDMCHV+LGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDG K
Subjt:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK

Query:  FCVFDRSVLVSNGVVHGKVM
        FCVFDRSVLVSNGVVH K++
Subjt:  FCVFDRSVLVSNGVVHGKVM

XP_022927292.1 phosphatase IMPL1, chloroplastic [Cucurbita moschata]6.8e-16791.56Show/hide
Query:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV
        RSISSL+P +AVLFNARRDYN GFPLI  L  KLAKTRAALSEISY K+YPKVGAKS+GPIPPAQLIQVVE AAKTGA+VVMDAVNKPRN+EYKGLTDLV
Subjt:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV

Query:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF
        TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRG+PAAAAVVEFVGGPMCWNTRIFTATAGGGAF
Subjt:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF

Query:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK
        CNGQKI VSQTSQVERSLLVTGFGYEHDDPW+ N+DLFKEFTDVSRGVRRLGAAAVDM HVALGIVEAYWEYRLKPWDMAAGVL+VEEAGGAVTRMDG K
Subjt:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK

Query:  FCVFDRSVLVSNGVVHGKVM
        F VFDRSVLVSNG+VH K++
Subjt:  FCVFDRSVLVSNGVVHGKVM

XP_023001591.1 phosphatase IMPL1, chloroplastic [Cucurbita maxima]2.5e-16992.5Show/hide
Query:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV
        RSISSL+P + VLFNARRDYNRGFPLI  LD KLAKTRAALSEISY K+YPKVGAKS+GPIPPAQLIQVVENAAKTGA+VVMDAVNKPRN+EYKGLTDLV
Subjt:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV

Query:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF
        TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF
Subjt:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF

Query:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK
        CNGQKI VSQTSQVERSLLVTGFGYEHDDPW+ N+DLFKEFTDVSRGVRRLGAAAVDM HVALGIVEAYWEYRLKPWDMAAGVL+VEEAGGAVTRMDG K
Subjt:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK

Query:  FCVFDRSVLVSNGVVHGKVM
        F VFDRSVLVSNG+VH K++
Subjt:  FCVFDRSVLVSNGVVHGKVM

XP_023520115.1 phosphatase IMPL1, chloroplastic [Cucurbita pepo subsp. pepo]1.6e-16892.81Show/hide
Query:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV
        RSISSL+P + VLFNARRDYNRGFPLI  L  KLAKTRAALSEISY K+YPKVGAKSIGPIPPAQLIQVVENAAKTGA+VVMDAVNKPRN+EYKGLTDLV
Subjt:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV

Query:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF
        TETDK SEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF
Subjt:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF

Query:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK
        CNGQKI VSQTSQVERSLLVTGFGYEHDDPWA N+DLFKEFTDVSRGVRRLGAAAVDM HVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDG K
Subjt:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK

Query:  FCVFDRSVLVSNGVVHGKVM
        F VFDRSVLVSNG+VH K++
Subjt:  FCVFDRSVLVSNGVVHGKVM

XP_038894424.1 phosphatase IMPL1, chloroplastic isoform X1 [Benincasa hispida]4.0e-16792.5Show/hide
Query:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV
        RSISSLSP L V FNARRD NR FP IRPL  KLAKTRAALSEISY K+YPKVGAKSIGPIPPAQL+QVVENAAKTGA+VVMDAVNKPRNV+YKGLTDLV
Subjt:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV

Query:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF
        TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRG+PAAAAVVEFVGGPMCWNTR+FTATAGGGAF
Subjt:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF

Query:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK
        CNGQKI VSQTSQVERSLLVTGFGYEHDDPWA N+DLFKEFTDVSRGVRRLGAAAVDM HVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDG K
Subjt:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK

Query:  FCVFDRSVLVSNGVVHGKVM
        F VFDRSVLVSNGVVH K++
Subjt:  FCVFDRSVLVSNGVVHGKVM

TrEMBL top hitse value%identityAlignment
A0A1S3CL94 Inositol-1-monophosphatase5.3e-16591.56Show/hide
Query:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV
        RSISSLSPS  VLFNARRD NRGF  I PLD KL KT+AALSEISY K+YPKVGAKSIGPIPP+ LIQVVENAA TGA+VVMDAVNKPRNVEYKGLTDLV
Subjt:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV

Query:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF
        TETDKMSEAAIL+VVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVL+RGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF
Subjt:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF

Query:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK
        CNGQKI VSQTSQVERSLLVTGFGYEHDDPW  N+DLFKEFTDVSRGVRRLGAAAVDM HVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDG K
Subjt:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK

Query:  FCVFDRSVLVSNGVVHGKVM
        F VFDRSVLVSNGVVH K++
Subjt:  FCVFDRSVLVSNGVVHGKVM

A0A6J1CMR5 Inositol-1-monophosphatase1.1e-16590.94Show/hide
Query:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV
        RSISSLSPSLA+LFNA R+ N GF  IR LD KL K RAA S+ISYDK+YPKVGAKS+GPIPP+QLIQVVENAAKTGA+VVMDAVNKPRNVEYKGLTDLV
Subjt:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV

Query:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF
        T+TDKMSEAAIL+VVRKNFKDHLILGEEGG+IGDSSSDYLWCIDPLDGTTNFAH YPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF
Subjt:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF

Query:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK
        CNGQKI VSQTSQVERSLLVTGFGYEHDDPWA N+DLFKEFTDVSRGVRRLGAAAVDMCHV+LGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDG K
Subjt:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK

Query:  FCVFDRSVLVSNGVVHGKVM
        FCVFDRSVLVSNGVVH K++
Subjt:  FCVFDRSVLVSNGVVHGKVM

A0A6J1EKL1 Inositol-1-monophosphatase3.3e-16791.56Show/hide
Query:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV
        RSISSL+P +AVLFNARRDYN GFPLI  L  KLAKTRAALSEISY K+YPKVGAKS+GPIPPAQLIQVVE AAKTGA+VVMDAVNKPRN+EYKGLTDLV
Subjt:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV

Query:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF
        TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRG+PAAAAVVEFVGGPMCWNTRIFTATAGGGAF
Subjt:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF

Query:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK
        CNGQKI VSQTSQVERSLLVTGFGYEHDDPW+ N+DLFKEFTDVSRGVRRLGAAAVDM HVALGIVEAYWEYRLKPWDMAAGVL+VEEAGGAVTRMDG K
Subjt:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK

Query:  FCVFDRSVLVSNGVVHGKVM
        F VFDRSVLVSNG+VH K++
Subjt:  FCVFDRSVLVSNGVVHGKVM

A0A6J1KJ24 Inositol-1-monophosphatase1.2e-16992.5Show/hide
Query:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV
        RSISSL+P + VLFNARRDYNRGFPLI  LD KLAKTRAALSEISY K+YPKVGAKS+GPIPPAQLIQVVENAAKTGA+VVMDAVNKPRN+EYKGLTDLV
Subjt:  RSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLV

Query:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF
        TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF
Subjt:  TETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAF

Query:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK
        CNGQKI VSQTSQVERSLLVTGFGYEHDDPW+ N+DLFKEFTDVSRGVRRLGAAAVDM HVALGIVEAYWEYRLKPWDMAAGVL+VEEAGGAVTRMDG K
Subjt:  CNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRK

Query:  FCVFDRSVLVSNGVVHGKVM
        F VFDRSVLVSNG+VH K++
Subjt:  FCVFDRSVLVSNGVVHGKVM

A0A7N2RD80 Inositol-1-monophosphatase5.5e-14684.23Show/hide
Query:  GFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLVTETDKMSEAAILEVVRKNFKDH
        G P +R    +   T+AALSEI   K+YPK+GA+S GPIPP+QLIQVVE AAKTGA+VVM+AVNKPRN+ YKGLTDLVTETDKMSEAAILEVV+KNF DH
Subjt:  GFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLVTETDKMSEAAILEVVRKNFKDH

Query:  LILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTG
        LILGEEGG+IGD+SSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIF+ATAGGGAFCNGQKI VS+T+ VERSLLVTG
Subjt:  LILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTG

Query:  FGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSNGVVHGKVM
        FGYEHDDPWA N+DLFK FTDVSRGVRRLGAAAVDMCHVALGI EAYWEYRLKPWDMAAG LIVEEAGG VTRMDG KFCVFDRS+LVSNG +H K++
Subjt:  FGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSNGVVHGKVM

SwissProt top hitse value%identityAlignment
B4ED80 Putative Nus factor SuhB6.6e-4035.8Show/hide
Query:  LIQVVENAAKTGAQVVMDAVNKPRNVEY--KGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSV
        ++ +   AA+   Q++  A      +E   K   D VTE DK +E AI+E ++  + DH IL EE G   D+ S++ W IDPLDGTTNF HG+P + VS+
Subjt:  LIQVVENAAKTGAQVVMDAVNKPRNVEY--KGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSV

Query:  GVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVAL
         +  +G    A V +    P      +FTAT G GA+ N ++I+V +  ++  +L+ TGF +   D   A   LF E T    G+RR GAAA+D+ +VA 
Subjt:  GVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVAL

Query:  GIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSNGVVHGKVM
        G ++A++E  +  WDMAAG L++ EAGG V    G    +    ++ +N  ++ +++
Subjt:  GIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSNGVVHGKVM

O67791 Fructose-1,6-bisphosphatase/inositol-1-monophosphatase5.1e-4037.89Show/hide
Query:  IQVVENAAKTGAQVVMDAVNKPR--NVEYKGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVG
        ++V + AA  G QV+ +   K +  N+E KG  D V+  DK SE  I EV+ K F DH ++GEE G  G S S+Y W IDPLDGT N+ +G+P FAVSVG
Subjt:  IQVVENAAKTGAQVVMDAVNKPR--NVEYKGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVG

Query:  VLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALG
        ++    P   AV       + +  +++    G GA+ NG++IKV     ++ + +V GF        +  +++FK+       +RR GAAAVD+C VA G
Subjt:  VLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALG

Query:  IVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSNGVVHGKVM
        I +   E+ +KPWD+ AG++I++EAGG  T + G  F V D  ++  N  +H  ++
Subjt:  IVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSNGVVHGKVM

P74158 Inositol-1-monophosphatase7.3e-4738.65Show/hide
Query:  IQVVENAAKTGAQVVMDAVNKPRNVEYKGLT-DLVTETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGV
        +++   A       +     K + ++ KG   DLVTE D+ +EA ILE++++   DH IL EE G +G   + + W IDPLDGTTNFAH YP   VS+G+
Subjt:  IQVVENAAKTGAQVVMDAVNKPRNVEYKGLT-DLVTETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGV

Query:  LFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGI
        L +  P    V         +   +F A    GA  N + I+VS T+ +++SLLVTGF Y+       N   F   T +++GVRR G+AA+D+  VA G 
Subjt:  LFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGI

Query:  VEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSNGVVH
        ++ YWE  + PWDMAAG++IV EAGG V+  D     +    +L +NG +H
Subjt:  VEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSNGVVH

Q94F00 Phosphatase IMPL1, chloroplastic1.1e-13879.46Show/hide
Query:  KTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDS
        +T+A LSE+S   +YP++GAK+ G I PA L++VVE AAKTGA+VVM+AVNKPRN+ YKGL+DLVT+TDK SEAAILEVV+KNF DHLILGEEGG+IGDS
Subjt:  KTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDS

Query:  SSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANI
        SSDYLWCIDPLDGTTNFAHGYPSFAVSVGVL+RGNPAAA+VVEFVGGPMCWNTR F+ATAGGGA CNGQKI VS+T  VER+LL+TGFGYEHDD W+ N+
Subjt:  SSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANI

Query:  DLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSNGVVHGKVMLWFCARKWNLSS
        +LFKEFTDVSRGVRRLGAAAVDMCHVALGI E+YWEYRLKPWDMAAGVLIVEEAGGAVTRMDG KF VFDRSVLVSNGV+H K++        NL S
Subjt:  DLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSNGVVHGKVMLWFCARKWNLSS

Q9HXI4 Nus factor SuhB1.4e-4238.91Show/hide
Query:  LIQVVENAAKTGAQVVMDAVNK--PRNVEYKGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDS--SSDYLWCIDPLDGTTNFAHGYPSFAV
        ++ +   AA++  +++  ++ +    +V  K   D VTE D+ +E  I+  +RK +  H I+GEEGG I  S   +DYLW IDPLDGTTNF HG P FAV
Subjt:  LIQVVENAAKTGAQVVMDAVNK--PRNVEYKGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDS--SSDYLWCIDPLDGTTNFAHGYPSFAV

Query:  SVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHD--DPWAANIDLFKEFTDVSRGVRRLGAAAVDMC
        S+   ++G    A V++ V          FTA+ G GA  NG++++VS    +E +LL TGF +  +  D     +++F+     + G+RR GAA++D+ 
Subjt:  SVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHD--DPWAANIDLFKEFTDVSRGVRRLGAAAVDMC

Query:  HVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDG
        +VA G  +A+WE+ L  WDMAAG L+V+EAGG V+   G
Subjt:  HVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDG

Arabidopsis top hitse value%identityAlignment
AT1G31190.1 myo-inositol monophosphatase like 17.5e-14079.46Show/hide
Query:  KTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDS
        +T+A LSE+S   +YP++GAK+ G I PA L++VVE AAKTGA+VVM+AVNKPRN+ YKGL+DLVT+TDK SEAAILEVV+KNF DHLILGEEGG+IGDS
Subjt:  KTRAALSEISYDKKYPKVGAKSIGPIPPAQLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDS

Query:  SSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANI
        SSDYLWCIDPLDGTTNFAHGYPSFAVSVGVL+RGNPAAA+VVEFVGGPMCWNTR F+ATAGGGA CNGQKI VS+T  VER+LL+TGFGYEHDD W+ N+
Subjt:  SSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANI

Query:  DLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSNGVVHGKVMLWFCARKWNLSS
        +LFKEFTDVSRGVRRLGAAAVDMCHVALGI E+YWEYRLKPWDMAAGVLIVEEAGGAVTRMDG KF VFDRSVLVSNGV+H K++        NL S
Subjt:  DLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSNGVVHGKVMLWFCARKWNLSS

AT3G02870.1 Inositol monophosphatase family protein1.9e-3735.18Show/hide
Query:  QLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGV---IGDSSSDYLWCIDPLDGTTNFAHGYPSFAV
        Q +    +AAK   Q++     + ++VE+KG  DLVTETDK  E  +   +++ F +H  +GEE      + + + +  W +DPLDGTTNF HG+P   V
Subjt:  QLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGV---IGDSSSDYLWCIDPLDGTTNFAHGYPSFAV

Query:  SVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAAN-IDLFKEFTDVSRGVRRLGAAAVDMCH
        S+G+     P    VV  V  P+     +FT   G GAF NG++IKVS  S++  +LLVT  G + D     +  +         R +R  G+ A+D+C 
Subjt:  SVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAAN-IDLFKEFTDVSRGVRRLGAAAVDMCH

Query:  VALGIVEAYWEYRL-KPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSN
        VA G V+ ++E     PWD+AAG++IV+EAGG +    G+   +  + +  SN
Subjt:  VALGIVEAYWEYRL-KPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSN

AT3G02870.2 Inositol monophosphatase family protein3.9e-3534.39Show/hide
Query:  QLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGV---IGDSSSDYLWCIDPLDGTTNFAHGYPSFAV
        Q +    +AAK   Q++     + ++VE+KG  DLVTETDK  E  +   +++ F +H  +GEE      + + + +  W +DPLDGTTNF HG+P   V
Subjt:  QLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGV---IGDSSSDYLWCIDPLDGTTNFAHGYPSFAV

Query:  SVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAAN-IDLFKEFTDVSRGVRRLGAAAVDMCH
        S+G+     P    VV  V  P+     +FT   G GAF NG++IK    S++  +LLVT  G + D     +  +         R +R  G+ A+D+C 
Subjt:  SVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAAN-IDLFKEFTDVSRGVRRLGAAAVDMCH

Query:  VALGIVEAYWEYRL-KPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSN
        VA G V+ ++E     PWD+AAG++IV+EAGG +    G+   +  + +  SN
Subjt:  VALGIVEAYWEYRL-KPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSN

AT3G02870.3 Inositol monophosphatase family protein1.9e-3735.18Show/hide
Query:  QLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGV---IGDSSSDYLWCIDPLDGTTNFAHGYPSFAV
        Q +    +AAK   Q++     + ++VE+KG  DLVTETDK  E  +   +++ F +H  +GEE      + + + +  W +DPLDGTTNF HG+P   V
Subjt:  QLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGV---IGDSSSDYLWCIDPLDGTTNFAHGYPSFAV

Query:  SVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAAN-IDLFKEFTDVSRGVRRLGAAAVDMCH
        S+G+     P    VV  V  P+     +FT   G GAF NG++IKVS  S++  +LLVT  G + D     +  +         R +R  G+ A+D+C 
Subjt:  SVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAAN-IDLFKEFTDVSRGVRRLGAAAVDMCH

Query:  VALGIVEAYWEYRL-KPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSN
        VA G V+ ++E     PWD+AAG++IV+EAGG +    G+   +  + +  SN
Subjt:  VALGIVEAYWEYRL-KPWDMAAGVLIVEEAGGAVTRMDGRKFCVFDRSVLVSN

AT4G39120.1 myo-inositol monophosphatase like 28.6e-1925.89Show/hide
Query:  SISSLSPSLAV---LFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQL--IQVVENA-AKTGAQVVMDAVNKPRNVEYKG
        ++ S +PSL +     N+R  +     +  P    +++ R  L+  S  K+ P +  +S   +   +L     V NA A    +V+     K  ++  K 
Subjt:  SISSLSPSLAV---LFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPAQL--IQVVENA-AKTGAQVVMDAVNKPRNVEYKG

Query:  LTDLVTETDKMSEAAILEVVRKNFKDHLILGEE-GGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTAT
            VT  D+M+E A++ ++ +N   H I GEE G    + S+DY+W +DP+DGT +F  G P F   + +L++G P    ++  +  P+     I    
Subjt:  LTDLVTETDKMSEAAILEVVRKNFKDHLILGEE-GGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAAAVVEFVGGPMCWNTRIFTAT

Query:  AGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVR--RLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGA
         G     NG+ I      ++ ++ L T        P   + +  K ++ V   V+    G        +A G V+   E  LKP+D  A V ++E AGG 
Subjt:  AGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVR--RLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVLIVEEAGGA

Query:  VTRMDGRKF
        +T   G++F
Subjt:  VTRMDGRKF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATTTTGAAAAAAAAACTGGAAATGGCGTACACGGTACACGATGTTTGCCTACGTGTACCAACCTCCCTCGCGTTCCCTCGACATTCTGCAGCAAGGTTGACCAC
GTGTAAATCGTGTACCGATACACGATTTAGATCGATTTCCTCTCTCAGTCCAAGCCTCGCTGTTCTTTTCAACGCTAGGAGAGATTACAATCGTGGGTTTCCGCTAATTC
GACCATTGGACAACAAATTGGCGAAAACCAGAGCTGCCTTGTCTGAAATTTCTTACGATAAGAAGTACCCAAAAGTCGGTGCCAAATCTATTGGACCAATTCCACCTGCC
CAGCTAATTCAAGTCGTTGAGAATGCTGCCAAGACTGGAGCTCAGGTGGTGATGGATGCTGTTAATAAGCCTCGAAATGTTGAATATAAAGGATTGACTGACTTAGTAAC
TGAAACAGATAAAATGAGTGAGGCTGCTATTCTGGAAGTCGTTAGAAAGAATTTTAAAGATCACCTCATCCTTGGAGAAGAGGGAGGAGTTATAGGAGATTCATCATCTG
ATTATCTTTGGTGCATTGATCCTTTAGATGGAACAACAAATTTTGCACACGGCTATCCTTCCTTTGCAGTATCTGTCGGAGTTCTGTTTCGGGGAAATCCTGCTGCTGCA
GCCGTGGTAGAGTTTGTTGGAGGTCCTATGTGTTGGAACACTCGTATATTTACTGCAACTGCTGGTGGGGGAGCATTCTGTAACGGCCAAAAGATTAAAGTGAGTCAAAC
TAGCCAGGTTGAACGATCTCTTCTAGTTACAGGATTTGGATATGAACATGACGATCCATGGGCTGCAAATATCGATTTATTTAAAGAGTTCACAGATGTCAGCAGGGGAG
TGAGAAGGCTCGGTGCAGCAGCAGTCGACATGTGCCATGTAGCTCTAGGAATTGTAGAAGCTTATTGGGAATATCGTCTAAAGCCATGGGATATGGCTGCAGGTGTTTTG
ATAGTTGAAGAAGCTGGTGGAGCAGTAACTCGCATGGATGGTCGAAAGTTCTGTGTGTTTGATAGATCTGTTTTGGTTTCTAATGGTGTTGTACATGGCAAGGTAATGTT
ATGGTTCTGTGCTCGAAAATGGAACCTTTCTTCTTCTGTATACGTGCAGATATTGGGTTGTATCATGATCACTTATAAAATACATAGATGTAGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
GACCTAATATATACAATGAGTAAACTAGAATCGGTGTACAGGGTACATATGAGATGTGTATAGAGTAATATATGGTTTTTAAAAATAATTTATTTTGTAGACAAATTTGA
TAAAAAAAATAATAAAAAATAAATGGAAGACACTTTTATGGGATTTTTTTTTAAAAAATAATGTTATTTAAATACTAAATGACAAAAATAGGGTAAGTTTAGAACTATTT
CCAAATCTGGTAGATGACTTGTGGACGGGGTTCACGATTTAGTCAAATTGCAGCTTATGCAGCGATTCATTAAAAAAATCGCGGGTGTAGGTCACGATTTTAAAGAAGTA
GTGTACATTGTACACATTTTATATGCCCAGTTGTCAGTCAGATATGCCACGTGTTATCCAGGTGGTGCCACAGGTGTGGATTCTCAGGAAAAACATTCGGGTTCTGAGCT
TCCGGACATGGCGTCTGCAACCAAAACCCACAGAGGATCGGGTTGTCGGAGAACGCCGTCGGGCCTTGATTCAACAGTGAACCCACCTGCGAAATCTTGTCGGTGAGATT
GTTGTGACAGACATCGAGATTCACAATCAACGGGAGGTTCGGGGCAGATGAGAGGGGATCAGTTAACTCCTCCATTTTTTACACCAGACCAGAGGGAGTTGTGGGAAAGG
TCTAGAACGACCAGATTCGTGGCGTTGTAGAGATGAGAAGAGATCGGGTTAGAGAAATTGTTGAAGGAGAGGCTGAGGCGACGGAGGGAATCGAGGAGGCCAAATTCGAA
AGGGATGTAGCAGGTGAGGTCTTTGTTGGGGAGAGAGAGTTAAGTGACTCGAGAACGTGAGTTGGGTCGGTTTCAATGGCGACCTTGAGGGCGAGAAGGGGGAGACCATC
GGAGTTGAGAGAGGGAGAGAAGGAAATGAAAAACGAGAAGAAGGTGAGGAAGGAAGCAAATGGGTAGAAGGAGAACTTTTTTAGGGTTTTAGATTGGGGTTTAAATGAGG
GGCTTGAATTCTGCCATTGTTGATGAGGCTGGAGTTGCAGGAGGTAGTTAGAACAGGAAGAGAGAATTTGAATTTAGAATTTGGAAGAGATATTACCGTTGTGAGATTGT
CGGAGAAGAGGAACGTGGGTGTATGAGTGAATGAATGAAGATTTTGAAAAAAAAACTGGAAATGGCGTACACGGTACACGATGTTTGCCTACGTGTACCAACCTCCCTCG
CGTTCCCTCGACATTCTGCAGCAAGGTTGACCACGTGTAAATCGTGTACCGATACACGATTTAGATCGATTTCCTCTCTCAGTCCAAGCCTCGCTGTTCTTTTCAACGCT
AGGAGAGATTACAATCGTGGGTTTCCGCTAATTCGACCATTGGACAACAAATTGGCGAAAACCAGAGCTGCCTTGTCTGAAATTTCTTACGATAAGAAGTACCCAAAAGT
CGGTGCCAAATCTATTGGACCAATTCCACCTGCCCAGCTAATTCAAGTCGTTGAGAATGCTGCCAAGACTGGAGCTCAGGTGGTGATGGATGCTGTTAATAAGCCTCGAA
ATGTTGAATATAAAGGATTGACTGACTTAGTAACTGAAACAGATAAAATGAGTGAGGCTGCTATTCTGGAAGTCGTTAGAAAGAATTTTAAAGATCACCTCATCCTTGGA
GAAGAGGGAGGAGTTATAGGAGATTCATCATCTGATTATCTTTGGTGCATTGATCCTTTAGATGGAACAACAAATTTTGCACACGGCTATCCTTCCTTTGCAGTATCTGT
CGGAGTTCTGTTTCGGGGAAATCCTGCTGCTGCAGCCGTGGTAGAGTTTGTTGGAGGTCCTATGTGTTGGAACACTCGTATATTTACTGCAACTGCTGGTGGGGGAGCAT
TCTGTAACGGCCAAAAGATTAAAGTGAGTCAAACTAGCCAGGTTGAACGATCTCTTCTAGTTACAGGATTTGGATATGAACATGACGATCCATGGGCTGCAAATATCGAT
TTATTTAAAGAGTTCACAGATGTCAGCAGGGGAGTGAGAAGGCTCGGTGCAGCAGCAGTCGACATGTGCCATGTAGCTCTAGGAATTGTAGAAGCTTATTGGGAATATCG
TCTAAAGCCATGGGATATGGCTGCAGGTGTTTTGATAGTTGAAGAAGCTGGTGGAGCAGTAACTCGCATGGATGGTCGAAAGTTCTGTGTGTTTGATAGATCTGTTTTGG
TTTCTAATGGTGTTGTACATGGCAAGGTAATGTTATGGTTCTGTGCTCGAAAATGGAACCTTTCTTCTTCTGTATACGTGCAGATATTGGGTTGTATCATGATCACTTAT
AAAATACATAGATGTAGGAAGTGAAAATTTGTCAGGTCCCCAATCAAGATCCAATATGTGAGAAGGGGCAAGAATGGGAGTTTACCTTCTACAAAGTTAGTTACGGTACG
TTAGGGGACCACAGGTGAATGGTATATACGTAGGGGAGGGACGTTAGTCTATCTTTATGCTTTTTGTTATGTTGCTTTAGTGAGCTCATAGAGGGAGGAGAGTTTACCAA
CCCTCTCAAATAGGTTGGGATCATTTTGTATTTCCTTAGGACATTTCTTCCTCTATTCATATCAATACAAGAGAATACATTACATATTGGTATGAGAGCGGTTCACACGG
GGGAAAATAGAGAAAATGGCTCAAAAGAGGTTAGAAGATGGAGATCAATTCGACAAACTTATCGAGGACATTCAATGAAACATTGAAAGCCTTAGTTTCGATTATGGCAG
AAATGGCATATTTGAGGACTCGAGGCTCATCTTTGATAGTTACCGATGGCTCTGGCCAAAAGCAAAAAGGAGTGGAAGAGTTCATAGAGACATCGGACTTGCGTGTGGAG
GAGAAAACACAGGAAACAAGGGAAAGAAACAAAAAGACTAATAGAGGTGATCAAATTAAATTTAAGAAGATGGAGATGCCCATCTTTGGAGGCGATGACCCAAACTCATG
TTATTTCGAGCTGAGGGGTATGCCCTGGTTGTTGTCGATTGTGAAAAAACTCATTGTTACTCATTGTTATTGTTGAGAATTCTGAAGGGGTAGCTCTCAACTGGGAGGAA
TGGAGGAGCAAGAGCCATTCAAAGACTGGAACGAAATTCTGGCTACTGGATCAATTTTGTGTGGCCCAAGAAGGTACGTTTTGCAGGGGATTCCTTGCCGTGAAACAAGA
ATCGATGGTGAAAGAATACTGGAATATGTCTGAAAGATGGATGCCACCGCTGTCCCATTTGCCCAACGAAGTGTTTGACATTCTTGAATGGGTTGGACCAACGATTAGGA
TAGAGGTTCAATGTTGGGAATTGAGTGGGCTGAATATGATTATGAAAAAGGCCCAACTACTAGAAGCGAAGGAACGTGCCCGATTGGAGGCTTCGAATAAGCCCAAGCAT
CCAACCCTAAGTCCAATATTGCGAAAGATGGAAATAAGTCACAGGATAATGCTGCCACGTGCACCATCACTATTCCAAGTGGGCAATTGCAATTAGTGACATGTCACGAG
GAGCCATCAAAATGTTTATCAGAATATGAGTTTATGGCTCAAAGAGAAAAGGATTACGCTTTAGGTGTGACGAGAAATTCACAAAGGGGCATTGTTGTAATCAAAAAGAG
TTTGAACATCATATTGGTTCATGATGAAGAAGATGGGGAAATCGTGTTGTTGGATGTCGATATGGCCATGGAACAAGTAGCAATGCAGGCTATTGAGGTAGGAGATTCTA
TAGATCTGTCACTAAATTTAGTAGTGGGTTTCATAACACTGGGGACAATCAAGATGGTAGGGAAAATAGAAAAACAGGAGGTGGTAGTATTAATTGATTGTGGTGCGACC
CACAACTTTATTGCACAATGGCTAGTTGAATCCCTATTGCCATTGATTGAAACAAAAAATTATGGCACGATAATGGGGATAGGGGTAGCGGTTAAAGGGAAAGAAGTGTG
CAAAACAGTAGTTCTCAACATTGCCGACTCGACGATCATAGAGAATTTCTTACTACTGGAATTGGGACGCGTCGACGTGGTCTTGGGCATGCAGTGGCTTTACACTTTAG
GGGTGACTGAAGTCGATTAGAAGGCACTAACCATGCGAGTTAGAATGGGGTCATCCCAGGTGACACTTAAGGGAGATTCAACACTAACGAAGGTTAGTGTAACACTGAAG
AAGATATGGGATATCCATGAACCCTTTGGTCTAATTTCGAGCTCTGACTGCGGATTTCAATGTTGGATAATGTTGATCGTATCCTAGAGACCACATTGCCTAGCTCAGTT
TTGTACGTATTCAAAGATTATGAAGATGTTTTTCGAGCATCCAAAGGACTACCTACCTCCTCAAAGAGACATTGATCACAGAATTCAATTGATCGAGGGTATCCAACCGG
TTAATGTCAAACCCTACAAGCATGCTAGAGTACAAAAGAGGTAAATTGAAAGGTTAGTGGATGAAATGCTAATAGCCGAAATGTTGGTTATTTTAGGGATGTTTTCCTTA
TTTACCTCAATAGAATATTCGTACTTAATGTAATATATTGCTGTAAATTTATTTCCTTATGTACATGGGTATTTCTTCTATTTAAGAAACCCTAATCCCACTTAATGAAT
ATAGAGAAAATATTATCTCTTCATATTTTGCACACGAAGTTTTAAGACCAAGTACGAGTCCGTTCTTGAGCCCTGTTTTACTTGTCAAAAAGAAGGATGGATGATGGAGA
TTTTGTGTTGACTACCGAGTACTCAACAATGTTACCAACAATTGAGGAAATATTTGATAAACTACATGGCTCCCGGTACTATTCGAAGATTGATCTCAAGTCTGGGTATC
ACCAGATCAGAGTTCACGCGAATGATGTGGCTAAAACAACGTTTCAAACACATGAAGGGGATTATGAGTTTTTAATTATGTCTTTTGGACTCATAAACGTGCCACCAACA
GTCTAGGCATTGATGAATAAAGTCTTTCGACCCTTTCTTCGATGGTTTGTGCTGGTATTTTTTAATGATATTTTGATTTGTAGCAATGATTATGAAACACTTAGATATGG
TGTTGAATATATTGCGAGAAAAGCATTTGTACGCTAACAAGGGAAAGTGTTAGTTCACAAGAGAAAACATTGAATATTTAAGGCATGGGGTCTCAGTTAAGGGAGTTCGA
GCAGATCTAGAGAAGGTTTGGTCAATGGTCGAATGACCACTGCATCAGAATATTAGAGAGTTACGAGGTTTCTTAGGCCTCTCGGGCTACTGTCAACGATTTGTAGCTAA
TTATGGTAGTATCGCACCCCATTGCATCATGTGATAAAAGGAGGGGGGTTTGTGTGTGTTGAACAATCTAGGAAGCATTCAAGCAACTGAAACATGCCATGGTAAACTTG
CCGGTTCTCGCGTTACATGACTTTAACCGACCATTTGTAGTTGAAATAGATGCTTTAGGGACGAGGTTTGGTGGGGTGGTATCTTAGAATAGACAACCAATCGCCTTCTA
TAGTCACACCTTATTAGCTCAAGTGCGCACGAAATCAATCTAGGAAAGAGAGTTAATGGCAGTGGGTTTTGCTGTGCAACGATGGCGTCCCTATCTCTTGGAGAGAAAGT
TCCTAGTCCGAATGGATCAGTGGGCCCTGAAATATTTACTAGAATAGTGAGTAACAACTGGAGTACCAAGGTTGGGTTTCCAAATTACTTGGATAGGAATTTGAAATAAA
TGACAACACGAGATTGGAAAACAAGGCTGCTGATGCCCTTTCTAGATTGCTTTGTCGTGGCTCATTTGGCCAATTTGACAGTTCCAACCGCGCTTGATGTTGATATGATC
AGAAGGGAGGCCGTTGTGAATCCCAAACTTAGCTAGATCATATGACAGCTGAGTGAGGACGAAGATGAGGCCTCAATTTTTTCTTATACTCGGGAAACTTTGACAATACA
AGGGGAGGTTAGTATTATCTAAAAACTGTACTCTAATTCCAACCATTTTACATAGGTATCATGTTTCGGTGCTAGGAGGCCATTCAGGTTTCTTAAGAACCTACAAAAGA
TTGATTGGGGAGCTTTATTGGGAGAAAATGCAAGCTGATACTAAGAAGTATGTTGAAGAATGTATGGTTAGTCAGAGGAACAAATCGCTCGCGGTTTCTCCAGCTGGGTT
GTTACAGCTTGTTATTGTGAGACCACAAATATGAGAGCATTCTTATTCATTCTATACAACAGAGACAAGTATATATAGACATACATGGAAGCCTAAACTAGTAAACGTAA
TATTATAATAAAGGACAAAGGATGAATAAACTCCTAAATCTACATATATACTATGCACTCCCCCTCAAGCTGGAGCATATATGTTAATCATGCCTATCTTGTTACAAAGG
TAATCTATTCGAACTCCATTTAAAATTTTCATAAAAATATCTCCTAATTGTTCTTCAGTCACATATCCAATAGACACCAAAACTTGTTGTATTTTCTCACGAACAAAATA
ACAATCAATTTCAATATGTTTAGTTCGCTCATGAAATGTAGGGTTGAATGCAATATGAAGGGCAGTTTGGTTGTCACACCCCAACTTTGGTTGGAACTGTAGTATCGAAG
CCTAATTCGATTAATAATTGATGTGCCCACATTATTTCACATACTGATTGTGTCATAGTTTTGTACTCCGATTCAACACTCGATTGAGAGGCTGCATTTTGTTTCTTACT
TTTCCATGAGATCAAATCGCCTCCAACAAAAACACAATAGCCAGAAGTCGATCTTCTATCTTCTCCTGATTCTACCCAATCGACGACGGAAAAATACTCATATGACCATG
ATCTTCTCCGCTTACCACCCACAAACAGACGACCAATCAGAGATTGTTAACAAATGTGTTGAAACGTATCTTAGATGCTTCTGCAGTGGAAGACTAAAGAAGTGGAGCCT
GTGGTTATGTTGGGCTGAGTATTGGTATAACACTACTTTCATAGTTCAATTGGGGTTTACCCTTTTCAAGCCGTTTGTGGGCACCTCCCACCTCCATTGATATCTTATGT
AGAGAAAAGAACTACCAATTCATCCTTGGATCAACAACTCAGTGATCGGGATTCAACCTTGAGAATTCTGAAGGAACATTTGATCAGATCACGACTCAAGAGAGAATGAA
GAAATTTGTAAATAAGCAGCAACGAGAAGTAGAGTATGAGGAGGGTGATTGGGTCTTTCTGAAAATTCGACCATATCAACAAGCCTCTTTGGCTAAGCGAAGAAATGAGA
AGTTCACACCTAAATTCTTTGGCCCGTACCAGATCAAGGAATGTATTGGAAAGGTAGCATATTGTTTGAAACTGTCGAATGAGGCGTCAATCCACCTGGTTTTTCATGTA
TCACAATTAAAGAAAGCATTAGGCGACAAACATTTAGTGCAACAAGGTCCTCTGTTACTCATTGATGAATTTGAATGGGCTACTGAGCTAAAGGATATTTTTGGCTACCG
AAAGAACATACAGACAGGAGCCATGGAACTATTGGTTAATGGAAGAATTTGCCTGAACATGAAGCAACTTGGGAACTACTTGATGACTTCCACTGCCAGTTTCCTAACTT
CCACCTTGAGGGTAAGATGTCTTTGGCCCCAAGAGGCGATGTTCGACCCCCAATCACTGACCAATATGCGAGAAGGGGCAAGAATGGGAATTTACCTTCTACAAGTTAGT
GACAAGGAAGGATGGTAGTATGTTTTTATGCTTTTTGTTATGTTGCTTTAGTGAACTCATAGAGAGAGGAGAGTTTACCAGCCCTCTCGAATAGACTGGGATCATTTTGT
ATTGCCTTGGGGCATTTCTTCCTTTGTTTATATCAATATAAGTGAGGGAATACCTTGCAAAATGATATATCCTTCTTTAGTACAAGGATTAGTCTACAAAATTTGAGGGA
TCTTTGTGGGCTTCTGAAGGCATAAAAAATGACCGTCATCTTGTTGATTGGAATGGCTTACCTTCTCCAATAACAAAAATGGTAGTGGGTTTTCCATTGAAAATCTTAAA
AGACAAGAACATAGTTCTTAATACCAAATAGTTTTGATCTGAAGGCATTATGTTCGCAGTATACTCCCACCGGGAAGCTTTGCCCATCCTAAGGTTAGCTCCATAACACA
AACTACTTAGAATTCGACATTACATTAGGTGTAGTCATCCTTGCTCCCTGACTCAAGCCATGGAATAGCTTTTCGTAGGCCATTCCTTAATTTTCTCTGAGAAACCTGGC
TATGGCTATAAAGAAAGCATAGATTCTTCAATGGCACATAATAAAATCTGAGATATAAGTCACTTTAATGCTTCTACTTGGGACTCCTCTCTCTAGGTGACATTTGTCTT
CTCTTGATGCCTTTCTAGGTGTAGGAGTAATCCATACATTTTTACTAGCATCTGAATATGTAAAACTAAGATGTTTTAAAATAACCTAGTATCAAGTTCTTTTGAATTGC
AGCTTTTGGAGAAAATTGGTTCTGCCACAGAAAAACTAAAGAGCAAAGGAATTGATTTCTCATTGTGGTATAAGCCAGAAAACTACCAAACAGATCTTTGAGCTGCGAGC
ACTCGATATCATTTTCTCCAGTCTCTGAGAAAGACGAAGTCGAAAGAAATTGCTGTAAGTTACCCCAGATGCAGTCATCACTCACTCGTTTCAAATAATAGTTTCAATGT
TTAGCTTGGATATGCAAGCTTACATTGTTTTCAGGTTGTGAAAGTTTGTTGAGACAGATACATACATCAAGGTCTTTTTACTTTAGTTAATGTGATGTTAATTCTTATTT
GATTTTGACATCCACTTGCTCTCTTTGCCAATATTTGCTACACTTCAAGGGAGACATGGTTGAGGGTTAGACTCATGTAGAGGGGAATATATAGAGTCAATTCAAGAAGA
GATTCCTTATTTACTGCATTTATCTTTTTTAGTAAAGGCTTACTTATAGAAGCCATCTAAGTGAAAATGTCGTATTTCTTGTAAATATATATATATCCAAATGCTTCAAA
AGAAA
Protein sequenceShow/hide protein sequence
MKILKKKLEMAYTVHDVCLRVPTSLAFPRHSAARLTTCKSCTDTRFRSISSLSPSLAVLFNARRDYNRGFPLIRPLDNKLAKTRAALSEISYDKKYPKVGAKSIGPIPPA
QLIQVVENAAKTGAQVVMDAVNKPRNVEYKGLTDLVTETDKMSEAAILEVVRKNFKDHLILGEEGGVIGDSSSDYLWCIDPLDGTTNFAHGYPSFAVSVGVLFRGNPAAA
AVVEFVGGPMCWNTRIFTATAGGGAFCNGQKIKVSQTSQVERSLLVTGFGYEHDDPWAANIDLFKEFTDVSRGVRRLGAAAVDMCHVALGIVEAYWEYRLKPWDMAAGVL
IVEEAGGAVTRMDGRKFCVFDRSVLVSNGVVHGKVMLWFCARKWNLSSSVYVQILGCIMITYKIHRCRK