; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG03G007740 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG03G007740
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionDUF676 domain-containing protein
Genome locationCG_Chr03:8853730..8868181
RNA-Seq ExpressionClCG03G007740
SyntenyClCG03G007740
Gene Ontology termsGO:0044255 - cellular lipid metabolic process (biological process)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR006121 - Heavy metal-associated domain, HMA
IPR007751 - Domain of unknown function DUF676, lipase-like
IPR029058 - Alpha/Beta hydrolase fold
IPR036163 - Heavy metal-associated domain superfamily
IPR044294 - Lipase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057035.1 putative lipase isoform X3 [Cucumis melo var. makuwa]2.6e-19488.83Show/hide
Query:  LCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEV
        L K+SNHSWRLPG G QAMSTSTLGTFSSS SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEV
Subjt:  LCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEV

Query:  LQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIV
        LQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL+SS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIV
Subjt:  LQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIV

Query:  VGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPP
        VGRTGSQLFLTDGKPDKPPLLLRMASDC+EGKFISALGSFRSR+LYANVAYD +V + +   +      KPPRRSLDGYKHVVDVEYYPPVSSAGPHFPP
Subjt:  VGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPP

Query:  EAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        EAAQAKEAAQ SP+ +NT DYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  EAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

XP_008443198.1 PREDICTED: uncharacterized protein LOC103486851 isoform X1 [Cucumis melo]3.1e-19287.91Show/hide
Query:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
        DSNHSWRLPG G QAMSTSTLGTFSSS SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
Subjt:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV

Query:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR
        VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL+SS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIVVGR
Subjt:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR

Query:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFI------SALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPH
        TGSQLFLTDGKPDKPPLLLRMASDC+EGKFI      SALGSFRSR+LYANVAYD +V + +   +      KPPRRSLDGYKHVVDVEYYPPVSSAGPH
Subjt:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFI------SALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPH

Query:  FPPEAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        FPPEAAQAKEAAQ SP+ +NT DYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  FPPEAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

XP_008443207.1 PREDICTED: uncharacterized protein LOC103486851 isoform X2 [Cucumis melo]3.3e-19489.26Show/hide
Query:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
        DSNHSWRLPG G QAMSTSTLGTFSSS SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
Subjt:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV

Query:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR
        VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL+SS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIVVGR
Subjt:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR

Query:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA
        TGSQLFLTDGKPDKPPLLLRMASDC+EGKFISALGSFRSR+LYANVAYD +V + +   +      KPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA
Subjt:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA

Query:  QAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        QAKEAAQ SP+ +NT DYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  QAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

XP_011657777.1 putative lipase C4A8.10 isoform X1 [Cucumis sativus]4.1e-19288.78Show/hide
Query:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
        D NHSWRLPG G QAMSTSTLGTFSSS+SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSN+FTKTFTGIDGAGKRLADEVLQV
Subjt:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV

Query:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL-SSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVG
        VHKTESLKRISFLAHSLGGLFARYAIAVLYNNS+SL SSS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIVVG
Subjt:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL-SSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVG

Query:  RTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA
        RTGSQLFLTDGKP KPPLLLRMASDC+EGKFISALGSFRSR+LYANVAYD +V + +   +      KPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA
Subjt:  RTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA

Query:  AQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        AQAKEAAQKSP+T+NT DYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  AQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

XP_023520181.1 putative lipase C4A8.10 [Cucurbita pepo subsp. pepo]1.3e-18586.96Show/hide
Query:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
        DSNHSWRLP LGPQAMST T GT SSSSSIGN KN+PDHLLVLVHGIMASPSDW YFEAELKRRLGRN+LIYASSSNTFTKTFTGIDGAGKRLADEVLQV
Subjt:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV

Query:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR
        V +TESLKRISFLAHSLGGLFARYAIAVLYNNS+SLSSSIPNDP +SSKKG +AGLEPISFITLATPHLGVR         GVP LEKLA PIAPIVVGR
Subjt:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR

Query:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA
        TGSQLFLTDGKPDKPPLLLRMAS  E+ KFISALG+FRSRVLYANVAYD +V + +   +      KPPRRSL GYKHVVDVEY PPVSSAGPHFPPEAA
Subjt:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA

Query:  QAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        QAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGW KVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSS APVASL
Subjt:  QAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

TrEMBL top hitse value%identityAlignment
A0A0A0LX84 DUF676 domain-containing protein2.0e-19288.78Show/hide
Query:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
        D NHSWRLPG G QAMSTSTLGTFSSS+SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSN+FTKTFTGIDGAGKRLADEVLQV
Subjt:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV

Query:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL-SSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVG
        VHKTESLKRISFLAHSLGGLFARYAIAVLYNNS+SL SSS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIVVG
Subjt:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL-SSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVG

Query:  RTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA
        RTGSQLFLTDGKP KPPLLLRMASDC+EGKFISALGSFRSR+LYANVAYD +V + +   +      KPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA
Subjt:  RTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA

Query:  AQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        AQAKEAAQKSP+T+NT DYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  AQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

A0A1S3B7J4 uncharacterized protein LOC103486851 isoform X21.6e-19489.26Show/hide
Query:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
        DSNHSWRLPG G QAMSTSTLGTFSSS SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
Subjt:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV

Query:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR
        VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL+SS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIVVGR
Subjt:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR

Query:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA
        TGSQLFLTDGKPDKPPLLLRMASDC+EGKFISALGSFRSR+LYANVAYD +V + +   +      KPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA
Subjt:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA

Query:  QAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        QAKEAAQ SP+ +NT DYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  QAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

A0A1S3B889 uncharacterized protein LOC103486851 isoform X11.5e-19287.91Show/hide
Query:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
        DSNHSWRLPG G QAMSTSTLGTFSSS SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
Subjt:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV

Query:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR
        VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL+SS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIVVGR
Subjt:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR

Query:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFI------SALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPH
        TGSQLFLTDGKPDKPPLLLRMASDC+EGKFI      SALGSFRSR+LYANVAYD +V + +   +      KPPRRSLDGYKHVVDVEYYPPVSSAGPH
Subjt:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFI------SALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPH

Query:  FPPEAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        FPPEAAQAKEAAQ SP+ +NT DYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  FPPEAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

A0A5A7UU10 Putative lipase isoform X31.2e-19488.83Show/hide
Query:  LCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEV
        L K+SNHSWRLPG G QAMSTSTLGTFSSS SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEV
Subjt:  LCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEV

Query:  LQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIV
        LQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL+SS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIV
Subjt:  LQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIV

Query:  VGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPP
        VGRTGSQLFLTDGKPDKPPLLLRMASDC+EGKFISALGSFRSR+LYANVAYD +V + +   +      KPPRRSLDGYKHVVDVEYYPPVSSAGPHFPP
Subjt:  VGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPP

Query:  EAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        EAAQAKEAAQ SP+ +NT DYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  EAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

A0A6J1EPC9 uncharacterized protein LOC111434365 isoform X13.1e-18585Show/hide
Query:  SDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGK
        S   T    DSNH+WRLP LG QAMST T GT SSSSSIGNVKN+PDHLLVLVHGIMASPSDW YFEAELKRRLGRN+LIYASSSNTFTKTF+GIDGAGK
Subjt:  SDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGK

Query:  RLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAA
        RLADEVLQVV +TESLKRISFLAHSLGGLFARYAIAVLYNNS+SLSSSIPNDP +SSKKG +AGLEPISFITLATPHLGVR         GVP LEKLA 
Subjt:  RLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAA

Query:  PIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSA
        PIAPIVVGRTGSQLFLTDGKPDKPPLLLRMAS  E+ KFISALG+FRSRVLYANVAYD +V + +   +      KPPRRSL GYKHVVDVEY PPVSSA
Subjt:  PIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNS---WKPPRRSLDGYKHVVDVEYYPPVSSA

Query:  GPHFPPEAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        GPHFPPEAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGW KVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSS APVASL
Subjt:  GPHFPPEAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

SwissProt top hitse value%identityAlignment
B3H6D0 Heavy metal-associated isoprenylated plant protein 456.9e-1749.47Show/hide
Query:  MTVTEMRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHY--QHYLNSPQHHHQ
        +++ E+ V MDC+GCEK+VR+A+  L+GVD V ID+  QKVTV G+  ++++LK V+R GRTAE WP+PYN  Y+G  + Y  QH   S Q  +Q
Subjt:  MTVTEMRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHY--QHYLNSPQHHHQ

F4IC29 Heavy metal-associated isoprenylated plant protein 289.6e-1942.66Show/hide
Query:  EMRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQTKPIITY
        EMRVHMDC GCE +V+ AL+ + GVD V ID+  QKVTV G+A QKK+LK VR+ GR AELW  PYNP + G       Y  +PQ  + P     P+ T 
Subjt:  EMRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQTKPIITY

Query:  NSLSSSSSSHKHKMSPMHEYGSSYNYSRGG---ADYGYYQEPP
                             SSYNY + G    DY  Y+  P
Subjt:  NSLSSSSSSHKHKMSPMHEYGSSYNYSRGG---ADYGYYQEPP

F4IQG4 Heavy metal-associated isoprenylated plant protein 302.5e-1448.05Show/hide
Query:  EMRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHY
        +++V M C GCE+ V+ A+  L GVD V ++L  ++VTV+G+ ++KK+LKAVRR G+ AE WPYP  P+Y     HY
Subjt:  EMRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHY

Q84K70 Heavy metal-associated isoprenylated plant protein 315.1e-1242.16Show/hide
Query:  MTVTEMRV-HMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGW-AKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQT
        MTV E+RV ++DC+GC  ++RK L  L+GV++V +++ TQKVT  G+  ++KK+LKAVRR G+ AELWPY     +    + Y  Y+ +  H++    +T
Subjt:  MTVTEMRV-HMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGW-AKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQT

Query:  KP
         P
Subjt:  KP

Q9LP41 Heavy metal-associated isoprenylated plant protein 291.1e-1438.06Show/hide
Query:  MRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRR-NGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQTKPIITY
        M V MDC GCE +VRKALE + GV DV ID+  Q+VTV G A+QKK+LK  R    R   LW YPY+P+ +G+   Y                       
Subjt:  MRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRR-NGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQTKPIITY

Query:  NSLSSSSSSHKHKMSPMHEYGSSYNYSR---GGADYGYYQEPPFT-TIDEEAGAM
                  +  MS   E  SSYNY +    G ++GYYQE P++  I+  A +M
Subjt:  NSLSSSSSSHKHKMSPMHEYGSSYNYSR---GGADYGYYQEPPFT-TIDEEAGAM

Arabidopsis top hitse value%identityAlignment
AT1G29120.1 Hydrolase-like protein family3.3e-14768.17Show/hide
Query:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG
        I++ D+G      SN SW   G   QAMS++    FS S    + KNEPDHLLVLVHGI+ASPSDW Y EAELKRRLGR +LIYASSSNTFTKTF GIDG
Subjt:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG

Query:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL
        AGKRLA+EV QVV K++SLK+ISFLAHSLGGLF+R+A+AVLY+ + +  S +    S +S   +G IAGLEPI+FITLATPHLGVR         GVP+L
Subjt:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL

Query:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNSW---KPPRRSLDGYKHVVDVEYYP
        EKLAAPIAP  VGRTGSQLFLTDGK DKPPLLLRMASD E+ KF+SALG+FRSR++YANV+YD +V + +   +      KP RRSLDGYKHVVDVEY P
Subjt:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNSW---KPPRRSLDGYKHVVDVEYYP

Query:  PVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSF
        PVSS G HFPPEAA+AKEAAQ SP+  NT++YHEI+EEEMIRGLQ+LGWKKVDVSFHS+FWP+ AHNNIHVK+E LY AGAGV+AHVAD++KQQE S+F
Subjt:  PVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSF

AT1G29120.2 Hydrolase-like protein family3.3e-14768.17Show/hide
Query:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG
        I++ D+G      SN SW   G   QAMS++    FS S    + KNEPDHLLVLVHGI+ASPSDW Y EAELKRRLGR +LIYASSSNTFTKTF GIDG
Subjt:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG

Query:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL
        AGKRLA+EV QVV K++SLK+ISFLAHSLGGLF+R+A+AVLY+ + +  S +    S +S   +G IAGLEPI+FITLATPHLGVR         GVP+L
Subjt:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL

Query:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNSW---KPPRRSLDGYKHVVDVEYYP
        EKLAAPIAP  VGRTGSQLFLTDGK DKPPLLLRMASD E+ KF+SALG+FRSR++YANV+YD +V + +   +      KP RRSLDGYKHVVDVEY P
Subjt:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNSW---KPPRRSLDGYKHVVDVEYYP

Query:  PVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSF
        PVSS G HFPPEAA+AKEAAQ SP+  NT++YHEI+EEEMIRGLQ+LGWKKVDVSFHS+FWP+ AHNNIHVK+E LY AGAGV+AHVAD++KQQE S+F
Subjt:  PVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSF

AT1G29120.3 Hydrolase-like protein family2.8e-11467.59Show/hide
Query:  SNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQVV
        SN SW   G   QAMS++    FS S    + KNEPDHLLVLVHGI+ASPSDW Y EAELKRRLGR +LIYASSSNTFTKTF GIDGAGKRLA+EV QVV
Subjt:  SNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQVV

Query:  HKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVG
         K++SLK+ISFLAHSLGGLF+R+A+AVLY+ + +  S +    S +S   +G IAGLEPI+FITLATPHLGVR         GVP+LEKLAAPIAP  VG
Subjt:  HKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVG

Query:  RTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNSW---KPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA
        RTGSQLFLTDGK DKPPLLLRMASD E+ KF+SALG+FRSR++YANV+YD +V + +   +      KP RRSLDGYKHVVDVEY PPVSS G HFPPEA
Subjt:  RTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNSW---KPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA

Query:  AQAKEAAQKSPTTHNTVDYHEIME
        A+AKEAAQ SP+  NT++YHEI+E
Subjt:  AQAKEAAQKSPTTHNTVDYHEIME

AT1G29120.4 Hydrolase-like protein family1.3e-11465.88Show/hide
Query:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG
        I++ D+G      SN SW   G   QAMS++    FS S    + KNEPDHLLVLVHGI+ASPSDW Y EAELKRRLGR +LIYASSSNTFTKTF GIDG
Subjt:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG

Query:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL
        AGKRLA+EV QVV K++SLK+ISFLAHSLGGLF+R+A+AVLY+ + +  S +    S +S   +G IAGLEPI+FITLATPHLGVR         GVP+L
Subjt:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL

Query:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNSW---KPPRRSLDGYKHVVDVEYYP
        EKLAAPIAP  VGRTGSQLFLTDGK DKPPLLLRMASD E+ KF+SALG+FRSR++YANV+YD +V + +   +      KP RRSLDGYKHVVDVEY P
Subjt:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-LVHYHSDIFQNSW---KPPRRSLDGYKHVVDVEYYP

Query:  PVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIME
        PVSS G HFPPEAA+AKEAAQ SP+  NT++YHEI+E
Subjt:  PVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIME

AT1G29120.5 Hydrolase-like protein family2.5e-14768.25Show/hide
Query:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG
        I++ D+G      SN SW   G   QAMS++    FS S    + KNEPDHLLVLVHGI+ASPSDW Y EAELKRRLGR +LIYASSSNTFTKTF GIDG
Subjt:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG

Query:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL
        AGKRLA+EV QVV K++SLK+ISFLAHSLGGLF+R+A+AVLY+ + +  S +    S +S   +G IAGLEPI+FITLATPHLGVR         GVP+L
Subjt:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL

Query:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDL--VHYHSDIFQNSW---KPPRRSLDGYKHVVDVEYY
        EKLAAPIAP  VGRTGSQLFLTDGK DKPPLLLRMASD E+ KF+SALG+FRSR++YANV+YDL  V + +   +      KP RRSLDGYKHVVDVEY 
Subjt:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDL--VHYHSDIFQNSW---KPPRRSLDGYKHVVDVEYY

Query:  PPVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSF
        PPVSS G HFPPEAA+AKEAAQ SP+  NT++YHEI+EEEMIRGLQ+LGWKKVDVSFHS+FWP+ AHNNIHVK+E LY AGAGV+AHVAD++KQQE S+F
Subjt:  PPVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSSFWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGTAACAGAGATGAGAGTTCATATGGACTGTCAGGGATGTGAAAAGCAAGTAAGAAAAGCTCTCGAAAATCTGGAAGGTGTGGATGATGTGATAATAGATTTGAG
CACACAGAAGGTGACTGTGATGGGATGGGCAAAGCAAAAGAAGATTCTGAAGGCGGTGCGGCGGAATGGGCGGACGGCGGAGCTGTGGCCATACCCTTACAACCCCCAAT
ACCATGGCTTCCTCCACCACTACCAGCATTACCTTAACTCTCCACAGCATCACCATCAGCCTCAGCCTCAGACTAAACCAATCATCACTTACAATTCACTGTCATCTTCT
TCTTCCTCGCACAAGCACAAGATGAGTCCAATGCATGAATATGGTAGTAGCTACAACTACAGCCGCGGCGGTGCTGACTATGGCTATTATCAAGAGCCACCATTTACCAC
TATTGATGAAGAAGCTGGTGCCATGAACGTCGTCGTTTTCTTCTTCCTCTTCCTCCTCCTCTTCCTCGTCTCCCTGTTCTTCCTCTTCTGTTATAGCTGGATATACAAAT
CTTCTAAATTTTATATACTTTATGAATTCGCGAACTCTTCGTTCTGGATTACCAATCTTTTGTATTTAATCGTGGATTCTGATTATGGAACTGTACTTTGCAAAGACTCG
AATCACAGCTGGAGATTACCTGGCCTTGGACCCCAAGCAATGAGTACTTCAACCCTTGGAACATTCTCATCATCAAGTAGCATTGGAAATGTGAAAAACGAGCCTGATCA
TCTTCTTGTCCTTGTTCATGGCATCATGGCTAGCCCAAGTGACTGGACTTACTTTGAAGCAGAGTTAAAAAGGCGTCTTGGAAGAAACTACTTGATATATGCAAGTTCGT
CAAATACTTTTACTAAAACTTTCACGGGAATTGATGGAGCAGGAAAACGATTAGCTGATGAGGTCTTGCAAGTGGTACATAAAACAGAGAGCTTAAAAAGGATATCTTTT
TTGGCTCATTCACTTGGTGGTTTGTTTGCGAGATATGCTATTGCTGTACTTTACAATAACTCAAATTCATTGTCTAGTAGCATCCCAAATGATCCTTCCAATTCTTCGAA
AAAAGGTGCGATTGCTGGGCTAGAGCCAATCAGTTTCATTACCTTGGCAACTCCTCATTTAGGAGTGAGAGGAGTTCCCCTCCTAGAGAAACTGGCTGCACCAATAGCCC
CTATTGTTGTGGGCCGAACGGGTAGTCAGCTATTCCTTACCGATGGAAAACCTGATAAACCACCACTTCTATTAAGAATGGCATCCGATTGTGAAGAAGGGAAATTCATA
TCTGCCCTTGGCTCTTTTCGGTCCCGTGTTCTTTATGCCAATGTAGCTTATGATCTTGTACATTATCATTCTGACATCTTCCAAAATTCTTGGAAGCCCCCTCGCCGATC
ATTGGATGGTTACAAGCATGTCGTAGATGTGGAATATTATCCTCCTGTTTCCTCTGCTGGTCCCCATTTTCCCCCTGAAGCAGCTCAAGCAAAGGAGGCTGCACAAAAAT
CACCAACCACACACAATACAGTGGATTATCATGAAATCATGGAAGAGGAGATGATTCGTGGCTTACAACAGTTGGGATGGAAAAAAGTTGATGTCAGCTTTCATTCCTCA
TTCTGGCCGTTCTTCGCACATAACAACATCCATGTGAAAAACGAATGGCTTTACAATGCTGGTGCTGGCGTGGTGGCTCATGTTGCAGACACCCTCAAGCAACAAGAACC
GTCTTCATTTGCCCCTGTGGCGAGCTTATAG
mRNA sequenceShow/hide mRNA sequence
GGAAAAAGGAAGTCTTTTTTTACAAATGATTTTCTTATGCATTTGCAGATGTTGACCATGGGATTTGTTTCAAGTAACTGCCATGCATATTCTACCAATATTTCATAGAG
ACAGCGAGAGGGAAGGCTTACTTATAATTCATATAAATTGACTGTTGCTTCCCTTCATCTCCTTGTAGACTTGTTTTGTTCTCTCATTGACAGGGAAAGCTTCTTCGTGC
CCCTCTTCAATTGGTATACCGACATGACGGTAACAGAGATGAGAGTTCATATGGACTGTCAGGGATGTGAAAAGCAAGTAAGAAAAGCTCTCGAAAATCTGGAAGGTGTG
GATGATGTGATAATAGATTTGAGCACACAGAAGGTGACTGTGATGGGATGGGCAAAGCAAAAGAAGATTCTGAAGGCGGTGCGGCGGAATGGGCGGACGGCGGAGCTGTG
GCCATACCCTTACAACCCCCAATACCATGGCTTCCTCCACCACTACCAGCATTACCTTAACTCTCCACAGCATCACCATCAGCCTCAGCCTCAGACTAAACCAATCATCA
CTTACAATTCACTGTCATCTTCTTCTTCCTCGCACAAGCACAAGATGAGTCCAATGCATGAATATGGTAGTAGCTACAACTACAGCCGCGGCGGTGCTGACTATGGCTAT
TATCAAGAGCCACCATTTACCACTATTGATGAAGAAGCTGGTGCCATGAACGTCGTCGTTTTCTTCTTCCTCTTCCTCCTCCTCTTCCTCGTCTCCCTGTTCTTCCTCTT
CTGTTATAGCTGGATATACAAATCTTCTAAATTTTATATACTTTATGAATTCGCGAACTCTTCGTTCTGGATTACCAATCTTTTGTATTTAATCGTGGATTCTGATTATG
GAACTGTACTTTGCAAAGACTCGAATCACAGCTGGAGATTACCTGGCCTTGGACCCCAAGCAATGAGTACTTCAACCCTTGGAACATTCTCATCATCAAGTAGCATTGGA
AATGTGAAAAACGAGCCTGATCATCTTCTTGTCCTTGTTCATGGCATCATGGCTAGCCCAAGTGACTGGACTTACTTTGAAGCAGAGTTAAAAAGGCGTCTTGGAAGAAA
CTACTTGATATATGCAAGTTCGTCAAATACTTTTACTAAAACTTTCACGGGAATTGATGGAGCAGGAAAACGATTAGCTGATGAGGTCTTGCAAGTGGTACATAAAACAG
AGAGCTTAAAAAGGATATCTTTTTTGGCTCATTCACTTGGTGGTTTGTTTGCGAGATATGCTATTGCTGTACTTTACAATAACTCAAATTCATTGTCTAGTAGCATCCCA
AATGATCCTTCCAATTCTTCGAAAAAAGGTGCGATTGCTGGGCTAGAGCCAATCAGTTTCATTACCTTGGCAACTCCTCATTTAGGAGTGAGAGGAGTTCCCCTCCTAGA
GAAACTGGCTGCACCAATAGCCCCTATTGTTGTGGGCCGAACGGGTAGTCAGCTATTCCTTACCGATGGAAAACCTGATAAACCACCACTTCTATTAAGAATGGCATCCG
ATTGTGAAGAAGGGAAATTCATATCTGCCCTTGGCTCTTTTCGGTCCCGTGTTCTTTATGCCAATGTAGCTTATGATCTTGTACATTATCATTCTGACATCTTCCAAAAT
TCTTGGAAGCCCCCTCGCCGATCATTGGATGGTTACAAGCATGTCGTAGATGTGGAATATTATCCTCCTGTTTCCTCTGCTGGTCCCCATTTTCCCCCTGAAGCAGCTCA
AGCAAAGGAGGCTGCACAAAAATCACCAACCACACACAATACAGTGGATTATCATGAAATCATGGAAGAGGAGATGATTCGTGGCTTACAACAGTTGGGATGGAAAAAAG
TTGATGTCAGCTTTCATTCCTCATTCTGGCCGTTCTTCGCACATAACAACATCCATGTGAAAAACGAATGGCTTTACAATGCTGGTGCTGGCGTGGTGGCTCATGTTGCA
GACACCCTCAAGCAACAAGAACCGTCTTCATTTGCCCCTGTGGCGAGCTTATAGCAGTTAGCTATATCTTAGTGGCAGTAGAATTTAATCATGTTGAAGAATCAGCGGTT
GCAAAATGAGAGAATATCACAGATTGTTTTGTACTGTCATAGTTTAGTATATTGTAGCTTGGATTGGATGTAAGCTTTGCATAGCTCAGCCACATGTAAAAAGCTTTATT
AAGGAGAAATTAAGGATAATTCTACATTTGCTTCTCTACAATAATAATAATGATGCTGAATAAAAAGCAA
Protein sequenceShow/hide protein sequence
MTVTEMRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQTKPIITYNSLSSS
SSSHKHKMSPMHEYGSSYNYSRGGADYGYYQEPPFTTIDEEAGAMNVVVFFFLFLLLFLVSLFFLFCYSWIYKSSKFYILYEFANSSFWITNLLYLIVDSDYGTVLCKDS
NHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQVVHKTESLKRISF
LAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVRGVPLLEKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFI
SALGSFRSRVLYANVAYDLVHYHSDIFQNSWKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIMEEEMIRGLQQLGWKKVDVSFHSS
FWPFFAHNNIHVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL