; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc03G08230 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc03G08230
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDUF676 domain-containing protein
Genome locationClcChr03:8515060..8529209
RNA-Seq ExpressionClc03G08230
SyntenyClc03G08230
Gene Ontology termsGO:0044255 - cellular lipid metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR006121 - Heavy metal-associated domain, HMA
IPR007751 - Domain of unknown function DUF676, lipase-like
IPR029058 - Alpha/Beta hydrolase fold
IPR036163 - Heavy metal-associated domain superfamily
IPR044294 - Lipase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057035.1 putative lipase isoform X3 [Cucumis melo var. makuwa]2.1e-18185.53Show/hide
Query:  LCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEV
        L K+SNHSWRLPG G QAMSTSTLGTFSSS SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEV
Subjt:  LCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEV

Query:  LQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIV
        LQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL+SS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIV
Subjt:  LQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIV

Query:  VGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPP
        VGRTGSQLFLTDGKPDKPPLLLRMASDC+EGKFISALGSFRSR+LYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPP
Subjt:  VGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPP

Query:  EAAQAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        EAAQAKEAAQ SP+ +NT DYHEIM E M   LQQ                        VKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  EAAQAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

XP_008443198.1 PREDICTED: uncharacterized protein LOC103486851 isoform X1 [Cucumis melo]2.6e-17984.63Show/hide
Query:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
        DSNHSWRLPG G QAMSTSTLGTFSSS SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
Subjt:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV

Query:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR
        VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL+SS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIVVGR
Subjt:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR

Query:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFI------SALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPH
        TGSQLFLTDGKPDKPPLLLRMASDC+EGKFI      SALGSFRSR+LYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPH
Subjt:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFI------SALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPH

Query:  FPPEAAQAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        FPPEAAQAKEAAQ SP+ +NT DYHEIM E M   LQQ                        VKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  FPPEAAQAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

XP_008443207.1 PREDICTED: uncharacterized protein LOC103486851 isoform X2 [Cucumis melo]2.8e-18185.93Show/hide
Query:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
        DSNHSWRLPG G QAMSTSTLGTFSSS SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
Subjt:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV

Query:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR
        VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL+SS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIVVGR
Subjt:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR

Query:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA
        TGSQLFLTDGKPDKPPLLLRMASDC+EGKFISALGSFRSR+LYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA
Subjt:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA

Query:  QAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        QAKEAAQ SP+ +NT DYHEIM E M   LQQ                        VKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  QAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

XP_011657777.1 putative lipase C4A8.10 isoform X1 [Cucumis sativus]3.4e-17985.46Show/hide
Query:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
        D NHSWRLPG G QAMSTSTLGTFSSS+SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSN+FTKTFTGIDGAGKRLADEVLQV
Subjt:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV

Query:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL-SSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVG
        VHKTESLKRISFLAHSLGGLFARYAIAVLYNNS+SL SSS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIVVG
Subjt:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL-SSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVG

Query:  RTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA
        RTGSQLFLTDGKP KPPLLLRMASDC+EGKFISALGSFRSR+LYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA
Subjt:  RTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA

Query:  AQAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        AQAKEAAQKSP+T+NT DYHEIM E M   LQQ                        VKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  AQAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

XP_023520181.1 putative lipase C4A8.10 [Cucurbita pepo subsp. pepo]1.3e-17383.89Show/hide
Query:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
        DSNHSWRLP LGPQAMST T GT SSSSSIGN KN+PDHLLVLVHGIMASPSDW YFEAELKRRLGRN+LIYASSSNTFTKTFTGIDGAGKRLADEVLQV
Subjt:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV

Query:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR
        V +TESLKRISFLAHSLGGLFARYAIAVLYNNS+SLSSSIPNDP +SSKKG +AGLEPISFITLATPHLGVR         GVP LEKLA PIAPIVVGR
Subjt:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR

Query:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA
        TGSQLFLTDGKPDKPPLLLRMAS  E+ KFISALG+FRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSL GYKHVVDVEY PPVSSAGPHFPPEAA
Subjt:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA

Query:  QAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        QAKEAAQKSPTTHNTVDYHEIM E M   LQQ                        VKNEWLYNAGAGVVAHVADTLKQQEPSS APVASL
Subjt:  QAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

TrEMBL top hitse value%identityAlignment
A0A0A0LX84 DUF676 domain-containing protein1.7e-17985.46Show/hide
Query:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
        D NHSWRLPG G QAMSTSTLGTFSSS+SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSN+FTKTFTGIDGAGKRLADEVLQV
Subjt:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV

Query:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL-SSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVG
        VHKTESLKRISFLAHSLGGLFARYAIAVLYNNS+SL SSS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIVVG
Subjt:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL-SSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVG

Query:  RTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA
        RTGSQLFLTDGKP KPPLLLRMASDC+EGKFISALGSFRSR+LYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA
Subjt:  RTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA

Query:  AQAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        AQAKEAAQKSP+T+NT DYHEIM E M   LQQ                        VKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  AQAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

A0A1S3B7J4 uncharacterized protein LOC103486851 isoform X21.4e-18185.93Show/hide
Query:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
        DSNHSWRLPG G QAMSTSTLGTFSSS SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
Subjt:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV

Query:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR
        VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL+SS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIVVGR
Subjt:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR

Query:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA
        TGSQLFLTDGKPDKPPLLLRMASDC+EGKFISALGSFRSR+LYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA
Subjt:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAA

Query:  QAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        QAKEAAQ SP+ +NT DYHEIM E M   LQQ                        VKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  QAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

A0A1S3B889 uncharacterized protein LOC103486851 isoform X11.3e-17984.63Show/hide
Query:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
        DSNHSWRLPG G QAMSTSTLGTFSSS SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV
Subjt:  DSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQV

Query:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR
        VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL+SS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIVVGR
Subjt:  VHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVGR

Query:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFI------SALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPH
        TGSQLFLTDGKPDKPPLLLRMASDC+EGKFI      SALGSFRSR+LYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPH
Subjt:  TGSQLFLTDGKPDKPPLLLRMASDCEEGKFI------SALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPH

Query:  FPPEAAQAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        FPPEAAQAKEAAQ SP+ +NT DYHEIM E M   LQQ                        VKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  FPPEAAQAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

A0A5A7UU10 Putative lipase isoform X31.0e-18185.53Show/hide
Query:  LCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEV
        L K+SNHSWRLPG G QAMSTSTLGTFSSS SIGNV+N+PDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEV
Subjt:  LCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEV

Query:  LQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIV
        LQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSL+SS+PNDP NSSKKG IAGLEPISFITLATPHLGVR         GVPLLEKLAAPIAPIV
Subjt:  LQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIV

Query:  VGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPP
        VGRTGSQLFLTDGKPDKPPLLLRMASDC+EGKFISALGSFRSR+LYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPP
Subjt:  VGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPP

Query:  EAAQAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        EAAQAKEAAQ SP+ +NT DYHEIM E M   LQQ                        VKNEWLYNAGAGVVAHVADTLKQQEPSSFAP+ASL
Subjt:  EAAQAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

A0A6J1EPC9 uncharacterized protein LOC111434365 isoform X13.0e-17382Show/hide
Query:  SDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGK
        S   T    DSNH+WRLP LG QAMST T GT SSSSSIGNVKN+PDHLLVLVHGIMASPSDW YFEAELKRRLGRN+LIYASSSNTFTKTF+GIDGAGK
Subjt:  SDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGK

Query:  RLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAA
        RLADEVLQVV +TESLKRISFLAHSLGGLFARYAIAVLYNNS+SLSSSIPNDP +SSKKG +AGLEPISFITLATPHLGVR         GVP LEKLA 
Subjt:  RLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAA

Query:  PIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSA
        PIAPIVVGRTGSQLFLTDGKPDKPPLLLRMAS  E+ KFISALG+FRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSL GYKHVVDVEY PPVSSA
Subjt:  PIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSA

Query:  GPHFPPEAAQAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL
        GPHFPPEAAQAKEAAQKSPTTHNTVDYHEIM E M   LQQ                        VKNEWLYNAGAGVVAHVADTLKQQEPSS APVASL
Subjt:  GPHFPPEAAQAKEAAQKSPTTHNTVDYHEIM-EGMKPQLQQ-----------------------EVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL

SwissProt top hitse value%identityAlignment
B3H6D0 Heavy metal-associated isoprenylated plant protein 451.5e-2037.72Show/hide
Query:  MTVTEMRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQTKP
        +++ E+ V MDC+GCEK+VR+A+  L+GVD V ID+  QKVTV G+  ++++LK V+R GRTAE WP+PYN  Y+G      +Y    QH  Q   +   
Subjt:  MTVTEMRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQTKP

Query:  IITYNSLSSSSSSHKHKMSPMHEYGSSYNYSRGGADYGYYQEPPFTTIDEEAGAMFSDENPHFCAVM
         I+Y        S K+    + ++ ++ N +  G  Y    +     IDE A  +FSD+N H C +M
Subjt:  IITYNSLSSSSSSHKHKMSPMHEYGSSYNYSRGGADYGYYQEPPFTTIDEEAGAMFSDENPHFCAVM

F4IC29 Heavy metal-associated isoprenylated plant protein 285.0e-2441.67Show/hide
Query:  EMRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQTKPIITY
        EMRVHMDC GCE +V+ AL+ + GVD V ID+  QKVTV G+A QKK+LK VR+ GR AELW  PYNP + G       Y  +PQ  + P     P+ T 
Subjt:  EMRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQTKPIITY

Query:  NSLSSSSSSHKHKMSPMHEYGSSYNYSRGG---ADYGYYQEPPF--TTIDEEAGAMFSDENPHFCAVM
                             SSYNY + G    DY  Y+  P   +    + G+ FSDENP+ C++M
Subjt:  NSLSSSSSSHKHKMSPMHEYGSSYNYSRGG---ADYGYYQEPPF--TTIDEEAGAMFSDENPHFCAVM

F4IQG4 Heavy metal-associated isoprenylated plant protein 301.6e-1432.73Show/hide
Query:  EMRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQTKPIITY
        +++V M C GCE+ V+ A+  L GVD V ++L  ++VTV+G+ ++KK+LKAVRR G+ AE WPYP  P+Y                              
Subjt:  EMRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQTKPIITY

Query:  NSLSSSSSSHKHKMSPMHEYGSSYNYSRGGADYGYYQEPPFTTI--DEEAGAMFSDENPHFCAVM
             +SS H  K     E+  SYNY R G +          T   D++    F+D+N H C++M
Subjt:  NSLSSSSSSHKHKMSPMHEYGSSYNYSRGGADYGYYQEPPFTTI--DEEAGAMFSDENPHFCAVM

Q84K70 Heavy metal-associated isoprenylated plant protein 311.5e-1534.91Show/hide
Query:  MTVTEMRV-HMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGW-AKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQT
        MTV E+RV ++DC+GC  ++RK L  L+GV++V +++ TQKVT  G+  ++KK+LKAVRR G+ AELWPY     +    + Y  Y+ +  H++    +T
Subjt:  MTVTEMRV-HMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGW-AKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQT

Query:  KPIITYNSLSSSSSSHKHKMSPMHEYGSSYNYSRGGADYGYYQEPPFTTIDEEAGAMFSDENPHFCAVM
         P                        G  + +    ADY           DE A +MFSD+NPH C +M
Subjt:  KPIITYNSLSSSSSSHKHKMSPMHEYGSSYNYSRGGADYGYYQEPPFTTIDEEAGAMFSDENPHFCAVM

Q9LP41 Heavy metal-associated isoprenylated plant protein 291.0e-2140.72Show/hide
Query:  MRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRR-NGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQTKPIITY
        M V MDC GCE +VRKALE + GV DV ID+  Q+VTV G A+QKK+LK  R    R   LW YPY+P+ +G+   Y                       
Subjt:  MRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRR-NGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQTKPIITY

Query:  NSLSSSSSSHKHKMSPMHEYGSSYNYSR---GGADYGYYQEPPFT-TIDEEAGAMFSDENPHFCAVM
                  +  MS   E  SSYNY +    G ++GYYQE P++  I+  A +MFS+ENPHFC++M
Subjt:  NSLSSSSSSHKHKMSPMHEYGSSYNYSR---GGADYGYYQEPPFT-TIDEEAGAMFSDENPHFCAVM

Arabidopsis top hitse value%identityAlignment
AT1G29120.1 Hydrolase-like protein family4.2e-13565.66Show/hide
Query:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG
        I++ D+G      SN SW   G   QAMS++    FS S    + KNEPDHLLVLVHGI+ASPSDW Y EAELKRRLGR +LIYASSSNTFTKTF GIDG
Subjt:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG

Query:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL
        AGKRLA+EV QVV K++SLK+ISFLAHSLGGLF+R+A+AVLY+ + +  S +    S +S   +G IAGLEPI+FITLATPHLGVR         GVP+L
Subjt:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL

Query:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYP
        EKLAAPIAP  VGRTGSQLFLTDGK DKPPLLLRMASD E+ KF+SALG+FRSR++YANV+YDHMVGWRTSSIRRE ELIKP RRSLDGYKHVVDVEY P
Subjt:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYP

Query:  PVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIME----------GMK-----------PQLQQ---EVKNEWLYNAGAGVVAHVADTLKQQEPSSF
        PVSS G HFPPEAA+AKEAAQ SP+  NT++YHEI+E          G K           P L      VK+E LY AGAGV+AHVAD++KQQE S+F
Subjt:  PVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIME----------GMK-----------PQLQQ---EVKNEWLYNAGAGVVAHVADTLKQQEPSSF

AT1G29120.2 Hydrolase-like protein family4.2e-13565.66Show/hide
Query:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG
        I++ D+G      SN SW   G   QAMS++    FS S    + KNEPDHLLVLVHGI+ASPSDW Y EAELKRRLGR +LIYASSSNTFTKTF GIDG
Subjt:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG

Query:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL
        AGKRLA+EV QVV K++SLK+ISFLAHSLGGLF+R+A+AVLY+ + +  S +    S +S   +G IAGLEPI+FITLATPHLGVR         GVP+L
Subjt:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL

Query:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYP
        EKLAAPIAP  VGRTGSQLFLTDGK DKPPLLLRMASD E+ KF+SALG+FRSR++YANV+YDHMVGWRTSSIRRE ELIKP RRSLDGYKHVVDVEY P
Subjt:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYP

Query:  PVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIME----------GMK-----------PQLQQ---EVKNEWLYNAGAGVVAHVADTLKQQEPSSF
        PVSS G HFPPEAA+AKEAAQ SP+  NT++YHEI+E          G K           P L      VK+E LY AGAGV+AHVAD++KQQE S+F
Subjt:  PVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIME----------GMK-----------PQLQQ---EVKNEWLYNAGAGVVAHVADTLKQQEPSSF

AT1G29120.3 Hydrolase-like protein family1.9e-12771.87Show/hide
Query:  SNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQVV
        SN SW   G   QAMS++    FS S    + KNEPDHLLVLVHGI+ASPSDW Y EAELKRRLGR +LIYASSSNTFTKTF GIDGAGKRLA+EV QVV
Subjt:  SNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQVV

Query:  HKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVG
         K++SLK+ISFLAHSLGGLF+R+A+AVLY+ + +  S +    S +S   +G IAGLEPI+FITLATPHLGVR         GVP+LEKLAAPIAP  VG
Subjt:  HKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLLEKLAAPIAPIVVG

Query:  RTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA
        RTGSQLFLTDGK DKPPLLLRMASD E+ KF+SALG+FRSR++YANV+YDHMVGWRTSSIRRE ELIKP RRSLDGYKHVVDVEY PPVSS G HFPPEA
Subjt:  RTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEA

Query:  AQAKEAAQKSPTTHNTVDYHEIMEGMK
        A+AKEAAQ SP+  NT++YHEI+EG++
Subjt:  AQAKEAAQKSPTTHNTVDYHEIMEGMK

AT1G29120.4 Hydrolase-like protein family5.0e-12870.29Show/hide
Query:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG
        I++ D+G      SN SW   G   QAMS++    FS S    + KNEPDHLLVLVHGI+ASPSDW Y EAELKRRLGR +LIYASSSNTFTKTF GIDG
Subjt:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG

Query:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL
        AGKRLA+EV QVV K++SLK+ISFLAHSLGGLF+R+A+AVLY+ + +  S +    S +S   +G IAGLEPI+FITLATPHLGVR         GVP+L
Subjt:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL

Query:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYP
        EKLAAPIAP  VGRTGSQLFLTDGK DKPPLLLRMASD E+ KF+SALG+FRSR++YANV+YDHMVGWRTSSIRRE ELIKP RRSLDGYKHVVDVEY P
Subjt:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYP

Query:  PVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIMEGMK
        PVSS G HFPPEAA+AKEAAQ SP+  NT++YHEI+EG K
Subjt:  PVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIMEGMK

AT1G29120.5 Hydrolase-like protein family1.2e-13265.25Show/hide
Query:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG
        I++ D+G      SN SW   G   QAMS++    FS S    + KNEPDHLLVLVHGI+ASPSDW Y EAELKRRLGR +LIYASSSNTFTKTF GIDG
Subjt:  IVDSDYGTVLCKDSNHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDG

Query:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL
        AGKRLA+EV QVV K++SLK+ISFLAHSLGGLF+R+A+AVLY+ + +  S +    S +S   +G IAGLEPI+FITLATPHLGVR         GVP+L
Subjt:  AGKRLADEVLQVVHKTESLKRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSK--KGAIAGLEPISFITLATPHLGVR---------GVPLL

Query:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-HMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYY
        EKLAAPIAP  VGRTGSQLFLTDGK DKPPLLLRMASD E+ KF+SALG+FRSR++YANV+YD  MVGWRTSSIRRE ELIKP RRSLDGYKHVVDVEY 
Subjt:  EKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRMASDCEEGKFISALGSFRSRVLYANVAYD-HMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYY

Query:  PPVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIME----------GMK-----------PQLQQ---EVKNEWLYNAGAGVVAHVADTLKQQEPSSF
        PPVSS G HFPPEAA+AKEAAQ SP+  NT++YHEI+E          G K           P L      VK+E LY AGAGV+AHVAD++KQQE S+F
Subjt:  PPVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIME----------GMK-----------PQLQQ---EVKNEWLYNAGAGVVAHVADTLKQQEPSSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGTAACAGAGATGAGAGTTCATATGGACTGTCAGGGATGTGAAAAGCAAGTAAGAAAAGCTCTCGAAAATCTGGAAGGTGTGGATGATGTGATAATAGAT
TTGAGCACACAGAAGGTGACTGTGATGGGATGGGCAAAGCAAAAGAAGATTCTGAAGGCGGTGCGGCGGAATGGGCGGACGGCGGAGCTGTGGCCATACCCTTAC
AACCCCCAATACCATGGCTTCCTCCACCACTACCAGCATTACCTTAACTCTCCACAGCATCACCATCAGCCTCAGCCTCAGACTAAACCAATCATCACTTACAAT
TCACTGTCATCTTCTTCTTCCTCGCACAAGCACAAGATGAGTCCAATGCATGAATATGGTAGTAGCTACAACTACAGCCGCGGCGGTGCTGACTATGGCTATTAT
CAAGAGCCACCATTTACCACTATTGATGAAGAAGCTGGTGCCATGTTCAGCGATGAGAACCCACATTTTTGCGCTGTCATGGCATGGGCTTTTCCCTTGATCATC
GAAATCCAAGAGGATAACGAAAGTTATGGCTTCTTGTTGACTCCACTTGGACAACAACAATCTCACTCCTCTCTTTCTTTTGCCTTGCTTCCTCTGCTAATTCTC
TCTTGTAATATATATATATATATATATATATATGGCATTGGCGCCTTTCATTCACGCGCGCTGTTGTTACAACCCCATTTTCGGTGCTCAAAGAACCGGTTCTTC
TCATGGACCTGTAGGAACGTCGTCGTTTTCTTCTTCCTCTTCCTCCTCCTCTTCCTCGTCTCCCTGTTCTTCCTCTTCTGTTATAGCTGGATATACAAATCTTCT
AAATTTTATATACTTTATGAATTCGCGAACTCTTCGTTCTGGATTACCAATCTTTTGTATTTAATCGTGGATTCTGATTATGGAACTGTACTTTGCAAAGACTCG
AATCACAGCTGGAGATTACCTGGCCTTGGACCCCAAGCAATGAGTACTTCAACCCTTGGAACATTCTCATCATCAAGTAGCATTGGAAATGTGAAAAACGAGCCT
GATCATCTTCTTGTCCTTGTTCATGGCATCATGGCTAGCCCAAGTGACTGGACTTACTTTGAAGCAGAGTTAAAAAGGCGTCTTGGAAGAAACTACTTGATATAT
GCAAGTTCGTCAAATACTTTTACTAAAACTTTCACGGGAATTGATGGAGCAGGAAAACGATTAGCTGATGAGGTCTTGCAAGTGGTACATAAAACAGAGAGCTTA
AAAAGGATATCTTTTTTGGCTCATTCACTTGGTGGTTTGTTTGCGAGATATGCTATTGCTGTACTTTACAATAACTCAAATTCATTGTCTAGTAGCATCCCAAAT
GATCCTTCCAATTCTTCGAAAAAAGGTGCGATTGCTGGGCTAGAGCCAATCAGTTTCATTACCTTGGCAACTCCTCATTTAGGAGTGAGAGGAGTTCCCCTCCTA
GAGAAACTGGCTGCACCAATAGCCCCTATTGTTGTGGGCCGAACGGGTAGTCAGCTATTCCTTACCGATGGAAAACCTGATAAACCACCACTTCTATTAAGAATG
GCATCCGATTGTGAAGAAGGGAAATTCATATCTGCCCTTGGCTCTTTTCGGTCCCGTGTTCTTTATGCCAATGTAGCTTATGATCATATGGTTGGTTGGCGCACT
TCGTCTATAAGGAGGGAAAATGAACTTATCAAGCCCCCTCGCCGATCATTGGATGGTTACAAGCATGTCGTAGATGTGGAATATTATCCTCCTGTTTCCTCTGCT
GGTCCCCATTTTCCCCCTGAAGCAGCTCAAGCAAAGGAGGCTGCACAAAAATCACCAACCACACACAATACAGTGGATTATCATGAAATCATGGAAGGTATGAAG
CCCCAACTCCAGCAGGAGGTGAAAAACGAATGGCTTTACAATGCTGGTGCTGGCGTGGTGGCTCATGTTGCAGACACCCTCAAGCAACAAGAACCGTCTTCATTT
GCCCCTGTGGCGAGCTTATAG
mRNA sequenceShow/hide mRNA sequence
TTTCATAGAGACAGCGAGAGGGAAGGCTTACTTATAATTCATATAAATTGACTGTTGCTTCCCTTCATCTCCTTGTAGACTTGTTTTGTTCTCTCATTGACAGGG
AAAGCTTCTTCGTGCCCCTCTTCAATTGGTATACCGACATGACGGTAACAGAGATGAGAGTTCATATGGACTGTCAGGGATGTGAAAAGCAAGTAAGAAAAGCTC
TCGAAAATCTGGAAGGTGTGGATGATGTGATAATAGATTTGAGCACACAGAAGGTGACTGTGATGGGATGGGCAAAGCAAAAGAAGATTCTGAAGGCGGTGCGGC
GGAATGGGCGGACGGCGGAGCTGTGGCCATACCCTTACAACCCCCAATACCATGGCTTCCTCCACCACTACCAGCATTACCTTAACTCTCCACAGCATCACCATC
AGCCTCAGCCTCAGACTAAACCAATCATCACTTACAATTCACTGTCATCTTCTTCTTCCTCGCACAAGCACAAGATGAGTCCAATGCATGAATATGGTAGTAGCT
ACAACTACAGCCGCGGCGGTGCTGACTATGGCTATTATCAAGAGCCACCATTTACCACTATTGATGAAGAAGCTGGTGCCATGTTCAGCGATGAGAACCCACATT
TTTGCGCTGTCATGGCATGGGCTTTTCCCTTGATCATCGAAATCCAAGAGGATAACGAAAGTTATGGCTTCTTGTTGACTCCACTTGGACAACAACAATCTCACT
CCTCTCTTTCTTTTGCCTTGCTTCCTCTGCTAATTCTCTCTTGTAATATATATATATATATATATATATATGGCATTGGCGCCTTTCATTCACGCGCGCTGTTGT
TACAACCCCATTTTCGGTGCTCAAAGAACCGGTTCTTCTCATGGACCTGTAGGAACGTCGTCGTTTTCTTCTTCCTCTTCCTCCTCCTCTTCCTCGTCTCCCTGT
TCTTCCTCTTCTGTTATAGCTGGATATACAAATCTTCTAAATTTTATATACTTTATGAATTCGCGAACTCTTCGTTCTGGATTACCAATCTTTTGTATTTAATCG
TGGATTCTGATTATGGAACTGTACTTTGCAAAGACTCGAATCACAGCTGGAGATTACCTGGCCTTGGACCCCAAGCAATGAGTACTTCAACCCTTGGAACATTCT
CATCATCAAGTAGCATTGGAAATGTGAAAAACGAGCCTGATCATCTTCTTGTCCTTGTTCATGGCATCATGGCTAGCCCAAGTGACTGGACTTACTTTGAAGCAG
AGTTAAAAAGGCGTCTTGGAAGAAACTACTTGATATATGCAAGTTCGTCAAATACTTTTACTAAAACTTTCACGGGAATTGATGGAGCAGGAAAACGATTAGCTG
ATGAGGTCTTGCAAGTGGTACATAAAACAGAGAGCTTAAAAAGGATATCTTTTTTGGCTCATTCACTTGGTGGTTTGTTTGCGAGATATGCTATTGCTGTACTTT
ACAATAACTCAAATTCATTGTCTAGTAGCATCCCAAATGATCCTTCCAATTCTTCGAAAAAAGGTGCGATTGCTGGGCTAGAGCCAATCAGTTTCATTACCTTGG
CAACTCCTCATTTAGGAGTGAGAGGAGTTCCCCTCCTAGAGAAACTGGCTGCACCAATAGCCCCTATTGTTGTGGGCCGAACGGGTAGTCAGCTATTCCTTACCG
ATGGAAAACCTGATAAACCACCACTTCTATTAAGAATGGCATCCGATTGTGAAGAAGGGAAATTCATATCTGCCCTTGGCTCTTTTCGGTCCCGTGTTCTTTATG
CCAATGTAGCTTATGATCATATGGTTGGTTGGCGCACTTCGTCTATAAGGAGGGAAAATGAACTTATCAAGCCCCCTCGCCGATCATTGGATGGTTACAAGCATG
TCGTAGATGTGGAATATTATCCTCCTGTTTCCTCTGCTGGTCCCCATTTTCCCCCTGAAGCAGCTCAAGCAAAGGAGGCTGCACAAAAATCACCAACCACACACA
ATACAGTGGATTATCATGAAATCATGGAAGGTATGAAGCCCCAACTCCAGCAGGAGGTGAAAAACGAATGGCTTTACAATGCTGGTGCTGGCGTGGTGGCTCATG
TTGCAGACACCCTCAAGCAACAAGAACCGTCTTCATTTGCCCCTGTGGCGAGCTTATAGCAGTTAGCTATATCTTAGTGGCAGTAGAATTTAATCATGTTGAAGA
ATCAGCGGTTGCAAAATGAGAGAATATCACAGATTGTTTTGTACTGTCATAGTTTAGTATATTGTAGCTTGGATTGGATGTAAGCTTTGCATAGCTCAGCCACAT
GTAAAAAGCTTTATTAAGGAGAAATTAAGGATAATTCTACATTTGCTTCTCTACAATAATAATAATGATGCTGAATAAAAAGCAACATTGATCAACTAGATGATT
AACTATTAACTAGGATTTGTTATGGCCATTTTTCTTGATGATTGGTCTATTTGTTCTAGCAACGAAGAATTTATTCTTCTTCCA
Protein sequenceShow/hide protein sequence
MTVTEMRVHMDCQGCEKQVRKALENLEGVDDVIIDLSTQKVTVMGWAKQKKILKAVRRNGRTAELWPYPYNPQYHGFLHHYQHYLNSPQHHHQPQPQTKPIITYN
SLSSSSSSHKHKMSPMHEYGSSYNYSRGGADYGYYQEPPFTTIDEEAGAMFSDENPHFCAVMAWAFPLIIEIQEDNESYGFLLTPLGQQQSHSSLSFALLPLLIL
SCNIYIYIYIYGIGAFHSRALLLQPHFRCSKNRFFSWTCRNVVVFFFLFLLLFLVSLFFLFCYSWIYKSSKFYILYEFANSSFWITNLLYLIVDSDYGTVLCKDS
NHSWRLPGLGPQAMSTSTLGTFSSSSSIGNVKNEPDHLLVLVHGIMASPSDWTYFEAELKRRLGRNYLIYASSSNTFTKTFTGIDGAGKRLADEVLQVVHKTESL
KRISFLAHSLGGLFARYAIAVLYNNSNSLSSSIPNDPSNSSKKGAIAGLEPISFITLATPHLGVRGVPLLEKLAAPIAPIVVGRTGSQLFLTDGKPDKPPLLLRM
ASDCEEGKFISALGSFRSRVLYANVAYDHMVGWRTSSIRRENELIKPPRRSLDGYKHVVDVEYYPPVSSAGPHFPPEAAQAKEAAQKSPTTHNTVDYHEIMEGMK
PQLQQEVKNEWLYNAGAGVVAHVADTLKQQEPSSFAPVASL