; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G006470 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G006470
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionHeavy metal transport/detoxification superfamily protein
Genome locationchr09:7214938..7222929
RNA-Seq ExpressionLsi09G006470
SyntenyLsi09G006470
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR006121 - Heavy metal-associated domain, HMA
IPR021763 - Protein of unknown function DUF3326
IPR036163 - Heavy metal-associated domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022939862.1 uncharacterized protein LOC111445603 isoform X3 [Cucurbita moschata]1.8e-19693.62Show/hide
Query:  NAQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEEL
        +A CKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPM NVLYVEGYALDRFAEGSWAL+PVHQNRVGLVLDAGMEEEL
Subjt:  NAQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEEL

Query:  RIRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAI
        RIRHLQVADAARASLGLPVMEYVVT+TPLMVEKWIDPKTGQSTGRIR PASL RAVQ LM RSKVNAVAVVGRFPDDDVEETD+YRQGMGVDTL+GVEAI
Subjt:  RIRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAI

Query:  ISHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPL
        ISHLVVKEFQIPCAHAPALSPTP+C SLSPKSAAEELG+TFLPCVLSGLS APQYLS +SESLGKDCILANDVDSVIVPI+ACGGDG LAFARSKQYKPL
Subjt:  ISHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPL

Query:  IIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN
        IIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNI+ IS TSPNGHAVSSA Q FN
Subjt:  IIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN

XP_022993478.1 uncharacterized protein LOC111489474 isoform X4 [Cucurbita maxima]2.7e-19793.62Show/hide
Query:  NAQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEEL
        +A CKRQYTSVMIVPTGVGAAIGGYAGDALPVARALA VVDCL+THPNVLNAAMLYWPM NVLYVEGYALDRFAEGSWAL+PVHQNRVGLVLDAGMEEEL
Subjt:  NAQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEEL

Query:  RIRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAI
        RIRHLQVADAARASLGLPVMEYVVT+TPLMVEKWIDPKTGQSTGRIR PASL RAVQ LM RSKVNAVAVVGRFPDDDVEETD+YRQGMGVDTLAGVEAI
Subjt:  RIRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAI

Query:  ISHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPL
        ISHLVVKEFQIPCAHAPALSPTP+C SLSPKSAAEELG+TFLPCVLSGLS APQY+SK+SESLGKDCILANDVDSVIVPI+ACGGDG LAFARSKQYKPL
Subjt:  ISHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPL

Query:  IIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN
        IIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNI  IS TSPNGHAVSSA QRFN
Subjt:  IIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN

XP_023551677.1 uncharacterized protein LOC111809445 [Cucurbita pepo subsp. pepo]1.0e-19693.62Show/hide
Query:  NAQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEEL
        +A CKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPM NVLYVEGYALDRFAEGSWAL+PVHQNRVGLVLDAGMEEEL
Subjt:  NAQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEEL

Query:  RIRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAI
        RIRHLQVADAARASLGLPVMEYVVT+TPLMVEKWIDPKTGQSTGRIR PASL RAVQ LM RSKVNAVAVVGRFPDDDVEETD+YRQGMGVDTLAGVEAI
Subjt:  RIRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAI

Query:  ISHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPL
        ISHLVVKEFQIPCAHAPALSPTP+C S+SPKSAAEELG+TFLPCVLSGLS APQYLS +SESLGKDCILANDVDSVIVPI+ACGGDG LAFARSKQ+KPL
Subjt:  ISHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPL

Query:  IIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN
        IIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNI+ IS TSPNGHAVSSA QRFN
Subjt:  IIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN

XP_038875667.1 uncharacterized lipoprotein syc1174_c-like isoform X5 [Benincasa hispida]5.0e-19994.47Show/hide
Query:  AQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNR-----VGLVLDAGM
        A CKRQYTSVMIVPTGVGAAIGGYAGDALP+ARALASVVDCLITHPNVLNAAMLYWPM NVLYVEGYALDRFAEGSWAL+PVHQNR     VGLVLDAGM
Subjt:  AQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNR-----VGLVLDAGM

Query:  EEELRIRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAG
        EEELRIRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDP TGQSTGRIR PASL RAVQTLMN SKV+AVAVVGRFPDDDVEETD+YRQGMGVDTLAG
Subjt:  EEELRIRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAG

Query:  VEAIISHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQ
        VEAIISHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKN +SLGKDCILANDVDSVIVPINACGGDGTLAFARSK+
Subjt:  VEAIISHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQ

Query:  YKPLIIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN
        YKPLIIAVEENETVLSDSP SLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSA Q+FN
Subjt:  YKPLIIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN

XP_038875669.1 uncharacterized lipoprotein syc1174_c-like isoform X6 [Benincasa hispida]7.0e-20195.73Show/hide
Query:  AQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEELR
        A CKRQYTSVMIVPTGVGAAIGGYAGDALP+ARALASVVDCLITHPNVLNAAMLYWPM NVLYVEGYALDRFAEGSWAL+PVHQNRVGLVLDAGMEEELR
Subjt:  AQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEELR

Query:  IRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAII
        IRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDP TGQSTGRIR PASL RAVQTLMN SKV+AVAVVGRFPDDDVEETD+YRQGMGVDTLAGVEAII
Subjt:  IRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAII

Query:  SHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPLI
        SHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKN +SLGKDCILANDVDSVIVPINACGGDGTLAFARSK+YKPLI
Subjt:  SHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPLI

Query:  IAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN
        IAVEENETVLSDSP SLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSA Q+FN
Subjt:  IAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN

TrEMBL top hitse value%identityAlignment
A0A0A0KPI8 Uncharacterized protein1.9e-19694.13Show/hide
Query:  AQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEELR
        A CKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPM NVLYVEGYALDRFAEGSWAL+PVHQNRVGLVLDAGME+EL+
Subjt:  AQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEELR

Query:  IRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAII
        IRHLQVADAARASLGLPVMEYVVTDTPL+VEKWID  TGQSTGRIR PASL RAVQTLMNRSKVNAVAVVGRFPDDDVEE DSYRQGMGVDTLAGVEAII
Subjt:  IRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAII

Query:  SHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPLI
        SHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNS+SLGKDC+LANDVDSVIVPINACGGDGTLAFARSKQYKPLI
Subjt:  SHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPLI

Query:  IAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN
        IAVEEN+TVLSDSPESLGIEAVKV+NYWEAIGVVAAHKAGIDPYSLRRNRI NINCISSTS NG AVSSA + F+
Subjt:  IAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN

A0A1S3CT16 uncharacterized lipoprotein syc1174_c-like1.1e-19693.6Show/hide
Query:  AQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEELR
        A CKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPM NVLYVEGYALDRFAEGSWAL+PVHQNRVGLVLDAGME+ELR
Subjt:  AQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEELR

Query:  IRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAII
        IRHLQVADAARASLGLPVMEYVVTDTPL+VEKWID  TGQSTGRIR PASL RAVQTL+NRSKVNAVAVVGRFPDDDVEE D+YRQGMGVDTLAGVEAII
Subjt:  IRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAII

Query:  SHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPLI
        SHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKN +SLGKDC+LANDVDSVIVPINACGGDGTLAFARSKQYKPLI
Subjt:  SHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPLI

Query:  IAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN
        IAVEEN+TVLSDSPESLGIEAV+V+NYWEAIGVVAAHKAGIDPYSLRRNRI NINCISSTS NG AVSSA Q F+
Subjt:  IAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN

A0A5A7TY81 Putative lipoprotein-like protein1.1e-19693.6Show/hide
Query:  AQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEELR
        A CKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPM NVLYVEGYALDRFAEGSWAL+PVHQNRVGLVLDAGME+ELR
Subjt:  AQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEELR

Query:  IRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAII
        IRHLQVADAARASLGLPVMEYVVTDTPL+VEKWID  TGQSTGRIR PASL RAVQTL+NRSKVNAVAVVGRFPDDDVEE D+YRQGMGVDTLAGVEAII
Subjt:  IRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAII

Query:  SHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPLI
        SHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKN +SLGKDC+LANDVDSVIVPINACGGDGTLAFARSKQYKPLI
Subjt:  SHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPLI

Query:  IAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN
        IAVEEN+TVLSDSPESLGIEAV+V+NYWEAIGVVAAHKAGIDPYSLRRNRI NINCISSTS NG AVSSA Q F+
Subjt:  IAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN

A0A6J1FMR8 uncharacterized protein LOC111445603 isoform X38.6e-19793.62Show/hide
Query:  NAQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEEL
        +A CKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPM NVLYVEGYALDRFAEGSWAL+PVHQNRVGLVLDAGMEEEL
Subjt:  NAQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEEL

Query:  RIRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAI
        RIRHLQVADAARASLGLPVMEYVVT+TPLMVEKWIDPKTGQSTGRIR PASL RAVQ LM RSKVNAVAVVGRFPDDDVEETD+YRQGMGVDTL+GVEAI
Subjt:  RIRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAI

Query:  ISHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPL
        ISHLVVKEFQIPCAHAPALSPTP+C SLSPKSAAEELG+TFLPCVLSGLS APQYLS +SESLGKDCILANDVDSVIVPI+ACGGDG LAFARSKQYKPL
Subjt:  ISHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPL

Query:  IIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN
        IIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNI+ IS TSPNGHAVSSA Q FN
Subjt:  IIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN

A0A6J1K0A7 uncharacterized protein LOC111489474 isoform X41.3e-19793.62Show/hide
Query:  NAQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEEL
        +A CKRQYTSVMIVPTGVGAAIGGYAGDALPVARALA VVDCL+THPNVLNAAMLYWPM NVLYVEGYALDRFAEGSWAL+PVHQNRVGLVLDAGMEEEL
Subjt:  NAQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEEL

Query:  RIRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAI
        RIRHLQVADAARASLGLPVMEYVVT+TPLMVEKWIDPKTGQSTGRIR PASL RAVQ LM RSKVNAVAVVGRFPDDDVEETD+YRQGMGVDTLAGVEAI
Subjt:  RIRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDDDVEETDSYRQGMGVDTLAGVEAI

Query:  ISHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPL
        ISHLVVKEFQIPCAHAPALSPTP+C SLSPKSAAEELG+TFLPCVLSGLS APQY+SK+SESLGKDCILANDVDSVIVPI+ACGGDG LAFARSKQYKPL
Subjt:  ISHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPL

Query:  IIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN
        IIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNI  IS TSPNGHAVSSA QRFN
Subjt:  IIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN

SwissProt top hitse value%identityAlignment
A2RVM8 Heavy metal-associated isoprenylated plant protein 374.0e-4236.85Show/hide
Query:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELWSQKANPSPSPKPKNKD-DKTPNK
        MTK+EDFKLLK QT  LRVNIHC+GC +KVKKLLQRIEGV  V I AE+QKVTV G+VDSATLINKLV+AGKHAELWS   N +   KPK  D  K  N+
Subjt:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELWSQKANPSPSPKPKNKD-DKTPNK

Query:  ---------------EPKH-LKLTSFNCEDDDIVDCVEEGDDYEAAELQF-RAANLDLLRQRAIEANNAAKGIGISRIPGLAPGNGKMNNNHNNIININN
                       +PK+  K  +F  E+D       +G + E  ++QF + AN    +Q+     NA K  G           G   NN NN +N   
Subjt:  ---------------EPKH-LKLTSFNCEDDDIVDCVEEGDDYEAAELQF-RAANLDLLRQRAIEANNAAKGIGISRIPGLAPGNGKMNNNHNNIININN

Query:  NKPGNGKKIDPNQPMAIKSTPSEIDRKTLAALKMNNAQLFSNGRESINLGEAKRANNNDLNSMMSMAGFNGGNLLNFATPSSIDVNSTNTSQGLHLQQNN
              KK++  Q     S  ++  ++ +AA++M  A   S G E+           N++ ++M +AGFNG        P+ I        Q   L   N
Subjt:  NKPGNGKKIDPNQPMAIKSTPSEIDRKTLAALKMNNAQLFSNGRESINLGEAKRANNNDLNSMMSMAGFNGGNLLNFATPSSIDVNSTNTSQGLHLQQNN

Query:  GYGYGYQPSSTSGFSMATGQYHHQQQQPTFINGYNQYHQQQPLMNMNMVNRQAMNQQPQMMYNKAQLVPPNT-GYYFNYNPSP--VHPSYPYV-------
        G       +S  G  M              +NGYN +H       MNM +RQ M+Q  QMMY ++  VP ++ GYY+NY PSP   +P YPY        
Subjt:  GYGYGYQPSSTSGFSMATGQYHHQQQQPTFINGYNQYHQQQPLMNMNMVNRQAMNQQPQMMYNKAQLVPPNT-GYYFNYNPSP--VHPSYPYV-------

Query:  --HGHNNNSAADMFSDENTSSSCSII
          H H  N +++   D   ++SC+I+
Subjt:  --HGHNNNSAADMFSDENTSSSCSII

F4JZL7 Heavy metal-associated isoprenylated plant protein 332.5e-2055.1Show/hide
Query:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELWSQKANPSPSPKPKNKDDKTPN
        M+KEE    +K QTC L+VNIHCDGC+QKVKK+LQ+IEGVF   I AE  KVTV GNVD + LI KL+++GKHAE+W      S      N +   PN
Subjt:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELWSQKANPSPSPKPKNKDDKTPN

P08452 Uncharacterized lipoprotein syc1174_c7.5e-9757.14Show/hide
Query:  RQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEELRIRHL
        R  TSV+IVPTG+G A+GGYAGDALP+ARA+ASV D LITHPNV+N A LYWP+ NV YVEGYALDRFA G W L+PVH NR+GL+LDA +E ELRIRH 
Subjt:  RQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEELRIRHL

Query:  QVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDD-DVEETDSYRQGMGVDTLAGVEAIISHL
        QVA+AA+A+LGL V   V+TD PL V       +G + G I RP SL RA   L+ ++   A+AV+ RFPDD        YRQG GVD LAG EA+ISHL
Subjt:  QVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVGRFPDD-DVEETDSYRQGMGVDTLAGVEAIISHL

Query:  VVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPLIIAV
        +V+EFQ+PCAHAPAL P PL  S+SP+SAAEELG TFLPCVL+GLS AP+Y S  +ES+  + I    VD VI P  A GG G L +A        I+AV
Subjt:  VVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINACGGDGTLAFARSKQYKPLIIAV

Query:  EENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSL
         EN + L   P  LG+    +  + EA+G +AA+KAG+DP +L
Subjt:  EENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSL

Q0WV37 Heavy metal-associated isoprenylated plant protein 342.9e-1655.88Show/hide
Query:  LLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAEL
        ++K QTC L+VN+HC+GC+ KVKK LQ+IEGV+ V    E  +VTV GN+D A L+ KL ++GKHAE+
Subjt:  LLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAEL

Q9M8K5 Heavy metal-associated isoprenylated plant protein 321.1e-2058.62Show/hide
Query:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELW-SQKANPSPS
        M+KEE    +K QTC L+VNIHCDGC+QKVKK+LQ+IEGVF   I +E  KVTV G+VD + LI KL ++GKHAE+W + K N +P+
Subjt:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELW-SQKANPSPS

Arabidopsis top hitse value%identityAlignment
AT1G23000.1 Heavy metal transport/detoxification superfamily protein2.9e-4336.85Show/hide
Query:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELWSQKANPSPSPKPKNKD-DKTPNK
        MTK+EDFKLLK QT  LRVNIHC+GC +KVKKLLQRIEGV  V I AE+QKVTV G+VDSATLINKLV+AGKHAELWS   N +   KPK  D  K  N+
Subjt:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELWSQKANPSPSPKPKNKD-DKTPNK

Query:  ---------------EPKH-LKLTSFNCEDDDIVDCVEEGDDYEAAELQF-RAANLDLLRQRAIEANNAAKGIGISRIPGLAPGNGKMNNNHNNIININN
                       +PK+  K  +F  E+D       +G + E  ++QF + AN    +Q+     NA K  G           G   NN NN +N   
Subjt:  ---------------EPKH-LKLTSFNCEDDDIVDCVEEGDDYEAAELQF-RAANLDLLRQRAIEANNAAKGIGISRIPGLAPGNGKMNNNHNNIININN

Query:  NKPGNGKKIDPNQPMAIKSTPSEIDRKTLAALKMNNAQLFSNGRESINLGEAKRANNNDLNSMMSMAGFNGGNLLNFATPSSIDVNSTNTSQGLHLQQNN
              KK++  Q     S  ++  ++ +AA++M  A   S G E+           N++ ++M +AGFNG        P+ I        Q   L   N
Subjt:  NKPGNGKKIDPNQPMAIKSTPSEIDRKTLAALKMNNAQLFSNGRESINLGEAKRANNNDLNSMMSMAGFNGGNLLNFATPSSIDVNSTNTSQGLHLQQNN

Query:  GYGYGYQPSSTSGFSMATGQYHHQQQQPTFINGYNQYHQQQPLMNMNMVNRQAMNQQPQMMYNKAQLVPPNT-GYYFNYNPSP--VHPSYPYV-------
        G       +S  G  M              +NGYN +H       MNM +RQ M+Q  QMMY ++  VP ++ GYY+NY PSP   +P YPY        
Subjt:  GYGYGYQPSSTSGFSMATGQYHHQQQQPTFINGYNQYHQQQPLMNMNMVNRQAMNQQPQMMYNKAQLVPPNT-GYYFNYNPSP--VHPSYPYV-------

Query:  --HGHNNNSAADMFSDENTSSSCSII
          H H  N +++   D   ++SC+I+
Subjt:  --HGHNNNSAADMFSDENTSSSCSII

AT3G06130.1 Heavy metal transport/detoxification superfamily protein8.1e-2258.62Show/hide
Query:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELW-SQKANPSPS
        M+KEE    +K QTC L+VNIHCDGC+QKVKK+LQ+IEGVF   I +E  KVTV G+VD + LI KL ++GKHAE+W + K N +P+
Subjt:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELW-SQKANPSPS

AT3G06130.2 Heavy metal transport/detoxification superfamily protein8.1e-2258.62Show/hide
Query:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELW-SQKANPSPS
        M+KEE    +K QTC L+VNIHCDGC+QKVKK+LQ+IEGVF   I +E  KVTV G+VD + LI KL ++GKHAE+W + K N +P+
Subjt:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELW-SQKANPSPS

AT5G19090.1 Heavy metal transport/detoxification superfamily protein1.8e-2155.1Show/hide
Query:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELWSQKANPSPSPKPKNKDDKTPN
        M+KEE    +K QTC L+VNIHCDGC+QKVKK+LQ+IEGVF   I AE  KVTV GNVD + LI KL+++GKHAE+W      S      N +   PN
Subjt:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELWSQKANPSPSPKPKNKDDKTPN

AT5G19090.2 Heavy metal transport/detoxification superfamily protein1.8e-2155.1Show/hide
Query:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELWSQKANPSPSPKPKNKDDKTPN
        M+KEE    +K QTC L+VNIHCDGC+QKVKK+LQ+IEGVF   I AE  KVTV GNVD + LI KL+++GKHAE+W      S      N +   PN
Subjt:  MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELWSQKANPSPSPKPKNKDDKTPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAAAGAGGAGGATTTTAAGCTACTAAAGTTTCAGACTTGTGATCTCAGAGTGAACATTCACTGTGATGGGTGTAGGCAGAAAGTGAAGAAACTTCTTCAGAGGAT
AGAAGGAGTTTTTCAGGTTGCCATTGGTGCAGAAAATCAGAAGGTTACTGTTTTAGGAAATGTGGATTCTGCAACTTTGATCAATAAGCTGGTGAGAGCTGGAAAACATG
CTGAGCTTTGGTCACAGAAAGCAAACCCGAGCCCAAGCCCGAAACCGAAGAACAAAGACGATAAGACTCCGAACAAGGAACCAAAGCATCTCAAACTGACCTCATTCAAC
TGTGAAGATGATGACATTGTTGATTGTGTTGAGGAAGGAGATGATTATGAAGCTGCAGAGCTTCAGTTCAGAGCAGCTAATCTTGATCTCCTTAGGCAGCGGGCAATCGA
AGCAAACAATGCTGCAAAAGGCATTGGGATCAGCAGAATTCCCGGGCTTGCCCCGGGAAATGGCAAGATGAACAACAACCACAACAACATCATCAATATCAACAACAACA
AACCTGGGAATGGAAAGAAAATAGACCCTAATCAGCCAATGGCAATAAAAAGCACCCCATCTGAGATTGACAGAAAAACTTTGGCAGCTCTGAAGATGAACAATGCTCAA
TTGTTCAGTAACGGTCGAGAAAGTATCAATCTTGGGGAAGCGAAAAGAGCGAACAACAACGATCTGAATTCAATGATGAGCATGGCAGGATTCAATGGTGGCAACCTTTT
GAATTTTGCCACTCCGTCTTCCATTGATGTCAATTCAACAAACACCTCTCAAGGACTTCACCTTCAACAAAACAATGGTTATGGCTATGGCTACCAGCCATCATCAACCT
CTGGATTCTCCATGGCAACTGGTCAATATCACCATCAACAACAACAACCAACCTTCATTAATGGCTACAATCAGTACCATCAGCAGCAACCATTGATGAACATGAACATG
GTAAATAGACAAGCAATGAACCAACAACCCCAAATGATGTACAATAAAGCTCAATTGGTTCCACCAAACACAGGATATTACTTCAATTACAATCCAAGCCCTGTTCATCC
AAGTTATCCTTATGTTCATGGCCACAATAATAACTCTGCTGCTGATATGTTCAGTGATGAGAACACAAGCAGCAGTTGCTCAATCATTAAGAAAAATCCGCGGCATTTTC
GCTTTCCATATGGTACCGCCAGAAATGTCCCTCCATCTCAGTTATCCACTCCCGATATTCCCCCGCCGGAATGGATTCCGGACCAAATCTCAGCCATCGGCGGCGAAGTC
GATCGTCTCCTGCTCTGCTATCAAACGCTACACCGCCGGAGTGAGTTCGCCTCTTTAATATTCCTTCCTTCATTTCCACATAGGATACAGTTATTCGAAATTCAAATTAC
TGCGTTTATTCGTTTACGACTGTGTCTATGTATGAATGCGCAGTGCAAAAGGCAGTATACGAGTGTGATGATAGTCCCGACAGGCGTAGGCGCCGCCATTGGTGGATACG
CAGGTGACGCTCTCCCGGTTGCTCGTGCCCTCGCCTCCGTCGTTGATTGCCTTATAACTCACCCTAACGTGCTTAATGCAGCAATGCTTTACTGGCCAATGCACAATGTG
CTTTATGTTGAAGGCTATGCACTAGATCGGTTTGCAGAAGGTTCATGGGCCCTCGAACCTGTTCACCAGAATCGGGTAGGATTGGTTCTTGATGCTGGAATGGAGGAAGA
GCTTCGAATTCGTCACTTGCAAGTGGCTGATGCTGCTAGAGCTTCTCTTGGATTGCCTGTGATGGAATATGTTGTCACAGATACACCTTTAATGGTAGAGAAGTGGATTG
ATCCAAAGACGGGGCAATCAACTGGGAGGATAAGACGTCCTGCCTCACTGCATAGAGCCGTGCAGACACTAATGAACCGGTCAAAGGTAAATGCAGTTGCAGTTGTTGGA
CGATTCCCAGACGATGATGTTGAAGAGACGGATAGTTATCGACAAGGGATGGGAGTTGATACTTTGGCAGGGGTTGAGGCTATTATTAGCCATCTTGTGGTGAAGGAGTT
TCAGATTCCTTGTGCTCATGCTCCTGCTTTGTCACCTACTCCCTTATGCACATCTCTATCTCCAAAATCAGCAGCAGAGGAGTTAGGATTCACATTCTTACCATGTGTAC
TTTCTGGGCTAAGTAATGCGCCTCAATACTTGAGCAAGAACTCCGAATCATTGGGGAAAGACTGCATATTGGCAAATGATGTTGATAGTGTCATTGTACCTATAAATGCA
TGTGGAGGGGATGGCACTCTTGCTTTTGCCAGAAGCAAACAGTACAAGCCACTTATTATTGCAGTAGAGGAAAATGAAACAGTTCTCAGCGATTCTCCAGAGTCACTTGG
GATTGAGGCGGTAAAAGTCTCAAATTATTGGGAAGCCATAGGTGTCGTTGCAGCTCACAAGGCAGGAATTGATCCTTATTCCCTTCGAAGAAATAGAATCAACAACATTA
ATTGCATTTCCAGTACATCTCCTAATGGCCACGCAGTTTCAAGTGCCCATCAACGATTCAACTGA
mRNA sequenceShow/hide mRNA sequence
TAACGTTGAATCACATCATCACCCAATTCAAAAAAAACCCTCTTTTTCTCTCTGTTTTCCTCTGTTTCCAAAAATCCCTCCCTTTTCCCTCTCTTCCCCTGCTCCTGACT
TTCCTCACTTCTTCCTTCCCTTCCCCTTTTCTTTCAAAATCAAAATCAAAATCAAAATCTCTCCCTTTCTCTCTCCAAACACTCCCTTCTCTTCCATGTGCACTCTCTTT
TGTTTGCTTTTGAGGAAACTCTTCTCAATAAATTAAACCCCACTTCTCCCGTTGCTGCTTTCCCATTCTTCTTTGCTCTCTCTATCTCTACAGTAACCTCTAAAAACACA
CACACACATTCAACTCAAGAAATGACCAAAGAGGAGGATTTTAAGCTACTAAAGTTTCAGACTTGTGATCTCAGAGTGAACATTCACTGTGATGGGTGTAGGCAGAAAGT
GAAGAAACTTCTTCAGAGGATAGAAGGAGTTTTTCAGGTTGCCATTGGTGCAGAAAATCAGAAGGTTACTGTTTTAGGAAATGTGGATTCTGCAACTTTGATCAATAAGC
TGGTGAGAGCTGGAAAACATGCTGAGCTTTGGTCACAGAAAGCAAACCCGAGCCCAAGCCCGAAACCGAAGAACAAAGACGATAAGACTCCGAACAAGGAACCAAAGCAT
CTCAAACTGACCTCATTCAACTGTGAAGATGATGACATTGTTGATTGTGTTGAGGAAGGAGATGATTATGAAGCTGCAGAGCTTCAGTTCAGAGCAGCTAATCTTGATCT
CCTTAGGCAGCGGGCAATCGAAGCAAACAATGCTGCAAAAGGCATTGGGATCAGCAGAATTCCCGGGCTTGCCCCGGGAAATGGCAAGATGAACAACAACCACAACAACA
TCATCAATATCAACAACAACAAACCTGGGAATGGAAAGAAAATAGACCCTAATCAGCCAATGGCAATAAAAAGCACCCCATCTGAGATTGACAGAAAAACTTTGGCAGCT
CTGAAGATGAACAATGCTCAATTGTTCAGTAACGGTCGAGAAAGTATCAATCTTGGGGAAGCGAAAAGAGCGAACAACAACGATCTGAATTCAATGATGAGCATGGCAGG
ATTCAATGGTGGCAACCTTTTGAATTTTGCCACTCCGTCTTCCATTGATGTCAATTCAACAAACACCTCTCAAGGACTTCACCTTCAACAAAACAATGGTTATGGCTATG
GCTACCAGCCATCATCAACCTCTGGATTCTCCATGGCAACTGGTCAATATCACCATCAACAACAACAACCAACCTTCATTAATGGCTACAATCAGTACCATCAGCAGCAA
CCATTGATGAACATGAACATGGTAAATAGACAAGCAATGAACCAACAACCCCAAATGATGTACAATAAAGCTCAATTGGTTCCACCAAACACAGGATATTACTTCAATTA
CAATCCAAGCCCTGTTCATCCAAGTTATCCTTATGTTCATGGCCACAATAATAACTCTGCTGCTGATATGTTCAGTGATGAGAACACAAGCAGCAGTTGCTCAATCATTA
AGAAAAATCCGCGGCATTTTCGCTTTCCATATGGTACCGCCAGAAATGTCCCTCCATCTCAGTTATCCACTCCCGATATTCCCCCGCCGGAATGGATTCCGGACCAAATC
TCAGCCATCGGCGGCGAAGTCGATCGTCTCCTGCTCTGCTATCAAACGCTACACCGCCGGAGTGAGTTCGCCTCTTTAATATTCCTTCCTTCATTTCCACATAGGATACA
GTTATTCGAAATTCAAATTACTGCGTTTATTCGTTTACGACTGTGTCTATGTATGAATGCGCAGTGCAAAAGGCAGTATACGAGTGTGATGATAGTCCCGACAGGCGTAG
GCGCCGCCATTGGTGGATACGCAGGTGACGCTCTCCCGGTTGCTCGTGCCCTCGCCTCCGTCGTTGATTGCCTTATAACTCACCCTAACGTGCTTAATGCAGCAATGCTT
TACTGGCCAATGCACAATGTGCTTTATGTTGAAGGCTATGCACTAGATCGGTTTGCAGAAGGTTCATGGGCCCTCGAACCTGTTCACCAGAATCGGGTAGGATTGGTTCT
TGATGCTGGAATGGAGGAAGAGCTTCGAATTCGTCACTTGCAAGTGGCTGATGCTGCTAGAGCTTCTCTTGGATTGCCTGTGATGGAATATGTTGTCACAGATACACCTT
TAATGGTAGAGAAGTGGATTGATCCAAAGACGGGGCAATCAACTGGGAGGATAAGACGTCCTGCCTCACTGCATAGAGCCGTGCAGACACTAATGAACCGGTCAAAGGTA
AATGCAGTTGCAGTTGTTGGACGATTCCCAGACGATGATGTTGAAGAGACGGATAGTTATCGACAAGGGATGGGAGTTGATACTTTGGCAGGGGTTGAGGCTATTATTAG
CCATCTTGTGGTGAAGGAGTTTCAGATTCCTTGTGCTCATGCTCCTGCTTTGTCACCTACTCCCTTATGCACATCTCTATCTCCAAAATCAGCAGCAGAGGAGTTAGGAT
TCACATTCTTACCATGTGTACTTTCTGGGCTAAGTAATGCGCCTCAATACTTGAGCAAGAACTCCGAATCATTGGGGAAAGACTGCATATTGGCAAATGATGTTGATAGT
GTCATTGTACCTATAAATGCATGTGGAGGGGATGGCACTCTTGCTTTTGCCAGAAGCAAACAGTACAAGCCACTTATTATTGCAGTAGAGGAAAATGAAACAGTTCTCAG
CGATTCTCCAGAGTCACTTGGGATTGAGGCGGTAAAAGTCTCAAATTATTGGGAAGCCATAGGTGTCGTTGCAGCTCACAAGGCAGGAATTGATCCTTATTCCCTTCGAA
GAAATAGAATCAACAACATTAATTGCATTTCCAGTACATCTCCTAATGGCCACGCAGTTTCAAGTGCCCATCAACGATTCAACTGA
Protein sequenceShow/hide protein sequence
MTKEEDFKLLKFQTCDLRVNIHCDGCRQKVKKLLQRIEGVFQVAIGAENQKVTVLGNVDSATLINKLVRAGKHAELWSQKANPSPSPKPKNKDDKTPNKEPKHLKLTSFN
CEDDDIVDCVEEGDDYEAAELQFRAANLDLLRQRAIEANNAAKGIGISRIPGLAPGNGKMNNNHNNIININNNKPGNGKKIDPNQPMAIKSTPSEIDRKTLAALKMNNAQ
LFSNGRESINLGEAKRANNNDLNSMMSMAGFNGGNLLNFATPSSIDVNSTNTSQGLHLQQNNGYGYGYQPSSTSGFSMATGQYHHQQQQPTFINGYNQYHQQQPLMNMNM
VNRQAMNQQPQMMYNKAQLVPPNTGYYFNYNPSPVHPSYPYVHGHNNNSAADMFSDENTSSSCSIIKKNPRHFRFPYGTARNVPPSQLSTPDIPPPEWIPDQISAIGGEV
DRLLLCYQTLHRRSEFASLIFLPSFPHRIQLFEIQITAFIRLRLCLCMNAQCKRQYTSVMIVPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNV
LYVEGYALDRFAEGSWALEPVHQNRVGLVLDAGMEEELRIRHLQVADAARASLGLPVMEYVVTDTPLMVEKWIDPKTGQSTGRIRRPASLHRAVQTLMNRSKVNAVAVVG
RFPDDDVEETDSYRQGMGVDTLAGVEAIISHLVVKEFQIPCAHAPALSPTPLCTSLSPKSAAEELGFTFLPCVLSGLSNAPQYLSKNSESLGKDCILANDVDSVIVPINA
CGGDGTLAFARSKQYKPLIIAVEENETVLSDSPESLGIEAVKVSNYWEAIGVVAAHKAGIDPYSLRRNRINNINCISSTSPNGHAVSSAHQRFN