; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025937 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025937
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHeavy metal transport/detoxification superfamily protein
Genome locationtig00153017:1513366..1522031
RNA-Seq ExpressionSgr025937
SyntenySgr025937
Gene Ontology termsGO:0005507 - copper ion binding (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR006121 - Heavy metal-associated domain, HMA
IPR021763 - Protein of unknown function DUF3326
IPR036163 - Heavy metal-associated domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PWA97078.1 putative lipoprotein [Artemisia annua]2.0e-19751.83Show/hide
Query:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELWSQK-ANPSPKPKN------KDDK
        M K+EDFKLLKIQTC LRVN+HCDGC+ KVKK+LQRIEGV+QV IDAE QKVTVSGSVD ATLIKKL++AGKHAE+WS K  N S   +N      K+DK
Subjt:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELWSQK-ANPSPKPKN------KDDK

Query:  PANKGPKQPKLTSFNCEEDDIDDCFGEEEGEDYEDEEFQFLKEKAAHLGLLRQQAIEANNAKKCIGISQIPGPATGNGKMNSSNSNSNNNKSGNGKKVVP
          NK  K+  +      +   DD F        +DE+ +FL+ K   L  LRQ+  EA               A  NG  +  N+ + +   G       
Subjt:  PANKGPKQPKLTSFNCEEDDIDDCFGEEEGEDYEDEEFQFLKEKAAHLGLLRQQAIEANNAKKCIGISQIPGPATGNGKMNSSNSNSNNNKSGNGKKVVP

Query:  NQQMGIKNIPSGIDQKAMAALRMNNAQHFSSGGGSGGRGGGSIINLGEAKRGNNDLNSMMNMAGFNGGNLVNFATPSSIGLNSTNSSQGHHLQQNNGYGY
                             ++NNA         G +  G+  +L E KR  ++L S+MN  G   GN         IG              NNG G 
Subjt:  NQQMGIKNIPSGIDQKAMAALRMNNAQHFSSGGGSGGRGGGSIINLGEAKRGNNDLNSMMNMAGFNGGNLVNFATPSSIGLNSTNSSQGHHLQQNNGYGY

Query:  QQPSSTSGFHMTGQYQQQQPTSINAYNQ--YHHQPPLMNMNMLTRQAMNQQPQMMYNRAHLVPPNTG-YYYNYNPSPVQPGYPYVEAGYQQGHN------
           +   G +  GQ  Q    S+N   Q  Y++    M MNM  RQA       MY R+ ++ PNTG YYYNYNP+P     P+    Y  G+N      
Subjt:  QQPSSTSGFHMTGQYQQQQPTSINAYNQ--YHHQPPLMNMNMLTRQAMNQQPQMMYNRAHLVPPNTG-YYYNYNPSPVQPGYPYVEAGYQQGHN------

Query:  --SNSAADMFSDENTSSSCSIMTKSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIG
            + A+M SDEN +SSCS+  K                               HF   N   +S+  ++      + + KR+Y SV+I+PTGVGA+IG
Subjt:  --SNSAADMFSDENTSSSCSIMTKSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIG

Query:  GYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYI
        GYAGDALPV R LASV D +I+HPNVLNAAMLYWPM NVLYVEG+ALDRFAEG WAL+PVHQN++GLVLDAGIEEELR RHLQVADA RASLGLPV EYI
Subjt:  GYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYI

Query:  VTETPLVVEKWIDPKTGQSTGRIRHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTP
        VT+TPL VEKW+DPK+GQ+TGRI+HP SLLRAVQTL DRS VNAVAVV RFP+DDV++ D YRQG G+D LAGVEAIISHLVVK F+IPCAHAPAL P P
Subjt:  VTETPLVVEKWIDPKTGQSTGRIRHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTP

Query:  LCKSLSPKSAAEELGYTFLPCVLSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAV
        L +SL PKSAAEELGYTFLPCVL+GL+NAPQYL   SES     I+A+DVDSVILP DACGGDG LAFA + + KPLII V+ENETVL+DTP+ L I+ +
Subjt:  LCKSLSPKSAAEELGYTFLPCVLSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAV

Query:  KVSNYWEAIGVIAAHKAGIDPYSLRRNRINNVNCISTTSPNGYAVSS
        KVSNYWEAIGV+AAHKAG+DP SLRR++I N+        NG+  SS
Subjt:  KVSNYWEAIGVIAAHKAGIDPYSLRRNRINNVNCISTTSPNGYAVSS

XP_022152336.1 uncharacterized protein LOC111020080 [Momordica charantia]9.4e-20385.35Show/hide
Query:  KSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLITH
        KSQPSAA SSVSCSAINRYSAG                                   CKRQYTSVMI+PTGVGAAIGGYAGDALPVARALASVVDCLI H
Subjt:  KSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLITH

Query:  PNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGRI
        PN+LNAAMLYWPMHNVLYVEG+ALDRFAEG WALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEY+VTETPLVVEKWIDPKTG+STGRI
Subjt:  PNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGRI

Query:  RHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCVL
        RHPASLL AVQTL++RSKVNAVAVVGRFPDDDVEETDNYRQG GVDALAGVEA+ISHLVVKEFQ+PCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCVL
Subjt:  RHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCVL

Query:  SGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPYS
        SGLSNAPQYLSKS+ESL KD ILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGI+ VKVSNYWEAIGVIAAHKAGIDPYS
Subjt:  SGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPYS

Query:  LRRNRINNVNCISTTSPNGYAVSSASQLFN
        LRRN INN  C+STT+PNGY +SSA Q FN
Subjt:  LRRNRINNVNCISTTSPNGYAVSSASQLFN

XP_022939862.1 uncharacterized protein LOC111445603 isoform X3 [Cucurbita moschata]4.4e-20084.69Show/hide
Query:  TKSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLIT
        TKSQP AAKS VSCSAINRYSAG                                   CKRQYTSVMI+PTGVGAAIGGYAGDALPVARALASVVDCLIT
Subjt:  TKSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLIT

Query:  HPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGR
        HPNVLNAAMLYWPM NVLYVEG+ALDRFAEGSWAL+PVHQNR+GLVLDAG+EEELRIRHLQVADAARASLGLPVMEY+VTETPL+VEKWIDPKTGQSTGR
Subjt:  HPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGR

Query:  IRHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCV
        IRHPASLLRAVQ LM RSKVNAVAVVGRFPDDDVEETDNYRQG GVD L+GVEAIISHLVVKEFQIPCAHAPALSPTP+CKSLSPKSAAEELGYTFLPCV
Subjt:  IRHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCV

Query:  LSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPY
        LSGLS APQYLS SSESLGKD ILANDVDSVI+PIDACGGDGALAFARS QYKPLIIAVEENETVLSD+P+SLGIEAVKVSNYWEAIGV+AAHKAGIDPY
Subjt:  LSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPY

Query:  SLRRNRINNVNCISTTSPNGYAVSSASQLFN
        SLRRNRINN++ IS TSPNG+AVSSA Q FN
Subjt:  SLRRNRINNVNCISTTSPNGYAVSSASQLFN

XP_022993478.1 uncharacterized protein LOC111489474 isoform X4 [Cucurbita maxima]3.4e-20084.45Show/hide
Query:  TKSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLIT
        TKSQP AAKS VSCSAINRYSAG                                   CKRQYTSVMI+PTGVGAAIGGYAGDALPVARALA VVDCL+T
Subjt:  TKSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLIT

Query:  HPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGR
        HPNVLNAAMLYWPM NVLYVEG+ALDRFAEGSWAL+PVHQNR+GLVLDAG+EEELRIRHLQVADAARASLGLPVMEY+VTETPL+VEKWIDPKTGQSTGR
Subjt:  HPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGR

Query:  IRHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCV
        IRHPASLLRAVQ LM RSKVNAVAVVGRFPDDDVEETDNYRQG GVD LAGVEAIISHLVVKEFQIPCAHAPALSPTP+CKSLSPKSAAEELGYTFLPCV
Subjt:  IRHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCV

Query:  LSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPY
        LSGLS APQY+SKSSESLGKD ILANDVDSVI+PIDACGGDGALAFARS QYKPLIIAVEENETVLSD+P+SLGIEAVKVSNYWEAIGV+AAHKAGIDPY
Subjt:  LSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPY

Query:  SLRRNRINNVNCISTTSPNGYAVSSASQLFN
        SLRRNRINN+  IS TSPNG+AVSSA Q FN
Subjt:  SLRRNRINNVNCISTTSPNGYAVSSASQLFN

XP_023551677.1 uncharacterized protein LOC111809445 [Cucurbita pepo subsp. pepo]1.1e-19883.99Show/hide
Query:  TKSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLIT
        TKSQP AA S V+CSAINRYSAG                                   CKRQYTSVMI+PTGVGAAIGGYAGDALPVARALASVVDCLIT
Subjt:  TKSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLIT

Query:  HPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGR
        HPNVLNAAMLYWPM NVLYVEG+ALDRFAEGSWAL+PVHQNR+GLVLDAG+EEELRIRHLQVADAARASLGLPVMEY+VTETPL+VEKWIDPKTGQSTGR
Subjt:  HPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGR

Query:  IRHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCV
        IRHPASLLRAVQ LM RSKVNAVAVVGRFPDDDVEETDNYRQG GVD LAGVEAIISHLVVKEFQIPCAHAPALSPTP+CKS+SPKSAAEELGYTFLPCV
Subjt:  IRHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCV

Query:  LSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPY
        LSGLS APQYLS SSESLGKD ILANDVDSVI+PIDACGGDGALAFARS Q+KPLIIAVEENETVLSD+P+SLGIEAVKVSNYWEAIGV+AAHKAGIDPY
Subjt:  LSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPY

Query:  SLRRNRINNVNCISTTSPNGYAVSSASQLFN
        SLRRNRINN++ IS TSPNG+AVSSA Q FN
Subjt:  SLRRNRINNVNCISTTSPNGYAVSSASQLFN

TrEMBL top hitse value%identityAlignment
A0A1S3CT16 uncharacterized lipoprotein syc1174_c-like2.0e-19082.97Show/hide
Query:  VSLPLQFSFLHFEIANSFR---RSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYV
        +SL L +    F   N FR   +S  ++  +       CKRQYTSVMI+PTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPM NVLYV
Subjt:  VSLPLQFSFLHFEIANSFR---RSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYV

Query:  EGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGRIRHPASLLRAVQTLMDRSKV
        EG+ALDRFAEGSWAL+PVHQNR+GLVLDAG+E+ELRIRHLQVADAARASLGLPVMEY+VT+TPLVVEKWID  TGQSTGRIRHPASLLRAVQTL++RSKV
Subjt:  EGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGRIRHPASLLRAVQTLMDRSKV

Query:  NAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCVLSGLSNAPQYLSKSSESLGK
        NAVAVVGRFPDDDVEE DNYRQG GVD LAGVEAIISHLVVKEFQIPCAHAPALSPTPLC SLSPKSAAEELG+TFLPCVLSGLSNAPQYLSK+ +SLGK
Subjt:  NAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCVLSGLSNAPQYLSKSSESLGK

Query:  DFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPYSLRRNRINNVNCISTTSPNG
        D +LANDVDSVI+PI+ACGGDG LAFARS QYKPLIIAVEEN+TVLSD+P+SLGIEAV+V+NYWEAIGV+AAHKAGIDPYSLRRNRI N+NCIS+TS NG
Subjt:  DFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPYSLRRNRINNVNCISTTSPNG

Query:  YAVSSASQLFN
         AVSSASQ F+
Subjt:  YAVSSASQLFN

A0A2U1QGG8 Putative lipoprotein9.8e-19851.83Show/hide
Query:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELWSQK-ANPSPKPKN------KDDK
        M K+EDFKLLKIQTC LRVN+HCDGC+ KVKK+LQRIEGV+QV IDAE QKVTVSGSVD ATLIKKL++AGKHAE+WS K  N S   +N      K+DK
Subjt:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELWSQK-ANPSPKPKN------KDDK

Query:  PANKGPKQPKLTSFNCEEDDIDDCFGEEEGEDYEDEEFQFLKEKAAHLGLLRQQAIEANNAKKCIGISQIPGPATGNGKMNSSNSNSNNNKSGNGKKVVP
          NK  K+  +      +   DD F        +DE+ +FL+ K   L  LRQ+  EA               A  NG  +  N+ + +   G       
Subjt:  PANKGPKQPKLTSFNCEEDDIDDCFGEEEGEDYEDEEFQFLKEKAAHLGLLRQQAIEANNAKKCIGISQIPGPATGNGKMNSSNSNSNNNKSGNGKKVVP

Query:  NQQMGIKNIPSGIDQKAMAALRMNNAQHFSSGGGSGGRGGGSIINLGEAKRGNNDLNSMMNMAGFNGGNLVNFATPSSIGLNSTNSSQGHHLQQNNGYGY
                             ++NNA         G +  G+  +L E KR  ++L S+MN  G   GN         IG              NNG G 
Subjt:  NQQMGIKNIPSGIDQKAMAALRMNNAQHFSSGGGSGGRGGGSIINLGEAKRGNNDLNSMMNMAGFNGGNLVNFATPSSIGLNSTNSSQGHHLQQNNGYGY

Query:  QQPSSTSGFHMTGQYQQQQPTSINAYNQ--YHHQPPLMNMNMLTRQAMNQQPQMMYNRAHLVPPNTG-YYYNYNPSPVQPGYPYVEAGYQQGHN------
           +   G +  GQ  Q    S+N   Q  Y++    M MNM  RQA       MY R+ ++ PNTG YYYNYNP+P     P+    Y  G+N      
Subjt:  QQPSSTSGFHMTGQYQQQQPTSINAYNQ--YHHQPPLMNMNMLTRQAMNQQPQMMYNRAHLVPPNTG-YYYNYNPSPVQPGYPYVEAGYQQGHN------

Query:  --SNSAADMFSDENTSSSCSIMTKSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIG
            + A+M SDEN +SSCS+  K                               HF   N   +S+  ++      + + KR+Y SV+I+PTGVGA+IG
Subjt:  --SNSAADMFSDENTSSSCSIMTKSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIG

Query:  GYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYI
        GYAGDALPV R LASV D +I+HPNVLNAAMLYWPM NVLYVEG+ALDRFAEG WAL+PVHQN++GLVLDAGIEEELR RHLQVADA RASLGLPV EYI
Subjt:  GYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYI

Query:  VTETPLVVEKWIDPKTGQSTGRIRHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTP
        VT+TPL VEKW+DPK+GQ+TGRI+HP SLLRAVQTL DRS VNAVAVV RFP+DDV++ D YRQG G+D LAGVEAIISHLVVK F+IPCAHAPAL P P
Subjt:  VTETPLVVEKWIDPKTGQSTGRIRHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTP

Query:  LCKSLSPKSAAEELGYTFLPCVLSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAV
        L +SL PKSAAEELGYTFLPCVL+GL+NAPQYL   SES     I+A+DVDSVILP DACGGDG LAFA + + KPLII V+ENETVL+DTP+ L I+ +
Subjt:  LCKSLSPKSAAEELGYTFLPCVLSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAV

Query:  KVSNYWEAIGVIAAHKAGIDPYSLRRNRINNVNCISTTSPNGYAVSS
        KVSNYWEAIGV+AAHKAG+DP SLRR++I N+        NG+  SS
Subjt:  KVSNYWEAIGVIAAHKAGIDPYSLRRNRINNVNCISTTSPNGYAVSS

A0A6J1DHE6 uncharacterized protein LOC1110200804.6e-20385.35Show/hide
Query:  KSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLITH
        KSQPSAA SSVSCSAINRYSAG                                   CKRQYTSVMI+PTGVGAAIGGYAGDALPVARALASVVDCLI H
Subjt:  KSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLITH

Query:  PNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGRI
        PN+LNAAMLYWPMHNVLYVEG+ALDRFAEG WALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEY+VTETPLVVEKWIDPKTG+STGRI
Subjt:  PNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGRI

Query:  RHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCVL
        RHPASLL AVQTL++RSKVNAVAVVGRFPDDDVEETDNYRQG GVDALAGVEA+ISHLVVKEFQ+PCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCVL
Subjt:  RHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCVL

Query:  SGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPYS
        SGLSNAPQYLSKS+ESL KD ILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGI+ VKVSNYWEAIGVIAAHKAGIDPYS
Subjt:  SGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPYS

Query:  LRRNRINNVNCISTTSPNGYAVSSASQLFN
        LRRN INN  C+STT+PNGY +SSA Q FN
Subjt:  LRRNRINNVNCISTTSPNGYAVSSASQLFN

A0A6J1FMR8 uncharacterized protein LOC111445603 isoform X32.1e-20084.69Show/hide
Query:  TKSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLIT
        TKSQP AAKS VSCSAINRYSAG                                   CKRQYTSVMI+PTGVGAAIGGYAGDALPVARALASVVDCLIT
Subjt:  TKSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLIT

Query:  HPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGR
        HPNVLNAAMLYWPM NVLYVEG+ALDRFAEGSWAL+PVHQNR+GLVLDAG+EEELRIRHLQVADAARASLGLPVMEY+VTETPL+VEKWIDPKTGQSTGR
Subjt:  HPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGR

Query:  IRHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCV
        IRHPASLLRAVQ LM RSKVNAVAVVGRFPDDDVEETDNYRQG GVD L+GVEAIISHLVVKEFQIPCAHAPALSPTP+CKSLSPKSAAEELGYTFLPCV
Subjt:  IRHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCV

Query:  LSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPY
        LSGLS APQYLS SSESLGKD ILANDVDSVI+PIDACGGDGALAFARS QYKPLIIAVEENETVLSD+P+SLGIEAVKVSNYWEAIGV+AAHKAGIDPY
Subjt:  LSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPY

Query:  SLRRNRINNVNCISTTSPNGYAVSSASQLFN
        SLRRNRINN++ IS TSPNG+AVSSA Q FN
Subjt:  SLRRNRINNVNCISTTSPNGYAVSSASQLFN

A0A6J1K0A7 uncharacterized protein LOC111489474 isoform X41.6e-20084.45Show/hide
Query:  TKSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLIT
        TKSQP AAKS VSCSAINRYSAG                                   CKRQYTSVMI+PTGVGAAIGGYAGDALPVARALA VVDCL+T
Subjt:  TKSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFEIANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLIT

Query:  HPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGR
        HPNVLNAAMLYWPM NVLYVEG+ALDRFAEGSWAL+PVHQNR+GLVLDAG+EEELRIRHLQVADAARASLGLPVMEY+VTETPL+VEKWIDPKTGQSTGR
Subjt:  HPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGR

Query:  IRHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCV
        IRHPASLLRAVQ LM RSKVNAVAVVGRFPDDDVEETDNYRQG GVD LAGVEAIISHLVVKEFQIPCAHAPALSPTP+CKSLSPKSAAEELGYTFLPCV
Subjt:  IRHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAIISHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCV

Query:  LSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPY
        LSGLS APQY+SKSSESLGKD ILANDVDSVI+PIDACGGDGALAFARS QYKPLIIAVEENETVLSD+P+SLGIEAVKVSNYWEAIGV+AAHKAGIDPY
Subjt:  LSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPY

Query:  SLRRNRINNVNCISTTSPNGYAVSSASQLFN
        SLRRNRINN+  IS TSPNG+AVSSA Q FN
Subjt:  SLRRNRINNVNCISTTSPNGYAVSSASQLFN

SwissProt top hitse value%identityAlignment
A2RVM8 Heavy metal-associated isoprenylated plant protein 375.8e-4640.14Show/hide
Query:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELWSQKANPS--PKPKNKD--DKPAN
        MTK+EDFKLLKIQT  LRVNIHC+GC +KVKKLLQRIEGV  V I+AE+QKVTVSGSVDSATLI KLV+AGKHAELWS   N +   KPK  D       
Subjt:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELWSQKANPS--PKPKNKD--DKPAN

Query:  KGPKQ---------------PKLTSFNCEEDDIDDCFGEEEGEDYEDEEFQFLKEKAAHLGLLRQQAIEANNAKKCIGISQIPGPATGNGKMNSSNSNSN
        KG KQ               PK  +F  EED         +G + ED + QF K         +QQ     NAKK           +G   MN    N N
Subjt:  KGPKQ---------------PKLTSFNCEEDDIDDCFGEEEGEDYEDEEFQFLKEKAAHLGLLRQQAIEANNAKKCIGISQIPGPATGNGKMNSSNSNSN

Query:  NNKSGNGKKVVPNQQMGIKNIPSGIDQKAMAALRMNNAQHFSSGGGSGGRGGGSIINLGEAKRGNNDLNSMMNMAGFNGG-NLVNFATPSSI-------G
        N  +   KKV   Q    +N      Q+ MAA+RM  A   S+G  +                  N++ ++M +AGFNG  N VN   P+ I        
Subjt:  NNKSGNGKKVVPNQQMGIKNIPSGIDQKAMAALRMNNAQHFSSGGGSGGRGGGSIINLGEAKRGNNDLNSMMNMAGFNGG-NLVNFATPSSI-------G

Query:  LNSTNSSQGHHLQQNNGYGYQQPSSTSGFHMTGQYQQQQPTSINAYNQYHHQPPLMNMNMLTRQAMNQQPQMMYNRAHLVPPNT-GYYYNYNPSPVQ--P
        LN+ N    H+L  +NG          G  M          ++N YN +H       MNM +RQ M+Q  QMMY R+  VP ++ GYYYNY PSP    P
Subjt:  LNSTNSSQGHHLQQNNGYGYQQPSSTSGFHMTGQYQQQQPTSINAYNQYHHQPPLMNMNMLTRQAMNQQPQMMYNRAHLVPPNT-GYYYNYNPSPVQ--P

Query:  GYPYVEAGYQQGHNSNSAADMFSDEN--TSSSCSIM
         YPY    YQQ  + + A +M S+E+   ++SC+IM
Subjt:  GYPYVEAGYQQGHNSNSAADMFSDEN--TSSSCSIM

F4JZL7 Heavy metal-associated isoprenylated plant protein 331.2e-2262.22Show/hide
Query:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELWSQKANPSPKPKNK
        M+KEE    +KIQTCVL+VNIHCDGC+QKVKK+LQ+IEGVF   IDAE  KVTVSG+VD + LIKKL+++GKHAE+W      S   +N+
Subjt:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELWSQKANPSPKPKNK

P08452 Uncharacterized lipoprotein syc1174_c5.5e-9756.56Show/hide
Query:  RQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHL
        R  TSV+I+PTG+G A+GGYAGDALP+ARA+ASV D LITHPNV+N A LYWP+ NV YVEG+ALDRFA G W L+PVH NRIGL+LDA IE ELRIRH 
Subjt:  RQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLVLDAGIEEELRIRHL

Query:  QVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGRIRHPASLLRAVQTLMDRSKVNAVAVVGRFPDD-DVEETDNYRQGTGVDALAGVEAIISHL
        QVA+AA+A+LGL V   ++T+ PL V       +G + G I  P SLLRA   L+ ++   A+AV+ RFPDD       +YRQG GVD LAG EA+ISHL
Subjt:  QVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGRIRHPASLLRAVQTLMDRSKVNAVAVVGRFPDD-DVEETDNYRQGTGVDALAGVEAIISHL

Query:  VVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCVLSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAV
        +V+EFQ+PCAHAPAL P PL  S+SP+SAAEELG+TFLPCVL+GLS AP+Y S ++ES+ +  I    VD VI P  A GG G L +A        I+AV
Subjt:  VVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCVLSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAV

Query:  EENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPYSL
         EN + L   P  LG+    +  + EA+G +AA+KAG+DP +L
Subjt:  EENETVLSDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPYSL

Q0WV37 Heavy metal-associated isoprenylated plant protein 346.7e-1858.82Show/hide
Query:  LLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAEL
        ++K+QTCVL+VN+HC+GC+ KVKK LQ+IEGV+ V  D E  +VTV+G++D A L+KKL ++GKHAE+
Subjt:  LLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAEL

Q9M8K5 Heavy metal-associated isoprenylated plant protein 321.1e-2366.28Show/hide
Query:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELW-SQKANPSP
        M+KEE    +KIQTCVL+VNIHCDGC+QKVKK+LQ+IEGVF   ID+E  KVTVSGSVD + LIKKL ++GKHAE+W + K N +P
Subjt:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELW-SQKANPSP

Arabidopsis top hitse value%identityAlignment
AT1G23000.1 Heavy metal transport/detoxification superfamily protein4.1e-4740.14Show/hide
Query:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELWSQKANPS--PKPKNKD--DKPAN
        MTK+EDFKLLKIQT  LRVNIHC+GC +KVKKLLQRIEGV  V I+AE+QKVTVSGSVDSATLI KLV+AGKHAELWS   N +   KPK  D       
Subjt:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELWSQKANPS--PKPKNKD--DKPAN

Query:  KGPKQ---------------PKLTSFNCEEDDIDDCFGEEEGEDYEDEEFQFLKEKAAHLGLLRQQAIEANNAKKCIGISQIPGPATGNGKMNSSNSNSN
        KG KQ               PK  +F  EED         +G + ED + QF K         +QQ     NAKK           +G   MN    N N
Subjt:  KGPKQ---------------PKLTSFNCEEDDIDDCFGEEEGEDYEDEEFQFLKEKAAHLGLLRQQAIEANNAKKCIGISQIPGPATGNGKMNSSNSNSN

Query:  NNKSGNGKKVVPNQQMGIKNIPSGIDQKAMAALRMNNAQHFSSGGGSGGRGGGSIINLGEAKRGNNDLNSMMNMAGFNGG-NLVNFATPSSI-------G
        N  +   KKV   Q    +N      Q+ MAA+RM  A   S+G  +                  N++ ++M +AGFNG  N VN   P+ I        
Subjt:  NNKSGNGKKVVPNQQMGIKNIPSGIDQKAMAALRMNNAQHFSSGGGSGGRGGGSIINLGEAKRGNNDLNSMMNMAGFNGG-NLVNFATPSSI-------G

Query:  LNSTNSSQGHHLQQNNGYGYQQPSSTSGFHMTGQYQQQQPTSINAYNQYHHQPPLMNMNMLTRQAMNQQPQMMYNRAHLVPPNT-GYYYNYNPSPVQ--P
        LN+ N    H+L  +NG          G  M          ++N YN +H       MNM +RQ M+Q  QMMY R+  VP ++ GYYYNY PSP    P
Subjt:  LNSTNSSQGHHLQQNNGYGYQQPSSTSGFHMTGQYQQQQPTSINAYNQYHHQPPLMNMNMLTRQAMNQQPQMMYNRAHLVPPNT-GYYYNYNPSPVQ--P

Query:  GYPYVEAGYQQGHNSNSAADMFSDEN--TSSSCSIM
         YPY    YQQ  + + A +M S+E+   ++SC+IM
Subjt:  GYPYVEAGYQQGHNSNSAADMFSDEN--TSSSCSIM

AT3G06130.1 Heavy metal transport/detoxification superfamily protein7.6e-2566.28Show/hide
Query:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELW-SQKANPSP
        M+KEE    +KIQTCVL+VNIHCDGC+QKVKK+LQ+IEGVF   ID+E  KVTVSGSVD + LIKKL ++GKHAE+W + K N +P
Subjt:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELW-SQKANPSP

AT3G06130.2 Heavy metal transport/detoxification superfamily protein7.6e-2566.28Show/hide
Query:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELW-SQKANPSP
        M+KEE    +KIQTCVL+VNIHCDGC+QKVKK+LQ+IEGVF   ID+E  KVTVSGSVD + LIKKL ++GKHAE+W + K N +P
Subjt:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELW-SQKANPSP

AT5G19090.1 Heavy metal transport/detoxification superfamily protein8.4e-2462.22Show/hide
Query:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELWSQKANPSPKPKNK
        M+KEE    +KIQTCVL+VNIHCDGC+QKVKK+LQ+IEGVF   IDAE  KVTVSG+VD + LIKKL+++GKHAE+W      S   +N+
Subjt:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELWSQKANPSPKPKNK

AT5G19090.2 Heavy metal transport/detoxification superfamily protein7.1e-1529.03Show/hide
Query:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELWSQKANPSPKPKNKD---------
        M+KEE    +KIQTCVL+VNIHCDGC+QKVKK+LQ+IEGVF   IDAE  KVTVSG+VD + LIKKL+++GKHAE+W      S   +N+          
Subjt:  MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELWSQKANPSPKPKNKD---------

Query:  --DKPANKGPKQPKLTSFNCEEDDIDDCFGEEEG--------------EDYEDEEFQFLKEKAAHLGLLRQQAIEANNAKKCIGISQIPGPATGNGKMNS
          D     G       + N +   I    G   G                   ++ Q L  +        QQ     + K    +   PGP  G+  MN 
Subjt:  --DKPANKGPKQPKLTSFNCEEDDIDDCFGEEEG--------------EDYEDEEFQFLKEKAAHLGLLRQQAIEANNAKKCIGISQIPGPATGNGKMNS

Query:  SNSNSNNNKSGNGKKVVPNQQMGIKNIPSGIDQK--------------------------------------------------AMAALRMNNAQHFSSG
        +    NN          PNQ+    N+P   D++                                                   M  + M NAQ   + 
Subjt:  SNSNSNNNKSGNGKKVVPNQQMGIKNIPSGIDQK--------------------------------------------------AMAALRMNNAQHFSSG

Query:  GGSGGRGGG---SIINLGEAKRGNNDLNSMMNMAGFNGGNLVNFATPSSI----------GLNSTNSSQGHHLQQNNGYGYQQPSSTSGFHMTGQYQQQQ
          +GG GGG     + +G A  G   + S+  M G  G    N      +          G  S  +  G+   Q +G G     S  G     Q QQQQ
Subjt:  GGSGGRGGG---SIINLGEAKRGNNDLNSMMNMAGFNGGNLVNFATPSSI----------GLNSTNSSQGHHLQQNNGYGYQQPSSTSGFHMTGQYQQQQ

Query:  PTSINAYNQYHHQPPLMNMNMLTRQAMNQQPQ-MMYNRAHLVPPNTGYYYNYNPSPVQP---GYPYVEAGYQQGHNSNSAADMFSDENTSSSCSIM
                Q  +   +MN     R   N++ Q MMY R    PP    Y    P P Q     YPY        HN +  +D F+DENT SSC+IM
Subjt:  PTSINAYNQYHHQPPLMNMNMLTRQAMNQQPQ-MMYNRAHLVPPNTGYYYNYNPSPVQP---GYPYVEAGYQQGHNSNSAADMFSDENTSSSCSIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAAAGAGGAAGATTTTAAGCTACTCAAGATTCAGACTTGTGTTCTCAGAGTGAACATTCACTGTGATGGGTGTAGGCAGAAAGTGAAGAAACTTCTTCAGAGGAT
AGAAGGAGTTTTCCAGGTCGTCATAGATGCAGAGAATCAGAAAGTTACAGTTTCAGGAAGTGTAGATTCTGCAACTTTGATCAAGAAGCTGGTGAGAGCTGGAAAACACG
CAGAGCTTTGGTCACAGAAAGCGAACCCCAGCCCAAAACCGAAGAATAAAGATGATAAGCCTGCGAACAAGGGACCGAAACAGCCAAAACTGACCTCATTCAACTGTGAA
GAAGATGACATTGATGACTGTTTTGGTGAGGAGGAAGGGGAAGATTACGAAGATGAAGAGTTTCAGTTCCTTAAGGAGAAAGCAGCTCATCTTGGTCTCCTTAGGCAGCA
AGCGATCGAAGCAAACAACGCCAAGAAATGCATCGGGATCAGCCAAATTCCCGGGCCAGCCACGGGAAATGGAAAGATGAACAGCAGCAACAGCAACAGCAACAACAACA
AATCTGGGAATGGAAAGAAAGTAGTCCCAAATCAGCAAATGGGCATAAAAAACATCCCATCTGGGATTGACCAGAAAGCCATGGCAGCTCTGAGGATGAACAATGCTCAA
CACTTCAGCAGTGGCGGGGGCAGCGGTGGCAGAGGCGGAGGAAGTATCATCAATCTCGGGGAAGCGAAAAGAGGAAACAACGACCTGAATTCAATGATGAACATGGCAGG
ATTCAACGGCGGAAACCTTGTAAATTTTGCCACTCCATCGTCCATTGGTTTGAATTCAACAAACTCATCTCAAGGACATCACCTTCAACAAAACAATGGCTATGGCTACC
AGCAGCCATCTTCAACCTCTGGCTTCCACATGACTGGTCAATATCAGCAACAACAACCAACCTCCATCAATGCCTACAACCAGTATCATCATCAGCCGCCATTGATGAAC
ATGAACATGCTAACCAGACAAGCAATGAACCAGCAGCCGCAGATGATGTACAACAGGGCTCACTTGGTTCCACCAAACACAGGATATTACTACAATTACAATCCTAGCCC
TGTCCAACCAGGTTATCCCTATGTTGAGGCTGGTTATCAGCAAGGCCATAATAGTAACTCTGCAGCCGATATGTTCAGTGATGAGAACACAAGCAGCAGCTGCTCCATCA
TGACCAAATCTCAGCCATCGGCGGCGAAGTCGAGCGTCTCCTGCTCCGCTATTAACCGCTACTCCGCCGGGGTGAGTTTGCCCCTTCAATTTTCCTTCCTTCATTTCGAA
ATAGCAAACAGTTTCCGGCGTAGTGTGTTAATTCGTTTGCGATTGTGTCTGGAACTGAATACGCAGTGTAAAAGGCAGTATACGAGTGTCATGATAATACCGACGGGCGT
AGGCGCCGCCATTGGTGGATATGCAGGTGACGCTCTGCCGGTTGCTCGCGCTCTCGCCTCCGTCGTCGATTGCCTTATCACTCACCCTAACGTACTTAATGCAGCAATGC
TTTACTGGCCGATGCACAATGTGCTTTATGTTGAAGGCCATGCACTTGATCGGTTTGCAGAAGGTTCATGGGCCCTAAAACCTGTTCACCAGAATCGGATAGGATTGGTT
CTTGATGCTGGAATTGAGGAAGAGCTTCGAATTCGTCACTTGCAAGTGGCTGATGCTGCTAGAGCTTCTCTTGGATTGCCTGTGATGGAATATATTGTCACAGAGACACC
TTTAGTGGTAGAGAAGTGGATTGATCCCAAAACTGGGCAATCAACTGGGAGGATTAGACACCCTGCCTCACTACTAAGAGCTGTGCAGACATTAATGGACCGGTCAAAGG
TAAATGCTGTTGCAGTTGTTGGACGATTCCCAGACGACGATGTTGAAGAGACAGATAACTATCGTCAAGGGACGGGAGTTGATGCTTTGGCAGGGGTTGAGGCTATCATT
AGCCATCTTGTGGTGAAGGAGTTCCAGATTCCATGTGCTCATGCTCCTGCTTTATCACCTACTCCCTTATGCAAATCTCTATCTCCAAAATCTGCTGCTGAGGAGTTAGG
ATACACATTCTTACCATGTGTACTTTCTGGGCTAAGTAATGCGCCTCAATACTTGAGCAAGAGTTCAGAATCATTGGGGAAGGATTTCATATTGGCAAATGATGTTGATA
GTGTCATTCTACCTATAGATGCTTGTGGAGGGGATGGCGCTCTTGCTTTTGCCAGAAGCAACCAGTACAAGCCACTTATTATTGCGGTTGAGGAAAATGAAACAGTTCTC
AGCGATACTCCAGACTCGCTTGGGATTGAGGCGGTAAAAGTCTCAAATTATTGGGAAGCCATAGGTGTTATTGCAGCACACAAGGCAGGAATAGATCCCTATTCCCTGCG
AAGAAATAGAATCAACAACGTTAACTGCATTTCCACTACATCTCCTAACGGTTATGCAGTTTCAAGTGCATCTCAACTATTCAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCAAAGAGGAAGATTTTAAGCTACTCAAGATTCAGACTTGTGTTCTCAGAGTGAACATTCACTGTGATGGGTGTAGGCAGAAAGTGAAGAAACTTCTTCAGAGGAT
AGAAGGAGTTTTCCAGGTCGTCATAGATGCAGAGAATCAGAAAGTTACAGTTTCAGGAAGTGTAGATTCTGCAACTTTGATCAAGAAGCTGGTGAGAGCTGGAAAACACG
CAGAGCTTTGGTCACAGAAAGCGAACCCCAGCCCAAAACCGAAGAATAAAGATGATAAGCCTGCGAACAAGGGACCGAAACAGCCAAAACTGACCTCATTCAACTGTGAA
GAAGATGACATTGATGACTGTTTTGGTGAGGAGGAAGGGGAAGATTACGAAGATGAAGAGTTTCAGTTCCTTAAGGAGAAAGCAGCTCATCTTGGTCTCCTTAGGCAGCA
AGCGATCGAAGCAAACAACGCCAAGAAATGCATCGGGATCAGCCAAATTCCCGGGCCAGCCACGGGAAATGGAAAGATGAACAGCAGCAACAGCAACAGCAACAACAACA
AATCTGGGAATGGAAAGAAAGTAGTCCCAAATCAGCAAATGGGCATAAAAAACATCCCATCTGGGATTGACCAGAAAGCCATGGCAGCTCTGAGGATGAACAATGCTCAA
CACTTCAGCAGTGGCGGGGGCAGCGGTGGCAGAGGCGGAGGAAGTATCATCAATCTCGGGGAAGCGAAAAGAGGAAACAACGACCTGAATTCAATGATGAACATGGCAGG
ATTCAACGGCGGAAACCTTGTAAATTTTGCCACTCCATCGTCCATTGGTTTGAATTCAACAAACTCATCTCAAGGACATCACCTTCAACAAAACAATGGCTATGGCTACC
AGCAGCCATCTTCAACCTCTGGCTTCCACATGACTGGTCAATATCAGCAACAACAACCAACCTCCATCAATGCCTACAACCAGTATCATCATCAGCCGCCATTGATGAAC
ATGAACATGCTAACCAGACAAGCAATGAACCAGCAGCCGCAGATGATGTACAACAGGGCTCACTTGGTTCCACCAAACACAGGATATTACTACAATTACAATCCTAGCCC
TGTCCAACCAGGTTATCCCTATGTTGAGGCTGGTTATCAGCAAGGCCATAATAGTAACTCTGCAGCCGATATGTTCAGTGATGAGAACACAAGCAGCAGCTGCTCCATCA
TGACCAAATCTCAGCCATCGGCGGCGAAGTCGAGCGTCTCCTGCTCCGCTATTAACCGCTACTCCGCCGGGGTGAGTTTGCCCCTTCAATTTTCCTTCCTTCATTTCGAA
ATAGCAAACAGTTTCCGGCGTAGTGTGTTAATTCGTTTGCGATTGTGTCTGGAACTGAATACGCAGTGTAAAAGGCAGTATACGAGTGTCATGATAATACCGACGGGCGT
AGGCGCCGCCATTGGTGGATATGCAGGTGACGCTCTGCCGGTTGCTCGCGCTCTCGCCTCCGTCGTCGATTGCCTTATCACTCACCCTAACGTACTTAATGCAGCAATGC
TTTACTGGCCGATGCACAATGTGCTTTATGTTGAAGGCCATGCACTTGATCGGTTTGCAGAAGGTTCATGGGCCCTAAAACCTGTTCACCAGAATCGGATAGGATTGGTT
CTTGATGCTGGAATTGAGGAAGAGCTTCGAATTCGTCACTTGCAAGTGGCTGATGCTGCTAGAGCTTCTCTTGGATTGCCTGTGATGGAATATATTGTCACAGAGACACC
TTTAGTGGTAGAGAAGTGGATTGATCCCAAAACTGGGCAATCAACTGGGAGGATTAGACACCCTGCCTCACTACTAAGAGCTGTGCAGACATTAATGGACCGGTCAAAGG
TAAATGCTGTTGCAGTTGTTGGACGATTCCCAGACGACGATGTTGAAGAGACAGATAACTATCGTCAAGGGACGGGAGTTGATGCTTTGGCAGGGGTTGAGGCTATCATT
AGCCATCTTGTGGTGAAGGAGTTCCAGATTCCATGTGCTCATGCTCCTGCTTTATCACCTACTCCCTTATGCAAATCTCTATCTCCAAAATCTGCTGCTGAGGAGTTAGG
ATACACATTCTTACCATGTGTACTTTCTGGGCTAAGTAATGCGCCTCAATACTTGAGCAAGAGTTCAGAATCATTGGGGAAGGATTTCATATTGGCAAATGATGTTGATA
GTGTCATTCTACCTATAGATGCTTGTGGAGGGGATGGCGCTCTTGCTTTTGCCAGAAGCAACCAGTACAAGCCACTTATTATTGCGGTTGAGGAAAATGAAACAGTTCTC
AGCGATACTCCAGACTCGCTTGGGATTGAGGCGGTAAAAGTCTCAAATTATTGGGAAGCCATAGGTGTTATTGCAGCACACAAGGCAGGAATAGATCCCTATTCCCTGCG
AAGAAATAGAATCAACAACGTTAACTGCATTTCCACTACATCTCCTAACGGTTATGCAGTTTCAAGTGCATCTCAACTATTCAACTGA
Protein sequenceShow/hide protein sequence
MTKEEDFKLLKIQTCVLRVNIHCDGCRQKVKKLLQRIEGVFQVVIDAENQKVTVSGSVDSATLIKKLVRAGKHAELWSQKANPSPKPKNKDDKPANKGPKQPKLTSFNCE
EDDIDDCFGEEEGEDYEDEEFQFLKEKAAHLGLLRQQAIEANNAKKCIGISQIPGPATGNGKMNSSNSNSNNNKSGNGKKVVPNQQMGIKNIPSGIDQKAMAALRMNNAQ
HFSSGGGSGGRGGGSIINLGEAKRGNNDLNSMMNMAGFNGGNLVNFATPSSIGLNSTNSSQGHHLQQNNGYGYQQPSSTSGFHMTGQYQQQQPTSINAYNQYHHQPPLMN
MNMLTRQAMNQQPQMMYNRAHLVPPNTGYYYNYNPSPVQPGYPYVEAGYQQGHNSNSAADMFSDENTSSSCSIMTKSQPSAAKSSVSCSAINRYSAGVSLPLQFSFLHFE
IANSFRRSVLIRLRLCLELNTQCKRQYTSVMIIPTGVGAAIGGYAGDALPVARALASVVDCLITHPNVLNAAMLYWPMHNVLYVEGHALDRFAEGSWALKPVHQNRIGLV
LDAGIEEELRIRHLQVADAARASLGLPVMEYIVTETPLVVEKWIDPKTGQSTGRIRHPASLLRAVQTLMDRSKVNAVAVVGRFPDDDVEETDNYRQGTGVDALAGVEAII
SHLVVKEFQIPCAHAPALSPTPLCKSLSPKSAAEELGYTFLPCVLSGLSNAPQYLSKSSESLGKDFILANDVDSVILPIDACGGDGALAFARSNQYKPLIIAVEENETVL
SDTPDSLGIEAVKVSNYWEAIGVIAAHKAGIDPYSLRRNRINNVNCISTTSPNGYAVSSASQLFN