; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023246 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023246
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionNucleotide-diphospho-sugar transferase
Genome locationtig00000892:1464326..1476835
RNA-Seq ExpressionSgr023246
SyntenySgr023246
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0009055 - electron transfer activity (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR003245 - Phytocyanin domain
IPR008972 - Cupredoxin
IPR029044 - Nucleotide-diphospho-sugar transferases


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605951.1 hypothetical protein SDJN03_03268, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0088.25Show/hide
Query:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD
        MVGHKR FLPLFLLVSVSILFF THFYSS+PPSFPSNPNP+FALT + SDPKT+GFTLIVK+LAYNRLDS+SRCLRSIANADYL+DRVHLHVYIDHF HD
Subjt:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD

Query:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV
        DAHI LNESHRILQFVD+ VWNFGEKL+H+RT NVGLQAQWLEAWWPSSDNEFAFVVEDDLE+SPLYYKFL+SLILNYYYNTSNY PSIYGASLQRPRFV
Subjt:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV

Query:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH
        PGKHGNKIRLDD TR+FLYQLVGTWGQLLFPKPWKEFRLWYDEHKA GIKP+LDGMVTTGWYK+MGERIWTPWFIKFIHSR YFNIYTNFLHERALSTSH
Subjt:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH

Query:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV
        RD GVNYGK AGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKH+NIIIVSV GVSES VRNFLCH ERLNIRNY+
Subjt:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV

Query:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI
        LLGHESELLYDLTRRG+PVIYADQFLKT+KTF   AS+ T A+L+KLVLAKFYI+KSCL+LGYNPGLVD NMLFVNGDPFTD SLAD++F GQS E+FFI
Subjt:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI

Query:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG
        KSSSSAQKMWASHIVPEAEAMVESLMSRG+SSR  I FVGI  KLLEQHGAKFKTA++M+F +N+GNNPTN SLGDGK+L+F PADMNVNHVEKRLEE+G
Subjt:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG

Query:  FWFIDGDFSCKAV
        F  IDGDFSCKAV
Subjt:  FWFIDGDFSCKAV

KAG7035898.1 hypothetical protein SDJN02_02697, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0088.09Show/hide
Query:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD
        MVGHKR FLPLFLLVSVSILFF THFYSS+PPSFPSNPNP+FALT + SDPKT+GFTLIVK+LAYNRLDS+SRCLRSIANADYL+DRVHLHVYIDHF HD
Subjt:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD

Query:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV
        DAHI LNESHRILQFVD+ VWNFGEKL+H+RT NVGLQAQWLEAWWPSSDNEFAFVVEDDLE+SPLYYKFL+SLILNYYYNTSNY PSIYGASLQRPRFV
Subjt:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV

Query:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH
        PGKHGNKIRLDD TR+FLYQLVGTWGQLLFPKPWKEFRLWYDEHKA GIKP+LDGMVTTGWYK+MGERIWTPWFIKFIHSR YFNIYTNFLHERALSTSH
Subjt:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH

Query:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV
        RD GVNYGK AGPDSHLLDGSSLDFNLLEMKPLTDLK YDFCFREVIPQRIVKSKSELGSVLHSVQKH+NIIIVSV GVSES VRNFLCH ERLNIRNY+
Subjt:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV

Query:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI
        LLGHESELLYDLTRRG+PVIYADQFLKT+KTF   AS+ T A+L+KLVLAKFYI+KSCL+LGYNPGLVD NMLFVNGDPFTD SLAD++F GQS E+FFI
Subjt:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI

Query:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG
        KSSSSAQKMWASHIVPEAEAMVESLMSRG+SSR  I FVGI  KLLEQHGAKFKTA++M+F +N+GNNPTN SLGDGK+L+F PADMNVNHVEKRLEE+G
Subjt:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG

Query:  FWFIDGDFSCKAV
        F  IDGDFSCKAV
Subjt:  FWFIDGDFSCKAV

XP_022153018.1 uncharacterized protein LOC111020622 [Momordica charantia]0.0e+0088.46Show/hide
Query:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD
        MVGHKRH LPLFLL+SVSILFF THFYSS+PPSFPSNPNPDFAL RKTSD KTSGFTLIVK+LAYNRLDS+SRCLRSIANADYLSDRVHLHVYIDHF HD
Subjt:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD

Query:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV
        DAHI LNESHRILQFVD+F W+FGEK+VH+RT N GLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFL++LILNYYYNTSNYAPSIYGASLQRPRFV
Subjt:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV

Query:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH
        PGKHGNKIRLDD TRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYK+MGERIWTPWFIKFIHSRGYFNIYTNFL ERALSTSH
Subjt:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH

Query:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV
        RDAGVNYGK AGPDSHLLDGSSL+F+LLEMKPLTDL+WYDFCFREVIPQRIVKSKSELGSV+HSVQKHQNIIIVS+ GVSESM+RNFLCH ERLNIRNYV
Subjt:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV

Query:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVN-GDPFTDFSLADNVFIGQSFELFF
        LLGHESELLYDLTRRG+PVI AD FLKTIKT K  A EGTR++L KLVLAKFYIIKSCL+LGYNPGLVDEN+LFVN GDP  DFSLADN+FIGQSFELFF
Subjt:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVN-GDPFTDFSLADNVFIGQSFELFF

Query:  IKSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEM
        +KSSSSAQKMWA+HIVPEAE MVESLMS+G  S+D   FVGITTKLLEQHG KF+TADEM+FG+N+GN+PTNTSLGDGK+ VF PAD+N+NHVEKRLEE+
Subjt:  IKSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEM

Query:  GFWFIDGDFSCKAVY
        GFW IDGDFSCK VY
Subjt:  GFWFIDGDFSCKAVY

XP_022958533.1 uncharacterized protein LOC111459737 [Cucurbita moschata]0.0e+0087.77Show/hide
Query:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD
        MVGHKR FLPLFLLVSVSILFF THFYSS+PPSFPSNPNP+FALT + SDPKT+GFTLIVK+LAYNRLDS+SRCLRSIANADYL+DRVHLHVYIDHF HD
Subjt:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD

Query:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV
        DAHI LNESHRILQFVD+ VWNFGEKL+H+RT NVGLQAQWLEAWWPSSDNEFAFVVEDDLE+SPLYYKFL+SLILNYYYNTSNY PSIYGASLQRPRFV
Subjt:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV

Query:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH
        PGKHGNKIRLDD TR+FLYQLVGTWGQLLFPKPWKEFRLWYDEHKA GIKP+LDGMVTTGWYK+MGERIWTPWFIKFIHSR YFNIYTNFLHERALSTSH
Subjt:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH

Query:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV
        RD GVNYGK AGPDSHLLDGSSLDFNLLEMKPLTDLK YDFCFREVIPQRIVKSKSELGSVLHSVQKH+NIIIVSV GVSES VRNFLCH ERLNIRNY+
Subjt:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV

Query:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI
        LLGHESELLYDLTRRG+PVIYADQFLKT+KTF   AS+ T A+L+KLVLAKFYI+KSCL+LGYNPGLVD NMLFVNGDPFTD  LAD++F GQS E+FF 
Subjt:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI

Query:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG
        KSSSSAQKMWASHIVPEAEAMVES MSRG+SSRD I FVGI  KLLEQHGAKFK A++M+F +N+GNNPTN SLGDGK+L+F PADMNVNHVEKRLEE+G
Subjt:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG

Query:  FWFIDGDFSCKAV
        FW IDGDFSCKAV
Subjt:  FWFIDGDFSCKAV

XP_023532732.1 uncharacterized protein LOC111794813 [Cucurbita pepo subsp. pepo]0.0e+0088.09Show/hide
Query:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD
        MVGHKR FLPLFLLVSVSILFF THFYSS+PPSFPSNPNP+FALT + SDPKTSGFTLIVK+LAYNRLDS+SRCLRSIANADYLSDRVHLHVYIDHF HD
Subjt:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD

Query:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV
        DAHI LNESHRILQFVD+ VWNFGEKL+H+RT NVGLQAQWLEAWWPSSDNEFAFVVEDDLE+SPLYYKFL+SLILNYYYNTSNY PSIYGASLQRPRFV
Subjt:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV

Query:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH
        PGKHGNKIRLDD TR+FLYQLVGTWGQLLFPKPWKEFRLWYDEHKA GIKP+LDGMVTTGWYK+MGERIWTPWFIKFIHSR YFNIYTNFLHERALSTSH
Subjt:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH

Query:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV
        RD GVNYGK AGPDSHLLDGSSLDF+LLEMKPL+DLK YDFCFREVIPQRIVKSKSELGSVLHSVQKH+NIIIVSV GVSES VRNFLCH ERLNIRNY+
Subjt:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV

Query:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI
        LLGHESELLYDLTRRG+PVIYADQFL T+KTF   ASEGT A+L+KLVLAKFYI+KSCL+LGYNPGLVD NM+FVNGDPFTD SLAD++F GQS E+FFI
Subjt:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI

Query:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG
        KSSSSAQKMW SHIVPEAEAMVESLMSRG+SSR  I FVGI  KLLEQHGAKFKTA++M+F +N+GNNPTN SLGDGK+L+F PADMNVNHVEKRLEE+G
Subjt:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG

Query:  FWFIDGDFSCKAV
        FW IDGDFSCKAV
Subjt:  FWFIDGDFSCKAV

TrEMBL top hitse value%identityAlignment
A0A0A0KGZ3 Uncharacterized protein0.0e+0084.36Show/hide
Query:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD
        MVGHKRHFLPLFLLVSVSILFF THFYSS+PPSFPSNPNPDFAL+R+ +DPKTSGFTLIVK+LAYNRLDS+SRCLRSIANADYLSDRVHLHVYIDH+ +D
Subjt:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD

Query:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV
        DAHIKLNESHRILQFVDQF WNFGEK+VH+RT NVGLQAQWLEAWWPSSDNEFAFVVEDDLE+SPLYYKFL+SLI+NYYYNTSNY PSIYGASLQRPRFV
Subjt:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV

Query:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH
        PGKHGNKI+LD+ TR+FLYQ+VGTWGQLLFP+PWKEFRLWYDEHKAMGIKP+LDGMVTTGWYK++GERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH
Subjt:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH

Query:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV
        RDAGVNYGK AGPDSHLLD SSLDFNLLEMKP ++LKWYDFCFREV+PQRIVKSKSEL SVLHSVQKH+NI+IVSV GVSES +RNFLCH ERLNI+NY+
Subjt:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV

Query:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI
        LLGHESELL DLTRRG+PVIYADQFLKT+ T KF   EGT ++L+KLVLAKFY+IKSCL+LGY+PGLVD NMLFVNG+ FTD SLA+++  GQS+ELFFI
Subjt:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI

Query:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG
        KSSSS+QKMWASH+V EAEA++ESLMS+GSSS D I FVGI TKLLE+HG KFKTA+EM+FG+N+GNNP NTSLGDGK+LVF PA+ + N V+KRLEE+G
Subjt:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG

Query:  FWFIDGDFSCKAVY
        FW IDGD SCKAVY
Subjt:  FWFIDGDFSCKAVY

A0A5A7V7P3 Putative transferring glycosyl group transferase0.0e+0084.85Show/hide
Query:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD
        MVGHKRHFLPL LLVSVSILFF THFYSS+PPSFPSNPNPDFAL+ + SDPKTSGFTLIVK+LAYNRLDS+SRCLRSIANADYLSDRVHLHVYIDH+  D
Subjt:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD

Query:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV
        DAHIKLNESHRILQFVDQF WNFGEK+VH+RT NVGLQAQWLEAWWPSSDNEFAFVVEDDLE+SPLYYKFL+SLI+NYYYNTSNY PSIYGASLQRPRFV
Subjt:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV

Query:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH
        PGKHGNKI+LD+ TRIFLYQ+VGTWGQLLFP+PWKEFRLWYDEHKAMGIKP+LDGMVTTGWYK++GERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH
Subjt:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH

Query:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV
        RDAGVNYGK AGPDSHLLD SSLDFNLLEMKPL++LKWYDFCFREVIPQRIVK KSEL SVLHSVQKH+NIIIVS+ GVSES +RNFLCH ERLNI+NY+
Subjt:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV

Query:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI
        LLGHESELL DLTRRG+PVIYADQFLKT+ T KF   EGT ++L+KLVLAKFYII+SCL+LGY+PGLVD NMLFVNGDPFTD  LAD+V  GQS+ELFFI
Subjt:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI

Query:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG
        KSSS +QKMWASH+V EAEA++ESLMSRGSSS D I FVGI TKLLE+HG KFKTA+E +FGLN+GNNPTNTSLGDGK+LVF PA+ + N ++KRLEE+G
Subjt:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG

Query:  FWFIDGDFSCKAVY
        FW IDGD +CKAVY
Subjt:  FWFIDGDFSCKAVY

A0A6J1DGE7 uncharacterized protein LOC1110206220.0e+0088.46Show/hide
Query:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD
        MVGHKRH LPLFLL+SVSILFF THFYSS+PPSFPSNPNPDFAL RKTSD KTSGFTLIVK+LAYNRLDS+SRCLRSIANADYLSDRVHLHVYIDHF HD
Subjt:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD

Query:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV
        DAHI LNESHRILQFVD+F W+FGEK+VH+RT N GLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFL++LILNYYYNTSNYAPSIYGASLQRPRFV
Subjt:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV

Query:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH
        PGKHGNKIRLDD TRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYK+MGERIWTPWFIKFIHSRGYFNIYTNFL ERALSTSH
Subjt:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH

Query:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV
        RDAGVNYGK AGPDSHLLDGSSL+F+LLEMKPLTDL+WYDFCFREVIPQRIVKSKSELGSV+HSVQKHQNIIIVS+ GVSESM+RNFLCH ERLNIRNYV
Subjt:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV

Query:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVN-GDPFTDFSLADNVFIGQSFELFF
        LLGHESELLYDLTRRG+PVI AD FLKTIKT K  A EGTR++L KLVLAKFYIIKSCL+LGYNPGLVDEN+LFVN GDP  DFSLADN+FIGQSFELFF
Subjt:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVN-GDPFTDFSLADNVFIGQSFELFF

Query:  IKSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEM
        +KSSSSAQKMWA+HIVPEAE MVESLMS+G  S+D   FVGITTKLLEQHG KF+TADEM+FG+N+GN+PTNTSLGDGK+ VF PAD+N+NHVEKRLEE+
Subjt:  IKSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEM

Query:  GFWFIDGDFSCKAVY
        GFW IDGDFSCK VY
Subjt:  GFWFIDGDFSCKAVY

A0A6J1H3B9 uncharacterized protein LOC1114597370.0e+0087.77Show/hide
Query:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD
        MVGHKR FLPLFLLVSVSILFF THFYSS+PPSFPSNPNP+FALT + SDPKT+GFTLIVK+LAYNRLDS+SRCLRSIANADYL+DRVHLHVYIDHF HD
Subjt:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD

Query:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV
        DAHI LNESHRILQFVD+ VWNFGEKL+H+RT NVGLQAQWLEAWWPSSDNEFAFVVEDDLE+SPLYYKFL+SLILNYYYNTSNY PSIYGASLQRPRFV
Subjt:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV

Query:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH
        PGKHGNKIRLDD TR+FLYQLVGTWGQLLFPKPWKEFRLWYDEHKA GIKP+LDGMVTTGWYK+MGERIWTPWFIKFIHSR YFNIYTNFLHERALSTSH
Subjt:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH

Query:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV
        RD GVNYGK AGPDSHLLDGSSLDFNLLEMKPLTDLK YDFCFREVIPQRIVKSKSELGSVLHSVQKH+NIIIVSV GVSES VRNFLCH ERLNIRNY+
Subjt:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV

Query:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI
        LLGHESELLYDLTRRG+PVIYADQFLKT+KTF   AS+ T A+L+KLVLAKFYI+KSCL+LGYNPGLVD NMLFVNGDPFTD  LAD++F GQS E+FF 
Subjt:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI

Query:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG
        KSSSSAQKMWASHIVPEAEAMVES MSRG+SSRD I FVGI  KLLEQHGAKFK A++M+F +N+GNNPTN SLGDGK+L+F PADMNVNHVEKRLEE+G
Subjt:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG

Query:  FWFIDGDFSCKAV
        FW IDGDFSCKAV
Subjt:  FWFIDGDFSCKAV

A0A6J1JXN1 uncharacterized protein LOC111490716 isoform X10.0e+0086.95Show/hide
Query:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD
        MVGHKR FLPLFLLVSVSILFF THFYSS+PP FPSNPNP+FALT + +DPKTSGFTLIVK+LAYNRLDS+SRCLRSIANADYL+DRVHLHVYIDHF HD
Subjt:  MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHD

Query:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV
        DAHI LNESHRILQFVD+ VWNFGEKL+H+RT NVGLQAQWLEAWWPSSDNEFAF+VEDDLE+SPLYYKFL+SLIL YYYNTSNY PSIYGASLQRPRFV
Subjt:  DAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFV

Query:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH
        PGKHGNKIRLDD TR+FLYQLVGTWGQLLFPKPWKEFRLWYDEHKA GIKP+LDGMVTTGWYK+MGERIWTPWFIKFIHSR YFNIYTNFLHERALSTSH
Subjt:  PGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSH

Query:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV
        RD GVNYGK AGPDSHLLDGSSLDFNLLEMKPLTDLK YDFCFRE+IPQRIVKSKSELGSVLH VQKH+NIIIVSV GVSES VRNFLCH ERLNIRNY+
Subjt:  RDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYV

Query:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI
        LLGHESELLYDL RRG+PVIYADQFLKT+KTF   ASEGT A+L+KLVLAKFYI+KSCL+LGYNPGLVD NMLFVNGDPF D SLAD++F GQSFE FFI
Subjt:  LLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFI

Query:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG
        K SSSA KMW SHIVPEAEAMVESLM+RG+SSRD I FVGI  KLLEQHGAKFK A++M+F +N+GNNPTN SLGDGK+L+F PADMNVNHVEKRLEE+G
Subjt:  KSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMG

Query:  FWFIDGDFSCKAV
        FW IDGDFSCKAV
Subjt:  FWFIDGDFSCKAV

SwissProt top hitse value%identityAlignment
P29602 Cucumber peeling cupredoxin3.6e-0431.03Show/hide
Query:  YTVGDSLGWYDKLEKPAVDYQKWTAGKNFGLGDFLIFN-TDNNHSVVQTYNFTTYKLCDYDDASENDTTEWSAADPSATTPHGVTVPVPLVKEGMNYFFS
        + VGD+ GW   +      Y +W AGK F +GD L FN   N H+V +     ++  C++ + S+ND           T+P    V   L + GM+YF  
Subjt:  YTVGDSLGWYDKLEKPAVDYQKWTAGKNFGLGDFLIFN-TDNNHSVVQTYNFTTYKLCDYDDASENDTTEWSAADPSATTPHGVTVPVPLVKEGMNYFFS

Query:  GHYDGEQCKNGQHFKINVTHGQGL----PKSMRDSSDESPAPISP
            G  C NGQ   INV          P S    S   P P+ P
Subjt:  GHYDGEQCKNGQHFKINVTHGQGL----PKSMRDSSDESPAPISP

Arabidopsis top hitse value%identityAlignment
AT1G08500.1 early nodulin-like protein 187.5e-6658.45Show/hide
Query:  LFSLLFLSLSGRSVVHAYKNYTVGDSLGWYDKLEKPAVDYQKWTAGKNFGLGDFLIFNTDNNHSVVQTYNFTTYKLCDYDDASENDTTEWSAADPSATTP
        +  L+ LSLS      AYKNYTVG+S GW+D  E+P+ +YQKW   K+F LGDFLIFNTD+NHSVVQTY+F TYK CDYD+   NDTTEWSAA+PSAT+P
Subjt:  LFSLLFLSLSGRSVVHAYKNYTVGDSLGWYDKLEKPAVDYQKWTAGKNFGLGDFLIFNTDNNHSVVQTYNFTTYKLCDYDDASENDTTEWSAADPSATTP

Query:  HGVTVPVPLVKEGMNYFFSGHYDGEQCKNGQHFKINVTHGQGLPKSMRDSSDESPAPISPDSGDDTEAAPDTIVPSNFDHPRN-DVADNDKEASRSVSLS
          V++ VPLVKEG NYFFSG+YDGEQCK GQHF INVTHGQGLP S       +P P       D E APDTIVP+NFDHP++ + AD+DKE     S S
Subjt:  HGVTVPVPLVKEGMNYFFSGHYDGEQCKNGQHFKINVTHGQGLPKSMRDSSDESPAPISPDSGDDTEAAPDTIVPSNFDHPRN-DVADNDKEASRSVSLS

Query:  LHFKFNEFFLLLVGIACIF
           K + F  + +G+   F
Subjt:  LHFKFNEFFLLLVGIACIF

AT1G45063.1 copper ion binding;electron carriers3.3e-0528.93Show/hide
Query:  YTVGDSLGWYDKLEKPAVDYQKWTAGKNFGLGDFLIFNTDNN-HSVVQTYNFTTYKLCDYDDASENDTTEWSAADPSATTPHGVTVPVPLVKEGMNYFFS
        Y VGDS GW  K       Y  WT  K   +GD LIF  D+N + V Q      Y+ CD             ++ P A    G  V V   + G  YF +
Subjt:  YTVGDSLGWYDKLEKPAVDYQKWTAGKNFGLGDFLIFNTDNN-HSVVQTYNFTTYKLCDYDDASENDTTEWSAADPSATTPHGVTVPVPLVKEGMNYFFS

Query:  GHYDGEQCKNGQHFKINVTHGQGLPKSMRDSSDESPAPISPDSGDDTEAAPDTIVPSNFDHPRND-----VADND------KEASRSVSLSLHFKFN
         ++   QC +GQ   + V H         D S  SP P+           P  I+PS   +   D     V D+D      KE   +V   L F++N
Subjt:  GHYDGEQCKNGQHFKINVTHGQGLPKSMRDSSDESPAPISPDSGDDTEAAPDTIVPSNFDHPRND-----VADND------KEASRSVSLSLHFKFN

AT1G45063.2 copper ion binding;electron carriers3.3e-0528.93Show/hide
Query:  YTVGDSLGWYDKLEKPAVDYQKWTAGKNFGLGDFLIFNTDNN-HSVVQTYNFTTYKLCDYDDASENDTTEWSAADPSATTPHGVTVPVPLVKEGMNYFFS
        Y VGDS GW  K       Y  WT  K   +GD LIF  D+N + V Q      Y+ CD             ++ P A    G  V V   + G  YF +
Subjt:  YTVGDSLGWYDKLEKPAVDYQKWTAGKNFGLGDFLIFNTDNN-HSVVQTYNFTTYKLCDYDDASENDTTEWSAADPSATTPHGVTVPVPLVKEGMNYFFS

Query:  GHYDGEQCKNGQHFKINVTHGQGLPKSMRDSSDESPAPISPDSGDDTEAAPDTIVPSNFDHPRND-----VADND------KEASRSVSLSLHFKFN
         ++   QC +GQ   + V H         D S  SP P+           P  I+PS   +   D     V D+D      KE   +V   L F++N
Subjt:  GHYDGEQCKNGQHFKINVTHGQGLPKSMRDSSDESPAPISPDSGDDTEAAPDTIVPSNFDHPRND-----VADND------KEASRSVSLSLHFKFN

AT5G12260.1 BEST Arabidopsis thaliana protein match is: glycosyltransferase family protein 2 (TAIR:AT5G60700.1)8.0e-18552.02Show/hide
Query:  KRHFLPLFL-LVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADY----LSDRVHLHVYIDHF--
        K++ LPL L L  +S++         +  +  SN        R  +DP  + FT ++K+L +NRL SLSRCLRS++ ADY       R+HLHVYIDHF  
Subjt:  KRHFLPLFL-LVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADY----LSDRVHLHVYIDHF--

Query:  SHDDAHIK--LNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQ
        + +D  ++  L  +  IL FVD+F W FGEK+VH+RT N GLQAQWLEAWWP SD+EFAFVVEDDLEVSPLYY  L+ LIL YYY+TSN+ PSIYGASLQ
Subjt:  SHDDAHIK--LNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQ

Query:  RPRFVPGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERA
        RPRFVPGKHGNK+ +D +T + LYQLVGTWGQLLFPKPWKEFRLWYDEHK+ G KP LDGMV+ GWYKR+GERIWTPWFIKF+HSRGYFNIYT+F +E A
Subjt:  RPRFVPGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERA

Query:  LSTSHRDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLN
        LS SHRDAGVNYG+ AGPDS LL+ SSL  + L+++PL++LKWYDFCF EV+P R+V++ +ELG++L SVQ+ + II+VS+ G  +  +RN LCH E+LN
Subjt:  LSTSHRDAGVNYGKKAGPDSHLLDGSSLDFNLLEMKPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLN

Query:  IRNYVLLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLAD-NVFIGQS
        I+N++ +G  SEL +DL+RRG+PVI AD FL      K   S+ +  + +K  L   Y++K CL+LGY+      N L V+     D   +D +++IG+S
Subjt:  IRNYVLLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGTRADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLAD-NVFIGQS

Query:  FELFFIKSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEK
          +  ++SSS  QK+W++  +              +S + S+ F+ +  +L+EQ G + KT + M    N   N  N SLGDGK +V+   ++  N +  
Subjt:  FELFFIKSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHGAKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEK

Query:  RLEEMGFWFIDGDFSCKAV
        +LEE+  W ID D SCKAV
Subjt:  RLEEMGFWFIDGDFSCKAV

AT5G60700.1 glycosyltransferase family protein 27.8e-3129.93Show/hide
Query:  RKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHDDAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAW
        R T+ P  +   + V I+  NR  SL R LRS++NA YL D + L   +D         K++E    +  V  F W  G K +  R    GL     E+W
Subjt:  RKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHDDAHIKLNESHRILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAW

Query:  WPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYY----------NTSNYAPSIYGASLQRPRFVPGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWK
        +P+SD++F  ++EDD+EVSP Y+ ++K  +L Y+Y          + S Y P I     +RP++ P     +I        +L+QL  +WG + FPK W+
Subjt:  WPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYY----------NTSNYAPSIYGASLQRPRFVPGKHGNKIRLDDRTRIFLYQLVGTWGQLLFPKPWK

Query:  EFRLWYD---EHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSHRDAGVNYGKKAGPDSH
        EF ++ +      A      +    T GW     +  W  + I  ++ RGY ++Y NF ++ + ST+H + G +   K     H
Subjt:  EFRLWYD---EHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSHRDAGVNYGKKAGPDSH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGGTCACAAGAGACATTTTCTTCCTCTCTTCCTCCTCGTGTCTGTTTCCATCCTCTTCTTCTGCACTCACTTCTACTCCTCCCTTCCCCCCTCCTTCCCTTCAAA
CCCTAACCCCGATTTCGCTCTCACCCGGAAAACCTCTGACCCAAAAACCTCCGGTTTCACTCTCATCGTCAAGATTCTCGCTTACAATCGCCTTGATTCCTTGTCCCGCT
GCCTCCGGTCCATCGCCAATGCGGATTATCTCTCGGACCGCGTCCATCTCCATGTCTACATTGATCATTTTTCTCATGACGATGCCCATATCAAGCTCAACGAGTCGCAT
CGGATCTTGCAATTCGTTGATCAATTCGTTTGGAATTTCGGGGAAAAGCTAGTGCATTTTCGGACGAGGAACGTGGGTTTGCAAGCACAGTGGTTGGAGGCTTGGTGGCC
GAGCTCTGACAATGAGTTTGCGTTCGTTGTGGAAGATGATTTGGAGGTCTCGCCGCTTTATTATAAGTTCCTGAAGAGTTTAATTCTGAATTATTATTATAACACTTCGA
ATTATGCTCCTTCAATATACGGAGCTTCGCTGCAGCGACCAAGGTTTGTTCCAGGTAAACATGGTAACAAAATAAGGTTGGATGACAGAACCAGAATTTTTTTGTACCAG
CTGGTTGGCACTTGGGGTCAGCTTCTCTTTCCAAAGCCTTGGAAGGAGTTCAGATTGTGGTATGATGAGCACAAAGCGATGGGCATCAAGCCAATGCTTGATGGCATGGT
GACGACAGGATGGTACAAGAGGATGGGAGAAAGAATATGGACTCCTTGGTTTATTAAATTTATCCATTCTCGTGGCTATTTCAATATCTACACCAATTTTTTACATGAGA
GAGCACTGAGTACTTCTCACAGGGATGCTGGTGTAAATTATGGAAAAAAAGCTGGTCCTGATTCACATTTACTGGATGGAAGTTCACTTGATTTCAATCTCCTGGAGATG
AAGCCTTTAACCGATTTGAAGTGGTATGATTTCTGCTTTAGAGAAGTGATTCCTCAAAGAATTGTTAAAAGCAAGAGTGAGCTTGGTTCTGTTCTTCACTCTGTTCAGAA
ACACCAAAACATAATCATTGTGAGCGTATCTGGTGTATCAGAGTCCATGGTGAGGAATTTTCTTTGCCACTTGGAGAGGCTAAACATAAGGAACTATGTTTTGTTGGGCC
ATGAATCCGAGTTACTCTATGATCTAACGAGAAGAGGATACCCAGTAATTTATGCAGACCAATTTCTGAAGACAATCAAAACTTTCAAATTTCCGGCTTCTGAAGGCACC
AGGGCAGACCTGATGAAATTAGTTCTAGCAAAGTTCTACATTATTAAGTCATGCTTGCAACTTGGGTACAATCCGGGGTTGGTTGATGAAAACATGCTTTTTGTCAATGG
TGATCCATTCACAGACTTCAGTCTTGCTGACAATGTCTTTATCGGGCAGAGCTTCGAACTTTTCTTCATTAAAAGCTCATCTTCTGCCCAGAAGATGTGGGCTAGTCATA
TTGTTCCCGAGGCTGAAGCCATGGTAGAGTCTTTGATGAGCAGAGGTTCCTCGTCGAGAGACAGCATTGGTTTTGTAGGCATAACAACAAAATTATTGGAACAACATGGT
GCTAAATTTAAAACAGCTGATGAGATGACCTTTGGTCTGAACGTAGGTAATAATCCTACCAACACATCATTAGGGGATGGGAAAAAGTTGGTGTTTTTGCCTGCTGATAT
GAATGTTAATCATGTAGAGAAACGGCTTGAGGAGATGGGTTTTTGGTTCATTGATGGTGACTTTTCTTGCAAGGCTGTTTACTCTAGATTTGCACAAGCTGATATGACAC
CCACAAAGGAAACAACATCAGGCTTCCTACCAAGCAGCCTCATCCCTGAGAACAAATTCAGTGACTCCGAGCAATGGCTAGTTTCGGCATATCCTAGAATTAATATGCAA
CCGCATTTCGCATACATGGCTGTAAGAGCGTTGGAAACAAACAAATCAGATGTTAGTCCTAAACGGATTCCCATGGCATGTATTTCTTTGCCAGGACCAAGGAAACCCAA
ACGTGCACAAGCAGGAAGAACGTTGGTGAAAGTCACTGCATTGGGATTTTCTCCACTCCCTTGCATTAGTATTACAAACCCCGCAATTGCCATAAACCATCAGCAGTGTA
TTGCCAACATAGACATCTGTATCAAAGCCCAGCTTGAACACAACCCCGTGAATCTCCATACCTTTGCGAATATCCAAAGTGTTAGAACACAACTTGAGAACGAAAGGAAA
TGTATGGTCATCGGGTTGAACACCGCTTCGGACCATCCTGTTATACGTCTCGAACCCATCAAGCATCCCGGCTATGGAGTGAGCCCTTATCAGGGTATTCCACAAGAACG
CAGTGCGGCAATTCTGAACAGTTTGTTGGAAGAGAGTACGGAACGATTCTGGGTCTTGGAACGTGGCGTAATTGAGAATAAGGGAAGCACAAAGCGAGACGCTGCGGGGA
AGCAAGCCATTGAGAAGGCAAAGGGCGTGGAGTTGTTTGGTCTGACGAAGGCATTGGGCGTCGGAACACAGAGTGAGGAGGTCGACGTGAATGGAAGATGGCCTGAAAAC
TGCATGGGAATGGTGATGCGAAGTGAAGACCAATGCTCCGAGTACATGAATGTAACATTAAAGGTGGGAATATGGCAGAATCATGAAAACGAGAGAGCTCGCGGCCGTTT
GCTGAAACAAAGCGCGGTGAACGAGATGACTTGCGGGCGGGGCTGGTCGTTACATCTCCACCGCTCTTTGAGACCCGCCAGACATCCAGCAGAGGCAAAATCTTGCAGTG
CTTCGCCTCCAATAATTCACGGATCCCTAGAAGAATTAATGGAACAGAGAAAGTACAGAGCCGATCTTCTTCCATCCAGTTTCCTTTTCTCCCTTCTCTTCCTCTCCCTC
TCAGGCCGATCTGTCGTCCACGCCTACAAGAACTACACCGTCGGAGATTCCCTCGGCTGGTACGACAAATTGGAGAAGCCCGCCGTCGATTACCAGAAGTGGACCGCCGG
CAAGAACTTCGGCCTCGGCGATTTCCTCATTTTTAACACCGACAACAATCACTCGGTCGTGCAGACGTACAATTTTACCACCTACAAGCTCTGTGACTACGACGACGCAT
CGGAAAACGACACGACGGAATGGTCCGCCGCCGATCCGTCGGCGACCACTCCGCACGGCGTCACGGTGCCCGTCCCACTGGTGAAAGAAGGGATGAACTATTTCTTCTCC
GGCCACTACGACGGAGAGCAGTGCAAAAATGGGCAACACTTCAAAATAAACGTCACTCACGGACAGGGCCTGCCAAAGAGCATGAGAGATTCTTCCGACGAATCTCCGGC
TCCAATAAGTCCCGACTCCGGCGACGACACCGAAGCCGCACCGGATACGATCGTCCCTTCCAACTTCGACCATCCTCGAAACGACGTCGCCGATAACGACAAGGAAGCTT
CACGCTCGGTTTCGTTATCGCTTCACTTCAAGTTTAATGAGTTTTTCTTGTTGTTAGTTGGTATTGCCTGCATATTTATTACTACAGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGGTCACAAGAGACATTTTCTTCCTCTCTTCCTCCTCGTGTCTGTTTCCATCCTCTTCTTCTGCACTCACTTCTACTCCTCCCTTCCCCCCTCCTTCCCTTCAAA
CCCTAACCCCGATTTCGCTCTCACCCGGAAAACCTCTGACCCAAAAACCTCCGGTTTCACTCTCATCGTCAAGATTCTCGCTTACAATCGCCTTGATTCCTTGTCCCGCT
GCCTCCGGTCCATCGCCAATGCGGATTATCTCTCGGACCGCGTCCATCTCCATGTCTACATTGATCATTTTTCTCATGACGATGCCCATATCAAGCTCAACGAGTCGCAT
CGGATCTTGCAATTCGTTGATCAATTCGTTTGGAATTTCGGGGAAAAGCTAGTGCATTTTCGGACGAGGAACGTGGGTTTGCAAGCACAGTGGTTGGAGGCTTGGTGGCC
GAGCTCTGACAATGAGTTTGCGTTCGTTGTGGAAGATGATTTGGAGGTCTCGCCGCTTTATTATAAGTTCCTGAAGAGTTTAATTCTGAATTATTATTATAACACTTCGA
ATTATGCTCCTTCAATATACGGAGCTTCGCTGCAGCGACCAAGGTTTGTTCCAGGTAAACATGGTAACAAAATAAGGTTGGATGACAGAACCAGAATTTTTTTGTACCAG
CTGGTTGGCACTTGGGGTCAGCTTCTCTTTCCAAAGCCTTGGAAGGAGTTCAGATTGTGGTATGATGAGCACAAAGCGATGGGCATCAAGCCAATGCTTGATGGCATGGT
GACGACAGGATGGTACAAGAGGATGGGAGAAAGAATATGGACTCCTTGGTTTATTAAATTTATCCATTCTCGTGGCTATTTCAATATCTACACCAATTTTTTACATGAGA
GAGCACTGAGTACTTCTCACAGGGATGCTGGTGTAAATTATGGAAAAAAAGCTGGTCCTGATTCACATTTACTGGATGGAAGTTCACTTGATTTCAATCTCCTGGAGATG
AAGCCTTTAACCGATTTGAAGTGGTATGATTTCTGCTTTAGAGAAGTGATTCCTCAAAGAATTGTTAAAAGCAAGAGTGAGCTTGGTTCTGTTCTTCACTCTGTTCAGAA
ACACCAAAACATAATCATTGTGAGCGTATCTGGTGTATCAGAGTCCATGGTGAGGAATTTTCTTTGCCACTTGGAGAGGCTAAACATAAGGAACTATGTTTTGTTGGGCC
ATGAATCCGAGTTACTCTATGATCTAACGAGAAGAGGATACCCAGTAATTTATGCAGACCAATTTCTGAAGACAATCAAAACTTTCAAATTTCCGGCTTCTGAAGGCACC
AGGGCAGACCTGATGAAATTAGTTCTAGCAAAGTTCTACATTATTAAGTCATGCTTGCAACTTGGGTACAATCCGGGGTTGGTTGATGAAAACATGCTTTTTGTCAATGG
TGATCCATTCACAGACTTCAGTCTTGCTGACAATGTCTTTATCGGGCAGAGCTTCGAACTTTTCTTCATTAAAAGCTCATCTTCTGCCCAGAAGATGTGGGCTAGTCATA
TTGTTCCCGAGGCTGAAGCCATGGTAGAGTCTTTGATGAGCAGAGGTTCCTCGTCGAGAGACAGCATTGGTTTTGTAGGCATAACAACAAAATTATTGGAACAACATGGT
GCTAAATTTAAAACAGCTGATGAGATGACCTTTGGTCTGAACGTAGGTAATAATCCTACCAACACATCATTAGGGGATGGGAAAAAGTTGGTGTTTTTGCCTGCTGATAT
GAATGTTAATCATGTAGAGAAACGGCTTGAGGAGATGGGTTTTTGGTTCATTGATGGTGACTTTTCTTGCAAGGCTGTTTACTCTAGATTTGCACAAGCTGATATGACAC
CCACAAAGGAAACAACATCAGGCTTCCTACCAAGCAGCCTCATCCCTGAGAACAAATTCAGTGACTCCGAGCAATGGCTAGTTTCGGCATATCCTAGAATTAATATGCAA
CCGCATTTCGCATACATGGCTGTAAGAGCGTTGGAAACAAACAAATCAGATGTTAGTCCTAAACGGATTCCCATGGCATGTATTTCTTTGCCAGGACCAAGGAAACCCAA
ACGTGCACAAGCAGGAAGAACGTTGGTGAAAGTCACTGCATTGGGATTTTCTCCACTCCCTTGCATTAGTATTACAAACCCCGCAATTGCCATAAACCATCAGCAGTGTA
TTGCCAACATAGACATCTGTATCAAAGCCCAGCTTGAACACAACCCCGTGAATCTCCATACCTTTGCGAATATCCAAAGTGTTAGAACACAACTTGAGAACGAAAGGAAA
TGTATGGTCATCGGGTTGAACACCGCTTCGGACCATCCTGTTATACGTCTCGAACCCATCAAGCATCCCGGCTATGGAGTGAGCCCTTATCAGGGTATTCCACAAGAACG
CAGTGCGGCAATTCTGAACAGTTTGTTGGAAGAGAGTACGGAACGATTCTGGGTCTTGGAACGTGGCGTAATTGAGAATAAGGGAAGCACAAAGCGAGACGCTGCGGGGA
AGCAAGCCATTGAGAAGGCAAAGGGCGTGGAGTTGTTTGGTCTGACGAAGGCATTGGGCGTCGGAACACAGAGTGAGGAGGTCGACGTGAATGGAAGATGGCCTGAAAAC
TGCATGGGAATGGTGATGCGAAGTGAAGACCAATGCTCCGAGTACATGAATGTAACATTAAAGGTGGGAATATGGCAGAATCATGAAAACGAGAGAGCTCGCGGCCGTTT
GCTGAAACAAAGCGCGGTGAACGAGATGACTTGCGGGCGGGGCTGGTCGTTACATCTCCACCGCTCTTTGAGACCCGCCAGACATCCAGCAGAGGCAAAATCTTGCAGTG
CTTCGCCTCCAATAATTCACGGATCCCTAGAAGAATTAATGGAACAGAGAAAGTACAGAGCCGATCTTCTTCCATCCAGTTTCCTTTTCTCCCTTCTCTTCCTCTCCCTC
TCAGGCCGATCTGTCGTCCACGCCTACAAGAACTACACCGTCGGAGATTCCCTCGGCTGGTACGACAAATTGGAGAAGCCCGCCGTCGATTACCAGAAGTGGACCGCCGG
CAAGAACTTCGGCCTCGGCGATTTCCTCATTTTTAACACCGACAACAATCACTCGGTCGTGCAGACGTACAATTTTACCACCTACAAGCTCTGTGACTACGACGACGCAT
CGGAAAACGACACGACGGAATGGTCCGCCGCCGATCCGTCGGCGACCACTCCGCACGGCGTCACGGTGCCCGTCCCACTGGTGAAAGAAGGGATGAACTATTTCTTCTCC
GGCCACTACGACGGAGAGCAGTGCAAAAATGGGCAACACTTCAAAATAAACGTCACTCACGGACAGGGCCTGCCAAAGAGCATGAGAGATTCTTCCGACGAATCTCCGGC
TCCAATAAGTCCCGACTCCGGCGACGACACCGAAGCCGCACCGGATACGATCGTCCCTTCCAACTTCGACCATCCTCGAAACGACGTCGCCGATAACGACAAGGAAGCTT
CACGCTCGGTTTCGTTATCGCTTCACTTCAAGTTTAATGAGTTTTTCTTGTTGTTAGTTGGTATTGCCTGCATATTTATTACTACAGAGTGA
Protein sequenceShow/hide protein sequence
MVGHKRHFLPLFLLVSVSILFFCTHFYSSLPPSFPSNPNPDFALTRKTSDPKTSGFTLIVKILAYNRLDSLSRCLRSIANADYLSDRVHLHVYIDHFSHDDAHIKLNESH
RILQFVDQFVWNFGEKLVHFRTRNVGLQAQWLEAWWPSSDNEFAFVVEDDLEVSPLYYKFLKSLILNYYYNTSNYAPSIYGASLQRPRFVPGKHGNKIRLDDRTRIFLYQ
LVGTWGQLLFPKPWKEFRLWYDEHKAMGIKPMLDGMVTTGWYKRMGERIWTPWFIKFIHSRGYFNIYTNFLHERALSTSHRDAGVNYGKKAGPDSHLLDGSSLDFNLLEM
KPLTDLKWYDFCFREVIPQRIVKSKSELGSVLHSVQKHQNIIIVSVSGVSESMVRNFLCHLERLNIRNYVLLGHESELLYDLTRRGYPVIYADQFLKTIKTFKFPASEGT
RADLMKLVLAKFYIIKSCLQLGYNPGLVDENMLFVNGDPFTDFSLADNVFIGQSFELFFIKSSSSAQKMWASHIVPEAEAMVESLMSRGSSSRDSIGFVGITTKLLEQHG
AKFKTADEMTFGLNVGNNPTNTSLGDGKKLVFLPADMNVNHVEKRLEEMGFWFIDGDFSCKAVYSRFAQADMTPTKETTSGFLPSSLIPENKFSDSEQWLVSAYPRINMQ
PHFAYMAVRALETNKSDVSPKRIPMACISLPGPRKPKRAQAGRTLVKVTALGFSPLPCISITNPAIAINHQQCIANIDICIKAQLEHNPVNLHTFANIQSVRTQLENERK
CMVIGLNTASDHPVIRLEPIKHPGYGVSPYQGIPQERSAAILNSLLEESTERFWVLERGVIENKGSTKRDAAGKQAIEKAKGVELFGLTKALGVGTQSEEVDVNGRWPEN
CMGMVMRSEDQCSEYMNVTLKVGIWQNHENERARGRLLKQSAVNEMTCGRGWSLHLHRSLRPARHPAEAKSCSASPPIIHGSLEELMEQRKYRADLLPSSFLFSLLFLSL
SGRSVVHAYKNYTVGDSLGWYDKLEKPAVDYQKWTAGKNFGLGDFLIFNTDNNHSVVQTYNFTTYKLCDYDDASENDTTEWSAADPSATTPHGVTVPVPLVKEGMNYFFS
GHYDGEQCKNGQHFKINVTHGQGLPKSMRDSSDESPAPISPDSGDDTEAAPDTIVPSNFDHPRNDVADNDKEASRSVSLSLHFKFNEFFLLLVGIACIFITTE