; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G009580 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G009580
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGlycos_transf_1 domain-containing protein
Genome locationCmo_Chr18:10668822..10670312
RNA-Seq ExpressionCmoCh18G009580
SyntenyCmoCh18G009580
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR001296 - Glycosyl transferase, family 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573893.1 hypothetical protein SDJN03_27780, partial [Cucurbita argyrosperma subsp. sororia]2.2e-29299.4Show/hide
Query:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
        MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTK DHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
Subjt:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL

Query:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
        SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
Subjt:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR

Query:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
        TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQP+DVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
Subjt:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG

Query:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
        WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
Subjt:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP

Query:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
        VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSI+KLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
Subjt:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV

KAG7012958.1 hypothetical protein SDJN02_25712, partial [Cucurbita argyrosperma subsp. argyrosperma]8.4e-29299.19Show/hide
Query:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
        MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTK DHFKSQSLKTLFQELI+RLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
Subjt:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL

Query:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
        SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
Subjt:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR

Query:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
        TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQP+DVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
Subjt:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG

Query:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
        WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
Subjt:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP

Query:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
        VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSI+KLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
Subjt:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV

XP_022945089.1 uncharacterized protein LOC111449431 [Cucurbita moschata]1.5e-293100Show/hide
Query:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
        MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
Subjt:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL

Query:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
        SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
Subjt:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR

Query:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
        TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
Subjt:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG

Query:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
        WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
Subjt:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP

Query:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
        VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
Subjt:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV

XP_022968340.1 uncharacterized protein LOC111467605 [Cucurbita maxima]1.2e-27794.96Show/hide
Query:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
        MDDLH TDRPLP+PK S SLKSRP  SS+IFFYCSSILILLLSISLF FTK DHFKSQSLKTLFQ+LIDRLNASRNPKQTSV NPFSTSSQCVLWMAPFL
Subjt:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL

Query:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
        SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLES+DFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
Subjt:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR

Query:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
        TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQP+DVNFFDPLNYSPFSLESVGTLVLGDKNM EVSLEKGFVFLSIFKWEFRKG
Subjt:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG

Query:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
        WDLLLEAYLKEFSKNDGV LFLLTNPYHTDSDFGNKILDFVE+SGIQ+P SGWAPV+VVDTHIAQTDLP+VYKAADAFVLPSRGEGWGRPLVEAMSMSLP
Subjt:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP

Query:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
        VIATNWSGQTEFLTDENSYPLAVE+MSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAK KGRRAREDMVRRFSPDVVAEIV  HIQRIF EV
Subjt:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV

XP_023542823.1 uncharacterized protein LOC111802622 [Cucurbita pepo subsp. pepo]1.8e-28697.59Show/hide
Query:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
        MDDLHLTD+PLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTK DHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
Subjt:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL

Query:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
        SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
Subjt:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR

Query:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNM-EEVSLEKGFVFLSIFKWEFRK
        TMFETDRVSQEHVNRCN+MDFVWVPSEFHVSTFVKSGVDPSKVVKIVQP+DVNFFDPLNYSPFSLESVGTLVLGDKNM EEVSLEKGFVFLSIFKWEFRK
Subjt:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNM-EEVSLEKGFVFLSIFKWEFRK

Query:  GWDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSL
        GWDLLLEAYLKEFSKNDGV LFLLTNPYHTD+DFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSM+L
Subjt:  GWDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSL

Query:  PVIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
        PVIATNWSGQTEFLTDENSYPL VERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAK KGRRAREDMVRRFSPDVVAEIV RHIQRIF EV
Subjt:  PVIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV

TrEMBL top hitse value%identityAlignment
A0A1S3BFB1 uncharacterized protein LOC1034893733.6e-21676.12Show/hide
Query:  DRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSGGGYSS
        DRP PNP   H  K   S       + SSILILLL+IS FAF K + +KSQS K     L + L  S  P           +  CVLWMAPFLSGGGYSS
Subjt:  DRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSGGGYSS

Query:  EAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRTMFETDR
        EAWSYILAL  H+ NP FRL I QHGDLES+DFWEGLP+SV+NLAIELHRT+CR+NET+V+CHSEPGAWNPPLFET PCPPG Y+ FKSVIGRTMFETDR
Subjt:  EAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRTMFETDR

Query:  VSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSL--EKGFVFLSIFKWEFRKGWDLLL
        V+QEHVNRCN MD+VWVPSEFHVSTFV+SGVDPSK+VK+VQPVDVNFFDPL Y PFSLESVGTLVLG  N EEV L  +K FVFLSIFKWEFRKGWDLLL
Subjt:  VSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSL--EKGFVFLSIFKWEFRKGWDLLL

Query:  EAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATN
        EAYLKEFSK D V LFLLTNPYHT+SDFGNKILDFVE+S +Q P SGWAPV+VVD HI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAM+MSLPVIATN
Subjt:  EAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATN

Query:  WSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE
        WSG TEFLTDENSYPL VERMSEVKE PFKGH+WAEPSISKL+VLMREV  NV+EAK+KGRRAREDM+ RFSPD+VA+IV R I+ IF E
Subjt:  WSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE

A0A5D3CDB1 Group 1 family glycosyltransferase3.6e-21676.12Show/hide
Query:  DRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSGGGYSS
        DRP PNP   H  K   S       + SSILILLL+IS FAF K + +KSQS K     L + L  S  P           +  CVLWMAPFLSGGGYSS
Subjt:  DRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSGGGYSS

Query:  EAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRTMFETDR
        EAWSYILAL  H+ NP FRL I QHGDLES+DFWEGLP+SV+NLAIELHRT+CR+NET+V+CHSEPGAWNPPLFET PCPPG Y+ FKSVIGRTMFETDR
Subjt:  EAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRTMFETDR

Query:  VSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSL--EKGFVFLSIFKWEFRKGWDLLL
        V+QEHVNRCN MD+VWVPSEFHVSTFV+SGVDPSK+VK+VQPVDVNFFDPL Y PFSLESVGTLVLG  N EEV L  +K FVFLSIFKWEFRKGWDLLL
Subjt:  VSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSL--EKGFVFLSIFKWEFRKGWDLLL

Query:  EAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATN
        EAYLKEFSK D V LFLLTNPYHT+SDFGNKILDFVE+S +Q P SGWAPV+VVD HI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAM+MSLPVIATN
Subjt:  EAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATN

Query:  WSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE
        WSG TEFLTDENSYPL VERMSEVKE PFKGH+WAEPSISKL+VLMREV  NV+EAK+KGRRAREDM+ RFSPD+VA+IV R I+ IF E
Subjt:  WSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE

A0A6J1D8R8 uncharacterized protein LOC1110186577.9e-21977.33Show/hide
Query:  DLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSG
        DLH  +   PNP   HSLK R S+         SILILL++IS F FTK DH K+QSLK L Q+ I  L    NP   S+P P  T   CVLWMAPFLSG
Subjt:  DLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSG

Query:  GGYSSEAWSYILALHDHVRNPN-FRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRT
        GGYSSEAWSYILALH H++ P+ FRLAIEQHGDLESIDFWEGLPDSV++LAI+LH T CR+NET+V+CHSEPGAWNPPLFET PCPPGVYQ FK+VIGRT
Subjt:  GGYSSEAWSYILALHDHVRNPN-FRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRT

Query:  MFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKGW
        MFETDRV+ EHVNRC  MD++WVPSEFHVSTFVKSGVDPSK+VKIVQP+DVNFFDPL Y PFSL S+GTLVLG K+M E+ L+ GFVFLSIFKWEFRKGW
Subjt:  MFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKGW

Query:  DLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPV
        DLLLEAYLKEFSK D V LFLLTNPYH+D DFGNKILDFVE+S IQ+PASGWAPV+V+DTHIAQTDLPR+YKAADAFVLPSRGEGWGRPLVEAM+MSLPV
Subjt:  DLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPV

Query:  IATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE
        IATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSI KL+ LMREV TNVDEAK KGR AR+DMVR+FSPD+VA+IV  HIQ +F +
Subjt:  IATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE

A0A6J1G004 uncharacterized protein LOC1114494317.4e-294100Show/hide
Query:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
        MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
Subjt:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL

Query:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
        SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
Subjt:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR

Query:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
        TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
Subjt:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG

Query:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
        WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
Subjt:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP

Query:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
        VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
Subjt:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV

A0A6J1HWY2 uncharacterized protein LOC1114676055.7e-27894.96Show/hide
Query:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
        MDDLH TDRPLP+PK S SLKSRP  SS+IFFYCSSILILLLSISLF FTK DHFKSQSLKTLFQ+LIDRLNASRNPKQTSV NPFSTSSQCVLWMAPFL
Subjt:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL

Query:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
        SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLES+DFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
Subjt:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR

Query:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
        TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQP+DVNFFDPLNYSPFSLESVGTLVLGDKNM EVSLEKGFVFLSIFKWEFRKG
Subjt:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG

Query:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
        WDLLLEAYLKEFSKNDGV LFLLTNPYHTDSDFGNKILDFVE+SGIQ+P SGWAPV+VVDTHIAQTDLP+VYKAADAFVLPSRGEGWGRPLVEAMSMSLP
Subjt:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP

Query:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
        VIATNWSGQTEFLTDENSYPLAVE+MSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAK KGRRAREDMVRRFSPDVVAEIV  HIQRIF EV
Subjt:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV

SwissProt top hitse value%identityAlignment
A7TZT2 Mannosylfructose-phosphate synthase7.8e-0627.56Show/hide
Query:  KGFVFLSIFKWEFRKGWDLLLEAYLKEFSKNDGVALFLLT---NPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLP
        +G V L++ +    KG+DLL++ +     +     L L     N    ++   N++ + V+  G++   +          ++A  DLP +Y+AAD FVL 
Subjt:  KGFVFLSIFKWEFRKGWDLLLEAYLKEFSKNDGVALFLLT---NPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLP

Query:  SRGEGWGRPLVEAMSMSLPVIATNWSG
        SR E +G   +EAM+   P + T   G
Subjt:  SRGEGWGRPLVEAMSMSLPVIATNWSG

Arabidopsis top hitse value%identityAlignment
AT3G10630.1 UDP-Glycosyltransferase superfamily protein4.0e-18365.15Show/hide
Query:  RPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLF---------QELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSGGGYSSEAWSYI
        RP   S I  Y SSIL LLLSI L  FT  D +K QSL+  F         Q L+   + +   K  ++ NP S++  CVLWMAPFLS GGYSSEAWSY+
Subjt:  RPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLF---------QELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSGGGYSSEAWSYI

Query:  LALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRTMFETDRVSQEHV
        L+L +H+ NP FR+ IE HGDLES++FW GL    K +AIE++R +CR NETIVVCHSEPGAW PPLFET PCPP  Y++F SVIGRTMFETDRV+ EHV
Subjt:  LALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRTMFETDRVSQEHV

Query:  NRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKGWDLLLEAYLKEFS
         RCN+MD VWVP++FHVS+FV+SGVD SKVVKIVQPVDV FFDP  Y P  L +VG LVLG        ++ GFVFLS+FKWE RKGWD+LL+AYL EFS
Subjt:  NRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKGWDLLLEAYLKEFS

Query:  KNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATNWSGQTEFL
          D VALFLLTN YH+DSDFGNKILDFVE   I+ P +G+  V+V+D HIAQ DLPR+YKAADAFVLP+RGEGWGRP+VEAM+MSLPVI TNWSG TE+L
Subjt:  KNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATNWSGQTEFL

Query:  TDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE
        T+ N YPL VE MSEVKEGPF+GH WAEPS+ KLRVLMR VM+N DEAK KG+R R+DMV+ F+P+VVA++V   I RIFDE
Subjt:  TDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE

AT5G01220.1 sulfoquinovosyldiacylglycerol 27.5e-0429.52Show/hide
Query:  DLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGR
        +L + Y + D FV+PS  E  G  ++EAMS  LPV+A    G  + + ++           E K G        E  ++KLR L+ +  T     +  G+
Subjt:  DLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSISKLRVLMREVMTNVDEAKEKGR

Query:  RARED
         ARE+
Subjt:  RARED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGACCTTCACCTCACAGATCGACCACTTCCCAATCCCAAACATTCCCACTCTCTCAAATCCCGACCTTCTTCTTCTTCCATGATCTTCTTCTACTGTTCATCCAT
TCTCATCCTCCTTCTATCAATTTCCCTCTTCGCTTTCACCAAAAAAGATCATTTCAAATCTCAATCTCTTAAAACCCTTTTTCAAGAACTCATCGATCGTCTCAACGCAT
CTCGAAATCCCAAGCAAACCTCTGTTCCTAATCCGTTTTCGACGTCCTCTCAATGTGTTCTTTGGATGGCTCCATTTCTTTCCGGCGGCGGATACAGTTCAGAAGCTTGG
TCCTACATTTTAGCCCTTCACGATCATGTAAGAAACCCTAATTTTCGTTTGGCTATTGAGCAACATGGCGATCTAGAATCCATTGATTTTTGGGAGGGCTTACCAGATTC
TGTGAAGAATTTGGCCATTGAACTTCATAGAACAAAATGTAGAATCAATGAAACTATTGTGGTTTGTCATAGTGAACCTGGTGCTTGGAATCCTCCTTTGTTTGAAACTT
TTCCTTGCCCACCAGGTGTTTACCAAAATTTCAAGTCAGTGATTGGCAGAACAATGTTTGAAACTGATAGAGTAAGTCAAGAACATGTTAATCGTTGTAATGAAATGGAT
TTTGTTTGGGTTCCTTCTGAATTTCATGTCTCTACATTTGTGAAAAGTGGGGTTGATCCTTCTAAAGTAGTGAAAATTGTTCAACCCGTTGATGTTAATTTCTTTGATCC
ATTGAATTATAGTCCATTTAGTCTTGAATCTGTAGGAACTCTTGTTCTAGGAGACAAAAACATGGAAGAAGTAAGCTTAGAGAAGGGATTTGTGTTCTTGAGTATCTTCA
AATGGGAATTTAGAAAAGGTTGGGATCTGTTATTAGAAGCATATTTGAAAGAATTCTCCAAGAATGATGGAGTTGCGTTGTTCTTATTGACAAATCCTTACCATACTGAT
AGTGATTTTGGGAACAAGATATTGGATTTTGTAGAACACTCAGGCATTCAAAGGCCAGCTTCTGGTTGGGCTCCTGTTCATGTGGTTGATACTCATATAGCTCAAACTGA
TTTGCCTAGAGTTTACAAGGCTGCAGATGCATTTGTTCTGCCGTCCCGAGGAGAGGGGTGGGGGAGACCGCTAGTCGAAGCGATGTCGATGTCGTTGCCGGTGATCGCCA
CCAACTGGTCGGGGCAAACGGAGTTTTTGACCGATGAGAATAGCTATCCATTGGCAGTTGAGAGAATGAGTGAAGTGAAGGAAGGGCCATTCAAAGGGCATCTGTGGGCT
GAGCCATCCATTAGTAAGCTTCGAGTTTTAATGAGGGAGGTAATGACCAACGTCGATGAAGCTAAGGAGAAAGGGCGGAGGGCGAGGGAGGACATGGTCAGGCGATTCTC
GCCCGACGTCGTGGCCGAGATTGTTCGTCGTCATATACAAAGGATTTTTGATGAGGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGACCTTCACCTCACAGATCGACCACTTCCCAATCCCAAACATTCCCACTCTCTCAAATCCCGACCTTCTTCTTCTTCCATGATCTTCTTCTACTGTTCATCCAT
TCTCATCCTCCTTCTATCAATTTCCCTCTTCGCTTTCACCAAAAAAGATCATTTCAAATCTCAATCTCTTAAAACCCTTTTTCAAGAACTCATCGATCGTCTCAACGCAT
CTCGAAATCCCAAGCAAACCTCTGTTCCTAATCCGTTTTCGACGTCCTCTCAATGTGTTCTTTGGATGGCTCCATTTCTTTCCGGCGGCGGATACAGTTCAGAAGCTTGG
TCCTACATTTTAGCCCTTCACGATCATGTAAGAAACCCTAATTTTCGTTTGGCTATTGAGCAACATGGCGATCTAGAATCCATTGATTTTTGGGAGGGCTTACCAGATTC
TGTGAAGAATTTGGCCATTGAACTTCATAGAACAAAATGTAGAATCAATGAAACTATTGTGGTTTGTCATAGTGAACCTGGTGCTTGGAATCCTCCTTTGTTTGAAACTT
TTCCTTGCCCACCAGGTGTTTACCAAAATTTCAAGTCAGTGATTGGCAGAACAATGTTTGAAACTGATAGAGTAAGTCAAGAACATGTTAATCGTTGTAATGAAATGGAT
TTTGTTTGGGTTCCTTCTGAATTTCATGTCTCTACATTTGTGAAAAGTGGGGTTGATCCTTCTAAAGTAGTGAAAATTGTTCAACCCGTTGATGTTAATTTCTTTGATCC
ATTGAATTATAGTCCATTTAGTCTTGAATCTGTAGGAACTCTTGTTCTAGGAGACAAAAACATGGAAGAAGTAAGCTTAGAGAAGGGATTTGTGTTCTTGAGTATCTTCA
AATGGGAATTTAGAAAAGGTTGGGATCTGTTATTAGAAGCATATTTGAAAGAATTCTCCAAGAATGATGGAGTTGCGTTGTTCTTATTGACAAATCCTTACCATACTGAT
AGTGATTTTGGGAACAAGATATTGGATTTTGTAGAACACTCAGGCATTCAAAGGCCAGCTTCTGGTTGGGCTCCTGTTCATGTGGTTGATACTCATATAGCTCAAACTGA
TTTGCCTAGAGTTTACAAGGCTGCAGATGCATTTGTTCTGCCGTCCCGAGGAGAGGGGTGGGGGAGACCGCTAGTCGAAGCGATGTCGATGTCGTTGCCGGTGATCGCCA
CCAACTGGTCGGGGCAAACGGAGTTTTTGACCGATGAGAATAGCTATCCATTGGCAGTTGAGAGAATGAGTGAAGTGAAGGAAGGGCCATTCAAAGGGCATCTGTGGGCT
GAGCCATCCATTAGTAAGCTTCGAGTTTTAATGAGGGAGGTAATGACCAACGTCGATGAAGCTAAGGAGAAAGGGCGGAGGGCGAGGGAGGACATGGTCAGGCGATTCTC
GCCCGACGTCGTGGCCGAGATTGTTCGTCGTCATATACAAAGGATTTTTGATGAGGTGTGA
Protein sequenceShow/hide protein sequence
MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKKDHFKSQSLKTLFQELIDRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSGGGYSSEAW
SYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNEMD
FVWVPSEFHVSTFVKSGVDPSKVVKIVQPVDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKGWDLLLEAYLKEFSKNDGVALFLLTNPYHTD
SDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWA
EPSISKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV