; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg19000 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg19000
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionGlycos_transf_1 domain-containing protein
Genome locationCarg_Chr18:9704683..9706173
RNA-Seq ExpressionCarg19000
SyntenyCarg19000
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR001296 - Glycosyl transferase, family 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573893.1 hypothetical protein SDJN03_27780, partial [Cucurbita argyrosperma subsp. sororia]5.8e-29399.8Show/hide
Query:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
        MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELI+RLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
Subjt:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL

Query:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
        SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
Subjt:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR

Query:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
        TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
Subjt:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG

Query:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
        WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
Subjt:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP

Query:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
        VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
Subjt:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV

KAG7012958.1 hypothetical protein SDJN02_25712, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-293100Show/hide
Query:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
        MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
Subjt:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL

Query:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
        SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
Subjt:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR

Query:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
        TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
Subjt:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG

Query:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
        WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
Subjt:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP

Query:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
        VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
Subjt:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV

XP_022945089.1 uncharacterized protein LOC111449431 [Cucurbita moschata]1.1e-29199.19Show/hide
Query:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
        MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTK DHFKSQSLKTLFQELI+RLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
Subjt:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL

Query:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
        SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
Subjt:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR

Query:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
        TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQP+DVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
Subjt:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG

Query:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
        WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
Subjt:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP

Query:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
        VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSI+KLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
Subjt:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV

XP_022968340.1 uncharacterized protein LOC111467605 [Cucurbita maxima]2.0e-27794.96Show/hide
Query:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
        MDDLH TDRPLP+PK S SLKSRP  SS+IFFYCSSILILLLSISLF FTKTDHFKSQSLKTLFQ+LI+RLNASRNPKQTSV NPFSTSSQCVLWMAPFL
Subjt:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL

Query:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
        SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLES+DFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
Subjt:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR

Query:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
        TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNM EVSLEKGFVFLSIFKWEFRKG
Subjt:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG

Query:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
        WDLLLEAYLKEFSKNDGV LFLLTNPYHTDSDFGNKILDFVE+SGIQ+P SGWAPV+VVDTHIAQTDLP+VYKAADAFVLPSRGEGWGRPLVEAMSMSLP
Subjt:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP

Query:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
        VIATNWSGQTEFLTDENSYPLAVE+MSEVKEGPFKGHLWAEPSI+KLRVLMREVMTNVDEAK KGRRAREDMVRRFSPDVVAEIV  HIQRIF EV
Subjt:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV

XP_023542823.1 uncharacterized protein LOC111802622 [Cucurbita pepo subsp. pepo]3.1e-28697.59Show/hide
Query:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
        MDDLHLTD+PLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELI+RLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
Subjt:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL

Query:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
        SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
Subjt:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR

Query:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNM-EEVSLEKGFVFLSIFKWEFRK
        TMFETDRVSQEHVNRCN+MDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNM EEVSLEKGFVFLSIFKWEFRK
Subjt:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNM-EEVSLEKGFVFLSIFKWEFRK

Query:  GWDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSL
        GWDLLLEAYLKEFSKNDGV LFLLTNPYHTD+DFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSM+L
Subjt:  GWDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSL

Query:  PVIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
        PVIATNWSGQTEFLTDENSYPL VERMSEVKEGPFKGHLWAEPSI+KLRVLMREVMTNVDEAK KGRRAREDMVRRFSPDVVAEIV RHIQRIF EV
Subjt:  PVIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV

TrEMBL top hitse value%identityAlignment
A0A1S3BFB1 uncharacterized protein LOC1034893735.6e-21776.12Show/hide
Query:  DRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSGGGYSS
        DRP PNP   H  K   S       + SSILILLL+IS FAF KT+ +KSQS K     L N L  S  P           +  CVLWMAPFLSGGGYSS
Subjt:  DRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSGGGYSS

Query:  EAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRTMFETDR
        EAWSYILAL  H+ NP FRL I QHGDLES+DFWEGLP+SV+NLAIELHRT+CR+NET+V+CHSEPGAWNPPLFET PCPPG Y+ FKSVIGRTMFETDR
Subjt:  EAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRTMFETDR

Query:  VSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSL--EKGFVFLSIFKWEFRKGWDLLL
        V+QEHVNRCN MD+VWVPSEFHVSTFV+SGVDPSK+VK+VQP+DVNFFDPL Y PFSLESVGTLVLG  N EEV L  +K FVFLSIFKWEFRKGWDLLL
Subjt:  VSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSL--EKGFVFLSIFKWEFRKGWDLLL

Query:  EAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATN
        EAYLKEFSK D V LFLLTNPYHT+SDFGNKILDFVE+S +Q P SGWAPV+VVD HI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAM+MSLPVIATN
Subjt:  EAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATN

Query:  WSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE
        WSG TEFLTDENSYPL VERMSEVKE PFKGH+WAEPSI+KL+VLMREV  NV+EAK+KGRRAREDM+ RFSPD+VA+IV R I+ IF E
Subjt:  WSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE

A0A5D3CDB1 Group 1 family glycosyltransferase5.6e-21776.12Show/hide
Query:  DRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSGGGYSS
        DRP PNP   H  K   S       + SSILILLL+IS FAF KT+ +KSQS K     L N L  S  P           +  CVLWMAPFLSGGGYSS
Subjt:  DRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSGGGYSS

Query:  EAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRTMFETDR
        EAWSYILAL  H+ NP FRL I QHGDLES+DFWEGLP+SV+NLAIELHRT+CR+NET+V+CHSEPGAWNPPLFET PCPPG Y+ FKSVIGRTMFETDR
Subjt:  EAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRTMFETDR

Query:  VSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSL--EKGFVFLSIFKWEFRKGWDLLL
        V+QEHVNRCN MD+VWVPSEFHVSTFV+SGVDPSK+VK+VQP+DVNFFDPL Y PFSLESVGTLVLG  N EEV L  +K FVFLSIFKWEFRKGWDLLL
Subjt:  VSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSL--EKGFVFLSIFKWEFRKGWDLLL

Query:  EAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATN
        EAYLKEFSK D V LFLLTNPYHT+SDFGNKILDFVE+S +Q P SGWAPV+VVD HI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAM+MSLPVIATN
Subjt:  EAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATN

Query:  WSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE
        WSG TEFLTDENSYPL VERMSEVKE PFKGH+WAEPSI+KL+VLMREV  NV+EAK+KGRRAREDM+ RFSPD+VA+IV R I+ IF E
Subjt:  WSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE

A0A6J1D8R8 uncharacterized protein LOC1110186571.6e-21977.73Show/hide
Query:  DLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSG
        DLH  +   PNP   HSLK R S+         SILILL++IS F FTKTDH K+QSLK L Q+ I  L    NP   S+P P  T   CVLWMAPFLSG
Subjt:  DLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSG

Query:  GGYSSEAWSYILALHDHVRNPN-FRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRT
        GGYSSEAWSYILALH H++ P+ FRLAIEQHGDLESIDFWEGLPDSV++LAI+LH T CR+NET+V+CHSEPGAWNPPLFET PCPPGVYQ FK+VIGRT
Subjt:  GGYSSEAWSYILALHDHVRNPN-FRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRT

Query:  MFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKGW
        MFETDRV+ EHVNRC  MD++WVPSEFHVSTFVKSGVDPSK+VKIVQPIDVNFFDPL Y PFSL S+GTLVLG K+M E+ L+ GFVFLSIFKWEFRKGW
Subjt:  MFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKGW

Query:  DLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPV
        DLLLEAYLKEFSK D V LFLLTNPYH+D DFGNKILDFVE+S IQ+PASGWAPV+V+DTHIAQTDLPR+YKAADAFVLPSRGEGWGRPLVEAM+MSLPV
Subjt:  DLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPV

Query:  IATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE
        IATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSI KL+ LMREV TNVDEAK KGR AR+DMVR+FSPD+VA+IV  HIQ +F +
Subjt:  IATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE

A0A6J1G004 uncharacterized protein LOC1114494315.3e-29299.19Show/hide
Query:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
        MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTK DHFKSQSLKTLFQELI+RLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
Subjt:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL

Query:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
        SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
Subjt:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR

Query:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
        TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQP+DVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
Subjt:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG

Query:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
        WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
Subjt:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP

Query:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
        VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSI+KLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
Subjt:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV

A0A6J1HWY2 uncharacterized protein LOC1114676059.8e-27894.96Show/hide
Query:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL
        MDDLH TDRPLP+PK S SLKSRP  SS+IFFYCSSILILLLSISLF FTKTDHFKSQSLKTLFQ+LI+RLNASRNPKQTSV NPFSTSSQCVLWMAPFL
Subjt:  MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFL

Query:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
        SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLES+DFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR
Subjt:  SGGGYSSEAWSYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGR

Query:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG
        TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNM EVSLEKGFVFLSIFKWEFRKG
Subjt:  TMFETDRVSQEHVNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKG

Query:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP
        WDLLLEAYLKEFSKNDGV LFLLTNPYHTDSDFGNKILDFVE+SGIQ+P SGWAPV+VVDTHIAQTDLP+VYKAADAFVLPSRGEGWGRPLVEAMSMSLP
Subjt:  WDLLLEAYLKEFSKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLP

Query:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV
        VIATNWSGQTEFLTDENSYPLAVE+MSEVKEGPFKGHLWAEPSI+KLRVLMREVMTNVDEAK KGRRAREDMVRRFSPDVVAEIV  HIQRIF EV
Subjt:  VIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV

SwissProt top hitse value%identityAlignment
A7TZT2 Mannosylfructose-phosphate synthase7.8e-0627.56Show/hide
Query:  KGFVFLSIFKWEFRKGWDLLLEAYLKEFSKNDGVALFLLT---NPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLP
        +G V L++ +    KG+DLL++ +     +     L L     N    ++   N++ + V+  G++   +          ++A  DLP +Y+AAD FVL 
Subjt:  KGFVFLSIFKWEFRKGWDLLLEAYLKEFSKNDGVALFLLT---NPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLP

Query:  SRGEGWGRPLVEAMSMSLPVIATNWSG
        SR E +G   +EAM+   P + T   G
Subjt:  SRGEGWGRPLVEAMSMSLPVIATNWSG

Arabidopsis top hitse value%identityAlignment
AT3G10630.1 UDP-Glycosyltransferase superfamily protein8.1e-18465.42Show/hide
Query:  RPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNA---------SRNPKQTS-VPNPFSTSSQCVLWMAPFLSGGGYSSEAWSY
        RP   S I  Y SSIL LLLSI L  FT TD +K QSL+  F   +NR  +            PK  S   NP S++  CVLWMAPFLS GGYSSEAWSY
Subjt:  RPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNA---------SRNPKQTS-VPNPFSTSSQCVLWMAPFLSGGGYSSEAWSY

Query:  ILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRTMFETDRVSQEH
        +L+L +H+ NP FR+ IE HGDLES++FW GL    K +AIE++R +CR NETIVVCHSEPGAW PPLFET PCPP  Y++F SVIGRTMFETDRV+ EH
Subjt:  ILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRTMFETDRVSQEH

Query:  VNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKGWDLLLEAYLKEF
        V RCN+MD VWVP++FHVS+FV+SGVD SKVVKIVQP+DV FFDP  Y P  L +VG LVLG        ++ GFVFLS+FKWE RKGWD+LL+AYL EF
Subjt:  VNRCNEMDFVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKGWDLLLEAYLKEF

Query:  SKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATNWSGQTEF
        S  D VALFLLTN YH+DSDFGNKILDFVE   I+ P +G+  V+V+D HIAQ DLPR+YKAADAFVLP+RGEGWGRP+VEAM+MSLPVI TNWSG TE+
Subjt:  SKNDGVALFLLTNPYHTDSDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATNWSGQTEF

Query:  LTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE
        LT+ N YPL VE MSEVKEGPF+GH WAEPS+ KLRVLMR VM+N DEAK KG+R R+DMV+ F+P+VVA++V   I RIFDE
Subjt:  LTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDE

AT5G01220.1 sulfoquinovosyldiacylglycerol 22.0e-0430.48Show/hide
Query:  DLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGR
        +L + Y + D FV+PS  E  G  ++EAMS  LPV+A    G  + + ++           E K G        E  +TKLR L+ +  T     +  G+
Subjt:  DLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSITKLRVLMREVMTNVDEAKEKGR

Query:  RARED
         ARE+
Subjt:  RARED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGACCTTCACCTCACAGATCGACCACTTCCCAATCCCAAACATTCCCACTCTCTCAAATCCCGACCTTCTTCTTCTTCCATGATCTTCTTCTACTGTTCATCCAT
TCTCATCCTCCTTCTATCAATTTCCCTCTTCGCTTTCACCAAAACAGATCATTTCAAATCTCAATCTCTTAAAACCCTTTTTCAAGAACTCATCAATCGTCTCAACGCAT
CTCGAAATCCCAAGCAAACCTCTGTTCCTAATCCGTTTTCGACGTCCTCTCAATGTGTTCTTTGGATGGCTCCATTTCTTTCCGGCGGCGGATACAGTTCAGAAGCTTGG
TCCTACATTTTAGCCCTTCACGATCATGTAAGAAACCCTAATTTTCGTTTGGCTATTGAGCAACATGGCGATCTAGAATCCATTGATTTTTGGGAGGGCTTACCAGATTC
TGTGAAGAATTTGGCCATTGAACTTCATAGAACAAAATGTAGAATCAATGAAACTATTGTGGTTTGTCATAGTGAACCTGGTGCTTGGAATCCTCCTTTGTTTGAAACTT
TTCCTTGCCCACCAGGTGTTTACCAAAATTTCAAGTCAGTGATTGGCAGAACAATGTTTGAAACTGATAGAGTAAGTCAAGAACATGTTAATCGTTGTAATGAAATGGAT
TTTGTTTGGGTTCCTTCTGAATTTCATGTCTCTACATTTGTGAAAAGTGGGGTTGATCCTTCTAAAGTAGTGAAAATTGTTCAACCCATTGATGTTAATTTCTTTGATCC
ATTGAATTATAGTCCATTTAGTCTTGAATCTGTAGGAACTCTTGTTCTAGGAGACAAAAACATGGAAGAAGTAAGCTTAGAGAAGGGATTTGTGTTCTTGAGTATCTTCA
AATGGGAATTTAGAAAAGGTTGGGATCTGTTATTGGAAGCATATTTGAAAGAATTCTCCAAGAATGATGGAGTTGCGTTGTTCTTATTGACAAATCCTTACCATACTGAT
AGTGATTTTGGGAACAAGATATTGGATTTTGTAGAACACTCAGGCATTCAAAGGCCAGCTTCTGGTTGGGCTCCTGTTCATGTGGTTGATACTCATATAGCTCAAACTGA
TTTGCCTAGAGTTTACAAGGCTGCAGATGCATTTGTTCTGCCGTCCCGAGGAGAGGGGTGGGGGAGACCGCTCGTCGAAGCGATGTCGATGTCGTTGCCAGTGATCGCCA
CCAACTGGTCGGGGCAAACGGAGTTTTTGACCGATGAGAATAGCTATCCATTGGCAGTTGAGAGAATGAGTGAGGTAAAGGAAGGACCATTCAAGGGGCATTTGTGGGCT
GAACCATCCATTACTAAGCTTCGAGTTTTAATGAGGGAGGTAATGACCAACGTCGATGAAGCTAAGGAGAAAGGGCGGAGGGCGAGGGAGGACATGGTCAGGCGATTCTC
GCCCGACGTCGTGGCCGAGATTGTTCGTCGTCATATACAAAGGATTTTTGATGAGGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGACCTTCACCTCACAGATCGACCACTTCCCAATCCCAAACATTCCCACTCTCTCAAATCCCGACCTTCTTCTTCTTCCATGATCTTCTTCTACTGTTCATCCAT
TCTCATCCTCCTTCTATCAATTTCCCTCTTCGCTTTCACCAAAACAGATCATTTCAAATCTCAATCTCTTAAAACCCTTTTTCAAGAACTCATCAATCGTCTCAACGCAT
CTCGAAATCCCAAGCAAACCTCTGTTCCTAATCCGTTTTCGACGTCCTCTCAATGTGTTCTTTGGATGGCTCCATTTCTTTCCGGCGGCGGATACAGTTCAGAAGCTTGG
TCCTACATTTTAGCCCTTCACGATCATGTAAGAAACCCTAATTTTCGTTTGGCTATTGAGCAACATGGCGATCTAGAATCCATTGATTTTTGGGAGGGCTTACCAGATTC
TGTGAAGAATTTGGCCATTGAACTTCATAGAACAAAATGTAGAATCAATGAAACTATTGTGGTTTGTCATAGTGAACCTGGTGCTTGGAATCCTCCTTTGTTTGAAACTT
TTCCTTGCCCACCAGGTGTTTACCAAAATTTCAAGTCAGTGATTGGCAGAACAATGTTTGAAACTGATAGAGTAAGTCAAGAACATGTTAATCGTTGTAATGAAATGGAT
TTTGTTTGGGTTCCTTCTGAATTTCATGTCTCTACATTTGTGAAAAGTGGGGTTGATCCTTCTAAAGTAGTGAAAATTGTTCAACCCATTGATGTTAATTTCTTTGATCC
ATTGAATTATAGTCCATTTAGTCTTGAATCTGTAGGAACTCTTGTTCTAGGAGACAAAAACATGGAAGAAGTAAGCTTAGAGAAGGGATTTGTGTTCTTGAGTATCTTCA
AATGGGAATTTAGAAAAGGTTGGGATCTGTTATTGGAAGCATATTTGAAAGAATTCTCCAAGAATGATGGAGTTGCGTTGTTCTTATTGACAAATCCTTACCATACTGAT
AGTGATTTTGGGAACAAGATATTGGATTTTGTAGAACACTCAGGCATTCAAAGGCCAGCTTCTGGTTGGGCTCCTGTTCATGTGGTTGATACTCATATAGCTCAAACTGA
TTTGCCTAGAGTTTACAAGGCTGCAGATGCATTTGTTCTGCCGTCCCGAGGAGAGGGGTGGGGGAGACCGCTCGTCGAAGCGATGTCGATGTCGTTGCCAGTGATCGCCA
CCAACTGGTCGGGGCAAACGGAGTTTTTGACCGATGAGAATAGCTATCCATTGGCAGTTGAGAGAATGAGTGAGGTAAAGGAAGGACCATTCAAGGGGCATTTGTGGGCT
GAACCATCCATTACTAAGCTTCGAGTTTTAATGAGGGAGGTAATGACCAACGTCGATGAAGCTAAGGAGAAAGGGCGGAGGGCGAGGGAGGACATGGTCAGGCGATTCTC
GCCCGACGTCGTGGCCGAGATTGTTCGTCGTCATATACAAAGGATTTTTGATGAGGTGTGA
Protein sequenceShow/hide protein sequence
MDDLHLTDRPLPNPKHSHSLKSRPSSSSMIFFYCSSILILLLSISLFAFTKTDHFKSQSLKTLFQELINRLNASRNPKQTSVPNPFSTSSQCVLWMAPFLSGGGYSSEAW
SYILALHDHVRNPNFRLAIEQHGDLESIDFWEGLPDSVKNLAIELHRTKCRINETIVVCHSEPGAWNPPLFETFPCPPGVYQNFKSVIGRTMFETDRVSQEHVNRCNEMD
FVWVPSEFHVSTFVKSGVDPSKVVKIVQPIDVNFFDPLNYSPFSLESVGTLVLGDKNMEEVSLEKGFVFLSIFKWEFRKGWDLLLEAYLKEFSKNDGVALFLLTNPYHTD
SDFGNKILDFVEHSGIQRPASGWAPVHVVDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMSMSLPVIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWA
EPSITKLRVLMREVMTNVDEAKEKGRRAREDMVRRFSPDVVAEIVRRHIQRIFDEV