; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G26180 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G26180
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGlycos_transf_1 domain-containing protein
Genome locationChr5:25141211..25142842
RNA-Seq ExpressionCSPI05G26180
SyntenyCSPI05G26180
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR001296 - Glycosyl transferase, family 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135369.1 uncharacterized protein LOC101204678 [Cucumis sativus]2.1e-28799.59Show/hide
Query:  MDADLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYILALR
        MDADLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYILALR
Subjt:  MDADLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYILALR

Query:  HHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHVNRCN
        HHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHVNRCN
Subjt:  HHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHVNRCN

Query:  VMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKEFSKK
        VMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKP SLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKEFSKK
Subjt:  VMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKEFSKK

Query:  DEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD
        DEVGLFLLTN YHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD
Subjt:  DEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD

Query:  ENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR
        ENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR
Subjt:  ENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR

XP_008446743.1 PREDICTED: uncharacterized protein LOC103489373 [Cucumis melo]3.6e-27194.4Show/hide
Query:  MDADLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYILALR
        MDAD R +DRPFP PNQPHRFK HLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLK SNQPPG NP CVLWMAPFLSGGGYSSEAWSYILALR
Subjt:  MDADLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYILALR

Query:  HHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHVNRCN
        HHITNPGFRLVIR HGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPG Y+KFKSVIGRTMFETDRVT+EHVNRCN
Subjt:  HHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHVNRCN

Query:  VMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKEFSKK
        VMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKP SLESVGTLVLGG NFEE   +EKKRFVFLSIFKWEFRKGWD+LLEAYLKEFSKK
Subjt:  VMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKEFSKK

Query:  DEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD
        DEVGLFLLTN YHT+SDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG TEFLTD
Subjt:  DEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD

Query:  ENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR
        ENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVT+NV+EAK+KGRRAR+DMI+RFSPDIVADIVHRQIENIFHEKR
Subjt:  ENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR

XP_022150540.1 uncharacterized protein LOC111018657 [Momordica charantia]1.3e-21577.16Show/hide
Query:  DLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSK-LTNLLKFSNQPPGFNPL-----CVLWMAPFLSGGGYSSEAWSYIL
        DL H +   P P  PH  KF  S I   SILILL+AISFF F KT+ +K+QS K L   +KF N PP   P+     CVLWMAPFLSGGGYSSEAWSYIL
Subjt:  DLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSK-LTNLLKFSNQPPGFNPL-----CVLWMAPFLSGGGYSSEAWSYIL

Query:  ALRHHITNP-GFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHV
        AL HHI  P  FRL I  HGDLES+DFWEGLP+SVR+LAI+LH T CRMNETVVICHSEPGAWNPPLFETLPCPPG YQKFK+VIGRTMFETDRV  EHV
Subjt:  ALRHHITNP-GFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHV

Query:  NRCNVMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKE
        NRC +MDY+WVPSEFHVSTFV+SGVDPSKIVK+VQP+DVNFFDPLKY+P SL S+GTLVLG K+ E  +      FVFLSIFKWEFRKGWD+LLEAYLKE
Subjt:  NRCNVMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKE

Query:  FSKKDEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE
        FSKKD+VGLFLLTN YH+D DFGNKILDFVENSD+Q P SGWAPVYV+D HI QTDLPR+YKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE
Subjt:  FSKKDEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE

Query:  FLTDENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR
        FLTDENSYPL VERMSEVKE PFKGH+WAEPSI KLQ LMREVT NVDEAK KGR AR DM+ +FSPDIVADIV+  I+N+FH+KR
Subjt:  FLTDENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR

XP_023542823.1 uncharacterized protein LOC111802622 [Cucurbita pepo subsp. pepo]8.7e-21776.16Show/hide
Query:  DLRHDDRPFPIPNQPHRFKFHLSP-----IHFSSILILLLAISFFAFPKTNFYKSQSSK-----LTNLLKFSNQPPGF---NPL-----CVLWMAPFLSG
        DL   D+P P P   H  K   S       + SSILILLL+IS FAF KT+ +KSQS K     L + L  S  P      NP      CVLWMAPFLSG
Subjt:  DLRHDDRPFPIPNQPHRFKFHLSP-----IHFSSILILLLAISFFAFPKTNFYKSQSSK-----LTNLLKFSNQPPGF---NPL-----CVLWMAPFLSG

Query:  GGYSSEAWSYILALRHHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTM
        GGYSSEAWSYILAL  H+ NP FRL I  HGDLES+DFWEGLP+SV+NLAIELHRT+CR+NET+V+CHSEPGAWNPPLFET PCPPG YQ FKSVIGRTM
Subjt:  GGYSSEAWSYILALRHHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTM

Query:  FETDRVTREHVNRCNVMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKG
        FETDRV++EHVNRCN MD+VWVPSEFHVSTFV+SGVDPSK+VK+VQP+DVNFFDPL Y P SLESVGTLVLG KN EEEV LE K FVFLSIFKWEFRKG
Subjt:  FETDRVTREHVNRCNVMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKG

Query:  WDVLLEAYLKEFSKKDEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLP
        WD+LLEAYLKEFSK D VGLFLLTN YHTD+DFGNKILDFVE+S +Q P SGWAPV+VVD HI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAM+M+LP
Subjt:  WDVLLEAYLKEFSKKDEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLP

Query:  VIATNWSGQTEFLTDENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHE
        VIATNWSGQTEFLTDENSYPL VERMSEVKE PFKGH+WAEPSISKL+VLMREV  NVDEAK KGRRAR+DM+ RFSPD+VA+IVHR I+ IFHE
Subjt:  VIATNWSGQTEFLTDENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHE

XP_038891322.1 uncharacterized protein LOC120080769 [Benincasa hispida]2.3e-24686.21Show/hide
Query:  MDADLRHD----DRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYI
        MD DLRH+    DRPFP PN+ H  KF  S IHFSSILILLLAISFF  PKT+FYK+QS KLT+LLK SNQ P  NP CVLWMAPF+SGGGYSSEAWSYI
Subjt:  MDADLRHD----DRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYI

Query:  LALRHHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHV
        LALR HITNPGFRL I+ HGDLES+DFWEGLP+S++NLAIELH T+CRMNETVVICHSEPGAWNPPLFETLPCPPG YQ FKSVIGRTMFETDRV++EHV
Subjt:  LALRHHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHV

Query:  NRCNVMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKE
        NRCN MDYVWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVNFFDPLKYKP SLESVGTLVLGGKN  EEV  EKK FVFLSIFKWEFRKGWD+LLEAYL+E
Subjt:  NRCNVMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKE

Query:  FSKKDEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE
        F KKDEV  FLLTN YHTDSDFGNKILDFVEN DLQMPLSGWAPVYV+DIHI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE
Subjt:  FSKKDEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE

Query:  FLTDENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR
        FLTDENSYPLPVERMSEVKE PFKGHMWAEPSISKLQVLMREVT NVDEAK KG+RAR+DM+ RFSP IVADIVHRQI+NIFHEKR
Subjt:  FLTDENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR

TrEMBL top hitse value%identityAlignment
A0A0A0KTD9 Glycos_transf_1 domain-containing protein1.0e-28799.59Show/hide
Query:  MDADLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYILALR
        MDADLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYILALR
Subjt:  MDADLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYILALR

Query:  HHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHVNRCN
        HHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHVNRCN
Subjt:  HHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHVNRCN

Query:  VMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKEFSKK
        VMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKP SLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKEFSKK
Subjt:  VMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKEFSKK

Query:  DEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD
        DEVGLFLLTN YHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD
Subjt:  DEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD

Query:  ENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR
        ENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR
Subjt:  ENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR

A0A1S3BFB1 uncharacterized protein LOC1034893731.7e-27194.4Show/hide
Query:  MDADLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYILALR
        MDAD R +DRPFP PNQPHRFK HLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLK SNQPPG NP CVLWMAPFLSGGGYSSEAWSYILALR
Subjt:  MDADLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYILALR

Query:  HHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHVNRCN
        HHITNPGFRLVIR HGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPG Y+KFKSVIGRTMFETDRVT+EHVNRCN
Subjt:  HHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHVNRCN

Query:  VMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKEFSKK
        VMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKP SLESVGTLVLGG NFEE   +EKKRFVFLSIFKWEFRKGWD+LLEAYLKEFSKK
Subjt:  VMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKEFSKK

Query:  DEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD
        DEVGLFLLTN YHT+SDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG TEFLTD
Subjt:  DEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD

Query:  ENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR
        ENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVT+NV+EAK+KGRRAR+DMI+RFSPDIVADIVHRQIENIFHEKR
Subjt:  ENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR

A0A5D3CDB1 Group 1 family glycosyltransferase1.7e-27194.4Show/hide
Query:  MDADLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYILALR
        MDAD R +DRPFP PNQPHRFK HLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLK SNQPPG NP CVLWMAPFLSGGGYSSEAWSYILALR
Subjt:  MDADLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYILALR

Query:  HHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHVNRCN
        HHITNPGFRLVIR HGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPG Y+KFKSVIGRTMFETDRVT+EHVNRCN
Subjt:  HHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHVNRCN

Query:  VMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKEFSKK
        VMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKP SLESVGTLVLGG NFEE   +EKKRFVFLSIFKWEFRKGWD+LLEAYLKEFSKK
Subjt:  VMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKEFSKK

Query:  DEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD
        DEVGLFLLTN YHT+SDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG TEFLTD
Subjt:  DEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD

Query:  ENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR
        ENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVT+NV+EAK+KGRRAR+DMI+RFSPDIVADIVHRQIENIFHEKR
Subjt:  ENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR

A0A6J1D8R8 uncharacterized protein LOC1110186576.1e-21677.16Show/hide
Query:  DLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSK-LTNLLKFSNQPPGFNPL-----CVLWMAPFLSGGGYSSEAWSYIL
        DL H +   P P  PH  KF  S I   SILILL+AISFF F KT+ +K+QS K L   +KF N PP   P+     CVLWMAPFLSGGGYSSEAWSYIL
Subjt:  DLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSK-LTNLLKFSNQPPGFNPL-----CVLWMAPFLSGGGYSSEAWSYIL

Query:  ALRHHITNP-GFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHV
        AL HHI  P  FRL I  HGDLES+DFWEGLP+SVR+LAI+LH T CRMNETVVICHSEPGAWNPPLFETLPCPPG YQKFK+VIGRTMFETDRV  EHV
Subjt:  ALRHHITNP-GFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHV

Query:  NRCNVMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKE
        NRC +MDY+WVPSEFHVSTFV+SGVDPSKIVK+VQP+DVNFFDPLKY+P SL S+GTLVLG K+ E  +      FVFLSIFKWEFRKGWD+LLEAYLKE
Subjt:  NRCNVMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKE

Query:  FSKKDEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE
        FSKKD+VGLFLLTN YH+D DFGNKILDFVENSD+Q P SGWAPVYV+D HI QTDLPR+YKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE
Subjt:  FSKKDEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE

Query:  FLTDENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR
        FLTDENSYPL VERMSEVKE PFKGH+WAEPSI KLQ LMREVT NVDEAK KGR AR DM+ +FSPDIVADIV+  I+N+FH+KR
Subjt:  FLTDENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR

A0A6J1HWY2 uncharacterized protein LOC1114676051.3e-21576.47Show/hide
Query:  DLRHDDRPFPIPNQPHRFKFHLSPIHF---SSILILLLAISFFAFPKTNFYKSQS-----SKLTNLLKFSNQPPG---FNPL-----CVLWMAPFLSGGG
        DL H DRP P P +    K   S + F   SSILILLL+IS F F KT+ +KSQS      KL + L  S  P      NP      CVLWMAPFLSGGG
Subjt:  DLRHDDRPFPIPNQPHRFKFHLSPIHF---SSILILLLAISFFAFPKTNFYKSQS-----SKLTNLLKFSNQPPG---FNPL-----CVLWMAPFLSGGG

Query:  YSSEAWSYILALRHHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFE
        YSSEAWSYILAL  H+ NP FRL I  HGDLESVDFWEGLP+SV+NLAIELHRT+CR+NET+V+CHSEPGAWNPPLFET PCPPG YQ FKSVIGRTMFE
Subjt:  YSSEAWSYILALRHHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFE

Query:  TDRVTREHVNRCNVMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWD
        TDRV++EHVNRCN MD+VWVPSEFHVSTFV+SGVDPSK+VK+VQP+DVNFFDPL Y P SLESVGTLVLG KN   EV LE K FVFLSIFKWEFRKGWD
Subjt:  TDRVTREHVNRCNVMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWD

Query:  VLLEAYLKEFSKKDEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVI
        +LLEAYLKEFSK D VGLFLLTN YHTDSDFGNKILDFVENS +Q P SGWAPVYVVD HI QTDLP+VYKAADAFVLPSRGEGWGRPLVEAM+MSLPVI
Subjt:  VLLEAYLKEFSKKDEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVI

Query:  ATNWSGQTEFLTDENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHE
        ATNWSGQTEFLTDENSYPL VE+MSEVKE PFKGH+WAEPSISKL+VLMREV  NVDEAK KGRRAR+DM+ RFSPD+VA+IVH  I+ IF E
Subjt:  ATNWSGQTEFLTDENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHE

SwissProt top hitse value%identityAlignment
A7TZT2 Mannosylfructose-phosphate synthase1.7e-0530.95Show/hide
Query:  VFLSIFKWEFRKGWDVLLEAYLKEFSKKDEVGLFLLTNSYHTD---SDFGNKILDFVENSDLQ--MPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPS
        V L++ +    KG+D+L++ +     ++ E  L L     + D   +   N++ + V++  L+  +  SG    YV D      DLP +Y+AAD FVL S
Subjt:  VFLSIFKWEFRKGWDVLLEAYLKEFSKKDEVGLFLLTNSYHTD---SDFGNKILDFVENSDLQ--MPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPS

Query:  RGEGWGRPLVEAMAMSLPVIATNWSG
        R E +G   +EAMA   P + T   G
Subjt:  RGEGWGRPLVEAMAMSLPVIATNWSG

Q48453 Uncharacterized 41.2 kDa protein in cps region1.4e-0425.13Show/hide
Query:  DYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKEFSKKDE
        D +   S +     V++G+  +KI KVV            Y   SL+    L    K  E EV+ EK   +F+   +++ +KG+D LL            
Subjt:  DYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKEFSKKDE

Query:  VGLFLLTNSYHTDSDFGNKILDFVENSDLQ-MPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD
          + +   S +T +  G+ + D  E  + + +   GW         +   +LP  +   D  ++PSR E +G   VEA    +PVIA N +   E ++D
Subjt:  VGLFLLTNSYHTDSDFGNKILDFVENSDLQ-MPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD

Q9R9N2 Lipopolysaccharide core biosynthesis mannosyltransferase LpsB4.2e-0444.9Show/hide
Query:  TDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLT
        T++P  Y+A D FV P R EG+G   +EAMA  +PV+AT+    +E +T
Subjt:  TDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLT

Arabidopsis top hitse value%identityAlignment
AT3G10630.1 UDP-Glycosyltransferase superfamily protein2.5e-17761.54Show/hide
Query:  DRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLT-------NLLKF--------------SNQPPGFNPLCVLWMAPFLSGGG
        D+P     +P +F    + ++ SSIL LLL+I    F  T+ YK QS + T       + L+F              +  P    P CVLWMAPFLS GG
Subjt:  DRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLT-------NLLKF--------------SNQPPGFNPLCVLWMAPFLSGGG

Query:  YSSEAWSYILALRHHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFE
        YSSEAWSY+L+LR+H+TNP FR+ I HHGDLESV+FW GL +  + +AIE++R +CR NET+V+CHSEPGAW PPLFETLPCPP  Y+ F SVIGRTMFE
Subjt:  YSSEAWSYILALRHHITNPGFRLVIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFE

Query:  TDRVTREHVNRCNVMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWD
        TDRV  EHV RCN MD+VWVP++FHVS+FV+SGVD SK+VK+VQPVDV FFDP KYKP  L +VG LVLG           K  FVFLS+FKWE RKGWD
Subjt:  TDRVTREHVNRCNVMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWD

Query:  VLLEAYLKEFSKKDEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVI
        VLL+AYL EFS +D V LFLLTN+YH+DSDFGNKILDFVE  +++ P +G+  VYV+D HI Q DLPR+YKAADAFVLP+RGEGWGRP+VEAMAMSLPVI
Subjt:  VLLEAYLKEFSKKDEVGLFLLTNSYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVI

Query:  ATNWSGQTEFLTDENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEK
         TNWSG TE+LT+ N YPL VE MSEVKE PF+GH WAEPS+ KL+VLMR V  N DEAK KG+R R DM+  F+P++VA +V  QI  IF EK
Subjt:  ATNWSGQTEFLTDENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCTGATCTCCGCCACGACGATCGCCCATTTCCCATTCCCAACCAACCTCACCGTTTCAAATTCCACCTTTCTCCGATCCACTTCTCATCCATTCTCATTCTCCT
TCTAGCAATTTCCTTCTTCGCTTTCCCCAAAACAAATTTCTACAAATCCCAATCCTCAAAACTCACCAATCTTCTAAAATTTTCTAACCAACCCCCGGGTTTTAATCCAT
TATGTGTTCTTTGGATGGCCCCTTTTCTTTCTGGTGGTGGGTACAGTTCAGAAGCTTGGTCCTACATTTTAGCTCTTCGTCATCATATAACAAACCCTGGATTTCGTTTG
GTCATTCGGCATCATGGTGATCTAGAATCGGTCGACTTTTGGGAAGGCTTACCGGAATCTGTAAGGAATTTGGCTATTGAACTTCATAGAACGAGATGTAGAATGAATGA
AACTGTTGTGATTTGTCACAGTGAACCTGGTGCTTGGAATCCTCCATTGTTTGAAACTTTGCCTTGCCCACCAGGTCCTTATCAAAAGTTCAAGTCAGTGATTGGTAGAA
CAATGTTTGAAACTGATAGGGTAACTCGAGAACATGTGAATCGATGTAATGTAATGGATTATGTTTGGGTTCCTTCTGAATTTCATGTCTCTACATTTGTGGAAAGTGGG
GTTGATCCTTCTAAGATTGTGAAAGTTGTTCAACCTGTTGATGTTAATTTCTTTGATCCATTGAAATACAAACCACCTAGTCTTGAATCTGTAGGAACATTAGTTTTAGG
AGGCAAGAACTTTGAAGAAGAAGTAAAGTTAGAGAAGAAGAGATTTGTGTTTCTGAGTATCTTTAAATGGGAATTTAGGAAAGGTTGGGATGTGTTGTTGGAAGCTTACT
TGAAAGAATTCTCTAAGAAAGATGAAGTGGGGTTGTTTTTATTGACAAATTCTTACCATACTGATAGTGATTTTGGGAATAAGATATTGGATTTTGTAGAAAATTCTGAC
TTACAAATGCCACTTTCTGGTTGGGCTCCTGTTTATGTGGTTGATATTCATATACCTCAAACTGATTTGCCTAGAGTTTACAAGGCTGCTGATGCATTTGTACTTCCATC
AAGAGGAGAAGGATGGGGAAGGCCACTAGTCGAAGCGATGGCGATGTCGTTGCCAGTGATCGCTACGAACTGGTCAGGACAGACGGAGTTTTTGACAGATGAAAATAGCT
ATCCGTTGCCGGTTGAGAGAATGAGTGAAGTGAAGGAAGAGCCATTCAAAGGGCATATGTGGGCGGAACCATCCATCAGTAAGCTTCAAGTTCTAATGAGGGAAGTGACG
GTTAATGTTGATGAAGCTAAGGAGAAAGGAAGACGGGCGAGGCAGGACATGATCGATCGATTCTCACCCGACATTGTTGCCGATATTGTTCATCGTCAGATAGAAAATAT
ATTTCATGAGAAGAGATGA
mRNA sequenceShow/hide mRNA sequence
CTTTCACTCAATAAAAGGAAGTGAGATTCGTCAAAAATTCCTTTTCGCCCCATCGCCATAATCGAAGAAACCTACTTCGTCTCCTTCTCTCTTTTCTCATTTCATCATCT
TAAATTTCTCTTTACTTTCTCCAAGTTTTCGCTTAATCTGAAAACCACATGGTTCCCCAAATCCACCGCCGCCATGGATGCTGATCTCCGCCACGACGATCGCCCATTTC
CCATTCCCAACCAACCTCACCGTTTCAAATTCCACCTTTCTCCGATCCACTTCTCATCCATTCTCATTCTCCTTCTAGCAATTTCCTTCTTCGCTTTCCCCAAAACAAAT
TTCTACAAATCCCAATCCTCAAAACTCACCAATCTTCTAAAATTTTCTAACCAACCCCCGGGTTTTAATCCATTATGTGTTCTTTGGATGGCCCCTTTTCTTTCTGGTGG
TGGGTACAGTTCAGAAGCTTGGTCCTACATTTTAGCTCTTCGTCATCATATAACAAACCCTGGATTTCGTTTGGTCATTCGGCATCATGGTGATCTAGAATCGGTCGACT
TTTGGGAAGGCTTACCGGAATCTGTAAGGAATTTGGCTATTGAACTTCATAGAACGAGATGTAGAATGAATGAAACTGTTGTGATTTGTCACAGTGAACCTGGTGCTTGG
AATCCTCCATTGTTTGAAACTTTGCCTTGCCCACCAGGTCCTTATCAAAAGTTCAAGTCAGTGATTGGTAGAACAATGTTTGAAACTGATAGGGTAACTCGAGAACATGT
GAATCGATGTAATGTAATGGATTATGTTTGGGTTCCTTCTGAATTTCATGTCTCTACATTTGTGGAAAGTGGGGTTGATCCTTCTAAGATTGTGAAAGTTGTTCAACCTG
TTGATGTTAATTTCTTTGATCCATTGAAATACAAACCACCTAGTCTTGAATCTGTAGGAACATTAGTTTTAGGAGGCAAGAACTTTGAAGAAGAAGTAAAGTTAGAGAAG
AAGAGATTTGTGTTTCTGAGTATCTTTAAATGGGAATTTAGGAAAGGTTGGGATGTGTTGTTGGAAGCTTACTTGAAAGAATTCTCTAAGAAAGATGAAGTGGGGTTGTT
TTTATTGACAAATTCTTACCATACTGATAGTGATTTTGGGAATAAGATATTGGATTTTGTAGAAAATTCTGACTTACAAATGCCACTTTCTGGTTGGGCTCCTGTTTATG
TGGTTGATATTCATATACCTCAAACTGATTTGCCTAGAGTTTACAAGGCTGCTGATGCATTTGTACTTCCATCAAGAGGAGAAGGATGGGGAAGGCCACTAGTCGAAGCG
ATGGCGATGTCGTTGCCAGTGATCGCTACGAACTGGTCAGGACAGACGGAGTTTTTGACAGATGAAAATAGCTATCCGTTGCCGGTTGAGAGAATGAGTGAAGTGAAGGA
AGAGCCATTCAAAGGGCATATGTGGGCGGAACCATCCATCAGTAAGCTTCAAGTTCTAATGAGGGAAGTGACGGTTAATGTTGATGAAGCTAAGGAGAAAGGAAGACGGG
CGAGGCAGGACATGATCGATCGATTCTCACCCGACATTGTTGCCGATATTGTTCATCGTCAGATAGAAAATATATTTCATGAGAAGAGATGA
Protein sequenceShow/hide protein sequence
MDADLRHDDRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYILALRHHITNPGFRL
VIRHHGDLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQKFKSVIGRTMFETDRVTREHVNRCNVMDYVWVPSEFHVSTFVESG
VDPSKIVKVVQPVDVNFFDPLKYKPPSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKEFSKKDEVGLFLLTNSYHTDSDFGNKILDFVENSD
LQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEEPFKGHMWAEPSISKLQVLMREVT
VNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIENIFHEKR