; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G001610 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G001610
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionGlycos_transf_1 domain-containing protein
Genome locationchr04:1801312..1802766
RNA-Seq ExpressionLsi04G001610
SyntenyLsi04G001610
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR001296 - Glycosyl transferase, family 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135369.1 uncharacterized protein LOC101204678 [Cucumis sativus]1.2e-24284.57Show/hide
Query:  MDDDLRHNQQPTDPPFLNPNEPHSFKFRPSAIHFSSILILLLAISFFSFTKTDFFKTHSLKLTHLLKNSNQTQHPNPFCVLWMAPFVSGGGYSSEAWSYI
        MD DLRH+    D PF  PN+PH FKF  S IHFSSILILLLAISFF+F KT+F+K+ S KLT+LLK SNQ    NP CVLWMAPF+SGGGYSSEAWSYI
Subjt:  MDDDLRHNQQPTDPPFLNPNEPHSFKFRPSAIHFSSILILLLAISFFSFTKTDFFKTHSLKLTHLLKNSNQTQHPNPFCVLWMAPFVSGGGYSSEAWSYI

Query:  LALHDHITNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHV
        LAL  HITNPGFRL I+ HGDLES++FWEGLP+S+RNLAIELH T+CRMNETVVICHSEPGAWNPPLFETLPCPPG YQ FKSVIGRTMFETDRV++EHV
Subjt:  LALHDHITNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHV

Query:  DRCNGMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL--EVNLEKKGFVFLSIFKWEFRKGWDLLLEAYLRE
        +RCN MDYVWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVNFFDPLKYKP SLESVGTLVLG +N   EV LEKK FVFLSIFKWEFRKGWD+LLEAYL+E
Subjt:  DRCNGMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL--EVNLEKKGFVFLSIFKWEFRKGWDLLLEAYLRE

Query:  FCKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE
        F KKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE
Subjt:  FCKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE

Query:  FLTDENSYPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHEKR
        FLTDENSYPLPVE MSEV E PFKGH+WAEPSIS LQVLMREV  NVDEAK KGR+AR+DM+ RFSPDIVA+IVH QI+NIFHEKR
Subjt:  FLTDENSYPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHEKR

XP_008446743.1 PREDICTED: uncharacterized protein LOC103489373 [Cucumis melo]2.3e-24185.18Show/hide
Query:  NQQPTDPPFLNPNEPHSFKFRPSAIHFSSILILLLAISFFSFTKTDFFKTHSLKLTHLLKNSNQTQHPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHI
        + +P D PF NPN+PH FK   S IHFSSILILLLAISFF+F KT+F+K+ S KLT+LLK SNQ    NP CVLWMAPF+SGGGYSSEAWSYILAL  HI
Subjt:  NQQPTDPPFLNPNEPHSFKFRPSAIHFSSILILLLAISFFSFTKTDFFKTHSLKLTHLLKNSNQTQHPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHI

Query:  TNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVDRCNGMD
        TNPGFRL I+QHGDLES++FWEGLP+S+RNLAIELH T+CRMNETVVICHSEPGAWNPPLFETLPCPPGAY+ FKSVIGRTMFETDRV+QEHV+RCN MD
Subjt:  TNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVDRCNGMD

Query:  YVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL-EVNL-EKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDEV
        YVWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVNFFDPLKYKPFSLESVGTLVLG  N  EV L EKK FVFLSIFKWEFRKGWDLLLEAYL+EF KKDEV
Subjt:  YVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL-EVNL-EKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDEV

Query:  GLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENS
        GLFLLTNPYHT+SDFGNKILDFVENSDLQMPLSGWAPVYVVDIHI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG TEFLTDENS
Subjt:  GLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENS

Query:  YPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHEKR
        YPLPVE MSEV E PFKGH+WAEPSIS LQVLMREV  NV+EAK KGR+AREDM++RFSPDIVA+IVH QI+NIFHEKR
Subjt:  YPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHEKR

XP_022968340.1 uncharacterized protein LOC111467605 [Cucurbita maxima]1.3e-22879.48Show/hide
Query:  DDLRHNQQPTDPPFLNPNEPHSFKFRPSAIHF---SSILILLLAISFFSFTKTDFFKTHSLKL--------THLLKNSNQTQHPNPF-----CVLWMAPF
        DDL H    TD P  +P    S K RPS++ F   SSILILLL+IS F+FTKTD FK+ SLK          +  +N  QT   NPF     CVLWMAPF
Subjt:  DDLRHNQQPTDPPFLNPNEPHSFKFRPSAIHF---SSILILLLAISFFSFTKTDFFKTHSLKL--------THLLKNSNQTQHPNPF-----CVLWMAPF

Query:  VSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIG
        +SGGGYSSEAWSYILALHDH+ NP FRLAI+QHGDLES++FWEGLPDS++NLAIELH TKCR+NET+V+CHSEPGAWNPPLFET PCPPG YQNFKSVIG
Subjt:  VSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIG

Query:  RTMFETDRVSQEHVDRCNGMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL-EVNLEKKGFVFLSIFKWEFR
        RTMFETDRVSQEHV+RCN MD+VWVPSEFHVSTFVKSGVDPSK+VK+VQPIDVNFFDPL Y PFSLESVGTLVLG +N+ EV+LE KGFVFLSIFKWEFR
Subjt:  RTMFETDRVSQEHVDRCNGMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL-EVNLEKKGFVFLSIFKWEFR

Query:  KGWDLLLEAYLREFCKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMS
        KGWDLLLEAYL+EF K D VGLFLLTNPYHTDSDFGNKILDFVENS +Q P SGWAPVYVVD HIAQTDLP+VYKAADAFVLPSRGEGWGRPLVEAM+MS
Subjt:  KGWDLLLEAYLREFCKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMS

Query:  LPVIATNWSGQTEFLTDENSYPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHE
        LPVIATNWSGQTEFLTDENSYPL VE MSEV EGPFKGHLWAEPSIS L+VLMREV+TNVDEAKAKGR+AREDMV RFSPD+VAEIVHS IQ IF E
Subjt:  LPVIATNWSGQTEFLTDENSYPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHE

XP_023542823.1 uncharacterized protein LOC111802622 [Cucurbita pepo subsp. pepo]4.9e-22879.43Show/hide
Query:  TDPPFLNPNEPHSFKFRPSA-----IHFSSILILLLAISFFSFTKTDFFKTHSLKL--------THLLKNSNQTQHPNPF-----CVLWMAPFVSGGGYS
        TD P  NP   HS K RPS+      + SSILILLL+IS F+FTKTD FK+ SLK          +  +N  QT  PNPF     CVLWMAPF+SGGGYS
Subjt:  TDPPFLNPNEPHSFKFRPSA-----IHFSSILILLLAISFFSFTKTDFFKTHSLKL--------THLLKNSNQTQHPNPF-----CVLWMAPFVSGGGYS

Query:  SEAWSYILALHDHITNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETD
        SEAWSYILALHDH+ NP FRLAI+QHGDLESI+FWEGLPDS++NLAIELH TKCR+NET+V+CHSEPGAWNPPLFET PCPPG YQNFKSVIGRTMFETD
Subjt:  SEAWSYILALHDHITNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETD

Query:  RVSQEHVDRCNGMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL--EVNLEKKGFVFLSIFKWEFRKGWDLL
        RVSQEHV+RCN MD+VWVPSEFHVSTFVKSGVDPSK+VK+VQPIDVNFFDPL Y PFSLESVGTLVLG +N+  EV+LE KGFVFLSIFKWEFRKGWDLL
Subjt:  RVSQEHVDRCNGMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL--EVNLEKKGFVFLSIFKWEFRKGWDLL

Query:  LEAYLREFCKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIAT
        LEAYL+EF K D VGLFLLTNPYHTD+DFGNKILDFVE+S +Q P SGWAPV+VVD HIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAM+M+LPVIAT
Subjt:  LEAYLREFCKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIAT

Query:  NWSGQTEFLTDENSYPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHE
        NWSGQTEFLTDENSYPL VE MSEV EGPFKGHLWAEPSIS L+VLMREV+TNVDEAK KGR+AREDMV RFSPD+VAEIVH  IQ IFHE
Subjt:  NWSGQTEFLTDENSYPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHE

XP_038891322.1 uncharacterized protein LOC120080769 [Benincasa hispida]7.0e-26792.58Show/hide
Query:  MDDDLRHNQQPTDPPFLNPNEPHSFKFRPSAIHFSSILILLLAISFFSFTKTDFFKTHSLKLTHLLKNSNQTQHPNPFCVLWMAPFVSGGGYSSEAWSYI
        MDDDLRHNQQPTD PF NPNE HS KFRPSAIHFSSILILLLAISFF+  KTDF+KT SLKLTHLLKNSNQT  PNP CVLWMAPFVSGGGYSSEAWSYI
Subjt:  MDDDLRHNQQPTDPPFLNPNEPHSFKFRPSAIHFSSILILLLAISFFSFTKTDFFKTHSLKLTHLLKNSNQTQHPNPFCVLWMAPFVSGGGYSSEAWSYI

Query:  LALHDHITNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHV
        LAL DHITNPGFRLAIQQHGDLESI+FWEGLPDSM+NLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHV
Subjt:  LALHDHITNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHV

Query:  DRCNGMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL-EVNLEKKGFVFLSIFKWEFRKGWDLLLEAYLREF
        +RCNGMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLG +NL EV+ EKKGFVFLSIFKWEFRKGWDLLLEAYLREF
Subjt:  DRCNGMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL-EVNLEKKGFVFLSIFKWEFRKGWDLLLEAYLREF

Query:  CKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEF
        CKKDEV  FLLTNPYHTDSDFGNKILDFVEN DLQMPLSGWAPVYV+DIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEF
Subjt:  CKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEF

Query:  LTDENSYPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHEKR
        LTDENSYPLPVE MSEV EGPFKGH+WAEPSIS LQVLMREV TNVDEAK KG++AREDMVSRFSP IVA+IVH QIQNIFHEKR
Subjt:  LTDENSYPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHEKR

TrEMBL top hitse value%identityAlignment
A0A0A0KTD9 Glycos_transf_1 domain-containing protein5.8e-24384.57Show/hide
Query:  MDDDLRHNQQPTDPPFLNPNEPHSFKFRPSAIHFSSILILLLAISFFSFTKTDFFKTHSLKLTHLLKNSNQTQHPNPFCVLWMAPFVSGGGYSSEAWSYI
        MD DLRH+    D PF  PN+PH FKF  S IHFSSILILLLAISFF+F KT+F+K+ S KLT+LLK SNQ    NP CVLWMAPF+SGGGYSSEAWSYI
Subjt:  MDDDLRHNQQPTDPPFLNPNEPHSFKFRPSAIHFSSILILLLAISFFSFTKTDFFKTHSLKLTHLLKNSNQTQHPNPFCVLWMAPFVSGGGYSSEAWSYI

Query:  LALHDHITNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHV
        LAL  HITNPGFRL I+ HGDLES++FWEGLP+S+RNLAIELH T+CRMNETVVICHSEPGAWNPPLFETLPCPPG YQ FKSVIGRTMFETDRV++EHV
Subjt:  LALHDHITNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHV

Query:  DRCNGMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL--EVNLEKKGFVFLSIFKWEFRKGWDLLLEAYLRE
        +RCN MDYVWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVNFFDPLKYKP SLESVGTLVLG +N   EV LEKK FVFLSIFKWEFRKGWD+LLEAYL+E
Subjt:  DRCNGMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL--EVNLEKKGFVFLSIFKWEFRKGWDLLLEAYLRE

Query:  FCKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE
        F KKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE
Subjt:  FCKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE

Query:  FLTDENSYPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHEKR
        FLTDENSYPLPVE MSEV E PFKGH+WAEPSIS LQVLMREV  NVDEAK KGR+AR+DM+ RFSPDIVA+IVH QI+NIFHEKR
Subjt:  FLTDENSYPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHEKR

A0A1S3BFB1 uncharacterized protein LOC1034893731.1e-24185.18Show/hide
Query:  NQQPTDPPFLNPNEPHSFKFRPSAIHFSSILILLLAISFFSFTKTDFFKTHSLKLTHLLKNSNQTQHPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHI
        + +P D PF NPN+PH FK   S IHFSSILILLLAISFF+F KT+F+K+ S KLT+LLK SNQ    NP CVLWMAPF+SGGGYSSEAWSYILAL  HI
Subjt:  NQQPTDPPFLNPNEPHSFKFRPSAIHFSSILILLLAISFFSFTKTDFFKTHSLKLTHLLKNSNQTQHPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHI

Query:  TNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVDRCNGMD
        TNPGFRL I+QHGDLES++FWEGLP+S+RNLAIELH T+CRMNETVVICHSEPGAWNPPLFETLPCPPGAY+ FKSVIGRTMFETDRV+QEHV+RCN MD
Subjt:  TNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVDRCNGMD

Query:  YVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL-EVNL-EKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDEV
        YVWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVNFFDPLKYKPFSLESVGTLVLG  N  EV L EKK FVFLSIFKWEFRKGWDLLLEAYL+EF KKDEV
Subjt:  YVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL-EVNL-EKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDEV

Query:  GLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENS
        GLFLLTNPYHT+SDFGNKILDFVENSDLQMPLSGWAPVYVVDIHI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG TEFLTDENS
Subjt:  GLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENS

Query:  YPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHEKR
        YPLPVE MSEV E PFKGH+WAEPSIS LQVLMREV  NV+EAK KGR+AREDM++RFSPDIVA+IVH QI+NIFHEKR
Subjt:  YPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHEKR

A0A5D3CDB1 Group 1 family glycosyltransferase1.1e-24185.18Show/hide
Query:  NQQPTDPPFLNPNEPHSFKFRPSAIHFSSILILLLAISFFSFTKTDFFKTHSLKLTHLLKNSNQTQHPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHI
        + +P D PF NPN+PH FK   S IHFSSILILLLAISFF+F KT+F+K+ S KLT+LLK SNQ    NP CVLWMAPF+SGGGYSSEAWSYILAL  HI
Subjt:  NQQPTDPPFLNPNEPHSFKFRPSAIHFSSILILLLAISFFSFTKTDFFKTHSLKLTHLLKNSNQTQHPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHI

Query:  TNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVDRCNGMD
        TNPGFRL I+QHGDLES++FWEGLP+S+RNLAIELH T+CRMNETVVICHSEPGAWNPPLFETLPCPPGAY+ FKSVIGRTMFETDRV+QEHV+RCN MD
Subjt:  TNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVDRCNGMD

Query:  YVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL-EVNL-EKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDEV
        YVWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVNFFDPLKYKPFSLESVGTLVLG  N  EV L EKK FVFLSIFKWEFRKGWDLLLEAYL+EF KKDEV
Subjt:  YVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL-EVNL-EKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDEV

Query:  GLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENS
        GLFLLTNPYHT+SDFGNKILDFVENSDLQMPLSGWAPVYVVDIHI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG TEFLTDENS
Subjt:  GLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENS

Query:  YPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHEKR
        YPLPVE MSEV E PFKGH+WAEPSIS LQVLMREV  NV+EAK KGR+AREDM++RFSPDIVA+IVH QI+NIFHEKR
Subjt:  YPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHEKR

A0A6J1G004 uncharacterized protein LOC1114494311.4e-22578.53Show/hide
Query:  TDPPFLNPNEPHSFKFRPSA-----IHFSSILILLLAISFFSFTKTDFFKTHSLKL--------THLLKNSNQTQHPNPF-----CVLWMAPFVSGGGYS
        TD P  NP   HS K RPS+      + SSILILLL+IS F+FTK D FK+ SLK          +  +N  QT  PNPF     CVLWMAPF+SGGGYS
Subjt:  TDPPFLNPNEPHSFKFRPSA-----IHFSSILILLLAISFFSFTKTDFFKTHSLKL--------THLLKNSNQTQHPNPF-----CVLWMAPFVSGGGYS

Query:  SEAWSYILALHDHITNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETD
        SEAWSYILALHDH+ NP FRLAI+QHGDLESI+FWEGLPDS++NLAIELH TKCR+NET+V+CHSEPGAWNPPLFET PCPPG YQNFKSVIGRTMFETD
Subjt:  SEAWSYILALHDHITNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETD

Query:  RVSQEHVDRCNGMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNLEVNLEKKGFVFLSIFKWEFRKGWDLLLE
        RVSQEHV+RCN MD+VWVPSEFHVSTFVKSGVDPSK+VK+VQP+DVNFFDPL Y PFSLESVGTLVLG +N+E    +KGFVFLSIFKWEFRKGWDLLLE
Subjt:  RVSQEHVDRCNGMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNLEVNLEKKGFVFLSIFKWEFRKGWDLLLE

Query:  AYLREFCKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNW
        AYL+EF K D V LFLLTNPYHTDSDFGNKILDFVE+S +Q P SGWAPV+VVD HIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAM+MSLPVIATNW
Subjt:  AYLREFCKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNW

Query:  SGQTEFLTDENSYPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHE
        SGQTEFLTDENSYPL VE MSEV EGPFKGHLWAEPSIS L+VLMREV+TNVDEAK KGR+AREDMV RFSPD+VAEIV   IQ IF E
Subjt:  SGQTEFLTDENSYPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHE

A0A6J1HWY2 uncharacterized protein LOC1114676056.3e-22979.48Show/hide
Query:  DDLRHNQQPTDPPFLNPNEPHSFKFRPSAIHF---SSILILLLAISFFSFTKTDFFKTHSLKL--------THLLKNSNQTQHPNPF-----CVLWMAPF
        DDL H    TD P  +P    S K RPS++ F   SSILILLL+IS F+FTKTD FK+ SLK          +  +N  QT   NPF     CVLWMAPF
Subjt:  DDLRHNQQPTDPPFLNPNEPHSFKFRPSAIHF---SSILILLLAISFFSFTKTDFFKTHSLKL--------THLLKNSNQTQHPNPF-----CVLWMAPF

Query:  VSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIG
        +SGGGYSSEAWSYILALHDH+ NP FRLAI+QHGDLES++FWEGLPDS++NLAIELH TKCR+NET+V+CHSEPGAWNPPLFET PCPPG YQNFKSVIG
Subjt:  VSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIG

Query:  RTMFETDRVSQEHVDRCNGMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL-EVNLEKKGFVFLSIFKWEFR
        RTMFETDRVSQEHV+RCN MD+VWVPSEFHVSTFVKSGVDPSK+VK+VQPIDVNFFDPL Y PFSLESVGTLVLG +N+ EV+LE KGFVFLSIFKWEFR
Subjt:  RTMFETDRVSQEHVDRCNGMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNL-EVNLEKKGFVFLSIFKWEFR

Query:  KGWDLLLEAYLREFCKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMS
        KGWDLLLEAYL+EF K D VGLFLLTNPYHTDSDFGNKILDFVENS +Q P SGWAPVYVVD HIAQTDLP+VYKAADAFVLPSRGEGWGRPLVEAM+MS
Subjt:  KGWDLLLEAYLREFCKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMS

Query:  LPVIATNWSGQTEFLTDENSYPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHE
        LPVIATNWSGQTEFLTDENSYPL VE MSEV EGPFKGHLWAEPSIS L+VLMREV+TNVDEAKAKGR+AREDMV RFSPD+VAEIVHS IQ IF E
Subjt:  LPVIATNWSGQTEFLTDENSYPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHE

SwissProt top hitse value%identityAlignment
A7TZT2 Mannosylfructose-phosphate synthase2.6e-0630.23Show/hide
Query:  KGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDEVGLFLLT---NPYHTDSDFGNKILDFVENSDLQ--MPLSGWAPVYVVDIHIAQTDLPRVYKAADAFV
        +G V L++ +    KG+DLL++ +     ++ E  L L     N    ++   N++ + V++  L+  +  SG         ++A  DLP +Y+AAD FV
Subjt:  KGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDEVGLFLLT---NPYHTDSDFGNKILDFVENSDLQ--MPLSGWAPVYVVDIHIAQTDLPRVYKAADAFV

Query:  LPSRGEGWGRPLVEAMAMSLPVIATNWSG
        L SR E +G   +EAMA   P + T   G
Subjt:  LPSRGEGWGRPLVEAMAMSLPVIATNWSG

Q9R9N2 Lipopolysaccharide core biosynthesis mannosyltransferase LpsB4.2e-0444.9Show/hide
Query:  TDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLT
        T++P  Y+A D FV P R EG+G   +EAMA  +PV+AT+    +E +T
Subjt:  TDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLT

Arabidopsis top hitse value%identityAlignment
AT1G52420.1 UDP-Glycosyltransferase superfamily protein7.3e-0433.75Show/hide
Query:  KILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL
        ++L F+ NS        W P        A T +  +Y AAD +V  S+G G  +GR  +EAMA  L V+ T+  G  E +
Subjt:  KILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL

AT3G10630.1 UDP-Glycosyltransferase superfamily protein1.0e-17862.16Show/hide
Query:  IHFSSILILLLAISFFSFTKTDFFKTHSLKLTHLL------------------KNSNQTQHP---NPFCVLWMAPFVSGGGYSSEAWSYILALHDHITNP
        ++ SSIL LLL+I    FT TD +K  SL+ T  +                  K+ ++T +P    P CVLWMAPF+S GGYSSEAWSY+L+L +H+TNP
Subjt:  IHFSSILILLLAISFFSFTKTDFFKTHSLKLTHLL------------------KNSNQTQHP---NPFCVLWMAPFVSGGGYSSEAWSYILALHDHITNP

Query:  GFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVDRCNGMDYVW
         FR+ I+ HGDLES+ FW GL    + +AIE++  +CR NET+V+CHSEPGAW PPLFETLPCPP  Y++F SVIGRTMFETDRV+ EHV RCN MD+VW
Subjt:  GFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVDRCNGMDYVW

Query:  VPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNLEVNLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDEVGLFLL
        VP++FHVS+FV+SGVD SK+VK+VQP+DV FFDP KYKP  L +VG LVLG+        K GFVFLS+FKWE RKGWD+LL+AYL EF  +D V LFLL
Subjt:  VPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNLEVNLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDEVGLFLL

Query:  TNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPV
        TN YH+DSDFGNKILDFVE  +++ P +G+  VYV+D HIAQ DLPR+YKAADAFVLP+RGEGWGRP+VEAMAMSLPVI TNWSG TE+LT+ N YPL V
Subjt:  TNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPV

Query:  ETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHEK
        E MSEV EGPF+GH WAEPS+  L+VLMR V++N DEAK KG++ R+DMV  F+P++VA++V  QI  IF EK
Subjt:  ETMSEVNEGPFKGHLWAEPSISTLQVLMREVITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHEK

AT3G15940.1 UDP-Glycosyltransferase superfamily protein4.3e-0433.75Show/hide
Query:  KILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL
        ++L F+ N+        W P        A T +  +Y AAD +V  S+G G  +GR  +EAMA  LPV+ T+  G  E +
Subjt:  KILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL

AT3G15940.2 UDP-Glycosyltransferase superfamily protein4.3e-0433.75Show/hide
Query:  KILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL
        ++L F+ N+        W P        A T +  +Y AAD +V  S+G G  +GR  +EAMA  LPV+ T+  G  E +
Subjt:  KILDFVENSDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGACGATCTCCGCCACAACCAACAACCCACAGATCCACCATTTCTCAATCCCAACGAACCTCACTCCTTCAAATTCCGCCCTTCTGCGATCCACTTTTCATCCAT
TCTCATTCTCCTTCTAGCAATTTCCTTCTTCAGTTTCACCAAAACAGATTTCTTCAAAACCCATTCCTTAAAACTCACCCATCTTCTCAAAAATTCTAACCAAACCCAAC
ATCCTAATCCCTTCTGTGTTCTTTGGATGGCCCCATTTGTTTCTGGTGGTGGCTACAGTTCAGAAGCTTGGTCCTACATTTTAGCCCTTCACGATCATATAACAAACCCT
GGATTTCGATTGGCCATTCAGCAACATGGTGATCTTGAATCCATCAACTTTTGGGAAGGCTTACCGGATTCTATGAGGAATTTGGCTATTGAACTTCACAGCACAAAATG
TAGAATGAATGAAACTGTTGTGATTTGTCACAGTGAACCAGGTGCGTGGAATCCTCCATTGTTTGAAACTTTGCCTTGCCCACCAGGTGCTTACCAAAATTTCAAGTCAG
TGATTGGTAGAACAATGTTTGAAACTGATAGGGTAAGTCAAGAACATGTGGATCGTTGTAATGGAATGGATTATGTTTGGGTTCCTTCTGAATTTCATGTCTCTACATTT
GTGAAAAGTGGGGTTGATCCTTCTAAGATTGTGAAAGTTGTTCAACCTATTGATGTGAATTTCTTTGATCCATTGAAATACAAACCATTTAGTCTTGAATCTGTAGGAAC
ATTAGTTTTAGGAACCAGAAACTTGGAAGTAAACTTAGAGAAGAAAGGATTTGTGTTTCTGAGTATCTTTAAATGGGAATTCAGGAAAGGTTGGGATCTGTTGTTGGAAG
CATATTTGAGAGAATTCTGTAAGAAAGATGAAGTGGGGTTGTTTTTGTTGACAAATCCTTACCATACTGATAGTGATTTTGGGAATAAGATTTTGGATTTTGTAGAAAAT
TCAGACTTACAAATGCCACTTTCTGGTTGGGCTCCTGTTTATGTGGTTGATATTCATATAGCTCAAACTGATTTGCCTAGAGTTTACAAGGCTGCTGATGCATTTGTACT
CCCATCAAGAGGAGAAGGGTGGGGAAGGCCGCTCGTTGAAGCGATGGCGATGTCATTGCCAGTGATTGCCACCAACTGGTCGGGGCAGACGGAGTTTTTGACCGATGAGA
ATAGCTATCCGTTGCCGGTTGAGACAATGAGTGAAGTTAATGAAGGGCCATTCAAAGGGCATCTGTGGGCTGAACCATCCATCAGTACGCTTCAAGTTCTAATGAGGGAA
GTAATAACTAATGTTGATGAAGCTAAGGCTAAAGGACGACAGGCGAGGGAGGACATGGTAAGCCGATTCTCGCCCGACATCGTTGCCGAGATTGTTCATAGTCAGATACA
AAATATATTTCATGAGAAGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGACGATCTCCGCCACAACCAACAACCCACAGATCCACCATTTCTCAATCCCAACGAACCTCACTCCTTCAAATTCCGCCCTTCTGCGATCCACTTTTCATCCAT
TCTCATTCTCCTTCTAGCAATTTCCTTCTTCAGTTTCACCAAAACAGATTTCTTCAAAACCCATTCCTTAAAACTCACCCATCTTCTCAAAAATTCTAACCAAACCCAAC
ATCCTAATCCCTTCTGTGTTCTTTGGATGGCCCCATTTGTTTCTGGTGGTGGCTACAGTTCAGAAGCTTGGTCCTACATTTTAGCCCTTCACGATCATATAACAAACCCT
GGATTTCGATTGGCCATTCAGCAACATGGTGATCTTGAATCCATCAACTTTTGGGAAGGCTTACCGGATTCTATGAGGAATTTGGCTATTGAACTTCACAGCACAAAATG
TAGAATGAATGAAACTGTTGTGATTTGTCACAGTGAACCAGGTGCGTGGAATCCTCCATTGTTTGAAACTTTGCCTTGCCCACCAGGTGCTTACCAAAATTTCAAGTCAG
TGATTGGTAGAACAATGTTTGAAACTGATAGGGTAAGTCAAGAACATGTGGATCGTTGTAATGGAATGGATTATGTTTGGGTTCCTTCTGAATTTCATGTCTCTACATTT
GTGAAAAGTGGGGTTGATCCTTCTAAGATTGTGAAAGTTGTTCAACCTATTGATGTGAATTTCTTTGATCCATTGAAATACAAACCATTTAGTCTTGAATCTGTAGGAAC
ATTAGTTTTAGGAACCAGAAACTTGGAAGTAAACTTAGAGAAGAAAGGATTTGTGTTTCTGAGTATCTTTAAATGGGAATTCAGGAAAGGTTGGGATCTGTTGTTGGAAG
CATATTTGAGAGAATTCTGTAAGAAAGATGAAGTGGGGTTGTTTTTGTTGACAAATCCTTACCATACTGATAGTGATTTTGGGAATAAGATTTTGGATTTTGTAGAAAAT
TCAGACTTACAAATGCCACTTTCTGGTTGGGCTCCTGTTTATGTGGTTGATATTCATATAGCTCAAACTGATTTGCCTAGAGTTTACAAGGCTGCTGATGCATTTGTACT
CCCATCAAGAGGAGAAGGGTGGGGAAGGCCGCTCGTTGAAGCGATGGCGATGTCATTGCCAGTGATTGCCACCAACTGGTCGGGGCAGACGGAGTTTTTGACCGATGAGA
ATAGCTATCCGTTGCCGGTTGAGACAATGAGTGAAGTTAATGAAGGGCCATTCAAAGGGCATCTGTGGGCTGAACCATCCATCAGTACGCTTCAAGTTCTAATGAGGGAA
GTAATAACTAATGTTGATGAAGCTAAGGCTAAAGGACGACAGGCGAGGGAGGACATGGTAAGCCGATTCTCGCCCGACATCGTTGCCGAGATTGTTCATAGTCAGATACA
AAATATATTTCATGAGAAGAGATGA
Protein sequenceShow/hide protein sequence
MDDDLRHNQQPTDPPFLNPNEPHSFKFRPSAIHFSSILILLLAISFFSFTKTDFFKTHSLKLTHLLKNSNQTQHPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHITNP
GFRLAIQQHGDLESINFWEGLPDSMRNLAIELHSTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVDRCNGMDYVWVPSEFHVSTF
VKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGTRNLEVNLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDEVGLFLLTNPYHTDSDFGNKILDFVEN
SDLQMPLSGWAPVYVVDIHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVETMSEVNEGPFKGHLWAEPSISTLQVLMRE
VITNVDEAKAKGRQAREDMVSRFSPDIVAEIVHSQIQNIFHEKR