; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G021920 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G021920
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGlycos_transf_1 domain-containing protein
Genome locationCG_Chr05:33929567..33931021
RNA-Seq ExpressionClCG05G021920
SyntenyClCG05G021920
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR001296 - Glycosyl transferase, family 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135369.1 uncharacterized protein LOC101204678 [Cucumis sativus]3.6e-24786.01Show/hide
Query:  MDDDLHHNQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYI
        MD DL H+    D PFP PNQ H F+F  S IHFSSILILLLAISFF+F KT+F+K+ S KLT LLK SNQ P  NP CVLWMAPF+SGGGYSSEAWSYI
Subjt:  MDDDLHHNQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYI

Query:  LALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHV
        LAL  HITNPGFRL I+ HGDLES+DFWEGLP+S+RNLAIELHRT+CRMNETVVICHSEPGAWNPPLFETLPCPPG YQ FKSVIGRTMFETDRV++EHV
Subjt:  LALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHV

Query:  NRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNL--EVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLRE
        NRCN MDYVWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVNFFDPLKYKP SLESVGTLVLG KN   EV LEKK FVFLSIFKWEFRKGWD+LLEAYL+E
Subjt:  NRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNL--EVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLRE

Query:  FCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE
        F KKD VGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE
Subjt:  FCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE

Query:  FLTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEKR
        FLTDENSYPLPVERMSEVKE PF+GH+WAEPSIS LQVLMREVT NVDEAK KGRRAR+DM+ RFSP+IVADIVH QI+NIFHEKR
Subjt:  FLTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEKR

XP_008446743.1 PREDICTED: uncharacterized protein LOC103489373 [Cucumis melo]3.1e-24686.22Show/hide
Query:  NQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHI
        + +P D PFPNPNQ H F+   S IHFSSILILLLAISFF+F KT+F+K+ S KLT LLK SNQ P  NP CVLWMAPF+SGGGYSSEAWSYILAL  HI
Subjt:  NQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHI

Query:  TNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVNRCNRMD
        TNPGFRL I+QHGDLES+DFWEGLP+S+RNLAIELHRT+CRMNETVVICHSEPGAWNPPLFETLPCPPGAY+ FKSVIGRTMFETDRV+QEHVNRCN MD
Subjt:  TNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVNRCNRMD

Query:  YVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEV--SLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGV
        YVWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVNFFDPLKYKPFSLESVGTLVLG  N E    +EKK FVFLSIFKWEFRKGWDLLLEAYL+EF KKD V
Subjt:  YVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEV--SLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGV

Query:  GLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENS
        GLFLLTNPYHT+SDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG TEFLTDENS
Subjt:  GLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENS

Query:  YPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEKR
        YPLPVERMSEVKE PF+GH+WAEPSIS LQVLMREVT NV+EAK KGRRAREDM++RFSP+IVADIVH QI+NIFHEKR
Subjt:  YPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEKR

XP_022968340.1 uncharacterized protein LOC111467605 [Cucurbita maxima]3.0e-23380.89Show/hide
Query:  DDLHHNQQPTDPPFPNPNQSHSFRFRPSAIHF---SSILILLLAISFFSFSKTDFFKTHSLK------LTRL--LKNSNQTPFPNPF-----CVLWMAPF
        DDLHH    TD P P+P +S S + RPS++ F   SSILILLL+IS F+F+KTD FK+ SLK      + RL   +N  QT   NPF     CVLWMAPF
Subjt:  DDLHHNQQPTDPPFPNPNQSHSFRFRPSAIHF---SSILILLLAISFFSFSKTDFFKTHSLK------LTRL--LKNSNQTPFPNPF-----CVLWMAPF

Query:  VSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIG
        +SGGGYSSEAWSYILALHDH+ NP FRLAI+QHGDLES+DFWEGLPDS++NLAIELHRTKCR+NET+V+CHSEPGAWNPPLFET PCPPG YQNFKSVIG
Subjt:  VSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIG

Query:  RTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNL-EVSLEKKGFVFLSIFKWEFR
        RTMFETDRVSQEHVNRCN MD+VWVPSEFHVSTFVKSGVDPSK+VK+VQPIDVNFFDPL Y PFSLESVGTLVLG KN+ EVSLE KGFVFLSIFKWEFR
Subjt:  RTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNL-EVSLEKKGFVFLSIFKWEFR

Query:  KGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMS
        KGWDLLLEAYL+EF K DGVGLFLLTNPYHTDSDFGNKILDFVENS +Q P SGWAPVYVVD HI QTDLP+VYKAADAFVLPSRGEGWGRPLVEAM+MS
Subjt:  KGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMS

Query:  LPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHE
        LPVIATNWSGQTEFLTDENSYPL VE+MSEVKEGPF+GHLWAEPSIS L+VLMREV TNVDEAKAKGRRAREDMV RFSP++VA+IVHS IQ IF E
Subjt:  LPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHE

XP_023542823.1 uncharacterized protein LOC111802622 [Cucurbita pepo subsp. pepo]1.1e-23280.2Show/hide
Query:  DDLHHNQQPTDPPFPNPNQSHSFRFRPSA-----IHFSSILILLLAISFFSFSKTDFFKTHSLK------LTRL--LKNSNQTPFPNPF-----CVLWMA
        DDLH     TD P PNP  SHS + RPS+      + SSILILLL+IS F+F+KTD FK+ SLK      + RL   +N  QT  PNPF     CVLWMA
Subjt:  DDLHHNQQPTDPPFPNPNQSHSFRFRPSA-----IHFSSILILLLAISFFSFSKTDFFKTHSLK------LTRL--LKNSNQTPFPNPF-----CVLWMA

Query:  PFVSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSV
        PF+SGGGYSSEAWSYILALHDH+ NP FRLAI+QHGDLESIDFWEGLPDS++NLAIELHRTKCR+NET+V+CHSEPGAWNPPLFET PCPPG YQNFKSV
Subjt:  PFVSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSV

Query:  IGRTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNL--EVSLEKKGFVFLSIFKW
        IGRTMFETDRVSQEHVNRCN+MD+VWVPSEFHVSTFVKSGVDPSK+VK+VQPIDVNFFDPL Y PFSLESVGTLVLG KN+  EVSLE KGFVFLSIFKW
Subjt:  IGRTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNL--EVSLEKKGFVFLSIFKW

Query:  EFRKGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAM
        EFRKGWDLLLEAYL+EF K DGVGLFLLTNPYHTD+DFGNKILDFVE+S +Q P SGWAPV+VVD HI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAM
Subjt:  EFRKGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAM

Query:  AMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHE
        +M+LPVIATNWSGQTEFLTDENSYPL VERMSEVKEGPF+GHLWAEPSIS L+VLMREV TNVDEAK KGRRAREDMV RFSP++VA+IVH  IQ IFHE
Subjt:  AMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHE

XP_038891322.1 uncharacterized protein LOC120080769 [Benincasa hispida]2.2e-26892.99Show/hide
Query:  MDDDLHHNQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYI
        MDDDL HNQQPTD PFPNPN+SHS +FRPSAIHFSSILILLLAISFF+  KTDF+KT SLKLT LLKNSNQTP PNP CVLWMAPFVSGGGYSSEAWSYI
Subjt:  MDDDLHHNQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYI

Query:  LALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHV
        LAL DHITNPGFRLAIQQHGDLESIDFWEGLPDS++NLAIELH TKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHV
Subjt:  LALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHV

Query:  NRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNL-EVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLREF
        NRCN MDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLG KNL EVS EKKGFVFLSIFKWEFRKGWDLLLEAYLREF
Subjt:  NRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNL-EVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLREF

Query:  CKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEF
        CKKD V  FLLTNPYHTDSDFGNKILDFVEN DLQMPLSGWAPVYV+DIHI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEF
Subjt:  CKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEF

Query:  LTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEKR
        LTDENSYPLPVERMSEVKEGPF+GH+WAEPSIS LQVLMREVTTNVDEAK KG+RAREDMVSRFSP IVADIVH QIQNIFHEKR
Subjt:  LTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEKR

TrEMBL top hitse value%identityAlignment
A0A0A0KTD9 Glycos_transf_1 domain-containing protein1.8e-24786.01Show/hide
Query:  MDDDLHHNQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYI
        MD DL H+    D PFP PNQ H F+F  S IHFSSILILLLAISFF+F KT+F+K+ S KLT LLK SNQ P  NP CVLWMAPF+SGGGYSSEAWSYI
Subjt:  MDDDLHHNQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYI

Query:  LALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHV
        LAL  HITNPGFRL I+ HGDLES+DFWEGLP+S+RNLAIELHRT+CRMNETVVICHSEPGAWNPPLFETLPCPPG YQ FKSVIGRTMFETDRV++EHV
Subjt:  LALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHV

Query:  NRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNL--EVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLRE
        NRCN MDYVWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVNFFDPLKYKP SLESVGTLVLG KN   EV LEKK FVFLSIFKWEFRKGWD+LLEAYL+E
Subjt:  NRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNL--EVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLRE

Query:  FCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE
        F KKD VGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE
Subjt:  FCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTE

Query:  FLTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEKR
        FLTDENSYPLPVERMSEVKE PF+GH+WAEPSIS LQVLMREVT NVDEAK KGRRAR+DM+ RFSP+IVADIVH QI+NIFHEKR
Subjt:  FLTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEKR

A0A1S3BFB1 uncharacterized protein LOC1034893731.5e-24686.22Show/hide
Query:  NQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHI
        + +P D PFPNPNQ H F+   S IHFSSILILLLAISFF+F KT+F+K+ S KLT LLK SNQ P  NP CVLWMAPF+SGGGYSSEAWSYILAL  HI
Subjt:  NQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHI

Query:  TNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVNRCNRMD
        TNPGFRL I+QHGDLES+DFWEGLP+S+RNLAIELHRT+CRMNETVVICHSEPGAWNPPLFETLPCPPGAY+ FKSVIGRTMFETDRV+QEHVNRCN MD
Subjt:  TNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVNRCNRMD

Query:  YVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEV--SLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGV
        YVWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVNFFDPLKYKPFSLESVGTLVLG  N E    +EKK FVFLSIFKWEFRKGWDLLLEAYL+EF KKD V
Subjt:  YVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEV--SLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGV

Query:  GLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENS
        GLFLLTNPYHT+SDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG TEFLTDENS
Subjt:  GLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENS

Query:  YPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEKR
        YPLPVERMSEVKE PF+GH+WAEPSIS LQVLMREVT NV+EAK KGRRAREDM++RFSP+IVADIVH QI+NIFHEKR
Subjt:  YPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEKR

A0A5D3CDB1 Group 1 family glycosyltransferase1.5e-24686.22Show/hide
Query:  NQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHI
        + +P D PFPNPNQ H F+   S IHFSSILILLLAISFF+F KT+F+K+ S KLT LLK SNQ P  NP CVLWMAPF+SGGGYSSEAWSYILAL  HI
Subjt:  NQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHI

Query:  TNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVNRCNRMD
        TNPGFRL I+QHGDLES+DFWEGLP+S+RNLAIELHRT+CRMNETVVICHSEPGAWNPPLFETLPCPPGAY+ FKSVIGRTMFETDRV+QEHVNRCN MD
Subjt:  TNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVNRCNRMD

Query:  YVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEV--SLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGV
        YVWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVNFFDPLKYKPFSLESVGTLVLG  N E    +EKK FVFLSIFKWEFRKGWDLLLEAYL+EF KKD V
Subjt:  YVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEV--SLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGV

Query:  GLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENS
        GLFLLTNPYHT+SDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG TEFLTDENS
Subjt:  GLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENS

Query:  YPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEKR
        YPLPVERMSEVKE PF+GH+WAEPSIS LQVLMREVT NV+EAK KGRRAREDM++RFSP+IVADIVH QI+NIFHEKR
Subjt:  YPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEKR

A0A6J1G004 uncharacterized protein LOC1114494317.4e-23079.76Show/hide
Query:  DDLHHNQQPTDPPFPNPNQSHSFRFRPSA-----IHFSSILILLLAISFFSFSKTDFFKTHSLK------LTRL--LKNSNQTPFPNPF-----CVLWMA
        DDLH     TD P PNP  SHS + RPS+      + SSILILLL+IS F+F+K D FK+ SLK      + RL   +N  QT  PNPF     CVLWMA
Subjt:  DDLHHNQQPTDPPFPNPNQSHSFRFRPSA-----IHFSSILILLLAISFFSFSKTDFFKTHSLK------LTRL--LKNSNQTPFPNPF-----CVLWMA

Query:  PFVSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSV
        PF+SGGGYSSEAWSYILALHDH+ NP FRLAI+QHGDLESIDFWEGLPDS++NLAIELHRTKCR+NET+V+CHSEPGAWNPPLFET PCPPG YQNFKSV
Subjt:  PFVSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSV

Query:  IGRTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNL-EVSLEKKGFVFLSIFKWE
        IGRTMFETDRVSQEHVNRCN MD+VWVPSEFHVSTFVKSGVDPSK+VK+VQP+DVNFFDPL Y PFSLESVGTLVLG KN+ EVSLE KGFVFLSIFKWE
Subjt:  IGRTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNL-EVSLEKKGFVFLSIFKWE

Query:  FRKGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMA
        FRKGWDLLLEAYL+EF K DGV LFLLTNPYHTDSDFGNKILDFVE+S +Q P SGWAPV+VVD HI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAM+
Subjt:  FRKGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMA

Query:  MSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHE
        MSLPVIATNWSGQTEFLTDENSYPL VERMSEVKEGPF+GHLWAEPSIS L+VLMREV TNVDEAK KGRRAREDMV RFSP++VA+IV   IQ IF E
Subjt:  MSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHE

A0A6J1HWY2 uncharacterized protein LOC1114676051.4e-23380.89Show/hide
Query:  DDLHHNQQPTDPPFPNPNQSHSFRFRPSAIHF---SSILILLLAISFFSFSKTDFFKTHSLK------LTRL--LKNSNQTPFPNPF-----CVLWMAPF
        DDLHH    TD P P+P +S S + RPS++ F   SSILILLL+IS F+F+KTD FK+ SLK      + RL   +N  QT   NPF     CVLWMAPF
Subjt:  DDLHHNQQPTDPPFPNPNQSHSFRFRPSAIHF---SSILILLLAISFFSFSKTDFFKTHSLK------LTRL--LKNSNQTPFPNPF-----CVLWMAPF

Query:  VSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIG
        +SGGGYSSEAWSYILALHDH+ NP FRLAI+QHGDLES+DFWEGLPDS++NLAIELHRTKCR+NET+V+CHSEPGAWNPPLFET PCPPG YQNFKSVIG
Subjt:  VSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIG

Query:  RTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNL-EVSLEKKGFVFLSIFKWEFR
        RTMFETDRVSQEHVNRCN MD+VWVPSEFHVSTFVKSGVDPSK+VK+VQPIDVNFFDPL Y PFSLESVGTLVLG KN+ EVSLE KGFVFLSIFKWEFR
Subjt:  RTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNL-EVSLEKKGFVFLSIFKWEFR

Query:  KGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMS
        KGWDLLLEAYL+EF K DGVGLFLLTNPYHTDSDFGNKILDFVENS +Q P SGWAPVYVVD HI QTDLP+VYKAADAFVLPSRGEGWGRPLVEAM+MS
Subjt:  KGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMS

Query:  LPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHE
        LPVIATNWSGQTEFLTDENSYPL VE+MSEVKEGPF+GHLWAEPSIS L+VLMREV TNVDEAKAKGRRAREDMV RFSP++VA+IVHS IQ IF E
Subjt:  LPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHE

SwissProt top hitse value%identityAlignment
A7TZT2 Mannosylfructose-phosphate synthase3.8e-0531.01Show/hide
Query:  KGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGVGLFLLT---NPYHTDSDFGNKILDFVENSDLQ--MPLSGWAPVYVVDIHIPQTDLPRVYKAADAFV
        +G V L++ +    KG+DLL++ +     ++    L L     N    ++   N++ + V++  L+  +  SG    YV D      DLP +Y+AAD FV
Subjt:  KGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGVGLFLLT---NPYHTDSDFGNKILDFVENSDLQ--MPLSGWAPVYVVDIHIPQTDLPRVYKAADAFV

Query:  LPSRGEGWGRPLVEAMAMSLPVIATNWSG
        L SR E +G   +EAMA   P + T   G
Subjt:  LPSRGEGWGRPLVEAMAMSLPVIATNWSG

D6Z995 D-inositol 3-phosphate glycosyltransferase9.3e-0425.56Show/hide
Query:  DPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKIL
        DP KI  V   +D+  F P                 A   E+ L+    V   + + +  KG D+LL A       K GV L L+             I+
Subjt:  DPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKIL

Query:  DFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENS
           E   +   ++ W          P   L +VY+AAD   +PS  E +G   +EA A   PV+A    G    + D+ S
Subjt:  DFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENS

Q9R9N2 Lipopolysaccharide core biosynthesis mannosyltransferase LpsB4.2e-0444.9Show/hide
Query:  TDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLT
        T++P  Y+A D FV P R EG+G   +EAMA  +PV+AT+    +E +T
Subjt:  TDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLT

Arabidopsis top hitse value%identityAlignment
AT3G10630.1 UDP-Glycosyltransferase superfamily protein3.4e-17962.79Show/hide
Query:  IHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLL------------------KNSNQTPFP---NPFCVLWMAPFVSGGGYSSEAWSYILALHDHITNP
        ++ SSIL LLL+I    F+ TD +K  SL+ T  +                  K+ ++T  P    P CVLWMAPF+S GGYSSEAWSY+L+L +H+TNP
Subjt:  IHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLL------------------KNSNQTPFP---NPFCVLWMAPFVSGGGYSSEAWSYILALHDHITNP

Query:  GFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVNRCNRMDYVW
         FR+ I+ HGDLES++FW GL    + +AIE++R +CR NET+V+CHSEPGAW PPLFETLPCPP  Y++F SVIGRTMFETDRV+ EHV RCN+MD+VW
Subjt:  GFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVNRCNRMDYVW

Query:  VPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGVGLFLL
        VP++FHVS+FV+SGVD SK+VK+VQP+DV FFDP KYKP  L +VG LVLG      S  K GFVFLS+FKWE RKGWD+LL+AYL EF  +D V LFLL
Subjt:  VPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGVGLFLL

Query:  TNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPV
        TN YH+DSDFGNKILDFVE  +++ P +G+  VYV+D HI Q DLPR+YKAADAFVLP+RGEGWGRP+VEAMAMSLPVI TNWSG TE+LT+ N YPL V
Subjt:  TNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPV

Query:  ERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEK
        E MSEVKEGPFEGH WAEPS+  L+VLMR V +N DEAK KG+R R+DMV  F+P +VA +V  QI  IF EK
Subjt:  ERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEK

AT3G15940.1 UDP-Glycosyltransferase superfamily protein9.5e-0432.5Show/hide
Query:  KILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL
        ++L F+ N+        W P          T +  +Y AAD +V  S+G G  +GR  +EAMA  LPV+ T+  G  E +
Subjt:  KILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL

AT3G15940.2 UDP-Glycosyltransferase superfamily protein9.5e-0432.5Show/hide
Query:  KILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL
        ++L F+ N+        W P          T +  +Y AAD +V  S+G G  +GR  +EAMA  LPV+ T+  G  E +
Subjt:  KILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGACGATCTCCACCATAACCAACAACCCACAGATCCACCATTTCCCAATCCCAACCAATCTCACTCCTTCAGATTCCGCCCTTCTGCGATTCACTTTTCATCCAT
CCTCATTCTCCTTCTAGCAATTTCCTTCTTCAGTTTCTCCAAAACAGATTTCTTCAAAACCCATTCCTTAAAACTCACCCGTCTTCTCAAAAATTCTAACCAAACCCCAT
TTCCTAATCCCTTCTGTGTTCTTTGGATGGCTCCATTTGTTTCTGGTGGTGGCTACAGTTCAGAAGCTTGGTCCTACATTTTAGCCCTTCATGATCATATAACAAACCCT
GGATTTCGATTGGCCATTCAGCAACATGGCGATTTAGAATCCATCGACTTTTGGGAGGGCTTACCGGATTCTATCAGGAATCTGGCCATTGAACTTCACAGAACAAAATG
TAGAATGAATGAAACCGTTGTGATTTGCCACAGTGAACCGGGTGCGTGGAATCCTCCTTTGTTTGAAACTTTGCCTTGCCCACCAGGTGCTTACCAAAATTTCAAGTCAG
TGATTGGTAGAACAATGTTTGAAACTGATAGGGTAAGTCAAGAACATGTGAATCGATGTAATAGAATGGATTACGTTTGGGTTCCTTCTGAATTTCATGTCTCTACATTT
GTGAAAAGTGGGGTTGATCCTTCTAAGATTGTGAAAGTTGTTCAACCTATTGATGTGAATTTCTTTGATCCATTGAAATACAAACCATTTAGTCTTGAATCTGTAGGAAC
ATTAGTTTTAGGAGCCAAAAACTTGGAAGTAAGCTTAGAGAAGAAGGGATTTGTGTTTCTGAGTATCTTTAAATGGGAATTCAGGAAAGGTTGGGATTTGTTGTTGGAAG
CATATTTGAGAGAATTTTGTAAGAAAGATGGAGTTGGGTTGTTTTTGTTGACAAATCCTTACCATACTGATAGTGATTTTGGGAATAAGATTTTGGATTTTGTAGAAAAT
TCAGACTTACAAATGCCACTTTCTGGTTGGGCTCCTGTTTATGTGGTTGATATTCATATACCTCAAACTGATTTGCCTAGAGTTTACAAGGCTGCTGATGCATTTGTACT
TCCATCAAGAGGAGAAGGGTGGGGGAGGCCGCTCGTTGAAGCAATGGCGATGTCGTTGCCAGTGATTGCAACCAACTGGTCGGGGCAAACGGAGTTTTTGACCGATGAGA
ATAGCTATCCGTTGCCCGTTGAGAGAATGAGTGAAGTAAAGGAAGGGCCATTTGAAGGGCATCTGTGGGCTGAACCATCCATCAGTATACTTCAAGTTCTAATGAGGGAA
GTAACAACTAATGTTGATGAAGCTAAGGCTAAAGGACGACGGGCAAGGGAGGACATGGTAAGCCGATTCTCGCCCAACATCGTTGCCGATATCGTTCATAGTCAGATACA
AAATATATTCCATGAGAAGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGACGATCTCCACCATAACCAACAACCCACAGATCCACCATTTCCCAATCCCAACCAATCTCACTCCTTCAGATTCCGCCCTTCTGCGATTCACTTTTCATCCAT
CCTCATTCTCCTTCTAGCAATTTCCTTCTTCAGTTTCTCCAAAACAGATTTCTTCAAAACCCATTCCTTAAAACTCACCCGTCTTCTCAAAAATTCTAACCAAACCCCAT
TTCCTAATCCCTTCTGTGTTCTTTGGATGGCTCCATTTGTTTCTGGTGGTGGCTACAGTTCAGAAGCTTGGTCCTACATTTTAGCCCTTCATGATCATATAACAAACCCT
GGATTTCGATTGGCCATTCAGCAACATGGCGATTTAGAATCCATCGACTTTTGGGAGGGCTTACCGGATTCTATCAGGAATCTGGCCATTGAACTTCACAGAACAAAATG
TAGAATGAATGAAACCGTTGTGATTTGCCACAGTGAACCGGGTGCGTGGAATCCTCCTTTGTTTGAAACTTTGCCTTGCCCACCAGGTGCTTACCAAAATTTCAAGTCAG
TGATTGGTAGAACAATGTTTGAAACTGATAGGGTAAGTCAAGAACATGTGAATCGATGTAATAGAATGGATTACGTTTGGGTTCCTTCTGAATTTCATGTCTCTACATTT
GTGAAAAGTGGGGTTGATCCTTCTAAGATTGTGAAAGTTGTTCAACCTATTGATGTGAATTTCTTTGATCCATTGAAATACAAACCATTTAGTCTTGAATCTGTAGGAAC
ATTAGTTTTAGGAGCCAAAAACTTGGAAGTAAGCTTAGAGAAGAAGGGATTTGTGTTTCTGAGTATCTTTAAATGGGAATTCAGGAAAGGTTGGGATTTGTTGTTGGAAG
CATATTTGAGAGAATTTTGTAAGAAAGATGGAGTTGGGTTGTTTTTGTTGACAAATCCTTACCATACTGATAGTGATTTTGGGAATAAGATTTTGGATTTTGTAGAAAAT
TCAGACTTACAAATGCCACTTTCTGGTTGGGCTCCTGTTTATGTGGTTGATATTCATATACCTCAAACTGATTTGCCTAGAGTTTACAAGGCTGCTGATGCATTTGTACT
TCCATCAAGAGGAGAAGGGTGGGGGAGGCCGCTCGTTGAAGCAATGGCGATGTCGTTGCCAGTGATTGCAACCAACTGGTCGGGGCAAACGGAGTTTTTGACCGATGAGA
ATAGCTATCCGTTGCCCGTTGAGAGAATGAGTGAAGTAAAGGAAGGGCCATTTGAAGGGCATCTGTGGGCTGAACCATCCATCAGTATACTTCAAGTTCTAATGAGGGAA
GTAACAACTAATGTTGATGAAGCTAAGGCTAAAGGACGACGGGCAAGGGAGGACATGGTAAGCCGATTCTCGCCCAACATCGTTGCCGATATCGTTCATAGTCAGATACA
AAATATATTCCATGAGAAGAGATGA
Protein sequenceShow/hide protein sequence
MDDDLHHNQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHITNP
GFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTF
VKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVEN
SDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMRE
VTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEKR