; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1441 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1441
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionGlycos_transf_1 domain-containing protein
Genome locationMC09:20430354..20431766
RNA-Seq ExpressionMC09g1441
SyntenyMC09g1441
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR001296 - Glycosyl transferase, family 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012958.1 hypothetical protein SDJN02_25712, partial [Cucurbita argyrosperma subsp. argyrosperma]5.31e-27578.67Show/hide
Query:  PQPPNPHSLKFRSSAIS--------ILILLIAISFFTFTKTDHHKTQSLK-LLQKFIKFLN----PPPNSIPIPPPTP--CVLWMAPFLSGGGYSSEAWS
        P P + HSLK R S+ S        ILILL++IS F FTKTDH K+QSLK L Q+ I  LN    P   S+P P  T   CVLWMAPFLSGGGYSSEAWS
Subjt:  PQPPNPHSLKFRSSAIS--------ILILLIAISFFTFTKTDHHKTQSLK-LLQKFIKFLN----PPPNSIPIPPPTP--CVLWMAPFLSGGGYSSEAWS

Query:  YILALHHHIKAPHEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNP
        YILALH H++ P+ FRLAIEQHGDLESIDFWEGLPDSV++LAI+LH T CR+NET+V+CHSEPGAWNPPLFET PCPPGVYQ FK+VIGRTMFETDRV+ 
Subjt:  YILALHHHIKAPHEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNP

Query:  EHVNRCKLMDYIWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDME-MGLDNGFVFLSIFKWEFRKGWDLLLEAYLK
        EHVNRC  MD++WVPSEFHVSTFVKSGVDPSK+VKIVQPIDVNFFDPL Y PFSL S+GTLVLG K+ME + L+ GFVFLSIFKWEFRKGWDLLLEAYLK
Subjt:  EHVNRCKLMDYIWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDME-MGLDNGFVFLSIFKWEFRKGWDLLLEAYLK

Query:  EFSKKDQVGLFLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQT
        EFSK D V LFLLTNPYH+D DFGNKILDFVE+S IQ+PASGWAPV+V+DTHIAQTDLPR+YKAADAFVLPSRGEGWGRPLVEAM+MSLPVIATNWSGQT
Subjt:  EFSKKDQVGLFLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQT

Query:  EFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVF
        EFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSI KL+ LMREV TNVDEAK KGR AR+DMVR+FSPD+VA+IV  HIQ +F
Subjt:  EFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVF

XP_022150540.1 uncharacterized protein LOC111018657 [Momordica charantia]0.0100Show/hide
Query:  PQPPNPHSLKFRSSAISILILLIAISFFTFTKTDHHKTQSLKLLQKFIKFLNPPPNSIPIPPPTPCVLWMAPFLSGGGYSSEAWSYILALHHHIKAPHEF
        PQPPNPHSLKFRSSAISILILLIAISFFTFTKTDHHKTQSLKLLQKFIKFLNPPPNSIPIPPPTPCVLWMAPFLSGGGYSSEAWSYILALHHHIKAPHEF
Subjt:  PQPPNPHSLKFRSSAISILILLIAISFFTFTKTDHHKTQSLKLLQKFIKFLNPPPNSIPIPPPTPCVLWMAPFLSGGGYSSEAWSYILALHHHIKAPHEF

Query:  RLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEHVNRCKLMDYIWVP
        RLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEHVNRCKLMDYIWVP
Subjt:  RLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEHVNRCKLMDYIWVP

Query:  SEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDMEMGLDNGFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDQVGLFLLTNP
        SEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDMEMGLDNGFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDQVGLFLLTNP
Subjt:  SEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDMEMGLDNGFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDQVGLFLLTNP

Query:  YHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLAVERM
        YHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLAVERM
Subjt:  YHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLAVERM

Query:  SEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHDKR
        SEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHDKR
Subjt:  SEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHDKR

XP_022968340.1 uncharacterized protein LOC111467605 [Cucurbita maxima]4.45e-27878.67Show/hide
Query:  PQPPNPHSLKFRSSAI------SILILLIAISFFTFTKTDHHKTQSLK-LLQKFIKFLNPPPN------SIPIPPPTPCVLWMAPFLSGGGYSSEAWSYI
        P P    SLK R S++      SILILL++IS FTFTKTDH K+QSLK L QK I  LN   N      S P    + CVLWMAPFLSGGGYSSEAWSYI
Subjt:  PQPPNPHSLKFRSSAI------SILILLIAISFFTFTKTDHHKTQSLK-LLQKFIKFLNPPPN------SIPIPPPTPCVLWMAPFLSGGGYSSEAWSYI

Query:  LALHHHIKAPHEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEH
        LALH H++ P+ FRLAIEQHGDLES+DFWEGLPDSV++LAI+LH T CR+NET+V+CHSEPGAWNPPLFET PCPPGVYQ FK+VIGRTMFETDRV+ EH
Subjt:  LALHHHIKAPHEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEH

Query:  VNRCKLMDYIWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDM-EMGLDNGFVFLSIFKWEFRKGWDLLLEAYLKEF
        VNRC  MD++WVPSEFHVSTFVKSGVDPSK+VKIVQPIDVNFFDPL Y PFSL S+GTLVLG K+M E+ L+ GFVFLSIFKWEFRKGWDLLLEAYLKEF
Subjt:  VNRCKLMDYIWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDM-EMGLDNGFVFLSIFKWEFRKGWDLLLEAYLKEF

Query:  SKKDQVGLFLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEF
        SK D VGLFLLTNPYH+D DFGNKILDFVENS IQKP SGWAPVYV+DTHIAQTDLP++YKAADAFVLPSRGEGWGRPLVEAM+MSLPVIATNWSGQTEF
Subjt:  SKKDQVGLFLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEF

Query:  LTDENSYPLAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHD
        LTDENSYPLAVE+MSEVKEGPFKGHLWAEPSI KL+ LMREV TNVDEAKAKGR AR+DMVR+FSPD+VA+IV+ HIQ +F +
Subjt:  LTDENSYPLAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHD

XP_023542823.1 uncharacterized protein LOC111802622 [Cucurbita pepo subsp. pepo]5.79e-27778.19Show/hide
Query:  PQPPNPHSLKFRSSAIS--------ILILLIAISFFTFTKTDHHKTQSLK-LLQKFIKFLN----PPPNSIPIPPPTP--CVLWMAPFLSGGGYSSEAWS
        P P + HSLK R S+ S        ILILL++IS F FTKTDH K+QSLK L Q+ I  LN    P   S+P P  T   CVLWMAPFLSGGGYSSEAWS
Subjt:  PQPPNPHSLKFRSSAIS--------ILILLIAISFFTFTKTDHHKTQSLK-LLQKFIKFLN----PPPNSIPIPPPTP--CVLWMAPFLSGGGYSSEAWS

Query:  YILALHHHIKAPHEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNP
        YILALH H++ P+ FRLAIEQHGDLESIDFWEGLPDSV++LAI+LH T CR+NET+V+CHSEPGAWNPPLFET PCPPGVYQ FK+VIGRTMFETDRV+ 
Subjt:  YILALHHHIKAPHEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNP

Query:  EHVNRCKLMDYIWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDME--MGLDNGFVFLSIFKWEFRKGWDLLLEAYL
        EHVNRC  MD++WVPSEFHVSTFVKSGVDPSK+VKIVQPIDVNFFDPL Y PFSL S+GTLVLG K+ME  + L+ GFVFLSIFKWEFRKGWDLLLEAYL
Subjt:  EHVNRCKLMDYIWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDME--MGLDNGFVFLSIFKWEFRKGWDLLLEAYL

Query:  KEFSKKDQVGLFLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQ
        KEFSK D VGLFLLTNPYH+D DFGNKILDFVE+S IQ+PASGWAPV+V+DTHIAQTDLPR+YKAADAFVLPSRGEGWGRPLVEAM+M+LPVIATNWSGQ
Subjt:  KEFSKKDQVGLFLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQ

Query:  TEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHD
        TEFLTDENSYPL VERMSEVKEGPFKGHLWAEPSI KL+ LMREV TNVDEAK KGR AR+DMVR+FSPD+VA+IV+ HIQ +FH+
Subjt:  TEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHD

XP_038891322.1 uncharacterized protein LOC120080769 [Benincasa hispida]3.53e-27579.83Show/hide
Query:  PQPPNPHSLKFRSSAI---SILILLIAISFFTFTKTDHHKTQSLKLLQKFIKFLNPPPNSIPIPPPTPCVLWMAPFLSGGGYSSEAWSYILALHHHIKAP
        P P   HS KFR SAI   SILILL+AISFFT  KTD +KTQSLKL       L    N  P+P P+ CVLWMAPF+SGGGYSSEAWSYILAL  HI  P
Subjt:  PQPPNPHSLKFRSSAI---SILILLIAISFFTFTKTDHHKTQSLKLLQKFIKFLNPPPNSIPIPPPTPCVLWMAPFLSGGGYSSEAWSYILALHHHIKAP

Query:  HEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEHVNRCKLMDYI
          FRLAI+QHGDLESIDFWEGLPDS+++LAI+LH T CRMNETVVICHSEPGAWNPPLFETLPCPPG YQ FK+VIGRTMFETDRV+ EHVNRC  MDY+
Subjt:  HEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEHVNRCKLMDYI

Query:  WVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDME-MGLDN-GFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDQVGLF
        WVPSEFHVSTFVKSGVDPSKIVK+VQPIDVNFFDPLKY+PFSL S+GTLVLG K++E +  +  GFVFLSIFKWEFRKGWDLLLEAYL+EF KKD+V  F
Subjt:  WVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDME-MGLDN-GFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDQVGLF

Query:  LLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPL
        LLTNPYH+D DFGNKILDFVEN D+Q P SGWAPVYVID HIAQTDLPR+YKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPL
Subjt:  LLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPL

Query:  AVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHDKR
         VERMSEVKEGPFKGH+WAEPSI KLQ LMREVTTNVDEAK KG+ AR+DMV +FSP IVADIV+  IQN+FH+KR
Subjt:  AVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHDKR

TrEMBL top hitse value%identityAlignment
A0A0A0KTD9 Glycos_transf_1 domain-containing protein2.94e-27378.2Show/hide
Query:  PQPPNPHSLKFRSSAI---SILILLIAISFFTFTKTDHHKTQSLKLLQKFIKFLNPPPNSIPIPPPTPCVLWMAPFLSGGGYSSEAWSYILALHHHIKAP
        P P  PH  KF  S I   SILILL+AISFF F KT+ +K+QS KL    +KF N PP   P+     CVLWMAPFLSGGGYSSEAWSYILAL HHI  P
Subjt:  PQPPNPHSLKFRSSAI---SILILLIAISFFTFTKTDHHKTQSLKLLQKFIKFLNPPPNSIPIPPPTPCVLWMAPFLSGGGYSSEAWSYILALHHHIKAP

Query:  HEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEHVNRCKLMDYI
          FRL I  HGDLES+DFWEGLP+SVR+LAI+LH T CRMNETVVICHSEPGAWNPPLFETLPCPPG YQKFK+VIGRTMFETDRV  EHVNRC +MDY+
Subjt:  HEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEHVNRCKLMDYI

Query:  WVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDMEMGLD---NGFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDQVGL
        WVPSEFHVSTFV+SGVDPSKIVK+VQP+DVNFFDPLKY+P SL S+GTLVLG K+ E  +      FVFLSIFKWEFRKGWD+LLEAYLKEFSKKD+VGL
Subjt:  WVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDMEMGLD---NGFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDQVGL

Query:  FLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYP
        FLLTNPYH+D DFGNKILDFVENSD+Q P SGWAPVYV+D HI QTDLPR+YKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYP
Subjt:  FLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYP

Query:  LAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHDKR
        L VERMSEVKE PFKGH+WAEPSI KLQ LMREVT NVDEAK KGR AR DM+ +FSPDIVADIV+  I+N+FH+KR
Subjt:  LAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHDKR

A0A5D3CDB1 Group 1 family glycosyltransferase6.56e-27077.36Show/hide
Query:  PQPPNPHSLKFRSSAI---SILILLIAISFFTFTKTDHHKTQSLKLLQKFIKFLNPPPNSIPIPPPTPCVLWMAPFLSGGGYSSEAWSYILALHHHIKAP
        P P  PH  K   S I   SILILL+AISFF F KT+ +K+QS KL    +K  N PP   P      CVLWMAPFLSGGGYSSEAWSYILAL HHI  P
Subjt:  PQPPNPHSLKFRSSAI---SILILLIAISFFTFTKTDHHKTQSLKLLQKFIKFLNPPPNSIPIPPPTPCVLWMAPFLSGGGYSSEAWSYILALHHHIKAP

Query:  HEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEHVNRCKLMDYI
          FRL I QHGDLES+DFWEGLP+SVR+LAI+LH T CRMNETVVICHSEPGAWNPPLFETLPCPPG Y+KFK+VIGRTMFETDRV  EHVNRC +MDY+
Subjt:  HEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEHVNRCKLMDYI

Query:  WVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDME---MGLDNGFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDQVGL
        WVPSEFHVSTFV+SGVDPSKIVK+VQP+DVNFFDPLKY+PFSL S+GTLVLG  + E   +     FVFLSIFKWEFRKGWDLLLEAYLKEFSKKD+VGL
Subjt:  WVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDME---MGLDNGFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDQVGL

Query:  FLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYP
        FLLTNPYH++ DFGNKILDFVENSD+Q P SGWAPVYV+D HI QTDLPR+YKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG TEFLTDENSYP
Subjt:  FLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYP

Query:  LAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHDKR
        L VERMSEVKE PFKGH+WAEPSI KLQ LMREVT NV+EAK KGR AR+DM+ +FSPDIVADIV+  I+N+FH+KR
Subjt:  LAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHDKR

A0A6J1D8R8 uncharacterized protein LOC1110186570.0100Show/hide
Query:  PQPPNPHSLKFRSSAISILILLIAISFFTFTKTDHHKTQSLKLLQKFIKFLNPPPNSIPIPPPTPCVLWMAPFLSGGGYSSEAWSYILALHHHIKAPHEF
        PQPPNPHSLKFRSSAISILILLIAISFFTFTKTDHHKTQSLKLLQKFIKFLNPPPNSIPIPPPTPCVLWMAPFLSGGGYSSEAWSYILALHHHIKAPHEF
Subjt:  PQPPNPHSLKFRSSAISILILLIAISFFTFTKTDHHKTQSLKLLQKFIKFLNPPPNSIPIPPPTPCVLWMAPFLSGGGYSSEAWSYILALHHHIKAPHEF

Query:  RLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEHVNRCKLMDYIWVP
        RLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEHVNRCKLMDYIWVP
Subjt:  RLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEHVNRCKLMDYIWVP

Query:  SEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDMEMGLDNGFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDQVGLFLLTNP
        SEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDMEMGLDNGFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDQVGLFLLTNP
Subjt:  SEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDMEMGLDNGFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDQVGLFLLTNP

Query:  YHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLAVERM
        YHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLAVERM
Subjt:  YHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLAVERM

Query:  SEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHDKR
        SEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHDKR
Subjt:  SEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHDKR

A0A6J1G004 uncharacterized protein LOC1114494312.11e-27478.26Show/hide
Query:  PQPPNPHSLKFRSSAIS--------ILILLIAISFFTFTKTDHHKTQSLK-LLQKFIKFLN----PPPNSIPIPPPTP--CVLWMAPFLSGGGYSSEAWS
        P P + HSLK R S+ S        ILILL++IS F FTK DH K+QSLK L Q+ I  LN    P   S+P P  T   CVLWMAPFLSGGGYSSEAWS
Subjt:  PQPPNPHSLKFRSSAIS--------ILILLIAISFFTFTKTDHHKTQSLK-LLQKFIKFLN----PPPNSIPIPPPTP--CVLWMAPFLSGGGYSSEAWS

Query:  YILALHHHIKAPHEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNP
        YILALH H++ P+ FRLAIEQHGDLESIDFWEGLPDSV++LAI+LH T CR+NET+V+CHSEPGAWNPPLFET PCPPGVYQ FK+VIGRTMFETDRV+ 
Subjt:  YILALHHHIKAPHEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNP

Query:  EHVNRCKLMDYIWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDME-MGLDNGFVFLSIFKWEFRKGWDLLLEAYLK
        EHVNRC  MD++WVPSEFHVSTFVKSGVDPSK+VKIVQP+DVNFFDPL Y PFSL S+GTLVLG K+ME + L+ GFVFLSIFKWEFRKGWDLLLEAYLK
Subjt:  EHVNRCKLMDYIWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDME-MGLDNGFVFLSIFKWEFRKGWDLLLEAYLK

Query:  EFSKKDQVGLFLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQT
        EFSK D V LFLLTNPYH+D DFGNKILDFVE+S IQ+PASGWAPV+V+DTHIAQTDLPR+YKAADAFVLPSRGEGWGRPLVEAM+MSLPVIATNWSGQT
Subjt:  EFSKKDQVGLFLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQT

Query:  EFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVF
        EFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSI KL+ LMREV TNVDEAK KGR AR+DMVR+FSPD+VA+IV  HIQ +F
Subjt:  EFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVF

A0A6J1HWY2 uncharacterized protein LOC1114676052.15e-27878.67Show/hide
Query:  PQPPNPHSLKFRSSAI------SILILLIAISFFTFTKTDHHKTQSLK-LLQKFIKFLNPPPN------SIPIPPPTPCVLWMAPFLSGGGYSSEAWSYI
        P P    SLK R S++      SILILL++IS FTFTKTDH K+QSLK L QK I  LN   N      S P    + CVLWMAPFLSGGGYSSEAWSYI
Subjt:  PQPPNPHSLKFRSSAI------SILILLIAISFFTFTKTDHHKTQSLK-LLQKFIKFLNPPPN------SIPIPPPTPCVLWMAPFLSGGGYSSEAWSYI

Query:  LALHHHIKAPHEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEH
        LALH H++ P+ FRLAIEQHGDLES+DFWEGLPDSV++LAI+LH T CR+NET+V+CHSEPGAWNPPLFET PCPPGVYQ FK+VIGRTMFETDRV+ EH
Subjt:  LALHHHIKAPHEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEH

Query:  VNRCKLMDYIWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDM-EMGLDNGFVFLSIFKWEFRKGWDLLLEAYLKEF
        VNRC  MD++WVPSEFHVSTFVKSGVDPSK+VKIVQPIDVNFFDPL Y PFSL S+GTLVLG K+M E+ L+ GFVFLSIFKWEFRKGWDLLLEAYLKEF
Subjt:  VNRCKLMDYIWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDM-EMGLDNGFVFLSIFKWEFRKGWDLLLEAYLKEF

Query:  SKKDQVGLFLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEF
        SK D VGLFLLTNPYH+D DFGNKILDFVENS IQKP SGWAPVYV+DTHIAQTDLP++YKAADAFVLPSRGEGWGRPLVEAM+MSLPVIATNWSGQTEF
Subjt:  SKKDQVGLFLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEF

Query:  LTDENSYPLAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHD
        LTDENSYPLAVE+MSEVKEGPFKGHLWAEPSI KL+ LMREV TNVDEAKAKGR AR+DMVR+FSPD+VA+IV+ HIQ +F +
Subjt:  LTDENSYPLAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHD

SwissProt top hitse value%identityAlignment
A7TZT2 Mannosylfructose-phosphate synthase2.5e-0628.57Show/hide
Query:  GFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDQVGLFLLTNPYHSDRD---FGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPS
        G V L++ +    KG+DLL++ +     ++ +  L L     + D       N++ + V++  ++   +          ++A  DLP IY+AAD FVL S
Subjt:  GFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDQVGLFLLTNPYHSDRD---FGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPS

Query:  RGEGWGRPLVEAMAMSLPVIATNWSG
        R E +G   +EAMA   P + T   G
Subjt:  RGEGWGRPLVEAMAMSLPVIATNWSG

Q9R9N2 Lipopolysaccharide core biosynthesis mannosyltransferase LpsB4.1e-0425.93Show/hide
Query:  MGLDNGFVFLSIF-KWEFRKGWDLLLEAYLKEFSKKDQVGLFLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFV
        +GLD    F+  F +   +KG DL +++ +     +   G  +          F +++ + V  + +         +  +  H   T++P  Y+A D FV
Subjt:  MGLDNGFVFLSIF-KWEFRKGWDLLLEAYLKEFSKKDQVGLFLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFV

Query:  LPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLT
         P R EG+G   +EAMA  +PV+AT+    +E +T
Subjt:  LPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLT

Arabidopsis top hitse value%identityAlignment
AT1G52420.1 UDP-Glycosyltransferase superfamily protein1.9e-0433.75Show/hide
Query:  KILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL
        ++L F+ NS     +  W P        A T +  +Y AAD +V  S+G G  +GR  +EAMA  L V+ T+  G  E +
Subjt:  KILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL

AT3G10630.1 UDP-Glycosyltransferase superfamily protein4.8e-17861.68Show/hide
Query:  QPPNPHSLKFRSSAI----SILILLIAISFFTFTKTDHHKTQSLKL------LQKFIKFL------NPPPNSIPIPP--PTP-CVLWMAPFLSGGGYSSE
        QPP+     ++ S I    SIL LL++I    FT TD +K QSL+          +++FL       P   S  + P   TP CVLWMAPFLS GGYSSE
Subjt:  QPPNPHSLKFRSSAI----SILILLIAISFFTFTKTDHHKTQSLKL------LQKFIKFL------NPPPNSIPIPP--PTP-CVLWMAPFLSGGGYSSE

Query:  AWSYILALHHHIKAPHEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDR
        AWSY+L+L +H+  P  FR+ IE HGDLES++FW GL    + +AI+++   CR NET+V+CHSEPGAW PPLFETLPCPP  Y+ F +VIGRTMFETDR
Subjt:  AWSYILALHHHIKAPHEFRLAIEQHGDLESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDR

Query:  VNPEHVNRCKLMDYIWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDMEMGLDNGFVFLSIFKWEFRKGWDLLLEAY
        VNPEHV RC  MD++WVP++FHVS+FV+SGVD SK+VKIVQP+DV FFDP KY+P  L ++G LVLGS     G+ NGFVFLS+FKWE RKGWD+LL+AY
Subjt:  VNPEHVNRCKLMDYIWVPSEFHVSTFVKSGVDPSKIVKIVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDMEMGLDNGFVFLSIFKWEFRKGWDLLLEAY

Query:  LKEFSKKDQVGLFLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG
        L EFS +D V LFLLTN YHSD DFGNKILDFVE  +I++P +G+  VYVID HIAQ DLPR+YKAADAFVLP+RGEGWGRP+VEAMAMSLPVI TNWSG
Subjt:  LKEFSKKDQVGLFLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG

Query:  QTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHDK
         TE+LT+ N YPL VE MSEVKEGPF+GH WAEPS+ KL+ LMR V +N DEAK KG+  RDDMV+ F+P++VA +V   I  +F +K
Subjt:  QTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGRWARDDMVRQFSPDIVADIVYHHIQNVFHDK

AT3G15940.1 UDP-Glycosyltransferase superfamily protein1.1e-0433.75Show/hide
Query:  KILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL
        ++L F+ N+     +  W P        A T +  +Y AAD +V  S+G G  +GR  +EAMA  LPV+ T+  G  E +
Subjt:  KILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL

AT3G15940.2 UDP-Glycosyltransferase superfamily protein1.1e-0433.75Show/hide
Query:  KILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL
        ++L F+ N+     +  W P        A T +  +Y AAD +V  S+G G  +GR  +EAMA  LPV+ T+  G  E +
Subjt:  KILDFVENSDIQKPASGWAPVYVIDTHIAQTDLPRIYKAADAFVLPSRGEG--WGRPLVEAMAMSLPVIATNWSGQTEFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CCACAACCTCCGAATCCTCACTCTCTCAAATTCCGATCTTCTGCAATCTCCATTCTCATTCTCCTCATCGCAATTTCCTTCTTCACTTTCACCAAAACAGATCATCACAA
AACCCAATCCCTAAAACTTCTCCAAAAATTCATCAAATTCCTCAATCCACCTCCAAATTCCATTCCAATCCCACCTCCCACTCCATGCGTCCTCTGGATGGCTCCATTTC
TCTCCGGCGGTGGGTACAGTTCAGAAGCTTGGTCCTACATTTTAGCCCTTCACCACCATATCAAAGCCCCACACGAGTTCCGGTTGGCGATTGAGCAACATGGCGATCTA
GAATCCATCGATTTTTGGGAGGGATTGCCAGATTCTGTCAGGAGTTTGGCCATTCAACTTCACGGAACAGATTGTAGAATGAATGAAACTGTTGTGATCTGCCACAGCGA
GCCCGGTGCCTGGAATCCTCCATTGTTTGAAACCCTGCCTTGCCCACCAGGTGTTTACCAAAAGTTCAAGGCAGTGATTGGCAGAACAATGTTTGAAACAGATAGGGTGA
ATCCAGAACATGTGAATCGCTGTAAGTTAATGGATTATATTTGGGTTCCTTCTGAATTTCATGTCTCTACATTCGTGAAAAGTGGGGTTGATCCTTCCAAGATTGTGAAA
ATTGTTCAACCCATTGATGTGAACTTCTTTGATCCTCTGAAATATAGGCCGTTTAGTCTTGCATCTTTAGGAACTCTGGTTTTAGGTTCCAAAGACATGGAAATGGGCTT
AGACAATGGATTTGTGTTTCTGAGTATCTTCAAGTGGGAGTTTAGGAAAGGCTGGGATCTGTTGTTGGAAGCTTATTTGAAAGAATTCTCTAAGAAGGATCAAGTGGGGT
TGTTCTTGTTGACAAATCCTTACCATAGCGATAGAGATTTTGGGAACAAGATTTTGGATTTTGTAGAGAACTCAGACATCCAAAAGCCAGCTTCTGGTTGGGCTCCTGTT
TATGTAATTGATACTCATATAGCTCAAACTGATTTGCCTAGAATTTACAAGGCTGCAGATGCGTTTGTTCTGCCATCGAGAGGCGAAGGGTGGGGGAGGCCGCTCGTTGA
AGCGATGGCCATGTCGTTGCCAGTGATTGCGACCAACTGGTCGGGGCAAACGGAGTTTTTGACGGACGAGAATAGCTATCCACTGGCAGTTGAGAGAATGAGTGAGGTAA
AGGAAGGTCCATTCAAAGGGCATCTGTGGGCCGAACCATCCATAGGGAAGCTTCAAGATTTGATGAGGGAAGTAACGACGAATGTTGACGAAGCCAAGGCCAAAGGACGA
TGGGCGAGGGACGACATGGTCAGGCAATTCTCTCCCGACATCGTAGCAGATATTGTTTATCATCATATACAAAATGTATTTCATGACAAGAGA
mRNA sequenceShow/hide mRNA sequence
CCACAACCTCCGAATCCTCACTCTCTCAAATTCCGATCTTCTGCAATCTCCATTCTCATTCTCCTCATCGCAATTTCCTTCTTCACTTTCACCAAAACAGATCATCACAA
AACCCAATCCCTAAAACTTCTCCAAAAATTCATCAAATTCCTCAATCCACCTCCAAATTCCATTCCAATCCCACCTCCCACTCCATGCGTCCTCTGGATGGCTCCATTTC
TCTCCGGCGGTGGGTACAGTTCAGAAGCTTGGTCCTACATTTTAGCCCTTCACCACCATATCAAAGCCCCACACGAGTTCCGGTTGGCGATTGAGCAACATGGCGATCTA
GAATCCATCGATTTTTGGGAGGGATTGCCAGATTCTGTCAGGAGTTTGGCCATTCAACTTCACGGAACAGATTGTAGAATGAATGAAACTGTTGTGATCTGCCACAGCGA
GCCCGGTGCCTGGAATCCTCCATTGTTTGAAACCCTGCCTTGCCCACCAGGTGTTTACCAAAAGTTCAAGGCAGTGATTGGCAGAACAATGTTTGAAACAGATAGGGTGA
ATCCAGAACATGTGAATCGCTGTAAGTTAATGGATTATATTTGGGTTCCTTCTGAATTTCATGTCTCTACATTCGTGAAAAGTGGGGTTGATCCTTCCAAGATTGTGAAA
ATTGTTCAACCCATTGATGTGAACTTCTTTGATCCTCTGAAATATAGGCCGTTTAGTCTTGCATCTTTAGGAACTCTGGTTTTAGGTTCCAAAGACATGGAAATGGGCTT
AGACAATGGATTTGTGTTTCTGAGTATCTTCAAGTGGGAGTTTAGGAAAGGCTGGGATCTGTTGTTGGAAGCTTATTTGAAAGAATTCTCTAAGAAGGATCAAGTGGGGT
TGTTCTTGTTGACAAATCCTTACCATAGCGATAGAGATTTTGGGAACAAGATTTTGGATTTTGTAGAGAACTCAGACATCCAAAAGCCAGCTTCTGGTTGGGCTCCTGTT
TATGTAATTGATACTCATATAGCTCAAACTGATTTGCCTAGAATTTACAAGGCTGCAGATGCGTTTGTTCTGCCATCGAGAGGCGAAGGGTGGGGGAGGCCGCTCGTTGA
AGCGATGGCCATGTCGTTGCCAGTGATTGCGACCAACTGGTCGGGGCAAACGGAGTTTTTGACGGACGAGAATAGCTATCCACTGGCAGTTGAGAGAATGAGTGAGGTAA
AGGAAGGTCCATTCAAAGGGCATCTGTGGGCCGAACCATCCATAGGGAAGCTTCAAGATTTGATGAGGGAAGTAACGACGAATGTTGACGAAGCCAAGGCCAAAGGACGA
TGGGCGAGGGACGACATGGTCAGGCAATTCTCTCCCGACATCGTAGCAGATATTGTTTATCATCATATACAAAATGTATTTCATGACAAGAGA
Protein sequenceShow/hide protein sequence
PQPPNPHSLKFRSSAISILILLIAISFFTFTKTDHHKTQSLKLLQKFIKFLNPPPNSIPIPPPTPCVLWMAPFLSGGGYSSEAWSYILALHHHIKAPHEFRLAIEQHGDL
ESIDFWEGLPDSVRSLAIQLHGTDCRMNETVVICHSEPGAWNPPLFETLPCPPGVYQKFKAVIGRTMFETDRVNPEHVNRCKLMDYIWVPSEFHVSTFVKSGVDPSKIVK
IVQPIDVNFFDPLKYRPFSLASLGTLVLGSKDMEMGLDNGFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDQVGLFLLTNPYHSDRDFGNKILDFVENSDIQKPASGWAPV
YVIDTHIAQTDLPRIYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLAVERMSEVKEGPFKGHLWAEPSIGKLQDLMREVTTNVDEAKAKGR
WARDDMVRQFSPDIVADIVYHHIQNVFHDKR