; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G7775 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G7775
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionO-fucosyltransferase family protein
Genome locationctg1556:843006..847425
RNA-Seq ExpressionCucsat.G7775
SyntenyCucsat.G7775
Gene Ontology termsGO:0006004 - fucose metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144331.1 uncharacterized protein LOC101219097 [Cucumis sativus]8.32e-300100Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVSR
        ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVSR
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVSR

Query:  IELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRP
        IELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRP
Subjt:  IELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRP

Query:  EFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIPV
        EFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIPV
Subjt:  EFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIPV

Query:  YTDEERR
        YTDEERR
Subjt:  YTDEERR

XP_008455718.1 PREDICTED: uncharacterized protein LOC103495824 [Cucumis melo]6.70e-28896.57Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRTQKPKPK RSPLIFFFVSL+AIAFLFLFSSLISTNGSSSFPSSNSIQKIFR KNLTQKQRR RHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVS
        ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS N+SSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLST MKLGARAV HVE+VS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVS

Query:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
        RIELRDSS YSNLLLINRTASPLSWFMECKDRNN SAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Subjt:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR

Query:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIP
        PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVV+NNYQLFMIERLIMAGAKT IRTFKEDDTDLSLTDDPKKNTK WQIP
Subjt:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIP

Query:  VYTDEERR
        VYTDEERR
Subjt:  VYTDEERR

XP_022967807.1 uncharacterized protein LOC111467213 isoform X1 [Cucurbita maxima]6.35e-26689.19Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNG-SSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGH
        MA  +TQK KPKPRSP +FFFV+L+ IAFLFLFSSLISTNG SSSFPSSNSI++IFR KNL QKQRRNRH FS NDKFLYWGNRIDCPGKHCESCEGLGH
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNG-SSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGH

Query:  QESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKV
        QESSLRCALEEAMFLQR FVMPSRMCINPIHNKKG+LHQS N+SSEE WE NSCAMDSLYDMDLISDTVPVILDNSK WYQV STGMKLG+R V HV++V
Subjt:  QESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKV

Query:  SRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
        SRIELRD SRYSNLLLINRTASPLSWFMECKDRNN SA++LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Subjt:  SRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT

Query:  RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQI
        RPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDPVV+NNYQLFMIERLIMAGAKT IRTFKEDDTDLSLTDDPKKNTK WQ 
Subjt:  RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQI

Query:  PVYTDEE
        P+YTD+E
Subjt:  PVYTDEE

XP_023545162.1 uncharacterized protein LOC111804548 isoform X1 [Cucurbita pepo subsp. pepo]1.33e-26588.73Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNG--SSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLG
        MA  +TQK KPKPRSP +FFFV+L+ IAFLFLFSSLISTNG  SSSFPSSNSI++IFR KNL QKQRRNRH FS NDKFLYWGNRIDCPGKHCESCEGLG
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNG--SSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLG

Query:  HQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEK
        HQESSLRCALEEAMFLQR FVMPSRMCINPIHNKKG+LHQS N+SSEE WE NSCAMDSLYDMDLISDTVPVILDNSK WYQV STGMKLG+R V HV++
Subjt:  HQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEK

Query:  VSRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
        VSRIELRD SRYSNLLLINRTASPLSWFMECKDRNN SA++LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Subjt:  VSRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD

Query:  TRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQ
        TRPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDPVV+NNYQLFMIERL+MAGAKT IRTFKEDDTDLSLTDDPKKNTK WQ
Subjt:  TRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQ

Query:  IPVYTDEE
         P+YTD+E
Subjt:  IPVYTDEE

XP_038881641.1 uncharacterized protein LOC120073097 [Benincasa hispida]7.97e-27892.91Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRTQKPKPKPRSPLIFFFV+L+AIAFLFLFSSL+STNG+SSF SSNSIQKIFR KNLTQKQRRNRH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVS
        ESSLRCALEEAMFLQR FVMPSRMCINPIHNKKGLLHQS N+SSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQVLSTGMKLGARAV HVE+VS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVS

Query:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
        R+ELRD+SRYS+LLLINRTASPLSWFMECKDRNN SA++LPYKFLPSMAAEN+RDAAEKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Subjt:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR

Query:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIP
        PEFMLKRIAKWVPAGRTLFIASNER PGFFSPLS RYKLAYS NYS ILDPVV+NNYQLFMIERLIMAGAKT IRTFKEDDTDLSLTDDPKKNTK WQIP
Subjt:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIP

Query:  VYT-DEERR
        VYT DEERR
Subjt:  VYT-DEERR

TrEMBL top hitse value%identityAlignment
A0A0A0L0X3 Uncharacterized protein4.03e-300100Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVSR
        ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVSR
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVSR

Query:  IELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRP
        IELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRP
Subjt:  IELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRP

Query:  EFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIPV
        EFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIPV
Subjt:  EFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIPV

Query:  YTDEERR
        YTDEERR
Subjt:  YTDEERR

A0A1S3C1H9 O-fucosyltransferase family protein3.25e-28896.57Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRTQKPKPK RSPLIFFFVSL+AIAFLFLFSSLISTNGSSSFPSSNSIQKIFR KNLTQKQRR RHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVS
        ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS N+SSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLST MKLGARAV HVE+VS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVS

Query:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
        RIELRDSS YSNLLLINRTASPLSWFMECKDRNN SAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Subjt:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR

Query:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIP
        PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVV+NNYQLFMIERLIMAGAKT IRTFKEDDTDLSLTDDPKKNTK WQIP
Subjt:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIP

Query:  VYTDEERR
        VYTDEERR
Subjt:  VYTDEERR

A0A5D3DWC1 O-fucosyltransferase family protein3.25e-28896.57Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRTQKPKPK RSPLIFFFVSL+AIAFLFLFSSLISTNGSSSFPSSNSIQKIFR KNLTQKQRR RHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVS
        ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS N+SSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLST MKLGARAV HVE+VS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVS

Query:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
        RIELRDSS YSNLLLINRTASPLSWFMECKDRNN SAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Subjt:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR

Query:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIP
        PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVV+NNYQLFMIERLIMAGAKT IRTFKEDDTDLSLTDDPKKNTK WQIP
Subjt:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIP

Query:  VYTDEERR
        VYTDEERR
Subjt:  VYTDEERR

A0A6J1DKB9 O-fucosyltransferase family protein1.30e-26588.29Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSS--FPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLG
        MAFPR QK KPKPRSPL FFFV+L+AIAFLFLFSSLISTNG+SS  F SSNSIQKIFR  N+ +K +RNRH FS NDKFLYWGNRIDCPGKHCESCEGLG
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSS--FPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLG

Query:  HQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNS-SSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEK
        HQESSLRCALEEAMFL+R FVMPSRMCINPIHNKKG+LHQSN+ SSEESWEA SCAMDSLYD+DLISDTVPVILDNSK WYQVLSTGMKLGARAV HVE+
Subjt:  HQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNS-SSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEK

Query:  VSRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
        VSR EL+D++RYSNLLLINRTASPLSWFMECKDRNN SA++LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Subjt:  VSRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD

Query:  TRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQ
        TRPEFMLKR+AKWV  GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDP+V+NNYQLFMIERLIMAGAKT IRTFKEDDTDLSLTDDPKKNTK WQ
Subjt:  TRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQ

Query:  IPVYTDEERR
         PVYTD+E +
Subjt:  IPVYTDEERR

A0A6J1HRU4 O-fucosyltransferase family protein3.08e-26689.19Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNG-SSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGH
        MA  +TQK KPKPRSP +FFFV+L+ IAFLFLFSSLISTNG SSSFPSSNSI++IFR KNL QKQRRNRH FS NDKFLYWGNRIDCPGKHCESCEGLGH
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNG-SSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGH

Query:  QESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKV
        QESSLRCALEEAMFLQR FVMPSRMCINPIHNKKG+LHQS N+SSEE WE NSCAMDSLYDMDLISDTVPVILDNSK WYQV STGMKLG+R V HV++V
Subjt:  QESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKV

Query:  SRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
        SRIELRD SRYSNLLLINRTASPLSWFMECKDRNN SA++LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Subjt:  SRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT

Query:  RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQI
        RPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDPVV+NNYQLFMIERLIMAGAKT IRTFKEDDTDLSLTDDPKKNTK WQ 
Subjt:  RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQI

Query:  PVYTDEE
        P+YTD+E
Subjt:  PVYTDEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G04280.1 unknown protein4.4e-1824.52Show/hide
Query:  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARA
        + C+ + H   S  CAL EA +L RT VM   +C++ I+   G        +EE  +         +D + + +   V LD ++ W Q      K   R 
Subjt:  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARA

Query:  VGHVEKVSRIELRDSSRYSNLLLINRTAS--PLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLG-DYDAIHVRRGDKIKTRKDRFGVDR
          H+ +  R+     +   + L++ +  S  P +++    + +  S V  P+  L    +  L +    I   L  DYDA+H+ RG+K +        ++
Subjt:  VGHVEKVSRIELRDSSRYSNLLLINRTAS--PLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLG-DYDAIHVRRGDKIKTRKDRFGVDR

Query:  SLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILD----------------PVVQNNYQLFMIERLIMAGAKTL
         + P+L+ DT P  +L  +   V  GR L+IA+NE    FF+PL  +Y   +  +Y D+ D                PV  + Y    ++  +    K  
Subjt:  SLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILD----------------PVVQNNYQLFMIERLIMAGAKTL

Query:  IRTFKEDDTD
        I TF +   D
Subjt:  IRTFKEDDTD

AT2G41150.1 unknown protein5.6e-8260.08Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        M+ P   K  P  +  ++   V   A+AFL LF+S+IST G  + P   ++   F       +  RN+   S++DK+LYWGNRIDCPGK+CE+C GLGHQ
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNSSS-EESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVS
        ESSLRCALEEAMFL RTFVMPSRMCINPIHNKKG+L++SN+ + EESWE +SCAM+SLYD+DLIS+ +PVILD+S++W+ +LST MKL  R   HV   +
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNSSS-EESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVS

Query:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGL
        R EL DSS ++NLLLINRTASPL+WF+ECKDR N S VMLPY FL +MAA  LRDAAEK+K L
Subjt:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGL

AT2G41150.2 unknown protein4.5e-14865.02Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        M+ P   K  P  +  ++   V   A+AFL LF+S+IST G  + P   ++   F       +  RN+   S++DK+LYWGNRIDCPGK+CE+C GLGHQ
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNSSS-EESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVS
        ESSLRCALEEAMFL RTFVMPSRMCINPIHNKKG+L++SN+ + EESWE +SCAM+SLYD+DLIS+ +PVILD+S++W+ +LST MKL  R   HV   +
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNSSS-EESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVS

Query:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
        R EL DSS ++NLLLINRTASPL+WF+ECKDR N S VMLPY FL +MAA  LRDAAEKIK  LGDYDAIHVRRGDK+KTRKDRF V+RS  PHLDRDTR
Subjt:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR

Query:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIP
        PEF++ RI K +P GRTLFI SNER P FFSPL+ RYK+AYSSN+S+ILDP+++NNYQLFM+ERLIM GAKT  +TF+E +TDL+LTDDPKKN K W+IP
Subjt:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIP

Query:  VYTDEE
        VYT +E
Subjt:  VYTDEE

AT3G56750.1 unknown protein2.8e-15066.09Show/hide
Query:  RTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSL
        + Q+ KP   S  +  F  +   +FL LFSS+IST G    P   ++   F    +   + + +H  S+++K+LYWGNRIDCPGK+CE+C GLGHQESSL
Subjt:  RTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSL

Query:  RCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVSRIEL
        RCALEEAMFL RTFVMPS MCINPIHNKKG+L++S N ++EE W  +SCAMDSLYD+DLIS+ +PVILD+SK+W+ VLST MKLG R + HV  V+R  L
Subjt:  RCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVSRIEL

Query:  RDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFM
        ++ S YSNLL+INRTASPL+WF+ECKDR+N SAVMLPY FLP+MAA  LR+AAEKIK  LGDYDAIHVRRGDK+KTRKDRFGV+R   PHLDRDTRPEF+
Subjt:  RDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFM

Query:  LKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIPVYTD
        L+RI K +P GRTLFI SNER PGFFSPL+ RYKLAYSSN+S+ILDP+++NNYQLFM+ERL+M GAKT  +TFKE +TDL+LTDDPKKN K W+IPVYT 
Subjt:  LKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIPVYTD

Query:  EERR
        +ERR
Subjt:  EERR

AT4G12700.1 unknown protein8.8e-1924.84Show/hide
Query:  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARA
        + C+ + H   S  CAL EA +L RT VM   +C++ ++   G   Q+    +  +  +    + L +   + D V    D  K WY+    G+KL    
Subjt:  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSNSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARA

Query:  VGHVEKVSRIELRDSSRYSNLLLINR--TASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLG-DYDAIHVRRGDKIKTRKDRFGVDR
           V  +  ++++D+      L++ +  T  P +++    +    S V  P+  L    ++ L +    I   L  DYDAIH+ RGDK +        ++
Subjt:  VGHVEKVSRIELRDSSRYSNLLLINR--TASPLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLG-DYDAIHVRRGDKIKTRKDRFGVDR

Query:  SLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILD----------------PVVQNNYQLFMIERLIMAGAKTL
         + P+L++DT P  +L  +   +  GR L+IA+NE    FF+PL  +YK  +   + D+ D                PV  + Y    ++  +    K  
Subjt:  SLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILD----------------PVVQNNYQLFMIERLIMAGAKTL

Query:  IRTFKEDDTD
        I TF +   D
Subjt:  IRTFKEDDTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATTTCCCAGAACCCAGAAGCCAAAACCCAAACCCAGATCCCCACTCATCTTCTTCTTTGTTTCCCTTTCCGCCATTGCCTTTCTTTTCCTCTTTTCTTCTCTGAT
TTCTACCAATGGGTCTTCTTCTTTTCCATCCTCGAACTCAATTCAGAAAATCTTCAGATTAAAAAATCTGACCCAGAAACAGAGACGTAATCGGCATTTTTTTAGTGTAA
ATGACAAGTTCTTGTACTGGGGAAACCGAATCGACTGCCCGGGGAAGCATTGCGAGTCTTGTGAGGGTTTGGGTCACCAGGAATCCAGCTTGAGGTGTGCCCTTGAGGAA
GCCATGTTCCTCCAAAGAACATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACATAATAAGAAAGGCCTTCTTCATCAGTCCAATTCAAGCTCAGAGGAAAGTTG
GGAAGCAAACTCTTGTGCCATGGACTCTTTGTACGATATGGACCTTATATCTGACACCGTACCAGTGATTTTAGACAACTCAAAATCATGGTATCAGGTACTGTCAACTG
GTATGAAATTAGGAGCTAGAGCAGTTGGCCATGTAGAGAAAGTCAGTCGTATTGAACTGAGAGACAGCAGCCGCTACTCCAATCTTTTGCTAATAAATCGAACTGCCAGC
CCTCTTTCATGGTTTATGGAATGCAAGGACAGAAACAACCATAGTGCCGTAATGTTGCCCTATAAATTTCTTCCTTCTATGGCAGCAGAAAACTTGAGGGATGCAGCTGA
GAAGATTAAAGGGCTACTCGGTGATTATGATGCCATCCATGTTCGTCGCGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCACATC
TTGACAGGGATACACGGCCGGAGTTTATGCTGAAGAGAATAGCAAAGTGGGTTCCGGCAGGGCGGACTCTTTTTATTGCTTCAAATGAGAGAATTCCTGGATTCTTCTCA
CCCCTCTCTGCTCGGTACAAGTTGGCTTATTCCTCGAACTATAGCGATATTCTGGATCCTGTGGTTCAGAACAACTACCAATTGTTCATGATCGAAAGGCTCATTATGGC
AGGTGCCAAGACATTGATCAGAACATTTAAAGAAGATGATACAGATCTAAGCCTCACCGACGACCCAAAGAAGAACACAAAAGCATGGCAAATACCTGTCTACACTGATG
AAGAAAGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCATTTCCCAGAACCCAGAAGCCAAAACCCAAACCCAGATCCCCACTCATCTTCTTCTTTGTTTCCCTTTCCGCCATTGCCTTTCTTTTCCTCTTTTCTTCTCTGAT
TTCTACCAATGGGTCTTCTTCTTTTCCATCCTCGAACTCAATTCAGAAAATCTTCAGATTAAAAAATCTGACCCAGAAACAGAGACGTAATCGGCATTTTTTTAGTGTAA
ATGACAAGTTCTTGTACTGGGGAAACCGAATCGACTGCCCGGGGAAGCATTGCGAGTCTTGTGAGGGTTTGGGTCACCAGGAATCCAGCTTGAGGTGTGCCCTTGAGGAA
GCCATGTTCCTCCAAAGAACATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACATAATAAGAAAGGCCTTCTTCATCAGTCCAATTCAAGCTCAGAGGAAAGTTG
GGAAGCAAACTCTTGTGCCATGGACTCTTTGTACGATATGGACCTTATATCTGACACCGTACCAGTGATTTTAGACAACTCAAAATCATGGTATCAGGTACTGTCAACTG
GTATGAAATTAGGAGCTAGAGCAGTTGGCCATGTAGAGAAAGTCAGTCGTATTGAACTGAGAGACAGCAGCCGCTACTCCAATCTTTTGCTAATAAATCGAACTGCCAGC
CCTCTTTCATGGTTTATGGAATGCAAGGACAGAAACAACCATAGTGCCGTAATGTTGCCCTATAAATTTCTTCCTTCTATGGCAGCAGAAAACTTGAGGGATGCAGCTGA
GAAGATTAAAGGGCTACTCGGTGATTATGATGCCATCCATGTTCGTCGCGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCACATC
TTGACAGGGATACACGGCCGGAGTTTATGCTGAAGAGAATAGCAAAGTGGGTTCCGGCAGGGCGGACTCTTTTTATTGCTTCAAATGAGAGAATTCCTGGATTCTTCTCA
CCCCTCTCTGCTCGGTACAAGTTGGCTTATTCCTCGAACTATAGCGATATTCTGGATCCTGTGGTTCAGAACAACTACCAATTGTTCATGATCGAAAGGCTCATTATGGC
AGGTGCCAAGACATTGATCAGAACATTTAAAGAAGATGATACAGATCTAAGCCTCACCGACGACCCAAAGAAGAACACAAAAGCATGGCAAATACCTGTCTACACTGATG
AAGAAAGAAGGTGA
Protein sequenceShow/hide protein sequence
MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEE
AMFLQRTFVMPSRMCINPIHNKKGLLHQSNSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQVLSTGMKLGARAVGHVEKVSRIELRDSSRYSNLLLINRTAS
PLSWFMECKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFS
PLSARYKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIPVYTDEERR