; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0000065 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0000065
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionO-fucosyltransferase family protein
Genome locationchr08:6200746..6204500
RNA-Seq ExpressionPI0000065
SyntenyPI0000065
Gene Ontology termsGO:0006004 - fucose metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR019378 - GDP-fucose protein O-fucosyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144331.1 uncharacterized protein LOC101219097 [Cucumis sativus]2.5e-22596.58Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRTQKPKPKPRSPLIFFFVSL+AIAFLFLFSSLISTNGSSSFPSSNSIQKIFR KNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS
        ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS N+SSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQVLSTGMKLGARAV HVE+VS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS

Query:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
        RIELRDSSRYSNLLLINRTASPLSWFMECKDRNN SA+MLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Subjt:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR

Query:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIP
        PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLA+SSNYSDILDPVV+NNYQLFMIERLIMAGAKT IRTFKEDDTDLSLTDDPKKNTK WQIP
Subjt:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIP

Query:  VYTPDEERR
        VYT DEERR
Subjt:  VYTPDEERR

XP_008455718.1 PREDICTED: uncharacterized protein LOC103495824 [Cucumis melo]8.4e-22998.04Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRTQKPKPK RSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRR RHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS
        ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQVLST MKLGARAVAHVEQVS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS

Query:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
        RIELRDSS YSNLLLINRTASPLSWFMECKDRNNRSA+MLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Subjt:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR

Query:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIP
        PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLA+SSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIP
Subjt:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIP

Query:  VYTPDEERR
        VYT DEERR
Subjt:  VYTPDEERR

XP_022967807.1 uncharacterized protein LOC111467213 isoform X1 [Cucurbita maxima]2.4e-21591.91Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNG-SSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGH
        MA  +TQK KPKPRSP +FFFV+LA IAFLFLFSSLISTNG SSSFPSSNSI++IFRFKNL QKQRRNRH FS NDKFLYWGNRIDCPGKHCESCEGLGH
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNG-SSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGH

Query:  QESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQV
        QESSLRCALEEAMFLQR FVMPSRMCINPIHNKKG+LHQSTNASSEE WE NSCAMDSLYDMDLISDTVPVILDNSKLWYQV STGMKLG+R VAHV+QV
Subjt:  QESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQV

Query:  SRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
        SRIELRD SRYSNLLLINRTASPLSWFMECKDRNNRSAI+LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Subjt:  SRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT

Query:  RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQI
        RPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARYKLA+SSNYS ILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQ 
Subjt:  RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQI

Query:  PVYTPDEE
        P+YT DEE
Subjt:  PVYTPDEE

XP_023545162.1 uncharacterized protein LOC111804548 isoform X1 [Cucurbita pepo subsp. pepo]4.0e-21591.44Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNG--SSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLG
        MA  +TQK KPKPRSP +FFFV+LA IAFLFLFSSLISTNG  SSSFPSSNSI++IFRFKNL QKQRRNRH FS NDKFLYWGNRIDCPGKHCESCEGLG
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNG--SSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLG

Query:  HQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQ
        HQESSLRCALEEAMFLQR FVMPSRMCINPIHNKKG+LHQSTNASSEE WE NSCAMDSLYDMDLISDTVPVILDNSKLWYQV STGMKLG+R VAHV+Q
Subjt:  HQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQ

Query:  VSRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
        VSRIELRD SRYSNLLLINRTASPLSWFMECKDRNNRSAI+LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Subjt:  VSRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD

Query:  TRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQ
        TRPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARYKLA+SSNYS ILDPVVKNNYQLFMIERL+MAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQ
Subjt:  TRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQ

Query:  IPVYTPDEE
         P+YT DEE
Subjt:  IPVYTPDEE

XP_038881641.1 uncharacterized protein LOC120073097 [Benincasa hispida]1.9e-22595.35Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRTQKPKPKPRSPLIFFFV+LAAIAFLFLFSSL+STNG+SSF SSNSIQKIFRFKNLTQKQRRNRH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS
        ESSLRCALEEAMFLQR FVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS

Query:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
        R+ELRD+SRYS+LLLINRTASPLSWFMECKDRNNRSAI+LPYKFLPSMAAEN+RDAAEKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Subjt:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR

Query:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIP
        PEFMLKRIAKWVPAGRTLFIASNER PGFFSPLS RYKLA+S NYS ILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTK+WQIP
Subjt:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIP

Query:  VYTPDEERR
        VYT DEERR
Subjt:  VYTPDEERR

TrEMBL top hitse value%identityAlignment
A0A0A0L0X3 Uncharacterized protein1.2e-22596.58Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRTQKPKPKPRSPLIFFFVSL+AIAFLFLFSSLISTNGSSSFPSSNSIQKIFR KNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS
        ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQS N+SSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQVLSTGMKLGARAV HVE+VS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS

Query:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
        RIELRDSSRYSNLLLINRTASPLSWFMECKDRNN SA+MLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Subjt:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR

Query:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIP
        PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLA+SSNYSDILDPVV+NNYQLFMIERLIMAGAKT IRTFKEDDTDLSLTDDPKKNTK WQIP
Subjt:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIP

Query:  VYTPDEERR
        VYT DEERR
Subjt:  VYTPDEERR

A0A1S3C1H9 O-fucosyltransferase family protein4.0e-22998.04Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRTQKPKPK RSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRR RHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS
        ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQVLST MKLGARAVAHVEQVS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS

Query:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
        RIELRDSS YSNLLLINRTASPLSWFMECKDRNNRSA+MLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Subjt:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR

Query:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIP
        PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLA+SSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIP
Subjt:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIP

Query:  VYTPDEERR
        VYT DEERR
Subjt:  VYTPDEERR

A0A5D3DWC1 O-fucosyltransferase family protein4.0e-22998.04Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
        MAFPRTQKPKPK RSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRR RHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQ

Query:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS
        ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQVLST MKLGARAVAHVEQVS
Subjt:  ESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVS

Query:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
        RIELRDSS YSNLLLINRTASPLSWFMECKDRNNRSA+MLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Subjt:  RIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR

Query:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIP
        PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLA+SSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIP
Subjt:  PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIP

Query:  VYTPDEERR
        VYT DEERR
Subjt:  VYTPDEERR

A0A6J1DKB9 O-fucosyltransferase family protein4.8e-21490.49Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNG--SSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLG
        MAFPR QK KPKPRSPL FFFV+LAAIAFLFLFSSLISTNG  SS+F SSNSIQKIFRF N+ +K +RNRH FS NDKFLYWGNRIDCPGKHCESCEGLG
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNG--SSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLG

Query:  HQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQ
        HQESSLRCALEEAMFL+R FVMPSRMCINPIHNKKG+LHQS NASSEESWEA SCAMDSLYD+DLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVE+
Subjt:  HQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQ

Query:  VSRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
        VSR EL+D++RYSNLLLINRTASPLSWFMECKDRNNRSAI+LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Subjt:  VSRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD

Query:  TRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQ
        TRPEFMLKR+AKWV  GRTLFIASNER PGFFSPLSARYKLA+SSNYS ILDP+VKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTK+WQ
Subjt:  TRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQ

Query:  IPVYTPDEER
         PVYT DEE+
Subjt:  IPVYTPDEER

A0A6J1HRU4 O-fucosyltransferase family protein1.1e-21591.91Show/hide
Query:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNG-SSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGH
        MA  +TQK KPKPRSP +FFFV+LA IAFLFLFSSLISTNG SSSFPSSNSI++IFRFKNL QKQRRNRH FS NDKFLYWGNRIDCPGKHCESCEGLGH
Subjt:  MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNG-SSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGH

Query:  QESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQV
        QESSLRCALEEAMFLQR FVMPSRMCINPIHNKKG+LHQSTNASSEE WE NSCAMDSLYDMDLISDTVPVILDNSKLWYQV STGMKLG+R VAHV+QV
Subjt:  QESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQV

Query:  SRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
        SRIELRD SRYSNLLLINRTASPLSWFMECKDRNNRSAI+LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Subjt:  SRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT

Query:  RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQI
        RPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARYKLA+SSNYS ILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQ 
Subjt:  RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQI

Query:  PVYTPDEE
        P+YT DEE
Subjt:  PVYTPDEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G04280.1 unknown protein1.2e-1824.44Show/hide
Query:  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGAR
        + C+ + H   S  CAL EA +L RT VM   +C++ I+   G         +EE  +         +D + + +   V LD ++ W Q      K   R
Subjt:  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGAR

Query:  AVAHVEQVSRIELRDSSRYSNLLLINRTAS--PLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLG-DYDAIHVRRGDKIKTRKDRFGVD
           H+ +  R+     +   + L++ +  S  P +++    + +  S +  P+  L    +  L +    I   L  DYDA+H+ RG+K +        +
Subjt:  AVAHVEQVSRIELRDSSRYSNLLLINRTAS--PLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLG-DYDAIHVRRGDKIKTRKDRFGVD

Query:  RSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILD----------------PVVKNNYQLFMIERLIMAGAKT
        + + P+L+ DT P  +L  +   V  GR L+IA+NE    FF+PL  +Y   F  +Y D+ D                PV  + Y    ++  +    K 
Subjt:  RSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILD----------------PVVKNNYQLFMIERLIMAGAKT

Query:  FIRTFKEDDTD
         I TF +   D
Subjt:  FIRTFKEDDTD

AT2G41150.1 unknown protein7.9e-8461Show/hide
Query:  RTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSL
        +  K K  P S  +   + + A+AFL LF+S+IST G  + P   ++   F       +  RN+   S++DK+LYWGNRIDCPGK+CE+C GLGHQESSL
Subjt:  RTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSL

Query:  RCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRIEL
        RCALEEAMFL RTFVMPSRMCINPIHNKKG+L++S N + EESWE +SCAM+SLYD+DLIS+ +PVILD+S+ W+ +LST MKL  R  AHV   +R EL
Subjt:  RCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRIEL

Query:  RDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGL
         DSS ++NLLLINRTASPL+WF+ECKDR NRS +MLPY FL +MAA  LRDAAEK+K L
Subjt:  RDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGL

AT2G41150.2 unknown protein1.3e-15065.92Show/hide
Query:  RTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSL
        +  K K  P S  +   + + A+AFL LF+S+IST G  + P   ++   F       +  RN+   S++DK+LYWGNRIDCPGK+CE+C GLGHQESSL
Subjt:  RTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSL

Query:  RCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRIEL
        RCALEEAMFL RTFVMPSRMCINPIHNKKG+L++S N + EESWE +SCAM+SLYD+DLIS+ +PVILD+S+ W+ +LST MKL  R  AHV   +R EL
Subjt:  RCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRIEL

Query:  RDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFM
         DSS ++NLLLINRTASPL+WF+ECKDR NRS +MLPY FL +MAA  LRDAAEKIK  LGDYDAIHVRRGDK+KTRKDRF V+RS  PHLDRDTRPEF+
Subjt:  RDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFM

Query:  LKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIPVYTP
        + RI K +P GRTLFI SNER P FFSPL+ RYK+A+SSN+S+ILDP+++NNYQLFM+ERLIM GAKTF +TF+E +TDL+LTDDPKKN K W+IPVYT 
Subjt:  LKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIPVYTP

Query:  DE
        DE
Subjt:  DE

AT3G56750.1 unknown protein1.4e-15266.09Show/hide
Query:  RTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSL
        + Q+ KP   S  +  F  +   +FL LFSS+IST G    P   ++   F +    ++Q       S+++K+LYWGNRIDCPGK+CE+C GLGHQESSL
Subjt:  RTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSL

Query:  RCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRIEL
        RCALEEAMFL RTFVMPS MCINPIHNKKG+L++S N ++EE W  +SCAMDSLYD+DLIS+ +PVILD+SK W+ VLST MKLG R +AHV  V+R  L
Subjt:  RCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRIEL

Query:  RDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFM
        ++ S YSNLL+INRTASPL+WF+ECKDR+NRSA+MLPY FLP+MAA  LR+AAEKIK  LGDYDAIHVRRGDK+KTRKDRFGV+R   PHLDRDTRPEF+
Subjt:  RDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFM

Query:  LKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIPVYTP
        L+RI K +P GRTLFI SNER PGFFSPL+ RYKLA+SSN+S+ILDP+++NNYQLFM+ERL+M GAKT+ +TFKE +TDL+LTDDPKKN K W+IPVYT 
Subjt:  LKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIPVYTP

Query:  DEER
        DE R
Subjt:  DEER

AT4G12700.1 unknown protein3.0e-1924.76Show/hide
Query:  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGAR
        + C+ + H   S  CAL EA +L RT VM   +C++ ++   G   +  +      +E        L +   + D V    D  K WY+    G+KL   
Subjt:  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGAR

Query:  AVAHVEQVSRIELRDSSRYSNLLLINR--TASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLG-DYDAIHVRRGDKIKTRKDRFGVD
            V  +  ++++D+      L++ +  T  P +++    +    S +  P+  L    ++ L +    I   L  DYDAIH+ RGDK +        +
Subjt:  AVAHVEQVSRIELRDSSRYSNLLLINR--TASPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLG-DYDAIHVRRGDKIKTRKDRFGVD

Query:  RSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILD----------------PVVKNNYQLFMIERLIMAGAKT
        + + P+L++DT P  +L  +   +  GR L+IA+NE    FF+PL  +YK  F   + D+ D                PV  + Y    ++  +    K 
Subjt:  RSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAFSSNYSDILD----------------PVVKNNYQLFMIERLIMAGAKT

Query:  FIRTFKEDDTD
         I TF +   D
Subjt:  FIRTFKEDDTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATTTCCCAGAACCCAGAAGCCAAAACCGAAACCCAGATCCCCACTCATCTTCTTCTTCGTCTCCCTTGCCGCCATTGCCTTTCTTTTCCTCTTTTCTTCACTGAT
TTCTACCAATGGCTCTTCTTCTTTTCCATCCTCGAACTCAATTCAGAAAATCTTCAGATTCAAAAATCTGACCCAGAAACAGAGACGTAATCGGCATTTTTTTAGTGTAA
ATGATAAGTTCTTGTACTGGGGCAACCGAATCGACTGCCCGGGGAAGCATTGCGAGTCTTGTGAGGGTTTGGGTCACCAGGAATCCAGCTTGAGGTGTGCCCTTGAGGAA
GCCATGTTCCTTCAAAGAACATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACATAATAAGAAAGGCCTTCTTCATCAGTCCACCAATGCAAGCTCAGAGGAAAG
TTGGGAAGCAAACTCTTGTGCCATGGACTCTTTGTACGATATGGACCTTATATCTGACACTGTACCAGTGATTTTAGACAACTCAAAGTTATGGTATCAGGTACTGTCAA
CTGGTATGAAATTAGGCGCTAGAGCAGTTGCCCACGTAGAGCAAGTCAGTCGTATTGAACTGAGAGACAGCAGCCGCTACTCCAATCTTTTGCTAATAAATCGAACTGCC
AGCCCTCTTTCATGGTTTATGGAATGCAAGGACAGAAACAACCGTAGTGCCATAATGTTGCCCTATAAATTTCTTCCTTCTATGGCAGCAGAAAACTTGAGGGATGCAGC
TGAGAAGATTAAAGGGCTACTTGGTGATTATGATGCCATCCACGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGCGTTGATAGAAGCTTACATCCAC
ATCTCGACAGGGATACACGGCCGGAGTTTATGCTGAAGAGAATAGCAAAGTGGGTTCCGGCAGGGCGGACTCTTTTTATTGCTTCAAATGAGAGAATTCCTGGATTCTTC
TCACCTCTCTCTGCTCGGTACAAGTTGGCTTTTTCTTCGAACTATAGCGATATTCTGGATCCTGTGGTTAAGAACAACTACCAATTGTTCATGATCGAAAGGCTCATTAT
GGCAGGTGCCAAGACATTCATCAGAACGTTCAAAGAAGACGATACAGATCTAAGCCTCACCGACGACCCAAAGAAGAACACAAAAGTATGGCAAATACCTGTCTACACAC
CTGATGAAGAAAGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCATTTCCCAGAACCCAGAAGCCAAAACCGAAACCCAGATCCCCACTCATCTTCTTCTTCGTCTCCCTTGCCGCCATTGCCTTTCTTTTCCTCTTTTCTTCACTGAT
TTCTACCAATGGCTCTTCTTCTTTTCCATCCTCGAACTCAATTCAGAAAATCTTCAGATTCAAAAATCTGACCCAGAAACAGAGACGTAATCGGCATTTTTTTAGTGTAA
ATGATAAGTTCTTGTACTGGGGCAACCGAATCGACTGCCCGGGGAAGCATTGCGAGTCTTGTGAGGGTTTGGGTCACCAGGAATCCAGCTTGAGGTGTGCCCTTGAGGAA
GCCATGTTCCTTCAAAGAACATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACATAATAAGAAAGGCCTTCTTCATCAGTCCACCAATGCAAGCTCAGAGGAAAG
TTGGGAAGCAAACTCTTGTGCCATGGACTCTTTGTACGATATGGACCTTATATCTGACACTGTACCAGTGATTTTAGACAACTCAAAGTTATGGTATCAGGTACTGTCAA
CTGGTATGAAATTAGGCGCTAGAGCAGTTGCCCACGTAGAGCAAGTCAGTCGTATTGAACTGAGAGACAGCAGCCGCTACTCCAATCTTTTGCTAATAAATCGAACTGCC
AGCCCTCTTTCATGGTTTATGGAATGCAAGGACAGAAACAACCGTAGTGCCATAATGTTGCCCTATAAATTTCTTCCTTCTATGGCAGCAGAAAACTTGAGGGATGCAGC
TGAGAAGATTAAAGGGCTACTTGGTGATTATGATGCCATCCACGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGCGTTGATAGAAGCTTACATCCAC
ATCTCGACAGGGATACACGGCCGGAGTTTATGCTGAAGAGAATAGCAAAGTGGGTTCCGGCAGGGCGGACTCTTTTTATTGCTTCAAATGAGAGAATTCCTGGATTCTTC
TCACCTCTCTCTGCTCGGTACAAGTTGGCTTTTTCTTCGAACTATAGCGATATTCTGGATCCTGTGGTTAAGAACAACTACCAATTGTTCATGATCGAAAGGCTCATTAT
GGCAGGTGCCAAGACATTCATCAGAACGTTCAAAGAAGACGATACAGATCTAAGCCTCACCGACGACCCAAAGAAGAACACAAAAGTATGGCAAATACCTGTCTACACAC
CTGATGAAGAAAGAAGGTGA
Protein sequenceShow/hide protein sequence
MAFPRTQKPKPKPRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEE
AMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRIELRDSSRYSNLLLINRTA
SPLSWFMECKDRNNRSAIMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFF
SPLSARYKLAFSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIPVYTPDEERR