; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0027898 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0027898
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationchr02:22651032..22654573
RNA-Seq ExpressionPI0027898
SyntenyPI0027898
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589613.1 Glycosyltransferase BC10, partial [Cucurbita argyrosperma subsp. sororia]9.6e-17480.15Show/hide
Query:  ISSSNP---KLQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPS-PPPLSPPSPPSRVGLKDFLKPPPALHDMTEEEL
        +++SNP   KLQ   KS  S LLLFAAGLAAGFTLTLF FPFPF   SSPLSL F+FSQ QL SPS PPP  PP PPSRVGLK FL PPPALHDMTEEEL
Subjt:  ISSSNP---KLQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPS-PPPLSPPSPPSRVGLKDFLKPPPALHDMTEEEL

Query:  LWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANAL
        LWRASL P RIPK P   KS+ KIAFLFLTKDGVSLAPLWE FFKG+  LYSIYVHRS S+N+TV S+SVFYGRSIPSKGVKWG PSMMEAERRLLANAL
Subjt:  LWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANAL

Query:  LDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCYM
        LDFSNQRFILLSESCIPLFNFSTIY+YLM SK+TF+E+YDL GPVGRGRY P+MRP INLHQWRKGSQWF++DR +ASQVVSDQKYF VF++ CKPSCYM
Subjt:  LDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCYM

Query:  DEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFNH
        DEHYLPT VGI FS TNSNRTLTWVDWS+GGAHPT+F R DV V LLQRLR GSHC YNG  TN+CHLFARKFM NSLNRLL+FAPKLM+FNH
Subjt:  DEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFNH

KAG7023304.1 hypothetical protein SDJN02_14329, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-17480.41Show/hide
Query:  ISSSNP---KLQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPS-PPPLSPPSPPSRVGLKDFLKPPPALHDMTEEEL
        +++SNP   KLQ   KS  S LLLFAAGLAAGFTLTLF FPFPF   SSPLSL F+FSQ QL SPS PPP  PP PPSRVGLK FL PPPALHDMTEEEL
Subjt:  ISSSNP---KLQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPS-PPPLSPPSPPSRVGLKDFLKPPPALHDMTEEEL

Query:  LWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANAL
        LWRASL P RIPK P   KS+ KIAFLFLTKDGVSLAPLWE FFKG+  LYSIYVHRS S+N+TV S+SVFYGRSIPSKGVKWG PSMMEAERRLLANAL
Subjt:  LWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANAL

Query:  LDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCYM
        LDFSNQRFILLSESCIPLFNFSTIYNYLM SK+TF+E+YDL GPVGRGRY P+MRP INLHQWRKGSQWF++DR +ASQVVSDQKYF VF++ CKPSCYM
Subjt:  LDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCYM

Query:  DEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFNH
        DEHYLPT VGI FS TNSNRTLTWVDWS+GGAHPT+F R DV V LLQRLR GSHC YNG  TN+CHLFARKFM NSLNRLL+FAPKLM+FNH
Subjt:  DEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFNH

XP_004137544.2 glycosyltransferase BC10 [Cucumis sativus]6.6e-19989.17Show/hide
Query:  MTISSSNPKLQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPSPPPLS---PPSPPSRVGLKDFLKPPPALHDMTEEE
        MTISSSNPKL IH KSFFSP LLF+AGLAAGFTLTLFIFPFPFFQFSS LSL FTF+Q QL SPSP P S   PP PPSRVGLK+FL PPP LHDMTEEE
Subjt:  MTISSSNPKLQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPSPPPLS---PPSPPSRVGLKDFLKPPPALHDMTEEE

Query:  LLWRASLAPHRIPKLPTTE--KSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSN--STVDSSSVFYGRSIPSKGVKWGEPSMMEAERRL
        LLWRASL P RIPKLP+TE   S RKIAFLFLTKDGVSLAPLWELFFKGY GLYSIYVHR+PSS+  STVDSSSVFYGRSIPSKGVKWGEPSMMEAERRL
Subjt:  LLWRASLAPHRIPKLPTTE--KSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSN--STVDSSSVFYGRSIPSKGVKWGEPSMMEAERRL

Query:  LANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCK
        LANALLDFSN+RFILLSESCIPLFNFST+YNYLMGSKSTFIEAYDL GPVGRGRY+PKMRPII LHQWRKGSQWFEMDRTIASQV+SDQKYF VFQKFCK
Subjt:  LANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCK

Query:  PSCYMDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFN
        PSCYMDEHYLPTFVGIRF KTNSNRTLTWVDWSRGGAHPTRF+RTDVT+ELL+RLRNG HCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLM FN
Subjt:  PSCYMDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFN

XP_022988481.1 uncharacterized protein LOC111485710 [Cucurbita maxima]9.6e-17479.95Show/hide
Query:  ISSSNP---KLQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPS--PPPLSPPSPPSRVGLKDFLKPPPALHDMTEEE
        +++SNP   KLQ   KS  S LLLFAAGLAAGFTLTLF FPFPF   SSPLSL F+FSQ QL SPS  PPP SPP PPSRVGLK F  PPPALHDMTEEE
Subjt:  ISSSNP---KLQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPS--PPPLSPPSPPSRVGLKDFLKPPPALHDMTEEE

Query:  LLWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANA
        LLWRASL P RIPK P   KS+ KIAFLFLTKDGVSLAPLWE FFKG+  LYSIYVHRS S+N+TV S+SVFYGRSIPSKGVKWG PSMMEAERRLLANA
Subjt:  LLWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANA

Query:  LLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCY
        LLDFSNQRFILLSESCIPLFNFSTIY+YLM SK+TF+E+YDL GPVGRGRY P+MRP INLHQWRKGSQWF++DR +ASQVVSDQKYF VF++ CKPSCY
Subjt:  LLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCY

Query:  MDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFNH
        MDEHYLPT VGI FS +NSNRTLTWVDWS+GGAHPT+F R DV V LLQRLR GSHC YNGV TN+CHLFARKFM NSLNRLL+FAPKLM+FNH
Subjt:  MDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFNH

XP_038895081.1 glycosyltransferase BC10-like [Benincasa hispida]1.5e-20391.26Show/hide
Query:  ISSSNPKLQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPSPPPLSPPSPPSRVGLKDFLKPPPALHDMTEEELLWRA
        +++SNPKLQ+H +S  SPLL FAAGLAAGFTLTLFIFPFPFFQFSSPLSLRF FSQFQL SPSPP   P SPPSRVGLKDFLKPPPALHDMTEEELLWRA
Subjt:  ISSSNPKLQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPSPPPLSPPSPPSRVGLKDFLKPPPALHDMTEEELLWRA

Query:  SLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFS
        SL P RIP  P+TEKSRRKIAFLFLTKDGV LAPLWELFFKG+ G YSIYVHRSPSSN+TVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFS
Subjt:  SLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFS

Query:  NQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCYMDEHY
        NQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDL GPVGRGRY+PKMRP INLHQWRKGSQWFEMDRTIASQVVSD K+F VFQKFCKPSCYMDEHY
Subjt:  NQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCYMDEHY

Query:  LPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFNH
        LPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTV LLQRLRNGSHCEYNGV TNLCHLFARKFM NSLNRLLMFAPKLMQFNH
Subjt:  LPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFNH

TrEMBL top hitse value%identityAlignment
A0A0A0LUY5 Uncharacterized protein3.2e-19989.17Show/hide
Query:  MTISSSNPKLQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPSPPPLS---PPSPPSRVGLKDFLKPPPALHDMTEEE
        MTISSSNPKL IH KSFFSP LLF+AGLAAGFTLTLFIFPFPFFQFSS LSL FTF+Q QL SPSP P S   PP PPSRVGLK+FL PPP LHDMTEEE
Subjt:  MTISSSNPKLQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPSPPPLS---PPSPPSRVGLKDFLKPPPALHDMTEEE

Query:  LLWRASLAPHRIPKLPTTE--KSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSN--STVDSSSVFYGRSIPSKGVKWGEPSMMEAERRL
        LLWRASL P RIPKLP+TE   S RKIAFLFLTKDGVSLAPLWELFFKGY GLYSIYVHR+PSS+  STVDSSSVFYGRSIPSKGVKWGEPSMMEAERRL
Subjt:  LLWRASLAPHRIPKLPTTE--KSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSN--STVDSSSVFYGRSIPSKGVKWGEPSMMEAERRL

Query:  LANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCK
        LANALLDFSN+RFILLSESCIPLFNFST+YNYLMGSKSTFIEAYDL GPVGRGRY+PKMRPII LHQWRKGSQWFEMDRTIASQV+SDQKYF VFQKFCK
Subjt:  LANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCK

Query:  PSCYMDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFN
        PSCYMDEHYLPTFVGIRF KTNSNRTLTWVDWSRGGAHPTRF+RTDVT+ELL+RLRNG HCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLM FN
Subjt:  PSCYMDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFN

A0A1S3BX15 uncharacterized protein LOC1034941402.3e-16596.31Show/hide
Query:  MTEEELLWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERR
        MTEEELLWRASL P RIPKLP+TE SRRKIAFLFLTKDGVSLAPLWELFFKGY GLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERR
Subjt:  MTEEELLWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERR

Query:  LLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFC
        LLANALLDFSNQRFILLSESCIPLFNFSTIYNYLM SKSTFIEAYDL GPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYF VFQKFC
Subjt:  LLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFC

Query:  KPSCYMDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFN
        KPSCYMDEHYLPTFVGIRFSK+NSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLR+GSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLM FN
Subjt:  KPSCYMDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFN

A0A6J1C2T7 uncharacterized protein LOC1110069261.2e-14871.69Show/hide
Query:  LQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPSPPPL-SPPSPPSRVGLKDFLKPPPALHDMTEEELLWRASLAPHR
        +   LKS FS LLLFA+GLA GF+L+L  F FP  Q S  L+          SS SPPP   PP PPS + L D   PPP +HDM++EELLWRASLAP  
Subjt:  LQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPSPPPL-SPPSPPSRVGLKDFLKPPPALHDMTEEELLWRASLAPHR

Query:  IPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFK-GYGGLYSIYVHRSPSSNSTVDS-SSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFSNQRF
        +P    +     KIAFLFLTK+GV+LAPLWELFFK  +  L+SIYVH +  SNST+ S SSVF+ R+IPSKGVKWGEPSMMEAERRLLANALLDFSNQRF
Subjt:  IPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFK-GYGGLYSIYVHRSPSSNSTVDS-SSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFSNQRF

Query:  ILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCYMDEHYLPTF
        +LLSESCIPLFNFST+YNYLMGSK+TFIEAYDL GPVGRGRY P+MRP INLHQWRKGSQWF++DR +A++VVSD K+F VF KFC P CYMDEHYLPT 
Subjt:  ILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCYMDEHYLPTF

Query:  VGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFNH
        VGI+FS TNSNRTLTWVDWSRGG HPTRFIR DV VELL+RLR GSHC+YNGV TN+CHLFARKFM NSLNRLLMFAPKLM+F+H
Subjt:  VGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFNH

A0A6J1JHA6 uncharacterized protein LOC1114857104.6e-17479.95Show/hide
Query:  ISSSNP---KLQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPS--PPPLSPPSPPSRVGLKDFLKPPPALHDMTEEE
        +++SNP   KLQ   KS  S LLLFAAGLAAGFTLTLF FPFPF   SSPLSL F+FSQ QL SPS  PPP SPP PPSRVGLK F  PPPALHDMTEEE
Subjt:  ISSSNP---KLQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPS--PPPLSPPSPPSRVGLKDFLKPPPALHDMTEEE

Query:  LLWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANA
        LLWRASL P RIPK P   KS+ KIAFLFLTKDGVSLAPLWE FFKG+  LYSIYVHRS S+N+TV S+SVFYGRSIPSKGVKWG PSMMEAERRLLANA
Subjt:  LLWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANA

Query:  LLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCY
        LLDFSNQRFILLSESCIPLFNFSTIY+YLM SK+TF+E+YDL GPVGRGRY P+MRP INLHQWRKGSQWF++DR +ASQVVSDQKYF VF++ CKPSCY
Subjt:  LLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCY

Query:  MDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFNH
        MDEHYLPT VGI FS +NSNRTLTWVDWS+GGAHPT+F R DV V LLQRLR GSHC YNGV TN+CHLFARKFM NSLNRLL+FAPKLM+FNH
Subjt:  MDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFNH

A0A6P4A4J5 uncharacterized protein LOC1074187962.0e-13763.29Show/hide
Query:  IHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPF-FQFSSPLSLRFTFSQFQLSSPSPPPLSP----PSPPS----------RVGLKDFLKPPP-ALHDMTE
        +HL+   S  L+F  GLA G TL+ ++  F F FQF +P      F    L+ P PPP SP    PS PS          R+GL ++LKPP   +HDM +
Subjt:  IHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPF-FQFSSPLSLRFTFSQFQLSSPSPPPLSP----PSPPS----------RVGLKDFLKPPP-ALHDMTE

Query:  EELLWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLA
        EELLWRAS AP ++ K+P   K   KIAF+FLT+  V++APLWE+FFKG+ GLYSIYVH +PS N TV  SSVF+GR IPSK V+WG+P+MMEAERRLLA
Subjt:  EELLWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLA

Query:  NALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPS
        NALLDFSNQRF+LLSESCIPLFNFSTIYNYL+ S  +F+EAYDL GPVGRGRY+P+MRP I L QWRKGSQWFEMDR +A +VV+D+KYF +F  +CKPS
Subjt:  NALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPS

Query:  CYMDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFN
        CY DEHYLPTFV ++F   NSNR+LTWVDWS+GGAHP RF+R DVT+E L+RLR+G+ CEYNG+ TN+CHLFARKF+ N+L+RLL FAPK+MQFN
Subjt:  CYMDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFN

SwissProt top hitse value%identityAlignment
Q65XS5 Glycosyltransferase BC102.7e-3834.68Show/hide
Query:  KIAFLFLTKDGVSLAPLWELFFKG-YGGLYSIYVHRSPSSNST--VDSSSVFYGRSI-PSKGVKWGEPSMMEAERRLLANALLDFSNQRFILLSESCIPL
        ++AFLF+ ++ + L  +W+ FF+G   G +SI+VH  P    T     S  FY R +  S  V WGE SM+EAER LLA+AL D  N+RF+ +S+SC+PL
Subjt:  KIAFLFLTKDGVSLAPLWELFFKG-YGGLYSIYVHRSPSSNST--VDSSSVFYGRSI-PSKGVKWGEPSMMEAERRLLANALLDFSNQRFILLSESCIPL

Query:  FNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCK----------------------P
        +NF+  Y+Y+M S ++F++++        GRY+P+M PII +  WRKGSQW  + R  A  VV D++    FQK C+                       
Subjt:  FNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCK----------------------P

Query:  SCYMDEHYLPTFVGIR-FSKTNSNRTLTWVDW--------SRGGAHPTRFIRTDVTVELLQRLRNGSH-----------CEYNGVKTNLCHLFARKF
        +C  DEHY+ T +      +  + R++T   W         R G HP  +  +D T  L++ +++  +           C  NG K   C LFARKF
Subjt:  SCYMDEHYLPTFVGIR-FSKTNSNRTLTWVDW--------SRGGAHPTRFIRTDVTVELLQRLRNGSH-----------CEYNGVKTNLCHLFARKF

Arabidopsis top hitse value%identityAlignment
AT1G10880.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.4e-9050.31Show/hide
Query:  TFSQFQLSSPSPPPLSPPSPPSRVGLKDFLKPPPALHDMTEEELLWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVH
        T S   L+SPS   LSPP  PS         P  +  ++ +EEL+WRA++A    P+ P   ++  K+AF+FLT+  + L+PLWE+FFKG+ G YSIYVH
Subjt:  TFSQFQLSSPSPPPLSPPSPPSRVGLKDFLKPPPALHDMTEEELLWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVH

Query:  RSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRP
         SP        SSVFY + IPSK V+WG+ SMM+AE+RL+++ALL+ SN RF+LLSE+CIPLFNF+TIY YL  S  +F+ ++D   P+GRGRY+PKM P
Subjt:  RSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRP

Query:  IINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCYMDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHC
         ++L  WRKG+QWFE+ R +A+++VSD++Y+ VF+  C+P CY+DEHYLPT V     + NSNRT+TWVDWSRGG+HP RF+R D+ V  L R+R GS+C
Subjt:  IINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCYMDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHC

Query:  EYNGVKTNLCHLFARKFMAN
         Y G    +  +  +K   N
Subjt:  EYNGVKTNLCHLFARKFMAN

AT1G51770.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.1e-9052.53Show/hide
Query:  VGLKDFLKPPPAL-HDMTEEELLWRASLAPHR----IPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGR
        V L  F++PP  + H M + ELLWRAS+ P R      ++P       K+AF+FL K  +  APLWE F KG+ GLYSIYVH  PS  S    SSVFY R
Subjt:  VGLKDFLKPPPAL-HDMTEEELLWRASLAPHR----IPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGR

Query:  SIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDR
         IPS+ V WGE SM EAERRLLANALLD SN+ F+LLSESCIPL  FS IY+Y+  S+ +F+ A D  GP GRGRY  +M P I L QWRKGSQWFE++R
Subjt:  SIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDR

Query:  TIASQVVSDQKYFHVFQKFCKPSCYMDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFM
         +A ++V D  Y+  F++FC+P CY+DEHY PT + ++     +NRTLTW DWSRGGAHP  F + DVT   L++L     C YN  ++ +C+LFARKF 
Subjt:  TIASQVVSDQKYFHVFQKFCKPSCYMDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFM

Query:  ANSLNRLLMFAPKLMQ
         ++L  LL  APK+++
Subjt:  ANSLNRLLMFAPKLMQ

AT1G68390.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein4.4e-10850.62Show/hide
Query:  TISSSNPKLQIHLKS----FFSPLLLFAAGLAAGFTLTLFIF-PFPFFQFSSPLSLR-----FTFSQFQLSSPSPPPLSPPSPPSRVGLKDFLKPPPAL-
        ++SSS+P L   L +     F  LL ++  L  G  + + +      F  +S LS++     F  S    S P PPP SPPS P + GLK F++PP  L 
Subjt:  TISSSNPKLQIHLKS----FFSPLLLFAAGLAAGFTLTLFIF-PFPFFQFSSPLSLR-----FTFSQFQLSSPSPPPLSPPSPPSRVGLKDFLKPPPAL-

Query:  HDMTEEELLWRASLAP----HRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSM
        HDM +EELLWRAS+AP    +  P+ P       K+AF+F+TK  + LA LWE FF+G+ GL++IYVH  PS N +    SVF GR IPSK V WG  +M
Subjt:  HDMTEEELLWRASLAP----HRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSM

Query:  MEAERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFH
        +EAE+RLLANALLD SN+RF+LLSESCIPLFNF+T+Y+YL+ S  T +E+YD LG VGRGRYSP M+P + L  WRKGSQW E+DR +A +++SD+ Y+ 
Subjt:  MEAERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFH

Query:  VFQKFCKPSCYMDEHYLPTFVGIRFS--KTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAP
        +F  +C   CY DEHY+PT + I+ S  + NSNRTLTWVDWS+GG HP RFIR +VT E ++ LR+G  C YNG +TN+C+LFARKF+  +L+RLL  + 
Subjt:  VFQKFCKPSCYMDEHYLPTFVGIRFS--KTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAP

Query:  KLMQF
         ++ F
Subjt:  KLMQF

AT3G21310.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.7e-9152.23Show/hide
Query:  RVGLKDFLKPP-PALHDMTEEELLWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIP
        R+ L+  +KPP    H M + ELLWRAS+ P RI   P   K   K+AF+FLTK  +  APLWE FFKG+ G YSIYVH  P+  S   SSSVFY R IP
Subjt:  RVGLKDFLKPP-PALHDMTEEELLWRASLAPHRIPKLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIP

Query:  SKGVKWGEPSMMEAERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIA
        S+ V WGE SM +AERRLLANALLD SN+ F+LLSE+CIPL  F+ +Y Y+  S+ +F+ + D  GP GRGRYS  M P ++L++WRKGSQWFE++R +A
Subjt:  SKGVKWGEPSMMEAERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIA

Query:  SQVVSDQKYFHVFQKFCKPSCYMDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANS
          +V D  Y++ F++FC+P CY+DEHY PT + I +    +NRTLTW DWSRGGAHP  F + D+T + +++L  G  C YN   + +C+LFARKF  ++
Subjt:  SQVVSDQKYFHVFQKFCKPSCYMDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANS

Query:  LNRLLMFAPKLMQF
        L  LL  APK++ F
Subjt:  LNRLLMFAPKLMQF

AT5G11730.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein3.7e-9147.88Show/hide
Query:  HLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPSPPPLSPPSPPSRVGLKDFLKPPPAL-HDMTEEELLWRASLAPHRIPK
        H K F S LLL   GL   F++T+FI      +++   S+  T +       S  P     P S   L  +++PP  L H+M++EELLWRAS  P R  +
Subjt:  HLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPSPPPLSPPSPPSRVGLKDFLKPPPAL-HDMTEEELLWRASLAPHRIPK

Query:  LPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFILLSE
         P   K   K+AF+FLTK  + LA LWE F KG+ GLYS+Y+H  PS  +   +SSVF+ R IPS+  +WG  SM +AE+RLLANALLD SN+ F+L+SE
Subjt:  LPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFILLSE

Query:  SCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCYMDEHYLPTFVGIRF
        SCIPL+NF+TIY+YL  SK +F+ A+D  GP GRGRY+  M P + L +WRKGSQWFE++R +A+ +V D  Y+  F++FC+P+CY+DEHY PT + I  
Subjt:  SCIPLFNFSTIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCYMDEHYLPTFVGIRF

Query:  SKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQF
            +NR+LTWVDWSRGG HP  F R+D+T     ++ +G +C YNG  T++C+LFARKF  ++L  LL  APK++ F
Subjt:  SKTNSNRTLTWVDWSRGGAHPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGATCTCTTCCTCCAATCCCAAACTCCAAATCCATCTCAAATCCTTCTTCTCTCCCCTCCTCCTCTTCGCCGCCGGCCTCGCTGCCGGCTTCACTCTTACTCTCTT
CATCTTCCCTTTCCCTTTCTTCCAATTTTCCTCCCCTCTCTCCCTCCGTTTCACCTTCAGCCAATTCCAACTCTCTTCTCCCTCCCCTCCCCCGTTGTCTCCACCATCGC
CGCCATCACGGGTTGGCTTGAAGGATTTTTTAAAACCTCCTCCGGCACTCCACGACATGACGGAGGAAGAGCTTTTATGGAGGGCGTCACTTGCTCCGCACCGGATTCCG
AAATTACCGACAACGGAGAAGTCACGGCGGAAGATCGCGTTCTTGTTTCTGACGAAAGACGGAGTCAGTTTGGCTCCGCTTTGGGAACTGTTCTTTAAAGGGTACGGTGG
ACTTTACTCCATTTACGTTCATCGGAGTCCTTCCTCCAATTCCACCGTCGATTCCTCCTCTGTTTTCTACGGCCGCTCAATCCCCAGTAAGGGAGTGAAGTGGGGAGAAC
CAAGCATGATGGAGGCAGAGAGGCGGCTACTAGCAAATGCATTATTAGATTTCTCCAACCAAAGATTCATCCTTCTCTCTGAATCATGCATCCCACTATTCAATTTCTCC
ACCATTTACAATTACTTAATGGGCTCAAAATCCACATTCATCGAGGCCTATGACCTTCTAGGTCCAGTGGGCCGAGGCCGCTACAGCCCAAAAATGCGGCCCATAATCAA
TCTTCATCAATGGCGCAAGGGTTCCCAATGGTTCGAAATGGACCGAACCATCGCCTCCCAAGTCGTCTCCGACCAAAAATATTTCCATGTTTTCCAAAAATTCTGCAAAC
CCTCTTGTTACATGGACGAGCACTACCTTCCCACCTTTGTCGGAATCCGATTCTCTAAGACCAACTCCAACCGGACCTTGACTTGGGTTGACTGGTCCAGAGGCGGTGCT
CACCCGACCCGGTTCATCAGAACCGATGTCACCGTTGAATTGCTTCAGAGGCTGAGGAATGGTAGCCACTGCGAGTATAATGGAGTTAAGACTAATCTCTGTCATTTGTT
TGCTAGGAAATTCATGGCTAATTCTTTGAATAGATTACTTATGTTTGCTCCTAAGCTCATGCAGTTTAATCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGACGATCTCTTCCTCCAATCCCAAACTCCAAATCCATCTCAAATCCTTCTTCTCTCCCCTCCTCCTCTTCGCCGCCGGCCTCGCTGCCGGCTTCACTCTTACTCTCTT
CATCTTCCCTTTCCCTTTCTTCCAATTTTCCTCCCCTCTCTCCCTCCGTTTCACCTTCAGCCAATTCCAACTCTCTTCTCCCTCCCCTCCCCCGTTGTCTCCACCATCGC
CGCCATCACGGGTTGGCTTGAAGGATTTTTTAAAACCTCCTCCGGCACTCCACGACATGACGGAGGAAGAGCTTTTATGGAGGGCGTCACTTGCTCCGCACCGGATTCCG
AAATTACCGACAACGGAGAAGTCACGGCGGAAGATCGCGTTCTTGTTTCTGACGAAAGACGGAGTCAGTTTGGCTCCGCTTTGGGAACTGTTCTTTAAAGGGTACGGTGG
ACTTTACTCCATTTACGTTCATCGGAGTCCTTCCTCCAATTCCACCGTCGATTCCTCCTCTGTTTTCTACGGCCGCTCAATCCCCAGTAAGGGAGTGAAGTGGGGAGAAC
CAAGCATGATGGAGGCAGAGAGGCGGCTACTAGCAAATGCATTATTAGATTTCTCCAACCAAAGATTCATCCTTCTCTCTGAATCATGCATCCCACTATTCAATTTCTCC
ACCATTTACAATTACTTAATGGGCTCAAAATCCACATTCATCGAGGCCTATGACCTTCTAGGTCCAGTGGGCCGAGGCCGCTACAGCCCAAAAATGCGGCCCATAATCAA
TCTTCATCAATGGCGCAAGGGTTCCCAATGGTTCGAAATGGACCGAACCATCGCCTCCCAAGTCGTCTCCGACCAAAAATATTTCCATGTTTTCCAAAAATTCTGCAAAC
CCTCTTGTTACATGGACGAGCACTACCTTCCCACCTTTGTCGGAATCCGATTCTCTAAGACCAACTCCAACCGGACCTTGACTTGGGTTGACTGGTCCAGAGGCGGTGCT
CACCCGACCCGGTTCATCAGAACCGATGTCACCGTTGAATTGCTTCAGAGGCTGAGGAATGGTAGCCACTGCGAGTATAATGGAGTTAAGACTAATCTCTGTCATTTGTT
TGCTAGGAAATTCATGGCTAATTCTTTGAATAGATTACTTATGTTTGCTCCTAAGCTCATGCAGTTTAATCATTAG
Protein sequenceShow/hide protein sequence
MTISSSNPKLQIHLKSFFSPLLLFAAGLAAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLSSPSPPPLSPPSPPSRVGLKDFLKPPPALHDMTEEELLWRASLAPHRIP
KLPTTEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVDSSSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFILLSESCIPLFNFS
TIYNYLMGSKSTFIEAYDLLGPVGRGRYSPKMRPIINLHQWRKGSQWFEMDRTIASQVVSDQKYFHVFQKFCKPSCYMDEHYLPTFVGIRFSKTNSNRTLTWVDWSRGGA
HPTRFIRTDVTVELLQRLRNGSHCEYNGVKTNLCHLFARKFMANSLNRLLMFAPKLMQFNH