; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G15810 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G15810
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationClcChr06:26671566..26676959
RNA-Seq ExpressionClc06G15810
SyntenyClc06G15810
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589613.1 Glycosyltransferase BC10, partial [Cucurbita argyrosperma subsp. sororia]4.5e-18082.19Show/hide
Query:  MTISNP---KLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPPPS-PPPSSPPPPPSRVGLKEFLKPPPALHDMTEEEL
        MT SNP   KLQ  FKSL+S LLLFAAGL AGFTLTLF FPFPF   SSPLSL F+FSQ QLP PS PPP  PPPPPSRVGLK+FL PPPALHDMTEEEL
Subjt:  MTISNP---KLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPPPS-PPPSSPPPPPSRVGLKEFLKPPPALHDMTEEEL

Query:  LWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMMEAERRLLANAL
        LWRASLVPRRIP  P + KS+ KIAFLFLTKDGVSLAPLWE FFKG+  LYSIYVHRS S+N+TVAS SVFYGRSIPSKGVKWG PSMMEAERRLLANAL
Subjt:  LWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMMEAERRLLANAL

Query:  LDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCYM
        LDFSN RFILLSESCIPLFNFSTIY+YLM SK+TF+E+YDLPGPVGRGRY P+MRPTINLHQWRKGSQWF++DR +ASQVVSDQK+FPVF++ CKPSCYM
Subjt:  LDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCYM

Query:  DEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFNH
        DEHYLPT VGI FS  NSNRTLTWVDWS+GGAHPT+F R DV VGLLQRLR GSHC YNG  TN+CHLFARKFMPNSLNRLL+FAPKLM+FNH
Subjt:  DEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFNH

KAG7023304.1 hypothetical protein SDJN02_14329, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-18082.44Show/hide
Query:  MTISNP---KLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPPPS-PPPSSPPPPPSRVGLKEFLKPPPALHDMTEEEL
        MT SNP   KLQ  FKSL+S LLLFAAGL AGFTLTLF FPFPF   SSPLSL F+FSQ QLP PS PPP  PPPPPSRVGLK+FL PPPALHDMTEEEL
Subjt:  MTISNP---KLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPPPS-PPPSSPPPPPSRVGLKEFLKPPPALHDMTEEEL

Query:  LWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMMEAERRLLANAL
        LWRASLVPRRIP  P + KS+ KIAFLFLTKDGVSLAPLWE FFKG+  LYSIYVHRS S+N+TVAS SVFYGRSIPSKGVKWG PSMMEAERRLLANAL
Subjt:  LWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMMEAERRLLANAL

Query:  LDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCYM
        LDFSN RFILLSESCIPLFNFSTIYNYLM SK+TF+E+YDLPGPVGRGRY P+MRPTINLHQWRKGSQWF++DR +ASQVVSDQK+FPVF++ CKPSCYM
Subjt:  LDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCYM

Query:  DEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFNH
        DEHYLPT VGI FS  NSNRTLTWVDWS+GGAHPT+F R DV VGLLQRLR GSHC YNG  TN+CHLFARKFMPNSLNRLL+FAPKLM+FNH
Subjt:  DEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFNH

XP_004137544.2 glycosyltransferase BC10 [Cucumis sativus]9.8e-19181.11Show/hide
Query:  ISIVHSSSSSSSSSSLKILLI------------QFNSMTI--SNPKLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPP
        +S + SSSS S  +    LLI             F+SMTI  SNPKL +HFKS  SP LLF+AGL AGFTLTLFIFPFPFFQFSS LSL FTF+Q QL P
Subjt:  ISIVHSSSSSSSSSSLKILLI------------QFNSMTI--SNPKLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPP

Query:  ---PSPPPSSPPPPPSRVGLKEFLKPPPALHDMTEEELLWRASLVPRRIPNFPTME--KSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPS
           PS PP  PPPPPSRVGLKEFL PPP LHDMTEEELLWRASLVPRRIP  P+ E   S RKIAFLFLTKDGVSLAPLWELFFKGY GLYSIYVHR+PS
Subjt:  ---PSPPPSSPPPPPSRVGLKEFLKPPPALHDMTEEELLWRASLVPRRIPNFPTME--KSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPS

Query:  SN--STVASFSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTI
        S+  STV S SVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFSN RFILLSESCIPLFNFST+YNYLMGSKSTFIEAYDLPGPVGRGRY+PKMRP I
Subjt:  SN--STVASFSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTI

Query:  NLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCYMDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDY
         LHQWRKGSQWFEMDRTIASQV+SDQK+F VFQKFCKPSCYMDEHYLPTFVGI+F K NSNRTLTWVDWSRGGAHPTRF+RTDVT+ LL+RLRNG HC+Y
Subjt:  NLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCYMDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDY

Query:  NGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFN
        NGVKTNLCHLFARKFM NSLNRLLMFAPKLM FN
Subjt:  NGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFN

XP_022988481.1 uncharacterized protein LOC111485710 [Cucurbita maxima]1.2e-18082.23Show/hide
Query:  MTISNP---KLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPPPS--PPPSSPPPPPSRVGLKEFLKPPPALHDMTEEE
        MT SNP   KLQ  FKSL+S LLLFAAGL AGFTLTLF FPFPF   SSPLSL F+FSQ QLP PS  PPP SPPPPPSRVGLK+F  PPPALHDMTEEE
Subjt:  MTISNP---KLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPPPS--PPPSSPPPPPSRVGLKEFLKPPPALHDMTEEE

Query:  LLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMMEAERRLLANA
        LLWRASLVPRRIP  P + KS+ KIAFLFLTKDGVSLAPLWE FFKG+  LYSIYVHRS S+N+TVAS SVFYGRSIPSKGVKWG PSMMEAERRLLANA
Subjt:  LLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMMEAERRLLANA

Query:  LLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCY
        LLDFSN RFILLSESCIPLFNFSTIY+YLM SK+TF+E+YDLPGPVGRGRY P+MRPTINLHQWRKGSQWF++DR +ASQVVSDQK+FPVF++ CKPSCY
Subjt:  LLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCY

Query:  MDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFNH
        MDEHYLPT VGI FS +NSNRTLTWVDWS+GGAHPT+F R DV VGLLQRLR GSHC YNGV TN+CHLFARKFMPNSLNRLL+FAPKLM+FNH
Subjt:  MDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFNH

XP_038895081.1 glycosyltransferase BC10-like [Benincasa hispida]1.6e-20993.32Show/hide
Query:  MTISNPKLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPPPSPPPSSPPPPPSRVGLKEFLKPPPALHDMTEEELLWRA
        MT SNPKLQMHF+SLLSPLL FAAGL AGFTLTLFIFPFPFFQFSSPLSLRF FSQFQLP PSPPPSS   PPSRVGLK+FLKPPPALHDMTEEELLWRA
Subjt:  MTISNPKLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPPPSPPPSSPPPPPSRVGLKEFLKPPPALHDMTEEELLWRA

Query:  SLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFS
        SLVPRRIPNFP+ EKSRRKIAFLFLTKDGV LAPLWELFFKG+ G YSIYVHRSPSSN+TV S SVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFS
Subjt:  SLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFS

Query:  NHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCYMDEHY
        N RFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRY+PKMRPTINLHQWRKGSQWFEMDRTIASQVVSD KFFPVFQKFCKPSCYMDEHY
Subjt:  NHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCYMDEHY

Query:  LPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFNH
        LPTFVGI+FSK NSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHC+YNGV TNLCHLFARKFMPNSLNRLLMFAPKLMQFNH
Subjt:  LPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFNH

TrEMBL top hitse value%identityAlignment
A0A0A0LUY5 Uncharacterized protein1.4e-19085.35Show/hide
Query:  SMTISNPKLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPP---PSPPPSSPPPPPSRVGLKEFLKPPPALHDMTEEEL
        +++ SNPKL +HFKS  SP LLF+AGL AGFTLTLFIFPFPFFQFSS LSL FTF+Q QL P   PS PP  PPPPPSRVGLKEFL PPP LHDMTEEEL
Subjt:  SMTISNPKLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPP---PSPPPSSPPPPPSRVGLKEFLKPPPALHDMTEEEL

Query:  LWRASLVPRRIPNFPTME--KSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSN--STVASFSVFYGRSIPSKGVKWGEPSMMEAERRLL
        LWRASLVPRRIP  P+ E   S RKIAFLFLTKDGVSLAPLWELFFKGY GLYSIYVHR+PSS+  STV S SVFYGRSIPSKGVKWGEPSMMEAERRLL
Subjt:  LWRASLVPRRIPNFPTME--KSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSN--STVASFSVFYGRSIPSKGVKWGEPSMMEAERRLL

Query:  ANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKP
        ANALLDFSN RFILLSESCIPLFNFST+YNYLMGSKSTFIEAYDLPGPVGRGRY+PKMRP I LHQWRKGSQWFEMDRTIASQV+SDQK+F VFQKFCKP
Subjt:  ANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKP

Query:  SCYMDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFN
        SCYMDEHYLPTFVGI+F K NSNRTLTWVDWSRGGAHPTRF+RTDVT+ LL+RLRNG HC+YNGVKTNLCHLFARKFM NSLNRLLMFAPKLM FN
Subjt:  SCYMDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFN

A0A1S3BX15 uncharacterized protein LOC1034941404.6e-16293.62Show/hide
Query:  MTEEELLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMMEAERR
        MTEEELLWRASLVPRRIP  P+ E SRRKIAFLFLTKDGVSLAPLWELFFKGY GLYSIYVHRSPSSNSTV S SVFYGRSIPSKGVKWGEPSMMEAERR
Subjt:  MTEEELLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMMEAERR

Query:  LLANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFC
        LLANALLDFSN RFILLSESCIPLFNFSTIYNYLM SKSTFIEAYDLPGPVGRGRYSPKMRP INLHQWRKGSQWFEMDRTIASQVVSDQK+FPVFQKFC
Subjt:  LLANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFC

Query:  KPSCYMDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFN
        KPSCYMDEHYLPTFVGI+FSK+NSNRTLTWVDWSRGGAHPTRFIRTDVTV LLQRLR+GSHC+YNGVKTNLCHLFARKFM NSLNRLLMFAPKLM FN
Subjt:  KPSCYMDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFN

A0A6J1C2T7 uncharacterized protein LOC1110069261.8e-15373.45Show/hide
Query:  KLQ-MHF--KSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPPPSPPPSSPPPPPSRVGLKEFLKPPPALHDMTEEELLWRASLV
        KLQ MHF  KSL S LLLFA+GL  GF+L+L  F FP  Q S  L+   ++S    PPP PP   PPPPPS + L +   PPP +HDM++EELLWRASL 
Subjt:  KLQ-MHF--KSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPPPSPPPSSPPPPPSRVGLKEFLKPPPALHDMTEEELLWRASLV

Query:  PRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFK-GYGGLYSIYVHRSPSSNSTVAS-FSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFSN
        PR +PNF        KIAFLFLTK+GV+LAPLWELFFK  +  L+SIYVH +  SNST+ S  SVF+ R+IPSKGVKWGEPSMMEAERRLLANALLDFSN
Subjt:  PRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFK-GYGGLYSIYVHRSPSSNSTVAS-FSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFSN

Query:  HRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCYMDEHYL
         RF+LLSESCIPLFNFST+YNYLMGSK+TFIEAYDLPGPVGRGRY P+MRPTINLHQWRKGSQWF++DR +A++VVSD KFFPVF KFC P CYMDEHYL
Subjt:  HRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCYMDEHYL

Query:  PTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFNH
        PT VGIKFS  NSNRTLTWVDWSRGG HPTRFIR DV V LL+RLR GSHCDYNGV TN+CHLFARKFMPNSLNRLLMFAPKLM+F+H
Subjt:  PTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFNH

A0A6J1JHA6 uncharacterized protein LOC1114857105.8e-18182.23Show/hide
Query:  MTISNP---KLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPPPS--PPPSSPPPPPSRVGLKEFLKPPPALHDMTEEE
        MT SNP   KLQ  FKSL+S LLLFAAGL AGFTLTLF FPFPF   SSPLSL F+FSQ QLP PS  PPP SPPPPPSRVGLK+F  PPPALHDMTEEE
Subjt:  MTISNP---KLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPPPS--PPPSSPPPPPSRVGLKEFLKPPPALHDMTEEE

Query:  LLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMMEAERRLLANA
        LLWRASLVPRRIP  P + KS+ KIAFLFLTKDGVSLAPLWE FFKG+  LYSIYVHRS S+N+TVAS SVFYGRSIPSKGVKWG PSMMEAERRLLANA
Subjt:  LLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMMEAERRLLANA

Query:  LLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCY
        LLDFSN RFILLSESCIPLFNFSTIY+YLM SK+TF+E+YDLPGPVGRGRY P+MRPTINLHQWRKGSQWF++DR +ASQVVSDQK+FPVF++ CKPSCY
Subjt:  LLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCY

Query:  MDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFNH
        MDEHYLPT VGI FS +NSNRTLTWVDWS+GGAHPT+F R DV VGLLQRLR GSHC YNGV TN+CHLFARKFMPNSLNRLL+FAPKLM+FNH
Subjt:  MDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQFNH

A0A6P4A4J5 uncharacterized protein LOC1074187965.7e-13661.69Show/hide
Query:  TISNPKLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPF-FQFSSPLSLRFTFSQFQLPPPSPPP----SSPPPP---------PSRVGLKEFLKPPP-
        T+++P   +H + L+S  L+F  GL  G TL+ ++  F F FQF +P       S+   PPP PPP    SSP  P         P R+GL E+LKPP  
Subjt:  TISNPKLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPF-FQFSSPLSLRFTFSQFQLPPPSPPP----SSPPPP---------PSRVGLKEFLKPPP-

Query:  ALHDMTEEELLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMME
         +HDM +EELLWRAS  P ++   P   K   KIAF+FLT+  V++APLWE+FFKG+ GLYSIYVH +PS N TV   SVF+GR IPSK V+WG+P+MME
Subjt:  ALHDMTEEELLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMME

Query:  AERRLLANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVF
        AERRLLANALLDFSN RF+LLSESCIPLFNFSTIYNYL+ S  +F+EAYDL GPVGRGRY+P+MRP I L QWRKGSQWFEMDR +A +VV+D+K+FP+F
Subjt:  AERRLLANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVF

Query:  QKFCKPSCYMDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQ
          +CKPSCY DEHYLPTFV +KF   NSNR+LTWVDWS+GGAHP RF+R DVT+  L+RLR+G+ C+YNG+ TN+CHLFARKF+PN+L+RLL FAPK+MQ
Subjt:  QKFCKPSCYMDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQ

Query:  FN
        FN
Subjt:  FN

SwissProt top hitse value%identityAlignment
Q65XS5 Glycosyltransferase BC101.1e-3834.68Show/hide
Query:  KIAFLFLTKDGVSLAPLWELFFKG-YGGLYSIYVHRSPSSNSTVAS--FSVFYGRSI-PSKGVKWGEPSMMEAERRLLANALLDFSNHRFILLSESCIPL
        ++AFLF+ ++ + L  +W+ FF+G   G +SI+VH  P    T A+     FY R +  S  V WGE SM+EAER LLA+AL D  N RF+ +S+SC+PL
Subjt:  KIAFLFLTKDGVSLAPLWELFFKG-YGGLYSIYVHRSPSSNSTVAS--FSVFYGRSI-PSKGVKWGEPSMMEAERRLLANALLDFSNHRFILLSESCIPL

Query:  FNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCK----------------------P
        +NF+  Y+Y+M S ++F++++        GRY+P+M P I +  WRKGSQW  + R  A  VV D++  P FQK C+                       
Subjt:  FNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCK----------------------P

Query:  SCYMDEHYLPTFVGIK-FSKNNSNRTLTWVDW--------SRGGAHPTRFIRTDVTVGLLQRLRNGSH-----------CDYNGVKTNLCHLFARKF
        +C  DEHY+ T +      +  + R++T   W         R G HP  +  +D T  L++ +++  +           C  NG K   C LFARKF
Subjt:  SCYMDEHYLPTFVGIK-FSKNNSNRTLTWVDW--------SRGGAHPTRFIRTDVTVGLLQRLRNGSH-----------CDYNGVKTNLCHLFARKF

Arabidopsis top hitse value%identityAlignment
AT1G10280.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.6e-9050.32Show/hide
Query:  FLKPPPALHDMTEEELLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWG
        F++P    H MT++EL WRAS+VP +   +P       K+AF+FLT+  + + PLWE FFKG     S+YVH  P  +  V+  S FY R IPS+ V+WG
Subjt:  FLKPPPALHDMTEEELLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWG

Query:  EPSMMEAERRLLANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQ
         P + +AE+RLLANALLDFSN RF+LLSESC+P++NFST+Y YL+ S  +F+++YD P   GRGRYS KM P I LH WRKGSQWFE++R IA  ++SD 
Subjt:  EPSMMEAERRLLANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQ

Query:  KFFPVFQKFCKPSCYMDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLR-NGSHCDYNGVKTNLCHLFARKFMPNSLNRLLM
        K++ +F++FC+P+CY DEHY+PTF+ +     N+NR++TWVDWS GG HP  +   ++T G LQ +R N + C YN   T+LC LFARKF P++L  L+ 
Subjt:  KFFPVFQKFCKPSCYMDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLR-NGSHCDYNGVKTNLCHLFARKFMPNSLNRLLM

Query:  FAPKLMQF
         +  ++ F
Subjt:  FAPKLMQF

AT1G51770.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.7e-9253.53Show/hide
Query:  VGLKEFLKPPPAL-HDMTEEELLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPS
        V L  F++PP  + H M + ELLWRAS+ P+R   +P   +   K+AF+FL K  +  APLWE F KG+ GLYSIYVH  PS  S  +  SVFY R IPS
Subjt:  VGLKEFLKPPPAL-HDMTEEELLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPS

Query:  KGVKWGEPSMMEAERRLLANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIAS
        + V WGE SM EAERRLLANALLD SN  F+LLSESCIPL  FS IY+Y+  S+ +F+ A D  GP GRGRY  +M P I L QWRKGSQWFE++R +A 
Subjt:  KGVKWGEPSMMEAERRLLANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIAS

Query:  QVVSDQKFFPVFQKFCKPSCYMDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSL
        ++V D  ++P F++FC+P CY+DEHY PT + +K     +NRTLTW DWSRGGAHP  F + DVT   L++L     C YN  ++ +C+LFARKF P++L
Subjt:  QVVSDQKFFPVFQKFCKPSCYMDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSL

Query:  NRLLMFAPKLMQ
          LL  APK+++
Subjt:  NRLLMFAPKLMQ

AT1G68390.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.5e-10951.2Show/hide
Query:  HSSSSSSSSSSL--KILLIQFNSMTISNPKLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLR-----FTFSQFQLPPPSPPPSSPPPP
        +SSS SSSS SL  K+L  Q++           HF +LLS  L+   G+  G  L   +     F  +S LS++     F  S     PP PPP SPP  
Subjt:  HSSSSSSSSSSL--KILLIQFNSMTISNPKLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLR-----FTFSQFQLPPPSPPPSSPPPP

Query:  PSRVGLKEFLKPPPAL-HDMTEEELLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRS
        P + GLK F++PP  L HDM +EELLWRAS+ P +I N+P       K+AF+F+TK  + LA LWE FF+G+ GL++IYVH  PS N +    SVF GR 
Subjt:  PSRVGLKEFLKPPPAL-HDMTEEELLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRS

Query:  IPSKGVKWGEPSMMEAERRLLANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRT
        IPSK V WG  +M+EAE+RLLANALLD SN RF+LLSESCIPLFNF+T+Y+YL+ S  T +E+YD  G VGRGRYSP M+P + L  WRKGSQW E+DR 
Subjt:  IPSKGVKWGEPSMMEAERRLLANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRT

Query:  IASQVVSDQKFFPVFQKFCKPSCYMDEHYLPTFVGIKFS--KNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKF
        +A +++SD+ ++P+F  +C   CY DEHY+PT + IK S  + NSNRTLTWVDWS+GG HP RFIR +VT   ++ LR+G  C YNG +TN+C+LFARKF
Subjt:  IASQVVSDQKFFPVFQKFCKPSCYMDEHYLPTFVGIKFS--KNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKF

Query:  MPNSLNRLLMFAPKLMQF
        +P +L+RLL  +  ++ F
Subjt:  MPNSLNRLLMFAPKLMQF

AT3G21310.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein6.5e-9251.91Show/hide
Query:  RVGLKEFLKPP-PALHDMTEEELLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIP
        R+ L+  +KPP    H M + ELLWRAS+ P RI ++P   K   K+AF+FLTK  +  APLWE FFKG+ G YSIYVH  P+  S   S SVFY R IP
Subjt:  RVGLKEFLKPP-PALHDMTEEELLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIP

Query:  SKGVKWGEPSMMEAERRLLANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIA
        S+ V WGE SM +AERRLLANALLD SN  F+LLSE+CIPL  F+ +Y Y+  S+ +F+ + D  GP GRGRYS  M P ++L++WRKGSQWFE++R +A
Subjt:  SKGVKWGEPSMMEAERRLLANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIA

Query:  SQVVSDQKFFPVFQKFCKPSCYMDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNS
          +V D  ++  F++FC+P CY+DEHY PT + I +    +NRTLTW DWSRGGAHP  F + D+T   +++L  G  C YN   + +C+LFARKF P++
Subjt:  SQVVSDQKFFPVFQKFCKPSCYMDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNS

Query:  LNRLLMFAPKLMQF
        L  LL  APK++ F
Subjt:  LNRLLMFAPKLMQF

AT5G11730.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.3e-9547.88Show/hide
Query:  HFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPPPSPPPSSPPPPPSRVGLKEFLKPPPAL-HDMTEEELLWRASLVPRRIPN
        H+K   S LLL   GL   F++T+FI      +++   S+  T +   +P     P+S         L ++++PP  L H+M++EELLWRAS  PRR   
Subjt:  HFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQFQLPPPSPPPSSPPPPPSRVGLKEFLKPPPAL-HDMTEEELLWRASLVPRRIPN

Query:  FPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFSNHRFILLSE
        +P   K   K+AF+FLTK  + LA LWE F KG+ GLYS+Y+H  PS  +   + SVF+ R IPS+  +WG  SM +AE+RLLANALLD SN  F+L+SE
Subjt:  FPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVFYGRSIPSKGVKWGEPSMMEAERRLLANALLDFSNHRFILLSE

Query:  SCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCYMDEHYLPTFVGIKF
        SCIPL+NF+TIY+YL  SK +F+ A+D PGP GRGRY+  M P + L +WRKGSQWFE++R +A+ +V D  ++P F++FC+P+CY+DEHY PT + I+ 
Subjt:  SCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVVSDQKFFPVFQKFCKPSCYMDEHYLPTFVGIKF

Query:  SKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQF
            +NR+LTWVDWSRGG HP  F R+D+T     ++ +G +C YNG  T++C+LFARKF P++L  LL  APK++ F
Subjt:  SKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAAGAAGGTATCCGTGTTGTGTTGTGCTCTGTCTCTCACCTTATGAATTTCTGCAACAATTGTGTGTTCATAAACCCATAATTTCAATCGTTCATTCTTCTTCTTC
TTCTTCTTCTTCTTCTTCTCTCAAAATTCTCTTAATTCAGTTCAATTCAATGACGATCTCCAATCCCAAACTCCAAATGCATTTCAAATCCCTCCTCTCTCCCCTTCTCC
TCTTCGCCGCCGGCCTCACCGCCGGCTTCACTCTCACTCTCTTCATCTTCCCTTTCCCTTTCTTCCAATTTTCCTCTCCTCTCTCCCTCCGTTTCACCTTCAGCCAATTC
CAACTCCCCCCTCCCTCTCCTCCGCCGTCGTCTCCGCCGCCGCCGCCGTCTCGGGTCGGATTGAAGGAGTTTCTCAAGCCTCCTCCGGCACTCCACGACATGACGGAAGA
GGAGCTTCTGTGGAGGGCCTCCCTGGTTCCTCGCCGGATTCCCAACTTTCCGACGATGGAGAAGTCACGGCGGAAGATCGCGTTCTTGTTTCTGACAAAAGACGGAGTCA
GTTTGGCCCCGCTTTGGGAATTGTTCTTTAAAGGCTACGGTGGGCTTTACTCCATTTACGTCCACCGCAGTCCTTCCTCCAACTCCACCGTCGCTTCCTTCTCTGTTTTC
TATGGCCGCTCAATCCCCAGTAAGGGAGTGAAATGGGGAGAACCAAGCATGATGGAGGCAGAGCGGCGGCTACTAGCAAACGCATTACTAGATTTCTCCAACCACAGATT
CATCCTTCTCTCCGAATCTTGCATCCCACTCTTCAATTTCTCCACCATTTACAATTACTTAATGGGCTCCAAATCCACTTTCATTGAGGCCTACGACCTTCCAGGCCCAG
TGGGCCGCGGCCGTTACAGCCCAAAAATGCGCCCCACAATCAACCTCCATCAATGGCGCAAGGGCTCCCAATGGTTCGAAATGGACCGAACCATCGCCTCCCAAGTCGTC
TCCGACCAGAAATTCTTCCCCGTCTTCCAGAAATTCTGTAAGCCCTCTTGTTACATGGACGAGCACTACCTCCCCACCTTCGTCGGAATCAAATTCTCCAAGAACAACTC
CAACCGGACCCTCACTTGGGTCGACTGGTCCAGAGGCGGCGCGCACCCCACCCGGTTCATCAGAACCGATGTCACCGTTGGATTGCTTCAGAGGCTCAGGAATGGAAGCC
ATTGCGACTACAATGGAGTTAAGACTAATCTCTGTCATTTGTTTGCTAGGAAGTTCATGCCTAATTCTTTGAATAGACTCCTTATGTTTGCTCCTAAGCTCATGCAGTTC
AATCATTAA
mRNA sequenceShow/hide mRNA sequence
AAGGGAATGTTAAGAAGGTATCCGTGTTGTGTTGTGCTCTGTCTCTCACCTTATGAATTTCTGCAACAATTGTGTGTTCATAAACCCATAATTTCAATCGTTCATTCTTC
TTCTTCTTCTTCTTCTTCTTCTTCTCTCAAAATTCTCTTAATTCAGTTCAATTCAATGACGATCTCCAATCCCAAACTCCAAATGCATTTCAAATCCCTCCTCTCTCCCC
TTCTCCTCTTCGCCGCCGGCCTCACCGCCGGCTTCACTCTCACTCTCTTCATCTTCCCTTTCCCTTTCTTCCAATTTTCCTCTCCTCTCTCCCTCCGTTTCACCTTCAGC
CAATTCCAACTCCCCCCTCCCTCTCCTCCGCCGTCGTCTCCGCCGCCGCCGCCGTCTCGGGTCGGATTGAAGGAGTTTCTCAAGCCTCCTCCGGCACTCCACGACATGAC
GGAAGAGGAGCTTCTGTGGAGGGCCTCCCTGGTTCCTCGCCGGATTCCCAACTTTCCGACGATGGAGAAGTCACGGCGGAAGATCGCGTTCTTGTTTCTGACAAAAGACG
GAGTCAGTTTGGCCCCGCTTTGGGAATTGTTCTTTAAAGGCTACGGTGGGCTTTACTCCATTTACGTCCACCGCAGTCCTTCCTCCAACTCCACCGTCGCTTCCTTCTCT
GTTTTCTATGGCCGCTCAATCCCCAGTAAGGGAGTGAAATGGGGAGAACCAAGCATGATGGAGGCAGAGCGGCGGCTACTAGCAAACGCATTACTAGATTTCTCCAACCA
CAGATTCATCCTTCTCTCCGAATCTTGCATCCCACTCTTCAATTTCTCCACCATTTACAATTACTTAATGGGCTCCAAATCCACTTTCATTGAGGCCTACGACCTTCCAG
GCCCAGTGGGCCGCGGCCGTTACAGCCCAAAAATGCGCCCCACAATCAACCTCCATCAATGGCGCAAGGGCTCCCAATGGTTCGAAATGGACCGAACCATCGCCTCCCAA
GTCGTCTCCGACCAGAAATTCTTCCCCGTCTTCCAGAAATTCTGTAAGCCCTCTTGTTACATGGACGAGCACTACCTCCCCACCTTCGTCGGAATCAAATTCTCCAAGAA
CAACTCCAACCGGACCCTCACTTGGGTCGACTGGTCCAGAGGCGGCGCGCACCCCACCCGGTTCATCAGAACCGATGTCACCGTTGGATTGCTTCAGAGGCTCAGGAATG
GAAGCCATTGCGACTACAATGGAGTTAAGACTAATCTCTGTCATTTGTTTGCTAGGAAGTTCATGCCTAATTCTTTGAATAGACTCCTTATGTTTGCTCCTAAGCTCATG
CAGTTCAATCATTAAATGTGCATAAACTCATTCCTTTTGAATTTTGAGCCTATATGATTCATTTATATGTGATTTCATAAGATGGGGTCAAGTAAAAATCTTCCCTGCCC
TTTATGTTCCTTTTCATATAGTGTAGTACTGTAAATGTAAACAGCATAAAGAATGAAGAATCAATTCAATAAACACATCAAAGAAATATATGGTGGCCCATCAGTGTAAT
AGATTGATGAGGAAGATGAAGATGGAAATGGAATAATTAGAAAGATAGTTGACAAATGGTAATGGTGGAGGGAAGAAGAGAGGGCTTTAAGAGCTGATGGTTTCATTAGT
TGCTGTTGAGGTGTTGCTGCTAACCCAAGATACAGAAACGGAACTTGGAGAAGATCCATAGCCATTCTTTAGCAACACAGCATCGTCTGAAATGTCGGCCTCTTGATCTG
TTGTCTTCTCGGCGTTCTGTTTTTGTATGATGTTCAGAGTGTCGAGAACTTCCATCATTGTGGGTCTCTCGTCTTTCATGCTTTGCAGGCATCGAAACGCCAACTCGGCT
ACGCTGGTTATCATGTCTTGGATTTTGTAGTCTGTTTTGAAACCGAGCGATGGGTCGACCAGTTCGTGCAATGTGTGGCTTTGGATCTTGTTGATGGCCATGTTGAATAG
GTTGATCTCTTGTCTATGCCTTGTGATGTCAACAGCAGGCATTGAGGATATGAGTTCAACAAGTACCACACCGAAGCTAAAGACATCGCTTTTGTCAGAGAGTTGGTAGC
ACTGATGGTACTCTGGATCGACATAGCCTGGAGTGCCTTGTGGAGCGGTTGAAACGTGTGTGACATCGAGTGGGAAGAGTCGGGACAATCCGAAATCCGCTACTTTAACG
CAGTAGTTGTTGTCGAGGAGAATGTTGTTGGTTTTGACATCACGGTGGATGATTTCAGAAGCATGGAGATAAACCAAAGCGCTTGCAGTCTCTATTGCAATCTTCATTCT
TGTACACCATGGAAGCTTGCCAGACTTTGCTAGTTTTCCGTGGAGGTGATCAGCCACCGTGCCATTTGGAACGTATTCATACACAAGCAGAAGCTCATGGCTACTGCGTG
AGGTGCAACCATAAAGCGAAACAAGGTTGCGGTGGCGCAAACGAGCAAGGATCTCCACTTCATTCATGAACTGCTCAACTCTCTTAAAATTGCTTTCATACAACCGTTTG
ACAGCAACAGCACGCCCGTCTTTGAGTAAGCCTAAACAGATATATGAACAAAAAGATCAATCAACACAACCACCAACTAATAATCAAGCATGAATAATGATTCAAACTGG
ATTTAGATAGCTGCTCTTACCATAATACACTGTGCCAAAACCTCCATCTCCAAGCTCTTTGTTGGAGTCAAAATGGTTTGTGGCTTCTTCAAGTTCTTTATAGGAGAAGA
GGTGAACGCCCAAGTAGGTACCTCCATTCTCAACTTCTTCCACAGAAGACGGGTTCGGAGGATTCGGTGGATTTGATGGAATGCTTCGTTGCACATACGGCAACGCGTGA
GACCTTCTACGACGAAGCTGGCGCCATCGATAGCAGATGCCAAGAACAAGAAAGGTTAAAAGCAAGGCTCCAAAACCAGTGCACAACCCTGCATGTGAATCAAAGCAACG
GAGTTTTTGATCAATTGTCAGGAAGGCAAACATGTGAACTGAGGGGGAAAAACAGAGTAGGTTCATGTGTTACCAATGATGATCTTTCTTCGGGTCGTGCTTTTTTCTGC
AACATTAAACACAACAATGTCAGCCTAAACATAAAAAAA
Protein sequenceShow/hide protein sequence
MLRRYPCCVVLCLSPYEFLQQLCVHKPIISIVHSSSSSSSSSSLKILLIQFNSMTISNPKLQMHFKSLLSPLLLFAAGLTAGFTLTLFIFPFPFFQFSSPLSLRFTFSQF
QLPPPSPPPSSPPPPPSRVGLKEFLKPPPALHDMTEEELLWRASLVPRRIPNFPTMEKSRRKIAFLFLTKDGVSLAPLWELFFKGYGGLYSIYVHRSPSSNSTVASFSVF
YGRSIPSKGVKWGEPSMMEAERRLLANALLDFSNHRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPKMRPTINLHQWRKGSQWFEMDRTIASQVV
SDQKFFPVFQKFCKPSCYMDEHYLPTFVGIKFSKNNSNRTLTWVDWSRGGAHPTRFIRTDVTVGLLQRLRNGSHCDYNGVKTNLCHLFARKFMPNSLNRLLMFAPKLMQF
NH