; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004599 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004599
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein, putative
Genome locationchr6:5351737..5352927
RNA-Seq ExpressionLag0004599
SyntenyLag0004599
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011649832.1 glycosyltransferase BC10 [Cucumis sativus]8.2e-16578.5Show/hide
Query:  MLSPNPLSLICALLLCLPLAIIFTLNSP-TTINS-DYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFMFL
        M  PNP SLI ALLLCL LA+ FT+N+P TTINS DYPFIFPF  SLY  N HR+IT      P+PP P PPEDD LLFPLAA VN TPSPT KLAF+FL
Subjt:  MLSPNPLSLICALLLCLPLAIIFTLNSP-TTINS-DYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFMFL

Query:  TNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTYRT
        TNSPLPFAPLWELFF+NIPPDLFNIYIHADPTR YD PFSGVFA+RVIPSKPTQR SPSL+AAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTY+T
Subjt:  TNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTYRT

Query:  LIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLVTA
        LIRSK+SFIEVLK+E+GAYDRWAARGPDVMLPVVK AD RIGSQFWVL RRHA IVVRD+ VWSKFDLPCVR     CYPEENYFPTLLSM DRRGLV A
Subjt:  LIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLVTA

Query:  TLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGG--MKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIFKD
        TLTHV+WNGS DGHP TYVASDVGPDLIR  R ARPRYGDGG  MK+ I  R G  GR  S  +Y     R+HPFLFARKFSA +L  LMNI+SD IFKD
Subjt:  TLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGG--MKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIFKD

XP_022132078.1 uncharacterized protein LOC111005037 [Momordica charantia]1.3e-16576.88Show/hide
Query:  MLSPNPLSLICALLLCLPLAIIFTLNSPTTINSDYPFIFPFHYSLYIP--NTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFMFL
        MLSPNPLSLICALLLC PLAI+FTL+      SDYP +FPF  SLY P  NTHR+IT+FPL       PPPP+DD  LFPLAARVN TPSPT KLAFMFL
Subjt:  MLSPNPLSLICALLLCLPLAIIFTLNSPTTINSDYPFIFPFHYSLYIP--NTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFMFL

Query:  TNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTYRT
        T SPLPFAPLWELFF+N+PP+ FNIYIHADPTR+Y+ PFSGVFAHR+IPSKP+ R+SP+LAAAARRLLAHALLHDSANSMFALLSPSCIPLHSF+FTY T
Subjt:  TNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTYRT

Query:  LIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLVTA
        LIRS +SFIEVLKNEIG YDRWAARGP+VMLPVVK AD RIGSQFW+LTR+HA +VV D RVWSKFDLPCVR     CYPEENYFPTLLSM D  GLVTA
Subjt:  LIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLVTA

Query:  TLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIFKD
        TLTHVDW G  DGHP TY ASDVGPDLIR  RIARPRYGDGGM+I        AG RNSSS+   KS   HPFLFARKFSAD+LQPLMNIS+D+IFKD
Subjt:  TLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIFKD

XP_022951447.1 uncharacterized protein LOC111454260 [Cucurbita moschata]2.3e-17578.47Show/hide
Query:  MLSPNPLSLICALLLCLPLAIIFTLNSPTTI----NSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFM
        M + +PLSLICALLLCLPLA++FT+NSPT I    NSD+PFIFP   SLY+P THR+ITLF +PS  PPP PPPE+D LLFPLA+RV+PTPSPTRKLAFM
Subjt:  MLSPNPLSLICALLLCLPLAIIFTLNSPTTI----NSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFM

Query:  FLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTY
        FLTNSPLPFAPLWELFFKNIPPDL+N+YIHADPTREYD PFSGVF+HRVIPSKPTQR++PSL AAARRLLAHALLHDS+NSMFALLSPSCIPLHSFNFTY
Subjt:  FLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTY

Query:  RTLIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLV
        +TLI SK+SFIEVLKNEIGAYDRWAARGPD MLPVVK  D+RIGSQFWVLTRRHA  VVRD++VWSKFDLPCVR     CYPEENYFPTLLSM D RGL+
Subjt:  RTLIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLV

Query:  TATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGD----GGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDV
         ATLTHVDWNGS DGHP TY  SDV P+LIR+ R++R RYGD    GG+++ IR R   +GRR+SSSS A K  RRH FLFARKFSAD LQPLMNISSDV
Subjt:  TATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGD----GGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDV

Query:  IFKD
        IFKD
Subjt:  IFKD

XP_023002372.1 uncharacterized protein LOC111496232 [Cucurbita maxima]2.0e-17477.61Show/hide
Query:  MLSPNPLSLICALLLCLPLAIIFTLNSPTTI----NSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFM
        M + +PLSLICALLLCLPLA++FT+NSPT I    NSD+PFIFP   SLY+P THR+ITLF +PSP PPPPPPPE+D LLFPLAARV+P PSPTRKLAFM
Subjt:  MLSPNPLSLICALLLCLPLAIIFTLNSPTTI----NSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFM

Query:  FLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTY
        FLTNSPLPFAPLWELFFKNIPPDL+N+YIH DPTREYD PFSGVF+HRVIPSKPTQR++ SL AAARRLLAHALLHDS+NSMFALLSPSCIPLHSFNFTY
Subjt:  FLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTY

Query:  RTLIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLV
        +TLIRSK+SFIEVLKNEIGAYDRWAARGPD MLPVVK  D+RIGSQFW LTRRHA  VV+D++VW+KFDLPCVR     CYPEENYFPTLLSM DR+GL+
Subjt:  RTLIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLV

Query:  TATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDG--GMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIF
         ATLTHVDWNG  DGHP TY  +DV P+LIR+ R+AR RYGDG  G+++ I  +   +GRR+SSSS A K  RRH FLFARKFSAD LQPLMNISSDVIF
Subjt:  TATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDG--GMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIF

Query:  KD
        KD
Subjt:  KD

XP_023537126.1 uncharacterized protein LOC111798298 [Cucurbita pepo subsp. pepo]3.4e-17979.6Show/hide
Query:  MLSPNPLSLICALLLCLPLAIIFTLNSPTTI----NSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPP--PPPEDDHLLFPLAARVNPTPSPTRKLA
        M + +PLSLICALLLCLPLA++FT+NSPT I    NSD+PFIFP   SLY+P THR+ITLF +PSP PPPP  PPPE+D LLFPLA+RV+PTPSPTRKLA
Subjt:  MLSPNPLSLICALLLCLPLAIIFTLNSPTTI----NSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPP--PPPEDDHLLFPLAARVNPTPSPTRKLA

Query:  FMFLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNF
        FMFLTNSPLPFAPLWELFFKNIPPDL+N+YIHADPTREYD PFSGVF+HRVIPSKPTQR++PSL AAARRLLAHALLHDS+NSMFALLSPSCIPLHSFNF
Subjt:  FMFLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNF

Query:  TYRTLIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRG
        TY+TLI SK+SFIEVLKNEIGAYDRWAARGPD MLPVVK  D+RIGSQFWVLTRRHA  VVRD++VWSKFDLPCVR     CYPEENYFPTLLSM D RG
Subjt:  TYRTLIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRG

Query:  LVTATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIF
        L+ ATLTHVDWNGS DGHP TY  SDV P+LIR+ R++R RYGDGG+++ IR RN  +GRR+SSSS A K  RRH FLFARKFSAD LQPLMNISSDVIF
Subjt:  LVTATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIF

Query:  KD
        KD
Subjt:  KD

TrEMBL top hitse value%identityAlignment
A0A0A0LQ94 Uncharacterized protein4.0e-16578.5Show/hide
Query:  MLSPNPLSLICALLLCLPLAIIFTLNSP-TTINS-DYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFMFL
        M  PNP SLI ALLLCL LA+ FT+N+P TTINS DYPFIFPF  SLY  N HR+IT      P+PP P PPEDD LLFPLAA VN TPSPT KLAF+FL
Subjt:  MLSPNPLSLICALLLCLPLAIIFTLNSP-TTINS-DYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFMFL

Query:  TNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTYRT
        TNSPLPFAPLWELFF+NIPPDLFNIYIHADPTR YD PFSGVFA+RVIPSKPTQR SPSL+AAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTY+T
Subjt:  TNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTYRT

Query:  LIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLVTA
        LIRSK+SFIEVLK+E+GAYDRWAARGPDVMLPVVK AD RIGSQFWVL RRHA IVVRD+ VWSKFDLPCVR     CYPEENYFPTLLSM DRRGLV A
Subjt:  LIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLVTA

Query:  TLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGG--MKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIFKD
        TLTHV+WNGS DGHP TYVASDVGPDLIR  R ARPRYGDGG  MK+ I  R G  GR  S  +Y     R+HPFLFARKFSA +L  LMNI+SD IFKD
Subjt:  TLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGG--MKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIFKD

A0A5A7VHH3 Putative Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.0e-16477.75Show/hide
Query:  MLSPNPLSLICALLLCLPLAIIFTLNSP-TTINS-DYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFMFL
        ML PNPLSLI ALLLCL LAI FT ++P TT+NS DYPFIFPF  SLY  N HR+ITL      +PP PPPPEDD LLFPLAA VN TPSPT KLAF+FL
Subjt:  MLSPNPLSLICALLLCLPLAIIFTLNSP-TTINS-DYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFMFL

Query:  TNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTYRT
        TNSPLPFAPLWELFFKNIPPDLFN+YIHADPTR YD PFSGVFA+RVIPSKPTQR+SPSL+ AARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTY+T
Subjt:  TNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTYRT

Query:  LIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLVTA
        LI SK+SFIEVLK+E GAYDRWAARGPDVMLP+VK AD RIGSQFWVL RRHA IVV+D+ VWSKFDLPCV  R+  CYPEENYFPTLLSM DRRGLV A
Subjt:  LIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLVTA

Query:  TLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGG--MKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIFKD
        TLTHV+WNGS DGHP TYVASDVGPDLIR  R ARPRYGDGG  MK+ IR R G  GR  S   Y      +HPFLFARKFSAD+L  LMNI++D I KD
Subjt:  TLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGG--MKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIFKD

A0A6J1BV94 uncharacterized protein LOC1110050376.2e-16676.88Show/hide
Query:  MLSPNPLSLICALLLCLPLAIIFTLNSPTTINSDYPFIFPFHYSLYIP--NTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFMFL
        MLSPNPLSLICALLLC PLAI+FTL+      SDYP +FPF  SLY P  NTHR+IT+FPL       PPPP+DD  LFPLAARVN TPSPT KLAFMFL
Subjt:  MLSPNPLSLICALLLCLPLAIIFTLNSPTTINSDYPFIFPFHYSLYIP--NTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFMFL

Query:  TNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTYRT
        T SPLPFAPLWELFF+N+PP+ FNIYIHADPTR+Y+ PFSGVFAHR+IPSKP+ R+SP+LAAAARRLLAHALLHDSANSMFALLSPSCIPLHSF+FTY T
Subjt:  TNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTYRT

Query:  LIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLVTA
        LIRS +SFIEVLKNEIG YDRWAARGP+VMLPVVK AD RIGSQFW+LTR+HA +VV D RVWSKFDLPCVR     CYPEENYFPTLLSM D  GLVTA
Subjt:  LIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLVTA

Query:  TLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIFKD
        TLTHVDW G  DGHP TY ASDVGPDLIR  RIARPRYGDGGM+I        AG RNSSS+   KS   HPFLFARKFSAD+LQPLMNIS+D+IFKD
Subjt:  TLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIFKD

A0A6J1GHM6 uncharacterized protein LOC1114542601.1e-17578.47Show/hide
Query:  MLSPNPLSLICALLLCLPLAIIFTLNSPTTI----NSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFM
        M + +PLSLICALLLCLPLA++FT+NSPT I    NSD+PFIFP   SLY+P THR+ITLF +PS  PPP PPPE+D LLFPLA+RV+PTPSPTRKLAFM
Subjt:  MLSPNPLSLICALLLCLPLAIIFTLNSPTTI----NSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFM

Query:  FLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTY
        FLTNSPLPFAPLWELFFKNIPPDL+N+YIHADPTREYD PFSGVF+HRVIPSKPTQR++PSL AAARRLLAHALLHDS+NSMFALLSPSCIPLHSFNFTY
Subjt:  FLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTY

Query:  RTLIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLV
        +TLI SK+SFIEVLKNEIGAYDRWAARGPD MLPVVK  D+RIGSQFWVLTRRHA  VVRD++VWSKFDLPCVR     CYPEENYFPTLLSM D RGL+
Subjt:  RTLIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLV

Query:  TATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGD----GGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDV
         ATLTHVDWNGS DGHP TY  SDV P+LIR+ R++R RYGD    GG+++ IR R   +GRR+SSSS A K  RRH FLFARKFSAD LQPLMNISSDV
Subjt:  TATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGD----GGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDV

Query:  IFKD
        IFKD
Subjt:  IFKD

A0A6J1KJC2 uncharacterized protein LOC1114962329.5e-17577.61Show/hide
Query:  MLSPNPLSLICALLLCLPLAIIFTLNSPTTI----NSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFM
        M + +PLSLICALLLCLPLA++FT+NSPT I    NSD+PFIFP   SLY+P THR+ITLF +PSP PPPPPPPE+D LLFPLAARV+P PSPTRKLAFM
Subjt:  MLSPNPLSLICALLLCLPLAIIFTLNSPTTI----NSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFM

Query:  FLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTY
        FLTNSPLPFAPLWELFFKNIPPDL+N+YIH DPTREYD PFSGVF+HRVIPSKPTQR++ SL AAARRLLAHALLHDS+NSMFALLSPSCIPLHSFNFTY
Subjt:  FLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTY

Query:  RTLIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLV
        +TLIRSK+SFIEVLKNEIGAYDRWAARGPD MLPVVK  D+RIGSQFW LTRRHA  VV+D++VW+KFDLPCVR     CYPEENYFPTLLSM DR+GL+
Subjt:  RTLIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLV

Query:  TATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDG--GMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIF
         ATLTHVDWNG  DGHP TY  +DV P+LIR+ R+AR RYGDG  G+++ I  +   +GRR+SSSS A K  RRH FLFARKFSAD LQPLMNISSDVIF
Subjt:  TATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDG--GMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIF

Query:  KD
        KD
Subjt:  KD

SwissProt top hitse value%identityAlignment
Q65XS5 Glycosyltransferase BC102.9e-2730.36Show/hide
Query:  PTPSPTRKLAFMFLTNSPLPFAPLWELFFKNIPPDLFNIYIHADP----TREYDTPFSGVFAHRVIPSKPTQRY-SPSLAAAARRLLAHALLHDSANSMF
        P P    +LAF+F+  + LP   +W+ FF+      F+I++H+ P    TR   T  SG F +R + +     +   S+  A R LLAHA L D  N  F
Subjt:  PTPSPTRKLAFMFLTNSPLPFAPLWELFFKNIPPDLFNIYIHADP----TREYDTPFSGVFAHRVIPSKPTQRY-SPSLAAAARRLLAHALLHDSANSMF

Query:  ALLSPSCIPLHSFNFTYRTLIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRT--------
          +S SC+PL++FN+TY  ++ S  SF++         D  A R    M P++   + R GSQ+ VLTR+HA +VV D  V  +F   C R         
Subjt:  ALLSPSCIPLHSFNFTYRTLIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRT--------

Query:  ----------RRHICYPEENYFPTLLSMRD-RRGLVTATLTHVDWNGSTD-------GHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAG
                  + H C P+E+Y  TLL+       L   ++TH  W+ S+         HP TY  SD  P L+++ +     Y +   +      NG   
Subjt:  ----------RRHICYPEENYFPTLLSMRD-RRGLVTATLTHVDWNGSTD-------GHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAG

Query:  RRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNIS
                         FLFARKF+  A   L+++S
Subjt:  RRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNIS

Arabidopsis top hitse value%identityAlignment
AT3G52060.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein9.7e-7145.74Show/hide
Query:  VNPTPSPTRKLAFMFLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPF-SGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFAL
        ++ +P+P  K+AF+FLTNS L F PLWE FF+    DL+N+YIHADPT        S     + IP++ T R SP+L +A RRLLA+A+L D  N  FAL
Subjt:  VNPTPSPTRKLAFMFLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPF-SGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFAL

Query:  LSPSCIPLHSFNFTYRTLI--RSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPE
        +S  CIPLHSF++ +  L     ++SFIE+L +E     R+ ARG D MLP +++ D R+GSQF+VL +RHAL+V+++R++W KF LPC+      CYPE
Subjt:  LSPSCIPLHSFNFTYRTLI--RSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPE

Query:  ENYFPTLLSMRDRRGLVTATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSA
        E+YFPTLLS+ D +G    TLT V+W GS  GHPHTY AS++ P LI + R                       R NSS  Y           FARKF+ 
Subjt:  ENYFPTLLSMRDRRGLVTATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSA

Query:  DALQPLMNISSDVIFKD
        ++LQPLM I+  VIF+D
Subjt:  DALQPLMNISSDVIFKD

AT3G52060.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein9.7e-7145.74Show/hide
Query:  VNPTPSPTRKLAFMFLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPF-SGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFAL
        ++ +P+P  K+AF+FLTNS L F PLWE FF+    DL+N+YIHADPT        S     + IP++ T R SP+L +A RRLLA+A+L D  N  FAL
Subjt:  VNPTPSPTRKLAFMFLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPF-SGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFAL

Query:  LSPSCIPLHSFNFTYRTLI--RSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPE
        +S  CIPLHSF++ +  L     ++SFIE+L +E     R+ ARG D MLP +++ D R+GSQF+VL +RHAL+V+++R++W KF LPC+      CYPE
Subjt:  LSPSCIPLHSFNFTYRTLI--RSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPE

Query:  ENYFPTLLSMRDRRGLVTATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSA
        E+YFPTLLS+ D +G    TLT V+W GS  GHPHTY AS++ P LI + R                       R NSS  Y           FARKF+ 
Subjt:  ENYFPTLLSMRDRRGLVTATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSA

Query:  DALQPLMNISSDVIFKD
        ++LQPLM I+  VIF+D
Subjt:  DALQPLMNISSDVIFKD

AT4G32290.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.6e-10853.25Show/hide
Query:  MLSPNPLSLICALLLCLPLAIIFTLNSPTT--INSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPT--PSPTRKLAFM
        M+SP    L+CAL LCLP+A+IFT+    T  I+ ++ F     +SLY  N     +    P+     P P EDD LL  L++RVNP   P  TRK+AFM
Subjt:  MLSPNPLSLICALLLCLPLAIIFTLNSPTT--INSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPT--PSPTRKLAFM

Query:  FLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTY
        +LT SPLPFAPLWE+FF  I  +L+N+Y+HADPTREYD PFSGVF +RVI SKP+ R++P+L AAARRLLAHALL D  N MFA++SPSC+P+ SF+FTY
Subjt:  FLTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTY

Query:  RTLIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLV
        +TL+ S++SFIE+LK+E   +DRW A G   MLP VK  + RIGSQFWVL RRHA +V RDRR+W KF+  CV  R   CYPEE+YFPTLL+MRD RG V
Subjt:  RTLIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLV

Query:  TATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIFKD
         ATLTHVDW  +  GHP  Y   +V P+L+   R  RPRYG+ G+                + S   K  R  PFLFARKFS  AL+PL+ ++  V+F D
Subjt:  TATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIFKD

AT5G22070.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.7e-6539.19Show/hide
Query:  NPLSLICALLLCLPLAIIFTLNSPTTINSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAA-------RVNPTPSPTRKLAFMF
        N   L  +LLLCLP    F            P +FP                   P  +  P     DD  LF  AA         +  P+P  K+AF+F
Subjt:  NPLSLICALLLCLPLAIIFTLNSPTTINSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAA-------RVNPTPSPTRKLAFMF

Query:  LTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSG-VFAHRVIP-SKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFT
        LTNS L FAP+W+ FF      L+N+Y+HADP      P +G VF +  I  +K T R SP+L +A RRLLA A L D AN+ FA+LS  CIPLHSFN+ 
Subjt:  LTNSPLPFAPLWELFFKNIPPDLFNIYIHADPTREYDTPFSG-VFAHRVIP-SKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFT

Query:  YRTLIRSK--------------------RSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHI
        Y +L  S                     RSF+E++ +E   + R+ ARG   M+P V F   R+GSQF+V+TRRHAL+ ++DR +W KF LPC R+    
Subjt:  YRTLIRSK--------------------RSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHI

Query:  CYPEENYFPTLLSMRDRRGLVTATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFAR
        CYPEE+YFPTLL+M+D  G    TLT V+W G+  GHP+TY   +V P+LI+  R                       R N SSSY           FAR
Subjt:  CYPEENYFPTLLSMRDRRGLVTATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFAR

Query:  KFSADALQPLMNISSDVIFKD
        KF+ D L+PL+ I+  VIF+D
Subjt:  KFSADALQPLMNISSDVIFKD

AT5G25330.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein3.5e-9750.25Show/hide
Query:  LSLICALLLCLPLAIIFTLNSP---TTINSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSP--TRKLAFMFLTNS
        L+L   LL+C+PL +I T+ SP    T+    P +      L I N +  +T     SP      P + D LL   A++ NP PSP   +KLAFMFLT +
Subjt:  LSLICALLLCLPLAIIFTLNSP---TTINSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSP--TRKLAFMFLTNS

Query:  PLPFAPLWELFFKNIP--PDLFNIYIHADPTREYDTPFSGVFAHRVIP-SKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTYRT
         LP APLWELFF        L+N+Y+H DPT+++     G F +R+IP SKP  R++P+L +AARRLLAHALL D +N MF LLSPSCIPLHSFNFTY+T
Subjt:  PLPFAPLWELFFKNIP--PDLFNIYIHADPTREYDTPFSGVFAHRVIP-SKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTYRT

Query:  LIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLVTA
        L+ S +SFIE+LK+E G Y+RWAARGP  M P V   + RIGSQFW LTR HAL+VV D  +WSKF+  CV  R  ICYPEE+YFPTLL+MRD +G V+A
Subjt:  LIRSKRSFIEVLKNEIGAYDRWAARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLVTA

Query:  TLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIFKD
        T+THVDW+ +  GHP TY   +V  +LI+  R ARPRYGDG                           R+ PFLFARKFS   +  LMNI+  VIF D
Subjt:  TLTHVDWNGSTDGHPHTYVASDVGPDLIRTFRIARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIFKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGTCCCCTAACCCACTTTCTCTTATCTGTGCTCTGCTTCTCTGCTTGCCTTTAGCCATTATCTTTACCCTCAACAGCCCCACCACCATTAATTCCGATTACCCTTT
CATTTTCCCCTTCCATTACTCTCTCTACATCCCCAACACCCACCGCAGAATTACCCTCTTCCCACTCCCTTCTCCGACGCCCCCGCCGCCGCCTCCGCCCGAGGATGACC
ACTTGCTTTTCCCTCTTGCCGCCCGTGTCAACCCGACCCCATCGCCAACTCGCAAATTGGCCTTCATGTTTCTCACCAACTCCCCTCTCCCTTTCGCTCCTCTTTGGGAA
CTGTTCTTCAAAAACATCCCGCCGGATCTTTTCAACATCTACATCCATGCCGACCCCACCCGGGAATACGACACGCCTTTCTCCGGCGTCTTCGCCCACCGGGTCATCCC
TTCCAAACCCACTCAGAGATATTCCCCTTCCCTCGCCGCCGCCGCCCGCCGCCTTCTCGCTCACGCGCTGCTGCATGATTCTGCTAATTCCATGTTTGCCCTTCTCTCTC
CCTCTTGCATCCCTCTCCATTCCTTCAATTTCACTTACAGAACGCTGATCCGATCCAAGAGGAGCTTCATCGAGGTTCTGAAAAATGAGATCGGCGCCTACGACAGGTGG
GCGGCGCGTGGACCCGACGTGATGCTTCCGGTGGTTAAATTCGCGGACGTTCGGATAGGGTCGCAGTTTTGGGTGCTGACGCGCCGGCACGCGCTGATTGTGGTGAGAGA
TAGAAGGGTTTGGTCAAAGTTTGACTTGCCTTGCGTGCGGACTCGGAGGCACATATGTTATCCTGAGGAGAATTATTTCCCCACCTTACTCAGCATGCGCGACCGCCGAG
GGCTTGTTACAGCTACACTTACACACGTGGACTGGAATGGGAGCACAGATGGCCACCCTCACACCTACGTGGCATCTGACGTGGGCCCCGATCTCATTCGCACTTTTCGG
ATCGCCCGGCCCAGATACGGCGACGGCGGAATGAAAATAAGTATTAGAAAGAGAAATGGAAACGCCGGCCGCCGGAATTCGTCGTCGTCGTACGCCGTTAAATCTATCCG
TCGGCACCCGTTCTTGTTTGCGAGGAAATTCTCCGCCGATGCACTCCAGCCGTTGATGAACATATCCAGTGACGTCATCTTTAAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGTCCCCTAACCCACTTTCTCTTATCTGTGCTCTGCTTCTCTGCTTGCCTTTAGCCATTATCTTTACCCTCAACAGCCCCACCACCATTAATTCCGATTACCCTTT
CATTTTCCCCTTCCATTACTCTCTCTACATCCCCAACACCCACCGCAGAATTACCCTCTTCCCACTCCCTTCTCCGACGCCCCCGCCGCCGCCTCCGCCCGAGGATGACC
ACTTGCTTTTCCCTCTTGCCGCCCGTGTCAACCCGACCCCATCGCCAACTCGCAAATTGGCCTTCATGTTTCTCACCAACTCCCCTCTCCCTTTCGCTCCTCTTTGGGAA
CTGTTCTTCAAAAACATCCCGCCGGATCTTTTCAACATCTACATCCATGCCGACCCCACCCGGGAATACGACACGCCTTTCTCCGGCGTCTTCGCCCACCGGGTCATCCC
TTCCAAACCCACTCAGAGATATTCCCCTTCCCTCGCCGCCGCCGCCCGCCGCCTTCTCGCTCACGCGCTGCTGCATGATTCTGCTAATTCCATGTTTGCCCTTCTCTCTC
CCTCTTGCATCCCTCTCCATTCCTTCAATTTCACTTACAGAACGCTGATCCGATCCAAGAGGAGCTTCATCGAGGTTCTGAAAAATGAGATCGGCGCCTACGACAGGTGG
GCGGCGCGTGGACCCGACGTGATGCTTCCGGTGGTTAAATTCGCGGACGTTCGGATAGGGTCGCAGTTTTGGGTGCTGACGCGCCGGCACGCGCTGATTGTGGTGAGAGA
TAGAAGGGTTTGGTCAAAGTTTGACTTGCCTTGCGTGCGGACTCGGAGGCACATATGTTATCCTGAGGAGAATTATTTCCCCACCTTACTCAGCATGCGCGACCGCCGAG
GGCTTGTTACAGCTACACTTACACACGTGGACTGGAATGGGAGCACAGATGGCCACCCTCACACCTACGTGGCATCTGACGTGGGCCCCGATCTCATTCGCACTTTTCGG
ATCGCCCGGCCCAGATACGGCGACGGCGGAATGAAAATAAGTATTAGAAAGAGAAATGGAAACGCCGGCCGCCGGAATTCGTCGTCGTCGTACGCCGTTAAATCTATCCG
TCGGCACCCGTTCTTGTTTGCGAGGAAATTCTCCGCCGATGCACTCCAGCCGTTGATGAACATATCCAGTGACGTCATCTTTAAAGATTGA
Protein sequenceShow/hide protein sequence
MLSPNPLSLICALLLCLPLAIIFTLNSPTTINSDYPFIFPFHYSLYIPNTHRRITLFPLPSPTPPPPPPPEDDHLLFPLAARVNPTPSPTRKLAFMFLTNSPLPFAPLWE
LFFKNIPPDLFNIYIHADPTREYDTPFSGVFAHRVIPSKPTQRYSPSLAAAARRLLAHALLHDSANSMFALLSPSCIPLHSFNFTYRTLIRSKRSFIEVLKNEIGAYDRW
AARGPDVMLPVVKFADVRIGSQFWVLTRRHALIVVRDRRVWSKFDLPCVRTRRHICYPEENYFPTLLSMRDRRGLVTATLTHVDWNGSTDGHPHTYVASDVGPDLIRTFR
IARPRYGDGGMKISIRKRNGNAGRRNSSSSYAVKSIRRHPFLFARKFSADALQPLMNISSDVIFKD