; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006912 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006912
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationscaffold10:38786814..38789466
RNA-Seq ExpressionSpg006912
SyntenySpg006912
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579216.1 Glycosyltransferase BC10, partial [Cucurbita argyrosperma subsp. sororia]9.8e-20089.71Show/hide
Query:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVSG-RVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP
        MK+QIQ PK+I IHLSFFNVVPYILLFTAGITAGV LTFYLSNFSISLNLTQIP +  SP++G RVGLEEYLKPPEVMHDME+EELLWRASMAAGIR+FP
Subjt:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVSG-RVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP

Query:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP
        FRRVPKVAFMFLTRGP+YLAPLWEEFFKGNEGLYSVY+HSDPSYN SFPE+PVFHGRRIPSK+VGWGKVNMIEAERRL+SNAL DISNERFVLLSEACIP
Subjt:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP

Query:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER
        LFNFST+YSFL+NSTMKSFIMSYDEP NVGRGRY  KMFPPISLKQWRKGSQWFEMDRDTA+AVVSDRKYFPVF KYCKGQCYSDEHYLPTLVNVLGW+R
Subjt:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER

Query:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR
        N +RSLTWVDWSKGGPHPTRFSRSD++VEL QRLRNQT ECTKSK EGTGVCFLFARKF+PNTL RLMKIAPKA+HFGR
Subjt:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR

XP_008467193.1 PREDICTED: uncharacterized protein LOC103504601 [Cucumis melo]5.2e-20189.45Show/hide
Query:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVS-GRVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP
        MKSQ Q PK+I IHL+FFNVVPYILLFT GI+AGV LTFYLSNFSISLNLTQIPS+ F PV+ GRVGLEE+LKPPEVMHDM++EELLWRASM AGI+KFP
Subjt:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVS-GRVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP

Query:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP
        F+RVPK+AFMFLT+GPVYLAPLWEEFFKGNE LYSVYVHSDPSYN S PESPVFHGRRIPSKKVGWGKVNMIEAERRL+SNAL DISNERFVLLSE+CIP
Subjt:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP

Query:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER
        LFNFST+YSFLINSTMKSFIMSYDEPGNVGRGRY NKMFPPISLKQWRKGSQWFEMDRDTA+AVVSD+KYFPVFQ YCKGQCYSDEHYLPTLVNVLGW+R
Subjt:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER

Query:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR
        N +RSLTWVDWSKGGPHP RFSRSDI+VEL QRLRNQTGEC KSKMEGTGVCFLFARKFAPNTLERLMKIAPKAM+FGR
Subjt:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR

XP_022939468.1 uncharacterized protein LOC111445363 [Cucurbita moschata]5.8e-20089.97Show/hide
Query:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVSG-RVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP
        MKSQIQ PK+I IHLSFFNVVPYILLFTAGITAGV LTFYLSNFSI+LNLTQIP +  SPV+G RVGLEEYLKPPEVMHDME++ELLWRASMAAGIR+FP
Subjt:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVSG-RVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP

Query:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP
        FRRVPKVAFMFLTRGP+YLAPLWEEFFKGNEGLYSVY+HSDPSYN SFPESPVFHGRRIPSK+VGWGKVNMIEAERRL+SNAL DISNERFVLLSEACIP
Subjt:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP

Query:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER
        LFNFST+YSFL+NSTMKSFIMSYDEP NVGRGRY  KMFPPISLKQWRKGSQWFEMDRDTA+AVVSDRKYFPVF KYCKGQCYSDEHY+PTLVNVLGW+R
Subjt:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER

Query:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR
        N +RSLTWVDWSKGGPHPTRFSRSDI+VEL QRLRNQT ECTKSK EGTGVCFLFARKF+PNTL RLMKIAPKA+HFGR
Subjt:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR

XP_023550533.1 uncharacterized protein LOC111808649 [Cucurbita pepo subsp. pepo]2.0e-20090.24Show/hide
Query:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVSG-RVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP
        MKSQIQ PK+I IHLSFFNVVPYILLFTAGITAGV LTFYLSNFSISLNLTQIP +  SPV+G RVGLEEYLKPPEVMHDME+EELLWRASMAAGIR+FP
Subjt:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVSG-RVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP

Query:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP
        FRRVPKVAFMFL+RGP+YLAPLWEEFFKGNEGLYSVY+HSDPSYN SFPESPVFHGRRIPSK+VGWGKVNMIEAERRL+SNAL DISNERFVLLSEACIP
Subjt:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP

Query:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER
        LFN ST+YSFL+NSTMKSF+MSYDEP NVGRGRY  KMFPPISLKQWRKGSQWFEMDRDTA+AVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGW+R
Subjt:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER

Query:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR
        N +RSLTWVDWSKGGPHPTRFSRSDI+VEL QRLRNQT ECTKSK EGTGVCFLFARKF+PNTL RLMKIAPKA+HFGR
Subjt:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR

XP_038906960.1 glycosyltransferase BC10-like [Benincasa hispida]3.0e-20491.82Show/hide
Query:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVS-GRVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP
        MKSQIQ PK+IQIHLSFFNVVPY+LLFTAGITAGV  TFYLSNFSISLNLTQIPS+SF PV+ GRVGLEEYLKPPEVMHDME+EELLWRASMAA I+KFP
Subjt:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVS-GRVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP

Query:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP
        FRRVPK+AFMFLT+GPVYLAPLWEEFFKGNEGLYSVYVHSDPSYN SFPESP FHGRR+PSKKVGWGKVNMIEAERRLLSNAL DISNERFVLLSEACIP
Subjt:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP

Query:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER
        LFNFST+YSFLINSTMKSFIMSYDEPGNVGRGRY NKMFPPISLKQWRKGSQWFEMDRDTAI VVSD+KYFPVFQ YCKGQCYSDEHYLPTLVNVLGW+R
Subjt:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER

Query:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR
        N +RSLTWVDWSKGGPHP  FSRSDI+VELFQRLRNQT ECTK+KMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR
Subjt:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR

TrEMBL top hitse value%identityAlignment
A0A0A0KT17 Uncharacterized protein2.7e-19586.81Show/hide
Query:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVS-GRVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP
        MKSQ Q PK+I IHL+FF+VVPYILLFT GITAGV LTFYLSNF ISLNLTQI S+ F PV+ GRVGLEE+LKPPEVMHDM++EELLWRASM A I+KFP
Subjt:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVS-GRVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP

Query:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP
        F+RVPK+AFMFLT+GPVYLAPLWEEFFKGNEGLYSVYVHSDPSYN S PE P FHGRRIPSKKVGWGKVNMIEAERRL+SNAL DISNERFVLLSE+CIP
Subjt:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP

Query:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER
        LFNFST+YSFLINSTMKSFIMSYDEP NVGRGRY NKMFPPISLKQWRKGSQWFE+DRDTA+AVVSD+KYFPVFQ YCKGQCYSDEHYLPTLVNVLGW+R
Subjt:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER

Query:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR
        NG+RSLTWVDWSKGGPHP R+SRSDI+VEL QRLRNQTGEC KSKMEG GVCFLFARKFAPN LERL+ IAPKAM+FGR
Subjt:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR

A0A1S3CU90 uncharacterized protein LOC1035046012.5e-20189.45Show/hide
Query:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVS-GRVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP
        MKSQ Q PK+I IHL+FFNVVPYILLFT GI+AGV LTFYLSNFSISLNLTQIPS+ F PV+ GRVGLEE+LKPPEVMHDM++EELLWRASM AGI+KFP
Subjt:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVS-GRVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP

Query:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP
        F+RVPK+AFMFLT+GPVYLAPLWEEFFKGNE LYSVYVHSDPSYN S PESPVFHGRRIPSKKVGWGKVNMIEAERRL+SNAL DISNERFVLLSE+CIP
Subjt:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP

Query:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER
        LFNFST+YSFLINSTMKSFIMSYDEPGNVGRGRY NKMFPPISLKQWRKGSQWFEMDRDTA+AVVSD+KYFPVFQ YCKGQCYSDEHYLPTLVNVLGW+R
Subjt:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER

Query:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR
        N +RSLTWVDWSKGGPHP RFSRSDI+VEL QRLRNQTGEC KSKMEGTGVCFLFARKFAPNTLERLMKIAPKAM+FGR
Subjt:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR

A0A5D3BMR2 Core-2/I-branching enzyme2.5e-20189.45Show/hide
Query:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVS-GRVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP
        MKSQ Q PK+I IHL+FFNVVPYILLFT GI+AGV LTFYLSNFSISLNLTQIPS+ F PV+ GRVGLEE+LKPPEVMHDM++EELLWRASM AGI+KFP
Subjt:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVS-GRVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP

Query:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP
        F+RVPK+AFMFLT+GPVYLAPLWEEFFKGNE LYSVYVHSDPSYN S PESPVFHGRRIPSKKVGWGKVNMIEAERRL+SNAL DISNERFVLLSE+CIP
Subjt:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP

Query:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER
        LFNFST+YSFLINSTMKSFIMSYDEPGNVGRGRY NKMFPPISLKQWRKGSQWFEMDRDTA+AVVSD+KYFPVFQ YCKGQCYSDEHYLPTLVNVLGW+R
Subjt:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER

Query:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR
        N +RSLTWVDWSKGGPHP RFSRSDI+VEL QRLRNQTGEC KSKMEGTGVCFLFARKFAPNTLERLMKIAPKAM+FGR
Subjt:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR

A0A6J1FLQ9 uncharacterized protein LOC1114453632.8e-20089.97Show/hide
Query:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVSG-RVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP
        MKSQIQ PK+I IHLSFFNVVPYILLFTAGITAGV LTFYLSNFSI+LNLTQIP +  SPV+G RVGLEEYLKPPEVMHDME++ELLWRASMAAGIR+FP
Subjt:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVSG-RVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP

Query:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP
        FRRVPKVAFMFLTRGP+YLAPLWEEFFKGNEGLYSVY+HSDPSYN SFPESPVFHGRRIPSK+VGWGKVNMIEAERRL+SNAL DISNERFVLLSEACIP
Subjt:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP

Query:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER
        LFNFST+YSFL+NSTMKSFIMSYDEP NVGRGRY  KMFPPISLKQWRKGSQWFEMDRDTA+AVVSDRKYFPVF KYCKGQCYSDEHY+PTLVNVLGW+R
Subjt:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER

Query:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR
        N +RSLTWVDWSKGGPHPTRFSRSDI+VEL QRLRNQT ECTKSK EGTGVCFLFARKF+PNTL RLMKIAPKA+HFGR
Subjt:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR

A0A6J1K052 uncharacterized protein LOC1114898412.3e-19488.39Show/hide
Query:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVSG-RVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP
        MK+QIQ PK+I IHLSFFNVVPYILLF    TAGV LTFYLSNFSISLNLTQIP +  SPV+G RVGLEEYLKPPEVMHDME+EELLWRASMAAGIR+FP
Subjt:  MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVSG-RVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFP

Query:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP
        FRRVPKVAFMFLTRGP+YLAPLW EFFKGNEGLYSVY+HS+PSYN SF ESPVFHGRRIPSK+V WG VNMIEAERRL+SNAL DISNERFVLLSEACIP
Subjt:  FRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIP

Query:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER
        LFNFSTIYSFL+NSTMKSFIMSYDEP NVGRGRY  KMFPPISLKQWRKGSQWFEMDRDTA+A+VSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGW+R
Subjt:  LFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWER

Query:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR
        N +RSLTWVDWSKGGPHPTRFSRSDI+VEL Q+LRNQT ECTKSK EGTGVCFLFARKF+PNTLERLMKIAPKA+HFGR
Subjt:  NGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR

SwissProt top hitse value%identityAlignment
Q65XS5 Glycosyltransferase BC101.5e-3834.78Show/hide
Query:  KVAFMFLTRGPVYLAPLWEEFFKGN-EGLYSVYVHSDPSYNL--SFPESPVFHGRRI-PSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIPL
        ++AF+F+ R  + L  +W+ FF+G+ EG +S++VHS P + L  +   S  F+ R++  S +V WG+ +MIEAER LL++AL D  NERFV +S++C+PL
Subjt:  KVAFMFLTRGPVYLAPLWEEFFKGN-EGLYSVYVHSDPSYNL--SFPESPVFHGRRI-PSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIPL

Query:  FNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQ--------------------
        +NF+  Y ++++S+  SF+ S+    +   GRY+ +M P I ++ WRKGSQW  + R  A  VV D +  P FQK+C+ +                    
Subjt:  FNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQ--------------------

Query:  --CYSDEHYLPTLVNVLGWERN-GDRSLTWVDW--------SKGGPHPTRFSRSDINVELFQRLR----------NQTGECTKSKMEGTGVCFLFARKF
          C  DEHY+ TL+   G E     RS+T   W         + G HP  +  SD    L + ++          N+   CT +       CFLFARKF
Subjt:  --CYSDEHYLPTLVNVLGWERN-GDRSLTWVDW--------SKGGPHPTRFSRSDINVELFQRLR----------NQTGECTKSKMEGTGVCFLFARKF

Arabidopsis top hitse value%identityAlignment
AT1G10280.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.2e-9750.97Show/hide
Query:  LEEYLKPPEVMHDMEEEELLWRASMAAGIRKFPFRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWG
        ++ +++P  + H M ++EL WRASM     ++P+ RVPKVAFMFLTRGP+ + PLWE+FFKGNE   SVYVH+ P Y+++      F+ R+IPS++V WG
Subjt:  LEEYLKPPEVMHDMEEEELLWRASMAAGIRKFPFRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWG

Query:  KVNMIEAERRLLSNALFDISNERFVLLSEACIPLFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSD
           + +AE+RLL+NAL D SNERFVLLSE+C+P++NFST+Y++LINS   SF+ SYDEP   GRGRYS KM P I L  WRKGSQWFE++R  AI ++SD
Subjt:  KVNMIEAERRLLSNALFDISNERFVLLSEACIPLFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSD

Query:  RKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWERNGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERL
         KY+ +F+++C+  CY DEHY+PT +N+     N +RS+TWVDWS GGPHP  ++ ++I     Q +R    +C  ++ E T +CFLFARKF+P+ L  L
Subjt:  RKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWERNGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERL

Query:  MKIAPKAMHF
        M ++   + F
Subjt:  MKIAPKAMHF

AT1G68380.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein4.7e-9949.87Show/hide
Query:  FFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLT----------QIPSTSFSPVSGRVGLEEYLKP-PEVMHDMEEEELLWRASMAAGIRKFPFRRVPK
        F N++ Y  +   G+  G+ +   L   S   +LT            P    SP     GL+ +L P   +MHDME+ ELLWRASM   IR +P+ R+PK
Subjt:  FFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLT----------QIPSTSFSPVSGRVGLEEYLKP-PEVMHDMEEEELLWRASMAAGIRKFPFRRVPK

Query:  VAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIPLFNFST
        VAFMFLT GP+ LAPLWE FF+G+EGL+++YVH++ SY+   P+  VF+GRRIPSK+V WG  NM+EAERRLL+NAL DI+NERF+LLSE+CIPLFNFST
Subjt:  VAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIPLFNFST

Query:  IYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNV---LGWERNGD
        +YSFLI+ST+ + + SYD    +GR RY  +M+P I + QWRKGSQWFE+DR  A+ VVSD  Y+P+F+ Y +     DEHY+PTL+N+   LG  RN +
Subjt:  IYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNV---LGWERNGD

Query:  RSLTWVDWSKGGPHPTRFSRSDINVELFQRLR-NQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHF
        R+LTW DWSK   HP  F   ++NVE  + LR    G+C K+      +CFLFARKF+   L+ L+++A   M+F
Subjt:  RSLTWVDWSKGGPHPTRFSRSDINVELFQRLR-NQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHF

AT1G68390.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.4e-11454.33Show/hide
Query:  FFNVVPYILLFTAGITAGVCLTFYLSNFS--ISLNLTQI--------------PSTSFSPVS--GRVGLEEYLKPPE-VMHDMEEEELLWRASMAAGIRK
        F N++ Y L+   GI  G+ L   L NFS   SL++ +I              P    SP S   + GL+ +++PPE +MHDME+EELLWRASMA  I+ 
Subjt:  FFNVVPYILLFTAGITAGVCLTFYLSNFS--ISLNLTQI--------------PSTSFSPVS--GRVGLEEYLKPPE-VMHDMEEEELLWRASMAAGIRK

Query:  FPFRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEAC
        +PF R PKVAFMF+T+G + LA LWE FF+G+EGL+++YVHS PSYN S PE  VF GR IPSK+V WG VNM+EAE+RLL+NAL DISNERFVLLSE+C
Subjt:  FPFRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEAC

Query:  IPLFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLG-
        IPLFNF+T+YS+LINST ++ + SYD+ G VGRGRYS  M P + L+ WRKGSQW E+DR  A+ ++SDR Y+P+F  YC   CY+DEHY+PTL+N+   
Subjt:  IPLFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLG-

Query:  -WERNGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHF
           RN +R+LTWVDWSKGGPHP RF R ++  E  + LR+  GEC  +  E T +C+LFARKF P  L+RL++++   +HF
Subjt:  -WERNGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHF

AT3G21310.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.2e-9651.07Show/hide
Query:  TQIPSTSFSPVSGRVGLEEYLKPP-EVMHDMEEEELLWRASMAAGIRKFPFRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPE
        T  PST  S    R+ LE  +KPP    H M + ELLWRASM   I  +PF+RVPK+AFMFLT+GP+  APLWE FFKG+EG YS+YVH+ P+Y   FP 
Subjt:  TQIPSTSFSPVSGRVGLEEYLKPP-EVMHDMEEEELLWRASMAAGIRKFPFRRVPKVAFMFLTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPE

Query:  SPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIPLFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKG
        S VF+ R+IPS+ V WG+++M +AERRLL+NAL DISNE FVLLSEACIPL  F+ +Y + ++ +  SF+ S DE G  GRGRYS  M P +SL +WRKG
Subjt:  SPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIPLFNFSTIYSFLINSTMKSFIMSYDEPGNVGRGRYSNKMFPPISLKQWRKG

Query:  SQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWERNGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTG
        SQWFE++R  A+ +V D  Y+  F+++C+  CY DEHY PT++++   +   +R+LTW DWS+GG HP  F ++DI  +  ++L    G+      + + 
Subjt:  SQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWERNGDRSLTWVDWSKGGPHPTRFSRSDINVELFQRLRNQTGECTKSKMEGTG

Query:  VCFLFARKFAPNTLERLMKIAPKAMHF
        VC+LFARKFAP+ L+ L+K+APK + F
Subjt:  VCFLFARKFAPNTLERLMKIAPKAMHF

AT5G11730.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.2e-9950.14Show/hide
Query:  GVCLTFYLSNFSISLNLTQ-------IPSTSFSPVSGRVG----LEEYLKPPEV-MHDMEEEELLWRASMAAGIRKFPFRRVPKVAFMFLTRGPVYLAPL
        G+ LTF ++ F IS++  +       + + + S V  R G    L ++++PP V MH+M +EELLWRAS     +++PF+RVPKVAFMFLT+GP+ LA L
Subjt:  GVCLTFYLSNFSISLNLTQ-------IPSTSFSPVSGRVG----LEEYLKPPEV-MHDMEEEELLWRASMAAGIRKFPFRRVPKVAFMFLTRGPVYLAPL

Query:  WEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIPLFNFSTIYSFLINSTMKSFIMS
        WE F KG++GLYSVY+H  PS+   FP S VFH R+IPS+   WG+++M +AE+RLL+NAL D+SNE FVL+SE+CIPL+NF+TIYS+L  S   SF+ +
Subjt:  WEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIPLFNFSTIYSFLINSTMKSFIMS

Query:  YDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWERNGDRSLTWVDWSKGGPHPTRFS
        +D+PG  GRGRY+  M P + L +WRKGSQWFE++RD A  +V D  Y+P F+++C+  CY DEHY PT++ +       +RSLTWVDWS+GGPHP  F 
Subjt:  YDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWERNGDRSLTWVDWSKGGPHPTRFS

Query:  RSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHF
        RSDI    F ++ +  G         T +C+LFARKFAP+ LE L+ IAPK + F
Subjt:  RSDINVELFQRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGCCAAATTCAGATCCCAAAGATCATCCAAATCCACCTGAGCTTCTTCAACGTCGTCCCTTACATCCTCCTCTTCACCGCCGGAATCACCGCCGGAGTTTGCCT
CACTTTCTACCTCTCTAATTTCTCCATCAGCTTAAACCTCACCCAGATCCCCTCCACGAGTTTCTCTCCGGTGAGCGGGCGGGTCGGGTTGGAGGAGTATCTGAAGCCGC
CAGAAGTAATGCACGACATGGAGGAGGAGGAGCTTCTGTGGAGAGCTTCAATGGCGGCCGGAATTCGGAAATTTCCGTTCCGGCGAGTTCCGAAGGTGGCGTTCATGTTC
TTGACTCGAGGGCCGGTGTACTTGGCTCCTCTTTGGGAAGAGTTCTTCAAGGGAAATGAAGGGCTTTACTCTGTTTATGTTCATTCCGATCCTTCTTATAATCTCTCTTT
CCCTGAAAGCCCTGTTTTCCATGGCCGGAGAATTCCCAGCAAGAAAGTAGGATGGGGGAAGGTAAACATGATCGAGGCAGAACGTCGTCTACTATCAAACGCACTTTTCG
ACATTTCAAATGAGCGATTCGTTCTCCTCTCAGAAGCCTGCATTCCTCTCTTCAACTTCTCCACAATCTACTCCTTCCTCATTAACTCCACCATGAAAAGCTTCATCATG
AGCTACGACGAACCGGGCAACGTCGGTCGTGGTCGATACTCGAACAAAATGTTCCCGCCGATCTCCCTAAAACAATGGCGAAAAGGGTCGCAGTGGTTCGAGATGGACCG
AGACACCGCGATCGCCGTCGTCTCGGATCGAAAATACTTCCCCGTCTTCCAGAAGTACTGCAAAGGCCAATGCTATTCCGACGAGCACTACTTGCCGACGTTGGTCAATG
TTTTGGGCTGGGAGAGGAATGGCGATAGAAGTTTGACCTGGGTTGACTGGTCAAAGGGTGGCCCACATCCAACTAGATTTTCGCGATCGGATATCAACGTGGAGCTATTT
CAGAGGCTAAGGAATCAAACCGGCGAGTGTACAAAAAGTAAAATGGAGGGGACAGGTGTCTGCTTCCTATTCGCCAGAAAGTTCGCTCCAAATACTTTGGAGAGGTTAAT
GAAGATTGCACCAAAAGCCATGCACTTTGGGAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGCCAAATTCAGATCCCAAAGATCATCCAAATCCACCTGAGCTTCTTCAACGTCGTCCCTTACATCCTCCTCTTCACCGCCGGAATCACCGCCGGAGTTTGCCT
CACTTTCTACCTCTCTAATTTCTCCATCAGCTTAAACCTCACCCAGATCCCCTCCACGAGTTTCTCTCCGGTGAGCGGGCGGGTCGGGTTGGAGGAGTATCTGAAGCCGC
CAGAAGTAATGCACGACATGGAGGAGGAGGAGCTTCTGTGGAGAGCTTCAATGGCGGCCGGAATTCGGAAATTTCCGTTCCGGCGAGTTCCGAAGGTGGCGTTCATGTTC
TTGACTCGAGGGCCGGTGTACTTGGCTCCTCTTTGGGAAGAGTTCTTCAAGGGAAATGAAGGGCTTTACTCTGTTTATGTTCATTCCGATCCTTCTTATAATCTCTCTTT
CCCTGAAAGCCCTGTTTTCCATGGCCGGAGAATTCCCAGCAAGAAAGTAGGATGGGGGAAGGTAAACATGATCGAGGCAGAACGTCGTCTACTATCAAACGCACTTTTCG
ACATTTCAAATGAGCGATTCGTTCTCCTCTCAGAAGCCTGCATTCCTCTCTTCAACTTCTCCACAATCTACTCCTTCCTCATTAACTCCACCATGAAAAGCTTCATCATG
AGCTACGACGAACCGGGCAACGTCGGTCGTGGTCGATACTCGAACAAAATGTTCCCGCCGATCTCCCTAAAACAATGGCGAAAAGGGTCGCAGTGGTTCGAGATGGACCG
AGACACCGCGATCGCCGTCGTCTCGGATCGAAAATACTTCCCCGTCTTCCAGAAGTACTGCAAAGGCCAATGCTATTCCGACGAGCACTACTTGCCGACGTTGGTCAATG
TTTTGGGCTGGGAGAGGAATGGCGATAGAAGTTTGACCTGGGTTGACTGGTCAAAGGGTGGCCCACATCCAACTAGATTTTCGCGATCGGATATCAACGTGGAGCTATTT
CAGAGGCTAAGGAATCAAACCGGCGAGTGTACAAAAAGTAAAATGGAGGGGACAGGTGTCTGCTTCCTATTCGCCAGAAAGTTCGCTCCAAATACTTTGGAGAGGTTAAT
GAAGATTGCACCAAAAGCCATGCACTTTGGGAGATAG
Protein sequenceShow/hide protein sequence
MKSQIQIPKIIQIHLSFFNVVPYILLFTAGITAGVCLTFYLSNFSISLNLTQIPSTSFSPVSGRVGLEEYLKPPEVMHDMEEEELLWRASMAAGIRKFPFRRVPKVAFMF
LTRGPVYLAPLWEEFFKGNEGLYSVYVHSDPSYNLSFPESPVFHGRRIPSKKVGWGKVNMIEAERRLLSNALFDISNERFVLLSEACIPLFNFSTIYSFLINSTMKSFIM
SYDEPGNVGRGRYSNKMFPPISLKQWRKGSQWFEMDRDTAIAVVSDRKYFPVFQKYCKGQCYSDEHYLPTLVNVLGWERNGDRSLTWVDWSKGGPHPTRFSRSDINVELF
QRLRNQTGECTKSKMEGTGVCFLFARKFAPNTLERLMKIAPKAMHFGR