; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G026870 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G026870
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationchrH02:2653135..2656819
RNA-Seq ExpressionChy2G026870
SyntenyChy2G026870
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK13024.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 2 [Cucumis melo var. makuwa]4.97e-27597.89Show/hide
Query:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
        MKKKALLTPPR LFWFSWKLLV FSLALCI AL+SLHSSPSTTDLA ASLSRR RPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
Subjt:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH

Query:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP
        SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP
Subjt:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP

Query:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
        AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVD SKG MNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
Subjt:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN

Query:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
Subjt:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

XP_004134777.1 glycosyltransferase BC10 [Cucumis sativus]9.36e-28098.43Show/hide
Query:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
        MKKKALLTPPRSLFWFSWKLLV FSLALCIFALVSLHSSPSTTDLA+ASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
Subjt:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH

Query:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP
        SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRF+LLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP
Subjt:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP

Query:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
        AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
Subjt:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN

Query:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKKTN
        KGWHPITFTYANAGPRQ+KEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDK T+
Subjt:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKKTN

XP_008440033.1 PREDICTED: uncharacterized protein LOC103484630 isoform X1 [Cucumis melo]2.02e-27497.63Show/hide
Query:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
        MKKKALLTPPR LFWFSWKLLV FSLALCI AL+SLHSSPSTTDLA ASLSRR RPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
Subjt:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH

Query:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP
        SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYL+ASPKSFVDSFLDAKEGRYNPKMSP
Subjt:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP

Query:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
        AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVD SKG MNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
Subjt:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN

Query:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
Subjt:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

XP_008440035.1 PREDICTED: uncharacterized protein LOC103484630 isoform X2 [Cucumis melo]2.43e-25191.58Show/hide
Query:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
        MKKKALLTPPR LFWFSWKLLV FSLALCI AL+SLHSSPSTTDLA ASLSRR RPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
Subjt:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH

Query:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP
        SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDS                        FLDAKEGRYNPKMSP
Subjt:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP

Query:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
        AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVD SKG MNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
Subjt:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN

Query:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
Subjt:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

XP_038883828.1 glycosyltransferase BC10-like isoform X2 [Benincasa hispida]2.14e-26293.95Show/hide
Query:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
        MKKKA L P RSL WFSWKLL+ FSLA+CI AL+ LHSS S TDLA+ASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
Subjt:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH

Query:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP
        SAPGFVFDESTTRSHFFFGRQL+NSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASP+SFVDSFLDAKEGRYNPKMSP
Subjt:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP

Query:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
        AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVD SKG MN KLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTT MEN
Subjt:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN

Query:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        KGWHPITF+YANAGPRQIKEIKGI+H+YYETEFRTEWCRNNSTFV CFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
Subjt:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

TrEMBL top hitse value%identityAlignment
A0A0A0KHS2 Uncharacterized protein3.6e-21998.43Show/hide
Query:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
        MKKKALLTPPRSLFWFSWKLLV FSLALCIFALVSLHSSPSTTDLA+ASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
Subjt:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH

Query:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP
        SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRF+LLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP
Subjt:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP

Query:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
        AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
Subjt:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN

Query:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKKTN
        KGWHPITFTYANAGPRQ+KEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDK T+
Subjt:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKKTN

A0A1S3AZR5 uncharacterized protein LOC103484630 isoform X28.5e-19791.58Show/hide
Query:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
        MKKKALLTPPR LFWFSWKLLV FSLALCI AL+SLHSSPSTTDLA ASLSRR RPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
Subjt:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH

Query:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP
        SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLS                        DSFLDAKEGRYNPKMSP
Subjt:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP

Query:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
        AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVD SKG MNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
Subjt:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN

Query:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
Subjt:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

A0A1S3B0S7 uncharacterized protein LOC103484630 isoform X15.3e-21597.63Show/hide
Query:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
        MKKKALLTPPR LFWFSWKLLV FSLALCI AL+SLHSSPSTTDLA ASLSRR RPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
Subjt:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH

Query:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP
        SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYL+ASPKSFVDSFLDAKEGRYNPKMSP
Subjt:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP

Query:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
        AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVD SKG MNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
Subjt:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN

Query:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
Subjt:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

A0A5D3CPR8 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 21.8e-21597.89Show/hide
Query:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
        MKKKALLTPPR LFWFSWKLLV FSLALCI AL+SLHSSPSTTDLA ASLSRR RPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
Subjt:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH

Query:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP
        SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP
Subjt:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP

Query:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
        AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVD SKG MNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
Subjt:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN

Query:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
Subjt:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

A0A6J1EFL2 uncharacterized protein LOC111433868 isoform X15.2e-19487.96Show/hide
Query:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
        MKKKA +T  RSL WFSWKL++ FS+ALCI  L+  HSS S +DLA+ASLSRRLRP  D+F  RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
Subjt:  MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH

Query:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP
        SAPGFVFDESTTRSHFFFGRQL NSIQVAWGKSSMIAAERLLLE ALEDPANQRFVLLSDSCVPLYNF YIYSYLMASP+SFVDSFLD KEGRYNPKMSP
Subjt:  SAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSP

Query:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN
         IPK KWRKGSQWISLIRSHAEV+VDDDIIFPIFGLFCKRRPPVD SKG MN KLQKQHNCIPDEHYVQTLLALN+LE ELE RT+TYTLWN+S TKMEN
Subjt:  AIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN

Query:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKKTN
        KGWHPITF YANAGPRQIKEIKGIDHVYY +E RTEWCRNNSTFVPCFLFARKFSQGAAMRLLS G+VSHFDASALLDKKTN
Subjt:  KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKKTN

SwissProt top hitse value%identityAlignment
Q65XS5 Glycosyltransferase BC103.9e-9851.79Show/hide
Query:  LATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLE
        +A A +     PP     G  ++AFLF+ R  LPLD +W +FF       FSI++HS PGFV   +TTRS FF+ RQ+ NS+QV WG++SMI AER+LL 
Subjt:  LATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLE

Query:  AALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSPAIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPV
         AL+DP N+RFV +SDSCVPLYNF+Y Y Y+M+S  SFVDSF D K GRYNP+M P IP   WRKGSQW  L R HAEVVV+D+ + P F   C+RRP  
Subjt:  AALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSPAIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPV

Query:  D---ESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTK-MENKGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRN
        +   +    +  +  K HNCIPDEHYVQTLLA + LE EL RR+VT++ W+ S++K  E +GWHP+T+  ++A P  +K IK ID++YYETE R EWC +
Subjt:  D---ESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTK-MENKGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRN

Query:  NSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASAL
        N    PCFLFARKF++ A ++LL   +++   AS +
Subjt:  NSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASAL

Arabidopsis top hitse value%identityAlignment
AT1G11940.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.3e-13061.39Show/hide
Query:  KKKALLTPPRS----LFWFSWKLLVIFSLALCIFAL--VSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANF
        K +  + PP S    + W  WKL++ FS+ALC+ AL  + L  +  TT     S++R   P       RPK+AFLFL RR+LPLDF+W  FF+  D ANF
Subjt:  KKKALLTPPRS----LFWFSWKLLVIFSLALCIFAL--VSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANF

Query:  SIYIHSAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYN
        SIYIHS PGFVF+E TTRS +F+ RQL NSI+V WG+SSMI AERLLL +ALED +NQRFVLLSD C PLY+F YIY YL++SP+SFVDSFL  KE RY+
Subjt:  SIYIHSAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYN

Query:  PKMSPAIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQST
         KMSP IP+ KWRKGSQWI+LIRSHAEV+V+D I+FP+F  FCKR PP+  ++  +  K QK+ NCIPDEHYVQTLL +  LE E+ERRTVTYT+WN S 
Subjt:  PKMSPAIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQST

Query:  TKMENKGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVV
        TK E K WHP+TFT  N+GP +IKEIK IDHVYYE+E RTEWC+ +S  VPCFLFARKF+  AAMR++SEG++
Subjt:  TKMENKGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVV

AT1G62305.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein8.3e-13665.29Show/hide
Query:  RSLFWFSWKLLVIFSLALCIFALVSL----HSSPSTTDLATASLSRRLRPPSDSFLG-RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSAPGF
        R + WF WK+L+  S ALCI AL  +    +S+ +TT L+++    R R P   + G RPK+AFLFL RR+LPLDFLW  FF++ D  NFSIY+HS PGF
Subjt:  RSLFWFSWKLLVIFSLALCIFALVSL----HSSPSTTDLATASLSRRLRPPSDSFLG-RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSAPGF

Query:  VFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSPAIPKS
        VFDES+TRSHFF+ RQL+NSI+V WG+SSMIAAERLLL +ALEDP+NQRFVLLSDSCVPLY+F YIY YL++SPKSFVDSFLD K+ RY  KM P I K 
Subjt:  VFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSPAIPKS

Query:  KWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMENKGWHP
        KWRKGSQWISLIRSHAEV+V+DD +FP+F  FCKR  P+D  K  +  K +++HNCIPDEHYVQTLL +  LE E+ERRTVTYT WN S  K E K WHP
Subjt:  KWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMENKGWHP

Query:  ITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVV
        +TFT  N GP +I+ IK I+HVYYE+E+RTEWCR NS  VPCFLFARKF++GAAMRLLSEG++
Subjt:  ITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVV

AT1G62305.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.9e-11960.06Show/hide
Query:  RSLFWFSWKLLVIFSLALCIFALVSL----HSSPSTTDLATASLSRRLRPPSDSFLG-RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSAPGF
        R + WF WK+L+  S ALCI AL  +    +S+ +TT L+++    R R P   + G RPK+AFLFL RR+LPLDFLW  FF++ D  NFSIY+HS PGF
Subjt:  RSLFWFSWKLLVIFSLALCIFALVSL----HSSPSTTDLATASLSRRLRPPSDSFLG-RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSAPGF

Query:  VFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSPAIPKS
        VFDES+TRSHFF+ RQL+NSI+V WG+SSMIAAERLLL +ALEDP+NQRFVLLS                        DSFLD K+ RY  KM P I K 
Subjt:  VFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSPAIPKS

Query:  KWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMENKGWHP
        KWRKGSQWISLIRSHAEV+V+DD +FP+F  FCKR  P+D  K  +  K +++HNCIPDEHYVQTLL +  LE E+ERRTVTYT WN S  K E K WHP
Subjt:  KWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMENKGWHP

Query:  ITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVV
        +TFT  N GP +I+ IK I+HVYYE+E+RTEWCR NS  VPCFLFARKF++GAAMRLLSEG++
Subjt:  ITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVV

AT5G14550.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein8.1e-9948.93Show/hide
Query:  MKKKALLTPPRSLFWFSWKLLVIFSLALC----IFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFS
        MKKK  ++  + L+ +  K+      A C    +F      S  +  +  +ASL    +P  D    RP+IAFLF+ R  LPL+F+W +FF+ G+   FS
Subjt:  MKKKALLTPPRSLFWFSWKLLVIFSLALC----IFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFS

Query:  IYIHSAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNP
        IY+HS PGFV +E+TTRS +F  RQL +SIQV WG+S+MI AER+LL  AL D  N RFV LSDSC+PLY+FSY Y+Y+M++P SFVDSF D K+ RYNP
Subjt:  IYIHSAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNP

Query:  KMSPAIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRP-PVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQST
        +M+P IP   WRKGSQW+ L R HAE+VV+D  +FP+F   C+R+  P       +  +  K+HNCIPDEHYVQTLL+   ++ EL RR++T++ W+ S+
Subjt:  KMSPAIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRP-PVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQST

Query:  TKM-ENKGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVV
        +K  E +GWHP+T+ +++A P  I+ IKGID++ YETE+R EWC +     PCFLFARKF++ AA+RLL E ++
Subjt:  TKM-ENKGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVV

AT5G14550.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein3.8e-8048.01Show/hide
Query:  MKKKALLTPPRSLFWFSWKLLVIFSLALC----IFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFS
        MKKK  ++  + L+ +  K+      A C    +F      S  +  +  +ASL    +P  D    RP+IAFLF+ R  LPL+F+W +FF+ G+   FS
Subjt:  MKKKALLTPPRSLFWFSWKLLVIFSLALC----IFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFS

Query:  IYIHSAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNP
        IY+HS PGFV +E+TTRS +F  RQL +SIQV WG+S+MI AER+LL  AL D  N RFV LSDSC+PLY+FSY Y+Y+M++P SFVDSF D K+ RYNP
Subjt:  IYIHSAPGFVFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNP

Query:  KMSPAIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTT
        +M+P IP   WRKGSQW+ L R HAE+VV+D  +FP+F   C  RP           +  K+HNCIPDEHYVQTLL+   ++ EL RR++T++ W+ S++
Subjt:  KMSPAIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTT

Query:  KM-ENKGWHPITFTYANAGPRQIKEIK
        K  E +GWHP+T+ +++A P  I+ IK
Subjt:  KM-ENKGWHPITFTYANAGPRQIKEIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGAAAGCTCTTCTAACACCACCCCGCAGCCTGTTCTGGTTCAGCTGGAAGCTTCTCGTCATCTTCTCTCTTGCTCTTTGCATTTTCGCTCTTGTTAGC
CTTCATTCCTCACCTTCTACAACCGACCTCGCCACTGCTTCATTATCTCGTCGATTGCGTCCTCCTTCTGATTCATTTTTAGGACGACCTAAGATCGCTTTCTTG
TTTCTCACTCGTCGGAACCTTCCTCTTGATTTCCTTTGGGGAAGCTTTTTCGAGAATGGCGACGTTGCGAACTTCTCGATTTATATTCATTCCGCACCTGGATTT
GTTTTCGATGAATCGACTACAAGGTCGCATTTCTTTTTTGGACGGCAATTAGAGAATAGCATACAGGTGGCCTGGGGAAAGTCGAGTATGATTGCCGCAGAGAGG
TTATTACTTGAAGCAGCTCTTGAAGATCCAGCAAACCAGAGATTTGTTCTTCTCTCAGACAGTTGCGTTCCACTATACAACTTTAGCTATATATACAGCTATCTC
ATGGCTTCTCCAAAGAGTTTCGTGGACAGTTTTCTTGATGCAAAGGAGGGTCGCTATAATCCAAAAATGTCACCTGCTATACCTAAGAGCAAATGGAGGAAAGGT
TCCCAGTGGATCAGTTTGATTCGTAGTCATGCAGAAGTTGTTGTGGATGATGATATCATATTCCCAATCTTTGGATTATTCTGCAAGCGAAGGCCGCCTGTGGAT
GAAAGCAAAGGAATTATGAACACTAAACTTCAAAAGCAGCACAACTGTATTCCAGATGAACACTATGTCCAGACACTGCTTGCTTTAAATGAACTTGAAGGTGAA
CTTGAACGAAGAACAGTAACTTACACGCTATGGAATCAGTCAACCACCAAAATGGAGAATAAGGGATGGCATCCTATTACATTTACCTATGCTAATGCTGGACCT
CGGCAGATTAAGGAAATAAAGGGAATCGACCATGTTTACTACGAGACTGAATTCAGGACGGAATGGTGTCGAAATAACTCAACTTTTGTTCCTTGTTTTCTATTT
GCTAGAAAATTTTCTCAGGGAGCTGCTATGCGATTATTAAGTGAGGGAGTTGTAAGTCACTTTGATGCCTCAGCATTATTAGACAAAAAGACCAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGAAAGCTCTTCTAACACCACCCCGCAGCCTGTTCTGGTTCAGCTGGAAGCTTCTCGTCATCTTCTCTCTTGCTCTTTGCATTTTCGCTCTTGTTAGC
CTTCATTCCTCACCTTCTACAACCGACCTCGCCACTGCTTCATTATCTCGTCGATTGCGTCCTCCTTCTGATTCATTTTTAGGACGACCTAAGATCGCTTTCTTG
TTTCTCACTCGTCGGAACCTTCCTCTTGATTTCCTTTGGGGAAGCTTTTTCGAGAATGGCGACGTTGCGAACTTCTCGATTTATATTCATTCCGCACCTGGATTT
GTTTTCGATGAATCGACTACAAGGTCGCATTTCTTTTTTGGACGGCAATTAGAGAATAGCATACAGGTGGCCTGGGGAAAGTCGAGTATGATTGCCGCAGAGAGG
TTATTACTTGAAGCAGCTCTTGAAGATCCAGCAAACCAGAGATTTGTTCTTCTCTCAGACAGTTGCGTTCCACTATACAACTTTAGCTATATATACAGCTATCTC
ATGGCTTCTCCAAAGAGTTTCGTGGACAGTTTTCTTGATGCAAAGGAGGGTCGCTATAATCCAAAAATGTCACCTGCTATACCTAAGAGCAAATGGAGGAAAGGT
TCCCAGTGGATCAGTTTGATTCGTAGTCATGCAGAAGTTGTTGTGGATGATGATATCATATTCCCAATCTTTGGATTATTCTGCAAGCGAAGGCCGCCTGTGGAT
GAAAGCAAAGGAATTATGAACACTAAACTTCAAAAGCAGCACAACTGTATTCCAGATGAACACTATGTCCAGACACTGCTTGCTTTAAATGAACTTGAAGGTGAA
CTTGAACGAAGAACAGTAACTTACACGCTATGGAATCAGTCAACCACCAAAATGGAGAATAAGGGATGGCATCCTATTACATTTACCTATGCTAATGCTGGACCT
CGGCAGATTAAGGAAATAAAGGGAATCGACCATGTTTACTACGAGACTGAATTCAGGACGGAATGGTGTCGAAATAACTCAACTTTTGTTCCTTGTTTTCTATTT
GCTAGAAAATTTTCTCAGGGAGCTGCTATGCGATTATTAAGTGAGGGAGTTGTAAGTCACTTTGATGCCTCAGCATTATTAGACAAAAAGACCAACTAA
Protein sequenceShow/hide protein sequence
MKKKALLTPPRSLFWFSWKLLVIFSLALCIFALVSLHSSPSTTDLATASLSRRLRPPSDSFLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSAPGF
VFDESTTRSHFFFGRQLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLMASPKSFVDSFLDAKEGRYNPKMSPAIPKSKWRKG
SQWISLIRSHAEVVVDDDIIFPIFGLFCKRRPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMENKGWHPITFTYANAGP
RQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKKTN