; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022135 (gene) of Snake gourd v1 genome

Gene IDTan0022135
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationLG03:77033444..77038246
RNA-Seq ExpressionTan0022135
SyntenyTan0022135
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022926887.1 uncharacterized protein LOC111433868 isoform X1 [Cucurbita moschata]1.1e-20192.61Show/hide
Query:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
        MKKKAPVT GRSLIWFSWKLVITFSVALCIL LI+ HSSSRSDLASASLSRRLR  PD+ S RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
Subjt:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS

Query:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST
         PGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLE ALED ANQRFVLLSDSCVPLYNF YIYSYLMASPRSFVDSFLD KEGRYNPKMS T
Subjt:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST

Query:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK
        IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVD SKGNMNIKLQKQHNCIPDEHYV TLLALNDLE ELE RTLTYTLWN+S TKMENK
Subjt:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK

Query:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        GWHPITFNYANAGPRQIKEIKGI+HVYY +E RTEWCRNNST VPCFLFARKFSQGAAMRLLS G+VSHFDASALLDKK
Subjt:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

XP_023003330.1 uncharacterized protein LOC111496969 isoform X1 [Cucurbita maxima]2.1e-20292.61Show/hide
Query:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
        MKKKAPVT GRSLIWFSWKLVITFSVALCIL LI+ HSSSRSDLASASLSRRLR PPD++S RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
Subjt:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS

Query:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST
         PGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLE ALED ANQRFVLLSDSCVPLYNF YIYSYLMASPRSFVDSFLD KEGRYNPKMS T
Subjt:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST

Query:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK
        IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVD SKGNMNIKLQKQHNCIPDEHYV TLLALNDLE ELE RTLTYTLWN+S TKMENK
Subjt:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK

Query:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        GWHPITFNYANAGPR+IKEIKGI+HVYY +E RTEWCRNNST VPCFLFARKFSQGAAMRLLS G+VSHFDASALLDKK
Subjt:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

XP_023003332.1 uncharacterized protein LOC111496969 isoform X3 [Cucurbita maxima]2.1e-20292.61Show/hide
Query:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
        MKKKAPVT GRSLIWFSWKLVITFSVALCIL LI+ HSSSRSDLASASLSRRLR PPD++S RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
Subjt:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS

Query:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST
         PGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLE ALED ANQRFVLLSDSCVPLYNF YIYSYLMASPRSFVDSFLD KEGRYNPKMS T
Subjt:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST

Query:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK
        IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVD SKGNMNIKLQKQHNCIPDEHYV TLLALNDLE ELE RTLTYTLWN+S TKMENK
Subjt:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK

Query:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        GWHPITFNYANAGPR+IKEIKGI+HVYY +E RTEWCRNNST VPCFLFARKFSQGAAMRLLS G+VSHFDASALLDKK
Subjt:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

XP_023517032.1 uncharacterized protein LOC111780911 isoform X1 [Cucurbita pepo subsp. pepo]1.4e-20192.35Show/hide
Query:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
        MKKKAPVT GRSLIWFSWKLVITFSVALCIL LI+ HSSSRSDL SASLSRRLR  PD+ S RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
Subjt:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS

Query:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST
         PGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLE ALED ANQRFVLLSDSCVPLYNF YIYSYLMASPRSFVDSFLD KEGRYNPKMS T
Subjt:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST

Query:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK
        IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVD SKGNMNIKLQKQHNCIPDEHYV TLLALNDLEGELE RTLTYTLWN+S TKMENK
Subjt:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK

Query:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        GWHPITFNYA+AGPRQIKEIKGI+HVYY +E RTEWCRNNST VPCFLFARKFSQGAAMRLLS G+VSHFDASALLDKK
Subjt:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

XP_038883828.1 glycosyltransferase BC10-like isoform X2 [Benincasa hispida]3.9e-20492.88Show/hide
Query:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
        MKKKAP+ P RSLIWFSWKL+ITFS+A+CILALIRLHSSSR+DLASASLSRRLR P DS  GRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
Subjt:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS

Query:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST
         PGFVFDESTTRSHFFFGRQL NSIQVAWGKSSMIAAERLLLE+ALED ANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMS  
Subjt:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST

Query:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK
        IPK KWRKGSQWISLIRSHAEV+VDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYV TLLALN+LEGELERRT+TYTLWNQSTT MENK
Subjt:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK

Query:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        GWHPITF+YANAGPRQIKEIKGINH+YYETEFRTEWCRNNST V CFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
Subjt:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

TrEMBL top hitse value%identityAlignment
A0A1S3B0S7 uncharacterized protein LOC103484630 isoform X15.9e-19890.53Show/hide
Query:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSS-SRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
        MKKKA +TP R L WFSWKL++ FS+ALCILALI LHSS S +DLA+ASLSRR R P DS  GRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
Subjt:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSS-SRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH

Query:  STPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSS
        S PGFVFDESTTRSHFFFGRQL NSIQVAWGKSSMIAAERLLLE+ALED ANQRFVLLSDSCVPLYNFSYIYSYL+ASP+SFVDSFLDAKEGRYNPKMS 
Subjt:  STPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSS

Query:  TIPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMEN
         IPK KWRKGSQWISLIRSHAEV+VDDDIIFPIFGLFCKRRPPVDASKGNMN KLQKQHNCIPDEHYV TLLALN+LEGELERRT+TYTLWNQSTTKMEN
Subjt:  TIPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMEN

Query:  KGWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        KGWHPITF YANAGPRQIKEIKGI+HVYYETEFRTEWCRNNST VPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
Subjt:  KGWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

A0A5D3CPR8 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 22.0e-19890.79Show/hide
Query:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSS-SRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
        MKKKA +TP R L WFSWKL++ FS+ALCILALI LHSS S +DLA+ASLSRR R P DS  GRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH
Subjt:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSS-SRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIH

Query:  STPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSS
        S PGFVFDESTTRSHFFFGRQL NSIQVAWGKSSMIAAERLLLE+ALED ANQRFVLLSDSCVPLYNFSYIYSYLMASP+SFVDSFLDAKEGRYNPKMS 
Subjt:  STPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSS

Query:  TIPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMEN
         IPK KWRKGSQWISLIRSHAEV+VDDDIIFPIFGLFCKRRPPVDASKGNMN KLQKQHNCIPDEHYV TLLALN+LEGELERRT+TYTLWNQSTTKMEN
Subjt:  TIPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMEN

Query:  KGWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        KGWHPITF YANAGPRQIKEIKGI+HVYYETEFRTEWCRNNST VPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
Subjt:  KGWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

A0A6J1EFL2 uncharacterized protein LOC111433868 isoform X15.2e-20292.61Show/hide
Query:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
        MKKKAPVT GRSLIWFSWKLVITFSVALCIL LI+ HSSSRSDLASASLSRRLR  PD+ S RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
Subjt:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS

Query:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST
         PGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLE ALED ANQRFVLLSDSCVPLYNF YIYSYLMASPRSFVDSFLD KEGRYNPKMS T
Subjt:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST

Query:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK
        IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVD SKGNMNIKLQKQHNCIPDEHYV TLLALNDLE ELE RTLTYTLWN+S TKMENK
Subjt:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK

Query:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        GWHPITFNYANAGPRQIKEIKGI+HVYY +E RTEWCRNNST VPCFLFARKFSQGAAMRLLS G+VSHFDASALLDKK
Subjt:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

A0A6J1KRG8 uncharacterized protein LOC111496969 isoform X11.0e-20292.61Show/hide
Query:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
        MKKKAPVT GRSLIWFSWKLVITFSVALCIL LI+ HSSSRSDLASASLSRRLR PPD++S RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
Subjt:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS

Query:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST
         PGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLE ALED ANQRFVLLSDSCVPLYNF YIYSYLMASPRSFVDSFLD KEGRYNPKMS T
Subjt:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST

Query:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK
        IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVD SKGNMNIKLQKQHNCIPDEHYV TLLALNDLE ELE RTLTYTLWN+S TKMENK
Subjt:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK

Query:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        GWHPITFNYANAGPR+IKEIKGI+HVYY +E RTEWCRNNST VPCFLFARKFSQGAAMRLLS G+VSHFDASALLDKK
Subjt:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

A0A6J1KW74 uncharacterized protein LOC111496969 isoform X31.0e-20292.61Show/hide
Query:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
        MKKKAPVT GRSLIWFSWKLVITFSVALCIL LI+ HSSSRSDLASASLSRRLR PPD++S RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS
Subjt:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHS

Query:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST
         PGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLE ALED ANQRFVLLSDSCVPLYNF YIYSYLMASPRSFVDSFLD KEGRYNPKMS T
Subjt:  TPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSST

Query:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK
        IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVD SKGNMNIKLQKQHNCIPDEHYV TLLALNDLE ELE RTLTYTLWN+S TKMENK
Subjt:  IPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENK

Query:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK
        GWHPITFNYANAGPR+IKEIKGI+HVYY +E RTEWCRNNST VPCFLFARKFSQGAAMRLLS G+VSHFDASALLDKK
Subjt:  GWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKK

SwissProt top hitse value%identityAlignment
Q65XS5 Glycosyltransferase BC105.3e-9551.57Show/hide
Query:  GRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSTPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSC
        G  ++AFLF+ R  LPLD +W +FF       FSI++HS PGFV   +TTRS FF+ RQ+ NS+QV WG++SMI AER+LL  AL+D  N+RFV +SDSC
Subjt:  GRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSTPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSC

Query:  VPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSSTIPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKG---NMNIKLQKQH
        VPLYNF+Y Y Y+M+S  SFVDSF D K GRYNP+M   IP   WRKGSQW  L R HAEV+V+D+ + P F   C+RRP  +  +     +  +  K H
Subjt:  VPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSSTIPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKG---NMNIKLQKQH

Query:  NCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTK-MENKGWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGA
        NCIPDEHYV TLLA + LE EL RR++T++ W+ S++K  E +GWHP+T+  ++A P  +K IK I+++YYETE R EWC +N    PCFLFARKF++ A
Subjt:  NCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTK-MENKGWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGA

Query:  AMRLLSEGVVSHFDASAL
         ++LL   +++   AS +
Subjt:  AMRLLSEGVVSHFDASAL

Arabidopsis top hitse value%identityAlignment
AT1G11940.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.6e-13163.01Show/hide
Query:  PVTPGRSLIWFSWKLVITFSVALCILALIR--LHSSSRSDLASASLSRRLRSPPDSVSG-RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSTP
        P++    ++W  WKLVI FSVALC+LAL+R  L  +S + L+      R ++P    SG RPK+AFLFL RR+LPLDF+W  FF+  D ANFSIYIHS P
Subjt:  PVTPGRSLIWFSWKLVITFSVALCILALIR--LHSSSRSDLASASLSRRLRSPPDSVSG-RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSTP

Query:  GFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSSTIP
        GFVF+E TTRS +F+ RQL NSI+V WG+SSMI AERLLL SALED +NQRFVLLSD C PLY+F YIY YL++SPRSFVDSFL  KE RY+ KMS  IP
Subjt:  GFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSSTIP

Query:  KGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENKGW
        + KWRKGSQWI+LIRSHAEVIV+D I+FP+F  FCKR PP+  ++  + +K QK+ NCIPDEHYV TLL +  LE E+ERRT+TYT+WN S TK E K W
Subjt:  KGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENKGW

Query:  HPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVV
        HP+TF   N+GP +IKEIK I+HVYYE+E RTEWC+ +S  VPCFLFARKF+  AAMR++SEG++
Subjt:  HPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVV

AT1G62305.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein9.2e-13565.29Show/hide
Query:  RSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLS-----RRLRSPPDSVSG-RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSTPGF
        R ++WF WK++IT S ALCILAL  ++  S S   + +LS      R R P    SG RPK+AFLFL RR+LPLDFLW  FF++ D  NFSIY+HS PGF
Subjt:  RSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLS-----RRLRSPPDSVSG-RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSTPGF

Query:  VFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSSTIPKG
        VFDES+TRSHFF+ RQL NSI+V WG+SSMIAAERLLL SALED +NQRFVLLSDSCVPLY+F YIY YL++SP+SFVDSFLD K+ RY  KM   I K 
Subjt:  VFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSSTIPKG

Query:  KWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENKGWHP
        KWRKGSQWISLIRSHAEVIV+DD +FP+F  FCKR  P+D  K  + +K +++HNCIPDEHYV TLL +  LE E+ERRT+TYT WN S  K E K WHP
Subjt:  KWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENKGWHP

Query:  ITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVV
        +TF   N GP +I+ IK INHVYYE+E+RTEWCR NS  VPCFLFARKF++GAAMRLLSEG++
Subjt:  ITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVV

AT1G62305.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein7.0e-11960.33Show/hide
Query:  RSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLS-----RRLRSPPDSVSG-RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSTPGF
        R ++WF WK++IT S ALCILAL  ++  S S   + +LS      R R P    SG RPK+AFLFL RR+LPLDFLW  FF++ D  NFSIY+HS PGF
Subjt:  RSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLS-----RRLRSPPDSVSG-RPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSTPGF

Query:  VFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSSTIPKG
        VFDES+TRSHFF+ RQL NSI+V WG+SSMIAAERLLL SALED +NQRFVLLS                        DSFLD K+ RY  KM   I K 
Subjt:  VFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSSTIPKG

Query:  KWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENKGWHP
        KWRKGSQWISLIRSHAEVIV+DD +FP+F  FCKR  P+D  K  + +K +++HNCIPDEHYV TLL +  LE E+ERRT+TYT WN S  K E K WHP
Subjt:  KWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENKGWHP

Query:  ITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVV
        +TF   N GP +I+ IK INHVYYE+E+RTEWCR NS  VPCFLFARKF++GAAMRLLSEG++
Subjt:  ITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVV

AT5G14550.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein6.9e-9848.93Show/hide
Query:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHS-----SSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFS
        MKKK  V+  + L  +  K+  T   A C    + + +      +R +  SASL   L+ P   +  RP+IAFLF+ R  LPL+F+W +FF+ G+   FS
Subjt:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHS-----SSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFS

Query:  IYIHSTPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNP
        IY+HS PGFV +E+TTRS +F  RQL +SIQV WG+S+MI AER+LL  AL DS N RFV LSDSC+PLY+FSY Y+Y+M++P SFVDSF D K+ RYNP
Subjt:  IYIHSTPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNP

Query:  KMSSTIPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRP-PVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQST
        +M+  IP   WRKGSQW+ L R HAE++V+D  +FP+F   C+R+  P       +  +  K+HNCIPDEHYV TLL+   ++ EL RR+LT++ W+ S+
Subjt:  KMSSTIPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRP-PVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQST

Query:  TKM-ENKGWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVV
        +K  E +GWHP+T+ +++A P  I+ IKGI+++ YETE+R EWC +     PCFLFARKF++ AA+RLL E ++
Subjt:  TKM-ENKGWHPITFNYANAGPRQIKEIKGINHVYYETEFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVV

AT5G14550.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein8.4e-8048.01Show/hide
Query:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHS-----SSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFS
        MKKK  V+  + L  +  K+  T   A C    + + +      +R +  SASL   L+ P   +  RP+IAFLF+ R  LPL+F+W +FF+ G+   FS
Subjt:  MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHS-----SSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFS

Query:  IYIHSTPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNP
        IY+HS PGFV +E+TTRS +F  RQL +SIQV WG+S+MI AER+LL  AL DS N RFV LSDSC+PLY+FSY Y+Y+M++P SFVDSF D K+ RYNP
Subjt:  IYIHSTPGFVFDESTTRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNP

Query:  KMSSTIPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTT
        +M+  IP   WRKGSQW+ L R HAE++V+D  +FP+F   C+   P +           K+HNCIPDEHYV TLL+   ++ EL RR+LT++ W+ S++
Subjt:  KMSSTIPKGKWRKGSQWISLIRSHAEVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTT

Query:  KM-ENKGWHPITFNYANAGPRQIKEIK
        K  E +GWHP+T+ +++A P  I+ IK
Subjt:  KM-ENKGWHPITFNYANAGPRQIKEIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGAAAGCTCCTGTAACACCAGGCCGCAGCCTCATCTGGTTCAGCTGGAAGCTTGTCATCACCTTCTCTGTTGCACTTTGCATTCTCGCTCTTATCAGGCTTCA
TTCATCTTCCCGGAGCGACCTCGCCTCTGCCTCATTATCTCGTCGATTGCGTTCTCCTCCTGATTCAGTTTCGGGCCGCCCTAAGATTGCCTTCTTGTTTCTCACTCGTC
GAAACCTCCCTCTTGATTTCCTTTGGGGAAGCTTCTTCGAGAATGGCGACGTTGCGAACTTCTCGATTTATATTCACTCGACACCTGGATTCGTGTTCGATGAATCGACG
ACTAGGTCGCATTTCTTTTTTGGACGGCAATTGACGAATAGCATTCAGGTGGCCTGGGGAAAATCGAGTATGATTGCTGCAGAGAGGTTGTTACTTGAATCAGCGCTTGA
GGATTCTGCAAACCAGAGATTTGTTCTTCTATCCGACAGTTGCGTTCCACTATACAATTTTAGCTATATATACAGCTATCTCATGGCTTCTCCCAGGAGTTTCGTGGACA
GTTTTCTTGATGCAAAGGAGGGTCGCTATAACCCAAAAATGTCATCTACTATACCGAAGGGCAAATGGAGAAAAGGGTCGCAGTGGATCAGTTTGATTCGTAGTCATGCA
GAAGTAATCGTGGATGATGATATTATATTTCCTATCTTCGGCTTATTCTGCAAGCGAAGGCCACCTGTGGATGCCAGCAAGGGAAATATGAATATTAAACTTCAAAAGCA
GCACAACTGTATTCCAGATGAACATTATGTCGCAACATTGCTTGCTTTAAATGATCTTGAAGGTGAACTTGAACGAAGAACATTAACTTACACGCTATGGAATCAGTCAA
CCACCAAAATGGAGAACAAGGGCTGGCATCCTATTACGTTTAACTATGCTAATGCCGGACCTCGGCAGATTAAGGAAATAAAGGGAATCAACCATGTCTACTATGAGACT
GAATTCAGGACGGAATGGTGTCGAAATAATTCAACTTCTGTTCCTTGTTTTCTATTTGCCAGAAAATTTTCTCAGGGAGCTGCTATGCGATTATTAAGTGAGGGAGTTGT
GAGTCACTTTGATGCCTCAGCATTATTAGACAAAAAGCCCAAACTAACTTGA
mRNA sequenceShow/hide mRNA sequence
CCGCAAGGCCTGGAAGCTACGGCCTTCCCGCCATCCGCCCTGTCCTGGCCGCGCCAGATAGTAGCCCAGATGGCCCATTCAACAAACTCAACCGTACGACGACGTATTGA
GGTTATTGTTTATGGAGAAGTCTTCCAAAATACTCCAAATTCCGTTCACATTTTGCTACCGTCATCCCTTCTCTGATGCGTTTTTCTTTATTAATCCTCTCTCCGGCGTA
ACCGCCACCTCTTCCATGAAGAACAATGAAGAAGAAAGCTCCTGTAACACCAGGCCGCAGCCTCATCTGGTTCAGCTGGAAGCTTGTCATCACCTTCTCTGTTGCACTTT
GCATTCTCGCTCTTATCAGGCTTCATTCATCTTCCCGGAGCGACCTCGCCTCTGCCTCATTATCTCGTCGATTGCGTTCTCCTCCTGATTCAGTTTCGGGCCGCCCTAAG
ATTGCCTTCTTGTTTCTCACTCGTCGAAACCTCCCTCTTGATTTCCTTTGGGGAAGCTTCTTCGAGAATGGCGACGTTGCGAACTTCTCGATTTATATTCACTCGACACC
TGGATTCGTGTTCGATGAATCGACGACTAGGTCGCATTTCTTTTTTGGACGGCAATTGACGAATAGCATTCAGGTGGCCTGGGGAAAATCGAGTATGATTGCTGCAGAGA
GGTTGTTACTTGAATCAGCGCTTGAGGATTCTGCAAACCAGAGATTTGTTCTTCTATCCGACAGTTGCGTTCCACTATACAATTTTAGCTATATATACAGCTATCTCATG
GCTTCTCCCAGGAGTTTCGTGGACAGTTTTCTTGATGCAAAGGAGGGTCGCTATAACCCAAAAATGTCATCTACTATACCGAAGGGCAAATGGAGAAAAGGGTCGCAGTG
GATCAGTTTGATTCGTAGTCATGCAGAAGTAATCGTGGATGATGATATTATATTTCCTATCTTCGGCTTATTCTGCAAGCGAAGGCCACCTGTGGATGCCAGCAAGGGAA
ATATGAATATTAAACTTCAAAAGCAGCACAACTGTATTCCAGATGAACATTATGTCGCAACATTGCTTGCTTTAAATGATCTTGAAGGTGAACTTGAACGAAGAACATTA
ACTTACACGCTATGGAATCAGTCAACCACCAAAATGGAGAACAAGGGCTGGCATCCTATTACGTTTAACTATGCTAATGCCGGACCTCGGCAGATTAAGGAAATAAAGGG
AATCAACCATGTCTACTATGAGACTGAATTCAGGACGGAATGGTGTCGAAATAATTCAACTTCTGTTCCTTGTTTTCTATTTGCCAGAAAATTTTCTCAGGGAGCTGCTA
TGCGATTATTAAGTGAGGGAGTTGTGAGTCACTTTGATGCCTCAGCATTATTAGACAAAAAGCCCAAACTAACTTGAACTATCATTCATCACGATAGTCACTTAGTTCAC
CTAGTTATAATATTAAATATTCTATAGTTATTTTTTTTACTATAAATGTTGTAAGGTGAGTATCTCTAGAAATTAAGTTAAAATTAGTCACAGGCGTGTGTCGGTTAGAT
ATTCTTGGTGTTCAAAAGAAAACTAGGCTTGATAGTTCAATCAAATTTCCCGTTACCTACCTTTGCCCATCTAACTTGACTTTGTTGGTTCATGTTTTCATCAACTGTAT
CTTTAAATGAAAAAATGGATCGTACAAGAAAGTCACTCGGTTCATGGTGTGCCTGAATTATCAGCCAATTTAGAAGAGGCTTCCGAAGAACTGGTCAGTACATGTCGAAA
AGATGGTTGTTACCATATTGATACTTTTGATTTTGATGCCTGATGTAATATAGAGTAGTTTTGTACTTGTACACGGCCTATTTATTTTAGTGGCTTCTCTGCTGACATTA
TTGTTGGTGTAAAAAATCATACAACTATCATTTATATAAATTGACGGAC
Protein sequenceShow/hide protein sequence
MKKKAPVTPGRSLIWFSWKLVITFSVALCILALIRLHSSSRSDLASASLSRRLRSPPDSVSGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSTPGFVFDEST
TRSHFFFGRQLTNSIQVAWGKSSMIAAERLLLESALEDSANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSSTIPKGKWRKGSQWISLIRSHA
EVIVDDDIIFPIFGLFCKRRPPVDASKGNMNIKLQKQHNCIPDEHYVATLLALNDLEGELERRTLTYTLWNQSTTKMENKGWHPITFNYANAGPRQIKEIKGINHVYYET
EFRTEWCRNNSTSVPCFLFARKFSQGAAMRLLSEGVVSHFDASALLDKKPKLT