; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009164 (gene) of Snake gourd v1 genome

Gene IDTan0009164
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein, putative
Genome locationLG02:95552796..95554865
RNA-Seq ExpressionTan0009164
SyntenyTan0009164
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132078.1 uncharacterized protein LOC111005037 [Momordica charantia]4.4e-17478.34Show/hide
Query:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIP--KTHRKITLFPIPSPPPPPEDDDFLFPLAARVNSTPSPTRKLAFMFLT
        ML PNPLSLICALLLC PLAILFT++         PSDYPLLFP  SLY P   THRKIT+FP+   PPPP+DDDFLFPLAARVNSTPSPT KLAFMFLT
Subjt:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIP--KTHRKITLFPIPSPPPPPEDDDFLFPLAARVNSTPSPTRKLAFMFLT

Query:  NSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTL
         SPLPFAPLWELFF+N+PP+ +N+YIHADPT +Y+PPFSG+FAHRLIPSKPS R +PTL +AARRLLAHALLHDSANSMFALLSPSCIPLHSF FTY TL
Subjt:  NSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTL

Query:  IRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLT
        IRS KSFIEVLKNEIG YDRWAARGP+VMLPVVKLADFRIGSQFW+LTR+HAR+VV D+RVWSKFDLPCVR +TCYPEENYFPTLL+M D  GL+ ATLT
Subjt:  IRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLT

Query:  HVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD
        HVDW G FDGHPRTY AS+VGP LIR LRIARPRYGDGGM+I        RTA  R+SSS+  AKS+  HPFLFARKFSADSLQPLMNIS+D+IFKD
Subjt:  HVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD

XP_022951447.1 uncharacterized protein LOC111454260 [Cucurbita moschata]2.2e-18681.7Show/hide
Query:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPP--PPPEDDDFLFPLAARVNSTPSPTRKLAFMFLT
        M   +PLSLICALLLCLPLA+LFT+NSPT IN +  SD+P +FPL SLY+PKTHRKITLF IPSPP  PPPE+DD LFPLA+RV+ TPSPTRKLAFMFLT
Subjt:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPP--PPPEDDDFLFPLAARVNSTPSPTRKLAFMFLT

Query:  NSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTL
        NSPLPFAPLWELFFKNIPPDLYNVYIHADPT EYDPPFSG+F+HR+IPSKP+QR TP+LT+AARRLLAHALLHDS+NSMFALLSPSCIPLHSF+FTY+TL
Subjt:  NSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTL

Query:  IRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLT
        I SKKSFIEVLKNEIGAYDRWAARGPD MLPVVKL D RIGSQFWVLTRRHAR VVRDK+VWSKFDLPCVR +TCYPEENYFPTLL+MWD RGLIPATLT
Subjt:  IRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLT

Query:  HVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRY--GDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD
        HVDWNGSFDGHPRTY  S+V P LIR+LR++R RY  GDG   I +R+R   +T+ RR SSSSSAAK YR H FLFARKFSAD+LQPLMNISSDVIFKD
Subjt:  HVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRY--GDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD

XP_023002372.1 uncharacterized protein LOC111496232 [Cucurbita maxima]3.6e-18480.2Show/hide
Query:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPS----PPPPPEDDDFLFPLAARVNSTPSPTRKLAFMF
        M   +PLSLICALLLCLPLA+LFT+NSPT IN +  SD+P +FPL SLY+PKTHRKITLF IPS    PPPPPE+DD LFPLAARV+  PSPTRKLAFMF
Subjt:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPS----PPPPPEDDDFLFPLAARVNSTPSPTRKLAFMF

Query:  LTNSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYR
        LTNSPLPFAPLWELFFKNIPPDLYNVYIH DPT EYDPPFSG+F+HR+IPSKP+QR T +LT+AARRLLAHALLHDS+NSMFALLSPSCIPLHSF+FTY+
Subjt:  LTNSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYR

Query:  TLIRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPAT
        TLIRSKKSFIEVLKNEIGAYDRWAARGPD MLPVVKL D RIGSQFW LTRRHAR VV+DK+VW+KFDLPCVR +TCYPEENYFPTLL+MWDR+GLIPAT
Subjt:  TLIRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPAT

Query:  LTHVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD
        LTHVDWNG FDGHPRTY  ++V P LIR+LR+AR RYGDG   I +R+    +T+ RR SSSSSAAK YR H FLFARKFSAD+LQPLMNISSDVIFKD
Subjt:  LTHVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD

XP_023537126.1 uncharacterized protein LOC111798298 [Cucurbita pepo subsp. pepo]3.1e-18881.8Show/hide
Query:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPP------PPPEDDDFLFPLAARVNSTPSPTRKLAF
        M   +PLSLICALLLCLPLA+LFT+NSPT IN +  SD+P +FPL SLY+PKTHRKITLF IPSPP      PPPE+DD LFPLA+RV+ TPSPTRKLAF
Subjt:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPP------PPPEDDDFLFPLAARVNSTPSPTRKLAF

Query:  MFLTNSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFT
        MFLTNSPLPFAPLWELFFKNIPPDLYNVYIHADPT EYDPPFSG+F+HR+IPSKP+QR TP+LT+AARRLLAHALLHDS+NSMFALLSPSCIPLHSF+FT
Subjt:  MFLTNSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFT

Query:  YRTLIRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIP
        Y+TLI SKKSFIEVLKNEIGAYDRWAARGPD MLPVVKL D RIGSQFWVLTRRHAR VVRDK+VWSKFDLPCVR +TCYPEENYFPTLL+MWD RGLIP
Subjt:  YRTLIRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIP

Query:  ATLTHVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFK
        ATLTHVDWNGSFDGHPRTY  S+V P LIR+LR++R RYGDGG+++ IR R  N+T+ RR SSSSSAAK YR H FLFARKFSAD+LQPLMNISSDVIFK
Subjt:  ATLTHVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFK

Query:  D
        D
Subjt:  D

XP_038885542.1 glycosyltransferase BC10 [Benincasa hispida]2.2e-17377.97Show/hide
Query:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPPPPPEDDDFLFPLAARVNSTPSPTRKLAFMFLTNS
        ML P+ L LIC  LLCLPLAILFTIN PT I+ ++ S YP +FP  SLY         L P P PPPPPEDDD LFPLAA VNSTPSPT KLAF+FLTNS
Subjt:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPPPPPEDDDFLFPLAARVNSTPSPTRKLAFMFLTNS

Query:  PLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTLIR
        PLPFAPLWELFFKNIPPDLYN+YIHADPT +YD PFSG+FAHR+IPSKP+QR +P+L++AARRLLAHALLHDS+NSMFALLSPSCIPLHSF+FTYRTL R
Subjt:  PLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTLIR

Query:  SKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLTHV
        SKKSFIEVLK+EIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVL RRHARIV  D+RVW KF+LPCVRR+TCYPEENYFPTLL+MWD+RGL+PATLTHV
Subjt:  SKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLTHV

Query:  DWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD
        DWNGSFDGHPRTY+ASEVGPKLIR LR+ARPRYGD G ++ IRMRN             SAAKS+R HPFLFARKFSA SLQ LMNISSD IF+D
Subjt:  DWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD

TrEMBL top hitse value%identityAlignment
A0A1S3BC61 uncharacterized protein LOC1034883179.2e-17076.71Show/hide
Query:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPPPPPEDDDFLFPLAARVNSTPSPTRKLAFMFLTNS
        ML PNPLSLI ALLLCL LAI FT ++PT    ++  DYP +FP  SLY    HRKITL P PS PPPPEDDD LFPLAA VNSTPSPT KLAF+FLTNS
Subjt:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPPPPPEDDDFLFPLAARVNSTPSPTRKLAFMFLTNS

Query:  PLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTLIR
        PLPFAPLWELFFKNIPPDL+NVYIHADPT  YDPPFSG+FA+R+IPSKP+QR +P+L+ AARRLLAHALLHDSANSMFALLSPSCIPLHSF+FTY+TLI 
Subjt:  PLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTLIR

Query:  SKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLTHV
        SKKSFIEVLK+E GAYDRWAARGPDVMLP+VKLADFRIGSQFWVL RRHARIVV+DK VWSKFDLPCVR++TCYPEENYFPTLL+MWDRRGL+PATLTHV
Subjt:  SKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLTHV

Query:  DWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD
        +WNGSFDGHPRTYVAS+VGP LIR LR ARPRYGDGG ++ +R+R       +      S  K    HPFLFARKFSADSL  LMNI++D I KD
Subjt:  DWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD

A0A5A7VHH3 Putative Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein9.2e-17076.71Show/hide
Query:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPPPPPEDDDFLFPLAARVNSTPSPTRKLAFMFLTNS
        ML PNPLSLI ALLLCL LAI FT ++PT    ++  DYP +FP  SLY    HRKITL P PS PPPPEDDD LFPLAA VNSTPSPT KLAF+FLTNS
Subjt:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPPPPPEDDDFLFPLAARVNSTPSPTRKLAFMFLTNS

Query:  PLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTLIR
        PLPFAPLWELFFKNIPPDL+NVYIHADPT  YDPPFSG+FA+R+IPSKP+QR +P+L+ AARRLLAHALLHDSANSMFALLSPSCIPLHSF+FTY+TLI 
Subjt:  PLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTLIR

Query:  SKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLTHV
        SKKSFIEVLK+E GAYDRWAARGPDVMLP+VKLADFRIGSQFWVL RRHARIVV+DK VWSKFDLPCVR++TCYPEENYFPTLL+MWDRRGL+PATLTHV
Subjt:  SKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLTHV

Query:  DWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD
        +WNGSFDGHPRTYVAS+VGP LIR LR ARPRYGDGG ++ +R+R       +      S  K    HPFLFARKFSADSL  LMNI++D I KD
Subjt:  DWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD

A0A6J1BV94 uncharacterized protein LOC1110050372.1e-17478.34Show/hide
Query:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIP--KTHRKITLFPIPSPPPPPEDDDFLFPLAARVNSTPSPTRKLAFMFLT
        ML PNPLSLICALLLC PLAILFT++         PSDYPLLFP  SLY P   THRKIT+FP+   PPPP+DDDFLFPLAARVNSTPSPT KLAFMFLT
Subjt:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIP--KTHRKITLFPIPSPPPPPEDDDFLFPLAARVNSTPSPTRKLAFMFLT

Query:  NSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTL
         SPLPFAPLWELFF+N+PP+ +N+YIHADPT +Y+PPFSG+FAHRLIPSKPS R +PTL +AARRLLAHALLHDSANSMFALLSPSCIPLHSF FTY TL
Subjt:  NSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTL

Query:  IRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLT
        IRS KSFIEVLKNEIG YDRWAARGP+VMLPVVKLADFRIGSQFW+LTR+HAR+VV D+RVWSKFDLPCVR +TCYPEENYFPTLL+M D  GL+ ATLT
Subjt:  IRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLT

Query:  HVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD
        HVDW G FDGHPRTY AS+VGP LIR LRIARPRYGDGGM+I        RTA  R+SSS+  AKS+  HPFLFARKFSADSLQPLMNIS+D+IFKD
Subjt:  HVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD

A0A6J1GHM6 uncharacterized protein LOC1114542601.1e-18681.7Show/hide
Query:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPP--PPPEDDDFLFPLAARVNSTPSPTRKLAFMFLT
        M   +PLSLICALLLCLPLA+LFT+NSPT IN +  SD+P +FPL SLY+PKTHRKITLF IPSPP  PPPE+DD LFPLA+RV+ TPSPTRKLAFMFLT
Subjt:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPP--PPPEDDDFLFPLAARVNSTPSPTRKLAFMFLT

Query:  NSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTL
        NSPLPFAPLWELFFKNIPPDLYNVYIHADPT EYDPPFSG+F+HR+IPSKP+QR TP+LT+AARRLLAHALLHDS+NSMFALLSPSCIPLHSF+FTY+TL
Subjt:  NSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTL

Query:  IRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLT
        I SKKSFIEVLKNEIGAYDRWAARGPD MLPVVKL D RIGSQFWVLTRRHAR VVRDK+VWSKFDLPCVR +TCYPEENYFPTLL+MWD RGLIPATLT
Subjt:  IRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLT

Query:  HVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRY--GDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD
        HVDWNGSFDGHPRTY  S+V P LIR+LR++R RY  GDG   I +R+R   +T+ RR SSSSSAAK YR H FLFARKFSAD+LQPLMNISSDVIFKD
Subjt:  HVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRY--GDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD

A0A6J1KJC2 uncharacterized protein LOC1114962321.7e-18480.2Show/hide
Query:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPS----PPPPPEDDDFLFPLAARVNSTPSPTRKLAFMF
        M   +PLSLICALLLCLPLA+LFT+NSPT IN +  SD+P +FPL SLY+PKTHRKITLF IPS    PPPPPE+DD LFPLAARV+  PSPTRKLAFMF
Subjt:  MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPS----PPPPPEDDDFLFPLAARVNSTPSPTRKLAFMF

Query:  LTNSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYR
        LTNSPLPFAPLWELFFKNIPPDLYNVYIH DPT EYDPPFSG+F+HR+IPSKP+QR T +LT+AARRLLAHALLHDS+NSMFALLSPSCIPLHSF+FTY+
Subjt:  LTNSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYR

Query:  TLIRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPAT
        TLIRSKKSFIEVLKNEIGAYDRWAARGPD MLPVVKL D RIGSQFW LTRRHAR VV+DK+VW+KFDLPCVR +TCYPEENYFPTLL+MWDR+GLIPAT
Subjt:  TLIRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPAT

Query:  LTHVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD
        LTHVDWNG FDGHPRTY  ++V P LIR+LR+AR RYGDG   I +R+    +T+ RR SSSSSAAK YR H FLFARKFSAD+LQPLMNISSDVIFKD
Subjt:  LTHVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD

SwissProt top hitse value%identityAlignment
Q65XS5 Glycosyltransferase BC101.9e-2628.57Show/hide
Query:  ARVNSTPSP--TRKLAFMFLTNSPLPFAPLWELFFKNIPPDLYNVYIHADP--TAEYDPPFSGIFAHRLIPSKPSQRL-TPTLTSAARRLLAHALLHDSA
        A V   P P    +LAF+F+  + LP   +W+ FF+      +++++H+ P          SG F +R + +         ++  A R LLAHA L D  
Subjt:  ARVNSTPSP--TRKLAFMFLTNSPLPFAPLWELFFKNIPPDLYNVYIHADP--TAEYDPPFSGIFAHRLIPSKPSQRL-TPTLTSAARRLLAHALLHDSA

Query:  NSMFALLSPSCIPLHSFDFTYRTLIRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRR----
        N  F  +S SC+PL++F++TY  ++ S  SF++         D  A R    M P++ + ++R GSQ+ VLTR+HA +VV D+ V  +F   C RR    
Subjt:  NSMFALLSPSCIPLHSFDFTYRTLIRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRR----

Query:  ----------------NTCYPEENYFPTLLNMWD-RRGLIPATLTHVDWNGSFD-------GHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRN
                        + C P+E+Y  TLL        L   ++TH  W+ S          HP TY  S+  P L+++++     Y +           
Subjt:  ----------------NTCYPEENYFPTLLNMWD-RRGLIPATLTHVDWNGSFD-------GHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRN

Query:  SNRTASRRDSSSSSAAKSYRPHP-FLFARKFSADSLQPLMNIS
           T +R++  +S+     +P P FLFARKF+  +   L+++S
Subjt:  SNRTASRRDSSSSSAAKSYRPHP-FLFARKFSADSLQPLMNIS

Arabidopsis top hitse value%identityAlignment
AT3G52060.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.3e-7246.06Show/hide
Query:  VNSTPSPTRKLAFMFLTNSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPF-SGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFAL
        ++ +P+P  K+AF+FLTNS L F PLWE FF+    DLYNVYIHADPT+   P   S     + IP++ + R +PTL SA RRLLA+A+L D  N  FAL
Subjt:  VNSTPSPTRKLAFMFLTNSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPF-SGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFAL

Query:  LSPSCIPLHSFDFTYRTLI--RSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEEN
        +S  CIPLHSF + +  L     ++SFIE+L +E     R+ ARG D MLP ++  DFR+GSQF+VL +RHA +V++++++W KF LPC+   +CYPEE+
Subjt:  LSPSCIPLHSFDFTYRTLI--RSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEEN

Query:  YFPTLLNMWDRRGLIPATLTHVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSA
        YFPTLL++ D +G    TLT V+W GS  GHP TY ASE+ P+LI +L                          RR +SS           + FARKF+ 
Subjt:  YFPTLLNMWDRRGLIPATLTHVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSA

Query:  DSLQPLMNISSDVIFKD
        +SLQPLM I+  VIF+D
Subjt:  DSLQPLMNISSDVIFKD

AT3G52060.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.3e-7246.06Show/hide
Query:  VNSTPSPTRKLAFMFLTNSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPF-SGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFAL
        ++ +P+P  K+AF+FLTNS L F PLWE FF+    DLYNVYIHADPT+   P   S     + IP++ + R +PTL SA RRLLA+A+L D  N  FAL
Subjt:  VNSTPSPTRKLAFMFLTNSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPF-SGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFAL

Query:  LSPSCIPLHSFDFTYRTLI--RSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEEN
        +S  CIPLHSF + +  L     ++SFIE+L +E     R+ ARG D MLP ++  DFR+GSQF+VL +RHA +V++++++W KF LPC+   +CYPEE+
Subjt:  LSPSCIPLHSFDFTYRTLI--RSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEEN

Query:  YFPTLLNMWDRRGLIPATLTHVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSA
        YFPTLL++ D +G    TLT V+W GS  GHP TY ASE+ P+LI +L                          RR +SS           + FARKF+ 
Subjt:  YFPTLLNMWDRRGLIPATLTHVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSA

Query:  DSLQPLMNISSDVIFKD
        +SLQPLM I+  VIF+D
Subjt:  DSLQPLMNISSDVIFKD

AT4G32290.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.0e-11654.14Show/hide
Query:  MLPPNPLSLICALLLCLPLAILFTINS--PTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPPPPPEDDDFLFPLAARVNST--PSPTRKLAFMF
        M+ P    L+CAL LCLP+A++FT+      +I+P        +F L+S  +P +        I    P P++DD L  L++RVN    P  TRK+AFM+
Subjt:  MLPPNPLSLICALLLCLPLAILFTINS--PTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPPPPPEDDDFLFPLAARVNST--PSPTRKLAFMF

Query:  LTNSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYR
        LT SPLPFAPLWE+FF  I  +LYNVY+HADPT EYDPPFSG+F +R+I SKPS R TPTLT+AARRLLAHALL D  N MFA++SPSC+P+ SFDFTY+
Subjt:  LTNSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYR

Query:  TLIRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPAT
        TL+ S+KSFIE+LK+E   +DRW A G   MLP VKL +FRIGSQFWVL RRHAR+V RD+R+W KF+  CVR ++CYPEE+YFPTLLNM D RG +PAT
Subjt:  TLIRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPAT

Query:  LTHVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD
        LTHVDW  +  GHPR Y   EV P+L+  LR  RPRYG+ G+                  + S  +K  R  PFLFARKFS  +L+PL+ ++  V+F D
Subjt:  LTHVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD

AT5G22070.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.7e-6739.05Show/hide
Query:  NPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPPPPPEDDDFLFPLAARVNS-------TPSPTRKLAFMFL
        N   L  +LLLCLP    F   +P +  P  P +                   +L PI        DD  LF  AA   S        P+P  K+AF+FL
Subjt:  NPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPPPPPEDDDFLFPLAARVNS-------TPSPTRKLAFMFL

Query:  TNSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSG-IFAHRLIP-SKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTY
        TNS L FAP+W+ FF      LYNVY+HADP      P +G +F +  I  +K + R +PTL SA RRLLA A L D AN+ FA+LS  CIPLHSF++ Y
Subjt:  TNSPLPFAPLWELFFKNIPPDLYNVYIHADPTAEYDPPFSG-IFAHRLIP-SKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTY

Query:  RTLIRSK--------------------KSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYP
         +L  S                     +SF+E++ +E   + R+ ARG   M+P V    FR+GSQF+V+TRRHA + ++D+ +W KF LPC R + CYP
Subjt:  RTLIRSK--------------------KSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYP

Query:  EENYFPTLLNMWDRRGLIPATLTHVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARK
        EE+YFPTLLNM D  G    TLT V+W G+  GHP TY   EV P+LI+                  R+R SN ++S                 + FARK
Subjt:  EENYFPTLLNMWDRRGLIPATLTHVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARK

Query:  FSADSLQPLMNISSDVIFKD
        F+ D L+PL+ I+  VIF+D
Subjt:  FSADSLQPLMNISSDVIFKD

AT5G25330.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.2e-10051.64Show/hide
Query:  LSLICALLLCLPLAILFTINSPTI---INPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPPPPPEDDDFLFPLAARVNSTPSP--TRKLAFMFLTNSP
        L+L   LL+C+PL ++ T+ SP +   +    P+   +  P ++L  P+    IT  P+       + D+ L   A++ N  PSP   +KLAFMFLT + 
Subjt:  LSLICALLLCLPLAILFTINSPTI---INPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPPPPPEDDDFLFPLAARVNSTPSP--TRKLAFMFLTNSP

Query:  LPFAPLWELFFKNIP--PDLYNVYIHADPTAEYDPPFSGIFAHRLIP-SKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTL
        LP APLWELFF        LYNVY+H DPT ++ P   G F +R+IP SKP+ R TPTL SAARRLLAHALL D +N MF LLSPSCIPLHSF+FTY+TL
Subjt:  LPFAPLWELFFKNIP--PDLYNVYIHADPTAEYDPPFSGIFAHRLIP-SKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTL

Query:  IRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLT
        + S KSFIE+LK+E G Y+RWAARGP  M P V   +FRIGSQFW LTR HA +VV D  +WSKF+  CVR + CYPEE+YFPTLLNM D +G + AT+T
Subjt:  IRSKKSFIEVLKNEIGAYDRWAARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLT

Query:  HVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD
        HVDW+ +  GHPRTY   EV  +LI+ LR ARPRYGDG           NRT               R  PFLFARKFS   +  LMNI+  VIF D
Subjt:  HVDWNGSFDGHPRTYVASEVGPKLIRTLRIARPRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGCCCCCTAACCCACTTTCTCTAATCTGTGCATTGCTTCTTTGCTTACCTTTAGCCATTCTCTTTACCATCAACAGCCCCACCATCATTAATCCTCTCCACCCCTC
CGATTACCCCTTACTTTTCCCCTTGCACTCTCTTTACATCCCCAAGACCCACCGCAAAATTACCCTCTTCCCAATCCCTTCTCCCCCGCCCCCGCCGGAGGATGATGATT
TTTTGTTCCCACTCGCCGCACGAGTCAACTCGACCCCATCGCCAACTCGGAAGTTGGCCTTCATGTTTCTCACGAATTCCCCTCTCCCTTTCGCTCCTCTTTGGGAACTG
TTCTTCAAAAACATCCCTCCCGATCTTTATAATGTCTACATCCACGCCGACCCGACCGCAGAATATGACCCTCCTTTCTCCGGCATATTCGCCCACCGGCTCATCCCCTC
GAAACCGTCTCAGAGATTAACCCCTACTCTCACCTCCGCCGCCCGCCGCCTTCTCGCCCACGCGCTCCTGCATGATTCTGCTAATTCCATGTTTGCCCTTCTCTCCCCTT
CTTGCATCCCTCTCCATTCCTTCGATTTCACCTACAGGACGCTAATTCGATCGAAGAAGAGCTTCATTGAGGTTCTCAAAAATGAGATCGGCGCCTACGACAGGTGGGCG
GCGCGTGGTCCCGATGTGATGCTTCCGGTCGTTAAATTGGCGGATTTTCGTATTGGGTCGCAGTTTTGGGTCCTTACGCGCCGGCACGCGCGGATTGTGGTGAGAGATAA
GAGAGTTTGGTCCAAGTTTGACTTGCCCTGCGTGCGGCGGAACACGTGTTATCCTGAGGAGAATTACTTCCCCACCTTACTCAACATGTGGGACCGCCGAGGGCTTATTC
CAGCAACACTTACACACGTGGATTGGAATGGGAGTTTTGATGGCCACCCTCGCACCTACGTGGCTTCTGAAGTGGGCCCCAAACTCATACGTACTTTGAGAATCGCCCGG
CCCAGATACGGCGACGGCGGAATGAAAATAATGATCAGAATGAGAAATAGTAATAGAACCGCCAGCCGCCGGGATTCGTCGTCTTCCTCCGCCGCTAAATCTTACCGTCC
ACACCCGTTCTTGTTTGCGAGGAAATTCTCCGCCGATTCGCTCCAGCCGTTGATGAACATATCCAGTGACGTCATCTTTAAAGATTGA
mRNA sequenceShow/hide mRNA sequence
CTTCGCCACATCAAAATATATCTATTTTATTATTTTCAAAGTTCAAATCATCTGTTATTATAAGGCGTGAAAATGGATTTATCCGTGCGAAAATACCTTTTTCAATGGAC
CAACCTTTTCCCTTTTCATTTTTTTCTTTGTTTGAATCCTTATCCGACAGCTCCCACAAAATCACTATTTTGATTTGGAAACCCACCCGGCTCTGCCTCAATCCTGGCCG
TCCATTTTGCATATTGGCGGATTTTCACAGATCAACTTCCTTCTCAAAAATCCTCATTTCCATTTCCAACTGCTCCACTTCACCAAATCCTTTTCTTCTTCCTCTCTCAC
TCTCTCTCTCACACACACACAGTATTCTCCTCCAATTTAAGTGTGTCAAGTCTGAAACCTGCAACAAACAAACAAACCATGCTGCCCCCTAACCCACTTTCTCTAATCTG
TGCATTGCTTCTTTGCTTACCTTTAGCCATTCTCTTTACCATCAACAGCCCCACCATCATTAATCCTCTCCACCCCTCCGATTACCCCTTACTTTTCCCCTTGCACTCTC
TTTACATCCCCAAGACCCACCGCAAAATTACCCTCTTCCCAATCCCTTCTCCCCCGCCCCCGCCGGAGGATGATGATTTTTTGTTCCCACTCGCCGCACGAGTCAACTCG
ACCCCATCGCCAACTCGGAAGTTGGCCTTCATGTTTCTCACGAATTCCCCTCTCCCTTTCGCTCCTCTTTGGGAACTGTTCTTCAAAAACATCCCTCCCGATCTTTATAA
TGTCTACATCCACGCCGACCCGACCGCAGAATATGACCCTCCTTTCTCCGGCATATTCGCCCACCGGCTCATCCCCTCGAAACCGTCTCAGAGATTAACCCCTACTCTCA
CCTCCGCCGCCCGCCGCCTTCTCGCCCACGCGCTCCTGCATGATTCTGCTAATTCCATGTTTGCCCTTCTCTCCCCTTCTTGCATCCCTCTCCATTCCTTCGATTTCACC
TACAGGACGCTAATTCGATCGAAGAAGAGCTTCATTGAGGTTCTCAAAAATGAGATCGGCGCCTACGACAGGTGGGCGGCGCGTGGTCCCGATGTGATGCTTCCGGTCGT
TAAATTGGCGGATTTTCGTATTGGGTCGCAGTTTTGGGTCCTTACGCGCCGGCACGCGCGGATTGTGGTGAGAGATAAGAGAGTTTGGTCCAAGTTTGACTTGCCCTGCG
TGCGGCGGAACACGTGTTATCCTGAGGAGAATTACTTCCCCACCTTACTCAACATGTGGGACCGCCGAGGGCTTATTCCAGCAACACTTACACACGTGGATTGGAATGGG
AGTTTTGATGGCCACCCTCGCACCTACGTGGCTTCTGAAGTGGGCCCCAAACTCATACGTACTTTGAGAATCGCCCGGCCCAGATACGGCGACGGCGGAATGAAAATAAT
GATCAGAATGAGAAATAGTAATAGAACCGCCAGCCGCCGGGATTCGTCGTCTTCCTCCGCCGCTAAATCTTACCGTCCACACCCGTTCTTGTTTGCGAGGAAATTCTCCG
CCGATTCGCTCCAGCCGTTGATGAACATATCCAGTGACGTCATCTTTAAAGATTGATCATGAAAATGTAATTTACGTACACAGAACAGAATTATATAATTTGTGTAGTTT
CTTTTTTTTTTTCTTCATAAGATAGACGGGACAAACAAATCCAAGGATAAATAAAAGGGATGAGCACTGCAGTTCGACGAACTCGATTGATGTGAATCTTGAAAGTGTTG
ATTTGGGTCAACTATAAAGAAGTGGTGGATCTCAATCAGGTCTCCTTTTCATCTCGCTCTAGTTTTTTTTTTTGTTCGTTGTTATTGGTTTAAGAATTAAGAAAATAAAA
ATAAAAGTGAGGAGGAAAAGAGTGTGTTGAATTGTCTTGTCTAGAGATATGGATGTATGAAGTGGAGCGGGAACTTGAATAGACGTGGATGAGAAAAGGTGAAGCAAAAA
CAAATCCACAAAGGACGATGGGAATAATGAAGTGGAGTGGACTCGTGGGCCGAGTTAGCGAGTTAACTCAGGTATGTATTTTGTCCCGAC
Protein sequenceShow/hide protein sequence
MLPPNPLSLICALLLCLPLAILFTINSPTIINPLHPSDYPLLFPLHSLYIPKTHRKITLFPIPSPPPPPEDDDFLFPLAARVNSTPSPTRKLAFMFLTNSPLPFAPLWEL
FFKNIPPDLYNVYIHADPTAEYDPPFSGIFAHRLIPSKPSQRLTPTLTSAARRLLAHALLHDSANSMFALLSPSCIPLHSFDFTYRTLIRSKKSFIEVLKNEIGAYDRWA
ARGPDVMLPVVKLADFRIGSQFWVLTRRHARIVVRDKRVWSKFDLPCVRRNTCYPEENYFPTLLNMWDRRGLIPATLTHVDWNGSFDGHPRTYVASEVGPKLIRTLRIAR
PRYGDGGMKIMIRMRNSNRTASRRDSSSSSAAKSYRPHPFLFARKFSADSLQPLMNISSDVIFKD