; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g32100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g32100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationchr3:22800278..22805644
RNA-Seq ExpressionMoc03g32100
SyntenyMoc03g32100
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8542728.1 hypothetical protein F0562_023880 [Nyssa sinensis]8.0e-22853.32Show/hide
Query:  IGFSFGIAISLNIKSFSSFTFHLRNFSLLTQP---PPPPLQLQPP----PPPVDCKTNCFIDPN-----VEPP-LMHTMNDDEVFWRASMVPLIKEFPYE
        +G S GI  SL ++SFS   F+L+  + L  P    PPPL L+PP    PPP        + PN     +E   LMH M+DDE+FWRA+MVP I+EFPY 
Subjt:  IGFSFGIAISLNIKSFSSFTFHLRNFSLLTQP---PPPPLQLQPP----PPPVDCKTNCFIDPN-----VEPP-LMHTMNDDEVFWRASMVPLIKEFPYE

Query:  RVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLANALLDFSNQRFVLLSESCIP
        RVPKVAFMFL KG + LAPLWE FF GH+ L+SIY+H  P YNV   +P  SVFHGRRIPS+ VEWG+ SMI AERRLLANALLDFSN+RFVLLSE+CIP
Subjt:  RVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLANALLDFSNQRFVLLSESCIP

Query:  LYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPPCYTDEHYIPTLLNIVSPDRS
        L+NFTT Y+YLI+S  +FV+SYDDPRKIGRGRYN++M+P + + DWRKGSQWFE +R  A+++ SD+ YYP+FR++C  PCYTDEHYIPTL+NI++P+++
Subjt:  LYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPPCYTDEHYIPTLLNIVSPDRS

Query:  SNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLLQDEGLRDSTHRVSLTAWGIYGQPHMA
        SNRSITWVDWSKNGPHPG+F R+ V++E LN+I F                  + + F  +S      I TSLL                      P  A
Subjt:  SNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLLQDEGLRDSTHRVSLTAWGIYGQPHMA

Query:  ESTAKLFHAVQLHLTTVVSHFLVFGIGLALGITFNFYVTRFSSAFQLNIQLSPPAPPPAPVMG--------------LREFWSPERVAHEMSDQELLWRA
             +  +     T                                     PPA  P P++               L+E+  P    H+M D+ELLWRA
Subjt:  ESTAKLFHAVQLHLTTVVSHFLVFGIGLALGITFNFYVTRFSSAFQLNIQLSPPAPPPAPVMG--------------LREFWSPERVAHEMSDQELLWRA

Query:  SVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYVHSDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFSNQR
        S+VPRI ++P     KVAF+FL+RG LPLA LWE FF G++GLYSIYVHS PSFNGT P TSVF+GR IPSK VEWG+ +M++AERRLLANALLDFSNQR
Subjt:  SVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYVHSDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFSNQR

Query:  FVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMKPTITEAQWRKGSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHYLAT
        FVLLSE+CIP+FNF+T+Y YL GS + FVE +DL G +GR RY   MKP I   QWRKGSQWFEMDR  A +V++D+ YFP+F++HC+ +CI+DEHYL T
Subjt:  FVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMKPTITEAQWRKGSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHYLAT

Query:  VTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIRSGSTCDYNGNKSRICHLFARKFLETALDRLLELAPPVMFF
        + SI+F  RNSNR+LTW DWSK G HPA F    VT+ LL+++R GS C+YNG  + IC LFARKFL ++L  LL LAP VM F
Subjt:  VTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIRSGSTCDYNGNKSRICHLFARKFLETALDRLLELAPPVMFF

KAF4395376.1 hypothetical protein G4B88_010840 [Cannabis sativa]1.9e-22953.39Show/hide
Query:  LMHTMNDDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHA
        L H+M+D+E+FWRASM+P + EFPY R+PKVAFMFL KG +PLAPLWE FFKGHQ L+SIY+HT P+Y      P SSVF+  RIPSQ V+WG+ +MI A
Subjt:  LMHTMNDDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHA

Query:  ERRLLANALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFR
        ERRLLANALLDFSN+RF+LLSE+CIP++NFTTIY +LINS  SF+  +DDPR+ GRGRYN +M+P ++++DWRKGSQWFE  R  A++I SD  YYPVF 
Subjt:  ERRLLANALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFR

Query:  DHCAPPCYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLL
        +HC PPCY DEHY+ TL+N V P  ++N SITWVDWS+ G HP  F +R VS   LN+IR GFNCTYN      S+CFLFARKF P++LQPLL I  +L+
Subjt:  DHCAPPCYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLL

Query:  QDEGLRDSTHRVSLTAWGIYGQPHMAESTAKLFHAVQLHLTTVVSHFLVFGIGLALGI---------TFNFYVTRFS-----SAFQLNIQLSPP------
          +      H    ++  I           +L +  Q+H   ++++ ++FG GL  GI         +FN   T+FS       +Q   QLSPP      
Subjt:  QDEGLRDSTHRVSLTAWGIYGQPHMAESTAKLFHAVQLHLTTVVSHFLVFGIGLALGI---------TFNFYVTRFS-----SAFQLNIQLSPP------

Query:  --------------------APPPAPVMGLREFWSPERVAHEMSDQELLWRASVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYV
                             P  +  +GL ++ +P  V H+M+D+ELLWRAS+ P+I ++P +   KVAFMFL+RGP+ +AP W++FF G++GLYSIYV
Subjt:  --------------------APPPAPVMGLREFWSPERVAHEMSDQELLWRASVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYV

Query:  HSDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFSNQRFVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMK
        HS+PS+NG+ P  S F+GR IPSKEV WG+ SM++AERRLLANALLD SNQRFVLLSE CIP+F+F TVYNYL+ + +  V ++D PG +GR RY  +M 
Subjt:  HSDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFSNQRFVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMK

Query:  PTITEAQWRKGSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHYLATVTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIR--SG
        P I+  QWRKGSQWFEM R  A EVV+DQ YFP+F+K+C  +C +DEHYL T  SI+F   NSNR+LTW DWSK GPHPA F   +VTV LL  +R  + 
Subjt:  PTITEAQWRKGSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHYLATVTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIR--SG

Query:  STCDYNGNKSRICHLFARKFLETALDRLLELAPPVMFF
        + C+YN N + +C LFARKFL +A+DRL++  P +M F
Subjt:  STCDYNGNKSRICHLFARKFLETALDRLLELAPPVMFF

KVH97644.1 Glycosyl transferase, family 14 [Cynara cardunculus var. scolymus]4.6e-22353.99Show/hide
Query:  FIDPNVEPPLMHTMNDDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVE
        F D      L H+M+D+E+  +A  VP + E P     K+AFMFL +G LPL P WE FF+GH+ LFSIY+HT P +  ++  P SSVF+ R+IPSQ V+
Subjt:  FIDPNVEPPLMHTMNDDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVE

Query:  WGRPSMIHAERRLLANALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIAS
        WG+P+MI AE+RLLANALLDFSNQRF+LLSE+CIPL+NFTTIY+YL N+  SF+SS+DDPRKIGRGRYN++M P +TL DWRKGSQWFEADR  A++I S
Subjt:  WGRPSMIHAERRLLANALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIAS

Query:  DRTYYPVFRDHCAPPCYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQP
        DR YY VF++HC PPCY DEHY+PTL+N V PD  +N ++TW DWS    HP  F R+ +++E LNRIR+  NCTYN A+   S+CFLFARKF P++L+P
Subjt:  DRTYYPVFRDHCAPPCYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQP

Query:  LLKIWTSLLQDEGLRDSTHRVSLTAWGIYGQPHMAESTAKLFHAVQLHLTTVVSHFLVFGIGLALGI---------TFNFYVTRFS----------SAFQ
        LLK   SL  +                   Q H +  +   F    LHL  ++S  L F  GL  GI         +FN   T+FS          SAF 
Subjt:  LLKIWTSLLQDEGLRDSTHRVSLTAWGIYGQPHMAESTAKLFHAVQLHLTTVVSHFLVFGIGLALGI---------TFNFYVTRFS----------SAFQ

Query:  LNIQLSPPAPPPAPV-MGLREFWSPERVAHEMSDQELLWRASVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYVH-SDPSFNGTH
         ++   P +  P+P+ +GL  + +P  + H+M+D+ELLWRAS+VP++T +P     KVAFMFL+RGP+ L+PLW+RFF GN+GLY+IYVH S+ S NGT 
Subjt:  LNIQLSPPAPPPAPV-MGLREFWSPERVAHEMSDQELLWRASVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYVH-SDPSFNGTH

Query:  PTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFSNQRFVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMKPTITEAQWRK
        P  SVF+GR IPSK+VEWG+ +M++AERRLLANALLDFSNQRF+LLS+ CIP+FNF+TVY+YL  S   FVES+DL G +GR RY P M PT+   +WRK
Subjt:  PTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFSNQRFVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMKPTITEAQWRK

Query:  GSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHYLATVTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIRSGSTCDYNGNKSRI
        GSQWFEMDR  ATEV++D  YF +F  +C   C +DEHYL T  + RF   N+NRTLT+ DW+K GPHP  +   +VT   LE++RS ++C+YNG  ++I
Subjt:  GSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHYLATVTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIRSGSTCDYNGNKSRI

Query:  CHLFARKFLETALDRLLELAPPVMFF
        CHLFARKF   ALDRLL +AP +M F
Subjt:  CHLFARKFLETALDRLLELAPPVMFF

RXH77290.1 hypothetical protein DVH24_023564 [Malus domestica]3.2e-23751.86Show/hide
Query:  ISVIPNSPSSHLSQIFHFLFLLIGFSFGIAISLNIKSFSSFTFHLRNFSLLTQPPPPPLQLQPPPP-----PVDCKTNCFIDPNVEPPLMHTMNDD-EVF
        ++V+  +P SH    FH +F +IG S G+ +S   K+F  FT    + S L  PP    Q  P PP     P+       +  + E  L+H M DD E+F
Subjt:  ISVIPNSPSSHLSQIFHFLFLLIGFSFGIAISLNIKSFSSFTFHLRNFSLLTQPPPPPLQLQPPPP-----PVDCKTNCFIDPNVEPPLMHTMNDD-EVF

Query:  WRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLANALLD
        WRAS VP I +FPY RVP++AFMFL KG +PLAPLWEMFF GH+ L++IY+H HPSY    S P +SVF+GRRIPS+AVEWG P+MI  ERRLLA+ALLD
Subjt:  WRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLANALLD

Query:  FSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPPCYTDE
        F+N+RFVLLSESCIPL+NFTTIY+YL+N+  S + SYDDPRK+GRGRYN RM+P + + DWRKGSQWFE  R  A++I SD  YYP+F++HC+PPCY DE
Subjt:  FSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPPCYTDE

Query:  HYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLLQDEGLRDSTHR
        HYIPTL+NI+ P+ +SNRS+TWVDWSK+GPHPGRFGR  VS E+  R                     FA                       L + TH 
Subjt:  HYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLLQDEGLRDSTHR

Query:  VSLTAWGIYGQPHMAESTAKLFHAVQLHLTTVVSHFLVFGIGLALGITFNFYVTRFSSAFQL-------------NIQLSPPAPP---------------
                  Q H   S  K F   Q+HL  +VS FL+F  GLALG + + Y+  F   FQL             N+  S P PP               
Subjt:  VSLTAWGIYGQPHMAESTAKLFHAVQLHLTTVVSHFLVFGIGLALGITFNFYVTRFSSAFQL-------------NIQLSPPAPP---------------

Query:  ----------------------PAPVMGLREFWSPERVAHEMSDQELLWRASVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYVH
                              P P +GL+E+     + HEM D ELLWRAS+VP     P K T KVAFMFL+RG L +APLWE FF G++GLYSIYVH
Subjt:  ----------------------PAPVMGLREFWSPERVAHEMSDQELLWRASVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYVH

Query:  SDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFSNQRFVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMKP
        ++P FN T P  SVFYGR +PSK V WGQP+M+QAERRLLANALLDFSNQRFVLLSE+CIP+FNF  VYNYL+ S Q FVE++DLPG +GR RYR  M+P
Subjt:  SDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFSNQRFVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMKP

Query:  TITEAQWRKGSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHYLATVTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIRSGSTC
         I   QWRKGSQWFE+DR  AT +V+D+KYFPLF ++C+P C SDEHYL T  SI+F  +NSNRTLTW DWS+ GPHP+ F   +VT+  L+++R GS C
Subjt:  TITEAQWRKGSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHYLATVTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIRSGSTC

Query:  DYNGNKSRICHLFARKFLETALDRLLELAPPVMFF
        +YNG  + IC LFARKFL  +LDRLL  AP +M F
Subjt:  DYNGNKSRICHLFARKFLETALDRLLELAPPVMFF

XP_022137518.1 uncharacterized protein LOC111008944 [Momordica charantia]4.9e-233100Show/hide
Query:  MKMGNEQNKKHISVIPNSPSSHLSQIFHFLFLLIGFSFGIAISLNIKSFSSFTFHLRNFSLLTQPPPPPLQLQPPPPPVDCKTNCFIDPNVEPPLMHTMN
        MKMGNEQNKKHISVIPNSPSSHLSQIFHFLFLLIGFSFGIAISLNIKSFSSFTFHLRNFSLLTQPPPPPLQLQPPPPPVDCKTNCFIDPNVEPPLMHTMN
Subjt:  MKMGNEQNKKHISVIPNSPSSHLSQIFHFLFLLIGFSFGIAISLNIKSFSSFTFHLRNFSLLTQPPPPPLQLQPPPPPVDCKTNCFIDPNVEPPLMHTMN

Query:  DDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLA
        DDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLA
Subjt:  DDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLA

Query:  NALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPP
        NALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPP
Subjt:  NALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPP

Query:  CYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLLQ
        CYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLLQ
Subjt:  CYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLLQ

TrEMBL top hitse value%identityAlignment
A0A498I2A4 Adaptin_N domain-containing protein1.6e-23751.86Show/hide
Query:  ISVIPNSPSSHLSQIFHFLFLLIGFSFGIAISLNIKSFSSFTFHLRNFSLLTQPPPPPLQLQPPPP-----PVDCKTNCFIDPNVEPPLMHTMNDD-EVF
        ++V+  +P SH    FH +F +IG S G+ +S   K+F  FT    + S L  PP    Q  P PP     P+       +  + E  L+H M DD E+F
Subjt:  ISVIPNSPSSHLSQIFHFLFLLIGFSFGIAISLNIKSFSSFTFHLRNFSLLTQPPPPPLQLQPPPP-----PVDCKTNCFIDPNVEPPLMHTMNDD-EVF

Query:  WRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLANALLD
        WRAS VP I +FPY RVP++AFMFL KG +PLAPLWEMFF GH+ L++IY+H HPSY    S P +SVF+GRRIPS+AVEWG P+MI  ERRLLA+ALLD
Subjt:  WRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLANALLD

Query:  FSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPPCYTDE
        F+N+RFVLLSESCIPL+NFTTIY+YL+N+  S + SYDDPRK+GRGRYN RM+P + + DWRKGSQWFE  R  A++I SD  YYP+F++HC+PPCY DE
Subjt:  FSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPPCYTDE

Query:  HYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLLQDEGLRDSTHR
        HYIPTL+NI+ P+ +SNRS+TWVDWSK+GPHPGRFGR  VS E+  R                     FA                       L + TH 
Subjt:  HYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLLQDEGLRDSTHR

Query:  VSLTAWGIYGQPHMAESTAKLFHAVQLHLTTVVSHFLVFGIGLALGITFNFYVTRFSSAFQL-------------NIQLSPPAPP---------------
                  Q H   S  K F   Q+HL  +VS FL+F  GLALG + + Y+  F   FQL             N+  S P PP               
Subjt:  VSLTAWGIYGQPHMAESTAKLFHAVQLHLTTVVSHFLVFGIGLALGITFNFYVTRFSSAFQL-------------NIQLSPPAPP---------------

Query:  ----------------------PAPVMGLREFWSPERVAHEMSDQELLWRASVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYVH
                              P P +GL+E+     + HEM D ELLWRAS+VP     P K T KVAFMFL+RG L +APLWE FF G++GLYSIYVH
Subjt:  ----------------------PAPVMGLREFWSPERVAHEMSDQELLWRASVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYVH

Query:  SDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFSNQRFVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMKP
        ++P FN T P  SVFYGR +PSK V WGQP+M+QAERRLLANALLDFSNQRFVLLSE+CIP+FNF  VYNYL+ S Q FVE++DLPG +GR RYR  M+P
Subjt:  SDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFSNQRFVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMKP

Query:  TITEAQWRKGSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHYLATVTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIRSGSTC
         I   QWRKGSQWFE+DR  AT +V+D+KYFPLF ++C+P C SDEHYL T  SI+F  +NSNRTLTW DWS+ GPHP+ F   +VT+  L+++R GS C
Subjt:  TITEAQWRKGSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHYLATVTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIRSGSTC

Query:  DYNGNKSRICHLFARKFLETALDRLLELAPPVMFF
        +YNG  + IC LFARKFL  +LDRLL  AP +M F
Subjt:  DYNGNKSRICHLFARKFLETALDRLLELAPPVMFF

A0A5J5BJE6 Uncharacterized protein3.9e-22853.32Show/hide
Query:  IGFSFGIAISLNIKSFSSFTFHLRNFSLLTQP---PPPPLQLQPP----PPPVDCKTNCFIDPN-----VEPP-LMHTMNDDEVFWRASMVPLIKEFPYE
        +G S GI  SL ++SFS   F+L+  + L  P    PPPL L+PP    PPP        + PN     +E   LMH M+DDE+FWRA+MVP I+EFPY 
Subjt:  IGFSFGIAISLNIKSFSSFTFHLRNFSLLTQP---PPPPLQLQPP----PPPVDCKTNCFIDPN-----VEPP-LMHTMNDDEVFWRASMVPLIKEFPYE

Query:  RVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLANALLDFSNQRFVLLSESCIP
        RVPKVAFMFL KG + LAPLWE FF GH+ L+SIY+H  P YNV   +P  SVFHGRRIPS+ VEWG+ SMI AERRLLANALLDFSN+RFVLLSE+CIP
Subjt:  RVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLANALLDFSNQRFVLLSESCIP

Query:  LYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPPCYTDEHYIPTLLNIVSPDRS
        L+NFTT Y+YLI+S  +FV+SYDDPRKIGRGRYN++M+P + + DWRKGSQWFE +R  A+++ SD+ YYP+FR++C  PCYTDEHYIPTL+NI++P+++
Subjt:  LYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPPCYTDEHYIPTLLNIVSPDRS

Query:  SNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLLQDEGLRDSTHRVSLTAWGIYGQPHMA
        SNRSITWVDWSKNGPHPG+F R+ V++E LN+I F                  + + F  +S      I TSLL                      P  A
Subjt:  SNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLLQDEGLRDSTHRVSLTAWGIYGQPHMA

Query:  ESTAKLFHAVQLHLTTVVSHFLVFGIGLALGITFNFYVTRFSSAFQLNIQLSPPAPPPAPVMG--------------LREFWSPERVAHEMSDQELLWRA
             +  +     T                                     PPA  P P++               L+E+  P    H+M D+ELLWRA
Subjt:  ESTAKLFHAVQLHLTTVVSHFLVFGIGLALGITFNFYVTRFSSAFQLNIQLSPPAPPPAPVMG--------------LREFWSPERVAHEMSDQELLWRA

Query:  SVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYVHSDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFSNQR
        S+VPRI ++P     KVAF+FL+RG LPLA LWE FF G++GLYSIYVHS PSFNGT P TSVF+GR IPSK VEWG+ +M++AERRLLANALLDFSNQR
Subjt:  SVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYVHSDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFSNQR

Query:  FVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMKPTITEAQWRKGSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHYLAT
        FVLLSE+CIP+FNF+T+Y YL GS + FVE +DL G +GR RY   MKP I   QWRKGSQWFEMDR  A +V++D+ YFP+F++HC+ +CI+DEHYL T
Subjt:  FVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMKPTITEAQWRKGSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHYLAT

Query:  VTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIRSGSTCDYNGNKSRICHLFARKFLETALDRLLELAPPVMFF
        + SI+F  RNSNR+LTW DWSK G HPA F    VT+ LL+++R GS C+YNG  + IC LFARKFL ++L  LL LAP VM F
Subjt:  VTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIRSGSTCDYNGNKSRICHLFARKFLETALDRLLELAPPVMFF

A0A6J1CAJ7 uncharacterized protein LOC1110089442.4e-233100Show/hide
Query:  MKMGNEQNKKHISVIPNSPSSHLSQIFHFLFLLIGFSFGIAISLNIKSFSSFTFHLRNFSLLTQPPPPPLQLQPPPPPVDCKTNCFIDPNVEPPLMHTMN
        MKMGNEQNKKHISVIPNSPSSHLSQIFHFLFLLIGFSFGIAISLNIKSFSSFTFHLRNFSLLTQPPPPPLQLQPPPPPVDCKTNCFIDPNVEPPLMHTMN
Subjt:  MKMGNEQNKKHISVIPNSPSSHLSQIFHFLFLLIGFSFGIAISLNIKSFSSFTFHLRNFSLLTQPPPPPLQLQPPPPPVDCKTNCFIDPNVEPPLMHTMN

Query:  DDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLA
        DDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLA
Subjt:  DDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLA

Query:  NALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPP
        NALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPP
Subjt:  NALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPP

Query:  CYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLLQ
        CYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLLQ
Subjt:  CYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLLQ

A0A6N2N115 Lactamase_B domain-containing protein (Fragment)1.7e-22355.56Show/hide
Query:  LMHTMNDDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHA
        L H+MND E+ WRASMVP I E+PY R PKVAFMFL +G+LPLAPLWEMFFKGH+ L+SIY+H  P +  ++  P S VF  R+IPS+  EWGR +MI A
Subjt:  LMHTMNDDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHA

Query:  ERRLLANALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFR
        ERRLLANALLDFSN+RFVLLSE+CIP++NF+TIYNYL NS  SF+ S+DDPR  GRGRYN++M PAVTL+DWRKGSQWFE  R  A+++ SD  Y+PVFR
Subjt:  ERRLLANALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFR

Query:  DHCAPPCYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLL
        DHC PPCY DEHY PTL   + P+ +SNRSITWVDWS  G HP RF R+ VS   LN+IR GF CTYN                          I +   
Subjt:  DHCAPPCYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLL

Query:  QDEGLRDSTHRVSLTAWGIYGQPHMAESTAKLFHAVQLHLTTVVSHFLVFGIGLALGITFNFYVTRFSSAFQLN-IQLSPPAPPPA-----PVMGLREFW
        Q++ L                    + +  KLF+A QL L  V+S F +FG GLA G+  + Y++  S +  ++    S   P PA     P +GL+E+ 
Subjt:  QDEGLRDSTHRVSLTAWGIYGQPHMAESTAKLFHAVQLHLTTVVSHFLVFGIGLALGITFNFYVTRFSSAFQLN-IQLSPPAPPPA-----PVMGLREFW

Query:  SPERVAHEMSDQELLWRASVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYVHSDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMM
            V H+M ++ELLWRAS+ P I +FP     K+AFMFL++GP+ +APLWE+FF G+ GLYSIYVHS PS+N + P + VF+GR IPSK+V WG  +M+
Subjt:  SPERVAHEMSDQELLWRASVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYVHSDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMM

Query:  QAERRLLANALLDFSNQRFVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMKPTITEAQWRKGSQWFEMDRGTATEVVADQKYFPL
        +AERRLLANALLD +NQRFVLLSE+CIP+FNF+TVY YL+ S +  VES+ L G +G  RY P M+P I   QWRKGSQWFE+DR  A EVV+D+KYFPL
Subjt:  QAERRLLANALLDFSNQRFVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMKPTITEAQWRKGSQWFEMDRGTATEVVADQKYFPL

Query:  FEKHCRPNCISDEHYLATVTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIRSGSTCDYNGNKSRICHLFARKFLETALDRLLELAPPVM
        F+KHC   C +DEHYL T  +++   RNSNRTLTW DWS+ GPHPA F    VT   LERIRSGS C YNGN +  C LFARKF   ALDRLL  AP +M
Subjt:  FEKHCRPNCISDEHYLATVTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIRSGSTCDYNGNKSRICHLFARKFLETALDRLLELAPPVM

Query:  FF
         F
Subjt:  FF

A0A7J6HJE3 Lactamase_B domain-containing protein9.2e-23053.39Show/hide
Query:  LMHTMNDDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHA
        L H+M+D+E+FWRASM+P + EFPY R+PKVAFMFL KG +PLAPLWE FFKGHQ L+SIY+HT P+Y      P SSVF+  RIPSQ V+WG+ +MI A
Subjt:  LMHTMNDDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHA

Query:  ERRLLANALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFR
        ERRLLANALLDFSN+RF+LLSE+CIP++NFTTIY +LINS  SF+  +DDPR+ GRGRYN +M+P ++++DWRKGSQWFE  R  A++I SD  YYPVF 
Subjt:  ERRLLANALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFR

Query:  DHCAPPCYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLL
        +HC PPCY DEHY+ TL+N V P  ++N SITWVDWS+ G HP  F +R VS   LN+IR GFNCTYN      S+CFLFARKF P++LQPLL I  +L+
Subjt:  DHCAPPCYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLL

Query:  QDEGLRDSTHRVSLTAWGIYGQPHMAESTAKLFHAVQLHLTTVVSHFLVFGIGLALGI---------TFNFYVTRFS-----SAFQLNIQLSPP------
          +      H    ++  I           +L +  Q+H   ++++ ++FG GL  GI         +FN   T+FS       +Q   QLSPP      
Subjt:  QDEGLRDSTHRVSLTAWGIYGQPHMAESTAKLFHAVQLHLTTVVSHFLVFGIGLALGI---------TFNFYVTRFS-----SAFQLNIQLSPP------

Query:  --------------------APPPAPVMGLREFWSPERVAHEMSDQELLWRASVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYV
                             P  +  +GL ++ +P  V H+M+D+ELLWRAS+ P+I ++P +   KVAFMFL+RGP+ +AP W++FF G++GLYSIYV
Subjt:  --------------------APPPAPVMGLREFWSPERVAHEMSDQELLWRASVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYV

Query:  HSDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFSNQRFVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMK
        HS+PS+NG+ P  S F+GR IPSKEV WG+ SM++AERRLLANALLD SNQRFVLLSE CIP+F+F TVYNYL+ + +  V ++D PG +GR RY  +M 
Subjt:  HSDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFSNQRFVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMK

Query:  PTITEAQWRKGSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHYLATVTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIR--SG
        P I+  QWRKGSQWFEM R  A EVV+DQ YFP+F+K+C  +C +DEHYL T  SI+F   NSNR+LTW DWSK GPHPA F   +VTV LL  +R  + 
Subjt:  PTITEAQWRKGSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHYLATVTSIRFGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIR--SG

Query:  STCDYNGNKSRICHLFARKFLETALDRLLELAPPVMFF
        + C+YN N + +C LFARKFL +A+DRL++  P +M F
Subjt:  STCDYNGNKSRICHLFARKFLETALDRLLELAPPVMFF

SwissProt top hitse value%identityAlignment
Q65XS5 Glycosyltransferase BC101.0e-3935.47Show/hide
Query:  KVAFMFLVKGALPLAPLWEMFFKG-HQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRI-PSQAVEWGRPSMIHAERRLLANALLDFSNQRFVLLSESCIPL
        ++AF+F+ +  LPL  +W+ FF+G  +  FSI+VH+ P + ++ +   S  F+ R++  S  V+WG  SMI AER LLA+AL D  N+RFV +S+SC+PL
Subjt:  KVAFMFLVKGALPLAPLWEMFFKG-HQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRI-PSQAVEWGRPSMIHAERRLLANALLDFSNQRFVLLSESCIPL

Query:  YNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHC----------------------AP
        YNF   Y+Y+++S  SFV S+ D +    GRYN RM P + + +WRKGSQW    R  A  +  D    P F+ HC                      A 
Subjt:  YNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHC----------------------AP

Query:  PCYTDEHYIPTLL-NIVSPDRSSNRSITWVDW--------SKNGPHPGRFGRRQVSVELLNRIRFGFNCTYN--------DANDTVSLCFLFARKF
         C  DEHY+ TLL      +  + RS+T   W         + G HP  +     +  L+  I+   N  Y          +N   + CFLFARKF
Subjt:  PCYTDEHYIPTLL-NIVSPDRSSNRSITWVDW--------SKNGPHPGRFGRRQVSVELLNRIRFGFNCTYN--------DANDTVSLCFLFARKF

Arabidopsis top hitse value%identityAlignment
AT1G10280.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein4.7e-10148.23Show/hide
Query:  GNEQNKKHISVIPNSPSSHLSQIFHFLFL-LIGFSFGIAISLNIKSFSSFTFHLRNFSLLTQPPPPPLQLQPPPPP---------VDCKTN---------
        G E+ +KHI ++       L+Q   FL + + G   G+A S +I  +  F    R FS  T      LQ  P   P          DC  N         
Subjt:  GNEQNKKHISVIPNSPSSHLSQIFHFLFL-LIGFSFGIAISLNIKSFSSFTFHLRNFSLLTQPPPPPLQLQPPPPP---------VDCKTN---------

Query:  ----------CF-IDPNVEPP-LMHTMNDDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSS
                  C+ ID  V P  L H M DDE+FWRASMVP+ +E+PY+RVPKVAFMFL +G LP+ PLWE FFKG++   S+YVHT P Y+++ S    S
Subjt:  ----------CF-IDPNVEPP-LMHTMNDDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSS

Query:  VFHGRRIPSQAVEWGRPSMIHAERRLLANALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQW
         F+ R+IPSQ VEWG P +  AE+RLLANALLDFSN+RFVLLSESC+P+YNF+T+Y YLINS YSFV SYD+P + GRGRY+R+M P + L  WRKGSQW
Subjt:  VFHGRRIPSQAVEWGRPSMIHAERRLLANALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQW

Query:  FEADRAAALDIASDRTYYPVFRDHCAPPCYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFG-FNCTYNDANDTVSLC
        FE +R  A+ I SD  YY +F+  C P CY DEHYIPT LN+     ++NRS+TWVDWS  GPHP  +    ++   L  IR    +C YN+  +  SLC
Subjt:  FEADRAAALDIASDRTYYPVFRDHCAPPCYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFG-FNCTYNDANDTVSLC

Query:  FLFARKFMPDSLQPLLKIWTSLL
        FLFARKF P +L PL+ + +++L
Subjt:  FLFARKFMPDSLQPLLKIWTSLL

AT1G68390.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein5.0e-11151.67Show/hide
Query:  TAKLFHAVQLHLTTVVSHFLVFGIGLALGITFNFYVTRFSSAFQLNIQ-------------LSPPAPPPAPVM-----GLREFWS-PERVAHEMSDQELL
        T KL +A   H   ++S+ L+   G+ +GI  +  +  FSS   L+IQ               PP PPP+P       GL+ F   PE++ H+M D+ELL
Subjt:  TAKLFHAVQLHLTTVVSHFLVFGIGLALGITFNFYVTRFSSAFQLNIQ-------------LSPPAPPPAPVM-----GLREFWS-PERVAHEMSDQELL

Query:  WRASVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYVHSDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFS
        WRAS+ P+I  +P   T KVAFMF+++G LPLA LWERFF G++GL++IYVHS PS+N + P  SVF GR IPSK V+WG  +M++AE+RLLANALLD S
Subjt:  WRASVVPRITKFPVKTTAKVAFMFLSRGPLPLAPLWERFFHGNQGLYSIYVHSDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFS

Query:  NQRFVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMKPTITEAQWRKGSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHY
        N+RFVLLSE+CIP+FNFTTVY+YL+ S Q  VES+D  G +GR RY P M+P +    WRKGSQW E+DR  A E+++D+ Y+PLF  +C   C +DEHY
Subjt:  NQRFVLLSETCIPVFNFTTVYNYLVGSAQIFVESFDLPGRLGRRRYRPNMKPTITEAQWRKGSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHY

Query:  LATVTSIR--FGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIRSGSTCDYNGNKSRICHLFARKFLETALDRLLELAPPVMFF
        + T+ +I+     RNSNRTLTW DWSK GPHP  F    VT   +E +RSG  C YNG ++ IC+LFARKFL TALDRLL L+  V+ F
Subjt:  LATVTSIR--FGGRNSNRTLTWADWSKQGPHPAGFESGNVTVGLLERIRSGSTCDYNGNKSRICHLFARKFLETALDRLLELAPPVMFF

AT3G21310.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein8.3e-9855.56Show/hide
Query:  VEPPL--MHTMNDDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGR
        ++PPL   H+MND E+ WRASM P I ++P++RVPK+AFMFL KG LP APLWE FFKGH+  +SIYVHT P+Y   S  P SSVF+ R+IPSQ V WG 
Subjt:  VEPPL--MHTMNDDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGR

Query:  PSMIHAERRLLANALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRT
         SM  AERRLLANALLD SN+ FVLLSE+CIPL  F  +Y Y+  S YSF+ S D+    GRGRY+  M P V+L +WRKGSQWFE +RA A+DI  D  
Subjt:  PSMIHAERRLLANALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRT

Query:  YYPVFRDHCAPPCYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLK
        YY  F++ C PPCY DEHY PT+L+I  PD  +NR++TW DWS+ G HP  FG+  ++ + + ++  G  C YND    V  C+LFARKF P +L+PLLK
Subjt:  YYPVFRDHCAPPCYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLK

Query:  IWTSLL
        +   +L
Subjt:  IWTSLL

AT5G11730.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein9.2e-10558.5Show/hide
Query:  VEPP--LMHTMNDDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGR
        ++PP  LMH M+D+E+ WRAS  P  KE+P++RVPKVAFMFL KG LPLA LWE F KGH+ L+S+Y+H HPS+  ++  P SSVFH R+IPSQ  EWGR
Subjt:  VEPP--LMHTMNDDEVFWRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGR

Query:  PSMIHAERRLLANALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRT
         SM  AE+RLLANALLD SN+ FVL+SESCIPLYNFTTIY+YL  S++SF+ ++DDP   GRGRYN  M P V L  WRKGSQWFE +R  A  I  D  
Subjt:  PSMIHAERRLLANALLDFSNQRFVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRT

Query:  YYPVFRDHCAPPCYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLK
        YYP F++ C P CY DEHY PT+L I  P   +NRS+TWVDWS+ GPHP  FGR  ++     +I  G NC+YN  N   S+C+LFARKF P +L+PLL 
Subjt:  YYPVFRDHCAPPCYTDEHYIPTLLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLK

Query:  IWTSLL
        I   +L
Subjt:  IWTSLL

AT5G25970.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein7.5e-9949.48Show/hide
Query:  VIPNSPSSHLSQIFHF-LFLLIGFSFGIAISLNIKSFSSFTFHLRNFSLLTQPPPPPLQLQPPPPPVDCKTNCFIDPNVEP--PLMHTMNDDEVFWRASM
        ++  S  + L++ F + L LL+GF     + L   S S+  ++  N S++T        +     P   K N  +D  ++P   LMH M+D+E+ W AS 
Subjt:  VIPNSPSSHLSQIFHF-LFLLIGFSFGIAISLNIKSFSSFTFHLRNFSLLTQPPPPPLQLQPPPPPVDCKTNCFIDPNVEP--PLMHTMNDDEVFWRASM

Query:  VPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLANALLDFSNQR
        +P  KE+P+ RVPK+AFMFL  G LPLAPLWE   KGH+ L+S+Y+H+  S   S+  P SSVF+ R IPSQ  EWGR +M  AERRLLANALLD SN+ 
Subjt:  VPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLANALLDFSNQR

Query:  FVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPPCYTDEHYIPT
        FVLLSESCIPL+NFTTIY Y+  SE+SF+ S+DDP   GRGRY+  M P V +  WRKGSQWFE +R  A+ I  D  YYP F++ C P CY DEHY PT
Subjt:  FVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPPCYTDEHYIPT

Query:  LLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLL
        +L I  P   +NRS+TWVDWS+ G HP  FG + ++ E   RI  G NCTYN      S+C+LFARKF P +L+PL++I   LL
Subjt:  LLNIVSPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATGGGCAATGAGCAGAACAAAAAGCATATTTCTGTGATCCCCAATTCCCCATCATCTCACTTGTCCCAAATCTTCCATTTTCTCTTCTTACTCATTGGC
TTCTCCTTCGGAATCGCCATCAGTCTGAACATCAAGAGCTTCTCATCATTCACTTTCCACCTCCGTAACTTCTCCCTCCTCACACAACCGCCACCTCCACCACTT
CAACTTCAACCGCCGCCGCCGCCGGTTGACTGCAAAACAAATTGCTTTATCGACCCGAATGTAGAGCCACCTCTGATGCACACAATGAATGATGATGAAGTGTTC
TGGAGAGCTTCCATGGTTCCGCTGATCAAAGAGTTCCCATATGAACGTGTGCCAAAAGTTGCATTCATGTTTTTGGTGAAGGGGGCGTTGCCTCTGGCTCCTCTG
TGGGAGATGTTCTTCAAAGGCCATCAACCGCTGTTTTCCATTTATGTTCACACTCATCCCTCCTATAATGTTTCTTCTTCGCTGCCTCTGAGCTCTGTCTTCCAC
GGAAGACGGATCCCAAGCCAGGCAGTGGAATGGGGAAGGCCGTCGATGATCCACGCGGAGCGCCGCCTTCTGGCAAACGCCCTTCTCGATTTCTCCAACCAAAGA
TTCGTTTTACTCTCCGAAAGCTGCATCCCTCTCTACAATTTCACCACCATCTACAATTACCTCATCAACTCCGAATACTCCTTCGTCTCTTCCTACGACGATCCC
AGAAAAATCGGCCGCGGCCGTTACAACCGCCGTATGTTCCCGGCCGTCACCCTCGCCGACTGGCGGAAGGGCTCCCAGTGGTTCGAGGCCGACCGGGCGGCGGCG
CTGGACATCGCCTCCGACAGAACCTATTACCCAGTTTTCCGGGACCACTGCGCGCCGCCGTGTTACACGGACGAGCACTACATTCCCACGCTCCTCAATATTGTT
TCGCCGGACCGGAGTTCGAACCGGAGCATTACCTGGGTAGATTGGTCCAAGAATGGCCCGCATCCCGGCAGATTCGGGAGGCGCCAGGTTTCGGTCGAGTTGTTG
AACCGGATCCGGTTTGGTTTTAACTGTACTTATAATGATGCGAACGACACCGTTTCTCTCTGCTTTCTGTTCGCCCGGAAGTTTATGCCGGATTCTCTCCAGCCT
CTGCTCAAGATTTGGACCTCTCTGTTGCAAGATGAGGGTCTAAGGGACTCGACCCATAGGGTCAGCTTGACTGCTTGGGGCATCTATGGTCAGCCGCATATGGCG
GAGTCCACGGCGAAGCTCTTCCACGCCGTTCAACTCCACCTCACCACTGTTGTATCTCACTTTCTCGTCTTCGGCATAGGCTTAGCCCTAGGGATCACCTTTAAT
TTTTACGTCACGCGTTTCTCCTCCGCCTTCCAGCTCAACATCCAGCTGTCGCCACCAGCTCCTCCGCCGGCGCCGGTGATGGGGTTGAGGGAATTTTGGAGCCCG
GAGCGCGTGGCCCACGAGATGAGCGACCAGGAGTTGCTGTGGAGGGCTTCGGTGGTTCCCCGGATCACAAAATTTCCGGTGAAGACGACGGCGAAGGTGGCGTTT
ATGTTTCTGAGCAGAGGGCCGCTGCCGTTGGCTCCTCTTTGGGAGCGATTCTTCCATGGAAATCAGGGATTGTACTCCATTTATGTCCATTCTGATCCTTCCTTC
AATGGAACTCACCCCACAACTTCCGTCTTTTATGGCCGTACAATTCCTAGTAAGGAGGTGGAATGGGGGCAGCCGAGCATGATGCAAGCGGAGCGGCGGCTGTTG
GCGAACGCGCTGCTCGACTTCTCCAACCAACGGTTCGTCCTCCTCTCCGAGACCTGCATCCCCGTCTTCAATTTCACCACAGTCTACAACTACCTTGTGGGCTCG
GCCCAAATCTTCGTCGAGTCCTTCGACTTACCAGGCCGGTTGGGCCGCCGACGCTACAGGCCCAACATGAAGCCCACAATCACAGAGGCCCAGTGGCGGAAAGGG
TCCCAGTGGTTCGAAATGGACCGAGGGACGGCGACGGAGGTGGTGGCGGACCAGAAGTACTTCCCACTCTTCGAAAAGCACTGCCGCCCCAACTGCATATCGGAC
GAGCACTACTTGGCGACGGTAACGAGCATCCGGTTCGGGGGGAGGAACTCGAACCGGACGCTGACTTGGGCCGACTGGTCGAAACAGGGCCCTCATCCGGCCGGG
TTCGAGAGCGGCAACGTCACGGTGGGGCTTTTGGAAAGGATCCGGAGCGGAAGCACGTGCGATTACAATGGAAATAAAAGTAGAATTTGCCATCTGTTTGCGAGG
AAGTTCTTGGAAACTGCTTTGGATCGGCTGCTGGAACTTGCCCCTCCAGTCATGTTCTTTGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATGGGCAATGAGCAGAACAAAAAGCATATTTCTGTGATCCCCAATTCCCCATCATCTCACTTGTCCCAAATCTTCCATTTTCTCTTCTTACTCATTGGC
TTCTCCTTCGGAATCGCCATCAGTCTGAACATCAAGAGCTTCTCATCATTCACTTTCCACCTCCGTAACTTCTCCCTCCTCACACAACCGCCACCTCCACCACTT
CAACTTCAACCGCCGCCGCCGCCGGTTGACTGCAAAACAAATTGCTTTATCGACCCGAATGTAGAGCCACCTCTGATGCACACAATGAATGATGATGAAGTGTTC
TGGAGAGCTTCCATGGTTCCGCTGATCAAAGAGTTCCCATATGAACGTGTGCCAAAAGTTGCATTCATGTTTTTGGTGAAGGGGGCGTTGCCTCTGGCTCCTCTG
TGGGAGATGTTCTTCAAAGGCCATCAACCGCTGTTTTCCATTTATGTTCACACTCATCCCTCCTATAATGTTTCTTCTTCGCTGCCTCTGAGCTCTGTCTTCCAC
GGAAGACGGATCCCAAGCCAGGCAGTGGAATGGGGAAGGCCGTCGATGATCCACGCGGAGCGCCGCCTTCTGGCAAACGCCCTTCTCGATTTCTCCAACCAAAGA
TTCGTTTTACTCTCCGAAAGCTGCATCCCTCTCTACAATTTCACCACCATCTACAATTACCTCATCAACTCCGAATACTCCTTCGTCTCTTCCTACGACGATCCC
AGAAAAATCGGCCGCGGCCGTTACAACCGCCGTATGTTCCCGGCCGTCACCCTCGCCGACTGGCGGAAGGGCTCCCAGTGGTTCGAGGCCGACCGGGCGGCGGCG
CTGGACATCGCCTCCGACAGAACCTATTACCCAGTTTTCCGGGACCACTGCGCGCCGCCGTGTTACACGGACGAGCACTACATTCCCACGCTCCTCAATATTGTT
TCGCCGGACCGGAGTTCGAACCGGAGCATTACCTGGGTAGATTGGTCCAAGAATGGCCCGCATCCCGGCAGATTCGGGAGGCGCCAGGTTTCGGTCGAGTTGTTG
AACCGGATCCGGTTTGGTTTTAACTGTACTTATAATGATGCGAACGACACCGTTTCTCTCTGCTTTCTGTTCGCCCGGAAGTTTATGCCGGATTCTCTCCAGCCT
CTGCTCAAGATTTGGACCTCTCTGTTGCAAGATGAGGGTCTAAGGGACTCGACCCATAGGGTCAGCTTGACTGCTTGGGGCATCTATGGTCAGCCGCATATGGCG
GAGTCCACGGCGAAGCTCTTCCACGCCGTTCAACTCCACCTCACCACTGTTGTATCTCACTTTCTCGTCTTCGGCATAGGCTTAGCCCTAGGGATCACCTTTAAT
TTTTACGTCACGCGTTTCTCCTCCGCCTTCCAGCTCAACATCCAGCTGTCGCCACCAGCTCCTCCGCCGGCGCCGGTGATGGGGTTGAGGGAATTTTGGAGCCCG
GAGCGCGTGGCCCACGAGATGAGCGACCAGGAGTTGCTGTGGAGGGCTTCGGTGGTTCCCCGGATCACAAAATTTCCGGTGAAGACGACGGCGAAGGTGGCGTTT
ATGTTTCTGAGCAGAGGGCCGCTGCCGTTGGCTCCTCTTTGGGAGCGATTCTTCCATGGAAATCAGGGATTGTACTCCATTTATGTCCATTCTGATCCTTCCTTC
AATGGAACTCACCCCACAACTTCCGTCTTTTATGGCCGTACAATTCCTAGTAAGGAGGTGGAATGGGGGCAGCCGAGCATGATGCAAGCGGAGCGGCGGCTGTTG
GCGAACGCGCTGCTCGACTTCTCCAACCAACGGTTCGTCCTCCTCTCCGAGACCTGCATCCCCGTCTTCAATTTCACCACAGTCTACAACTACCTTGTGGGCTCG
GCCCAAATCTTCGTCGAGTCCTTCGACTTACCAGGCCGGTTGGGCCGCCGACGCTACAGGCCCAACATGAAGCCCACAATCACAGAGGCCCAGTGGCGGAAAGGG
TCCCAGTGGTTCGAAATGGACCGAGGGACGGCGACGGAGGTGGTGGCGGACCAGAAGTACTTCCCACTCTTCGAAAAGCACTGCCGCCCCAACTGCATATCGGAC
GAGCACTACTTGGCGACGGTAACGAGCATCCGGTTCGGGGGGAGGAACTCGAACCGGACGCTGACTTGGGCCGACTGGTCGAAACAGGGCCCTCATCCGGCCGGG
TTCGAGAGCGGCAACGTCACGGTGGGGCTTTTGGAAAGGATCCGGAGCGGAAGCACGTGCGATTACAATGGAAATAAAAGTAGAATTTGCCATCTGTTTGCGAGG
AAGTTCTTGGAAACTGCTTTGGATCGGCTGCTGGAACTTGCCCCTCCAGTCATGTTCTTTGCCTGA
Protein sequenceShow/hide protein sequence
MKMGNEQNKKHISVIPNSPSSHLSQIFHFLFLLIGFSFGIAISLNIKSFSSFTFHLRNFSLLTQPPPPPLQLQPPPPPVDCKTNCFIDPNVEPPLMHTMNDDEVF
WRASMVPLIKEFPYERVPKVAFMFLVKGALPLAPLWEMFFKGHQPLFSIYVHTHPSYNVSSSLPLSSVFHGRRIPSQAVEWGRPSMIHAERRLLANALLDFSNQR
FVLLSESCIPLYNFTTIYNYLINSEYSFVSSYDDPRKIGRGRYNRRMFPAVTLADWRKGSQWFEADRAAALDIASDRTYYPVFRDHCAPPCYTDEHYIPTLLNIV
SPDRSSNRSITWVDWSKNGPHPGRFGRRQVSVELLNRIRFGFNCTYNDANDTVSLCFLFARKFMPDSLQPLLKIWTSLLQDEGLRDSTHRVSLTAWGIYGQPHMA
ESTAKLFHAVQLHLTTVVSHFLVFGIGLALGITFNFYVTRFSSAFQLNIQLSPPAPPPAPVMGLREFWSPERVAHEMSDQELLWRASVVPRITKFPVKTTAKVAF
MFLSRGPLPLAPLWERFFHGNQGLYSIYVHSDPSFNGTHPTTSVFYGRTIPSKEVEWGQPSMMQAERRLLANALLDFSNQRFVLLSETCIPVFNFTTVYNYLVGS
AQIFVESFDLPGRLGRRRYRPNMKPTITEAQWRKGSQWFEMDRGTATEVVADQKYFPLFEKHCRPNCISDEHYLATVTSIRFGGRNSNRTLTWADWSKQGPHPAG
FESGNVTVGLLERIRSGSTCDYNGNKSRICHLFARKFLETALDRLLELAPPVMFFA