; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg001048 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg001048
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionheparan-alpha-glucosaminide N-acetyltransferase
Genome locationscaffold8:39184051..39193162
RNA-Seq ExpressionSpg001048
SyntenySpg001048
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0015267 - channel activity (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000425 - Major intrinsic protein
IPR022357 - Major intrinsic protein, conserved site
IPR023271 - Aquaporin-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_020425883.1 uncharacterized protein LOC18767296 [Prunus persica]1.2e-26765.83Show/hide
Query:  MADSRPLLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSLA
        MAD  PLL +  +       K PR+ SLDVFRGL VF                     +MM VDYGGS  PIIAHSPWNGLHLADFVMP+FLFIAGVSLA
Subjt:  MADSRPLLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSLA

Query:  LVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLLSM
        LVYK+V ++  AT  A  + L LFLLGVLLQGGYFHG+TSLT+GVD+ERIRW GILQRI++GY++AALCEIWL+R+  +E    KS+ WHWC IF L ++
Subjt:  LVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLLSM

Query:  YMGLLYGLYVPDWDFKISSTSSSLPPNGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEGLL
        Y GLLYGLYVPDW+FK  + +S  P + S++Y+V CSVRGDLGPACNSAGMIDR++LG+ HLY KPVYRNLKECN+S+ G+VPE++PSWCHAPF+PEG+L
Subjt:  YMGLLYGLYVPDWDFKISSTSSSLPPNGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEGLL

Query:  SSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLSTL
        SSLTA V CIIGLQYGHILA+++DHK R + W   S+ I  LG FL F+GIPVNKSLYT+SYMLITSASAGI FCALY+L        +   G   ++++
Subjt:  SSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLSTL

Query:  DMYSGVDGEACFECIGFNRSARGLSDCIRNHSANT---KQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIG
          + G+   + F  +  N +   +     +   N     ++G      +     KDYVDPPPAPL+   ELK WSFYRA+IAEF+ATLLFLYIT++TVIG
Subjt:  DMYSGVDGEACFECIGFNRSARGLSDCIRNHSANT---KQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIG

Query:  NNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISK
        N  Q+GPC GVGLLGIAWAFG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLIRAVAY+ AQ LGAI GV LVK+F +H YNS GGGAN VA G SK
Subjt:  NNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISK

Query:  GTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQ
        GTALGAEIIGTF+LVYTVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS G AV+FNN + W   WIFWVGPFVGALAAA YHQ
Subjt:  GTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQ

Query:  YVLRAAAVKALGSFRSN
        Y+LRAAA+KALGSFRSN
Subjt:  YVLRAAAVKALGSFRSN

XP_021810408.1 uncharacterized protein LOC110753759 [Prunus avium]4.7e-26765.79Show/hide
Query:  MADSRPLLK--DQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVS
        MAD  PLL   D    +P    K PR+ SLDVFRGL VF                     +MM VDYGGS  PIIAHSPWNGLHLADFVMP+FLFIAGVS
Subjt:  MADSRPLLK--DQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVS

Query:  LALVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLL
        LALVYK+V ++  AT  A  + L LFLLGVLLQGGYFHG+TSLT+GVD+ERIRW GILQRI++GY++AALCEIWL+R+   E    +S+ WHWC IF L 
Subjt:  LALVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLL

Query:  SMYMGLLYGLYVPDWDFKISSTSSSLPPNGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEG
        ++Y GLLYGLYVPDW+FK S+ +S LP N S++Y+V CSVRGDLGPACNSAGMIDR +LG+ HLY KPVYRNLKECN+S+ G VPE++P WCHAPF+PEG
Subjt:  SMYMGLLYGLYVPDWDFKISSTSSSLPPNGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEG

Query:  LLSSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLS
        +LSSLTA V CIIGLQYGHILA+++DH  R + W   S+ I  LG FL F+GIPVNKSLYT+SYMLITSASAGI FCALY+L        +   G   ++
Subjt:  LLSSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLS

Query:  TLDMYSGVDGEACFECIGFNRSARGLSDCIRNHSANT---KQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATV
        ++  + G+   + F  +  N +   +     +   N     ++G      +     KDYVDPPPAPL+   ELK WSFYRA+IAEF+ATLLFLYIT++TV
Subjt:  TLDMYSGVDGEACFECIGFNRSARGLSDCIRNHSANT---KQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATV

Query:  IGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGI
        IG+ AQ+GPC GVGLLGIAWAFG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLIRAVAY+ AQ LGAIAGV LVK+  +H YNS GGGAN V+ G 
Subjt:  IGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGI

Query:  SKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVY
        SKGTALGAEIIGTF+LVYTVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++N+ +VW  HWIFWVGPF+GALAAA Y
Subjt:  SKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVY

Query:  HQYVLRAAAVKALGSFRSN
        HQY+LRAAA+KALGSFRSN
Subjt:  HQYVLRAAAVKALGSFRSN

XP_022146281.1 heparan-alpha-glucosaminide N-acetyltransferase [Momordica charantia]0.0e+0079.92Show/hide
Query:  MADSRPLLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSLA
        MADSRPLLK+Q  ELPES+ KAPRVVSLDVFRGLSVF                     +MM VDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSLA
Subjt:  MADSRPLLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSLA

Query:  LVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLLSM
        LVYKEVKSKVAATR AA RGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLT R REE  +TKSF WHWC IFFLLS+
Subjt:  LVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLLSM

Query:  YMGLLYGLYVPDWDFKISSTSSSLPPNGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEGLL
        YMGL YGLYVPDW+FKIS+T+SSLPPNGSY YMVNCSVRGD GPACNSAGMIDRYVLGI HLYTKPVYRN+KECNISS GQVPET+PSWCHAPFEPEGLL
Subjt:  YMGLLYGLYVPDWDFKISSTSSSLPPNGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEGLL

Query:  SSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLSTL
        SSLTATVACIIGLQYGHIL+NVQ+HKSRT  WFSLSLK L LGIFLVF+GIPVNKSLYTVSYMLITSASAGI+FCALYIL        +   G  RL+ +
Subjt:  SSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLSTL

Query:  DMYSGVDGEACFECIGFNRSARGLSDCIRNHSANTKQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNA
          + G      +  +  N    G+         N       +   E  QQ KDYVDPPPAPL+GFSELKLWSF+RAVIAEFMATLLFLYIT+ATVIGNNA
Subjt:  DMYSGVDGEACFECIGFNRSARGLSDCIRNHSANTKQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNA

Query:  QTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTA
        + GPC+GVG LGIAWAFG MIFVLVYCTAGISGGHINPAVTFGLLLAR+VS++RAVAYMAAQCLGAI GVALVKSFM+HAYNSHGGGANLVA G S+GTA
Subjt:  QTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTA

Query:  LGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVL
        LGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIG AVFVVHLATIPITGTGINPARSLGAAV++NN R W  HWIFWVGPF+GA AAA+YHQYVL
Subjt:  LGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVL

Query:  RAAAVKALGSFRSNRSV
        RAA VKALGSFRSNRSV
Subjt:  RAAAVKALGSFRSNRSV

XP_024454514.1 heparan-alpha-glucosaminide N-acetyltransferase [Populus trichocarpa]2.3e-26666.48Show/hide
Query:  MAD--SRPLLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVS
        MAD   +PLL    EE P +++K PRV SLDVFRGL VF                     +MM VDYGG+ +PIIAHSPWNGLHLADFVMP+FLF AGVS
Subjt:  MAD--SRPLLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVS

Query:  LALVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLL
        LALVYK V +++ ATR A  R + LFLLGV+LQGGYFHGI  LTYGVDM+RIRWLGILQRIS+GY+ AALCEIWL+ R R +    KS+ WHW   F L 
Subjt:  LALVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLL

Query:  SMYMGLLYGLYVPDWDFKIS-STSSSLPPNGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPE
        ++Y GLLYGLYVPDW F++S +TSS  P N SY+YMV CSVRGDLGPACNSAGMIDRYVLGI HLY KPVYRNLKECN+S++GQVPE+APSWCHAPF+PE
Subjt:  SMYMGLLYGLYVPDWDFKIS-STSSSLPPNGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPE

Query:  GLLSSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYIL----ARHRIKNGLPPCG
        G+LSS+TA VACIIGLQYGH LA++QDHK R   W   SL +L +G+ L  VG PVNKSLYT  YMLIT ASAGI + A+Y+L        +   L   G
Subjt:  GLLSSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYIL----ARHRIKNGLPPCG

Query:  RPRLSTLDMYSGVDGEACFECIGFNRSARGLSDCIRNHSANTKQKGCGKKITEPLQ-QAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITI
        +  LS   + +        +   +      L   I      +K+    K+++E  Q   KDYVDPPPAPL+   ELKLWSFYRA+IAEF+ATLLFLY+T+
Subjt:  RPRLSTLDMYSGVDGEACFECIGFNRSARGLSDCIRNHSANTKQKGCGKKITEPLQ-QAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITI

Query:  ATVIGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVA
        ATVIG+ +   PC GVGLLGIAWAFG MIF+LVYCTAGISGGHINPAVTFGL LAR+VSLIRAVAYM AQCLGAI GV LVK+FM+  YNS GGGAN VA
Subjt:  ATVIGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVA

Query:  AGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAA
         G + GTALGAEIIGTF+LVYTVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV+FNN + W  HWIFWVGPFVGALAA
Subjt:  AGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAA

Query:  AVYHQYVLRAAAVKALGSFRSN
        A YHQY+LRAAA+KALGSFRSN
Subjt:  AVYHQYVLRAAAVKALGSFRSN

XP_034226800.1 uncharacterized protein LOC117636424 [Prunus dulcis]7.2e-26865.79Show/hide
Query:  MADSRPLLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSLA
        MAD  PLL +  +       K PR+ SLDVFRGL VF                     +MM VDYGGS  PIIAHSPWNGLHLADFVMP+FLFIAGVSLA
Subjt:  MADSRPLLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSLA

Query:  LVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLLSM
        LVYK+V ++  AT  A  + L LFLLGVLLQGGYFHG+TSLT+GVD+ERIRW GILQRI++GY++AALCEIWL+R+  +E    +S+ WHWC IF L ++
Subjt:  LVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLLSM

Query:  YMGLLYGLYVPDWDFKISSTSSSLPPNGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEGLL
        Y GLLYGLYVPDW+FK  + +S  P N S++Y+V CSVRGDLGPACNSAGMIDR +LG+ HLY KPVYRNLKECN+S+ G+VPE++PSWCHAPF+PEG+L
Subjt:  YMGLLYGLYVPDWDFKISSTSSSLPPNGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEGLL

Query:  SSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLSTL
        SSLTA V CIIGLQYGHILA+++DHK R + W   S+ I  LG FL F+GIPVNKSLYT+SYMLITSASAGI FCALY+L        +   G   ++++
Subjt:  SSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLSTL

Query:  DMYSGVDGEACFECIGFNRSARGLSDCIRNHSANT---KQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIG
          + G+   + F  +  N +   +     +   N     ++G      +     KDYVDPPPAPL+   ELK WSFYRA+IAEF+ATLLFLYIT++TVIG
Subjt:  DMYSGVDGEACFECIGFNRSARGLSDCIRNHSANT---KQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIG

Query:  NNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISK
        N  Q+GPC GVGLLGIAWAFG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLIRAVAY+ AQ LGAI GV LVK+F +H YNS GGGAN+VA+G SK
Subjt:  NNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISK

Query:  GTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQ
        GTALGAEIIGTF+LVYTVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS G AV+FNN + W   WIFWVGPFVGALAAA YHQ
Subjt:  GTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQ

Query:  YVLRAAAVKALGSFRSNRS
        Y+LRAAA+KALGSFRSN S
Subjt:  YVLRAAAVKALGSFRSNRS

TrEMBL top hitse value%identityAlignment
A0A1S2Y9C8 uncharacterized protein LOC1014942832.7e-26063.97Show/hide
Query:  MADSRP-LLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSL
        MAD +P LL    + + +  +   RV S+DVFRGLSVF                     +M+ VDYG S  PII+H+PWNGLHLADFVMP+FLF+AG+SL
Subjt:  MADSRP-LLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSL

Query:  ALVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLLS
        ALVYK    +  AT  A  R L LF+LG+LLQGGY HG TSLTYGVD+ RIR  G+LQRIS+GY++AALCEIWL     +E    KS+ WHW     LL+
Subjt:  ALVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLLS

Query:  MYMGLLYGLYVPDWDFKISSTSSSLPP-NGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEG
        +Y GLLYGLYVPDW F +S+++SSLPP +G  IY VNCSVRGD GPACNSAGMIDRY+LG+ HLY KPVYRNLKECN+SS+GQ+ +++PSWCHAPF+PEG
Subjt:  MYMGLLYGLYVPDWDFKISSTSSSLPP-NGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEG

Query:  LLSSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLS
        +LSS+TA V+CIIGLQYGHILA+++DHK R   W S S+   ALG+FL  +GIPVNKSLYTVSYML++SA++G+ F ALY+L        +   G  RL+
Subjt:  LLSSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLS

Query:  TLDMYSGVDGEACFECIGFNRSARGLSDCIRNHSANTKQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGN
        ++  + G    + F  +  N +   +         N  +      +       KDYVDPPPAPL+ F+E+KLWSFYRA+IAEF+ATLLFLY+T+ATVIG+
Subjt:  TLDMYSGVDGEACFECIGFNRSARGLSDCIRNHSANTKQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGN

Query:  NAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKG
          QTGPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLIRAV YM AQCLGAI GV LVK+ M+  YN+ GGGAN VA+G SKG
Subjt:  NAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKG

Query:  TALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQY
        +ALGAE+IGTF+LVYTVFSATDPKRNARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV+FNN +VW  HWIFWVGPFVGALAAA YHQY
Subjt:  TALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQY

Query:  VLRAAAVKALGSFRSN
        +LRAAA+KALGSFRSN
Subjt:  VLRAAAVKALGSFRSN

A0A1S3U4S1 uncharacterized protein LOC1067619126.1e-26564.95Show/hide
Query:  MADSRPLLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSLA
        MAD +PLL +  E       +  R+ SLDVFRGLSVF                     +M+ VDYGGS  PIIAH+PWNG+HLAD VMP+FLFIAG+SLA
Subjt:  MADSRPLLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSLA

Query:  LVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLLSM
        LVYK    +  AT  A  R + LF+LGV+LQGGYFHGITSLT+GVD+ERIRWLGILQRIS+GY++AALCEIWL     ++    K++ WH   +  LL++
Subjt:  LVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLLSM

Query:  YMGLLYGLYVPDWDFKISSTSSSLPP-NGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEGL
        Y GLLYGLYVPDW F +S+++SSLPP  G  IY VNCSVRGDLGPACNSAGMIDRY+LG+ HLY KPVYRNL+ECN+ S GQV + +PSWCHAPF+PEG+
Subjt:  YMGLLYGLYVPDWDFKISSTSSSLPP-NGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEGL

Query:  LSSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYIL----ARHRIKNGLPPCGRP
        LSS+TA V+CIIGLQYGH+LA++QDHK R   W   SL +LALG+FL   GIP+NK+LYTVSYML+TSA++G+ F ALY L     + R+   L   G+ 
Subjt:  LSSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYIL----ARHRIKNGLPPCGRP

Query:  RLSTLDMYSGVDGEACFECIGFNRSARGLSDCIRNHSANTKQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATV
         LS   + S        +   +++    + + I    A +      K++TE  QQ KDY DPPPAPL+  +E+KLWSFYRA+IAEF+ATLLFLY+T+ATV
Subjt:  RLSTLDMYSGVDGEACFECIGFNRSARGLSDCIRNHSANTKQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATV

Query:  IGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGI
        IG+  QTGPC GVGLLGIAWAFG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLIRAV Y+ AQCLGAI+GV LVK+FM+H YNS GGGAN V++G 
Subjt:  IGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGI

Query:  SKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVY
        SKGTALGAEIIGTF+LVYTVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++NN +VW  HWIFWVGPFVGALAAA Y
Subjt:  SKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVY

Query:  HQYVLRAAAVKALGSFRSN
        HQY+LRAAA+KALGSFRSN
Subjt:  HQYVLRAAAVKALGSFRSN

A0A2I4GCB0 uncharacterized protein LOC1090066085.4e-26164.76Show/hide
Query:  MADSRPLLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSLA
        M D  P+   +Q+  P + R+  R+ SLDVFRGL VF                     +MM VDY GS  PII+HSPW+G+HLADFVMP+FLF+AGVS A
Subjt:  MADSRPLLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSLA

Query:  LVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLLSM
        LVYK+V ++V AT  A  R + LFLLGV+LQGGY HG+ SLT+GVD+E+IRW+GILQRIS+GY++AALCEIWLT R   E    K + WHWC  F L ++
Subjt:  LVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLLSM

Query:  YMGLLYGLYVPDWDFKISSTSSSLPP-NGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEGL
        Y+ L YGLYVP+W F + S+ SSLPP N S +YMV C+VRGDLGPACNSAGMIDRY+LGI HLY KPVYRNLKEC IS++GQV +++P WCHAPF+PEG+
Subjt:  YMGLLYGLYVPDWDFKISSTSSSLPP-NGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEGL

Query:  LSSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLST
        LSSLTA V CI GLQYGH+LAN+QDHK R   W   S  +  LG+ L F GIP+NKSLYTVSYMLITSASAGI FCALY+L        +   G  RL+ 
Subjt:  LSSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLST

Query:  LDMYSGVDGEACFECIGFNR---SARGLSDCIRNHSANTKQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVI
        +  + G    + F  I  N    + +G    + +++    ++G         Q  KDYVDPPPAPL+  +ELKLWSFYRA+IAEF+ATLLFLYIT+ATVI
Subjt:  LDMYSGVDGEACFECIGFNR---SARGLSDCIRNHSANTKQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVI

Query:  GNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGIS
        G   ++GPC GVGLLGIAWAFG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLIRAV YM AQCLGAI GV LVK+FM+H YN+ GGG N VA G S
Subjt:  GNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGIS

Query:  KGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYH
        KGTALGAEIIGTF+LVYTVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV+++N +VW  HWIFWVGPFVGALAAA YH
Subjt:  KGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYH

Query:  QYVLRAAAVKALGSFRSN
        QY+LRAAA+KALGSFRSN
Subjt:  QYVLRAAAVKALGSFRSN

A0A6J1CYU3 heparan-alpha-glucosaminide N-acetyltransferase0.0e+0079.92Show/hide
Query:  MADSRPLLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSLA
        MADSRPLLK+Q  ELPES+ KAPRVVSLDVFRGLSVF                     +MM VDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSLA
Subjt:  MADSRPLLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSLA

Query:  LVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLLSM
        LVYKEVKSKVAATR AA RGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLT R REE  +TKSF WHWC IFFLLS+
Subjt:  LVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLLSM

Query:  YMGLLYGLYVPDWDFKISSTSSSLPPNGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEGLL
        YMGL YGLYVPDW+FKIS+T+SSLPPNGSY YMVNCSVRGD GPACNSAGMIDRYVLGI HLYTKPVYRN+KECNISS GQVPET+PSWCHAPFEPEGLL
Subjt:  YMGLLYGLYVPDWDFKISSTSSSLPPNGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEGLL

Query:  SSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLSTL
        SSLTATVACIIGLQYGHIL+NVQ+HKSRT  WFSLSLK L LGIFLVF+GIPVNKSLYTVSYMLITSASAGI+FCALYIL        +   G  RL+ +
Subjt:  SSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLSTL

Query:  DMYSGVDGEACFECIGFNRSARGLSDCIRNHSANTKQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNA
          + G      +  +  N    G+         N       +   E  QQ KDYVDPPPAPL+GFSELKLWSF+RAVIAEFMATLLFLYIT+ATVIGNNA
Subjt:  DMYSGVDGEACFECIGFNRSARGLSDCIRNHSANTKQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNA

Query:  QTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTA
        + GPC+GVG LGIAWAFG MIFVLVYCTAGISGGHINPAVTFGLLLAR+VS++RAVAYMAAQCLGAI GVALVKSFM+HAYNSHGGGANLVA G S+GTA
Subjt:  QTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTA

Query:  LGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVL
        LGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIG AVFVVHLATIPITGTGINPARSLGAAV++NN R W  HWIFWVGPF+GA AAA+YHQYVL
Subjt:  LGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVL

Query:  RAAAVKALGSFRSNRSV
        RAA VKALGSFRSNRSV
Subjt:  RAAAVKALGSFRSNRSV

A0A6P5S9C4 uncharacterized protein LOC1107537592.3e-26765.79Show/hide
Query:  MADSRPLLK--DQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVS
        MAD  PLL   D    +P    K PR+ SLDVFRGL VF                     +MM VDYGGS  PIIAHSPWNGLHLADFVMP+FLFIAGVS
Subjt:  MADSRPLLK--DQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVS

Query:  LALVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLL
        LALVYK+V ++  AT  A  + L LFLLGVLLQGGYFHG+TSLT+GVD+ERIRW GILQRI++GY++AALCEIWL+R+   E    +S+ WHWC IF L 
Subjt:  LALVYKEVKSKVAATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLL

Query:  SMYMGLLYGLYVPDWDFKISSTSSSLPPNGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEG
        ++Y GLLYGLYVPDW+FK S+ +S LP N S++Y+V CSVRGDLGPACNSAGMIDR +LG+ HLY KPVYRNLKECN+S+ G VPE++P WCHAPF+PEG
Subjt:  SMYMGLLYGLYVPDWDFKISSTSSSLPPNGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEG

Query:  LLSSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLS
        +LSSLTA V CIIGLQYGHILA+++DH  R + W   S+ I  LG FL F+GIPVNKSLYT+SYMLITSASAGI FCALY+L        +   G   ++
Subjt:  LLSSLTATVACIIGLQYGHILANVQDHKSRTSGWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLS

Query:  TLDMYSGVDGEACFECIGFNRSARGLSDCIRNHSANT---KQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATV
        ++  + G+   + F  +  N +   +     +   N     ++G      +     KDYVDPPPAPL+   ELK WSFYRA+IAEF+ATLLFLYIT++TV
Subjt:  TLDMYSGVDGEACFECIGFNRSARGLSDCIRNHSANT---KQKGCGKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATV

Query:  IGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGI
        IG+ AQ+GPC GVGLLGIAWAFG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLIRAVAY+ AQ LGAIAGV LVK+  +H YNS GGGAN V+ G 
Subjt:  IGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGI

Query:  SKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVY
        SKGTALGAEIIGTF+LVYTVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++N+ +VW  HWIFWVGPF+GALAAA Y
Subjt:  SKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVY

Query:  HQYVLRAAAVKALGSFRSN
        HQY+LRAAA+KALGSFRSN
Subjt:  HQYVLRAAAVKALGSFRSN

SwissProt top hitse value%identityAlignment
P42767 Aquaporin PIP-type2.9e-11879.32Show/hide
Query:  QQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLAR
        Q  KDYVDPPPAP     ELKLWSF+RA IAEF+ATLLFLYIT+ATVIG   +T PC+ VGLLGIAW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR
Subjt:  QQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLAR

Query:  RVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLA
        +VSL+RA+ YM AQC GAI GV LVK+FM+  YN  GGGAN VA G +KGTA GAE+IGTF+LVYTVFSATDPKR+ARDSHVP+LAPLPIG AVF+VHLA
Subjt:  RVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLA

Query:  TIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSN
        TIPITGTGINPARS GAAV++N +RVW  HWIFWVGPFVGALAAA YHQYVLRAAA+KALGSFRSN
Subjt:  TIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSN

P93004 Aquaporin PIP2-78.9e-12080.61Show/hide
Query:  KDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVS
        KDYVDPPPAPL+   ELK WSFYRA+IAEF+ATLLFLY+T+ATVIG+  QTGPC GVGLLGIAWAFG MIFVLVYCTAGISGGHINPAVTFGL LAR+VS
Subjt:  KDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVS

Query:  LIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIP
        L+RA+ YM AQCLGAI GV  VK+FM+  YN+ GGGAN VA G SKGTALGAEIIGTF+LVYTVFSATDPKR+ARDSH+PVLAPLPIG AVF+VHLATIP
Subjt:  LIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIP

Query:  ITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSN
        ITGTGINPARS GAAV++NN + W   WIFWVGPF+GALAAA YHQY+LRA+A+KALGSFRSN
Subjt:  ITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSN

Q7XLR1 Probable aquaporin PIP2-62.4e-11778.28Show/hide
Query:  KDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTG--PCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARR
        KDY DPPPAPL    EL+LWSFYRA+IAEF+ATLLFLYIT+ATVIG   Q+    C GVG LGIAWAFG MIF+LVYCTAGISGGHINPAVTFGLLLAR+
Subjt:  KDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTG--PCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARR

Query:  VSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLAT
        VS+IRAV Y+ AQCLG I GV +VK  M+H YN++GGGAN+VA+G S GTALGAEIIGTF+LVYTVFSATDPKRNARDSHVPVLAPLPIG AVF+VHLAT
Subjt:  VSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLAT

Query:  IPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSNRS
        IPITGTGINPARS+GAAV++N ++ W  HWIFW GPF+GALAAA YHQY+LRAAA+KALGSFRSN S
Subjt:  IPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSNRS

Q8H5N9 Probable aquaporin PIP2-12.8e-11376.1Show/hide
Query:  AKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQT--------GPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTF
        AKDY DPPPAPL+  +EL  WS YRAVIAEF+ATLLFLYIT+ATVIG   QT          C GVG+LGIAWAFG MIF+LVYCTAGISGGHINPAVTF
Subjt:  AKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQT--------GPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTF

Query:  GLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAV
        GL LAR+VSL+RA+ Y+ AQCLGAI GV LVK+F    +N +GGGAN +AAG SKGT L AEIIGTF+LVYTVFSATDPKRNARDSHVPVLAPLPIG AV
Subjt:  GLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAV

Query:  FVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSN
        F+VHLATIPITGTGINPARS+GAAV+FNN + W +HWIFWVGPFVGA  AA YHQY+LRA A+KALGSFRSN
Subjt:  FVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSN

Q9ZVX8 Probable aquaporin PIP2-89.5e-12280.22Show/hide
Query:  KKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVT
        K+++E  +  KDYVDPPPAPL+  +ELKLWSFYRA+IAEF+ATLLFLY+T+ATVIG+  QTGPC GVGLLGIAWAFG MIFVLVYCTAGISGGHINPAVT
Subjt:  KKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVT

Query:  FGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLA
        FGL LAR+VSL RAVAYM AQCLGAI GV LVK+FM   Y   GGGAN VA G S GTALGAEIIGTF+LVYTVFSATDPKR+ARDSHVPVLAPLPIG A
Subjt:  FGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLA

Query:  VFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSN
        VF+VHLATIPITGTGINPARS GAAV++NN + W  HWIFWVGPFVGALAAA YHQY+LRAAA+KAL SFRSN
Subjt:  VFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSN

Arabidopsis top hitse value%identityAlignment
AT2G16850.1 plasma membrane intrinsic protein 2;86.8e-12380.22Show/hide
Query:  KKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVT
        K+++E  +  KDYVDPPPAPL+  +ELKLWSFYRA+IAEF+ATLLFLY+T+ATVIG+  QTGPC GVGLLGIAWAFG MIFVLVYCTAGISGGHINPAVT
Subjt:  KKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVT

Query:  FGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLA
        FGL LAR+VSL RAVAYM AQCLGAI GV LVK+FM   Y   GGGAN VA G S GTALGAEIIGTF+LVYTVFSATDPKR+ARDSHVPVLAPLPIG A
Subjt:  FGLLLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLA

Query:  VFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSN
        VF+VHLATIPITGTGINPARS GAAV++NN + W  HWIFWVGPFVGALAAA YHQY+LRAAA+KAL SFRSN
Subjt:  VFVVHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSN

AT3G54820.1 plasma membrane intrinsic protein 2;53.8e-11073.53Show/hide
Query:  KDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGP------CSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLL
        KDY DPPP PL   +EL  WSFYRA+IAEF+ATLLFLY+TI TVIG  +QT P      C+GVG+LGIAWAFG MIF+LVYCTAGISGGHINPAVTFGLL
Subjt:  KDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGP------CSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLL

Query:  LARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVV
        LAR+V+L+RAV YM AQCLGAI GVALVK+F    +  +GGGAN ++ G S GT + AEIIGTF+LVYTVFSATDPKR+ARDSHVPVLAPLPIG AVF+V
Subjt:  LARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVV

Query:  HLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSNRSV
        HLATIPITGTGINPARSLGAA+++N  + W HHWIFWVGPF GA  AA YHQ+VLRA A+KALGSFRS   V
Subjt:  HLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSNRSV

AT4G35100.1 plasma membrane intrinsic protein 36.3e-12180.61Show/hide
Query:  KDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVS
        KDYVDPPPAPL+   ELK WSFYRA+IAEF+ATLLFLY+T+ATVIG+  QTGPC GVGLLGIAWAFG MIFVLVYCTAGISGGHINPAVTFGL LAR+VS
Subjt:  KDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVS

Query:  LIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIP
        L+RA+ YM AQCLGAI GV  VK+FM+  YN+ GGGAN VA G SKGTALGAEIIGTF+LVYTVFSATDPKR+ARDSH+PVLAPLPIG AVF+VHLATIP
Subjt:  LIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIP

Query:  ITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSN
        ITGTGINPARS GAAV++NN + W   WIFWVGPF+GALAAA YHQY+LRA+A+KALGSFRSN
Subjt:  ITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSN

AT4G35100.2 plasma membrane intrinsic protein 36.3e-12180.61Show/hide
Query:  KDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVS
        KDYVDPPPAPL+   ELK WSFYRA+IAEF+ATLLFLY+T+ATVIG+  QTGPC GVGLLGIAWAFG MIFVLVYCTAGISGGHINPAVTFGL LAR+VS
Subjt:  KDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVS

Query:  LIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIP
        L+RA+ YM AQCLGAI GV  VK+FM+  YN+ GGGAN VA G SKGTALGAEIIGTF+LVYTVFSATDPKR+ARDSH+PVLAPLPIG AVF+VHLATIP
Subjt:  LIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIP

Query:  ITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSN
        ITGTGINPARS GAAV++NN + W   WIFWVGPF+GALAAA YHQY+LRA+A+KALGSFRSN
Subjt:  ITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSN

AT5G60660.1 plasma membrane intrinsic protein 2;45.2e-10772.49Show/hide
Query:  AKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTG------PCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGL
        A+DY DPPPAP     EL+ W  YRAVIAEF+ATLLFLY++I TVIG  AQT        C GVG+LGIAWAFG MIFVLVYCTAGISGGHINPAVT GL
Subjt:  AKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTG------PCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGL

Query:  LLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFV
         LAR+VSL+R V Y+ AQCLGAI G   VK+F    Y  +GGGAN +A G +KGT LGAEIIGTF+LVYTVFSATDPKRNARDSHVPVLAPLPIG AVF+
Subjt:  LLARRVSLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFV

Query:  VHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRS
        VHLATIPITGTGINPARS GAAV++NN + W   WIFWVGP +GA AAA YHQ++LRAAA+KALGSF S
Subjt:  VHLATIPITGTGINPARSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATTCTCGACCGCTGCTGAAGGATCAACAGGAGGAGTTGCCGGAGTCCACTCGCAAAGCGCCGCGCGTTGTCTCACTCGACGTCTTTCGCGGACTCAGCGTCTT
TGTTAGTTCTCTCTTTTCATCTCTGGATCTCGACTCACTAAACACACTGAATACTTCTCTTCATATGATGATGTTCGTGGACTACGGTGGCTCTTTTTTACCAATTATCG
CTCATTCGCCATGGAATGGACTTCATTTGGCTGATTTTGTGATGCCTTGGTTTCTATTTATTGCGGGAGTCTCGCTTGCACTTGTTTATAAAGAAGTAAAAAGTAAAGTG
GCCGCTACAAGGAATGCAGCATGCAGGGGTCTGTACCTTTTTCTCCTGGGTGTTCTTCTTCAAGGTGGTTACTTTCATGGAATAACATCTTTGACTTATGGTGTGGATAT
GGAAAGGATTAGGTGGCTTGGCATTTTGCAGAGAATATCTGTTGGATACTTGATTGCTGCACTATGTGAGATTTGGCTAACTCGTCGCATACGTGAAGAAGCTCAACATA
CTAAGAGTTTCAGCTGGCATTGGTGTACTATATTTTTTCTGTTGTCAATGTATATGGGACTGTTGTATGGTTTATATGTTCCAGATTGGGACTTCAAAATATCAAGCACG
AGCTCTTCACTTCCACCAAATGGAAGCTATATTTACATGGTGAATTGTTCTGTTCGAGGTGACCTGGGACCTGCATGTAATTCTGCTGGGATGATTGATCGTTATGTTCT
TGGTATTCACCATTTGTATACTAAACCTGTCTACAGAAATCTGAAGGAGTGTAACATTTCCTCCAGTGGTCAAGTTCCTGAGACTGCACCTTCGTGGTGTCATGCTCCGT
TTGAACCTGAAGGTCTTTTAAGCTCTTTAACAGCTACAGTGGCATGCATAATAGGACTTCAGTATGGTCACATTCTTGCCAATGTACAGGATCACAAAAGTCGCACCAGT
GGCTGGTTCTCACTCTCGCTTAAGATTTTGGCTCTCGGAATATTCCTTGTCTTTGTAGGTATCCCTGTAAATAAGTCCCTTTACACAGTCAGCTATATGTTGATTACTTC
AGCATCAGCAGGAATAATCTTTTGTGCTCTATACATCTTGGCGCGGCATCGTATAAAGAACGGCTTACCACCGTGTGGACGTCCACGGTTATCGACGCTTGACATGTATT
CTGGAGTGGATGGGGAAGCATGCTTTGAGTGCATTGGATTTAATCGTAGTGCTCGAGGTCTGTCCGATTGTATTCGAAATCACTCTGCAAATACTAAGCAAAAAGGCTGT
GGTAAGAAGATAACAGAGCCATTGCAACAGGCCAAGGACTATGTGGATCCGCCACCAGCTCCATTGATGGGCTTCTCCGAGCTGAAGCTATGGTCTTTTTACAGAGCTGT
AATTGCAGAGTTCATGGCCACTCTGCTTTTCCTGTACATCACCATCGCCACTGTCATTGGAAACAATGCACAAACAGGACCCTGTTCTGGCGTTGGCCTTCTCGGCATAG
CATGGGCCTTCGGCGCCATGATCTTCGTCCTCGTTTACTGCACTGCTGGTATTTCAGGTGGCCATATCAACCCCGCCGTCACGTTCGGGCTGCTGTTGGCCAGAAGGGTG
TCGCTGATCCGCGCGGTGGCTTACATGGCGGCCCAGTGTCTGGGAGCCATTGCCGGCGTTGCCTTGGTTAAATCTTTCATGAGGCATGCCTACAACAGCCACGGCGGCGG
AGCCAACCTCGTCGCCGCCGGCATCAGCAAAGGCACCGCTCTCGGTGCTGAGATCATCGGCACTTTTCTGCTGGTCTACACTGTGTTTTCAGCCACTGACCCTAAAAGAA
ATGCTCGCGACTCCCATGTTCCTGTTTTGGCTCCATTGCCCATTGGACTGGCCGTCTTTGTTGTGCACCTTGCCACCATTCCGATTACCGGCACCGGCATCAACCCTGCT
CGGAGCTTAGGCGCCGCGGTCGTATTCAACAACAGAAGAGTCTGGAGCCACCATTGGATTTTTTGGGTAGGGCCATTTGTTGGGGCGCTGGCAGCGGCGGTGTACCACCA
GTACGTTCTCAGAGCGGCGGCCGTCAAGGCACTTGGATCTTTCCGCAGCAACCGCTCCGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGATTCTCGACCGCTGCTGAAGGATCAACAGGAGGAGTTGCCGGAGTCCACTCGCAAAGCGCCGCGCGTTGTCTCACTCGACGTCTTTCGCGGACTCAGCGTCTT
TGTTAGTTCTCTCTTTTCATCTCTGGATCTCGACTCACTAAACACACTGAATACTTCTCTTCATATGATGATGTTCGTGGACTACGGTGGCTCTTTTTTACCAATTATCG
CTCATTCGCCATGGAATGGACTTCATTTGGCTGATTTTGTGATGCCTTGGTTTCTATTTATTGCGGGAGTCTCGCTTGCACTTGTTTATAAAGAAGTAAAAAGTAAAGTG
GCCGCTACAAGGAATGCAGCATGCAGGGGTCTGTACCTTTTTCTCCTGGGTGTTCTTCTTCAAGGTGGTTACTTTCATGGAATAACATCTTTGACTTATGGTGTGGATAT
GGAAAGGATTAGGTGGCTTGGCATTTTGCAGAGAATATCTGTTGGATACTTGATTGCTGCACTATGTGAGATTTGGCTAACTCGTCGCATACGTGAAGAAGCTCAACATA
CTAAGAGTTTCAGCTGGCATTGGTGTACTATATTTTTTCTGTTGTCAATGTATATGGGACTGTTGTATGGTTTATATGTTCCAGATTGGGACTTCAAAATATCAAGCACG
AGCTCTTCACTTCCACCAAATGGAAGCTATATTTACATGGTGAATTGTTCTGTTCGAGGTGACCTGGGACCTGCATGTAATTCTGCTGGGATGATTGATCGTTATGTTCT
TGGTATTCACCATTTGTATACTAAACCTGTCTACAGAAATCTGAAGGAGTGTAACATTTCCTCCAGTGGTCAAGTTCCTGAGACTGCACCTTCGTGGTGTCATGCTCCGT
TTGAACCTGAAGGTCTTTTAAGCTCTTTAACAGCTACAGTGGCATGCATAATAGGACTTCAGTATGGTCACATTCTTGCCAATGTACAGGATCACAAAAGTCGCACCAGT
GGCTGGTTCTCACTCTCGCTTAAGATTTTGGCTCTCGGAATATTCCTTGTCTTTGTAGGTATCCCTGTAAATAAGTCCCTTTACACAGTCAGCTATATGTTGATTACTTC
AGCATCAGCAGGAATAATCTTTTGTGCTCTATACATCTTGGCGCGGCATCGTATAAAGAACGGCTTACCACCGTGTGGACGTCCACGGTTATCGACGCTTGACATGTATT
CTGGAGTGGATGGGGAAGCATGCTTTGAGTGCATTGGATTTAATCGTAGTGCTCGAGGTCTGTCCGATTGTATTCGAAATCACTCTGCAAATACTAAGCAAAAAGGCTGT
GGTAAGAAGATAACAGAGCCATTGCAACAGGCCAAGGACTATGTGGATCCGCCACCAGCTCCATTGATGGGCTTCTCCGAGCTGAAGCTATGGTCTTTTTACAGAGCTGT
AATTGCAGAGTTCATGGCCACTCTGCTTTTCCTGTACATCACCATCGCCACTGTCATTGGAAACAATGCACAAACAGGACCCTGTTCTGGCGTTGGCCTTCTCGGCATAG
CATGGGCCTTCGGCGCCATGATCTTCGTCCTCGTTTACTGCACTGCTGGTATTTCAGGTGGCCATATCAACCCCGCCGTCACGTTCGGGCTGCTGTTGGCCAGAAGGGTG
TCGCTGATCCGCGCGGTGGCTTACATGGCGGCCCAGTGTCTGGGAGCCATTGCCGGCGTTGCCTTGGTTAAATCTTTCATGAGGCATGCCTACAACAGCCACGGCGGCGG
AGCCAACCTCGTCGCCGCCGGCATCAGCAAAGGCACCGCTCTCGGTGCTGAGATCATCGGCACTTTTCTGCTGGTCTACACTGTGTTTTCAGCCACTGACCCTAAAAGAA
ATGCTCGCGACTCCCATGTTCCTGTTTTGGCTCCATTGCCCATTGGACTGGCCGTCTTTGTTGTGCACCTTGCCACCATTCCGATTACCGGCACCGGCATCAACCCTGCT
CGGAGCTTAGGCGCCGCGGTCGTATTCAACAACAGAAGAGTCTGGAGCCACCATTGGATTTTTTGGGTAGGGCCATTTGTTGGGGCGCTGGCAGCGGCGGTGTACCACCA
GTACGTTCTCAGAGCGGCGGCCGTCAAGGCACTTGGATCTTTCCGCAGCAACCGCTCCGTTTGA
Protein sequenceShow/hide protein sequence
MADSRPLLKDQQEELPESTRKAPRVVSLDVFRGLSVFVSSLFSSLDLDSLNTLNTSLHMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSLALVYKEVKSKV
AATRNAACRGLYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLIAALCEIWLTRRIREEAQHTKSFSWHWCTIFFLLSMYMGLLYGLYVPDWDFKISST
SSSLPPNGSYIYMVNCSVRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQVPETAPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILANVQDHKSRTS
GWFSLSLKILALGIFLVFVGIPVNKSLYTVSYMLITSASAGIIFCALYILARHRIKNGLPPCGRPRLSTLDMYSGVDGEACFECIGFNRSARGLSDCIRNHSANTKQKGC
GKKITEPLQQAKDYVDPPPAPLMGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWAFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRV
SLIRAVAYMAAQCLGAIAGVALVKSFMRHAYNSHGGGANLVAAGISKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPA
RSLGAAVVFNNRRVWSHHWIFWVGPFVGALAAAVYHQYVLRAAAVKALGSFRSNRSV