; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012473 (gene) of Snake gourd v1 genome

Gene IDTan0012473
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionheparan-alpha-glucosaminide N-acetyltransferase
Genome locationLG05:10082063..10091868
RNA-Seq ExpressionTan0012473
SyntenyTan0012473
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0015267 - channel activity (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000425 - Major intrinsic protein
IPR022357 - Major intrinsic protein, conserved site
IPR023271 - Aquaporin-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_020425883.1 uncharacterized protein LOC18767296 [Prunus persica]4.9e-26666.62Show/hide
Query:  MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALY
        MAD  PLL           K PR+ SLDVFRGL VF+MM VDYGGS  PIIAHS WNGLHLADFVMP+FLFIAG SL LVYK+V ++  AT  A  +AL 
Subjt:  MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALY

Query:  LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSS
        LFLLGVLLQGGYFHG+TSLT+GVD+ERIRW GILQRI++GY++ ALCEIWL+R T +     KS+ WHW +IF L ++Y GL YGLYVPDW+FK    +S
Subjt:  LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSS

Query:  SLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV
          PS+ + VY+V CSVRGDLGPACNS GMIDR++LG+ HLY KPVYRNLKECN+S+ G+VPE+SPSWCHAPF+PEG+LSSLTA+V CIIGLQYGHILA++
Subjt:  SLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV

Query:  QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTRHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEA
        +DHK R N+W   S+ I  LG FL FIGIPVNKSLYT+SYMLITSASAGI FCALY                    LLI + G   ++++  + G+   +
Subjt:  QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTRHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEA

Query:  CFESV---------------DRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGI
         F  V               D   N   + + +     KDYVDPPPAP +   ELK WSFYRA+IAEF+ATLLFLYIT++TVIGN  Q+GPC GVGLLGI
Subjt:  CFESV---------------DRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGI

Query:  AWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVY
        AW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLIRAVAYI AQ LGAI GV LV++F +H YN+ GGGAN VA G+SKGTALGAEIIGTF+LVY
Subjt:  AWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVY

Query:  TVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRS
        TVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS G AV+FNN + W   WIFWVGPFVGA+AAA YHQY+LRAAA+KAL SFRS
Subjt:  TVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRS

Query:  N
        N
Subjt:  N

XP_021810408.1 uncharacterized protein LOC110753759 [Prunus avium]9.3e-26566.48Show/hide
Query:  MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALY
        MAD  PLL           K PR+ SLDVFRGL VF+MM VDYGGS  PIIAHS WNGLHLADFVMP+FLFIAG SL LVYK+V ++  AT  A  +AL 
Subjt:  MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALY

Query:  LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSS
        LFLLGVLLQGGYFHG+TSLT+GVD+ERIRW GILQRI++GY++ ALCEIWL+R T       +S+ WHW +IF L ++Y GL YGLYVPDW+FK S  +S
Subjt:  LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSS

Query:  SLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV
         LPSN + VY+V CSVRGDLGPACNS GMIDR +LG+ HLY KPVYRNLKECN+S+ G VPE+SP WCHAPF+PEG+LSSLTA+V CIIGLQYGHILA++
Subjt:  SLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV

Query:  QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTRHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEA
        +DH  R N+W   S+ I  LG FL FIGIPVNKSLYT+SYMLITSASAGI FCALY                    LLI + G   ++++  + G+   +
Subjt:  QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTRHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEA

Query:  CFESV---------------DRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGI
         F  V               D   N   + + +     KDYVDPPPAP +   ELK WSFYRA+IAEF+ATLLFLYIT++TVIG+ AQ+GPC GVGLLGI
Subjt:  CFESV---------------DRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGI

Query:  AWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVY
        AW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLIRAVAYI AQ LGAIAGV LV++  +H YN+ GGGAN V+ G+SKGTALGAEIIGTF+LVY
Subjt:  AWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVY

Query:  TVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRS
        TVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++N+ +VW  HWIFWVGPF+GA+AAA YHQY+LRAAA+KAL SFRS
Subjt:  TVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRS

Query:  N
        N
Subjt:  N

XP_022146281.1 heparan-alpha-glucosaminide N-acetyltransferase [Momordica charantia]0.0e+0080.92Show/hide
Query:  MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALY
        MADS+PLLKNQ ELPESS KAPRVVSLDVFRGLSVF+MM VDYGGSFLPIIAHS WNGLHLADFVMPWFLFIAG SL LVYKEV+SKV ATR AA R LY
Subjt:  MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALY

Query:  LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSS
        LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLI ALCEIWLT  TRE   NTKSF WHW  IFFLLSLYMGLSYGLYVPDW+FKIS T+S
Subjt:  LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSS

Query:  SLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV
        SLP NG++ YMVNCSVRGD GPACNS GMIDRYVLGI HLYTKPVYRN+KECNISS GQVPETSPSWCHAPFEPEGLLSSLTA+VACIIGLQYGHIL+NV
Subjt:  SLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV

Query:  QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILT----RHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGV
        Q+HKSRT SWFSLSLK L LGIFLVFIGIPVNKSLYTVSYMLITSASAGI+FCALYIL       R+   L   ++    L I +     +  +    G+
Subjt:  QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILT----RHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGV

Query:  DGEACFESVDRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVL
         G   F       N    ++++ RQQ KDYVDPPPAP +G SELKLWSF+RAVIAEFMATLLFLYIT+ATVIGNNA+ GPC+GVG LGIAW+FG MIFVL
Subjt:  DGEACFESVDRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVL

Query:  VYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRN
        VYCTAGISGGHINPAVTFGLLLAR+VS++RAVAY+AAQCLGAI GVALV+SFM+HAYN++GGGANLVA GFS+GTALGAEIIGTFLLVYTVFSATDPKRN
Subjt:  VYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRN

Query:  ARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSNRS
        ARDSHVPVLAPLPIG AVFVVHLATIPITGTGINPARSLGAAVM+NN R W  HWIFWVGPF+GA AAA+YHQYVLRAA VKAL SFRSNRS
Subjt:  ARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSNRS

XP_034226800.1 uncharacterized protein LOC117636424 [Prunus dulcis]2.9e-26666.57Show/hide
Query:  MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALY
        MAD  PLL           K PR+ SLDVFRGL VF+MM VDYGGS  PIIAHS WNGLHLADFVMP+FLFIAG SL LVYK+V ++  AT  A  +AL 
Subjt:  MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALY

Query:  LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSS
        LFLLGVLLQGGYFHG+TSLT+GVD+ERIRW GILQRI++GY++ ALCEIWL+R T +     +S+ WHW +IF L ++Y GL YGLYVPDW+FK    +S
Subjt:  LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSS

Query:  SLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV
          PSN + VY+V CSVRGDLGPACNS GMIDR +LG+ HLY KPVYRNLKECN+S+ G+VPE+SPSWCHAPF+PEG+LSSLTA+V CIIGLQYGHILA++
Subjt:  SLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV

Query:  QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTRHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEA
        +DHK R N+W   S+ I  LG FL FIGIPVNKSLYT+SYMLITSASAGI FCALY                    LLI + G   ++++  + G+   +
Subjt:  QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTRHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEA

Query:  CFESV---------------DRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGI
         F  V               D   N   + + +     KDYVDPPPAP +   ELK WSFYRA+IAEF+ATLLFLYIT++TVIGN  Q+GPC GVGLLGI
Subjt:  CFESV---------------DRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGI

Query:  AWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVY
        AW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLIRAVAYI AQ LGAI GV LV++F +H YN+ GGGAN+VA+G+SKGTALGAEIIGTF+LVY
Subjt:  AWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVY

Query:  TVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRS
        TVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS G AV+FNN + W   WIFWVGPFVGA+AAA YHQY+LRAAA+KAL SFRS
Subjt:  TVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRS

Query:  NRS
        N S
Subjt:  NRS

XP_038694264.1 uncharacterized protein LOC119991845 [Tripterygium wilfordii]5.1e-26367.05Show/hide
Query:  MADSQPL--LKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRA
        MAD QPL  ++ Q +   +S K  RVVS+DVFRGL VF+MM VDYGG+  P+IAHS WNGL LA+FVMP+FLFIAG S  L +K+V +KV AT     RA
Subjt:  MADSQPL--LKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRA

Query:  LYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKIS-A
        + LFLLGVLLQGGYFHG  SLTYGVD+ +IRWLG+LQRIS+GY++ ALCEIWLT  T+   +  +S+ WHW + F + ++Y GL YGLYVPDW  K+S A
Subjt:  LYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKIS-A

Query:  TSSSLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHIL
        T  SLPSN  ++Y V CSVRGDLGPACNS GMIDR VLG+ HLY KPVYRNLKECNIS+ GQVP+TSP+WCHAPF+PEGLLSSLTA+V CIIGLQYGH+L
Subjt:  TSSSLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHIL

Query:  ANVQDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTRHRIKNSLPPFFLESEGL------LISIGGCPRLSTLD
        A++QDHK R   W   S+ +L LG+FLVFIGIPVNKSLYT++YMLITSASAGI FC  Y+L        L  F LE  G+      ++       +S   
Subjt:  ANVQDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTRHRIKNSLPPFFLESEGL------LISIGGCPRLSTLD

Query:  MYSGVDGEACFESVDRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGA
         Y  V       S D I           R Q KDYVDPPPAP + ++ELKLWSFYRA+IAEF+ATLLFLY+T+ATVIG+  QTGPC GVGLLGIAW+FG 
Subjt:  MYSGVDGEACFESVDRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGA

Query:  MIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSAT
        MIFVLVYCTAGISGGHINPAVT GL LAR+VSLIRA+AY+ AQCLGAI GV LV++FM+H YN+ GGGAN VA G+SKGTALGAEIIGTF+LVYTVFSAT
Subjt:  MIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSAT

Query:  DPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN
        DPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++NN + W  HWIFWVGPF GA+AAA YHQY+LRA A+KAL SFRSN
Subjt:  DPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN

TrEMBL top hitse value%identityAlignment
A0A1S2Y9C8 uncharacterized protein LOC1014942831.8e-25865.05Show/hide
Query:  MADSQPLL--KNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRA
        MAD QPLL   N Q + + ++   RV S+DVFRGLSVF+M+ VDYG S  PII+H+ WNGLHLADFVMP+FLF+AG SL LVYK    +  AT  A  RA
Subjt:  MADSQPLL--KNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRA

Query:  LYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISAT
        L LF+LG+LLQGGY HG TSLTYGVD+ RIR  G+LQRIS+GY++ ALCEIWL     +     KS+ WHW++   LL++Y GL YGLYVPDW F +S +
Subjt:  LYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISAT

Query:  SSSLPS-NGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHIL
        +SSLP  +G ++Y VNCSVRGD GPACNS GMIDRY+LG+ HLY KPVYRNLKECN+SS GQ+ ++SPSWCHAPF+PEG+LSS+TA+V+CIIGLQYGHIL
Subjt:  SSSLPS-NGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHIL

Query:  ANVQDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILT---RHRI---------KNSLPPFFLESEGLLISIGGCP
        A+++DHK R + W S S+   ALG+FL  IGIPVNKSLYTVSYML++SA++G+ F ALY+L     HR          K+SL  F L S  L +      
Subjt:  ANVQDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILT---RHRI---------KNSLPPFFLESEGLLISIGGCP

Query:  RLSTLDMYSGVDGEACFESVDRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGI
                  + G    +  + I        Q      KDYVDPPPAP +  +E+KLWSFYRA+IAEF+ATLLFLY+T+ATVIG+  QTGPC GVGLLGI
Subjt:  RLSTLDMYSGVDGEACFESVDRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGI

Query:  AWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVY
        AWSFG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLIRAV Y+ AQCLGAI GV LV++ M+  YN  GGGAN VA+G+SKG+ALGAE+IGTF+LVY
Subjt:  AWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVY

Query:  TVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRS
        TVFSATDPKRNARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV+FNN +VW  HWIFWVGPFVGA+AAA YHQY+LRAAA+KAL SFRS
Subjt:  TVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRS

Query:  N
        N
Subjt:  N

A0A1S3U4S1 uncharacterized protein LOC1067619123.0e-26166.29Show/hide
Query:  MADSQPLLKNQQELPESSRKAP-RVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRAL
        MAD QPLL N  E  +     P R+ SLDVFRGLSVF+M+ VDYGGS  PIIAH+ WNG+HLAD VMP+FLFIAG SL LVYK    +  AT  A  RA+
Subjt:  MADSQPLLKNQQELPESSRKAP-RVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRAL

Query:  YLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATS
         LF+LGV+LQGGYFHGITSLT+GVD+ERIRWLGILQRIS+GY++ ALCEIWL     +     K++ WH +++  LL+LY GL YGLYVPDW F +SA++
Subjt:  YLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATS

Query:  SSLPS-NGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILA
        SSLP   G  +Y VNCSVRGDLGPACNS GMIDRY+LG+ HLY KPVYRNL+ECN+S  GQV + SPSWCHAPF+PEG+LSS+TA+V+CIIGLQYGH+LA
Subjt:  SSLPS-NGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILA

Query:  NVQDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILT----RHRI--------KNSLPPFFLESEGL-LISIGGCP
        ++QDHK R  +W   SL +LALG+FL   GIP+NK+LYTVSYML+TSA++G+ F ALY L     + R+        K+SL  F L S  + +I+I G  
Subjt:  NVQDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILT----RHRI--------KNSLPPFFLESEGL-LISIGGCP

Query:  RLSTLDMYSGVDGEACFESVDRIRNHPANIKQ--KGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLL
               +S  +       V R        K+  +  QQ KDY DPPPAP + L+E+KLWSFYRA+IAEF+ATLLFLY+T+ATVIG+  QTGPC GVGLL
Subjt:  RLSTLDMYSGVDGEACFESVDRIRNHPANIKQ--KGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLL

Query:  GIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLL
        GIAW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLIRAV YI AQCLGAI+GV LV++FM+H YN+ GGGAN V++G+SKGTALGAEIIGTF+L
Subjt:  GIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLL

Query:  VYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSF
        VYTVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++NN +VW  HWIFWVGPFVGA+AAA YHQY+LRAAA+KAL SF
Subjt:  VYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSF

Query:  RSN
        RSN
Subjt:  RSN

A0A2I4GCB0 uncharacterized protein LOC1090066081.6e-25764.67Show/hide
Query:  MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALY
        M D +P+   +Q+ P + R+  R+ SLDVFRGL VF+MM VDY GS  PII+HS W+G+HLADFVMP+FLF+AG S  LVYK+V ++V AT  A  RA+ 
Subjt:  MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALY

Query:  LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSS
        LFLLGV+LQGGY HG+ SLT+GVD+E+IRW+GILQRIS+GY++ ALCEIWLT  T       K + WHW I F L ++Y+ LSYGLYVP+W F + ++ S
Subjt:  LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSS

Query:  SL-PSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILAN
        SL PSN ++VYMV C+VRGDLGPACNS GMIDRY+LGI HLY KPVYRNLKEC IS+ GQV ++SP WCHAPF+PEG+LSSLTA+V CI GLQYGH+LAN
Subjt:  SL-PSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILAN

Query:  VQDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTRHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGE
        +QDHK R  +W   S  +  LG+ L F GIP+NKSLYTVSYMLITSASAGI FCALY                    LL+ + G  RL+ +  + G    
Subjt:  VQDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTRHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGE

Query:  ACFESVD-------------RIRNHPANIKQKG--RQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLG
        + F  +               + ++   + ++G   Q  KDYVDPPPAP + L+ELKLWSFYRA+IAEF+ATLLFLYIT+ATVIG   ++GPC GVGLLG
Subjt:  ACFESVD-------------RIRNHPANIKQKG--RQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLG

Query:  IAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLV
        IAW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLIRAV Y+ AQCLGAI GV LV++FM+H YN  GGG N VA G+SKGTALGAEIIGTF+LV
Subjt:  IAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLV

Query:  YTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFR
        YTVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV+++N +VW  HWIFWVGPFVGA+AAA YHQY+LRAAA+KAL SFR
Subjt:  YTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFR

Query:  SN
        SN
Subjt:  SN

A0A6J1CYU3 heparan-alpha-glucosaminide N-acetyltransferase0.0e+0080.92Show/hide
Query:  MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALY
        MADS+PLLKNQ ELPESS KAPRVVSLDVFRGLSVF+MM VDYGGSFLPIIAHS WNGLHLADFVMPWFLFIAG SL LVYKEV+SKV ATR AA R LY
Subjt:  MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALY

Query:  LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSS
        LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLI ALCEIWLT  TRE   NTKSF WHW  IFFLLSLYMGLSYGLYVPDW+FKIS T+S
Subjt:  LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSS

Query:  SLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV
        SLP NG++ YMVNCSVRGD GPACNS GMIDRYVLGI HLYTKPVYRN+KECNISS GQVPETSPSWCHAPFEPEGLLSSLTA+VACIIGLQYGHIL+NV
Subjt:  SLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV

Query:  QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILT----RHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGV
        Q+HKSRT SWFSLSLK L LGIFLVFIGIPVNKSLYTVSYMLITSASAGI+FCALYIL       R+   L   ++    L I +     +  +    G+
Subjt:  QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILT----RHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGV

Query:  DGEACFESVDRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVL
         G   F       N    ++++ RQQ KDYVDPPPAP +G SELKLWSF+RAVIAEFMATLLFLYIT+ATVIGNNA+ GPC+GVG LGIAW+FG MIFVL
Subjt:  DGEACFESVDRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVL

Query:  VYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRN
        VYCTAGISGGHINPAVTFGLLLAR+VS++RAVAY+AAQCLGAI GVALV+SFM+HAYN++GGGANLVA GFS+GTALGAEIIGTFLLVYTVFSATDPKRN
Subjt:  VYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRN

Query:  ARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSNRS
        ARDSHVPVLAPLPIG AVFVVHLATIPITGTGINPARSLGAAVM+NN R W  HWIFWVGPF+GA AAA+YHQYVLRAA VKAL SFRSNRS
Subjt:  ARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSNRS

A0A6P5S9C4 uncharacterized protein LOC1107537594.5e-26566.48Show/hide
Query:  MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALY
        MAD  PLL           K PR+ SLDVFRGL VF+MM VDYGGS  PIIAHS WNGLHLADFVMP+FLFIAG SL LVYK+V ++  AT  A  +AL 
Subjt:  MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALY

Query:  LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSS
        LFLLGVLLQGGYFHG+TSLT+GVD+ERIRW GILQRI++GY++ ALCEIWL+R T       +S+ WHW +IF L ++Y GL YGLYVPDW+FK S  +S
Subjt:  LFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSS

Query:  SLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV
         LPSN + VY+V CSVRGDLGPACNS GMIDR +LG+ HLY KPVYRNLKECN+S+ G VPE+SP WCHAPF+PEG+LSSLTA+V CIIGLQYGHILA++
Subjt:  SLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV

Query:  QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTRHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEA
        +DH  R N+W   S+ I  LG FL FIGIPVNKSLYT+SYMLITSASAGI FCALY                    LLI + G   ++++  + G+   +
Subjt:  QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTRHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEA

Query:  CFESV---------------DRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGI
         F  V               D   N   + + +     KDYVDPPPAP +   ELK WSFYRA+IAEF+ATLLFLYIT++TVIG+ AQ+GPC GVGLLGI
Subjt:  CFESV---------------DRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGI

Query:  AWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVY
        AW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLIRAVAYI AQ LGAIAGV LV++  +H YN+ GGGAN V+ G+SKGTALGAEIIGTF+LVY
Subjt:  AWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVY

Query:  TVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRS
        TVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++N+ +VW  HWIFWVGPF+GA+AAA YHQY+LRAAA+KAL SFRS
Subjt:  TVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRS

Query:  N
        N
Subjt:  N

SwissProt top hitse value%identityAlignment
P42767 Aquaporin PIP-type1.5e-11677.01Show/hide
Query:  IKQKGR--QQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAV
        I ++G+  Q  KDYVDPPPAPF  + ELKLWSF+RA IAEF+ATLLFLYIT+ATVIG   +T PC+ VGLLGIAWSFG MIFVLVYCTAGISGGHINPAV
Subjt:  IKQKGR--QQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAV

Query:  TFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGL
        TFGL LAR+VSL+RA+ Y+ AQC GAI GV LV++FM+  YN  GGGAN VA G++KGTA GAE+IGTF+LVYTVFSATDPKR+ARDSHVP+LAPLPIG 
Subjt:  TFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGL

Query:  AVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN
        AVF+VHLATIPITGTGINPARS GAAV++N  RVW  HWIFWVGPFVGA+AAA YHQYVLRAAA+KAL SFRSN
Subjt:  AVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN

P93004 Aquaporin PIP2-78.9e-11775.82Show/hide
Query:  IKQKGR-QQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVT
        + ++G+    KDYVDPPPAP + + ELK WSFYRA+IAEF+ATLLFLY+T+ATVIG+  QTGPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVT
Subjt:  IKQKGR-QQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVT

Query:  FGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLA
        FGL LAR+VSL+RA+ Y+ AQCLGAI GV  V++FM+  YN  GGGAN VA G+SKGTALGAEIIGTF+LVYTVFSATDPKR+ARDSH+PVLAPLPIG A
Subjt:  FGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLA

Query:  VFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN
        VF+VHLATIPITGTGINPARS GAAV++NN + W   WIFWVGPF+GA+AAA YHQY+LRA+A+KAL SFRSN
Subjt:  VFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN

Q7XLR1 Probable aquaporin PIP2-69.8e-11677.53Show/hide
Query:  KDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTG--PCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARR
        KDY DPPPAP   + EL+LWSFYRA+IAEF+ATLLFLYIT+ATVIG   Q+    C GVG LGIAW+FG MIF+LVYCTAGISGGHINPAVTFGLLLAR+
Subjt:  KDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTG--PCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARR

Query:  VSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLAT
        VS+IRAV YI AQCLG I GV +V+  M+H YNANGGGAN+VA+G+S GTALGAEIIGTF+LVYTVFSATDPKRNARDSHVPVLAPLPIG AVF+VHLAT
Subjt:  VSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLAT

Query:  IPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSNRS
        IPITGTGINPARS+GAAV++N  + W  HWIFW GPF+GA+AAA YHQY+LRAAA+KAL SFRSN S
Subjt:  IPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSNRS

Q8H5N9 Probable aquaporin PIP2-11.6e-11073.91Show/hide
Query:  GRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQT--------GPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINP
        G   +KDY DPPPAP +  +EL  WS YRAVIAEF+ATLLFLYIT+ATVIG   QT          C GVG+LGIAW+FG MIF+LVYCTAGISGGHINP
Subjt:  GRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQT--------GPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINP

Query:  AVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPI
        AVTFGL LAR+VSL+RA+ YI AQCLGAI GV LV++F    +N  GGGAN +AAG+SKGT L AEIIGTF+LVYTVFSATDPKRNARDSHVPVLAPLPI
Subjt:  AVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPI

Query:  GLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN
        G AVF+VHLATIPITGTGINPARS+GAAV+FNN + W +HWIFWVGPFVGA  AA YHQY+LRA A+KAL SFRSN
Subjt:  GLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN

Q9ZVX8 Probable aquaporin PIP2-84.2e-11978.68Show/hide
Query:  IKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTF
        + ++GR   KDYVDPPPAP + ++ELKLWSFYRA+IAEF+ATLLFLY+T+ATVIG+  QTGPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVTF
Subjt:  IKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTF

Query:  GLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAV
        GL LAR+VSL RAVAY+ AQCLGAI GV LV++FM   Y   GGGAN VA G+S GTALGAEIIGTF+LVYTVFSATDPKR+ARDSHVPVLAPLPIG AV
Subjt:  GLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAV

Query:  FVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN
        F+VHLATIPITGTGINPARS GAAV++NN + W  HWIFWVGPFVGA+AAA YHQY+LRAAA+KAL SFRSN
Subjt:  FVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN

Arabidopsis top hitse value%identityAlignment
AT2G16850.1 plasma membrane intrinsic protein 2;83.0e-12078.68Show/hide
Query:  IKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTF
        + ++GR   KDYVDPPPAP + ++ELKLWSFYRA+IAEF+ATLLFLY+T+ATVIG+  QTGPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVTF
Subjt:  IKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTF

Query:  GLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAV
        GL LAR+VSL RAVAY+ AQCLGAI GV LV++FM   Y   GGGAN VA G+S GTALGAEIIGTF+LVYTVFSATDPKR+ARDSHVPVLAPLPIG AV
Subjt:  GLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAV

Query:  FVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN
        F+VHLATIPITGTGINPARS GAAV++NN + W  HWIFWVGPFVGA+AAA YHQY+LRAAA+KAL SFRSN
Subjt:  FVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN

AT3G54820.1 plasma membrane intrinsic protein 2;58.5e-10772.39Show/hide
Query:  KDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGP------CSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLL
        KDY DPPP P    +EL  WSFYRA+IAEF+ATLLFLY+TI TVIG  +QT P      C+GVG+LGIAW+FG MIF+LVYCTAGISGGHINPAVTFGLL
Subjt:  KDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGP------CSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLL

Query:  LARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVV
        LAR+V+L+RAV Y+ AQCLGAI GVALV++F    +   GGGAN ++ G+S GT + AEIIGTF+LVYTVFSATDPKR+ARDSHVPVLAPLPIG AVF+V
Subjt:  LARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVV

Query:  HLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRS
        HLATIPITGTGINPARSLGAA+++N  + W HHWIFWVGPF GA  AA YHQ+VLRA A+KAL SFRS
Subjt:  HLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRS

AT4G35100.1 plasma membrane intrinsic protein 36.3e-11875.82Show/hide
Query:  IKQKGR-QQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVT
        + ++G+    KDYVDPPPAP + + ELK WSFYRA+IAEF+ATLLFLY+T+ATVIG+  QTGPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVT
Subjt:  IKQKGR-QQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVT

Query:  FGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLA
        FGL LAR+VSL+RA+ Y+ AQCLGAI GV  V++FM+  YN  GGGAN VA G+SKGTALGAEIIGTF+LVYTVFSATDPKR+ARDSH+PVLAPLPIG A
Subjt:  FGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLA

Query:  VFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN
        VF+VHLATIPITGTGINPARS GAAV++NN + W   WIFWVGPF+GA+AAA YHQY+LRA+A+KAL SFRSN
Subjt:  VFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN

AT4G35100.2 plasma membrane intrinsic protein 36.3e-11875.82Show/hide
Query:  IKQKGR-QQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVT
        + ++G+    KDYVDPPPAP + + ELK WSFYRA+IAEF+ATLLFLY+T+ATVIG+  QTGPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVT
Subjt:  IKQKGR-QQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVT

Query:  FGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLA
        FGL LAR+VSL+RA+ Y+ AQCLGAI GV  V++FM+  YN  GGGAN VA G+SKGTALGAEIIGTF+LVYTVFSATDPKR+ARDSH+PVLAPLPIG A
Subjt:  FGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLA

Query:  VFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN
        VF+VHLATIPITGTGINPARS GAAV++NN + W   WIFWVGPF+GA+AAA YHQY+LRA+A+KAL SFRSN
Subjt:  VFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN

AT5G60660.1 plasma membrane intrinsic protein 2;43.2e-10669.78Show/hide
Query:  NIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTG------PCSGVGLLGIAWSFGAMIFVLVYCTAGISGGH
        ++ + G   ++DY DPPPAPF  + EL+ W  YRAVIAEF+ATLLFLY++I TVIG  AQT        C GVG+LGIAW+FG MIFVLVYCTAGISGGH
Subjt:  NIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTG------PCSGVGLLGIAWSFGAMIFVLVYCTAGISGGH

Query:  INPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAP
        INPAVT GL LAR+VSL+R V YI AQCLGAI G   V++F    Y   GGGAN +A G++KGT LGAEIIGTF+LVYTVFSATDPKRNARDSHVPVLAP
Subjt:  INPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAP

Query:  LPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRS
        LPIG AVF+VHLATIPITGTGINPARS GAAV++NN + W   WIFWVGP +GA AAA YHQ++LRAAA+KAL SF S
Subjt:  LPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATTCTCAACCGCTGCTGAAGAACCAACAGGAGTTGCCGGAGTCCAGTCGCAAAGCTCCGCGAGTTGTCTCACTCGACGTCTTTCGCGGCCTCAGCGTCTTTAT
GATGATGTTCGTGGACTACGGTGGCTCGTTTTTGCCAATTATCGCTCATTCGTCATGGAATGGACTTCATTTGGCTGATTTTGTAATGCCTTGGTTTCTATTTATTGCGG
GAGCCTCGCTTGGACTTGTTTATAAAGAAGTACAAAGTAAAGTGACCGCTACAAGGAATGCAGCATGCAGGGCTTTGTACCTTTTTCTCCTGGGTGTTCTTCTTCAAGGT
GGTTACTTTCATGGAATAACATCTTTGACATATGGTGTTGATATGGAAAGGATTAGGTGGCTTGGCATTTTACAGAGAATATCTGTTGGATACTTGATTACTGCACTATG
TGAGATCTGGCTAACTCGTTGTACACGTGAAAGAGCTCAAAATACTAAGAGTTTCAGCTGGCATTGGTATATTATATTTTTTCTGTTGTCATTGTATATGGGACTGTCGT
ATGGTTTATATGTTCCAGATTGGGACTTTAAAATATCAGCCACGAGCTCTTCACTTCCATCAAATGGAACCCATGTTTACATGGTGAATTGTTCTGTTCGAGGTGATCTG
GGACCTGCTTGTAATTCTGTTGGGATGATTGATCGTTATGTTCTTGGTATTCACCATTTGTATACTAAACCTGTGTACAGAAATCTGAAGGAGTGCAACATTTCCTCCGG
TGGTCAAGTTCCTGAGACTTCACCTTCATGGTGTCATGCTCCTTTTGAACCTGAAGGTCTTTTAAGCTCTTTAACAGCTTCAGTGGCATGCATAATAGGACTTCAGTATG
GTCACATTCTTGCCAATGTACAGGATCACAAAAGTCGCACCAATAGCTGGTTCTCACTCTCGCTTAAGATTTTGGCCCTCGGAATATTCCTCGTCTTTATAGGTATCCCT
GTAAATAAGTCCCTCTACACAGTCAGCTACATGTTAATTACTTCAGCATCAGCAGGAATTATCTTTTGTGCTCTATATATCTTGACGCGGCATCGTATAAAGAACAGCTT
ACCACCTTTTTTCCTCGAGTCTGAAGGTCTCTTAATTTCTATAGGTGGATGTCCACGGCTATCGACGCTTGACATGTATTCTGGAGTGGATGGGGAAGCATGCTTTGAAT
CTGTCGACCGTATTCGAAATCACCCTGCAAATATTAAGCAAAAAGGCCGTCAGCAGTCTAAGGACTATGTGGATCCGCCACCAGCTCCATTCATGGGGTTGTCCGAGCTG
AAGCTATGGTCCTTTTACAGAGCTGTAATTGCAGAGTTCATGGCCACACTCCTTTTCCTGTACATCACCATCGCCACTGTCATTGGGAACAATGCACAAACAGGACCCTG
TTCTGGCGTTGGCCTCCTCGGCATAGCATGGTCCTTCGGCGCCATGATCTTCGTCCTTGTTTACTGTACTGCTGGTATTTCAGGTGGCCATATCAATCCCGCCGTGACGT
TCGGGCTGCTGCTGGCCAGAAGGGTGTCCCTGATCCGCGCGGTGGCTTACATAGCGGCGCAGTGTCTGGGAGCCATTGCCGGCGTTGCATTGGTTGAATCTTTCATGAGG
CACGCCTACAACGCCAACGGCGGCGGAGCCAACCTCGTCGCCGCCGGCTTCAGCAAAGGCACCGCCCTCGGGGCCGAGATCATTGGCACTTTTCTGCTGGTCTATACTGT
GTTCTCTGCCACTGACCCAAAGAGAAACGCCCGCGACTCCCATGTTCCTGTTTTGGCTCCATTGCCCATCGGACTTGCCGTCTTTGTGGTGCATCTGGCCACCATTCCAA
TTACAGGCACCGGCATCAACCCTGCTCGGAGCTTAGGCGCTGCGGTCATGTTCAACAACACAAGAGTCTGGAGCCACCATTGGATTTTCTGGGTTGGACCATTTGTTGGG
GCGATGGCAGCGGCGGTTTACCATCAATACGTTCTTAGAGCGGCGGCCGTGAAGGCACTTGTATCTTTCCGCAGCAACCGATCCACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGATTCTCAACCGCTGCTGAAGAACCAACAGGAGTTGCCGGAGTCCAGTCGCAAAGCTCCGCGAGTTGTCTCACTCGACGTCTTTCGCGGCCTCAGCGTCTTTAT
GATGATGTTCGTGGACTACGGTGGCTCGTTTTTGCCAATTATCGCTCATTCGTCATGGAATGGACTTCATTTGGCTGATTTTGTAATGCCTTGGTTTCTATTTATTGCGG
GAGCCTCGCTTGGACTTGTTTATAAAGAAGTACAAAGTAAAGTGACCGCTACAAGGAATGCAGCATGCAGGGCTTTGTACCTTTTTCTCCTGGGTGTTCTTCTTCAAGGT
GGTTACTTTCATGGAATAACATCTTTGACATATGGTGTTGATATGGAAAGGATTAGGTGGCTTGGCATTTTACAGAGAATATCTGTTGGATACTTGATTACTGCACTATG
TGAGATCTGGCTAACTCGTTGTACACGTGAAAGAGCTCAAAATACTAAGAGTTTCAGCTGGCATTGGTATATTATATTTTTTCTGTTGTCATTGTATATGGGACTGTCGT
ATGGTTTATATGTTCCAGATTGGGACTTTAAAATATCAGCCACGAGCTCTTCACTTCCATCAAATGGAACCCATGTTTACATGGTGAATTGTTCTGTTCGAGGTGATCTG
GGACCTGCTTGTAATTCTGTTGGGATGATTGATCGTTATGTTCTTGGTATTCACCATTTGTATACTAAACCTGTGTACAGAAATCTGAAGGAGTGCAACATTTCCTCCGG
TGGTCAAGTTCCTGAGACTTCACCTTCATGGTGTCATGCTCCTTTTGAACCTGAAGGTCTTTTAAGCTCTTTAACAGCTTCAGTGGCATGCATAATAGGACTTCAGTATG
GTCACATTCTTGCCAATGTACAGGATCACAAAAGTCGCACCAATAGCTGGTTCTCACTCTCGCTTAAGATTTTGGCCCTCGGAATATTCCTCGTCTTTATAGGTATCCCT
GTAAATAAGTCCCTCTACACAGTCAGCTACATGTTAATTACTTCAGCATCAGCAGGAATTATCTTTTGTGCTCTATATATCTTGACGCGGCATCGTATAAAGAACAGCTT
ACCACCTTTTTTCCTCGAGTCTGAAGGTCTCTTAATTTCTATAGGTGGATGTCCACGGCTATCGACGCTTGACATGTATTCTGGAGTGGATGGGGAAGCATGCTTTGAAT
CTGTCGACCGTATTCGAAATCACCCTGCAAATATTAAGCAAAAAGGCCGTCAGCAGTCTAAGGACTATGTGGATCCGCCACCAGCTCCATTCATGGGGTTGTCCGAGCTG
AAGCTATGGTCCTTTTACAGAGCTGTAATTGCAGAGTTCATGGCCACACTCCTTTTCCTGTACATCACCATCGCCACTGTCATTGGGAACAATGCACAAACAGGACCCTG
TTCTGGCGTTGGCCTCCTCGGCATAGCATGGTCCTTCGGCGCCATGATCTTCGTCCTTGTTTACTGTACTGCTGGTATTTCAGGTGGCCATATCAATCCCGCCGTGACGT
TCGGGCTGCTGCTGGCCAGAAGGGTGTCCCTGATCCGCGCGGTGGCTTACATAGCGGCGCAGTGTCTGGGAGCCATTGCCGGCGTTGCATTGGTTGAATCTTTCATGAGG
CACGCCTACAACGCCAACGGCGGCGGAGCCAACCTCGTCGCCGCCGGCTTCAGCAAAGGCACCGCCCTCGGGGCCGAGATCATTGGCACTTTTCTGCTGGTCTATACTGT
GTTCTCTGCCACTGACCCAAAGAGAAACGCCCGCGACTCCCATGTTCCTGTTTTGGCTCCATTGCCCATCGGACTTGCCGTCTTTGTGGTGCATCTGGCCACCATTCCAA
TTACAGGCACCGGCATCAACCCTGCTCGGAGCTTAGGCGCTGCGGTCATGTTCAACAACACAAGAGTCTGGAGCCACCATTGGATTTTCTGGGTTGGACCATTTGTTGGG
GCGATGGCAGCGGCGGTTTACCATCAATACGTTCTTAGAGCGGCGGCCGTGAAGGCACTTGTATCTTTCCGCAGCAACCGATCCACTTGA
Protein sequenceShow/hide protein sequence
MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALYLFLLGVLLQG
GYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSSSLPSNGTHVYMVNCSVRGDL
GPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANVQDHKSRTNSWFSLSLKILALGIFLVFIGIP
VNKSLYTVSYMLITSASAGIIFCALYILTRHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEACFESVDRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSEL
KLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMR
HAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVG
AMAAAVYHQYVLRAAAVKALVSFRSNRST