; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023780 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023780
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionheparan-alpha-glucosaminide N-acetyltransferase
Genome locationtig00000892:6606028..6614691
RNA-Seq ExpressionSgr023780
SyntenySgr023780
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0016020 - membrane (cellular component)
GO:0015267 - channel activity (molecular function)
InterPro domainsIPR000425 - Major intrinsic protein
IPR023271 - Aquaporin-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_020425883.1 uncharacterized protein LOC18767296 [Prunus persica]6.0e-24361.41Show/hide
Query:  MADSRPLLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLY
        MAD  PLL   +       KPPR+ SLD        LMMLVDYGGS FPIIAHSPWNGLHLADFVMP+FLF AGVSLALVYK+V ++  AT  A  + L 
Subjt:  MADSRPLLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLY

Query:  LFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS
        LFLLGVLLQGGYFHG+TSLT+G D+ERIRW GILQRI++GY++AALCEIWL+R T +E    KS+ WHWC+IF LS++Y GL YGLYVPDW+FK  T TS
Subjt:  LFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS

Query:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV
          P + S+VY+V CSVRGDLGPACNSAGMIDR++LG+DHLY KPVYRNLKECN+S+ G+VPE+SPSWCHAPF+PEG+LSSLTA+V CIIGLQYGHILA++
Subjt:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV

Query:  QEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYWR
        ++HK R N W   S+ I  LG FL FIG+PVNKSLYT+SYMLITSASAGI FCALY+L+DV+GY+ +T +LEWMG H+LSI+VLV SN+ +I IQG YW 
Subjt:  QEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYWR

Query:  SPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQS-QQAKDYVDPPPAP
         P+NNIV ++                                                                        EGQ+    KDYVDPPPAP
Subjt:  SPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQS-QQAKDYVDPPPAP

Query:  LLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-------------------------------F
        L+   ELK WSFYRA+IAEF+ATLLFLYIT++TVIGN  Q+GPC+GVGLLGIAWAFG MIFVLVY  A                                
Subjt:  LLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-------------------------------F

Query:  QCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPAR
        Q LGAI GVGLVKA  KH YN  GGGAN VA G++KGTALGAEIIGTF+LVYT+FSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPAR
Subjt:  QCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPAR

Query:  SLGTAVIFNN
        S G AVIFNN
Subjt:  SLGTAVIFNN

XP_021810408.1 uncharacterized protein LOC110753759 [Prunus avium]9.6e-24160.73Show/hide
Query:  MADSRPLLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLY
        MAD  PLL   +       KPPR+ SLD        LMMLVDYGGS FPIIAHSPWNGLHLADFVMP+FLF AGVSLALVYK+V ++  AT  A  + L 
Subjt:  MADSRPLLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLY

Query:  LFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS
        LFLLGVLLQGGYFHG+TSLT+G D+ERIRW GILQRI++GY++AALCEIWL+R T  E    +S+ WHWC+IF LS++Y GL YGLYVPDW+FK ST TS
Subjt:  LFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS

Query:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV
         LP N S+VY+V CSVRGDLGPACNSAGMIDR +LG+DHLY KPVYRNLKECN+S+ G VPE+SP WCHAPF+PEG+LSSLTA+V CIIGLQYGHILA++
Subjt:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV

Query:  QEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYWR
        ++H  R N W   S+ I  LG FL FIG+PVNKSLYT+SYMLITSASAGI FCALY+L+DV+GY+ +T +LEWMG H+LSI+VLV SN+ +I IQG YW 
Subjt:  QEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYWR

Query:  SPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQS-QQAKDYVDPPPAP
         P+NNIV ++                                                                        EGQ+    KDYVDPPPAP
Subjt:  SPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQS-QQAKDYVDPPPAP

Query:  LLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-------------------------------F
        L+   ELK WSFYRA+IAEF+ATLLFLYIT++TVIG+  Q+GPC+GVGLLGIAWAFG MIFVLVY  A                                
Subjt:  LLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-------------------------------F

Query:  QCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPAR
        Q LGAIAGVGLVKA+ KH YN  GGGAN V+ G++KGTALGAEIIGTF+LVYT+FSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPAR
Subjt:  QCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPAR

Query:  SLGTAVIFNNRSL
        S G AVI+N+  +
Subjt:  SLGTAVIFNNRSL

XP_022146281.1 heparan-alpha-glucosaminide N-acetyltransferase [Momordica charantia]1.2e-28874.19Show/hide
Query:  MADSRPLLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLY
        MADSRPLLK+Q E P+ SGK PRVVSLD        LMMLVDYGGSF PIIAHSPWNGLHLADFVMPWFLF AGVSLALVYKEVK KVAATR A  RGLY
Subjt:  MADSRPLLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLY

Query:  LFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS
        LFLLGVLLQGGYFHGITSLTYG DMERIRWLGILQRISVGYLIAALCEIWLT  T EE  +TKSF WHWC IF L SLYMGLSYGLYVPDW+FKISTTTS
Subjt:  LFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS

Query:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV
        SLPPNGSY YMVNCSVRGD GPACNSAGMIDRYVLGIDHLYTKPVYRN+KECNISS GQVPETSPSWCHAPFEPEGLLSSLTA+VACIIGLQYGHIL+NV
Subjt:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV

Query:  QEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYWR
        QEHKSRT  WFSLSLK L LGIFLVFIG+PVNKSLYTVSYMLITSASAGIVFCALYILVDVHGY+RLTC+LEWMGKHAL IYVLVISNI VIGIQGFYWR
Subjt:  QEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYWR

Query:  SPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQSQQAKDYVDPPPAPL
        SPKNNI                                                                         V+ E + QQ KDYVDPPPAPL
Subjt:  SPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQSQQAKDYVDPPPAPL

Query:  LGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPP-------------------------------AFQ
        LGFSELKLWSF+RAVIAEFMATLLFLYIT+ATVIGN+ + GPC GVG LGIAWAFG MIFVLVY                                 A Q
Subjt:  LGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPP-------------------------------AFQ

Query:  CLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARS
        CLGAI GV LVK+ MKHAYN +GGGANLVA GF++GTALGAEIIGTFLLVYT+FSATDPKRNARDSHVPVLAPLPIG AVFVVHLATIPITGTGINPARS
Subjt:  CLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARS

Query:  LGTAVIFNN
        LG AV++NN
Subjt:  LGTAVIFNN

XP_034226800.1 uncharacterized protein LOC117636424 [Prunus dulcis]7.9e-24361.27Show/hide
Query:  MADSRPLLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLY
        MAD  PLL   +       KPPR+ SLD        LMMLVDYGGS FPIIAHSPWNGLHLADFVMP+FLF AGVSLALVYK+V ++  AT  A  + L 
Subjt:  MADSRPLLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLY

Query:  LFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS
        LFLLGVLLQGGYFHG+TSLT+G D+ERIRW GILQRI++GY++AALCEIWL+R T +E    +S+ WHWC+IF LS++Y GL YGLYVPDW+FK  T TS
Subjt:  LFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS

Query:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV
          P N S+VY+V CSVRGDLGPACNSAGMIDR +LG+DHLY KPVYRNLKECN+S+ G+VPE+SPSWCHAPF+PEG+LSSLTA+V CIIGLQYGHILA++
Subjt:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV

Query:  QEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYWR
        ++HK R N W   S+ I  LG FL FIG+PVNKSLYT+SYMLITSASAGI FCALY+L+DV+GY+ +T +LEWMG H+LSI+VLV SN+ +I IQG YW 
Subjt:  QEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYWR

Query:  SPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQS-QQAKDYVDPPPAP
         P+NN+V ++                                                                        EGQ+    KDYVDPPPAP
Subjt:  SPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQS-QQAKDYVDPPPAP

Query:  LLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-------------------------------F
        L+   ELK WSFYRA+IAEF+ATLLFLYIT++TVIGN  Q+GPC+GVGLLGIAWAFG MIFVLVY  A                                
Subjt:  LLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-------------------------------F

Query:  QCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPAR
        Q LGAI GVGLVKA  KH YN  GGGAN+VA+G++KGTALGAEIIGTF+LVYT+FSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPAR
Subjt:  QCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPAR

Query:  SLGTAVIFNN
        S G AVIFNN
Subjt:  SLGTAVIFNN

XP_038694264.1 uncharacterized protein LOC119991845 [Tripterygium wilfordii]8.2e-24061.85Show/hide
Query:  MADSRPLLKDQEESPQP---SGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCR
        MAD +P L + EE PQP   S KP RVVS+D        LMMLVDYGG+ FP+IAHSPWNGL LA+FVMP+FLF AG+S AL +K+V +KV AT     R
Subjt:  MADSRPLLKDQEESPQP---SGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCR

Query:  GLYLFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKIST
         + LFLLGVLLQGGYFHG  SLTYG D+ +IRWLG+LQRIS+GY++AALCEIWLT  T  E +  +S+CWHWC+ F +S++Y GL YGLYVPDW  K+S 
Subjt:  GLYLFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKIST

Query:  TT-SSLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHI
         T  SLP N + +Y V CSVRGDLGPACNSAGMIDR VLG+DHLY KPVYRNLKECNIS+SGQVP+TSP+WCHAPF+PEGLLSSLTA+V CIIGLQYGH+
Subjt:  TT-SSLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHI

Query:  LANVQEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQG
        LA++Q+HK R   W   S+ +L LG+FLVFIG+PVNKSLYT++YMLITSASAGI FC  Y+LVDV+GY+ LT ILEWMG H+LSI+VLV SN+ V+ IQG
Subjt:  LANVQEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQG

Query:  FYWRSPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQSQQAKDYVDPP
        FY   P NNI                                                                   KM SK+VI EG+  Q KDYVDPP
Subjt:  FYWRSPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQSQQAKDYVDPP

Query:  PAPLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-----------------------------
        PAPL+  +ELKLWSFYRA+IAEF+ATLLFLY+T+ATVIG+ +QTGPC+GVGLLGIAWAFG MIFVLVY  A                             
Subjt:  PAPLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-----------------------------

Query:  --FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGIN
           QCLGAI GVGLVKA MKH YN  GGGAN VA G++KGTALGAEIIGTF+LVYT+FSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGIN
Subjt:  --FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGIN

Query:  PARSLGTAVIFNN
        PARS G AVI+NN
Subjt:  PARSLGTAVIFNN

TrEMBL top hitse value%identityAlignment
A0A1S2Y9C8 uncharacterized protein LOC1014942833.6e-23359.08Show/hide
Query:  MADSRPLLKDQEESP--QPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRG
        MAD +PLL +    P  Q +  P RV S+D        LM+LVDYG S FPII+H+PWNGLHLADFVMP+FLF AG+SLALVYK    +  AT  A  R 
Subjt:  MADSRPLLKDQEESP--QPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRG

Query:  LYLFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTT
        L LF+LG+LLQGGY HG TSLTYG D+ RIR  G+LQRIS+GY++AALCEIWL     +E    KS+ WHW +  +L ++Y GL YGLYVPDW F +ST+
Subjt:  LYLFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTT

Query:  TSSLPP-NGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHIL
        TSSLPP +G  +Y VNCSVRGD GPACNSAGMIDRY+LG+DHLY KPVYRNLKECN+SS+GQ+ ++SPSWCHAPF+PEG+LSS+TA+V+CIIGLQYGHIL
Subjt:  TSSLPP-NGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHIL

Query:  ANVQEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGF
        A++++HK R + W S S+    LG+FL  IG+PVNKSLYTVSYML++SA++G+ F ALY+LVDV+G++RLT +LEWMGKH+LSI+VLV SN+ VI IQGF
Subjt:  ANVQEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGF

Query:  YWRSPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQSQQ----AKDYV
        YW  P+NNI                                                                        EV  EG  Q      KDYV
Subjt:  YWRSPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQSQQ----AKDYV

Query:  DPPPAPLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA--------------------------
        DPPPAPLL F+E+KLWSFYRA+IAEF+ATLLFLY+T+ATVIG+ +QTGPC+GVGLLGIAW+FG MIFVLVY  A                          
Subjt:  DPPPAPLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA--------------------------

Query:  -----FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGT
              QCLGAI GVGLVKALMK  YN  GGGAN VA+G++KG+ALGAE+IGTF+LVYT+FSATDPKRNARDSHVPVLAPLPIG AVF+VHLATIPITGT
Subjt:  -----FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGT

Query:  GINPARSLGTAVIFNN
        GINPARS G AVIFNN
Subjt:  GINPARSLGTAVIFNN

A0A1S3U4S1 uncharacterized protein LOC1067619128.5e-23559.49Show/hide
Query:  MADSRP-LLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGL
        MAD +P LL D E +   + +P R+ SLD        LM+LVDYGGS FPIIAH+PWNG+HLAD VMP+FLF AG+SLALVYK    +  AT  A  R +
Subjt:  MADSRP-LLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGL

Query:  YLFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTT
         LF+LGV+LQGGYFHGITSLT+G D+ERIRWLGILQRIS+GY++AALCEIWL     ++    K++ WH  ++ +L +LY GL YGLYVPDW F +S +T
Subjt:  YLFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTT

Query:  SSLPP-NGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILA
        SSLPP  G  +Y VNCSVRGDLGPACNSAGMIDRY+LG+ HLY KPVYRNL+ECN+ S GQV + SPSWCHAPF+PEG+LSS+TA+V+CIIGLQYGH+LA
Subjt:  SSLPP-NGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILA

Query:  NVQEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFY
        ++Q+HK R   W   SL +L LG+FL   G+P+NK+LYTVSYML+TSA++G+ F ALY LVDVHG +RLT +LEWMGKH+LSI+VLV SN+ VI IQGFY
Subjt:  NVQEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFY

Query:  WRSPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQSQQAKDYVDPPPA
        W  P+NNI+                 +W                                        +   +     SKE+  EG  QQ KDY DPPPA
Subjt:  WRSPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQSQQAKDYVDPPPA

Query:  PLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-------------------------------
        PL+  +E+KLWSFYRA+IAEF+ATLLFLY+T+ATVIG+ +QTGPC+GVGLLGIAWAFG MIFVLVY  A                               
Subjt:  PLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-------------------------------

Query:  FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPA
         QCLGAI+GVGLVKA MKH YN  GGGAN V++G++KGTALGAEIIGTF+LVYT+FSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPA
Subjt:  FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPA

Query:  RSLGTAVIFNN
        RS G AVI+NN
Subjt:  RSLGTAVIFNN

A0A2I4GCB0 uncharacterized protein LOC1090066089.4e-23460.06Show/hide
Query:  MADSRPLLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLY
        M D  P + D E+ P    +  R+ SLD        LMMLVDY GS FPII+HSPW+G+HLADFVMP+FLF AGVS ALVYK+V ++V AT  A  R + 
Subjt:  MADSRPLLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLY

Query:  LFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS
        LFLLGV+LQGGY HG+ SLT+G D+E+IRW+GILQRIS+GY++AALCEIWLT  T  E    K + WHWCI F LS++Y+ LSYGLYVP+W F + ++ S
Subjt:  LFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS

Query:  SLPP-NGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILAN
        SLPP N S VYMV C+VRGDLGPACNSAGMIDRY+LGIDHLY KPVYRNLKEC IS++GQV ++SP WCHAPF+PEG+LSSLTA+V CI GLQYGH+LAN
Subjt:  SLPP-NGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILAN

Query:  VQEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYW
        +Q+HK R   W   S  +  LG+ L F G+P+NKSLYTVSYMLITSASAGI FCALY+LVDV+GY+RLTC+LEWMGKH+LSI+VL+ SN+ +I IQGFYW
Subjt:  VQEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYW

Query:  RSPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQS-QQAKDYVDPPPA
          P NNI                                                                        EV  EG S Q  KDYVDPPPA
Subjt:  RSPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQS-QQAKDYVDPPPA

Query:  PLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-------------------------------
        PL+  +ELKLWSFYRA+IAEF+ATLLFLYIT+ATVIG  +++GPC+GVGLLGIAWAFG MIFVLVY  A                               
Subjt:  PLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-------------------------------

Query:  FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPA
         QCLGAI GVGLVKA MKH YN  GGG N VA G++KGTALGAEIIGTF+LVYT+FSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPA
Subjt:  FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPA

Query:  RSLGTAVIFNN
        RS G AVI++N
Subjt:  RSLGTAVIFNN

A0A6J1CYU3 heparan-alpha-glucosaminide N-acetyltransferase6.0e-28974.19Show/hide
Query:  MADSRPLLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLY
        MADSRPLLK+Q E P+ SGK PRVVSLD        LMMLVDYGGSF PIIAHSPWNGLHLADFVMPWFLF AGVSLALVYKEVK KVAATR A  RGLY
Subjt:  MADSRPLLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLY

Query:  LFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS
        LFLLGVLLQGGYFHGITSLTYG DMERIRWLGILQRISVGYLIAALCEIWLT  T EE  +TKSF WHWC IF L SLYMGLSYGLYVPDW+FKISTTTS
Subjt:  LFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS

Query:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV
        SLPPNGSY YMVNCSVRGD GPACNSAGMIDRYVLGIDHLYTKPVYRN+KECNISS GQVPETSPSWCHAPFEPEGLLSSLTA+VACIIGLQYGHIL+NV
Subjt:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV

Query:  QEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYWR
        QEHKSRT  WFSLSLK L LGIFLVFIG+PVNKSLYTVSYMLITSASAGIVFCALYILVDVHGY+RLTC+LEWMGKHAL IYVLVISNI VIGIQGFYWR
Subjt:  QEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYWR

Query:  SPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQSQQAKDYVDPPPAPL
        SPKNNI                                                                         V+ E + QQ KDYVDPPPAPL
Subjt:  SPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQSQQAKDYVDPPPAPL

Query:  LGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPP-------------------------------AFQ
        LGFSELKLWSF+RAVIAEFMATLLFLYIT+ATVIGN+ + GPC GVG LGIAWAFG MIFVLVY                                 A Q
Subjt:  LGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPP-------------------------------AFQ

Query:  CLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARS
        CLGAI GV LVK+ MKHAYN +GGGANLVA GF++GTALGAEIIGTFLLVYT+FSATDPKRNARDSHVPVLAPLPIG AVFVVHLATIPITGTGINPARS
Subjt:  CLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARS

Query:  LGTAVIFNN
        LG AV++NN
Subjt:  LGTAVIFNN

A0A6P5S9C4 uncharacterized protein LOC1107537594.7e-24160.73Show/hide
Query:  MADSRPLLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLY
        MAD  PLL   +       KPPR+ SLD        LMMLVDYGGS FPIIAHSPWNGLHLADFVMP+FLF AGVSLALVYK+V ++  AT  A  + L 
Subjt:  MADSRPLLKDQEESPQPSGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLY

Query:  LFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS
        LFLLGVLLQGGYFHG+TSLT+G D+ERIRW GILQRI++GY++AALCEIWL+R T  E    +S+ WHWC+IF LS++Y GL YGLYVPDW+FK ST TS
Subjt:  LFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS

Query:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV
         LP N S+VY+V CSVRGDLGPACNSAGMIDR +LG+DHLY KPVYRNLKECN+S+ G VPE+SP WCHAPF+PEG+LSSLTA+V CIIGLQYGHILA++
Subjt:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV

Query:  QEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYWR
        ++H  R N W   S+ I  LG FL FIG+PVNKSLYT+SYMLITSASAGI FCALY+L+DV+GY+ +T +LEWMG H+LSI+VLV SN+ +I IQG YW 
Subjt:  QEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYWR

Query:  SPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQS-QQAKDYVDPPPAP
         P+NNIV ++                                                                        EGQ+    KDYVDPPPAP
Subjt:  SPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAEFEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQS-QQAKDYVDPPPAP

Query:  LLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-------------------------------F
        L+   ELK WSFYRA+IAEF+ATLLFLYIT++TVIG+  Q+GPC+GVGLLGIAWAFG MIFVLVY  A                                
Subjt:  LLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-------------------------------F

Query:  QCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPAR
        Q LGAIAGVGLVKA+ KH YN  GGGAN V+ G++KGTALGAEIIGTF+LVYT+FSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPAR
Subjt:  QCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPAR

Query:  SLGTAVIFNNRSL
        S G AVI+N+  +
Subjt:  SLGTAVIFNNRSL

SwissProt top hitse value%identityAlignment
P42767 Aquaporin PIP-type1.1e-7465.68Show/hide
Query:  SKEVIA-EGQ-SQQAKDYVDPPPAPLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-------
        SKEVI+ EGQ  Q  KDYVDPPPAP     ELKLWSF+RA IAEF+ATLLFLYIT+ATVIG  ++T PC  VGLLGIAW+FG MIFVLVY  A       
Subjt:  SKEVIA-EGQ-SQQAKDYVDPPPAPLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA-------

Query:  ------------------------FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPL
                                 QC GAI GVGLVKA MK  YN  GGGAN VA G+NKGTA GAE+IGTF+LVYT+FSATDPKR+ARDSHVP+LAPL
Subjt:  ------------------------FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPL

Query:  PIGLAVFVVHLATIPITGTGINPARSLGTAVIFNNR
        PIG AVF+VHLATIPITGTGINPARS G AVI+N +
Subjt:  PIGLAVFVVHLATIPITGTGINPARSLGTAVIFNNR

P93004 Aquaporin PIP2-74.0e-8068.24Show/hide
Query:  SKEVIAEGQSQQAKDYVDPPPAPLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA---------
        SKEV  EG++   KDYVDPPPAPLL   ELK WSFYRA+IAEF+ATLLFLY+T+ATVIG+ +QTGPC+GVGLLGIAWAFG MIFVLVY  A         
Subjt:  SKEVIAEGQSQQAKDYVDPPPAPLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA---------

Query:  ----------------------FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPI
                               QCLGAI GVG VKA MK  YN  GGGAN VA G++KGTALGAEIIGTF+LVYT+FSATDPKR+ARDSH+PVLAPLPI
Subjt:  ----------------------FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPI

Query:  GLAVFVVHLATIPITGTGINPARSLGTAVIFNN
        G AVF+VHLATIPITGTGINPARS G AVI+NN
Subjt:  GLAVFVVHLATIPITGTGINPARSLGTAVIFNN

Q7XLR1 Probable aquaporin PIP2-62.3e-7564.41Show/hide
Query:  SKEVIAEGQSQQAKDYVDPPPAPLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTG--PCNGVGLLGIAWAFGAMIFVLVYPPA-------
        SKEV  E +  + KDY DPPPAPL    EL+LWSFYRA+IAEF+ATLLFLYIT+ATVIG   Q+    C GVG LGIAWAFG MIF+LVY  A       
Subjt:  SKEVIAEGQSQQAKDYVDPPPAPLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTG--PCNGVGLLGIAWAFGAMIFVLVYPPA-------

Query:  ------------------------FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPL
                                 QCLG I GVG+VK +MKH YN NGGGAN+VA+G++ GTALGAEIIGTF+LVYT+FSATDPKRNARDSHVPVLAPL
Subjt:  ------------------------FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPL

Query:  PIGLAVFVVHLATIPITGTGINPARSLGTAVIFNNR
        PIG AVF+VHLATIPITGTGINPARS+G AVI+N +
Subjt:  PIGLAVFVVHLATIPITGTGINPARSLGTAVIFNNR

Q8H5N9 Probable aquaporin PIP2-14.9e-7061.38Show/hide
Query:  MSSKEVIAEGQSQ---QAKDYVDPPPAPLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQT--------GPCNGVGLLGIAWAFGAMIFVLV
        M   EV+  G +     AKDY DPPPAPL+  +EL  WS YRAVIAEF+ATLLFLYIT+ATVIG   QT          C GVG+LGIAWAFG MIF+LV
Subjt:  MSSKEVIAEGQSQ---QAKDYVDPPPAPLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQT--------GPCNGVGLLGIAWAFGAMIFVLV

Query:  YPPA-------------------------------FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNA
        Y  A                                QCLGAI GVGLVKA     +N  GGGAN +AAG++KGT L AEIIGTF+LVYT+FSATDPKRNA
Subjt:  YPPA-------------------------------FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNA

Query:  RDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGTAVIFNN
        RDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS+G AVIFNN
Subjt:  RDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGTAVIFNN

Q9ZVX8 Probable aquaporin PIP2-81.1e-7768.24Show/hide
Query:  SKEVIAEGQSQQAKDYVDPPPAPLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA---------
        SKEV  EG  +  KDYVDPPPAPLL  +ELKLWSFYRA+IAEF+ATLLFLY+T+ATVIG+  QTGPC GVGLLGIAWAFG MIFVLVY  A         
Subjt:  SKEVIAEGQSQQAKDYVDPPPAPLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAMIFVLVYPPA---------

Query:  ----------------------FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPI
                               QCLGAI GVGLVKA M   Y   GGGAN VA G++ GTALGAEIIGTF+LVYT+FSATDPKR+ARDSHVPVLAPLPI
Subjt:  ----------------------FQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPI

Query:  GLAVFVVHLATIPITGTGINPARSLGTAVIFNN
        G AVF+VHLATIPITGTGINPARS G AVI+NN
Subjt:  GLAVFVVHLATIPITGTGINPARSLGTAVIFNN

Arabidopsis top hitse value%identityAlignment
AT5G27730.1 Protein of unknown function (DUF1624)5.7e-9042.5Show/hide
Query:  SGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLYLFLLGVLLQGGYFHGIT
        +G  PR+ SLD        LM+LVD  G  +P+IAH+PWNG +LADFVMP+FLF  GVS+AL  K + +K  A +    R   L   G+LLQGG+ H   
Subjt:  SGKPPRVVSLD--------LMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLYLFLLGVLLQGGYFHGIT

Query:  SLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHT------KSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTSSLPPNGSYVYM
         LTYG D+  +R+ GILQRI++ YL+ AL EI+ T+ + EE   T      KS+ WHW +   +  +Y+   YG YVPDW+F +    S L      +  
Subjt:  SLTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHT------KSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTSSLPPNGSYVYM

Query:  VNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISS--SGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANVQEHKSRTNR
        V+C VRG L P CN+ G +DR VLGI+H+Y  P +R  K C   S   G + + +PSWC APFEPEG+LSS++A ++ IIG+ +GHI+ +++ H +R   
Subjt:  VNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNISS--SGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANVQEHKSRTNR

Query:  WFSLSLKILGLGIFLVFIGL-PVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYWRSPKNNIVS
        W S  L +L LG+ L F  L P+NK LY+ SY+ +TS +A +VF +LY LVD+  ++ +   L+W+G +A+ +YV+    I+     G+Y+R P N +++
Subjt:  WFSLSLKILGLGIFLVFIGL-PVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYWRSPKNNIVS

AT5G47900.1 Protein of unknown function (DUF1624)4.3e-11448.46Show/hide
Query:  KDQEESPQ---PSGKPP---RVVSLDL--------MMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLYLF
        KD E + Q    S  PP   R+VSLD+        M+LVD  G   P I HSPW+G+ LADFVMP+FLF  GVSLA  YK +  +  ATR A  R L L 
Subjt:  KDQEESPQ---PSGKPP---RVVSLDL--------MMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLYLF

Query:  LLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWL--TRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS
        LLG+ LQGG+ HG+ +LTYG D+E+IR +GILQRI++ YL+ ALCEIWL      S E    K + +HW + F+++++Y+ L YGLYVPDW+++I     
Subjt:  LLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWL--TRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS

Query:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNIS--SSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILA
                   V C VRG  GP CN+ GM+DR  LGI HLY KPVY   K+C+I+  ++G +P  +PSWC APF+PEGLLSSL A+V C++GL YGHI+ 
Subjt:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNIS--SSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILA

Query:  NVQEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFY
        + ++HK R N+W   S  +L LG+ L   G+ +NK LYT+SYM +TS ++G +  A+Y++VDV+GY+R + +LEWMG HAL IYVL+  N+V + I GFY
Subjt:  NVQEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFY

Query:  WRSPKNNIVSTKKWSYLLMGTGK
        W++P NN++        L+G GK
Subjt:  WRSPKNNIVSTKKWSYLLMGTGK

AT5G47900.4 Protein of unknown function (DUF1624)5.3e-9643.98Show/hide
Query:  KDQEESPQ---PSGKPP---RVVSLDL--------MMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLYLF
        KD E + Q    S  PP   R+VSLD+        M+LVD  G   P I HSPW+G+ LADFVMP+FLF  GVSLA  YK +  +  ATR A  R L L 
Subjt:  KDQEESPQ---PSGKPP---RVVSLDL--------MMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLYLF

Query:  LLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWL--TRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS
        LLG+ LQGG+ HG+ +LTYG D+E+IR +GILQRI++ YL+ ALCEIWL      S E    K + +HW + F+++++Y+ L YGLYVPDW+++I     
Subjt:  LLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWL--TRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS

Query:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNIS--SSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILA
                   V C VRG  GP CN+ GM+DR  LGI HLY KPVY   K+C+I+  ++G +P  +PSWC APF+PEGLLSSL A+V C++GL YGHI+ 
Subjt:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNIS--SSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILA

Query:  NVQEHKSRTNRWFSLSLKI---------LGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNI
        + + + S+   +   S+ I         +    FL        + L+ +   +I     G        LVDV+GY+R + +LEWMG HAL IYVL+  N+
Subjt:  NVQEHKSRTNRWFSLSLKI---------LGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNI

Query:  VVIGIQGFYWRSPKNNIVSTKKWSYLLMGTGK
        V + I GFYW++P NN++        L+G GK
Subjt:  VVIGIQGFYWRSPKNNIVSTKKWSYLLMGTGK

AT5G47900.6 Protein of unknown function (DUF1624)6.9e-9648.81Show/hide
Query:  ATRNATCRGLYLFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWL--TRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLY
        ATR A  R L L LLG+ LQGG+ HG+ +LTYG D+E+IR +GILQRI++ YL+ ALCEIWL      S E    K + +HW + F+++++Y+ L YGLY
Subjt:  ATRNATCRGLYLFLLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWL--TRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLY

Query:  VPDWDFKISTTTSSLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNIS--SSGQVPETSPSWCHAPFEPEGLLSSLTASV
        VPDW+++I                V C VRG  GP CN+ GM+DR  LGI HLY KPVY   K+C+I+  ++G +P  +PSWC APF+PEGLLSSL A+V
Subjt:  VPDWDFKISTTTSSLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNIS--SSGQVPETSPSWCHAPFEPEGLLSSLTASV

Query:  ACIIGLQYGHILANVQEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLV
         C++GL YGHI+ + ++HK R N+W   S  +L LG+ L   G+ +NK LYT+SYM +TS ++G +  A+Y++VDV+GY+R + +LEWMG HAL IYVL+
Subjt:  ACIIGLQYGHILANVQEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLV

Query:  ISNIVVIGIQGFYWRSPKNNIVSTKKWSYLLMGTGK
          N+V + I GFYW++P NN++        L+G GK
Subjt:  ISNIVVIGIQGFYWRSPKNNIVSTKKWSYLLMGTGK

AT5G47900.7 Protein of unknown function (DUF1624)3.3e-9043.97Show/hide
Query:  KDQEESPQ---PSGKPP---RVVSLDL--------MMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLYLF
        KD E + Q    S  PP   R+VSLD+        M+LVD  G   P I HSPW+G+ LADFVMP+FLF  GVSLA  YK +  +  ATR A  R L L 
Subjt:  KDQEESPQ---PSGKPP---RVVSLDL--------MMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLYLF

Query:  LLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWL--TRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS
        LLG+ LQGG+ HG+ +LTYG D+E+IR +GILQRI++ YL+ ALCEIWL      S E    K + +HW + F+++++Y+ L YGLYVPDW+++I     
Subjt:  LLGVLLQGGYFHGITSLTYGADMERIRWLGILQRISVGYLIAALCEIWL--TRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTS

Query:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNIS--SSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILA
                   V C VRG  GP CN+ GM+DR  LGI HLY KPVY   K+C+I+  ++G +P  +PSWC APF+PEGLLSSL A+V C++GL YGHI+ 
Subjt:  SLPPNGSYVYMVNCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKECNIS--SSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILA

Query:  N-------------------------------------VQEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILV
        +                                     +Q+HK R N+W   S  +L LG+ L   G+ +NK LYT+SYM +TS ++G +  A+Y++V
Subjt:  N-------------------------------------VQEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTVSYMLITSASAGIVFCALYILV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATTCTCGACCGCTGCTGAAGGATCAAGAAGAGTCGCCGCAGCCTAGTGGCAAACCTCCGCGCGTTGTATCACTCGACCTGATGATGCTCGTGGACTACGGTGG
CTCGTTTTTTCCAATTATTGCTCATTCGCCCTGGAATGGACTTCATTTGGCTGATTTTGTGATGCCTTGGTTTCTATTTGCTGCCGGAGTCTCGCTTGCACTTGTTTACA
AAGAAGTAAAAGATAAAGTGGCCGCTACAAGAAATGCAACATGCAGGGGTCTGTACCTTTTTCTCCTTGGTGTTCTTCTTCAAGGTGGTTACTTTCATGGAATAACATCT
TTGACATATGGTGCTGATATGGAAAGGATTAGGTGGCTTGGCATTTTGCAGAGAATATCTGTTGGATACTTGATTGCTGCTTTATGTGAGATCTGGCTAACTCGTTACAC
AAGTGAGGAAGCTCGACATACTAAGAGCTTCTGCTGGCATTGGTGCATTATATTTCTTCTGTCGTCATTGTATATGGGACTGTCATATGGTTTATATGTTCCAGATTGGG
ACTTTAAAATATCAACCACAACCTCTTCACTTCCACCAAATGGAAGCTATGTTTACATGGTGAATTGTTCTGTTCGAGGTGATCTTGGACCTGCTTGTAATTCTGCTGGG
ATGATTGATCGTTATGTTCTTGGTATTGACCATTTGTATACTAAACCTGTCTACAGAAATCTGAAGGAGTGCAACATTTCTTCCAGTGGTCAAGTTCCTGAGACTTCACC
TTCATGGTGCCATGCTCCTTTTGAACCTGAAGGTCTTTTAAGCTCTTTAACAGCTTCAGTGGCATGCATAATAGGACTTCAGTATGGTCACATTCTCGCCAATGTACAGG
AACACAAAAGTCGCACCAATAGGTGGTTCTCACTTTCGCTTAAGATTTTGGGTCTCGGAATATTCCTCGTCTTTATAGGTCTCCCTGTAAATAAGTCCCTCTACACAGTC
AGCTATATGCTGATTACTTCTGCATCAGCAGGAATAGTCTTTTGTGCTCTATATATCTTGGTGGATGTCCACGGCTATCAACGCTTGACGTGTATTCTGGAGTGGATGGG
GAAGCATGCTTTGAGTATTTATGTTTTAGTAATCTCTAACATAGTCGTTATTGGGATACAAGGATTCTACTGGAGATCTCCTAAAAATAACATTGTAAGCACTAAAAAAT
GGAGCTATCTTTTGATGGGAACTGGGAAAGTATTTGGAAGTTGGAGCGGCTATCTTACTGGGGTCAATTCACTTATCAGTAGTTGTAGTGCTCGAGACCGCTCGGCCGAG
TTCGAAATTATCCTCCAAAGATTAGGCAAAAAGGCCGGTAAGAAGATAACAGCTATTGCAGAGAGAAGCTTTGGGAAAATGTCTTCTAAGGAAGTGATCGCAGAGGGACA
GAGCCAGCAGGCCAAAGACTATGTCGACCCGCCACCAGCTCCGCTCCTCGGCTTCTCTGAGCTCAAGCTTTGGTCCTTCTACAGAGCTGTCATCGCTGAGTTCATGGCTA
CTCTCCTTTTCCTCTACATCACCATCGCCACCGTCATCGGAAACAGCAGACAAACCGGGCCCTGCAATGGAGTCGGCCTTCTCGGCATAGCCTGGGCCTTCGGCGCCATG
ATCTTCGTCCTCGTATACCCGCCGGCATTTCAGTGTCTGGGAGCCATTGCTGGCGTTGGGTTGGTTAAAGCTCTCATGAAGCACGCCTACAACGGCAATGGCGGCGGAGC
GAACCTCGTTGCCGCCGGGTTCAACAAGGGCACCGCCCTCGGAGCCGAGATAATCGGCACTTTTCTTCTGGTCTACACCTTGTTCTCAGCCACTGACCCGAAGAGAAACG
CCCGTGACTCCCATGTTCCTGTTTTGGCTCCATTGCCAATCGGACTCGCCGTCTTCGTGGTGCATCTGGCCACCATTCCCATTACCGGCACCGGTATCAACCCTGCTCGG
AGCTTGGGCACCGCCGTCATATTCAACAACAGGAGTCTGGACCGACCATTGGATCTTCTGGGTTGGGCCGTTTGTCGGGGCGGTGGCGGCGGCGGTGTACCATCAATACG
TTCTCAGAGCGGCGGCCGTGAAAGCACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGATTCTCGACCGCTGCTGAAGGATCAAGAAGAGTCGCCGCAGCCTAGTGGCAAACCTCCGCGCGTTGTATCACTCGACCTGATGATGCTCGTGGACTACGGTGG
CTCGTTTTTTCCAATTATTGCTCATTCGCCCTGGAATGGACTTCATTTGGCTGATTTTGTGATGCCTTGGTTTCTATTTGCTGCCGGAGTCTCGCTTGCACTTGTTTACA
AAGAAGTAAAAGATAAAGTGGCCGCTACAAGAAATGCAACATGCAGGGGTCTGTACCTTTTTCTCCTTGGTGTTCTTCTTCAAGGTGGTTACTTTCATGGAATAACATCT
TTGACATATGGTGCTGATATGGAAAGGATTAGGTGGCTTGGCATTTTGCAGAGAATATCTGTTGGATACTTGATTGCTGCTTTATGTGAGATCTGGCTAACTCGTTACAC
AAGTGAGGAAGCTCGACATACTAAGAGCTTCTGCTGGCATTGGTGCATTATATTTCTTCTGTCGTCATTGTATATGGGACTGTCATATGGTTTATATGTTCCAGATTGGG
ACTTTAAAATATCAACCACAACCTCTTCACTTCCACCAAATGGAAGCTATGTTTACATGGTGAATTGTTCTGTTCGAGGTGATCTTGGACCTGCTTGTAATTCTGCTGGG
ATGATTGATCGTTATGTTCTTGGTATTGACCATTTGTATACTAAACCTGTCTACAGAAATCTGAAGGAGTGCAACATTTCTTCCAGTGGTCAAGTTCCTGAGACTTCACC
TTCATGGTGCCATGCTCCTTTTGAACCTGAAGGTCTTTTAAGCTCTTTAACAGCTTCAGTGGCATGCATAATAGGACTTCAGTATGGTCACATTCTCGCCAATGTACAGG
AACACAAAAGTCGCACCAATAGGTGGTTCTCACTTTCGCTTAAGATTTTGGGTCTCGGAATATTCCTCGTCTTTATAGGTCTCCCTGTAAATAAGTCCCTCTACACAGTC
AGCTATATGCTGATTACTTCTGCATCAGCAGGAATAGTCTTTTGTGCTCTATATATCTTGGTGGATGTCCACGGCTATCAACGCTTGACGTGTATTCTGGAGTGGATGGG
GAAGCATGCTTTGAGTATTTATGTTTTAGTAATCTCTAACATAGTCGTTATTGGGATACAAGGATTCTACTGGAGATCTCCTAAAAATAACATTGTAAGCACTAAAAAAT
GGAGCTATCTTTTGATGGGAACTGGGAAAGTATTTGGAAGTTGGAGCGGCTATCTTACTGGGGTCAATTCACTTATCAGTAGTTGTAGTGCTCGAGACCGCTCGGCCGAG
TTCGAAATTATCCTCCAAAGATTAGGCAAAAAGGCCGGTAAGAAGATAACAGCTATTGCAGAGAGAAGCTTTGGGAAAATGTCTTCTAAGGAAGTGATCGCAGAGGGACA
GAGCCAGCAGGCCAAAGACTATGTCGACCCGCCACCAGCTCCGCTCCTCGGCTTCTCTGAGCTCAAGCTTTGGTCCTTCTACAGAGCTGTCATCGCTGAGTTCATGGCTA
CTCTCCTTTTCCTCTACATCACCATCGCCACCGTCATCGGAAACAGCAGACAAACCGGGCCCTGCAATGGAGTCGGCCTTCTCGGCATAGCCTGGGCCTTCGGCGCCATG
ATCTTCGTCCTCGTATACCCGCCGGCATTTCAGTGTCTGGGAGCCATTGCTGGCGTTGGGTTGGTTAAAGCTCTCATGAAGCACGCCTACAACGGCAATGGCGGCGGAGC
GAACCTCGTTGCCGCCGGGTTCAACAAGGGCACCGCCCTCGGAGCCGAGATAATCGGCACTTTTCTTCTGGTCTACACCTTGTTCTCAGCCACTGACCCGAAGAGAAACG
CCCGTGACTCCCATGTTCCTGTTTTGGCTCCATTGCCAATCGGACTCGCCGTCTTCGTGGTGCATCTGGCCACCATTCCCATTACCGGCACCGGTATCAACCCTGCTCGG
AGCTTGGGCACCGCCGTCATATTCAACAACAGGAGTCTGGACCGACCATTGGATCTTCTGGGTTGGGCCGTTTGTCGGGGCGGTGGCGGCGGCGGTGTACCATCAATACG
TTCTCAGAGCGGCGGCCGTGAAAGCACTTAG
Protein sequenceShow/hide protein sequence
MADSRPLLKDQEESPQPSGKPPRVVSLDLMMLVDYGGSFFPIIAHSPWNGLHLADFVMPWFLFAAGVSLALVYKEVKDKVAATRNATCRGLYLFLLGVLLQGGYFHGITS
LTYGADMERIRWLGILQRISVGYLIAALCEIWLTRYTSEEARHTKSFCWHWCIIFLLSSLYMGLSYGLYVPDWDFKISTTTSSLPPNGSYVYMVNCSVRGDLGPACNSAG
MIDRYVLGIDHLYTKPVYRNLKECNISSSGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANVQEHKSRTNRWFSLSLKILGLGIFLVFIGLPVNKSLYTV
SYMLITSASAGIVFCALYILVDVHGYQRLTCILEWMGKHALSIYVLVISNIVVIGIQGFYWRSPKNNIVSTKKWSYLLMGTGKVFGSWSGYLTGVNSLISSCSARDRSAE
FEIILQRLGKKAGKKITAIAERSFGKMSSKEVIAEGQSQQAKDYVDPPPAPLLGFSELKLWSFYRAVIAEFMATLLFLYITIATVIGNSRQTGPCNGVGLLGIAWAFGAM
IFVLVYPPAFQCLGAIAGVGLVKALMKHAYNGNGGGANLVAAGFNKGTALGAEIIGTFLLVYTLFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPAR
SLGTAVIFNNRSLDRPLDLLGWAVCRGGGGGGVPSIRSQSGGREST