; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G12950 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G12950
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF3537)
Genome locationClcChr08:24162679..24167833
RNA-Seq ExpressionClc08G12950
SyntenyClc08G12950
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065010.1 uncharacterized protein E6C27_scaffold82G003340 [Cucumis melo var. makuwa]9.0e-20889.51Show/hide
Query:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR
        MDSGESNLPLLLQHSY RSKS+TSDDLRNFRISLKWCALDHSS  GKFFSY  F FLT +VPAVT++A++DP DDPISFNKLVQVPESGLAAV FLSLCR
Subjt:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFLEGLQ+DSVYVQKGYARELEKWFR VAYILFPSFFVELAHKIIFFSTVKISFPVGFP+NSI+FI VLGSWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGVHKLFETS+GS SD V+IFNEHLRIRKQLS+TSHRYRFFIIVCLLVITVSQFVALLLVLASK+DKNFFNSGDVVVCS VQLCGFFLCLFGA
Subjt:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGN--GSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGY
        ARITHRAQ IAS+ATRWHMLVTSAA+GSN TKSPPLQP V+ KK+ N   SG D+DDTDSDSSSDVLISV  Q+ SSFQTRQALVSYLQHNNGGITLFGY
Subjt:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGN--GSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGY

Query:  ALDRGLLHTLFAFEFSLVMWILSKVVVLS
        ALDRGLLHTLFAFEFSLVMWILSKVVVLS
Subjt:  ALDRGLLHTLFAFEFSLVMWILSKVVVLS

KAG7020327.1 hypothetical protein SDJN02_17011, partial [Cucurbita argyrosperma subsp. argyrosperma]3.3e-19484.31Show/hide
Query:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR
        MDSG++NLPLLLQHSY RSKSLTSD+LRNFRISLKW ALDHSS AGKF SYF F  LT +VP VTIV++RDPADDPIS NKLVQVPESGLAA+GF SLCR
Subjt:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFL+GLQ+DS+YVQ+GYAREL+K FRY+AYILFPSFFVELAHKIIFFSTVKISFP GFP+NSI+F+ VL SWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGV+KLFETS+GSGSD +VIFNEHLRIRKQL VTSHRYRFFIIVCL +ITVSQFVALLLVL SK+DKNFFNSGD+VVCS VQLCGFFLC+ GA
Subjt:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGYAL
        ARITHRAQAIASVATRWHMLVTSAAAG N TK P   PP  D         D DDTDSDSSS+VLIS+STQ+PSSFQTRQALVSYLQHNNGGITLFG+AL
Subjt:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGYAL

Query:  DRGLLHTLFAFEFSLVMWILSKVVVLS
        DRGLLHTLFAFEFSLVMWI+SKVVVLS
Subjt:  DRGLLHTLFAFEFSLVMWILSKVVVLS

XP_011649751.1 uncharacterized protein LOC101214952 [Cucumis sativus]3.9e-20389.04Show/hide
Query:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR
        MDS ESNLPLLLQHSY RSKS+TSDDLRN+RISLKWCALDHSS AGKFFSY  F  LT +VPAVTI+A++DP DDPISFNKLVQVPESGLAAVGFLSLCR
Subjt:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFLEGLQ+DSVYVQKGYARELEKWFR VAYILFPSFFVELAHKIIFFSTVKI FPVGFPVNSI+FILVLGSWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGVHKLFETS+GS SD V+IFNEHLRIRKQLS+TSHRYRFFIIVCLLVITVSQFVALLLVLASK+DKNFFNSGDVVVCS VQLCGFFLCLFGA
Subjt:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPP-LQPPVDDKKDGN--GSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFG
        ARITHRAQ IASVATRWHMLVTSAA+GSN TK PP LQP V+ KK+ N   SG D+DDTDSDSSSDVLISV  ++ SSFQTRQALVSYLQHNNGGITLFG
Subjt:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPP-LQPPVDDKKDGN--GSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFG

Query:  YALDRGLLHTLFAFEFSLVMWILSKVVVL
        YALDRGLLHTLFAFEFSLVM ILSKVVVL
Subjt:  YALDRGLLHTLFAFEFSLVMWILSKVVVL

XP_023002362.1 uncharacterized protein LOC111496225 [Cucurbita maxima]4.3e-19483.84Show/hide
Query:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR
        MDSG++NLPLLLQHSY RSKSLTSD+LRNFRISLKWCALDHSS AGKF SYF F  LT +VP VTIV++RDPADDPIS NKLVQVPESGLAA+GF SLCR
Subjt:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFL+GLQ+DS+YVQ+GYAREL+K FRY+AY+LFPSFF ELAHKIIFFSTVKISFP GFP+NSI+F+ VL SWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGV+KLFETS+GSGSD +VIFNEHLRIRKQL VTSHRYRFFIIVCL +ITVSQFVALLLVL SK++KNFFNSGD+VVCS VQLCGFFLC+ GA
Subjt:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGYAL
        ARITHRAQAIASVATRWHMLVTSAAAG N TK P   PP  D         D DDTDSDSSS+VLIS+STQ+PSSFQTRQALVSYLQHNNGGITLFG+AL
Subjt:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGYAL

Query:  DRGLLHTLFAFEFSLVMWILSKVVVLS
        DRGLLHTLFAFEFSLVMWI+SKVVVLS
Subjt:  DRGLLHTLFAFEFSLVMWILSKVVVLS

XP_038884363.1 uncharacterized protein LOC120075225 [Benincasa hispida]2.5e-21392.13Show/hide
Query:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR
        MDSGESNLPLLLQHSY RSKSLTSDDLRNFRISLKWCALDHSS AGKFFSYFTF FLTF+VP V I+A+RDPADDPISFNKLVQVPESGLAAVGFLSLCR
Subjt:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPV---GFPVNSIVFILVLGSWVYRTGVFLLVCVLFR
        FFRRYGLRQLLFLEGLQ+DSVYVQKGYARELEKWFR VAYILFPSFFVELAHKIIFFSTVKISFP+   GFPVNSI+FI VLGSW+YRTGVFLLVCVLFR
Subjt:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPV---GFPVNSIVFILVLGSWVYRTGVFLLVCVLFR

Query:  LTCELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCL
        LTCELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASK+DKNFFNSGDVVVCSAVQLCGFFLCL
Subjt:  LTCELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCL

Query:  FGAARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKD--GNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITL
        FGAARITHRAQ IASVATRWHMLVTSAA GSNPTKS PL P VDDKKD   N +  DD+DTDSDSSSDVLISV+TQ+PSSFQ RQALVSYLQHNNGGITL
Subjt:  FGAARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKD--GNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITL

Query:  FGYALDRGLLHTLFAFEFSLVMWILSKVVVLS
        FGYALDRGLLHTLFAFEFSLVMWILSKVVVLS
Subjt:  FGYALDRGLLHTLFAFEFSLVMWILSKVVVLS

TrEMBL top hitse value%identityAlignment
A0A0A0LS65 Uncharacterized protein1.9e-20389.04Show/hide
Query:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR
        MDS ESNLPLLLQHSY RSKS+TSDDLRN+RISLKWCALDHSS AGKFFSY  F  LT +VPAVTI+A++DP DDPISFNKLVQVPESGLAAVGFLSLCR
Subjt:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFLEGLQ+DSVYVQKGYARELEKWFR VAYILFPSFFVELAHKIIFFSTVKI FPVGFPVNSI+FILVLGSWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGVHKLFETS+GS SD V+IFNEHLRIRKQLS+TSHRYRFFIIVCLLVITVSQFVALLLVLASK+DKNFFNSGDVVVCS VQLCGFFLCLFGA
Subjt:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPP-LQPPVDDKKDGN--GSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFG
        ARITHRAQ IASVATRWHMLVTSAA+GSN TK PP LQP V+ KK+ N   SG D+DDTDSDSSSDVLISV  ++ SSFQTRQALVSYLQHNNGGITLFG
Subjt:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPP-LQPPVDDKKDGN--GSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFG

Query:  YALDRGLLHTLFAFEFSLVMWILSKVVVL
        YALDRGLLHTLFAFEFSLVM ILSKVVVL
Subjt:  YALDRGLLHTLFAFEFSLVMWILSKVVVL

A0A1S3BDR7 uncharacterized protein LOC1034885591.2e-18188.02Show/hide
Query:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR
        MDSGESNLPLLLQHSY RSKS+TSDDLRNFRISLKWCALDHSS  GKFFSY  F FLT +VPAVT++A++DP DDPISFNKLVQVPESGLAAV FLSLCR
Subjt:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFLEGLQ+DSVYVQKGYARELEKWFR VAYILFPSFFVELAHKIIFFSTVKISFPVGFP+NSI+FI VLGSWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGVHKLFETS+GS SD V+IFNEHLRI KQLS+TSHRYRFFIIVCLLVITVSQFVALLLVLASK+DKNFFNSGDVVVCS VQLCGFFLCLFGA
Subjt:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGN--GSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQAL
        ARITHRAQ IAS+ATRWHMLVTSAA+GSN TKSPPLQP V+ KK+ N   SG D+DDTDSDSSSDVLISV  Q+ SSFQTRQAL
Subjt:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGN--GSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQAL

A0A5A7VCT9 Uncharacterized protein4.4e-20889.51Show/hide
Query:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR
        MDSGESNLPLLLQHSY RSKS+TSDDLRNFRISLKWCALDHSS  GKFFSY  F FLT +VPAVT++A++DP DDPISFNKLVQVPESGLAAV FLSLCR
Subjt:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFLEGLQ+DSVYVQKGYARELEKWFR VAYILFPSFFVELAHKIIFFSTVKISFPVGFP+NSI+FI VLGSWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGVHKLFETS+GS SD V+IFNEHLRIRKQLS+TSHRYRFFIIVCLLVITVSQFVALLLVLASK+DKNFFNSGDVVVCS VQLCGFFLCLFGA
Subjt:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGN--GSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGY
        ARITHRAQ IAS+ATRWHMLVTSAA+GSN TKSPPLQP V+ KK+ N   SG D+DDTDSDSSSDVLISV  Q+ SSFQTRQALVSYLQHNNGGITLFGY
Subjt:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGN--GSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGY

Query:  ALDRGLLHTLFAFEFSLVMWILSKVVVLS
        ALDRGLLHTLFAFEFSLVMWILSKVVVLS
Subjt:  ALDRGLLHTLFAFEFSLVMWILSKVVVLS

A0A6J1GHK9 uncharacterized protein LOC1114542474.7e-19484.07Show/hide
Query:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR
        MDSG++NLPLLLQHSY RSKSLTSD+LRNFRISLKW ALDHSS AGKF SYF F  LT +VP VTIV++RDPADDPIS NKLVQVPESGLAA+GF SLCR
Subjt:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFL+GLQ+DS+YVQ+GYAREL+K FRY+AYILFPSFFVELAHKIIFFSTVKISFP GFP+NSI+F+ VL SWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGV+KLFETS+GSGSD +VIFNEHLRIRKQL VTSHRYRFFIIVCL +ITVSQFVALLL L SK+DKNFFNSGD+VVCS VQLCGFFLC+ GA
Subjt:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGYAL
        ARITHRAQAIASVATRWHMLVTSAAAG N TK P   PP  D         D DDTDSDSSS+VLIS+STQ+PSSFQTRQALVSYLQHNNGGITLFG+AL
Subjt:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGYAL

Query:  DRGLLHTLFAFEFSLVMWILSKVVVLS
        DRGLLHTLFAFEFSLVMWI+SKVVVLS
Subjt:  DRGLLHTLFAFEFSLVMWILSKVVVLS

A0A6J1KJB4 uncharacterized protein LOC1114962252.1e-19483.84Show/hide
Query:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR
        MDSG++NLPLLLQHSY RSKSLTSD+LRNFRISLKWCALDHSS AGKF SYF F  LT +VP VTIV++RDPADDPIS NKLVQVPESGLAA+GF SLCR
Subjt:  MDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFL+GLQ+DS+YVQ+GYAREL+K FRY+AY+LFPSFF ELAHKIIFFSTVKISFP GFP+NSI+F+ VL SWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA
        ELQILRLQGV+KLFETS+GSGSD +VIFNEHLRIRKQL VTSHRYRFFIIVCL +ITVSQFVALLLVL SK++KNFFNSGD+VVCS VQLCGFFLC+ GA
Subjt:  ELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGA

Query:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGYAL
        ARITHRAQAIASVATRWHMLVTSAAAG N TK P   PP  D         D DDTDSDSSS+VLIS+STQ+PSSFQTRQALVSYLQHNNGGITLFG+AL
Subjt:  ARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGYAL

Query:  DRGLLHTLFAFEFSLVMWILSKVVVLS
        DRGLLHTLFAFEFSLVMWI+SKVVVLS
Subjt:  DRGLLHTLFAFEFSLVMWILSKVVVLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)8.2e-6637.39Show/hide
Query:  YGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVT--IVAIRD-PADDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQLLF
        + R  S   D+L +FR  L+W  +DHSSP     S+  F   T +VPA++  ++A  D  +     ++ +VQ+  S +A V FL L RF  +YGLR+ LF
Subjt:  YGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVT--IVAIRD-PADDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQLLF

Query:  LEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFP-VGFPV--NSIVFILVLGSWVYRTGVFLLVCVLFRLTCELQILRLQG
         + L D+S  V++ Y  +L      V+Y + P F    A+KI ++++     P +G  V  +++  I+ L SW+YRT V  LVCVLFRL C LQILRLQ 
Subjt:  LEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFP-VGFPV--NSIVFILVLGSWVYRTGVFLLVCVLFRLTCELQILRLQG

Query:  VHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQA
          KLF+  S  GS    I +EHLRIR+ L + SHRYR FI+  L+++T SQF +LL+   + ++ N + +G++ +CS   +    + L  A++ITH+AQA
Subjt:  VHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQA

Query:  IASVATRWHMLVT------SAAAGSNPTKSPPL-------QPPVDD-----KKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNG
        +  +A +WH+  T      +  +     ++P L          V D     + D +  G ++DD D++     +I V      SFQ RQALVSY ++N+ 
Subjt:  IASVATRWHMLVT------SAAAGSNPTKSPPL-------QPPVDD-----KKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNG

Query:  GITLFGYALDRGLLHTLFAFEFSLVMWILSKVVVLS
        GIT++G+ LDRG LHT+F  E SLV+W+L K + +S
Subjt:  GITLFGYALDRGLLHTLFAFEFSLVMWILSKVVVLS

AT1G50630.2 Protein of unknown function (DUF3537)2.2e-5035.26Show/hide
Query:  YGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVT--IVAIRD-PADDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQLLF
        + R  S   D+L +FR  L+W  +DHSSP     S+  F   T +VPA++  ++A  D  +     ++ +VQ+  S +A V FL L RF  +YGLR+ LF
Subjt:  YGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVT--IVAIRD-PADDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQLLF

Query:  LEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFP-VGFPV--NSIVFILVLGSWVYRTGVFLLVCVLFRLTCELQILRLQG
         + L D+S  V++ Y  +L      V+Y + P F    A+KI ++++     P +G  V  +++  I+ L SW+YRT V  LVCVLFRL C LQILRLQ 
Subjt:  LEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFP-VGFPV--NSIVFILVLGSWVYRTGVFLLVCVLFRLTCELQILRLQG

Query:  VHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQA
          KLF+  S  GS    I +EHLRIR+ L + SHRYR FI+  L+++T SQF +LL+   + ++ N + +G++ +CS   +    + L  A++ITH+AQA
Subjt:  VHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQA

Query:  IASVATRWHMLVT------SAAAGSNPTKSPPL-------QPPVDD-----KKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQH
        +  +A +WH+  T      +  +     ++P L          V D     + D +  G ++DD D++     +I V      SFQ RQAL   +Q+
Subjt:  IASVATRWHMLVT------SAAAGSNPTKSPPL-------QPPVDD-----KKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQH

AT2G21080.1 unknown protein1.9e-12359.03Show/hide
Query:  NLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPI------SFNKLVQVPESGLAAVGFLSLCR
        +LP L       SKSL SDDLRNFR+ LKWCALDHSS  GK  SY  F   T +VP ++ + I+ P + P       SFN LVQ PESGLA +GFL+L  
Subjt:  NLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPI------SFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPV----GFPVNSIVFILVLGSWVYRTGVFLLVCVLF
        FFR Y L +LLFL    DDS  V+ GY+REL+K  RY+AYIL PSF VEL HK IFF + ++SFP        +N ++F LVL SWVYRTGVFLLVC+LF
Subjt:  FFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPV----GFPVNSIVFILVLGSWVYRTGVFLLVCVLF

Query:  RLTCELQILRLQGVHKLFETSSGSGSDVVV-IFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFL
        RLTCELQILR +G+HKLF+     GSD +  +  EH+RI+KQLS TSHRYRFFII   +VI+ SQFVALLLVLASKS+K+F +SGD+VVCSAVQL GFFL
Subjt:  RLTCELQILRLQGVHKLFETSSGSGSDVVV-IFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFL

Query:  CLFGAARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITL
        CL GAARITHRAQ +  +ATRWHM +T A+   +P                      + DTDS  +  + +S S    S FQ RQALV YL+HNN GITL
Subjt:  CLFGAARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITL

Query:  FGYALDRGLLHTLFAFEFSLVMWILSKVVVLS
        +GYALDRGLLHTLFAFEFSLVMWILSKVVVLS
Subjt:  FGYALDRGLLHTLFAFEFSLVMWILSKVVVLS

AT3G20300.1 Protein of unknown function (DUF3537)2.2e-6636.6Show/hide
Query:  QHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVT--IVAIRD-PADDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQ
        ++ + RS S   D+L +FR  L+W  +D SSP     S+  F   T +VPA +  ++A  D  +     ++ +VQ+  S  AA+ FL L RF  +YGLR+
Subjt:  QHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVT--IVAIRD-PADDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQ

Query:  LLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFP-VGFPV--NSIVFILVLGSWVYRTGVFLLVCVLFRLTCELQILR
         LF + L D+S  V+ GY  +L +  + ++Y + P F    ++KI ++++     P +G  +  +++  ++ L SW+YRT V  LVCVLFRL C LQILR
Subjt:  LLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFP-VGFPV--NSIVFILVLGSWVYRTGVFLLVCVLFRLTCELQILR

Query:  LQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHR
        LQ   ++F+  S  GS    I +EHLRIR+ L + SHRYR FI++ L+++T SQF +LL+   + ++ N + +G++ +CS   +    + L  A++ITH+
Subjt:  LQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHR

Query:  AQAIASVATRWHMLVTSAAAGSNPTKSPPLQP--------PVDDKKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGY
        AQA+  +A +WH+  T  +  +   ++P L          P DD  +G     D  D + D  ++ LI        SFQ RQALV+Y ++N  GIT+FG+
Subjt:  AQAIASVATRWHMLVTSAAAGSNPTKSPPLQP--------PVDDKKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGY

Query:  ALDRGLLHTLFAFEFSLVMWILSKVVVLS
         LDR  LHT+F  E SLV+W+L K + +S
Subjt:  ALDRGLLHTLFAFEFSLVMWILSKVVVLS

AT4G22270.1 Protein of unknown function (DUF3537)3.2e-4631.31Show/hide
Query:  DHSSPAGKFFSYFTFAFLTFIVPAVT---IVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFR
        D S+      S+  F  L  IVP ++   +V           ++ +VQ+  S  A + F+SL  + R++G+R+ LFL+ L D S  V+  Y  E+++  +
Subjt:  DHSSPAGKFFSYFTFAFLTFIVPAVT---IVAIRDPADDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFR

Query:  YVAYILFPSFFVELAHKIIFFSTVKISFP-VGFPVNS--IVFILVLGSWVYRTGVFLLVCVLFRLTCELQILRLQGVHKLFETSSGSGSDVVVIFNEHLR
         +   + PS  +E  ++I ++ +     P +  P+ S  +   L L SW+YR  +F++VC+L+++TC LQ LRL    + F +     +DV     EH +
Subjt:  YVAYILFPSFFVELAHKIIFFSTVKISFP-VGFPVNS--IVFILVLGSWVYRTGVFLLVCVLFRLTCELQILRLQGVHKLFETSSGSGSDVVVIFNEHLR

Query:  IRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQAIASVATRWHMLVT----SAAAGSN
        IR+ L + SHR+R FI++ L+++T +QF+ALL    +    N +  G++ +CS   + G F+CL  A +ITH+AQ++ S+A +W++  T        G  
Subjt:  IRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQAIASVATRWHMLVT----SAAAGSN

Query:  PTKSPPLQPPVDDKKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGYALDRGLLHTLFAFEFSLVMWILSKVVV
        PT S  ++  V  + +   +  D++    D   +  I        S+Q RQALV+YL++N  GIT++G+ +DR  L+T+F  E +L++W+L+K +V
Subjt:  PTKSPPLQPPVDDKKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGITLFGYALDRGLLHTLFAFEFSLVMWILSKVVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAATAAATCTCACCGCCAACATTATTCACTTTCCAATTTTCAATCAAAAAAATCTCTCCGCGCCGCTTCCTCCGCCATGGACTCCGGCGAATCAAACCTC
CCTCTCCTCCTCCAACACTCATATGGCAGATCCAAATCCCTAACCTCCGACGACCTTCGCAACTTCCGAATCAGCCTCAAATGGTGCGCTCTCGACCACTCCTCC
CCCGCCGGAAAGTTCTTCTCCTACTTTACCTTCGCTTTCCTGACCTTCATCGTCCCCGCCGTCACGATCGTCGCCATCCGAGATCCAGCCGACGATCCGATCTCG
TTCAATAAACTGGTTCAGGTTCCGGAATCCGGACTGGCGGCGGTCGGATTCTTGAGCCTGTGCCGATTCTTCCGGCGGTATGGACTCCGGCAGCTTCTGTTCTTG
GAGGGATTGCAAGATGATTCTGTGTATGTGCAGAAAGGGTATGCGCGTGAACTGGAGAAGTGGTTCCGTTACGTTGCTTACATTCTTTTTCCGTCGTTCTTTGTT
GAGTTGGCCCACAAGATCATATTTTTCTCCACCGTCAAGATTTCATTTCCGGTAGGATTTCCAGTGAACTCGATTGTGTTCATATTGGTGTTGGGATCGTGGGTA
TACCGAACCGGGGTGTTCTTGCTCGTGTGCGTATTGTTTCGGCTCACATGCGAGTTGCAAATTCTTAGGCTACAAGGGGTTCACAAGCTGTTCGAAACAAGCAGC
GGGTCGGGATCGGACGTGGTGGTGATCTTCAACGAACATTTGAGGATCAGAAAACAATTATCGGTTACGAGCCACCGGTACCGGTTCTTCATAATTGTGTGCTTG
CTTGTTATTACGGTTAGCCAGTTCGTTGCGTTGCTCCTGGTTTTGGCCTCTAAAAGTGACAAAAATTTCTTCAACTCCGGCGATGTTGTGGTTTGCTCGGCTGTT
CAATTATGCGGATTCTTCTTGTGCTTGTTTGGAGCTGCGAGAATTACTCACCGAGCACAAGCAATCGCATCGGTTGCAACCCGATGGCACATGCTAGTCACCTCC
GCCGCCGCCGGATCCAACCCGACCAAATCTCCACCGCTACAGCCGCCTGTTGACGACAAGAAAGACGGTAATGGTAGTGGTGCCGATGACGACGACACCGACTCG
GATTCGTCGTCGGATGTTTTAATTTCAGTGAGCACACAACAACCTTCTTCCTTCCAAACCAGACAAGCTTTAGTGTCATATTTGCAACACAATAATGGGGGAATT
ACACTGTTTGGATATGCATTGGATCGAGGATTGCTTCATACACTCTTCGCATTTGAATTCTCTCTCGTCATGTGGATTTTGAGTAAGGTTGTCGTATTATCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTAATAAATCTCACCGCCAACATTATTCACTTTCCAATTTTCAATCAAAAAAATCTCTCCGCGCCGCTTCCTCCGCCATGGACTCCGGCGAATCAAACCTC
CCTCTCCTCCTCCAACACTCATATGGCAGATCCAAATCCCTAACCTCCGACGACCTTCGCAACTTCCGAATCAGCCTCAAATGGTGCGCTCTCGACCACTCCTCC
CCCGCCGGAAAGTTCTTCTCCTACTTTACCTTCGCTTTCCTGACCTTCATCGTCCCCGCCGTCACGATCGTCGCCATCCGAGATCCAGCCGACGATCCGATCTCG
TTCAATAAACTGGTTCAGGTTCCGGAATCCGGACTGGCGGCGGTCGGATTCTTGAGCCTGTGCCGATTCTTCCGGCGGTATGGACTCCGGCAGCTTCTGTTCTTG
GAGGGATTGCAAGATGATTCTGTGTATGTGCAGAAAGGGTATGCGCGTGAACTGGAGAAGTGGTTCCGTTACGTTGCTTACATTCTTTTTCCGTCGTTCTTTGTT
GAGTTGGCCCACAAGATCATATTTTTCTCCACCGTCAAGATTTCATTTCCGGTAGGATTTCCAGTGAACTCGATTGTGTTCATATTGGTGTTGGGATCGTGGGTA
TACCGAACCGGGGTGTTCTTGCTCGTGTGCGTATTGTTTCGGCTCACATGCGAGTTGCAAATTCTTAGGCTACAAGGGGTTCACAAGCTGTTCGAAACAAGCAGC
GGGTCGGGATCGGACGTGGTGGTGATCTTCAACGAACATTTGAGGATCAGAAAACAATTATCGGTTACGAGCCACCGGTACCGGTTCTTCATAATTGTGTGCTTG
CTTGTTATTACGGTTAGCCAGTTCGTTGCGTTGCTCCTGGTTTTGGCCTCTAAAAGTGACAAAAATTTCTTCAACTCCGGCGATGTTGTGGTTTGCTCGGCTGTT
CAATTATGCGGATTCTTCTTGTGCTTGTTTGGAGCTGCGAGAATTACTCACCGAGCACAAGCAATCGCATCGGTTGCAACCCGATGGCACATGCTAGTCACCTCC
GCCGCCGCCGGATCCAACCCGACCAAATCTCCACCGCTACAGCCGCCTGTTGACGACAAGAAAGACGGTAATGGTAGTGGTGCCGATGACGACGACACCGACTCG
GATTCGTCGTCGGATGTTTTAATTTCAGTGAGCACACAACAACCTTCTTCCTTCCAAACCAGACAAGCTTTAGTGTCATATTTGCAACACAATAATGGGGGAATT
ACACTGTTTGGATATGCATTGGATCGAGGATTGCTTCATACACTCTTCGCATTTGAATTCTCTCTCGTCATGTGGATTTTGAGTAAGGTTGTCGTATTATCCTAG
ACTTGCACCAAAAACGTAATTATTATTATTGTTATTATTACTACTTAACAAAGTTTTAAGTTTTTAGGTCAAGTAATCGTATATGAAATATTATGTTTTATTCAA
ATGTATATCCAAGTAGATAAAACATCAATCATTACAGCAAATGTCGATAGTTCAATTTAACATTATGCAATATTGTAGTACGTTTTGTAATTTAAAATTAGATAA
AAATACTATTTTAGTTCTAGTATTTTGAAACTTCTTTATATATATATATATATATATCATCAAATTTTAGTTTGTGTACATTCAATAAAACTTCATAAGTTTCAC
GTTGT
Protein sequenceShow/hide protein sequence
MTNKSHRQHYSLSNFQSKKSLRAASSAMDSGESNLPLLLQHSYGRSKSLTSDDLRNFRISLKWCALDHSSPAGKFFSYFTFAFLTFIVPAVTIVAIRDPADDPIS
FNKLVQVPESGLAAVGFLSLCRFFRRYGLRQLLFLEGLQDDSVYVQKGYARELEKWFRYVAYILFPSFFVELAHKIIFFSTVKISFPVGFPVNSIVFILVLGSWV
YRTGVFLLVCVLFRLTCELQILRLQGVHKLFETSSGSGSDVVVIFNEHLRIRKQLSVTSHRYRFFIIVCLLVITVSQFVALLLVLASKSDKNFFNSGDVVVCSAV
QLCGFFLCLFGAARITHRAQAIASVATRWHMLVTSAAAGSNPTKSPPLQPPVDDKKDGNGSGADDDDTDSDSSSDVLISVSTQQPSSFQTRQALVSYLQHNNGGI
TLFGYALDRGLLHTLFAFEFSLVMWILSKVVVLS