; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G5031 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G5031
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionProtein of unknown function (DUF3537)
Genome locationctg1227:2203234..2205366
RNA-Seq ExpressionCucsat.G5031
SyntenyCucsat.G5031
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065010.1 uncharacterized protein E6C27_scaffold82G003340 [Cucumis melo var. makuwa]1.20e-28596.04Show/hide
Query:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR
        MDS ESNLPLLLQHSYCRSKSITSDDLRN+RISLKWCALDHSSA GKFFSYL FT LTIVVPAVT+LAVQDPGDDPISFNKLVQVPESGLAAV FLSLCR
Subjt:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKI FPVGFP+NSILFI VLGSWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA
        ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASK DKNFFNSGDVVVCSTVQLCGFFLCLFGA
Subjt:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA

Query:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG
        ARITHRAQGIAS+ATRWHMLVTSAASGSNSTK PPL QP VNHKKNNN+SSSGEDEDDTDSDSSSDVLISVRP+ESSSFQTRQALVSYLQHNNGGITLFG
Subjt:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG

Query:  YALDRGLLHTLFAFEFSLVMWILSKVVVL
        YALDRGLLHTLFAFEFSLVMWILSKVVVL
Subjt:  YALDRGLLHTLFAFEFSLVMWILSKVVVL

KAG7020327.1 hypothetical protein SDJN02_17011, partial [Cucurbita argyrosperma subsp. argyrosperma]2.46e-24181.82Show/hide
Query:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR
        MDS ++NLPLLLQHSYCRSKS+TSD+LRN+RISLKW ALDHSSAAGKF SY  F +LT+VVP VTI++V+DP DDPIS NKLVQVPESGLAA+GF SLCR
Subjt:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFL+GLQEDS+YVQ+GYAREL+K FR +AYILFPSFFVELAHKIIFFSTVKI FP GFP+NSILF+ VL SWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA
        ELQILRLQGV+KLFETSNGS SD+++IFNEHLRIRKQL +TSHRYRFFIIVCL +ITVSQFVALLLVL SKTDKNFFNSGD+VVCSTVQLCGFFLC+ GA
Subjt:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA

Query:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG
        ARITHRAQ IASVATRWHMLVTSAA+G N TKPP    P V++           D DDTDSDSSS+VLIS+  +E SSFQTRQALVSYLQHNNGGITLFG
Subjt:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG

Query:  YALDRGLLHTLFAFEFSLVMWILSKVVVL
        +ALDRGLLHTLFAFEFSLVMWI+SKVVVL
Subjt:  YALDRGLLHTLFAFEFSLVMWILSKVVVL

XP_008445578.1 PREDICTED: uncharacterized protein LOC103488559 [Cucumis melo]3.88e-25395.58Show/hide
Query:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR
        MDS ESNLPLLLQHSYCRSKSITSDDLRN+RISLKWCALDHSSA GKFFSYL FT LTIVVPAVT+LAVQDPGDDPISFNKLVQVPESGLAAV FLSLCR
Subjt:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKI FPVGFP+NSILFI VLGSWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA
        ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRI KQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASK DKNFFNSGDVVVCSTVQLCGFFLCLFGA
Subjt:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA

Query:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQAL
        ARITHRAQGIAS+ATRWHMLVTSAASGSNSTK PPL QPLVNHKKNNN+SSSGEDEDDTDSDSSSDVLISVRP+ESSSFQTRQAL
Subjt:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQAL

XP_011649751.1 uncharacterized protein LOC101214952 [Cucumis sativus]1.82e-29699.77Show/hide
Query:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR
        MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR
Subjt:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA
        ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA
Subjt:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA

Query:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG
        ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG
Subjt:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG

Query:  YALDRGLLHTLFAFEFSLVMWILSKVVVL
        YALDRGLLHTLFAFEFSLVM ILSKVVVL
Subjt:  YALDRGLLHTLFAFEFSLVMWILSKVVVL

XP_038884363.1 uncharacterized protein LOC120075225 [Benincasa hispida]2.72e-26289.12Show/hide
Query:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR
        MDS ESNLPLLLQHSYCRSKS+TSDDLRN+RISLKWCALDHSSAAGKFFSY  FT LT VVP V ILAV+DP DDPISFNKLVQVPESGLAAVGFLSLCR
Subjt:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPV---GFPVNSILFILVLGSWVYRTGVFLLVCVLFR
        FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFR VAYILFPSFFVELAHKIIFFSTVKI FP+   GFPVNSILFI VLGSW+YRTGVFLLVCVLFR
Subjt:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPV---GFPVNSILFILVLGSWVYRTGVFLLVCVLFR

Query:  LTCELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCL
        LTCELQILRLQGVHKLFETS+GS SD V+IFNEHLRIRKQLS+TSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCS VQLCGFFLCL
Subjt:  LTCELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCL

Query:  FGAARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGIT
        FGAARITHRAQGIASVATRWHMLVTSAA+GSN TK  PL  P V+ KK+NN++S+ +D++DTDSDSSSDVLISV  +E SSFQ RQALVSYLQHNNGGIT
Subjt:  FGAARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGIT

Query:  LFGYALDRGLLHTLFAFEFSLVMWILSKVVVL
        LFGYALDRGLLHTLFAFEFSLVMWILSKVVVL
Subjt:  LFGYALDRGLLHTLFAFEFSLVMWILSKVVVL

TrEMBL top hitse value%identityAlignment
A0A0A0LS65 Uncharacterized protein8.80e-29799.77Show/hide
Query:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR
        MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR
Subjt:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA
        ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA
Subjt:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA

Query:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG
        ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG
Subjt:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG

Query:  YALDRGLLHTLFAFEFSLVMWILSKVVVL
        YALDRGLLHTLFAFEFSLVM ILSKVVVL
Subjt:  YALDRGLLHTLFAFEFSLVMWILSKVVVL

A0A1S3BDR7 uncharacterized protein LOC1034885591.88e-25395.58Show/hide
Query:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR
        MDS ESNLPLLLQHSYCRSKSITSDDLRN+RISLKWCALDHSSA GKFFSYL FT LTIVVPAVT+LAVQDPGDDPISFNKLVQVPESGLAAV FLSLCR
Subjt:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKI FPVGFP+NSILFI VLGSWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA
        ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRI KQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASK DKNFFNSGDVVVCSTVQLCGFFLCLFGA
Subjt:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA

Query:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQAL
        ARITHRAQGIAS+ATRWHMLVTSAASGSNSTK PPL QPLVNHKKNNN+SSSGEDEDDTDSDSSSDVLISVRP+ESSSFQTRQAL
Subjt:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQAL

A0A5A7VCT9 Uncharacterized protein5.80e-28696.04Show/hide
Query:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR
        MDS ESNLPLLLQHSYCRSKSITSDDLRN+RISLKWCALDHSSA GKFFSYL FT LTIVVPAVT+LAVQDPGDDPISFNKLVQVPESGLAAV FLSLCR
Subjt:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKI FPVGFP+NSILFI VLGSWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA
        ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASK DKNFFNSGDVVVCSTVQLCGFFLCLFGA
Subjt:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA

Query:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG
        ARITHRAQGIAS+ATRWHMLVTSAASGSNSTK PPL QP VNHKKNNN+SSSGEDEDDTDSDSSSDVLISVRP+ESSSFQTRQALVSYLQHNNGGITLFG
Subjt:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG

Query:  YALDRGLLHTLFAFEFSLVMWILSKVVVL
        YALDRGLLHTLFAFEFSLVMWILSKVVVL
Subjt:  YALDRGLLHTLFAFEFSLVMWILSKVVVL

A0A6J1GHK9 uncharacterized protein LOC1114542474.85e-24181.59Show/hide
Query:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR
        MDS ++NLPLLLQHSYCRSKS+TSD+LRN+RISLKW ALDHSSAAGKF SY  F +LT+VVP VTI++V+DP DDPIS NKLVQVPESGLAA+GF SLCR
Subjt:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFL+GLQEDS+YVQ+GYAREL+K FR +AYILFPSFFVELAHKIIFFSTVKI FP GFP+NSILF+ VL SWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA
        ELQILRLQGV+KLFETSNGS SD+++IFNEHLRIRKQL +TSHRYRFFIIVCL +ITVSQFVALLL L SKTDKNFFNSGD+VVCSTVQLCGFFLC+ GA
Subjt:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA

Query:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG
        ARITHRAQ IASVATRWHMLVTSAA+G N TKPP    P V++           D DDTDSDSSS+VLIS+  +E SSFQTRQALVSYLQHNNGGITLFG
Subjt:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG

Query:  YALDRGLLHTLFAFEFSLVMWILSKVVVL
        +ALDRGLLHTLFAFEFSLVMWI+SKVVVL
Subjt:  YALDRGLLHTLFAFEFSLVMWILSKVVVL

A0A6J1KJB4 uncharacterized protein LOC1114962251.69e-24181.35Show/hide
Query:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR
        MDS ++NLPLLLQHSYCRSKS+TSD+LRN+RISLKWCALDHSSAAGKF SY  F +LT+VVP VTI++V+DP DDPIS NKLVQVPESGLAA+GF SLCR
Subjt:  MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC
        FFRRYGLRQLLFL+GLQEDS+YVQ+GYAREL+K FR +AY+LFPSFF ELAHKIIFFSTVKI FP GFP+NSILF+ VL SWVYRTGVFLLVCVLFRLTC
Subjt:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTC

Query:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA
        ELQILRLQGV+KLFETSNGS SD+++IFNEHLRIRKQL +TSHRYRFFIIVCL +ITVSQFVALLLVL SKT+KNFFNSGD+VVCSTVQLCGFFLC+ GA
Subjt:  ELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGA

Query:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG
        ARITHRAQ IASVATRWHMLVTSAA+G N TKPP    P V++           D DDTDSDSSS+VLIS+  +E SSFQTRQALVSYLQHNNGGITLFG
Subjt:  ARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFG

Query:  YALDRGLLHTLFAFEFSLVMWILSKVVVL
        +ALDRGLLHTLFAFEFSLVMWI+SKVVVL
Subjt:  YALDRGLLHTLFAFEFSLVMWILSKVVVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)2.4e-6738.05Show/hide
Query:  RSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVT--ILAVQD-PGDDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQLLFLE
        R  S   D+L ++R  L+W  +DHSS      S+ +F + T+VVPA++  +LA  D        ++ +VQ+  S +A V FL L RF  +YGLR+ LF +
Subjt:  RSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVT--ILAVQD-PGDDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQLLFLE

Query:  GLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFP-VGFPV--NSILFILVLGSWVYRTGVFLLVCVLFRLTCELQILRLQGVH
         L ++S  V++ Y  +L      V+Y + P F    A+KI ++++     P +G  V  +++  I+ L SW+YRT V  LVCVLFRL C LQILRLQ   
Subjt:  GLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFP-VGFPV--NSILFILVLGSWVYRTGVFLLVCVLFRLTCELQILRLQGVH

Query:  KLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGAARITHRAQGIA
        KLF+     +SD   I +EHLRIR+ L I SHRYR FI+  L+++T SQF +LL+   + T+ N + +G++ +CS   +    + L  A++ITH+AQ + 
Subjt:  KLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGAARITHRAQGIA

Query:  SVATRWHMLVT-----SAASGSNSTKPPPLLQPLVNHKKNN----------NSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGI
         +A +WH+  T           + T   P L    N+  NN          +S   G++EDD D++     +I V    + SFQ RQALVSY ++N+ GI
Subjt:  SVATRWHMLVT-----SAASGSNSTKPPPLLQPLVNHKKNN----------NSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGI

Query:  TLFGYALDRGLLHTLFAFEFSLVMWILSKVV
        T++G+ LDRG LHT+F  E SLV+W+L K +
Subjt:  TLFGYALDRGLLHTLFAFEFSLVMWILSKVV

AT1G50630.2 Protein of unknown function (DUF3537)4.9e-5235.95Show/hide
Query:  RSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVT--ILAVQD-PGDDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQLLFLE
        R  S   D+L ++R  L+W  +DHSS      S+ +F + T+VVPA++  +LA  D        ++ +VQ+  S +A V FL L RF  +YGLR+ LF +
Subjt:  RSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVT--ILAVQD-PGDDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQLLFLE

Query:  GLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFP-VGFPV--NSILFILVLGSWVYRTGVFLLVCVLFRLTCELQILRLQGVH
         L ++S  V++ Y  +L      V+Y + P F    A+KI ++++     P +G  V  +++  I+ L SW+YRT V  LVCVLFRL C LQILRLQ   
Subjt:  GLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFP-VGFPV--NSILFILVLGSWVYRTGVFLLVCVLFRLTCELQILRLQGVH

Query:  KLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGAARITHRAQGIA
        KLF+     +SD   I +EHLRIR+ L I SHRYR FI+  L+++T SQF +LL+   + T+ N + +G++ +CS   +    + L  A++ITH+AQ + 
Subjt:  KLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGAARITHRAQGIA

Query:  SVATRWHMLVT-----SAASGSNSTKPPPLLQPLVNHKKNN----------NSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQH
         +A +WH+  T           + T   P L    N+  NN          +S   G++EDD D++     +I V    + SFQ RQAL   +Q+
Subjt:  SVATRWHMLVT-----SAASGSNSTKPPPLLQPLVNHKKNN----------NSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQH

AT2G21080.1 unknown protein1.4e-12057.24Show/hide
Query:  NLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPI------SFNKLVQVPESGLAAVGFLSLCR
        +LP L       SKS+ SDDLRN+R+ LKWCALDHSS+ GK  SY++F + T++VP ++ L ++ P + P       SFN LVQ PESGLA +GFL+L  
Subjt:  NLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPI------SFNKLVQVPESGLAAVGFLSLCR

Query:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPV----GFPVNSILFILVLGSWVYRTGVFLLVCVLF
        FFR Y L +LLFL    +DS  V+ GY+REL+K  R +AYIL PSF VEL HK IFF + ++ FP        +N ++F LVL SWVYRTGVFLLVC+LF
Subjt:  FFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPV----GFPVNSILFILVLGSWVYRTGVFLLVCVLF

Query:  RLTCELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLC
        RLTCELQILR +G+HKLF+       + V    EH+RI+KQLS TSHRYRFFII   +VI+ SQFVALLLVLASK++K+F +SGD+VVCS VQL GFFLC
Subjt:  RLTCELQILRLQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLC

Query:  LFGAARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPR--ESSSFQTRQALVSYLQHNNG
        L GAARITHRAQG+  +ATRWHM +T A   S +  P                        ++D+DSS ++ I+V P    SS FQ RQALV YL+HNN 
Subjt:  LFGAARITHRAQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPR--ESSSFQTRQALVSYLQHNNG

Query:  GITLFGYALDRGLLHTLFAFEFSLVMWILSKVVVL
        GITL+GYALDRGLLHTLFAFEFSLVMWILSKVVVL
Subjt:  GITLFGYALDRGLLHTLFAFEFSLVMWILSKVVVL

AT3G20300.1 Protein of unknown function (DUF3537)4.5e-6636.24Show/hide
Query:  QHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVT--ILAVQD-PGDDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQ
        ++ + RS S   D+L ++R  L+W  +D SS      S+ +F + T+VVPA +  +LA  D        ++ +VQ+  S  AA+ FL L RF  +YGLR+
Subjt:  QHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVT--ILAVQD-PGDDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQ

Query:  LLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFP-VGFPV--NSILFILVLGSWVYRTGVFLLVCVLFRLTCELQILR
         LF + L ++S  V+ GY  +L +  + ++Y + P F    ++KI ++++     P +G  +  +++  ++ L SW+YRT V  LVCVLFRL C LQILR
Subjt:  LLFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFP-VGFPV--NSILFILVLGSWVYRTGVFLLVCVLFRLTCELQILR

Query:  LQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGAARITHR
        LQ   ++F+     +SD   I +EHLRIR+ L I SHRYR FI++ L+++T SQF +LL+   +  + N + +G++ +CS   +    + L  A++ITH+
Subjt:  LQGVHKLFETSNGSESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGAARITHR

Query:  AQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNH----KKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFGYA
        AQ +  +A +WH+  T  +  +   + P L+     H      ++N  S  ED  D + D  ++ LI      + SFQ RQALV+Y ++N  GIT+FG+ 
Subjt:  AQGIASVATRWHMLVTSAASGSNSTKPPPLLQPLVNH----KKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFGYA

Query:  LDRGLLHTLFAFEFSLVMWILSKVV
        LDR  LHT+F  E SLV+W+L K +
Subjt:  LDRGLLHTLFAFEFSLVMWILSKVV

AT4G22270.1 Protein of unknown function (DUF3537)1.9e-4831.33Show/hide
Query:  DHSSAAGKFFSYLVFTLLTIVVPAVT--ILAVQD-PGDDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFR
        D S+      S+ VF LL ++VP ++  +L   D        ++ +VQ+  S  A + F+SL  + R++G+R+ LFL+ L + S  V+  Y  E+++  +
Subjt:  DHSSAAGKFFSYLVFTLLTIVVPAVT--ILAVQD-PGDDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQLLFLEGLQEDSVYVQKGYARELEKWFR

Query:  SVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILV-----LGSWVYRTGVFLLVCVLFRLTCELQILRLQGVHKLFETSNGSESDSVLIFNEH
         +   + PS  +E  ++I ++  +  +  + + +N IL  +V     L SW+YR  +F++VC+L+++TC LQ LRL    + F +     +D      EH
Subjt:  SVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILV-----LGSWVYRTGVFLLVCVLFRLTCELQILRLQGVHKLFETSNGSESDSVLIFNEH

Query:  LRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGAARITHRAQGIASVATRWHMLVT--SAASGSN
         +IR+ L I SHR+R FI++ L+++T +QF+ALL    +    N +  G++ +CS   + G F+CL  A +ITH+AQ + S+A +W++  T  S      
Subjt:  LRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGAARITHRAQGIASVATRWHMLVT--SAASGSN

Query:  STKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFGYALDRGLLHTLFAFEFSLVMWILSKVVV
         T    +++  V+ +   N+  + +DE+    D   +  I      + S+Q RQALV+YL++N  GIT++G+ +DR  L+T+F  E +L++W+L+K +V
Subjt:  STKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFGYALDRGLLHTLFAFEFSLVMWILSKVVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCCAGCGAATCAAACCTCCCTCTCCTCCTCCAACATTCTTACTGCAGGTCCAAATCCATAACCTCCGATGATCTACGCAACTATCGAATCAGCTTAAAATGGTG
CGCTCTTGACCACTCCTCCGCCGCCGGAAAGTTCTTCTCCTACCTCGTCTTCACTCTCCTGACCATCGTCGTCCCTGCCGTGACGATCCTCGCCGTCCAAGATCCGGGGG
ACGATCCGATCTCCTTCAATAAACTGGTTCAGGTTCCGGAATCGGGACTGGCGGCGGTGGGATTCTTGAGCCTGTGTCGATTCTTCCGGCGGTACGGACTCCGGCAGCTT
CTGTTCCTGGAAGGATTGCAAGAGGATTCGGTGTACGTACAGAAAGGGTATGCGCGTGAATTGGAGAAATGGTTCCGTTCCGTTGCGTATATTCTTTTCCCGTCGTTCTT
CGTTGAGTTGGCCCACAAAATCATATTCTTCTCCACCGTTAAGATTTATTTTCCGGTAGGATTTCCGGTAAATTCGATCTTGTTCATATTAGTACTGGGGTCGTGGGTGT
ACAGAACTGGAGTGTTTTTGCTCGTGTGTGTATTGTTTCGGCTCACATGTGAGTTACAGATTCTTAGACTTCAAGGGGTTCACAAGCTATTTGAAACGAGCAACGGGTCA
GAATCGGACTCGGTGTTGATCTTCAACGAACATTTGAGGATCAGAAAACAGTTATCAATTACGAGTCACCGGTACCGGTTCTTCATAATTGTGTGTTTGCTCGTTATTAC
GGTTAGCCAGTTCGTCGCATTGCTCTTGGTTTTGGCCTCTAAAACTGACAAAAATTTCTTCAACTCCGGCGATGTTGTGGTTTGCTCAACGGTTCAATTATGTGGATTCT
TCTTGTGTTTGTTTGGAGCTGCAAGAATTACTCACAGAGCCCAAGGAATCGCTTCTGTTGCAACCCGATGGCACATGCTAGTCACCTCCGCTGCCTCTGGATCCAACTCA
ACCAAACCTCCGCCACTACTACAACCGCTCGTTAACCACAAGAAAAACAACAACAGTAGCAGTAGCGGCGAGGATGAAGACGATACCGATTCTGATTCTTCATCGGACGT
TTTAATTTCAGTAAGGCCACGAGAATCTTCTTCCTTCCAAACCAGACAAGCTTTAGTGTCATATTTACAACACAACAATGGAGGAATTACATTGTTTGGATATGCATTAG
ATCGAGGATTGCTTCATACACTCTTTGCATTTGAATTCTCACTCGTCATGTGGATTTTGAGTAAGGTTGTCGTATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTCCAGCGAATCAAACCTCCCTCTCCTCCTCCAACATTCTTACTGCAGGTCCAAATCCATAACCTCCGATGATCTACGCAACTATCGAATCAGCTTAAAATGGTG
CGCTCTTGACCACTCCTCCGCCGCCGGAAAGTTCTTCTCCTACCTCGTCTTCACTCTCCTGACCATCGTCGTCCCTGCCGTGACGATCCTCGCCGTCCAAGATCCGGGGG
ACGATCCGATCTCCTTCAATAAACTGGTTCAGGTTCCGGAATCGGGACTGGCGGCGGTGGGATTCTTGAGCCTGTGTCGATTCTTCCGGCGGTACGGACTCCGGCAGCTT
CTGTTCCTGGAAGGATTGCAAGAGGATTCGGTGTACGTACAGAAAGGGTATGCGCGTGAATTGGAGAAATGGTTCCGTTCCGTTGCGTATATTCTTTTCCCGTCGTTCTT
CGTTGAGTTGGCCCACAAAATCATATTCTTCTCCACCGTTAAGATTTATTTTCCGGTAGGATTTCCGGTAAATTCGATCTTGTTCATATTAGTACTGGGGTCGTGGGTGT
ACAGAACTGGAGTGTTTTTGCTCGTGTGTGTATTGTTTCGGCTCACATGTGAGTTACAGATTCTTAGACTTCAAGGGGTTCACAAGCTATTTGAAACGAGCAACGGGTCA
GAATCGGACTCGGTGTTGATCTTCAACGAACATTTGAGGATCAGAAAACAGTTATCAATTACGAGTCACCGGTACCGGTTCTTCATAATTGTGTGTTTGCTCGTTATTAC
GGTTAGCCAGTTCGTCGCATTGCTCTTGGTTTTGGCCTCTAAAACTGACAAAAATTTCTTCAACTCCGGCGATGTTGTGGTTTGCTCAACGGTTCAATTATGTGGATTCT
TCTTGTGTTTGTTTGGAGCTGCAAGAATTACTCACAGAGCCCAAGGAATCGCTTCTGTTGCAACCCGATGGCACATGCTAGTCACCTCCGCTGCCTCTGGATCCAACTCA
ACCAAACCTCCGCCACTACTACAACCGCTCGTTAACCACAAGAAAAACAACAACAGTAGCAGTAGCGGCGAGGATGAAGACGATACCGATTCTGATTCTTCATCGGACGT
TTTAATTTCAGTAAGGCCACGAGAATCTTCTTCCTTCCAAACCAGACAAGCTTTAGTGTCATATTTACAACACAACAATGGAGGAATTACATTGTTTGGATATGCATTAG
ATCGAGGATTGCTTCATACACTCTTTGCATTTGAATTCTCACTCGTCATGTGGATTTTGAGTAAGGTTGTCGTATTATGA
Protein sequenceShow/hide protein sequence
MDSSESNLPLLLQHSYCRSKSITSDDLRNYRISLKWCALDHSSAAGKFFSYLVFTLLTIVVPAVTILAVQDPGDDPISFNKLVQVPESGLAAVGFLSLCRFFRRYGLRQL
LFLEGLQEDSVYVQKGYARELEKWFRSVAYILFPSFFVELAHKIIFFSTVKIYFPVGFPVNSILFILVLGSWVYRTGVFLLVCVLFRLTCELQILRLQGVHKLFETSNGS
ESDSVLIFNEHLRIRKQLSITSHRYRFFIIVCLLVITVSQFVALLLVLASKTDKNFFNSGDVVVCSTVQLCGFFLCLFGAARITHRAQGIASVATRWHMLVTSAASGSNS
TKPPPLLQPLVNHKKNNNSSSSGEDEDDTDSDSSSDVLISVRPRESSSFQTRQALVSYLQHNNGGITLFGYALDRGLLHTLFAFEFSLVMWILSKVVVL