; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG06G006580 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG06G006580
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionProtein of unknown function (DUF506)
Genome locationCG_Chr06:7570484..7572932
RNA-Seq ExpressionClCG06G006580
SyntenyClCG06G006580
Gene Ontology termsNA
InterPro domainsIPR006502 - Protein of unknown function PDDEXK-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574951.1 hypothetical protein SDJN03_25590, partial [Cucurbita argyrosperma subsp. sororia]1.7e-19790.24Show/hide
Query:  MEIRGKMKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSV
        MEIRGKMKIQPIDIDPPTGR AIRADPGKPVLKSRLRKLFDRP PNVLKNS+AEKPIAAGEAAQFIINKDG SEFEPSSICLAKMVQSFIEESNEKQLSV
Subjt:  MEIRGKMKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSV

Query:  ATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS
        ATA KNGRNRCNCFNGNNN+SSDDESDDFGGGFGE V IGSS ADVSDLLKSLIL ASVAERNLLADTAKI EKNNKIHK+KDDLRKVVTDGLSSLGYDS
Subjt:  ATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS

Query:  SICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA
        SICKSKWDK PSHPAGEYEYIDV+VEGERL+IDIDFRSEFEIARSTGMYKAILQLLPNIF+GK DRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA
Subjt:  SICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA

Query:  KWLSPHDRSKPPNPSVKEIEITNMNQNEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAKIVT
        KWLSPH RSKPP PS KEIE    + NEQSPTETDCG+ E+IFGDE TT + P+S SIASS  PPQKG   GEKAAVAVTAWQPPAIKPKSLDRGAKIVT
Subjt:  KWLSPHDRSKPPNPSVKEIEITNMNQNEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAKIVT

Query:  GLASILKENP
        GLASILKENP
Subjt:  GLASILKENP

XP_004150569.1 uncharacterized protein LOC101219203 [Cucumis sativus]6.2e-20891.77Show/hide
Query:  MEIRGKMKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSV
        MEIRGKMKIQPIDIDPPTGR AIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIA GEAAQFIINKDG+SEFEPSSICLAKMVQSFIEESNEKQLSV
Subjt:  MEIRGKMKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSV

Query:  ATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS
        ATA KNGRNRCNCFNGNNN+SSDDESDDFGGGFGE V IGSS ADV DLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSS+GYD+
Subjt:  ATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS

Query:  SICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA
        SICKSKW+KSPSHPAGEYEYIDVMVE ERLVIDIDFRSEFEIARSTGMYK ILQL+PNIFVGKTDRLGQI SIVSEAARQSLKKKGMHFPPWRKAEYMRA
Subjt:  SICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA

Query:  KWLSPHDRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAK
        KWLSPH RSKPPNPSVKE E+ NMN+   NE+SPTETDCGELELIFGDE T  TS +SNSIASS PPPQ+GLYGG+KAAV VTAWQPPAIKPKSLDRGAK
Subjt:  KWLSPHDRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAK

Query:  IVTGLASILKENP
        IVTGLASILKENP
Subjt:  IVTGLASILKENP

XP_008449296.1 PREDICTED: uncharacterized protein LOC103491218 isoform X1 [Cucumis melo]5.0e-21092.74Show/hide
Query:  MEIRGKMKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSV
        MEIRGKMK+QPIDIDPPTGR AIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDG SEFEPSSICLAKMVQSFIEESNEKQLSV
Subjt:  MEIRGKMKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSV

Query:  ATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS
        ATA KNGRNRCNCFNGNNN+SSDDESDDFGGGFGE VTIGSS ADV DLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS
Subjt:  ATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS

Query:  SICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA
        SICKSKW+KSPSHPAGEYEYIDVMVE ERLVIDIDFRSEFEIARSTGMYK ILQL+PNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA
Subjt:  SICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA

Query:  KWLSPHDRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAK
        KWLSPH RSKPPNPSVKEIEITNM +   NE+SPTETDCGELELIFGDE T  TS +SNSIASS PPPQ+GLYGG+KAA+ VT WQPPAIKPKSLDRGAK
Subjt:  KWLSPHDRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAK

Query:  IVTGLASILKENP
        IVTGLASILKENP
Subjt:  IVTGLASILKENP

XP_008449297.1 PREDICTED: uncharacterized protein LOC103491218 isoform X2 [Cucumis melo]1.5e-20692.63Show/hide
Query:  MKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSVATAGKN
        MK+QPIDIDPPTGR AIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDG SEFEPSSICLAKMVQSFIEESNEKQLSVATA KN
Subjt:  MKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSVATAGKN

Query:  GRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICKSK
        GRNRCNCFNGNNN+SSDDESDDFGGGFGE VTIGSS ADV DLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICKSK
Subjt:  GRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICKSK

Query:  WDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH
        W+KSPSHPAGEYEYIDVMVE ERLVIDIDFRSEFEIARSTGMYK ILQL+PNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH
Subjt:  WDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH

Query:  DRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLA
         RSKPPNPSVKEIEITNM +   NE+SPTETDCGELELIFGDE T  TS +SNSIASS PPPQ+GLYGG+KAA+ VT WQPPAIKPKSLDRGAKIVTGLA
Subjt:  DRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLA

Query:  SILKENP
        SILKENP
Subjt:  SILKENP

XP_038875399.1 uncharacterized protein LOC120067865 [Benincasa hispida]3.9e-21093.01Show/hide
Query:  MEIRGKMKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSV
        MEIRGKMKIQPIDI+PPTGR AIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSV
Subjt:  MEIRGKMKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSV

Query:  ATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS
        AT  KNGRNRCNCFNGNNN+SSDDESDDFG GFGE VTIGSS ADVSDLLKSLI CASVAERNLLADTAKIVEKN+KIHKRKDDLRKVVT+GLSSLGYDS
Subjt:  ATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS

Query:  SICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA
        SICKSKWDKSPSHPAGEYEYIDVM+E ERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA
Subjt:  SICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA

Query:  KWLSPHDRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDE--TTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRG
        KWLSPH RSKPPNPSVKEIEITNMNQ   NEQSP ETDCGE ELIFGDE  TTTTTS +SNS  +S PPPQKGLYGGEK AVAV AWQPPAIKPKSLDRG
Subjt:  KWLSPHDRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDE--TTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRG

Query:  AKIVTGLASILKENP
        AKIVTGLASILKENP
Subjt:  AKIVTGLASILKENP

TrEMBL top hitse value%identityAlignment
A0A0A0LKR8 Uncharacterized protein3.0e-20891.77Show/hide
Query:  MEIRGKMKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSV
        MEIRGKMKIQPIDIDPPTGR AIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIA GEAAQFIINKDG+SEFEPSSICLAKMVQSFIEESNEKQLSV
Subjt:  MEIRGKMKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSV

Query:  ATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS
        ATA KNGRNRCNCFNGNNN+SSDDESDDFGGGFGE V IGSS ADV DLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSS+GYD+
Subjt:  ATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS

Query:  SICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA
        SICKSKW+KSPSHPAGEYEYIDVMVE ERLVIDIDFRSEFEIARSTGMYK ILQL+PNIFVGKTDRLGQI SIVSEAARQSLKKKGMHFPPWRKAEYMRA
Subjt:  SICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA

Query:  KWLSPHDRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAK
        KWLSPH RSKPPNPSVKE E+ NMN+   NE+SPTETDCGELELIFGDE T  TS +SNSIASS PPPQ+GLYGG+KAAV VTAWQPPAIKPKSLDRGAK
Subjt:  KWLSPHDRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAK

Query:  IVTGLASILKENP
        IVTGLASILKENP
Subjt:  IVTGLASILKENP

A0A1S3BMC1 uncharacterized protein LOC103491218 isoform X12.4e-21092.74Show/hide
Query:  MEIRGKMKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSV
        MEIRGKMK+QPIDIDPPTGR AIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDG SEFEPSSICLAKMVQSFIEESNEKQLSV
Subjt:  MEIRGKMKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSV

Query:  ATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS
        ATA KNGRNRCNCFNGNNN+SSDDESDDFGGGFGE VTIGSS ADV DLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS
Subjt:  ATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS

Query:  SICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA
        SICKSKW+KSPSHPAGEYEYIDVMVE ERLVIDIDFRSEFEIARSTGMYK ILQL+PNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA
Subjt:  SICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA

Query:  KWLSPHDRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAK
        KWLSPH RSKPPNPSVKEIEITNM +   NE+SPTETDCGELELIFGDE T  TS +SNSIASS PPPQ+GLYGG+KAA+ VT WQPPAIKPKSLDRGAK
Subjt:  KWLSPHDRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAK

Query:  IVTGLASILKENP
        IVTGLASILKENP
Subjt:  IVTGLASILKENP

A0A1S3BMM4 uncharacterized protein LOC103491218 isoform X27.4e-20792.63Show/hide
Query:  MKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSVATAGKN
        MK+QPIDIDPPTGR AIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDG SEFEPSSICLAKMVQSFIEESNEKQLSVATA KN
Subjt:  MKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSVATAGKN

Query:  GRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICKSK
        GRNRCNCFNGNNN+SSDDESDDFGGGFGE VTIGSS ADV DLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICKSK
Subjt:  GRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICKSK

Query:  WDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH
        W+KSPSHPAGEYEYIDVMVE ERLVIDIDFRSEFEIARSTGMYK ILQL+PNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH
Subjt:  WDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH

Query:  DRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLA
         RSKPPNPSVKEIEITNM +   NE+SPTETDCGELELIFGDE T  TS +SNSIASS PPPQ+GLYGG+KAA+ VT WQPPAIKPKSLDRGAKIVTGLA
Subjt:  DRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLA

Query:  SILKENP
        SILKENP
Subjt:  SILKENP

A0A5D3E4G1 Uncharacterized protein7.4e-20792.63Show/hide
Query:  MKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSVATAGKN
        MK+QPIDIDPPTGR AIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDG SEFEPSSICLAKMVQSFIEESNEKQLSVATA KN
Subjt:  MKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSVATAGKN

Query:  GRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICKSK
        GRNRCNCFNGNNN+SSDDESDDFGGGFGE VTIGSS ADV DLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICKSK
Subjt:  GRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICKSK

Query:  WDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH
        W+KSPSHPAGEYEYIDVMVE ERLVIDIDFRSEFEIARSTGMYK ILQL+PNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH
Subjt:  WDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH

Query:  DRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLA
         RSKPPNPSVKEIEITNM +   NE+SPTETDCGELELIFGDE T  TS +SNSIASS PPPQ+GLYGG+KAA+ VT WQPPAIKPKSLDRGAKIVTGLA
Subjt:  DRSKPPNPSVKEIEITNMNQ---NEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLA

Query:  SILKENP
        SILKENP
Subjt:  SILKENP

A0A6J1H4C5 uncharacterized protein LOC1114600394.0e-19790Show/hide
Query:  MEIRGKMKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSV
        MEIRGKMKIQPIDIDPPTGR AIRADPGKPVLKSRLRKLFDRP PNVLKNS+AEKPIAAGEAAQFIINKDG SEFEPSSICLAKMVQSFIEESNEKQLSV
Subjt:  MEIRGKMKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSV

Query:  ATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS
        ATA KNGRNRCNCFNGNNN+SSDDESDDFGGGFGE V IGSS ADVSDLLKSLIL ASVAERNLLADTAKI EKNNKIHK+KDDLRKVVTDGLSSLGYDS
Subjt:  ATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDS

Query:  SICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA
        SICKSKWDK PSHPAGEYEYIDV+VEGERL+IDIDFRSEFEIARSTGMYKAILQLLPNIF+GK DRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA
Subjt:  SICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRA

Query:  KWLSPHDRSKPPNPSVKEIEITNMNQNEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAKIVT
        KWLSPH RSKPP PS KEIE    + NEQSPTETDCG+ E+IFGDE  T + P+S SIASS  PPQKG   GEKAAVAVTAWQPPAIKPKSLDRGAKIVT
Subjt:  KWLSPHDRSKPPNPSVKEIEITNMNQNEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAKIVT

Query:  GLASILKENP
        GLASILKENP
Subjt:  GLASILKENP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38820.1 Protein of unknown function (DUF506)1.6e-6545.4Show/hide
Query:  MKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESN--EKQLSVATAG
        MKIQPID +          +  + + KSRL++LF+R F N    + +EK    G   +  +++    +FEPSS+CLAKMV +F+E++N  EKQ       
Subjt:  MKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESN--EKQLSVATAG

Query:  KNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICK
        + GR+RCNCF+G+  ESSDDE++              SS +  ++LKSL+LC S+  RNLL D  KI E +                      YD+++CK
Subjt:  KNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICK

Query:  SKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLS
        S+W+KSPS PAGEYEY+DV+++GERL+IDIDF+S+FEIAR+T  YK++LQ LP IFVGK DRL +I+ ++ +AA+QSLKKKG+H PPWR+AEY+++KWLS
Subjt:  SKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLS

Query:  PHDR-SKPPNPSVKE
         H R  +  N  VK+
Subjt:  PHDR-SKPPNPSVKE

AT2G38820.2 Protein of unknown function (DUF506)8.9e-7247.62Show/hide
Query:  MKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESN--EKQLSVATAG
        MKIQPID +          +  + + KSRL++LF+R F N    + +EK    G   +  +++    +FEPSS+CLAKMV +F+E++N  EKQ       
Subjt:  MKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESN--EKQLSVATAG

Query:  KNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICK
        + GR+RCNCF+G+  ESSDDE++              SS +  ++LKSL+LC S+  RNLL D  KI E +     +     K V +GL SLGYD+++CK
Subjt:  KNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICK

Query:  SKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLS
        S+W+KSPS PAGEYEY+DV+++GERL+IDIDF+S+FEIAR+T  YK++LQ LP IFVGK DRL +I+ ++ +AA+QSLKKKG+H PPWR+AEY+++KWLS
Subjt:  SKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLS

Query:  PHDR-SKPPNPSVKE
         H R  +  N  VK+
Subjt:  PHDR-SKPPNPSVKE

AT3G22970.1 Protein of unknown function (DUF506)4.1e-10155.45Show/hide
Query:  MKIQPIDID-PPTGRGAIRADPG-KPVLKSRLRKLFDRPFPNVLKNS---TAEKP--IAAGEAAQFIINKDG-VSEFEPSSICLAKMVQSFIEESNEKQL
        MKIQPIDID  PT     RA+ G KPVLKSRL++LFDRPF NVL+NS   T EKP  +  GE     +   G V+EFEPSS+CLAKMVQ+FIEE+NEKQ 
Subjt:  MKIQPIDID-PPTGRGAIRADPG-KPVLKSRLRKLFDRPFPNVLKNS---TAEKP--IAAGEAAQFIINKDG-VSEFEPSSICLAKMVQSFIEESNEKQL

Query:  SVATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGY
              K GRNRCNCFNGNN+ SSDDESD FGG             D SD LKSLI C +V ERNLLAD AKIV+KN  + KRKDD++K+V +GL SL Y
Subjt:  SVATAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGY

Query:  DSSICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYM
        +SSICKSKWDKSPS PAGEYEYIDV++  ERL+ID+DFRSEF+IAR T  YK +LQ LP IFVGK+DRL QIV ++SEAA+QSLKKKGM FPPWRKAEYM
Subjt:  DSSICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYM

Query:  RAKWLSPHDRSKPPNPSVKEIEITNMNQNEQS-PTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAK
        R+KWLS + R+       + + +T++   + +   E D  E+EL+F ++     SP+    +SS P       G +  AV               +R  K
Subjt:  RAKWLSPHDRSKPPNPSVKEIEITNMNQNEQS-PTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAK

Query:  IVTGLASILKENP
         VTGLAS+ KE P
Subjt:  IVTGLASILKENP

AT3G22970.2 Protein of unknown function (DUF506)2.6e-6352.85Show/hide
Query:  LLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGM
        ++ SLI C +V ERNLLAD AKIV+KN  + KRKDD++K+V +GL SL Y+SSICKSKWDKSPS PAGEYEYIDV++  ERL+ID+DFRSEF+IAR T  
Subjt:  LLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGM

Query:  YKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPHDRSKPPNPSVKEIEITNMNQNEQS-PTETDCGELELIFGDET
        YK +LQ LP IFVGK+DRL QIV ++SEAA+QSLKKKGM FPPWRKAEYMR+KWLS + R+       + + +T++   + +   E D  E+EL+F ++ 
Subjt:  YKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPHDRSKPPNPSVKEIEITNMNQNEQS-PTETDCGELELIFGDET

Query:  TTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLASILKENP
            SP+    +SS P       G +  AV               +R  K VTGLAS+ KE P
Subjt:  TTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLASILKENP

AT4G14620.1 Protein of unknown function (DUF506)3.4e-8751.96Show/hide
Query:  MKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGV---SEFEPSSICLAKMVQSFIEESNEKQLSVATA
        MKIQPI+ D P  R        KPVLKSRL++L DRPF  +   S +EK          +I+ DGV   +EFEPS   LAKMVQ+++EE+N+KQ      
Subjt:  MKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGV---SEFEPSSICLAKMVQSFIEESNEKQLSVATA

Query:  GKNGRN--RCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSS
         KNGRN  RCNCFNG NN+ SDDE D F                  D  KSLI C S  E++LL +  KI+EKN  + KRKD+LRK+V D LSSLGYDSS
Subjt:  GKNGRN--RCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSS

Query:  ICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAK
        ICKSKWDK+ S PAGEYEYIDV+V GERL+IDIDFRSEFEIAR T  YK +LQ LP IFVGK+DR+ QIVSIVSEA++QSLKKKGMHFPPWRKA+YMRAK
Subjt:  ICKSKWDKSPSHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAK

Query:  WLSPHDRSKPPNPSVKEIEITNMNQNEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAKIVTG
        WLS + R    N   K+  +T+  +    P E D  E+ELIF  E      P  + I S          G +   VA           +S+ + AK+VTG
Subjt:  WLSPHDRSKPPNPSVKEIEITNMNQNEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAKIVTG

Query:  LASILKEN
        LA + KEN
Subjt:  LASILKEN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTCCACGTCATCAATCCACATTATCAAAAACGAAAATAAGAAATGCCATGTAGTTAATGACATGGCTAAACTGTCCTGGTCAGCAATCTTCAATTGCTATAAAGA
AGTCCAGCCCCAATTTTGCCCACTCCACACTCGAGAACCACATAAATCCCTGCCAACTTCTCTATCTTCATCATCATCATCTTCACCGGCGACGACCATCGGGCCGGGAA
ATGAACTTTCCGGCGACCGATTCTTCCAAATCAGCACTCACTCTTCGAATCTCCGATCATCGCCGCCATGTATTTCACGAAAAGCTTTGATCCGAGTTCCAAATTTCTCG
TTTCACGAAGTTATCCGCGTCAATTCCATGGAAATTCGGGGAAAGATGAAGATTCAACCTATTGATATTGATCCTCCAACTGGAAGAGGTGCCATTCGTGCTGATCCTGG
CAAACCAGTACTGAAATCGCGTCTGAGAAAGCTATTTGATCGGCCGTTTCCGAATGTTCTAAAGAATTCTACTGCGGAAAAGCCAATCGCGGCTGGGGAAGCTGCACAGT
TCATCATCAACAAAGATGGAGTCTCTGAGTTCGAGCCGAGCTCCATTTGTTTGGCTAAAATGGTGCAGAGTTTTATAGAGGAGAGTAACGAGAAACAGTTGTCGGTGGCC
ACTGCTGGGAAGAATGGTCGCAACCGCTGCAATTGCTTCAACGGGAACAATAATGAAAGTTCTGATGATGAGTCCGATGACTTCGGCGGTGGTTTTGGTGAACCTGTAAC
AATCGGATCATCTAGTGCCGATGTTTCTGACTTACTCAAGAGTCTGATACTTTGCGCCAGCGTAGCTGAAAGAAACCTCTTAGCCGACACCGCAAAGATTGTTGAGAAGA
ACAACAAAATTCACAAAAGGAAAGACGATTTGAGAAAAGTTGTCACAGATGGTCTTTCATCTCTCGGTTACGACTCTTCAATCTGTAAATCGAAATGGGACAAATCCCCC
TCGCATCCTGCAGGGGAATATGAATACATCGATGTGATGGTGGAGGGCGAGAGATTGGTGATAGACATAGATTTCAGATCGGAGTTTGAGATCGCTCGTTCTACCGGAAT
GTACAAGGCGATTCTCCAATTACTCCCGAATATCTTCGTCGGCAAAACGGATCGTCTAGGTCAAATCGTATCGATTGTATCAGAGGCTGCGAGACAGAGCTTGAAGAAGA
AGGGGATGCACTTTCCGCCATGGAGGAAAGCTGAATACATGAGAGCTAAATGGCTTTCCCCTCACGACAGATCCAAACCTCCAAATCCATCGGTAAAGGAGATCGAAATC
ACGAACATGAATCAAAACGAGCAATCGCCGACAGAGACGGATTGCGGAGAACTCGAATTGATATTCGGCGACGAAACGACGACAACGACATCACCTAAGAGTAATTCAAT
CGCTTCATCTCCTCCCCCTCCGCAGAAGGGTTTGTATGGCGGCGAGAAGGCGGCGGTGGCAGTGACGGCCTGGCAACCTCCGGCGATCAAACCGAAGAGTCTCGATAGAG
GAGCTAAGATCGTTACGGGATTGGCATCAATCCTGAAGGAGAATCCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCTCCACGTCATCAATCCACATTATCAAAAACGAAAATAAGAAATGCCATGTAGTTAATGACATGGCTAAACTGTCCTGGTCAGCAATCTTCAATTGCTATAAAGA
AGTCCAGCCCCAATTTTGCCCACTCCACACTCGAGAACCACATAAATCCCTGCCAACTTCTCTATCTTCATCATCATCATCTTCACCGGCGACGACCATCGGGCCGGGAA
ATGAACTTTCCGGCGACCGATTCTTCCAAATCAGCACTCACTCTTCGAATCTCCGATCATCGCCGCCATGTATTTCACGAAAAGCTTTGATCCGAGTTCCAAATTTCTCG
TTTCACGAAGTTATCCGCGTCAATTCCATGGAAATTCGGGGAAAGATGAAGATTCAACCTATTGATATTGATCCTCCAACTGGAAGAGGTGCCATTCGTGCTGATCCTGG
CAAACCAGTACTGAAATCGCGTCTGAGAAAGCTATTTGATCGGCCGTTTCCGAATGTTCTAAAGAATTCTACTGCGGAAAAGCCAATCGCGGCTGGGGAAGCTGCACAGT
TCATCATCAACAAAGATGGAGTCTCTGAGTTCGAGCCGAGCTCCATTTGTTTGGCTAAAATGGTGCAGAGTTTTATAGAGGAGAGTAACGAGAAACAGTTGTCGGTGGCC
ACTGCTGGGAAGAATGGTCGCAACCGCTGCAATTGCTTCAACGGGAACAATAATGAAAGTTCTGATGATGAGTCCGATGACTTCGGCGGTGGTTTTGGTGAACCTGTAAC
AATCGGATCATCTAGTGCCGATGTTTCTGACTTACTCAAGAGTCTGATACTTTGCGCCAGCGTAGCTGAAAGAAACCTCTTAGCCGACACCGCAAAGATTGTTGAGAAGA
ACAACAAAATTCACAAAAGGAAAGACGATTTGAGAAAAGTTGTCACAGATGGTCTTTCATCTCTCGGTTACGACTCTTCAATCTGTAAATCGAAATGGGACAAATCCCCC
TCGCATCCTGCAGGGGAATATGAATACATCGATGTGATGGTGGAGGGCGAGAGATTGGTGATAGACATAGATTTCAGATCGGAGTTTGAGATCGCTCGTTCTACCGGAAT
GTACAAGGCGATTCTCCAATTACTCCCGAATATCTTCGTCGGCAAAACGGATCGTCTAGGTCAAATCGTATCGATTGTATCAGAGGCTGCGAGACAGAGCTTGAAGAAGA
AGGGGATGCACTTTCCGCCATGGAGGAAAGCTGAATACATGAGAGCTAAATGGCTTTCCCCTCACGACAGATCCAAACCTCCAAATCCATCGGTAAAGGAGATCGAAATC
ACGAACATGAATCAAAACGAGCAATCGCCGACAGAGACGGATTGCGGAGAACTCGAATTGATATTCGGCGACGAAACGACGACAACGACATCACCTAAGAGTAATTCAAT
CGCTTCATCTCCTCCCCCTCCGCAGAAGGGTTTGTATGGCGGCGAGAAGGCGGCGGTGGCAGTGACGGCCTGGCAACCTCCGGCGATCAAACCGAAGAGTCTCGATAGAG
GAGCTAAGATCGTTACGGGATTGGCATCAATCCTGAAGGAGAATCCGTAAAAATTTTGGTTTTTTCTTTTTATTTTCCT
Protein sequenceShow/hide protein sequence
MVSTSSIHIIKNENKKCHVVNDMAKLSWSAIFNCYKEVQPQFCPLHTREPHKSLPTSLSSSSSSSPATTIGPGNELSGDRFFQISTHSSNLRSSPPCISRKALIRVPNFS
FHEVIRVNSMEIRGKMKIQPIDIDPPTGRGAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFIINKDGVSEFEPSSICLAKMVQSFIEESNEKQLSVA
TAGKNGRNRCNCFNGNNNESSDDESDDFGGGFGEPVTIGSSSADVSDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLRKVVTDGLSSLGYDSSICKSKWDKSP
SHPAGEYEYIDVMVEGERLVIDIDFRSEFEIARSTGMYKAILQLLPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPHDRSKPPNPSVKEIEI
TNMNQNEQSPTETDCGELELIFGDETTTTTSPKSNSIASSPPPPQKGLYGGEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLASILKENP