; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G20000 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G20000
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationChr5:21169869..21174580
RNA-Seq ExpressionCSPI05G20000
SyntenyCSPI05G20000
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016733.1 hypothetical protein SDJN02_21843, partial [Cucurbita argyrosperma subsp. argyrosperma]5.3e-21590.75Show/hide
Query:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGR  GDDRFYNPTKARR HQGRQ DQLRRAQSDVSA Q  V+K + VSSVIRE+E GDGCE+LPKSIA S FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY +KTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMG QLNHH LSSELSRRM+ +S RDQLIGLQEDCSSDEAESLNS+GQLLFEHLERDLPYSREPLADK+SDL+F+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLS+P+GGARSVQ PVVTYPS+IDG+P+MSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRER V
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

TYK00266.1 DUF789 domain-containing protein [Cucumis melo var. makuwa]2.4e-22896.35Show/hide
Query:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGR  GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVK SAVSSVIRE+ECG+GCEELPKSIA SGFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYL+KTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMGKQLNHH LSSELSRRMDN+SFRDQLIGLQEDCSSDEAESLNS+GQLLFEHLERDLPYSREPLADKISDLAFQFP+LKTLRSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSP GGARSVQCPVVTYPSEIDG+PKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLRERQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_004147115.1 uncharacterized protein LOC101217142 [Cucumis sativus]2.4e-236100Show/hide
Query:  MLGAGLQFGRGDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPSVP
        MLGAGLQFGRGDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPSVP
Subjt:  MLGAGLQFGRGDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPSVP

Query:  AQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEP
        AQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEP
Subjt:  AQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEP

Query:  ERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSV
        ERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSV
Subjt:  ERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSV

Query:  AWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNH
        AWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNH
Subjt:  AWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNH

Query:  PDFIFFSRR
        PDFIFFSRR
Subjt:  PDFIFFSRR

XP_008467187.1 PREDICTED: uncharacterized protein LOC103504597 [Cucumis melo]2.7e-22795.86Show/hide
Query:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGR  GD RFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVK SAVSSVIRE+ECG+GCEELPKSIA SGFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYL+KTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMGKQLNHH LSSELSRRMDN+SFRDQLIGLQEDCSSDEAESLNS+GQLLFEHLERDLPYSREPLADKISDLAFQFP+LKT+RSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSP GGARSVQCPVVTYPSEIDG+PKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLRERQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_038874258.1 uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida]5.4e-22093.19Show/hide
Query:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF R  GDDRFYNPTKARR HQGRQ DQLRRAQSDVSAGQS +VK   VSSVIRE+E GDGCEELPKSIA S FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYL+KTTMKGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMGKQLNHH LSSEL RRMD +SFRDQLIGLQEDCSSDEAESLNS+GQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTL+DLDACFLTFH+LSSP+GGARSVQ PVVTYPSEIDG+PKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAE+WLR+RQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

TrEMBL top hitse value%identityAlignment
A0A0A0KT13 Uncharacterized protein1.2e-236100Show/hide
Query:  MLGAGLQFGRGDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPSVP
        MLGAGLQFGRGDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPSVP
Subjt:  MLGAGLQFGRGDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPSVP

Query:  AQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEP
        AQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEP
Subjt:  AQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEP

Query:  ERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSV
        ERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSV
Subjt:  ERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSV

Query:  AWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNH
        AWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNH
Subjt:  AWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNH

Query:  PDFIFFSRR
        PDFIFFSRR
Subjt:  PDFIFFSRR

A0A1S3CT52 uncharacterized protein LOC1035045971.3e-22795.86Show/hide
Query:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGR  GD RFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVK SAVSSVIRE+ECG+GCEELPKSIA SGFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYL+KTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMGKQLNHH LSSELSRRMDN+SFRDQLIGLQEDCSSDEAESLNS+GQLLFEHLERDLPYSREPLADKISDLAFQFP+LKT+RSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSP GGARSVQCPVVTYPSEIDG+PKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLRERQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A5D3BPU4 DUF789 domain-containing protein1.2e-22896.35Show/hide
Query:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGR  GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVK SAVSSVIRE+ECG+GCEELPKSIA SGFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYL+KTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMGKQLNHH LSSELSRRMDN+SFRDQLIGLQEDCSSDEAESLNS+GQLLFEHLERDLPYSREPLADKISDLAFQFP+LKTLRSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSP GGARSVQCPVVTYPSEIDG+PKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLRERQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A6J1FG46 uncharacterized protein LOC1114451073.1e-21390.27Show/hide
Query:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGR  GDDRFYNPTKARR HQGRQ DQLRRAQSDVSA QS V+K + VSSVIRE+E GDGCEELP SIA S FEPVVSSLSNLERFLQSI PS
Subjt:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY +KTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMG QLNHH LSSELSRR + +S RDQLIGLQEDC SDEAESLNS+GQLLFEHLERDLPYSREPLADK+SDLAF+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLS+P+GGARSVQ PVVTYPS+IDG+P+MSLPVFGLASYKFRGSLWTPNGG+EWQLANSLLQDAEDWLRER V
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A6J1JV26 uncharacterized protein LOC1114891471.8e-21389.78Show/hide
Query:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS
        M GAGLQFGR  GDDRFYNPTKARR HQGRQ DQLRR QSDVSA +S V+K + VSS+IRE+E GDGCEELPKSIA S FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGR--GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY +KTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERA+KYMG QLNHH LSSELSRRM+ +S RDQLIGLQEDCSSDEAESLNS+GQLLFEHLERDLPYSREPLADK+SDLAF+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLS+P+GGARSVQ PVVTYPS+IDG+P+MSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDA+DWLRER V
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)2.3e-10758.49Show/hide
Query:  QLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPSVPAQYLAKTTMKGWRTCDMEFQ-PYFVLGDLWE
        QL+RAQ DVS G          SS  ++ E G    +   S A+S         SN+ERFL S+ PSVPA YL+KT ++     D+E Q PYF+LGD+WE
Subjt:  QLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPSVPAQYLAKTTMKGWRTCDMEFQ-PYFVLGDLWE

Query:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHHRLSSELSRRMDNM
        SF EWSAYG GVPL LN++ D V QYYVP LSGIQ+Y   ++L SS ++R+ GE+S+SDFRDSSS+GSS SE ER L Y  +Q         +S RMD +
Subjt:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHHRLSSELSRRMDNM

Query:  SFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYL
        S R +    QED SSD+ E L+S+G+L+FE+LERDLPY REP ADK+SDLA +FPELKTLRSCDLLPSSWFSVAWYPIY+IPTGPTLKDLDACFLT+H L
Subjt:  SFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYL

Query:  SSPIGGARSVQCPV-VTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
         +P  G       + V  P E   V KM LPVFGLASYK RGS+WT  GG   QLANSL Q A++WLR RQVNHPDFIFF RR
Subjt:  SSPIGGARSVQCPV-VTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

AT2G01260.1 Protein of unknown function (DUF789)6.0e-11658.27Show/hide
Query:  MLGAGLQFGR---GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAP
        MLGAG Q  R   GDD FY   K RR +Q  + DQLRRAQSDVS      V  SA S   ++                   EP   S SNL+RFL+S+ P
Subjt:  MLGAGLQFGR---GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAP

Query:  SVPAQYLAKTTMKGWRTCD--MEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS
        SVPAQ+L+KT ++  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYLAKTTMKGWRTCD--MEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDL
        D SSDS+ ER                 +S R+D +S RDQ    QED SSD+ E L S+G+L+FE+LERDLPY REP ADK+ DLA QFPEL TLRSCDL
Subjt:  DGSSDSEPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDW
        L SSWFSVAWYPIYRIPTGPTLKDLDACFLT+H L +  GG  S Q   +T P E +   KMSLPVFGLASYKFRGSLWTP GG E QL NSL Q A+ W
Subjt:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDW

Query:  LRERQVNHPDFIFFSRR
        L    V+HPDF+FF RR
Subjt:  LRERQVNHPDFIFFSRR

AT2G01260.2 Protein of unknown function (DUF789)3.7e-8957.48Show/hide
Query:  MLGAGLQFGR---GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAP
        MLGAG Q  R   GDD FY   K RR +Q  + DQLRRAQSDVS      V  SA S   ++                   EP   S SNL+RFL+S+ P
Subjt:  MLGAGLQFGR---GDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAP

Query:  SVPAQYLAKTTMKGWRTCD--MEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS
        SVPAQ+L+KT ++  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYLAKTTMKGWRTCD--MEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDL
        D SSDS+ ER                 +S R+D +S RDQ    QED SSD+ E L S+G+L+FE+LERDLPY REP ADK+ DLA QFPEL TLRSCDL
Subjt:  DGSSDSEPERALKYMGKQLNHHRLSSELSRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGG
        L SSWFSVAWYPIYRIPTGPTLKDLDACFLT+H L +  GG
Subjt:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGG

AT4G16100.1 Protein of unknown function (DUF789)7.1e-8545.19Show/hide
Query:  RGDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAV----SSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPSVPAQYLA
        RG++RFYNP   R++ Q R+K +L   + +    ++  +    +      + +  EC      +P  ++++      +S SNL RFL    P V  Q+L 
Subjt:  RGDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAV----SSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPSVPAQYLA

Query:  KTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERAL
         T+ KGWRT + E++PYF+L DLW+SF+EWSAYG GVPL+LN  DSVVQYYVPYLSGIQ+Y +  ++    R+ GE+SD D  RD SSDGS+D       
Subjt:  KTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERAL

Query:  KYMGKQLNHHRLSSELSRRMDNMSFRDQ-LIGLQEDCSSDEAE-SLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAW
                      ELS+ +   S  ++  IG     SSDE+E S NS G+L+FE+LE  +P+ REPL DKIS+L+ QFP L+T RSCDL PSSW SVAW
Subjt:  KYMGKQLNHHRLSSELSRRMDNMSFRDQ-LIGLQEDCSSDEAE-SLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAW

Query:  YPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGY-EWQLANSLLQDAEDWLRERQVNHP
        YPIYRIP G +L++LDACFLTFH LS+P  G  + +       S+     K+ LP FGLASYKF+ S W+P     E Q   +LL+ AE+WLR  +V  P
Subjt:  YPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGY-EWQLANSLLQDAEDWLRERQVNHP

Query:  DFIFF
        DF  F
Subjt:  DFIFF

AT5G49220.1 Protein of unknown function (DUF789)1.7e-7844.05Show/hide
Query:  RGDDRFYNPTKARRVHQGRQ-KDQLRRAQSDVSAGQSLVVKQSAVSSVI--RESECGDGCEELPKSIATSGFE-------------PVVSSLSNLERFLQ
        RG++RFYNP   RR+ Q  Q + Q+R  Q      + L+ K+   ++ +  R +  G G  E    +  SG E              V+S  SNL+RFL+
Subjt:  RGDDRFYNPTKARRVHQGRQ-KDQLRRAQSDVSAGQSLVVKQSAVSSVI--RESECGDGCEELPKSIATSGFE-------------PVVSSLSNLERFLQ

Query:  SIAPSVPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFR
           P VPA+     +    +T + +   YFVL DLWESF EWSAYGAGV     PL ++ +DS VQYYVPYLSGIQ+Y + LK   K R P  D+     
Subjt:  SIAPSVPAQYLAKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFR

Query:  DSSSDGSSDSEPERALKYMGKQLNHHRLSSELS-RRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTL
        + SS+GSS               N   L  +LS   ++ +S +DQ   +    SS EAE  N +G+LLFE+LE + P+ REPLA+KISDLA + PEL T 
Subjt:  DSSSDGSSDSEPERALKYMGKQLNHHRLSSELS-RRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTL

Query:  RSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQ
        RSCDLLPSSW SV+WYPIYRIP GPTL++LDACFLTFH LS+      ++ C      S+     K+ LP FGLASYK + S+W  N   E Q   SLLQ
Subjt:  RSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPIGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQ

Query:  DAEDWLRERQVNHPDFIFFS
         A+ WL+  QV+HPD+ FF+
Subjt:  DAEDWLRERQVNHPDFIFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGTGCAGGATTGCAGTTCGGTCGTGGTGACGATAGGTTTTATAATCCGACGAAAGCTCGTAGGGTGCATCAGGGCCGTCAAAAGGATCAGCTCCGGAGAGCTCA
GAGCGATGTTTCGGCTGGTCAATCCCTTGTCGTTAAACAGAGCGCGGTGTCCTCGGTTATTAGGGAATCCGAATGCGGCGATGGGTGTGAAGAGCTCCCTAAATCTATTG
CGACGTCGGGTTTTGAGCCGGTGGTGTCATCGCTGAGTAATCTCGAGCGGTTCTTGCAGTCTATCGCACCATCTGTTCCTGCACAGTATCTCGCAAAGACAACGATGAAG
GGTTGGAGAACCTGTGATATGGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGGGCAGGTGTGCCTCTTGTATTAAA
CGACAGTGACAGCGTTGTCCAATATTATGTACCATATTTATCTGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTGCAAAGTCGAGGCAACCTGGTGAGGACAGTG
ATAGTGACTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCACTAAAATACATGGGGAAGCAACTCAATCATCATCGTTTATCATCTGAGCTT
TCTCGTAGAATGGATAATATGTCTTTTCGGGACCAGCTAATTGGACTTCAAGAAGACTGTTCTAGTGATGAGGCCGAATCTCTTAATTCTAGAGGGCAGCTACTGTTCGA
GCATCTTGAACGTGATTTGCCTTATAGTCGTGAACCTTTGGCAGATAAGATATCAGATCTTGCTTTTCAGTTCCCTGAGCTCAAGACATTACGGAGTTGTGATCTATTGC
CTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAAGGATCTGGATGCCTGCTTCCTCACATTTCACTATTTGTCTTCGCCA
ATTGGAGGAGCACGTAGTGTTCAATGTCCTGTAGTAACGTATCCTAGTGAGATAGATGGTGTCCCTAAGATGTCTCTACCAGTTTTTGGTCTAGCTTCATACAAGTTCAG
AGGGTCTTTATGGACTCCAAATGGTGGATACGAGTGGCAATTGGCAAATTCACTTTTGCAGGATGCTGAGGATTGGTTAAGAGAACGTCAAGTAAATCACCCTGACTTCA
TCTTCTTCAGCAGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
AAAAGAATTAGTGGCTATTTTCTTCCTTGAAAAAACCGGCGTACTGTTCTTCTCCTCCATCTCTCTCCATTTTTCTCTTCTTCAAATCGCTTCCTTTTTCTCTTCCCTTA
CCTCTCTCTCTCTCTCTCTCTCTCGACGCCATTGAAACTTTCCTTCTCTGTATTAGATCCAATTCATCTTCTCCCATTCCCTTCACCCATTTCTTCCTTTTCAACGACCC
ACACAGGGCACCGCTCTGTTTTTCCTCTCTCACATTCTGCCACCACCCCCGATTGTTTGTCTAATTCTATTTCGCAATCAATCGGTCTCTCTTCTTCTTCTTCTTCGGGA
TTTTCATTTTAATCCCATTACTCTCAACCAACGGATTCTTGCTTTTCCCATTTTTCGTTGACTTTTATTGAGGATTGATTAGCCGATTCCCATTTTCCAGACCCTTTTTT
TTCTCTGTTCTTTTTTCTTTTTCTCGTTTGGTTTTGGAAAAAAAAATTATAATGTTAGGTGCAGGATTGCAGTTCGGTCGTGGTGACGATAGGTTTTATAATCCGACGAA
AGCTCGTAGGGTGCATCAGGGCCGTCAAAAGGATCAGCTCCGGAGAGCTCAGAGCGATGTTTCGGCTGGTCAATCCCTTGTCGTTAAACAGAGCGCGGTGTCCTCGGTTA
TTAGGGAATCCGAATGCGGCGATGGGTGTGAAGAGCTCCCTAAATCTATTGCGACGTCGGGTTTTGAGCCGGTGGTGTCATCGCTGAGTAATCTCGAGCGGTTCTTGCAG
TCTATCGCACCATCTGTTCCTGCACAGTATCTCGCAAAGACAACGATGAAGGGTTGGAGAACCTGTGATATGGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGA
GTCTTTTAAGGAATGGAGTGCTTATGGGGCAGGTGTGCCTCTTGTATTAAACGACAGTGACAGCGTTGTCCAATATTATGTACCATATTTATCTGGTATACAGATATATG
GTGAATCCTTGAAGTCCTCTGCAAAGTCGAGGCAACCTGGTGAGGACAGTGATAGTGACTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCA
CTAAAATACATGGGGAAGCAACTCAATCATCATCGTTTATCATCTGAGCTTTCTCGTAGAATGGATAATATGTCTTTTCGGGACCAGCTAATTGGACTTCAAGAAGACTG
TTCTAGTGATGAGGCCGAATCTCTTAATTCTAGAGGGCAGCTACTGTTCGAGCATCTTGAACGTGATTTGCCTTATAGTCGTGAACCTTTGGCAGATAAGATATCAGATC
TTGCTTTTCAGTTCCCTGAGCTCAAGACATTACGGAGTTGTGATCTATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACA
TTAAAGGATCTGGATGCCTGCTTCCTCACATTTCACTATTTGTCTTCGCCAATTGGAGGAGCACGTAGTGTTCAATGTCCTGTAGTAACGTATCCTAGTGAGATAGATGG
TGTCCCTAAGATGTCTCTACCAGTTTTTGGTCTAGCTTCATACAAGTTCAGAGGGTCTTTATGGACTCCAAATGGTGGATACGAGTGGCAATTGGCAAATTCACTTTTGC
AGGATGCTGAGGATTGGTTAAGAGAACGTCAAGTAAATCACCCTGACTTCATCTTCTTCAGCAGAAGGTGAAATCCTTGCAACATCTTCAATGCCTAAACCTAAAGCTGG
TAACAATATCATCGTTCAGTATGATGTCGTAATTTGCTCTTCGTGGCGCTGTTCAACTTTCAGAAAATGGAAAGGAAAAAGGAAAAGAACAAAAAAAGAAAAAAAGAAAA
AGAAAAAAAAAAGGAAAAAGAAGGAAAAGAAAAGTTTGTGATGGGATGGTGGTGAGGTTGGTGGATGGCGACTTGATAAAAAAAAAAAAACAAAGGTCAAAAGGGGAAAA
GGCCATAAAGAAAAAGGTGAAATACAGTTGGGTGAAGATGTAGAAGTTTTAGCAAAAACATTGCCCAAAGTTGGTTCTACTTGCTGCTGCTGCTTGAGCTGCATAGCAGG
TTTTATCCTTTCTTTTTTTCTCTTTAATTACTGAAAGTTATTTTTTTTTCTATAAGTTATTAACATTAGGTGAAATGAAAGGAAAGAAAAAGAAAAAGAAAGAACCCCCA
AAAAAAGAAAAACAGGCCCTTTTTACTGTCCTGATGATGTTTAGAATGTAAACTAATGGAGAGGGGATTTTTGATAGGAAAGGTTATGTTAGGAGTAAGCTATGGATCCC
CCCCCAATGTATTTGAATGTTTGTTTATGCTTCTTAGTTGCTATTGATGAAAACTGTATAATCTTTTATGAAGAGCTTTGTTATTAATGAACCATTATTCAAGCTCTCTA
TC
Protein sequenceShow/hide protein sequence
MLGAGLQFGRGDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKQSAVSSVIRESECGDGCEELPKSIATSGFEPVVSSLSNLERFLQSIAPSVPAQYLAKTTMK
GWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHHRLSSEL
SRRMDNMSFRDQLIGLQEDCSSDEAESLNSRGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSSP
IGGARSVQCPVVTYPSEIDGVPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR