; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G06230 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G06230
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF789)
Genome locationClcChr01:6056535..6060640
RNA-Seq ExpressionClc01G06230
SyntenyClc01G06230
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016733.1 hypothetical protein SDJN02_21843, partial [Cucurbita argyrosperma subsp. argyrosperma]8.9e-22187.84Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVS+ Q P++KP+ +SSVIRETEYGDGC++LPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ
        EPERALKYMG QLNHHHL SELSRRM+ +S RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK                         
Subjt:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ

Query:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
        +SDL+F+FPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFH+LS+PMGGARSVQGPVVTYPS+IDGIP+MSLPVFGLASYKFRGSLWTP
Subjt:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP

Query:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        NGGYEWQLANSLLQDAEDWLRER VNHPDFIFFSRR
Subjt:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

TYK00266.1 DUF789 domain-containing protein [Cucumis melo var. makuwa]1.2e-22289.68Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARR HQGRQ DQLRRAQSDVS+GQS +VKPSA+SSVIRETE G+GC+ELPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ
        EPERALKYMGKQLNHHHL SELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK                         
Subjt:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ

Query:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
        ISDLAFQFP+LKTLRSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFH+LSSP GGARSVQ PVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
Subjt:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP

Query:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        NGGYEWQLANSLL DAEDWLRERQVNHPDFIFFSRR
Subjt:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

XP_008467187.1 PREDICTED: uncharacterized protein LOC103504597 [Cucumis melo]1.4e-22189.22Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGD RFYNPTKARR HQGRQ DQLRRAQSDVS+GQS +VKPSA+SSVIRETE G+GC+ELPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ
        EPERALKYMGKQLNHHHL SELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK                         
Subjt:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ

Query:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
        ISDLAFQFP+LKT+RSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFH+LSSP GGARSVQ PVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
Subjt:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP

Query:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        NGGYEWQLANSLL DAEDWLRERQVNHPDFIFFSRR
Subjt:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

XP_023550272.1 uncharacterized protein LOC111808496 [Cucurbita pepo subsp. pepo]2.0e-22087.84Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVS+ QSP++KP+ +SSVIRETEYGDGC+ELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ
        EPER LKY G QLNHHHL SELSRRM+ +S RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK                         
Subjt:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ

Query:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
        +SDLAF+FPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFH+LS+PMGGARSVQGPVVTYPS+IDGIP+MSLPVFGLASYKFRGSLWTP
Subjt:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP

Query:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        NGGYEWQLANSLLQDAEDWLRER VNHPDFIFFSRR
Subjt:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

XP_038874258.1 uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida]1.8e-22991.97Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVS+GQSPLVKP  +SSVIRETEYGDGC+ELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ
        EPERALKYMGKQLNHHHL SEL RRMD ISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK                         
Subjt:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ

Query:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
        ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
Subjt:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP

Query:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        NGGYEWQLANSLLQDAE+WLR+RQVNHPDFIFFSRR
Subjt:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

TrEMBL top hitse value%identityAlignment
A0A0A0KT13 Uncharacterized protein5.8e-21888.3Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF R  GDDRFYNPTKARR HQGRQ DQLRRAQSDVS+GQS +VK SA+SSVIRE+E GDGC+ELPKSIA S FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYL+KTTMKGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ
        EPERALKYMGKQLNHH L SELSRRMDN+SFRDQLIGLQEDCSSDEAESLNS+GQLLFEHLERDLPYSREPLADK                         
Subjt:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ

Query:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
        ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFH+LSSP+GGARSVQ PVVTYPSEIDG+PKMSLPVFGLASYKFRGSLWTP
Subjt:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP

Query:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
Subjt:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

A0A1S3CT52 uncharacterized protein LOC1035045976.7e-22289.22Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGD RFYNPTKARR HQGRQ DQLRRAQSDVS+GQS +VKPSA+SSVIRETE G+GC+ELPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ
        EPERALKYMGKQLNHHHL SELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK                         
Subjt:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ

Query:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
        ISDLAFQFP+LKT+RSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFH+LSSP GGARSVQ PVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
Subjt:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP

Query:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        NGGYEWQLANSLL DAEDWLRERQVNHPDFIFFSRR
Subjt:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

A0A5D3BPU4 DUF789 domain-containing protein6.0e-22389.68Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARR HQGRQ DQLRRAQSDVS+GQS +VKPSA+SSVIRETE G+GC+ELPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ
        EPERALKYMGKQLNHHHL SELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK                         
Subjt:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ

Query:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
        ISDLAFQFP+LKTLRSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFH+LSSP GGARSVQ PVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
Subjt:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP

Query:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        NGGYEWQLANSLL DAEDWLRERQVNHPDFIFFSRR
Subjt:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

A0A6J1FG46 uncharacterized protein LOC1114451075.3e-21987.39Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVS+ QSP++KP+ +SSVIRETEYGDGC+ELP SIAMSAFEPVVSSLSNLERFLQSI PS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ
        EPERALKYMG QLNHHHL SELSRR + +S RDQLIGLQEDC SDEAESLNSQGQLLFEHLERDLPYSREPLADK                         
Subjt:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ

Query:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
        +SDLAF+FPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFH+LS+PMGGARSVQGPVVTYPS+IDGIP+MSLPVFGLASYKFRGSLWTP
Subjt:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP

Query:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        NGG+EWQLANSLLQDAEDWLRER VNHPDFIFFSRR
Subjt:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

A0A6J1JV26 uncharacterized protein LOC1114891474.0e-21986.7Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        M GAGLQF RGCGDDRFYNPTKARR+HQGRQNDQLRR QSDVS+ +SP++KP+ +SS+IRETEYGDGC+ELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ
        EPERA+KYMG QLNHHHL SELSRRM+ +S RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK                         
Subjt:  EPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQ

Query:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
        +SDLAF+FPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFH+LS+PMGGARSVQGPVVTYPS+IDGIP+MSLPVFGLASYKFRGSLWTP
Subjt:  ISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP

Query:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        NGGYEWQLANSLLQDA+DWLRER VNHPDFIFFSRR
Subjt:  NGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.3e-10555.15Show/hide
Query:  QLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKTTMKGWRTCDVEFQ-PYFVLGDLWE
        QL+RAQ DVS G          SS  ++ E G          A+       +S SN+ERFL S+ PSVPA YLSKT ++     DVE Q PYF+LGD+WE
Subjt:  QLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKTTMKGWRTCDVEFQ-PYFVLGDLWE

Query:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHHHLFSELSRRMDNI
        SF EWSAYG GVPL LN++ D V QYYVP LSGIQ+Y   ++L SS ++R+ GE+S+SDFRDSSS+GSS SE ER L Y  +Q         +S RMD +
Subjt:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHHHLFSELSRRMDNI

Query:  SFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQISDLAFQFPELKTLRSCDLLPSSWFSVAW
        S R +    QED SSD+ E L+SQG+L+FE+LERDLPY REP ADK                         +SDLA +FPELKTLRSCDLLPSSWFSVAW
Subjt:  SFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQISDLAFQFPELKTLRSCDLLPSSWFSVAW

Query:  YPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPV-VTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHP
        YPIY+IPTGPTL+DLDACFLT+H L +P  G     G + V  P E   + KM LPVFGLASYK RGS+WT  GG   QLANSL Q A++WLR RQVNHP
Subjt:  YPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPV-VTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHP

Query:  DFIFFSRR
        DFIFF RR
Subjt:  DFIFFSRR

AT2G01260.1 Protein of unknown function (DUF789)1.2e-11455.43Show/hide
Query:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAP
        MLGAG Q  RG  GDD FY   K RRA+Q  + DQLRRAQSDVS+  S    P                            EP   S SNL+RFL+S+ P
Subjt:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAP

Query:  SVPAQYLSKTTMKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS
        SVPAQ+LSKT ++  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYLSKTTMKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLW
        D SSDS+ ER                 +S R+D IS RDQ    QED SSD+ E L SQG+L+FE+LERDLPY REP ADK                   
Subjt:  DGSSDSEPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLW

Query:  FQYFLQISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR
              + DLA QFPEL TLRSCDLL SSWFSVAWYPIYRIPTGPTL+DLDACFLT+H L +  GG  S Q   +T P E +   KMSLPVFGLASYKFR
Subjt:  FQYFLQISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR

Query:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        GSLWTP GG E QL NSL Q A+ WL    V+HPDF+FF RR
Subjt:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

AT2G01260.2 Protein of unknown function (DUF789)5.6e-8854.1Show/hide
Query:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAP
        MLGAG Q  RG  GDD FY   K RRA+Q  + DQLRRAQSDVS+  S    P                            EP   S SNL+RFL+S+ P
Subjt:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAP

Query:  SVPAQYLSKTTMKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS
        SVPAQ+LSKT ++  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYLSKTTMKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLW
        D SSDS+ ER                 +S R+D IS RDQ    QED SSD+ E L SQG+L+FE+LERDLPY REP ADK                   
Subjt:  DGSSDSEPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLW

Query:  FQYFLQISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGG
              + DLA QFPEL TLRSCDLL SSWFSVAWYPIYRIPTGPTL+DLDACFLT+H L +  GG
Subjt:  FQYFLQISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGG

AT4G16100.1 Protein of unknown function (DUF789)1.5e-8042.52Show/hide
Query:  GDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSI---AMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKT
        G++RFYNP   R+  Q R+  +L   + +    ++  +    +    +E +  + C     S+     S      ++ SNL RFL    P V  Q+L  T
Subjt:  GDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSI---AMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKT

Query:  TMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY
        + KGWRT + E++PYF+L DLW+SF+EWSAYG GVPL+LN  DSVVQYYVPYLSGIQ+Y +  ++    R+ GE+SD D  RD SSDGS+D         
Subjt:  TMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY

Query:  MGKQLNHHHLFSELSRRMDNISFRDQ-LIGLQEDCSSDEAE-SLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQISDLAF
                    ELS+ +   S  ++  IG     SSDE+E S NS G+L+FE+LE  +P+ REPL DK                         IS+L+ 
Subjt:  MGKQLNHHHLFSELSRRMDNISFRDQ-LIGLQEDCSSDEAE-SLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQISDLAF

Query:  QFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGY-E
        QFP L+T RSCDL PSSW SVAWYPIYRIP G +L++LDACFLTFH LS+P  G  + +G      S+     K+ LP FGLASYKF+ S W+P     E
Subjt:  QFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGY-E

Query:  WQLANSLLQDAEDWLRERQVNHPDFIFF
         Q   +LL+ AE+WLR  +V  PDF  F
Subjt:  WQLANSLLQDAEDWLRERQVNHPDFIFF

AT5G49220.1 Protein of unknown function (DUF789)1.6e-7441.5Show/hide
Query:  GLQFARGC--GDDRFYNPTKARRAHQGRQ-NDQLRRAQSDVSSGQSPLVKPSAMSSVI--RETEYGDGCDELPKSIAMSAFE-------------PVVSS
        G+  AR    G++RFYNP   RR  Q  Q   Q+R  Q      +  + K    ++ +  R T  G G  E    + +S  E              V+S 
Subjt:  GLQFARGC--GDDRFYNPTKARRAHQGRQ-NDQLRRAQSDVSSGQSPLVKPSAMSSVI--RETEYGDGCDELPKSIAMSAFE-------------PVVSS

Query:  LSNLERFLQSIAPSVPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQP
         SNL+RFL+   P VPA+     +    +T + +   YFVL DLWESF EWSAYGAGV     PL ++ +DS VQYYVPYLSGIQ+Y + LK   K R P
Subjt:  LSNLERFLQSIAPSVPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQP

Query:  GEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKY
          D+     + SS+GSS+S        +G+              ++ IS +DQ   +    SS EAE  N QG+LLFE+LE + P+ REPLA+K      
Subjt:  GEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHHHLFSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKY

Query:  LNHFLSFLVTLLWFQYFLQISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKM
                           ISDLA + PEL T RSCDLLPSSW SV+WYPIYRIP GPTL++LDACFLTFH LS+     +S  G   + PS      K+
Subjt:  LNHFLSFLVTLLWFQYFLQISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKM

Query:  SLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFS
         LP FGLASYK + S+W  N   E Q   SLLQ A+ WL+  QV+HPD+ FF+
Subjt:  SLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGTGCAGGCTTGCAGTTTGCTCGTGGTTGTGGTGACGATAGGTTTTACAATCCGACGAAAGCTCGTAGGGCGCATCAGGGTCGTCAAAATGATCAGCTCCGGAG
AGCTCAGAGCGATGTTTCTTCTGGTCAATCCCCTCTCGTTAAACCGAGCGCGATGTCCTCGGTGATTAGAGAAACCGAATATGGCGATGGGTGTGATGAGCTCCCTAAAT
CCATTGCGATGTCGGCTTTTGAGCCAGTGGTGTCGTCGCTTAGTAATCTCGAGCGGTTCTTGCAGTCCATCGCGCCATCTGTACCTGCACAGTACCTATCAAAGACAACG
ATGAAGGGTTGGAGAACCTGTGACGTGGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGCGCAGGTGTGCCTCTTGT
ATTAAACGACAGTGACAGTGTTGTCCAATATTATGTACCATATTTATCTGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTGCAAAGTCAAGGCAACCAGGTGAGG
ACAGTGATAGTGACTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAACCTGAAAGAGCACTAAAATACATGGGGAAGCAACTCAATCATCACCATTTATTTTCT
GAGCTTTCTCGTAGAATGGATAATATATCTTTTCGGGACCAGCTAATTGGACTCCAAGAAGACTGTTCTAGTGACGAGGCTGAATCTCTCAATTCTCAAGGCCAGCTACT
ATTTGAGCATCTTGAACGTGATTTGCCTTATAGTCGTGAACCTTTGGCAGATAAGGCAAAAGAGATAAAATACCTAAATCATTTCCTTTCATTCCTCGTAACCTTACTCT
GGTTTCAATATTTTCTCCAGATATCAGATCTTGCTTTTCAGTTCCCTGAGCTCAAGACATTACGAAGTTGTGATCTATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTAT
CCAATTTACAGGATTCCAACTGGACCAACATTAAGGGATCTGGATGCCTGCTTCCTCACCTTTCATCATTTGTCTTCGCCAATGGGAGGGGCACGTAGTGTTCAAGGTCC
TGTAGTAACGTATCCTAGTGAGATAGATGGTATCCCTAAGATGTCCCTACCAGTTTTTGGTCTAGCTTCATACAAGTTTAGAGGGTCCTTATGGACTCCAAATGGTGGAT
ACGAGTGGCAATTGGCAAATTCACTTTTGCAGGATGCTGAGGATTGGTTAAGAGAACGTCAAGTAAATCACCCTGACTTCATCTTCTTCAGCCGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
TTTTTTTTTTTTTTTGGTTCCTTTTGACGAGCCACAGAGGGCTCTGTTTTTCGTCTCTCACATTTTGGAAGCCGATTGTTTGTCTAATTCTATCTCGCAATCAATCGGAC
TCTTCTGTGCTTACGGATTTTTATTCTAATCCGATCAGTCTCTAGCGAGAGATTGTGAATTGCTTTTGCATTTTCGTTGACTTTTATTGAGGATTCTACGACTGCCGATT
GCCATTTTCCAGAGCCATTTTTTTTTTTCGTTTGGTTTTGGAAATATAATGTTAGGTGCAGGCTTGCAGTTTGCTCGTGGTTGTGGTGACGATAGGTTTTACAATCCGAC
GAAAGCTCGTAGGGCGCATCAGGGTCGTCAAAATGATCAGCTCCGGAGAGCTCAGAGCGATGTTTCTTCTGGTCAATCCCCTCTCGTTAAACCGAGCGCGATGTCCTCGG
TGATTAGAGAAACCGAATATGGCGATGGGTGTGATGAGCTCCCTAAATCCATTGCGATGTCGGCTTTTGAGCCAGTGGTGTCGTCGCTTAGTAATCTCGAGCGGTTCTTG
CAGTCCATCGCGCCATCTGTACCTGCACAGTACCTATCAAAGACAACGATGAAGGGTTGGAGAACCTGTGACGTGGAATTTCAACCATACTTTGTCCTAGGTGATTTGTG
GGAGTCTTTTAAGGAATGGAGTGCTTATGGCGCAGGTGTGCCTCTTGTATTAAACGACAGTGACAGTGTTGTCCAATATTATGTACCATATTTATCTGGTATACAGATAT
ATGGTGAATCCTTGAAGTCCTCTGCAAAGTCAAGGCAACCAGGTGAGGACAGTGATAGTGACTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAACCTGAAAGA
GCACTAAAATACATGGGGAAGCAACTCAATCATCACCATTTATTTTCTGAGCTTTCTCGTAGAATGGATAATATATCTTTTCGGGACCAGCTAATTGGACTCCAAGAAGA
CTGTTCTAGTGACGAGGCTGAATCTCTCAATTCTCAAGGCCAGCTACTATTTGAGCATCTTGAACGTGATTTGCCTTATAGTCGTGAACCTTTGGCAGATAAGGCAAAAG
AGATAAAATACCTAAATCATTTCCTTTCATTCCTCGTAACCTTACTCTGGTTTCAATATTTTCTCCAGATATCAGATCTTGCTTTTCAGTTCCCTGAGCTCAAGACATTA
CGAAGTTGTGATCTATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAGGGATCTGGATGCCTGCTTCCTCACCTT
TCATCATTTGTCTTCGCCAATGGGAGGGGCACGTAGTGTTCAAGGTCCTGTAGTAACGTATCCTAGTGAGATAGATGGTATCCCTAAGATGTCCCTACCAGTTTTTGGTC
TAGCTTCATACAAGTTTAGAGGGTCCTTATGGACTCCAAATGGTGGATACGAGTGGCAATTGGCAAATTCACTTTTGCAGGATGCTGAGGATTGGTTAAGAGAACGTCAA
GTAAATCACCCTGACTTCATCTTCTTCAGCCGAAGGTGAAATGAAATCCTTGCAACATCTACAATGCCTAAACCTAAAGGTGGGAATCAAGATATCATAGTTCAGTAAGA
TGTCGTAATTTGCTCTTTGTGGCGCCGTTCTACTTTCAGAAAATGGAAAGGAAAAAGGAAAAGAACAAAAAAACCAAAAAACCAAAAAACCAAAAAAAAAAAAAAGAACG
AAAAAAAAGGGAAAAAGAAGGAAAAGAAAAGTTTGTGATGGGATGGTGGCGAGGTTGGTGGATGGTGACGTAAACAAAAACAAAGGTCAAAAGGGAAAAAGGCAATAAAG
AAAAAGCTGAAATACAGTCGGATGTAGAAGTTGTTGGCAAAACATTGCCCAAAGTTGGTTTTACTTGCTGCTGCTTGTGCTGCATAGCAAGTTTTATCCTTTCTTTTTTC
TCTTTAATTACTGAGTTTTTTTTCTTTTCTTTTTTTCTATAAGTTATTAACCGTAGGTGAAATGAAAGGAAAGAAAAAGAAAAAGAAAAACAAAGAAAAACAGGCCCTTT
TTACTGTCCTGATGATGTTTAGAATGTAAACTATGGAGAGGGGATTTTTTGGTAGGAAGGTTATGTTAGGAGTAAGCTATGGATTCCCCCCAATGTATTTGAATGTTTAT
TTATGCTCCTTAGTTGCTATCGAAAACTGTATAATTTGTTATGAAGAGC
Protein sequenceShow/hide protein sequence
MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSSGQSPLVKPSAMSSVIRETEYGDGCDELPKSIAMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKTT
MKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHHHLFS
ELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKAKEIKYLNHFLSFLVTLLWFQYFLQISDLAFQFPELKTLRSCDLLPSSWFSVAWY
PIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR