; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10009496 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10009496
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationChr06:6659291..6662452
RNA-Seq ExpressionHG10009496
SyntenyHG10009496
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016733.1 hypothetical protein SDJN02_21843, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-22192.7Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQN+QLRRAQSDVSA Q P++KP+ VSSVIRETEYGDGCE+LPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMG QL+HHHL SELS RM+ +S RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK ++L+F+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLS+PMGGARSVQGPVVTYPS+IDGIP+MSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRER V
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

TYK00266.1 DUF789 domain-containing protein [Cucumis melo var. makuwa]1.2e-22294.16Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARR HQGRQ +QLRRAQSDVSAGQS +VKPSAVSSVIRETE G+GCEELPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTT KGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMGKQL+HHHL SELS RMD+ISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK ++LAFQFP+LKTLRSCDLLPSSWF
Subjt:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLSSP GGARSVQ PVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLRERQV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_008467187.1 PREDICTED: uncharacterized protein LOC103504597 [Cucumis melo]1.3e-22193.67Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGD RFYNPTKARR HQGRQ +QLRRAQSDVSAGQS +VKPSAVSSVIRETE G+GCEELPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTT KGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMGKQL+HHHL SELS RMD+ISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK ++LAFQFP+LKT+RSCDLLPSSWF
Subjt:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLSSP GGARSVQ PVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLRERQV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_023550272.1 uncharacterized protein LOC111808496 [Cucurbita pepo subsp. pepo]1.3e-22192.94Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQN+QLRRAQSDVSA QSP++KP+ VSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF
        EPER LKY G QL+HHHL SELS RM+ +S RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK ++LAF+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLS+PMGGARSVQGPVVTYPS+IDGIP+MSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRER V
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_038874258.1 uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida]1.1e-22896.35Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFARGCGDDRFYNPTKARRAHQGRQN+QLRRAQSDVSAGQSPLVKP  VSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTT KGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMGKQL+HHHL SEL  RMD ISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK ++LAFQFPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTLRDLDACFLTFH+LSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAE+WLR+RQV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

TrEMBL top hitse value%identityAlignment
A0A0A0KT13 Uncharacterized protein7.2e-21892.7Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF R  GDDRFYNPTKARR HQGRQ +QLRRAQSDVSAGQS +VK SAVSSVIRE+E GDGCEELPKSIA S FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYL+KTT KGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMGKQL+HH L SELS RMD++SFRDQLIGLQEDCSSDEAESLNS+GQLLFEHLERDLPYSREPLADK ++LAFQFPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLSSP+GGARSVQ PVVTYPSEIDG+PKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A1S3CT52 uncharacterized protein LOC1035045976.3e-22293.67Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGD RFYNPTKARR HQGRQ +QLRRAQSDVSAGQS +VKPSAVSSVIRETE G+GCEELPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTT KGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMGKQL+HHHL SELS RMD+ISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK ++LAFQFP+LKT+RSCDLLPSSWF
Subjt:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLSSP GGARSVQ PVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLRERQV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A5D3BPU4 DUF789 domain-containing protein5.7e-22394.16Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARR HQGRQ +QLRRAQSDVSAGQS +VKPSAVSSVIRETE G+GCEELPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTT KGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMGKQL+HHHL SELS RMD+ISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK ++LAFQFP+LKTLRSCDLLPSSWF
Subjt:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLSSP GGARSVQ PVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLRERQV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A6J1FG46 uncharacterized protein LOC1114451071.3e-21992.21Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQN+QLRRAQSDVSA QSP++KP+ VSSVIRETEYGDGCEELP SIAMSAFEPVVSSLSNLERFLQSI PS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMG QL+HHHL SELS R + +S RDQLIGLQEDC SDEAESLNSQGQLLFEHLERDLPYSREPLADK ++LAF+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLS+PMGGARSVQGPVVTYPS+IDGIP+MSLPVFGLASYKFRGSLWTPNGG+EWQLANSLLQDAEDWLRER V
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A6J1JV26 uncharacterized protein LOC1114891473.4e-22091.73Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        M GAGLQF RGCGDDRFYNPTKARR+HQGRQN+QLRR QSDVSA +SP++KP+ VSS+IRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF
        EPERA+KYMG QL+HHHL SELS RM+ +S RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK ++LAF+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLS+PMGGARSVQGPVVTYPS+IDGIP+MSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDA+DWLRER V
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.1e-10658.22Show/hide
Query:  QLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKTTTKGWRTCDVEFQ-PYFVLGDLWE
        QL+RAQ DVS G          SS  ++ E G          A+       +S SN+ERFL S+ PSVPA YLSKT  +     DVE Q PYF+LGD+WE
Subjt:  QLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKTTTKGWRTCDVEFQ-PYFVLGDLWE

Query:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSI
        SF EWSAYG GVPL LN++ D V QYYVP LSGIQ+Y   ++L SS ++R+ GE+S+SDFRDSSS+GSS SE ER L Y  +Q         +S RMD +
Subjt:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSI

Query:  SFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYL
        S R +    QED SSD+ E L+SQG+L+FE+LERDLPY REP ADK ++LA +FPELKTLRSCDLLPSSWFSVAWYPIY+IPTGPTL+DLDACFLT+H L
Subjt:  SFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYL

Query:  SSPMGGARSVQGPV-VTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
         +P  G     G + V  P E   + KM LPVFGLASYK RGS+WT  GG   QLANSL Q A++WLR RQVNHPDFIFF RR
Subjt:  SSPMGGARSVQGPV-VTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

AT2G01260.1 Protein of unknown function (DUF789)1.7e-11558.27Show/hide
Query:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAP
        MLGAG Q  RG  GDD FY   K RRA+Q  + +QLRRAQSDVS   S    P                            EP   S SNL+RFL+S+ P
Subjt:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAP

Query:  SVPAQYLSKTTTKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS
        SVPAQ+LSKT  +  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYLSKTTTKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKA-NLAFQFPELKTLRSCDL
        D SSDS+ ER                 +S R+D IS RDQ    QED SSD+ E L SQG+L+FE+LERDLPY REP ADK  +LA QFPEL TLRSCDL
Subjt:  DGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKA-NLAFQFPELKTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDW
        L SSWFSVAWYPIYRIPTGPTL+DLDACFLT+H L +  GG  S Q   +T P E +   KMSLPVFGLASYKFRGSLWTP GG E QL NSL Q A+ W
Subjt:  LPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDW

Query:  LRERQVNHPDFIFFSRR
        L    V+HPDF+FF RR
Subjt:  LRERQVNHPDFIFFSRR

AT2G01260.2 Protein of unknown function (DUF789)8.1e-8957.48Show/hide
Query:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAP
        MLGAG Q  RG  GDD FY   K RRA+Q  + +QLRRAQSDVS   S    P                            EP   S SNL+RFL+S+ P
Subjt:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAP

Query:  SVPAQYLSKTTTKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS
        SVPAQ+LSKT  +  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYLSKTTTKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKA-NLAFQFPELKTLRSCDL
        D SSDS+ ER                 +S R+D IS RDQ    QED SSD+ E L SQG+L+FE+LERDLPY REP ADK  +LA QFPEL TLRSCDL
Subjt:  DGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKA-NLAFQFPELKTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGG
        L SSWFSVAWYPIYRIPTGPTL+DLDACFLT+H L +  GG
Subjt:  LPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGG

AT4G16100.1 Protein of unknown function (DUF789)3.0e-8344.91Show/hide
Query:  GDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSI---AMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKT
        G++RFYNP   R+  Q R+ ++L   + +    ++  +    +    +E +  + C     S+     S      ++ SNL RFL    P V  Q+L  T
Subjt:  GDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSI---AMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKT

Query:  TTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY
        ++KGWRT + E++PYF+L DLW+SF+EWSAYG GVPL+LN  DSVVQYYVPYLSGIQ+Y +  ++    R+ GE+SD D  RD SSDGS+D         
Subjt:  TTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY

Query:  MGKQLSHHHLFSELSHRMDSISFRDQ-LIGLQEDCSSDEAE-SLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYP
                    ELS  +   S  ++  IG     SSDE+E S NS G+L+FE+LE  +P+ REPL DK +NL+ QFP L+T RSCDL PSSW SVAWYP
Subjt:  MGKQLSHHHLFSELSHRMDSISFRDQ-LIGLQEDCSSDEAE-SLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYP

Query:  IYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGY-EWQLANSLLQDAEDWLRERQVNHPDF
        IYRIP G +L++LDACFLTFH LS+P  G  + +G      S+     K+ LP FGLASYKF+ S W+P     E Q   +LL+ AE+WLR  +V  PDF
Subjt:  IYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGY-EWQLANSLLQDAEDWLRERQVNHPDF

Query:  IFF
          F
Subjt:  IFF

AT5G49220.1 Protein of unknown function (DUF789)1.3e-7543.22Show/hide
Query:  GLQFARGC--GDDRFYNPTKARRAHQGRQ-NEQLRRAQSDVSAGQSPLVKPSAVSSVI--RETEYGDGCEELPKSIAMSAFE-------------PVVSS
        G+  AR    G++RFYNP   RR  Q  Q  +Q+R  Q      +  + K    ++ +  R T  G G  E    + +S  E              V+S 
Subjt:  GLQFARGC--GDDRFYNPTKARRAHQGRQ-NEQLRRAQSDVSAGQSPLVKPSAVSSVI--RETEYGDGCEELPKSIAMSAFE-------------PVVSS

Query:  LSNLERFLQSIAPSVPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQP
         SNL+RFL+   P VPA+     +    +T + +   YFVL DLWESF EWSAYGAGV     PL ++ +DS VQYYVPYLSGIQ+Y + LK   K R P
Subjt:  LSNLERFLQSIAPSVPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQP

Query:  GEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAF
          D+     + SS+GSS+S        +G+              ++ IS +DQ   +    SS EAE  N QG+LLFE+LE + P+ REPLA+K ++LA 
Subjt:  GEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAF

Query:  QFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEW
        + PEL T RSCDLLPSSW SV+WYPIYRIP GPTL++LDACFLTFH LS+     +S  G   + PS      K+ LP FGLASYK + S+W  N   E 
Subjt:  QFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEW

Query:  QLANSLLQDAEDWLRERQVNHPDFIFFS
        Q   SLLQ A+ WL+  QV+HPD+ FF+
Subjt:  QLANSLLQDAEDWLRERQVNHPDFIFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGTGCAGGCTTGCAGTTTGCTCGTGGTTGTGGTGATGATAGGTTTTACAATCCGACGAAAGCTCGTAGGGCGCATCAGGGCCGTCAAAATGAACAGCTCCGGAG
AGCTCAGAGCGACGTTTCTGCTGGTCAATCCCCTCTTGTTAAACCGAGCGCGGTATCCTCGGTGATTAGAGAAACGGAATACGGCGATGGGTGTGAAGAGCTCCCTAAAT
CCATTGCGATGTCGGCTTTTGAGCCAGTGGTGTCGTCGTTGAGTAATCTCGAGCGGTTCTTGCAGTCCATCGCGCCATCTGTACCTGCACAGTACCTCTCAAAGACAACG
ACGAAGGGTTGGAGAACCTGCGACGTGGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGCGCAGGTGTGCCTCTTGT
ATTAAACGACAGTGACAGTGTTGTCCAATATTATGTACCATATTTATCTGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTGCAAAGTCAAGGCAACCAGGTGAGG
ACAGTGATAGTGACTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCACTAAAATACATGGGGAAGCAACTCAGCCATCACCATTTATTTTCT
GAGCTTTCTCATAGAATGGATAGTATATCTTTTCGGGACCAGCTAATTGGACTTCAAGAAGACTGTTCTAGTGACGAGGCTGAATCTCTCAATTCTCAAGGCCAGCTACT
ATTTGAGCATCTTGAACGTGATTTGCCTTATAGTCGCGAACCTTTGGCAGATAAGGCAAATCTTGCTTTTCAGTTCCCTGAGCTCAAGACATTACGAAGTTGTGATCTAT
TGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAGGGATCTGGATGCCTGCTTCCTCACCTTTCATTATTTGTCTTCG
CCAATGGGAGGGGCACGTAGTGTTCAAGGTCCTGTAGTAACGTATCCTAGTGAGATAGATGGTATCCCTAAGATGTCCCTACCAGTTTTTGGTCTAGCTTCATACAAGTT
TAGAGGGTCTTTATGGACTCCAAATGGTGGATACGAGTGGCAATTGGCAAATTCACTTTTACAGGATGCTGAGGATTGGTTAAGAGAACGTCAAGTAAATCACCCTGACT
TCATCTTCTTCAGCCGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGGTGCAGGCTTGCAGTTTGCTCGTGGTTGTGGTGATGATAGGTTTTACAATCCGACGAAAGCTCGTAGGGCGCATCAGGGCCGTCAAAATGAACAGCTCCGGAG
AGCTCAGAGCGACGTTTCTGCTGGTCAATCCCCTCTTGTTAAACCGAGCGCGGTATCCTCGGTGATTAGAGAAACGGAATACGGCGATGGGTGTGAAGAGCTCCCTAAAT
CCATTGCGATGTCGGCTTTTGAGCCAGTGGTGTCGTCGTTGAGTAATCTCGAGCGGTTCTTGCAGTCCATCGCGCCATCTGTACCTGCACAGTACCTCTCAAAGACAACG
ACGAAGGGTTGGAGAACCTGCGACGTGGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGCGCAGGTGTGCCTCTTGT
ATTAAACGACAGTGACAGTGTTGTCCAATATTATGTACCATATTTATCTGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTGCAAAGTCAAGGCAACCAGGTGAGG
ACAGTGATAGTGACTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCACTAAAATACATGGGGAAGCAACTCAGCCATCACCATTTATTTTCT
GAGCTTTCTCATAGAATGGATAGTATATCTTTTCGGGACCAGCTAATTGGACTTCAAGAAGACTGTTCTAGTGACGAGGCTGAATCTCTCAATTCTCAAGGCCAGCTACT
ATTTGAGCATCTTGAACGTGATTTGCCTTATAGTCGCGAACCTTTGGCAGATAAGGCAAATCTTGCTTTTCAGTTCCCTGAGCTCAAGACATTACGAAGTTGTGATCTAT
TGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAGGGATCTGGATGCCTGCTTCCTCACCTTTCATTATTTGTCTTCG
CCAATGGGAGGGGCACGTAGTGTTCAAGGTCCTGTAGTAACGTATCCTAGTGAGATAGATGGTATCCCTAAGATGTCCCTACCAGTTTTTGGTCTAGCTTCATACAAGTT
TAGAGGGTCTTTATGGACTCCAAATGGTGGATACGAGTGGCAATTGGCAAATTCACTTTTACAGGATGCTGAGGATTGGTTAAGAGAACGTCAAGTAAATCACCCTGACT
TCATCTTCTTCAGCCGAAGGTGA
Protein sequenceShow/hide protein sequence
MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKTT
TKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFS
ELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSS
PMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR