; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G006220 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G006220
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationchr09:6862254..6866567
RNA-Seq ExpressionLsi09G006220
SyntenyLsi09G006220
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016733.1 hypothetical protein SDJN02_21843, partial [Cucurbita argyrosperma subsp. argyrosperma]2.7e-21786.2Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQN+QLRRAQSDVSA Q P++KP+ VSSVIRETEYGDGCE+LPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN
        VPAQY SKTT KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSR                     
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN

Query:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
                  QPGEDSDSDFRDSSSDGSSDSEPERALKYMG QL+HHHL SELS RM+ +S RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
Subjt:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR

Query:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR
        EPLADK ++L+F+FPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLS+PMGGARSVQGPVVTYPS+IDGIP+MSLPVFGLASYKFR
Subjt:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR

Query:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        GSLWTPNGGYEWQLANSLLQDAEDWLRER VNHPDFIFFSRR
Subjt:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

TYK00266.1 DUF789 domain-containing protein [Cucumis melo var. makuwa]1.4e-21887.56Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARR HQGRQ +QLRRAQSDVSAGQS +VKPSAVSSVIRETE G+GCEELPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN
        VPAQYLSKTT KGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSR                     
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN

Query:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
                  QPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQL+HHHL SELS RMD+ISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
Subjt:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR

Query:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR
        EPLADK ++LAFQFP+LKTLRSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFHYLSSP GGARSVQ PVVTYPSEIDGIPKMSLPVFGLASYKFR
Subjt:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR

Query:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        GSLWTPNGGYEWQLANSLL DAEDWLRERQVNHPDFIFFSRR
Subjt:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

XP_008467187.1 PREDICTED: uncharacterized protein LOC103504597 [Cucumis melo]1.6e-21787.1Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGD RFYNPTKARR HQGRQ +QLRRAQSDVSAGQS +VKPSAVSSVIRETE G+GCEELPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN
        VPAQYLSKTT KGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSR                     
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN

Query:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
                  QPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQL+HHHL SELS RMD+ISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
Subjt:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR

Query:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR
        EPLADK ++LAFQFP+LKT+RSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFHYLSSP GGARSVQ PVVTYPSEIDGIPKMSLPVFGLASYKFR
Subjt:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR

Query:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        GSLWTPNGGYEWQLANSLL DAEDWLRERQVNHPDFIFFSRR
Subjt:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

XP_023550272.1 uncharacterized protein LOC111808496 [Cucurbita pepo subsp. pepo]1.6e-21786.43Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQN+QLRRAQSDVSA QSP++KP+ VSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN
        VPAQY SKTT KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSR                     
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN

Query:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
                  QPGEDSDSDFRDSSSDGSSDSEPER LKY G QL+HHHL SELS RM+ +S RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
Subjt:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR

Query:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR
        EPLADK ++LAF+FPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLS+PMGGARSVQGPVVTYPS+IDGIP+MSLPVFGLASYKFR
Subjt:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR

Query:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        GSLWTPNGGYEWQLANSLLQDAEDWLRER VNHPDFIFFSRR
Subjt:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

XP_038874258.1 uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida]1.3e-22489.59Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFARGCGDDRFYNPTKARRAHQGRQN+QLRRAQSDVSAGQSPLVKP  VSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN
        VPAQYLSKTT KGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSR                     
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN

Query:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
                  QPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQL+HHHL SEL  RMD ISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
Subjt:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR

Query:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR
        EPLADK ++LAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFH+LSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR
Subjt:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR

Query:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        GSLWTPNGGYEWQLANSLLQDAE+WLR+RQVNHPDFIFFSRR
Subjt:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

TrEMBL top hitse value%identityAlignment
A0A0A0KT13 Uncharacterized protein8.8e-21486.2Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF R  GDDRFYNPTKARR HQGRQ +QLRRAQSDVSAGQS +VK SAVSSVIRE+E GDGCEELPKSIA S FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN
        VPAQYL+KTT KGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSR                     
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN

Query:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
                  QPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQL+HH L SELS RMD++SFRDQLIGLQEDCSSDEAESLNS+GQLLFEHLERDLPYSR
Subjt:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR

Query:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR
        EPLADK ++LAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFHYLSSP+GGARSVQ PVVTYPSEIDG+PKMSLPVFGLASYKFR
Subjt:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR

Query:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
Subjt:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

A0A1S3CT52 uncharacterized protein LOC1035045977.7e-21887.1Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGD RFYNPTKARR HQGRQ +QLRRAQSDVSAGQS +VKPSAVSSVIRETE G+GCEELPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN
        VPAQYLSKTT KGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSR                     
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN

Query:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
                  QPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQL+HHHL SELS RMD+ISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
Subjt:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR

Query:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR
        EPLADK ++LAFQFP+LKT+RSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFHYLSSP GGARSVQ PVVTYPSEIDGIPKMSLPVFGLASYKFR
Subjt:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR

Query:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        GSLWTPNGGYEWQLANSLL DAEDWLRERQVNHPDFIFFSRR
Subjt:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

A0A5D3BPU4 DUF789 domain-containing protein7.0e-21987.56Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARR HQGRQ +QLRRAQSDVSAGQS +VKPSAVSSVIRETE G+GCEELPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN
        VPAQYLSKTT KGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSR                     
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN

Query:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
                  QPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQL+HHHL SELS RMD+ISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
Subjt:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR

Query:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR
        EPLADK ++LAFQFP+LKTLRSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFHYLSSP GGARSVQ PVVTYPSEIDGIPKMSLPVFGLASYKFR
Subjt:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR

Query:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        GSLWTPNGGYEWQLANSLL DAEDWLRERQVNHPDFIFFSRR
Subjt:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

A0A6J1FG46 uncharacterized protein LOC1114451071.6e-21585.75Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQN+QLRRAQSDVSA QSP++KP+ VSSVIRETEYGDGCEELP SIAMSAFEPVVSSLSNLERFLQSI PS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN
        VPAQY SKTT KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSR                     
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN

Query:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
                  QPGEDSDSDFRDSSSDGSSDSEPERALKYMG QL+HHHL SELS R + +S RDQLIGLQEDC SDEAESLNSQGQLLFEHLERDLPYSR
Subjt:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR

Query:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR
        EPLADK ++LAF+FPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLS+PMGGARSVQGPVVTYPS+IDGIP+MSLPVFGLASYKFR
Subjt:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR

Query:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        GSLWTPNGG+EWQLANSLLQDAEDWLRER VNHPDFIFFSRR
Subjt:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

A0A6J1JV26 uncharacterized protein LOC1114891474.2e-21685.29Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        M GAGLQF RGCGDDRFYNPTKARR+HQGRQN+QLRR QSDVSA +SP++KP+ VSS+IRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN
        VPAQY SKTT KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSR                     
Subjt:  VPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIAN

Query:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
                  QPGEDSDSDFRDSSSDGSSDSEPERA+KYMG QL+HHHL SELS RM+ +S RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR
Subjt:  IWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSR

Query:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR
        EPLADK ++LAF+FPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLS+PMGGARSVQGPVVTYPS+IDGIP+MSLPVFGLASYKFR
Subjt:  EPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFR

Query:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        GSLWTPNGGYEWQLANSLLQDA+DWLRER VNHPDFIFFSRR
Subjt:  GSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.8e-10253.86Show/hide
Query:  QLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKTTTKGWRTCDVEFQ-PYFVLGDLWE
        QL+RAQ DVS G          SS  ++ E G          A+       +S SN+ERFL S+ PSVPA YLSKT  +     DVE Q PYF+LGD+WE
Subjt:  QLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKTTTKGWRTCDVEFQ-PYFVLGDLWE

Query:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRMVIELFKDFSWSKELLELIANIWFSHVVIGKQPGEDSDSDFRDSSSDGSS
        SF EWSAYG GVPL LN++ D V QYYVP LSGIQ+Y   ++L SS ++R                               + GE+S+SDFRDSSS+GSS
Subjt:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRMVIELFKDFSWSKELLELIANIWFSHVVIGKQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS
         SE ER L Y  +Q         +S RMD +S R +    QED SSD+ E L+SQG+L+FE+LERDLPY REP ADK ++LA +FPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPV-VTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRE
        WFSVAWYPIY+IPTGPTL+DLDACFLT+H L +P  G     G + V  P E   + KM LPVFGLASYK RGS+WT  GG   QLANSL Q A++WLR 
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPV-VTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRE

Query:  RQVNHPDFIFFSRR
        RQVNHPDFIFF RR
Subjt:  RQVNHPDFIFFSRR

AT2G01260.1 Protein of unknown function (DUF789)2.1e-11154.24Show/hide
Query:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAP
        MLGAG Q  RG  GDD FY   K RRA+Q  + +QLRRAQSDVS   S    P                            EP   S SNL+RFL+S+ P
Subjt:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAP

Query:  SVPAQYLSKTTTKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRMVIELFKDFSWSKEL
        SVPAQ+LSKT  +  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR               
Subjt:  SVPAQYLSKTTTKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRMVIELFKDFSWSKEL

Query:  LELIANIWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLER
                        +PG+ SDSDFRDSSSD SSDS+ ER                 +S R+D IS RDQ    QED SSD+ E L SQG+L+FE+LER
Subjt:  LELIANIWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLER

Query:  DLPYSREPLADKA-NLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGL
        DLPY REP ADK  +LA QFPEL TLRSCDLL SSWFSVAWYPIYRIPTGPTL+DLDACFLT+H L +  GG  S Q   +T P E +   KMSLPVFGL
Subjt:  DLPYSREPLADKA-NLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGL

Query:  ASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR
        ASYKFRGSLWTP GG E QL NSL Q A+ WL    V+HPDF+FF RR
Subjt:  ASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSRR

AT2G01260.2 Protein of unknown function (DUF789)1.0e-8452.69Show/hide
Query:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAP
        MLGAG Q  RG  GDD FY   K RRA+Q  + +QLRRAQSDVS   S    P                            EP   S SNL+RFL+S+ P
Subjt:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAP

Query:  SVPAQYLSKTTTKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRMVIELFKDFSWSKEL
        SVPAQ+LSKT  +  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR               
Subjt:  SVPAQYLSKTTTKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRMVIELFKDFSWSKEL

Query:  LELIANIWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLER
                        +PG+ SDSDFRDSSSD SSDS+ ER                 +S R+D IS RDQ    QED SSD+ E L SQG+L+FE+LER
Subjt:  LELIANIWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLER

Query:  DLPYSREPLADKA-NLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGG
        DLPY REP ADK  +LA QFPEL TLRSCDLL SSWFSVAWYPIYRIPTGPTL+DLDACFLT+H L +  GG
Subjt:  DLPYSREPLADKA-NLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGG

AT4G16100.1 Protein of unknown function (DUF789)1.3e-7941.94Show/hide
Query:  GDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSI---AMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKT
        G++RFYNP   R+  Q R+ ++L   + +    ++  +    +    +E +  + C     S+     S      ++ SNL RFL    P V  Q+L  T
Subjt:  GDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSI---AMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKT

Query:  TTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIANIWFSHVVIG
        ++KGWRT + E++PYF+L DLW+SF+EWSAYG GVPL+LN  DSVVQYYVPYLSGIQ+Y +  ++    R V                            
Subjt:  TTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIANIWFSHVVIG

Query:  KQPGEDSDSDF-RDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQ-LIGLQEDCSSDEAE-SLNSQGQLLFEHLERDLPYSREPLADK
           GE+SD D  RD SSDGS+D                     ELS  +   S  ++  IG     SSDE+E S NS G+L+FE+LE  +P+ REPL DK
Subjt:  KQPGEDSDSDF-RDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQ-LIGLQEDCSSDEAE-SLNSQGQLLFEHLERDLPYSREPLADK

Query:  -ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP
         +NL+ QFP L+T RSCDL PSSW SVAWYPIYRIP G +L++LDACFLTFH LS+P  G  + +G      S+     K+ LP FGLASYKF+ S W+P
Subjt:  -ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTP

Query:  NGGY-EWQLANSLLQDAEDWLRERQVNHPDFIFF
             E Q   +LL+ AE+WLR  +V  PDF  F
Subjt:  NGGY-EWQLANSLLQDAEDWLRERQVNHPDFIFF

AT5G49220.1 Protein of unknown function (DUF789)1.7e-7139.87Show/hide
Query:  GLQFARGC--GDDRFYNPTKARRAHQGRQ-NEQLRRAQSDVSAGQSPLVKPSAVSSVI--RETEYGDGCEELPKSIAMSAFE-------------PVVSS
        G+  AR    G++RFYNP   RR  Q  Q  +Q+R  Q      +  + K    ++ +  R T  G G  E    + +S  E              V+S 
Subjt:  GLQFARGC--GDDRFYNPTKARRAHQGRQ-NEQLRRAQSDVSAGQSPLVKPSAVSSVI--RETEYGDGCEELPKSIAMSAFE-------------PVVSS

Query:  LSNLERFLQSIAPSVPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMV
         SNL+RFL+   P VPA+     +    +T + +   YFVL DLWESF EWSAYGAGV     PL ++ +DS VQYYVPYLSGIQ+Y + LK        
Subjt:  LSNLERFLQSIAPSVPAQYLSKTTTKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMV

Query:  IELFKDFSWSKELLELIANIWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESL
                                    + P  D+     + SS+GSS+S        +G+              ++ IS +DQ   +    SS EAE  
Subjt:  IELFKDFSWSKELLELIANIWFSHVVIGKQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESL

Query:  NSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEI
        N QG+LLFE+LE + P+ REPLA+K ++LA + PEL T RSCDLLPSSW SV+WYPIYRIP GPTL++LDACFLTFH LS+     +S  G   + PS  
Subjt:  NSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEI

Query:  DGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFS
            K+ LP FGLASYK + S+W  N   E Q   SLLQ A+ WL+  QV+HPD+ FF+
Subjt:  DGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGTGCAGGCTTGCAGTTTGCTCGTGGTTGTGGTGATGATAGGTTTTACAATCCGACGAAAGCTCGTAGGGCGCATCAGGGCCGTCAAAATGAACAGCTCCGGAG
AGCTCAGAGCGACGTTTCTGCTGGTCAATCCCCTCTTGTTAAACCGAGCGCGGTATCCTCGGTGATTAGAGAAACGGAATACGGCGATGGGTGTGAAGAGCTCCCTAAAT
CCATTGCGATGTCGGCTTTTGAGCCAGTGGTGTCGTCGTTGAGTAATCTCGAGCGGTTCTTGCAGTCCATCGCGCCATCTGTACCTGCACAGTACCTCTCAAAGACAACG
ACGAAGGGTTGGAGAACCTGCGACGTGGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGCGCAGGTGTGCCTCTTGT
ATTAAACGACAGTGACAGTGTTGTCCAATATTATGTACCATATTTATCTGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTGCAAAGTCAAGAATGGTTATTGAAC
TGTTCAAGGATTTTTCTTGGTCAAAAGAACTATTGGAGTTGATAGCTAATATTTGGTTCAGCCATGTTGTAATCGGTAAGCAACCAGGTGAGGACAGTGATAGTGACTTC
AGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCACTAAAATACATGGGGAAGCAACTCAGCCATCACCATTTATTTTCTGAGCTTTCTCATAGAAT
GGATAGTATATCTTTTCGGGACCAGCTAATTGGACTTCAAGAAGACTGTTCTAGTGACGAGGCTGAATCTCTCAATTCTCAAGGCCAGCTACTATTTGAGCATCTTGAAC
GTGATTTGCCTTATAGTCGCGAACCTTTGGCAGATAAGGCAAATCTTGCTTTTCAGTTCCCTGAGCTCAAGACATTACGAAGTTGTGATCTATTGCCTTCCAGCTGGTTT
TCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAGGGATCTGGATGCCTGCTTCCTCACCTTTCATTATTTGTCTTCGCCAATGGGAGGGGCACG
TAGTGTTCAAGGTCCTGTAGTAACGTATCCTAGTGAGATAGATGGTATCCCTAAGATGTCCCTACCAGTTTTTGGTCTAGCTTCATACAAGTTTAGAGGGTCTTTATGGA
CTCCAAATGGTGGATACGAGTGGCAATTGGCAAATTCACTTTTACAGGATGCTGAGGATTGGTTAAGAGAACGTCAAGTAAATCACCCTGACTTCATCTTCTTCAGCCGA
AGGTGA
mRNA sequenceShow/hide mRNA sequence
CTTCCTTGAAAAAACCGGCGTACTGGTTCTCCTTCATCTCTCTCCTTTTCTCTTCTTCAATCGCTTCGTTTTTCTTCCATTGCTCTCTCTCTCTCTTTCTCTCTCTTTTC
TCTGTTTCGACGCCATTGAATCCCTCCTTCTGTGTATTAGATCCATTGCGTTTCAATCTCTGTAAGATTTTTGCATTCAAATTCATCTTCTCCGATTCGATTCACCCATT
TTTTTGTTCCTTTTGACGAGCCACAGAGGGCTCTGTTTTTCGTCTTTCACATTTTAGAAAGCCGATTGTTTGTTTAATTCTATCTCGCAATCAATCGGACTCTTCTGTGC
TTACGGATTTTCATTCTAATTCGATCAGTCTCTAGCGAGAGATTGTGAATTGCTTTTGCATTTTCGTTGACTTTTATTGAGGATTCTACGACTGCCGATTGCCATTTTCC
AGAGCCATTTTTTTTTCGTTTGGTTTTGGAAATATAATGTTAGGTGCAGGCTTGCAGTTTGCTCGTGGTTGTGGTGATGATAGGTTTTACAATCCGACGAAAGCTCGTAG
GGCGCATCAGGGCCGTCAAAATGAACAGCTCCGGAGAGCTCAGAGCGACGTTTCTGCTGGTCAATCCCCTCTTGTTAAACCGAGCGCGGTATCCTCGGTGATTAGAGAAA
CGGAATACGGCGATGGGTGTGAAGAGCTCCCTAAATCCATTGCGATGTCGGCTTTTGAGCCAGTGGTGTCGTCGTTGAGTAATCTCGAGCGGTTCTTGCAGTCCATCGCG
CCATCTGTACCTGCACAGTACCTCTCAAAGACAACGACGAAGGGTTGGAGAACCTGCGACGTGGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAA
GGAATGGAGTGCTTATGGCGCAGGTGTGCCTCTTGTATTAAACGACAGTGACAGTGTTGTCCAATATTATGTACCATATTTATCTGGTATACAGATATATGGTGAATCCT
TGAAGTCCTCTGCAAAGTCAAGAATGGTTATTGAACTGTTCAAGGATTTTTCTTGGTCAAAAGAACTATTGGAGTTGATAGCTAATATTTGGTTCAGCCATGTTGTAATC
GGTAAGCAACCAGGTGAGGACAGTGATAGTGACTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCACTAAAATACATGGGGAAGCAACTCAG
CCATCACCATTTATTTTCTGAGCTTTCTCATAGAATGGATAGTATATCTTTTCGGGACCAGCTAATTGGACTTCAAGAAGACTGTTCTAGTGACGAGGCTGAATCTCTCA
ATTCTCAAGGCCAGCTACTATTTGAGCATCTTGAACGTGATTTGCCTTATAGTCGCGAACCTTTGGCAGATAAGGCAAATCTTGCTTTTCAGTTCCCTGAGCTCAAGACA
TTACGAAGTTGTGATCTATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAGGGATCTGGATGCCTGCTTCCTCAC
CTTTCATTATTTGTCTTCGCCAATGGGAGGGGCACGTAGTGTTCAAGGTCCTGTAGTAACGTATCCTAGTGAGATAGATGGTATCCCTAAGATGTCCCTACCAGTTTTTG
GTCTAGCTTCATACAAGTTTAGAGGGTCTTTATGGACTCCAAATGGTGGATACGAGTGGCAATTGGCAAATTCACTTTTACAGGATGCTGAGGATTGGTTAAGAGAACGT
CAAGTAAATCACCCTGACTTCATCTTCTTCAGCCGAAGGTGAAATGAAATCCTTGCAACATCTACAATGCTTAAACCTAAAGGTGGGAATCAAGATATCATAGTTCAGTA
CGATGTCGTAATTTGCTCTTTGTGGCGCCGTTCTACTTTCAGAAAATGGAAAGGAAGAAGGAAAAGAACAAAAAAACTAAAAAAAAAAAAGGGAAAAAGAAGGAAAAGAA
AAGTTTGTGATGGGATGGTGGTGAGGTTGGTGGATGGCGACTTAAACAAAAACAAAGGTCAAAAGGGGAAAAGGCAATAAAGAAAAAAGCTGAAATACAGTTGGATGAAG
ATGTAGAAGTAGTAGCAGAAACATTGCCCAAAGTTGGTTCTACTTGCTGCTGCTTGTGCTGCATAGCAGGTTTTATCCTTTCTTTTTTCTCTTTAATTACTGAAAGTTTT
TTTTTTTCTATAAGTTATTAACCGTAGGTGAAATGAAAGGAAAGAAAAAGAAAAAAAAAAAAAAAGAAAGAAAAACAGGCCCTTTTTACTGTCCTGATGATGTTTAGAAT
GTAAACTATGGAGAGGGGATTTTTTTGGTAGGAAGGTTATGTTAGGAGTAAGCTATGGATCCCCCCCAATGTATTTGAATGTTTGTTTATGCTCCTTAGTTGCTATTGAA
AACTGTATAATTTGTTACGAAGAGCTTCGTTATTAATAAACCATTATTCTAGCTCTCT
Protein sequenceShow/hide protein sequence
MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNEQLRRAQSDVSAGQSPLVKPSAVSSVIRETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKTT
TKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRMVIELFKDFSWSKELLELIANIWFSHVVIGKQPGEDSDSDF
RDSSSDGSSDSEPERALKYMGKQLSHHHLFSELSHRMDSISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKANLAFQFPELKTLRSCDLLPSSWF
SVAWYPIYRIPTGPTLRDLDACFLTFHYLSSPMGGARSVQGPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERQVNHPDFIFFSR
R