; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006971 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006971
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationscaffold10:38694739..38699974
RNA-Seq ExpressionSpg006971
SyntenySpg006971
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016733.1 hypothetical protein SDJN02_21843, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-22090.35Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSA Q PV+KPT VSSVIRETE GDGCE LPKSIA+SAFEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL
        EPERALKYMG QLNHHHLSSELSRRM+R+SLRDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK              +SDL+F+FPEL
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL

Query:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS
        KTLRSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFHYLSTP+GGA SVQGPVVTYPS+IDG P+MSLPVFGLASYKFRGSLWTPNGGYEWQLANS
Subjt:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS

Query:  LLQDAEDWLRQRQVNHPDFIFFSRR
        LLQDAEDWLR+R VNHPDFIFFSRR
Subjt:  LLQDAEDWLRQRQVNHPDFIFFSRR

TYK00266.1 DUF789 domain-containing protein [Cucumis melo var. makuwa]2.5e-21990.59Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQFGRGCGDDRFYNPTKARR HQGRQ DQLRRAQSDVSAGQS VVKP+ VSSVIRETE G+GCE+LPKSIA+S FEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL
        EPERALKYMG QLNHHHLSSELSRRMD IS RDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK              ISDLAFQFP+L
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL

Query:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS
        KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLS+P GGA SVQ PVVTYPSEIDG PKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS
Subjt:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS

Query:  LLQDAEDWLRQRQVNHPDFIFFSRR
        LL DAEDWLR+RQVNHPDFIFFSRR
Subjt:  LLQDAEDWLRQRQVNHPDFIFFSRR

XP_022992986.1 uncharacterized protein LOC111489147 [Cucurbita maxima]1.2e-21888.94Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        M GAGLQFGRGCGDDRFYNPTKARR+HQGRQNDQLRR QSDVSA +SPV+KPT VSS+IRETE GDGCE+LPKSIA+SAFEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL
        EPERA+KYMG QLNHHHLSSELSRRM+R+SLRDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK              +SDLAF+FPEL
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL

Query:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS
        KTLRSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFHYLSTP+GGA SVQGPVVTYPS+IDG P+MSLPVFGLASYKFRGSLWTPNGGYEWQLANS
Subjt:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS

Query:  LLQDAEDWLRQRQVNHPDFIFFSRR
        LLQDA+DWLR+R VNHPDFIFFSRR
Subjt:  LLQDAEDWLRQRQVNHPDFIFFSRR

XP_023550272.1 uncharacterized protein LOC111808496 [Cucurbita pepo subsp. pepo]2.9e-22090.12Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSA QSPV+KPT VSSVIRETE GDGCE+LPKSIA+SAFEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL
        EPER LKY G QLNHHHLSSELSRRM+R+SLRDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK              +SDLAF+FPEL
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL

Query:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS
        KTLRSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFHYLSTP+GGA SVQGPVVTYPS+IDG P+MSLPVFGLASYKFRGSLWTPNGGYEWQLANS
Subjt:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS

Query:  LLQDAEDWLRQRQVNHPDFIFFSRR
        LLQDAEDWLR+R VNHPDFIFFSRR
Subjt:  LLQDAEDWLRQRQVNHPDFIFFSRR

XP_038874258.1 uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida]2.2e-22391.76Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSP+VKP +VSSVIRETE GDGCE+LPKSIA+SAFEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL
        EPERALKYMG QLNHHHLSSEL RRMDRIS RDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK              ISDLAFQFPEL
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL

Query:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS
        KTLRSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFH+LS+P+GGA SVQGPVVTYPSEIDG PKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS
Subjt:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS

Query:  LLQDAEDWLRQRQVNHPDFIFFSRR
        LLQDAE+WLR RQVNHPDFIFFSRR
Subjt:  LLQDAEDWLRQRQVNHPDFIFFSRR

TrEMBL top hitse value%identityAlignment
A0A1S3CT52 uncharacterized protein LOC1035045971.3e-21890.12Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQFGRGCGD RFYNPTKARR HQGRQ DQLRRAQSDVSAGQS VVKP+ VSSVIRETE G+GCE+LPKSIA+S FEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL
        EPERALKYMG QLNHHHLSSELSRRMD IS RDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK              ISDLAFQFP+L
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL

Query:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS
        KT+RSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLS+P GGA SVQ PVVTYPSEIDG PKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS
Subjt:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS

Query:  LLQDAEDWLRQRQVNHPDFIFFSRR
        LL DAEDWLR+RQVNHPDFIFFSRR
Subjt:  LLQDAEDWLRQRQVNHPDFIFFSRR

A0A5D3BPU4 DUF789 domain-containing protein1.2e-21990.59Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQFGRGCGDDRFYNPTKARR HQGRQ DQLRRAQSDVSAGQS VVKP+ VSSVIRETE G+GCE+LPKSIA+S FEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL
        EPERALKYMG QLNHHHLSSELSRRMD IS RDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK              ISDLAFQFP+L
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL

Query:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS
        KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLS+P GGA SVQ PVVTYPSEIDG PKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS
Subjt:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS

Query:  LLQDAEDWLRQRQVNHPDFIFFSRR
        LL DAEDWLR+RQVNHPDFIFFSRR
Subjt:  LLQDAEDWLRQRQVNHPDFIFFSRR

A0A6J1FG46 uncharacterized protein LOC1114451077.8e-21989.65Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSA QSPV+KPT VSSVIRETE GDGCE+LP SIA+SAFEPVVSSLSNL+RFL SI PS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL
        EPERALKYMG QLNHHHLSSELSRR +R+SLRDQLI LQEDC SDEAESLNSQGQLLFEHLERDLPYSREPLADK              +SDLAF+FPEL
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL

Query:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS
        KTLRSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFHYLSTP+GGA SVQGPVVTYPS+IDG P+MSLPVFGLASYKFRGSLWTPNGG+EWQLANS
Subjt:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS

Query:  LLQDAEDWLRQRQVNHPDFIFFSRR
        LLQDAEDWLR+R VNHPDFIFFSRR
Subjt:  LLQDAEDWLRQRQVNHPDFIFFSRR

A0A6J1I465 uncharacterized protein LOC1114704646.1e-21688.94Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQFGRGCGDDRFYN TKAR+ HQGRQNDQLRRAQSDVSAGQSPVVKPT VSSV RETE+GD C++LPKSI++SAFEPVVSSLSNLQRFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTT+KGWRTCD EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSV+QYYVPYLSGIQIYGESLKSSAK RQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL
        E ERALKYM  QLNHH+LSSELSRRMDRISLRDQLI LQEDCSSDEAES N QGQLLFEHLERDLPYSREPLADK              ISDLAFQFPEL
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL

Query:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS
        +TLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTP+GG  SVQGPV+TYPSEIDG PKMSLPVFGLASYKFRGSLWTPNG YEWQLA S
Subjt:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS

Query:  LLQDAEDWLRQRQVNHPDFIFFSRR
        LLQDAEDWLRQRQVNHPDF+FF RR
Subjt:  LLQDAEDWLRQRQVNHPDFIFFSRR

A0A6J1JV26 uncharacterized protein LOC1114891475.9e-21988.94Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        M GAGLQFGRGCGDDRFYNPTKARR+HQGRQNDQLRR QSDVSA +SPV+KPT VSS+IRETE GDGCE+LPKSIA+SAFEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL
        EPERA+KYMG QLNHHHLSSELSRRM+R+SLRDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK              +SDLAF+FPEL
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPEL

Query:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS
        KTLRSCDLLPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFHYLSTP+GGA SVQGPVVTYPS+IDG P+MSLPVFGLASYKFRGSLWTPNGGYEWQLANS
Subjt:  KTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANS

Query:  LLQDAEDWLRQRQVNHPDFIFFSRR
        LLQDA+DWLR+R VNHPDFIFFSRR
Subjt:  LLQDAEDWLRQRQVNHPDFIFFSRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)5.5e-10857.43Show/hide
Query:  QLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPSVPAQYLSKTTMKGWRTCDVEFQ-PYFVLGDLWE
        QL+RAQ DVS G          SS  ++ ENG          A+       +S SN++RFL S+ PSVPA YLSKT ++     DVE Q PYF+LGD+WE
Subjt:  QLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPSVPAQYLSKTTMKGWRTCDVEFQ-PYFVLGDLWE

Query:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGTQLNHHHLSSELSRRMDRI
        SF EWSAYG GVPL LN++ D V QYYVP LSGIQ+Y   ++L SS ++R+ GE+S+SDFRDSSS+GSS SE ER L Y            ++S RMD++
Subjt:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGTQLNHHHLSSELSRRMDRI

Query:  SLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPT
        SLR    E QED SSD+ E L+SQG+L+FE+LERDLPY REP ADK              +SDLA +FPELKTLRSCDLLPSSWFSVAWYPIY+IPTGPT
Subjt:  SLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPT

Query:  LKDLDACFLTFHYLSTPIGGASSVQGPV-VTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQVNHPDFIFFSRR
        LKDLDACFLT+H L TP  G     G + V  P E     KM LPVFGLASYK RGS+WT  GG   QLANSL Q A++WLR RQVNHPDFIFF RR
Subjt:  LKDLDACFLTFHYLSTPIGGASSVQGPV-VTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQVNHPDFIFFSRR

AT2G01260.1 Protein of unknown function (DUF789)1.6e-11557.54Show/hide
Query:  MLGAGLQFGRG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAP
        MLGAG Q  RG  GDD FY   K RRA+Q  + DQLRRAQSDVS   S    P                            EP   S SNL RFL S+ P
Subjt:  MLGAGLQFGRG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAP

Query:  SVPAQYLSKTTMKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS
        SVPAQ+LSKT ++  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYLSKTTMKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLA
        D SSDS+ ER                 +S R+D ISLRDQ    QED SSD+ E L SQG+L+FE+LERDLPY REP ADK              + DLA
Subjt:  DGSSDSEPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLA

Query:  FQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYE
         QFPEL TLRSCDLL SSWFSVAWYPIYRIPTGPTLKDLDACFLT+H L T  GG  S Q   +T P E +   KMSLPVFGLASYKFRGSLWTP GG E
Subjt:  FQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGYE

Query:  WQLANSLLQDAEDWLRQRQVNHPDFIFFSRR
         QL NSL Q A+ WL    V+HPDF+FF RR
Subjt:  WQLANSLLQDAEDWLRQRQVNHPDFIFFSRR

AT2G01260.2 Protein of unknown function (DUF789)9.7e-8956.62Show/hide
Query:  MLGAGLQFGRG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAP
        MLGAG Q  RG  GDD FY   K RRA+Q  + DQLRRAQSDVS   S    P                            EP   S SNL RFL S+ P
Subjt:  MLGAGLQFGRG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAP

Query:  SVPAQYLSKTTMKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS
        SVPAQ+LSKT ++  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYLSKTTMKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLA
        D SSDS+ ER                 +S R+D ISLRDQ    QED SSD+ E L SQG+L+FE+LERDLPY REP ADK              + DLA
Subjt:  DGSSDSEPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLA

Query:  FQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGG
         QFPEL TLRSCDLL SSWFSVAWYPIYRIPTGPTLKDLDACFLT+H L T  GG
Subjt:  FQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGG

AT4G16100.1 Protein of unknown function (DUF789)6.1e-8344.23Show/hide
Query:  GDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSI---AVSAFEPVVSSLSNLQRFLHSIAPSVPAQYLSKT
        G++RFYNP   R+  Q R+  +L   + +    ++  +    +    +E +  + C     S+     S      ++ SNL RFL    P V  Q+L  T
Subjt:  GDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTIVSSVIRETENGDGCEQLPKSI---AVSAFEPVVSSLSNLQRFLHSIAPSVPAQYLSKT

Query:  TMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY
        + KGWRT + E++PYF+L DLW+SF+EWSAYG GVPL+LN  DSVVQYYVPYLSGIQ+Y +  ++    R+ GE+SD D  RD SSDGS+D         
Subjt:  TMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY

Query:  MGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAE-SLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPELKTLRSCD
                    ELS+ + R SL ++        SSDE+E S NS G+L+FE+LE  +P+ REPL DK              IS+L+ QFP L+T RSCD
Subjt:  MGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAE-SLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQISDLAFQFPELKTLRSCD

Query:  LLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGY-EWQLANSLLQDAE
        L PSSW SVAWYPIYRIP G +L++LDACFLTFH LSTP  G S+ +G      S+     K+ LP FGLASYKF+ S W+P     E Q   +LL+ AE
Subjt:  LLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPNGGY-EWQLANSLLQDAE

Query:  DWLRQRQVNHPDFIFF
        +WLR+ +V  PDF  F
Subjt:  DWLRQRQVNHPDFIFF

AT5G49220.1 Protein of unknown function (DUF789)7.2e-7643.88Show/hide
Query:  GDDRFYNPTKARRAHQGRQ-NDQLRRAQSDVSAGQSPVVKPTIVSSVI--RETENGDGCEQLPKSIAVSAFE-------------PVVSSLSNLQRFLHS
        G++RFYNP   RR  Q  Q   Q+R  Q      +  + K    ++ +  R T  G G  +    + VS  E              V+S  SNL RFL  
Subjt:  GDDRFYNPTKARRAHQGRQ-NDQLRRAQSDVSAGQSPVVKPTIVSSVI--RETENGDGCEQLPKSIAVSAFE-------------PVVSSLSNLQRFLHS

Query:  IAPSVPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRD
          P VPA+     +    +T + +   YFVL DLWESF EWSAYGAGV     PL ++ +DS VQYYVPYLSGIQ+Y + LK   K R P  D+     +
Subjt:  IAPSVPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRD

Query:  SSSDGSSDSEPERALKYMGTQLNHHHLSSELS-RRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQI
         SS+GSS+S                 L  +LS   ++RISL+DQ   +    SS EAE  N QG+LLFE+LE + P+ REPLA+K              I
Subjt:  SSSDGSSDSEPERALKYMGTQLNHHHLSSELS-RRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKATLVTLLWFQYFLQI

Query:  SDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPN
        SDLA + PEL T RSCDLLPSSW SV+WYPIYRIP GPTL++LDACFLTFH LST      S  G   + PS      K+ LP FGLASYK + S+W  N
Subjt:  SDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVFGLASYKFRGSLWTPN

Query:  GGYEWQLANSLLQDAEDWLRQRQVNHPDFIFFS
           E Q   SLLQ A+ WL++ QV+HPD+ FF+
Subjt:  GGYEWQLANSLLQDAEDWLRQRQVNHPDFIFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGAAATCTCCGGATCTGGAACGAGAGAGAAAGAAATAGAAAGAAATCGTCGGATCTTCTTGTTGGGATGTTGTCGGGGAAGATGACCGAATCTGCTTATTGGGA
TGTTGTCGGAGAAGGTGGTCTCCGGATTCTAGGACTGCCGATTGCGATTTTCCGGAGCCGTTTTTCGTTTGGTTTTGGAAATATAATGTTAGGTGCAGGCTTGCAGTTTG
GTCGTGGTTGTGGTGATGATAGATTTTACAATCCGACGAAAGCTCGTAGGGCGCATCAGGGCCGTCAAAATGATCAGCTCCGGAGAGCTCAGAGCGACGTTTCTGCAGGC
CAATCCCCTGTGGTTAAACCAACCATAGTGTCCTCCGTGATTAGAGAAACCGAAAACGGCGATGGGTGTGAACAGCTCCCCAAATCCATTGCGGTGTCGGCTTTTGAGCC
AGTGGTGTCGTCGCTGAGTAATCTGCAGCGGTTTTTGCACTCCATCGCGCCATCTGTTCCTGCACAGTACCTCTCAAAGACAACGATGAAGGGTTGGAGAACCTGTGACG
TGGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGTGCAGGTGTGCCTCTTGTATTAAACGACAGTGACAGTGTTGTC
CAGTATTATGTTCCATATTTATCCGGTATACAGATATATGGTGAATCTTTGAAGTCCTCTGCAAAGTCAAGGCAACCAGGTGAGGACAGTGATAGTGATTTCAGAGATTC
TAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCACTAAAGTACATGGGGACACAACTCAATCATCACCATTTATCTTCTGAGCTTTCTCGTAGAATGGATAGGA
TATCTTTGCGGGACCAGCTCATTGAACTTCAAGAAGACTGCTCTAGTGACGAGGCTGAATCTCTTAATTCTCAAGGCCAGCTACTATTTGAGCATCTTGAACGTGATTTG
CCTTATAGTCGCGAACCTTTGGCAGATAAGGCAACCCTTGTAACCTTACTCTGGTTTCAATATTTTCTCCAGATATCAGATCTTGCCTTTCAGTTCCCTGAGCTCAAGAC
ATTACGAAGTTGTGATCTATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAAGGATCTGGATGCCTGTTTCCTCA
CCTTTCATTATTTGTCTACGCCAATAGGAGGGGCAAGCAGTGTTCAAGGTCCTGTAGTAACGTATCCTAGTGAGATAGATGGTTTCCCTAAGATGTCCCTACCAGTTTTT
GGTCTAGCTTCATACAAGTTTAGAGGGTCTTTATGGACTCCAAATGGCGGATATGAATGGCAATTGGCAAATTCACTTTTGCAGGATGCTGAGGATTGGTTAAGACAGCG
TCAAGTAAATCACCCTGACTTCATCTTCTTCAGCCGACGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGAAATCTCCGGATCTGGAACGAGAGAGAAAGAAATAGAAAGAAATCGTCGGATCTTCTTGTTGGGATGTTGTCGGGGAAGATGACCGAATCTGCTTATTGGGA
TGTTGTCGGAGAAGGTGGTCTCCGGATTCTAGGACTGCCGATTGCGATTTTCCGGAGCCGTTTTTCGTTTGGTTTTGGAAATATAATGTTAGGTGCAGGCTTGCAGTTTG
GTCGTGGTTGTGGTGATGATAGATTTTACAATCCGACGAAAGCTCGTAGGGCGCATCAGGGCCGTCAAAATGATCAGCTCCGGAGAGCTCAGAGCGACGTTTCTGCAGGC
CAATCCCCTGTGGTTAAACCAACCATAGTGTCCTCCGTGATTAGAGAAACCGAAAACGGCGATGGGTGTGAACAGCTCCCCAAATCCATTGCGGTGTCGGCTTTTGAGCC
AGTGGTGTCGTCGCTGAGTAATCTGCAGCGGTTTTTGCACTCCATCGCGCCATCTGTTCCTGCACAGTACCTCTCAAAGACAACGATGAAGGGTTGGAGAACCTGTGACG
TGGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGTGCAGGTGTGCCTCTTGTATTAAACGACAGTGACAGTGTTGTC
CAGTATTATGTTCCATATTTATCCGGTATACAGATATATGGTGAATCTTTGAAGTCCTCTGCAAAGTCAAGGCAACCAGGTGAGGACAGTGATAGTGATTTCAGAGATTC
TAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCACTAAAGTACATGGGGACACAACTCAATCATCACCATTTATCTTCTGAGCTTTCTCGTAGAATGGATAGGA
TATCTTTGCGGGACCAGCTCATTGAACTTCAAGAAGACTGCTCTAGTGACGAGGCTGAATCTCTTAATTCTCAAGGCCAGCTACTATTTGAGCATCTTGAACGTGATTTG
CCTTATAGTCGCGAACCTTTGGCAGATAAGGCAACCCTTGTAACCTTACTCTGGTTTCAATATTTTCTCCAGATATCAGATCTTGCCTTTCAGTTCCCTGAGCTCAAGAC
ATTACGAAGTTGTGATCTATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAAGGATCTGGATGCCTGTTTCCTCA
CCTTTCATTATTTGTCTACGCCAATAGGAGGGGCAAGCAGTGTTCAAGGTCCTGTAGTAACGTATCCTAGTGAGATAGATGGTTTCCCTAAGATGTCCCTACCAGTTTTT
GGTCTAGCTTCATACAAGTTTAGAGGGTCTTTATGGACTCCAAATGGCGGATATGAATGGCAATTGGCAAATTCACTTTTGCAGGATGCTGAGGATTGGTTAAGACAGCG
TCAAGTAAATCACCCTGACTTCATCTTCTTCAGCCGACGATGA
Protein sequenceShow/hide protein sequence
MERNLRIWNERERNRKKSSDLLVGMLSGKMTESAYWDVVGEGGLRILGLPIAIFRSRFSFGFGNIMLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAG
QSPVVKPTIVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPSVPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVV
QYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDL
PYSREPLADKATLVTLLWFQYFLQISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPIGGASSVQGPVVTYPSEIDGFPKMSLPVF
GLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQVNHPDFIFFSRR