; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028872 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028872
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationchr8:32163228..32166434
RNA-Seq ExpressionLag0028872
SyntenyLag0028872
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016733.1 hypothetical protein SDJN02_21843, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-22393.43Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSA Q PV+KPT VSSVIRETE GDGCE LPKSIA+SAFEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMG QLNHHHLSSELSRRM+R+SLRDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK+SDL+F+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLSTPMGGA SVQGPVVTYPS+IDG P+M+LPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLR+R V
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

TYK00266.1 DUF789 domain-containing protein [Cucumis melo var. makuwa]7.6e-22293.43Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQFGRGCGDDRFYNPTKARR HQGRQ DQLRRAQSDVSAGQS VVKP+ VSSVIRETE G+GCE+LPKSIA+S FEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMG QLNHHHLSSELSRRMD IS RDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFP+LKTLRSCDLLPSSWF
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTLKDLDACFLTFHYLS+P GGA SVQ PVVTYPSEIDG PKM+LPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLR+RQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_022992986.1 uncharacterized protein LOC111489147 [Cucurbita maxima]1.3e-22191.97Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        M GAGLQFGRGCGDDRFYNPTKARR+HQGRQNDQLRR QSDVSA +SPV+KPT VSS+IRETE GDGCE+LPKSIA+SAFEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERA+KYMG QLNHHHLSSELSRRM+R+SLRDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK+SDLAF+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLSTPMGGA SVQGPVVTYPS+IDG P+M+LPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDA+DWLR+R V
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_023550272.1 uncharacterized protein LOC111808496 [Cucurbita pepo subsp. pepo]3.1e-22393.19Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSA QSPV+KPT VSSVIRETE GDGCE+LPKSIA+SAFEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPER LKY G QLNHHHLSSELSRRM+R+SLRDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK+SDLAF+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLSTPMGGA SVQGPVVTYPS+IDG P+M+LPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLR+R V
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_038874258.1 uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida]3.9e-22694.89Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSP+VKP +VSSVIRETE GDGCE+LPKSIA+SAFEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMG QLNHHHLSSEL RRMDRIS RDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTL+DLDACFLTFH+LS+PMGGA SVQGPVVTYPSEIDG PKM+LPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAE+WLR RQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

TrEMBL top hitse value%identityAlignment
A0A1S3CT52 uncharacterized protein LOC1035045974.1e-22192.94Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQFGRGCGD RFYNPTKARR HQGRQ DQLRRAQSDVSAGQS VVKP+ VSSVIRETE G+GCE+LPKSIA+S FEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMG QLNHHHLSSELSRRMD IS RDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFP+LKT+RSCDLLPSSWF
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTLKDLDACFLTFHYLS+P GGA SVQ PVVTYPSEIDG PKM+LPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLR+RQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A5D3BPU4 DUF789 domain-containing protein3.7e-22293.43Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQFGRGCGDDRFYNPTKARR HQGRQ DQLRRAQSDVSAGQS VVKP+ VSSVIRETE G+GCE+LPKSIA+S FEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMG QLNHHHLSSELSRRMD IS RDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFP+LKTLRSCDLLPSSWF
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTLKDLDACFLTFHYLS+P GGA SVQ PVVTYPSEIDG PKM+LPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLR+RQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A6J1FG46 uncharacterized protein LOC1114451078.2e-22292.7Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSA QSPV+KPT VSSVIRETE GDGCE+LP SIA+SAFEPVVSSLSNL+RFL SI PS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMG QLNHHHLSSELSRR +R+SLRDQLI LQEDC SDEAESLNSQGQLLFEHLERDLPYSREPLADK+SDLAF+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLSTPMGGA SVQGPVVTYPS+IDG P+M+LPVFGLASYKFRGSLWTPNGG+EWQLANSLLQDAEDWLR+R V
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A6J1I465 uncharacterized protein LOC1114704646.5e-21991.97Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        MLGAGLQFGRGCGDDRFYN TKAR+ HQGRQNDQLRRAQSDVSAGQSPVVKPT VSSV RETE+GD C++LPKSI++SAFEPVVSSLSNLQRFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTT+KGWRTCD EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSV+QYYVPYLSGIQIYGESLKSSAK RQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        E ERALKYM  QLNHH+LSSELSRRMDRISLRDQLI LQEDCSSDEAES N QGQLLFEHLERDLPYSREPLADKISDLAFQFPEL+TLRSCDLLPSSWF
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGG  SVQGPV+TYPSEIDG PKM+LPVFGLASYKFRGSLWTPNG YEWQLA SLLQDAEDWLRQRQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDF+FF RR
Subjt:  NHPDFIFFSRR

A0A6J1JV26 uncharacterized protein LOC1114891476.3e-22291.97Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS
        M GAGLQFGRGCGDDRFYNPTKARR+HQGRQNDQLRR QSDVSA +SPV+KPT VSS+IRETE GDGCE+LPKSIA+SAFEPVVSSLSNL+RFL SIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPS

Query:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERA+KYMG QLNHHHLSSELSRRM+R+SLRDQLI LQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK+SDLAF+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLSTPMGGA SVQGPVVTYPS+IDG P+M+LPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDA+DWLR+R V
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.3e-11059.53Show/hide
Query:  QLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPSVPAQYLSKTTMKGWRTCDVEFQ-PYFVLGDLWE
        QL+RAQ DVS G          SS  ++ ENG          A+       +S SN++RFL S+ PSVPA YLSKT ++     DVE Q PYF+LGD+WE
Subjt:  QLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPSVPAQYLSKTTMKGWRTCDVEFQ-PYFVLGDLWE

Query:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGTQLNHHHLSSELSRRMDRI
        SF EWSAYG GVPL LN++ D V QYYVP LSGIQ+Y   ++L SS ++R+ GE+S+SDFRDSSS+GSS SE ER L Y            ++S RMD++
Subjt:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGTQLNHHHLSSELSRRMDRI

Query:  SLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYL
        SLR    E QED SSD+ E L+SQG+L+FE+LERDLPY REP ADK+SDLA +FPELKTLRSCDLLPSSWFSVAWYPIY+IPTGPTLKDLDACFLT+H L
Subjt:  SLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYL

Query:  STPMGGASSVQGPV-VTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQVNHPDFIFFSRR
         TP  G     G + V  P E     KM LPVFGLASYK RGS+WT  GG   QLANSL Q A++WLR RQVNHPDFIFF RR
Subjt:  STPMGGASSVQGPV-VTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQVNHPDFIFFSRR

AT2G01260.1 Protein of unknown function (DUF789)2.2e-11859.71Show/hide
Query:  MLGAGLQFGRG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAP
        MLGAG Q  RG  GDD FY   K RRA+Q  + DQLRRAQSDVS   S    P                +QL         EP   S SNL RFL S+ P
Subjt:  MLGAGLQFGRG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAP

Query:  SVPAQYLSKTTMKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS
        SVPAQ+LSKT ++  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYLSKTTMKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDL
        D SSDS+ ER                 +S R+D ISLRDQ    QED SSD+ E L SQG+L+FE+LERDLPY REP ADK+ DLA QFPEL TLRSCDL
Subjt:  DGSSDSEPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDW
        L SSWFSVAWYPIYRIPTGPTLKDLDACFLT+H L T  GG  S Q   +T P E +   KM+LPVFGLASYKFRGSLWTP GG E QL NSL Q A+ W
Subjt:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDW

Query:  LRQRQVNHPDFIFFSRR
        L    V+HPDF+FF RR
Subjt:  LRQRQVNHPDFIFFSRR

AT2G01260.2 Protein of unknown function (DUF789)6.0e-9259.53Show/hide
Query:  MLGAGLQFGRG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAP
        MLGAG Q  RG  GDD FY   K RRA+Q  + DQLRRAQSDVS   S    P                +QL         EP   S SNL RFL S+ P
Subjt:  MLGAGLQFGRG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAP

Query:  SVPAQYLSKTTMKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS
        SVPAQ+LSKT ++  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYLSKTTMKGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDL
        D SSDS+ ER                 +S R+D ISLRDQ    QED SSD+ E L SQG+L+FE+LERDLPY REP ADK+ DLA QFPEL TLRSCDL
Subjt:  DGSSDSEPERALKYMGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGG
        L SSWFSVAWYPIYRIPTGPTLKDLDACFLT+H L T  GG
Subjt:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGG

AT4G16100.1 Protein of unknown function (DUF789)6.5e-8645.77Show/hide
Query:  GDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSI---AVSAFEPVVSSLSNLQRFLHSIAPSVPAQYLSKT
        G++RFYNP   R+  Q R+  +L   + +    ++  +    +    +E +  + C     S+     S      ++ SNL RFL    P V  Q+L  T
Subjt:  GDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSI---AVSAFEPVVSSLSNLQRFLHSIAPSVPAQYLSKT

Query:  TMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY
        + KGWRT + E++PYF+L DLW+SF+EWSAYG GVPL+LN  DSVVQYYVPYLSGIQ+Y +  ++    R+ GE+SD D  RD SSDGS+D         
Subjt:  TMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY

Query:  MGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAE-SLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPI
                    ELS+ + R SL ++        SSDE+E S NS G+L+FE+LE  +P+ REPL DKIS+L+ QFP L+T RSCDL PSSW SVAWYPI
Subjt:  MGTQLNHHHLSSELSRRMDRISLRDQLIELQEDCSSDEAE-SLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPI

Query:  YRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGY-EWQLANSLLQDAEDWLRQRQVNHPDFI
        YRIP G +L++LDACFLTFH LSTP  G S+ +G      S+     K+ LP FGLASYKF+ S W+P     E Q   +LL+ AE+WLR+ +V  PDF 
Subjt:  YRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGY-EWQLANSLLQDAEDWLRQRQVNHPDFI

Query:  FF
         F
Subjt:  FF

AT5G49220.1 Protein of unknown function (DUF789)7.7e-7945.35Show/hide
Query:  GDDRFYNPTKARRAHQGRQ-NDQLRRAQSDVSAGQSPVVKPTMVSSVI--RETENGDGCEQLPKSIAVSAFE-------------PVVSSLSNLQRFLHS
        G++RFYNP   RR  Q  Q   Q+R  Q      +  + K    ++ +  R T  G G  +    + VS  E              V+S  SNL RFL  
Subjt:  GDDRFYNPTKARRAHQGRQ-NDQLRRAQSDVSAGQSPVVKPTMVSSVI--RETENGDGCEQLPKSIAVSAFE-------------PVVSSLSNLQRFLHS

Query:  IAPSVPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRD
          P VPA+     +    +T + +   YFVL DLWESF EWSAYGAGV     PL ++ +DS VQYYVPYLSGIQ+Y + LK   K R P  D+     +
Subjt:  IAPSVPAQYLSKTTMKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRD

Query:  SSSDGSSDSEPERALKYMGTQLNHHHLSSELS-RRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLR
         SS+GSS+S                 L  +LS   ++RISL+DQ   +    SS EAE  N QG+LLFE+LE + P+ REPLA+KISDLA + PEL T R
Subjt:  SSSDGSSDSEPERALKYMGTQLNHHHLSSELS-RRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLR

Query:  SCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQD
        SCDLLPSSW SV+WYPIYRIP GPTL++LDACFLTFH LST      S  G   + PS      K+ LP FGLASYK + S+W  N   E Q   SLLQ 
Subjt:  SCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQD

Query:  AEDWLRQRQVNHPDFIFFS
        A+ WL++ QV+HPD+ FF+
Subjt:  AEDWLRQRQVNHPDFIFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGTGCAGGCTTGCAGTTTGGTCGTGGTTGTGGTGACGATAGATTTTACAATCCGACGAAAGCTCGTAGGGCGCATCAGGGCCGTCAAAATGATCAGCTCCGGAG
AGCTCAGAGCGACGTTTCTGCAGGCCAATCCCCTGTGGTTAAACCAACCATGGTGTCCTCCGTCATTAGAGAAACCGAAAACGGCGATGGGTGTGAACAGCTCCCCAAAT
CCATTGCGGTGTCGGCTTTTGAGCCAGTGGTCTCGTCGCTGAGTAATCTGCAGCGTTTTTTGCACTCCATCGCGCCATCTGTTCCTGCACAGTACCTCTCAAAGACAACG
ATGAAGGGTTGGAGAACCTGTGACGTGGAATTTCAACCATACTTTGTTCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGTGCAGGTGTGCCTCTTGT
ATTAAACGACAGTGACAGTGTTGTCCAGTATTATGTTCCATATTTATCCGGTATACAGATATATGGTGAATCTTTGAAGTCCTCTGCAAAGTCAAGGCAACCAGGTGAGG
ACAGTGATAGTGATTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCACTAAAGTACATGGGGACACAACTCAATCATCACCATTTATCTTCT
GAGCTTTCTCGTAGAATGGATAGGATATCTTTGCGGGACCAGCTAATTGAACTTCAAGAAGACTGCTCTAGTGACGAGGCTGAATCTCTTAACTCTCAAGGCCAGCTACT
ATTTGAGCATCTTGAACGTGATTTGCCTTATAGTCGCGAACCTTTGGCAGATAAGATATCAGATCTTGCCTTTCAGTTCCCTGAGCTCAAGACATTACGAAGTTGTGATC
TATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAAGGATCTGGATGCCTGTTTCCTCACCTTTCATTATTTGTCT
ACGCCAATGGGAGGAGCAAGCAGTGTTCAAGGTCCTGTAGTAACATATCCTAGTGAGATAGATGGTTTCCCTAAGATGGCCCTACCAGTTTTTGGTCTAGCTTCATACAA
GTTTAGAGGGTCTTTATGGACTCCAAATGGTGGATACGAATGGCAATTGGCAAATTCACTTTTGCAGGATGCTGAGGATTGGTTAAGACAGCGTCAAGTAAATCACCCTG
ACTTCATCTTCTTCAGCCGACGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGGTGCAGGCTTGCAGTTTGGTCGTGGTTGTGGTGACGATAGATTTTACAATCCGACGAAAGCTCGTAGGGCGCATCAGGGCCGTCAAAATGATCAGCTCCGGAG
AGCTCAGAGCGACGTTTCTGCAGGCCAATCCCCTGTGGTTAAACCAACCATGGTGTCCTCCGTCATTAGAGAAACCGAAAACGGCGATGGGTGTGAACAGCTCCCCAAAT
CCATTGCGGTGTCGGCTTTTGAGCCAGTGGTCTCGTCGCTGAGTAATCTGCAGCGTTTTTTGCACTCCATCGCGCCATCTGTTCCTGCACAGTACCTCTCAAAGACAACG
ATGAAGGGTTGGAGAACCTGTGACGTGGAATTTCAACCATACTTTGTTCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGTGCAGGTGTGCCTCTTGT
ATTAAACGACAGTGACAGTGTTGTCCAGTATTATGTTCCATATTTATCCGGTATACAGATATATGGTGAATCTTTGAAGTCCTCTGCAAAGTCAAGGCAACCAGGTGAGG
ACAGTGATAGTGATTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCACTAAAGTACATGGGGACACAACTCAATCATCACCATTTATCTTCT
GAGCTTTCTCGTAGAATGGATAGGATATCTTTGCGGGACCAGCTAATTGAACTTCAAGAAGACTGCTCTAGTGACGAGGCTGAATCTCTTAACTCTCAAGGCCAGCTACT
ATTTGAGCATCTTGAACGTGATTTGCCTTATAGTCGCGAACCTTTGGCAGATAAGATATCAGATCTTGCCTTTCAGTTCCCTGAGCTCAAGACATTACGAAGTTGTGATC
TATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAAGGATCTGGATGCCTGTTTCCTCACCTTTCATTATTTGTCT
ACGCCAATGGGAGGAGCAAGCAGTGTTCAAGGTCCTGTAGTAACATATCCTAGTGAGATAGATGGTTTCCCTAAGATGGCCCTACCAGTTTTTGGTCTAGCTTCATACAA
GTTTAGAGGGTCTTTATGGACTCCAAATGGTGGATACGAATGGCAATTGGCAAATTCACTTTTGCAGGATGCTGAGGATTGGTTAAGACAGCGTCAAGTAAATCACCCTG
ACTTCATCTTCTTCAGCCGACGATGA
Protein sequenceShow/hide protein sequence
MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPVVKPTMVSSVIRETENGDGCEQLPKSIAVSAFEPVVSSLSNLQRFLHSIAPSVPAQYLSKTT
MKGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGTQLNHHHLSS
ELSRRMDRISLRDQLIELQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLS
TPMGGASSVQGPVVTYPSEIDGFPKMALPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQVNHPDFIFFSRR