; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg24924 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg24924
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProtein of unknown function (DUF789)
Genome locationCarg_Chr15:5139035..5142582
RNA-Seq ExpressionCarg24924
SyntenyCarg24924
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016733.1 hypothetical protein SDJN02_21843, partial [Cucurbita argyrosperma subsp. argyrosperma]6.2e-240100Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF
        EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_022939104.1 uncharacterized protein LOC111445107 [Cucurbita moschata]1.3e-23498.05Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQ PVLKPTTVSSVIRETEYGDGCE+LP SIAMSAFEPVVSSLSNLERFLQSI PS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF
        EPERALKYMGNQLNHHHLSSELSRR ERLSLRDQLIGLQEDC SDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDL+FRFPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGG+EWQLANSLLQDAEDWLRERCV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_022992986.1 uncharacterized protein LOC111489147 [Cucurbita maxima]1.0e-23497.32Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        M GAGLQFGRGCGDDRFYNPTKARR+HQGRQNDQLRR QSDVSAS+ PVLKPTTVSS+IRETEYGDGCE+LPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYPSKTT+KGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF
        EPERA+KYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDL+FRFPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDA+DWLRERCV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_023550272.1 uncharacterized protein LOC111808496 [Cucurbita pepo subsp. pepo]2.4e-23698.54Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQ PVLKPTTVSSVIRETEYGDGCE+LPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYPSKTT+KGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF
        EPER LKY GNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDL+FRFPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_038874258.1 uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida]1.5e-22593.67Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSA Q P++KP  VSSVIRETEYGDGCE+LPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF
        EPERALKYMG QLNHHHLSSEL RRM+R+S RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK+SDL+F+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV
        SVAWYPIYRIPTGPTLRDLDACFLTFH+LS+PMGGARSVQGPVVTYPS+IDGIP+MSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAE+WLR+R V
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

TrEMBL top hitse value%identityAlignment
A0A1S3CT52 uncharacterized protein LOC1035045971.7e-21991.48Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGRGCGD RFYNPTKARR HQGRQ DQLRRAQSDVSA Q  V+KP+ VSSVIRETE G+GCE+LPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF
        EPERALKYMG QLNHHHLSSELSRRM+ +S RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK+SDL+F+FP+LKT+RSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLS+P GGARSVQ PVVTYPS+IDGIP+MSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLRER V
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A5D3BPU4 DUF789 domain-containing protein1.5e-22091.97Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGRGCGDDRFYNPTKARR HQGRQ DQLRRAQSDVSA Q  V+KP+ VSSVIRETE G+GCE+LPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF
        EPERALKYMG QLNHHHLSSELSRRM+ +S RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK+SDL+F+FP+LKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLS+P GGARSVQ PVVTYPS+IDGIP+MSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLRER V
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A6J1FG46 uncharacterized protein LOC1114451076.5e-23598.05Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQ PVLKPTTVSSVIRETEYGDGCE+LP SIAMSAFEPVVSSLSNLERFLQSI PS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF
        EPERALKYMGNQLNHHHLSSELSRR ERLSLRDQLIGLQEDC SDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDL+FRFPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGG+EWQLANSLLQDAEDWLRERCV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A6J1I465 uncharacterized protein LOC1114704641.4e-21689.78Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGRGCGDDRFYN TKAR+ HQGRQNDQLRRAQSDVSA Q PV+KPTTVSSV RETE GD C++LPKSI+MSAFEPVVSSLSNL+RFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSV+QYYVPYLSGIQIYGESLKSSAK RQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF
        E ERALKYM  QLNHH+LSSELSRRM+R+SLRDQLIGLQEDCSSDEAES N QGQLLFEHLERDLPYSREPLADK+SDL+F+FPEL+TLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLSTPMGG RSVQGPV+TYPS+IDGIP+MSLPVFGLASYKFRGSLWTPNG YEWQLA SLLQDAEDWLR+R V
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV

Query:  NHPDFIFFSRR
        NHPDF+FF RR
Subjt:  NHPDFIFFSRR

A0A6J1JV26 uncharacterized protein LOC1114891475.0e-23597.32Show/hide
Query:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS
        M GAGLQFGRGCGDDRFYNPTKARR+HQGRQNDQLRR QSDVSAS+ PVLKPTTVSS+IRETEYGDGCE+LPKSIAMSAFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYPSKTT+KGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF
        EPERA+KYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDL+FRFPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDA+DWLRERCV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.1e-10657.7Show/hide
Query:  QLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPSVPAQYPSKTTMKGWRTCDAELQ-PYFVLGDLWE
        QL+RAQ DVS            SS  ++ E G          A+       +S SN+ERFL S+ PSVPA Y SKT ++     D E Q PYF+LGD+WE
Subjt:  QLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPSVPAQYPSKTTMKGWRTCDAELQ-PYFVLGDLWE

Query:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGNQLNHHHLSSELSRRMERL
        SF EWSAYG GVPL LN++ D V QYYVP LSGIQ+Y   ++L SS ++R+ GE+S+SDFRDSSS+GSS SE ER L Y   Q         +S RM++L
Subjt:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGNQLNHHHLSSELSRRMERL

Query:  SLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYL
        SLR +    QED SSD+ E L+SQG+L+FE+LERDLPY REP ADK+SDL+ RFPELKTLRSCDLLPSSWFSVAWYPIY+IPTGPTL+DLDACFLT+H L
Subjt:  SLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYL

Query:  STPMGGARSVQGPV-VTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCVNHPDFIFFSRR
         TP  G     G + V  P   + + +M LPVFGLASYK RGS+WT  GG   QLANSL Q A++WLR R VNHPDFIFF RR
Subjt:  STPMGGARSVQGPV-VTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCVNHPDFIFFSRR

AT2G01260.1 Protein of unknown function (DUF789)4.6e-11658.03Show/hide
Query:  MLGAGLQFGRG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAP
        MLGAG Q  RG  GDD FY   K RRA+Q  + DQLRRAQSDVS        P++  S  ++                   EP   S SNL+RFL+S+ P
Subjt:  MLGAGLQFGRG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAP

Query:  SVPAQYPSKTTMKGWRTCD--AELQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS
        SVPAQ+ SKT ++  R  D   +L PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYPSKTTMKGWRTCD--AELQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDL
        D SSDS+ ER                 +S R++ +SLRDQ    QED SSD+ E L SQG+L+FE+LERDLPY REP ADKV DL+ +FPEL TLRSCDL
Subjt:  DGSSDSEPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDW
        L SSWFSVAWYPIYRIPTGPTL+DLDACFLT+H L T  GG  S Q   +T P + +   +MSLPVFGLASYKFRGSLWTP GG E QL NSL Q A+ W
Subjt:  LPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDW

Query:  LRERCVNHPDFIFFSRR
        L    V+HPDF+FF RR
Subjt:  LRERCVNHPDFIFFSRR

AT2G01260.2 Protein of unknown function (DUF789)4.3e-9057.77Show/hide
Query:  MLGAGLQFGRG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAP
        MLGAG Q  RG  GDD FY   K RRA+Q  + DQLRRAQSDVS        P++  S  ++                   EP   S SNL+RFL+S+ P
Subjt:  MLGAGLQFGRG-CGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAP

Query:  SVPAQYPSKTTMKGWRTCD--AELQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS
        SVPAQ+ SKT ++  R  D   +L PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYPSKTTMKGWRTCD--AELQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDL
        D SSDS+ ER                 +S R++ +SLRDQ    QED SSD+ E L SQG+L+FE+LERDLPY REP ADKV DL+ +FPEL TLRSCDL
Subjt:  DGSSDSEPERALKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGG
        L SSWFSVAWYPIYRIPTGPTL+DLDACFLT+H L T  GG
Subjt:  LPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGG

AT4G16100.1 Protein of unknown function (DUF789)2.1e-8445.16Show/hide
Query:  GDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSI---AMSAFEPVVSSLSNLERFLQSIAPSVPAQYPSKT
        G++RFYNP   R+  Q R+  +L   + +    +   +    +    +E +  + C     S+     S      ++ SNL RFL    P V  Q+   T
Subjt:  GDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSI---AMSAFEPVVSSLSNLERFLQSIAPSVPAQYPSKT

Query:  TMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY
        + KGWRT + E +PYF+L DLW+SF+EWSAYG GVPL+LN  DSVVQYYVPYLSGIQ+Y +  ++    R+ GE+SD D  RD SSDGS+D         
Subjt:  TMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY

Query:  MGNQLNHHHLSSELSRRMERLSLRDQ-LIGLQEDCSSDEAE-SLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWFSVAWYP
                    ELS+ + R SL ++  IG     SSDE+E S NS G+L+FE+LE  +P+ REPL DK+S+LS +FP L+T RSCDL PSSW SVAWYP
Subjt:  MGNQLNHHHLSSELSRRMERLSLRDQ-LIGLQEDCSSDEAE-SLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWFSVAWYP

Query:  IYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGY-EWQLANSLLQDAEDWLRERCVNHPDF
        IYRIP G +L++LDACFLTFH LSTP  G  + +G      S      ++ LP FGLASYKF+ S W+P     E Q   +LL+ AE+WLR   V  PDF
Subjt:  IYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGY-EWQLANSLLQDAEDWLRERCVNHPDF

Query:  IFF
          F
Subjt:  IFF

AT5G49220.1 Protein of unknown function (DUF789)5.0e-7844.42Show/hide
Query:  GDDRFYNPTKARRAHQGRQ-----NDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFE-------------PVVSSLSNLERFL
        G++RFYNP   RR  Q  Q      ++ RR   D         K  TV+   R T  G G  +    + +S  E              V+S  SNL+RFL
Subjt:  GDDRFYNPTKARRAHQGRQ-----NDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFE-------------PVVSSLSNLERFL

Query:  QSIAPSVPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF
        +   P VPA+     +    +T +++   YFVL DLWESF EWSAYGAGV     PL ++ +DS VQYYVPYLSGIQ+Y + LK   K R P  D+    
Subjt:  QSIAPSVPAQYPSKTTMKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF

Query:  RDSSSDGSSDSEPERALKYMGNQLNHHHLSSELS-RRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKT
         + SS+GSS+S                 L  +LS   + R+SL+DQ   +    SS EAE  N QG+LLFE+LE + P+ REPLA+K+SDL+ R PEL T
Subjt:  RDSSSDGSSDSEPERALKYMGNQLNHHHLSSELS-RRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKT

Query:  LRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL
         RSCDLLPSSW SV+WYPIYRIP GPTL++LDACFLTFH LST     +S  G   + PS      ++ LP FGLASYK + S+W  N   E Q   SLL
Subjt:  LRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL

Query:  QDAEDWLRERCVNHPDFIFFS
        Q A+ WL+   V+HPD+ FF+
Subjt:  QDAEDWLRERCVNHPDFIFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGTGCAGGCTTGCAGTTTGGTCGTGGTTGTGGTGACGATAGGTTTTACAATCCGACAAAAGCTCGTAGGGCGCATCAGGGCCGTCAAAATGATCAGCTCCGGAG
AGCTCAGAGCGACGTTTCTGCAAGCCAATGCCCTGTTCTTAAACCGACCACGGTGTCCTCGGTGATTAGAGAAACCGAATACGGCGATGGGTGTGAAGATCTCCCTAAAT
CCATTGCGATGTCGGCTTTTGAGCCAGTCGTATCGTCGCTGAGTAATCTCGAGCGGTTCTTGCAGTCAATCGCGCCATCTGTACCTGCACAGTACCCTTCAAAGACAACG
ATGAAGGGTTGGAGAACCTGTGACGCGGAATTGCAACCATACTTCGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGCGCAGGTGTGCCTCTTGT
ATTAAACGACAGTGACAGTGTTGTCCAGTATTATGTACCATATTTATCTGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTGCGAAGTCGAGGCAACCAGGTGAGG
ACAGTGATAGTGATTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCACTAAAATACATGGGGAATCAACTCAATCATCACCATTTGTCTTCT
GAGCTTTCTCGTAGAATGGAGAGGTTATCTTTGCGGGATCAGCTAATTGGACTTCAAGAAGACTGTTCTAGTGATGAGGCTGAATCTCTTAATTCTCAAGGCCAGTTACT
ATTTGAGCATCTTGAACGTGATTTGCCTTATAGTCGGGAACCTTTGGCAGATAAGGTATCAGATCTTTCCTTTCGGTTCCCTGAGCTCAAGACATTACGAAGTTGTGATC
TATTGCCTTCCAGTTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAGGGATCTAGATGCCTGCTTCCTCACCTTTCATTATTTGTCT
ACGCCAATGGGAGGGGCACGTAGTGTTCAAGGTCCTGTAGTAACGTATCCTAGTGATATAGATGGTATTCCTAGGATGTCCCTACCAGTTTTTGGTCTAGCTTCATACAA
GTTTAGAGGGTCTTTATGGACTCCAAATGGCGGATATGAGTGGCAATTGGCAAATTCACTTTTGCAGGACGCCGAGGATTGGTTAAGAGAACGTTGCGTAAATCACCCTG
ACTTCATCTTCTTCAGCCGACGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGGTGCAGGCTTGCAGTTTGGTCGTGGTTGTGGTGACGATAGGTTTTACAATCCGACAAAAGCTCGTAGGGCGCATCAGGGCCGTCAAAATGATCAGCTCCGGAG
AGCTCAGAGCGACGTTTCTGCAAGCCAATGCCCTGTTCTTAAACCGACCACGGTGTCCTCGGTGATTAGAGAAACCGAATACGGCGATGGGTGTGAAGATCTCCCTAAAT
CCATTGCGATGTCGGCTTTTGAGCCAGTCGTATCGTCGCTGAGTAATCTCGAGCGGTTCTTGCAGTCAATCGCGCCATCTGTACCTGCACAGTACCCTTCAAAGACAACG
ATGAAGGGTTGGAGAACCTGTGACGCGGAATTGCAACCATACTTCGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGCGCAGGTGTGCCTCTTGT
ATTAAACGACAGTGACAGTGTTGTCCAGTATTATGTACCATATTTATCTGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTGCGAAGTCGAGGCAACCAGGTGAGG
ACAGTGATAGTGATTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCACTAAAATACATGGGGAATCAACTCAATCATCACCATTTGTCTTCT
GAGCTTTCTCGTAGAATGGAGAGGTTATCTTTGCGGGATCAGCTAATTGGACTTCAAGAAGACTGTTCTAGTGATGAGGCTGAATCTCTTAATTCTCAAGGCCAGTTACT
ATTTGAGCATCTTGAACGTGATTTGCCTTATAGTCGGGAACCTTTGGCAGATAAGGTATCAGATCTTTCCTTTCGGTTCCCTGAGCTCAAGACATTACGAAGTTGTGATC
TATTGCCTTCCAGTTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAGGGATCTAGATGCCTGCTTCCTCACCTTTCATTATTTGTCT
ACGCCAATGGGAGGGGCACGTAGTGTTCAAGGTCCTGTAGTAACGTATCCTAGTGATATAGATGGTATTCCTAGGATGTCCCTACCAGTTTTTGGTCTAGCTTCATACAA
GTTTAGAGGGTCTTTATGGACTCCAAATGGCGGATATGAGTGGCAATTGGCAAATTCACTTTTGCAGGACGCCGAGGATTGGTTAAGAGAACGTTGCGTAAATCACCCTG
ACTTCATCTTCTTCAGCCGACGGTGA
Protein sequenceShow/hide protein sequence
MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQCPVLKPTTVSSVIRETEYGDGCEDLPKSIAMSAFEPVVSSLSNLERFLQSIAPSVPAQYPSKTT
MKGWRTCDAELQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGNQLNHHHLSS
ELSRRMERLSLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKVSDLSFRFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLS
TPMGGARSVQGPVVTYPSDIDGIPRMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRERCVNHPDFIFFSRR