; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006820 (gene) of Snake gourd v1 genome

Gene IDTan0006820
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF789)
Genome locationLG11:21234135..21238605
RNA-Seq ExpressionTan0006820
SyntenyTan0006820
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016733.1 hypothetical protein SDJN02_21843, partial [Cucurbita argyrosperma subsp. argyrosperma]3.4e-22292.7Show/hide
Query:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQ GRGCGDDRFYNPTKARRA+QGRQNDQLRRAQSDVSA Q PV+K T VSS IRETE GDGCE+LPKSI++SAFEPVVSSLSNL+RFLQSIAPS
Subjt:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKS+AKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMGNQLNHHHLSSELSRRM+R+SLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPY REPLADK+SDL+F+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGG RS QGPV+TYPS+IDGIPRM LPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLR+R V
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_022939104.1 uncharacterized protein LOC111445107 [Cucurbita moschata]4.2e-22092.21Show/hide
Query:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQ GRGCGDDRFYNPTKARRA+QGRQNDQLRRAQSDVSA QSPV+K T VSS IRETE GDGCEELP SI++SAFEPVVSSLSNL+RFLQSI PS
Subjt:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKS+AKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMGNQLNHHHLSSELSRR +R+SLRDQLIGLQEDC SDEAESLNSQGQLLFEHLERDLPY REPLADK+SDLAF+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGG RS QGPV+TYPS+IDGIPRM LPVFGLASYKFRGSLWTPNGG+EWQLANSLLQDAEDWLR+R V
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_022992986.1 uncharacterized protein LOC111489147 [Cucurbita maxima]1.9e-22091.73Show/hide
Query:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS
        M GAGLQ GRGCGDDRFYNPTKARR++QGRQNDQLRR QSDVSA +SPV+K T VSS IRETE GDGCEELPKSI++SAFEPVVSSLSNL+RFLQSIAPS
Subjt:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKS+AKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERA+KYMGNQLNHHHLSSELSRRM+R+SLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPY REPLADK+SDLAF+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGG RS QGPV+TYPS+IDGIPRM LPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDA+DWLR+R V
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_023550272.1 uncharacterized protein LOC111808496 [Cucurbita pepo subsp. pepo]7.6e-22292.7Show/hide
Query:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQ GRGCGDDRFYNPTKARRA+QGRQNDQLRRAQSDVSA QSPV+K T VSS IRETE GDGCEELPKSI++SAFEPVVSSLSNL+RFLQSIAPS
Subjt:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKS+AKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPER LKY GNQLNHHHLSSELSRRM+R+SLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPY REPLADK+SDLAF+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGG RS QGPV+TYPS+IDGIPRM LPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLR+R V
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_038874258.1 uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida]9.0e-22393.43Show/hide
Query:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQ  RGCGDDRFYNPTKARRA+QGRQNDQLRRAQSDVSAGQSP+VK   VSS IRETE GDGCEELPKSI++SAFEPVVSSLSNL+RFLQSIAPS
Subjt:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKS+AKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMG QLNHHHLSSEL RRMDRIS RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPY REPLADKISDLAFQFPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTLRDLDACFLTFH+LS+PMGG RS QGPV+TYPSEIDGIP+M LPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAE+WLR RQV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

TrEMBL top hitse value%identityAlignment
A0A1S3CT52 uncharacterized protein LOC1035045972.5e-21891.48Show/hide
Query:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQ GRGCGD RFYNPTKARR +QGRQ DQLRRAQSDVSAGQS VVK + VSS IRETE G+GCEELPKSI++S FEPVVSSLSNL+RFLQSIAPS
Subjt:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KS+AKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMG QLNHHHLSSELSRRMD IS RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPY REPLADKISDLAFQFP+LKT+RSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLS+P GG RS Q PV+TYPSEIDGIP+M LPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLR+RQV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A5D3BPU4 DUF789 domain-containing protein2.2e-21991.97Show/hide
Query:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQ GRGCGDDRFYNPTKARR +QGRQ DQLRRAQSDVSAGQS VVK + VSS IRETE G+GCEELPKSI++S FEPVVSSLSNL+RFLQSIAPS
Subjt:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KS+AKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMG QLNHHHLSSELSRRMD IS RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPY REPLADKISDLAFQFP+LKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLS+P GG RS Q PV+TYPSEIDGIP+M LPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLR+RQV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A6J1FG46 uncharacterized protein LOC1114451072.0e-22092.21Show/hide
Query:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQ GRGCGDDRFYNPTKARRA+QGRQNDQLRRAQSDVSA QSPV+K T VSS IRETE GDGCEELP SI++SAFEPVVSSLSNL+RFLQSI PS
Subjt:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKS+AKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERALKYMGNQLNHHHLSSELSRR +R+SLRDQLIGLQEDC SDEAESLNSQGQLLFEHLERDLPY REPLADK+SDLAF+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGG RS QGPV+TYPS+IDGIPRM LPVFGLASYKFRGSLWTPNGG+EWQLANSLLQDAEDWLR+R V
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A6J1HCB0 uncharacterized protein LOC1114622948.5e-21991.97Show/hide
Query:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQ GRGCGDDRFYN TKARR NQGRQNDQLRRAQSDVSAGQSPVVK T VSS  RETE+GD CEELPKSIS+SAFEPVVSSLSNLQRFLQSIAPS
Subjt:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTT+KGWRTCD EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSV+QYYVPYLSGIQIYGESLKS+AK RQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        E ERALKYM  QLNHH+LSSE+SRRMDRISLRDQLIGLQEDCSSDEAES N QGQLLFEHLERDLPY REPLADKISDLAFQFPEL+TLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLSTPMGGGRS QGPVLTYPSEIDGIP+M LPVFGLASYKFRGSLWTPNG +EWQLA SLLQDAEDWLRQRQV
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDF+FF RR
Subjt:  NHPDFIFFSRR

A0A6J1JV26 uncharacterized protein LOC1114891479.1e-22191.73Show/hide
Query:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS
        M GAGLQ GRGCGDDRFYNPTKARR++QGRQNDQLRR QSDVSA +SPV+K T VSS IRETE GDGCEELPKSI++SAFEPVVSSLSNL+RFLQSIAPS
Subjt:  MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKS+AKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF
        EPERA+KYMGNQLNHHHLSSELSRRM+R+SLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPY REPLADK+SDLAF+FPELKTLRSCDLLPSSWF
Subjt:  EPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV
        SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGG RS QGPV+TYPS+IDGIPRM LPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDA+DWLR+R V
Subjt:  SVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)7.1e-10958.49Show/hide
Query:  QLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTMKGWRTCDMEFQ-PYFVLGDLWE
        QL+RAQ DVS G          SS  ++ ENG     L   +S        +S SN++RFL S+ PSVPA YLSKT ++     D+E Q PYF+LGD+WE
Subjt:  QLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTMKGWRTCDMEFQ-PYFVLGDLWE

Query:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSTAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGNQLNHHHLSSELSRRMDRI
        SF EWSAYG GVPL LN++ D V QYYVP LSGIQ+Y   ++L S+ ++R+ GE+S+SDFRDSSS+GSS SE ER L Y   Q         +S RMD++
Subjt:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSTAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGNQLNHHHLSSELSRRMDRI

Query:  SLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYL
        SLR +    QED SSD+ E L+SQG+L+FE+LERDLPY REP ADK+SDLA +FPELKTLRSCDLLPSSWFSVAWYPIY+IPTGPTL+DLDACFLT+H L
Subjt:  SLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYL

Query:  STPMGGGRSAQGPV-LTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQVNHPDFIFFSRR
         TP  G     G + +  P E   + +M LPVFGLASYK RGS+WT  GG   QLANSL Q A++WLR RQVNHPDFIFF RR
Subjt:  STPMGGGRSAQGPV-LTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQVNHPDFIFFSRR

AT2G01260.1 Protein of unknown function (DUF789)2.9e-11859.23Show/hide
Query:  MLGAGLQLGRG-CGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAP
        MLGAG QL RG  GDD FY   K RRANQ  + DQLRRAQSDVS   S                          S      EP   S SNL RFL+S+ P
Subjt:  MLGAGLQLGRG-CGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAP

Query:  SVPAQYLSKTTMKGWRTCD--MEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSTAKSRQPGEDSDSDFRDSSS
        SVPAQ+LSKT ++  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L S+ KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYLSKTTMKGWRTCD--MEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSTAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDL
        D SSDS+ ER                 +S R+D ISLRDQ    QED SSD+ E L SQG+L+FE+LERDLPY REP ADK+ DLA QFPEL TLRSCDL
Subjt:  DGSSDSEPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDW
        L SSWFSVAWYPIYRIPTGPTL+DLDACFLT+H L T  GG  S Q   LT P E +   +M LPVFGLASYKFRGSLWTP GG E QL NSL Q A+ W
Subjt:  LPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDW

Query:  LRQRQVNHPDFIFFSRR
        L    V+HPDF+FF RR
Subjt:  LRQRQVNHPDFIFFSRR

AT2G01260.2 Protein of unknown function (DUF789)3.5e-9258.94Show/hide
Query:  MLGAGLQLGRG-CGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAP
        MLGAG QL RG  GDD FY   K RRANQ  + DQLRRAQSDVS   S                          S      EP   S SNL RFL+S+ P
Subjt:  MLGAGLQLGRG-CGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAP

Query:  SVPAQYLSKTTMKGWRTCD--MEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSTAKSRQPGEDSDSDFRDSSS
        SVPAQ+LSKT ++  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L S+ KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYLSKTTMKGWRTCD--MEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSTAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDL
        D SSDS+ ER                 +S R+D ISLRDQ    QED SSD+ E L SQG+L+FE+LERDLPY REP ADK+ DLA QFPEL TLRSCDL
Subjt:  DGSSDSEPERALKYMGNQLNHHHLSSELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGG
        L SSWFSVAWYPIYRIPTGPTL+DLDACFLT+H L T  GG
Subjt:  LPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGG

AT4G16100.1 Protein of unknown function (DUF789)6.5e-8645.79Show/hide
Query:  GDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCE----ELPKSISVSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSK
        G++RFYNP   R+  Q R+  +L   + +    ++  +   ++    +E +  + C      +P  +S S      ++ SNL RFL    P V  Q+L  
Subjt:  GDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCE----ELPKSISVSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSK

Query:  TTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALK
        T+ KGWRT + E++PYF+L DLW+SF+EWSAYG GVPL+LN  DSVVQYYVPYLSGIQ+Y +  ++    R+ GE+SD D  RD SSDGS+D        
Subjt:  TTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALK

Query:  YMGNQLNHHHLSSELSRRMDRISLRDQ-LIGLQEDCSSDEAE-SLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWY
                     ELS+ + R SL ++  IG     SSDE+E S NS G+L+FE+LE  +P+ REPL DKIS+L+ QFP L+T RSCDL PSSW SVAWY
Subjt:  YMGNQLNHHHLSSELSRRMDRISLRDQ-LIGLQEDCSSDEAE-SLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWY

Query:  PIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGY-EWQLANSLLQDAEDWLRQRQVNHPD
        PIYRIP G +L++LDACFLTFH LSTP  G  + +G      S+     ++ LP FGLASYKF+ S W+P     E Q   +LL+ AE+WLR+ +V  PD
Subjt:  PIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGY-EWQLANSLLQDAEDWLRQRQVNHPD

Query:  FIFF
        F  F
Subjt:  FIFF

AT5G49220.1 Protein of unknown function (DUF789)9.0e-8045.82Show/hide
Query:  GDDRFYNPTKARRANQGRQ-NDQLRRAQSDVSAGQSPVVKQTRVSSAI--RETENGDGCEELPKSISVSAFE-------------PVVSSLSNLQRFLQS
        G++RFYNP   RR  Q  Q   Q+R  Q      +  + K+ R ++ +  R T  G G  E    + VS  E              V+S  SNL RFL+ 
Subjt:  GDDRFYNPTKARRANQGRQ-NDQLRRAQSDVSAGQSPVVKQTRVSSAI--RETENGDGCEELPKSISVSAFE-------------PVVSSLSNLQRFLQS

Query:  IAPSVPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRD
          P VPA+     +    +T + +   YFVL DLWESF EWSAYGAGV     PL ++ +DS VQYYVPYLSGIQ+Y + LK   K R P  D+     +
Subjt:  IAPSVPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRD

Query:  SSSDGSSDSEPERALKYMGNQLNHHHLSSELS-RRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLR
         SS+GSS+S                 L  +LS   ++RISL+DQ   +    SS EAE  N QG+LLFE+LE + P+ REPLA+KISDLA + PEL T R
Subjt:  SSSDGSSDSEPERALKYMGNQLNHHHLSSELS-RRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLR

Query:  SCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQD
        SCDLLPSSW SV+WYPIYRIP GPTL++LDACFLTFH LST     +SA G   + PS      ++ LP FGLASYK + S+W  N   E Q   SLLQ 
Subjt:  SCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQD

Query:  AEDWLRQRQVNHPDFIFFS
        A+ WL++ QV+HPD+ FF+
Subjt:  AEDWLRQRQVNHPDFIFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGTGCAGGCTTGCAGCTTGGTCGTGGTTGTGGTGACGATAGGTTTTACAATCCGACGAAAGCTCGTAGGGCGAATCAGGGCCGTCAAAATGATCAGCTCCGGAG
AGCTCAGAGCGACGTTTCTGCAGGCCAATCCCCTGTCGTTAAACAGACCAGGGTGTCCTCGGCGATTAGAGAAACCGAGAACGGAGATGGGTGTGAAGAGCTCCCCAAAT
CCATTTCGGTGTCGGCTTTTGAGCCAGTGGTGTCGTCGCTGAGTAATCTGCAGCGGTTCTTGCAGTCCATCGCGCCATCGGTACCTGCACAGTACCTCTCAAAGACAACG
ATGAAGGGTTGGAGAACCTGTGATATGGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGCGCAGGTGTGCCTCTTGT
ATTAAACGACAGTGACAGTGTTGTTCAATATTATGTACCATATTTATCCGGTATACAGATATATGGTGAATCCTTGAAGTCCACTGCAAAGTCAAGGCAACCAGGTGAGG
ACAGTGATAGTGATTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCACTAAAATACATGGGGAACCAACTCAATCATCACCATTTATCTTCT
GAGCTTTCTCGTAGAATGGATAGGATATCTTTGCGGGACCAGCTAATTGGACTTCAAGAGGACTGTTCTAGTGACGAGGCTGAATCTCTTAATTCTCAAGGCCAGCTGCT
ATTTGAGCATCTCGAACGTGATTTGCCTTATTGTCGCGAACCTTTGGCTGATAAGATATCAGATCTTGCCTTTCAGTTCCCTGAGCTCAAGACATTACGAAGTTGCGATC
TATTGCCTTCCAGCTGGTTTTCAGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAGGGATCTGGATGCCTGCTTTCTTACCTTTCATTATTTGTCT
ACGCCAATGGGAGGGGGCCGCAGTGCTCAAGGTCCTGTACTAACGTATCCAAGTGAGATAGATGGTATCCCTAGGATGTTCCTACCAGTTTTTGGTCTAGCTTCATACAA
GTTTAGAGGGTCTTTATGGACTCCAAATGGCGGATATGAGTGGCAATTGGCAAATTCACTTTTGCAGGACGCCGAGGATTGGTTAAGACAGCGTCAAGTAAATCACCCTG
ACTTCATCTTCTTCAGCCGACGGTGA
mRNA sequenceShow/hide mRNA sequence
AAAACCAGCGTACTGGTTCTCCTTCATCTCTCTCATTTTCTCTTCTTCAATCGCTTCGTTTTTCTTCCATTGCTCTTTCTCTCTCCTCTCTGTTTCGACGTCATTGAAAC
CCTCGTTCCGTGGGTTAGATCCATTGCGTCTCAATCTGTGGAAGGTTTTCGCATTCAAATTCATCTTCTCCGATTCGATTCGCCCGACTTTTTGGACGACCCACAGACGG
CTCTGTTTTCCTCCCTCTCACATTTTGGCAGCCAATTGTCTGTATAATTCCATCTCGCAATCAATCGTTACTCTTCTGTGCTCAGGGATTTTGATTCTAATCCGATCAGT
CTCTGGCGAGATTGTTAGTTTCTGCATTTCGTTGACTTTTATTGAGGATTCTACGACTGCCGATTGCGATTTTCCGGAGCCGTTTTTCGTTGGTTTTGGAAATATAATGT
TAGGTGCAGGCTTGCAGCTTGGTCGTGGTTGTGGTGACGATAGGTTTTACAATCCGACGAAAGCTCGTAGGGCGAATCAGGGCCGTCAAAATGATCAGCTCCGGAGAGCT
CAGAGCGACGTTTCTGCAGGCCAATCCCCTGTCGTTAAACAGACCAGGGTGTCCTCGGCGATTAGAGAAACCGAGAACGGAGATGGGTGTGAAGAGCTCCCCAAATCCAT
TTCGGTGTCGGCTTTTGAGCCAGTGGTGTCGTCGCTGAGTAATCTGCAGCGGTTCTTGCAGTCCATCGCGCCATCGGTACCTGCACAGTACCTCTCAAAGACAACGATGA
AGGGTTGGAGAACCTGTGATATGGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGCGCAGGTGTGCCTCTTGTATTA
AACGACAGTGACAGTGTTGTTCAATATTATGTACCATATTTATCCGGTATACAGATATATGGTGAATCCTTGAAGTCCACTGCAAAGTCAAGGCAACCAGGTGAGGACAG
TGATAGTGATTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCACTAAAATACATGGGGAACCAACTCAATCATCACCATTTATCTTCTGAGC
TTTCTCGTAGAATGGATAGGATATCTTTGCGGGACCAGCTAATTGGACTTCAAGAGGACTGTTCTAGTGACGAGGCTGAATCTCTTAATTCTCAAGGCCAGCTGCTATTT
GAGCATCTCGAACGTGATTTGCCTTATTGTCGCGAACCTTTGGCTGATAAGATATCAGATCTTGCCTTTCAGTTCCCTGAGCTCAAGACATTACGAAGTTGCGATCTATT
GCCTTCCAGCTGGTTTTCAGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAGGGATCTGGATGCCTGCTTTCTTACCTTTCATTATTTGTCTACGC
CAATGGGAGGGGGCCGCAGTGCTCAAGGTCCTGTACTAACGTATCCAAGTGAGATAGATGGTATCCCTAGGATGTTCCTACCAGTTTTTGGTCTAGCTTCATACAAGTTT
AGAGGGTCTTTATGGACTCCAAATGGCGGATATGAGTGGCAATTGGCAAATTCACTTTTGCAGGACGCCGAGGATTGGTTAAGACAGCGTCAAGTAAATCACCCTGACTT
CATCTTCTTCAGCCGACGGTGATGTTCTTGAAATGTCTACAAAGCCTAAAGGTGGGAATCAAGATATCATAGTTCAGTACCATGTCGTGTTTTTTTCTCTTTGTGGCACT
GATCTACTTTCAGAAAATGGAAAGGAAAAGGAAAATGGAAAAAAGAAAAGAAAAGAAGGAGAAGAAAAATTTATGATGGAATGGGATGGGGGTGAGGTCGGTGGATGGCG
ACTTAAACAAGGCTCAAACAAAGGTCAAAAGGGGAAGAAGCATTAAAGGAAACGCAAAAAATACAGTTGGATGAAGATGTGATGTAGAAGTTGTAGCAAAAACATTGCCC
AAAGTTTGTCCTACTTGTTGCTTGTGCTGCATAGCAGGTTTATCCTTTCTTTTTTCTCTTTAAGTTACTGAAAGTTTTTTTTATAAGTTATTAACCGTAGATGAAATGCA
AGAAAAGAAAAAAAAGAAAAAGAAAAACAGGCCTTTTTTACTGTCCTGATAATGTTTAGAATGTAAACTATGGAGAGGGGATTTTTGATAGGAAGGTTATGTTAGGAGTA
AGCTATGGATCCCCTCCAAATGTATTTGAATGCTTGCTATGCCCAGTTGCTATCGAAAACTGTATAATTCGTTACGACGAGCTTCGTTATCACGAACCATTATTCAAGCC
CTCATTGTGTTCCTTCTTTGTTTGAATGGGGAGGGGTGAATCTTTGTGCAGGTTTGATATATTGTTTAGGTGTTTCAAACTACGTGTATCAAATGACTGAATGGATCATT
AGCAACAATATCTGGCATCATATATTGGCTGTCCATGGTCTTTGCATTTTGGAATCTTTTTCATTGAAACTACGTATGTTCCGTGTTTCA
Protein sequenceShow/hide protein sequence
MLGAGLQLGRGCGDDRFYNPTKARRANQGRQNDQLRRAQSDVSAGQSPVVKQTRVSSAIRETENGDGCEELPKSISVSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTT
MKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSTAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGNQLNHHHLSS
ELSRRMDRISLRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYCREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLS
TPMGGGRSAQGPVLTYPSEIDGIPRMFLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEDWLRQRQVNHPDFIFFSRR