; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C005342 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C005342
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionProtein of unknown function (DUF789)
Genome locationchr09:19429103..19433875
RNA-Seq ExpressionMELO3C005342
SyntenyMELO3C005342
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016733.1 hypothetical protein SDJN02_21843, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-21591Show/hide
Query:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGRGCGD RFYNPTKARR HQGRQ DQLRRAQSDVSA Q  V+KP+ VSSVIRETE G+GCE+LPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF
        EPERALKYMG QLNHHHLSSELSRRM+ +S RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK ++L+F+FP+LKT+RSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLS+P GGARSVQ PVVTYPS+IDGIP+MSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLRER V
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

TYK00266.1 DUF789 domain-containing protein [Cucumis melo var. makuwa]7.8e-23598.78Show/hide
Query:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGRGCGD RFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF
        EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK ++LAFQFPKLKT+RSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV
        SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_004147115.1 uncharacterized protein LOC101217142 [Cucumis sativus]3.6e-22495.13Show/hide
Query:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGR  GD RFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVK SAVSSVIRE+ECG+GCEELPKSIA SGFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYL+KTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF
        EPERALKYMGKQLNHH LSSELSRRMDN+SFRDQLIGLQEDCSSDEAESLNS+GQLLFEHLERDLPYSREPLADK ++LAFQFP+LKT+RSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV
        SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSP GGARSVQCPVVTYPSEIDG+PKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLRERQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_008467187.1 PREDICTED: uncharacterized protein LOC103504597 [Cucumis melo]7.1e-23699.27Show/hide
Query:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF
        EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK ++LAFQFPKLKTIRSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV
        SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

XP_038874258.1 uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida]6.4e-22193.43Show/hide
Query:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQF RGCGD RFYNPTKARR HQGRQ DQLRRAQSDVSAGQS +VKP  VSSVIRETE G+GCEELPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF
        EPERALKYMGKQLNHHHLSSEL RRMD ISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK ++LAFQFP+LKT+RSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV
        SVAWYPIYRIPTGPTL+DLDACFLTFH+LSSP GGARSVQ PVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAE+WLR+RQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

TrEMBL top hitse value%identityAlignment
A0A0A0KT13 Uncharacterized protein1.8e-22495.13Show/hide
Query:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGR  GD RFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVK SAVSSVIRE+ECG+GCEELPKSIA SGFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYL+KTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF
        EPERALKYMGKQLNHH LSSELSRRMDN+SFRDQLIGLQEDCSSDEAESLNS+GQLLFEHLERDLPYSREPLADK ++LAFQFP+LKT+RSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV
        SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSP GGARSVQCPVVTYPSEIDG+PKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL DAEDWLRERQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A1S3CT52 uncharacterized protein LOC1035045973.4e-23699.27Show/hide
Query:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF
        EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK ++LAFQFPKLKTIRSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV
        SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A5D3BPU4 DUF789 domain-containing protein3.8e-23598.78Show/hide
Query:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGRGCGD RFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF
        EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK ++LAFQFPKLKT+RSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV
        SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A6J1FG46 uncharacterized protein LOC1114451076.3e-21490.51Show/hide
Query:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS
        MLGAGLQFGRGCGD RFYNPTKARR HQGRQ DQLRRAQSDVSA QS V+KP+ VSSVIRETE G+GCEELP SIAMS FEPVVSSLSNLERFLQSI PS
Subjt:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTTMKGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF
        EPERALKYMG QLNHHHLSSELSRR + +S RDQLIGLQEDC SDEAESLNSQGQLLFEHLERDLPYSREPLADK ++LAF+FP+LKT+RSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLS+P GGARSVQ PVVTYPS+IDGIP+MSLPVFGLASYKFRGSLWTPNGG+EWQLANSLL DAEDWLRER V
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

A0A6J1JV26 uncharacterized protein LOC1114891473.7e-21490.02Show/hide
Query:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS
        M GAGLQFGRGCGD RFYNPTKARR HQGRQ DQLRR QSDVSA +S V+KP+ VSS+IRETE G+GCEELPKSIAMS FEPVVSSLSNLERFLQSIAPS
Subjt:  MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPS

Query:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDS

Query:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF
        EPERA+KYMG QLNHHHLSSELSRRM+ +S RDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK ++LAF+FP+LKT+RSCDLLPSSWF
Subjt:  EPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWF

Query:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV
        SVAWYPIYRIPTGPTL+DLDACFLTFHYLS+P GGARSVQ PVVTYPS+IDGIP+MSLPVFGLASYKFRGSLWTPNGGYEWQLANSLL DA+DWLRER V
Subjt:  SVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQV

Query:  NHPDFIFFSRR
        NHPDFIFFSRR
Subjt:  NHPDFIFFSRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)7.6e-10356.92Show/hide
Query:  QLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPSVPAQYLSKTTMKGWRTCDMEFQ-PYFVLGDLWE
        QL+RAQ DVS G          SS  ++ E G          A+       +S SN+ERFL S+ PSVPA YLSKT ++     D+E Q PYF+LGD+WE
Subjt:  QLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPSVPAQYLSKTTMKGWRTCDMEFQ-PYFVLGDLWE

Query:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESSKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHHHLSSELSRRMDNI
        SF EWSAYG GVPL LN++ D V QYYVP LSGIQ+Y   ++  SS ++R+ GE+S+SDFRDSSS+GSS SE ER L Y  +Q         +S RMD +
Subjt:  SFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESSKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHHHLSSELSRRMDNI

Query:  SFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYL
        S R +    QED SSD+ E L+SQG+L+FE+LERDLPY REP ADK ++LA +FP+LKT+RSCDLLPSSWFSVAWYPIY+IPTGPTLKDLDACFLT+H L
Subjt:  SFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYL

Query:  SSP-TGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQVNHPDFIFFSRR
         +P  G   +     V  P E   + KM LPVFGLASYK RGS+WT  GG   QLANSL   A++WLR RQVNHPDFIFF RR
Subjt:  SSP-TGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQVNHPDFIFFSRR

AT2G01260.1 Protein of unknown function (DUF789)2.4e-11257.55Show/hide
Query:  MLGAGLQFGRG-CGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAP
        MLGAG Q  RG  GD  FY   K RR +Q  + DQLRRAQSDVS        PS+  S  ++                   EP   S SNL+RFL+S+ P
Subjt:  MLGAGLQFGRG-CGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAP

Query:  SVPAQYLSKTTMKGWRTCD--MEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGESS--KSSAKSRQPGEDSDSDFRDSSS
        SVPAQ+LSKT ++  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S    SS KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYLSKTTMKGWRTCD--MEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGESS--KSSAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKA-NLAFQFPKLKTIRSCDL
        D SSDS+ ER                 +S R+D IS RDQ    QED SSD+ E L SQG+L+FE+LERDLPY REP ADK  +LA QFP+L T+RSCDL
Subjt:  DGSSDSEPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKA-NLAFQFPKLKTIRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDW
        L SSWFSVAWYPIYRIPTGPTLKDLDACFLT+H L +  GG  S Q   +T P E +   KMSLPVFGLASYKFRGSLWTP GG E QL NSL   A+ W
Subjt:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDW

Query:  LRERQVNHPDFIFFSRR
        L    V+HPDF+FF RR
Subjt:  LRERQVNHPDFIFFSRR

AT2G01260.2 Protein of unknown function (DUF789)1.7e-8656.89Show/hide
Query:  MLGAGLQFGRG-CGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAP
        MLGAG Q  RG  GD  FY   K RR +Q  + DQLRRAQSDVS        PS+  S  ++                   EP   S SNL+RFL+S+ P
Subjt:  MLGAGLQFGRG-CGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAP

Query:  SVPAQYLSKTTMKGWRTCD--MEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGESS--KSSAKSRQPGEDSDSDFRDSSS
        SVPAQ+LSKT ++  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S    SS KSR+PG+ SDSDFRDSSS
Subjt:  SVPAQYLSKTTMKGWRTCD--MEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGESS--KSSAKSRQPGEDSDSDFRDSSS

Query:  DGSSDSEPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKA-NLAFQFPKLKTIRSCDL
        D SSDS+ ER                 +S R+D IS RDQ    QED SSD+ E L SQG+L+FE+LERDLPY REP ADK  +LA QFP+L T+RSCDL
Subjt:  DGSSDSEPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKA-NLAFQFPKLKTIRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGG
        L SSWFSVAWYPIYRIPTGPTLKDLDACFLT+H L +  GG
Subjt:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGG

AT4G16100.1 Protein of unknown function (DUF789)8.7e-8345.41Show/hide
Query:  GDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSI---AMSGFEPVVSSLSNLERFLQSIAPSVPAQYLSKT
        G+ RFYNP   R++ Q R+K +L   + +    ++  +    +    +E +  E C     S+     S      ++ SNL RFL    P V  Q+L  T
Subjt:  GDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSI---AMSGFEPVVSSLSNLERFLQSIAPSVPAQYLSKT

Query:  TMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY
        + KGWRT + E++PYF+L DLW+SF+EWSAYG GVPL+LN  DSVVQYYVPYLSGIQ+Y + S++    R+ GE+SD D  RD SSDGS+D         
Subjt:  TMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY

Query:  MGKQLNHHHLSSELSRRMDNISFRDQ-LIGLQEDCSSDEAE-SLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWFSVAWYP
                    ELS+ +   S  ++  IG     SSDE+E S NS G+L+FE+LE  +P+ REPL DK +NL+ QFP L+T RSCDL PSSW SVAWYP
Subjt:  MGKQLNHHHLSSELSRRMDNISFRDQ-LIGLQEDCSSDEAE-SLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIRSCDLLPSSWFSVAWYP

Query:  IYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGY-EWQLANSLLHDAEDWLRERQVNHPDF
        IYRIP G +L++LDACFLTFH LS+P  G  + +       S+     K+ LP FGLASYKF+ S W+P     E Q   +LL  AE+WLR  +V  PDF
Subjt:  IYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGY-EWQLANSLLHDAEDWLRERQVNHPDF

Query:  IFF
          F
Subjt:  IFF

AT5G49220.1 Protein of unknown function (DUF789)1.9e-7443.68Show/hide
Query:  GDGRFYNPTKARRVHQGRQ-KDQLRRAQSDVSAGQSLVVKPSAVSSVI--RETECGEGCEELPKSIAMSGFE-------------PVVSSLSNLERFLQS
        G+ RFYNP   RR+ Q  Q + Q+R  Q      + L+ K    ++ +  R T  G G  E    + +SG E              V+S  SNL+RFL+ 
Subjt:  GDGRFYNPTKARRVHQGRQ-KDQLRRAQSDVSAGQSLVVKPSAVSSVI--RETECGEGCEELPKSIAMSGFE-------------PVVSSLSNLERFLQS

Query:  IAPSVPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRD
          P VPA+     +    +T + +   YFVL DLWESF EWSAYGAGV     PL ++ +DS VQYYVPYLSGIQ+Y +  K   K R P  D+     +
Subjt:  IAPSVPAQYLSKTTMKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRD

Query:  SSSDGSSDSEPERALKYMGKQLNHHHLSSELS-RRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIR
         SS+GSS+S                 L  +LS   ++ IS +DQ   +    SS EAE  N QG+LLFE+LE + P+ REPLA+K ++LA + P+L T R
Subjt:  SSSDGSSDSEPERALKYMGKQLNHHHLSSELS-RRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPKLKTIR

Query:  SCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHD
        SCDLLPSSW SV+WYPIYRIP GPTL++LDACFLTFH LS  T   +S      + PS      K+ LP FGLASYK + S+W  N   E Q   SLL  
Subjt:  SCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHD

Query:  AEDWLRERQVNHPDFIFFS
        A+ WL+  QV+HPD+ FF+
Subjt:  AEDWLRERQVNHPDFIFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGTGCAGGATTGCAGTTTGGACGTGGTTGTGGTGACGGTAGGTTTTATAATCCGACGAAAGCTCGTAGGGTGCATCAGGGCCGTCAAAAGGATCAGCTCCGGAG
AGCTCAGAGCGATGTTTCGGCTGGTCAATCCCTTGTCGTTAAACCGAGCGCGGTGTCCTCGGTTATTAGAGAAACCGAATGCGGGGAAGGGTGTGAAGAGCTCCCTAAAT
CTATTGCGATGTCGGGTTTTGAGCCGGTGGTGTCGTCGCTGAGTAATCTCGAGCGGTTCTTGCAGTCTATCGCACCATCTGTACCTGCACAGTATCTCTCAAAGACAACG
ATGAAGGGTTGGAGAACCTGTGATATGGAATTTCAACCGTACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGCGCAGGTGTGCCTCTTGT
ATTAAACGACAGTGACAGCGTTGTCCAATATTATGTACCATATTTATCTGGTATACAGATATATGGTGAATCCTCGAAGTCCTCTGCAAAGTCAAGGCAACCTGGTGAGG
ACAGTGATAGTGACTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCACTAAAATACATGGGGAAGCAACTCAATCATCACCATTTATCATCT
GAGCTTTCTCGTAGAATGGATAATATATCTTTTCGGGACCAGCTAATTGGACTTCAAGAAGACTGTTCTAGTGATGAGGCCGAATCTCTTAATTCTCAAGGGCAGCTACT
ATTCGAGCATCTTGAACGTGATTTGCCTTATAGTCGCGAACCTTTGGCAGATAAGGCAAATCTTGCTTTTCAGTTCCCTAAGCTCAAGACGATACGAAGTTGTGATCTAT
TGCCTTCAAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAAGGATCTGGATGCCTGCTTCCTCACCTTTCACTATTTGTCTTCG
CCAACTGGAGGAGCACGTAGTGTTCAATGTCCTGTAGTAACGTATCCTAGTGAGATAGATGGTATCCCTAAGATGTCTCTACCAGTTTTTGGTCTAGCTTCATACAAGTT
CAGAGGGTCTTTATGGACTCCAAATGGTGGATACGAGTGGCAATTGGCAAATTCACTTTTGCACGATGCTGAGGATTGGTTAAGAGAACGTCAAGTAAATCACCCTGACT
TCATCTTCTTCAGCCGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
TCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTTCGACGCCATTGAAACCCTCCTTCTCTGTATTAGATCCAATTCATCTTCTCCCATTTTTTCCTTTCTTCCTTTT
CAACGAGCCCGCACAGGACACCGCTCTGTTTTTCCTCTCTCACATTCTGCCACACCCCCGATTGTTTGTCTAATTCTATTTCGCAATCAATCGGTCTCTCTTCTTCTTCT
TCTTCTTCTTCTTCGGGATTTTCATTTTAATCCCATTACTCTCAACCCACAGATTCTTGCTTTTCCCATTTTTCGTTGACTTTTATTCAGGATTGATTAGCCGATTCCCA
TTTTCCAGACCCATCTTTTTTTTTCTTTTTCTTTTTCTCGTTTGGTTTTGGAACAAAAATTATAATGTTAGGTGCAGGATTGCAGTTTGGACGTGGTTGTGGTGACGGTA
GGTTTTATAATCCGACGAAAGCTCGTAGGGTGCATCAGGGCCGTCAAAAGGATCAGCTCCGGAGAGCTCAGAGCGATGTTTCGGCTGGTCAATCCCTTGTCGTTAAACCG
AGCGCGGTGTCCTCGGTTATTAGAGAAACCGAATGCGGGGAAGGGTGTGAAGAGCTCCCTAAATCTATTGCGATGTCGGGTTTTGAGCCGGTGGTGTCGTCGCTGAGTAA
TCTCGAGCGGTTCTTGCAGTCTATCGCACCATCTGTACCTGCACAGTATCTCTCAAAGACAACGATGAAGGGTTGGAGAACCTGTGATATGGAATTTCAACCGTACTTTG
TCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGCGCAGGTGTGCCTCTTGTATTAAACGACAGTGACAGCGTTGTCCAATATTATGTACCATATTTA
TCTGGTATACAGATATATGGTGAATCCTCGAAGTCCTCTGCAAAGTCAAGGCAACCTGGTGAGGACAGTGATAGTGACTTCAGAGATTCTAGTAGTGATGGTAGTAGTGA
TTCAGAACCTGAACGAGCACTAAAATACATGGGGAAGCAACTCAATCATCACCATTTATCATCTGAGCTTTCTCGTAGAATGGATAATATATCTTTTCGGGACCAGCTAA
TTGGACTTCAAGAAGACTGTTCTAGTGATGAGGCCGAATCTCTTAATTCTCAAGGGCAGCTACTATTCGAGCATCTTGAACGTGATTTGCCTTATAGTCGCGAACCTTTG
GCAGATAAGGCAAATCTTGCTTTTCAGTTCCCTAAGCTCAAGACGATACGAAGTTGTGATCTATTGCCTTCAAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGAT
TCCAACTGGACCAACATTAAAGGATCTGGATGCCTGCTTCCTCACCTTTCACTATTTGTCTTCGCCAACTGGAGGAGCACGTAGTGTTCAATGTCCTGTAGTAACGTATC
CTAGTGAGATAGATGGTATCCCTAAGATGTCTCTACCAGTTTTTGGTCTAGCTTCATACAAGTTCAGAGGGTCTTTATGGACTCCAAATGGTGGATACGAGTGGCAATTG
GCAAATTCACTTTTGCACGATGCTGAGGATTGGTTAAGAGAACGTCAAGTAAATCACCCTGACTTCATCTTCTTCAGCCGAAGGTGAAATCCTTGCAACATCTTCAATGC
CTAAACCTAAAGCTGGAAACAATATCATCGTTCAGTATGATGTCGTAATTTGCTCTTTGTGCCGCTGTTCAACTTTCAGAAAATGGAAAGGAAAAAGGAAAAGAACAAAA
AAAGAAAAAAAGAAAAAGGAAAAAAAAAAAGGAAAAAGAAGGAAAAGAAAAGTTTGTGATGGGGATGGTGGTGAGGTTGGTGGATGGCGACTTGATCAAAAACAAAGGTC
AAAAGGGGTGGCCATAAAGAAAAAGCTGAAATACAGTTGGGTGAAGATGTAGAAGTTTTAGCAAAAACATTGCCCAAAGTTGGTTTACTTGCTGCTGCTTGAGCTGCATA
CAGCAGGTCCTTTCTTTTTTCTCTTTAATTACTGAAAGTTTTTTTTTTCCTATAAGTTATTAACATTAGGTGAAATGAAAGGAAAGAAAAAGAAAAAGCAAAAACCCCAA
AAGAAAGAAAAACAGGCCCTTTTTACTGTCCTGATGATGTTTAGAATGTAAACTAATGGAGAGGGGATTTTTGATAGGAAGGTTATGTTAGGAGTAAGCTATGGATCCCC
CCCCCCCCCCCCCCCCCCCCCCCCCCCAATGTATTTGAATGTTTATGCTTCTTAGTTGCTTTTGATGAAAACTGTATAATTTGTTATGAAGAGCTTTGTTATTAATGAAC
CATCATTCAAGCTCTCTATCATTGTGTTCCCTCTTTGTTTGAATGGGGAGGGCTGAATCTCTATGTAGCTTTCATTGATACATTGTTTAGAGATCTGCATCATATATTGG
CTCTCCTTGCTATTTATTTTGCATTTTGAAACACATTTCATAAAGTTTCTTGAATCATAAGGTTGTACCATAGGAACTATTCTCACATTATAC
Protein sequenceShow/hide protein sequence
MLGAGLQFGRGCGDGRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIRETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPSVPAQYLSKTT
MKGWRTCDMEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHHHLSS
ELSRRMDNISFRDQLIGLQEDCSSDEAESLNSQGQLLFEHLERDLPYSREPLADKANLAFQFPKLKTIRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSS
PTGGARSVQCPVVTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQVNHPDFIFFSRR