; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G021720 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G021720
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationCmo_Chr04:15821372..15825745
RNA-Seq ExpressionCmoCh04G021720
SyntenyCmoCh04G021720
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032526.1 hypothetical protein SDJN02_06575, partial [Cucurbita argyrosperma subsp. argyrosperma]3.9e-23492.41Show/hide
Query:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS

Query:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKA------------------------
        EAERALKYMEKQLNHHNLSSE+SRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKA                        
Subjt:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKA------------------------

Query:  -------LYFLQISDLAFQFPELRTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGL
               +YFLQISDLAFQFPELRTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGL
Subjt:  -------LYFLQISDLAFQFPELRTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGL

Query:  ASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR
        ASYKFRGSLWTPNGR+EWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR
Subjt:  ASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR

XP_022961613.1 uncharacterized protein LOC111462294 [Cucurbita moschata]1.8e-23498.56Show/hide
Query:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS

Query:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL
        EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADK      ISDLAFQFPELRTLRSCDL
Subjt:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW
        LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW
Subjt:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW

Query:  LRQRQVNHPDFLFFRRR
        LRQRQVNHPDFLFFRRR
Subjt:  LRQRQVNHPDFLFFRRR

XP_022971786.1 uncharacterized protein LOC111470464 [Cucurbita maxima]1.3e-23297.36Show/hide
Query:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQFGRGCGDDRFYNSTKAR+V+QGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDAC+ELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS

Query:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL
        EAERALKYMEKQLNHHNLSSE+SRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADK      ISDLAFQFPELRTLRSCDL
Subjt:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW
        LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGR+EWQLAKSLLQDAEDW
Subjt:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW

Query:  LRQRQVNHPDFLFFRRR
        LRQRQVNHPDFLFFRRR
Subjt:  LRQRQVNHPDFLFFRRR

XP_023554659.1 uncharacterized protein LOC111811852 [Cucurbita pepo subsp. pepo]6.7e-23498.08Show/hide
Query:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS

Query:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL
        EAERALKYMEKQLNHHNLSSE+SRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADK      ISDLAFQFPELRTLRSCDL
Subjt:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW
        LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGR+EWQLAKSLLQDAEDW
Subjt:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW

Query:  LRQRQVNHPDFLFFRRR
        LRQRQVNHPDFLFFRRR
Subjt:  LRQRQVNHPDFLFFRRR

XP_038874258.1 uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida]1.0e-21389.69Show/hide
Query:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQF RGCGDDRFYN TKARR +QGRQNDQLRRAQSDVSAGQSP+VKP  VSSV RETE GD CEELPKSI+MSAFEPVVSSLSNL+RFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTT+KGWRTCD EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSV+QYYVPYLSGIQIYGESLKSSAK RQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS

Query:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL
        E ERALKYM KQLNHH+LSSE+ RRMDRIS RDQLIGLQEDCSSDEAES N QGQLLFEHLERDLPYSREPLADK      ISDLAFQFPEL+TLRSCDL
Subjt:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW
        LPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFH+LS+PMGG RSVQGPV+TYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNG +EWQLA SLLQDAE+W
Subjt:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW

Query:  LRQRQVNHPDFLFFRRR
        LR RQVNHPDF+FF RR
Subjt:  LRQRQVNHPDFLFFRRR

TrEMBL top hitse value%identityAlignment
A0A5D3BPU4 DUF789 domain-containing protein1.7e-21188.97Show/hide
Query:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQFGRGCGDDRFYN TKARRV+QGRQ DQLRRAQSDVSAGQS VVKP+ VSSV RETE G+ CEELPKSI+MS FEPVVSSLSNL+RFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTT+KGWRTCD EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSV+QYYVPYLSGIQIYGES KSSAK RQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS

Query:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL
        E ERALKYM KQLNHH+LSSE+SRRMD IS RDQLIGLQEDCSSDEAES N QGQLLFEHLERDLPYSREPLADK      ISDLAFQFP+L+TLRSCDL
Subjt:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW
        LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLS+P GG RSVQ PV+TYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNG +EWQLA SLL DAEDW
Subjt:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW

Query:  LRQRQVNHPDFLFFRRR
        LR+RQVNHPDF+FF RR
Subjt:  LRQRQVNHPDFLFFRRR

A0A6J1FG46 uncharacterized protein LOC1114451071.9e-21088.01Show/hide
Query:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQFGRGCGDDRFYN TKARR +QGRQNDQLRRAQSDVSA QSPV+KPTTVSSV RETE GD CEELP SI+MSAFEPVVSSLSNL+RFLQSI PS
Subjt:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSV+QYYVPYLSGIQIYGESLKSSAK RQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS

Query:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL
        E ERALKYM  QLNHH+LSSE+SRR +R+SLRDQLIGLQEDC SDEAES N QGQLLFEHLERDLPYSREPLADK      +SDLAF+FPEL+TLRSCDL
Subjt:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW
        LPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFHYLSTPMGG RSVQGPV+TYPS+IDGIP+MSLPVFGLASYKFRGSLWTPNG  EWQLA SLLQDAEDW
Subjt:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW

Query:  LRQRQVNHPDFLFFRRR
        LR+R VNHPDF+FF RR
Subjt:  LRQRQVNHPDFLFFRRR

A0A6J1HCB0 uncharacterized protein LOC1114622948.6e-23598.56Show/hide
Query:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS

Query:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL
        EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADK      ISDLAFQFPELRTLRSCDL
Subjt:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW
        LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW
Subjt:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW

Query:  LRQRQVNHPDFLFFRRR
        LRQRQVNHPDFLFFRRR
Subjt:  LRQRQVNHPDFLFFRRR

A0A6J1I465 uncharacterized protein LOC1114704646.2e-23397.36Show/hide
Query:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
        MLGAGLQFGRGCGDDRFYNSTKAR+V+QGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDAC+ELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
        VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS

Query:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL
        EAERALKYMEKQLNHHNLSSE+SRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADK      ISDLAFQFPELRTLRSCDL
Subjt:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW
        LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGR+EWQLAKSLLQDAEDW
Subjt:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW

Query:  LRQRQVNHPDFLFFRRR
        LRQRQVNHPDFLFFRRR
Subjt:  LRQRQVNHPDFLFFRRR

A0A6J1JV26 uncharacterized protein LOC1114891473.0e-21187.53Show/hide
Query:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS
        M GAGLQFGRGCGDDRFYN TKARR +QGRQNDQLRR QSDVSA +SPV+KPTTVSS+ RETE GD CEELPKSI+MSAFEPVVSSLSNL+RFLQSIAPS
Subjt:  MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPS

Query:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS
        VPAQY SKTT+KGWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSV+QYYVPYLSGIQIYGESLKSSAK RQPGEDSDSDFRDSSSDGSSDS
Subjt:  VPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDS

Query:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL
        E ERA+KYM  QLNHH+LSSE+SRRM+R+SLRDQLIGLQEDCSSDEAES N QGQLLFEHLERDLPYSREPLADK      +SDLAF+FPEL+TLRSCDL
Subjt:  EAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL

Query:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW
        LPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFHYLSTPMGG RSVQGPV+TYPS+IDGIP+MSLPVFGLASYKFRGSLWTPNG +EWQLA SLLQDA+DW
Subjt:  LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDW

Query:  LRQRQVNHPDFLFFRRR
        LR+R VNHPDF+FF RR
Subjt:  LRQRQVNHPDFLFFRRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)4.8e-10557.58Show/hide
Query:  QLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGEFQ-PYFVLGDLWE
        QL+RAQ DVS G          SS T++ E+G A   L   +S        +S SN++RFL S+ PSVPA YLSKT V+     D E Q PYF+LGD+WE
Subjt:  QLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGEFQ-PYFVLGDLWE

Query:  SFKEWSAYGAGVPLVLNDS-DSVIQYYVPYLSGIQIYG--ESLKSSAKLRQPGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRI
        SF EWSAYG GVPL LN++ D V QYYVP LSGIQ+Y   ++L SS + R+ GE+S+SDFRDSSS+GSS SE+ER L Y ++Q         IS RMD++
Subjt:  SFKEWSAYGAGVPLVLNDS-DSVIQYYVPYLSGIQIYG--ESLKSSAKLRQPGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRI

Query:  SLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACF
        SLR +    QED SSD+ E  + QG+L+FE+LERDLPY REP ADK      +SDLA +FPEL+TLRSCDLLPSSWFSVAWYPIY+IPTGPTLKDLDACF
Subjt:  SLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACF

Query:  LTFHYLSTPMGGGRSVQGPV-LTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR
        LT+H L TP  G     G + +  P E   + KM LPVFGLASYK RGS+WT  G    QLA SL Q A++WLR RQVNHPDF+FF RR
Subjt:  LTFHYLSTPMGGGRSVQGPV-LTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR

AT2G01260.1 Protein of unknown function (DUF789)3.9e-11558.63Show/hide
Query:  MLGAGLQFGRG-CGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAP
        MLGAG Q  RG  GDD FY S K RR NQ  + DQLRRAQSDVS   S    P                            EP   S SNL RFL+S+ P
Subjt:  MLGAGLQFGRG-CGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAP

Query:  SVPAQYLSKTTVKGWRTCD--GEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVIQYYVPYLSGIQIYGES--LKSSAKLRQPGEDSDSDFRDSSS
        SVPAQ+LSKT ++  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D VIQYYVP LS IQIY  S  L SS K R+PG+ SDSDFRDSSS
Subjt:  SVPAQYLSKTTVKGWRTCD--GEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVIQYYVPYLSGIQIYGES--LKSSAKLRQPGEDSDSDFRDSSS

Query:  DGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRT
        D SSDS++ER                 +S R+D ISLRDQ    QED SSD+ E    QG+L+FE+LERDLPY REP ADK L      DLA QFPEL T
Subjt:  DGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRT

Query:  LRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLL
        LRSCDLL SSWFSVAWYPIYRIPTGPTLKDLDACFLT+H L T  GG  S Q   LT P E +   KMSLPVFGLASYKFRGSLWTP G  E QL  SL 
Subjt:  LRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLL

Query:  QDAEDWLRQRQVNHPDFLFFRRR
        Q A+ WL    V+HPDFLFF RR
Subjt:  QDAEDWLRQRQVNHPDFLFFRRR

AT2G01260.2 Protein of unknown function (DUF789)4.4e-9057.93Show/hide
Query:  MLGAGLQFGRG-CGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAP
        MLGAG Q  RG  GDD FY S K RR NQ  + DQLRRAQSDVS   S    P                            EP   S SNL RFL+S+ P
Subjt:  MLGAGLQFGRG-CGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAP

Query:  SVPAQYLSKTTVKGWRTCD--GEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVIQYYVPYLSGIQIYGES--LKSSAKLRQPGEDSDSDFRDSSS
        SVPAQ+LSKT ++  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D VIQYYVP LS IQIY  S  L SS K R+PG+ SDSDFRDSSS
Subjt:  SVPAQYLSKTTVKGWRTCD--GEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVIQYYVPYLSGIQIYGES--LKSSAKLRQPGEDSDSDFRDSSS

Query:  DGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRT
        D SSDS++ER                 +S R+D ISLRDQ    QED SSD+ E    QG+L+FE+LERDLPY REP ADK L      DLA QFPEL T
Subjt:  DGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRT

Query:  LRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGG
        LRSCDLL SSWFSVAWYPIYRIPTGPTLKDLDACFLT+H L T  GG
Subjt:  LRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGG

AT4G16100.1 Protein of unknown function (DUF789)8.9e-8344.63Show/hide
Query:  GDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACE----ELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSK
        G++RFYN    R++ Q R+  +L   + +    ++  +    +    +E +  + C      +P  +S +      +S SNL RFL    P V  Q+L  
Subjt:  GDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACE----ELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSK

Query:  TTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDF-RDSSSDGSSDSEAERALK
        T+ KGWRT + E++PYF+L DLW+SF+EWSAYG GVPL+LN  DSV+QYYVPYLSGIQ+Y +  ++    R+ GE+SD D  RD SSDGS+D        
Subjt:  TTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDF-RDSSSDGSSDSEAERALK

Query:  YMEKQLNHHNLSSEISRRMDRISLRDQ-LIGLQEDCSSDEAE-SPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDLLPSSW
                     E+S+ + R SL ++  IG     SSDE+E S N  G+L+FE+LE  +P+ REPL DK      IS+L+ QFP LRT RSCDL PSSW
Subjt:  YMEKQLNHHNLSSEISRRMDRISLRDQ-LIGLQEDCSSDEAE-SPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRF-EWQLAKSLLQDAEDWLRQR
         SVAWYPIYRIP G +L++LDACFLTFH LSTP  G  + +G      S+     K+ LP FGLASYKF+ S W+P     E Q   +LL+ AE+WLR+ 
Subjt:  FSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRF-EWQLAKSLLQDAEDWLRQR

Query:  QVNHPDFLFF
        +V  PDF  F
Subjt:  QVNHPDFLFF

AT5G49220.1 Protein of unknown function (DUF789)6.2e-7645.07Show/hide
Query:  GDDRFYNSTKARRVNQGRQ-NDQLRRAQ---------SDVSAGQSPVVKPTTV-------SSVTRETESG-DACEELPKSISMSAFEPVVSSLSNLQRFL
        G++RFYN    RR+ Q  Q   Q+R  Q          D    ++  V P T         S +R   SG + C     S S S    V+S  SNL RFL
Subjt:  GDDRFYNSTKARRVNQGRQ-NDQLRRAQ---------SDVSAGQSPVVKPTTV-------SSVTRETESG-DACEELPKSISMSAFEPVVSSLSNLQRFL

Query:  QSIAPSVPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDF
        +   P VPA+     +    +T + +   YFVL DLWESF EWSAYGAGV     PL ++ +DS +QYYVPYLSGIQ+Y + LK   K R P  D+    
Subjt:  QSIAPSVPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDF

Query:  RDSSSDGSSDSEAERALKYMEKQLNHHNLSSEIS-RRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQ
         + SS+GSS               N   L  ++S   ++RISL+DQ   +    SS EAE  NPQG+LLFE+LE + P+ REPLA+K      ISDLA +
Subjt:  RDSSSDGSSDSEAERALKYMEKQLNHHNLSSEIS-RRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQ

Query:  FPELRTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQ
         PEL T RSCDLLPSSW SV+WYPIYRIP GPTL++LDACFLTFH LST     +S  G   + PS      K+ LP FGLASYK + S+W  N   E Q
Subjt:  FPELRTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQ

Query:  LAKSLLQDAEDWLRQRQVNHPDFLFF
           SLLQ A+ WL++ QV+HPD+ FF
Subjt:  LAKSLLQDAEDWLRQRQVNHPDFLFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGTGCTGGATTGCAGTTTGGTCGTGGTTGTGGTGATGATAGGTTTTACAATTCGACGAAAGCTCGTAGGGTGAATCAGGGACGTCAAAATGATCAGCTCCGTAG
AGCTCAGAGCGACGTTTCTGCAGGTCAATCTCCTGTGGTTAAACCGACCACGGTGTCCTCGGTGACTAGAGAAACCGAGAGCGGAGATGCGTGTGAAGAGCTCCCCAAAT
CTATTTCGATGTCGGCCTTTGAGCCAGTGGTATCGTCGCTGAGTAATCTGCAGCGGTTTTTGCAGTCTATCGCGCCATCTGTACCTGCACAGTACCTCTCAAAGACAACG
GTAAAGGGTTGGAGAACCTGTGACGGAGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGTGCAGGTGTGCCTCTTGT
ACTAAACGACAGTGACAGTGTTATCCAGTATTATGTACCATATTTATCCGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTGCAAAGTTAAGGCAACCAGGTGAGG
ACAGTGATAGTGATTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAAGCTGAACGAGCTCTAAAATACATGGAGAAACAACTCAATCATCACAACTTATCTTCT
GAGATTTCTCGTAGAATGGATAGGATATCTTTGCGGGACCAGCTAATTGGACTTCAAGAAGACTGTTCTAGTGACGAGGCTGAATCTCCTAATCCTCAAGGCCAGCTACT
ATTTGAGCATCTTGAACGGGATTTGCCTTATAGTCGTGAACCTTTAGCAGATAAGGCATTATATTTTCTCCAGATATCAGATCTTGCCTTCCAGTTCCCTGAGCTCAGGA
CATTACGAAGTTGTGATCTATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAAGGATCTGGATGCCTGCTTCCTC
ACCTTTCATTATTTGTCTACGCCAATGGGAGGGGGACGCAGTGTACAAGGTCCTGTACTAACGTATCCCAGTGAGATAGATGGTATCCCTAAGATGTCACTGCCAGTTTT
TGGTCTAGCTTCATACAAGTTTAGAGGGTCTTTATGGACTCCAAATGGCAGATTCGAGTGGCAATTGGCAAAGTCACTTTTGCAGGATGCTGAGGATTGGTTGAGACAAC
GCCAAGTAAACCACCCCGACTTCCTCTTCTTCAGGCGACGGTGA
mRNA sequenceShow/hide mRNA sequence
CCTTGGAAAAACCAGCGTACTGCTTCGCCTCCATCTCTTTCCTTTTTTCTTCCATTGCTCTCTTTTCTTCTCTCTGTTTCGACGACGCCATTGAAACCCTCGTTCTGTGT
TTTAGATCCATTGCGTCTCAATCTGTTTAAGGTTTTTGCATTCGAACTTATCTTCTTCGATTCGATTTGCCTGGATTTTTTGGCGAACCACAGGAGGGCTCTGTTTTCGT
CTCTCTCACATTTTGGAACCCGATTGCCTGTTTGATTCTATCTCGCAATCAATCGTTACTCTTCTGTGCTTCGGGATTTTGGTTCCAATCTGATTGGTTTTTAGCGAGAT
TGTTACTTTGTGCATTTCCTTGACTTTTATTGTGGATTTTAGGACTGCGGATTGCGATTTTCCGGAGCTGTTTTTCGTTTGGTTTTCGAAATATAATGTTAGGTGCTGGA
TTGCAGTTTGGTCGTGGTTGTGGTGATGATAGGTTTTACAATTCGACGAAAGCTCGTAGGGTGAATCAGGGACGTCAAAATGATCAGCTCCGTAGAGCTCAGAGCGACGT
TTCTGCAGGTCAATCTCCTGTGGTTAAACCGACCACGGTGTCCTCGGTGACTAGAGAAACCGAGAGCGGAGATGCGTGTGAAGAGCTCCCCAAATCTATTTCGATGTCGG
CCTTTGAGCCAGTGGTATCGTCGCTGAGTAATCTGCAGCGGTTTTTGCAGTCTATCGCGCCATCTGTACCTGCACAGTACCTCTCAAAGACAACGGTAAAGGGTTGGAGA
ACCTGTGACGGAGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGTGCAGGTGTGCCTCTTGTACTAAACGACAGTGA
CAGTGTTATCCAGTATTATGTACCATATTTATCCGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTGCAAAGTTAAGGCAACCAGGTGAGGACAGTGATAGTGATT
TCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAAGCTGAACGAGCTCTAAAATACATGGAGAAACAACTCAATCATCACAACTTATCTTCTGAGATTTCTCGTAGA
ATGGATAGGATATCTTTGCGGGACCAGCTAATTGGACTTCAAGAAGACTGTTCTAGTGACGAGGCTGAATCTCCTAATCCTCAAGGCCAGCTACTATTTGAGCATCTTGA
ACGGGATTTGCCTTATAGTCGTGAACCTTTAGCAGATAAGGCATTATATTTTCTCCAGATATCAGATCTTGCCTTCCAGTTCCCTGAGCTCAGGACATTACGAAGTTGTG
ATCTATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAAGGATCTGGATGCCTGCTTCCTCACCTTTCATTATTTG
TCTACGCCAATGGGAGGGGGACGCAGTGTACAAGGTCCTGTACTAACGTATCCCAGTGAGATAGATGGTATCCCTAAGATGTCACTGCCAGTTTTTGGTCTAGCTTCATA
CAAGTTTAGAGGGTCTTTATGGACTCCAAATGGCAGATTCGAGTGGCAATTGGCAAAGTCACTTTTGCAGGATGCTGAGGATTGGTTGAGACAACGCCAAGTAAACCACC
CCGACTTCCTCTTCTTCAGGCGACGGTGAAGTTCTGGTAACTTCTACGAGATCTAAAGGTGGAAATCAAGATATCATAGTATTCAGTACCATGTCGTGATTTGTGGGCAC
TGTTCTAATTTTGGAAAATGGGAAGGAAAAAGGAAAAGGAAAAAAAAAAAAGAAAAAGGAAAAAGGAAAAAGGAAAAAGGAAAA
Protein sequenceShow/hide protein sequence
MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTT
VKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSS
EISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFL
TFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR