; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C01G014150 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C01G014150
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionSET domain-containing protein
Genome locationCla97Chr01:27914349..27923431
RNA-Seq ExpressionCla97C01G014150
SyntenyCla97C01G014150
Gene Ontology termsGO:0018026 - peptidyl-lysine monomethylation (biological process)
GO:0005515 - protein binding (molecular function)
GO:0016279 - protein-lysine N-methyltransferase activity (molecular function)
InterPro domainsIPR001214 - SET domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK07464.1 SET domain-containing protein [Cucumis melo var. makuwa]1.6e-27775.62Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE

Query:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS
                                                    GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEE EVDDRFLMILFLMVE LRENSS
Subjt:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS

Query:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
        WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS REVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
Subjt:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE

Query:  EVRSDS-FKETIEVSTSTVSEKLFYVNLT-TRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL
        EV SDS  +ET EVSTS V +    V L  T++  GCED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFSMYLL
Subjt:  EVRSDS-FKETIEVSTSTVSEKLFYVNLT-TRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL

Query:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP
        S       A ILDISGKA SRSSG+EEVSISYGNKGNEELLYLYGFV+E+NPDDYLMVHYPLEAIQNASFSDSKLQLLE+QKAEMRCLLPR+LLDHGFHP
Subjt:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP

Query:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ
        P TSNIKENV CSNR+CNYSWSGQRK+PSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQ
Subjt:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ

Query:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        KK+MDLEEGTGTLDSDT+LLKE QV ES N NG  QDSA R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

XP_008462641.1 PREDICTED: uncharacterized protein LOC103500952 isoform X2 [Cucumis melo]7.8e-28075.76Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE

Query:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS
                                                    GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEE EVDDRFLMILFLMVE LRENSS
Subjt:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS

Query:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
        WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS REVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
Subjt:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE

Query:  EVRSDS-FKETIEVSTSTVSEKLFYVNLT-TRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL
        EV SDS  +ET EVSTS V +    V L  T++  GCED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFSMYLL
Subjt:  EVRSDS-FKETIEVSTSTVSEKLFYVNLT-TRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL

Query:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP
        S       A ILDISGKA SRSSG+EEVSISYGNKGNEELLYLYGFV+E+NPDDYLMVHYPLEAIQNASFSDSKLQLLE+QKAEMRCLLPR+LLDHGFHP
Subjt:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP

Query:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ
        P TSNIKENV CSNR+CNYSWSGQRK+PSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQ
Subjt:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ

Query:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        KK+MDLEEGTGTLDSDT+LLKE QV ES N NG CQDSA R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

XP_016902907.1 PREDICTED: uncharacterized protein LOC103500952 isoform X1 [Cucumis melo]1.5e-27875.47Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE

Query:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS
                                                    GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEE EVDDRFLMILFLMVE LRENSS
Subjt:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS

Query:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
        WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS REVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
Subjt:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE

Query:  EVRSDS-FKETIEVSTSTVSE-KLFYVNLTTRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL
        EV SDS  +ET EVSTS V + +L          S CED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFSMYLL
Subjt:  EVRSDS-FKETIEVSTSTVSE-KLFYVNLTTRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL

Query:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP
        S       A ILDISGKA SRSSG+EEVSISYGNKGNEELLYLYGFV+E+NPDDYLMVHYPLEAIQNASFSDSKLQLLE+QKAEMRCLLPR+LLDHGFHP
Subjt:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP

Query:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ
        P TSNIKENV CSNR+CNYSWSGQRK+PSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQ
Subjt:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ

Query:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        KK+MDLEEGTGTLDSDT+LLKE QV ES N NG CQDSA R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

XP_038881849.1 protein-lysine N-methyltransferase EFM1 isoform X1 [Benincasa hispida]1.1e-28176.2Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE
        MANSDEAKLELFLQWLQVNGADLRGC IK+SDLSKGCGLFSANDA D                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE

Query:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS
                                                    GVLLVVPLDLAITPMRVLQD LYGPECRAMYEE EVDDRFLMILFLMVE LRENSS
Subjt:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS

Query:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
        WKPYLDVLPT FGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVN+LLT+EGFSVREVSFEDFLWANSIFWARALNIPMPH YVFPKIQE
Subjt:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE

Query:  EVRSDSF-KETIEVSTSTVSEKLFYVNLT-TRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL
        EV SDS  +ET EVSTSTVSE    V L  T++  GCEDHRMID+TA GKT GS KQETVWVEGLVPGVDFCNHDLKA ATWEVD +GSTTGVPFSMYLL
Subjt:  EVRSDSF-KETIEVSTSTVSEKLFYVNLT-TRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL

Query:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP
        S                ANSRSSG+EEVSISYGNKGNEELLYLYGF +ENNPDDYLMVHYPLEAIQNA FSD KLQLLEVQKAEMRCLLPRKLLDHGFHP
Subjt:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP

Query:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ
        PNTSNIKENVVCSNR+CNYSWSGQRK+PSYLDKLIFPEKF+TALRTISMQEDELMQVSSLLAEIVGPEED+QPTDIDVQAAVWE CGDSGALQLLVDLLQ
Subjt:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ

Query:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        KK+MDLEEGTGTLDSDT+LLKEVQVIESTN NGSCQ+SASRESDDRKPQ L+SRNQWSSIVYRHGQKQLTSLFLKEAE ALQLSLSE+N
Subjt:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

XP_038881850.1 protein-lysine N-methyltransferase EFM1 isoform X2 [Benincasa hispida]7.8e-28076.05Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE
        MANSDEAKLELFLQWLQVNGADLRGC IK+SDLSKGCGLFSANDA D                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE

Query:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS
                                                    GVLLVVPLDLAITPMRVLQD LYGPECRAMYEE EVDDRFLMILFLMVE LRENSS
Subjt:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS

Query:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
        WKPYLDVLPT FGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVN+LLT+EGFSVREVSFEDFLWANSIFWARALNIPMPH YVFPKIQE
Subjt:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE

Query:  EVRSDSF-KETIEVSTSTVSEKLFYVNLT-TRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL
        EV SDS  +ET EVSTSTVSE    V L  T++  GCEDHRMID+TA GKT GS KQETVWVEGLVPGVDFCNHDLKA ATWEVD +GSTTGVPFSMYLL
Subjt:  EVRSDSF-KETIEVSTSTVSEKLFYVNLT-TRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL

Query:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP
        S                ANSRSSG+EEVSISYGNKGNEELLYLYGF +ENNPDDYLMVHYPLEAIQNA FSD KLQLLEVQKAEMRCLLPRKLLDHGFHP
Subjt:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP

Query:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ
        PNTSNIKENVVCSNR+CNYSWSGQRK+PSYLDKLIFPEKF+TALRTISMQEDELMQVSSLLAEIVGPEED+QPTDIDVQAAVWE CGDSGALQLLVDLLQ
Subjt:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ

Query:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        KK+MDLEEGTGTLDSDT+LLKEVQVIESTN NGSCQ+SA RESDDRKPQ L+SRNQWSSIVYRHGQKQLTSLFLKEAE ALQLSLSE+N
Subjt:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

TrEMBL top hitse value%identityAlignment
A0A1S3CHD9 uncharacterized protein LOC103500952 isoform X23.8e-28075.76Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE

Query:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS
                                                    GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEE EVDDRFLMILFLMVE LRENSS
Subjt:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS

Query:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
        WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS REVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
Subjt:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE

Query:  EVRSDS-FKETIEVSTSTVSEKLFYVNLT-TRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL
        EV SDS  +ET EVSTS V +    V L  T++  GCED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFSMYLL
Subjt:  EVRSDS-FKETIEVSTSTVSEKLFYVNLT-TRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL

Query:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP
        S       A ILDISGKA SRSSG+EEVSISYGNKGNEELLYLYGFV+E+NPDDYLMVHYPLEAIQNASFSDSKLQLLE+QKAEMRCLLPR+LLDHGFHP
Subjt:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP

Query:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ
        P TSNIKENV CSNR+CNYSWSGQRK+PSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQ
Subjt:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ

Query:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        KK+MDLEEGTGTLDSDT+LLKE QV ES N NG CQDSA R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

A0A1S4E3V3 uncharacterized protein LOC103500952 isoform X31.2e-27374.31Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE

Query:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS
                                                    GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEE EVDDRFLMILFLMVE LRENSS
Subjt:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS

Query:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
        WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS REVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
Subjt:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE

Query:  EVRSDS-FKETIEVSTSTVSE-KLFYVNLTTRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL
        EV SDS  +ET EVSTS V + +L          S CED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFSMYLL
Subjt:  EVRSDS-FKETIEVSTSTVSE-KLFYVNLTTRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL

Query:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP
        S                A SRSSG+EEVSISYGNKGNEELLYLYGFV+E+NPDDYLMVHYPLEAIQNASFSDSKLQLLE+QKAEMRCLLPR+LLDHGFHP
Subjt:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP

Query:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ
        P TSNIKENV CSNR+CNYSWSGQRK+PSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQ
Subjt:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ

Query:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        KK+MDLEEGTGTLDSDT+LLKE QV ES N NG CQDSA R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

A0A1S4E3V9 uncharacterized protein LOC103500952 isoform X17.2e-27975.47Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE

Query:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS
                                                    GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEE EVDDRFLMILFLMVE LRENSS
Subjt:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS

Query:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
        WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS REVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
Subjt:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE

Query:  EVRSDS-FKETIEVSTSTVSE-KLFYVNLTTRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL
        EV SDS  +ET EVSTS V + +L          S CED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFSMYLL
Subjt:  EVRSDS-FKETIEVSTSTVSE-KLFYVNLTTRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL

Query:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP
        S       A ILDISGKA SRSSG+EEVSISYGNKGNEELLYLYGFV+E+NPDDYLMVHYPLEAIQNASFSDSKLQLLE+QKAEMRCLLPR+LLDHGFHP
Subjt:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP

Query:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ
        P TSNIKENV CSNR+CNYSWSGQRK+PSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQ
Subjt:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ

Query:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        KK+MDLEEGTGTLDSDT+LLKE QV ES N NG CQDSA R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

A0A5A7SJ36 SET domain-containing protein2.4e-27474.42Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE

Query:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS
                                                    GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEE EVDDRFLMILFLMVE LRENSS
Subjt:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS

Query:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
        WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS R +  + F  ANSIFWARALNIPMPHDYVFPKIQE
Subjt:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE

Query:  EVRSDS-FKETIEVSTSTVSEKLFYVNLTTRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLLS
        EV SDS  +ET EVSTS V +    +  T     GCED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFSMYLLS
Subjt:  EVRSDS-FKETIEVSTSTVSEKLFYVNLTTRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLLS

Query:  VLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHPP
               A ILDISGKA SRSSG+EEVSISYGNKGNEELLYLYGFV+E+NPDDYLMVHYPLEAIQNASFSDSKLQLLE+QKAEMRCLLPR+LLDHGFHPP
Subjt:  VLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHPP

Query:  NTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQK
         TSNIKENV CSNR+CNYSWSGQRK+PSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQK
Subjt:  NTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQK

Query:  KLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        K+MDLEEGTGTLDSDT+LLKE QV ES N NG CQDSA R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  KLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

A0A5D3C897 SET domain-containing protein7.9e-27875.62Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNE

Query:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS
                                                    GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEE EVDDRFLMILFLMVE LRENSS
Subjt:  LEFSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSS

Query:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
        WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS REVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE
Subjt:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQE

Query:  EVRSDS-FKETIEVSTSTVSEKLFYVNLT-TRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL
        EV SDS  +ET EVSTS V +    V L  T++  GCED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFSMYLL
Subjt:  EVRSDS-FKETIEVSTSTVSEKLFYVNLT-TRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLL

Query:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP
        S       A ILDISGKA SRSSG+EEVSISYGNKGNEELLYLYGFV+E+NPDDYLMVHYPLEAIQNASFSDSKLQLLE+QKAEMRCLLPR+LLDHGFHP
Subjt:  SVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHP

Query:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ
        P TSNIKENV CSNR+CNYSWSGQRK+PSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQ
Subjt:  PNTSNIKENVVCSNRSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQ

Query:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        KK+MDLEEGTGTLDSDT+LLKE QV ES N NG  QDSA R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  KKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

SwissProt top hitse value%identityAlignment
P94026 Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic4.4e-0731.31Show/hide
Query:  LFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALN
        LFL+ E  R++S WK Y+DVLP    + +++++ EL E++GT L   T   K+ +Q+ ++   ++++ R   +  F    ++ +DF WA  I  +RA +
Subjt:  LFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALN

Q08961 Ribosomal lysine N-methyltransferase 12.8e-0623.89Show/hide
Query:  MVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWAN------SIFWARA
        +V+ +R N  +KPYLD LP+R  +PL +  +EL  L  T +        NS+   +E   K+    + + + F +  V+ +   + N         + + 
Subjt:  MVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWAN------SIFWARA

Query:  LNIPMPHDYVFPKIQEEVRSDSFKETIEVSTSTVSEKLFYVNLTTRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDG
        L I    +   P I     +  +   I +S      + F   +  R+   C D+ ++                     L+P VD  NHD ++   W  + 
Subjt:  LNIPMPHDYVFPKIQEEVRSDSFKETIEVSTSTVSEKLFYVNLTTRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDG

Query:  VGSTTGVPFSMYLLSVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDD--YLMVHYPLEAIQNASFSDSKLQL
                               F  +  G A    S   E+S +YG KGNEELL  YGFV+E+N  D   L V  PL+ +     ++  L+L
Subjt:  VGSTTGVPFSMYLLSVLVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDD--YLMVHYPLEAIQNASFSDSKLQL

Q43088 Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic2.8e-0630.99Show/hide
Query:  VLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSL
        V+L VP  L I P  V    + G  C       E+     +ILFL+ E  RE+S WK Y  +LP    + +++++ EL EL+G+ L + T     S++  
Subjt:  VLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSL

Query:  YENKVKKLVNR-LLTIEGFSVREVSFEDFLWANSIFWARALN
         +N+  KL    +L  +      V+ +DF WA  I  +RA +
Subjt:  YENKVKKLVNR-LLTIEGFSVREVSFEDFLWANSIFWARALN

Q9XI84 [Fructose-bisphosphate aldolase]-lysine N-methyltransferase, chloroplastic6.8e-0833.57Show/hide
Query:  VLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSL
        V+L +P  L I P  V    + GP C  +     V       LFL+ E   E SSW+ YLD+LP    + +++++ EL ELKGT L   T      ++  
Subjt:  VLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSL

Query:  YENKVKKLVNRLL--TIEGFSVREVSFEDFLWANSIFWARALN
         EN+  KL   +L    + FS R ++ +DF+WA  I  +RA +
Subjt:  YENKVKKLVNRLL--TIEGFSVREVSFEDFLWANSIFWARALN

Arabidopsis top hitse value%identityAlignment
AT1G01920.1 SET domain-containing protein1.4e-17348.73Show/hide
Query:  NSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNELE
        + +EAKLE FL WLQVNG +LRGC IKYSD  KG G+F++                    + + DE                                  
Subjt:  NSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNELE

Query:  FSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSSWK
                                                   VLLVVPLDLAITPMRVLQDPL GPEC+ M+E+ +VDDRFLMILFL +E LR NSSWK
Subjt:  FSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSSWK

Query:  PYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQEEV
        PYLD+LPTRFGNPLWF+D+++LELKGT LY ATELQK  L SLY +KV+ LV +LL ++G S  +VSFE FLWANS+FW+RALNIP+PH +VFP+ Q+  
Subjt:  PYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQEEV

Query:  RSDSFKETIEVSTSTVSEKLFYVNLTTRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLLSVLV
              +T E ++++ S +   VN      S  E  + + S     + GS   +T+WVEGLVPG+DFCNHDLK  ATWEVDG+GS + VPFSMYLLSV  
Subjt:  RSDSFKETIEVSTSTVSEKLFYVNLTTRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLLSVLV

Query:  DFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLM----------------------VHYPLEAIQNASFSDSKLQLLEVQK
                        R    +E+SISYGNKGNEELLYLYGFV++NNPDDYLM                      VHYP+EAI +  FSDSK QLLE Q 
Subjt:  DFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLM----------------------VHYPLEAIQNASFSDSKLQLLEVQK

Query:  AEMRCLLPRKLLDHGFHPPNTSNIKENVVCSN-RSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAA
        A++RCLLP+ +L+HGF P  TS I+E+      RSCN+SWSG+RK+P+Y++KL+FPE F+T LRTI+MQE+E+ +VS++L E+V   +  QP++ +V+ A
Subjt:  AEMRCLLPRKLLDHGFHPPNTSNIKENVVCSN-RSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAA

Query:  VWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQAL
        VWEACGDSGALQLLVDLL  K+M LEE +GT + D  LL+E  V+ES           SR+ D R+    +SRN+WSS+VYR GQKQLT L LKEAE AL
Subjt:  VWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQAL

Query:  QLSLSEEN
         L+LS ++
Subjt:  QLSLSEEN

AT1G01920.2 SET domain-containing protein1.7e-17650Show/hide
Query:  NSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNELE
        + +EAKLE FL WLQVNG +LRGC IKYSD  KG G+F++                    + + DE                                  
Subjt:  NSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNELE

Query:  FSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSSWK
                                                   VLLVVPLDLAITPMRVLQDPL GPEC+ M+E+ +VDDRFLMILFL +E LR NSSWK
Subjt:  FSRVDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSSWK

Query:  PYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQEEV
        PYLD+LPTRFGNPLWF+D+++LELKGT LY ATELQK  L SLY +KV+ LV +LL ++G S  +VSFE FLWANS+FW+RALNIP+PH +VFP+ Q+  
Subjt:  PYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQEEV

Query:  RSDSFKETIEVSTSTVSEKLFYVNLTTRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLLSVLV
              +T E ++++ S +       T  ++  E+  +    A    SG    +T+WVEGLVPG+DFCNHDLK  ATWEVDG+GS + VPFSMYLLSV  
Subjt:  RSDSFKETIEVSTSTVSEKLFYVNLTTRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLLSVLV

Query:  DFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTS
                        R    +E+SISYGNKGNEELLYLYGFV++NNPDDYLMVHYP+EAI +  FSDSK QLLE Q A++RCLLP+ +L+HGF P  TS
Subjt:  DFLTAFILDISGKANSRSSGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTS

Query:  NIKENVVCSN-RSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKL
         I+E+      RSCN+SWSG+RK+P+Y++KL+FPE F+T LRTI+MQE+E+ +VS++L E+V   +  QP++ +V+ AVWEACGDSGALQLLVDLL  K+
Subjt:  NIKENVVCSN-RSCNYSWSGQRKIPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKL

Query:  MDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        M LEE +GT + D  LL+E  V+ES           SR+ D R+    +SRN+WSS+VYR GQKQLT L LKEAE AL L+LS ++
Subjt:  MDLEEGTGTLDSDTELLKEVQVIESTNVNGSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

AT1G14030.1 Rubisco methyltransferase family protein4.8e-0933.57Show/hide
Query:  VLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSL
        V+L +P  L I P  V    + GP C  +     V       LFL+ E   E SSW+ YLD+LP    + +++++ EL ELKGT L   T      ++  
Subjt:  VLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSL

Query:  YENKVKKLVNRLL--TIEGFSVREVSFEDFLWANSIFWARALN
         EN+  KL   +L    + FS R ++ +DF+WA  I  +RA +
Subjt:  YENKVKKLVNRLL--TIEGFSVREVSFEDFLWANSIFWARALN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAACTCCGATGAAGCAAAGCTCGAACTGTTCCTCCAATGGTTACAGGTTAACGGAGCAGACCTTCGAGGTTGCATGATCAAGTACAGCGATTTAAGCAAA
GGATGTGGACTTTTTTCTGCCAACGATGCCTCTGATGTAAATCTGGTTTTTCCATTGGGGGTGGTGATTGAGTTCCTGCTGAGTAGCTCTGGTGATGAAGTCAAT
GGAATTGACATAATACTCTTAAACACATCAGTCGCTGGTAATGTGATGAGGATGTGTTGGGTCCTTAATCCTAGAAGGGGTCTTAATGAGCTAGAATTCTCCCGA
GTGGATCCTTGCTATGAAACTCCAAGGGTTCAAAGGCAAGGGGAACGTGCTTTAGAACATCATGGTCAAGAGCTTTTTGTGTTTCTCAGGGAAATAGATAATTTT
CTTACTTGTGCAGGGGTTCTATTAGTCGTTCCCCTAGATTTAGCAATCACTCCAATGAGAGTTTTGCAAGATCCTCTTTATGGACCAGAGTGTAGAGCAATGTAT
GAAGAAGATGAAGTAGATGATAGATTTCTGATGATTTTATTCCTCATGGTTGAGTGGCTGCGTGAAAATTCTTCATGGAAACCGTACCTTGATGTGCTTCCTACC
AGATTTGGGAATCCACTTTGGTTTACTGATAACGAGCTTTTGGAACTGAAGGGTACCACACTCTATCGAGCAACTGAACTGCAGAAAAATTCATTGCAGTCATTG
TATGAAAATAAAGTGAAGAAATTGGTTAATAGATTGTTGACTATTGAAGGGTTTTCAGTAAGGGAGGTGAGCTTTGAAGATTTCCTATGGGCAAATTCCATTTTC
TGGGCACGTGCCTTGAACATCCCAATGCCACATGATTATGTATTTCCTAAAATCCAAGAAGAGGTTCGAAGTGACTCCTTCAAAGAAACTATTGAGGTTTCAACT
TCTACTGTCTCCGAGAAATTATTTTACGTTAACTTAACAACTAGGTCAATCTCAGGATGTGAAGATCATCGGATGATTGATAGCACAGCTAGTGGAAAAACATCT
GGGTCGTCAAAGCAAGAAACTGTGTGGGTGGAGGGTCTTGTTCCTGGTGTTGACTTCTGCAATCATGATCTGAAAGCAACAGCAACATGGGAAGTTGATGGAGTA
GGATCCACTACTGGAGTTCCTTTCTCAATGTACCTCCTCTCTGTGTTAGTGGATTTTCTAACCGCTTTTATCCTTGATATATCCGGAAAAGCCAATTCAAGGTCT
TCTGGAATGGAGGAGGTCTCAATCAGTTACGGTAACAAGGGGAATGAGGAGCTCCTTTATCTTTACGGATTTGTCGTTGAAAATAATCCAGATGATTATCTAATG
GTACACTATCCTTTAGAAGCAATCCAGAATGCTTCCTTTTCTGATTCGAAGTTACAGCTCCTTGAAGTACAGAAGGCTGAAATGCGATGTCTTTTACCAAGAAAA
TTGCTGGATCATGGATTTCACCCTCCAAACACCTCAAATATCAAAGAAAATGTTGTCTGCAGCAACCGGTCCTGCAATTACAGCTGGAGTGGTCAGCGCAAGATA
CCTTCTTACTTGGACAAGCTGATATTCCCTGAGAAATTTTTAACTGCGTTGAGAACTATATCTATGCAGGAGGACGAGCTTATGCAGGTTTCATCTTTACTGGCA
GAGATTGTTGGACCTGAAGAGGATAGGCAGCCCACCGACATTGATGTCCAAGCAGCAGTCTGGGAGGCTTGTGGTGACTCTGGAGCCTTGCAGTTGCTTGTTGAT
CTTCTTCAAAAGAAGTTGATGGATCTTGAAGAAGGCACCGGAACTCTGGACAGCGACACTGAGCTGCTGAAAGAGGTCCAAGTAATTGAAAGCACGAATGTAAAT
GGCTCGTGTCAGGATTCTGCAAGCAGAGAGTCAGATGACAGGAAGCCACAAAAGTTGGTGAGCAGGAACCAATGGTCTAGCATTGTTTATCGCCATGGTCAGAAG
CAGCTAACCAGTCTATTTCTGAAGGAGGCAGAACAGGCTTTGCAATTATCATTAAGTGAGGAAAACTGA
mRNA sequenceShow/hide mRNA sequence
GAAAAAAGAAAAAAGAAAAAGAATGCCCCTTGGTTCTTTCTTCTGTTCCCAATTTGTAATTGTGCCTAAAACGCCACAGTAAAAGTCAGTCAGCAGAAAATGGCG
AACTCCGATGAAGCAAAGCTCGAACTGTTCCTCCAATGGTTACAGGTTAACGGAGCAGACCTTCGAGGTTGCATGATCAAGTACAGCGATTTAAGCAAAGGATGT
GGACTTTTTTCTGCCAACGATGCCTCTGATGTAAATCTGGTTTTTCCATTGGGGGTGGTGATTGAGTTCCTGCTGAGTAGCTCTGGTGATGAAGTCAATGGAATT
GACATAATACTCTTAAACACATCAGTCGCTGGTAATGTGATGAGGATGTGTTGGGTCCTTAATCCTAGAAGGGGTCTTAATGAGCTAGAATTCTCCCGAGTGGAT
CCTTGCTATGAAACTCCAAGGGTTCAAAGGCAAGGGGAACGTGCTTTAGAACATCATGGTCAAGAGCTTTTTGTGTTTCTCAGGGAAATAGATAATTTTCTTACT
TGTGCAGGGGTTCTATTAGTCGTTCCCCTAGATTTAGCAATCACTCCAATGAGAGTTTTGCAAGATCCTCTTTATGGACCAGAGTGTAGAGCAATGTATGAAGAA
GATGAAGTAGATGATAGATTTCTGATGATTTTATTCCTCATGGTTGAGTGGCTGCGTGAAAATTCTTCATGGAAACCGTACCTTGATGTGCTTCCTACCAGATTT
GGGAATCCACTTTGGTTTACTGATAACGAGCTTTTGGAACTGAAGGGTACCACACTCTATCGAGCAACTGAACTGCAGAAAAATTCATTGCAGTCATTGTATGAA
AATAAAGTGAAGAAATTGGTTAATAGATTGTTGACTATTGAAGGGTTTTCAGTAAGGGAGGTGAGCTTTGAAGATTTCCTATGGGCAAATTCCATTTTCTGGGCA
CGTGCCTTGAACATCCCAATGCCACATGATTATGTATTTCCTAAAATCCAAGAAGAGGTTCGAAGTGACTCCTTCAAAGAAACTATTGAGGTTTCAACTTCTACT
GTCTCCGAGAAATTATTTTACGTTAACTTAACAACTAGGTCAATCTCAGGATGTGAAGATCATCGGATGATTGATAGCACAGCTAGTGGAAAAACATCTGGGTCG
TCAAAGCAAGAAACTGTGTGGGTGGAGGGTCTTGTTCCTGGTGTTGACTTCTGCAATCATGATCTGAAAGCAACAGCAACATGGGAAGTTGATGGAGTAGGATCC
ACTACTGGAGTTCCTTTCTCAATGTACCTCCTCTCTGTGTTAGTGGATTTTCTAACCGCTTTTATCCTTGATATATCCGGAAAAGCCAATTCAAGGTCTTCTGGA
ATGGAGGAGGTCTCAATCAGTTACGGTAACAAGGGGAATGAGGAGCTCCTTTATCTTTACGGATTTGTCGTTGAAAATAATCCAGATGATTATCTAATGGTACAC
TATCCTTTAGAAGCAATCCAGAATGCTTCCTTTTCTGATTCGAAGTTACAGCTCCTTGAAGTACAGAAGGCTGAAATGCGATGTCTTTTACCAAGAAAATTGCTG
GATCATGGATTTCACCCTCCAAACACCTCAAATATCAAAGAAAATGTTGTCTGCAGCAACCGGTCCTGCAATTACAGCTGGAGTGGTCAGCGCAAGATACCTTCT
TACTTGGACAAGCTGATATTCCCTGAGAAATTTTTAACTGCGTTGAGAACTATATCTATGCAGGAGGACGAGCTTATGCAGGTTTCATCTTTACTGGCAGAGATT
GTTGGACCTGAAGAGGATAGGCAGCCCACCGACATTGATGTCCAAGCAGCAGTCTGGGAGGCTTGTGGTGACTCTGGAGCCTTGCAGTTGCTTGTTGATCTTCTT
CAAAAGAAGTTGATGGATCTTGAAGAAGGCACCGGAACTCTGGACAGCGACACTGAGCTGCTGAAAGAGGTCCAAGTAATTGAAAGCACGAATGTAAATGGCTCG
TGTCAGGATTCTGCAAGCAGAGAGTCAGATGACAGGAAGCCACAAAAGTTGGTGAGCAGGAACCAATGGTCTAGCATTGTTTATCGCCATGGTCAGAAGCAGCTA
ACCAGTCTATTTCTGAAGGAGGCAGAACAGGCTTTGCAATTATCATTAAGTGAGGAAAACTGAACATCTCTTTAACATCTCTCTCTGTCTCTCCAGCCCAGTCTT
ACATGATAGTTAGGTTTCATCTGTTTCTTTGTAATTTAAATGCATGAGAATATCCTTTTTTTTTTTCCCTCCTGATCTTGCTTAGTAACTTATACTGTGAAAATT
GTTTTCATGCATTTCAAAATAAATAATAAAAAAACCCTTCAAAATTACTGGCC
Protein sequenceShow/hide protein sequence
MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLNTSVAGNVMRMCWVLNPRRGLNELEFSR
VDPCYETPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEDEVDDRFLMILFLMVEWLRENSSWKPYLDVLPT
RFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVREVSFEDFLWANSIFWARALNIPMPHDYVFPKIQEEVRSDSFKETIEVST
STVSEKLFYVNLTTRSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLLSVLVDFLTAFILDISGKANSRS
SGMEEVSISYGNKGNEELLYLYGFVVENNPDDYLMVHYPLEAIQNASFSDSKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRSCNYSWSGQRKI
PSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSDTELLKEVQVIESTNVN
GSCQDSASRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN