; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G014540 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G014540
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionSET domain-containing protein
Genome locationCiama_Chr01:27569780..27579110
RNA-Seq ExpressionCaUC01G014540
SyntenyCaUC01G014540
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025202.1 SET domain-containing protein [Cucumis melo var. makuwa]4.3e-26666.16Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA

Query:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN
                                                      GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVE LREN
Subjt:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN

Query:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFS---------VRANSIFWARALNIPMPHDYVFPKI
        SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS          RANSIFWARALNIPMPHDYVFPKI
Subjt:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFS---------VRANSIFWARALNIPMPHDYVFPKI

Query:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS
        QEEV SDS  +ET EVSTSAV +VQLACTKNEDG     GCED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFS
Subjt:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS

Query:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE
        MYLLS     A ILDISGKA SRSSG+EEVSISYGNKGNE                                                 ELLYLYGFV+E
Subjt:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE

Query:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS
        +NPDDYLM                                 VHYPLEAIQNASFS+SKLQLLE+QKAEMRCLLPR+LLDHGFHPP TSNIKENV CSNR+
Subjt:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS

Query:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD
        CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQKK+MDLEEGTGTLDSD
Subjt:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD

Query:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        T+LLKE QVTES N NG CQDSA                    R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

TYK07464.1 SET domain-containing protein [Cucumis melo var. makuwa]1.0e-26265.91Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA

Query:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN
                                                      GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVE LREN
Subjt:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN

Query:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI
        SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS R         ANSIFWARALNIPMPHDYVFPKI
Subjt:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI

Query:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS
        QEEV SDS  +ET EVSTSAV +VQLACTKNED      GCED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFS
Subjt:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS

Query:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE
        MYLLS     A ILDISGKA SRSSG+EEVSISYGNKGNE                                                 ELLYLYGFV+E
Subjt:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE

Query:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS
        +NPDDYLM                                 VHYPLEAIQNASFS+SKLQLLE+QKAEMRCLLPR+LLDHGFHPP TSNIKENV CSNR+
Subjt:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS

Query:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD
        CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQKK+MDLEEGTGTLDSD
Subjt:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD

Query:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        T+LLKE QVTES N NG  QDSA                    R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

XP_008462641.1 PREDICTED: uncharacterized protein LOC103500952 isoform X2 [Cucumis melo]4.8e-26566.04Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA

Query:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN
                                                      GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVE LREN
Subjt:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN

Query:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI
        SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS R         ANSIFWARALNIPMPHDYVFPKI
Subjt:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI

Query:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS
        QEEV SDS  +ET EVSTSAV +VQLACTKNED      GCED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFS
Subjt:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS

Query:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE
        MYLLS     A ILDISGKA SRSSG+EEVSISYGNKGNE                                                 ELLYLYGFV+E
Subjt:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE

Query:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS
        +NPDDYLM                                 VHYPLEAIQNASFS+SKLQLLE+QKAEMRCLLPR+LLDHGFHPP TSNIKENV CSNR+
Subjt:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS

Query:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD
        CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQKK+MDLEEGTGTLDSD
Subjt:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD

Query:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        T+LLKE QVTES N NG CQDSA                    R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

XP_016902907.1 PREDICTED: uncharacterized protein LOC103500952 isoform X1 [Cucumis melo]1.3e-26566.16Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA

Query:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN
                                                      GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVE LREN
Subjt:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN

Query:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI
        SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS R         ANSIFWARALNIPMPHDYVFPKI
Subjt:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI

Query:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS
        QEEV SDS  +ET EVSTSAV +VQLACTKNEDG    S CED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFS
Subjt:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS

Query:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE
        MYLLS     A ILDISGKA SRSSG+EEVSISYGNKGNE                                                 ELLYLYGFV+E
Subjt:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE

Query:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS
        +NPDDYLM                                 VHYPLEAIQNASFS+SKLQLLE+QKAEMRCLLPR+LLDHGFHPP TSNIKENV CSNR+
Subjt:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS

Query:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD
        CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQKK+MDLEEGTGTLDSD
Subjt:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD

Query:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        T+LLKE QVTES N NG CQDSA                    R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

XP_038881849.1 protein-lysine N-methyltransferase EFM1 isoform X1 [Benincasa hispida]2.6e-26365.91Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA
        MANSDEAKLELFLQWLQVNGADLRGC IK+SDLSKGCGLFSANDA D                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA

Query:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN
                                                      GVLLVVPLDLAITPMRVLQD LYGPECRAMYEEGEVDDRFLMILFLMVE LREN
Subjt:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN

Query:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI
        SSWKPYLDVLPT FGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVN+LLT+EGFSVR         ANSIFWARALNIPMPH YVFPKI
Subjt:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI

Query:  QEEVRSDSF-KETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS
        QEEV SDS  +ET EVSTS VSEVQLACTKNED      GCEDHRMID+TA GKT GS KQETVWVEGLVPGVDFCNHDLKA ATWEVD +GSTTGVPFS
Subjt:  QEEVRSDSF-KETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS

Query:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE
        MYLLS              ANSRSSG+EEVSISYGNKGNE                                                 ELLYLYGF +E
Subjt:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE

Query:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS
        NNPDDYLM                                 VHYPLEAIQNA FS+ KLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNR+
Subjt:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS

Query:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD
        CNYSWSGQRKLPSYLDKLIFPEKF+TALRTISMQEDELMQVSSLLAEIVGPEED+QPTDIDVQAAVWE CGDSGALQLLVDLLQKK+MDLEEGTGTLDSD
Subjt:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD

Query:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        T+LLKEVQV ESTN NGSCQ+SA                   SRESDDRKPQ L+SRNQWSSIVYRHGQKQLTSLFLKEAE ALQLSLSE+N
Subjt:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

TrEMBL top hitse value%identityAlignment
A0A1S3CHD9 uncharacterized protein LOC103500952 isoform X22.3e-26566.04Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA

Query:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN
                                                      GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVE LREN
Subjt:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN

Query:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI
        SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS R         ANSIFWARALNIPMPHDYVFPKI
Subjt:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI

Query:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS
        QEEV SDS  +ET EVSTSAV +VQLACTKNED      GCED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFS
Subjt:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS

Query:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE
        MYLLS     A ILDISGKA SRSSG+EEVSISYGNKGNE                                                 ELLYLYGFV+E
Subjt:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE

Query:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS
        +NPDDYLM                                 VHYPLEAIQNASFS+SKLQLLE+QKAEMRCLLPR+LLDHGFHPP TSNIKENV CSNR+
Subjt:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS

Query:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD
        CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQKK+MDLEEGTGTLDSD
Subjt:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD

Query:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        T+LLKE QVTES N NG CQDSA                    R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

A0A1S4E3V3 uncharacterized protein LOC103500952 isoform X31.0e-26065.15Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA

Query:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN
                                                      GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVE LREN
Subjt:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN

Query:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI
        SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS R         ANSIFWARALNIPMPHDYVFPKI
Subjt:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI

Query:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS
        QEEV SDS  +ET EVSTSAV +VQLACTKNEDG    S CED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFS
Subjt:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS

Query:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE
        MYLLS              A SRSSG+EEVSISYGNKGNE                                                 ELLYLYGFV+E
Subjt:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE

Query:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS
        +NPDDYLM                                 VHYPLEAIQNASFS+SKLQLLE+QKAEMRCLLPR+LLDHGFHPP TSNIKENV CSNR+
Subjt:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS

Query:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD
        CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQKK+MDLEEGTGTLDSD
Subjt:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD

Query:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        T+LLKE QVTES N NG CQDSA                    R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

A0A1S4E3V9 uncharacterized protein LOC103500952 isoform X16.1e-26666.16Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA

Query:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN
                                                      GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVE LREN
Subjt:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN

Query:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI
        SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS R         ANSIFWARALNIPMPHDYVFPKI
Subjt:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI

Query:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS
        QEEV SDS  +ET EVSTSAV +VQLACTKNEDG    S CED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFS
Subjt:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS

Query:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE
        MYLLS     A ILDISGKA SRSSG+EEVSISYGNKGNE                                                 ELLYLYGFV+E
Subjt:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE

Query:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS
        +NPDDYLM                                 VHYPLEAIQNASFS+SKLQLLE+QKAEMRCLLPR+LLDHGFHPP TSNIKENV CSNR+
Subjt:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS

Query:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD
        CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQKK+MDLEEGTGTLDSD
Subjt:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD

Query:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        T+LLKE QVTES N NG CQDSA                    R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

A0A5A7SJ36 SET domain-containing protein2.1e-26666.16Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA

Query:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN
                                                      GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVE LREN
Subjt:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN

Query:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFS---------VRANSIFWARALNIPMPHDYVFPKI
        SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS          RANSIFWARALNIPMPHDYVFPKI
Subjt:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFS---------VRANSIFWARALNIPMPHDYVFPKI

Query:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS
        QEEV SDS  +ET EVSTSAV +VQLACTKNEDG     GCED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFS
Subjt:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS

Query:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE
        MYLLS     A ILDISGKA SRSSG+EEVSISYGNKGNE                                                 ELLYLYGFV+E
Subjt:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE

Query:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS
        +NPDDYLM                                 VHYPLEAIQNASFS+SKLQLLE+QKAEMRCLLPR+LLDHGFHPP TSNIKENV CSNR+
Subjt:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS

Query:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD
        CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQKK+MDLEEGTGTLDSD
Subjt:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD

Query:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        T+LLKE QVTES N NG CQDSA                    R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

A0A5D3C897 SET domain-containing protein4.8e-26365.91Show/hide
Query:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA
        MANSDEA LELFLQWLQVNGADLRGC IKYSDLSKGCGLFSANDASD                                                     
Subjt:  MANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIA

Query:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN
                                                      GVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVE LREN
Subjt:  TYARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLREN

Query:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI
        SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLV+RLLT+EGFS R         ANSIFWARALNIPMPHDYVFPKI
Subjt:  SSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVR---------ANSIFWARALNIPMPHDYVFPKI

Query:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS
        QEEV SDS  +ET EVSTSAV +VQLACTKNED      GCED RMIDS A+G+T GS KQETVWVEGLVPGVDFCNHDLKATATWEVDG+GSTTGVPFS
Subjt:  QEEVRSDS-FKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFS

Query:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE
        MYLLS     A ILDISGKA SRSSG+EEVSISYGNKGNE                                                 ELLYLYGFV+E
Subjt:  MYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVE

Query:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS
        +NPDDYLM                                 VHYPLEAIQNASFS+SKLQLLE+QKAEMRCLLPR+LLDHGFHPP TSNIKENV CSNR+
Subjt:  NNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRS

Query:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD
        CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISM+EDELMQVSSLLAEIVGPEEDRQPTD DVQAAVWEACGDSGALQLLVDLLQKK+MDLEEGTGTLDSD
Subjt:  CNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLDSD

Query:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
        T+LLKE QVTES N NG  QDSA                    R  DD+KPQ L+SRNQW SIVYRHGQK+LTSLFLKEAE AL LSLSEEN
Subjt:  TELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

SwissProt top hitse value%identityAlignment
P94026 Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic2.5e-0631.4Show/hide
Query:  LFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVRANSIFWA
        LFL+ E  R++S WK Y+DVLP    + +++++ EL E++GT L   T   K+ +Q+ ++   ++++ R   +  F +  +  FWA
Subjt:  LFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVRANSIFWA

Q43088 Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic5.2e-0431.11Show/hide
Query:  VLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRAT----ELQKNS
        V+L VP  L I P  V    + G  C       E+     +ILFL+ E  RE+S WK Y  +LP    + +++++ EL EL+G+ L + T    E  KN 
Subjt:  VLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRAT----ELQKNS

Query:  LQSLYENKV---KKLVNRLLTIEGFSVRANSIFWA
           L +  +   K+L    +T++ F       FWA
Subjt:  LQSLYENKV---KKLVNRLLTIEGFSVRANSIFWA

Q9XI84 [Fructose-bisphosphate aldolase]-lysine N-methyltransferase, chloroplastic6.1e-0533.63Show/hide
Query:  VLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSL
        V+L +P  L I P  V    + GP C      G +     + LFL+ E   E SSW+ YLD+LP    + +++++ EL ELKGT L   T      ++  
Subjt:  VLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSL

Query:  YENKVKKLVNRLL
         EN+  KL   +L
Subjt:  YENKVKKLVNRLL

Arabidopsis top hitse value%identityAlignment
AT1G01920.1 SET domain-containing protein6.4e-15942.71Show/hide
Query:  NSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIATY
        + +EAKLE FL WLQVNG +LRGC IKYSD  KG G+F++                    + + DE                                  
Subjt:  NSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIATY

Query:  ARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLRENSS
                                                     VLLVVPLDLAITPMRVLQDPL GPEC+ M+E+G+VDDRFLMILFL +E LR NSS
Subjt:  ARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLRENSS

Query:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFS---------VRANSIFWARALNIPMPHDYVFPKIQE
        WKPYLD+LPTRFGNPLWF+D+++LELKGT LY ATELQK  L SLY +KV+ LV +LL ++G S         + ANS+FW+RALNIP+PH +VFP+ Q+
Subjt:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFS---------VRANSIFWARALNIPMPHDYVFPKIQE

Query:  EVRSDSFKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYL
                +T E ++++ S        NE+  +S++  +      S  SG        +T+WVEGLVPG+DFCNHDLK  ATWEVDG+GS + VPFSMYL
Subjt:  EVRSDSFKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYL

Query:  LSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVENNP
        LSV                R    +E+SISYGNKGNE                                                 ELLYLYGFV++NNP
Subjt:  LSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVENNP

Query:  DDYLMV------ILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCS
        DDYLM+       +  +V   +NG++                  VHYP+EAI +  FS+SK QLLE Q A++RCLLP+ +L+HGF P  TS I+E+    
Subjt:  DDYLMV------ILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCS

Query:  N-RSCNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGT
          RSCN+SWSG+RK+P+Y++KL+FPE F+T LRTI+MQE+E+ +VS++L E+V   +  QP++ +V+ AVWEACGDSGALQLLVDLL  K+M LEE +GT
Subjt:  N-RSCNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGT

Query:  LDSDTELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
         + D  LL+E  V ES                              SR+ D R+    +SRN+WSS+VYR GQKQLT L LKEAE AL L+LS ++
Subjt:  LDSDTELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

AT1G01920.2 SET domain-containing protein3.3e-15542.32Show/hide
Query:  NSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIATY
        + +EAKLE FL WLQVNG +LRGC IKYSD  KG G+F++                    + + DE                                  
Subjt:  NSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIATY

Query:  ARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLRENSS
                                                     VLLVVPLDLAITPMRVLQDPL GPEC+ M+E+G+VDDRFLMILFL +E LR NSS
Subjt:  ARNIGKPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLRENSS

Query:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFS---------VRANSIFWARALNIPMPHDYVFPKIQE
        WKPYLD+LPTRFGNPLWF+D+++LELKGT LY ATELQK  L SLY +KV+ LV +LL ++G S         + ANS+FW+RALNIP+PH +VFP+ Q+
Subjt:  WKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFS---------VRANSIFWARALNIPMPHDYVFPKIQE

Query:  E----VRSDSFKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPF
        +      +    ET  V+++   E+Q                   +   S  SG        +T+WVEGLVPG+DFCNHDLK  ATWEVDG+GS + VPF
Subjt:  E----VRSDSFKETIEVSTSAVSEVQLACTKNEDGARSISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPF

Query:  SMYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVV
        SMYLLSV                R    +E+SISYGNKGNE                                                 ELLYLYGFV+
Subjt:  SMYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRKHRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVV

Query:  ENNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSN-
        +NNPDDYLM                                 VHYP+EAI +  FS+SK QLLE Q A++RCLLP+ +L+HGF P  TS I+E+      
Subjt:  ENNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSESKLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSN-

Query:  RSCNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLD
        RSCN+SWSG+RK+P+Y++KL+FPE F+T LRTI+MQE+E+ +VS++L E+V   +  QP++ +V+ AVWEACGDSGALQLLVDLL  K+M LEE +GT +
Subjt:  RSCNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVWEACGDSGALQLLVDLLQKKLMDLEEGTGTLD

Query:  SDTELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN
         D  LL+E  V ES                              SR+ D R+    +SRN+WSS+VYR GQKQLT L LKEAE AL L+LS ++
Subjt:  SDTELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFLKEAEQALQLSLSEEN

AT1G14030.1 Rubisco methyltransferase family protein4.3e-0633.63Show/hide
Query:  VLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSL
        V+L +P  L I P  V    + GP C      G +     + LFL+ E   E SSW+ YLD+LP    + +++++ EL ELKGT L   T      ++  
Subjt:  VLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLRENSSWKPYLDVLPTRFGNPLWFTDNELLELKGTTLYRATELQKNSLQSL

Query:  YENKVKKLVNRLL
         EN+  KL   +L
Subjt:  YENKVKKLVNRLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAAAATGGCGAACTCCGATGAAGCAAAGCTCGAACTGTTCCTCCAATGGTTACAGGTTAACGGAGCAGACCTTCGAGGTTGCATGATCAAGTACAGCGATTTAAG
CAAAGGATGTGGACTTTTTTCTGCCAACGATGCCTCTGATGTAAATCTGGTTTTTCCATTGGGGGTGGTGATTGAGTTCCTGCTGAGTAGCTCTGGTGATGAAGTCAATG
GAATTGACATAATACTCTTAAAGTATCAACATTTAGCATACGAGATAGTTTCCAGACCAAACACCATCAAATGTCCCTCTCGTATTGCTACTTATGCAAGGAACATTGGG
AAACCTCAGACGTATCTTCTGCCAAGGGTTCAAAGGCAAGGGGAACGTGCTTTAGAACATCATGGTCAAGAGCTTTTTGTGTTTCTCAGGGAAATAGATAATTTCCTTAC
TTGTGCAGGGGTTCTATTAGTCGTTCCCCTAGATTTAGCAATCACTCCAATGAGAGTTTTGCAAGATCCTCTTTATGGACCAGAGTGTAGAGCAATGTATGAAGAAGGTG
AAGTAGATGATAGATTTCTGATGATTTTATTCCTCATGGTTGAGTGGCTGCGTGAAAATTCTTCATGGAAACCGTACCTTGATGTGCTTCCTACAAGATTTGGGAATCCA
CTTTGGTTTACCGATAACGAGCTTTTGGAACTGAAGGGTACCACACTCTATCGAGCAACTGAACTGCAGAAAAATTCATTGCAGTCATTGTATGAAAATAAAGTGAAGAA
ATTGGTTAATAGATTGTTGACTATTGAAGGGTTTTCAGTAAGGGCAAATTCCATTTTCTGGGCACGTGCCTTGAACATCCCAATGCCACATGATTATGTATTTCCTAAAA
TCCAAGAAGAGGTTCGAAGTGACTCCTTCAAAGAAACTATTGAGGTTTCAACTTCTGCTGTCTCCGAGGTGCAGTTGGCTTGCACTAAGAATGAAGATGGTGCTCGGTCA
ATCTCAGGATGTGAAGATCATCGGATGATTGATAGCACAGCTAGTGGAAAAACATCTGGGTCGTCAAAGCAAGAAACTGTGTGGGTGGAGGGTCTTGTTCCTGGTGTTGA
CTTCTGCAATCATGATCTGAAAGCAACAGCAACATGGGAAGTTGATGGAGTAGGATCCACTACTGGAGTTCCTTTCTCAATGTACCTCCTCTCTGTGGATTTTCTAACCG
CTTTTATCCTTGATATATCCGGAAAAGCCAATTCAAGGTCTTCTGGAATGGAGGAGGTCTCAATCAGTTACGGTAACAAGGGGAATGAGGCATACAATGAAAGAAGGAAA
CATCGTGCTTACTCTAGATTAAATTGTTGCATCTGCAGTTTTTTATCAAAAAAAAAAAGTTGCATCTGCAGTTTTTATCAAAAAAAAAAGTTGCATCTGCAGTTAGGTTT
TGCCCTTCTTTGGCAGGAGCTCCTTTATCTTTACGGATTTGTCGTTGAAAATAATCCAGATGATTATCTAATGGTAATTCTATCTAACACAGTAGGAATACTGGACAATG
GGTATGTAAGTGGGATGGTTTGGGTCCGTCTTTGGTCAATGCCGACTCCAAACCTACTGACGGTACACTATCCTTTAGAAGCAATCCAGAATGCTTCCTTTTCTGAATCG
AAGTTACAGCTCCTTGAAGTACAGAAGGCTGAAATGCGATGTCTTTTACCAAGAAAATTGCTGGATCATGGATTTCACCCTCCAAACACCTCAAATATCAAAGAAAATGT
TGTCTGCAGCAACCGGTCCTGCAATTACAGCTGGAGTGGTCAGCGCAAGCTACCTTCTTACTTGGACAAGCTGATATTCCCTGAGAAATTTTTAACTGCGTTGAGAACTA
TATCTATGCAGGAGGACGAGCTTATGCAGGTTTCATCTTTACTGGCAGAGATTGTTGGACCTGAAGAGGATAGGCAGCCCACCGACATTGATGTCCAAGCAGCAGTCTGG
GAGGCTTGTGGTGACTCTGGAGCCTTGCAGTTGCTTGTTGATCTTCTTCAAAAGAAGTTGATGGATCTTGAAGAAGGCACCGGAACTCTGGACAGCGACACTGAGCTGCT
GAAAGAGGTCCAAGTAACTGAAAGCACGAATGTAAATGGCTCGTGTCAGGATTCTGCAAGTCCCATCAGTTCTTACAAACGTGTCTCTTTCTTCATTCCCTTTCTCCTTG
GCTGCAGCAGAGAGTCAGATGACAGGAAGCCACAAAAGTTGGTGAGCAGGAACCAATGGTCTAGCATTGTTTATCGCCATGGTCAGAAGCAGCTAACCAGTCTATTTCTG
AAGGAGGCAGAACAGGCTTTGCAATTATCATTAAGTGAGGAAAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAAAAATGGCGAACTCCGATGAAGCAAAGCTCGAACTGTTCCTCCAATGGTTACAGGTTAACGGAGCAGACCTTCGAGGTTGCATGATCAAGTACAGCGATTTAAG
CAAAGGATGTGGACTTTTTTCTGCCAACGATGCCTCTGATGTAAATCTGGTTTTTCCATTGGGGGTGGTGATTGAGTTCCTGCTGAGTAGCTCTGGTGATGAAGTCAATG
GAATTGACATAATACTCTTAAAGTATCAACATTTAGCATACGAGATAGTTTCCAGACCAAACACCATCAAATGTCCCTCTCGTATTGCTACTTATGCAAGGAACATTGGG
AAACCTCAGACGTATCTTCTGCCAAGGGTTCAAAGGCAAGGGGAACGTGCTTTAGAACATCATGGTCAAGAGCTTTTTGTGTTTCTCAGGGAAATAGATAATTTCCTTAC
TTGTGCAGGGGTTCTATTAGTCGTTCCCCTAGATTTAGCAATCACTCCAATGAGAGTTTTGCAAGATCCTCTTTATGGACCAGAGTGTAGAGCAATGTATGAAGAAGGTG
AAGTAGATGATAGATTTCTGATGATTTTATTCCTCATGGTTGAGTGGCTGCGTGAAAATTCTTCATGGAAACCGTACCTTGATGTGCTTCCTACAAGATTTGGGAATCCA
CTTTGGTTTACCGATAACGAGCTTTTGGAACTGAAGGGTACCACACTCTATCGAGCAACTGAACTGCAGAAAAATTCATTGCAGTCATTGTATGAAAATAAAGTGAAGAA
ATTGGTTAATAGATTGTTGACTATTGAAGGGTTTTCAGTAAGGGCAAATTCCATTTTCTGGGCACGTGCCTTGAACATCCCAATGCCACATGATTATGTATTTCCTAAAA
TCCAAGAAGAGGTTCGAAGTGACTCCTTCAAAGAAACTATTGAGGTTTCAACTTCTGCTGTCTCCGAGGTGCAGTTGGCTTGCACTAAGAATGAAGATGGTGCTCGGTCA
ATCTCAGGATGTGAAGATCATCGGATGATTGATAGCACAGCTAGTGGAAAAACATCTGGGTCGTCAAAGCAAGAAACTGTGTGGGTGGAGGGTCTTGTTCCTGGTGTTGA
CTTCTGCAATCATGATCTGAAAGCAACAGCAACATGGGAAGTTGATGGAGTAGGATCCACTACTGGAGTTCCTTTCTCAATGTACCTCCTCTCTGTGGATTTTCTAACCG
CTTTTATCCTTGATATATCCGGAAAAGCCAATTCAAGGTCTTCTGGAATGGAGGAGGTCTCAATCAGTTACGGTAACAAGGGGAATGAGGCATACAATGAAAGAAGGAAA
CATCGTGCTTACTCTAGATTAAATTGTTGCATCTGCAGTTTTTTATCAAAAAAAAAAAGTTGCATCTGCAGTTTTTATCAAAAAAAAAAGTTGCATCTGCAGTTAGGTTT
TGCCCTTCTTTGGCAGGAGCTCCTTTATCTTTACGGATTTGTCGTTGAAAATAATCCAGATGATTATCTAATGGTAATTCTATCTAACACAGTAGGAATACTGGACAATG
GGTATGTAAGTGGGATGGTTTGGGTCCGTCTTTGGTCAATGCCGACTCCAAACCTACTGACGGTACACTATCCTTTAGAAGCAATCCAGAATGCTTCCTTTTCTGAATCG
AAGTTACAGCTCCTTGAAGTACAGAAGGCTGAAATGCGATGTCTTTTACCAAGAAAATTGCTGGATCATGGATTTCACCCTCCAAACACCTCAAATATCAAAGAAAATGT
TGTCTGCAGCAACCGGTCCTGCAATTACAGCTGGAGTGGTCAGCGCAAGCTACCTTCTTACTTGGACAAGCTGATATTCCCTGAGAAATTTTTAACTGCGTTGAGAACTA
TATCTATGCAGGAGGACGAGCTTATGCAGGTTTCATCTTTACTGGCAGAGATTGTTGGACCTGAAGAGGATAGGCAGCCCACCGACATTGATGTCCAAGCAGCAGTCTGG
GAGGCTTGTGGTGACTCTGGAGCCTTGCAGTTGCTTGTTGATCTTCTTCAAAAGAAGTTGATGGATCTTGAAGAAGGCACCGGAACTCTGGACAGCGACACTGAGCTGCT
GAAAGAGGTCCAAGTAACTGAAAGCACGAATGTAAATGGCTCGTGTCAGGATTCTGCAAGTCCCATCAGTTCTTACAAACGTGTCTCTTTCTTCATTCCCTTTCTCCTTG
GCTGCAGCAGAGAGTCAGATGACAGGAAGCCACAAAAGTTGGTGAGCAGGAACCAATGGTCTAGCATTGTTTATCGCCATGGTCAGAAGCAGCTAACCAGTCTATTTCTG
AAGGAGGCAGAACAGGCTTTGCAATTATCATTAAGTGAGGAAAACTGA
Protein sequenceShow/hide protein sequence
MQKMANSDEAKLELFLQWLQVNGADLRGCMIKYSDLSKGCGLFSANDASDVNLVFPLGVVIEFLLSSSGDEVNGIDIILLKYQHLAYEIVSRPNTIKCPSRIATYARNIG
KPQTYLLPRVQRQGERALEHHGQELFVFLREIDNFLTCAGVLLVVPLDLAITPMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVEWLRENSSWKPYLDVLPTRFGNP
LWFTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVNRLLTIEGFSVRANSIFWARALNIPMPHDYVFPKIQEEVRSDSFKETIEVSTSAVSEVQLACTKNEDGARS
ISGCEDHRMIDSTASGKTSGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGVGSTTGVPFSMYLLSVDFLTAFILDISGKANSRSSGMEEVSISYGNKGNEAYNERRK
HRAYSRLNCCICSFLSKKKSCICSFYQKKKLHLQLGFALLWQELLYLYGFVVENNPDDYLMVILSNTVGILDNGYVSGMVWVRLWSMPTPNLLTVHYPLEAIQNASFSES
KLQLLEVQKAEMRCLLPRKLLDHGFHPPNTSNIKENVVCSNRSCNYSWSGQRKLPSYLDKLIFPEKFLTALRTISMQEDELMQVSSLLAEIVGPEEDRQPTDIDVQAAVW
EACGDSGALQLLVDLLQKKLMDLEEGTGTLDSDTELLKEVQVTESTNVNGSCQDSASPISSYKRVSFFIPFLLGCSRESDDRKPQKLVSRNQWSSIVYRHGQKQLTSLFL
KEAEQALQLSLSEEN