; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019785 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019785
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionhistone-lysine N-methyltransferase SUVR3-like
Genome locationChr04:25468406..25470333
RNA-Seq ExpressionHG10019785
SyntenyHG10019785
Gene Ontology termsGO:0006325 - chromatin organization (biological process)
GO:0034968 - histone lysine methylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0018024 - histone-lysine N-methyltransferase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001214 - SET domain
IPR003616 - Post-SET domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136662.1 histone-lysine N-methyltransferase SUVR3 isoform X1 [Cucumis sativus]3.3e-17888.95Show/hide
Query:  MQRAVSKKCLKTSETEEGEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPN
        M+ AVS KCLKTSE E  EE LN GLLHCAHLVLPWLT LEL+ ISLSCKSLNATSKSITLRR LDASRSLEKIPIPFHN IDDR YAFF+YTPT II N
Subjt:  MQRAVSKKCLKTSETEEGEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPN

Query:  HHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLY
         HF+RQCWGSISD QS H ESES+NLVD+WV GVFGCDCENCG+FE QCPC S DGL DVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLY
Subjt:  HHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLY

Query:  ADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCF
        ADELIQEGAFICEYAGELLTT EARRRQKIYDARAK G+FASSLLVVREHLPSGNACLR+NIDATWIGNVARFINHSCDGGNLVTRLVR TGVMLPRLCF
Subjt:  ADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCF

Query:  YASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        YAS+SIS +EELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  YASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

XP_008443304.1 PREDICTED: histone-lysine N-methyltransferase SUVR3 isoform X1 [Cucumis melo]6.0e-18088.76Show/hide
Query:  MQRAVSKKCLKTSETEE---GEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI
        M+  VS KCLKTSE EE    EEHLN GLLHCAHLVLPWLT LEL+ ISLSCKSLNA SKSITLRR LDASRSLEKIPIPFHNPIDDR YAFF+YTPT I
Subjt:  MQRAVSKKCLKTSETEE---GEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI

Query:  IPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW
        I N HF+RQCWGSISDSQSGH ES+S+NLVD+WV GVFGCDCENCGEF+ QCPC S DGL DVASECGPRCSCG ECENRLTQRGISVRLKILRDEKKGW
Subjt:  IPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW

Query:  GLYADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPR
        GLYADELIQEGAFICEYAGELLTT EARRRQKIYDARAK G+FASSLLVVREHLPSGNACLR+NIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPR
Subjt:  GLYADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPR

Query:  LCFYASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        LCFYAS+SIS +EELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  LCFYASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

XP_008443306.1 PREDICTED: histone-lysine N-methyltransferase SUVR3 isoform X2 [Cucumis melo]1.2e-17587.61Show/hide
Query:  MQRAVSKKCLKTSETEE---GEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI
        M+  VS KCLKTSE EE    EEHLN GLLHCAHLVLPWLT LEL+ ISLSCKSLNA SKSITLRR LDASRSLEKIPIPFHNPIDDR YAFF+YTPT I
Subjt:  MQRAVSKKCLKTSETEE---GEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI

Query:  IPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW
        I N HF+RQCWGSISDSQSGH ES+S+NLVD+WV GVFGCDCENCGEF+ QCPC S DGL DVASECGPRCSCG ECENRLTQRGISVRLKILRDEKKGW
Subjt:  IPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW

Query:  GLYADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPR
        GLYADELIQEGAFIC    ELLTT EARRRQKIYDARAK G+FASSLLVVREHLPSGNACLR+NIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPR
Subjt:  GLYADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPR

Query:  LCFYASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        LCFYAS+SIS +EELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  LCFYASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

XP_011652203.1 histone-lysine N-methyltransferase SUVR3 isoform X2 [Cucumis sativus]6.5e-17487.79Show/hide
Query:  MQRAVSKKCLKTSETEEGEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPN
        M+ AVS KCLKTSE E  EE LN GLLHCAHLVLPWLT LEL+ ISLSCKSLNATSKSITLRR LDASRSLEKIPIPFHN IDDR YAFF+YTPT II N
Subjt:  MQRAVSKKCLKTSETEEGEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPN

Query:  HHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLY
         HF+RQCWGSISD QS H ESES+NLVD+WV GVFGCDCENCG+FE QCPC S DGL DVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLY
Subjt:  HHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLY

Query:  ADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCF
        ADELIQEGAFIC    ELLTT EARRRQKIYDARAK G+FASSLLVVREHLPSGNACLR+NIDATWIGNVARFINHSCDGGNLVTRLVR TGVMLPRLCF
Subjt:  ADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCF

Query:  YASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        YAS+SIS +EELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  YASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

XP_038906247.1 histone-lysine N-methyltransferase SUVR3 [Benincasa hispida]3.9e-18792.17Show/hide
Query:  MQRAVSKKCLKTSET-EEGEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIP
        M+RAVSKKC KTSE  EE EEHLNSGLLHCAHLVLPWLT LEL++ISLSCK LNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPT II 
Subjt:  MQRAVSKKCLKTSET-EEGEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIP

Query:  NHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGL
        NHHFERQCWGSISD QS HVESESMNLVDDWV GVFGCDCENCG+FEFQC CSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGL
Subjt:  NHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGL

Query:  YADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLC
        +ADELIQEG F+CEYAGELLTT EAR+RQKIYDA AK G+FASSLLVVREHLPSGNACLR+NIDATWIGNVARFINHSCDGGNL TRLVRSTGVMLPRLC
Subjt:  YADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLC

Query:  FYASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        FYASRSIS DEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  FYASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

TrEMBL top hitse value%identityAlignment
A0A0A0LF10 Uncharacterized protein1.6e-17888.95Show/hide
Query:  MQRAVSKKCLKTSETEEGEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPN
        M+ AVS KCLKTSE E  EE LN GLLHCAHLVLPWLT LEL+ ISLSCKSLNATSKSITLRR LDASRSLEKIPIPFHN IDDR YAFF+YTPT II N
Subjt:  MQRAVSKKCLKTSETEEGEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPN

Query:  HHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLY
         HF+RQCWGSISD QS H ESES+NLVD+WV GVFGCDCENCG+FE QCPC S DGL DVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLY
Subjt:  HHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLY

Query:  ADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCF
        ADELIQEGAFICEYAGELLTT EARRRQKIYDARAK G+FASSLLVVREHLPSGNACLR+NIDATWIGNVARFINHSCDGGNLVTRLVR TGVMLPRLCF
Subjt:  ADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCF

Query:  YASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        YAS+SIS +EELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  YASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

A0A1S3B7R7 histone-lysine N-methyltransferase SUVR3 isoform X12.9e-18088.76Show/hide
Query:  MQRAVSKKCLKTSETEE---GEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI
        M+  VS KCLKTSE EE    EEHLN GLLHCAHLVLPWLT LEL+ ISLSCKSLNA SKSITLRR LDASRSLEKIPIPFHNPIDDR YAFF+YTPT I
Subjt:  MQRAVSKKCLKTSETEE---GEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI

Query:  IPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW
        I N HF+RQCWGSISDSQSGH ES+S+NLVD+WV GVFGCDCENCGEF+ QCPC S DGL DVASECGPRCSCG ECENRLTQRGISVRLKILRDEKKGW
Subjt:  IPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW

Query:  GLYADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPR
        GLYADELIQEGAFICEYAGELLTT EARRRQKIYDARAK G+FASSLLVVREHLPSGNACLR+NIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPR
Subjt:  GLYADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPR

Query:  LCFYASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        LCFYAS+SIS +EELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  LCFYASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

A0A1S3B8F3 histone-lysine N-methyltransferase SUVR3 isoform X25.7e-17687.61Show/hide
Query:  MQRAVSKKCLKTSETEE---GEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI
        M+  VS KCLKTSE EE    EEHLN GLLHCAHLVLPWLT LEL+ ISLSCKSLNA SKSITLRR LDASRSLEKIPIPFHNPIDDR YAFF+YTPT I
Subjt:  MQRAVSKKCLKTSETEE---GEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI

Query:  IPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW
        I N HF+RQCWGSISDSQSGH ES+S+NLVD+WV GVFGCDCENCGEF+ QCPC S DGL DVASECGPRCSCG ECENRLTQRGISVRLKILRDEKKGW
Subjt:  IPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW

Query:  GLYADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPR
        GLYADELIQEGAFIC    ELLTT EARRRQKIYDARAK G+FASSLLVVREHLPSGNACLR+NIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPR
Subjt:  GLYADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPR

Query:  LCFYASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        LCFYAS+SIS +EELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  LCFYASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

A0A5D3DQF9 Histone-lysine N-methyltransferase SUVR3 isoform X12.9e-18088.76Show/hide
Query:  MQRAVSKKCLKTSETEE---GEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI
        M+  VS KCLKTSE EE    EEHLN GLLHCAHLVLPWLT LEL+ ISLSCKSLNA SKSITLRR LDASRSLEKIPIPFHNPIDDR YAFF+YTPT I
Subjt:  MQRAVSKKCLKTSETEE---GEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI

Query:  IPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW
        I N HF+RQCWGSISDSQSGH ES+S+NLVD+WV GVFGCDCENCGEF+ QCPC S DGL DVASECGPRCSCG ECENRLTQRGISVRLKILRDEKKGW
Subjt:  IPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW

Query:  GLYADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPR
        GLYADELIQEGAFICEYAGELLTT EARRRQKIYDARAK G+FASSLLVVREHLPSGNACLR+NIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPR
Subjt:  GLYADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPR

Query:  LCFYASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        LCFYAS+SIS +EELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  LCFYASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

A0A6J1J706 histone-lysine N-methyltransferase SUVR3 isoform X11.1e-17185.76Show/hide
Query:  MQRAVSKKCLKTSETEEGEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPN
        M +A SKKCLKTSE E  EE L SGLLHCAHLVLPWLT LEL++ISLSCK LNATSKSITLRRILDASRS+E IPIPFH  I+D PYAFF+YTPT+IIP+
Subjt:  MQRAVSKKCLKTSETEEGEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPN

Query:  HHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLY
        H  ERQCWGSISDSQS HV SESM+LVDD  G V GCDCENCGE+EFQCPCSSLDGL DVA+ECGPRCSCGLECENRLTQRGI VRLKI RDEKKGWGLY
Subjt:  HHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLY

Query:  ADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCF
        ADELI++G FICEYAGELLTTGE+RRRQKIYDARAK G+F SSLLVVREHLPSGNACLR NIDATWIGNV RF+NHSCDGGNLVTRLVRSTGVMLPRLCF
Subjt:  ADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCF

Query:  YASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        YASRSIS DEELTFSYGDIR+  EGLKCFCGSSCCLGTLPSENT
Subjt:  YASRSISNDEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

SwissProt top hitse value%identityAlignment
Q53H47 Histone-lysine N-methyltransferase SETMAR1.0e-2835.38Show/hide
Query:  ECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINI
        EC   C C   C NR+ Q+G+    ++ +  KKGWGL   E I +G F+CEYAGE+L   E +RR  I+     D  +   ++ +REH+ +G   +   +
Subjt:  ECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINI

Query:  DATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCFYASRSISNDEELTFSYG-----------DIRLKHEGLK--CFCGSSCCLGTLPSENT
        D T+IGN+ RF+NHSC+  NL+   VR    M+P+L  +A++ I  +EEL++ Y              RL H  L+  C+CG+  C   LP +++
Subjt:  DATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCFYASRSISNDEELTFSYG-----------DIRLKHEGLK--CFCGSSCCLGTLPSENT

Q80UJ9 Histone-lysine N-methyltransferase SETMAR7.0e-3036.92Show/hide
Query:  ECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINI
        EC   C CG+ C NR+ Q G+   L++ + EKKGWGL   E I +G F+CEYAGE+L   E +RR  I+   + D  +   ++ VREH+ SG   +   +
Subjt:  ECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINI

Query:  DATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCFYASRSISNDEELTFSYGDIRLKHEGLK-------------CFCGSSCCLGTLPSENT
        D T+IGN+ RF+NHSC+  NL+   VR    M+P+L  +A++ I   EEL++ Y    L     K             C+CG+  C   LP +++
Subjt:  DATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCFYASRSISNDEELTFSYGDIRLKHEGLK-------------CFCGSSCCLGTLPSENT

Q96KQ7 Histone-lysine N-methyltransferase EHMT27.0e-3030.46Show/hide
Query:  LNATSKSITLRRIL--DASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCEN--CGEFEF
        L   +++I   +I+  D +R  E +PIP  N +D  P               +    C  S  +        +    VDD       C   N  CG+   
Subjt:  LNATSKSITLRRIL--DASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCEN--CGEFEF

Query:  QCPCSSLDGLVD--------VASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTGEARRRQK---IYDARAK
        +C       L+         +  EC   CSC   C+NR+ Q GI VRL++ R  K GWG+ A + I +G FICEY GEL++  EA  R+    ++D   K
Subjt:  QCPCSSLDGLVD--------VASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTGEARRRQK---IYDARAK

Query:  DGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRL-VRSTGVMLPRLCFYASRSISNDEELTFSYGD--IRLKHEGLKCFCGSS
        DG+                      IDA + GN++RFINH CD   +  R+ +    +  PR+ F++SR I   EEL F YGD    +K +   C CGS 
Subjt:  DGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRL-VRSTGVMLPRLCFYASRSISNDEELTFSYGD--IRLKHEGLKCFCGSS

Query:  CC
         C
Subjt:  CC

Q9SRV2 Histone-lysine N-methyltransferase SUVR32.4e-10256.85Show/hide
Query:  LHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI-IPNHHFERQCWGSISD------------
        L CA+L+LPWL   EL+ ++ +CK+L+  SKS+T+ R LDA+RSLE I IPFHN ID + YA+F+YTP  I   +    RQ WG+ ++            
Subjt:  LHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI-IPNHHFERQCWGSISD------------

Query:  -SQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFIC
         S+SG      ++LVD+      GC+CE C   E  C C +  G+ ++A+ECG  C CG +C NR+TQ+G+SV LKI+RDEKKGW LYAD+LI++G FIC
Subjt:  -SQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFIC

Query:  EYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCFYASRSISNDEEL
        EYAGELLTT EARRRQ IYD       FAS+LLVVREHLPSG ACLRINIDAT IGNVARFINHSCDGGNL T L+RS+G +LPRLCF+A++ I  +EEL
Subjt:  EYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCFYASRSISNDEEL

Query:  TFSYGDIRLKHEG----LKCFCGSSCCLGTLPSENT
        +FSYGD+ +  E     L C CGSSCCLGTLP ENT
Subjt:  TFSYGDIRLKHEG----LKCFCGSSCCLGTLPSENT

Q9Z148 Histone-lysine N-methyltransferase EHMT29.1e-3030.13Show/hide
Query:  LNATSKSITLRRIL--DASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCEN--CGEFEF
        L   ++++   +I+  D +R  E +PIP  N +D  P               +    C  S  +        +    VDD       C   N  CG+   
Subjt:  LNATSKSITLRRIL--DASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCEN--CGEFEF

Query:  QCPCSSLDGLVD--------VASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTGEARRRQK---IYDARAK
        +C       L+         +  EC   CSC   C+NR+ Q GI VRL++ R  K GWG+ A + I +G FICEY GEL++  EA  R+    ++D   K
Subjt:  QCPCSSLDGLVD--------VASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTGEARRRQK---IYDARAK

Query:  DGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRL-VRSTGVMLPRLCFYASRSISNDEELTFSYGD--IRLKHEGLKCFCGSS
        DG+                      IDA + GN++RFINH CD   +  R+ +    +  PR+ F++SR I   EEL F YGD    +K +   C CGS 
Subjt:  DGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRL-VRSTGVMLPRLCFYASRSISNDEELTFSYGD--IRLKHEGLKCFCGSS

Query:  CC
         C
Subjt:  CC

Arabidopsis top hitse value%identityAlignment
AT1G73100.1 SU(VAR)3-9 homolog 31.7e-2628.92Show/hide
Query:  KSLNATSKSITLRRIL---DASRSLEKIPIPFHNPID-DRPYAFFLYTPTAIIPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCE-NCGE
        KS+    + +T R  L   D +   E  P+   N +D D+  A+F YT                      S    SE+  L       V GC C  +C  
Subjt:  KSLNATSKSITLRRIL---DASRSLEKIPIPFHNPID-DRPYAFFLYTPTAIIPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCE-NCGE

Query:  FEFQCPC--------SSLDGLV-----DVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTGEARRRQK--
            C C          L+G++      V  ECGP C C   C+NR+ Q G+  RL++ +   +GWGL + + ++ G+FICEYAGE+   G  R  Q+  
Subjt:  FEFQCPC--------SSLDGLV-----DVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTGEARRRQK--

Query:  --IYDARAKDGQF------------ASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVR-STGVMLPRLCFYASRSISNDEELTF
          ++D       F             S+ +    +LPS      + I A   GNVARF+NHSC        ++R   G  +  + F+A R I    ELT+
Subjt:  --IYDARAKDGQF------------ASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVR-STGVMLPRLCFYASRSISNDEELTF

Query:  SYG--------DIRLKHEGLKCFCGSSCCLGT
         YG        D  L H    C CGS  C G+
Subjt:  SYG--------DIRLKHEGLKCFCGSSCCLGT

AT2G22740.1 SU(VAR)3-9 homolog 68.2e-2631.62Show/hide
Query:  LDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASEC
        LD S   E+ PI   N IDD     F YT   I P+  + R          +   E+E+          V  C  +N GE  +     ++ G      EC
Subjt:  LDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASEC

Query:  GPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTGEARRR----QKIYD-ARAKDGQFA---SSLLV---VREHLPS
        GP C C   C  R+TQ GI + L+I + + +GWG+   + I  G+FICEY GELL   EA RR    + ++D     D   A   S L++       +  
Subjt:  GPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTGEARRR----QKIYD-ARAKDGQFA---SSLLV---VREHLPS

Query:  GNACLRINIDATWIGNVARFINHSCDGGNLVTR--LVRSTGVMLPRLCFYASRSISNDEELTFSYG----DIRLKHEGLK---CFCGSSCC
        G+      IDA   GNV RFINHSC   NL  +  L       +P + F+A  +I   +EL + Y      +R     +K   CFCG++ C
Subjt:  GNACLRINIDATWIGNVARFINHSCDGGNLVTR--LVRSTGVMLPRLCFYASRSISNDEELTFSYG----DIRLKHEGLK---CFCGSSCC

AT2G22740.2 SU(VAR)3-9 homolog 68.2e-2631.62Show/hide
Query:  LDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASEC
        LD S   E+ PI   N IDD     F YT   I P+  + R          +   E+E+          V  C  +N GE  +     ++ G      EC
Subjt:  LDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPNHHFERQCWGSISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASEC

Query:  GPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTGEARRR----QKIYD-ARAKDGQFA---SSLLV---VREHLPS
        GP C C   C  R+TQ GI + L+I + + +GWG+   + I  G+FICEY GELL   EA RR    + ++D     D   A   S L++       +  
Subjt:  GPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTGEARRR----QKIYD-ARAKDGQFA---SSLLV---VREHLPS

Query:  GNACLRINIDATWIGNVARFINHSCDGGNLVTR--LVRSTGVMLPRLCFYASRSISNDEELTFSYG----DIRLKHEGLK---CFCGSSCC
        G+      IDA   GNV RFINHSC   NL  +  L       +P + F+A  +I   +EL + Y      +R     +K   CFCG++ C
Subjt:  GNACLRINIDATWIGNVARFINHSCDGGNLVTR--LVRSTGVMLPRLCFYASRSISNDEELTFSYG----DIRLKHEGLK---CFCGSSCC

AT3G03750.1 SET domain protein 201.1e-9152.68Show/hide
Query:  LHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI-IPNHHFERQCWGSISD------------
        L CA+L+LPWL   EL+ ++ +CK+L+  SKS+T+ R LDA+RSLE I IPFHN ID + YA+F+YTP  I   +    RQ WG+ ++            
Subjt:  LHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI-IPNHHFERQCWGSISD------------

Query:  -SQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFIC
         S+SG      ++LVD+      GC+CE C   E  C C +  G+ ++A+ECG  C CG +C NR+TQ+G+SV LKI+RDEKKGW LYAD+LI+      
Subjt:  -SQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFIC

Query:  EYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCFYASRSISNDEEL
                  +ARRRQ IYD       FAS+LLVVREHLPSG ACLRINIDAT IGNVARFINHSCDGGNL T L+RS+G +LPRLCF+A++ I  +EEL
Subjt:  EYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCFYASRSISNDEEL

Query:  TFSYGDIRLKHEG----LKCFCGSSCCLGTLPSENT
        +FSYGD+ +  E     L C CGSSCCLGTLP ENT
Subjt:  TFSYGDIRLKHEG----LKCFCGSSCCLGTLPSENT

AT3G03750.2 SET domain protein 201.7e-10356.85Show/hide
Query:  LHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI-IPNHHFERQCWGSISD------------
        L CA+L+LPWL   EL+ ++ +CK+L+  SKS+T+ R LDA+RSLE I IPFHN ID + YA+F+YTP  I   +    RQ WG+ ++            
Subjt:  LHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAI-IPNHHFERQCWGSISD------------

Query:  -SQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFIC
         S+SG      ++LVD+      GC+CE C   E  C C +  G+ ++A+ECG  C CG +C NR+TQ+G+SV LKI+RDEKKGW LYAD+LI++G FIC
Subjt:  -SQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFIC

Query:  EYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCFYASRSISNDEEL
        EYAGELLTT EARRRQ IYD       FAS+LLVVREHLPSG ACLRINIDAT IGNVARFINHSCDGGNL T L+RS+G +LPRLCF+A++ I  +EEL
Subjt:  EYAGELLTTGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCFYASRSISNDEEL

Query:  TFSYGDIRLKHEG----LKCFCGSSCCLGTLPSENT
        +FSYGD+ +  E     L C CGSSCCLGTLP ENT
Subjt:  TFSYGDIRLKHEG----LKCFCGSSCCLGTLPSENT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCGGGCCGTATCAAAAAAATGCTTGAAAACCAGTGAAACCGAAGAAGGAGAAGAACATTTAAATTCTGGTCTCCTTCACTGTGCTCACCTCGTCCTTCCATGGCT
GACCTTCCTCGAGCTTTCAAACATCTCTCTCTCCTGCAAATCCCTCAATGCCACATCCAAATCCATCACTCTTCGCCGGATTCTCGACGCTTCCAGATCCCTCGAGAAAA
TTCCCATCCCATTCCATAATCCAATCGACGATCGCCCCTACGCCTTCTTCCTCTACACCCCCACGGCCATTATCCCCAACCACCATTTCGAGCGCCAATGCTGGGGCTCA
ATTTCCGATTCCCAATCGGGCCATGTCGAGAGCGAATCGATGAATTTGGTAGATGATTGGGTCGGCGGTGTATTTGGATGCGATTGTGAAAATTGCGGGGAGTTTGAGTT
CCAATGCCCCTGTTCGAGCTTGGATGGGTTGGTGGATGTTGCTAGCGAGTGCGGACCGCGATGTTCTTGTGGGCTTGAGTGTGAAAATCGATTGACCCAGAGAGGAATTT
CTGTTCGACTGAAGATTTTGAGAGACGAGAAGAAAGGATGGGGTTTGTACGCGGATGAGTTGATTCAAGAAGGGGCGTTTATTTGTGAGTATGCAGGTGAACTTTTGACC
ACCGGAGAAGCAAGAAGGCGGCAGAAAATATATGATGCACGTGCTAAAGACGGGCAGTTTGCTTCATCTCTTCTCGTTGTGAGAGAGCATCTTCCATCTGGAAATGCATG
TTTGCGAATTAACATTGATGCGACCTGGATTGGGAATGTGGCACGGTTCATAAATCACTCTTGTGATGGAGGTAATCTAGTAACGAGACTGGTGAGAAGCACAGGTGTTA
TGTTGCCTCGCCTTTGTTTCTATGCTTCAAGAAGCATATCAAACGATGAAGAGCTTACCTTTAGTTATGGTGATATCAGATTAAAGCATGAAGGTTTGAAATGCTTCTGT
GGTAGCTCCTGCTGTTTGGGAACTTTGCCTTCAGAAAATACATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGCGGGCCGTATCAAAAAAATGCTTGAAAACCAGTGAAACCGAAGAAGGAGAAGAACATTTAAATTCTGGTCTCCTTCACTGTGCTCACCTCGTCCTTCCATGGCT
GACCTTCCTCGAGCTTTCAAACATCTCTCTCTCCTGCAAATCCCTCAATGCCACATCCAAATCCATCACTCTTCGCCGGATTCTCGACGCTTCCAGATCCCTCGAGAAAA
TTCCCATCCCATTCCATAATCCAATCGACGATCGCCCCTACGCCTTCTTCCTCTACACCCCCACGGCCATTATCCCCAACCACCATTTCGAGCGCCAATGCTGGGGCTCA
ATTTCCGATTCCCAATCGGGCCATGTCGAGAGCGAATCGATGAATTTGGTAGATGATTGGGTCGGCGGTGTATTTGGATGCGATTGTGAAAATTGCGGGGAGTTTGAGTT
CCAATGCCCCTGTTCGAGCTTGGATGGGTTGGTGGATGTTGCTAGCGAGTGCGGACCGCGATGTTCTTGTGGGCTTGAGTGTGAAAATCGATTGACCCAGAGAGGAATTT
CTGTTCGACTGAAGATTTTGAGAGACGAGAAGAAAGGATGGGGTTTGTACGCGGATGAGTTGATTCAAGAAGGGGCGTTTATTTGTGAGTATGCAGGTGAACTTTTGACC
ACCGGAGAAGCAAGAAGGCGGCAGAAAATATATGATGCACGTGCTAAAGACGGGCAGTTTGCTTCATCTCTTCTCGTTGTGAGAGAGCATCTTCCATCTGGAAATGCATG
TTTGCGAATTAACATTGATGCGACCTGGATTGGGAATGTGGCACGGTTCATAAATCACTCTTGTGATGGAGGTAATCTAGTAACGAGACTGGTGAGAAGCACAGGTGTTA
TGTTGCCTCGCCTTTGTTTCTATGCTTCAAGAAGCATATCAAACGATGAAGAGCTTACCTTTAGTTATGGTGATATCAGATTAAAGCATGAAGGTTTGAAATGCTTCTGT
GGTAGCTCCTGCTGTTTGGGAACTTTGCCTTCAGAAAATACATAA
Protein sequenceShow/hide protein sequence
MQRAVSKKCLKTSETEEGEEHLNSGLLHCAHLVLPWLTFLELSNISLSCKSLNATSKSITLRRILDASRSLEKIPIPFHNPIDDRPYAFFLYTPTAIIPNHHFERQCWGS
ISDSQSGHVESESMNLVDDWVGGVFGCDCENCGEFEFQCPCSSLDGLVDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLT
TGEARRRQKIYDARAKDGQFASSLLVVREHLPSGNACLRINIDATWIGNVARFINHSCDGGNLVTRLVRSTGVMLPRLCFYASRSISNDEELTFSYGDIRLKHEGLKCFC
GSSCCLGTLPSENT