; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G12699 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G12699
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionhistone-lysine N-methyltransferase SUVR3-like
Genome locationctg1838:2641868..2645103
RNA-Seq ExpressionCucsat.G12699
SyntenyCucsat.G12699
Gene Ontology termsGO:0006325 - chromatin organization (biological process)
GO:0034968 - histone lysine methylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0018024 - histone-lysine N-methyltransferase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001214 - SET domain
IPR003616 - Post-SET domain
IPR006560 - AWS domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136662.1 histone-lysine N-methyltransferase SUVR3 isoform X1 [Cucumis sativus]2.37e-257100Show/hide
Query:  MEPAVSNKCLKTSEAEEEQLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQH
        MEPAVSNKCLKTSEAEEEQLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQH
Subjt:  MEPAVSNKCLKTSEAEEEQLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQH

Query:  FQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYAD
        FQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYAD
Subjt:  FQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYAD

Query:  ELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYA
        ELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYA
Subjt:  ELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYA

Query:  SQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        SQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  SQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

XP_008443304.1 PREDICTED: histone-lysine N-methyltransferase SUVR3 isoform X1 [Cucumis melo]6.39e-24594.81Show/hide
Query:  MEPAVSNKCLKTSEAEEE-----QLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI
        M+P VSNKCLKTSEAEEE      LNCGLLHCAHLVLPWLTSLELATISLSCKSLNA SKSITLRRTLDASRSLEKIPIPFHN IDDRLYAFFIYTPTVI
Subjt:  MEPAVSNKCLKTSEAEEE-----QLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI

Query:  ISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW
        ISN+HFQRQCWGSISD QS HDES+SINLVDNWVDGVFGCDCENCG+F+LQCPCLSFDGLEDVASECGPRCSCG ECENRLTQRGISVRLKILRDEKKGW
Subjt:  ISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW

Query:  GLYADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPR
        GLYADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVR TGVMLPR
Subjt:  GLYADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPR

Query:  LCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        LCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  LCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

XP_008443306.1 PREDICTED: histone-lysine N-methyltransferase SUVR3 isoform X2 [Cucumis melo]2.40e-23993.66Show/hide
Query:  MEPAVSNKCLKTSEAEEE-----QLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI
        M+P VSNKCLKTSEAEEE      LNCGLLHCAHLVLPWLTSLELATISLSCKSLNA SKSITLRRTLDASRSLEKIPIPFHN IDDRLYAFFIYTPTVI
Subjt:  MEPAVSNKCLKTSEAEEE-----QLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI

Query:  ISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW
        ISN+HFQRQCWGSISD QS HDES+SINLVDNWVDGVFGCDCENCG+F+LQCPCLSFDGLEDVASECGPRCSCG ECENRLTQRGISVRLKILRDEKKGW
Subjt:  ISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW

Query:  GLYADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPR
        GLYADELIQEGAFICE    LLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVR TGVMLPR
Subjt:  GLYADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPR

Query:  LCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        LCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  LCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

XP_011652203.1 histone-lysine N-methyltransferase SUVR3 isoform X2 [Cucumis sativus]8.93e-25298.83Show/hide
Query:  MEPAVSNKCLKTSEAEEEQLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQH
        MEPAVSNKCLKTSEAEEEQLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQH
Subjt:  MEPAVSNKCLKTSEAEEEQLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQH

Query:  FQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYAD
        FQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYAD
Subjt:  FQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYAD

Query:  ELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYA
        ELIQEGAFICE    LLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYA
Subjt:  ELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYA

Query:  SQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        SQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  SQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

XP_038906247.1 histone-lysine N-methyltransferase SUVR3 [Benincasa hispida]8.13e-22788.99Show/hide
Query:  MEPAVSNKCLKTSEAEEEQ---LNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIIS
        M+ AVS KC KTSEAEEE+   LN GLLHCAHLVLPWLTSLELA+ISLSCK LNATSKSITLRR LDASRSLEKIPIPFHN IDDR YAFF+YTPTVIIS
Subjt:  MEPAVSNKCLKTSEAEEEQ---LNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIIS

Query:  NQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGL
        N HF+RQCWGSISD QS H ESES+NLVD+WV GVFGCDCENCGDFE QC C S DGL DVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGL
Subjt:  NQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGL

Query:  YADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLC
        +ADELIQEG F+CEYAGELLTT EAR+RQKIYDA AKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNL TRLVR TGVMLPRLC
Subjt:  YADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLC

Query:  FYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        FYAS+SISK+EELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  FYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

TrEMBL top hitse value%identityAlignment
A0A0A0LF10 Uncharacterized protein1.15e-257100Show/hide
Query:  MEPAVSNKCLKTSEAEEEQLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQH
        MEPAVSNKCLKTSEAEEEQLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQH
Subjt:  MEPAVSNKCLKTSEAEEEQLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQH

Query:  FQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYAD
        FQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYAD
Subjt:  FQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYAD

Query:  ELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYA
        ELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYA
Subjt:  ELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYA

Query:  SQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        SQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  SQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

A0A1S3B7R7 histone-lysine N-methyltransferase SUVR3 isoform X13.10e-24594.81Show/hide
Query:  MEPAVSNKCLKTSEAEEE-----QLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI
        M+P VSNKCLKTSEAEEE      LNCGLLHCAHLVLPWLTSLELATISLSCKSLNA SKSITLRRTLDASRSLEKIPIPFHN IDDRLYAFFIYTPTVI
Subjt:  MEPAVSNKCLKTSEAEEE-----QLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI

Query:  ISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW
        ISN+HFQRQCWGSISD QS HDES+SINLVDNWVDGVFGCDCENCG+F+LQCPCLSFDGLEDVASECGPRCSCG ECENRLTQRGISVRLKILRDEKKGW
Subjt:  ISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW

Query:  GLYADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPR
        GLYADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVR TGVMLPR
Subjt:  GLYADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPR

Query:  LCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        LCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  LCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

A0A1S3B8F3 histone-lysine N-methyltransferase SUVR3 isoform X21.16e-23993.66Show/hide
Query:  MEPAVSNKCLKTSEAEEE-----QLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI
        M+P VSNKCLKTSEAEEE      LNCGLLHCAHLVLPWLTSLELATISLSCKSLNA SKSITLRRTLDASRSLEKIPIPFHN IDDRLYAFFIYTPTVI
Subjt:  MEPAVSNKCLKTSEAEEE-----QLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI

Query:  ISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW
        ISN+HFQRQCWGSISD QS HDES+SINLVDNWVDGVFGCDCENCG+F+LQCPCLSFDGLEDVASECGPRCSCG ECENRLTQRGISVRLKILRDEKKGW
Subjt:  ISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW

Query:  GLYADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPR
        GLYADELIQEGAFICE    LLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVR TGVMLPR
Subjt:  GLYADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPR

Query:  LCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        LCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  LCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

A0A5D3DQF9 Histone-lysine N-methyltransferase SUVR3 isoform X13.10e-24594.81Show/hide
Query:  MEPAVSNKCLKTSEAEEE-----QLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI
        M+P VSNKCLKTSEAEEE      LNCGLLHCAHLVLPWLTSLELATISLSCKSLNA SKSITLRRTLDASRSLEKIPIPFHN IDDRLYAFFIYTPTVI
Subjt:  MEPAVSNKCLKTSEAEEE-----QLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI

Query:  ISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW
        ISN+HFQRQCWGSISD QS HDES+SINLVDNWVDGVFGCDCENCG+F+LQCPCLSFDGLEDVASECGPRCSCG ECENRLTQRGISVRLKILRDEKKGW
Subjt:  ISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGW

Query:  GLYADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPR
        GLYADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVR TGVMLPR
Subjt:  GLYADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPR

Query:  LCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        LCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
Subjt:  LCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

A0A6J1J706 histone-lysine N-methyltransferase SUVR3 isoform X17.88e-20883.04Show/hide
Query:  MEPAVSNKCLKTSEAEEEQLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQH
        M  A S KCLKTSEAEEE L  GLLHCAHLVLPWLTSLELA+ISLSCK LNATSKSITLRR LDASRS+E IPIPFH SI+D  YAFFIYTPT II +  
Subjt:  MEPAVSNKCLKTSEAEEEQLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQH

Query:  FQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYAD
         +RQCWGSISD QS H  SES++LVD+    V GCDCENCG++E QCPC S DGLEDVA+ECGPRCSCGLECENRLTQRGI VRLKI RDEKKGWGLYAD
Subjt:  FQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYAD

Query:  ELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYA
        ELI++G FICEYAGELLTT E+RRRQKIYDARAKGGRF SSLLVVREHLPSGNACLR NIDATWIGNV RF+NHSCDGGNLVTRLVR TGVMLPRLCFYA
Subjt:  ELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYA

Query:  SQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT
        S+SISK+EELTFSYGDIR+  EGLKCFCGSSCCLGTLPSENT
Subjt:  SQSISKEEELTFSYGDIRLKHEGLKCFCGSSCCLGTLPSENT

SwissProt top hitse value%identityAlignment
A8XI75 Probable histone-lysine N-methyltransferase set-235.0e-2835.68Show/hide
Query:  DNWVDGVFGCDCENCGDFELQCPCL-------SFDG---LEDVASECGPRCSCGL---ECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEY
        D+W D   GCDCE     E QC C+       S DG      +  EC   C+C L    C N++ Q GI  +LKI    +KG G+ A+E IQ   F+CEY
Subjt:  DNWVDGVFGCDCENCGDFELQCPCL-------SFDG---LEDVASECGPRCSCGL---ECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEY

Query:  AGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYASQSISKEEELTF
        AGE +  +E +RR +++          +  L ++EH   G   ++  ID    GN+ RF+NHSCD  N    +VR  G M+P    +A + IS  EEL++
Subjt:  AGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYASQSISKEEELTF

Query:  SYGDIRLKHEGLK-CFCGSSCCLGTLP
         YG   +  +  K C C S  C   LP
Subjt:  SYGDIRLKHEGLK-CFCGSSCCLGTLP

Q53H47 Histone-lysine N-methyltransferase SETMAR1.0e-2833.19Show/hide
Query:  GCDCENCGDFELQCPCL----SFDG---LEDVAS---------ECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELL
        GC C         C CL    ++D    L D+ S         EC   C C   C NR+ Q+G+    ++ +  KKGWGL   E I +G F+CEYAGE+L
Subjt:  GCDCENCGDFELQCPCL----SFDG---LEDVAS---------ECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELL

Query:  TTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYASQSISKEEELTFSYG--
           E +RR  +   + K    ++ ++ +REH+ +G   +   +D T+IGN+ RF+NHSC+  NL+   VR    M+P+L  +A++ I  EEEL++ Y   
Subjt:  TTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYASQSISKEEELTFSYG--

Query:  ---------DIRLKHEGLK--CFCGSSCCLGTLPSENT
                   RL H  L+  C+CG+  C   LP +++
Subjt:  ---------DIRLKHEGLK--CFCGSSCCLGTLPSENT

Q5I0M0 Histone-lysine N-methyltransferase SETMAR2.2e-2833.19Show/hide
Query:  GCDCENCGDFELQCPCLSFDG-------LEDVAS---------ECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELL
        GC C         C CL  +        L DV S         EC   C CG  C NR+ Q G+   L++ + EKKGWGL   E I +G F+CEYAGE+L
Subjt:  GCDCENCGDFELQCPCLSFDG-------LEDVAS---------ECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELL

Query:  TTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYASQSISKEEELTFSYGDI
           E +RR  +  A        + ++ +REH  +G   +   +D T+IGN+ RF+NHSC+  NL+   VR    M+P+L  +A++ I   EEL++ Y   
Subjt:  TTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYASQSISKEEELTFSYGDI

Query:  RLKHEGLK-------------CFCGSSCCLGTLPSENT
         L     K             C+CG+  C   LP +++
Subjt:  RLKHEGLK-------------CFCGSSCCLGTLPSENT

Q80UJ9 Histone-lysine N-methyltransferase SETMAR5.3e-3033.61Show/hide
Query:  GCDCENCGDFELQCPCLSFDG-------LEDVAS---------ECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELL
        GC C         C CL  +        L DV S         EC   C CG+ C NR+ Q G+   L++ + EKKGWGL   E I +G F+CEYAGE+L
Subjt:  GCDCENCGDFELQCPCLSFDG-------LEDVAS---------ECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELL

Query:  TTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYASQSISKEEELTFSYGDI
           E +RR  +  +       ++ ++ VREH+ SG   +   +D T+IGN+ RF+NHSC+  NL+   VR    M+P+L  +A++ I   EEL++ Y   
Subjt:  TTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYASQSISKEEELTFSYGDI

Query:  RLKHEGLK-------------CFCGSSCCLGTLPSENT
         L     K             C+CG+  C   LP +++
Subjt:  RLKHEGLK-------------CFCGSSCCLGTLPSENT

Q9SRV2 Histone-lysine N-methyltransferase SUVR33.5e-10659.76Show/hide
Query:  LHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI-ISNQHFQRQCWGSI-----SDPQSVHDE
        L CA+L+LPWL   ELA ++ +CK+L+  SKS+T+ R+LDA+RSLE I IPFHNSID + YA+FIYTP  I  S+    RQ WG+      S+ +   D 
Subjt:  LHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI-ISNQHFQRQCWGSI-----SDPQSVHDE

Query:  -SES----INLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYA
         SES    ++LVD       GC+CE C   E  C CL+F G+E++A+ECG  C CG +C NR+TQ+G+SV LKI+RDEKKGW LYAD+LI++G FICEYA
Subjt:  -SES----INLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYA

Query:  GELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYASQSISKEEELTFS
        GELLTT+EARRRQ IYD       FAS+LLVVREHLPSG ACLR+NIDAT IGNVARFINHSCDGGNL T L+R +G +LPRLCF+A++ I  EEEL+FS
Subjt:  GELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYASQSISKEEELTFS

Query:  YGDIRLKHEG----LKCFCGSSCCLGTLPSENT
        YGD+ +  E     L C CGSSCCLGTLP ENT
Subjt:  YGDIRLKHEG----LKCFCGSSCCLGTLPSENT

Arabidopsis top hitse value%identityAlignment
AT2G22740.1 SU(VAR)3-9 homolog 68.2e-2630.84Show/hide
Query:  LDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQ-CPCL---------SF
        LD S   E+ PI   N IDD     F YT  +I  +       W     P+S                    C    C + E + C C+         +F
Subjt:  LDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQ-CPCL---------SF

Query:  D----GLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTEEARRR----QKIYDARAKGGRFASSLLV
        D    G +    ECGP C C   C  R+TQ GI + L+I + + +GWG+   + I  G+FICEY GELL   EA RR    + ++D    G R+ +SL  
Subjt:  D----GLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTEEARRR----QKIYDARAKGGRFASSLLV

Query:  VREHLPSGNACLR----------MNIDATWIGNVARFINHSCDGGNLVTR--LVRGTGVMLPRLCFYASQSISKEEELTFSYG----DIRLKHEGLK---
            L  G    R            IDA   GNV RFINHSC   NL  +  L       +P + F+A  +I   +EL + Y      +R     +K   
Subjt:  VREHLPSGNACLR----------MNIDATWIGNVARFINHSCDGGNLVTR--LVRGTGVMLPRLCFYASQSISKEEELTFSYG----DIRLKHEGLK---

Query:  CFCGSSCC
        CFCG++ C
Subjt:  CFCGSSCC

AT2G22740.2 SU(VAR)3-9 homolog 68.2e-2630.84Show/hide
Query:  LDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQ-CPCL---------SF
        LD S   E+ PI   N IDD     F YT  +I  +       W     P+S                    C    C + E + C C+         +F
Subjt:  LDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQ-CPCL---------SF

Query:  D----GLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTEEARRR----QKIYDARAKGGRFASSLLV
        D    G +    ECGP C C   C  R+TQ GI + L+I + + +GWG+   + I  G+FICEY GELL   EA RR    + ++D    G R+ +SL  
Subjt:  D----GLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTEEARRR----QKIYDARAKGGRFASSLLV

Query:  VREHLPSGNACLR----------MNIDATWIGNVARFINHSCDGGNLVTR--LVRGTGVMLPRLCFYASQSISKEEELTFSYG----DIRLKHEGLK---
            L  G    R            IDA   GNV RFINHSC   NL  +  L       +P + F+A  +I   +EL + Y      +R     +K   
Subjt:  VREHLPSGNACLR----------MNIDATWIGNVARFINHSCDGGNLVTR--LVRGTGVMLPRLCFYASQSISKEEELTFSYG----DIRLKHEGLK---

Query:  CFCGSSCC
        CFCG++ C
Subjt:  CFCGSSCC

AT2G35160.1 SU(VAR)3-9 homolog 51.4e-2529.57Show/hide
Query:  LDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCL--------SFDG
        +D +   E +PI   N++DD     FIYT  +I  +       W     P+S                    C C N       C C+         +DG
Subjt:  LDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQHFQRQCWGSISDPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCL--------SFDG

Query:  ----LEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHL-
            ++ +  ECGP C C   C  R++Q GI ++L+I + E +GWG+ + E I  G+FICEYAGELL  ++                 A SL    E+L 
Subjt:  ----LEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTEEARRRQKIYDARAKGGRFASSLLVVREHL-

Query:  PSGNACLRMNIDATWIGNVARFINHSCDGGNLVTR--LVRGTGVMLPRLCFYASQSISKEEELTFSY-----------GDIRLKHEGLKCFCGSSCCLGT
          G+      I+A   GN+ RFINHSC   NL  +  L     + +P + F+A  +I   +EL++ Y           G+I+ K     C+CGS+ C G 
Subjt:  PSGNACLRMNIDATWIGNVARFINHSCDGGNLVTR--LVRGTGVMLPRLCFYASQSISKEEELTFSY-----------GDIRLKHEGLKCFCGSSCCLGT

Query:  L
        L
Subjt:  L

AT3G03750.1 SET domain protein 203.7e-9555.56Show/hide
Query:  LHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI-ISNQHFQRQCWGSI-----SDPQSVHDE
        L CA+L+LPWL   ELA ++ +CK+L+  SKS+T+ R+LDA+RSLE I IPFHNSID + YA+FIYTP  I  S+    RQ WG+      S+ +   D 
Subjt:  LHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI-ISNQHFQRQCWGSI-----SDPQSVHDE

Query:  -SES----INLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYA
         SES    ++LVD       GC+CE C   E  C CL+F G+E++A+ECG  C CG +C NR+TQ+G+SV LKI+RDEKKGW LYAD+LI          
Subjt:  -SES----INLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYA

Query:  GELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYASQSISKEEELTFS
              ++ARRRQ IYD       FAS+LLVVREHLPSG ACLR+NIDAT IGNVARFINHSCDGGNL T L+R +G +LPRLCF+A++ I  EEEL+FS
Subjt:  GELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYASQSISKEEELTFS

Query:  YGDIRLKHEG----LKCFCGSSCCLGTLPSENT
        YGD+ +  E     L C CGSSCCLGTLP ENT
Subjt:  YGDIRLKHEG----LKCFCGSSCCLGTLPSENT

AT3G03750.2 SET domain protein 202.5e-10759.76Show/hide
Query:  LHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI-ISNQHFQRQCWGSI-----SDPQSVHDE
        L CA+L+LPWL   ELA ++ +CK+L+  SKS+T+ R+LDA+RSLE I IPFHNSID + YA+FIYTP  I  S+    RQ WG+      S+ +   D 
Subjt:  LHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVI-ISNQHFQRQCWGSI-----SDPQSVHDE

Query:  -SES----INLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYA
         SES    ++LVD       GC+CE C   E  C CL+F G+E++A+ECG  C CG +C NR+TQ+G+SV LKI+RDEKKGW LYAD+LI++G FICEYA
Subjt:  -SES----INLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYA

Query:  GELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYASQSISKEEELTFS
        GELLTT+EARRRQ IYD       FAS+LLVVREHLPSG ACLR+NIDAT IGNVARFINHSCDGGNL T L+R +G +LPRLCF+A++ I  EEEL+FS
Subjt:  GELLTTEEARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYASQSISKEEELTFS

Query:  YGDIRLKHEG----LKCFCGSSCCLGTLPSENT
        YGD+ +  E     L C CGSSCCLGTLP ENT
Subjt:  YGDIRLKHEG----LKCFCGSSCCLGTLPSENT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCAGCCGTATCTAACAAATGCTTGAAGACAAGTGAAGCCGAAGAAGAACAGTTAAACTGTGGTCTCCTTCACTGCGCTCACCTCGTACTTCCATGGCTGACCTC
TCTCGAGCTCGCAACCATATCTCTCTCCTGCAAATCCCTCAATGCCACTTCCAAATCTATCACTCTCCGCCGGACTCTCGACGCTTCCAGATCCCTCGAGAAAATTCCCA
TCCCATTCCATAATTCAATCGACGATCGCCTCTACGCCTTCTTTATCTACACCCCCACGGTTATTATCTCCAATCAGCATTTCCAGCGCCAATGCTGGGGCTCAATTTCA
GATCCCCAGTCGGTCCATGACGAGAGCGAGTCGATTAATTTGGTTGATAATTGGGTTGATGGTGTATTTGGATGCGATTGTGAAAATTGCGGGGATTTTGAGTTACAATG
CCCCTGTTTGAGCTTTGATGGGTTGGAGGATGTTGCTAGCGAGTGTGGACCGCGATGTTCTTGTGGGCTGGAGTGTGAAAATCGGTTGACCCAGAGAGGAATTTCTGTTC
GATTGAAGATTTTGAGAGATGAGAAGAAAGGATGGGGTTTGTATGCGGACGAGTTGATTCAAGAAGGGGCGTTCATTTGTGAGTATGCAGGTGAACTTTTGACCACTGAA
GAAGCAAGAAGGCGGCAGAAAATATATGATGCACGTGCCAAAGGTGGGCGGTTTGCTTCATCTCTTCTCGTTGTGAGAGAGCATCTTCCATCTGGAAATGCATGTTTGCG
AATGAACATCGACGCGACCTGGATTGGGAATGTCGCACGATTCATAAATCACTCTTGTGACGGAGGTAATCTAGTAACAAGACTGGTGAGAGGTACAGGTGTTATGTTGC
CTCGCCTTTGTTTCTATGCTTCTCAAAGCATATCAAAAGAGGAAGAGCTTACCTTTAGTTATGGTGATATCAGATTAAAGCATGAAGGCTTGAAATGCTTCTGTGGTAGT
TCTTGCTGTTTGGGAACTTTGCCTTCAGAAAATACATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCCAGCCGTATCTAACAAATGCTTGAAGACAAGTGAAGCCGAAGAAGAACAGTTAAACTGTGGTCTCCTTCACTGCGCTCACCTCGTACTTCCATGGCTGACCTC
TCTCGAGCTCGCAACCATATCTCTCTCCTGCAAATCCCTCAATGCCACTTCCAAATCTATCACTCTCCGCCGGACTCTCGACGCTTCCAGATCCCTCGAGAAAATTCCCA
TCCCATTCCATAATTCAATCGACGATCGCCTCTACGCCTTCTTTATCTACACCCCCACGGTTATTATCTCCAATCAGCATTTCCAGCGCCAATGCTGGGGCTCAATTTCA
GATCCCCAGTCGGTCCATGACGAGAGCGAGTCGATTAATTTGGTTGATAATTGGGTTGATGGTGTATTTGGATGCGATTGTGAAAATTGCGGGGATTTTGAGTTACAATG
CCCCTGTTTGAGCTTTGATGGGTTGGAGGATGTTGCTAGCGAGTGTGGACCGCGATGTTCTTGTGGGCTGGAGTGTGAAAATCGGTTGACCCAGAGAGGAATTTCTGTTC
GATTGAAGATTTTGAGAGATGAGAAGAAAGGATGGGGTTTGTATGCGGACGAGTTGATTCAAGAAGGGGCGTTCATTTGTGAGTATGCAGGTGAACTTTTGACCACTGAA
GAAGCAAGAAGGCGGCAGAAAATATATGATGCACGTGCCAAAGGTGGGCGGTTTGCTTCATCTCTTCTCGTTGTGAGAGAGCATCTTCCATCTGGAAATGCATGTTTGCG
AATGAACATCGACGCGACCTGGATTGGGAATGTCGCACGATTCATAAATCACTCTTGTGACGGAGGTAATCTAGTAACAAGACTGGTGAGAGGTACAGGTGTTATGTTGC
CTCGCCTTTGTTTCTATGCTTCTCAAAGCATATCAAAAGAGGAAGAGCTTACCTTTAGTTATGGTGATATCAGATTAAAGCATGAAGGCTTGAAATGCTTCTGTGGTAGT
TCTTGCTGTTTGGGAACTTTGCCTTCAGAAAATACATAA
Protein sequenceShow/hide protein sequence
MEPAVSNKCLKTSEAEEEQLNCGLLHCAHLVLPWLTSLELATISLSCKSLNATSKSITLRRTLDASRSLEKIPIPFHNSIDDRLYAFFIYTPTVIISNQHFQRQCWGSIS
DPQSVHDESESINLVDNWVDGVFGCDCENCGDFELQCPCLSFDGLEDVASECGPRCSCGLECENRLTQRGISVRLKILRDEKKGWGLYADELIQEGAFICEYAGELLTTE
EARRRQKIYDARAKGGRFASSLLVVREHLPSGNACLRMNIDATWIGNVARFINHSCDGGNLVTRLVRGTGVMLPRLCFYASQSISKEEELTFSYGDIRLKHEGLKCFCGS
SCCLGTLPSENT