; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg03145 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg03145
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionhistone-lysine N-methyltransferase ATXR2
Genome locationCarg_Chr20:2870725..2880146
RNA-Seq ExpressionCarg03145
SyntenyCarg03145
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001214 - SET domain
IPR002893 - Zinc finger, MYND-type
IPR044237 - Histone-lysine N-methyltransferase ATXR2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8075750.1 hypothetical protein FH972_014439 [Carpinus fangiana]2.0e-29465.09Show/hide
Query:  MAEASSKDDLLQLIKRFGAYLTLKISNFLPISLYNL---------------------------------------------DSRSVGAIAGFAVAIVFTW
        MAE SSK++L+QLIKRFGAYLT+KIS+ LPISL NL                                             +SRS GAIAG AVAIVFTW
Subjt:  MAEASSKDDLLQLIKRFGAYLTLKISNFLPISLYNL---------------------------------------------DSRSVGAIAGFAVAIVFTW

Query:  RLLRSPNGHQRRQSKRQMPAPSSSTSSVGLNSNAQLISSGDCSSSEDLRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQA
        RLLRSP+  QRRQ KRQ   PSS  S V   SNA L+  G   S ED RAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRL+G+ILEE+SPE++QKQA
Subjt:  RLLRSPNGHQRRQSKRQMPAPSSSTSSVGLNSNAQLISSGDCSSSEDLRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQA

Query:  TVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSALEDAGVFTSGGLVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLA-------------
        TVRSSVLEVLLEITK+CDLYLME VLDDESE++VL ALEDAGVFTSGGLVKDKVLFCS ENGRTSFVRQLEPDWHIDSNPEIV+QLA             
Subjt:  TVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSALEDAGVFTSGGLVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLA-------------

Query:  --------------------------------------YPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQM
                                              + +EISALL+PPSPL  QEYFD L+  RQ RG++VKQ+G  GKGV+AD  FKEG+LVLKDQM
Subjt:  --------------------------------------YPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQM

Query:  LVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGVSTNHQCDMEPSSPMSEDCYEAES-DDGEEIALENNESMGGCSSSNSK-GVALPNGLVES
        L G+QH SNK+DCLVCSFCFRF+GSIELQIGR+LY QD+GVS N   DME    +S DCY  +S D+     L+N +++G C+SS+SK   ++P  +VES
Subjt:  LVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGVSTNHQCDMEPSSPMSEDCYEAES-DDGEEIALENNESMGGCSSSNSK-GVALPNGLVES

Query:  LMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEE
        LMNG L LP+SKEFS+P A+PC GGCGEA+YCSKSCAEADW+ FHSLLCTG ++E   REAL+KFIQ+AN+TNDIFLLA K ISSTILRY+KLK    +E
Subjt:  LMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEE

Query:  QR-----KYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVV
        Q+        + +D+S+LLEAWKPISMGHKRRWWDC+ALP+DV+ S+E AFRM+IRE+AF SLQLLK AI D+ CEPLFSLEIYG IIGMFELNNLDLVV
Subjt:  QR-----KYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVV

Query:  ASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFE
        ASPVEDYFLY+D+LP P K++A++ITRP LD LGD YS+CCQGTAFFPLQSCMNHSC+PNAKAFKREEDRDGQA II+++PI  GEEVTISY+DEDLPFE
Subjt:  ASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFE

Query:  QRRALLADYGFECRCPKCLQQQ
        +R+ALLADYGF+C+CPKCL ++
Subjt:  QRRALLADYGFECRCPKCLQQQ

KAG6570873.1 Histone-lysine N-methyltransferase ATXR2, partial [Cucurbita argyrosperma subsp. sororia]4.2e-284100Show/hide
Query:  YPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQD
        YPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQD
Subjt:  YPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQD

Query:  LGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEAD
        LGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEAD
Subjt:  LGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEAD

Query:  WEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEP
        WEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEP
Subjt:  WEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEP

Query:  SNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTA
        SNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTA
Subjt:  SNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTA

Query:  FFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQQQHP
        FFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQQQHP
Subjt:  FFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQQQHP

KAG7010720.1 Histone-lysine N-methyltransferase ATXR2 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MAEASSKDDLLQLIKRFGAYLTLKISNFLPISLYNLDSRSVGAIAGFAVAIVFTWRLLRSPNGHQRRQSKRQMPAPSSSTSSVGLNSNAQLISSGDCSSS
        MAEASSKDDLLQLIKRFGAYLTLKISNFLPISLYNLDSRSVGAIAGFAVAIVFTWRLLRSPNGHQRRQSKRQMPAPSSSTSSVGLNSNAQLISSGDCSSS
Subjt:  MAEASSKDDLLQLIKRFGAYLTLKISNFLPISLYNLDSRSVGAIAGFAVAIVFTWRLLRSPNGHQRRQSKRQMPAPSSSTSSVGLNSNAQLISSGDCSSS

Query:  EDLRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSALEDAGVFT
        EDLRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSALEDAGVFT
Subjt:  EDLRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSALEDAGVFT

Query:  SGGLVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLAYPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVL
        SGGLVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLAYPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVL
Subjt:  SGGLVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLAYPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVL

Query:  KDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLV
        KDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLV
Subjt:  KDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLV

Query:  ESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACS
        ESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACS
Subjt:  ESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACS

Query:  EEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASP
        EEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASP
Subjt:  EEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASP

Query:  VEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRR
        VEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRR
Subjt:  VEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRR

Query:  ALLADYGFECRCPKCLQQQHP
        ALLADYGFECRCPKCLQQQHP
Subjt:  ALLADYGFECRCPKCLQQQHP

RXH89948.1 hypothetical protein DVH24_032305 [Malus domestica]1.2e-29468.46Show/hide
Query:  MAEASSKDDLLQLIKRFGAYLTLKISNFLPISLYNLDSRSVGAIAGFAVAIVFTWRLLRSPNGHQRRQSKRQMPAPSSSTSSVGLNSNAQLISSGDCSSS
        MAE SSK++LLQLIKRFGAYLT+K+S+  PISL+NL+SRS+GAIAGFAVAIVFTWRLLR P+G QRRQ KRQ  AP+SS+S +  NSNA L + G  SSS
Subjt:  MAEASSKDDLLQLIKRFGAYLTLKISNFLPISLYNLDSRSVGAIAGFAVAIVFTWRLLRSPNGHQRRQSKRQMPAPSSSTSSVGLNSNAQLISSGDCSSS

Query:  EDLRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSALEDAGVFT
        ED R QNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLG+ILEE++PE+LQKQ TVR SVLEVLLEITK+CDLYLME VLDDESEK+VL ALEDAG+FT
Subjt:  EDLRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSALEDAGVFT

Query:  SGGLVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLA-----------------------------------------YPNEISALLSPPSPLQV
        SGGLVKDKVLFCS ENGRTSFVRQLEPDWHID+NP+I+ QL+                                            EISALL+PPSP  V
Subjt:  SGGLVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLA-----------------------------------------YPNEISALLSPPSPLQV

Query:  QEYFDQLVW--MRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGVSTNHQCDMEPSS
        +EY ++L+    RQC G+ VKQ+G LGKGV+AD+  KEG L+LKDQMLVG QH+SNK+DCLVCSFCFRFVGS+ELQIGR+LY Q+LGVS    C      
Subjt:  QEYFDQLVW--MRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGVSTNHQCDMEPSS

Query:  PMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEAFHSLLCTGGKTE
           E  Y AE +D ++         G  SS   + V LP G+VESLMNGGL LP+S +FS+PP +PC GGC EA+YCSK CAE+DW+  HSLLCTG ++E
Subjt:  PMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEAFHSLLCTGGKTE

Query:  PSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIV---DLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNEVAFRMQIRE
           REAL++FIQ+ANDTNDIFLLAAKA+SSTIL+Y+KLK+A SE++ K  N+     LS+LL+AWKPIS+GHKRRWWDCIALP DVEPS+E AFRMQIRE
Subjt:  PSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIV---DLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNEVAFRMQIRE

Query:  MAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSC
        +AFTSL+LLK AI DE C+P+FSLEIYG II MFELNNLDLVVASPVEDYFLY+D+LP+P K++ E ITRP LD LGD YS+CCQGTAF+PLQSCMNHSC
Subjt:  MAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSC

Query:  YPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQQ
         PNAKAFKREEDRDGQATIIA++PI  GEEVTISY+DEDLP+E+RRALLADYGF+CRCPKCL++
Subjt:  YPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQQ

XP_022943576.1 histone-lysine N-methyltransferase ATXR2 isoform X1 [Cucurbita moschata]1.4e-28299.37Show/hide
Query:  YPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQD
        YPNEIS LLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQD
Subjt:  YPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQD

Query:  LGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEAD
        LGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEAD
Subjt:  LGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEAD

Query:  WEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEP
        WEAFHSLLCTG KTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEP
Subjt:  WEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEP

Query:  SNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTA
        SNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTA
Subjt:  SNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTA

Query:  FFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQQQHP
        FFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVT+SYIDEDLPFEQRRALLADYGFECRCPKCLQQQHP
Subjt:  FFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQQQHP

TrEMBL top hitse value%identityAlignment
A0A1R3GQ23 SET domain-containing protein8.0e-28167.96Show/hide
Query:  ASSKDDLLQLIKRFGAYLTLKISNFLPISLYNLDSRSVGAIAGFAVAIVFTWRLLRSPNGHQRRQSKRQMPAPSSSTSSVGLNSNAQLISSGDCSSSEDL
        +SSKD+L+QLIKR G +L LK+SN   ISL  LD RSVGAIAG AVAI+FT+RL+RSP    RRQ KRQ  AP++STS+    S++ L+ SG CSSSED 
Subjt:  ASSKDDLLQLIKRFGAYLTLKISNFLPISLYNLDSRSVGAIAGFAVAIVFTWRLLRSPNGHQRRQSKRQMPAPSSSTSSVGLNSNAQLISSGDCSSSEDL

Query:  RAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSALEDAGVFTSGG
        RAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLG+ILEE+SPE+LQ +ATV+SSVL+VLLEITK+CDLYLME V+DDESEK VL ALE+AG+FTSGG
Subjt:  RAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSALEDAGVFTSGG

Query:  LVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLAYPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQ
        LVKDKVLFCS ENGRTSFVRQLEPDWHID+NPEIVSQLA                  EYF+QL+  RQC G+ VKQ+G  GKGVFA+  F+E  L+LKDQ
Subjt:  LVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLAYPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQ

Query:  MLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESL
        ML+G+QH+SNK+DCLVCS+CF+F+GSIE QIGRKLY + LGVS        P S    D  + + D+ E     ++ S  G SSS+   V LP   VESL
Subjt:  MLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESL

Query:  MNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQ
        MNG +SLP+S++F +P  + C G C E FYCSKSCAEADWE+FHSLLC G KT+   REALLKFIQ+AN+TNDIFLLAAKAIS TILRY+KLK +   E 
Subjt:  MNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQ

Query:  RKYG-----NIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVA
         K          +LS+LL+AWKPIS+GHKRRWWDC++LP+D++ S+E AFRMQ+RE+AFTSLQLLKEAI D+ CEPLFSLEIYG IIGMFELNNLDLVVA
Subjt:  RKYG-----NIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVA

Query:  SPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQ
        SPVEDYF+Y+D+LP+P K++AE ITRP LD LG+ YS+CC+GTAF+PLQSCMNHSC PNAKAFKREEDRDGQATIIAVRP+   EE+ ISYIDEDLPFE+
Subjt:  SPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQ

Query:  RRALLADYGFECRCPKCLQQQ
        R+ALLADYGF CRCP+CL+++
Subjt:  RRALLADYGFECRCPKCLQQQ

A0A498J311 SET domain-containing protein5.7e-29568.46Show/hide
Query:  MAEASSKDDLLQLIKRFGAYLTLKISNFLPISLYNLDSRSVGAIAGFAVAIVFTWRLLRSPNGHQRRQSKRQMPAPSSSTSSVGLNSNAQLISSGDCSSS
        MAE SSK++LLQLIKRFGAYLT+K+S+  PISL+NL+SRS+GAIAGFAVAIVFTWRLLR P+G QRRQ KRQ  AP+SS+S +  NSNA L + G  SSS
Subjt:  MAEASSKDDLLQLIKRFGAYLTLKISNFLPISLYNLDSRSVGAIAGFAVAIVFTWRLLRSPNGHQRRQSKRQMPAPSSSTSSVGLNSNAQLISSGDCSSS

Query:  EDLRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSALEDAGVFT
        ED R QNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLG+ILEE++PE+LQKQ TVR SVLEVLLEITK+CDLYLME VLDDESEK+VL ALEDAG+FT
Subjt:  EDLRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSALEDAGVFT

Query:  SGGLVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLA-----------------------------------------YPNEISALLSPPSPLQV
        SGGLVKDKVLFCS ENGRTSFVRQLEPDWHID+NP+I+ QL+                                            EISALL+PPSP  V
Subjt:  SGGLVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLA-----------------------------------------YPNEISALLSPPSPLQV

Query:  QEYFDQLVW--MRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGVSTNHQCDMEPSS
        +EY ++L+    RQC G+ VKQ+G LGKGV+AD+  KEG L+LKDQMLVG QH+SNK+DCLVCSFCFRFVGS+ELQIGR+LY Q+LGVS    C      
Subjt:  QEYFDQLVW--MRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGVSTNHQCDMEPSS

Query:  PMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEAFHSLLCTGGKTE
           E  Y AE +D ++         G  SS   + V LP G+VESLMNGGL LP+S +FS+PP +PC GGC EA+YCSK CAE+DW+  HSLLCTG ++E
Subjt:  PMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEAFHSLLCTGGKTE

Query:  PSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIV---DLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNEVAFRMQIRE
           REAL++FIQ+ANDTNDIFLLAAKA+SSTIL+Y+KLK+A SE++ K  N+     LS+LL+AWKPIS+GHKRRWWDCIALP DVEPS+E AFRMQIRE
Subjt:  PSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIV---DLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNEVAFRMQIRE

Query:  MAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSC
        +AFTSL+LLK AI DE C+P+FSLEIYG II MFELNNLDLVVASPVEDYFLY+D+LP+P K++ E ITRP LD LGD YS+CCQGTAF+PLQSCMNHSC
Subjt:  MAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSC

Query:  YPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQQ
         PNAKAFKREEDRDGQATIIA++PI  GEEVTISY+DEDLP+E+RRALLADYGF+CRCPKCL++
Subjt:  YPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQQ

A0A5N6RD75 SET domain-containing protein9.8e-29565.09Show/hide
Query:  MAEASSKDDLLQLIKRFGAYLTLKISNFLPISLYNL---------------------------------------------DSRSVGAIAGFAVAIVFTW
        MAE SSK++L+QLIKRFGAYLT+KIS+ LPISL NL                                             +SRS GAIAG AVAIVFTW
Subjt:  MAEASSKDDLLQLIKRFGAYLTLKISNFLPISLYNL---------------------------------------------DSRSVGAIAGFAVAIVFTW

Query:  RLLRSPNGHQRRQSKRQMPAPSSSTSSVGLNSNAQLISSGDCSSSEDLRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQA
        RLLRSP+  QRRQ KRQ   PSS  S V   SNA L+  G   S ED RAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRL+G+ILEE+SPE++QKQA
Subjt:  RLLRSPNGHQRRQSKRQMPAPSSSTSSVGLNSNAQLISSGDCSSSEDLRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQA

Query:  TVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSALEDAGVFTSGGLVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLA-------------
        TVRSSVLEVLLEITK+CDLYLME VLDDESE++VL ALEDAGVFTSGGLVKDKVLFCS ENGRTSFVRQLEPDWHIDSNPEIV+QLA             
Subjt:  TVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSALEDAGVFTSGGLVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLA-------------

Query:  --------------------------------------YPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQM
                                              + +EISALL+PPSPL  QEYFD L+  RQ RG++VKQ+G  GKGV+AD  FKEG+LVLKDQM
Subjt:  --------------------------------------YPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQM

Query:  LVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGVSTNHQCDMEPSSPMSEDCYEAES-DDGEEIALENNESMGGCSSSNSK-GVALPNGLVES
        L G+QH SNK+DCLVCSFCFRF+GSIELQIGR+LY QD+GVS N   DME    +S DCY  +S D+     L+N +++G C+SS+SK   ++P  +VES
Subjt:  LVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGVSTNHQCDMEPSSPMSEDCYEAES-DDGEEIALENNESMGGCSSSNSK-GVALPNGLVES

Query:  LMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEE
        LMNG L LP+SKEFS+P A+PC GGCGEA+YCSKSCAEADW+ FHSLLCTG ++E   REAL+KFIQ+AN+TNDIFLLA K ISSTILRY+KLK    +E
Subjt:  LMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEE

Query:  QR-----KYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVV
        Q+        + +D+S+LLEAWKPISMGHKRRWWDC+ALP+DV+ S+E AFRM+IRE+AF SLQLLK AI D+ CEPLFSLEIYG IIGMFELNNLDLVV
Subjt:  QR-----KYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVV

Query:  ASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFE
        ASPVEDYFLY+D+LP P K++A++ITRP LD LGD YS+CCQGTAFFPLQSCMNHSC+PNAKAFKREEDRDGQA II+++PI  GEEVTISY+DEDLPFE
Subjt:  ASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFE

Query:  QRRALLADYGFECRCPKCLQQQ
        +R+ALLADYGF+C+CPKCL ++
Subjt:  QRRALLADYGFECRCPKCLQQQ

A0A6J1FS34 histone-lysine N-methyltransferase ATXR2 isoform X16.6e-28399.37Show/hide
Query:  YPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQD
        YPNEIS LLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQD
Subjt:  YPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQD

Query:  LGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEAD
        LGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEAD
Subjt:  LGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEAD

Query:  WEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEP
        WEAFHSLLCTG KTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEP
Subjt:  WEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEP

Query:  SNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTA
        SNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTA
Subjt:  SNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTA

Query:  FFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQQQHP
        FFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVT+SYIDEDLPFEQRRALLADYGFECRCPKCLQQQHP
Subjt:  FFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQQQHP

A0A6J1JG48 histone-lysine N-methyltransferase ATXR2 isoform X14.7e-28198.75Show/hide
Query:  YPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQD
        YPNEISALLS PSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQD
Subjt:  YPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQD

Query:  LGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEAD
        LGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALP GLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEAD
Subjt:  LGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEAD

Query:  WEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEP
        WEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKP+SMGHKRRWWDCIALPEDVEP
Subjt:  WEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEP

Query:  SNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTA
        SNEVAFRMQIREMAFTSLQLLKEAIL+EGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLG+SYSICCQGTA
Subjt:  SNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTA

Query:  FFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQQQHP
        FFPLQSCMNHSCYPNAKAFKREEDRDG+ATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQQQHP
Subjt:  FFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQQQHP

SwissProt top hitse value%identityAlignment
Q3TYX3 SET and MYND domain-containing protein 51.2e-1034.08Show/hide
Query:  LQLLKEAILD---------EGCEPLFSL-EIYGQIIGMFELNN----LDLVVASPV--EDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAF
        L L KEA+ +         EG   LF+L    GQ IG   L+      D +  +P   E    ++D+L   YK+  E  T   L+         C+G+  
Subjt:  LQLLKEAILD---------EGCEPLFSL-EIYGQIIGMFELNN----LDLVVASPV--EDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAF

Query:  FPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYID----EDLPFEQRRALLADYGFECRCPKCLQQ
        F LQSC NHSC PNA+    E +     T  A+  I PGEE+ ISY+D    E     + + L  +Y F C CPKCL +
Subjt:  FPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYID----EDLPFEQRRALLADYGFECRCPKCLQQ

Q5PP37 Histone-lysine N-methyltransferase ATXR23.1e-17363.56Show/hide
Query:  EISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGV
        +++ALL+P    Q+QEYF++L+  R+C G+ VK +G +GKGV+A++ F E +L+LKD++LVG QH+SNK+DCLVCSFCFRF+GSIE QIGRKLYF++LGV
Subjt:  EISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGV

Query:  STNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEA
        S    CD + S    ++C +            N E  GG SSS++    LP G+V SLMNG ++LPH+ +F +P  + C GGC EAFYCS+SCA ADWE+
Subjt:  STNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEA

Query:  FHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNE
         HSLLCTG ++E   REAL +FI++ANDTNDIFLLAAKAI+ TILRY+KLK    +++ K       S+LLEAWKP+S+G+KRRWWDCIALP+DV+P++E
Subjt:  FHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNE

Query:  VAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFP
         AFRMQI+ +A TSL+LLK AI D+ CE LFSLEIYG IIGMFELNNLDLVVASPVEDYFLY+D+LP   KE+ EEITRP LD LGD YS CCQGTAFFP
Subjt:  VAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFP

Query:  LQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQ
        LQSCMNHSC PNAKAFKREEDRDGQA IIA+R I   EEVTISYIDE+LP+++R+ALLADYGF C+C KCL+
Subjt:  LQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQ

Q5ZIZ2 SET and MYND domain-containing protein 51.2e-1225.07Show/hide
Query:  LMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEAFHSLLCTGGKTEPSRREAL--LKFIQNA-------NDTNDIFLLAAKAISSTILRYK
        L    L LPH ++ S+   +       +  YCS  C +A  E +H +LC G    PSR +    L  +Q A        +T+ I L+A            
Subjt:  LMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEAFHSLLCTGGKTEPSRREAL--LKFIQNA-------NDTNDIFLLAAKAISSTILRYK

Query:  KLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWW------DCIALPEDVEPSNEVAFRMQIREMAFTSLQLLK----EAILDEGCEPLFSLEIYGQ
          ++  + +Q K                      + WW       C     + E   E+A ++ + +     L+LL+    EA+ DE     F+ E +  
Subjt:  KLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWW------DCIALPEDVEPSNEVAFRMQIREMAFTSLQLLK----EAILDEGCEPLFSLEIYGQ

Query:  IIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAE---EITRPLLDVLGDSYS-ICCQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPI
        +  +   N   +  +S +  +    D L  P  ++ E    I +   D+  +S   + C+G+  + LQSC NHSC PNA+      D +    + A+  I
Subjt:  IIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAE---EITRPLLDVLGDSYS-ICCQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPI

Query:  HPGEEVTISYID----EDLPFEQRRALLADYGFECRCPKCLQQ
          GEE+ ISY+D    E     + + L  +Y F C CPKCL Q
Subjt:  HPGEEVTISYID----EDLPFEQRRALLADYGFECRCPKCLQQ

Q6GPQ4 SET and MYND domain-containing protein 53.1e-1137.61Show/hide
Query:  ELPSPYKEK----AEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYID---EDLPFEQRRAL
        ELP   +EK     +++ + +  V G+  +  C+G+  + LQSC NHSC PNA+A     D +    + A+  I PGEE+ ISY+D    D     R+ +
Subjt:  ELPSPYKEK----AEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYID---EDLPFEQRRAL

Query:  LAD-YGFECRCPKCLQQ
        L + Y F C CPKCL Q
Subjt:  LAD-YGFECRCPKCLQQ

Q9LSX7 Peroxisome biogenesis protein 223.3e-8268.27Show/hide
Query:  MAEASS---KDDLLQLIKRFGAYLTLKISN-FLPISLYNLDSRSVGAIAGFAVAIVFTWRLLRSP-NGHQRRQSKRQM-PAPSSSTSSVGLNSN-AQLIS
        MAE+SS    +++++LIKR  AY+  K+S+ F   S+ NLDSRS+GAIAG A+A++FTWR +R+P    QRRQ KR++  A +SS ++    SN A  ++
Subjt:  MAEASS---KDDLLQLIKRFGAYLTLKISN-FLPISLYNLDSRSVGAIAGFAVAIVFTWRLLRSP-NGHQRRQSKRQM-PAPSSSTSSVGLNSN-AQLIS

Query:  SGDCSSSEDLRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSAL
            S  ED   Q+VVD+FFQPVKPTLGQIVRQKLSEGRKVTCRLLG+ILEE SPE+LQKQATVRSSVLEVLLEITKY DLYLME VLDDESE +VL AL
Subjt:  SGDCSSSEDLRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSAL

Query:  EDAGVFTSGGLVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLA
        E+AGVFTSGGLVKDKVLFCS E GRTSFVRQLEPDWHID+NPEI +QLA
Subjt:  EDAGVFTSGGLVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLA

Arabidopsis top hitse value%identityAlignment
AT1G26760.1 SET domain protein 352.8e-0736.36Show/hide
Query:  GTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATII-AVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKC
        G   + L S +NHSC PNA+         G   I+ A R I  GEE++ +Y D   P E+R+ +   +GF C C +C
Subjt:  GTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATII-AVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKC

AT2G17900.1 SET domain group 371.5e-0825.11Show/hide
Query:  KYGNIVDLSILLEAWKPISMGHKRRWW----DCIALPE-DVEPSNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVAS
        K  N+   S     W   S   K  W     +C AL   + E    V   +++    +    L  E +L       +SL +   +  M E++   +++ +
Subjt:  KYGNIVDLSILLEAWKPISMGHKRRWW----DCIALPE-DVEPSNEVAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVAS

Query:  PVEDYFLYVDELPS-PYKEKAEEITRPLLDVLGDSYSIC-----CQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYID-E
         + +    + + PS   +E AE  ++       +++SIC      QG   FPL S +NHSC PNA     E+     A + A+  I    E+TISYI+  
Subjt:  PVEDYFLYVDELPS-PYKEKAEEITRPLLDVLGDSYSIC-----CQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYID-E

Query:  DLPFEQRRALLADYGFECRCPKC
             ++++L   Y F C+C +C
Subjt:  DLPFEQRRALLADYGFECRCPKC

AT3G21820.1 histone-lysine N-methyltransferase ATXR22.2e-17463.56Show/hide
Query:  EISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGV
        +++ALL+P    Q+QEYF++L+  R+C G+ VK +G +GKGV+A++ F E +L+LKD++LVG QH+SNK+DCLVCSFCFRF+GSIE QIGRKLYF++LGV
Subjt:  EISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSIELQIGRKLYFQDLGV

Query:  STNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEA
        S    CD + S    ++C +            N E  GG SSS++    LP G+V SLMNG ++LPH+ +F +P  + C GGC EAFYCS+SCA ADWE+
Subjt:  STNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAEADWEA

Query:  FHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNE
         HSLLCTG ++E   REAL +FI++ANDTNDIFLLAAKAI+ TILRY+KLK    +++ K       S+LLEAWKP+S+G+KRRWWDCIALP+DV+P++E
Subjt:  FHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNE

Query:  VAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFP
         AFRMQI+ +A TSL+LLK AI D+ CE LFSLEIYG IIGMFELNNLDLVVASPVEDYFLY+D+LP   KE+ EEITRP LD LGD YS CCQGTAFFP
Subjt:  VAFRMQIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFP

Query:  LQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQ
        LQSCMNHSC PNAKAFKREEDRDGQA IIA+R I   EEVTISYIDE+LP+++R+ALLADYGF C+C KCL+
Subjt:  LQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQ

AT3G21865.1 peroxin 222.4e-8368.27Show/hide
Query:  MAEASS---KDDLLQLIKRFGAYLTLKISN-FLPISLYNLDSRSVGAIAGFAVAIVFTWRLLRSP-NGHQRRQSKRQM-PAPSSSTSSVGLNSN-AQLIS
        MAE+SS    +++++LIKR  AY+  K+S+ F   S+ NLDSRS+GAIAG A+A++FTWR +R+P    QRRQ KR++  A +SS ++    SN A  ++
Subjt:  MAEASS---KDDLLQLIKRFGAYLTLKISN-FLPISLYNLDSRSVGAIAGFAVAIVFTWRLLRSP-NGHQRRQSKRQM-PAPSSSTSSVGLNSN-AQLIS

Query:  SGDCSSSEDLRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSAL
            S  ED   Q+VVD+FFQPVKPTLGQIVRQKLSEGRKVTCRLLG+ILEE SPE+LQKQATVRSSVLEVLLEITKY DLYLME VLDDESE +VL AL
Subjt:  SGDCSSSEDLRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSAL

Query:  EDAGVFTSGGLVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLA
        E+AGVFTSGGLVKDKVLFCS E GRTSFVRQLEPDWHID+NPEI +QLA
Subjt:  EDAGVFTSGGLVKDKVLFCSKENGRTSFVRQLEPDWHIDSNPEIVSQLA

AT5G06620.1 SET domain protein 381.4e-0635.06Show/hide
Query:  GTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLAD-YGFECRCPKC
        G A + L S  NH C PNA         +  A +  +R +  GEE+ I YID  + +E R+ +L+  +GF C C +C
Subjt:  GTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLAD-YGFECRCPKC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAAGCTTCTTCCAAAGACGACCTACTCCAGCTGATCAAGCGCTTCGGGGCTTATCTCACTCTCAAGATCTCTAATTTCCTCCCTATCTCTCTCTACAATCTGGA
TTCACGATCTGTTGGGGCTATTGCTGGATTTGCTGTTGCAATAGTTTTTACATGGAGGCTGTTGAGATCACCTAATGGACATCAGAGACGGCAATCTAAAAGGCAAATGC
CTGCACCTAGCAGCTCTACTTCTAGTGTTGGGTTAAATTCAAATGCACAATTAATATCTTCTGGAGATTGTTCATCTTCAGAGGATTTACGAGCGCAAAATGTTGTTGAT
GAATTCTTCCAACCAGTCAAGCCAACTCTGGGCCAAATAGTGAGGCAAAAGTTGAGTGAAGGAAGAAAGGTTACATGTCGTTTGCTTGGAATAATTCTTGAGGAAAACAG
TCCAGAGGATCTGCAGAAACAAGCAACTGTGAGGTCCTCGGTATTGGAAGTACTATTGGAGATAACAAAATATTGTGATCTTTATCTCATGGAAACGGTGCTGGATGATG
AGAGTGAAAAAAGAGTTCTTTCAGCACTTGAAGATGCCGGGGTTTTCACATCTGGTGGTCTGGTAAAAGACAAGGTTCTCTTCTGTAGCAAGGAGAATGGACGAACATCT
TTCGTGCGTCAACTGGAACCCGATTGGCATATAGATTCCAATCCCGAAATCGTCTCGCAGTTGGCTTACCCCAATGAAATCTCTGCCCTCCTTTCACCTCCTTCACCTCT
CCAAGTTCAGGAATATTTTGATCAGCTTGTATGGATGAGGCAGTGTCGTGGTCTCAGAGTGAAGCAGGATGGCGCTCTTGGAAAGGGTGTCTTTGCTGATGCTGCTTTCA
AAGAAGGGGACCTTGTCTTGAAGGACCAAATGCTTGTGGGATCACAGCATACGTCAAATAAGATGGACTGTCTAGTGTGCAGCTTTTGTTTTCGCTTTGTTGGGTCTATA
GAACTTCAAATTGGAAGGAAATTGTACTTTCAAGATCTTGGCGTTTCGACTAATCATCAATGTGATATGGAACCGTCATCACCCATGTCAGAAGATTGCTATGAAGCGGA
ATCAGATGATGGCGAGGAGATTGCGTTAGAAAATAATGAAAGCATGGGAGGATGTTCTTCCAGCAATTCTAAAGGTGTAGCTTTACCCAACGGGCTTGTGGAATCATTGA
TGAATGGTGGCCTATCATTGCCTCATTCCAAAGAGTTTTCCATGCCTCCAGCAATTCCTTGTTCTGGGGGATGTGGTGAGGCCTTCTACTGCAGTAAATCGTGTGCAGAA
GCTGATTGGGAAGCATTCCATTCCTTACTTTGTACTGGGGGAAAAACTGAACCATCACGCAGGGAAGCACTGCTAAAATTTATACAAAATGCTAATGACACGAATGACAT
ATTCCTCCTTGCTGCAAAGGCAATTTCTTCTACTATTTTAAGGTATAAGAAGTTAAAGCTGGCTTGTTCTGAAGAACAAAGGAAATATGGGAACATTGTTGATCTATCCA
TTCTTTTGGAGGCATGGAAGCCAATCTCAATGGGACATAAGAGAAGGTGGTGGGATTGCATTGCATTGCCGGAAGATGTCGAACCTTCTAACGAAGTTGCATTCCGAATG
CAAATAAGAGAGATGGCTTTCACGTCACTGCAGCTCCTCAAGGAAGCAATTTTGGACGAAGGATGTGAACCATTATTCTCCCTTGAAATATATGGCCAGATAATTGGCAT
GTTTGAGCTAAATAATCTTGATTTGGTTGTAGCATCACCGGTAGAGGATTACTTCTTGTATGTTGACGAACTTCCATCTCCTTATAAGGAGAAGGCAGAGGAAATTACCC
GACCCCTTTTGGATGTTCTCGGCGATAGCTATTCAATCTGTTGTCAAGGTACAGCGTTTTTCCCTTTACAGAGTTGTATGAATCATTCCTGCTATCCTAATGCAAAAGCG
TTCAAAAGAGAGGAGGATAGAGATGGACAAGCGACCATAATTGCAGTGAGGCCAATCCATCCAGGAGAGGAGGTCACAATTTCATATATAGATGAAGATCTTCCATTTGA
ACAGAGACGAGCATTACTTGCAGATTATGGGTTCGAATGCAGGTGCCCCAAGTGCTTACAACAGCAGCATCCATAG
mRNA sequenceShow/hide mRNA sequence
TGTGTGGGCGACAGGAAAAAATAGGGAACCTCATTAACATTTCAACATCAGTATCAGCCATTGACGCCATAGCTCGCTTCCTTCTTCTGCTTCCAAATCCCTCTACTGCT
CAACTAACTTTGACGCCTCAATAATTCCATTTCGATTCTTCTATCCGCTTCCTTCTTCTGCTGGGCGCGCCAATTTTTCAGAAGCTCAATTCGCGAGGTTTTTGTGTTCC
CCATGGCCGAAGCTTCTTCCAAAGACGACCTACTCCAGCTGATCAAGCGCTTCGGGGCTTATCTCACTCTCAAGATCTCTAATTTCCTCCCTATCTCTCTCTACAATCTG
GATTCACGATCTGTTGGGGCTATTGCTGGATTTGCTGTTGCAATAGTTTTTACATGGAGGCTGTTGAGATCACCTAATGGACATCAGAGACGGCAATCTAAAAGGCAAAT
GCCTGCACCTAGCAGCTCTACTTCTAGTGTTGGGTTAAATTCAAATGCACAATTAATATCTTCTGGAGATTGTTCATCTTCAGAGGATTTACGAGCGCAAAATGTTGTTG
ATGAATTCTTCCAACCAGTCAAGCCAACTCTGGGCCAAATAGTGAGGCAAAAGTTGAGTGAAGGAAGAAAGGTTACATGTCGTTTGCTTGGAATAATTCTTGAGGAAAAC
AGTCCAGAGGATCTGCAGAAACAAGCAACTGTGAGGTCCTCGGTATTGGAAGTACTATTGGAGATAACAAAATATTGTGATCTTTATCTCATGGAAACGGTGCTGGATGA
TGAGAGTGAAAAAAGAGTTCTTTCAGCACTTGAAGATGCCGGGGTTTTCACATCTGGTGGTCTGGTAAAAGACAAGGTTCTCTTCTGTAGCAAGGAGAATGGACGAACAT
CTTTCGTGCGTCAACTGGAACCCGATTGGCATATAGATTCCAATCCCGAAATCGTCTCGCAGTTGGCTTACCCCAATGAAATCTCTGCCCTCCTTTCACCTCCTTCACCT
CTCCAAGTTCAGGAATATTTTGATCAGCTTGTATGGATGAGGCAGTGTCGTGGTCTCAGAGTGAAGCAGGATGGCGCTCTTGGAAAGGGTGTCTTTGCTGATGCTGCTTT
CAAAGAAGGGGACCTTGTCTTGAAGGACCAAATGCTTGTGGGATCACAGCATACGTCAAATAAGATGGACTGTCTAGTGTGCAGCTTTTGTTTTCGCTTTGTTGGGTCTA
TAGAACTTCAAATTGGAAGGAAATTGTACTTTCAAGATCTTGGCGTTTCGACTAATCATCAATGTGATATGGAACCGTCATCACCCATGTCAGAAGATTGCTATGAAGCG
GAATCAGATGATGGCGAGGAGATTGCGTTAGAAAATAATGAAAGCATGGGAGGATGTTCTTCCAGCAATTCTAAAGGTGTAGCTTTACCCAACGGGCTTGTGGAATCATT
GATGAATGGTGGCCTATCATTGCCTCATTCCAAAGAGTTTTCCATGCCTCCAGCAATTCCTTGTTCTGGGGGATGTGGTGAGGCCTTCTACTGCAGTAAATCGTGTGCAG
AAGCTGATTGGGAAGCATTCCATTCCTTACTTTGTACTGGGGGAAAAACTGAACCATCACGCAGGGAAGCACTGCTAAAATTTATACAAAATGCTAATGACACGAATGAC
ATATTCCTCCTTGCTGCAAAGGCAATTTCTTCTACTATTTTAAGGTATAAGAAGTTAAAGCTGGCTTGTTCTGAAGAACAAAGGAAATATGGGAACATTGTTGATCTATC
CATTCTTTTGGAGGCATGGAAGCCAATCTCAATGGGACATAAGAGAAGGTGGTGGGATTGCATTGCATTGCCGGAAGATGTCGAACCTTCTAACGAAGTTGCATTCCGAA
TGCAAATAAGAGAGATGGCTTTCACGTCACTGCAGCTCCTCAAGGAAGCAATTTTGGACGAAGGATGTGAACCATTATTCTCCCTTGAAATATATGGCCAGATAATTGGC
ATGTTTGAGCTAAATAATCTTGATTTGGTTGTAGCATCACCGGTAGAGGATTACTTCTTGTATGTTGACGAACTTCCATCTCCTTATAAGGAGAAGGCAGAGGAAATTAC
CCGACCCCTTTTGGATGTTCTCGGCGATAGCTATTCAATCTGTTGTCAAGGTACAGCGTTTTTCCCTTTACAGAGTTGTATGAATCATTCCTGCTATCCTAATGCAAAAG
CGTTCAAAAGAGAGGAGGATAGAGATGGACAAGCGACCATAATTGCAGTGAGGCCAATCCATCCAGGAGAGGAGGTCACAATTTCATATATAGATGAAGATCTTCCATTT
GAACAGAGACGAGCATTACTTGCAGATTATGGGTTCGAATGCAGGTGCCCCAAGTGCTTACAACAGCAGCATCCATAGGCAAGCCGCTCCCTGTGGGATTTCCTCCCAAC
GAGATGAGCAAACAAAAACAACTTTCACTGTTATGAACATCCACAGCAGTTCTTAATGTTTCAACAACTTTATTCCAATATGATTACTAGTTTTGTAGGATTAGACAAAT
AGTAGAATGAGCAAATTTGATTTTGTTATGCCAAATTTGTGTTTA
Protein sequenceShow/hide protein sequence
MAEASSKDDLLQLIKRFGAYLTLKISNFLPISLYNLDSRSVGAIAGFAVAIVFTWRLLRSPNGHQRRQSKRQMPAPSSSTSSVGLNSNAQLISSGDCSSSEDLRAQNVVD
EFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLSALEDAGVFTSGGLVKDKVLFCSKENGRTS
FVRQLEPDWHIDSNPEIVSQLAYPNEISALLSPPSPLQVQEYFDQLVWMRQCRGLRVKQDGALGKGVFADAAFKEGDLVLKDQMLVGSQHTSNKMDCLVCSFCFRFVGSI
ELQIGRKLYFQDLGVSTNHQCDMEPSSPMSEDCYEAESDDGEEIALENNESMGGCSSSNSKGVALPNGLVESLMNGGLSLPHSKEFSMPPAIPCSGGCGEAFYCSKSCAE
ADWEAFHSLLCTGGKTEPSRREALLKFIQNANDTNDIFLLAAKAISSTILRYKKLKLACSEEQRKYGNIVDLSILLEAWKPISMGHKRRWWDCIALPEDVEPSNEVAFRM
QIREMAFTSLQLLKEAILDEGCEPLFSLEIYGQIIGMFELNNLDLVVASPVEDYFLYVDELPSPYKEKAEEITRPLLDVLGDSYSICCQGTAFFPLQSCMNHSCYPNAKA
FKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEQRRALLADYGFECRCPKCLQQQHP