; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0009104 (gene) of Chayote v1 genome

Gene IDSed0009104
OrganismSechium edule (Chayote v1)
DescriptionProtein SET DOMAIN GROUP 41
Genome locationLG10:33385600..33389084
RNA-Seq ExpressionSed0009104
SyntenySed0009104
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001214 - SET domain
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463080.1 PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo]3.7e-22365.11Show/hide
Query:  MEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSL--SDPLAAAV----NLP--SSATADLRAALRLL-----
        MEM A+EDIEMAEDITPPL PLT+ALH  FL THCS+CF+ LPNPP  +S  L YCS KCSL  SDPL AA      LP  SS T+DLRA+LRLL     
Subjt:  MEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSL--SDPLAAAV----NLP--SSATADLRAALRLL-----

Query:  --LANPSLSPSSSDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINH
            +PSLSP    RI GLLTNR KLM  +  +E+ L +R+ A A+AA R  +  D   G ALEEAV+CLV+TNAV+V DS G++IGIAVY P+FSWINH
Subjt:  --LANPSLSPSSSDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINH

Query:  SCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRS
        SCSPNACYRFET SD   +R RI+P  CTD    +G+    Q+G V  SN+LDF+R DFQG GPRV+VRSIK I+K E VTIAYCDLLQPKA RQSEL S
Subjt:  SCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRS

Query:  RYQFVCTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHN
        RYQFVC+CQRC A PLT Y DH LQEISAV  +L + S  ISNFD++  VRRI++YVDNAI EYLS+GSP SCCEKL+NLLT GFRDE+ ED EGK+  +
Subjt:  RYQFVCTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHN

Query:  LRLHPFHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLW
        LRLHP H+L LNAYTAL SAYKV SCDLLA SS+    + ++H  +A TM KTSAAY+LFLAGA HHLFL EPSLIASAANCWVVAGESLLILAR SSLW
Subjt:  LRLHPFHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLW

Query:  A--SNISNCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIG
        A  +N S+  FP+ K MC NC WV+E N SRIHGR ++ DF EFS GISNCI +ISR+CWSFLTHGCPYLK FTDPFDF WP+T     N+ D     I 
Subjt:  A--SNISNCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIG

Query:  SSFAYGKTEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILD
         S A  KT+D+   C+P   S++ER+SI  LGIHCL+YGGYLASICYG+HSHLASQIQNIL+
Subjt:  SSFAYGKTEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILD

XP_011656459.1 protein SET DOMAIN GROUP 41 [Cucumis sativus]3.6e-22665.41Show/hide
Query:  MEMEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSL--SDPLAAAVNL------PSSATADLRAALRLL---
        MEMEMIA+EDIEMAEDI+PPL PLT+ALH  FL THCS+CF+ LPNPP  +S PL YCS KCSL  SDPL  A          SS T+DLRA+LRLL   
Subjt:  MEMEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSL--SDPLAAAVNL------PSSATADLRAALRLL---

Query:  --LANPSLSPSSSDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINH
            +PSLSP   DRI GLLTNR KLM  + D+E+ L +R+GA A+AA R  +  D P G ALEEAV+CLV+TNAV+V DS G++IGIAVY  +FSWINH
Subjt:  --LANPSLSPSSSDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINH

Query:  SCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRS
        SCSPNACYRFET SDS+ +R RI+P  CTD    +GS    Q+G V  SN+LDF+R DFQG GPRV+VRSIK I+K E VTIAYCDLLQPKA RQSEL S
Subjt:  SCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRS

Query:  RYQFVCTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHN
        RYQFVC+CQRC A PLT Y DH LQEIS+V  +L + ST ISNFD++  VRRI++YVDNAI EYLS  SP SCCEKL+NLLT GF DE+ ED EGK+  +
Subjt:  RYQFVCTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHN

Query:  LRLHPFHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLW
        LRLHP H+L LNAYTAL SAYKV SCDL+A SS+    + ++H  +A TM KTSAAY+LFLAGA H LFL EPSL+ASAANCWVVAGESLLILAR SSLW
Subjt:  LRLHPFHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLW

Query:  A--SNISNCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIG
        A  +N SN  FP+ K MC+NC WV+E NASRIHG+ ++ DF EFS GISNCI +IS++CWS LTHGCPYLK FT PFDF WP+T     N +D   R I 
Subjt:  A--SNISNCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIG

Query:  SSFAYGKTEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILD
         S A  KT+DV   CKP   S++ER+SI  LGIHCL+YGGYLASICYGHHSHLASQIQNIL+
Subjt:  SSFAYGKTEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILD

XP_022974027.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita maxima]6.4e-22364.69Show/hide
Query:  MEMEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSLSDPLAAAV----NLPSSATADLRAALR---LLLANP
        MEME+ AMEDIEMAEDITPPLPPLTAALH  FL THCS+CF+PLPN P  +S+ LRYCSP CS SD L AAV    +   S T+DLRA+LR   LLL++ 
Subjt:  MEMEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSLSDPLAAAV----NLPSSATADLRAALR---LLLANP

Query:  SLSPSS-SDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPN
        S   S+  +RI GLLTNR+KLMLA+ D+E+   IR+GA A+A  R  +  D    NALEEA+MCLV+TNAVEV DS G++IGIAVY P+F WINHSCSPN
Subjt:  SLSPSS-SDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPN

Query:  ACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFV
        ACYRFET SDSI++R+RISP  CTD+   +GS   +Q+ TV   N   F+  DFQGYGPRV+VRSIK+IRK E VTIAYCDLLQPKAMRQSELRSRY+FV
Subjt:  ACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFV

Query:  CTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHNLRLHP
        C+CQRC AKP T Y DH LQEI AV  +  + STSISNFD +  + RI+DYV+NAI EYLS+GSP SCCEKL+NLLTLGF DE+ +D +GK+L NLRLHP
Subjt:  CTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHNLRLHP

Query:  FHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLWASNIS
         H+L LN YTALASAYKV S             ++++++ + STM KTSAAYSLFLAGA HHLFL EPSLIASAANCWVVAGESLL L R SSLW SN S
Subjt:  FHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLWASNIS

Query:  NCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGSSFAYGK
          S P+ +I C NC WV++ N SRIHGR+++VDF EFS GISNCI NIS + WSFLTH CPYLK FTDPFDF WP+T  T SN RDR          Y K
Subjt:  NCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGSSFAYGK

Query:  TEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILDYM
         +DV         SD++RQSIFELGIHCLFYGGYLASICYGH SHL+SQIQ IL  M
Subjt:  TEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILDYM

XP_023520942.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo]2.6e-22465.45Show/hide
Query:  MEMEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSLSDPLAAAV----NLPSSATADLRAALR---LLLANP
        MEMEM AMEDIEMAEDITPPLPPLTAALH  FL THCS+CF+PLPN    +S+ LRYCSP CS SD L AAV      P S T+DLRA+LR   LLL++P
Subjt:  MEMEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSLSDPLAAAV----NLPSSATADLRAALR---LLLANP

Query:  SLSPSS-SDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPN
        S   S+  +RI GLLTNR+KLMLA+ D+E+ + IR+G+ AMAA R  +  D    NALEEA++CLV+TNAVEV DS GR+IGIAVY P+F WINHSCSPN
Subjt:  SLSPSS-SDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPN

Query:  ACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFV
        ACYRFET SDSI++R+RISP  CTD+   +GS   +Q+ TV   N   F+  DFQGYGPRV+VRSIK+IR  E VTIAYCDLLQPKAMRQSELRSRY+FV
Subjt:  ACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFV

Query:  CTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHNLRLHP
        C+CQRC AKP T Y DH LQEISAV  +L + STSISNFD +  + RI+DYV+NAI EYLS+GS  SCCEKL+NLLTLGF DE+ ED +GK+L NLRLHP
Subjt:  CTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHNLRLHP

Query:  FHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLWASNIS
         H+L LNAYTALASAYKV          ++ NGD ++     +TM KTSAAYSLFLAGA HHLFL+EPSLIASAANCWVVAGESLLIL + SSLW SN S
Subjt:  FHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLWASNIS

Query:  NCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGSSFAYGK
          S P+ +I C NC WV++ N SRIHGR+++ DF EFS GISNCI NIS++ WSFL H C YLK FTDPFDF WP+T  T SN RDR       S    K
Subjt:  NCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGSSFAYGK

Query:  TEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILDYM
         +DV         SD++RQSIFELGIHCLFYGGYLASICYGHHSHLASQIQ IL  M
Subjt:  TEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILDYM

XP_038886411.1 protein SET DOMAIN GROUP 41 [Benincasa hispida]5.6e-22766.21Show/hide
Query:  MEMEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSL--SDPLAAA------VNLPSSATADLRAALR---LL
        MEMEMIAMEDIEMAEDITPPL PLT+ALH  FL THCS+CF+ LPNPP  +S+ LRYCSPKCSL  SDPL AA         P S T+DLRA+LR   LL
Subjt:  MEMEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSL--SDPLAAA------VNLPSSATADLRAALR---LL

Query:  LANP--SLSPSSSDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINH
        L++P  SLSP   +RI GLLTNR KLM  + DAE+   +R+G  A+AA  SA   D P G+ L EA +CLV TNAV+VHDSTGR+IGIAVY P+F WINH
Subjt:  LANP--SLSPSSSDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINH

Query:  SCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRS
        SCSPNACYRFET S S  +R RI+P  CTDL   +GS   +Q+GTV  SN+ DF+  DFQG GPRV+VRSIK+IR+ E VTIAYCDLLQPKAMRQSEL S
Subjt:  SCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRS

Query:  RYQFVCTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHN
        RYQFVC+CQRC AKPLT Y DH LQE+SA   +L   STSISNFD++  VRRI+DYV++AI EYLS+GSP SCCEKL NLLTLGF DE+ ED E K+  N
Subjt:  RYQFVCTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHN

Query:  LRLHPFHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLW
        LRLHP H+LSLN YTALASAYKV SCDLLA SS+    + D  + +ASTM K SAAYSLFLAGA HHLFL+EPSLI SA+ CWV+AGESLL LAR S LW
Subjt:  LRLHPFHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLW

Query:  A-SNISNCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGS
        A +N S   FPV K MC  C WV++ NASRIHG+ ++ DF EFS GISNCI N+SR+ WSFLTHGCPYLK FTDPF+F WP+    YS++RD RA  I  
Subjt:  A-SNISNCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGS

Query:  SFAYGKTEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNIL
          A   ++DV  +C+P  HS++ER+SI  LGIHCLFYGGYLASICYGHHSHLASQIQNIL
Subjt:  SFAYGKTEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNIL

TrEMBL top hitse value%identityAlignment
A0A0A0KAK3 SET domain-containing protein6.9e-22364.76Show/hide
Query:  MEMEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSL--SDPLAAAVNL------PSSATADLRAALRLL---
        MEMEMIA+EDIEMAEDI+PPL PLT+ALH  FL THCS+CF+ LPNPP  +S PL YCS KCSL  SDPL  A          SS T+DLRA+LRLL   
Subjt:  MEMEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSL--SDPLAAAVNL------PSSATADLRAALRLL---

Query:  --LANPSLSPSSSDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINH
            +PSLSP   DRI GLLTNR KLM  + D+E+ L +R+GA A+AA R  +  D P G ALEEAV+CLV+TNAV+V DS G++IGIAVY  +FSWINH
Subjt:  --LANPSLSPSSSDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINH

Query:  SCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTD--FQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSEL
        SCSPNACYRFET SDS+ +R RI+P  CTD    +GS    Q+G V  SN+LDF+R      G GPRV+VRSIK I+K E VTIAYCDLLQPKA RQSEL
Subjt:  SCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTD--FQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSEL

Query:  RSRYQFVCTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKL
         SRYQFVC+CQRC A PLT Y DH LQEIS+V  +L + ST ISNFD++  VRRI++YVDNAI EYLS  SP SCCEKL+NLLT GF DE+ ED EGK+ 
Subjt:  RSRYQFVCTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKL

Query:  HNLRLHPFHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSS
         +LRLHP H+L LNAYTAL SAYKV SCDL+A SS+    + ++H  +A TM KTSAAY+LFLAGA H LFL EPSL+ASAANCWVVAGESLLILAR SS
Subjt:  HNLRLHPFHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSS

Query:  LWA--SNISNCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARI
        LWA  +N SN  FP+ K MC+NC WV+E NASRIHG+ ++ DF EFS GISNCI +IS++CWS LTHGCPYLK FT PFDF WP+T     N +D   R 
Subjt:  LWA--SNISNCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARI

Query:  IGSSFAYGKTEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILD
        I  S A  KT+DV   CKP   S++ER+SI  LGIHCL+YGGYLASICYGHHSHLASQIQNIL+
Subjt:  IGSSFAYGKTEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILD

A0A1S3CIT0 protein SET DOMAIN GROUP 41 isoform X11.8e-22365.11Show/hide
Query:  MEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSL--SDPLAAAV----NLP--SSATADLRAALRLL-----
        MEM A+EDIEMAEDITPPL PLT+ALH  FL THCS+CF+ LPNPP  +S  L YCS KCSL  SDPL AA      LP  SS T+DLRA+LRLL     
Subjt:  MEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSL--SDPLAAAV----NLP--SSATADLRAALRLL-----

Query:  --LANPSLSPSSSDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINH
            +PSLSP    RI GLLTNR KLM  +  +E+ L +R+ A A+AA R  +  D   G ALEEAV+CLV+TNAV+V DS G++IGIAVY P+FSWINH
Subjt:  --LANPSLSPSSSDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINH

Query:  SCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRS
        SCSPNACYRFET SD   +R RI+P  CTD    +G+    Q+G V  SN+LDF+R DFQG GPRV+VRSIK I+K E VTIAYCDLLQPKA RQSEL S
Subjt:  SCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRS

Query:  RYQFVCTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHN
        RYQFVC+CQRC A PLT Y DH LQEISAV  +L + S  ISNFD++  VRRI++YVDNAI EYLS+GSP SCCEKL+NLLT GFRDE+ ED EGK+  +
Subjt:  RYQFVCTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHN

Query:  LRLHPFHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLW
        LRLHP H+L LNAYTAL SAYKV SCDLLA SS+    + ++H  +A TM KTSAAY+LFLAGA HHLFL EPSLIASAANCWVVAGESLLILAR SSLW
Subjt:  LRLHPFHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLW

Query:  A--SNISNCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIG
        A  +N S+  FP+ K MC NC WV+E N SRIHGR ++ DF EFS GISNCI +ISR+CWSFLTHGCPYLK FTDPFDF WP+T     N+ D     I 
Subjt:  A--SNISNCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIG

Query:  SSFAYGKTEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILD
         S A  KT+D+   C+P   S++ER+SI  LGIHCL+YGGYLASICYG+HSHLASQIQNIL+
Subjt:  SSFAYGKTEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILD

A0A5A7T0X4 Protein SET DOMAIN GROUP 41 isoform X12.2e-18965.67Show/hide
Query:  MLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPNACYRFETRSDSIESRMRISPK
        M  +  +E+ L +R+ A A+AA R  +  D   G ALEEAV+CLV+TNAV+V DS G++IGIAVY P+FSWINHSCSPNACYRFET SD   +R RI+P 
Subjt:  MLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPNACYRFETRSDSIESRMRISPK

Query:  CCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFVCTCQRCCAKPLTYYADHVLQE
         CTD    +G+    Q+G V  SN+LDF+R DFQG GPRV+VRSIK I+K E VTIAYCDLLQPKA RQSEL SRYQFVC+CQRC A PLT Y DH LQE
Subjt:  CCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFVCTCQRCCAKPLTYYADHVLQE

Query:  ISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHNLRLHPFHYLSLNAYTALASAYKVCSC
        ISAV  +L + S  ISNFD++  VRRI++YVDNAI EYLS+GSP SCCEKL+NLLT GFRDE+ ED EGK+  +LRLHP H+L LNAYTAL SAYKV SC
Subjt:  ISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHNLRLHPFHYLSLNAYTALASAYKVCSC

Query:  DLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLWA--SNISNCSFPVEKIMCFNCFWVNE
        DLLA SS+    + ++H  +A TM KTSAAY+LFLAGA HHLFL EPSLIASAANCWVVAGESLLILAR SSLWA  +N S+  FP+ K MC NC WV+E
Subjt:  DLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLWA--SNISNCSFPVEKIMCFNCFWVNE

Query:  LNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGSSFAYGKTEDVISRCKPLVHSDKERQ
         N SRIHGR ++ DF EFS GISNCI +ISR+CWSFLTHGCPYLK FTDPFDF WP+T     N+ D     I  S A  KT+D+   C+P   S++ER+
Subjt:  LNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGSSFAYGKTEDVISRCKPLVHSDKERQ

Query:  SIFELGIHCLFYGGYLASICYGHHSHLASQIQNILD
        SI  LGIHCL+YGGYLASICYG+HSHLASQIQNIL+
Subjt:  SIFELGIHCLFYGGYLASICYGHHSHLASQIQNILD

A0A6J1EY39 protein SET DOMAIN GROUP 41 isoform X12.3e-21863.88Show/hide
Query:  MEMEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSLSDPLAAAV----NLPSSATADLRAALRLLLANPSLS
        MEMEM AMEDIEMAEDITPPLPPLTAALH  F  THCS+CF+PLPN    +S+ LRYCSP CS SD L AAV    + P S T+DLRA+LRLL  +  LS
Subjt:  MEMEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSLSDPLAAAV----NLPSSATADLRAALRLLLANPSLS

Query:  PSSS------DRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCS
         SS+      +RI GLLTNR+KLMLAE D+E+ + IR+GA AMAA R  +  D    NALEEA++CLV+TNAVEV DS G++IGIAVY P+F WINHSCS
Subjt:  PSSS------DRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCS

Query:  PNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQ
        PNACYRFET SDSI +R+RISP  CTD+   +GS   NQ+ TV   N   F+  DFQGYGPRV+VRSIK++RK E VTIAYCDLLQPKA+RQSEL SRY+
Subjt:  PNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQ

Query:  FVCTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHNLRL
        FVC+CQRC AKP T Y DH LQEISA   +L + STSISNFD +  +RRI+DYV+NAI EYLS+GSP SCCEKL+NLLTLGF DE+ ED +GK+L NLRL
Subjt:  FVCTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHNLRL

Query:  HPFHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLWASN
        HP H+L LN YTALASAYKV S              ND   +  +TM KTSAAYSLFLAGA HHLFL EPSLIASAANCWVVAGESLLIL + SSLW SN
Subjt:  HPFHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLWASN

Query:  ISNCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGSSFAY
         S  S P+ +I C NC WV++ N +RIHGR+++ DF EFS GISNCI +IS + WSFL H C YLK FTDPFDF WP+T  T  N         G S   
Subjt:  ISNCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGSSFAY

Query:  GKTEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILDYM
         K +DV         S+++RQSIFELGIHCLFYGGYLASICYGH SHLASQI+ IL  M
Subjt:  GKTEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILDYM

A0A6J1I954 protein SET DOMAIN GROUP 41 isoform X13.1e-22364.69Show/hide
Query:  MEMEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSLSDPLAAAV----NLPSSATADLRAALR---LLLANP
        MEME+ AMEDIEMAEDITPPLPPLTAALH  FL THCS+CF+PLPN P  +S+ LRYCSP CS SD L AAV    +   S T+DLRA+LR   LLL++ 
Subjt:  MEMEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSLSDPLAAAV----NLPSSATADLRAALR---LLLANP

Query:  SLSPSS-SDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPN
        S   S+  +RI GLLTNR+KLMLA+ D+E+   IR+GA A+A  R  +  D    NALEEA+MCLV+TNAVEV DS G++IGIAVY P+F WINHSCSPN
Subjt:  SLSPSS-SDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPN

Query:  ACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFV
        ACYRFET SDSI++R+RISP  CTD+   +GS   +Q+ TV   N   F+  DFQGYGPRV+VRSIK+IRK E VTIAYCDLLQPKAMRQSELRSRY+FV
Subjt:  ACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFV

Query:  CTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHNLRLHP
        C+CQRC AKP T Y DH LQEI AV  +  + STSISNFD +  + RI+DYV+NAI EYLS+GSP SCCEKL+NLLTLGF DE+ +D +GK+L NLRLHP
Subjt:  CTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHNLRLHP

Query:  FHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLWASNIS
         H+L LN YTALASAYKV S             ++++++ + STM KTSAAYSLFLAGA HHLFL EPSLIASAANCWVVAGESLL L R SSLW SN S
Subjt:  FHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLWASNIS

Query:  NCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGSSFAYGK
          S P+ +I C NC WV++ N SRIHGR+++VDF EFS GISNCI NIS + WSFLTH CPYLK FTDPFDF WP+T  T SN RDR          Y K
Subjt:  NCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGSSFAYGK

Query:  TEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILDYM
         +DV         SD++RQSIFELGIHCLFYGGYLASICYGH SHL+SQIQ IL  M
Subjt:  TEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILDYM

SwissProt top hitse value%identityAlignment
Q3ECY6 Protein SET DOMAIN GROUP 411.1e-9737.63Show/hide
Query:  MEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPL-PNPPFPNSDPLRYCSPKCSLSDPLAAAVNLPSSAT----ADLRAALRLLLANPSLSP
        ME+ A EDIE+  D+ PPL PL ++L+  FL +HCS+CF+ L P+PP P      YCS  CSL+D    +   P   T    +D+R +L LL +    + 
Subjt:  MEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPL-PNPPFPNSDPLRYCSPKCSLSDPLAAAVNLPSSAT----ADLRAALRLLLANPSLSP

Query:  SSSDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPNACYRF
        SS  R+  LLTN   LM    D  I + I   A  +A    ++  +T     LEEA +C V+TNAVEVHDS G ++GIA+Y+ SFSWINHSCSPN+CYRF
Subjt:  SSSDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPNACYRF

Query:  ETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFVCTCQR
             S       + +  ++LE+          GT   S           G GP++IVRSIK I+  E +T++Y DLLQP  +RQS+L S+Y+F+C C R
Subjt:  ETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFVCTCQR

Query:  CCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFD----NNNVVRRINDYVDNAIDEYLSVG-SPVSCCEKLENLLTLGFRDEKEEDEEGKKLHNLRLHP
        C A P   Y D +L+ +  +  +     T++ +FD     +  V ++NDY+  AID++LS    P +CCE +E++L  G      + +E  + H LRLH 
Subjt:  CCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFD----NNNVVRRINDYVDNAIDEYLSVG-SPVSCCEKLENLLTLGFRDEKEEDEEGKKLHNLRLHP

Query:  FHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLWASNIS
         HY++LNAY  LA+AY++ S D     S+TG             M + SAAYSLFLAG +HHLF AE S   SAA  W  AGE L  LA +  +  S  S
Subjt:  FHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLWASNIS

Query:  NCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGSSFAYGK
        +       + C  C  +   N+ R        D  E S  I +C+ +IS+  WSFLT GCPYL+ F  P DF     S+T +N               G+
Subjt:  NCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGSSFAYGK

Query:  TEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQ
         E+          S  +  ++  L  HCL Y   L  +CYG  SHL S+ +
Subjt:  TEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQ

Q7XJS0 Histone-lysine N-methyltransferase ASHR19.5e-0426.81Show/hide
Query:  NAVEVHDSTGRSIGIAVYDPSFSWINHSCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKN
        NA  + DS  R  GI ++ P  S INHSCSPNA   FE +                                                     +VR++ N
Subjt:  NAVEVHDSTGRSIGIAVYDPSFSWINHSCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKN

Query:  IRKNEPVTIAYCDLLQPKAMRQSELRSRYQFVCTCQRC
        I K+  +TI+Y +       RQ  L+ +Y F C C RC
Subjt:  IRKNEPVTIAYCDLLQPKAMRQSELRSRYQFVCTCQRC

Q9CWR2 Histone-lysine N-methyltransferase SMYD33.3e-0421.91Show/hide
Query:  RYCSPKCSLS-------DPLAAAVNLPSSATADLRAALRLLLANPSLSPSSSDRILGLLTNRDKL-MLAEPDAEILLTIRQGATAMAAFRSADPDDTPTG
        +YCS KC          +        P      +R   R+++      PS S+++         +  L E   E L  +             D    P  
Subjt:  RYCSPKCSLS-------DPLAAAVNLPSSATADLRAALRLLLANPSLSPSSSDRILGLLTNRDKL-MLAEPDAEILLTIRQGATAMAAFRSADPDDTPTG

Query:  NALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQ
          L EA    VI N+  + ++  + +G+ +Y PS S +NHSC PN                            C   +N                     
Subjt:  NALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQ

Query:  GYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFVCTCQRC
          GP +++R+++ I   E +TI Y D+L     R+ +LR +Y F C C RC
Subjt:  GYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFVCTCQRC

Q9H7B4 Histone-lysine N-methyltransferase SMYD31.0e-0525.16Show/hide
Query:  DPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVL
        D    P    L EA    VI N+  + ++  + +G+ +Y PS S +NHSC PN                            C   +N             
Subjt:  DPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVL

Query:  DFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFVCTCQRC
                  GP +++R++++I   E +TI Y D+L     R+ +LR +Y F C C RC
Subjt:  DFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFVCTCQRC

Arabidopsis top hitse value%identityAlignment
AT1G43245.1 SET domain-containing protein8.0e-9937.63Show/hide
Query:  MEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPL-PNPPFPNSDPLRYCSPKCSLSDPLAAAVNLPSSAT----ADLRAALRLLLANPSLSP
        ME+ A EDIE+  D+ PPL PL ++L+  FL +HCS+CF+ L P+PP P      YCS  CSL+D    +   P   T    +D+R +L LL +    + 
Subjt:  MEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPL-PNPPFPNSDPLRYCSPKCSLSDPLAAAVNLPSSAT----ADLRAALRLLLANPSLSP

Query:  SSSDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPNACYRF
        SS  R+  LLTN   LM    D  I + I   A  +A    ++  +T     LEEA +C V+TNAVEVHDS G ++GIA+Y+ SFSWINHSCSPN+CYRF
Subjt:  SSSDRILGLLTNRDKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPNACYRF

Query:  ETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFVCTCQR
             S       + +  ++LE+          GT   S           G GP++IVRSIK I+  E +T++Y DLLQP  +RQS+L S+Y+F+C C R
Subjt:  ETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFVCTCQR

Query:  CCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFD----NNNVVRRINDYVDNAIDEYLSVG-SPVSCCEKLENLLTLGFRDEKEEDEEGKKLHNLRLHP
        C A P   Y D +L+ +  +  +     T++ +FD     +  V ++NDY+  AID++LS    P +CCE +E++L  G      + +E  + H LRLH 
Subjt:  CCAKPLTYYADHVLQEISAVGDKLFVGSTSISNFD----NNNVVRRINDYVDNAIDEYLSVG-SPVSCCEKLENLLTLGFRDEKEEDEEGKKLHNLRLHP

Query:  FHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLWASNIS
         HY++LNAY  LA+AY++ S D     S+TG             M + SAAYSLFLAG +HHLF AE S   SAA  W  AGE L  LA +  +  S  S
Subjt:  FHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKTSAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLWASNIS

Query:  NCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGSSFAYGK
        +       + C  C  +   N+ R        D  E S  I +C+ +IS+  WSFLT GCPYL+ F  P DF     S+T +N               G+
Subjt:  NCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTHGCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGSSFAYGK

Query:  TEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQ
         E+          S  +  ++  L  HCL Y   L  +CYG  SHL S+ +
Subjt:  TEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQ

AT2G17900.1 SET domain group 376.8e-0526.81Show/hide
Query:  NAVEVHDSTGRSIGIAVYDPSFSWINHSCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKN
        NA  + DS  R  GI ++ P  S INHSCSPNA   FE +                                                     +VR++ N
Subjt:  NAVEVHDSTGRSIGIAVYDPSFSWINHSCSPNACYRFETRSDSIESRMRISPKCCTDLEICKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKN

Query:  IRKNEPVTIAYCDLLQPKAMRQSELRSRYQFVCTCQRC
        I K+  +TI+Y +       RQ  L+ +Y F C C RC
Subjt:  IRKNEPVTIAYCDLLQPKAMRQSELRSRYQFVCTCQRC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATGGAAATGATAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCGTTGCCGCCGCTCACCGCCGCCCTCCACAGTCCCTTCCTCCACACCCACTG
CTCCACCTGCTTCACCCCTCTCCCAAATCCCCCATTCCCCAACTCCGATCCCCTCCGCTACTGCTCCCCCAAATGCTCCCTCTCCGATCCCCTCGCCGCCGCCGTCAACC
TCCCCTCCTCCGCCACCGCCGACCTCCGCGCCGCCCTCCGCCTCCTCCTCGCCAACCCTTCCCTCTCGCCCTCTTCTTCCGACCGCATCCTTGGTCTTCTCACAAACCGC
GACAAATTGATGCTCGCCGAACCCGACGCCGAGATTCTCCTCACGATCCGGCAAGGCGCCACCGCCATGGCCGCGTTCAGATCGGCGGATCCCGACGACACTCCCACCGG
AAACGCATTGGAAGAGGCGGTTATGTGCCTTGTGATTACCAACGCTGTGGAGGTTCACGATTCGACCGGACGCAGTATTGGAATCGCTGTGTATGATCCTTCCTTCTCCT
GGATCAACCACAGTTGTTCTCCCAATGCTTGTTACAGATTTGAAACTCGGTCCGATTCAATCGAGTCTAGAATGCGGATTTCCCCCAAATGCTGCACCGATCTGGAGATT
TGTAAAGGAAGTTATAATAATAATCAGATTGGTACTGTTTCTGGTAGCAACGTTTTAGATTTCGTAAGAACAGATTTTCAGGGTTATGGTCCAAGAGTTATTGTCAGGAG
TATAAAGAATATAAGGAAAAATGAGCCTGTAACAATTGCATATTGTGACTTGTTGCAACCTAAGGCAATGAGGCAGTCAGAGTTGCGGTCAAGATATCAATTTGTGTGTA
CTTGCCAGCGATGTTGTGCCAAGCCCCTAACTTATTATGCAGACCATGTTTTGCAAGAAATTTCTGCTGTCGGAGATAAATTATTTGTTGGTTCGACTTCCATCAGCAAC
TTTGATAACAACAATGTAGTGAGAAGAATAAATGACTACGTCGACAATGCAATCGACGAGTACCTATCTGTTGGTTCTCCTGTGTCGTGTTGTGAAAAGCTTGAAAACTT
GCTCACTCTAGGTTTCCGTGATGAGAAAGAGGAAGATGAAGAAGGAAAGAAGCTGCACAATTTGAGGCTGCATCCCTTTCACTATCTGTCGCTTAATGCATACACAGCGC
TTGCATCGGCTTATAAAGTCTGTTCGTGTGATTTATTAGCTTCCAGTTCCAAAACAGGCAACGGTGACAACGATAAACATCGAAAGGATGCATCTACCATGATCAAAACA
AGTGCAGCATACTCCTTGTTTCTTGCAGGTGCCAACCACCATCTTTTTCTTGCTGAACCATCTTTGATTGCATCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCCTT
GCTTATTCTTGCCAGAAGGAGCTCATTATGGGCCTCTAACATTTCAAATTGCAGTTTCCCTGTGGAGAAAATAATGTGCTTTAATTGCTTTTGGGTCAATGAGCTCAACG
CGAGTAGAATCCACGGTCGAACTTTGAAAGTCGATTTTTTTGAGTTTTCAAGTGGTATTTCAAATTGCATCGTTAATATCTCACGTCAATGTTGGAGCTTTCTGACTCAT
GGCTGCCCATATTTGAAAACTTTCACTGACCCCTTTGATTTTAGATGGCCAGAAACAAGCATAACGTATTCGAATAACCGAGATAGACGCGCTCGTATAATTGGCAGCTC
GTTTGCTTATGGTAAGACTGAAGATGTTATATCTCGATGTAAACCACTGGTCCATTCTGATAAAGAGAGGCAATCAATCTTTGAGCTTGGCATCCATTGCTTATTCTATG
GGGGATATTTAGCAAGTATTTGTTATGGCCACCATTCACATTTGGCATCTCAAATTCAAAATATATTAGATTACATGTAG
mRNA sequenceShow/hide mRNA sequence
CTCACGTAAATTATTCTATTAAAAATAGTAAATTGTCATATCCTAATTTTTTATTTCATTTATTTTCATCCTGATTTTGTCTAAATAAACTGGGATTAATTCAAACAAAA
CTCCAAAATAAACCATAACCACTGATGCAGAAGAAATGGAGATGGAAATGATAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCGTTGCCGCCGCTCACC
GCCGCCCTCCACAGTCCCTTCCTCCACACCCACTGCTCCACCTGCTTCACCCCTCTCCCAAATCCCCCATTCCCCAACTCCGATCCCCTCCGCTACTGCTCCCCCAAATG
CTCCCTCTCCGATCCCCTCGCCGCCGCCGTCAACCTCCCCTCCTCCGCCACCGCCGACCTCCGCGCCGCCCTCCGCCTCCTCCTCGCCAACCCTTCCCTCTCGCCCTCTT
CTTCCGACCGCATCCTTGGTCTTCTCACAAACCGCGACAAATTGATGCTCGCCGAACCCGACGCCGAGATTCTCCTCACGATCCGGCAAGGCGCCACCGCCATGGCCGCG
TTCAGATCGGCGGATCCCGACGACACTCCCACCGGAAACGCATTGGAAGAGGCGGTTATGTGCCTTGTGATTACCAACGCTGTGGAGGTTCACGATTCGACCGGACGCAG
TATTGGAATCGCTGTGTATGATCCTTCCTTCTCCTGGATCAACCACAGTTGTTCTCCCAATGCTTGTTACAGATTTGAAACTCGGTCCGATTCAATCGAGTCTAGAATGC
GGATTTCCCCCAAATGCTGCACCGATCTGGAGATTTGTAAAGGAAGTTATAATAATAATCAGATTGGTACTGTTTCTGGTAGCAACGTTTTAGATTTCGTAAGAACAGAT
TTTCAGGGTTATGGTCCAAGAGTTATTGTCAGGAGTATAAAGAATATAAGGAAAAATGAGCCTGTAACAATTGCATATTGTGACTTGTTGCAACCTAAGGCAATGAGGCA
GTCAGAGTTGCGGTCAAGATATCAATTTGTGTGTACTTGCCAGCGATGTTGTGCCAAGCCCCTAACTTATTATGCAGACCATGTTTTGCAAGAAATTTCTGCTGTCGGAG
ATAAATTATTTGTTGGTTCGACTTCCATCAGCAACTTTGATAACAACAATGTAGTGAGAAGAATAAATGACTACGTCGACAATGCAATCGACGAGTACCTATCTGTTGGT
TCTCCTGTGTCGTGTTGTGAAAAGCTTGAAAACTTGCTCACTCTAGGTTTCCGTGATGAGAAAGAGGAAGATGAAGAAGGAAAGAAGCTGCACAATTTGAGGCTGCATCC
CTTTCACTATCTGTCGCTTAATGCATACACAGCGCTTGCATCGGCTTATAAAGTCTGTTCGTGTGATTTATTAGCTTCCAGTTCCAAAACAGGCAACGGTGACAACGATA
AACATCGAAAGGATGCATCTACCATGATCAAAACAAGTGCAGCATACTCCTTGTTTCTTGCAGGTGCCAACCACCATCTTTTTCTTGCTGAACCATCTTTGATTGCATCT
GCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCCTTGCTTATTCTTGCCAGAAGGAGCTCATTATGGGCCTCTAACATTTCAAATTGCAGTTTCCCTGTGGAGAAAATAAT
GTGCTTTAATTGCTTTTGGGTCAATGAGCTCAACGCGAGTAGAATCCACGGTCGAACTTTGAAAGTCGATTTTTTTGAGTTTTCAAGTGGTATTTCAAATTGCATCGTTA
ATATCTCACGTCAATGTTGGAGCTTTCTGACTCATGGCTGCCCATATTTGAAAACTTTCACTGACCCCTTTGATTTTAGATGGCCAGAAACAAGCATAACGTATTCGAAT
AACCGAGATAGACGCGCTCGTATAATTGGCAGCTCGTTTGCTTATGGTAAGACTGAAGATGTTATATCTCGATGTAAACCACTGGTCCATTCTGATAAAGAGAGGCAATC
AATCTTTGAGCTTGGCATCCATTGCTTATTCTATGGGGGATATTTAGCAAGTATTTGTTATGGCCACCATTCACATTTGGCATCTCAAATTCAAAATATATTAGATTACA
TGTAGTGACAGTTATTCAATAGAAATGTAAATGGTTGTGAGATTGAAATTTCTTTGGGCTGCCTC
Protein sequenceShow/hide protein sequence
MEMEMIAMEDIEMAEDITPPLPPLTAALHSPFLHTHCSTCFTPLPNPPFPNSDPLRYCSPKCSLSDPLAAAVNLPSSATADLRAALRLLLANPSLSPSSSDRILGLLTNR
DKLMLAEPDAEILLTIRQGATAMAAFRSADPDDTPTGNALEEAVMCLVITNAVEVHDSTGRSIGIAVYDPSFSWINHSCSPNACYRFETRSDSIESRMRISPKCCTDLEI
CKGSYNNNQIGTVSGSNVLDFVRTDFQGYGPRVIVRSIKNIRKNEPVTIAYCDLLQPKAMRQSELRSRYQFVCTCQRCCAKPLTYYADHVLQEISAVGDKLFVGSTSISN
FDNNNVVRRINDYVDNAIDEYLSVGSPVSCCEKLENLLTLGFRDEKEEDEEGKKLHNLRLHPFHYLSLNAYTALASAYKVCSCDLLASSSKTGNGDNDKHRKDASTMIKT
SAAYSLFLAGANHHLFLAEPSLIASAANCWVVAGESLLILARRSSLWASNISNCSFPVEKIMCFNCFWVNELNASRIHGRTLKVDFFEFSSGISNCIVNISRQCWSFLTH
GCPYLKTFTDPFDFRWPETSITYSNNRDRRARIIGSSFAYGKTEDVISRCKPLVHSDKERQSIFELGIHCLFYGGYLASICYGHHSHLASQIQNILDYM