; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS011141 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS011141
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein SET DOMAIN GROUP 41
Genome locationscaffold325:372844..375687
RNA-Seq ExpressionMS011141
SyntenyMS011141
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001214 - SET domain
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463080.1 PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo]3.2e-22765.51Show/hide
Query:  MEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRL--LRLA
        MEMRA EDIEM EDITPPL PLTSALH SFL THCSSCFS     PIS S  L +CS KCS SHSD +             +S  +DLRAS RL  L L 
Subjt:  MEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRL--LRLA

Query:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPS
        LS+PS   S PP RIFGLLTNR KLM PQ+  +      + +++R  A A+AA RR   AD      ALEEA+LCLVLTNAVDVQ+S G+TIGIAVY P+
Subjt:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPS

Query:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFD-SDEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR
        F WINHSCSPNACYRF    ET SD   +R RIAPSCT   +  G+C Q+ ++   + D   EDFQG+GPRVVVRSIKRI+KGEAVTIAYCDLLQPKA R
Subjt:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFD-SDEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR

Query:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK
        QSELWSRYQF C CQRCS  P TYVD ALQEISA KV L DS  ISNF  D AVRRI +YVD+AI++YLSIGSPESCCEKL+N+LT GF ++Q +  EGK
Subjt:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK

Query:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS
          ++LRLHP H+L LNAYTAL SAYKVRS DLLAL S++D D+EN  NA TMS+TSAAY+LFLAGATHHLFL EPSLIASAANCWVVAGES+LIL R SS
Subjt:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS

Query:  FWA--ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINRSG
         WA   + S W FP+ KRMCS C+WVD FN SRIHGR +  DF   SIG  +CIA+IS++CWSFLTHGCPYLKAFTDPFDFSWPKT         I+RS 
Subjt:  FWA--ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINRSG

Query:  ACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
        AC KTKDI  +CE Q  SN+ER+ I  LG+HCL+YG YLAS+CYG+HSHLASQIQNIL+++
Subjt:  ACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

XP_011656459.1 protein SET DOMAIN GROUP 41 [Cucumis sativus]1.5e-22464.3Show/hide
Query:  MEMEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRLLRLA
        MEMEM A EDIEM EDI+PPL PLTSALH SFL THCSSCFS     PIS S  L +CS KCS SHSD +             +S  +DLRAS RLL L 
Subjt:  MEMEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRLLRLA

Query:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPS
        LS+PS   S PP+RI+GLLTNR KLM PQ+D +      + +++R GA A+AA RR   AD      ALEEA+LCLVLTNAVDVQ+S G+TIGIAVY  +
Subjt:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPS

Query:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFD-SDEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR
        F WINHSCSPNACYRF    ET SDSV +R RIAPSCT   +  GSC Q+ ++   + D   EDFQG+GPRVVVRSIKRI+KGEAVTIAYCDLLQPKA R
Subjt:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFD-SDEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR

Query:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK
        QSELWSRYQF C CQRCS  P TYVD ALQEIS+ KV L DST ISNF  D AVRRI +YVD+AI++YLS  SPESCCEKL+N+LT GF ++Q +  EGK
Subjt:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK

Query:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS
          ++LRLHPLH+L LNAYTAL SAYKVRS DL+AL S++D D+ N  NA TM +TSAAY+LFLAGATH LFL EPSL+ASAANCWVVAGES+LIL R SS
Subjt:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS

Query:  FWA--ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINRSG
         WA   + S W FP+ KRMC  C+WVD FN+SRIHG+ V  DF   SIG  +CIA+ISQ+CWS LTHGCPYLKAFT PFDFSWPKT         I+ S 
Subjt:  FWA--ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINRSG

Query:  ACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
        AC KT+D+  +C+ Q  SN+ER+ I  LG+HCL+YG YLAS+CYGHHSHLASQIQNIL+++
Subjt:  ACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

XP_022932824.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata]1.8e-22265.14Show/hide
Query:  MEMEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP
        MEMEMRA EDIEM EDITPPLPPLT+ALH +F LTHCSSCFSP+ +S       LR+CS  CS S S + A         S  +DLRAS RLL L LS+ 
Subjt:  MEMEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP

Query:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPSFCWI
        S W S PPERIFGLLTNREKLML +DD +      + ++IR GA+AMAASRRT SAD    +NALEEA+LCLVLTNAV+VQ+S G+TIGIAVY P+FCWI
Subjt:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPSFCWI

Query:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDS--DEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSE
        NHSCSPNACYRF    ET SDS+++RLRI+P CT   TG GSCNQ +S V   F     +DFQG GPRV+VRSIK +RKGEAVTIAYCDLLQPKA+RQSE
Subjt:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDS--DEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSE

Query:  LWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLL
        L SRY+F C CQRCS KP TYVD ALQEISAF V L DSTSISNF  D A+RRI DYV++AI++YLSIGSPESCCEKL+N+LT GF ++QA+  +GK LL
Subjt:  LWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLL

Query:  NLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWA
        NLRLHP+H+L LN YTALASAYKVRS           +DDEN  NA TMS+TSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGES+LIL + SS W 
Subjt:  NLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWA

Query:  ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKT-TPSHSSINRSGACRKTKD
        ++ SK S PM +  C  C+WVD FN++RIHGR +  DF   SIG  +CIA+IS + WSFL H C YLKAFTDPFDFSWPKT T   +   RS  C K +D
Subjt:  ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKT-TPSHSSINRSGACRKTKD

Query:  IICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
        +         S ++RQ IFELG+HCLFYG YLAS+CYGH SHLASQI+ IL +M
Subjt:  IICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

XP_023520942.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo]2.3e-22566.06Show/hide
Query:  MEMEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP
        MEMEMRA EDIEM EDITPPLPPLT+ALH +FLLTHCSSCFSP+ +S       LR+CS  CSHS S + A         S  +DLRAS RLL L LS+P
Subjt:  MEMEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP

Query:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPSFCWI
        S W S PPERIFGLLTNREKLML  DD +      + ++IR G++AMAASRRT SAD    +NALEEA+LCLVLTNAV+VQ+S GRTIGIAVY P+FCWI
Subjt:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPSFCWI

Query:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDS--DEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSE
        NHSCSPNACYRF    ET SDS+ +RLRI+P CT   TG GSC+Q +S V   F     +DFQG GPRV+VRSIK IR GEAVTIAYCDLLQPKAMRQSE
Subjt:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDS--DEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSE

Query:  LWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLL
        L SRY+F C CQRCS KP TYVD ALQEISA  V L DSTSISNF  D A+ RI DYV++AI++YLSIGS ESCCEKL+N+LT GF ++QA+  +GK LL
Subjt:  LWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLL

Query:  NLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWA
        NLRLHP+H+L LNAYTALASAYKVRS           + DEN  NA TMS+TSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGES+LIL + SS W 
Subjt:  NLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWA

Query:  ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSI-NRSGACRKTKD
        ++ SK S PM +  C  C+WVD FN+SRIHGR +  DF   SIG  +CIANISQ+ WSFL H C YLKAFTDPFDFSWPKT  + S+  +RS  C K +D
Subjt:  ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSI-NRSGACRKTKD

Query:  IICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
        +         S+++RQ IFELG+HCLFYG YLAS+CYGHHSHLASQIQ IL +M
Subjt:  IICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

XP_038886411.1 protein SET DOMAIN GROUP 41 [Benincasa hispida]2.2e-22865.71Show/hide
Query:  MEMEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSIA-------------SGAADLRASFRLLRLA
        MEMEM A EDIEM EDITPPL PLTSALH SFL THCSSCFS     PIS S  LR+CS KCS SHSD +              S  +DLRAS RLL L 
Subjt:  MEMEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSIA-------------SGAADLRASFRLLRLA

Query:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPS
        LS+P    S PPERIFGLLTNR KLM PQ D +      L  ++R G +A+AA     SAD  H  + L EA LCLV TNAVDV +S GRTIGIAVY P+
Subjt:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPS

Query:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFD-SDEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR
        FCWINHSCSPNACYRF    ET+S S  +R RIAPSCT   TG GSC+Q+ ++   L D   EDFQG+GPRV+VRSIK IR+GEAVTIAYCDLLQPKAMR
Subjt:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFD-SDEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR

Query:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK
        QSELWSRYQF C CQRCS KP TYVD ALQE+SA KV L DSTSISNF  D AVRRI DYV+ AI++YLSIGSPESCCEKL N+LT GF ++QA+  E K
Subjt:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK

Query:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS
          +NLRLHPLH+LSLN YTALASAYKVRS DLLAL S++D D+E+  NASTM + SAAYSLFLAGATHHLFLSEPSLI SA+ CWV+AGES+L L R S 
Subjt:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS

Query:  FWA-ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGR--DVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHS--------SI
         WA  + SKW FP+ KRMCS C+WVD FN+SRIHG+  + DF   SIG  +CIAN+S++ WSFLTHGCPYLKAFTDPF+FSWPK  P +S        SI
Subjt:  FWA-ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGR--DVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHS--------SI

Query:  NRSGACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
        +R  AC  +KD+  QCE Q HSN+ER+ I  LG+HCLFYG YLAS+CYGHHSHLASQIQNIL ++
Subjt:  NRSGACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

TrEMBL top hitse value%identityAlignment
A0A0A0KAK3 SET domain-containing protein3.3e-22263.5Show/hide
Query:  MEMEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRLLRLA
        MEMEM A EDIEM EDI+PPL PLTSALH SFL THCSSCFS     PIS S  L +CS KCS SHSD +             +S  +DLRAS RLL L 
Subjt:  MEMEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRLLRLA

Query:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPS
        LS+PS   S PP+RI+GLLTNR KLM PQ+D +      + +++R GA A+AA RR   AD      ALEEA+LCLVLTNAVDVQ+S G+TIGIAVY  +
Subjt:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPS

Query:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDSDED---FQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKA
        F WINHSCSPNACYRF    ET SDSV +R RIAPSCT   +  GSC Q+ ++   + D   +     G+GPRVVVRSIKRI+KGEAVTIAYCDLLQPKA
Subjt:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDSDED---FQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKA

Query:  MRQSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEE
         RQSELWSRYQF C CQRCS  P TYVD ALQEIS+ KV L DST ISNF  D AVRRI +YVD+AI++YLS  SPESCCEKL+N+LT GF ++Q +  E
Subjt:  MRQSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEE

Query:  GKPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRS
        GK  ++LRLHPLH+L LNAYTAL SAYKVRS DL+AL S++D D+ N  NA TM +TSAAY+LFLAGATH LFL EPSL+ASAANCWVVAGES+LIL R 
Subjt:  GKPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRS

Query:  SSFWA--ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINR
        SS WA   + S W FP+ KRMC  C+WVD FN+SRIHG+ V  DF   SIG  +CIA+ISQ+CWS LTHGCPYLKAFT PFDFSWPKT         I+ 
Subjt:  SSFWA--ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINR

Query:  SGACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
        S AC KT+D+  +C+ Q  SN+ER+ I  LG+HCL+YG YLAS+CYGHHSHLASQIQNIL+++
Subjt:  SGACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

A0A1S3CIT0 protein SET DOMAIN GROUP 41 isoform X11.5e-22765.51Show/hide
Query:  MEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRL--LRLA
        MEMRA EDIEM EDITPPL PLTSALH SFL THCSSCFS     PIS S  L +CS KCS SHSD +             +S  +DLRAS RL  L L 
Subjt:  MEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRL--LRLA

Query:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPS
        LS+PS   S PP RIFGLLTNR KLM PQ+  +      + +++R  A A+AA RR   AD      ALEEA+LCLVLTNAVDVQ+S G+TIGIAVY P+
Subjt:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPS

Query:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFD-SDEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR
        F WINHSCSPNACYRF    ET SD   +R RIAPSCT   +  G+C Q+ ++   + D   EDFQG+GPRVVVRSIKRI+KGEAVTIAYCDLLQPKA R
Subjt:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFD-SDEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR

Query:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK
        QSELWSRYQF C CQRCS  P TYVD ALQEISA KV L DS  ISNF  D AVRRI +YVD+AI++YLSIGSPESCCEKL+N+LT GF ++Q +  EGK
Subjt:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK

Query:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS
          ++LRLHP H+L LNAYTAL SAYKVRS DLLAL S++D D+EN  NA TMS+TSAAY+LFLAGATHHLFL EPSLIASAANCWVVAGES+LIL R SS
Subjt:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS

Query:  FWA--ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINRSG
         WA   + S W FP+ KRMCS C+WVD FN SRIHGR +  DF   SIG  +CIA+IS++CWSFLTHGCPYLKAFTDPFDFSWPKT         I+RS 
Subjt:  FWA--ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINRSG

Query:  ACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
        AC KTKDI  +CE Q  SN+ER+ I  LG+HCL+YG YLAS+CYG+HSHLASQIQNIL+++
Subjt:  ACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

A0A6J1DFD6 protein SET DOMAIN GROUP 41 isoform X11.6e-20899.72Show/hide
Query:  MRQSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEE
        MRQSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEE
Subjt:  MRQSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEE

Query:  GKPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRS
        GKPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRS
Subjt:  GKPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRS

Query:  SSFWAADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKT
        SSFWAADISKWSFPMDKRMCSKCTWV+SFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKT
Subjt:  SSFWAADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKT

Query:  KDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEMK
        KDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEMK
Subjt:  KDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEMK

A0A6J1EY39 protein SET DOMAIN GROUP 41 isoform X18.8e-22365.14Show/hide
Query:  MEMEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP
        MEMEMRA EDIEM EDITPPLPPLT+ALH +F LTHCSSCFSP+ +S       LR+CS  CS S S + A         S  +DLRAS RLL L LS+ 
Subjt:  MEMEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP

Query:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPSFCWI
        S W S PPERIFGLLTNREKLML +DD +      + ++IR GA+AMAASRRT SAD    +NALEEA+LCLVLTNAV+VQ+S G+TIGIAVY P+FCWI
Subjt:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPSFCWI

Query:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDS--DEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSE
        NHSCSPNACYRF    ET SDS+++RLRI+P CT   TG GSCNQ +S V   F     +DFQG GPRV+VRSIK +RKGEAVTIAYCDLLQPKA+RQSE
Subjt:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDS--DEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSE

Query:  LWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLL
        L SRY+F C CQRCS KP TYVD ALQEISAF V L DSTSISNF  D A+RRI DYV++AI++YLSIGSPESCCEKL+N+LT GF ++QA+  +GK LL
Subjt:  LWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLL

Query:  NLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWA
        NLRLHP+H+L LN YTALASAYKVRS           +DDEN  NA TMS+TSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGES+LIL + SS W 
Subjt:  NLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWA

Query:  ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKT-TPSHSSINRSGACRKTKD
        ++ SK S PM +  C  C+WVD FN++RIHGR +  DF   SIG  +CIA+IS + WSFL H C YLKAFTDPFDFSWPKT T   +   RS  C K +D
Subjt:  ADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKT-TPSHSSINRSGACRKTKD

Query:  IICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
        +         S ++RQ IFELG+HCLFYG YLAS+CYGH SHLASQI+ IL +M
Subjt:  IICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

A0A6J1I954 protein SET DOMAIN GROUP 41 isoform X18.2e-22164.73Show/hide
Query:  MEMEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP
        MEME+RA EDIEM EDITPPLPPLT+ALH SFLLTHCSSCFSP+ +S       LR+CS  CS+S S + A         S  +DLRAS RLL L LS+ 
Subjt:  MEMEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP

Query:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPSFCWI
        S W S PPERIFGLLTNREKLML  DD +      +  +IR GA+A+A SRRT SAD    +NALEEA++CLVLTNAV+VQ+S G+TIGIAVY P+FCWI
Subjt:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPSFCWI

Query:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDS--DEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSE
        NHSCSPNACYRF    ET SDS+ +RLRI+P CT   TG GSC+Q +S V   F     +DFQG GPRV+VRSIK IRKGEAVTIAYCDLLQPKAMRQSE
Subjt:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDS--DEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSE

Query:  LWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKV-SLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPL
        L SRY+F C CQRCS KP TYVD ALQEI A  V  L DSTSISNF  D A+ RI DYV++AI++YLSIGSPESCCEKL+N+LT GF ++QA   +GK L
Subjt:  LWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKV-SLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPL

Query:  LNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFW
        LNLRLHP+H+L LN YTALASAYKVRS           +D+EN  N STMS+TSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGES+L L R SS W
Subjt:  LNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFW

Query:  AADISKWSFPMDKRMCSKCTWVDSFNSSRIHGR--DVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKTKD
         ++ SK S PM +  C  C+WVD FN+SRIHGR  +VDF   SIG  +CIANIS + WSFLTH CPYLKAFTDPFDFSWPKT  +         C   +D
Subjt:  AADISKWSFPMDKRMCSKCTWVDSFNSSRIHGR--DVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKTKD

Query:  IICQ-CETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
         +C   + Q  S+++RQ IFELG+HCLFYG YLAS+CYGH SHL+SQIQ IL +M
Subjt:  IICQ-CETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

SwissProt top hitse value%identityAlignment
O94256 SET domain and MYND-type zinc finger protein 61.1e-0425.68Show/hide
Query:  LEEALLCLVLTNAVDVQNSDGRTIGIAVYDPSFCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDSDEDFQGSG
        L + L C +  NA+++  S   ++G+ + D   C +NHSC PN                                             +FD        G
Subjt:  LEEALLCLVLTNAVDVQNSDGRTIGIAVYDPSFCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDSDEDFQGSG

Query:  PRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCCCQRC
          V + S + I+K E + I+Y D+  PK++RQ +L  +Y FSC C RC
Subjt:  PRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCCCQRC

Q3ECY6 Protein SET DOMAIN GROUP 411.0e-9836.88Show/hide
Query:  MEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFS---PISDSLRHCSAKCSHSHSDSIASG---------AADLRASFRLLRLALSNPSVWHSD
        ME+RA EDIE+  D+ PPL PL S+L+ SFL +HCSSCFS   P      +CSA CS + S + +            +D+R S  L    L++ +V  S 
Subjt:  MEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFS---PISDSLRHCSAKCSHSHSDSIASG---------AADLRASFRLLRLALSNPSVWHSD

Query:  PPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPSFCWINHSCSP
         P R+  LLTN   LM          +  + + I + A  +A   R+       +N  LEEA +C VLTNAV+V +S+G  +GIA+Y+ SF WINHSCSP
Subjt:  PPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPSFCWINHSCSP

Query:  NACYRFLLESETNSD------SVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDSDEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSELW
        N+CYRF+    +  D         S L +     G     G+                   G+GP+++VRSIKRI+ GE +T++Y DLLQP  +RQS+LW
Subjt:  NACYRFLLESETNSD------SVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDSDEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSELW

Query:  SRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNF----YDDNAVRRITDYVDDAISDYLSIG-SPESCCEKLENVLTSGFSEDQAKYEEGK
        S+Y+F C C RC+  P  YVD  L+ +   +    + T++ +F      D AV ++ DY+ +AI D+LS    P++CCE +E+VL  G      +++E  
Subjt:  SRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNF----YDDNAVRRITDYVDDAISDYLSIG-SPESCCEKLENVLTSGFSEDQAKYEEGK

Query:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS
            LRLH  HY++LNAY  LA+AY++RS             D        MSR SAAYSLFLAG +HHLF +E S   SAA  W  AGE +  L     
Subjt:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS

Query:  FWAADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKTKD
             + + S   D + C+KC  +++ NS R      D    S    SC+ +ISQ  WSFLT GCPYL+ F  P DFS  +T                  
Subjt:  FWAADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKTKD

Query:  IICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQ
             E +  S ++   +  L  HCL Y   L  LCYG  SHL S+ +
Subjt:  IICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQ

Q9CWR2 Histone-lysine N-methyltransferase SMYD31.7e-0523.57Show/hide
Query:  VLTNAVDVQNSDGRTIGIAVYDPSFCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDSDEDFQGSGPRVVVRSI
        V+ N+  + N++ + +G+ +Y PS   +NHSC PN    F                                                  +GP +++R++
Subjt:  VLTNAVDVQNSDGRTIGIAVYDPSFCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDSDEDFQGSGPRVVVRSI

Query:  KRIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCCCQRC
        + I  GE +TI Y D+L     R+ +L  +Y F C C RC
Subjt:  KRIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCCCQRC

Q9H7B4 Histone-lysine N-methyltransferase SMYD33.8e-0523.57Show/hide
Query:  VLTNAVDVQNSDGRTIGIAVYDPSFCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDSDEDFQGSGPRVVVRSI
        V+ N+  + N++ + +G+ +Y PS   +NHSC PN    F                                                  +GP +++R++
Subjt:  VLTNAVDVQNSDGRTIGIAVYDPSFCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDSDEDFQGSGPRVVVRSI

Query:  KRIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCCCQRC
        + I  GE +TI Y D+L     R+ +L  +Y F C C RC
Subjt:  KRIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCCCQRC

Arabidopsis top hitse value%identityAlignment
AT1G43245.1 SET domain-containing protein7.1e-10036.88Show/hide
Query:  MEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFS---PISDSLRHCSAKCSHSHSDSIASG---------AADLRASFRLLRLALSNPSVWHSD
        ME+RA EDIE+  D+ PPL PL S+L+ SFL +HCSSCFS   P      +CSA CS + S + +            +D+R S  L    L++ +V  S 
Subjt:  MEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFS---PISDSLRHCSAKCSHSHSDSIASG---------AADLRASFRLLRLALSNPSVWHSD

Query:  PPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPSFCWINHSCSP
         P R+  LLTN   LM          +  + + I + A  +A   R+       +N  LEEA +C VLTNAV+V +S+G  +GIA+Y+ SF WINHSCSP
Subjt:  PPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPSFCWINHSCSP

Query:  NACYRFLLESETNSD------SVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDSDEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSELW
        N+CYRF+    +  D         S L +     G     G+                   G+GP+++VRSIKRI+ GE +T++Y DLLQP  +RQS+LW
Subjt:  NACYRFLLESETNSD------SVDSRLRIAPSCTGPETGGGSCNQVLSLVVYLFDSDEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSELW

Query:  SRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNF----YDDNAVRRITDYVDDAISDYLSIG-SPESCCEKLENVLTSGFSEDQAKYEEGK
        S+Y+F C C RC+  P  YVD  L+ +   +    + T++ +F      D AV ++ DY+ +AI D+LS    P++CCE +E+VL  G      +++E  
Subjt:  SRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNF----YDDNAVRRITDYVDDAISDYLSIG-SPESCCEKLENVLTSGFSEDQAKYEEGK

Query:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS
            LRLH  HY++LNAY  LA+AY++RS             D        MSR SAAYSLFLAG +HHLF +E S   SAA  W  AGE +  L     
Subjt:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS

Query:  FWAADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKTKD
             + + S   D + C+KC  +++ NS R      D    S    SC+ +ISQ  WSFLT GCPYL+ F  P DFS  +T                  
Subjt:  FWAADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKTKD

Query:  IICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQ
             E +  S ++   +  L  HCL Y   L  LCYG  SHL S+ +
Subjt:  IICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATGGAAATGAGGGCAACAGAAGACATTGAAATGGGGGAAGATATTACTCCGCCATTGCCGCCCCTTACCTCAGCTCTCCACCATTCCTTCCTCCTCACTCACTG
CTCCTCCTGCTTCTCCCCAATCTCCGATTCCCTCCGCCACTGCTCCGCCAAATGCTCCCATTCCCATTCCGATTCCATCGCTTCCGGCGCCGCCGACCTCCGCGCCTCCT
TCCGCCTCCTCCGCCTCGCCCTCTCTAATCCCTCTGTTTGGCACTCCGATCCTCCCGAGCGCATCTTCGGCCTTCTCACCAATCGCGAGAAACTGATGCTGCCCCAAGAC
GACGACGACGGCGACGACGAGGGTTTACTGAGAATTCGAATTCGGAACGGCGCCGAGGCCATGGCCGCTTCCAGAAGGACGGGCTCTGCCGATGGATGCCATGAGAACAA
CGCCTTGGAGGAGGCCCTCCTCTGCCTCGTCTTGACCAACGCCGTCGATGTTCAGAATTCCGACGGCCGCACCATCGGAATCGCTGTGTACGATCCTTCCTTCTGCTGGA
TCAATCACAGTTGCTCTCCCAATGCTTGTTACAGATTTCTACTTGAATCCGAAACTAATTCGGATTCTGTCGATTCGAGGCTGCGGATTGCTCCCAGCTGCACTGGCCCC
GAGACTGGTGGAGGAAGTTGTAATCAAGTGTTGTCTCTTGTTGTATATTTATTTGATTCTGATGAAGATTTTCAGGGTTCTGGTCCAAGAGTCGTGGTTAGGAGTATAAA
GAGAATAAGGAAAGGTGAGGCAGTCACAATTGCATACTGTGACTTGTTGCAACCTAAGGCGATGAGACAGTCAGAGTTGTGGTCAAGGTATCAATTTTCCTGCTGTTGCC
AGCGATGTAGTATGAAGCCCCAAACTTATGTGGACCTTGCTTTGCAAGAAATTTCTGCCTTTAAAGTTAGTTTGTTTGATTCAACTTCCATTAGCAACTTCTATGACGAC
AATGCAGTGAGAAGAATAACTGATTATGTCGATGATGCAATTTCGGATTACCTATCCATTGGTTCTCCTGAATCTTGTTGCGAGAAGCTGGAAAACGTGCTTACTTCAGG
TTTCAGTGAAGATCAAGCAAAATATGAGGAAGGAAAACCGCTGCTTAATTTGAGGCTGCATCCTTTGCACTACTTGTCACTGAATGCATACACTGCTCTGGCCTCGGCTT
ATAAAGTGCGATCGAGTGATTTATTGGCTTTAGATTCCAAAATAGACGATGACGATGAAAATCTACGTAACGCATCGACCATGAGCAGAACAAGTGCAGCATACTCCTTG
TTCCTTGCAGGTGCTACTCACCATCTTTTTCTTTCTGAACCATCTTTGATTGCGTCTGCTGCTAATTGTTGGGTTGTTGCTGGAGAGTCTATGCTTATTCTTTGTAGAAG
CAGCTCATTTTGGGCTGCTGACATTTCAAAGTGGAGTTTCCCTATGGATAAAAGAATGTGTTCTAAATGCACATGGGTCGATAGCTTCAATTCTAGTCGAATTCACGGTC
GAGATGTCGATTTTCACGGCATCTCAATCGGTACTTTTAGTTGCATTGCTAATATTTCTCAAAGATGTTGGAGCTTTCTGACTCATGGTTGCCCATATTTGAAGGCTTTC
ACAGACCCTTTTGATTTCAGCTGGCCTAAGACGACCCCATCGCATTCGAGCATCAATCGCTCAGGTGCGTGTCGGAAAACTAAAGATATTATTTGTCAGTGTGAAACTCA
GGTGCATTCTAACGAAGAGAGGCAATGGATCTTTGAGCTTGGAATGCATTGCTTATTCTATGGGGCCTATTTAGCAAGTTTGTGTTATGGACACCATTCCCATTTGGCAT
CTCAGATTCAGAATATTTTGGACGAGATGAAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATGGAAATGAGGGCAACAGAAGACATTGAAATGGGGGAAGATATTACTCCGCCATTGCCGCCCCTTACCTCAGCTCTCCACCATTCCTTCCTCCTCACTCACTG
CTCCTCCTGCTTCTCCCCAATCTCCGATTCCCTCCGCCACTGCTCCGCCAAATGCTCCCATTCCCATTCCGATTCCATCGCTTCCGGCGCCGCCGACCTCCGCGCCTCCT
TCCGCCTCCTCCGCCTCGCCCTCTCTAATCCCTCTGTTTGGCACTCCGATCCTCCCGAGCGCATCTTCGGCCTTCTCACCAATCGCGAGAAACTGATGCTGCCCCAAGAC
GACGACGACGGCGACGACGAGGGTTTACTGAGAATTCGAATTCGGAACGGCGCCGAGGCCATGGCCGCTTCCAGAAGGACGGGCTCTGCCGATGGATGCCATGAGAACAA
CGCCTTGGAGGAGGCCCTCCTCTGCCTCGTCTTGACCAACGCCGTCGATGTTCAGAATTCCGACGGCCGCACCATCGGAATCGCTGTGTACGATCCTTCCTTCTGCTGGA
TCAATCACAGTTGCTCTCCCAATGCTTGTTACAGATTTCTACTTGAATCCGAAACTAATTCGGATTCTGTCGATTCGAGGCTGCGGATTGCTCCCAGCTGCACTGGCCCC
GAGACTGGTGGAGGAAGTTGTAATCAAGTGTTGTCTCTTGTTGTATATTTATTTGATTCTGATGAAGATTTTCAGGGTTCTGGTCCAAGAGTCGTGGTTAGGAGTATAAA
GAGAATAAGGAAAGGTGAGGCAGTCACAATTGCATACTGTGACTTGTTGCAACCTAAGGCGATGAGACAGTCAGAGTTGTGGTCAAGGTATCAATTTTCCTGCTGTTGCC
AGCGATGTAGTATGAAGCCCCAAACTTATGTGGACCTTGCTTTGCAAGAAATTTCTGCCTTTAAAGTTAGTTTGTTTGATTCAACTTCCATTAGCAACTTCTATGACGAC
AATGCAGTGAGAAGAATAACTGATTATGTCGATGATGCAATTTCGGATTACCTATCCATTGGTTCTCCTGAATCTTGTTGCGAGAAGCTGGAAAACGTGCTTACTTCAGG
TTTCAGTGAAGATCAAGCAAAATATGAGGAAGGAAAACCGCTGCTTAATTTGAGGCTGCATCCTTTGCACTACTTGTCACTGAATGCATACACTGCTCTGGCCTCGGCTT
ATAAAGTGCGATCGAGTGATTTATTGGCTTTAGATTCCAAAATAGACGATGACGATGAAAATCTACGTAACGCATCGACCATGAGCAGAACAAGTGCAGCATACTCCTTG
TTCCTTGCAGGTGCTACTCACCATCTTTTTCTTTCTGAACCATCTTTGATTGCGTCTGCTGCTAATTGTTGGGTTGTTGCTGGAGAGTCTATGCTTATTCTTTGTAGAAG
CAGCTCATTTTGGGCTGCTGACATTTCAAAGTGGAGTTTCCCTATGGATAAAAGAATGTGTTCTAAATGCACATGGGTCGATAGCTTCAATTCTAGTCGAATTCACGGTC
GAGATGTCGATTTTCACGGCATCTCAATCGGTACTTTTAGTTGCATTGCTAATATTTCTCAAAGATGTTGGAGCTTTCTGACTCATGGTTGCCCATATTTGAAGGCTTTC
ACAGACCCTTTTGATTTCAGCTGGCCTAAGACGACCCCATCGCATTCGAGCATCAATCGCTCAGGTGCGTGTCGGAAAACTAAAGATATTATTTGTCAGTGTGAAACTCA
GGTGCATTCTAACGAAGAGAGGCAATGGATCTTTGAGCTTGGAATGCATTGCTTATTCTATGGGGCCTATTTAGCAAGTTTGTGTTATGGACACCATTCCCATTTGGCAT
CTCAGATTCAGAATATTTTGGACGAGATGAAA
Protein sequenceShow/hide protein sequence
MEMEMRATEDIEMGEDITPPLPPLTSALHHSFLLTHCSSCFSPISDSLRHCSAKCSHSHSDSIASGAADLRASFRLLRLALSNPSVWHSDPPERIFGLLTNREKLMLPQD
DDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSDGRTIGIAVYDPSFCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGP
ETGGGSCNQVLSLVVYLFDSDEDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDD
NAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSL
FLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWAADISKWSFPMDKRMCSKCTWVDSFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAF
TDPFDFSWPKTTPSHSSINRSGACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEMK