; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g1270 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g1270
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein SET DOMAIN GROUP 41 isoform X1
Genome locationMC11:14381085..14384110
RNA-Seq ExpressionMC11g1270
SyntenyMC11g1270
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001214 - SET domain
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463080.1 PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo]1.57e-28665.81Show/hide
Query:  MEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRLLRL--A
        MEMRA EDIEM EDITPPL P+TSALH SFL THCSSCFS     PIS S  L +CS KCS SHSD +             +S  +DLRAS RLL L   
Subjt:  MEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRLLRL--A

Query:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPS
        LS+PS   S PP RIFGLLTNR KLM PQ+  +      + +++R  A A+AA RR   AD      ALEEA+LCLVLTNAVDVQ+S G+TIGIAVY P+
Subjt:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPS

Query:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR
        F WINHSCSPNACYRF    ET SD   +R RIAPSCT   +  G+C Q+G V SN  DF+ +DFQG+GPRVVVRSIKRI+KGEAVTIAYCDLLQPKA R
Subjt:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR

Query:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK
        QSELWSRYQF C CQRCS  P TYVD ALQEISA KV L DS  ISNF  D AVRRI +YVD+AI++YLSIGSPESCCEKL+N+LT GF ++Q +  EGK
Subjt:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK

Query:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS
          ++LRLHP H+L LNAYTAL SAYKVRS DLLAL S++D D+EN  NA TMS+TSAAY+LFLAGATHHLFL EPSLIASAANCWVVAGES+LIL R SS
Subjt:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS

Query:  FWAA--DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINRSG
         WA   + S W FP+ KRMCS C+WV+ FN SRIHGR +  DF   SIG  +CIA+IS++CWSFLTHGCPYLKAFTDPFDFSWPKT         I+RS 
Subjt:  FWAA--DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINRSG

Query:  ACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
        AC KTKDI  +CE Q  SN+ER+ I  LG+HCL+YG YLAS+CYG+HSHLASQIQNIL+++
Subjt:  ACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

XP_011656459.1 protein SET DOMAIN GROUP 41 [Cucumis sativus]4.89e-28364.6Show/hide
Query:  IEMEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRLLRLA
        +EMEM A EDIEM EDI+PPL P+TSALH SFL THCSSCFS     PIS S  L +CS KCS SHSD +             +S  +DLRAS RLL L 
Subjt:  IEMEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRLLRLA

Query:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPS
        LS+PS   S PP+RI+GLLTNR KLM PQ+D +      + +++R GA A+AA RR   AD      ALEEA+LCLVLTNAVDVQ+S G+TIGIAVY  +
Subjt:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPS

Query:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR
        F WINHSCSPNACYRF    ET SDSV +R RIAPSCT   +  GSC Q+G V SN  DFI +DFQG+GPRVVVRSIKRI+KGEAVTIAYCDLLQPKA R
Subjt:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR

Query:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK
        QSELWSRYQF C CQRCS  P TYVD ALQEIS+ KV L DST ISNF  D AVRRI +YVD+AI++YLS  SPESCCEKL+N+LT GF ++Q +  EGK
Subjt:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK

Query:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS
          ++LRLHPLH+L LNAYTAL SAYKVRS DL+AL S++D D+ N  NA TM +TSAAY+LFLAGATH LFL EPSL+ASAANCWVVAGES+LIL R SS
Subjt:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS

Query:  FWAA--DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINRSG
         WA   + S W FP+ KRMC  C+WV+ FN+SRIHG+ V  DF   SIG  +CIA+ISQ+CWS LTHGCPYLKAFT PFDFSWPKT         I+ S 
Subjt:  FWAA--DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINRSG

Query:  ACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
        AC KT+D+  +C+ Q  SN+ER+ I  LG+HCL+YG YLAS+CYGHHSHLASQIQNIL+++
Subjt:  ACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

XP_022932824.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata]3.02e-28265.54Show/hide
Query:  IEMEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP
        +EMEMRA EDIEM EDITPPLPP+T+ALH +F LTHCSSCFSP+ +S       LR+CS  CS S S + A         S  +DLRAS RLL L LS+ 
Subjt:  IEMEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP

Query:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPSFCWI
        S W S PPERIFGLLTNREKLML +DD +      + ++IR GA+AMAASRRT SAD    +NALEEA+LCLVLTNAV+VQ+S G+TIGIAVY P+FCWI
Subjt:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPSFCWI

Query:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSEL
        NHSCSPNACYRF    ET SDS+++RLRI+P CT   TG GSCNQ+ TV  N S FITKDFQG GPRV+VRSIK +RKGEAVTIAYCDLLQPKA+RQSEL
Subjt:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSEL

Query:  WSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLLN
         SRY+F C CQRCS KP TYVD ALQEISAF V L DSTSISNF  D A+RRI DYV++AI++YLSIGSPESCCEKL+N+LT GF ++QA+  +GK LLN
Subjt:  WSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLLN

Query:  LRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWAA
        LRLHP+H+L LN YTALASAYKVRS +          DDEN  NA TMS+TSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGES+LIL + SS W +
Subjt:  LRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWAA

Query:  DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKT-TPSHSSINRSGACRKTKDI
        + SK S PM +  C  C+WV+ FN++RIHGR +  DF   SIG  +CIA+IS + WSFL H C YLKAFTDPFDFSWPKT T   +   RS  C K +D+
Subjt:  DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKT-TPSHSSINRSGACRKTKDI

Query:  ICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
                 S ++RQ IFELG+HCLFYG YLAS+CYGH SHLASQI+ IL +M
Subjt:  ICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

XP_023520942.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo]4.82e-28666.46Show/hide
Query:  IEMEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP
        +EMEMRA EDIEM EDITPPLPP+T+ALH +FLLTHCSSCFSP+ +S       LR+CS  CSHS S + A         S  +DLRAS RLL L LS+P
Subjt:  IEMEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP

Query:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPSFCWI
        S W S PPERIFGLLTNREKLML  DD +      + ++IR G++AMAASRRT SAD    +NALEEA+LCLVLTNAV+VQ+S GRTIGIAVY P+FCWI
Subjt:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPSFCWI

Query:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSEL
        NHSCSPNACYRF    ET SDS+ +RLRI+P CT   TG GSC+Q+ TV  N S FITKDFQG GPRV+VRSIK IR GEAVTIAYCDLLQPKAMRQSEL
Subjt:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSEL

Query:  WSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLLN
         SRY+F C CQRCS KP TYVD ALQEISA  V L DSTSISNF  D A+ RI DYV++AI++YLSIGS ESCCEKL+N+LT GF ++QA+  +GK LLN
Subjt:  WSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLLN

Query:  LRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWAA
        LRLHP+H+L LNAYTALASAYKVRS +           DEN  NA TMS+TSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGES+LIL + SS W +
Subjt:  LRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWAA

Query:  DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSI-NRSGACRKTKDI
        + SK S PM +  C  C+WV+ FN+SRIHGR +  DF   SIG  +CIANISQ+ WSFL H C YLKAFTDPFDFSWPKT  + S+  +RS  C K +D+
Subjt:  DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSI-NRSGACRKTKDI

Query:  ICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
                 S+++RQ IFELG+HCLFYG YLAS+CYGHHSHLASQIQ IL +M
Subjt:  ICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

XP_038886411.1 protein SET DOMAIN GROUP 41 [Benincasa hispida]1.34e-29066.17Show/hide
Query:  IEMEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSIASG-------------AADLRASFRLLRLA
        +EMEM A EDIEM EDITPPL P+TSALH SFL THCSSCFS     PIS S  LR+CS KCS SHSD + +               +DLRAS RLL L 
Subjt:  IEMEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSIASG-------------AADLRASFRLLRLA

Query:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPS
        LS+P    S PPERIFGLLTNR KLM PQ D +      L  ++R G +A+AA     SAD  H  + L EA LCLV TNAVDV +S GRTIGIAVY P+
Subjt:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPS

Query:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR
        FCWINHSCSPNACYRF    ET+S S  +R RIAPSCT   TG GSC+Q+GTV SN SDFIT+DFQG+GPRV+VRSIK IR+GEAVTIAYCDLLQPKAMR
Subjt:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR

Query:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK
        QSELWSRYQF C CQRCS KP TYVD ALQE+SA KV L DSTSISNF  D AVRRI DYV+ AI++YLSIGSPESCCEKL N+LT GF ++QA+  E K
Subjt:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK

Query:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS
          +NLRLHPLH+LSLN YTALASAYKVRS DLLAL S++D D+E+  NASTM + SAAYSLFLAGATHHLFLSEPSLI SA+ CWV+AGES+L L R S 
Subjt:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS

Query:  FWAA-DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSS--------I
         WA  + SKW FP+ KRMCS C+WV+ FN+SRIHG+ +  DF   SIG  +CIAN+S++ WSFLTHGCPYLKAFTDPF+FSWPK  P +SS        I
Subjt:  FWAA-DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSS--------I

Query:  NRSGACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
        +R  AC  +KD+  QCE Q HSN+ER+ I  LG+HCLFYG YLAS+CYGHHSHLASQIQNIL ++
Subjt:  NRSGACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

TrEMBL top hitse value%identityAlignment
A0A0A0KAK3 SET domain-containing protein1.30e-27863.95Show/hide
Query:  IEMEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRLLRLA
        +EMEM A EDIEM EDI+PPL P+TSALH SFL THCSSCFS     PIS S  L +CS KCS SHSD +             +S  +DLRAS RLL L 
Subjt:  IEMEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRLLRLA

Query:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPS
        LS+PS   S PP+RI+GLLTNR KLM PQ+D +      + +++R GA A+AA RR   AD      ALEEA+LCLVLTNAVDVQ+S G+TIGIAVY  +
Subjt:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPS

Query:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKD--FQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKA
        F WINHSCSPNACYRF    ET SDSV +R RIAPSCT   +  GSC Q+G V SN  DFI +     G+GPRVVVRSIKRI+KGEAVTIAYCDLLQPKA
Subjt:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKD--FQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKA

Query:  MRQSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEE
         RQSELWSRYQF C CQRCS  P TYVD ALQEIS+ KV L DST ISNF  D AVRRI +YVD+AI++YLS  SPESCCEKL+N+LT GF ++Q +  E
Subjt:  MRQSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEE

Query:  GKPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRS
        GK  ++LRLHPLH+L LNAYTAL SAYKVRS DL+AL S++D D+ N  NA TM +TSAAY+LFLAGATH LFL EPSL+ASAANCWVVAGES+LIL R 
Subjt:  GKPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRS

Query:  SSFWAA--DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINR
        SS WA   + S W FP+ KRMC  C+WV+ FN+SRIHG+ V  DF   SIG  +CIA+ISQ+CWS LTHGCPYLKAFT PFDFSWPKT         I+ 
Subjt:  SSFWAA--DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINR

Query:  SGACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
        S AC KT+D+  +C+ Q  SN+ER+ I  LG+HCL+YG YLAS+CYGHHSHLASQIQNIL+++
Subjt:  SGACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

A0A1S3CIT0 protein SET DOMAIN GROUP 41 isoform X17.60e-28765.81Show/hide
Query:  MEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRLLRL--A
        MEMRA EDIEM EDITPPL P+TSALH SFL THCSSCFS     PIS S  L +CS KCS SHSD +             +S  +DLRAS RLL L   
Subjt:  MEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFS-----PISDS--LRHCSAKCSHSHSDSI-------------ASGAADLRASFRLLRL--A

Query:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPS
        LS+PS   S PP RIFGLLTNR KLM PQ+  +      + +++R  A A+AA RR   AD      ALEEA+LCLVLTNAVDVQ+S G+TIGIAVY P+
Subjt:  LSNPSVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPS

Query:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR
        F WINHSCSPNACYRF    ET SD   +R RIAPSCT   +  G+C Q+G V SN  DF+ +DFQG+GPRVVVRSIKRI+KGEAVTIAYCDLLQPKA R
Subjt:  FCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMR

Query:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK
        QSELWSRYQF C CQRCS  P TYVD ALQEISA KV L DS  ISNF  D AVRRI +YVD+AI++YLSIGSPESCCEKL+N+LT GF ++Q +  EGK
Subjt:  QSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGK

Query:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS
          ++LRLHP H+L LNAYTAL SAYKVRS DLLAL S++D D+EN  NA TMS+TSAAY+LFLAGATHHLFL EPSLIASAANCWVVAGES+LIL R SS
Subjt:  PLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSS

Query:  FWAA--DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINRSG
         WA   + S W FP+ KRMCS C+WV+ FN SRIHGR +  DF   SIG  +CIA+IS++CWSFLTHGCPYLKAFTDPFDFSWPKT         I+RS 
Subjt:  FWAA--DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSH---SSINRSG

Query:  ACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
        AC KTKDI  +CE Q  SN+ER+ I  LG+HCL+YG YLAS+CYG+HSHLASQIQNIL+++
Subjt:  ACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

A0A6J1DFD6 protein SET DOMAIN GROUP 41 isoform X14.78e-264100Show/hide
Query:  MRQSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEE
        MRQSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEE
Subjt:  MRQSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEE

Query:  GKPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRS
        GKPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRS
Subjt:  GKPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRS

Query:  SSFWAADISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKT
        SSFWAADISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKT
Subjt:  SSFWAADISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKT

Query:  KDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEMK
        KDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEMK
Subjt:  KDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEMK

A0A6J1EY39 protein SET DOMAIN GROUP 41 isoform X11.46e-28265.54Show/hide
Query:  IEMEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP
        +EMEMRA EDIEM EDITPPLPP+T+ALH +F LTHCSSCFSP+ +S       LR+CS  CS S S + A         S  +DLRAS RLL L LS+ 
Subjt:  IEMEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP

Query:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPSFCWI
        S W S PPERIFGLLTNREKLML +DD +      + ++IR GA+AMAASRRT SAD    +NALEEA+LCLVLTNAV+VQ+S G+TIGIAVY P+FCWI
Subjt:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPSFCWI

Query:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSEL
        NHSCSPNACYRF    ET SDS+++RLRI+P CT   TG GSCNQ+ TV  N S FITKDFQG GPRV+VRSIK +RKGEAVTIAYCDLLQPKA+RQSEL
Subjt:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSEL

Query:  WSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLLN
         SRY+F C CQRCS KP TYVD ALQEISAF V L DSTSISNF  D A+RRI DYV++AI++YLSIGSPESCCEKL+N+LT GF ++QA+  +GK LLN
Subjt:  WSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLLN

Query:  LRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWAA
        LRLHP+H+L LN YTALASAYKVRS +          DDEN  NA TMS+TSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGES+LIL + SS W +
Subjt:  LRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWAA

Query:  DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKT-TPSHSSINRSGACRKTKDI
        + SK S PM +  C  C+WV+ FN++RIHGR +  DF   SIG  +CIA+IS + WSFL H C YLKAFTDPFDFSWPKT T   +   RS  C K +D+
Subjt:  DISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDV--DFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKT-TPSHSSINRSGACRKTKDI

Query:  ICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
                 S ++RQ IFELG+HCLFYG YLAS+CYGH SHLASQI+ IL +M
Subjt:  ICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

A0A6J1I954 protein SET DOMAIN GROUP 41 isoform X16.01e-28065.14Show/hide
Query:  IEMEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP
        +EME+RA EDIEM EDITPPLPP+T+ALH SFLLTHCSSCFSP+ +S       LR+CS  CS+S S + A         S  +DLRAS RLL L LS+ 
Subjt:  IEMEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFSPISDS-------LRHCSAKCSHSHSDSIA---------SGAADLRASFRLLRLALSNP

Query:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPSFCWI
        S W S PPERIFGLLTNREKLML  DD +      +  +IR GA+A+A SRRT SAD    +NALEEA++CLVLTNAV+VQ+S G+TIGIAVY P+FCWI
Subjt:  SVWHSDPPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPSFCWI

Query:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSEL
        NHSCSPNACYRF    ET SDS+ +RLRI+P CT   TG GSC+Q+ TV  N S FITKDFQG GPRV+VRSIK IRKGEAVTIAYCDLLQPKAMRQSEL
Subjt:  NHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSEL

Query:  WSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVS-LFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLL
         SRY+F C CQRCS KP TYVD ALQEI A  V  L DSTSISNF  D A+ RI DYV++AI++YLSIGSPESCCEKL+N+LT GF ++QA   +GK LL
Subjt:  WSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVS-LFDSTSISNFYDDNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLL

Query:  NLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWA
        NLRLHP+H+L LN YTALASAYKVRS +          D+EN  N STMS+TSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGES+L L R SS W 
Subjt:  NLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWA

Query:  ADISKWSFPMDKRMCSKCTWVNSFNSSRIHGR--DVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKTKDI
        ++ SK S PM +  C  C+WV+ FN+SRIHGR  +VDF   SIG  +CIANIS + WSFLTH CPYLKAFTDPFDFSWPKT  +         C   +D 
Subjt:  ADISKWSFPMDKRMCSKCTWVNSFNSSRIHGR--DVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKTKDI

Query:  ICQ-CETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM
        +C   + Q  S+++RQ IFELG+HCLFYG YLAS+CYGH SHL+SQIQ IL +M
Subjt:  ICQ-CETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEM

SwissProt top hitse value%identityAlignment
O94256 SET domain and MYND-type zinc finger protein 62.5e-0424.83Show/hide
Query:  LEEALLCLVLTNAVDVQNSNGRTIGIAVYDPSFCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGS
        L + L C +  NA+++  S+  ++G+ + D   C +NHSC PN    F                                                    
Subjt:  LEEALLCLVLTNAVDVQNSNGRTIGIAVYDPSFCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGS

Query:  GPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCCCQRC
        G  V + S + I+K E + I+Y D+  PK++RQ +L  +Y FSC C RC
Subjt:  GPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCCCQRC

Q3ECY6 Protein SET DOMAIN GROUP 415.0e-9836.83Show/hide
Query:  MEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFS---PISDSLRHCSAKCSHSHSDSIASG---------AADLRASFRLLRLALSNPSVWHSD
        ME+RA EDIE+  D+ PPL P+ S+L+ SFL +HCSSCFS   P      +CSA CS + S + +            +D+R S  L    L++ +V  S 
Subjt:  MEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFS---PISDSLRHCSAKCSHSHSDSIASG---------AADLRASFRLLRLALSNPSVWHSD

Query:  PPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPSFCWINHSCSP
         P R+  LLTN   LM          +  + + I + A  +A   R+       +N  LEEA +C VLTNAV+V +SNG  +GIA+Y+ SF WINHSCSP
Subjt:  PPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPSFCWINHSCSP

Query:  NACYRFLLESETNSD------SVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSEL
        N+CYRF+    +  D         S L +     G     G+                    G+GP+++VRSIKRI+ GE +T++Y DLLQP  +RQS+L
Subjt:  NACYRFLLESETNSD------SVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSEL

Query:  WSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNF----YDDNAVRRITDYVDDAISDYLSIG-SPESCCEKLENVLTSGFSEDQAKYEEG
        WS+Y+F C C RC+  P  YVD  L+ +   +    + T++ +F      D AV ++ DY+ +AI D+LS    P++CCE +E+VL  G      +++E 
Subjt:  WSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNF----YDDNAVRRITDYVDDAISDYLSIG-SPESCCEKLENVLTSGFSEDQAKYEEG

Query:  KPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSS
             LRLH  HY++LNAY  LA+AY++RS             D        MSR SAAYSLFLAG +HHLF +E S   SAA  W  AGE +  L    
Subjt:  KPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSS

Query:  SFWAADISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKTK
              + + S   D + C+KC  + + NS R      D    S    SC+ +ISQ  WSFLT GCPYL+ F  P DFS  +T                 
Subjt:  SFWAADISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKTK

Query:  DIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQ
              E +  S ++   +  L  HCL Y   L  LCYG  SHL S+ +
Subjt:  DIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQ

Q9CWR2 Histone-lysine N-methyltransferase SMYD33.8e-0523.4Show/hide
Query:  VLTNAVDVQNSNGRTIGIAVYDPSFCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRS
        V+ N+  + N+  + +G+ +Y PS   +NHSC PN    F                                                   +GP +++R+
Subjt:  VLTNAVDVQNSNGRTIGIAVYDPSFCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRS

Query:  IKRIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCCCQRC
        ++ I  GE +TI Y D+L     R+ +L  +Y F C C RC
Subjt:  IKRIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCCCQRC

Q9H7B4 Histone-lysine N-methyltransferase SMYD38.4e-0523.4Show/hide
Query:  VLTNAVDVQNSNGRTIGIAVYDPSFCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRS
        V+ N+  + N+  + +G+ +Y PS   +NHSC PN    F                                                   +GP +++R+
Subjt:  VLTNAVDVQNSNGRTIGIAVYDPSFCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRS

Query:  IKRIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCCCQRC
        ++ I  GE +TI Y D+L     R+ +L  +Y F C C RC
Subjt:  IKRIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCCCQRC

Arabidopsis top hitse value%identityAlignment
AT1G43245.1 SET domain-containing protein3.5e-9936.83Show/hide
Query:  MEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFS---PISDSLRHCSAKCSHSHSDSIASG---------AADLRASFRLLRLALSNPSVWHSD
        ME+RA EDIE+  D+ PPL P+ S+L+ SFL +HCSSCFS   P      +CSA CS + S + +            +D+R S  L    L++ +V  S 
Subjt:  MEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFS---PISDSLRHCSAKCSHSHSDSIASG---------AADLRASFRLLRLALSNPSVWHSD

Query:  PPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPSFCWINHSCSP
         P R+  LLTN   LM          +  + + I + A  +A   R+       +N  LEEA +C VLTNAV+V +SNG  +GIA+Y+ SF WINHSCSP
Subjt:  PPERIFGLLTNREKLMLPQDDDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPSFCWINHSCSP

Query:  NACYRFLLESETNSD------SVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSEL
        N+CYRF+    +  D         S L +     G     G+                    G+GP+++VRSIKRI+ GE +T++Y DLLQP  +RQS+L
Subjt:  NACYRFLLESETNSD------SVDSRLRIAPSCTGPETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSEL

Query:  WSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNF----YDDNAVRRITDYVDDAISDYLSIG-SPESCCEKLENVLTSGFSEDQAKYEEG
        WS+Y+F C C RC+  P  YVD  L+ +   +    + T++ +F      D AV ++ DY+ +AI D+LS    P++CCE +E+VL  G      +++E 
Subjt:  WSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNF----YDDNAVRRITDYVDDAISDYLSIG-SPESCCEKLENVLTSGFSEDQAKYEEG

Query:  KPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSS
             LRLH  HY++LNAY  LA+AY++RS             D        MSR SAAYSLFLAG +HHLF +E S   SAA  W  AGE +  L    
Subjt:  KPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSS

Query:  SFWAADISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKTK
              + + S   D + C+KC  + + NS R      D    S    SC+ +ISQ  WSFLT GCPYL+ F  P DFS  +T                 
Subjt:  SFWAADISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKAFTDPFDFSWPKTTPSHSSINRSGACRKTK

Query:  DIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQ
              E +  S ++   +  L  HCL Y   L  LCYG  SHL S+ +
Subjt:  DIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATAGAGATGGAAATGAGGGCAACAGAAGACATTGAAATGGGGGAAGATATTACTCCGCCATTGCCGCCCGTTACCTCAGCTCTCCACCCTTCCTTCCTCCTCACTCACTG
CTCCTCCTGCTTCTCCCCAATCTCCGATTCCCTCCGCCACTGCTCCGCCAAATGCTCCCATTCCCATTCCGATTCCATCGCTTCCGGCGCCGCCGACCTCCGCGCCTCCT
TCCGCCTCCTCCGCCTCGCCCTCTCTAATCCCTCTGTTTGGCACTCCGATCCTCCCGAGCGCATCTTCGGCCTTCTCACCAATCGCGAGAAACTGATGCTGCCCCAAGAC
GACGACGACGGCGACGACGAGGGTTTACTGAGAATTCGAATTCGGAACGGCGCCGAGGCCATGGCCGCTTCCAGAAGGACGGGCTCTGCCGATGGATGCCATGAGAACAA
CGCCTTGGAGGAGGCCCTCCTCTGCCTCGTCTTGACCAACGCCGTCGATGTTCAGAATTCCAACGGCCGCACCATCGGAATCGCTGTGTACGATCCTTCCTTCTGCTGGA
TCAATCACAGTTGCTCTCCCAACGCTTGTTACAGATTTCTACTTGAATCCGAAACTAATTCGGATTCTGTCGATTCGAGGCTGCGGATTGCTCCCAGCTGCACTGGCCCC
GAGACTGGTGGAGGAAGTTGTAATCAAATTGGTACTGTTCATAGCAATCCGTCGGATTTCATAACAAAAGATTTTCAGGGTTCTGGTCCAAGAGTCGTGGTTAGGAGTAT
AAAGAGAATAAGGAAAGGTGAGGCAGTCACAATTGCATACTGTGACTTGTTGCAACCTAAGGCGATGAGACAGTCAGAGTTGTGGTCAAGGTATCAATTTTCCTGCTGTT
GCCAGCGATGTAGTATGAAGCCCCAAACTTATGTGGACCTTGCTTTGCAAGAAATTTCTGCCTTTAAAGTTAGTTTGTTTGATTCAACTTCCATTAGCAACTTCTATGAC
GACAATGCAGTGAGAAGAATAACTGATTATGTCGATGATGCAATTTCGGATTACCTTTCCATTGGTTCTCCTGAATCTTGTTGCGAGAAGCTGGAAAACGTGCTTACTTC
AGGTTTCAGTGAAGATCAAGCAAAATATGAGGAAGGAAAACCGCTGCTTAATTTGAGGCTGCATCCTTTGCACTACTTGTCACTGAATGCATACACTGCTCTGGCCTCGG
CTTATAAAGTGCGATCGAGTGATTTATTGGCTTTAGATTCCAAAATAGACGATGACGATGAAAATCTACGTAATGCATCGACCATGAGCAGAACAAGTGCAGCATACTCC
TTGTTCCTTGCAGGTGCTACTCACCATCTTTTTCTTTCTGAACCATCTTTGATTGCGTCTGCTGCTAATTGTTGGGTTGTTGCTGGAGAGTCTATGCTTATTCTTTGTAG
AAGCAGCTCATTTTGGGCTGCTGACATTTCAAAGTGGAGTTTCCCTATGGATAAAAGAATGTGTTCTAAATGCACATGGGTCAATAGCTTCAATTCTAGTCGAATTCACG
GTCGAGATGTCGATTTTCACGGCATCTCAATCGGTACTTTTAGTTGCATTGCTAATATTTCTCAAAGATGTTGGAGCTTTCTGACTCATGGTTGCCCATATTTGAAGGCT
TTCACAGACCCTTTTGATTTCAGCTGGCCTAAGACGACCCCATCGCATTCGAGCATCAATCGCTCAGGTGCGTGTCGGAAAACTAAAGATATTATTTGTCAGTGTGAAAC
TCAGGTGCATTCTAACGAAGAGAGGCAATGGATCTTTGAGCTTGGAATGCATTGCTTATTCTATGGGGCCTATTTAGCAAGTTTGTGTTATGGACACCATTCCCATTTGG
CATCTCAGATTCAGAATATTTTGGACGAGATGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATAGAGATGGAAATGAGGGCAACAGAAGACATTGAAATGGGGGAAGATATTACTCCGCCATTGCCGCCCGTTACCTCAGCTCTCCACCCTTCCTTCCTCCTCACTCACTG
CTCCTCCTGCTTCTCCCCAATCTCCGATTCCCTCCGCCACTGCTCCGCCAAATGCTCCCATTCCCATTCCGATTCCATCGCTTCCGGCGCCGCCGACCTCCGCGCCTCCT
TCCGCCTCCTCCGCCTCGCCCTCTCTAATCCCTCTGTTTGGCACTCCGATCCTCCCGAGCGCATCTTCGGCCTTCTCACCAATCGCGAGAAACTGATGCTGCCCCAAGAC
GACGACGACGGCGACGACGAGGGTTTACTGAGAATTCGAATTCGGAACGGCGCCGAGGCCATGGCCGCTTCCAGAAGGACGGGCTCTGCCGATGGATGCCATGAGAACAA
CGCCTTGGAGGAGGCCCTCCTCTGCCTCGTCTTGACCAACGCCGTCGATGTTCAGAATTCCAACGGCCGCACCATCGGAATCGCTGTGTACGATCCTTCCTTCTGCTGGA
TCAATCACAGTTGCTCTCCCAACGCTTGTTACAGATTTCTACTTGAATCCGAAACTAATTCGGATTCTGTCGATTCGAGGCTGCGGATTGCTCCCAGCTGCACTGGCCCC
GAGACTGGTGGAGGAAGTTGTAATCAAATTGGTACTGTTCATAGCAATCCGTCGGATTTCATAACAAAAGATTTTCAGGGTTCTGGTCCAAGAGTCGTGGTTAGGAGTAT
AAAGAGAATAAGGAAAGGTGAGGCAGTCACAATTGCATACTGTGACTTGTTGCAACCTAAGGCGATGAGACAGTCAGAGTTGTGGTCAAGGTATCAATTTTCCTGCTGTT
GCCAGCGATGTAGTATGAAGCCCCAAACTTATGTGGACCTTGCTTTGCAAGAAATTTCTGCCTTTAAAGTTAGTTTGTTTGATTCAACTTCCATTAGCAACTTCTATGAC
GACAATGCAGTGAGAAGAATAACTGATTATGTCGATGATGCAATTTCGGATTACCTTTCCATTGGTTCTCCTGAATCTTGTTGCGAGAAGCTGGAAAACGTGCTTACTTC
AGGTTTCAGTGAAGATCAAGCAAAATATGAGGAAGGAAAACCGCTGCTTAATTTGAGGCTGCATCCTTTGCACTACTTGTCACTGAATGCATACACTGCTCTGGCCTCGG
CTTATAAAGTGCGATCGAGTGATTTATTGGCTTTAGATTCCAAAATAGACGATGACGATGAAAATCTACGTAATGCATCGACCATGAGCAGAACAAGTGCAGCATACTCC
TTGTTCCTTGCAGGTGCTACTCACCATCTTTTTCTTTCTGAACCATCTTTGATTGCGTCTGCTGCTAATTGTTGGGTTGTTGCTGGAGAGTCTATGCTTATTCTTTGTAG
AAGCAGCTCATTTTGGGCTGCTGACATTTCAAAGTGGAGTTTCCCTATGGATAAAAGAATGTGTTCTAAATGCACATGGGTCAATAGCTTCAATTCTAGTCGAATTCACG
GTCGAGATGTCGATTTTCACGGCATCTCAATCGGTACTTTTAGTTGCATTGCTAATATTTCTCAAAGATGTTGGAGCTTTCTGACTCATGGTTGCCCATATTTGAAGGCT
TTCACAGACCCTTTTGATTTCAGCTGGCCTAAGACGACCCCATCGCATTCGAGCATCAATCGCTCAGGTGCGTGTCGGAAAACTAAAGATATTATTTGTCAGTGTGAAAC
TCAGGTGCATTCTAACGAAGAGAGGCAATGGATCTTTGAGCTTGGAATGCATTGCTTATTCTATGGGGCCTATTTAGCAAGTTTGTGTTATGGACACCATTCCCATTTGG
CATCTCAGATTCAGAATATTTTGGACGAGATGAAATGATTTAATGCTAATCCAATAAGATTGTAAATTGTTATGAGATTGAAACTTTGTATAGACTCCTGCAGTTGTATG
AAATCAGGATTTTAGTTTCTGTATTCGAACATGGTTTTGTAGGACCAATGAGGAAGAATTAAATGGATTTAATTTTCGACTTGTTTTTGG
Protein sequenceShow/hide protein sequence
IEMEMRATEDIEMGEDITPPLPPVTSALHPSFLLTHCSSCFSPISDSLRHCSAKCSHSHSDSIASGAADLRASFRLLRLALSNPSVWHSDPPERIFGLLTNREKLMLPQD
DDDGDDEGLLRIRIRNGAEAMAASRRTGSADGCHENNALEEALLCLVLTNAVDVQNSNGRTIGIAVYDPSFCWINHSCSPNACYRFLLESETNSDSVDSRLRIAPSCTGP
ETGGGSCNQIGTVHSNPSDFITKDFQGSGPRVVVRSIKRIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCCCQRCSMKPQTYVDLALQEISAFKVSLFDSTSISNFYD
DNAVRRITDYVDDAISDYLSIGSPESCCEKLENVLTSGFSEDQAKYEEGKPLLNLRLHPLHYLSLNAYTALASAYKVRSSDLLALDSKIDDDDENLRNASTMSRTSAAYS
LFLAGATHHLFLSEPSLIASAANCWVVAGESMLILCRSSSFWAADISKWSFPMDKRMCSKCTWVNSFNSSRIHGRDVDFHGISIGTFSCIANISQRCWSFLTHGCPYLKA
FTDPFDFSWPKTTPSHSSINRSGACRKTKDIICQCETQVHSNEERQWIFELGMHCLFYGAYLASLCYGHHSHLASQIQNILDEMK