; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018860 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018860
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein SET DOMAIN GROUP 41
Genome locationtig00153226:431861..435135
RNA-Seq ExpressionSgr018860
SyntenySgr018860
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463080.1 PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo]8.4e-24768.22Show/hide
Query:  METRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKC--SDFDSVTPRFFSFHHLP--FPDAADLRASFRL--LHLV
        ME RA+ED EM EDITPPL PLTSALHD FL THCSSCFS LP+ P SHS L  YCS KC  S  D +T  FFS H LP    D +DLRAS RL  LHL+
Subjt:  METRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKC--SDFDSVTPRFFSFHHLP--FPDAADLRASFRL--LHLV

Query:  LSNPSAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHS
        LS+PS   S PP RIFGLLTNR KLM  ++  EVF+ +R+ A A+A  R    +D+    ALEEAVLCLVLTNAV+VQ++ G+TIGIAVY PTF WINHS
Subjt:  LSNPSAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHS

Query:  CSPNACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSE
        CSPNACYRF        ET S    T  RIAPSCTD  S EG+C Q+G V  N+ DF+ +DFQG GP+VVVRSIK ++KGEAVTI YCDLLQPKA RQSE
Subjt:  CSPNACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSE

Query:  LWSRYQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLL
        LWSRYQFVC CQRCSA PL YVDHALQEISAVKVEL+DS  ISNFDH+ AVRRI++YVDNAITEYLSIGSPESCCEKL+NLL  GF +EQV+D EGKQ +
Subjt:  LWSRYQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLL

Query:  NLSLHPLHYLSLNAYTALASAYKFRSCDSLALTSKM--DDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWP
        +L LHP H+L LNAYTAL SAYK RSCD LAL+S+M  D+E+ H+A TMS+TSAAY+LFLAGATHHLFL EPSLIASAANCWVVAGESLLIL R  SLW 
Subjt:  NLSLHPLHYLSLNAYTALASAYKFRSCDSLALTSKM--DDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWP

Query:  TCHGTN--LYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDC
        T   T+   + L K MCSNCSWVD+FN SRIHGR I A+F+EFSIGI NCIA+IS+K WSFL HGCPYLKAF DPFDFSWPKT     N+ D+   GID 
Subjt:  TCHGTN--LYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDC

Query:  SGAYCRTKDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN
        S A  +TKDI  +C+ Q  SNQER+S+  LGIHCL++GGYLAS+CYG+HSHLAS+IQNIL+++N
Subjt:  SGAYCRTKDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN

XP_022932824.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata]4.9e-25569.97Show/hide
Query:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVTPRFFSFHHLPFPDAADLRASFRLLHLVLSNP
        MEME RAMED EM EDITPPLPPLT+ALHD F LTHCSSCFSPLP+S  SHSNL RYCS  CS  DS+T   FS  H PF D +DLRAS RLLHL+LS+ 
Subjt:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVTPRFFSFHHLPFPDAADLRASFRLLHLVLSNP

Query:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN
        SAW S PPERIFGLLTNREKLMLAED+ EVFV IRKGA+AMA SR T S+D+ ++NALEEA+LCLVLTNAVEVQ++ G+TIGIAVY PTFCWINHSCSPN
Subjt:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN

Query:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSR
        ACYRF        ET S SI T LRI+P CTD  + EGSC Q+ TV  N S FITKDFQGYGP+V+VRSIKS+RKGEAVTI YCDLLQPKA+RQSEL SR
Subjt:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSR

Query:  YQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLLNLSL
        Y+FVC CQRCSAKP  YVDHALQEISA  VEL+DSTSISNFD++ A+RRI+DYV+NAI EYLSIGSPESCCEKL+NLL LGF++EQ +D +GKQLLNL L
Subjt:  YQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLLNLSL

Query:  HPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTCHGTN
        HP+H+L LN YTALASAYK RS +        DDE+  +A TMS+TSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGESLLIL +  SLW +    +
Subjt:  HPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTCHGTN

Query:  LYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCSGAYCRTK
           + +I C NCSWVDKFN +RIHGR+I A+F+EFSIGI NCIA+IS K WSFLAH C YLKAF DPFDFSWPKT  T  N      +  DCS    + +
Subjt:  LYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCSGAYCRTK

Query:  DIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN
        D+         S Q+RQS+FELGIHCLF+GGYLAS+CYGH SHLAS+I+ IL +MN
Subjt:  DIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN

XP_022974027.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita maxima]2.5e-25469.56Show/hide
Query:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVTPRFFSFHHLPFPDAADLRASFRLLHLVLSNP
        MEME RAMED EM EDITPPLPPLT+ALHD FLLTHCSSCFSPLP+SP SHSNL RYCS  CS  DS+T   FS  H  F D +DLRAS RLLHL+LS+ 
Subjt:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVTPRFFSFHHLPFPDAADLRASFRLLHLVLSNP

Query:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN
        SAW S PPERIFGLLTNREKLMLA+D+ EVF  IRKGA+A+A SR T S+D+ ++NALEEA++CLVLTNAVEVQ++ G+TIGIAVY PTFCWINHSCSPN
Subjt:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN

Query:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSR
        ACYRF        ET S SI+T LRI+P CTD  + EGSC Q+ TV  N S FITKDFQGYGP+V+VRSIKS+RKGEAVTI YCDLLQPKA+RQSEL SR
Subjt:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSR

Query:  YQFVCCCQRCSAKPLAYVDHALQEISAVKV-ELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLLNLS
        Y+FVC CQRCSAKP  YVDHALQEI AV V EL+DSTSISNFD++ A+ RI+DYV+NAI EYLSIGSPESCCEKL+NLL LGF++EQ  D +GKQLLNL 
Subjt:  YQFVCCCQRCSAKPLAYVDHALQEISAVKV-ELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLLNLS

Query:  LHPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTCHGT
        LHP+H+L LN YTALASAYK RS +        D+E+  + STMS+TSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGESLL L R  SLW +    
Subjt:  LHPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTCHGT

Query:  NLYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCSGAYCRT
        +   + +I C NCSWVDKFN SRIHGR+I  +FQEFSIGI NCIANIS K WSFL H CPYLKAF DPFDFSWPKT  T SN RD           Y + 
Subjt:  NLYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCSGAYCRT

Query:  KDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN
        +D+         S+Q+RQS+FELGIHCLF+GGYLAS+CYGH SHL+S+IQ IL +MN
Subjt:  KDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN

XP_023520942.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo]1.6e-25870.88Show/hide
Query:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVTPRFFSFHHLPFPDAADLRASFRLLHLVLSNP
        MEME RAMED EM EDITPPLPPLT+ALHD FLLTHCSSCFSPLP+S  SHSNL RYCS  CS  DS+T   FS    PF D +DLRAS RLLHL+LS+P
Subjt:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVTPRFFSFHHLPFPDAADLRASFRLLHLVLSNP

Query:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN
        SAW S PPERIFGLLTNREKLMLA+D+ EVFV IR+G++AMA SR T S+D+ ++NALEEA+LCLVLTNAVEVQ++ GRTIGIAVY PTFCWINHSCSPN
Subjt:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN

Query:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSR
        ACYRF        ET S SI+T LRI+P CTD  + EGSC Q+ TV  N S FITKDFQGYGP+V+VRSIKS+R GEAVTI YCDLLQPKA+RQSEL SR
Subjt:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSR

Query:  YQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLLNLSL
        Y+FVC CQRCSAKP  YVDHALQEISAV VEL+DSTSISNFD++ A+ RI+DYV+NAI EYLSIGS ESCCEKL+NLL LGF++EQ +D +GKQLLNL L
Subjt:  YQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLLNLSL

Query:  HPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTCHGTN
        HP+H+L LNAYTALASAYK RS +         DE+  +A TMS+TSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLIL +  SLW +    +
Subjt:  HPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTCHGTN

Query:  LYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCSGAYCRTK
           + +I C NCSWVDKFN SRIHGR+I A+F+EFSIGI NCIANISQK WSFLAH C YLKAF DPFDFSWPKT  T SN RD   +  DCS    + +
Subjt:  LYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCSGAYCRTK

Query:  DIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN
        D+         S+Q+RQS+FELGIHCLF+GGYLAS+CYGHHSHLAS+IQ IL +MN
Subjt:  DIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN

XP_038886411.1 protein SET DOMAIN GROUP 41 [Benincasa hispida]1.2e-25368.78Show/hide
Query:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKC--SDFDSVTPRFFSFHHLPFPDA--ADLRASFRLLHLV
        MEME  AMED EM EDITPPL PLTSALHD FL THCSSCFS LP+ P SHSNL RYCS KC  S  D +T  FFS H  P P +  +DLRAS RLLHL+
Subjt:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKC--SDFDSVTPRFFSFHHLPFPDA--ADLRASFRLLHLV

Query:  LSNPSAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHS
        LS+P A  S PPERIFGLLTNR KLM  + + E+F  +R+G +A+A      S+D+ H + L EA LCLV TNAV+V ++ GRTIGIAVY PTFCWINHS
Subjt:  LSNPSAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHS

Query:  CSPNACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSE
        CSPNACYRF        ET+S S  T  RIAPSCTD  + +GSC Q+GTV  NLSDFIT+DFQG GP+V+VRSIKS+R+GEAVTI YCDLLQPKA+RQSE
Subjt:  CSPNACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSE

Query:  LWSRYQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLL
        LWSRYQFVC CQRCSAKPL YVDHALQE+SA KVEL DSTSISNFDH+ AVRRI+DYV++AITEYLSIGSPESCCEKL NLL LGF++EQ +D E KQ +
Subjt:  LWSRYQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLL

Query:  NLSLHPLHYLSLNAYTALASAYKFRSCDSLALTSKM--DDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWP
        NL LHPLH+LSLN YTALASAYK RSCD LAL+S+M  D+E   +ASTM + SAAYSLFLAGATHHLFLSEPSLI SA+ CWV+AGESLL L R   LW 
Subjt:  NLSLHPLHYLSLNAYTALASAYKFRSCDSLALTSKM--DDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWP

Query:  TCHGTNL-YLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCS
        T + +   + + K MCS CSWVDKFNASRIHG+ I A+F+EFSIGI NCIAN+S+KSWSFL HGCPYLKAF DPF+FSWPK  P YS++RD++A  ID  
Subjt:  TCHGTNL-YLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCS

Query:  GAYCRTKDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN
         A   +KD+  QC+ Q HSNQER+S+  LGIHCLF+GGYLAS+CYGHHSHLAS+IQNIL ++N
Subjt:  GAYCRTKDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN

TrEMBL top hitse value%identityAlignment
A0A0A0KAK3 SET domain-containing protein8.8e-24266.32Show/hide
Query:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKC--SDFDSVTPRFFSFHHLPFPDA----ADLRASFRLLH
        MEME  A+ED EM EDI+PPL PLTSALHD FL THCSSCFS LP+ P SHS    YCS KC  S  D +T  FFS H  PFPDA    +DLRAS RLLH
Subjt:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKC--SDFDSVTPRFFSFHHLPFPDA----ADLRASFRLLH

Query:  LVLSNPSAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWIN
        L+LS+PS   S PP+RI+GLLTNR KLM  +++ EVF+ +R+GA A+A  R    +D+    ALEEAVLCLVLTNAV+VQ++ G+TIGIAVY  TF WIN
Subjt:  LVLSNPSAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWIN

Query:  HSCSPNACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKD--FQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKAL
        HSCSPNACYRF        ET S S+ T  RIAPSCTD  S EGSC Q+G V  N+ DFI +     G GP+VVVRSIK ++KGEAVTI YCDLLQPKA 
Subjt:  HSCSPNACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKD--FQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKAL

Query:  RQSELWSRYQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEG
        RQSELWSRYQFVC CQRCSA PL YVDHALQEIS+VKVEL+DST ISNFDH+ AVRRI++YVDNAITEYLS  SPESCCEKL+NLL  GF +EQV+D EG
Subjt:  RQSELWSRYQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEG

Query:  KQLLNLSLHPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDE--HLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSL
        KQ ++L LHPLH+L LNAYTAL SAYK RSCD +AL+S+MD +  + H+A TM +TSAAY+LFLAGATH LFL EPSL+ASAANCWVVAGESLLIL R  
Subjt:  KQLLNLSLHPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDE--HLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSL

Query:  SLWPTCHGTN--LYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQ
        SLW T   T+  ++ L K MC NCSWVD+FNASRIHG+ + A+F+EFSIGI NCIA+ISQK WS L HGCPYLKAF  PFDFSWPKT     N +D+  +
Subjt:  SLWPTCHGTN--LYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQ

Query:  GIDCSGAYCRTKDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN
        GID S A  +T+D+  +C  Q  SNQER+S+  LGIHCL++GGYLAS+CYGHHSHLAS+IQNIL+++N
Subjt:  GIDCSGAYCRTKDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN

A0A1S3CIT0 protein SET DOMAIN GROUP 41 isoform X14.1e-24768.22Show/hide
Query:  METRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKC--SDFDSVTPRFFSFHHLP--FPDAADLRASFRL--LHLV
        ME RA+ED EM EDITPPL PLTSALHD FL THCSSCFS LP+ P SHS L  YCS KC  S  D +T  FFS H LP    D +DLRAS RL  LHL+
Subjt:  METRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKC--SDFDSVTPRFFSFHHLP--FPDAADLRASFRL--LHLV

Query:  LSNPSAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHS
        LS+PS   S PP RIFGLLTNR KLM  ++  EVF+ +R+ A A+A  R    +D+    ALEEAVLCLVLTNAV+VQ++ G+TIGIAVY PTF WINHS
Subjt:  LSNPSAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHS

Query:  CSPNACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSE
        CSPNACYRF        ET S    T  RIAPSCTD  S EG+C Q+G V  N+ DF+ +DFQG GP+VVVRSIK ++KGEAVTI YCDLLQPKA RQSE
Subjt:  CSPNACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSE

Query:  LWSRYQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLL
        LWSRYQFVC CQRCSA PL YVDHALQEISAVKVEL+DS  ISNFDH+ AVRRI++YVDNAITEYLSIGSPESCCEKL+NLL  GF +EQV+D EGKQ +
Subjt:  LWSRYQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLL

Query:  NLSLHPLHYLSLNAYTALASAYKFRSCDSLALTSKM--DDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWP
        +L LHP H+L LNAYTAL SAYK RSCD LAL+S+M  D+E+ H+A TMS+TSAAY+LFLAGATHHLFL EPSLIASAANCWVVAGESLLIL R  SLW 
Subjt:  NLSLHPLHYLSLNAYTALASAYKFRSCDSLALTSKM--DDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWP

Query:  TCHGTN--LYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDC
        T   T+   + L K MCSNCSWVD+FN SRIHGR I A+F+EFSIGI NCIA+IS+K WSFL HGCPYLKAF DPFDFSWPKT     N+ D+   GID 
Subjt:  TCHGTN--LYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDC

Query:  SGAYCRTKDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN
        S A  +TKDI  +C+ Q  SNQER+S+  LGIHCL++GGYLAS+CYG+HSHLAS+IQNIL+++N
Subjt:  SGAYCRTKDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN

A0A6J1EY39 protein SET DOMAIN GROUP 41 isoform X12.4e-25569.97Show/hide
Query:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVTPRFFSFHHLPFPDAADLRASFRLLHLVLSNP
        MEME RAMED EM EDITPPLPPLT+ALHD F LTHCSSCFSPLP+S  SHSNL RYCS  CS  DS+T   FS  H PF D +DLRAS RLLHL+LS+ 
Subjt:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVTPRFFSFHHLPFPDAADLRASFRLLHLVLSNP

Query:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN
        SAW S PPERIFGLLTNREKLMLAED+ EVFV IRKGA+AMA SR T S+D+ ++NALEEA+LCLVLTNAVEVQ++ G+TIGIAVY PTFCWINHSCSPN
Subjt:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN

Query:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSR
        ACYRF        ET S SI T LRI+P CTD  + EGSC Q+ TV  N S FITKDFQGYGP+V+VRSIKS+RKGEAVTI YCDLLQPKA+RQSEL SR
Subjt:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSR

Query:  YQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLLNLSL
        Y+FVC CQRCSAKP  YVDHALQEISA  VEL+DSTSISNFD++ A+RRI+DYV+NAI EYLSIGSPESCCEKL+NLL LGF++EQ +D +GKQLLNL L
Subjt:  YQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLLNLSL

Query:  HPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTCHGTN
        HP+H+L LN YTALASAYK RS +        DDE+  +A TMS+TSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGESLLIL +  SLW +    +
Subjt:  HPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTCHGTN

Query:  LYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCSGAYCRTK
           + +I C NCSWVDKFN +RIHGR+I A+F+EFSIGI NCIA+IS K WSFLAH C YLKAF DPFDFSWPKT  T  N      +  DCS    + +
Subjt:  LYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCSGAYCRTK

Query:  DIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN
        D+         S Q+RQS+FELGIHCLF+GGYLAS+CYGH SHLAS+I+ IL +MN
Subjt:  DIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN

A0A6J1F365 protein SET DOMAIN GROUP 41 isoform X22.7e-21472.25Show/hide
Query:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVTPRFFSFHHLPFPDAADLRASFRLLHLVLSNP
        MEME RAMED EM EDITPPLPPLT+ALHD F LTHCSSCFSPLP+S  SHSNL RYCS  CS  DS+T   FS  H PF D +DLRAS RLLHL+LS+ 
Subjt:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVTPRFFSFHHLPFPDAADLRASFRLLHLVLSNP

Query:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN
        SAW S PPERIFGLLTNREKLMLAED+ EVFV IRKGA+AMA SR T S+D+ ++NALEEA+LCLVLTNAVEVQ++ G+TIGIAVY PTFCWINHSCSPN
Subjt:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN

Query:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSR
        ACYRF        ET S SI T LRI+P CTD  + EGSC Q+ TV  N S FITKDFQGYGP+V+VRSIKS+RKGEAVTI YCDLLQPKA+RQSEL SR
Subjt:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSR

Query:  YQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLLNLSL
        Y+FVC CQRCSAKP  YVDHALQEISA  VEL+DSTSISNFD++ A+RRI+DYV+NAI EYLSIGSPESCCEKL+NLL LGF++EQ +D +GKQLLNL L
Subjt:  YQFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLLNLSL

Query:  HPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTCHGTN
        HP+H+L LN YTALASAYK RS +        DDE+  +A TMS+TSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGESLLIL +  SLW +    +
Subjt:  HPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTCHGTN

Query:  LYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSI
           + +I C NCSWVDKFN +RIHGR+I A+F+EFSI
Subjt:  LYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSI

A0A6J1I954 protein SET DOMAIN GROUP 41 isoform X11.2e-25469.56Show/hide
Query:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVTPRFFSFHHLPFPDAADLRASFRLLHLVLSNP
        MEME RAMED EM EDITPPLPPLT+ALHD FLLTHCSSCFSPLP+SP SHSNL RYCS  CS  DS+T   FS  H  F D +DLRAS RLLHL+LS+ 
Subjt:  MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVTPRFFSFHHLPFPDAADLRASFRLLHLVLSNP

Query:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN
        SAW S PPERIFGLLTNREKLMLA+D+ EVF  IRKGA+A+A SR T S+D+ ++NALEEA++CLVLTNAVEVQ++ G+TIGIAVY PTFCWINHSCSPN
Subjt:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN

Query:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSR
        ACYRF        ET S SI+T LRI+P CTD  + EGSC Q+ TV  N S FITKDFQGYGP+V+VRSIKS+RKGEAVTI YCDLLQPKA+RQSEL SR
Subjt:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSC-QIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSR

Query:  YQFVCCCQRCSAKPLAYVDHALQEISAVKV-ELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLLNLS
        Y+FVC CQRCSAKP  YVDHALQEI AV V EL+DSTSISNFD++ A+ RI+DYV+NAI EYLSIGSPESCCEKL+NLL LGF++EQ  D +GKQLLNL 
Subjt:  YQFVCCCQRCSAKPLAYVDHALQEISAVKV-ELVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLLNLS

Query:  LHPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTCHGT
        LHP+H+L LN YTALASAYK RS +        D+E+  + STMS+TSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGESLL L R  SLW +    
Subjt:  LHPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTCHGT

Query:  NLYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCSGAYCRT
        +   + +I C NCSWVDKFN SRIHGR+I  +FQEFSIGI NCIANIS K WSFL H CPYLKAF DPFDFSWPKT  T SN RD           Y + 
Subjt:  NLYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCSGAYCRT

Query:  KDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN
        +D+         S+Q+RQS+FELGIHCLF+GGYLAS+CYGH SHL+S+IQ IL +MN
Subjt:  KDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMN

SwissProt top hitse value%identityAlignment
Q3ECY6 Protein SET DOMAIN GROUP 415.4e-10338.44Show/hide
Query:  METRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVT--PRFFSFHHLPFPDAADLRASFRLLHLVLSNP
        ME RA ED E+  D+ PPL PL S+L+D FL +HCSSCFS LP SP        YCSA CS  DS T  P+F        P  +D+R S   LHL L++ 
Subjt:  METRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVT--PRFFSFHHLPFPDAADLRASFRLLHLVLSNP

Query:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN
        +   S  P R+  LLTN   LM    +  + V I   A  +A    +   +      LEEA +C VLTNAVEV +++G  +GIA+Y+ +F WINHSCSPN
Subjt:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN

Query:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSCQIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSRY
        +CYRF+    +  +    + ET+  +       E  E  C  GT L            G GPK++VRSIK ++ GE +T++Y DLLQP  LRQS+LWS+Y
Subjt:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSCQIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSRY

Query:  QFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFD----HENAVRRINDYVDNAITEYLSIG-SPESCCEKLENLLALGFFEEQVKDKEGKQLL
        +F+C C RC+A P AYVD  L+ +  ++ E    T++ +FD     + AV ++NDY+  AI ++LS    P++CCE +E++L  G     ++ KE  Q  
Subjt:  QFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFD----HENAVRRINDYVDNAITEYLSIG-SPESCCEKLENLLALGFFEEQVKDKEGKQLL

Query:  NLSLHPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTC
         L LH  HY++LNAY  LA+AY+ RS DS                 MSR SAAYSLFLAG +HHLF +E S   SAA  W  AGE L  L   L +    
Subjt:  NLSLHPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTC

Query:  HGTNLYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCSGAY
            L +   + C+ C  ++  N+ R        + +E S  I +C+ +ISQ +WSFL  GCPYL+ FR P DFS  +T                     
Subjt:  HGTNLYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCSGAY

Query:  CRTKDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQ
                  + +  S  +  +V  L  HCL +   L  +CYG  SHL SR +
Subjt:  CRTKDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQ

Q9CWR2 Histone-lysine N-methyltransferase SMYD32.6e-0421.01Show/hide
Query:  SHSNLFRYCSAK------------CSDFDSVTPRFFSFHHLPFPDAADLRASFRLLHLVLSNPSAWHSGPPERIFGLLTNREKL-MLAEDEDEVFVGIRK
        S   + +YCSAK            CS   S  PR+        PD      S RLL  V+           E+++        +  L ED+ E    +  
Subjt:  SHSNLFRYCSAK------------CSDFDSVTPRFFSFHHLPFPDAADLRASFRLLHLVLSNPSAWHSGPPERIFGLLTNREKL-MLAEDEDEVFVGIRK

Query:  GAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPNACYRFLLQPLTETETTSGSIETALRIAPSCTDPESI
          +         +S +     L EA    V+ N+  + N + + +G+ +Y P+   +NHSC PN    F                               
Subjt:  GAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPNACYRFLLQPLTETETTSGSIETALRIAPSCTDPESI

Query:  EGSCQIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSRYQFVCCCQRCSAK
                                GP +++R+++ +  GE +TI Y D+L     R+ +L  +Y F C C RC  +
Subjt:  EGSCQIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSRYQFVCCCQRCSAK

Q9H7B4 Histone-lysine N-methyltransferase SMYD32.6e-0421.66Show/hide
Query:  SHSNLFRYCSAKCSD------------FDSVTPRFFSFHHLPFPDAADL--RASFRLLHLVLSNPSAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIR
        S   + +YCSAKC                S  PR+        PD+  L  R  F+L+    S     +S      + L +N  K  L ED+ E    + 
Subjt:  SHSNLFRYCSAKCSD------------FDSVTPRFFSFHHLPFPDAADL--RASFRLLHLVLSNPSAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIR

Query:  KGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPNACYRFLLQPLTETETTSGSIETALRIAPSCTDPES
           +         +S +     L EA    V+ N+  + N + + +G+ +Y P+   +NHSC PN    F                              
Subjt:  KGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPNACYRFLLQPLTETETTSGSIETALRIAPSCTDPES

Query:  IEGSCQIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSRYQFVCCCQRCSAK
                                 GP +++R+++ +  GE +TI Y D+L     R+ +L  +Y F C C RC  +
Subjt:  IEGSCQIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSRYQFVCCCQRCSAK

Arabidopsis top hitse value%identityAlignment
AT1G43245.1 SET domain-containing protein3.8e-10438.44Show/hide
Query:  METRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVT--PRFFSFHHLPFPDAADLRASFRLLHLVLSNP
        ME RA ED E+  D+ PPL PL S+L+D FL +HCSSCFS LP SP        YCSA CS  DS T  P+F        P  +D+R S   LHL L++ 
Subjt:  METRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVT--PRFFSFHHLPFPDAADLRASFRLLHLVLSNP

Query:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN
        +   S  P R+  LLTN   LM    +  + V I   A  +A    +   +      LEEA +C VLTNAVEV +++G  +GIA+Y+ +F WINHSCSPN
Subjt:  SAWHSGPPERIFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPN

Query:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSCQIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSRY
        +CYRF+    +  +    + ET+  +       E  E  C  GT L            G GPK++VRSIK ++ GE +T++Y DLLQP  LRQS+LWS+Y
Subjt:  ACYRFLLQPLTETETTSGSIETALRIAPSCTDPESIEGSCQIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSRY

Query:  QFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFD----HENAVRRINDYVDNAITEYLSIG-SPESCCEKLENLLALGFFEEQVKDKEGKQLL
        +F+C C RC+A P AYVD  L+ +  ++ E    T++ +FD     + AV ++NDY+  AI ++LS    P++CCE +E++L  G     ++ KE  Q  
Subjt:  QFVCCCQRCSAKPLAYVDHALQEISAVKVELVDSTSISNFD----HENAVRRINDYVDNAITEYLSIG-SPESCCEKLENLLALGFFEEQVKDKEGKQLL

Query:  NLSLHPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTC
         L LH  HY++LNAY  LA+AY+ RS DS                 MSR SAAYSLFLAG +HHLF +E S   SAA  W  AGE L  L   L +    
Subjt:  NLSLHPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTC

Query:  HGTNLYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCSGAY
            L +   + C+ C  ++  N+ R        + +E S  I +C+ +ISQ +WSFL  GCPYL+ FR P DFS  +T                     
Subjt:  HGTNLYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSWSFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCSGAY

Query:  CRTKDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQ
                  + +  S  +  +V  L  HCL +   L  +CYG  SHL SR +
Subjt:  CRTKDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATGGAAACAAGGGCAATGGAAGACACAGAAATGGGGGAAGACATAACTCCGCCATTGCCTCCCCTTACCTCCGCTCTCCACGATTGTTTCCTCCTCACTCACTG
TTCCTCCTGCTTCTCCCCTCTCCCCGATTCTCCAACTTCGCACTCCAATCTCTTCCGCTACTGCTCCGCCAAATGCTCCGACTTCGATTCCGTCACTCCCCGATTCTTCT
CCTTCCATCATCTTCCCTTCCCCGACGCCGCCGACCTTCGCGCCTCCTTCCGCCTCCTCCACTTGGTCCTCTCCAACCCATCTGCTTGGCACTCTGGTCCTCCTGAGCGC
ATCTTTGGCCTTCTTACCAACCGCGAGAAATTGATGTTAGCCGAAGACGAGGACGAGGTTTTCGTGGGAATTCGGAAAGGGGCCGAGGCCATGGCCGTTTCCAGAACGAC
GGGCTCTTCCGATATGTGCCATGAGAACGCGTTGGAAGAGGCCGTCCTGTGCCTGGTGTTGACCAACGCCGTGGAGGTACAGAATACCGACGGGCGCACCATAGGAATCG
CTGTGTACGATCCTACCTTCTGCTGGATCAATCACAGCTGTTCTCCCAACGCTTGTTACAGATTTTTACTGCAGCCACTGACTGAAACTGAAACTACGTCGGGTTCCATC
GAGACGGCGCTGCGGATTGCCCCCAGCTGCACTGATCCTGAGTCTATTGAAGGAAGTTGCCAAATAGGTACTGTTCTTTGCAATCTGTCGGATTTCATAACAAAAGATTT
TCAGGGTTATGGTCCAAAAGTTGTGGTTAGGAGTATAAAGAGTGTAAGGAAAGGTGAGGCAGTCACAATCACGTACTGTGACTTGTTGCAACCTAAGGCATTGAGGCAGT
CAGAATTGTGGTCAAGGTATCAGTTTGTCTGTTGTTGCCAGCGATGTAGCGCCAAGCCCCTAGCTTATGTGGACCATGCTTTGCAAGAAATTTCTGCTGTGAAAGTGGAA
TTGGTTGATTCAACTTCCATTAGCAACTTTGATCATGAAAACGCAGTGAGAAGAATAAATGATTATGTCGACAATGCAATCACCGAGTACCTATCTATTGGTTCTCCTGA
ATCTTGTTGTGAGAAACTTGAAAATTTACTTGCTTTAGGTTTCTTCGAAGAGCAAGTAAAAGACAAGGAAGGAAAACAGCTGCTTAATCTGAGTCTGCATCCCTTGCACT
ACCTGTCCCTGAATGCGTACACAGCTCTAGCATCGGCTTATAAATTCCGTTCGTGTGATTCATTGGCTTTGACTTCCAAAATGGACGATGAACATCTACACGACGCATCT
ACCATGAGTAGAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACGCACCATCTTTTTCTTTCTGAACCATCTTTGATTGCATCTGCTGCAAATTGTTGGGTTGT
TGCTGGAGAGTCCTTGCTTATTCTTGGTAGAAGCTTATCATTATGGCCTACCTGCCATGGTACAAACCTTTATCTTCTGGAGAAAATAATGTGCTCTAATTGCTCATGGG
TCGACAAGTTCAATGCGAGTAGAATTCACGGCCGAGCTATAAATGCTAATTTTCAGGAGTTTTCAATTGGTATTTTTAATTGCATTGCTAATATTTCACAAAAAAGTTGG
AGCTTTCTTGCTCATGGCTGCCCGTATTTGAAGGCTTTCAGAGACCCCTTTGATTTCAGCTGGCCCAAGACAACCCCGACATATTCAAATAACCGAGATGTACAGGCCCA
AGGCATTGATTGTTCGGGTGCTTATTGTAGAACTAAAGATATTGTTTCTCAGTGTGACACTCAGGTGCATTCTAACCAAGAGAGGCAATCTGTGTTTGAGCTTGGTATCC
ATTGCTTATTCTTTGGTGGCTATTTAGCAAGTATGTGTTATGGCCACCATTCACATTTGGCATCTCGGATTCAAAATATTTTAGATGAGATGAATCGATACAGTTATCTA
GTAGAATTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATGGAAACAAGGGCAATGGAAGACACAGAAATGGGGGAAGACATAACTCCGCCATTGCCTCCCCTTACCTCCGCTCTCCACGATTGTTTCCTCCTCACTCACTG
TTCCTCCTGCTTCTCCCCTCTCCCCGATTCTCCAACTTCGCACTCCAATCTCTTCCGCTACTGCTCCGCCAAATGCTCCGACTTCGATTCCGTCACTCCCCGATTCTTCT
CCTTCCATCATCTTCCCTTCCCCGACGCCGCCGACCTTCGCGCCTCCTTCCGCCTCCTCCACTTGGTCCTCTCCAACCCATCTGCTTGGCACTCTGGTCCTCCTGAGCGC
ATCTTTGGCCTTCTTACCAACCGCGAGAAATTGATGTTAGCCGAAGACGAGGACGAGGTTTTCGTGGGAATTCGGAAAGGGGCCGAGGCCATGGCCGTTTCCAGAACGAC
GGGCTCTTCCGATATGTGCCATGAGAACGCGTTGGAAGAGGCCGTCCTGTGCCTGGTGTTGACCAACGCCGTGGAGGTACAGAATACCGACGGGCGCACCATAGGAATCG
CTGTGTACGATCCTACCTTCTGCTGGATCAATCACAGCTGTTCTCCCAACGCTTGTTACAGATTTTTACTGCAGCCACTGACTGAAACTGAAACTACGTCGGGTTCCATC
GAGACGGCGCTGCGGATTGCCCCCAGCTGCACTGATCCTGAGTCTATTGAAGGAAGTTGCCAAATAGGTACTGTTCTTTGCAATCTGTCGGATTTCATAACAAAAGATTT
TCAGGGTTATGGTCCAAAAGTTGTGGTTAGGAGTATAAAGAGTGTAAGGAAAGGTGAGGCAGTCACAATCACGTACTGTGACTTGTTGCAACCTAAGGCATTGAGGCAGT
CAGAATTGTGGTCAAGGTATCAGTTTGTCTGTTGTTGCCAGCGATGTAGCGCCAAGCCCCTAGCTTATGTGGACCATGCTTTGCAAGAAATTTCTGCTGTGAAAGTGGAA
TTGGTTGATTCAACTTCCATTAGCAACTTTGATCATGAAAACGCAGTGAGAAGAATAAATGATTATGTCGACAATGCAATCACCGAGTACCTATCTATTGGTTCTCCTGA
ATCTTGTTGTGAGAAACTTGAAAATTTACTTGCTTTAGGTTTCTTCGAAGAGCAAGTAAAAGACAAGGAAGGAAAACAGCTGCTTAATCTGAGTCTGCATCCCTTGCACT
ACCTGTCCCTGAATGCGTACACAGCTCTAGCATCGGCTTATAAATTCCGTTCGTGTGATTCATTGGCTTTGACTTCCAAAATGGACGATGAACATCTACACGACGCATCT
ACCATGAGTAGAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACGCACCATCTTTTTCTTTCTGAACCATCTTTGATTGCATCTGCTGCAAATTGTTGGGTTGT
TGCTGGAGAGTCCTTGCTTATTCTTGGTAGAAGCTTATCATTATGGCCTACCTGCCATGGTACAAACCTTTATCTTCTGGAGAAAATAATGTGCTCTAATTGCTCATGGG
TCGACAAGTTCAATGCGAGTAGAATTCACGGCCGAGCTATAAATGCTAATTTTCAGGAGTTTTCAATTGGTATTTTTAATTGCATTGCTAATATTTCACAAAAAAGTTGG
AGCTTTCTTGCTCATGGCTGCCCGTATTTGAAGGCTTTCAGAGACCCCTTTGATTTCAGCTGGCCCAAGACAACCCCGACATATTCAAATAACCGAGATGTACAGGCCCA
AGGCATTGATTGTTCGGGTGCTTATTGTAGAACTAAAGATATTGTTTCTCAGTGTGACACTCAGGTGCATTCTAACCAAGAGAGGCAATCTGTGTTTGAGCTTGGTATCC
ATTGCTTATTCTTTGGTGGCTATTTAGCAAGTATGTGTTATGGCCACCATTCACATTTGGCATCTCGGATTCAAAATATTTTAGATGAGATGAATCGATACAGTTATCTA
GTAGAATTGTAA
Protein sequenceShow/hide protein sequence
MEMETRAMEDTEMGEDITPPLPPLTSALHDCFLLTHCSSCFSPLPDSPTSHSNLFRYCSAKCSDFDSVTPRFFSFHHLPFPDAADLRASFRLLHLVLSNPSAWHSGPPER
IFGLLTNREKLMLAEDEDEVFVGIRKGAEAMAVSRTTGSSDMCHENALEEAVLCLVLTNAVEVQNTDGRTIGIAVYDPTFCWINHSCSPNACYRFLLQPLTETETTSGSI
ETALRIAPSCTDPESIEGSCQIGTVLCNLSDFITKDFQGYGPKVVVRSIKSVRKGEAVTITYCDLLQPKALRQSELWSRYQFVCCCQRCSAKPLAYVDHALQEISAVKVE
LVDSTSISNFDHENAVRRINDYVDNAITEYLSIGSPESCCEKLENLLALGFFEEQVKDKEGKQLLNLSLHPLHYLSLNAYTALASAYKFRSCDSLALTSKMDDEHLHDAS
TMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLILGRSLSLWPTCHGTNLYLLEKIMCSNCSWVDKFNASRIHGRAINANFQEFSIGIFNCIANISQKSW
SFLAHGCPYLKAFRDPFDFSWPKTTPTYSNNRDVQAQGIDCSGAYCRTKDIVSQCDTQVHSNQERQSVFELGIHCLFFGGYLASMCYGHHSHLASRIQNILDEMNRYSYL
VEL