; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001786 (gene) of Snake gourd v1 genome

Gene IDTan0001786
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein SET DOMAIN GROUP 41
Genome locationLG10:54087597..54090003
RNA-Seq ExpressionTan0001786
SyntenyTan0001786
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001214 - SET domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463080.1 PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo]4.5e-25371.93Show/hide
Query:  MEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKC--SDSDSATAAFFSANHL--SFSDTADLRASLRLLH---L
        MEMRA+EDIEMAEDITPP  PLTSALHDSFL THCSSCF  LP  PISHS LL YCS KC  S SD  TAAFFS + L  + SDT+DLRASLRLLH   L
Subjt:  MEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKC--SDSDSATAAFFSANHL--SFSDTADLRASLRLLH---L

Query:  LSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHS
        LS PS   S PP RIFGLLTNR KLM  ++  EV +++R+ A+A+AA RR N ADI  G ALEEAVLCLV+TNAV+VQDS G+TIGIAVY PTF WINHS
Subjt:  LSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHS

Query:  CSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFI
        CSPNACYRFET SD   TR +I+P CTD  + EG+CRQMG V SN+ DF+R+DFQG GPR+VVRSIK I+KGEAVTIAYCDLLQPK  RQ+EL SRYQF+
Subjt:  CSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFI

Query:  CSCHRCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPL
        CSC RCSA P TYVDHALQEISAVK +L + S  ISNFD+D AVRRI +YVDNAI EYLSIGSPESCCEKL+NLLT GF DEQVE+ EGKQ  +LRLHP 
Subjt:  CSCHRCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPL

Query:  NYLSLNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSS--SCATNT
        ++L LNAYTAL S+YKVRSCDLLA  S+M   N  E+R NA T +KTSAAY+LFLAGATHHLFL EPSLIASAANCWVVAGESLLILAR SS  +  TNT
Subjt:  NYLSLNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSS--SCATNT

Query:  SKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYS
        S   FPL KRMCSNCSWVD+FN SRIHGR  +A+F EFS GISNCIA+IS+K WSFLT GCPYLKAFTDPFDFSWPK     TN+ DI  H ID S A S
Subjt:  SKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYS

Query:  ETKDIVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN
        +TKDI  +CEPQ  S+QE +SI  LGIHCL +GGYLASICYG+HS LASQIQNIL+ +N
Subjt:  ETKDIVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN

XP_022932824.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata]8.3e-26373.51Show/hide
Query:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDP
        MEMEMRAMEDIEMAEDITPP PPLT+ALHD+F LTHCSSCF PLP S ISHSNLLRYCSP CS SDS TAA FS +H  FSDT+DLRASLRLLH LLSD 
Subjt:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDP

Query:  SAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPN
        SAWRSAPPERIFGLLTNREKLML EDD EV V+IRKGADA+AASRRTNSADI + NALEEA+LCLV+TNAVEVQDS G+TIGIAVY PTFCWINHSCSPN
Subjt:  SAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPN

Query:  ACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH
        ACYRFET SDS+ TR++ISP CTD+GTGEGSC QM TV  N S FI KDFQGYGPR++VRSIKS+RKGEAVTIAYCDLLQPK +RQ+EL SRY+F+CSC 
Subjt:  ACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH

Query:  RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLS
        RCSAKPPTYVDHALQEISA   +L + STSISNFD D A+RRI DYV+NAI EYLSIGSPESCCEKL+NLLTLGF DEQ E+ +GKQL NLRLHP+++L 
Subjt:  RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLS

Query:  LNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCSFP
        LN YTALAS+YKVRS             N DE++ NA T +KTSAAYSLFLAGATHHLFL EPSLIASAANCWVVAGESLLIL + SS   +NTSK S P
Subjt:  LNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCSFP

Query:  LRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIV
        + +  C NCSWVDKFN +RIHGRS +A+F EFS GISNCIA+IS K WSFL   C YLKAFTDPFDFSWPKT T   N           S   S+ +D+ 
Subjt:  LRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIV

Query:  PQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN
                S+Q+ QSIFELGIHCL +GGYLASICYGH S LASQI+ IL  MN
Subjt:  PQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN

XP_022974027.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita maxima]5.5e-26773.66Show/hide
Query:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDP
        MEME+RAMEDIEMAEDITPP PPLT+ALHDSFLLTHCSSCF PLP SPISHSNLLRYCSP CS SDS TAA FS +H  FSDT+DLRASLRLLH LLSD 
Subjt:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDP

Query:  SAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPN
        SAWRS PPERIFGLLTNREKLML +DD EV  +IRKGADA+A SRRTNSADI + NALEEA++CLV+TNAVEVQDS G+TIGIAVY PTFCWINHSCSPN
Subjt:  SAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPN

Query:  ACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH
        ACYRFET SDS+KTR++ISP CTD+GTGEGSC QM TV  N S FI KDFQGYGPR++VRSIKSIRKGEAVTIAYCDLLQPK MRQ+EL SRY+F+CSC 
Subjt:  ACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH

Query:  RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLS
        RCSAKPPTYVDHALQEI AV  +  + STSISNFD D A+ RI DYV+NAI EYLSIGSPESCCEKL+NLLTLGF DEQ ++ +GKQL NLRLHP+++L 
Subjt:  RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLS

Query:  LNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCSFP
        LN YTALAS+YKVRS             N +E++ N ST +KTSAAYSLFLAGATHHLFL EPSLIASAANCWVVAGESLL L R SS   +NTSK S P
Subjt:  LNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCSFP

Query:  LRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIV
        + +  C NCSWVDKFN SRIHGRS + +F EFS GISNCIANIS K WSFLT  CPYLKAFTDPFDFSWPKT T  +N +       D    YS+ +D+ 
Subjt:  LRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIV

Query:  PQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN
                SDQ+ QSIFELGIHCL +GGYLASICYGH S L+SQIQ IL  MN
Subjt:  PQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN

XP_023520942.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo]2.2e-26874.89Show/hide
Query:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDP
        MEMEMRAMEDIEMAEDITPP PPLT+ALHD+FLLTHCSSCF PLP S ISHSNLLRYCSP CS SDS TAA FS     FSDT+DLRASLRLLH LLSDP
Subjt:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDP

Query:  SAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPN
        SAWRSAPPERIFGLLTNREKLML +DD EV V+IR+G+DA+AASRRTNSADI + NALEEA+LCLV+TNAVEVQDS GRTIGIAVY PTFCWINHSCSPN
Subjt:  SAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPN

Query:  ACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH
        ACYRFET SDS+KTR++ISP CTD+GTGEGSC QM TV  N S FI KDFQGYGPR++VRSIKSIR GEAVTIAYCDLLQPK MRQ+EL SRY+F+CSC 
Subjt:  ACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH

Query:  RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLS
        RCSAKPPTYVDHALQEISAV  +L + STSISNFD D A+ RI DYV+NAI EYLSIGS ESCCEKL+NLLTLGF DEQ E+ +GKQL NLRLHP+++L 
Subjt:  RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLS

Query:  LNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCSFP
        LNAYTALAS+YKVRS             NGDE++ NA T +KTSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGESLLIL + SS   +NTSK S P
Subjt:  LNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCSFP

Query:  LRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIV
        + +  C NCSWVDKFN SRIHGRS +A+F EFS GISNCIANISQK WSFL   C YLKAFTDPFDFSWPKT T  +N +       D S   S+ +D+ 
Subjt:  LRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIV

Query:  PQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN
                SDQ+ QSIFELGIHCL +GGYLASICYGHHS LASQIQ IL  MN
Subjt:  PQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN

XP_038886411.1 protein SET DOMAIN GROUP 41 [Benincasa hispida]1.1e-25973.4Show/hide
Query:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKC--SDSDSATAAFFSANHL--SFSDTADLRASLRLLH-L
        MEMEM AMEDIEMAEDITPP  PLTSALHDSFL THCSSCF  LP  PISHSNLLRYCSPKC  S SD  TAAFFS +     FS T+DLRASLRLLH L
Subjt:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKC--SDSDSATAAFFSANHL--SFSDTADLRASLRLLH-L

Query:  LSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHS
        LS P A  S PPERIFGLLTNR KLM  + D E+  ++R+G DA+AA     SADI HG+ L EA LCLV TNAV+V DS GRTIGIAVY PTFCWINHS
Subjt:  LSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHS

Query:  CSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFI
        CSPNACYRFETSS S  TR +I+P CTDL TG+GSC QMGTV SNLSDFI +DFQG GPR++VRSIKSIR+GEAVTIAYCDLLQPK MRQ+EL SRYQF+
Subjt:  CSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFI

Query:  CSCHRCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPL
        CSC RCSAKP TYVDHALQE+SA K +L   STSISNFD+D AVRRI DYV++AI EYLSIGSPESCCEKL NLLTLGF DEQ E+ E KQ  NLRLHPL
Subjt:  CSCHRCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPL

Query:  NYLSLNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCA-TNTS
        ++LSLN YTALAS+YKVRSCDLLA  S+M   N D+   NAST  K SAAYSLFLAGATHHLFL+EPSLI SA+ CWV+AGESLL LAR S   A TNTS
Subjt:  NYLSLNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCA-TNTS

Query:  KCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSE
        K  FP+ KRMCS CSWVDKFNASRIHG+  +A+F EFS GISNCIAN+S+KSWSFLT GCPYLKAFTDPF+FSWPK    Y++++DIRAHSID   A S 
Subjt:  KCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSE

Query:  TKDIVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN
        +KD+  QCEPQ HS+QE +SI  LGIHCL +GGYLASICYGHHS LASQIQNIL  +N
Subjt:  TKDIVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN

TrEMBL top hitse value%identityAlignment
A0A0A0KAK3 SET domain-containing protein5.8e-24670.2Show/hide
Query:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKC--SDSDSATAAFFSANHL--SFSDTADLRASLRLLH-L
        MEMEM A+EDIEMAEDI+PP  PLTSALHDSFL THCSSCF  LP  PISHS  L YCS KC  S SD  T AFFS +    + SDT+DLRASLRLLH L
Subjt:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKC--SDSDSATAAFFSANHL--SFSDTADLRASLRLLH-L

Query:  LSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHS
        LS PS   S PP+RI+GLLTNR KLM  ++D EV +++R+GA+A+AA RR N ADI  G ALEEAVLCLV+TNAV+VQDS G+TIGIAVY  TF WINHS
Subjt:  LSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHS

Query:  CSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKD--FQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQ
        CSPNACYRFET SDS+ TR +I+P CTD  + EGSCRQMG V SN+ DFIR+     G GPR+VVRSIK I+KGEAVTIAYCDLLQPK  RQ+EL SRYQ
Subjt:  CSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKD--FQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQ

Query:  FICSCHRCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLH
        F+CSC RCSA P TYVDHALQEIS+VK +L + ST ISNFD+D AVRRI +YVDNAI EYLS  SPESCCEKL+NLLT GF DEQVE+ EGKQ  +LRLH
Subjt:  FICSCHRCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLH

Query:  PLNYLSLNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSS--SCAT
        PL++L LNAYTAL S+YKVRSCDL+A  S+M   NG+ H  NA T  KTSAAY+LFLAGATH LFL EPSL+ASAANCWVVAGESLLILAR SS  +  T
Subjt:  PLNYLSLNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSS--SCAT

Query:  NTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSA
        NTS   FPL KRMC NCSWVD+FNASRIHG+  +A+F EFS GISNCIA+ISQK WS LT GCPYLKAFT PFDFSWPK     TN QDI    IDHS A
Subjt:  NTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSA

Query:  YSETKDIVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN
         S+T+D+  +C+PQ  S+QE +SI  LGIHCL +GGYLASICYGHHS LASQIQNIL+ +N
Subjt:  YSETKDIVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN

A0A1S3CIT0 protein SET DOMAIN GROUP 41 isoform X12.2e-25371.93Show/hide
Query:  MEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKC--SDSDSATAAFFSANHL--SFSDTADLRASLRLLH---L
        MEMRA+EDIEMAEDITPP  PLTSALHDSFL THCSSCF  LP  PISHS LL YCS KC  S SD  TAAFFS + L  + SDT+DLRASLRLLH   L
Subjt:  MEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKC--SDSDSATAAFFSANHL--SFSDTADLRASLRLLH---L

Query:  LSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHS
        LS PS   S PP RIFGLLTNR KLM  ++  EV +++R+ A+A+AA RR N ADI  G ALEEAVLCLV+TNAV+VQDS G+TIGIAVY PTF WINHS
Subjt:  LSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHS

Query:  CSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFI
        CSPNACYRFET SD   TR +I+P CTD  + EG+CRQMG V SN+ DF+R+DFQG GPR+VVRSIK I+KGEAVTIAYCDLLQPK  RQ+EL SRYQF+
Subjt:  CSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFI

Query:  CSCHRCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPL
        CSC RCSA P TYVDHALQEISAVK +L + S  ISNFD+D AVRRI +YVDNAI EYLSIGSPESCCEKL+NLLT GF DEQVE+ EGKQ  +LRLHP 
Subjt:  CSCHRCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPL

Query:  NYLSLNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSS--SCATNT
        ++L LNAYTAL S+YKVRSCDLLA  S+M   N  E+R NA T +KTSAAY+LFLAGATHHLFL EPSLIASAANCWVVAGESLLILAR SS  +  TNT
Subjt:  NYLSLNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSS--SCATNT

Query:  SKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYS
        S   FPL KRMCSNCSWVD+FN SRIHGR  +A+F EFS GISNCIA+IS+K WSFLT GCPYLKAFTDPFDFSWPK     TN+ DI  H ID S A S
Subjt:  SKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYS

Query:  ETKDIVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN
        +TKDI  +CEPQ  S+QE +SI  LGIHCL +GGYLASICYG+HS LASQIQNIL+ +N
Subjt:  ETKDIVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN

A0A6J1EY39 protein SET DOMAIN GROUP 41 isoform X14.0e-26373.51Show/hide
Query:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDP
        MEMEMRAMEDIEMAEDITPP PPLT+ALHD+F LTHCSSCF PLP S ISHSNLLRYCSP CS SDS TAA FS +H  FSDT+DLRASLRLLH LLSD 
Subjt:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDP

Query:  SAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPN
        SAWRSAPPERIFGLLTNREKLML EDD EV V+IRKGADA+AASRRTNSADI + NALEEA+LCLV+TNAVEVQDS G+TIGIAVY PTFCWINHSCSPN
Subjt:  SAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPN

Query:  ACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH
        ACYRFET SDS+ TR++ISP CTD+GTGEGSC QM TV  N S FI KDFQGYGPR++VRSIKS+RKGEAVTIAYCDLLQPK +RQ+EL SRY+F+CSC 
Subjt:  ACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH

Query:  RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLS
        RCSAKPPTYVDHALQEISA   +L + STSISNFD D A+RRI DYV+NAI EYLSIGSPESCCEKL+NLLTLGF DEQ E+ +GKQL NLRLHP+++L 
Subjt:  RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLS

Query:  LNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCSFP
        LN YTALAS+YKVRS             N DE++ NA T +KTSAAYSLFLAGATHHLFL EPSLIASAANCWVVAGESLLIL + SS   +NTSK S P
Subjt:  LNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCSFP

Query:  LRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIV
        + +  C NCSWVDKFN +RIHGRS +A+F EFS GISNCIA+IS K WSFL   C YLKAFTDPFDFSWPKT T   N           S   S+ +D+ 
Subjt:  LRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIV

Query:  PQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN
                S+Q+ QSIFELGIHCL +GGYLASICYGH S LASQI+ IL  MN
Subjt:  PQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN

A0A6J1I954 protein SET DOMAIN GROUP 41 isoform X12.7e-26773.66Show/hide
Query:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDP
        MEME+RAMEDIEMAEDITPP PPLT+ALHDSFLLTHCSSCF PLP SPISHSNLLRYCSP CS SDS TAA FS +H  FSDT+DLRASLRLLH LLSD 
Subjt:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDP

Query:  SAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPN
        SAWRS PPERIFGLLTNREKLML +DD EV  +IRKGADA+A SRRTNSADI + NALEEA++CLV+TNAVEVQDS G+TIGIAVY PTFCWINHSCSPN
Subjt:  SAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPN

Query:  ACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH
        ACYRFET SDS+KTR++ISP CTD+GTGEGSC QM TV  N S FI KDFQGYGPR++VRSIKSIRKGEAVTIAYCDLLQPK MRQ+EL SRY+F+CSC 
Subjt:  ACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH

Query:  RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLS
        RCSAKPPTYVDHALQEI AV  +  + STSISNFD D A+ RI DYV+NAI EYLSIGSPESCCEKL+NLLTLGF DEQ ++ +GKQL NLRLHP+++L 
Subjt:  RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLS

Query:  LNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCSFP
        LN YTALAS+YKVRS             N +E++ N ST +KTSAAYSLFLAGATHHLFL EPSLIASAANCWVVAGESLL L R SS   +NTSK S P
Subjt:  LNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCSFP

Query:  LRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIV
        + +  C NCSWVDKFN SRIHGRS + +F EFS GISNCIANIS K WSFLT  CPYLKAFTDPFDFSWPKT T  +N +       D    YS+ +D+ 
Subjt:  LRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIV

Query:  PQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN
                SDQ+ QSIFELGIHCL +GGYLASICYGH S L+SQIQ IL  MN
Subjt:  PQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN

A0A6J1IF01 protein SET DOMAIN GROUP 41 isoform X23.0e-22676.17Show/hide
Query:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDP
        MEME+RAMEDIEMAEDITPP PPLT+ALHDSFLLTHCSSCF PLP SPISHSNLLRYCSP CS SDS TAA FS +H  FSDT+DLRASLRLLH LLSD 
Subjt:  MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDP

Query:  SAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPN
        SAWRS PPERIFGLLTNREKLML +DD EV  +IRKGADA+A SRRTNSADI + NALEEA++CLV+TNAVEVQDS G+TIGIAVY PTFCWINHSCSPN
Subjt:  SAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPN

Query:  ACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH
        ACYRFET SDS+KTR++ISP CTD+GTGEGSC QM TV  N S FI KDFQGYGPR++VRSIKSIRKGEAVTIAYCDLLQPK MRQ+EL SRY+F+CSC 
Subjt:  ACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH

Query:  RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLS
        RCSAKPPTYVDHALQEI AV  +  + STSISNFD D A+ RI DYV+NAI EYLSIGSPESCCEKL+NLLTLGF DEQ ++ +GKQL NLRLHP+++L 
Subjt:  RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLS

Query:  LNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCSFP
        LN YTALAS+YKVRS             N +E++ N ST +KTSAAYSLFLAGATHHLFL EPSLIASAANCWVVAGESLL L R SS   +NTSK S P
Subjt:  LNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCSFP

Query:  LRKRMCSNCSWVDKFNASRIHGRSFKANFCEFS
        + +  C NCSWVDKFN SRIHGRS + +F EFS
Subjt:  LRKRMCSNCSWVDKFNASRIHGRSFKANFCEFS

SwissProt top hitse value%identityAlignment
Q3ECY6 Protein SET DOMAIN GROUP 415.1e-9837.35Show/hide
Query:  MEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAW
        ME+RA EDIE+  D+ PP  PL S+L+DSFL +HCSSCF  LP SP        YCS  CS +DS T +      ++    +D+R S   LHLL+  +  
Subjt:  MEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAW

Query:  RSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACY
         S+ P R+  LLTN   LM    D  + V I   A+ +A   R+N         LEEA +C V+TNAVEV DSNG  +GIA+Y+ +F WINHSCSPN+CY
Subjt:  RSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACY

Query:  RFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCS
        RF  +  S           T+  T      Q    G++L+        G GP+++VRSIK I+ GE +T++Y DLLQP  +RQ++L S+Y+F+C+C RC+
Subjt:  RFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCS

Query:  AKPPTYVDHALQEISAVKEKLFVGSTSISNFD----NDNAVRRIKDYVDNAIYEYLSIG-SPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNY
        A PP YVD  L+ +  ++ +     T++ +FD     D AV ++ DY+  AI ++LS    P++CCE +E++L  G     ++ +E  Q H LRLH  +Y
Subjt:  AKPPTYVDHALQEISAVKEKLFVGSTSISNFD----NDNAVRRIKDYVDNAIYEYLSIG-SPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNY

Query:  LSLNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCS
        ++LNAY  LA++Y++RS      DS+ G              ++ SAAYSLFLAG +HHLF AE S   SAA  W  AGE L  LA       +  S   
Subjt:  LSLNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCS

Query:  FPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKD
               C+ C  ++  N+ R        +  E S  I +C+ +ISQ +WSFLT GCPYL+ F  P DFS  +T                          
Subjt:  FPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKD

Query:  IVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQ
             E +  S  +  ++  L  HCL +   L  +CYG  S L S+ +
Subjt:  IVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQ

Q557F7 SET and MYND domain-containing protein DDB_G02735893.3e-0428.15Show/hide
Query:  NAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKG
        N   +   N + IG+AV  P+  + NHSC PN                     CTD+             GSN++                +S+  I+KG
Subjt:  NAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKG

Query:  EAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCS
        + +TI+Y +L QP + R+ EL   Y F C C RC+
Subjt:  EAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCS

Q9CWR2 Histone-lysine N-methyltransferase SMYD37.8e-0622.31Show/hide
Query:  SHSNLLRYCSPKCS-----DSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASR
        S   + +YCS KC      D     +   S       D+  L    R++  L D     S      + L +N  K  L ED  E L ++           
Subjt:  SHSNLLRYCSPKCS-----DSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASR

Query:  RTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDF
          +++ +     L EA    VI N+  + ++  + +G+ +Y P+   +NHSC PN    F                                        
Subjt:  RTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDF

Query:  IRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCSAK
                GP +++R+++ I  GE +TI Y D+L   E R+ +L  +Y F C C RC  +
Subjt:  IRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCSAK

Q9H7B4 Histone-lysine N-methyltransferase SMYD31.3e-0522.31Show/hide
Query:  SHSNLLRYCSPKCS-----DSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASR
        S   + +YCS KC      D         S       D+  L    R++  L D +   S      + L +N  K  L ED  E L ++           
Subjt:  SHSNLLRYCSPKCS-----DSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASR

Query:  RTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDF
          +++ +     L EA    VI N+  + ++  + +G+ +Y P+   +NHSC PN    F                                        
Subjt:  RTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDF

Query:  IRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCSAK
                GP +++R+++ I  GE +TI Y D+L   E R+ +L  +Y F C C RC  +
Subjt:  IRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCSAK

Q9NRG4 N-lysine methyltransferase SMYD21.3e-0522.22Show/hide
Query:  HCSSCFFPLP-ISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIR
        HC  CF     +S         YC+ +C   D        +  + F +  +   ++RL   +    A +   PER     T  EKL+ V++ +  L ++ 
Subjt:  HCSSCFFPLP-ISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIR

Query:  KGADALAASRRTNSADIHH---------GNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLG
             L  S   + A +HH          N     +   V  N   ++D     +G A++ P    +NHSC PN    ++                    
Subjt:  KGADALAASRRTNSADIHH---------GNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLG

Query:  TGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCSAKPPTYVDHALQEISAVKEKLFV
                 GT+                    VR+++ I+ GE V  +Y DLL P E R   L   Y F C C  C+ K               K+K  V
Subjt:  TGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCSAKPPTYVDHALQEISAVKEKLFV

Query:  GSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLE
            +S+     A+R +  Y  N I E+      +S  E LE
Subjt:  GSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLE

Arabidopsis top hitse value%identityAlignment
AT1G26760.1 SET domain protein 359.8e-0433.93Show/hide
Query:  GPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCSAKPPTY
        G  ++V + + I+ GE ++ AY D+L P E R+ E+   + F C C RC  +   Y
Subjt:  GPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCSAKPPTY

AT1G43245.1 SET domain-containing protein3.6e-9937.35Show/hide
Query:  MEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAW
        ME+RA EDIE+  D+ PP  PL S+L+DSFL +HCSSCF  LP SP        YCS  CS +DS T +      ++    +D+R S   LHLL+  +  
Subjt:  MEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAW

Query:  RSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACY
         S+ P R+  LLTN   LM    D  + V I   A+ +A   R+N         LEEA +C V+TNAVEV DSNG  +GIA+Y+ +F WINHSCSPN+CY
Subjt:  RSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACY

Query:  RFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCS
        RF  +  S           T+  T      Q    G++L+        G GP+++VRSIK I+ GE +T++Y DLLQP  +RQ++L S+Y+F+C+C RC+
Subjt:  RFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCS

Query:  AKPPTYVDHALQEISAVKEKLFVGSTSISNFD----NDNAVRRIKDYVDNAIYEYLSIG-SPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNY
        A PP YVD  L+ +  ++ +     T++ +FD     D AV ++ DY+  AI ++LS    P++CCE +E++L  G     ++ +E  Q H LRLH  +Y
Subjt:  AKPPTYVDHALQEISAVKEKLFVGSTSISNFD----NDNAVRRIKDYVDNAIYEYLSIG-SPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNY

Query:  LSLNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCS
        ++LNAY  LA++Y++RS      DS+ G              ++ SAAYSLFLAG +HHLF AE S   SAA  W  AGE L  LA       +  S   
Subjt:  LSLNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCS

Query:  FPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKD
               C+ C  ++  N+ R        +  E S  I +C+ +ISQ +WSFLT GCPYL+ F  P DFS  +T                          
Subjt:  FPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKD

Query:  IVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQ
             E +  S  +  ++  L  HCL +   L  +CYG  S L S+ +
Subjt:  IVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQ

AT2G17900.1 SET domain group 371.2e-0428.67Show/hide
Query:  NAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKG
        NA  + DS  R  GI ++ P    INHSCSPNA   FE                           QM                      VVR++ +I K 
Subjt:  NAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKG

Query:  EAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCS--AKPPTYVDHALQE
          +TI+Y +       RQ  L  +Y F C C RCS   KP    + A+ E
Subjt:  EAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCS--AKPPTYVDHALQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATGGAAATGAGGGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATCGCCTCCCCTCACCTCCGCCCTCCACGATTCCTTCCTCCTTACACACTG
CTCCTCCTGCTTCTTCCCTCTCCCAATTTCCCCAATTTCTCACTCCAATCTCCTCCGCTATTGCTCCCCCAAATGCTCCGATTCCGATTCCGCCACCGCCGCCTTCTTCT
CCGCCAATCATCTCTCCTTCTCCGACACCGCGGACCTCCGCGCCTCGCTTCGCCTCCTCCATCTCCTCTCCGATCCCTCGGCTTGGCGCTCTGCTCCTCCCGAGCGCATC
TTTGGCCTTCTCACCAACCGCGAGAAATTGATGCTCGTCGAAGACGACGACGAGGTTCTCGTCAGGATTCGGAAAGGAGCCGACGCCTTGGCCGCTTCCAGAAGGACGAA
CTCTGCCGATATTCACCATGGAAACGCCTTGGAAGAGGCCGTCCTATGCCTCGTTATTACCAACGCTGTGGAGGTTCAGGATTCGAACGGCCGTACCATTGGAATCGCTG
TGTATGATCCTACCTTCTGCTGGATCAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGAAACTTCGTCGGATTCCTTGAAGACAAGGATGCAGATTTCCCCCAAA
TGCACTGACCTTGGCACTGGTGAAGGAAGTTGTCGTCAAATGGGTACTGTGGGTAGCAACCTTTCGGATTTCATAAGAAAAGATTTTCAGGGTTATGGTCCAAGAATTGT
GGTTAGGAGTATAAAGAGTATAAGGAAAGGTGAAGCAGTAACAATCGCATACTGTGACTTGTTACAACCTAAGGAAATGAGGCAGACAGAGTTGTGTTCTAGATATCAAT
TTATCTGTAGTTGCCACCGATGTAGTGCCAAGCCCCCAACTTATGTGGACCATGCTTTGCAAGAAATTTCTGCTGTCAAAGAGAAATTGTTTGTTGGTTCAACTTCCATT
AGCAACTTTGATAATGACAATGCAGTGAGAAGAATAAAAGATTATGTTGATAATGCAATCTACGAGTATCTATCTATTGGTTCTCCCGAATCATGTTGTGAGAAGCTTGA
AAACTTGCTTACTTTAGGTTTCTGTGACGAGCAAGTGGAAGAGGAGGAAGGAAAACAGTTGCATAATTTGAGGCTGCATCCCTTGAACTACCTGTCACTGAATGCATACA
CAGCTCTCGCATCGTCTTATAAAGTCCGTTCATGTGATTTATTGGCTTCCGATTCCAAAATGGGCGATGGCAACGGCGATGAACATCGACAAAATGCATCTACCACAACC
AAAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACCCACCATCTTTTTCTTGCTGAACCATCTTTGATTGCTTCTGCTGCAAACTGTTGGGTTGTTGCTGGAGA
ATCTTTGCTTATTCTTGCTAGAAGCAGCTCATCATGTGCTACTAACACATCAAAATGTAGTTTCCCTCTGCGCAAAAGAATGTGTTCTAATTGCTCATGGGTCGATAAGT
TCAATGCGAGTAGAATCCATGGTCGATCTTTCAAAGCCAATTTTTGCGAGTTTTCAAGTGGTATTTCAAATTGCATTGCTAATATTTCACAAAAATCTTGGAGCTTTCTG
ACTGATGGCTGCCCATATTTGAAGGCTTTCACTGACCCCTTTGATTTCAGCTGGCCAAAGACGACCACAGCGTATACGAATAACCAAGATATACGGGCTCATAGCATCGA
TCATTCGAGTGCTTATAGTGAAACTAAAGATATTGTTCCTCAGTGTGAACCTCAGGTGCATTCTGACCAAGAGTGGCAATCTATCTTTGAGCTTGGCATCCATTGCTTAT
GCTTTGGGGGCTATTTAGCAAGTATTTGTTATGGCCACCATTCACTTCTGGCATCTCAGATTCAAAACATTTTAGATAAGATGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATGGAAATGAGGGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATCGCCTCCCCTCACCTCCGCCCTCCACGATTCCTTCCTCCTTACACACTG
CTCCTCCTGCTTCTTCCCTCTCCCAATTTCCCCAATTTCTCACTCCAATCTCCTCCGCTATTGCTCCCCCAAATGCTCCGATTCCGATTCCGCCACCGCCGCCTTCTTCT
CCGCCAATCATCTCTCCTTCTCCGACACCGCGGACCTCCGCGCCTCGCTTCGCCTCCTCCATCTCCTCTCCGATCCCTCGGCTTGGCGCTCTGCTCCTCCCGAGCGCATC
TTTGGCCTTCTCACCAACCGCGAGAAATTGATGCTCGTCGAAGACGACGACGAGGTTCTCGTCAGGATTCGGAAAGGAGCCGACGCCTTGGCCGCTTCCAGAAGGACGAA
CTCTGCCGATATTCACCATGGAAACGCCTTGGAAGAGGCCGTCCTATGCCTCGTTATTACCAACGCTGTGGAGGTTCAGGATTCGAACGGCCGTACCATTGGAATCGCTG
TGTATGATCCTACCTTCTGCTGGATCAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGAAACTTCGTCGGATTCCTTGAAGACAAGGATGCAGATTTCCCCCAAA
TGCACTGACCTTGGCACTGGTGAAGGAAGTTGTCGTCAAATGGGTACTGTGGGTAGCAACCTTTCGGATTTCATAAGAAAAGATTTTCAGGGTTATGGTCCAAGAATTGT
GGTTAGGAGTATAAAGAGTATAAGGAAAGGTGAAGCAGTAACAATCGCATACTGTGACTTGTTACAACCTAAGGAAATGAGGCAGACAGAGTTGTGTTCTAGATATCAAT
TTATCTGTAGTTGCCACCGATGTAGTGCCAAGCCCCCAACTTATGTGGACCATGCTTTGCAAGAAATTTCTGCTGTCAAAGAGAAATTGTTTGTTGGTTCAACTTCCATT
AGCAACTTTGATAATGACAATGCAGTGAGAAGAATAAAAGATTATGTTGATAATGCAATCTACGAGTATCTATCTATTGGTTCTCCCGAATCATGTTGTGAGAAGCTTGA
AAACTTGCTTACTTTAGGTTTCTGTGACGAGCAAGTGGAAGAGGAGGAAGGAAAACAGTTGCATAATTTGAGGCTGCATCCCTTGAACTACCTGTCACTGAATGCATACA
CAGCTCTCGCATCGTCTTATAAAGTCCGTTCATGTGATTTATTGGCTTCCGATTCCAAAATGGGCGATGGCAACGGCGATGAACATCGACAAAATGCATCTACCACAACC
AAAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACCCACCATCTTTTTCTTGCTGAACCATCTTTGATTGCTTCTGCTGCAAACTGTTGGGTTGTTGCTGGAGA
ATCTTTGCTTATTCTTGCTAGAAGCAGCTCATCATGTGCTACTAACACATCAAAATGTAGTTTCCCTCTGCGCAAAAGAATGTGTTCTAATTGCTCATGGGTCGATAAGT
TCAATGCGAGTAGAATCCATGGTCGATCTTTCAAAGCCAATTTTTGCGAGTTTTCAAGTGGTATTTCAAATTGCATTGCTAATATTTCACAAAAATCTTGGAGCTTTCTG
ACTGATGGCTGCCCATATTTGAAGGCTTTCACTGACCCCTTTGATTTCAGCTGGCCAAAGACGACCACAGCGTATACGAATAACCAAGATATACGGGCTCATAGCATCGA
TCATTCGAGTGCTTATAGTGAAACTAAAGATATTGTTCCTCAGTGTGAACCTCAGGTGCATTCTGACCAAGAGTGGCAATCTATCTTTGAGCTTGGCATCCATTGCTTAT
GCTTTGGGGGCTATTTAGCAAGTATTTGTTATGGCCACCATTCACTTCTGGCATCTCAGATTCAAAACATTTTAGATAAGATGAACTGA
Protein sequenceShow/hide protein sequence
MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAWRSAPPERI
FGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPK
CTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCSAKPPTYVDHALQEISAVKEKLFVGSTSI
SNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLSLNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTT
KTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFL
TDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN