; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009560 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009560
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionhistone-lysine N-methyltransferase ATXR2
Genome locationchr9:40343772..40355054
RNA-Seq ExpressionLag0009560
SyntenyLag0009560
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0016020 - membrane (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001214 - SET domain
IPR002893 - Zinc finger, MYND-type
IPR044237 - Histone-lysine N-methyltransferase ATXR2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8075750.1 hypothetical protein FH972_014439 [Carpinus fangiana]1.1e-29564.07Show/hide
Query:  MAETSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNL---------------------------------------------DSRSVGAIAGFAVAIIFTW
        MAE SSK++L+QLIKRFGAYLT+K+S+  PISL NL                                             +SRS GAIAG AVAI+FTW
Subjt:  MAETSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNL---------------------------------------------DSRSVGAIAGFAVAIIFTW

Query:  RLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSSEDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQA
        RLLRSP+  QRRQPKRQ   P  S+S V   SNA L+P GV  S ED RA NVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRL+G+ILEE+SPE++QKQA
Subjt:  RLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSSEDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQA

Query:  TVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFTSGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQL--------------
        TVRSSVLEVLLEITK+CDLYLME VLDDESE++VL ALEDAG+FTSGGLVKDKVLFCS ENGRTSFVRQLEPDWHIDSNPEI++QL              
Subjt:  TVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFTSGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQL--------------

Query:  ----------APS----------------TVLLLTELRLKQVLNSTPAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADAAFKEGDLFLKDRMLV
                  APS                  +   + +    +++   PP+   EYFD L+ TRQ RGI+VKQN  FGKGV AD  FKEG+L LKD+ML 
Subjt:  ----------APS----------------TVLLLTELRLKQVLNSTPAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADAAFKEGDLFLKDRMLV

Query:  GSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSSPMSEDCWETES-DDGQEIALENNESMGGCSSGKSKTT-ALPKGLVESLM
        G+QH SNKIDCLVCSFCFRF+GSIELQIGR+LY Q++GVS N   DME    +S DC+ T+S D+     L+N +++G C+S  SK   ++P+ +VESLM
Subjt:  GSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSSPMSEDCWETES-DDGQEIALENNESMGGCSSGKSKTT-ALPKGLVESLM

Query:  NGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQM
        NG L LP+S EF++P A+PC GGCGEA+YCSKSCAEADW++FHSLLCTG ++E   REAL KFIQHAN+TNDIFLLA K ISSTIL+Y+KLK    +EQ 
Subjt:  NGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQM

Query:  KYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVAS
           +  I + +D+S+LLEAW+PISMGHKRRWWDC+ALPDDV+ S+E  FRM+IRE+AF SLQLLK AIF++ CEPLFSLEIYGHIIGMFELNNLDLVVAS
Subjt:  KYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVAS

Query:  PVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEER
        PVEDYFLYID+LP P K++A++ITRP LDALGD YS+CCQGTAFFP+QSCMNHSC+PNAKAFKREEDRDGQA II+++PI  GEEVTISY+DEDLPFEER
Subjt:  PVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEER

Query:  RALLADYGFECRCPKCLQEHP
        +ALLADYGF+C+CPKCL E P
Subjt:  RALLADYGFECRCPKCLQEHP

KAF5741447.1 histone-lysine N-methyltransferase ATXR2-like isoform X1 [Tripterygium wilfordii]4.9e-28066.53Show/hide
Query:  MAETSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSS
        MAE SSK+DL QLIKRFGAY++++MSNFFPISL N+D RSVGA+ G A+AI+FTWRLLRSP+  QRRQPKRQ  APS+S+S  G+ S+A   P  V   S
Subjt:  MAETSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSS

Query:  EDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFT
        ED RA NVVDE FQPV PTLGQIVRQ+LSEGRKVTCRL+G+ILEE++PE+LQKQATV+SSVLEVLLEITK+CDLYLME VLDD+SE+ VLLALE+AG+FT
Subjt:  EDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFT

Query:  SGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLLTELRLKQVLNSTPAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADA
        SGGLVKDKVLFCS E GRTSFVRQLEPDWHID+NPEI++QLA                           EYF++L+  RQCRGI+VKQ+  FGKGV A+ 
Subjt:  SGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLLTELRLKQVLNSTPAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADA

Query:  AFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSSPMSEDCWETESDDGQEIAL---ENNESMGGCSSG
         F EG+L LKD+MLVG+QH  NKIDCLVCSFCF+F+GSIELQIGR+L+ Q + VS N +CD +  S +S+DC +T+S DG   +      N  +GG SS 
Subjt:  AFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSSPMSEDCWETESDDGQEIAL---ENNESMGGCSSG

Query:  KSKTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISST
        K +   LP+G+VESLMNG + LP++N F++P  + C GGCGEA+YCSKSCAEADWE+ HSLLCTG +++   R AL KF +HAN+TNDIFLLAAKAIS T
Subjt:  KSKTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISST

Query:  ILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGH
        IL+Y+KLK    +E+            + S+LLEAW+PIS+G+KRRWWDC+ALPDDV+ S+E  FRMQI+E+AF SLQ LK AI ++ CEPLFSLEIYGH
Subjt:  ILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGH

Query:  IIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGE
        IIGMFELNNLDLVVASPVEDYFLYID+LP   K++AEEIT+P+L+ALG+ YS+ CQGTAFFP+QSCMNHSC+PNAKAFKREEDRDG+ATIIA RPI  GE
Subjt:  IIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGE

Query:  EVTISYIDEDLPFEERRALLADYGFECRCPKCLQE
        EVTISYIDEDL +EER+A LADYGF+CRC +CL+E
Subjt:  EVTISYIDEDLPFEERRALLADYGFECRCPKCLQE

KAG7010720.1 Histone-lysine N-methyltransferase ATXR2 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0088.59Show/hide
Query:  MAETSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSS
        MAE SSKDDLLQLIKRFGAYLTLK+SNF PISL+NLDSRSVGAIAGFAVAI+FTWRLLRSPNGHQRRQ KRQ PAPS+STSSVGLNSNAQLI SG CSSS
Subjt:  MAETSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSS

Query:  EDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFT
        EDLRA NVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVL ALEDAG+FT
Subjt:  EDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFT

Query:  SGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLA-PSTVLLLTELRLKQVLNSTPAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVAD
        SGGLVKDKVLFCS ENGRTSFVRQLEPDWHIDSNPEI+SQLA P+ +  L           +P  P    EYFDQLVW RQCRG+RVKQ+ A GKGV AD
Subjt:  SGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLA-PSTVLLLTELRLKQVLNSTPAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVAD

Query:  AAFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSSPMSEDCWETESDDGQEIALENNESMGGCSSGKS
        AAFKEGDL LKD+MLVGSQHTSNK+DCLVCSFCFRFVGSIELQIGRKLYFQ+LGVSTNHQCDMEPSSPMSEDC+E ESDDG+EIALENNESMGGCSS  S
Subjt:  AAFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSSPMSEDCWETESDDGQEIALENNESMGGCSSGKS

Query:  KTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTIL
        K  ALP GLVESLMNGGLSLPHS EF+MPPAIPCSGGCGEAFYCSKSCAEADWE FHSLLCTGGKTEPSRREAL KFIQ+ANDTNDIFLLAAKAISSTIL
Subjt:  KTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTIL

Query:  KYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHII
        +YKKLK A SEEQ KYG     N VDLSILLEAW+PISMGHKRRWWDCIALP+DVEPSNEV FRMQIREMAFTSLQLLKEAI +EGCEPLFSLEIYG II
Subjt:  KYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHII

Query:  GMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEV
        GMFELNNLDLVVASPVEDYFLY+DELPSPYKEKAEEITRPLLD LGDSYSICCQGTAFFP+QSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEV
Subjt:  GMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEV

Query:  TISYIDEDLPFEERRALLADYGFECRCPKCL-QEHP
        TISYIDEDLPFE+RRALLADYGFECRCPKCL Q+HP
Subjt:  TISYIDEDLPFEERRALLADYGFECRCPKCL-QEHP

OMO60161.1 hypothetical protein CCACVL1_24353 [Corchorus capsularis]2.0e-28968.67Show/hide
Query:  TSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSSEDL
        +SSKD+L+QLIKR G +L LKMSN F ISL  LD RSVGAIAG AVAIIFT+RL+RSP    RRQPKRQ  AP+TSTS+    S++ L+PSGVCSSSED 
Subjt:  TSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSSEDL

Query:  RAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFTSGG
        RA NVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLG+ILEE+SPE+LQ +ATV+SSVL+VLLEITK+CDLYLME V+DDESEK VLLALE+AGIFTSGG
Subjt:  RAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFTSGG

Query:  LVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLLTELRLKQVLNSTPAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADAAFK
        LVKDKVLFCS ENGRTSFVRQLEPDWHID+NPEI+SQLA                           EYF+QL+  RQC GI VKQN  FGKGV A+  F+
Subjt:  LVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLLTELRLKQVLNSTPAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADAAFK

Query:  EGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSSPMSEDCWETESDDGQEIALENNESMGGCSSGKSKTTA
        E  L LKD+ML+G+QH+SNKIDCLVCS+CF+F+GSIE QIGRKLY + LGVS        P S    D  + + D+ +     ++ S  G SS    T  
Subjt:  EGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSSPMSEDCWETESDDGQEIALENNESMGGCSSGKSKTTA

Query:  LPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKK
        LPK  VESLMNG +SLP+S +F +P  + C G C E FYCSKSCAEADWE FHSLLC G KT+   REAL KFIQHAN+TNDIFLLAAKAIS TIL+Y+K
Subjt:  LPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKK

Query:  LKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFE
        LK +   E  K      L T +LS+LL+AW+PIS+GHKRRWWDC++LPDD++ S+E  FRMQ+RE+AFTSLQLLKEAIF++ CEPLFSLEIYGHIIGMFE
Subjt:  LKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFE

Query:  LNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISY
        LNNLDLVVASPVEDYF+YID+LP+P K++AE ITRP LDALG+ YS+CC+GTAF+P+QSCMNHSC PNAKAFKREEDRDGQATIIAVRP+   EE+ ISY
Subjt:  LNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISY

Query:  IDEDLPFEERRALLADYGFECRCPKCLQEHP
        IDEDLPFEER+ALLADYGF CRCP+CL+E P
Subjt:  IDEDLPFEERRALLADYGFECRCPKCLQEHP

RXH89948.1 hypothetical protein DVH24_032305 [Malus domestica]3.7e-29668.49Show/hide
Query:  MAETSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSS
        MAE SSK++LLQLIKRFGAYLT+KMS+ FPISLHNL+SRS+GAIAGFAVAI+FTWRLLR P+G QRRQPKRQ  AP++S+S +  NSNA L   GV SSS
Subjt:  MAETSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSS

Query:  EDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFT
        ED R  NVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLG+ILEE++PE+LQKQ TVR SVLEVLLEITK+CDLYLME VLDDESEK+VL+ALEDAGIFT
Subjt:  EDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFT

Query:  SGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLL------TELRLKQVLNS--------------------------TPAPPTIA
        SGGLVKDKVLFCS ENGRTSFVRQLEPDWHID+NP+II QL+      L      TE       NS                          TP  P   
Subjt:  SGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLL------TELRLKQVLNS--------------------------TPAPPTIA

Query:  SEYFDQLVW--TRQCRGIRVKQNEAFGKGVVADAAFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSS
         EY ++L+   +RQC GI VKQN   GKGV AD+  KEG L LKD+MLVG QH+SNKIDCLVCSFCFRFVGS+ELQIGR+LY QELGVS    C     S
Subjt:  SEYFDQLVW--TRQCRGIRVKQNEAFGKGVVADAAFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSS

Query:  PMSEDCWETESDDGQEIALENNESMGGCSSGKSKTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTE
           ED                 +  G  SS   +   LPKG+VESLMNGGL LP+S++F++PP +PC GGC EA+YCSK CAE+DW++ HSLLCTG ++E
Subjt:  PMSEDCWETESDDGQEIALENNESMGGCSSGKSKTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTE

Query:  PSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQI
           REAL +FIQHANDTNDIFLLAAKA+SSTILKY+KLK A SE++ K     +  +  LS+LL+AW+PIS+GHKRRWWDCIALP DVEPS+E  FRMQI
Subjt:  PSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQI

Query:  REMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNH
        RE+AFTSL+LLK AIF+E C+P+FSLEIYGHII MFELNNLDLVVASPVEDYFLYID+LP+P K++ E ITRP LDALGD YS+CCQGTAF+P+QSCMNH
Subjt:  REMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNH

Query:  SCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEERRALLADYGFECRCPKCLQEHP
        SC PNAKAFKREEDRDGQATIIA++PI  GEEVTISY+DEDLP+EERRALLADYGF+CRCPKCL+E P
Subjt:  SCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEERRALLADYGFECRCPKCLQEHP

TrEMBL top hitse value%identityAlignment
A0A1R3GQ23 SET domain-containing protein9.6e-29068.67Show/hide
Query:  TSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSSEDL
        +SSKD+L+QLIKR G +L LKMSN F ISL  LD RSVGAIAG AVAIIFT+RL+RSP    RRQPKRQ  AP+TSTS+    S++ L+PSGVCSSSED 
Subjt:  TSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSSEDL

Query:  RAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFTSGG
        RA NVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLG+ILEE+SPE+LQ +ATV+SSVL+VLLEITK+CDLYLME V+DDESEK VLLALE+AGIFTSGG
Subjt:  RAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFTSGG

Query:  LVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLLTELRLKQVLNSTPAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADAAFK
        LVKDKVLFCS ENGRTSFVRQLEPDWHID+NPEI+SQLA                           EYF+QL+  RQC GI VKQN  FGKGV A+  F+
Subjt:  LVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLLTELRLKQVLNSTPAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADAAFK

Query:  EGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSSPMSEDCWETESDDGQEIALENNESMGGCSSGKSKTTA
        E  L LKD+ML+G+QH+SNKIDCLVCS+CF+F+GSIE QIGRKLY + LGVS        P S    D  + + D+ +     ++ S  G SS    T  
Subjt:  EGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSSPMSEDCWETESDDGQEIALENNESMGGCSSGKSKTTA

Query:  LPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKK
        LPK  VESLMNG +SLP+S +F +P  + C G C E FYCSKSCAEADWE FHSLLC G KT+   REAL KFIQHAN+TNDIFLLAAKAIS TIL+Y+K
Subjt:  LPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKK

Query:  LKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFE
        LK +   E  K      L T +LS+LL+AW+PIS+GHKRRWWDC++LPDD++ S+E  FRMQ+RE+AFTSLQLLKEAIF++ CEPLFSLEIYGHIIGMFE
Subjt:  LKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFE

Query:  LNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISY
        LNNLDLVVASPVEDYF+YID+LP+P K++AE ITRP LDALG+ YS+CC+GTAF+P+QSCMNHSC PNAKAFKREEDRDGQATIIAVRP+   EE+ ISY
Subjt:  LNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISY

Query:  IDEDLPFEERRALLADYGFECRCPKCLQEHP
        IDEDLPFEER+ALLADYGF CRCP+CL+E P
Subjt:  IDEDLPFEERRALLADYGFECRCPKCLQEHP

A0A1R3JYQ9 SET domain-containing protein1.4e-27263.26Show/hide
Query:  TSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSSEDL
        +SSKD+L+QLIKR G +L LKMSN F ISL  LD RSVGAIAG AVAIIFT+RL+RSP    RRQPKRQ  AP+TSTS+    S++ L+PSGVCSSSED 
Subjt:  TSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSSEDL

Query:  RAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFTSGG
        RA NVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLG+ILEE+SPE+LQ +ATV+SSVL+VLLEITK+CDLYLME V+DDESEK VLLALE+AGIFTSGG
Subjt:  RAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFTSGG

Query:  LVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAP--STVLLLTELRLKQVLNSTPA--------------------------------------
        LVKDKVLFCS ENGRTSFVRQLEPDWHID+NPEI+SQ +   S + LL++ +  Q+++S                                         
Subjt:  LVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAP--STVLLLTELRLKQVLNSTPA--------------------------------------

Query:  -----PPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADAAFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNH
             PP+   EYF+QL+  RQC GI VKQN  FGKGV A+  F+E  L LKD+ML+G+QH SNKIDCLVCS+CF+F+GSIE QIGRKLY + LGVS + 
Subjt:  -----PPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADAAFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNH

Query:  QCDMEPSSPMSEDCWETESDDGQEIALENNESMGGCSSGKSKTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSL
         C  + S          + D+   +   ++ S  G SS    T  LP+  VESLMNG +SLP+SN+F +P  I C G C EAFYCSKSCAEADWE FHSL
Subjt:  QCDMEPSSPMSEDCWETESDDGQEIALENNESMGGCSSGKSKTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSL

Query:  LCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSN
        LC G KT+   REAL KFIQH N+TNDIFLLAAKAIS TIL+Y+KLK + + EQ K     +L T  L +LL+AW+PIS+GHK+RWWDC++LPDD++ S 
Subjt:  LCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSN

Query:  EVVFRMQIREMAFTSLQLLKEAIFNEGCEPLF---SLEIYGHIIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGT
        E  FRMQ+RE+AFTSLQLLKEAIF++ CEP F   SL +   + G       DLVVASPVEDYF+YID+LP P K++AE ITRP LDALG+ YS+CCQGT
Subjt:  EVVFRMQIREMAFTSLQLLKEAIFNEGCEPLF---SLEIYGHIIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGT

Query:  AFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEERRALLADYGFECRCPK
        AF+P+QSCMNHSC PNAKAFKREEDRDGQATIIA RPI  GEE+ ISYIDEDLPFEER+ALLADYGF CRCP+
Subjt:  AFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEERRALLADYGFECRCPK

A0A498J311 SET domain-containing protein1.8e-29668.49Show/hide
Query:  MAETSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSS
        MAE SSK++LLQLIKRFGAYLT+KMS+ FPISLHNL+SRS+GAIAGFAVAI+FTWRLLR P+G QRRQPKRQ  AP++S+S +  NSNA L   GV SSS
Subjt:  MAETSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSS

Query:  EDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFT
        ED R  NVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLG+ILEE++PE+LQKQ TVR SVLEVLLEITK+CDLYLME VLDDESEK+VL+ALEDAGIFT
Subjt:  EDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFT

Query:  SGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLL------TELRLKQVLNS--------------------------TPAPPTIA
        SGGLVKDKVLFCS ENGRTSFVRQLEPDWHID+NP+II QL+      L      TE       NS                          TP  P   
Subjt:  SGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLL------TELRLKQVLNS--------------------------TPAPPTIA

Query:  SEYFDQLVW--TRQCRGIRVKQNEAFGKGVVADAAFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSS
         EY ++L+   +RQC GI VKQN   GKGV AD+  KEG L LKD+MLVG QH+SNKIDCLVCSFCFRFVGS+ELQIGR+LY QELGVS    C     S
Subjt:  SEYFDQLVW--TRQCRGIRVKQNEAFGKGVVADAAFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSS

Query:  PMSEDCWETESDDGQEIALENNESMGGCSSGKSKTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTE
           ED                 +  G  SS   +   LPKG+VESLMNGGL LP+S++F++PP +PC GGC EA+YCSK CAE+DW++ HSLLCTG ++E
Subjt:  PMSEDCWETESDDGQEIALENNESMGGCSSGKSKTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTE

Query:  PSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQI
           REAL +FIQHANDTNDIFLLAAKA+SSTILKY+KLK A SE++ K     +  +  LS+LL+AW+PIS+GHKRRWWDCIALP DVEPS+E  FRMQI
Subjt:  PSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQI

Query:  REMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNH
        RE+AFTSL+LLK AIF+E C+P+FSLEIYGHII MFELNNLDLVVASPVEDYFLYID+LP+P K++ E ITRP LDALGD YS+CCQGTAF+P+QSCMNH
Subjt:  REMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNH

Query:  SCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEERRALLADYGFECRCPKCLQEHP
        SC PNAKAFKREEDRDGQATIIA++PI  GEEVTISY+DEDLP+EERRALLADYGF+CRCPKCL+E P
Subjt:  SCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEERRALLADYGFECRCPKCLQEHP

A0A5N6RD75 SET domain-containing protein5.3e-29664.07Show/hide
Query:  MAETSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNL---------------------------------------------DSRSVGAIAGFAVAIIFTW
        MAE SSK++L+QLIKRFGAYLT+K+S+  PISL NL                                             +SRS GAIAG AVAI+FTW
Subjt:  MAETSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNL---------------------------------------------DSRSVGAIAGFAVAIIFTW

Query:  RLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSSEDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQA
        RLLRSP+  QRRQPKRQ   P  S+S V   SNA L+P GV  S ED RA NVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRL+G+ILEE+SPE++QKQA
Subjt:  RLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSSEDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQA

Query:  TVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFTSGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQL--------------
        TVRSSVLEVLLEITK+CDLYLME VLDDESE++VL ALEDAG+FTSGGLVKDKVLFCS ENGRTSFVRQLEPDWHIDSNPEI++QL              
Subjt:  TVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFTSGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQL--------------

Query:  ----------APS----------------TVLLLTELRLKQVLNSTPAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADAAFKEGDLFLKDRMLV
                  APS                  +   + +    +++   PP+   EYFD L+ TRQ RGI+VKQN  FGKGV AD  FKEG+L LKD+ML 
Subjt:  ----------APS----------------TVLLLTELRLKQVLNSTPAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADAAFKEGDLFLKDRMLV

Query:  GSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSSPMSEDCWETES-DDGQEIALENNESMGGCSSGKSKTT-ALPKGLVESLM
        G+QH SNKIDCLVCSFCFRF+GSIELQIGR+LY Q++GVS N   DME    +S DC+ T+S D+     L+N +++G C+S  SK   ++P+ +VESLM
Subjt:  GSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSSPMSEDCWETES-DDGQEIALENNESMGGCSSGKSKTT-ALPKGLVESLM

Query:  NGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQM
        NG L LP+S EF++P A+PC GGCGEA+YCSKSCAEADW++FHSLLCTG ++E   REAL KFIQHAN+TNDIFLLA K ISSTIL+Y+KLK    +EQ 
Subjt:  NGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQM

Query:  KYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVAS
           +  I + +D+S+LLEAW+PISMGHKRRWWDC+ALPDDV+ S+E  FRM+IRE+AF SLQLLK AIF++ CEPLFSLEIYGHIIGMFELNNLDLVVAS
Subjt:  KYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVAS

Query:  PVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEER
        PVEDYFLYID+LP P K++A++ITRP LDALGD YS+CCQGTAFFP+QSCMNHSC+PNAKAFKREEDRDGQA II+++PI  GEEVTISY+DEDLPFEER
Subjt:  PVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEER

Query:  RALLADYGFECRCPKCLQEHP
        +ALLADYGF+C+CPKCL E P
Subjt:  RALLADYGFECRCPKCLQEHP

A0A7J7D510 Histone-lysine N-methyltransferase ATXR2-like isoform X12.4e-28066.53Show/hide
Query:  MAETSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSS
        MAE SSK+DL QLIKRFGAY++++MSNFFPISL N+D RSVGA+ G A+AI+FTWRLLRSP+  QRRQPKRQ  APS+S+S  G+ S+A   P  V   S
Subjt:  MAETSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSS

Query:  EDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFT
        ED RA NVVDE FQPV PTLGQIVRQ+LSEGRKVTCRL+G+ILEE++PE+LQKQATV+SSVLEVLLEITK+CDLYLME VLDD+SE+ VLLALE+AG+FT
Subjt:  EDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFT

Query:  SGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLLTELRLKQVLNSTPAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADA
        SGGLVKDKVLFCS E GRTSFVRQLEPDWHID+NPEI++QLA                           EYF++L+  RQCRGI+VKQ+  FGKGV A+ 
Subjt:  SGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLLTELRLKQVLNSTPAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADA

Query:  AFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSSPMSEDCWETESDDGQEIAL---ENNESMGGCSSG
         F EG+L LKD+MLVG+QH  NKIDCLVCSFCF+F+GSIELQIGR+L+ Q + VS N +CD +  S +S+DC +T+S DG   +      N  +GG SS 
Subjt:  AFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSSPMSEDCWETESDDGQEIAL---ENNESMGGCSSG

Query:  KSKTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISST
        K +   LP+G+VESLMNG + LP++N F++P  + C GGCGEA+YCSKSCAEADWE+ HSLLCTG +++   R AL KF +HAN+TNDIFLLAAKAIS T
Subjt:  KSKTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISST

Query:  ILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGH
        IL+Y+KLK    +E+            + S+LLEAW+PIS+G+KRRWWDC+ALPDDV+ S+E  FRMQI+E+AF SLQ LK AI ++ CEPLFSLEIYGH
Subjt:  ILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGH

Query:  IIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGE
        IIGMFELNNLDLVVASPVEDYFLYID+LP   K++AEEIT+P+L+ALG+ YS+ CQGTAFFP+QSCMNHSC+PNAKAFKREEDRDG+ATIIA RPI  GE
Subjt:  IIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGE

Query:  EVTISYIDEDLPFEERRALLADYGFECRCPKCLQE
        EVTISYIDEDL +EER+A LADYGF+CRC +CL+E
Subjt:  EVTISYIDEDLPFEERRALLADYGFECRCPKCLQE

SwissProt top hitse value%identityAlignment
Q3TYX3 SET and MYND domain-containing protein 52.3e-1422.9Show/hide
Query:  EAFYCSKSCAEADWEVFHSLLCTGGKTEP----SRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWR
        +  YCS  C  A  E +H +LC G   +P    ++ +   + + +  +T  I L+A    +        +KQA+ ++                       
Subjt:  EAFYCSKSCAEADWEVFHSLLCTGGKTEP----SRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWR

Query:  PISMGHKRRWWD--CIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEK
             H  R ++  C    +  +     + + + ++     L L KEA++ E     F+ E +  +  +   N   +  +S +  +    D L    +++
Subjt:  PISMGHKRRWWD--CIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEK

Query:  ------AEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYID----EDLPFEERRALLADYGF
               +++ + +  A G+  +  C+G+  F +QSC NHSC PNA+    E +     T  A+  I PGEE+ ISY+D    E       + L  +Y F
Subjt:  ------AEEITRPLLDALGDSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYID----EDLPFEERRALLADYGF

Query:  ECRCPKCLQE
         C CPKCL E
Subjt:  ECRCPKCLQE

Q5PP37 Histone-lysine N-methyltransferase ATXR27.9e-17262.85Show/hide
Query:  PAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADAAFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCD
        P P     EYF++L+ +R+C GI VK N   GKGV A++ F E +L LKD +LVG QH+SNK+DCLVCSFCFRF+GSIE QIGRKLYF+ LGVS    C 
Subjt:  PAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADAAFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCD

Query:  MEPSSPMSEDCWETESDDGQEIALENNESMGGCSSGKSKTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCT
                  C +  S++ + +    NE   G SS    T  LP+G+V SLMNG ++LPH+++F +P  + C GGC EAFYCS+SCA ADWE  HSLLCT
Subjt:  MEPSSPMSEDCWETESDDGQEIALENNESMGGCSSGKSKTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCT

Query:  GGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVV
        G ++E   REAL +FI+HANDTNDIFLLAAKAI+ TIL+Y+KLK    +++ K  +         S+LLEAW+P+S+G+KRRWWDCIALPDDV+P++E  
Subjt:  GGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVV

Query:  FRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQ
        FRMQI+ +A TSL+LLK AIF++ CE LFSLEIYG+IIGMFELNNLDLVVASPVEDYFLYID+LP   KE+ EEITRP LDALGD YS CCQGTAFFP+Q
Subjt:  FRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQ

Query:  SCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEERRALLADYGFECRCPKCLQE
        SCMNHSC PNAKAFKREEDRDGQA IIA+R I   EEVTISYIDE+LP++ER+ALLADYGF C+C KCL++
Subjt:  SCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEERRALLADYGFECRCPKCLQE

Q5ZIZ2 SET and MYND domain-containing protein 56.3e-1221.97Show/hide
Query:  LMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTG----GKTEP-SRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQ
        L    L LPH  + ++   +       +  YCS  C +A  E +H +LC G      T P ++ +   + + +  +T+ I L+A    +        +KQ
Subjt:  LMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCTG----GKTEP-SRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQ

Query:  ARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWW------DCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIG
        A+ ++                                WW       C    ++ E     +   + +        L  EA+++E     F+ E +  +  
Subjt:  ARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWW------DCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIG

Query:  MFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSY----------SICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAV
        +   N   +  +S +  +    D L  P  ++ E      LDA  D             + C+G+  + +QSC NHSC PNA+      D +    + A+
Subjt:  MFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSY----------SICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAV

Query:  RPIHPGEEVTISYID----EDLPFEERRALLADYGFECRCPKCLQE
          I  GEE+ ISY+D    E       + L  +Y F C CPKCL +
Subjt:  RPIHPGEEVTISYID----EDLPFEERRALLADYGFECRCPKCLQE

Q6GPQ4 SET and MYND domain-containing protein 51.3e-1223.93Show/hide
Query:  YCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHK
        YCS  C +A  E +H +LC     E SR          A+  N +                       EE  +   YP   T  + ++      I     
Subjt:  YCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHK

Query:  RRWW------DCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVASPVEDYFLYID--ELPSPYKEKA
        + WW       C    ++ E     +   + +       +L  +A++ E     F+ E +  +  +   N   +  +S +  +    D  ELP   +EK 
Subjt:  RRWW------DCIALPDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVASPVEDYFLYID--ELPSPYKEKA

Query:  EEITRPLLDALG--DSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYID---EDLPFEERRALLAD-YGFECRCP
        + +   L   +       + C+G+  + +QSC NHSC PNA+A     D +    + A+  I PGEE+ ISY+D    D     R+ +L + Y F C CP
Subjt:  EEITRPLLDALG--DSYSICCQGTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYID---EDLPFEERRALLAD-YGFECRCP

Query:  KCLQE
        KCL +
Subjt:  KCLQE

Q9LSX7 Peroxisome biogenesis protein 225.6e-8562.28Show/hide
Query:  MAETSS---KDDLLQLIKRFGAYLTLKMSNFF-PISLHNLDSRSVGAIAGFAVAIIFTWRLLRSP-NGHQRRQPKRQTPAPSTSTSSVGLNSN--AQLIP
        MAE+SS    +++++LIKR  AY+  KMS+ F   S+ NLDSRS+GAIAG A+A+IFTWR +R+P    QRRQPKR+     TS+++   + +  A  + 
Subjt:  MAETSS---KDDLLQLIKRFGAYLTLKMSNFF-PISLHNLDSRSVGAIAGFAVAIIFTWRLLRSP-NGHQRRQPKRQTPAPSTSTSSVGLNSN--AQLIP

Query:  SGVCSSSEDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLAL
          V S  ED    +VVD+FFQPVKPTLGQIVRQKLSEGRKVTCRLLG+ILEE SPE+LQKQATVRSSVLEVLLEITKY DLYLME VLDDESE +VL AL
Subjt:  SGVCSSSEDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLAL

Query:  EDAGIFTSGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLLTELRLKQVLNSTPAPPTIASEYFDQ
        E+AG+FTSGGLVKDKVLFCS E GRTSFVRQLEPDWHID+NPEI +QLA     +  +L +  V     AP    S+  +Q
Subjt:  EDAGIFTSGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLLTELRLKQVLNSTPAPPTIASEYFDQ

Arabidopsis top hitse value%identityAlignment
AT2G17900.1 SET domain group 371.3e-0727.44Show/hide
Query:  LKEAIFNEGCEPLFSLEIYGHIIG----MFELNNLDLVVASPVEDYFLYIDELPS-PYKEKAEEITRPLLDALGDSYSIC-----CQGTAFFPIQSCMNH
        +K  + NE   P+ + + Y  +      M E++   +++ + + +    I + PS   +E AE  ++   +A    +SIC      QG   FP+ S +NH
Subjt:  LKEAIFNEGCEPLFSLEIYGHIIG----MFELNNLDLVVASPVEDYFLYIDELPS-PYKEKAEEITRPLLDALGDSYSIC-----CQGTAFFPIQSCMNH

Query:  SCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYID-EDLPFEERRALLADYGFECRCPKC
        SC PNA     E+     A + A+  I    E+TISYI+        +++L   Y F C+C +C
Subjt:  SCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYID-EDLPFEERRALLADYGFECRCPKC

AT3G21820.1 histone-lysine N-methyltransferase ATXR25.6e-17362.85Show/hide
Query:  PAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADAAFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCD
        P P     EYF++L+ +R+C GI VK N   GKGV A++ F E +L LKD +LVG QH+SNK+DCLVCSFCFRF+GSIE QIGRKLYF+ LGVS    C 
Subjt:  PAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADAAFKEGDLFLKDRMLVGSQHTSNKIDCLVCSFCFRFVGSIELQIGRKLYFQELGVSTNHQCD

Query:  MEPSSPMSEDCWETESDDGQEIALENNESMGGCSSGKSKTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCT
                  C +  S++ + +    NE   G SS    T  LP+G+V SLMNG ++LPH+++F +P  + C GGC EAFYCS+SCA ADWE  HSLLCT
Subjt:  MEPSSPMSEDCWETESDDGQEIALENNESMGGCSSGKSKTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEAFYCSKSCAEADWEVFHSLLCT

Query:  GGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVV
        G ++E   REAL +FI+HANDTNDIFLLAAKAI+ TIL+Y+KLK    +++ K  +         S+LLEAW+P+S+G+KRRWWDCIALPDDV+P++E  
Subjt:  GGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIALPDDVEPSNEVV

Query:  FRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQ
        FRMQI+ +A TSL+LLK AIF++ CE LFSLEIYG+IIGMFELNNLDLVVASPVEDYFLYID+LP   KE+ EEITRP LDALGD YS CCQGTAFFP+Q
Subjt:  FRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPIQ

Query:  SCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEERRALLADYGFECRCPKCLQE
        SCMNHSC PNAKAFKREEDRDGQA IIA+R I   EEVTISYIDE+LP++ER+ALLADYGF C+C KCL++
Subjt:  SCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEERRALLADYGFECRCPKCLQE

AT3G21865.1 peroxin 224.0e-8662.28Show/hide
Query:  MAETSS---KDDLLQLIKRFGAYLTLKMSNFF-PISLHNLDSRSVGAIAGFAVAIIFTWRLLRSP-NGHQRRQPKRQTPAPSTSTSSVGLNSN--AQLIP
        MAE+SS    +++++LIKR  AY+  KMS+ F   S+ NLDSRS+GAIAG A+A+IFTWR +R+P    QRRQPKR+     TS+++   + +  A  + 
Subjt:  MAETSS---KDDLLQLIKRFGAYLTLKMSNFF-PISLHNLDSRSVGAIAGFAVAIIFTWRLLRSP-NGHQRRQPKRQTPAPSTSTSSVGLNSN--AQLIP

Query:  SGVCSSSEDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLAL
          V S  ED    +VVD+FFQPVKPTLGQIVRQKLSEGRKVTCRLLG+ILEE SPE+LQKQATVRSSVLEVLLEITKY DLYLME VLDDESE +VL AL
Subjt:  SGVCSSSEDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLAL

Query:  EDAGIFTSGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLLTELRLKQVLNSTPAPPTIASEYFDQ
        E+AG+FTSGGLVKDKVLFCS E GRTSFVRQLEPDWHID+NPEI +QLA     +  +L +  V     AP    S+  +Q
Subjt:  EDAGIFTSGGLVKDKVLFCSMENGRTSFVRQLEPDWHIDSNPEIISQLAPSTVLLLTELRLKQVLNSTPAPPTIASEYFDQ

AT5G06620.1 SET domain protein 382.4e-0633.77Show/hide
Query:  GTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEERRALLAD-YGFECRCPKC
        G A + + S  NH C PNA         +  A +  +R +  GEE+ I YID  + +E R+ +L+  +GF C C +C
Subjt:  GTAFFPIQSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEERRALLAD-YGFECRCPKC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAAACTTCTTCCAAAGACGACCTACTCCAGCTGATCAAGCGCTTCGGGGCTTATCTCACTCTCAAGATGTCTAATTTCTTCCCGATCTCTCTCCACAATCTGGA
TTCACGTTCTGTTGGGGCTATTGCTGGATTTGCTGTTGCAATAATTTTCACATGGAGGCTGTTGAGATCACCTAACGGACATCAGAGACGACAACCAAAAAGGCAAACGC
CTGCACCAAGTACTTCTACTTCTAGTGTTGGGTTAAATTCCAATGCGCAATTGATACCTTCTGGAGTTTGTTCATCTTCAGAGGATTTAAGAGCACACAATGTGGTTGAT
GAATTCTTCCAGCCAGTCAAGCCAACTCTGGGTCAAATAGTGAGGCAAAAATTGAGTGAAGGAAGAAAGGTAACATGTCGTTTGCTTGGAATAATTCTTGAGGAAAACAG
TCCAGAGGATCTGCAGAAACAAGCCACTGTGAGGTCCTCGGTATTGGAAGTACTGTTGGAGATTACGAAATATTGTGATCTTTATCTCATGGAAACAGTGTTGGATGATG
AGAGCGAAAAAAGAGTTCTTTTGGCACTTGAAGATGCAGGAATTTTCACATCTGGTGGTCTGGTCAAAGACAAGGTTCTCTTCTGTAGCATGGAGAATGGGCGAACATCT
TTCGTGCGTCAACTGGAACCCGATTGGCATATAGATTCCAATCCTGAAATCATTTCTCAGTTAGCTCCCAGCACCGTTCTGTTACTCACAGAGCTCCGACTGAAGCAAGT
GCTTAACAGCACACCGGCGCCGCCAACTATCGCCTCTGAATATTTTGATCAGCTTGTATGGACAAGGCAGTGTCGTGGTATTCGAGTGAAGCAGAATGAAGCCTTTGGAA
AGGGTGTCGTTGCCGATGCTGCTTTCAAAGAAGGGGACCTTTTCCTGAAAGACCGAATGCTTGTGGGATCTCAGCATACGTCAAATAAGATAGACTGTCTGGTGTGCAGC
TTCTGTTTTCGTTTTGTTGGATCTATAGAACTTCAAATTGGAAGGAAATTATATTTTCAAGAACTTGGAGTTTCCACTAATCATCAATGTGACATGGAGCCTTCATCACC
CATGTCAGAAGATTGCTGGGAAACTGAATCAGATGATGGCCAGGAGATTGCATTAGAAAATAATGAAAGCATGGGAGGATGTTCTTCCGGAAAATCTAAAACTACAGCTT
TACCCAAAGGGCTTGTTGAATCATTGATGAACGGTGGCTTATCATTGCCGCACTCCAATGAGTTTGCCATGCCTCCGGCCATTCCATGTTCTGGGGGATGTGGTGAGGCC
TTCTACTGCAGTAAATCATGTGCAGAGGCTGATTGGGAAGTTTTCCATTCCTTACTTTGCACTGGGGGAAAAACAGAACCATCACGCAGGGAAGCACTGCAAAAATTTAT
ACAACATGCTAATGACACGAATGACATATTCCTCCTTGCTGCAAAGGCAATTTCTTCCACTATTTTGAAGTATAAGAAATTAAAGCAGGCTCGTTCTGAAGAACAAATGA
AATATGGCAAGTATCCTATTCTCAACACTGTTGATCTATCCATTCTTTTGGAGGCATGGAGGCCAATCTCAATGGGACATAAGAGAAGGTGGTGGGATTGCATTGCATTA
CCAGATGATGTTGAACCTTCTAATGAAGTTGTGTTCCGGATGCAAATAAGAGAGATGGCTTTCACGTCATTGCAGCTCCTCAAGGAAGCAATTTTCAACGAGGGATGTGA
ACCATTATTCTCCCTTGAAATCTATGGGCATATAATTGGCATGTTTGAGCTAAATAATCTTGATTTGGTTGTAGCATCACCAGTTGAGGATTACTTCTTGTATATCGACG
AACTTCCGTCTCCTTACAAGGAGAAAGCTGAGGAAATTACCCGACCCCTTCTAGATGCTCTTGGCGACAGCTATTCAATTTGTTGTCAGGGTACAGCCTTTTTCCCCATA
CAAAGTTGTATGAATCATTCCTGCTATCCTAATGCAAAAGCGTTCAAAAGAGAGGAGGATAGAGATGGACAAGCAACCATAATTGCAGTGAGGCCCATACACCCAGGGGA
GGAGGTGACAATTTCATATATAGACGAAGATCTACCATTTGAAGAAAGACGAGCATTACTTGCAGATTATGGATTTGAATGCAGGTGCCCCAAGTGCTTACAAGAGCATC
CATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAAACTTCTTCCAAAGACGACCTACTCCAGCTGATCAAGCGCTTCGGGGCTTATCTCACTCTCAAGATGTCTAATTTCTTCCCGATCTCTCTCCACAATCTGGA
TTCACGTTCTGTTGGGGCTATTGCTGGATTTGCTGTTGCAATAATTTTCACATGGAGGCTGTTGAGATCACCTAACGGACATCAGAGACGACAACCAAAAAGGCAAACGC
CTGCACCAAGTACTTCTACTTCTAGTGTTGGGTTAAATTCCAATGCGCAATTGATACCTTCTGGAGTTTGTTCATCTTCAGAGGATTTAAGAGCACACAATGTGGTTGAT
GAATTCTTCCAGCCAGTCAAGCCAACTCTGGGTCAAATAGTGAGGCAAAAATTGAGTGAAGGAAGAAAGGTAACATGTCGTTTGCTTGGAATAATTCTTGAGGAAAACAG
TCCAGAGGATCTGCAGAAACAAGCCACTGTGAGGTCCTCGGTATTGGAAGTACTGTTGGAGATTACGAAATATTGTGATCTTTATCTCATGGAAACAGTGTTGGATGATG
AGAGCGAAAAAAGAGTTCTTTTGGCACTTGAAGATGCAGGAATTTTCACATCTGGTGGTCTGGTCAAAGACAAGGTTCTCTTCTGTAGCATGGAGAATGGGCGAACATCT
TTCGTGCGTCAACTGGAACCCGATTGGCATATAGATTCCAATCCTGAAATCATTTCTCAGTTAGCTCCCAGCACCGTTCTGTTACTCACAGAGCTCCGACTGAAGCAAGT
GCTTAACAGCACACCGGCGCCGCCAACTATCGCCTCTGAATATTTTGATCAGCTTGTATGGACAAGGCAGTGTCGTGGTATTCGAGTGAAGCAGAATGAAGCCTTTGGAA
AGGGTGTCGTTGCCGATGCTGCTTTCAAAGAAGGGGACCTTTTCCTGAAAGACCGAATGCTTGTGGGATCTCAGCATACGTCAAATAAGATAGACTGTCTGGTGTGCAGC
TTCTGTTTTCGTTTTGTTGGATCTATAGAACTTCAAATTGGAAGGAAATTATATTTTCAAGAACTTGGAGTTTCCACTAATCATCAATGTGACATGGAGCCTTCATCACC
CATGTCAGAAGATTGCTGGGAAACTGAATCAGATGATGGCCAGGAGATTGCATTAGAAAATAATGAAAGCATGGGAGGATGTTCTTCCGGAAAATCTAAAACTACAGCTT
TACCCAAAGGGCTTGTTGAATCATTGATGAACGGTGGCTTATCATTGCCGCACTCCAATGAGTTTGCCATGCCTCCGGCCATTCCATGTTCTGGGGGATGTGGTGAGGCC
TTCTACTGCAGTAAATCATGTGCAGAGGCTGATTGGGAAGTTTTCCATTCCTTACTTTGCACTGGGGGAAAAACAGAACCATCACGCAGGGAAGCACTGCAAAAATTTAT
ACAACATGCTAATGACACGAATGACATATTCCTCCTTGCTGCAAAGGCAATTTCTTCCACTATTTTGAAGTATAAGAAATTAAAGCAGGCTCGTTCTGAAGAACAAATGA
AATATGGCAAGTATCCTATTCTCAACACTGTTGATCTATCCATTCTTTTGGAGGCATGGAGGCCAATCTCAATGGGACATAAGAGAAGGTGGTGGGATTGCATTGCATTA
CCAGATGATGTTGAACCTTCTAATGAAGTTGTGTTCCGGATGCAAATAAGAGAGATGGCTTTCACGTCATTGCAGCTCCTCAAGGAAGCAATTTTCAACGAGGGATGTGA
ACCATTATTCTCCCTTGAAATCTATGGGCATATAATTGGCATGTTTGAGCTAAATAATCTTGATTTGGTTGTAGCATCACCAGTTGAGGATTACTTCTTGTATATCGACG
AACTTCCGTCTCCTTACAAGGAGAAAGCTGAGGAAATTACCCGACCCCTTCTAGATGCTCTTGGCGACAGCTATTCAATTTGTTGTCAGGGTACAGCCTTTTTCCCCATA
CAAAGTTGTATGAATCATTCCTGCTATCCTAATGCAAAAGCGTTCAAAAGAGAGGAGGATAGAGATGGACAAGCAACCATAATTGCAGTGAGGCCCATACACCCAGGGGA
GGAGGTGACAATTTCATATATAGACGAAGATCTACCATTTGAAGAAAGACGAGCATTACTTGCAGATTATGGATTTGAATGCAGGTGCCCCAAGTGCTTACAAGAGCATC
CATAG
Protein sequenceShow/hide protein sequence
MAETSSKDDLLQLIKRFGAYLTLKMSNFFPISLHNLDSRSVGAIAGFAVAIIFTWRLLRSPNGHQRRQPKRQTPAPSTSTSSVGLNSNAQLIPSGVCSSSEDLRAHNVVD
EFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEENSPEDLQKQATVRSSVLEVLLEITKYCDLYLMETVLDDESEKRVLLALEDAGIFTSGGLVKDKVLFCSMENGRTS
FVRQLEPDWHIDSNPEIISQLAPSTVLLLTELRLKQVLNSTPAPPTIASEYFDQLVWTRQCRGIRVKQNEAFGKGVVADAAFKEGDLFLKDRMLVGSQHTSNKIDCLVCS
FCFRFVGSIELQIGRKLYFQELGVSTNHQCDMEPSSPMSEDCWETESDDGQEIALENNESMGGCSSGKSKTTALPKGLVESLMNGGLSLPHSNEFAMPPAIPCSGGCGEA
FYCSKSCAEADWEVFHSLLCTGGKTEPSRREALQKFIQHANDTNDIFLLAAKAISSTILKYKKLKQARSEEQMKYGKYPILNTVDLSILLEAWRPISMGHKRRWWDCIAL
PDDVEPSNEVVFRMQIREMAFTSLQLLKEAIFNEGCEPLFSLEIYGHIIGMFELNNLDLVVASPVEDYFLYIDELPSPYKEKAEEITRPLLDALGDSYSICCQGTAFFPI
QSCMNHSCYPNAKAFKREEDRDGQATIIAVRPIHPGEEVTISYIDEDLPFEERRALLADYGFECRCPKCLQEHP