; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019711 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019711
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPHD-type domain-containing protein
Genome locationChr04:24772360..24774473
RNA-Seq ExpressionHG10019711
SyntenyHG10019711
Gene Ontology termsNA
InterPro domainsIPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR034732 - Extended PHD (ePHD) domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058834.1 Tat-binding-7-like protein [Cucumis melo var. makuwa]1.1e-8445.49Show/hide
Query:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKRK------------------------
        L  SGNRSG RL KKHKRLDAICEKEYSRNHG VNENV+ LGT+EAD GLR+S+ VRR     +L+      RK++                        
Subjt:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKRK------------------------

Query:  LFDETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR------------------------------
        L  ET GNW    RSRNRNLG RVDKG R SRKRKLFDEII VKVRN GMR+DL EEK KME+ ESMVGR                              
Subjt:  LFDETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR------------------------------

Query:  --EDMLAIN----------------------------NEDEEEEEVEEVEEEEGEEGGGG----------------------------------------
          E+ML I+                             E+EEEEE EE EEEE EE   G                                        
Subjt:  --EDMLAIN----------------------------NEDEEEEEVEEVEEEEGEEGGGG----------------------------------------

Query:  ----------------------GGGGGGQ-----------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGTLGKAR
                                 G  Q                 +AA+VSTNEVVGGRSC+EKAVDLGKF EKSRQHG  LNLKKFTDSS G LGKAR
Subjt:  ----------------------GGGGGGQ-----------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGTLGKAR

Query:  IQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCLKNVRA
        I+EGRRCGLCGGGI+G   +   +       +                          G +LG INDRY IA IW+H+H AVWS EVYFAGLGCLKNVRA
Subjt:  IQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCLKNVRA

Query:  ALCRGRALKCTRCGRPGATIG
        ALCRGRALKCTRCGRPGATIG
Subjt:  ALCRGRALKCTRCGRPGATIG

TYK11250.1 Tat-binding-7-like protein [Cucumis melo var. makuwa]3.1e-8445.14Show/hide
Query:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKRK------------------------
        L  SGNRSG RL KKHKRLDAICEKEYSRNHG VNENV+ LGT+EAD GLR+S+ VRR     +L+      RK++                        
Subjt:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKRK------------------------

Query:  LFDETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR------------------------------
        L  ET GNW    RSRNRNLG RVDKG R SRKRKLFDEII VKVRN GMR+DL EEK KME+ ESMVGR                              
Subjt:  LFDETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR------------------------------

Query:  --EDMLAIN--------------------------------NEDEEEEEVEEVEEEEGEEGGGG------------------------------------
          E+ML I+                                 E+EEEEE EE EEEE EE   G                                    
Subjt:  --EDMLAIN--------------------------------NEDEEEEEVEEVEEEEGEEGGGG------------------------------------

Query:  --------------------------GGGGGGQ-----------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGTL
                                     G  Q                 +AA+VSTNEVVGGRSC+EKAVDLGKF EKSRQHG  LNLKKFTDSS G L
Subjt:  --------------------------GGGGGGQ-----------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGTL

Query:  GKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCLK
        GKARI+EGRRCGLCGGGI+G   +   +       +                          G +LG INDRY IA IW+H+H AVWS EVYFAGLGCLK
Subjt:  GKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCLK

Query:  NVRAALCRGRALKCTRCGRPGATIG
        NVRAALCRGRALKCTRCGRPGATIG
Subjt:  NVRAALCRGRALKCTRCGRPGATIG

XP_008456208.1 PREDICTED: uncharacterized protein LOC103496212 [Cucumis melo]3.0e-8743.16Show/hide
Query:  IDLAPPAAQGFVIFCSLLRI-------CLSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRI
        I L P     F+    ++R+        L  SGNRSG RL KKHKRLDAICEKEYSRNHG VNENV+ LGT+EAD GLR+S+ VRR     +L+      
Subjt:  IDLAPPAAQGFVIFCSLLRI-------CLSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRI

Query:  RKRK------------------------LFDETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR--
        RK++                        L  ET GNW    RSRNRNLG RVDKG R SRKRKLFDEII VKVRN GMR+DL EEK KME+ ESMVGR  
Subjt:  RKRK------------------------LFDETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR--

Query:  ------------------------------EDMLAINNEDEEE------EEVEEVEEEEGEEGGGGGGGGGGQ---------------------------
                                      E+ML I+ +DEEE      EE EE EEEE EE   GGGGGGG+                           
Subjt:  ------------------------------EDMLAINNEDEEE------EEVEEVEEEEGEEGGGGGGGGGGQ---------------------------

Query:  -----------------------------------------------------------------------------------------------KAALV
                                                                                                       +AA+V
Subjt:  -----------------------------------------------------------------------------------------------KAALV

Query:  STNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGTLGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS----
        STNEVVGGRSC+EKAVDLGKF EKSRQHG  LNLKKFTDSS G LGKARI+EGRRCGLCGGGI+G   +   +       +                   
Subjt:  STNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGTLGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS----

Query:  -------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIG
               G +LG INDRY IA IW+H+H AVWS EVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIG
Subjt:  -------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIG

XP_031739139.1 uncharacterized protein LOC101208571 [Cucumis sativus]2.1e-8544.87Show/hide
Query:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRK---------------------RKLFD
        L+ SGNRSG RL KKHKRLDAICEKEYSRNHG VNENVSGLGT+EAD GLR+S+ VRR           R+ R+                       L D
Subjt:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRK---------------------RKLFD

Query:  ETHGNWRSR----NRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR--------------------------------E
        ET GNWRSR    +RNLG RVDKG R SRKRKLFDEI+ VKVRN GMR+DL EEKG+ME+ ES+VGR                                +
Subjt:  ETHGNWRSR----NRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR--------------------------------E

Query:  DMLAIN------------------------------NEDEEEEEVEEVEEEEGEEGGGG----------GGG----------------------------
        DML I+                               E+EEEEE EE EEEEGEE   G          G G                            
Subjt:  DMLAIN------------------------------NEDEEEEEVEEVEEEEGEEGGGG----------GGG----------------------------

Query:  ----------------------------GGGQ-------------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGT
                                      G+                    AA+VSTNEVVGGRSC+EKAVD+GKF EKSR+HG  LNLKKFTDSS G 
Subjt:  ----------------------------GGGQ-------------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGT

Query:  LGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCL
        LGKARI+EGRRCGLCGGGI+G   +   +       +                          G +LG INDRY IA IW+H+H AVWS EVYFAGLGCL
Subjt:  LGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCL

Query:  KNVRAALCRGRALKCTRCGRPGATIG
        KNVRAALCRGRALKCTRCGRPGATIG
Subjt:  KNVRAALCRGRALKCTRCGRPGATIG

XP_038898386.1 uncharacterized protein LOC120086038 [Benincasa hispida]4.6e-8845.45Show/hide
Query:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKR---------------------KLFD
        L  SGNRSGSRL KKHKRLDAICEKEYSRNHG VNENVS L TVE DLGLR+S+ VRR           R+ R+                       L D
Subjt:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKR---------------------KLFD

Query:  ETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVG---------------------------------R
        ET GNW    RSRNRNLG RV+KGTR SRKRKLFDEII VKVR++GMRM L E KG+MEY ESMVG                                 R
Subjt:  ETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVG---------------------------------R

Query:  EDMLAINNEDEEEEEVEEVEEEEGEEGGGGGGGGGGQ---------------------------------------------------------------
        EDML+INNEDEEEEE  E EEEE EE   G     G+                                                               
Subjt:  EDMLAINNEDEEEEEVEEVEEEEGEEGGGGGGGGGGQ---------------------------------------------------------------

Query:  -----------------------------------------------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSI
                                                             +AA+ STNEVVGGRSC+EKA DLGKFAEKSRQHGG LN KKFTDSS 
Subjt:  -----------------------------------------------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSI

Query:  GTLGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLG
        G LGKARI+EGRRCGLCGGGI+G   +   +     E +                          G +LG INDRY IA IW+H+H AVWS EVYFAGLG
Subjt:  GTLGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLG

Query:  CLKNVRAALCRGRALKCTRCGRPGATIG
        CLKNVRAALCRGRALKCTRCGRPGATIG
Subjt:  CLKNVRAALCRGRALKCTRCGRPGATIG

TrEMBL top hitse value%identityAlignment
A0A0A0L9H9 PHD-type domain-containing protein1.3e-8544.78Show/hide
Query:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRK---------------------RKLFD
        L+ SGNRSG RL KKHKRLDAICEKEYSRNHG VNENVSGLGT+EAD GLR+S+ VRR           R+ R+                       L D
Subjt:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRK---------------------RKLFD

Query:  ETHGNWRSR----NRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR--------------------------------E
        ET GNWRSR    +RNLG RVDKG R SRKRKLFDEI+ VKVRN GMR+DL EEKG+ME+ ES+VGR                                +
Subjt:  ETHGNWRSR----NRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR--------------------------------E

Query:  DMLAIN-------------------------------NEDEEEEEVEEVEEEEGEEGGGG----------GGG---------------------------
        DML I+                                E+EEEEE EE EEEEGEE   G          G G                           
Subjt:  DMLAIN-------------------------------NEDEEEEEVEEVEEEEGEEGGGG----------GGG---------------------------

Query:  -----------------------------GGGQ-------------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIG
                                       G+                    AA+VSTNEVVGGRSC+EKAVD+GKF EKSR+HG  LNLKKFTDSS G
Subjt:  -----------------------------GGGQ-------------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIG

Query:  TLGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGC
         LGKARI+EGRRCGLCGGGI+G   +   +       +                          G +LG INDRY IA IW+H+H AVWS EVYFAGLGC
Subjt:  TLGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGC

Query:  LKNVRAALCRGRALKCTRCGRPGATIG
        LKNVRAALCRGRALKCTRCGRPGATIG
Subjt:  LKNVRAALCRGRALKCTRCGRPGATIG

A0A1S3C2T2 uncharacterized protein LOC1034962121.4e-8743.16Show/hide
Query:  IDLAPPAAQGFVIFCSLLRI-------CLSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRI
        I L P     F+    ++R+        L  SGNRSG RL KKHKRLDAICEKEYSRNHG VNENV+ LGT+EAD GLR+S+ VRR     +L+      
Subjt:  IDLAPPAAQGFVIFCSLLRI-------CLSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRI

Query:  RKRK------------------------LFDETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR--
        RK++                        L  ET GNW    RSRNRNLG RVDKG R SRKRKLFDEII VKVRN GMR+DL EEK KME+ ESMVGR  
Subjt:  RKRK------------------------LFDETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR--

Query:  ------------------------------EDMLAINNEDEEE------EEVEEVEEEEGEEGGGGGGGGGGQ---------------------------
                                      E+ML I+ +DEEE      EE EE EEEE EE   GGGGGGG+                           
Subjt:  ------------------------------EDMLAINNEDEEE------EEVEEVEEEEGEEGGGGGGGGGGQ---------------------------

Query:  -----------------------------------------------------------------------------------------------KAALV
                                                                                                       +AA+V
Subjt:  -----------------------------------------------------------------------------------------------KAALV

Query:  STNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGTLGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS----
        STNEVVGGRSC+EKAVDLGKF EKSRQHG  LNLKKFTDSS G LGKARI+EGRRCGLCGGGI+G   +   +       +                   
Subjt:  STNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGTLGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS----

Query:  -------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIG
               G +LG INDRY IA IW+H+H AVWS EVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIG
Subjt:  -------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIG

A0A5A7UUP2 Tat-binding-7-like protein5.1e-8545.49Show/hide
Query:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKRK------------------------
        L  SGNRSG RL KKHKRLDAICEKEYSRNHG VNENV+ LGT+EAD GLR+S+ VRR     +L+      RK++                        
Subjt:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKRK------------------------

Query:  LFDETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR------------------------------
        L  ET GNW    RSRNRNLG RVDKG R SRKRKLFDEII VKVRN GMR+DL EEK KME+ ESMVGR                              
Subjt:  LFDETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR------------------------------

Query:  --EDMLAIN----------------------------NEDEEEEEVEEVEEEEGEEGGGG----------------------------------------
          E+ML I+                             E+EEEEE EE EEEE EE   G                                        
Subjt:  --EDMLAIN----------------------------NEDEEEEEVEEVEEEEGEEGGGG----------------------------------------

Query:  ----------------------GGGGGGQ-----------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGTLGKAR
                                 G  Q                 +AA+VSTNEVVGGRSC+EKAVDLGKF EKSRQHG  LNLKKFTDSS G LGKAR
Subjt:  ----------------------GGGGGGQ-----------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGTLGKAR

Query:  IQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCLKNVRA
        I+EGRRCGLCGGGI+G   +   +       +                          G +LG INDRY IA IW+H+H AVWS EVYFAGLGCLKNVRA
Subjt:  IQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCLKNVRA

Query:  ALCRGRALKCTRCGRPGATIG
        ALCRGRALKCTRCGRPGATIG
Subjt:  ALCRGRALKCTRCGRPGATIG

A0A5D3CIS0 Tat-binding-7-like protein1.5e-8445.14Show/hide
Query:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKRK------------------------
        L  SGNRSG RL KKHKRLDAICEKEYSRNHG VNENV+ LGT+EAD GLR+S+ VRR     +L+      RK++                        
Subjt:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKRK------------------------

Query:  LFDETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR------------------------------
        L  ET GNW    RSRNRNLG RVDKG R SRKRKLFDEII VKVRN GMR+DL EEK KME+ ESMVGR                              
Subjt:  LFDETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR------------------------------

Query:  --EDMLAIN--------------------------------NEDEEEEEVEEVEEEEGEEGGGG------------------------------------
          E+ML I+                                 E+EEEEE EE EEEE EE   G                                    
Subjt:  --EDMLAIN--------------------------------NEDEEEEEVEEVEEEEGEEGGGG------------------------------------

Query:  --------------------------GGGGGGQ-----------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGTL
                                     G  Q                 +AA+VSTNEVVGGRSC+EKAVDLGKF EKSRQHG  LNLKKFTDSS G L
Subjt:  --------------------------GGGGGGQ-----------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGTL

Query:  GKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCLK
        GKARI+EGRRCGLCGGGI+G   +   +       +                          G +LG INDRY IA IW+H+H AVWS EVYFAGLGCLK
Subjt:  GKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCLK

Query:  NVRAALCRGRALKCTRCGRPGATIG
        NVRAALCRGRALKCTRCGRPGATIG
Subjt:  NVRAALCRGRALKCTRCGRPGATIG

A0A6J1CP50 uncharacterized protein LOC111012888 isoform X23.1e-8244.4Show/hide
Query:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKR---------------------KLFD
        L  SGNRSGSRL KKHKRLDAICEKEYSRNHG VNEN SGLGT E D GLR+SN VRR           R+ R++                      L D
Subjt:  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKR---------------------KLFD

Query:  ETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVG---------------------------------R
        E  GNW    R+RN NLG RVDKG R SRKRKLFD I  VKV+++GM+MDL E+KGK+E  ESMVG                                 R
Subjt:  ETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVG---------------------------------R

Query:  EDMLAINNED------EEEEEVEEVEEEEGEEGG-----GGG-----GGGGGQ-----------------------------------------------
        E  LAIN ED      EEEEEVEE EEEE EE G     G G     G G G+                                               
Subjt:  EDMLAINNED------EEEEEVEEVEEEEGEEGG-----GGG-----GGGGGQ-----------------------------------------------

Query:  ----------------------------------------------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIG
                                                            +AA VSTNEVVGGR C EK VDLGKFAEKS QHGG LNLKKF DSS G
Subjt:  ----------------------------------------------------KAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIG

Query:  TLGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGC
          GKA I+EGRRCGLCGGGI+G   +   +   + E +                          G +LG INDRY IA IW+H+H AVWS EVYFAGLGC
Subjt:  TLGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTS-----------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGC

Query:  LKNVRAALCRGRALKCTRCGRPGATIG
        LKNVRAALCRGRALKCTRCGRPGATIG
Subjt:  LKNVRAALCRGRALKCTRCGRPGATIG

SwissProt top hitse value%identityAlignment
O08550 Histone-lysine N-methyltransferase 2B1.9e-0751.02Show/hide
Query:  WIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIGSC
        W H + A+WS+EV+    G LKNV AA+ RGR ++C  C +PGAT+G C
Subjt:  WIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIGSC

P20659 Histone-lysine N-methyltransferase trithorax7.0e-0748.98Show/hide
Query:  EIWIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIG
        + W+H + A+WS+EV+    G L+NV +A+ RGR +KCT CG  GAT+G
Subjt:  EIWIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIG

Q03164 Histone-lysine N-methyltransferase 2A2.7e-0648.98Show/hide
Query:  WIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIGSC
        W H + A+WS+EV+    G LKNV  A+ RG+ L+C  C +PGAT+G C
Subjt:  WIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIGSC

Q24742 Histone-lysine N-methyltransferase trithorax1.2e-0648.98Show/hide
Query:  EIWIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIG
        + W+H + A+WS+EV+    G L+NV +A+ RGR +KCT CG  GAT+G
Subjt:  EIWIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIG

Q9UMN6 Histone-lysine N-methyltransferase 2B1.9e-0751.02Show/hide
Query:  WIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIGSC
        W H + A+WS+EV+    G LKNV AA+ RGR ++C  C +PGAT+G C
Subjt:  WIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIGSC

Arabidopsis top hitse value%identityAlignment
AT3G15120.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein1.5e-2546.76Show/hide
Query:  KKFTDS---SIGTLGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKL---VVALQLQRNKIMTS--------GMVLGRINDRYDIAEIWIHRHYAV
        KK  DS   S   LGK   ++ RRCGLCG G +G L +   +     +++      + + Q+  I+          G +LG INDRY I+  W+H++ AV
Subjt:  KKFTDS---SIGTLGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKL---VVALQLQRNKIMTS--------GMVLGRINDRYDIAEIWIHRHYAV

Query:  WSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIG
        WS EVYFAG+GCLKN+RAAL RGR+LKCTRC RPGAT G
Subjt:  WSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIG

AT3G15120.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein4.1e-0227.32Show/hide
Query:  SGNRSGSRLKKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKRKLFDETHGNWRSRNRNLGTRVD-----KG
        SG+ SG   KK K+L AICE+EY +NHG   +   G G   AD  LR+S+ VR+  S  +L       +KR+ F+++  +     RN     D     K 
Subjt:  SGNRSGSRLKKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKRKLFDETHGNWRSRNRNLGTRVD-----KG

Query:  TRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGREDMLAINNEDEEEEEVE-----------EVEEEEGEEGGG
           SR++K       V  + +G +  + + K K+ +        +    ++ +EE+  ++           +V+E E  E GG
Subjt:  TRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGREDMLAINNEDEEEEEVE-----------EVEEEEGEEGGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCTGGAAACCAACGAAGAAAGAAGTCCTGAAGATCCATCGATCGACCTCGCTCCTCCTGCCGCTCAAGGCTTCGTTATTTTTTGTTCTCTCCTTCGAATCTGCCT
CTCTCTGAGTGGTAATCGGTCTGGGTCTAGGCTTAAGAAGCACAAGAGGCTTGATGCCATATGCGAGAAAGAGTATAGTCGAAACCATGGCTATGTGAATGAGAATGTCA
GTGGGTTGGGGACTGTGGAGGCTGATCTTGGGCTTAGGCAGAGCAACCATGTTCGTCGGCCCCAGTCCTGCTGGATGCTTGTTCTATGTCAAAGAAGAATCAGGAAAAGG
AAACTTTTTGATGAGACGCATGGGAATTGGAGATCAAGGAACAGAAATTTGGGGACTAGAGTGGACAAAGGTACTCGGGAAAGCAGGAAAAGGAAACTTTTTGATGAAAT
TATTGTTGTGAAAGTAAGAAACAATGGAATGAGGATGGATTTGGCTGAGGAAAAAGGGAAAATGGAATACGTGGAATCTATGGTTGGAAGGGAAGACATGTTGGCAATCA
ATAATGAAGATGAGGAGGAGGAAGAAGTAGAAGAAGTAGAAGAAGAAGAAGGGGAAGAAGGAGGAGGAGGAGGTGGTGGTGGTGGAGGGCAAAAAGCTGCCTTAGTTTCA
ACAAATGAAGTGGTAGGTGGAAGGTCTTGCAGTGAGAAAGCTGTTGATTTGGGTAAGTTTGCTGAAAAGTCTAGGCAACATGGTGGCTATTTAAATTTAAAGAAGTTTAC
AGACAGTTCCATAGGTACTTTGGGTAAGGCTCGCATTCAAGAGGGCAGAAGGTGTGGATTGTGTGGAGGAGGAATTAATGGTAACCTCTCAAGAAGTTGGTTCAGGATTC
GGGTGAGAGTGGAAATGAAGCTTGTAGTGGCTCTTCAGCTTCAGAGGAACAAAATTATGACAAGTGGGATGGTTTTAGGTCGAATTAATGATCGTTATGACATTGCTGAA
ATATGGATCCATCGACACTATGCAGTTTGGAGCTCCGAGGTTTATTTTGCTGGATTGGGATGCTTGAAAAATGTAAGGGCTGCTCTTTGCAGGGGAAGAGCATTGAAGTG
CACTCGGTGTGGGAGACCTGGTGCAACCATCGGATCGTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCTGGAAACCAACGAAGAAAGAAGTCCTGAAGATCCATCGATCGACCTCGCTCCTCCTGCCGCTCAAGGCTTCGTTATTTTTTGTTCTCTCCTTCGAATCTGCCT
CTCTCTGAGTGGTAATCGGTCTGGGTCTAGGCTTAAGAAGCACAAGAGGCTTGATGCCATATGCGAGAAAGAGTATAGTCGAAACCATGGCTATGTGAATGAGAATGTCA
GTGGGTTGGGGACTGTGGAGGCTGATCTTGGGCTTAGGCAGAGCAACCATGTTCGTCGGCCCCAGTCCTGCTGGATGCTTGTTCTATGTCAAAGAAGAATCAGGAAAAGG
AAACTTTTTGATGAGACGCATGGGAATTGGAGATCAAGGAACAGAAATTTGGGGACTAGAGTGGACAAAGGTACTCGGGAAAGCAGGAAAAGGAAACTTTTTGATGAAAT
TATTGTTGTGAAAGTAAGAAACAATGGAATGAGGATGGATTTGGCTGAGGAAAAAGGGAAAATGGAATACGTGGAATCTATGGTTGGAAGGGAAGACATGTTGGCAATCA
ATAATGAAGATGAGGAGGAGGAAGAAGTAGAAGAAGTAGAAGAAGAAGAAGGGGAAGAAGGAGGAGGAGGAGGTGGTGGTGGTGGAGGGCAAAAAGCTGCCTTAGTTTCA
ACAAATGAAGTGGTAGGTGGAAGGTCTTGCAGTGAGAAAGCTGTTGATTTGGGTAAGTTTGCTGAAAAGTCTAGGCAACATGGTGGCTATTTAAATTTAAAGAAGTTTAC
AGACAGTTCCATAGGTACTTTGGGTAAGGCTCGCATTCAAGAGGGCAGAAGGTGTGGATTGTGTGGAGGAGGAATTAATGGTAACCTCTCAAGAAGTTGGTTCAGGATTC
GGGTGAGAGTGGAAATGAAGCTTGTAGTGGCTCTTCAGCTTCAGAGGAACAAAATTATGACAAGTGGGATGGTTTTAGGTCGAATTAATGATCGTTATGACATTGCTGAA
ATATGGATCCATCGACACTATGCAGTTTGGAGCTCCGAGGTTTATTTTGCTGGATTGGGATGCTTGAAAAATGTAAGGGCTGCTCTTTGCAGGGGAAGAGCATTGAAGTG
CACTCGGTGTGGGAGACCTGGTGCAACCATCGGATCGTGTTGA
Protein sequenceShow/hide protein sequence
MPLETNEERSPEDPSIDLAPPAAQGFVIFCSLLRICLSLSGNRSGSRLKKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKR
KLFDETHGNWRSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGREDMLAINNEDEEEEEVEEVEEEEGEEGGGGGGGGGGQKAALVS
TNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGTLGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTSGMVLGRINDRYDIAE
IWIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIGSC