; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G020830 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G020830
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionHistone-lysine N-methyltransferase MLL
Genome locationchr02:27029921..27030697
RNA-Seq ExpressionLsi02G020830
SyntenyLsi02G020830
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA2959695.1 Hypothetical predicted protein [Olea europaea subsp. europaea]2.1e-2835.92Show/hide
Query:  MGRTTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGS------------
        MG + R + D+LKL+HPG+++E Y +P++A++++KKYP+ CI RPDVF+FPWI VRP+S+LVPGKVF+L+P RT+++LL++  P                
Subjt:  MGRTTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGS------------

Query:  ------LPSLLSPLSRSSSRPSLPSTTNAGTTPKHLTHLRRRSKPPGEDGGMRSRKDDSHVESWLSTLPPPVVGNKRIVRSSSSHVHDCYQCGNVVSTDV
              L    +P  R  + P+ P  + AG TPKHL H R   +    D  M S  D    +S ++        +    R   +H +  Y+  ++V ++V
Subjt:  ------LPSLLSPLSRSSSRPSLPSTTNAGTTPKHLTHLRRRSKPPGEDGGMRSRKDDSHVESWLSTLPPPVVGNKRIVRSSSSHVHDCYQCGNVVSTDV

Query:  LRESVENGGGGRRGGKRKTTSLRSCMRKPGSAPRLPDLKVRFSIP
         R    NG       +     L SCMRKP S  +L  LKV F+ P
Subjt:  LRESVENGGGGRRGGKRKTTSLRSCMRKPGSAPRLPDLKVRFSIP

KAA0026283.1 Histone-lysine N-methyltransferase MLL [Cucumis melo var. makuwa]5.6e-10681.3Show/hide
Query:  MGRTTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLL-SPLSRS
        MGRTTRSEKDVLK+IHPGKHIETYTKPILASEVL+KYPKFCITRPDVFKFPWIVVR DSLLVPGKVF L+PKRTLYRLLK NHPPDGSLPSLL SPLSRS
Subjt:  MGRTTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLL-SPLSRS

Query:  SSRPSLPSTTNAGTTPKHLTHLRR-RSKPPGEDGGMRSRKDDSHVESWLSTLPPPVVGNKR-IVRSSSSHVHDCYQCGNVVSTDVLRESVEN-GGGGRRG
         SRPSLP  +NAGTTPKHLTHLRR + K  GE  G+RSRK ++HVESWLS LPP  VGNKR  VR SSSHVHDCY+CGNV S D+ RE VEN    G R 
Subjt:  SSRPSLPSTTNAGTTPKHLTHLRR-RSKPPGEDGGMRSRKDDSHVESWLSTLPPPVVGNKR-IVRSSSSHVHDCYQCGNVVSTDVLRESVEN-GGGGRRG

Query:  GKRKTTSLRSCMRKPGSAPRLPDLKVRFSIPNEDIVELVTKQRTVIESLSKLATSLIVDVCR
         K   TSLRSCMRKPGSAPRLP+LKVRFSIPNEDIVE V KQRTVIESLSKLATS++VDVCR
Subjt:  GKRKTTSLRSCMRKPGSAPRLPDLKVRFSIPNEDIVELVTKQRTVIESLSKLATSLIVDVCR

KAG6592996.1 hypothetical protein SDJN03_12472, partial [Cucurbita argyrosperma subsp. sororia]1.8e-6461.48Show/hide
Query:  MGRTT-RSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLLSPLSRS
        MGRT   SE+ VLKL+HPGK IE YTKPI A+E+L+KYPKFCITRPDVFKFPWIVVR DSLLVPGKVFFL+PKRTLYRLLKAN PPD SL          
Subjt:  MGRTT-RSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLLSPLSRS

Query:  SSRPSLPSTTNAGTTPKHLTHLRRRSKPPGEDGGMRSRKDDSHVESWLSTLPPPVVGNKRIVRSSSSHVHDCYQCGNVVSTDVLRESVENGGGGRRGGKR
           P  P T NAG TP       RR +  GEDGG R RK  S +E      P  VVG+KR VR SSSHVHDCY+CG  VS+  + E   NG      GK 
Subjt:  SSRPSLPSTTNAGTTPKHLTHLRRRSKPPGEDGGMRSRKDDSHVESWLSTLPPPVVGNKRIVRSSSSHVHDCYQCGNVVSTDVLRESVENGGGGRRGGKR

Query:  KTTSLRSCMRKPGSAPRLPDLKVRFSIPNEDIVELVTKQRTVIE
        KTTSLRSCM+KPGSAPRL + +V F IP ED+   VTKQRTV++
Subjt:  KTTSLRSCMRKPGSAPRLPDLKVRFSIPNEDIVELVTKQRTVIE

KGN44593.1 hypothetical protein Csa_016207 [Cucumis sativus]3.1e-10479.69Show/hide
Query:  MGRTTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLL-SPLSRS
        MGRTT +EK VLKLIHPGKH+ETYTKPILASEVLKKYPKFCITRPDVFK+PWIVVR DSLLVPGKVF L+PKRTL+RLLK NHPPDGSLPSLL SPLSRS
Subjt:  MGRTTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLL-SPLSRS

Query:  SSRPSLPSTTNAGTTPKHLTHLRR-RSKPPGEDGGMRSRKDDSHVESWLSTLPPPVVGNKR-IVRSSSSHVHDCYQCGNVVSTDVLRESVENGGGGRRGG
         SRPSLP  +NAGTTPKHLTHLRR +SKP GE  G+R+RK ++HVE WLS LPP  VGNKR  VR SSSHVHDCY+CG+V STD+ RE VEN  GG    
Subjt:  SSRPSLPSTTNAGTTPKHLTHLRR-RSKPPGEDGGMRSRKDDSHVESWLSTLPPPVVGNKR-IVRSSSSHVHDCYQCGNVVSTDVLRESVENGGGGRRGG

Query:  KRKTTSLRSCMRKPGSAPRLPDLKVRFSIPNEDIVELVTKQRTVIESLSKLATSLIVDVCR
         + TTSLRSCMRKPGSAPRL +LKVRFSIPNEDIVE V KQRTVIESLSKLA SL+VDVCR
Subjt:  KRKTTSLRSCMRKPGSAPRLPDLKVRFSIPNEDIVELVTKQRTVIESLSKLATSLIVDVCR

OAY34757.1 hypothetical protein MANES_12G044800v8 [Manihot esculenta]1.2e-2836.18Show/hide
Query:  TTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLLSPLSRSSSRP
        +T SE  VLKL+ PG+++E Y +P+ A+E+LKKYP+  +TRPDVF++PW+VV+P+S+L  GKVFF++P  TLY L+K++   +      L P ++     
Subjt:  TTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLLSPLSRSSSRP

Query:  SLPSTTNAGTTPKHLT-HLRRRSKPP----------GEDGGMRSRKDDSHVESW-----------LSTLPPPVVGNKRIVRSSSSHVHDCYQCGNVVSTD
          P  + AG+TPKH   H +    PP            D   R+R+ +  V SW           L  L    V N RI+  SS+H  D     N+    
Subjt:  SLPSTTNAGTTPKHLT-HLRRRSKPP----------GEDGGMRSRKDDSHVESW-----------LSTLPPPVVGNKRIVRSSSSHVHDCYQCGNVVSTD

Query:  VLRESVENGGGGRRGGKRKTTSLRSCMRKPGSAPRLPDLKVRFSIP
         L    +  G      K++ T L+SC+RKP SA +   LKV F +P
Subjt:  VLRESVENGGGGRRGGKRKTTSLRSCMRKPGSAPRLPDLKVRFSIP

TrEMBL top hitse value%identityAlignment
A0A0A0K557 Uncharacterized protein1.5e-10479.69Show/hide
Query:  MGRTTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLL-SPLSRS
        MGRTT +EK VLKLIHPGKH+ETYTKPILASEVLKKYPKFCITRPDVFK+PWIVVR DSLLVPGKVF L+PKRTL+RLLK NHPPDGSLPSLL SPLSRS
Subjt:  MGRTTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLL-SPLSRS

Query:  SSRPSLPSTTNAGTTPKHLTHLRR-RSKPPGEDGGMRSRKDDSHVESWLSTLPPPVVGNKR-IVRSSSSHVHDCYQCGNVVSTDVLRESVENGGGGRRGG
         SRPSLP  +NAGTTPKHLTHLRR +SKP GE  G+R+RK ++HVE WLS LPP  VGNKR  VR SSSHVHDCY+CG+V STD+ RE VEN  GG    
Subjt:  SSRPSLPSTTNAGTTPKHLTHLRR-RSKPPGEDGGMRSRKDDSHVESWLSTLPPPVVGNKR-IVRSSSSHVHDCYQCGNVVSTDVLRESVENGGGGRRGG

Query:  KRKTTSLRSCMRKPGSAPRLPDLKVRFSIPNEDIVELVTKQRTVIESLSKLATSLIVDVCR
         + TTSLRSCMRKPGSAPRL +LKVRFSIPNEDIVE V KQRTVIESLSKLA SL+VDVCR
Subjt:  KRKTTSLRSCMRKPGSAPRLPDLKVRFSIPNEDIVELVTKQRTVIESLSKLATSLIVDVCR

A0A2C9UV04 Uncharacterized protein6.0e-2936.18Show/hide
Query:  TTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLLSPLSRSSSRP
        +T SE  VLKL+ PG+++E Y +P+ A+E+LKKYP+  +TRPDVF++PW+VV+P+S+L  GKVFF++P  TLY L+K++   +      L P ++     
Subjt:  TTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLLSPLSRSSSRP

Query:  SLPSTTNAGTTPKHLT-HLRRRSKPP----------GEDGGMRSRKDDSHVESW-----------LSTLPPPVVGNKRIVRSSSSHVHDCYQCGNVVSTD
          P  + AG+TPKH   H +    PP            D   R+R+ +  V SW           L  L    V N RI+  SS+H  D     N+    
Subjt:  SLPSTTNAGTTPKHLT-HLRRRSKPP----------GEDGGMRSRKDDSHVESW-----------LSTLPPPVVGNKRIVRSSSSHVHDCYQCGNVVSTD

Query:  VLRESVENGGGGRRGGKRKTTSLRSCMRKPGSAPRLPDLKVRFSIP
         L    +  G      K++ T L+SC+RKP SA +   LKV F +P
Subjt:  VLRESVENGGGGRRGGKRKTTSLRSCMRKPGSAPRLPDLKVRFSIP

A0A2P5CID5 Uncharacterized protein1.5e-2736.72Show/hide
Query:  RTTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKA----NHPPDGSLPSLL---SP
        R   +EK  +KL+HPG+H+E   +P+  +EV+ + P+ CITRPDVFKFPW+V++P+S+L  GKVFFL+P RTLY L+K+    N PP      L     P
Subjt:  RTTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKA----NHPPDGSLPSLL---SP

Query:  LSRSSSRPSLPSTTNAGTTPKHLTHLRRRSKPPG-----EDGGMRSRKDDSHVESWLST------------LPPPVVGNKRIVRSSSSHVHDCYQCGNVV
         +  ++    PS   AG TPK+ +H  R + P        D   RS    + V S +++            LP P +  +   R    H+          
Subjt:  LSRSSSRPSLPSTTNAGTTPKHLTHLRRRSKPPG-----EDGGMRSRKDDSHVESWLST------------LPPPVVGNKRIVRSSSSHVHDCYQCGNVV

Query:  STDVLRESVEN--GGGGR----RGGKRKTTS-LRSCMRKPGSAPRLPDLKVRFSIP
        +TD  R  V N  GGG R     GG+R     L+SCMRKPGS  R   L+V F +P
Subjt:  STDVLRESVEN--GGGGR----RGGKRKTTS-LRSCMRKPGSAPRLPDLKVRFSIP

A0A314YMV0 Uncharacterized protein1.9e-2735.25Show/hide
Query:  MGRTTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLLSPLSRS-
        MG  TR E+ VLKL+  G+ +E + +PI A EV++KYP+  ITRPD+F+FPWIVV+ +++L PGKVFF++P RT+YRLLKA    D S  S  S   +S 
Subjt:  MGRTTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLLSPLSRS-

Query:  --------SSRPSLPSTTNAGTTPKHLTHLRRRS---------KPPGE----DGGMRSRKDDSHVESW--LSTLPPPVVGNKR-------IVRSSSSHVH
                S + + P    AG TPKH    RR            P  E    D G+++   DS V +W  LS+      G  +        +++     +
Subjt:  --------SSRPSLPSTTNAGTTPKHLTHLRRRS---------KPPGE----DGGMRSRKDDSHVESW--LSTLPPPVVGNKR-------IVRSSSSHVH

Query:  DCYQCGNVVSTDVLRESVENGGGGRRGGKRKTTSLRSCMRKPGSAPRLPDLKVRFSIPNED
        D  +  N+ +     E  ++         ++ T L+SC+RKP SA +   LKV F++PN+D
Subjt:  DCYQCGNVVSTDVLRESVENGGGGRRGGKRKTTSLRSCMRKPGSAPRLPDLKVRFSIPNED

A0A5D3DDU1 Histone-lysine N-methyltransferase MLL2.7e-10681.3Show/hide
Query:  MGRTTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLL-SPLSRS
        MGRTTRSEKDVLK+IHPGKHIETYTKPILASEVL+KYPKFCITRPDVFKFPWIVVR DSLLVPGKVF L+PKRTLYRLLK NHPPDGSLPSLL SPLSRS
Subjt:  MGRTTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLL-SPLSRS

Query:  SSRPSLPSTTNAGTTPKHLTHLRR-RSKPPGEDGGMRSRKDDSHVESWLSTLPPPVVGNKR-IVRSSSSHVHDCYQCGNVVSTDVLRESVEN-GGGGRRG
         SRPSLP  +NAGTTPKHLTHLRR + K  GE  G+RSRK ++HVESWLS LPP  VGNKR  VR SSSHVHDCY+CGNV S D+ RE VEN    G R 
Subjt:  SSRPSLPSTTNAGTTPKHLTHLRR-RSKPPGEDGGMRSRKDDSHVESWLSTLPPPVVGNKR-IVRSSSSHVHDCYQCGNVVSTDVLRESVEN-GGGGRRG

Query:  GKRKTTSLRSCMRKPGSAPRLPDLKVRFSIPNEDIVELVTKQRTVIESLSKLATSLIVDVCR
         K   TSLRSCMRKPGSAPRLP+LKVRFSIPNEDIVE V KQRTVIESLSKLATS++VDVCR
Subjt:  GKRKTTSLRSCMRKPGSAPRLPDLKVRFSIPNEDIVELVTKQRTVIESLSKLATSLIVDVCR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G29195.1 unknown protein7.8e-0534.25Show/hide
Query:  DVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRP----------DVFKFPWIV-VRPDSLLVPGKVFFLLP
        DV++++H   H+E  +  I ASE++K +PK  + +P          DV     IV V P++ L  GK++FL+P
Subjt:  DVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRP----------DVFKFPWIV-VRPDSLLVPGKVFFLLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAACAACAAGATCAGAAAAAGATGTATTAAAGTTAATTCATCCAGGAAAACACATTGAGACTTATACCAAACCAATACTTGCATCTGAAGTCCTAAAAAAATA
CCCTAAATTTTGCATCACAAGACCTGATGTATTCAAATTTCCATGGATTGTTGTGAGGCCTGATTCCCTTTTGGTCCCTGGAAAAGTGTTCTTTCTTCTCCCTAAACGCA
CCCTCTATCGCCTCTTGAAGGCCAACCATCCACCCGATGGCTCGTTGCCATCGCTGTTGTCGCCTCTATCCCGCTCTTCCTCGAGGCCATCTTTACCTTCAACGACGAAT
GCGGGAACGACTCCAAAGCACCTAACTCATCTCCGCAGGCGGTCGAAGCCGCCGGGGGAAGACGGTGGAATGAGAAGTAGGAAAGACGACTCTCATGTTGAGTCGTGGTT
GTCAACGTTGCCGCCGCCCGTCGTTGGTAATAAGAGAATTGTGAGGTCTTCTTCCTCACATGTGCATGATTGCTACCAATGTGGTAATGTGGTTAGTACCGATGTGTTGA
GGGAGAGCGTTGAGAATGGCGGCGGCGGCAGACGTGGCGGAAAAAGGAAAACGACGTCGTTGAGATCTTGCATGAGAAAGCCTGGGAGTGCTCCTCGTTTGCCTGATTTG
AAAGTGAGATTTTCAATTCCAAATGAGGATATTGTTGAACTTGTAACTAAACAGAGAACGGTGATTGAGTCCCTTTCAAAACTTGCAACCTCATTAATCGTTGACGTATG
TAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGAACAACAAGATCAGAAAAAGATGTATTAAAGTTAATTCATCCAGGAAAACACATTGAGACTTATACCAAACCAATACTTGCATCTGAAGTCCTAAAAAAATA
CCCTAAATTTTGCATCACAAGACCTGATGTATTCAAATTTCCATGGATTGTTGTGAGGCCTGATTCCCTTTTGGTCCCTGGAAAAGTGTTCTTTCTTCTCCCTAAACGCA
CCCTCTATCGCCTCTTGAAGGCCAACCATCCACCCGATGGCTCGTTGCCATCGCTGTTGTCGCCTCTATCCCGCTCTTCCTCGAGGCCATCTTTACCTTCAACGACGAAT
GCGGGAACGACTCCAAAGCACCTAACTCATCTCCGCAGGCGGTCGAAGCCGCCGGGGGAAGACGGTGGAATGAGAAGTAGGAAAGACGACTCTCATGTTGAGTCGTGGTT
GTCAACGTTGCCGCCGCCCGTCGTTGGTAATAAGAGAATTGTGAGGTCTTCTTCCTCACATGTGCATGATTGCTACCAATGTGGTAATGTGGTTAGTACCGATGTGTTGA
GGGAGAGCGTTGAGAATGGCGGCGGCGGCAGACGTGGCGGAAAAAGGAAAACGACGTCGTTGAGATCTTGCATGAGAAAGCCTGGGAGTGCTCCTCGTTTGCCTGATTTG
AAAGTGAGATTTTCAATTCCAAATGAGGATATTGTTGAACTTGTAACTAAACAGAGAACGGTGATTGAGTCCCTTTCAAAACTTGCAACCTCATTAATCGTTGACGTATG
TAGATGA
Protein sequenceShow/hide protein sequence
MGRTTRSEKDVLKLIHPGKHIETYTKPILASEVLKKYPKFCITRPDVFKFPWIVVRPDSLLVPGKVFFLLPKRTLYRLLKANHPPDGSLPSLLSPLSRSSSRPSLPSTTN
AGTTPKHLTHLRRRSKPPGEDGGMRSRKDDSHVESWLSTLPPPVVGNKRIVRSSSSHVHDCYQCGNVVSTDVLRESVENGGGGRRGGKRKTTSLRSCMRKPGSAPRLPDL
KVRFSIPNEDIVELVTKQRTVIESLSKLATSLIVDVCR