; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G01940 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G01940
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLSD1 zinc finger family protein
Genome locationClcChr10:1803918..1809626
RNA-Seq ExpressionClc10G01940
SyntenyClc10G01940
Gene Ontology termsGO:0009793 - embryo development ending in seed dormancy (biological process)
GO:0034051 - negative regulation of plant-type hypersensitive response (biological process)
GO:0045595 - regulation of cell differentiation (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR005513 - Late embryogenesis abundant protein, LEA_1 subgroup
IPR005735 - Zinc finger, LSD1-type
IPR040319 - LSD1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650032.1 hypothetical protein Csa_010851 [Cucumis sativus]2.9e-4587.25Show/hide
Query:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM
        MPFPPVTC Q+QI+CSGCK+LL+YPAGATSICCALCH V+PVPTSGLTMA+LVCSGC+TLLMY RGA SVQCSCCRT+NAAS+ANQMAHINCGNCRVLLM
Subjt:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM

Query:  YQ
        YQ
Subjt:  YQ

XP_008448702.2 PREDICTED: protein LSD1 isoform X2 [Cucumis melo]8.6e-4267.63Show/hide
Query:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASK-------------ANQM
        MPFP VTC Q+QI+CSGCK++L+YPAGATSICCALCH V+PVPTSGLTMA+LVCSGC+TLLMY RGATSVQCSCCRT+NAASK             ANQM
Subjt:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASK-------------ANQM

Query:  AHINCGNCRVLLMYQ-SLGSMKMNPKSINTFYWFLMKKE
        AH NCGNCRVLLMYQ    S+K    +  T    L K+E
Subjt:  AHINCGNCRVLLMYQ-SLGSMKMNPKSINTFYWFLMKKE

XP_008448703.1 PREDICTED: protein LOL1 isoform X1 [Cucumis melo]1.4e-4474.6Show/hide
Query:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM
        MPFP VTC Q+QI+CSGCK++L+YPAGATSICCALCH V+PVPTSGLTMA+LVCSGC+TLLMY RGATSVQCSCCRT+NAASKANQMAH NCGNCRVLLM
Subjt:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM

Query:  YQ-SLGSMKMNPKSINTFYWFLMKKE
        YQ    S+K    +  T    L K+E
Subjt:  YQ-SLGSMKMNPKSINTFYWFLMKKE

XP_031738784.1 protein LSD1 [Cucumis sativus]2.9e-4587.25Show/hide
Query:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM
        MPFPPVTC Q+QI+CSGCK+LL+YPAGATSICCALCH V+PVPTSGLTMA+LVCSGC+TLLMY RGA SVQCSCCRT+NAAS+ANQMAHINCGNCRVLLM
Subjt:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM

Query:  YQ
        YQ
Subjt:  YQ

XP_038903355.1 protein LSD1-like [Benincasa hispida]4.9e-4588.24Show/hide
Query:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM
        MPFPPVTC ++ IVCSGCK LL+YPAGAT ICCALCH VSPVPTSGLTMA+LVCSGCHTLLMY RGATSVQCSCCRTVNAAS+ANQMAHINCGNCRV+LM
Subjt:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM

Query:  YQ
        YQ
Subjt:  YQ

TrEMBL top hitse value%identityAlignment
A0A0A9NHW2 Uncharacterized protein3.4e-3672.55Show/hide
Query:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM
        +PF P   SQSQ+VC+GC++LLMYPAGATS+CCA+C  V+ VP  G  MAQLVC GCHTLLMY RGATSVQCSCC TVN A +ANQ+AH+NCGNCR+LLM
Subjt:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM

Query:  YQ
        YQ
Subjt:  YQ

A0A0E0EI65 Uncharacterized protein4.5e-3672.55Show/hide
Query:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM
        +PF P   +QSQ+VCSGC++LLMYPAGATS+CCA+C  V+ VP  G  MAQLVC GCHTLLMY RGATSVQCSCC TVN A +ANQ+AH+NCGNCR+LLM
Subjt:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM

Query:  YQ
        YQ
Subjt:  YQ

A0A1S3BJQ2 protein LSD1 isoform X24.2e-4267.63Show/hide
Query:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASK-------------ANQM
        MPFP VTC Q+QI+CSGCK++L+YPAGATSICCALCH V+PVPTSGLTMA+LVCSGC+TLLMY RGATSVQCSCCRT+NAASK             ANQM
Subjt:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASK-------------ANQM

Query:  AHINCGNCRVLLMYQ-SLGSMKMNPKSINTFYWFLMKKE
        AH NCGNCRVLLMYQ    S+K    +  T    L K+E
Subjt:  AHINCGNCRVLLMYQ-SLGSMKMNPKSINTFYWFLMKKE

A0A1S3BKC0 protein LOL1 isoform X16.9e-4574.6Show/hide
Query:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM
        MPFP VTC Q+QI+CSGCK++L+YPAGATSICCALCH V+PVPTSGLTMA+LVCSGC+TLLMY RGATSVQCSCCRT+NAASKANQMAH NCGNCRVLLM
Subjt:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM

Query:  YQ-SLGSMKMNPKSINTFYWFLMKKE
        YQ    S+K    +  T    L K+E
Subjt:  YQ-SLGSMKMNPKSINTFYWFLMKKE

A0A6J1CWA3 protein LSD1-like3.9e-4079.41Show/hide
Query:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM
        MP PPVTC QS++VCSGCK+LL+YP GATSICC LCH VSPVPT GL MA+LVC GCHTLL++ RGATSVQCSCCRTVN+ASKANQ A INC NCR+LLM
Subjt:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM

Query:  YQ
        YQ
Subjt:  YQ

SwissProt top hitse value%identityAlignment
P94077 Protein LSD17.9e-2249.51Show/hide
Query:  QSQIVCSGCKHLLMYPAGATSICCALCHVVS--PVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVN---------AASKANQMAHINCGNCRVL
        Q Q+VC GC++LLMYP GA+++ CALC+ ++  P P     MA ++C GC T+LMY RGA+SV+CSCC+T N         A + ++Q+A INCG+CR  
Subjt:  QSQIVCSGCKHLLMYPAGATSICCALCHVVS--PVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVN---------AASKANQMAHINCGNCRVL

Query:  LMY
        LMY
Subjt:  LMY

Q0J7V9 Protein LSD11.2e-3872.55Show/hide
Query:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM
        +PF P   +QSQ+VCSGC++LLMYPAGATS+CCA+C  V+ VP  G  MAQLVC GCHTLLMY RGATSVQCSCC TVN A +ANQ+AH+NCGNCR+LLM
Subjt:  MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLM

Query:  YQ
        YQ
Subjt:  YQ

Q2QMB3 Protein LOL28.2e-2758.06Show/hide
Query:  QSQIVCSGCKHLLMYPAGATSICCALCHVV-SPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMY
        QSQIVC GC+++L+YP GA S+CCA+CH V S  P+ G+ +A L+C GC TLLMY R ATSV+CSCC TVN     + +AH+NCG C+ +LMY
Subjt:  QSQIVCSGCKHLLMYPAGATSICCALCHVV-SPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMY

Q6ASS2 Protein LOL31.3e-2455.32Show/hide
Query:  QSQIVCSGCKHLLMYPAGATSICCALCHVVS--PVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMY
        QSQIVC GC+ +L YP+GA S+CCALC  ++  P P   + MA L+C GC TLLMY R A +V+CSCC TVN     N +AH++CG CR  LMY
Subjt:  QSQIVCSGCKHLLMYPAGATSICCALCHVVS--PVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMY

Q93ZB1 Protein LOL14.3e-3672.92Show/hide
Query:  TCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ
        T  QSQ+VCSGC++LLMYP GATS+CCA+C+ V+ VP  G  MAQLVC GCHTLLMY RGATSVQCSCC TVN A +ANQ+AH+NCGNC +LLMYQ
Subjt:  TCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ

Arabidopsis top hitse value%identityAlignment
AT1G32540.1 lsd one like 13.1e-3772.92Show/hide
Query:  TCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ
        T  QSQ+VCSGC++LLMYP GATS+CCA+C+ V+ VP  G  MAQLVC GCHTLLMY RGATSVQCSCC TVN A +ANQ+AH+NCGNC +LLMYQ
Subjt:  TCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ

AT1G32540.2 lsd one like 13.1e-3772.92Show/hide
Query:  TCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ
        T  QSQ+VCSGC++LLMYP GATS+CCA+C+ V+ VP  G  MAQLVC GCHTLLMY RGATSVQCSCC TVN A +ANQ+AH+NCGNC +LLMYQ
Subjt:  TCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ

AT1G32540.3 lsd one like 13.1e-3772.92Show/hide
Query:  TCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ
        T  QSQ+VCSGC++LLMYP GATS+CCA+C+ V+ VP  G  MAQLVC GCHTLLMY RGATSVQCSCC TVN A +ANQ+AH+NCGNC +LLMYQ
Subjt:  TCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ

AT4G20380.1 LSD1 zinc finger family protein5.6e-2349.51Show/hide
Query:  QSQIVCSGCKHLLMYPAGATSICCALCHVVS--PVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVN---------AASKANQMAHINCGNCRVL
        Q Q+VC GC++LLMYP GA+++ CALC+ ++  P P     MA ++C GC T+LMY RGA+SV+CSCC+T N         A + ++Q+A INCG+CR  
Subjt:  QSQIVCSGCKHLLMYPAGATSICCALCHVVS--PVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVN---------AASKANQMAHINCGNCRVL

Query:  LMY
        LMY
Subjt:  LMY

AT4G20380.2 LSD1 zinc finger family protein5.6e-2349.51Show/hide
Query:  QSQIVCSGCKHLLMYPAGATSICCALCHVVS--PVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVN---------AASKANQMAHINCGNCRVL
        Q Q+VC GC++LLMYP GA+++ CALC+ ++  P P     MA ++C GC T+LMY RGA+SV+CSCC+T N         A + ++Q+A INCG+CR  
Subjt:  QSQIVCSGCKHLLMYPAGATSICCALCHVVS--PVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVN---------AASKANQMAHINCGNCRVL

Query:  LMY
        LMY
Subjt:  LMY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTTCCTCCAGTTACTTGTAGTCAAAGCCAAATAGTGTGCTCTGGATGTAAACATCTGTTGATGTATCCTGCCGGAGCAACCTCCATTTGCTGTGCTCTTTGCCA
TGTTGTATCTCCTGTGCCAACCTCTGGCTTAACGATGGCTCAGCTGGTATGCAGTGGCTGCCACACCCTGCTCATGTACAGGCGTGGGGCGACAAGTGTACAATGTTCTT
GTTGTCGCACTGTAAATGCAGCCTCCAAAGCAAATCAGATGGCTCACATCAACTGTGGGAACTGTAGGGTGCTGCTGATGTACCAATCACTTGGATCAATGAAGATGAAC
CCTAAAAGTATCAACACTTTCTATTGGTTCCTAATGAAAAAGGAATCTCTTTTAATTAGGATCAAGGAACAGGTGGAGAAAGCGTCGGTGAAAACAGCGGAGGAGAGGAA
GATTGTGGAGGAGAGAAGAAAGGCGGCAACGGCGGAGGCAAAGCGGGAGCTACACGAGGCCAAAGCCAGACATGCTGCTCAAAAGCTAAGGAATAGGAAGTCAAAACAAG
TACTTGGCGGCCATTTACACCATCACCACCCTCCGGTCGAGGGTGGCGCCGCCGCCACACATCTCGGCGGAGTAAATGTTCCGGCTTATCCTATAGTCAGCCCGGATGGG
TACTTTCCCGGACATAAAATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCATTTCCTCCAGTTACTTGTAGTCAAAGCCAAATAGTGTGCTCTGGATGTAAACATCTGTTGATGTATCCTGCCGGAGCAACCTCCATTTGCTGTGCTCTTTGCCA
TGTTGTATCTCCTGTGCCAACCTCTGGCTTAACGATGGCTCAGCTGGTATGCAGTGGCTGCCACACCCTGCTCATGTACAGGCGTGGGGCGACAAGTGTACAATGTTCTT
GTTGTCGCACTGTAAATGCAGCCTCCAAAGCAAATCAGATGGCTCACATCAACTGTGGGAACTGTAGGGTGCTGCTGATGTACCAATCACTTGGATCAATGAAGATGAAC
CCTAAAAGTATCAACACTTTCTATTGGTTCCTAATGAAAAAGGAATCTCTTTTAATTAGGATCAAGGAACAGGTGGAGAAAGCGTCGGTGAAAACAGCGGAGGAGAGGAA
GATTGTGGAGGAGAGAAGAAAGGCGGCAACGGCGGAGGCAAAGCGGGAGCTACACGAGGCCAAAGCCAGACATGCTGCTCAAAAGCTAAGGAATAGGAAGTCAAAACAAG
TACTTGGCGGCCATTTACACCATCACCACCCTCCGGTCGAGGGTGGCGCCGCCGCCACACATCTCGGCGGAGTAAATGTTCCGGCTTATCCTATAGTCAGCCCGGATGGG
TACTTTCCCGGACATAAAATTTAA
Protein sequenceShow/hide protein sequence
MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQSLGSMKMN
PKSINTFYWFLMKKESLLIRIKEQVEKASVKTAEERKIVEERRKAATAEAKRELHEAKARHAAQKLRNRKSKQVLGGHLHHHHPPVEGGAAATHLGGVNVPAYPIVSPDG
YFPGHKI