; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020208 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020208
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLOL1 (LSD ONE LIKE 1)
Genome locationtig00153449:1183335..1190933
RNA-Seq ExpressionSgr020208
SyntenySgr020208
Gene Ontology termsGO:0009793 - embryo development ending in seed dormancy (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR005513 - Late embryogenesis abundant protein, LEA_1 subgroup
IPR005735 - Zinc finger, LSD1-type
IPR040319 - LSD1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650032.1 hypothetical protein Csa_010851 [Cucumis sativus]1.4e-4581.08Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL
        ++CSGCKNLL+YP GATSICCA+CHAV+PVPT GL MARLVC GC+TLL+YSRGA SVQCSCCRT+N+AS+ANQ AHINCGNCRVLLMYQC A  VKCTL
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL

Query:  CNFVTSVGTWR
        CNFVTSVG  R
Subjt:  CNFVTSVGTWR

XP_008448703.1 PREDICTED: protein LOL1 isoform X1 [Cucumis melo]6.3e-4681.08Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL
        ++CSGCKN+L+YP GATSICCA+CHAV+PVPT GL MARLVC GC+TLL+YSRGATSVQCSCCRT+N+ASKANQ AH NCGNCRVLLMYQC A  VKCTL
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL

Query:  CNFVTSVGTWR
        CNFVTSVG  R
Subjt:  CNFVTSVGTWR

XP_022145337.1 protein LSD1-like [Momordica charantia]1.0e-4887.16Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL
        MVCSGCKNLL+YPVGATSICC +CH+VSPVPTPGLEMARLVC+GCHTLLL+SRGATSVQCSCCRTVNSASKANQTA INC NCR+LLMYQCGA  V+CTL
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL

Query:  CNFVTSVGT
        CNFVT VG+
Subjt:  CNFVTSVGT

XP_031738784.1 protein LSD1 [Cucumis sativus]1.4e-4581.08Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL
        ++CSGCKNLL+YP GATSICCA+CHAV+PVPT GL MARLVC GC+TLL+YSRGA SVQCSCCRT+N+AS+ANQ AHINCGNCRVLLMYQC A  VKCTL
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL

Query:  CNFVTSVGTWR
        CNFVTSVG  R
Subjt:  CNFVTSVGTWR

XP_038903355.1 protein LSD1-like [Benincasa hispida]1.2e-4483.33Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL
        +VCSGCK LL+YP GAT ICCA+CH VSPVPT GL MARLVC GCHTLL+YSRGATSVQCSCCRTVN+AS+ANQ AHINCGNCRV+LMYQ GA  VKCTL
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL

Query:  CNFVTSVG
        CNFVTSVG
Subjt:  CNFVTSVG

TrEMBL top hitse value%identityAlignment
A0A1S3BJQ2 protein LSD1 isoform X21.8e-4372.58Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASK-------------ANQTAHINCGNCRVLL
        ++CSGCKN+L+YP GATSICCA+CHAV+PVPT GL MARLVC GC+TLL+YSRGATSVQCSCCRT+N+ASK             ANQ AH NCGNCRVLL
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASK-------------ANQTAHINCGNCRVLL

Query:  MYQCGAPCVKCTLCNFVTSVGTWR
        MYQC A  VKCTLCNFVTSVG  R
Subjt:  MYQCGAPCVKCTLCNFVTSVGTWR

A0A1S3BKC0 protein LOL1 isoform X13.0e-4681.08Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL
        ++CSGCKN+L+YP GATSICCA+CHAV+PVPT GL MARLVC GC+TLL+YSRGATSVQCSCCRT+N+ASKANQ AH NCGNCRVLLMYQC A  VKCTL
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL

Query:  CNFVTSVGTWR
        CNFVTSVG  R
Subjt:  CNFVTSVGTWR

A0A4Y7K7L2 Uncharacterized protein9.1e-4376.85Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL
        +VCSGC+NLL+YPVGATS+CCA+C+AV+PVP PG EMA+LVC GCHTLL+Y RGATSVQCSCC TVN A +ANQ AH+NCGNCR+LLMYQ GA  VKC +
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL

Query:  CNFVTSVG
        CNFVTSVG
Subjt:  CNFVTSVG

A0A4Y7KPP1 Uncharacterized protein3.1e-4376.85Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL
        +VCSGC+NLL+YPVGATS+CCA+C+AV+PVP PG EMA+LVC GCHTLL+Y RGATSVQCSCC T+N A +ANQ AH+NCGNCR+LLMYQ GA  VKCT+
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL

Query:  CNFVTSVG
        CNFVTSVG
Subjt:  CNFVTSVG

A0A6J1CWA3 protein LSD1-like5.0e-4987.16Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL
        MVCSGCKNLL+YPVGATSICC +CH+VSPVPTPGLEMARLVC+GCHTLLL+SRGATSVQCSCCRTVNSASKANQTA INC NCR+LLMYQCGA  V+CTL
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL

Query:  CNFVTSVGT
        CNFVT VG+
Subjt:  CNFVTSVGT

SwissProt top hitse value%identityAlignment
P94077 Protein LSD11.2e-2650Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVS--PVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSA-SKANQTAH--------INCGNCRVLLMY
        +VC GC+NLL+YP GA+++ CA+C+ ++  P P P  +MA ++C GC T+L+Y+RGA+SV+CSCC+T N   + +NQ AH        INCG+CR  LMY
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVS--PVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSA-SKANQTAH--------INCGNCRVLLMY

Query:  QCGAPCVKCTLCNFVTSV
          GA  VKC +C FVT+V
Subjt:  QCGAPCVKCTLCNFVTSV

Q0J7V9 Protein LSD13.0e-4374.07Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL
        +VCSGC+NLL+YP GATS+CCA+C  V+ VP PG EMA+LVC GCHTLL+Y RGATSVQCSCC TVN A +ANQ AH+NCGNCR+LLMYQ GA  VKC +
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL

Query:  CNFVTSVG
        CNFVTSVG
Subjt:  CNFVTSVG

Q2QMB3 Protein LOL23.3e-3456.9Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAV-SPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCT
        +VC GC+N+L+YP GA S+CCA+CHAV S  P+PG+++A L+C GC TLL+Y+R ATSV+CSCC TVN     +  AH+NCG C+ +LMY  GAP VKC 
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAV-SPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCT

Query:  LCNFVTSVG--TWRNL
        +CNF+T+ G  T R+L
Subjt:  LCNFVTSVG--TWRNL

Q6ASS2 Protein LOL32.9e-3052.73Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVS--PVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKC
        +VC GC+++L YP GA S+CCA+C A++  P P P +EMA L+C GC TLL+Y+R A +V+CSCC TVN     N  AH++CG CR  LMY  GAP VKC
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVS--PVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKC

Query:  TLCNFVTSVG
         +C+++T+ G
Subjt:  TLCNFVTSVG

Q93ZB1 Protein LOL11.8e-4375Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL
        +VCSGC+NLL+YPVGATS+CCA+C+AV+ VP PG EMA+LVC GCHTLL+Y RGATSVQCSCC TVN A +ANQ AH+NCGNC +LLMYQ GA  VKC +
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL

Query:  CNFVTSVG
        CNFVTSVG
Subjt:  CNFVTSVG

Arabidopsis top hitse value%identityAlignment
AT1G32540.1 lsd one like 11.3e-4475Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL
        +VCSGC+NLL+YPVGATS+CCA+C+AV+ VP PG EMA+LVC GCHTLL+Y RGATSVQCSCC TVN A +ANQ AH+NCGNC +LLMYQ GA  VKC +
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL

Query:  CNFVTSVG
        CNFVTSVG
Subjt:  CNFVTSVG

AT1G32540.2 lsd one like 11.3e-4475Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL
        +VCSGC+NLL+YPVGATS+CCA+C+AV+ VP PG EMA+LVC GCHTLL+Y RGATSVQCSCC TVN A +ANQ AH+NCGNC +LLMYQ GA  VKC +
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL

Query:  CNFVTSVG
        CNFVTSVG
Subjt:  CNFVTSVG

AT1G32540.3 lsd one like 11.3e-4475Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL
        +VCSGC+NLL+YPVGATS+CCA+C+AV+ VP PG EMA+LVC GCHTLL+Y RGATSVQCSCC TVN A +ANQ AH+NCGNC +LLMYQ GA  VKC +
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTL

Query:  CNFVTSVG
        CNFVTSVG
Subjt:  CNFVTSVG

AT4G20380.1 LSD1 zinc finger family protein8.2e-2850Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVS--PVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSA-SKANQTAH--------INCGNCRVLLMY
        +VC GC+NLL+YP GA+++ CA+C+ ++  P P P  +MA ++C GC T+L+Y+RGA+SV+CSCC+T N   + +NQ AH        INCG+CR  LMY
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVS--PVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSA-SKANQTAH--------INCGNCRVLLMY

Query:  QCGAPCVKCTLCNFVTSV
          GA  VKC +C FVT+V
Subjt:  QCGAPCVKCTLCNFVTSV

AT4G20380.2 LSD1 zinc finger family protein8.2e-2850Show/hide
Query:  MVCSGCKNLLVYPVGATSICCAICHAVS--PVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSA-SKANQTAH--------INCGNCRVLLMY
        +VC GC+NLL+YP GA+++ CA+C+ ++  P P P  +MA ++C GC T+L+Y+RGA+SV+CSCC+T N   + +NQ AH        INCG+CR  LMY
Subjt:  MVCSGCKNLLVYPVGATSICCAICHAVS--PVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSA-SKANQTAH--------INCGNCRVLLMY

Query:  QCGAPCVKCTLCNFVTSV
          GA  VKC +C FVT+V
Subjt:  QCGAPCVKCTLCNFVTSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTGCTCTGGGTGTAAAAATCTGTTGGTCTATCCAGTTGGAGCAACCTCCATTTGCTGTGCTATATGCCATGCAGTCTCCCCTGTGCCAACACCTGGCTTGGAGAT
GGCTCGGCTGGTGTGTAGAGGCTGCCACACTCTGCTCTTGTACAGTCGTGGGGCGACAAGTGTACAATGTTCGTGTTGTCGCACTGTCAATTCAGCCTCCAAAGCAAATC
AGACGGCTCACATCAACTGTGGGAACTGCAGGGTGCTGCTGATGTACCAATGTGGGGCCCCCTGTGTCAAATGTACACTGTGCAATTTTGTTACCTCAGTTGGAACTTGG
AGAAATCTTGCCTCATTGTACCTTCAAATTTTAAAAGGCTTATCTCAGTGTACCTTTGGTCACCGAGGTACAATGAAGAAGCCATTTGAAATTCAAGTGGAAATGAAAAG
AAAACAATGTTCACCGCCATCGCCAGTGCCACGGTTGCCCAAATGGGCAGCTTCTCTTGCTCGCAAGAACAAGATTAACAGCTTATATGAACAACACGTGGGATATTACA
AGAGAGGTGAGTTGGCATGCATGGTTGACACATGGCAGCTGACGTGTCGTGCTTTTGAGTCTGCAATGGAGAAGCTCAGCAATATGGGCAGTGTTGCTAAAGAGAAAATG
AAAATTTGCAGAGCCAAACTCGTCGAGAAGGTGGAGAAAGCGACGGTGAAGACGGCAGAGGAGAGGAGAATTGTGGAGGAGAGGAGAAAGGCAGCGGAGGCGGGGGCGAA
GCGAGAGCTACACGAGGCCAAAGCCAGGCATGCCGCTCAAAAGCTAAGCAGCAGGAAGTCGCACGTTCACGGCGGCCATGCACACCATCAACTGCCGGTAGAGGGTGGCG
CCGCCACCCATGCCGGCGGAGCAAATGTTGCTGCTGTGGCTCATCCTTTCCCCAGCCAGGGCTACCGTCCTGGGTTCAAATTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGTGCTCTGGGTGTAAAAATCTGTTGGTCTATCCAGTTGGAGCAACCTCCATTTGCTGTGCTATATGCCATGCAGTCTCCCCTGTGCCAACACCTGGCTTGGAGAT
GGCTCGGCTGGTGTGTAGAGGCTGCCACACTCTGCTCTTGTACAGTCGTGGGGCGACAAGTGTACAATGTTCGTGTTGTCGCACTGTCAATTCAGCCTCCAAAGCAAATC
AGACGGCTCACATCAACTGTGGGAACTGCAGGGTGCTGCTGATGTACCAATGTGGGGCCCCCTGTGTCAAATGTACACTGTGCAATTTTGTTACCTCAGTTGGAACTTGG
AGAAATCTTGCCTCATTGTACCTTCAAATTTTAAAAGGCTTATCTCAGTGTACCTTTGGTCACCGAGGTACAATGAAGAAGCCATTTGAAATTCAAGTGGAAATGAAAAG
AAAACAATGTTCACCGCCATCGCCAGTGCCACGGTTGCCCAAATGGGCAGCTTCTCTTGCTCGCAAGAACAAGATTAACAGCTTATATGAACAACACGTGGGATATTACA
AGAGAGGTGAGTTGGCATGCATGGTTGACACATGGCAGCTGACGTGTCGTGCTTTTGAGTCTGCAATGGAGAAGCTCAGCAATATGGGCAGTGTTGCTAAAGAGAAAATG
AAAATTTGCAGAGCCAAACTCGTCGAGAAGGTGGAGAAAGCGACGGTGAAGACGGCAGAGGAGAGGAGAATTGTGGAGGAGAGGAGAAAGGCAGCGGAGGCGGGGGCGAA
GCGAGAGCTACACGAGGCCAAAGCCAGGCATGCCGCTCAAAAGCTAAGCAGCAGGAAGTCGCACGTTCACGGCGGCCATGCACACCATCAACTGCCGGTAGAGGGTGGCG
CCGCCACCCATGCCGGCGGAGCAAATGTTGCTGCTGTGGCTCATCCTTTCCCCAGCCAGGGCTACCGTCCTGGGTTCAAATTTTAA
Protein sequenceShow/hide protein sequence
MVCSGCKNLLVYPVGATSICCAICHAVSPVPTPGLEMARLVCRGCHTLLLYSRGATSVQCSCCRTVNSASKANQTAHINCGNCRVLLMYQCGAPCVKCTLCNFVTSVGTW
RNLASLYLQILKGLSQCTFGHRGTMKKPFEIQVEMKRKQCSPPSPVPRLPKWAASLARKNKINSLYEQHVGYYKRGELACMVDTWQLTCRAFESAMEKLSNMGSVAKEKM
KICRAKLVEKVEKATVKTAEERRIVEERRKAAEAGAKRELHEAKARHAAQKLSSRKSHVHGGHAHHQLPVEGGAATHAGGANVAAVAHPFPSQGYRPGFKF