; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018578 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018578
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionChromo domain-containing protein
Genome locationtig00153206:450229..450735
RNA-Seq ExpressionSgr018578
SyntenySgr018578
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RZC65112.1 hypothetical protein C5167_008797 [Papaver somniferum]8.7e-3951.74Show/hide
Query:  IQARLEESDGEEDIDLEPDIVMEDQ----LEISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS
        IQA LE  DG++D+++E + + +DQ     EISL AI+  Q P+TM+V G L  KPV +LIDSGSTHNF+  + AEK+GL PI  G+++V+VA RER+ S
Subjt:  IQARLEESDGEEDIDLEPDIVMEDQ----LEISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS

Query:  PSKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK
           C    L LQ + I+ D Y+LPLE Y+VV+GTQWLR LG I WDFSK+ M F +  ++V L GL++ ENK
Subjt:  PSKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK

XP_038985806.1 uncharacterized protein LOC103721475 isoform X1 [Phoenix dactylifera]9.9e-4355.81Show/hide
Query:  MIQARLEESDGEEDIDLEPDIVMEDQL---EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS
        +IQA LE+SD  ED+++E D   E  +   EISL AIAG +A +TM+V G+LR + V++L+DSGSTHNF+S + A+ VGLQP S+G++ VMVA  ER+ S
Subjt:  MIQARLEESDGEEDIDLEPDIVMEDQL---EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS

Query:  PSKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK
        P KC+ +P+KLQ + I  DFY+LPLEGYDVV+G QWL  LG I WDFSKL M F +G ++V+L GL+  ENK
Subjt:  PSKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK

XP_038985807.1 uncharacterized protein LOC103721475 isoform X2 [Phoenix dactylifera]9.9e-4355.81Show/hide
Query:  MIQARLEESDGEEDIDLEPDIVMEDQL---EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS
        +IQA LE+SD  ED+++E D   E  +   EISL AIAG +A +TM+V G+LR + V++L+DSGSTHNF+S + A+ VGLQP S+G++ VMVA  ER+ S
Subjt:  MIQARLEESDGEEDIDLEPDIVMEDQL---EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS

Query:  PSKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK
        P KC+ +P+KLQ + I  DFY+LPLEGYDVV+G QWL  LG I WDFSKL M F +G ++V+L GL+  ENK
Subjt:  PSKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK

XP_038985808.1 uncharacterized protein LOC103721475 isoform X3 [Phoenix dactylifera]9.9e-4355.81Show/hide
Query:  MIQARLEESDGEEDIDLEPDIVMEDQL---EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS
        +IQA LE+SD  ED+++E D   E  +   EISL AIAG +A +TM+V G+LR + V++L+DSGSTHNF+S + A+ VGLQP S+G++ VMVA  ER+ S
Subjt:  MIQARLEESDGEEDIDLEPDIVMEDQL---EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS

Query:  PSKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK
        P KC+ +P+KLQ + I  DFY+LPLEGYDVV+G QWL  LG I WDFSKL M F +G ++V+L GL+  ENK
Subjt:  PSKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK

XP_038985809.1 uncharacterized protein LOC103721475 isoform X4 [Phoenix dactylifera]9.9e-4355.81Show/hide
Query:  MIQARLEESDGEEDIDLEPDIVMEDQL---EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS
        +IQA LE+SD  ED+++E D   E  +   EISL AIAG +A +TM+V G+LR + V++L+DSGSTHNF+S + A+ VGLQP S+G++ VMVA  ER+ S
Subjt:  MIQARLEESDGEEDIDLEPDIVMEDQL---EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS

Query:  PSKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK
        P KC+ +P+KLQ + I  DFY+LPLEGYDVV+G QWL  LG I WDFSKL M F +G ++V+L GL+  ENK
Subjt:  PSKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK

TrEMBL top hitse value%identityAlignment
A0A2N9EM04 Uncharacterized protein3.6e-3849.12Show/hide
Query:  MIQARLEESDGEEDIDLEPDIVME--DQLEISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVSP
        +I+A  EE DG+  +D+E    +E  ++  ISL AI+G QAP+TM++ GNL S P  IL+DSGSTHNFIS K A KV L+P +  ++ V VA  ++LVS 
Subjt:  MIQARLEESDGEEDIDLEPDIVME--DQLEISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVSP

Query:  SKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK
         KC  + L+L+   + TDFYI+PL+GYD+V+GTQWL+ LG I WDFSKL M F + ++++ L GL++  N+
Subjt:  SKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK

A0A4Y7JYI1 Uncharacterized protein4.2e-3951.74Show/hide
Query:  IQARLEESDGEEDIDLEPDIVMEDQ----LEISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS
        IQA LE  DG++D+++E + + +DQ     EISL AI+  Q P+TM+V G L  KPV +LIDSGSTHNF+  + AEK+GL PI  G+++V+VA RER+ S
Subjt:  IQARLEESDGEEDIDLEPDIVMEDQ----LEISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS

Query:  PSKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK
           C    L LQ + I+ D Y+LPLE Y+VV+GTQWLR LG I WDFSK+ M F +  ++V L GL++ ENK
Subjt:  PSKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK

A0A5J4ZIY9 Integrase catalytic domain-containing protein3.9e-3750.61Show/hide
Query:  DGEEDIDLEPDIVMEDQL----EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVSPSKCSSIP
        D + D+ +E + V+ED +    EIS  AI+G  AP+TM+V G++     ++L+DSGSTHNFI+   A+KVGLQPI  GR +V+VA  E+L SP KC+ + 
Subjt:  DGEEDIDLEPDIVMEDQL----EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVSPSKCSSIP

Query:  LKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK
        L LQ   I  DFY+LPLEGYD+V+GTQWLR LG I WDFS+L M F +  + V L GL++ ++K
Subjt:  LKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK

A0A5J5BHC7 Chromo domain-containing protein5.7e-3650.3Show/hide
Query:  MIQARLEESDGEEDIDLEPDIVMEDQL---EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS
        +I+A  EE DG+  +D+E + V ED     EISL AI+G +AP+TM+V G +     ++L+DSGSTHNFI+   A KV LQP   G+ +V+VA  E+L S
Subjt:  MIQARLEESDGEEDIDLEPDIVMEDQL---EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS

Query:  PSKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLA
        P KC+++ L LQ I +  DFY+LPLEGYD+V+GTQWL  LG I WDF+KL M F + D++V L G++
Subjt:  PSKCSSIPLKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLA

A0A5J5C5K3 Uncharacterized protein3.9e-3750.61Show/hide
Query:  DGEEDIDLEPDIVMEDQL----EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVSPSKCSSIP
        D + D+ +E + V+ED +    EIS  AI+G  AP+TM+V G++     ++L+DSGSTHNFI+   A+KVGLQPI  GR +V+VA  E+L SP KC+ + 
Subjt:  DGEEDIDLEPDIVMEDQL----EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVSPSKCSSIP

Query:  LKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK
        L LQ   I  DFY+LPLEGYD+V+GTQWLR LG I WDFS+L M F +  + V L GL++ ++K
Subjt:  LKLQKILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein2.2e-0828.39Show/hide
Query:  MIQARLEESDGEEDIDLEPDIVMEDQLEISL---QAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS
        ++QA+L+    ++ +  E + + +D   +     Q +      K M+  G +    VV+ IDSG+T NFI ++ A  + L      +  V++  R+ + S
Subjt:  MIQARLEESDGEEDIDLEPDIVMEDQLEISL---QAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVS

Query:  PSKCSSIPLKLQKILITTDFYILPL--EGYDVVMGTQWLRILG--IIRW---DFS
           C  I L +Q++ IT +F +L L     DV++G +WL  LG  ++ W   DFS
Subjt:  PSKCSSIPLKLQKILITTDFYILPL--EGYDVVMGTQWLRILG--IIRW---DFS

AT3G30770.1 Eukaryotic aspartyl protease family protein4.6e-0629Show/hide
Query:  EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVSPSKCSSIPLKLQKILITTDFYILPLEGYDV
        ++  Q+   +   K M+  G +    VV++IDSG+T+NFIS + A  + L   +  +  V++  R+ + +   C  I L +Q++ I  +F +L L   DV
Subjt:  EISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVSPSKCSSIPLKLQKILITTDFYILPLEGYDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATACAAGCCAGACTTGAAGAGAGTGACGGTGAAGAAGACATAGATCTTGAACCAGACATTGTAATGGAGGATCAACTAGAAATCTCCCTACAAGCCATAGCGGGTTG
GCAAGCTCCCAAAACAATGAAGGTTACCGGCAATCTGCGAAGTAAACCAGTGGTTATACTAATTGATTCCGGGAGCACCCACAACTTCATTAGCTTAAAAACAGCCGAGA
AGGTAGGACTACAACCAATTTCCAATGGAAGAATGGATGTAATGGTAGCATTGAGAGAAAGGTTAGTAAGTCCAAGTAAGTGCTCTTCTATACCACTTAAGCTTCAGAAA
ATTCTCATTACAACTGATTTCTATATTCTGCCCCTTGAGGGATATGATGTTGTAATGGGAACACAGTGGTTAAGAATCTTGGGCATCATACGTTGGGATTTCTCTAAACT
AACCATGAGCTTTGCTTTAGGCGATAGGAAGGTTATGCTGTGGGGATTGGCTAGCAAGGAGAACAAG
mRNA sequenceShow/hide mRNA sequence
ATGATACAAGCCAGACTTGAAGAGAGTGACGGTGAAGAAGACATAGATCTTGAACCAGACATTGTAATGGAGGATCAACTAGAAATCTCCCTACAAGCCATAGCGGGTTG
GCAAGCTCCCAAAACAATGAAGGTTACCGGCAATCTGCGAAGTAAACCAGTGGTTATACTAATTGATTCCGGGAGCACCCACAACTTCATTAGCTTAAAAACAGCCGAGA
AGGTAGGACTACAACCAATTTCCAATGGAAGAATGGATGTAATGGTAGCATTGAGAGAAAGGTTAGTAAGTCCAAGTAAGTGCTCTTCTATACCACTTAAGCTTCAGAAA
ATTCTCATTACAACTGATTTCTATATTCTGCCCCTTGAGGGATATGATGTTGTAATGGGAACACAGTGGTTAAGAATCTTGGGCATCATACGTTGGGATTTCTCTAAACT
AACCATGAGCTTTGCTTTAGGCGATAGGAAGGTTATGCTGTGGGGATTGGCTAGCAAGGAGAACAAG
Protein sequenceShow/hide protein sequence
MIQARLEESDGEEDIDLEPDIVMEDQLEISLQAIAGWQAPKTMKVTGNLRSKPVVILIDSGSTHNFISLKTAEKVGLQPISNGRMDVMVALRERLVSPSKCSSIPLKLQK
ILITTDFYILPLEGYDVVMGTQWLRILGIIRWDFSKLTMSFALGDRKVMLWGLASKENK