; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr001791 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr001791
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein CREG1
Genome locationtig00001131:24196..26024
RNA-Seq ExpressionSgr001791
SyntenySgr001791
Gene Ontology termsGO:0005576 - extracellular region (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR012349 - FMN-binding split barrel


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578621.1 Protein CREG1, partial [Cucurbita argyrosperma subsp. sororia]4.9e-4890.48Show/hide
Query:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK
        TISSDFGGAPFGNVVSFSDG P KG+GTPYFYLTTLDPTARY+I+DERASFTLSEYPIGTCGK+DPENPTCAKITLIGKL+QVEPN+KEAEFA+ SLFSK
Subjt:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK

Query:  HAEMK
        HAEMK
Subjt:  HAEMK

XP_008458719.1 PREDICTED: protein CREG1 isoform X2 [Cucumis melo]1.1e-4788.68Show/hide
Query:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK
        TISSDFGGAPFGNVVSFSDG PN+GQG PYFYLTTLDPTA+Y+I+DERASFTLSEYPIGTCGK+DPENPTCAKITLIGKLKQVEPN+KE EFA+ SLFSK
Subjt:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK

Query:  HAEMKS
        HAEMK+
Subjt:  HAEMKS

XP_022133441.1 protein CREG1 isoform X1 [Momordica charantia]3.8e-4892.38Show/hide
Query:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK
        TISS+FGGAPFGNVVSFSDG P+KGQGTPYFYLTTLDPTARY+I+DERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQ+E NTKEAEFAR SLFSK
Subjt:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK

Query:  HAEMK
        HAEMK
Subjt:  HAEMK

XP_022133442.1 protein CREG1 isoform X2 [Momordica charantia]3.8e-4892.38Show/hide
Query:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK
        TISS+FGGAPFGNVVSFSDG P+KGQGTPYFYLTTLDPTARY+I+DERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQ+E NTKEAEFAR SLFSK
Subjt:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK

Query:  HAEMK
        HAEMK
Subjt:  HAEMK

XP_022939916.1 protein CREG1 [Cucurbita moschata]1.1e-4789.52Show/hide
Query:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK
        TISSDFGGAPFGNVVSFSDG P KG+GTPYFYLTTLDPTARY+I+DERASFTLSEYPIGTCGK+DPENPTCAKITLIGKL+QV+PN+KEAEFA+ SLFSK
Subjt:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK

Query:  HAEMK
        HAEMK
Subjt:  HAEMK

TrEMBL top hitse value%identityAlignment
A0A1S4E2V2 protein CREG1 isoform X25.3e-4888.68Show/hide
Query:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK
        TISSDFGGAPFGNVVSFSDG PN+GQG PYFYLTTLDPTA+Y+I+DERASFTLSEYPIGTCGK+DPENPTCAKITLIGKLKQVEPN+KE EFA+ SLFSK
Subjt:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK

Query:  HAEMKS
        HAEMK+
Subjt:  HAEMKS

A0A5A7T7X4 Protein CREG1 isoform X15.3e-4888.68Show/hide
Query:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK
        TISSDFGGAPFGNVVSFSDG PN+GQG PYFYLTTLDPTA+Y+I+DERASFTLSEYPIGTCGK+DPENPTCAKITLIGKLKQVEPN+KE EFA+ SLFSK
Subjt:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK

Query:  HAEMKS
        HAEMK+
Subjt:  HAEMKS

A0A6J1BV44 protein CREG1 isoform X11.8e-4892.38Show/hide
Query:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK
        TISS+FGGAPFGNVVSFSDG P+KGQGTPYFYLTTLDPTARY+I+DERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQ+E NTKEAEFAR SLFSK
Subjt:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK

Query:  HAEMK
        HAEMK
Subjt:  HAEMK

A0A6J1BWP4 protein CREG1 isoform X21.8e-4892.38Show/hide
Query:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK
        TISS+FGGAPFGNVVSFSDG P+KGQGTPYFYLTTLDPTARY+I+DERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQ+E NTKEAEFAR SLFSK
Subjt:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK

Query:  HAEMK
        HAEMK
Subjt:  HAEMK

A0A6J1FH47 protein CREG15.3e-4889.52Show/hide
Query:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK
        TISSDFGGAPFGNVVSFSDG P KG+GTPYFYLTTLDPTARY+I+DERASFTLSEYPIGTCGK+DPENPTCAKITLIGKL+QV+PN+KEAEFA+ SLFSK
Subjt:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK

Query:  HAEMK
        HAEMK
Subjt:  HAEMK

SwissProt top hitse value%identityAlignment
O75629 Protein CREG11.1e-1038.61Show/hide
Query:  GAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGK--VDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSKHAEMK
        G PF +V+S SDG P  G G PYFYL+ L  +      +  A+ T++      C K   DP++P C  I L G + +V  N  E + A+ SLF +H EMK
Subjt:  GAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGK--VDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSKHAEMK

Query:  S
        +
Subjt:  S

O88668 Protein CREG17.2e-1037Show/hide
Query:  GAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGK--VDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSKHAEMK
        G PF +++S SDG P +G G PY YL+ L         +  A+ T+S      C     DP++P C  I + G + +V  N  E ++AR SLF +H EMK
Subjt:  GAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGK--VDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSKHAEMK

Q8BGC9 Protein CREG24.1e-1340Show/hide
Query:  GAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGK--VDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSKHAEMK
        G PFG+ ++ SDG  +   G P+FY+T  DP     + +  AS  L E     C K  VDPE+P CA++TL G++  V P   E EFA+ ++FS+H  M+
Subjt:  GAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGK--VDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSKHAEMK

Q8IUH2 Protein CREG29.1e-1341Show/hide
Query:  GAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGK--VDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSKHAEMK
        G PFGN +  SDG  N   G P+FY+T  DP     + +  AS  L E     C K  VDPE+P C ++TL G++  V P  +E EFA+ ++FS+H  M+
Subjt:  GAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGK--VDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSKHAEMK

Arabidopsis top hitse value%identityAlignment
AT2G04690.1 Pyridoxamine 5'-phosphate oxidase family protein6.2e-3364.42Show/hide
Query:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK
        T+S D  GAPFGNVVSFSDGLP KG G PYFYLTTLDPTAR ++ D+RAS  +SE P+GTC + DP NPTC+K+TL GKL  +E  ++EAE A+ +LF+K
Subjt:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK

Query:  HAEM
        H EM
Subjt:  HAEM

AT2G04690.2 Pyridoxamine 5'-phosphate oxidase family protein6.2e-3364.42Show/hide
Query:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK
        T+S D  GAPFGNVVSFSDGLP KG G PYFYLTTLDPTAR ++ D+RAS  +SE P+GTC + DP NPTC+K+TL GKL  +E  ++EAE A+ +LF+K
Subjt:  TISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTTLDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSK

Query:  HAEM
        H EM
Subjt:  HAEM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAGTGGCATCTTCAAAGAATTCATGGGAATCTTGAGGTATAATCTGCATCTGGGCATCACCCGCTGCTTGCGATTCTATTTAGTTAATTCTTTCTTGGTATCCAA
TCAGAAGCTTCAGGCTTTTCGCTGTTTAATCGAATTGATCCGAGTTGGTCAGCTCAAACTTCGTTTTTTCGGTTATGAATTTCTACGGACATTTGATCTTCGTCAAAGAA
AGGGTACTATCTCAAGTGATTTTGGCGGAGCGCCTTTTGGAAATGTTGTTTCGTTTAGTGACGGGCTACCTAACAAGGGTCAAGGCACGCCATATTTCTACTTAACTACT
CTTGACCCAACTGCAAGATATTCAATTGCGGATGAGAGGGCTTCATTCACACTCAGTGAGTACCCTATTGGAACTTGTGGCAAGGTAGATCCAGAAAACCCAACTTGCGC
AAAAATTACCTTGATCGGAAAGCTGAAGCAGGTGGAGCCTAATACCAAGGAAGCAGAGTTTGCTAGAATTTCCTTGTTCTCGAAGCATGCAGAGATGAAGAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCAGTGGCATCTTCAAAGAATTCATGGGAATCTTGAGGTATAATCTGCATCTGGGCATCACCCGCTGCTTGCGATTCTATTTAGTTAATTCTTTCTTGGTATCCAA
TCAGAAGCTTCAGGCTTTTCGCTGTTTAATCGAATTGATCCGAGTTGGTCAGCTCAAACTTCGTTTTTTCGGTTATGAATTTCTACGGACATTTGATCTTCGTCAAAGAA
AGGGTACTATCTCAAGTGATTTTGGCGGAGCGCCTTTTGGAAATGTTGTTTCGTTTAGTGACGGGCTACCTAACAAGGGTCAAGGCACGCCATATTTCTACTTAACTACT
CTTGACCCAACTGCAAGATATTCAATTGCGGATGAGAGGGCTTCATTCACACTCAGTGAGTACCCTATTGGAACTTGTGGCAAGGTAGATCCAGAAAACCCAACTTGCGC
AAAAATTACCTTGATCGGAAAGCTGAAGCAGGTGGAGCCTAATACCAAGGAAGCAGAGTTTGCTAGAATTTCCTTGTTCTCGAAGCATGCAGAGATGAAGAGTTAG
Protein sequenceShow/hide protein sequence
MASGIFKEFMGILRYNLHLGITRCLRFYLVNSFLVSNQKLQAFRCLIELIRVGQLKLRFFGYEFLRTFDLRQRKGTISSDFGGAPFGNVVSFSDGLPNKGQGTPYFYLTT
LDPTARYSIADERASFTLSEYPIGTCGKVDPENPTCAKITLIGKLKQVEPNTKEAEFARISLFSKHAEMKS