; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023199 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023199
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionGlycosyl hydrolase of Uncharacterized protein function (DUF1680), putative isoform 2
Genome locationtig00000892:896091..896600
RNA-Seq ExpressionSgr023199
SyntenySgr023199
Gene Ontology termsGO:0046373 - L-arabinose metabolic process (biological process)
GO:0046556 - alpha-L-arabinofuranosidase activity (molecular function)
InterPro domainsIPR012878 - Beta-L-arabinofuranosidase, GH127


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041392.1 DUF1680 domain-containing protein [Cucumis melo var. makuwa]5.2e-8486.98Show/hide
Query:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL
        MW VLV L+ FLLC CDSLKECTNTPTQLGSHTFR+ELLSS N TWK+E+FSHYHLTPTDDFAWS+LLPRKMLKEENE+NW M+YRQMKNKDG Q PGG+
Subjt:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL

Query:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
        LKEISLHDVRLDP+SLHG AQ TN+KYLLMLDVD LLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
Subjt:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG

XP_008449737.1 PREDICTED: uncharacterized protein LOC103491528 [Cucumis melo]5.2e-8486.98Show/hide
Query:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL
        MW VLV L+ FLLC CDSLKECTNTPTQLGSHTFR+ELLSS N TWK+E+FSHYHLTPTDDFAWS+LLPRKMLKEENE+NW M+YRQMKNKDG Q PGG+
Subjt:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL

Query:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
        LKEISLHDVRLDP+SLHG AQ TN+KYLLMLDVD LLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
Subjt:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG

XP_011653585.1 uncharacterized protein LOC101207833 [Cucumis sativus]4.0e-8486.39Show/hide
Query:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL
        MW VLV L+ FLLC CDSLKECTNTPTQLGSHTFR+ELLSS N TWK+E+FSHYHLTPTDDFAWS+LLPRKMLKEENE+NW M+YRQMKNKDG + PGG+
Subjt:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL

Query:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
        LKEISLHDVRLDPNSLHG AQ TN+KYLLMLDVD LLWSFRKTAGLPTPGEPY+GWEKSDCELRGHFVG
Subjt:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG

XP_022148748.1 uncharacterized protein LOC111017340 [Momordica charantia]3.7e-9093.49Show/hide
Query:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL
        MWAVLVTLMVF+LCR DSLKECTNTPTQLGSHTFR+ELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRK+LKEENEFNWAMVYRQMKNKDG Q PGGL
Subjt:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL

Query:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
        LKEISLHDVRLDPNS HGRAQ+TN+KYLLMLDVD LLWSFRKTAGLPTPGEPY+GWEKSDCELRGHFVG
Subjt:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG

XP_038901175.1 uncharacterized protein LOC120088146 [Benincasa hispida]2.3e-8789.35Show/hide
Query:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL
        MW VL  L+ FLLC CDSLKECTNTPTQLGSHTFR+ELLSSHNGTWKEEMFSHYHLTPTDDFAWS+LLPRKMLKEENEFNW M+YRQMKNKDG Q PGGL
Subjt:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL

Query:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
        LKEISLHD+RLDPNSLHG AQ TN+KYLLMLDVD LLWSFRKTAGLPTPGEPY+GWEKSDCELRGHFVG
Subjt:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG

TrEMBL top hitse value%identityAlignment
A0A0A0KX04 Uncharacterized protein1.9e-8486.39Show/hide
Query:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL
        MW VLV L+ FLLC CDSLKECTNTPTQLGSHTFR+ELLSS N TWK+E+FSHYHLTPTDDFAWS+LLPRKMLKEENE+NW M+YRQMKNKDG + PGG+
Subjt:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL

Query:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
        LKEISLHDVRLDPNSLHG AQ TN+KYLLMLDVD LLWSFRKTAGLPTPGEPY+GWEKSDCELRGHFVG
Subjt:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG

A0A1S3BM44 uncharacterized protein LOC1034915282.5e-8486.98Show/hide
Query:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL
        MW VLV L+ FLLC CDSLKECTNTPTQLGSHTFR+ELLSS N TWK+E+FSHYHLTPTDDFAWS+LLPRKMLKEENE+NW M+YRQMKNKDG Q PGG+
Subjt:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL

Query:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
        LKEISLHDVRLDP+SLHG AQ TN+KYLLMLDVD LLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
Subjt:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG

A0A5A7TD86 DUF1680 domain-containing protein2.5e-8486.98Show/hide
Query:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL
        MW VLV L+ FLLC CDSLKECTNTPTQLGSHTFR+ELLSS N TWK+E+FSHYHLTPTDDFAWS+LLPRKMLKEENE+NW M+YRQMKNKDG Q PGG+
Subjt:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL

Query:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
        LKEISLHDVRLDP+SLHG AQ TN+KYLLMLDVD LLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
Subjt:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG

A0A6J1D4Z0 uncharacterized protein LOC1110173401.8e-9093.49Show/hide
Query:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL
        MWAVLVTLMVF+LCR DSLKECTNTPTQLGSHTFR+ELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRK+LKEENEFNWAMVYRQMKNKDG Q PGGL
Subjt:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL

Query:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
        LKEISLHDVRLDPNS HGRAQ+TN+KYLLMLDVD LLWSFRKTAGLPTPGEPY+GWEKSDCELRGHFVG
Subjt:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG

A0A6J1H2F6 uncharacterized protein LOC1114594151.1e-7679.88Show/hide
Query:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL
        MW V V LM FLLC CD+LKECTN PTQLGSHT R+EL  SHN T K+EMFSHYHLTPTDD AWS+LL R++LKEENEFNW M+YRQMKNKDG Q PGGL
Subjt:  MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGL

Query:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
        LKE+ L DVRL+PNS HGRAQ TN+KYLLMLDVD LLWSFR+TAGLPTPG+PY+GWEKSDCELRGHFVG
Subjt:  LKEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G12950.1 Putative glycosyl hydrolase of unknown function (DUF1680)6.8e-5870.2Show/hide
Query:  KECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEE-NEFNWAMVYRQMKNKDGFQGPGGLLKEISLHDVRLDPNSLHG
        KECTNTPTQL SHTFR ELL S N T K E+FSHYHLTP DD AWSSLLPRKMLKEE +EF W M+YR+ K+ +     G  LK++SLHDVRLDP+S H 
Subjt:  KECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEE-NEFNWAMVYRQMKNKDGFQGPGGLLKEISLHDVRLDPNSLHG

Query:  RAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
        RAQ TN++YLLMLDVDGL WSFRK AGL  PG+ Y GWE+ D ELRGHFVG
Subjt:  RAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG

AT5G12960.1 Putative glycosyl hydrolase of unknown function (DUF1680)6.0e-5461.9Show/hide
Query:  AVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEE-NEFNWAMVYRQMKNKDGFQGPGGLL
        A+L+     L+C     KECT+ PT+L SHT R ELL S N   K E FSHYHLTPTDD AWS+LLPRKMLKEE ++F W M+YR+ K+ +     G  L
Subjt:  AVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEE-NEFNWAMVYRQMKNKDGFQGPGGLL

Query:  KEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG
        K++SLHDVRLDP+S H RAQ TN++YLLMLDVDGL ++FRK AGL  PG PY GWEK D ELRGHFVG
Subjt:  KEISLHDVRLDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGCGGTTCTGGTGACCTTAATGGTATTCCTTCTCTGTCGCTGTGATTCTCTAAAGGAGTGTACAAACACTCCTACTCAACTTGGATCACATACTTTTAGACATGA
GCTCTTATCATCACATAATGGAACGTGGAAGGAAGAGATGTTTTCTCACTACCACTTGACACCAACCGACGATTTTGCTTGGTCGAGTTTGCTACCAAGAAAGATGTTGA
AGGAAGAAAACGAATTTAATTGGGCAATGGTGTATAGACAAATGAAGAATAAAGATGGGTTCCAAGGTCCTGGCGGTCTGCTCAAGGAAATTTCTTTACATGACGTACGG
TTGGATCCAAACTCGTTACATGGGAGGGCTCAGTTGACAAATATGAAGTATCTATTGATGTTGGATGTGGACGGGTTGCTCTGGAGCTTTAGGAAGACGGCTGGTTTGCC
TACACCTGGAGAACCGTATATTGGGTGGGAAAAATCAGACTGCGAGCTTCGTGGTCATTTTGTAGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGGCGGTTCTGGTGACCTTAATGGTATTCCTTCTCTGTCGCTGTGATTCTCTAAAGGAGTGTACAAACACTCCTACTCAACTTGGATCACATACTTTTAGACATGA
GCTCTTATCATCACATAATGGAACGTGGAAGGAAGAGATGTTTTCTCACTACCACTTGACACCAACCGACGATTTTGCTTGGTCGAGTTTGCTACCAAGAAAGATGTTGA
AGGAAGAAAACGAATTTAATTGGGCAATGGTGTATAGACAAATGAAGAATAAAGATGGGTTCCAAGGTCCTGGCGGTCTGCTCAAGGAAATTTCTTTACATGACGTACGG
TTGGATCCAAACTCGTTACATGGGAGGGCTCAGTTGACAAATATGAAGTATCTATTGATGTTGGATGTGGACGGGTTGCTCTGGAGCTTTAGGAAGACGGCTGGTTTGCC
TACACCTGGAGAACCGTATATTGGGTGGGAAAAATCAGACTGCGAGCTTCGTGGTCATTTTGTAGGTTAG
Protein sequenceShow/hide protein sequence
MWAVLVTLMVFLLCRCDSLKECTNTPTQLGSHTFRHELLSSHNGTWKEEMFSHYHLTPTDDFAWSSLLPRKMLKEENEFNWAMVYRQMKNKDGFQGPGGLLKEISLHDVR
LDPNSLHGRAQLTNMKYLLMLDVDGLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVG