; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023278 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023278
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPollen Ole e 1 allergen and extensin family protein
Genome locationtig00000892:1762980..1764605
RNA-Seq ExpressionSgr023278
SyntenySgr023278
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605976.1 hypothetical protein SDJN03_03293, partial [Cucurbita argyrosperma subsp. sororia]1.3e-5367.72Show/hide
Query:  MALPIFVTAFLLAVVLAKVEL--STSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVARLAGGPHQLY
        MAL   VTA LL  + A VE   ST+ +LKGKVSCLDCDAAYDL         S IVVM KC++ GKVVTATTAKDG F  ELPSD+C ARL GG +QLY
Subjt:  MALPIFVTAFLLAVVLAKVEL--STSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVARLAGGPHQLY

Query:  AARKSMVAGIVRAGDGSA---VYGISPPLGFYTA--CGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
        AARK MVA IVR GDGS    VYG S PL F +   C SI  +  K CKAAA  RKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
Subjt:  AARKSMVAGIVRAGDGSA---VYGISPPLGFYTA--CGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP

XP_022152985.1 uncharacterized protein LOC111020592 isoform X1 [Momordica charantia]8.9e-5569.35Show/hide
Query:  MALPIFVTAFLLAVVLAKVELSTSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAEL-PSD---DCVARLAGGPHQ
        MAL  FVTA    ++L  VELST+H LKG VSCLDC A+YDL         SG VVM KCD+V KVVTATT KDG FEAEL PS    DC ARLAG P+Q
Subjt:  MALPIFVTAFLLAVVLAKVELSTSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAEL-PSD---DCVARLAGGPHQ

Query:  LYAARKSMVAGIVRAGDGSAVYGISPPLGFYTACGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
        +Y+A  ++VAGIVR GDG  VYGIS PL F TAC SIS EA KYCK  AAGRKFGSSKTF+LPLPPEWGLAPSSYYFPFFPIIG+P
Subjt:  LYAARKSMVAGIVRAGDGSAVYGISPPLGFYTACGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP

XP_022958347.1 uncharacterized protein LOC111459595 [Cucurbita moschata]7.5e-5467.72Show/hide
Query:  MALPIFVTAFLLAVVLAKVEL--STSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVARLAGGPHQLY
        MAL   VTA LL  + A VE   ST+ +LKGKVSCLDCDAAYDL         S IVVM KC++ GKVVTATTAKDG F  ELPSD+C ARL GG +QLY
Subjt:  MALPIFVTAFLLAVVLAKVEL--STSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVARLAGGPHQLY

Query:  AARKSMVAGIVRAGDGSA---VYGISPPLGFYTA--CGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
        AARK MVA IVR GDGS    VYG S PL F +   C SI  +  K CKAAA  RKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
Subjt:  AARKSMVAGIVRAGDGSA---VYGISPPLGFYTA--CGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP

XP_022995160.1 uncharacterized protein LOC111490782 [Cucurbita maxima]2.2e-5367.2Show/hide
Query:  MALPIFVTAFLLAVVLAKVELSTS--HVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVARLAGGPHQLY
        MAL   VTA LLA + A VE STS   +LKGKVSCLDCDA YDL         S IVVM KC++ G+VVTATTAKDG F  ELPSD+C ARL GG +QLY
Subjt:  MALPIFVTAFLLAVVLAKVELSTS--HVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVARLAGGPHQLY

Query:  AARKSMVAGIVRAGDGSA---VYGISPPLGFYTA--CGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
        AARK MVA IVR GDGS    VYG S PL F +   C SI  +  + CKAAA  RKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
Subjt:  AARKSMVAGIVRAGDGSA---VYGISPPLGFYTA--CGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP

XP_038902068.1 uncharacterized protein LOC120088711 [Benincasa hispida]1.4e-5571.05Show/hide
Query:  MALPIFVT-AFLLAVVL--AKVELS-TSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVARLAGGPHQ
        MA+   VT AFLL VV+  A+VELS T+HVLKGKV CLDCDAAYDL         SGIVVMAKC++V KVVTATTAKDG FEAELPSDDC ARL GG +Q
Subjt:  MALPIFVT-AFLLAVVL--AKVELS-TSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVARLAGGPHQ

Query:  LYAARKSMVAGIVRAGDGS-AVYGISPPLGFYTACGSIS---GEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
        LYAARK MVAGIVR   GS  VYGI+ PL F ++C   S    EA+KYCK  AAG KFGSSKTF+LPLPPEWG+APSSYYFPFFPIIGIP
Subjt:  LYAARKSMVAGIVRAGDGS-AVYGISPPLGFYTACGSIS---GEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP

TrEMBL top hitse value%identityAlignment
A0A0A0KGW8 Uncharacterized protein2.9e-5163.13Show/hide
Query:  MALPIFV-TAFLLAVVL------AKVELSTS---HVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVARL
        MA+P  V  AF+L VV+      A VEL+ +   ++LKGKV CLDC A+YDL         SGIVVMAKC++VGKVVTATTA DG FEAELPSD+C ARL
Subjt:  MALPIFV-TAFLLAVVL------AKVELSTS---HVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVARL

Query:  AGGPHQLYAARKSMVAGIVRAGDGS-AVYGISPPLGFYTAC-----GSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
        AGG +QLYA+RK +VAGIV+   GS  +YGIS PL F ++C     G+ S EA+KYCKA A   KFGSSKTF+LPLPPEWG+APSSYYFPFFPIIGIP
Subjt:  AGGPHQLYAARKSMVAGIVRAGDGS-AVYGISPPLGFYTAC-----GSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP

A0A5D3BC61 Bile acid-inducible operon CD6.4e-5161.5Show/hide
Query:  MALPIFVTAFLL---------AVVLAKVELSTS---HVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVA
        MA+P  V A L+         A   A VEL+ +   ++LKGKV CLDC A+YDL         +GIVVMAKC++VGKVVTATTAKDG FEAELPSD+C A
Subjt:  MALPIFVTAFLL---------AVVLAKVELSTS---HVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVA

Query:  RLAGGPHQLYAARKSMVAGIVRAGDGS-AVYGISPPLGFYTAC-----GSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
        RLAGG +QLYAA K MVAGIV+   GS  +YGIS PL F ++C     G+ S EA+KYCK  A   KFGSSKTF+LPLPPEWG+APSSYYFPFFPIIGIP
Subjt:  RLAGGPHQLYAARKSMVAGIVRAGDGS-AVYGISPPLGFYTAC-----GSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP

A0A6J1DFG9 uncharacterized protein LOC111020592 isoform X14.3e-5569.35Show/hide
Query:  MALPIFVTAFLLAVVLAKVELSTSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAEL-PSD---DCVARLAGGPHQ
        MAL  FVTA    ++L  VELST+H LKG VSCLDC A+YDL         SG VVM KCD+V KVVTATT KDG FEAEL PS    DC ARLAG P+Q
Subjt:  MALPIFVTAFLLAVVLAKVELSTSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAEL-PSD---DCVARLAGGPHQ

Query:  LYAARKSMVAGIVRAGDGSAVYGISPPLGFYTACGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
        +Y+A  ++VAGIVR GDG  VYGIS PL F TAC SIS EA KYCK  AAGRKFGSSKTF+LPLPPEWGLAPSSYYFPFFPIIG+P
Subjt:  LYAARKSMVAGIVRAGDGSAVYGISPPLGFYTACGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP

A0A6J1H1L0 uncharacterized protein LOC1114595953.7e-5467.72Show/hide
Query:  MALPIFVTAFLLAVVLAKVEL--STSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVARLAGGPHQLY
        MAL   VTA LL  + A VE   ST+ +LKGKVSCLDCDAAYDL         S IVVM KC++ GKVVTATTAKDG F  ELPSD+C ARL GG +QLY
Subjt:  MALPIFVTAFLLAVVLAKVEL--STSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVARLAGGPHQLY

Query:  AARKSMVAGIVRAGDGSA---VYGISPPLGFYTA--CGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
        AARK MVA IVR GDGS    VYG S PL F +   C SI  +  K CKAAA  RKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
Subjt:  AARKSMVAGIVRAGDGSA---VYGISPPLGFYTA--CGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP

A0A6J1K3C6 uncharacterized protein LOC1114907821.1e-5367.2Show/hide
Query:  MALPIFVTAFLLAVVLAKVELSTS--HVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVARLAGGPHQLY
        MAL   VTA LLA + A VE STS   +LKGKVSCLDCDA YDL         S IVVM KC++ G+VVTATTAKDG F  ELPSD+C ARL GG +QLY
Subjt:  MALPIFVTAFLLAVVLAKVELSTS--HVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVARLAGGPHQLY

Query:  AARKSMVAGIVRAGDGSA---VYGISPPLGFYTA--CGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
        AARK MVA IVR GDGS    VYG S PL F +   C SI  +  + CKAAA  RKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
Subjt:  AARKSMVAGIVRAGDGSA---VYGISPPLGFYTA--CGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27385.1 Pollen Ole e 1 allergen and extensin family protein1.2e-2540.56Show/hide
Query:  FVTAFLLAVVLAKVELSTSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPS---DDCVARLAGGPHQLYAARK
        F+  FL +  L+   + ++ +++GKVSC DC   YD          SGI V   C       T TT K G F +ELPS    +C A L G   QLYA++ 
Subjt:  FVTAFLLAVVLAKVELSTSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPS---DDCVARLAGGPHQLYAARK

Query:  SMVAGIVRAGDGSAVYGISPPLGFYTACGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
        ++ + IV+ G     YG+S  L F  +C    G              F SSKT DLP+PPEWGLAP+SYY PF PIIGIP
Subjt:  SMVAGIVRAGDGSAVYGISPPLGFYTACGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP

AT2G27385.2 Pollen Ole e 1 allergen and extensin family protein1.2e-2540.56Show/hide
Query:  FVTAFLLAVVLAKVELSTSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPS---DDCVARLAGGPHQLYAARK
        F+  FL +  L+   + ++ +++GKVSC DC   YD          SGI V   C       T TT K G F +ELPS    +C A L G   QLYA++ 
Subjt:  FVTAFLLAVVLAKVELSTSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPS---DDCVARLAGGPHQLYAARK

Query:  SMVAGIVRAGDGSAVYGISPPLGFYTACGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
        ++ + IV+ G     YG+S  L F  +C    G              F SSKT DLP+PPEWGLAP+SYY PF PIIGIP
Subjt:  SMVAGIVRAGDGSAVYGISPPLGFYTACGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP

AT2G27385.3 Pollen Ole e 1 allergen and extensin family protein1.2e-2540.56Show/hide
Query:  FVTAFLLAVVLAKVELSTSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPS---DDCVARLAGGPHQLYAARK
        F+  FL +  L+   + ++ +++GKVSC DC   YD          SGI V   C       T TT K G F +ELPS    +C A L G   QLYA++ 
Subjt:  FVTAFLLAVVLAKVELSTSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPS---DDCVARLAGGPHQLYAARK

Query:  SMVAGIVRAGDGSAVYGISPPLGFYTACGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
        ++ + IV+ G     YG+S  L F  +C    G              F SSKT DLP+PPEWGLAP+SYY PF PIIGIP
Subjt:  SMVAGIVRAGDGSAVYGISPPLGFYTACGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP

AT5G22430.1 Pollen Ole e 1 allergen and extensin family protein4.3e-2339.44Show/hide
Query:  FLLAV-VLAKVELSTSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDD------CVARLAGGPHQLYAARK
        FL A+ V + +ELS S ++ GK+SCLDC   +D          SGI V+ KCD   K +TA  A DG F + LP+ D      C+A+L GGP QLYA + 
Subjt:  FLLAV-VLAKVELSTSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDD------CVARLAGGPHQLYAARK

Query:  SMVAGIVRAGDGSAVYGISPPLGFYTACGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP
        ++V+ +V++   S V   S PL F  +C   S +          G   G SKT + P    +G  P+S +FPF PIIGIP
Subjt:  SMVAGIVRAGDGSAVYGISPPLGFYTACGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCAACGAGGAGAAGTGCTGCAAGTTTGGCCGCCATTGATAAGCAAGGAAAGAAAGTTCTTGGTGTTTGTCGTGTATTTGATATCCGTTTTATATATAGG
ACAAAGATCTCTGAGTTGACAAAAATAATCTGGCGCTGCCATTTTATGCCTGCCGCGGGTGCCTCTTGCTTCACGGATTCAGTTTCTGGTTCATTGATGTGGTTT
ACACTGACACCAACTACCCACAAGGCCTTTTGGGACCCCTTCATTAAACACCATATGACTCTTAACTTCCTCTACAAAACCCATTTCCCTCTGGCTTCAAATTGC
AACATCTTCTGTAAATTGCAGGACATTTTCATGGCTCTTCCAATCTTTGTTACAGCGTTTCTCTTGGCAGTCGTTCTTGCTAAAGTTGAGCTCTCAACCAGTCAT
GTTCTGAAAGGGAAGGTCTCTTGCCTTGACTGCGATGCCGCTTATGATCTCTCAGGTTGTAATTCGAGGAAAATGGTCTCAGGGATCGTGGTGATGGCGAAGTGC
GATGAGGTTGGGAAGGTGGTTACAGCAACGACGGCGAAGGATGGGTGTTTCGAGGCGGAGCTGCCTTCAGATGACTGCGTGGCCAGGCTCGCCGGGGGTCCGCAC
CAGCTCTACGCCGCGAGAAAAAGCATGGTCGCCGGAATCGTTAGGGCCGGTGATGGCTCCGCCGTCTACGGCATCTCCCCTCCGCTTGGGTTTTACACTGCGTGC
GGCTCGATCAGCGGTGAAGCAGACAAATATTGCAAAGCAGCCGCCGCCGGAAGGAAGTTTGGGTCGTCCAAGACCTTCGACCTTCCTCTGCCGCCGGAGTGGGGG
TTGGCGCCGTCTAGCTACTATTTTCCCTTCTTCCCTATCATCGGCATCCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCAACGAGGAGAAGTGCTGCAAGTTTGGCCGCCATTGATAAGCAAGGAAAGAAAGTTCTTGGTGTTTGTCGTGTATTTGATATCCGTTTTATATATAGG
ACAAAGATCTCTGAGTTGACAAAAATAATCTGGCGCTGCCATTTTATGCCTGCCGCGGGTGCCTCTTGCTTCACGGATTCAGTTTCTGGTTCATTGATGTGGTTT
ACACTGACACCAACTACCCACAAGGCCTTTTGGGACCCCTTCATTAAACACCATATGACTCTTAACTTCCTCTACAAAACCCATTTCCCTCTGGCTTCAAATTGC
AACATCTTCTGTAAATTGCAGGACATTTTCATGGCTCTTCCAATCTTTGTTACAGCGTTTCTCTTGGCAGTCGTTCTTGCTAAAGTTGAGCTCTCAACCAGTCAT
GTTCTGAAAGGGAAGGTCTCTTGCCTTGACTGCGATGCCGCTTATGATCTCTCAGGTTGTAATTCGAGGAAAATGGTCTCAGGGATCGTGGTGATGGCGAAGTGC
GATGAGGTTGGGAAGGTGGTTACAGCAACGACGGCGAAGGATGGGTGTTTCGAGGCGGAGCTGCCTTCAGATGACTGCGTGGCCAGGCTCGCCGGGGGTCCGCAC
CAGCTCTACGCCGCGAGAAAAAGCATGGTCGCCGGAATCGTTAGGGCCGGTGATGGCTCCGCCGTCTACGGCATCTCCCCTCCGCTTGGGTTTTACACTGCGTGC
GGCTCGATCAGCGGTGAAGCAGACAAATATTGCAAAGCAGCCGCCGCCGGAAGGAAGTTTGGGTCGTCCAAGACCTTCGACCTTCCTCTGCCGCCGGAGTGGGGG
TTGGCGCCGTCTAGCTACTATTTTCCCTTCTTCCCTATCATCGGCATCCCTTGA
Protein sequenceShow/hide protein sequence
MAATRRSAASLAAIDKQGKKVLGVCRVFDIRFIYRTKISELTKIIWRCHFMPAAGASCFTDSVSGSLMWFTLTPTTHKAFWDPFIKHHMTLNFLYKTHFPLASNC
NIFCKLQDIFMALPIFVTAFLLAVVLAKVELSTSHVLKGKVSCLDCDAAYDLSGCNSRKMVSGIVVMAKCDEVGKVVTATTAKDGCFEAELPSDDCVARLAGGPH
QLYAARKSMVAGIVRAGDGSAVYGISPPLGFYTACGSISGEADKYCKAAAAGRKFGSSKTFDLPLPPEWGLAPSSYYFPFFPIIGIP