; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029514 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029514
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionchaperone protein DnaJ-like
Genome locationtig00153403:1519302..1520131
RNA-Seq ExpressionSgr029514
SyntenySgr029514
Gene Ontology termsNA
InterPro domainsIPR001623 - DnaJ domain
IPR018253 - DnaJ domain, conserved site
IPR036869 - Chaperone J-domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022991261.1 uncharacterized protein LOC111487967 isoform X1 [Cucurbita maxima]5.2e-6281.99Show/hide
Query:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE
        MSGAG  S G CCYYSVLGLC QAS DEIRGAYRKLAMKWHPDRWMKDP+MAAESK RFQQIQEAYSVLSNK KRSIYDAGLISFLTDDDD+GFCD M+E
Subjt:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE

Query:  MVLMMKNVTQQ-GKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI
        MV MMKN TQQ  KKKNSLEDLR SLMEMMG DEQ     LGW     PLNARKR+RVAEI
Subjt:  MVLMMKNVTQQ-GKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI

XP_022991262.1 uncharacterized protein LOC111487967 isoform X2 [Cucurbita maxima]6.2e-6382.61Show/hide
Query:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE
        MSGAG  S G CCYYSVLGLC QAS DEIRGAYRKLAMKWHPDRWMKDP+MAAESK RFQQIQEAYSVLSNK KRSIYDAGLISFLTDDDD+GFCD M+E
Subjt:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE

Query:  MVLMMKNVTQQ-GKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI
        MV MMKN TQQ GKKKNSLEDLR SLMEMMG DEQ     LGW     PLNARKR+RVAEI
Subjt:  MVLMMKNVTQQ-GKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI

XP_022991263.1 uncharacterized protein LOC111487967 isoform X3 [Cucurbita maxima]1.5e-6181.88Show/hide
Query:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE
        MSGAG  S G CCYYSVLGLC QAS DEIRGAYRKLAMKWHPDRWMKDP+MAAESK RFQQIQEAYSVLSNK KRSIYDAGLISFLTDDDD+GFCD M+E
Subjt:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE

Query:  MVLMMKNVTQQGKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI
        MV MMKN TQQ  KKNSLEDLR SLMEMMG DEQ     LGW     PLNARKR+RVAEI
Subjt:  MVLMMKNVTQQGKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI

XP_023547778.1 uncharacterized protein LOC111806632 isoform X2 [Cucurbita pepo subsp. pepo]2.6e-6180.86Show/hide
Query:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE
        MSGAG  S G CCYYSVLGLCNQAS DEIRGAYRKLAMKWHPDRWMKDP+MAAESK RFQQIQEAYSVLSNK KRSIYDAGLISFLTDDDD+GFCD M+E
Subjt:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE

Query:  MVLMMKNVTQQ--GKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI
        MV MMKN TQQ   KKKNSLEDLR SLMEMM  DEQ     LGW     P+NARKR+RVAEI
Subjt:  MVLMMKNVTQQ--GKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI

XP_023547779.1 uncharacterized protein LOC111806632 isoform X3 [Cucurbita pepo subsp. pepo]2.0e-6181.37Show/hide
Query:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE
        MSGAG  S G CCYYSVLGLCNQAS DEIRGAYRKLAMKWHPDRWMKDP+MAAESK RFQQIQEAYSVLSNK KRSIYDAGLISFLTDDDD+GFCD M+E
Subjt:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE

Query:  MVLMMKNVTQQ-GKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI
        MV MMKN TQQ  KKKNSLEDLR SLMEMM  DEQ     LGW     P+NARKR+RVAEI
Subjt:  MVLMMKNVTQQ-GKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI

TrEMBL top hitse value%identityAlignment
A0A6J1GP90 uncharacterized protein LOC111456269 isoform X21.4e-6080.12Show/hide
Query:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE
        MSGAG  S G CCYYSVLGLC QAS DEIRGAYRK AMKWHPD+WMKDP+MAAESK RFQQIQEAYSVLSNK KRSIYDAGLISFLTDDDD+GFCD M+E
Subjt:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE

Query:  MVLMMKNVTQQG-KKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI
        MV MMKN T QG KKKNSLEDLR SLMEMM  DEQ     LGW     PLNARKR+RVAEI
Subjt:  MVLMMKNVTQQG-KKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI

A0A6J1JLB0 uncharacterized protein LOC111487967 isoform X23.0e-6382.61Show/hide
Query:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE
        MSGAG  S G CCYYSVLGLC QAS DEIRGAYRKLAMKWHPDRWMKDP+MAAESK RFQQIQEAYSVLSNK KRSIYDAGLISFLTDDDD+GFCD M+E
Subjt:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE

Query:  MVLMMKNVTQQ-GKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI
        MV MMKN TQQ GKKKNSLEDLR SLMEMMG DEQ     LGW     PLNARKR+RVAEI
Subjt:  MVLMMKNVTQQ-GKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI

A0A6J1JQ92 uncharacterized protein LOC111487967 isoform X12.5e-6281.99Show/hide
Query:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE
        MSGAG  S G CCYYSVLGLC QAS DEIRGAYRKLAMKWHPDRWMKDP+MAAESK RFQQIQEAYSVLSNK KRSIYDAGLISFLTDDDD+GFCD M+E
Subjt:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE

Query:  MVLMMKNVTQQ-GKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI
        MV MMKN TQQ  KKKNSLEDLR SLMEMMG DEQ     LGW     PLNARKR+RVAEI
Subjt:  MVLMMKNVTQQ-GKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI

A0A6J1JUC0 uncharacterized protein LOC111487967 isoform X43.7e-6181.25Show/hide
Query:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE
        MSGAG  S G CCYYSVLGLC QAS DEIRGAYRKLAMKWHPDRWMKDP+MAAESK RFQQIQEAYSVLSNK KRSIYDAGLISFLTDDDD+GFCD M+E
Subjt:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE

Query:  MVLMMKNVTQQGKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI
        MV MMKN TQQ   KNSLEDLR SLMEMMG DEQ     LGW     PLNARKR+RVAEI
Subjt:  MVLMMKNVTQQGKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI

A0A6J1JVQ6 uncharacterized protein LOC111487967 isoform X37.4e-6281.88Show/hide
Query:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE
        MSGAG  S G CCYYSVLGLC QAS DEIRGAYRKLAMKWHPDRWMKDP+MAAESK RFQQIQEAYSVLSNK KRSIYDAGLISFLTDDDD+GFCD M+E
Subjt:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE

Query:  MVLMMKNVTQQGKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI
        MV MMKN TQQ  KKNSLEDLR SLMEMMG DEQ     LGW     PLNARKR+RVAEI
Subjt:  MVLMMKNVTQQGKKKNSLEDLRESLMEMMGGDEQAAEVGLGW--GCPPLNARKRTRVAEI

SwissProt top hitse value%identityAlignment
O35723 DnaJ homolog subfamily B member 31.9e-1459.09Show/hide
Query:  YYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYD
        YY VLG+  QASA+ IR AYRKLA+KWHPD   K+PE   E++ RF+Q+ +AY VLS+ RKR +YD
Subjt:  YYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYD

Q0III6 DnaJ homolog subfamily B member 69.4e-1456.06Show/hide
Query:  YYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYD
        YY VLG+   ASA++I+ AYRKLA+KWHPD   K+PE   E++ +F+Q+ EAY VLS+ +KR IYD
Subjt:  YYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYD

Q5F3Z5 DnaJ homolog subfamily B member 69.4e-1456.06Show/hide
Query:  YYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYD
        YY VLG+   ASA++I+ AYRKLA+KWHPD   K+PE   E++ +F+Q+ EAY VLS+ +KR IYD
Subjt:  YYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYD

Q5FWN8 DnaJ homolog subfamily B member 6-A1.6e-1354.55Show/hide
Query:  YYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYD
        YY VLG+   ASAD+I+ AYR+LA+KWHPD   K+P+   E++ RF+++ EAY VLS+ +KR IYD
Subjt:  YYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYD

Q8WWF6 DnaJ homolog subfamily B member 33.6e-1356.06Show/hide
Query:  YYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYD
        YY VL +  QAS++ I+ AYRKLA+KWHPD   K+PE   E++ RF+Q+ EAY VLS+ +KR IYD
Subjt:  YYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYD

Arabidopsis top hitse value%identityAlignment
AT1G56300.1 Chaperone DnaJ-domain superfamily protein4.6e-3248.43Show/hide
Query:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE
        M+  G  S+    YY++LG+   AS  +IR AYRKLAMKWHPDR+ ++P +A E+K RFQQIQEAYSVL+++ KRS+YD GL     DDDD  FCD M E
Subjt:  MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSE

Query:  MVLMMKNVTQQGKKKNSLEDLRESLMEMMGGDEQAAEVGLGWGC---PPLNARKRTRVA
        M+ MM NV   G+   SLEDL+    +M+GGD      G+ + C   P  N R R  ++
Subjt:  MVLMMKNVTQQGKKKNSLEDLRESLMEMMGGDEQAAEVGLGWGC---PPLNARKRTRVA

AT1G71000.1 Chaperone DnaJ-domain superfamily protein1.9e-2550.86Show/hide
Query:  YYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSEMVLMMKNVTQQGK
        YY +LG+   +SA++IR AY KLA  WHPDRW KDP  + E+K RFQQIQEAYSVLS++RKRS YD GL       +D+G+ D + EMV +M   T++ +
Subjt:  YYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSEMVLMMKNVTQQGK

Query:  KKNSLEDLRESLMEMM
        K+ SLE+L+  + +M+
Subjt:  KKNSLEDLRESLMEMM

AT1G72416.1 Chaperone DnaJ-domain superfamily protein1.3e-1840.68Show/hide
Query:  YSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSEMVLMMKNVTQQGKK
        Y+VL L N+ +  ++R +Y+ L +KWHPDR++++ E   E+K +FQ IQ AYSVLS+  KR +YD G  ++ +DDD+ G  D ++EMV +M      G +
Subjt:  YSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSEMVLMMKNVTQQGKK

Query:  KNSLEDLRESLMEMMGGD
          SLE+  E   E++  D
Subjt:  KNSLEDLRESLMEMMGGD

AT1G72416.4 Chaperone DnaJ-domain superfamily protein7.6e-1941.53Show/hide
Query:  YSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSEMVLMMKNVTQQGKK
        Y+VL L N+ +  ++R +Y+ L +KWHPDR++++ E   E+K +FQ IQ AYSVLS+  KR +YD G  ++ +DDD+ G  D ++EMV +M   TQ  + 
Subjt:  YSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSEMVLMMKNVTQQGKK

Query:  KNSLEDLRESLMEMMGGD
          SLE+  E   E++  D
Subjt:  KNSLEDLRESLMEMMGGD

AT3G14200.1 Chaperone DnaJ-domain superfamily protein1.4e-2044.26Show/hide
Query:  YSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSEMVLMM-KNVTQQGK
        Y+VLGL  + S  E+R AY+KLA++WHPDR     E   E+K +FQ IQEAYSVLS+  KR +YD G  +   DDD  G  D ++EM  MM ++      
Subjt:  YSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSEMVLMM-KNVTQQGK

Query:  KKNSLEDLRESLMEMMGGDEQA
          +S E L++   EM  GD  A
Subjt:  KKNSLEDLRESLMEMMGGDEQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGGCGCCGGAGCCAAATCCAGTGGTGGGTGTTGTTACTACTCCGTGCTTGGCCTGTGCAACCAGGCTTCTGCCGATGAAATCCGAGGAGCTTACCGTAAACTCGC
CATGAAATGGCACCCGGATAGGTGGATGAAAGACCCGGAAATGGCCGCTGAATCGAAGTGGCGGTTTCAGCAAATCCAAGAGGCTTATTCAGTTCTATCAAACAAGAGGA
AAAGAAGCATCTACGACGCCGGATTGATTTCTTTTCTGACAGACGACGACGATAAAGGATTCTGCGATTTGATGAGCGAAATGGTCTTGATGATGAAGAACGTTACCCAG
CAGGGGAAGAAGAAGAACAGTTTGGAGGATCTGAGAGAATCGTTAATGGAGATGATGGGAGGCGATGAGCAGGCTGCGGAGGTCGGGCTTGGCTGGGGATGTCCTCCTCT
TAATGCTAGAAAACGAACTCGAGTTGCAGAGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCGGCGCCGGAGCCAAATCCAGTGGTGGGTGTTGTTACTACTCCGTGCTTGGCCTGTGCAACCAGGCTTCTGCCGATGAAATCCGAGGAGCTTACCGTAAACTCGC
CATGAAATGGCACCCGGATAGGTGGATGAAAGACCCGGAAATGGCCGCTGAATCGAAGTGGCGGTTTCAGCAAATCCAAGAGGCTTATTCAGTTCTATCAAACAAGAGGA
AAAGAAGCATCTACGACGCCGGATTGATTTCTTTTCTGACAGACGACGACGATAAAGGATTCTGCGATTTGATGAGCGAAATGGTCTTGATGATGAAGAACGTTACCCAG
CAGGGGAAGAAGAAGAACAGTTTGGAGGATCTGAGAGAATCGTTAATGGAGATGATGGGAGGCGATGAGCAGGCTGCGGAGGTCGGGCTTGGCTGGGGATGTCCTCCTCT
TAATGCTAGAAAACGAACTCGAGTTGCAGAGATTTGA
Protein sequenceShow/hide protein sequence
MSGAGAKSSGGCCYYSVLGLCNQASADEIRGAYRKLAMKWHPDRWMKDPEMAAESKWRFQQIQEAYSVLSNKRKRSIYDAGLISFLTDDDDKGFCDLMSEMVLMMKNVTQ
QGKKKNSLEDLRESLMEMMGGDEQAAEVGLGWGCPPLNARKRTRVAEI