; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017497 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017497
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDnaJ/Hsp40 cysteine-rich domain superfamily protein
Genome locationChr03:14942148..14945013
RNA-Seq ExpressionHG10017497
SyntenyHG10017497
Gene Ontology termsNA
InterPro domainsIPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144653.1 uncharacterized protein LOC101220837 isoform X1 [Cucumis sativus]1.4e-7591.3Show/hide
Query:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASG-GDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNP
        MDSATLTMSSSVILRNSS KLL IRKIQ  LCF+RNKFSKISA+YPNGSASG GDSSAADVHRRRSSFESLFCYDKA+PEERIETP+GISLAEKMIG+NP
Subjt:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASG-GDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNP

Query:  RCTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK
        RCTDCQAKGAVLC+TCSGSGLYVDSILESQGIIVKVRCLGCGG+GN MC ECGGRGHLGSK
Subjt:  RCTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK

XP_008442041.1 PREDICTED: uncharacterized protein LOC103486023 isoform X2 [Cucumis melo]4.2e-7589.38Show/hide
Query:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASGGDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNPR
        MDSATLT+SSSVILRNSS KLL IRKIQ  LCF+RN+FS+ISA+Y NGSASGGDSSAADVHRRRSSFESLFCYDKA+PEERIETP+GISLAEKMIG+NPR
Subjt:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASGGDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNPR

Query:  CTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK
        CTDCQAKGAVLC+TCSGSGLYVDSILESQGIIVKVRCLGCGG+GN MC ECGGRGHLGSK
Subjt:  CTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK

XP_011653948.1 uncharacterized protein LOC101220837 isoform X2 [Cucumis sativus]5.8e-7791.88Show/hide
Query:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASGGDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNPR
        MDSATLTMSSSVILRNSS KLL IRKIQ  LCF+RNKFSKISA+YPNGSASGGDSSAADVHRRRSSFESLFCYDKA+PEERIETP+GISLAEKMIG+NPR
Subjt:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASGGDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNPR

Query:  CTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK
        CTDCQAKGAVLC+TCSGSGLYVDSILESQGIIVKVRCLGCGG+GN MC ECGGRGHLGSK
Subjt:  CTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK

XP_038881865.1 uncharacterized protein LOC120073222 isoform X1 [Benincasa hispida]8.4e-7691.93Show/hide
Query:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASG-GDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNP
        MDSATLTMSSSVILRNSSQKLLA RKIQ GLCF RNKFSKISAVYPNGSASG GDSS ADVHRRRSSFESLFCYDKA+PEERIETPVGISLAE+MIGDNP
Subjt:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASG-GDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNP

Query:  RCTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK
        RCTDC AKG VLC+TCSGSGLYVDSILESQGIIVKVRCLGCGG+GN MC ECGGRGHLGSK
Subjt:  RCTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK

XP_038881866.1 uncharacterized protein LOC120073222 isoform X2 [Benincasa hispida]3.4e-7792.5Show/hide
Query:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASGGDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNPR
        MDSATLTMSSSVILRNSSQKLLA RKIQ GLCF RNKFSKISAVYPNGSASGGDSS ADVHRRRSSFESLFCYDKA+PEERIETPVGISLAE+MIGDNPR
Subjt:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASGGDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNPR

Query:  CTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK
        CTDC AKG VLC+TCSGSGLYVDSILESQGIIVKVRCLGCGG+GN MC ECGGRGHLGSK
Subjt:  CTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK

TrEMBL top hitse value%identityAlignment
A0A0A0L441 Uncharacterized protein2.8e-7791.88Show/hide
Query:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASGGDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNPR
        MDSATLTMSSSVILRNSS KLL IRKIQ  LCF+RNKFSKISA+YPNGSASGGDSSAADVHRRRSSFESLFCYDKA+PEERIETP+GISLAEKMIG+NPR
Subjt:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASGGDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNPR

Query:  CTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK
        CTDCQAKGAVLC+TCSGSGLYVDSILESQGIIVKVRCLGCGG+GN MC ECGGRGHLGSK
Subjt:  CTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK

A0A1S3B4T2 uncharacterized protein LOC103486023 isoform X15.0e-7488.82Show/hide
Query:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASG-GDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNP
        MDSATLT+SSSVILRNSS KLL IRKIQ  LCF+RN+FS+ISA+Y NGSASG GDSSAADVHRRRSSFESLFCYDKA+PEERIETP+GISLAEKMIG+NP
Subjt:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASG-GDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNP

Query:  RCTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK
        RCTDCQAKGAVLC+TCSGSGLYVDSILESQGIIVKVRCLGCGG+GN MC ECGGRGHLGSK
Subjt:  RCTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK

A0A1S3B5F7 uncharacterized protein LOC103486023 isoform X22.0e-7589.38Show/hide
Query:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASGGDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNPR
        MDSATLT+SSSVILRNSS KLL IRKIQ  LCF+RN+FS+ISA+Y NGSASGGDSSAADVHRRRSSFESLFCYDKA+PEERIETP+GISLAEKMIG+NPR
Subjt:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASGGDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNPR

Query:  CTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK
        CTDCQAKGAVLC+TCSGSGLYVDSILESQGIIVKVRCLGCGG+GN MC ECGGRGHLGSK
Subjt:  CTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK

A0A6J1DNW0 uncharacterized protein LOC111021761 isoform X25.2e-7184.76Show/hide
Query:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNG----SASGGDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIG
        MDSATLT+SSS+ILRNSSQKLL  RKIQ GLCF R +FSKI AVYPNG    SA GGDSSAADVHRRRS+FESLFCYDKA+PEERIE PVGISLAEK+IG
Subjt:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNG----SASGGDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIG

Query:  DNPRCTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK
        DNPRCTDC AKGAVLC+TCSGSGLYVD+ILESQGIIVKVRCLGCGG+GN MC ECGGRGHL SK
Subjt:  DNPRCTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK

A0A6J1GFH6 uncharacterized protein LOC1114537014.2e-7388.2Show/hide
Query:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASG-GDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNP
        MDS++LT+SSS I RNSSQKLLAIRKIQDGLCF+RNKFSKI AVYPNGSASG  D SAADVHR+RS+FESLFCYDKA+PEE IE PVGISLAEKMIGDNP
Subjt:  MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASG-GDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNP

Query:  RCTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK
        RCTDC AKGAVLC+TCSGSGLYVDSILESQGIIVKVRCLGCGG+GNTMC ECGGRGHLGSK
Subjt:  RCTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK

SwissProt top hitse value%identityAlignment
Q6YUA8 Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic5.2e-0434.25Show/hide
Query:  TPVGISLAEKMIGDNPRCTDCQAKGAVLCSTCSGSGLY-VDSILESQGIIVKVRCLGCGGSGNTMCPECGGRG
        TP G  LA  M+     C +C   GAVLC  C G+G +   +   ++ + +   C  C G G  +CP C G G
Subjt:  TPVGISLAEKMIGDNPRCTDCQAKGAVLCSTCSGSGLY-VDSILESQGIIVKVRCLGCGGSGNTMCPECGGRG

Arabidopsis top hitse value%identityAlignment
AT2G34860.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein4.1e-0435.71Show/hide
Query:  CTDCQAKGAVLCSTCSGSGLY-VDSILESQGIIVKVRCLGCGGSGNTMCPECGGRG
        C +CQ  GAVLC  C G+G +   +   ++ +     C  C G G  +CP C G G
Subjt:  CTDCQAKGAVLCSTCSGSGLY-VDSILESQGIIVKVRCLGCGGSGNTMCPECGGRG

AT2G34860.2 DnaJ/Hsp40 cysteine-rich domain superfamily protein4.1e-0435.71Show/hide
Query:  CTDCQAKGAVLCSTCSGSGLY-VDSILESQGIIVKVRCLGCGGSGNTMCPECGGRG
        C +CQ  GAVLC  C G+G +   +   ++ +     C  C G G  +CP C G G
Subjt:  CTDCQAKGAVLCSTCSGSGLY-VDSILESQGIIVKVRCLGCGGSGNTMCPECGGRG

AT5G17840.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein5.3e-4467.23Show/hide
Query:  KISAVYPNGSASGGDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNPRCTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLG
        ++ ++ P+ S         DVHR+RSS ES+FCYDK +PEE IE PVG+S++E+ IGDN RCT C+AKGA+LCSTCSG+GLYVDSI+ESQGIIVKVRCLG
Subjt:  KISAVYPNGSASGGDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNPRCTDCQAKGAVLCSTCSGSGLYVDSILESQGIIVKVRCLG

Query:  CGGSGNTMCPECGGRGHLG
        CGGSGN MC  CGGRGH+G
Subjt:  CGGSGNTMCPECGGRGHLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCAGCTACTTTAACCATGTCATCGTCTGTCATTTTGCGTAATTCATCGCAGAAATTACTTGCCATTAGAAAGATTCAAGATGGGTTATGTTTTGAAAGGAACAA
GTTTTCCAAGATCTCCGCGGTCTATCCCAATGGCTCTGCTTCTGGGGGGGATTCTTCAGCTGCAGATGTTCATAGACGACGAAGTTCTTTTGAATCTTTGTTTTGCTACG
ATAAGGCTGTCCCAGAGGAAAGAATTGAGACACCTGTTGGAATATCTCTGGCAGAGAAAATGATTGGGGACAATCCCCGTTGCACTGATTGTCAAGCCAAAGGTGCTGTC
CTTTGCTCAACTTGCTCCGGTTCGGGCTTATATGTTGACTCGATATTGGAGAGCCAGGGAATCATTGTGAAAGTTCGTTGTCTGGGTTGTGGTGGATCTGGCAATACCAT
GTGTCCAGAATGTGGCGGTCGAGGTCACCTGGGATCTAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCAGCTACTTTAACCATGTCATCGTCTGTCATTTTGCGTAATTCATCGCAGAAATTACTTGCCATTAGAAAGATTCAAGATGGGTTATGTTTTGAAAGGAACAA
GTTTTCCAAGATCTCCGCGGTCTATCCCAATGGCTCTGCTTCTGGGGGGGATTCTTCAGCTGCAGATGTTCATAGACGACGAAGTTCTTTTGAATCTTTGTTTTGCTACG
ATAAGGCTGTCCCAGAGGAAAGAATTGAGACACCTGTTGGAATATCTCTGGCAGAGAAAATGATTGGGGACAATCCCCGTTGCACTGATTGTCAAGCCAAAGGTGCTGTC
CTTTGCTCAACTTGCTCCGGTTCGGGCTTATATGTTGACTCGATATTGGAGAGCCAGGGAATCATTGTGAAAGTTCGTTGTCTGGGTTGTGGTGGATCTGGCAATACCAT
GTGTCCAGAATGTGGCGGTCGAGGTCACCTGGGATCTAAATGA
Protein sequenceShow/hide protein sequence
MDSATLTMSSSVILRNSSQKLLAIRKIQDGLCFERNKFSKISAVYPNGSASGGDSSAADVHRRRSSFESLFCYDKAVPEERIETPVGISLAEKMIGDNPRCTDCQAKGAV
LCSTCSGSGLYVDSILESQGIIVKVRCLGCGGSGNTMCPECGGRGHLGSK