; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001261 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001261
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionWHy domain-containing protein
Genome locationChr09:15493948..15494499
RNA-Seq ExpressionHG10001261
SyntenyHG10001261
Gene Ontology termsGO:0009269 - response to desiccation (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup
IPR013990 - Water stress and hypersensitive response domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061714.1 late embryogenesis abundant hydroxyproline-rich glycoprotein [Cucumis melo var. makuwa]1.1e-9296.7Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS
        MGKKRNWSW SALVGAASA+AATAIISAKPKDP FHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSAQVDAGSQQ RS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQL LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFL

KAE8647360.1 hypothetical protein Csa_003928 [Cucumis sativus]9.6e-9296.15Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS
        MGKKRNWSW SALVGAASAIAATAIISAKPKDP FHLISIKFTSFKLKPPVVD ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSAQVDAGSQQ RS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFA+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQL LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFL

XP_004140159.2 uncharacterized protein LOC101218134 [Cucumis sativus]9.6e-9296.15Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS
        MGKKRNWSW SALVGAASAIAATAIISAKPKDP FHLISIKFTSFKLKPPVVD ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSAQVDAGSQQ RS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFA+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQL LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFL

XP_008449575.2 PREDICTED: uncharacterized protein LOC103491417 [Cucumis melo]1.1e-9296.7Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS
        MGKKRNWSW SALVGAASA+AATAIISAKPKDP FHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSAQVDAGSQQ RS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQL LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFL

XP_038900652.1 uncharacterized protein LOC120087813 [Benincasa hispida]1.3e-9396.72Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS
        MGKKRNWSW SALVGAASAIAATAI+SAKPKDP FHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQ RS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFLT
        CQVLRLPARLDGLKLAHHGSRFISDVAKREM+LDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQL LFLT
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFLT

TrEMBL top hitse value%identityAlignment
A0A0A0KGW5 WHy domain-containing protein4.6e-9296.15Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS
        MGKKRNWSW SALVGAASAIAATAIISAKPKDP FHLISIKFTSFKLKPPVVD ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSAQVDAGSQQ RS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFA+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQL LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFL

A0A1S3BMZ9 uncharacterized protein LOC1034914175.5e-9396.7Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS
        MGKKRNWSW SALVGAASA+AATAIISAKPKDP FHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSAQVDAGSQQ RS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQL LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFL

A0A5D3B864 Late embryogenesis abundant hydroxyproline-rich glycoprotein5.5e-9396.7Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS
        MGKKRNWSW SALVGAASA+AATAIISAKPKDP FHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSAQVDAGSQQ RS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQL LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFL

A0A6J1FY16 uncharacterized protein LOC1114488714.8e-8993.44Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS
        M KKRNWSWGSALVGAASAIAATAIISAKPKDP FHLISIKFTS K+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGS LGSA V+AGSQQSRS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFLT
        CQVLRLPARLDGLKLAHH SRFISDVAKREMVLDASVDIGG AKVLWW+H+FKVHVDSHLTVDPVFLDVLDQENTSQL LFLT
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFLT

A0A6J1JD65 uncharacterized protein LOC1114839624.8e-8993.44Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS
        M KKRNWSWGSALVGAASAIAATAIISAKPKDP FHLISIKFTS K+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGS LGSA V+AGSQQSRS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFLT
        CQVLRLPARLDGLKLAHH SRFISDVAKREMVLDASVDIGG AKVLWW+H+FKVHVDSHLTVDPVFLDVLDQENTSQL LFLT
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFLT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.2e-0827.34Show/hide
Query:  PKDPAFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRSCQVLRLPARLDGLKLAHHGSRFISDV
        P DP   +I +K +   +  +P P +D  L++T+ V+N +V    ++   ++I Y G  LG    D G   +     L   A LDG+ +       I D+
Subjt:  PKDPAFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRSCQVLRLPARLDGLKLAHHGSRFISDV

Query:  AKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPV
        AK  +  D   +  G   VL++    K  V   + VD V
Subjt:  AKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPV

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.0e-0627.34Show/hide
Query:  PKDPAFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRSCQVLRLPARLDGLKLAHHGSRFISDV
        P DP   +I +K +   +  +P P +D  L++T+ V+N +V    ++   ++I Y G  LG    D G   +     L   A LDG+ +       I D+
Subjt:  PKDPAFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRSCQVLRLPARLDGLKLAHHGSRFISDV

Query:  AKREMVLDASVDIGGFAKVLWWSHKFKV
        AK  +  D   +  G   VL++    KV
Subjt:  AKREMVLDASVDIGGFAKVLWWSHKFKV

AT3G44380.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.3e-6969.61Show/hide
Query:  KKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRSCQ
        +K  WSW SAL+GAASA AA +++SAKPKDP FHLISI  TS KL  PV+DAEL+LTVHVTNPN+A IHYSST M+I YDG++LGSA+V AGSQ +RSCQ
Subjt:  KKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRSCQ

Query:  VLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFLT
        +LRLPARLDG++LA H  +F SDVA REM L+A + I G AKVLWW H F+VHVDS +TVDPVFLDV+ QEN SQ+ LFLT
Subjt:  VLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAAAGCGTAACTGGAGCTGGGGCTCCGCCCTAGTCGGAGCGGCGTCGGCAATTGCAGCGACGGCGATCATTTCCGCAAAGCCCAAGGACCCCGCCTTC
CACCTGATCTCAATAAAATTCACTTCCTTCAAGCTGAAACCGCCGGTGGTGGATGCCGAGCTCATCCTGACCGTCCACGTCACCAACCCCAACGTGGCTCCCATC
CATTACTCCTCCACCGCCATGTCCATTTTCTACGACGGCTCCCTCCTCGGCTCGGCCCAGGTCGATGCCGGTTCGCAGCAGTCCCGGTCCTGCCAAGTCCTCCGA
CTTCCGGCCCGGCTCGACGGCCTGAAGCTGGCCCACCACGGCAGCCGGTTCATCTCCGACGTGGCCAAGCGAGAGATGGTTCTGGATGCGAGTGTGGACATTGGG
GGTTTTGCTAAAGTGCTGTGGTGGAGTCACAAATTCAAGGTCCACGTGGACAGTCATCTGACCGTTGATCCCGTCTTCCTTGATGTCCTTGATCAGGAAAACACT
TCTCAACTTGCGCTCTTTCTTACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAAAAGCGTAACTGGAGCTGGGGCTCCGCCCTAGTCGGAGCGGCGTCGGCAATTGCAGCGACGGCGATCATTTCCGCAAAGCCCAAGGACCCCGCCTTC
CACCTGATCTCAATAAAATTCACTTCCTTCAAGCTGAAACCGCCGGTGGTGGATGCCGAGCTCATCCTGACCGTCCACGTCACCAACCCCAACGTGGCTCCCATC
CATTACTCCTCCACCGCCATGTCCATTTTCTACGACGGCTCCCTCCTCGGCTCGGCCCAGGTCGATGCCGGTTCGCAGCAGTCCCGGTCCTGCCAAGTCCTCCGA
CTTCCGGCCCGGCTCGACGGCCTGAAGCTGGCCCACCACGGCAGCCGGTTCATCTCCGACGTGGCCAAGCGAGAGATGGTTCTGGATGCGAGTGTGGACATTGGG
GGTTTTGCTAAAGTGCTGTGGTGGAGTCACAAATTCAAGGTCCACGTGGACAGTCATCTGACCGTTGATCCCGTCTTCCTTGATGTCCTTGATCAGGAAAACACT
TCTCAACTTGCGCTCTTTCTTACTTAA
Protein sequenceShow/hide protein sequence
MGKKRNWSWGSALVGAASAIAATAIISAKPKDPAFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQQSRSCQVLR
LPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLALFLT