; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C02G048310 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C02G048310
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionWHy domain-containing protein
Genome locationCla97Chr02:35879794..35881678
RNA-Seq ExpressionCla97C02G048310
SyntenyCla97C02G048310
Gene Ontology termsGO:0009269 - response to desiccation (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup
IPR013990 - Water stress and hypersensitive response domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061714.1 late embryogenesis abundant hydroxyproline-rich glycoprotein [Cucumis melo var. makuwa]9.7e-9597.8Show/hide
Query:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS
        MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQPRS
Subjt:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDA+VDIGGFAKVLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

KAE8647360.1 hypothetical protein Csa_003928 [Cucumis sativus]1.4e-9394.05Show/hide
Query:  RNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQ
        +  MGKKRNWSW+SALVGAASA+AATAIISAKPKDPTFHLISIKFTSFKLKPPVVD ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ
Subjt:  RNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQ

Query:  PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDA+VDIGGFA+VLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

XP_004140159.2 uncharacterized protein LOC101218134 [Cucumis sativus]1.4e-9394.05Show/hide
Query:  RNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQ
        +  MGKKRNWSW+SALVGAASA+AATAIISAKPKDPTFHLISIKFTSFKLKPPVVD ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ
Subjt:  RNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQ

Query:  PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDA+VDIGGFA+VLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

XP_008449575.2 PREDICTED: uncharacterized protein LOC103491417 [Cucumis melo]3.3e-9594.74Show/hide
Query:  FSLIDRNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVD
        F L     MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VD
Subjt:  FSLIDRNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVD

Query:  AGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        AGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDA+VDIGGFAKVLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  AGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

XP_038900652.1 uncharacterized protein LOC120087813 [Benincasa hispida]2.0e-9596.72Show/hide
Query:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS
        MGKKRNWSWSSALVGAASA+AATAI+SAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSA+VDAGSQQPRS
Subjt:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        CQVLRLPARLDGLKLAHHGSRFISDVAKREM+LDA+VDIGGFAKVLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT

TrEMBL top hitse value%identityAlignment
A0A0A0KGW5 WHy domain-containing protein6.8e-9494.05Show/hide
Query:  RNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQ
        +  MGKKRNWSW+SALVGAASA+AATAIISAKPKDPTFHLISIKFTSFKLKPPVVD ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ
Subjt:  RNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQ

Query:  PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDA+VDIGGFA+VLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A1S3BMZ9 uncharacterized protein LOC1034914171.6e-9594.74Show/hide
Query:  FSLIDRNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVD
        F L     MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VD
Subjt:  FSLIDRNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVD

Query:  AGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        AGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDA+VDIGGFAKVLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  AGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A5D3B864 Late embryogenesis abundant hydroxyproline-rich glycoprotein4.7e-9597.8Show/hide
Query:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS
        MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQPRS
Subjt:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDA+VDIGGFAKVLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A6J1FY16 uncharacterized protein LOC1114488713.9e-8992.35Show/hide
Query:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS
        M KKRNWSW SALVGAASA+AATAIISAKPKDPTFHLISIKFTS K+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGS LGSA V+AGSQQ RS
Subjt:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        CQVLRLPARLDGLKLAHH SRFISDVAKREMVLDA+VDIGG AKVLWWNH+FKVHVDSHLTVDPVFLDVLDQENTSQL+LFLT
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT

A0A6J1JD65 uncharacterized protein LOC1114839623.9e-8992.35Show/hide
Query:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS
        M KKRNWSW SALVGAASA+AATAIISAKPKDPTFHLISIKFTS K+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGS LGSA V+AGSQQ RS
Subjt:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        CQVLRLPARLDGLKLAHH SRFISDVAKREMVLDA+VDIGG AKVLWWNH+FKVHVDSHLTVDPVFLDVLDQENTSQL+LFLT
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.6e-0827.34Show/hide
Query:  PKDPTFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDV
        P DP   +I +K +   +  +P P +D  L++T+ V+N +V    ++   ++I Y G  LG    D G         L   A LDG+ +       I D+
Subjt:  PKDPTFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDV

Query:  AKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPV
        AK  +  D   +  G   VL++    K  V   + VD V
Subjt:  AKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPV

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.2e-0727.34Show/hide
Query:  PKDPTFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDV
        P DP   +I +K +   +  +P P +D  L++T+ V+N +V    ++   ++I Y G  LG    D G         L   A LDG+ +       I D+
Subjt:  PKDPTFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDV

Query:  AKREMVLDATVDIGGFAKVLWWNHKFKV
        AK  +  D   +  G   VL++    KV
Subjt:  AKREMVLDATVDIGGFAKVLWWNHKFKV

AT3G44380.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.0e-7070.72Show/hide
Query:  KKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRSCQ
        +K  WSWSSAL+GAASA AA +++SAKPKDPTFHLISI  TS KL  PV+DAEL+LTVHVTNPN+A IHYSST M+I YDG++LGSA V AGSQ  RSCQ
Subjt:  KKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRSCQ

Query:  VLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        +LRLPARLDG++LA H  +F SDVA REM L+A + I G AKVLWW+H F+VHVDS +TVDPVFLDV+ QEN SQ++LFLT
Subjt:  VLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTCTCTTATAGATAGAAACAATATGGGGAAAAAGCGTAATTGGAGCTGGAGCTCCGCCCTAGTCGGAGCGGCGTCGGCAGTTGCAGCGACGGCGATCATTTCCGC
AAAGCCCAAGGACCCCACCTTTCACCTGATCTCAATTAAATTCACTTCCTTCAAGCTCAAGCCGCCGGTGGTGGACGCCGAGCTTATCCTGACCGTCCACGTCACCAACC
CCAACGTGGCTCCCATCCATTACTCCTCCACCGCCATGTCCATTTTCTACGACGGTTCCCTTCTGGGCTCAGCCCGGGTGGATGCCGGTTCGCAGCAGCCCCGGTCCTGC
CAAGTCCTCCGACTTCCAGCCCGGCTTGACGGCCTGAAGTTGGCCCACCACGGGAGCCGGTTCATCTCCGACGTGGCCAAGCGAGAGATGGTTCTGGATGCGACTGTGGA
CATTGGGGGTTTTGCCAAAGTGCTGTGGTGGAATCACAAATTCAAGGTCCACGTGGACAGCCATCTCACCGTTGATCCCGTCTTCCTTGATGTCCTTGATCAGGAAAACA
CTTCTCAACTTGAGCTGTTTCTTACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTCTCTTATAGATAGAAACAATATGGGGAAAAAGCGTAATTGGAGCTGGAGCTCCGCCCTAGTCGGAGCGGCGTCGGCAGTTGCAGCGACGGCGATCATTTCCGC
AAAGCCCAAGGACCCCACCTTTCACCTGATCTCAATTAAATTCACTTCCTTCAAGCTCAAGCCGCCGGTGGTGGACGCCGAGCTTATCCTGACCGTCCACGTCACCAACC
CCAACGTGGCTCCCATCCATTACTCCTCCACCGCCATGTCCATTTTCTACGACGGTTCCCTTCTGGGCTCAGCCCGGGTGGATGCCGGTTCGCAGCAGCCCCGGTCCTGC
CAAGTCCTCCGACTTCCAGCCCGGCTTGACGGCCTGAAGTTGGCCCACCACGGGAGCCGGTTCATCTCCGACGTGGCCAAGCGAGAGATGGTTCTGGATGCGACTGTGGA
CATTGGGGGTTTTGCCAAAGTGCTGTGGTGGAATCACAAATTCAAGGTCCACGTGGACAGCCATCTCACCGTTGATCCCGTCTTCCTTGATGTCCTTGATCAGGAAAACA
CTTCTCAACTTGAGCTGTTTCTTACTTAA
Protein sequenceShow/hide protein sequence
MFSLIDRNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRSC
QVLRLPARLDGLKLAHHGSRFISDVAKREMVLDATVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT