; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G039900 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G039900
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionWHy domain-containing protein
Genome locationCicolChr02:35201719..35203541
RNA-Seq ExpressionCcUC02G039900
SyntenyCcUC02G039900
Gene Ontology termsGO:0009269 - response to desiccation (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup
IPR013990 - Water stress and hypersensitive response domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061714.1 late embryogenesis abundant hydroxyproline-rich glycoprotein [Cucumis melo var. makuwa]5.7e-9598.35Show/hide
Query:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS
        MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQPRS
Subjt:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

KAE8647360.1 hypothetical protein Csa_003928 [Cucumis sativus]6.3e-9494.59Show/hide
Query:  RNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQ
        +  MGKKRNWSW+SALVGAASA+AATAIISAKPKDPTFHLISIKFTSFKLKPPVVD ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ
Subjt:  RNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQ

Query:  PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFA+VLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

XP_004140159.2 uncharacterized protein LOC101218134 [Cucumis sativus]6.3e-9494.59Show/hide
Query:  RNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQ
        +  MGKKRNWSW+SALVGAASA+AATAIISAKPKDPTFHLISIKFTSFKLKPPVVD ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ
Subjt:  RNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQ

Query:  PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFA+VLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

XP_008449575.2 PREDICTED: uncharacterized protein LOC103491417 [Cucumis melo]2.0e-9595.26Show/hide
Query:  FSLIDRNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVD
        F L     MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VD
Subjt:  FSLIDRNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVD

Query:  AGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        AGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  AGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

XP_038900652.1 uncharacterized protein LOC120087813 [Benincasa hispida]1.1e-9597.27Show/hide
Query:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS
        MGKKRNWSWSSALVGAASA+AATAI+SAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSA+VDAGSQQPRS
Subjt:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        CQVLRLPARLDGLKLAHHGSRFISDVAKREM+LDASVDIGGFAKVLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT

TrEMBL top hitse value%identityAlignment
A0A0A0KGW5 WHy domain-containing protein3.0e-9494.59Show/hide
Query:  RNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQ
        +  MGKKRNWSW+SALVGAASA+AATAIISAKPKDPTFHLISIKFTSFKLKPPVVD ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ
Subjt:  RNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQ

Query:  PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFA+VLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  PRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A1S3BMZ9 uncharacterized protein LOC1034914179.5e-9695.26Show/hide
Query:  FSLIDRNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVD
        F L     MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VD
Subjt:  FSLIDRNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVD

Query:  AGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        AGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  AGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A5D3B864 Late embryogenesis abundant hydroxyproline-rich glycoprotein2.8e-9598.35Show/hide
Query:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS
        MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQPRS
Subjt:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWW+HKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A6J1FY16 uncharacterized protein LOC1114488712.3e-8992.9Show/hide
Query:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS
        M KKRNWSW SALVGAASA+AATAIISAKPKDPTFHLISIKFTS K+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGS LGSA V+AGSQQ RS
Subjt:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        CQVLRLPARLDGLKLAHH SRFISDVAKREMVLDASVDIGG AKVLWWNH+FKVHVDSHLTVDPVFLDVLDQENTSQL+LFLT
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT

A0A6J1JD65 uncharacterized protein LOC1114839622.3e-8992.9Show/hide
Query:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS
        M KKRNWSW SALVGAASA+AATAIISAKPKDPTFHLISIKFTS K+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGS LGSA V+AGSQQ RS
Subjt:  MGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        CQVLRLPARLDGLKLAHH SRFISDVAKREMVLDASVDIGG AKVLWWNH+FKVHVDSHLTVDPVFLDVLDQENTSQL+LFLT
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.6e-0827.34Show/hide
Query:  PKDPTFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDV
        P DP   +I +K +   +  +P P +D  L++T+ V+N +V    ++   ++I Y G  LG    D G         L   A LDG+ +       I D+
Subjt:  PKDPTFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDV

Query:  AKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPV
        AK  +  D   +  G   VL++    K  V   + VD V
Subjt:  AKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPV

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.1e-0627.34Show/hide
Query:  PKDPTFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDV
        P DP   +I +K +   +  +P P +D  L++T+ V+N +V    ++   ++I Y G  LG    D G         L   A LDG+ +       I D+
Subjt:  PKDPTFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDV

Query:  AKREMVLDASVDIGGFAKVLWWNHKFKV
        AK  +  D   +  G   VL++    KV
Subjt:  AKREMVLDASVDIGGFAKVLWWNHKFKV

AT3G44380.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.3e-7070.72Show/hide
Query:  KKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRSCQ
        +K  WSWSSAL+GAASA AA +++SAKPKDPTFHLISI  TS KL  PV+DAEL+LTVHVTNPN+A IHYSST M+I YDG++LGSA V AGSQ  RSCQ
Subjt:  KKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRSCQ

Query:  VLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        +LRLPARLDG++LA H  +F SDVA REM L+A + I G AKVLWW+H F+VHVDS +TVDPVFLDV+ QEN SQ++LFLT
Subjt:  VLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTCTCTTATAGATAGAAACAATATGGGGAAAAAGCGTAATTGGAGCTGGAGCTCCGCCCTAGTGGGAGCGGCGTCGGCAGTTGCAGCGACGGCGATCATTTCCGC
AAAGCCCAAGGACCCCACCTTTCACCTGATCTCAATTAAATTCACTTCCTTCAAGCTCAAGCCGCCGGTGGTGGACGCCGAGCTTATCCTGACCGTCCACGTCACCAACC
CCAACGTGGCTCCCATCCATTACTCCTCCACCGCCATGTCCATTTTCTACGACGGTTCCCTTCTGGGCTCAGCCCGGGTGGATGCCGGTTCGCAGCAGCCCCGGTCCTGC
CAAGTCCTCCGACTTCCAGCCCGGCTTGACGGCCTGAAGTTAGCCCACCACGGCAGTCGGTTCATCTCCGACGTGGCCAAGCGAGAGATGGTTCTGGATGCGAGTGTGGA
CATTGGGGGTTTTGCCAAAGTGCTGTGGTGGAATCACAAATTCAAGGTCCACGTGGACAGCCATCTCACCGTTGATCCCGTCTTCCTTGATGTCCTTGATCAGGAAAACA
CTTCTCAACTTGAGCTGTTTCTTACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTCTCTTATAGATAGAAACAATATGGGGAAAAAGCGTAATTGGAGCTGGAGCTCCGCCCTAGTGGGAGCGGCGTCGGCAGTTGCAGCGACGGCGATCATTTCCGC
AAAGCCCAAGGACCCCACCTTTCACCTGATCTCAATTAAATTCACTTCCTTCAAGCTCAAGCCGCCGGTGGTGGACGCCGAGCTTATCCTGACCGTCCACGTCACCAACC
CCAACGTGGCTCCCATCCATTACTCCTCCACCGCCATGTCCATTTTCTACGACGGTTCCCTTCTGGGCTCAGCCCGGGTGGATGCCGGTTCGCAGCAGCCCCGGTCCTGC
CAAGTCCTCCGACTTCCAGCCCGGCTTGACGGCCTGAAGTTAGCCCACCACGGCAGTCGGTTCATCTCCGACGTGGCCAAGCGAGAGATGGTTCTGGATGCGAGTGTGGA
CATTGGGGGTTTTGCCAAAGTGCTGTGGTGGAATCACAAATTCAAGGTCCACGTGGACAGCCATCTCACCGTTGATCCCGTCTTCCTTGATGTCCTTGATCAGGAAAACA
CTTCTCAACTTGAGCTGTTTCTTACTTAA
Protein sequenceShow/hide protein sequence
MFSLIDRNNMGKKRNWSWSSALVGAASAVAATAIISAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQPRSC
QVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGFAKVLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT