; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0192 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0192
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionWHy domain-containing protein
Genome locationMC02:1828840..1829397
RNA-Seq ExpressionMC02g0192
SyntenyMC02g0192
Gene Ontology termsGO:0009269 - response to desiccation (biological process)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup
IPR013990 - Water stress and hypersensitive response domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061714.1 late embryogenesis abundant hydroxyproline-rich glycoprotein [Cucumis melo var. makuwa]7.31e-11994.51Show/hide
Query:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS
        MG+KRNWSWSSALVGAASA+AATA++SAKPKDPTFHLISIKFTSFK+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ RS
Subjt:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS

Query:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHG RFISDVAKREMVLDASVDIGG AKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

KAE8647360.1 hypothetical protein Csa_003928 [Cucumis sativus]1.30e-11692.93Show/hide
Query:  KSMGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQA
        K MG+KRNWSW+SALVGAASAIAATA++SAKPKDPTFHLISIKFTSFK+KPPVVD ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ 
Subjt:  KSMGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQA

Query:  RSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        RSCQVLRLPARLDGLKLAHHG RFISDVAKREMVLDASVDIGG A+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  RSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

XP_004140159.2 uncharacterized protein LOC101218134 [Cucumis sativus]1.66e-11792.93Show/hide
Query:  KSMGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQA
        K MG+KRNWSW+SALVGAASAIAATA++SAKPKDPTFHLISIKFTSFK+KPPVVD ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ 
Subjt:  KSMGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQA

Query:  RSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        RSCQVLRLPARLDGLKLAHHG RFISDVAKREMVLDASVDIGG A+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  RSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

XP_008449575.2 PREDICTED: uncharacterized protein LOC103491417 [Cucumis melo]2.81e-12094.09Show/hide
Query:  PAKSMGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQ
        PAK MG+KRNWSWSSALVGAASA+AATA++SAKPKDPTFHLISIKFTSFK+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQ
Subjt:  PAKSMGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQ

Query:  QARSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        Q RSCQVLRLPARLDGLKLAHHG RFISDVAKREMVLDASVDIGG AKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  QARSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

XP_038900652.1 uncharacterized protein LOC120087813 [Benincasa hispida]1.26e-11995.6Show/hide
Query:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS
        MG+KRNWSWSSALVGAASAIAATA+VSAKPKDPTFHLISIKFTSFK+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSA+VDAGSQQ RS
Subjt:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS

Query:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHG RFISDVAKREM+LDASVDIGG AKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

TrEMBL top hitse value%identityAlignment
A0A0A0KGW5 WHy domain-containing protein8.06e-11892.93Show/hide
Query:  KSMGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQA
        K MG+KRNWSW+SALVGAASAIAATA++SAKPKDPTFHLISIKFTSFK+KPPVVD ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ 
Subjt:  KSMGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQA

Query:  RSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        RSCQVLRLPARLDGLKLAHHG RFISDVAKREMVLDASVDIGG A+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  RSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A1S3BMZ9 uncharacterized protein LOC1034914171.36e-12094.09Show/hide
Query:  PAKSMGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQ
        PAK MG+KRNWSWSSALVGAASA+AATA++SAKPKDPTFHLISIKFTSFK+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQ
Subjt:  PAKSMGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQ

Query:  QARSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        Q RSCQVLRLPARLDGLKLAHHG RFISDVAKREMVLDASVDIGG AKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  QARSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A5D3B864 Late embryogenesis abundant hydroxyproline-rich glycoprotein3.54e-11994.51Show/hide
Query:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS
        MG+KRNWSWSSALVGAASA+AATA++SAKPKDPTFHLISIKFTSFK+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ RS
Subjt:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS

Query:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHG RFISDVAKREMVLDASVDIGG AKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A6J1H295 uncharacterized protein LOC1114594451.42e-11492.93Show/hide
Query:  MGRKRNWSWS--SALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQA
        M +KR+WSWS  SALVGAASAIAATA+VSAKPKDPTFHLISIKFTSFK+KPPVVDAELILTVHVTNPNVAPIHYSST+MSIFYDGSLLGSA VDAGSQQA
Subjt:  MGRKRNWSWS--SALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQA

Query:  RSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        RSCQVLRLPARLDGLKLAH+G RFISDV KREMVLDASVDIGGIA+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  RSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A6J1K4X2 uncharacterized protein LOC1114907771.42e-11492.39Show/hide
Query:  MGRKRNWSWS--SALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQA
        M +KR+WSWS  SALVGAASAIAATA+VSAKPKDPTFHLISIKFTSFK+KPPV+DAELILTVHVTNPNVAPIHYSST+MSIFYDGSLLGSA+VDAGSQQA
Subjt:  MGRKRNWSWS--SALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQA

Query:  RSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        RSCQVLRLPARLDGLKLAH+G RFISDV KREMVLDASVDIGGIA+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  RSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.2e-0828.78Show/hide
Query:  PKDPTFHLISIKFTSFKV--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARSCQVLRLPARLDGLKLAHHGGRFISDV
        P DP   +I +K +   V  +P P +D  L++T+ V+N +V    ++   ++I Y G  LG    D G   A     L   A LDG+ +       I D+
Subjt:  PKDPTFHLISIKFTSFKV--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARSCQVLRLPARLDGLKLAHHGGRFISDV

Query:  AKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPV
        AK  +  D   +  G   VL++    K  V   + VD V
Subjt:  AKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPV

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.5e-0728.91Show/hide
Query:  PKDPTFHLISIKFTSFKV--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARSCQVLRLPARLDGLKLAHHGGRFISDV
        P DP   +I +K +   V  +P P +D  L++T+ V+N +V    ++   ++I Y G  LG    D G   A     L   A LDG+ +       I D+
Subjt:  PKDPTFHLISIKFTSFKV--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARSCQVLRLPARLDGLKLAHHGGRFISDV

Query:  AKREMVLDASVDIGGIAKVLWWSHKFKV
        AK  +  D   +  G   VL++    KV
Subjt:  AKREMVLDASVDIGGIAKVLWWSHKFKV

AT3G44380.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.2e-6969.95Show/hide
Query:  SMGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQAR
        S  +K  WSWSSAL+GAASA AA +++SAKPKDPTFHLISI  TS K+  PV+DAEL+LTVHVTNPN+A IHYSST M+I YDG++LGSA V AGSQ AR
Subjt:  SMGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQAR

Query:  SCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        SCQ+LRLPARLDG++LA H  +F SDVA REM L+A + I G AKVLWW H F+VHVDS +TVDPVFLDV+ QEN SQ++LFL
Subjt:  SCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CCGGCGAAGAGCATGGGGAGAAAGCGTAACTGGAGCTGGAGCTCCGCCCTGGTCGGAGCGGCGTCGGCGATTGCGGCGACGGCGGTCGTTTCCGCGAAACCTAAGGACCC
GACGTTCCACCTGATTTCAATCAAGTTCACTTCCTTCAAAGTGAAGCCGCCGGTGGTGGACGCCGAGCTCATCCTGACCGTCCACGTCACCAACCCCAACGTGGCCCCCA
TCCACTACTCCTCCACCGCCATGTCCATCTTCTACGACGGCTCCCTCCTCGGCTCGGCCCGGGTCGACGCCGGGTCGCAGCAGGCCCGGTCCTGCCAGGTCCTCCGTCTC
CCGGCCCGGCTCGACGGCCTCAAGCTGGCCCACCACGGCGGCCGGTTCATCTCCGACGTCGCGAAACGGGAGATGGTTCTGGACGCGAGCGTGGACATTGGGGGAATCGC
AAAAGTGCTGTGGTGGAGTCACAAGTTCAAGGTCCACGTGGACAGCCATCTCACCGTTGATCCCGTGTTCCTTGATGTACTGGATCAGGAAAATACTTCCCAACTTGAGC
TCTTTCTT
mRNA sequenceShow/hide mRNA sequence
CCGGCGAAGAGCATGGGGAGAAAGCGTAACTGGAGCTGGAGCTCCGCCCTGGTCGGAGCGGCGTCGGCGATTGCGGCGACGGCGGTCGTTTCCGCGAAACCTAAGGACCC
GACGTTCCACCTGATTTCAATCAAGTTCACTTCCTTCAAAGTGAAGCCGCCGGTGGTGGACGCCGAGCTCATCCTGACCGTCCACGTCACCAACCCCAACGTGGCCCCCA
TCCACTACTCCTCCACCGCCATGTCCATCTTCTACGACGGCTCCCTCCTCGGCTCGGCCCGGGTCGACGCCGGGTCGCAGCAGGCCCGGTCCTGCCAGGTCCTCCGTCTC
CCGGCCCGGCTCGACGGCCTCAAGCTGGCCCACCACGGCGGCCGGTTCATCTCCGACGTCGCGAAACGGGAGATGGTTCTGGACGCGAGCGTGGACATTGGGGGAATCGC
AAAAGTGCTGTGGTGGAGTCACAAGTTCAAGGTCCACGTGGACAGCCATCTCACCGTTGATCCCGTGTTCCTTGATGTACTGGATCAGGAAAATACTTCCCAACTTGAGC
TCTTTCTT
Protein sequenceShow/hide protein sequence
PAKSMGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARSCQVLRL
PARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL