; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005174 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005174
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionWHy domain-containing protein
Genome locationscaffold176:3269815..3270363
RNA-Seq ExpressionMS005174
SyntenyMS005174
Gene Ontology termsGO:0009269 - response to desiccation (biological process)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup
IPR013990 - Water stress and hypersensitive response domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061714.1 late embryogenesis abundant hydroxyproline-rich glycoprotein [Cucumis melo var. makuwa]9.6e-9294.51Show/hide
Query:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS
        MG+KRNWSWSSALVGAASA+AATA++SAKPKDPTFHLISIKFTSFK+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ RS
Subjt:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS

Query:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHG RFISDVAKREMVLDASVDIGG AKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

KAE8647360.1 hypothetical protein Csa_003928 [Cucumis sativus]1.1e-9093.41Show/hide
Query:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS
        MG+KRNWSW+SALVGAASAIAATA++SAKPKDPTFHLISIKFTSFK+KPPVVD ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ RS
Subjt:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS

Query:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHG RFISDVAKREMVLDASVDIGG A+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

XP_004140159.2 uncharacterized protein LOC101218134 [Cucumis sativus]1.1e-9093.41Show/hide
Query:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS
        MG+KRNWSW+SALVGAASAIAATA++SAKPKDPTFHLISIKFTSFK+KPPVVD ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ RS
Subjt:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS

Query:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHG RFISDVAKREMVLDASVDIGG A+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

XP_008449575.2 PREDICTED: uncharacterized protein LOC103491417 [Cucumis melo]9.6e-9294.51Show/hide
Query:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS
        MG+KRNWSWSSALVGAASA+AATA++SAKPKDPTFHLISIKFTSFK+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ RS
Subjt:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS

Query:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHG RFISDVAKREMVLDASVDIGG AKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

XP_038900652.1 uncharacterized protein LOC120087813 [Benincasa hispida]5.1e-9395.63Show/hide
Query:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS
        MG+KRNWSWSSALVGAASAIAATA+VSAKPKDPTFHLISIKFTSFK+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSA+VDAGSQQ RS
Subjt:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS

Query:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        CQVLRLPARLDGLKLAHHG RFISDVAKREM+LDASVDIGG AKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
Subjt:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT

TrEMBL top hitse value%identityAlignment
A0A0A0KGW5 WHy domain-containing protein5.1e-9193.41Show/hide
Query:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS
        MG+KRNWSW+SALVGAASAIAATA++SAKPKDPTFHLISIKFTSFK+KPPVVD ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ RS
Subjt:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS

Query:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHG RFISDVAKREMVLDASVDIGG A+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A1S3BMZ9 uncharacterized protein LOC1034914174.6e-9294.51Show/hide
Query:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS
        MG+KRNWSWSSALVGAASA+AATA++SAKPKDPTFHLISIKFTSFK+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ RS
Subjt:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS

Query:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHG RFISDVAKREMVLDASVDIGG AKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A5D3B864 Late embryogenesis abundant hydroxyproline-rich glycoprotein4.6e-9294.51Show/hide
Query:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS
        MG+KRNWSWSSALVGAASA+AATA++SAKPKDPTFHLISIKFTSFK+KPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSA+VDAGSQQ RS
Subjt:  MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARS

Query:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHG RFISDVAKREMVLDASVDIGG AKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A6J1H295 uncharacterized protein LOC1114594458.2e-8992.43Show/hide
Query:  MGRKRNWSWS--SALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQA
        M +KR+WSWS  SALVGAASAIAATA+VSAKPKDPTFHLISIKFTSFK+KPPVVDAELILTVHVTNPNVAPIHYSST+MSIFYDGSLLGSA VDAGSQQA
Subjt:  MGRKRNWSWS--SALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQA

Query:  RSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        RSCQVLRLPARLDGLKLAH+G RFISDV KREMVLDASVDIGGIA+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL+
Subjt:  RSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT

A0A6J1K4X2 uncharacterized protein LOC1114907778.2e-8991.89Show/hide
Query:  MGRKRNWSWS--SALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQA
        M +KR+WSWS  SALVGAASAIAATA+VSAKPKDPTFHLISIKFTSFK+KPPV+DAELILTVHVTNPNVAPIHYSST+MSIFYDGSLLGSA+VDAGSQQA
Subjt:  MGRKRNWSWS--SALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQA

Query:  RSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        RSCQVLRLPARLDGLKLAH+G RFISDV KREMVLDASVDIGGIA+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL+
Subjt:  RSCQVLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.1e-0828.78Show/hide
Query:  PKDPTFHLISIKFTSFKV--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARSCQVLRLPARLDGLKLAHHGGRFISDV
        P DP   +I +K +   V  +P P +D  L++T+ V+N +V    ++   ++I Y G  LG    D G   A     L   A LDG+ +       I D+
Subjt:  PKDPTFHLISIKFTSFKV--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARSCQVLRLPARLDGLKLAHHGGRFISDV

Query:  AKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPV
        AK  +  D   +  G   VL++    K  V   + VD V
Subjt:  AKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPV

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.5e-0728.91Show/hide
Query:  PKDPTFHLISIKFTSFKV--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARSCQVLRLPARLDGLKLAHHGGRFISDV
        P DP   +I +K +   V  +P P +D  L++T+ V+N +V    ++   ++I Y G  LG    D G   A     L   A LDG+ +       I D+
Subjt:  PKDPTFHLISIKFTSFKV--KP-PVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARSCQVLRLPARLDGLKLAHHGGRFISDV

Query:  AKREMVLDASVDIGGIAKVLWWSHKFKV
        AK  +  D   +  G   VL++    KV
Subjt:  AKREMVLDASVDIGGIAKVLWWSHKFKV

AT3G44380.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.1e-6970.72Show/hide
Query:  RKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARSCQ
        +K  WSWSSAL+GAASA AA +++SAKPKDPTFHLISI  TS K+  PV+DAEL+LTVHVTNPN+A IHYSST M+I YDG++LGSA V AGSQ ARSCQ
Subjt:  RKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARSCQ

Query:  VLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        +LRLPARLDG++LA H  +F SDVA REM L+A + I G AKVLWW H F+VHVDS +TVDPVFLDV+ QEN SQ++LFLT
Subjt:  VLRLPARLDGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGAAAGCGTAACTGGAGCTGGAGCTCCGCCCTGGTCGGAGCGGCGTCGGCGATTGCGGCGACGGCGGTCGTTTCCGCGAAACCTAAGGACCCGACGTTCCACCT
GATTTCAATCAAGTTCACTTCCTTCAAAGTGAAGCCGCCGGTGGTGGACGCCGAGCTCATCCTGACCGTCCACGTCACCAACCCCAACGTGGCCCCCATCCACTACTCCT
CCACCGCCATGTCCATCTTCTACGACGGCTCCCTCCTCGGCTCGGCCCGGGTCGACGCCGGGTCGCAGCAGGCCCGGTCCTGCCAGGTCCTCCGTCTCCCGGCCCGGCTC
GACGGCCTCAAGCTGGCCCACCACGGCGGCCGGTTCATCTCCGACGTCGCGAAACGGGAGATGGTTCTGGACGCGAGCGTGGACATTGGGGGAATCGCAAAAGTGCTGTG
GTGGAGTCACAAGTTCAAGGTCCACGTGGACAGCCATCTCACCGTTGATCCCGTGTTCCTTGATGTACTGGATCAGGAAAATACTTCCCAACTTGAGCTCTTTCTTACT
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGAAAGCGTAACTGGAGCTGGAGCTCCGCCCTGGTCGGAGCGGCGTCGGCGATTGCGGCGACGGCGGTCGTTTCCGCGAAACCTAAGGACCCGACGTTCCACCT
GATTTCAATCAAGTTCACTTCCTTCAAAGTGAAGCCGCCGGTGGTGGACGCCGAGCTCATCCTGACCGTCCACGTCACCAACCCCAACGTGGCCCCCATCCACTACTCCT
CCACCGCCATGTCCATCTTCTACGACGGCTCCCTCCTCGGCTCGGCCCGGGTCGACGCCGGGTCGCAGCAGGCCCGGTCCTGCCAGGTCCTCCGTCTCCCGGCCCGGCTC
GACGGCCTCAAGCTGGCCCACCACGGCGGCCGGTTCATCTCCGACGTCGCGAAACGGGAGATGGTTCTGGACGCGAGCGTGGACATTGGGGGAATCGCAAAAGTGCTGTG
GTGGAGTCACAAGTTCAAGGTCCACGTGGACAGCCATCTCACCGTTGATCCCGTGTTCCTTGATGTACTGGATCAGGAAAATACTTCCCAACTTGAGCTCTTTCTTACT
Protein sequenceShow/hide protein sequence
MGRKRNWSWSSALVGAASAIAATAVVSAKPKDPTFHLISIKFTSFKVKPPVVDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSARVDAGSQQARSCQVLRLPARL
DGLKLAHHGGRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT