; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010200 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010200
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionWHy domain-containing protein
Genome locationchr9:45375231..45375782
RNA-Seq ExpressionLag0010200
SyntenyLag0010200
Gene Ontology termsGO:0009269 - response to desiccation (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup
IPR013990 - Water stress and hypersensitive response domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061714.1 late embryogenesis abundant hydroxyproline-rich glycoprotein [Cucumis melo var. makuwa]1.0e-9396.7Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS
        MGKKRNWSW SALVGAASA+AATAI+SAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSST MSIFY+GSLLGSAQVDAGSQQPRS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGG AKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

KAE8647360.1 hypothetical protein Csa_003928 [Cucumis sativus]8.7e-9396.15Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS
        MGKKRNWSW SALVGAASAIAATAI+SAKPKDPTFHLISIKFTSFKLKPPVVD ELILTVHVTNPNVAPIHYSST MSIFY+GSLLGSAQVDAGSQQPRS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGG A+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

XP_004140159.2 uncharacterized protein LOC101218134 [Cucumis sativus]8.7e-9396.15Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS
        MGKKRNWSW SALVGAASAIAATAI+SAKPKDPTFHLISIKFTSFKLKPPVVD ELILTVHVTNPNVAPIHYSST MSIFY+GSLLGSAQVDAGSQQPRS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGG A+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

XP_008449575.2 PREDICTED: uncharacterized protein LOC103491417 [Cucumis melo]1.0e-9396.7Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS
        MGKKRNWSW SALVGAASA+AATAI+SAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSST MSIFY+GSLLGSAQVDAGSQQPRS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGG AKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

XP_038900652.1 uncharacterized protein LOC120087813 [Benincasa hispida]7.1e-9597.81Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS
        MGKKRNWSW SALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSST MSIFYDGSLLGSAQVDAGSQQPRS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        CQVLRLPARLDGLKLAHHGSRFISDVAKREM+LDASVDIGG AKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT

TrEMBL top hitse value%identityAlignment
A0A0A0KGW5 WHy domain-containing protein4.2e-9396.15Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS
        MGKKRNWSW SALVGAASAIAATAI+SAKPKDPTFHLISIKFTSFKLKPPVVD ELILTVHVTNPNVAPIHYSST MSIFY+GSLLGSAQVDAGSQQPRS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGG A+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A1S3BMZ9 uncharacterized protein LOC1034914175.0e-9496.7Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS
        MGKKRNWSW SALVGAASA+AATAI+SAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSST MSIFY+GSLLGSAQVDAGSQQPRS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGG AKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A5D3B864 Late embryogenesis abundant hydroxyproline-rich glycoprotein5.0e-9496.7Show/hide
Query:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS
        MGKKRNWSW SALVGAASA+AATAI+SAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSST MSIFY+GSLLGSAQVDAGSQQPRS
Subjt:  MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
        CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGG AKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL

A0A6J1H295 uncharacterized protein LOC1114594455.7e-9094.05Show/hide
Query:  MGKKR--NWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQP
        M KKR  +WSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSST MSIFYDGSLLGSA VDAGSQQ 
Subjt:  MGKKR--NWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQP

Query:  RSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        RSCQVLRLPARLDGLKLAH+GSRFISDV KREMVLDASVDIGGIA+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL+
Subjt:  RSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT

A0A6J1K4X2 uncharacterized protein LOC1114907771.9e-9094.05Show/hide
Query:  MGKKR--NWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQP
        M KKR  +WSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPV+DAELILTVHVTNPNVAPIHYSST MSIFYDGSLLGSAQVDAGSQQ 
Subjt:  MGKKR--NWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQP

Query:  RSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        RSCQVLRLPARLDGLKLAH+GSRFISDV KREMVLDASVDIGGIA+VLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFL+
Subjt:  RSCQVLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.1e-0828.06Show/hide
Query:  PKDPTFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDV
        P DP   +I +K +   +  +P P +D  L++T+ V+N +V    ++  D++I Y G  LG    D G         L   A LDG+ +       I D+
Subjt:  PKDPTFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDV

Query:  AKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPV
        AK  +  D   +  G   VL++    K  V   + VD V
Subjt:  AKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPV

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.2e-0728.12Show/hide
Query:  PKDPTFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDV
        P DP   +I +K +   +  +P P +D  L++T+ V+N +V    ++  D++I Y G  LG    D G         L   A LDG+ +       I D+
Subjt:  PKDPTFHLISIKFTSFKL--KP-PVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRSCQVLRLPARLDGLKLAHHGSRFISDV

Query:  AKREMVLDASVDIGGIAKVLWWSHKFKV
        AK  +  D   +  G   VL++    KV
Subjt:  AKREMVLDASVDIGGIAKVLWWSHKFKV

AT3G44380.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.4e-7070.17Show/hide
Query:  KKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRSCQ
        +K  WSW SAL+GAASA AA +++SAKPKDPTFHLISI  TS KL  PV+DAEL+LTVHVTNPN+A IHYSST M+I YDG++LGSA+V AGSQ  RSCQ
Subjt:  KKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRSCQ

Query:  VLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT
        +LRLPARLDG++LA H  +F SDVA REM L+A + I G AKVLWW H F+VHVDS +TVDPVFLDV+ QEN SQ++LFLT
Subjt:  VLRLPARLDGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAAAGCGTAACTGGAGCTGGGGCTCCGCCCTGGTCGGAGCGGCGTCGGCGATTGCAGCGACGGCGATCGTTTCCGCCAAGCCAAAGGACCCAACGTTCCACTT
GATTTCAATAAAGTTCACTTCCTTCAAGCTGAAGCCGCCGGTGGTCGACGCCGAGCTCATCCTCACCGTCCACGTCACCAACCCCAACGTGGCACCCATCCACTACTCCT
CCACCGACATGTCCATTTTCTACGACGGGTCCCTTCTCGGGTCGGCCCAGGTGGACGCCGGTTCGCAGCAGCCCCGGTCCTGCCAGGTCCTCCGACTCCCGGCCCGTCTC
GACGGCCTGAAGCTGGCCCACCACGGCAGCCGGTTCATCTCCGACGTGGCCAAGCGGGAGATGGTTCTAGATGCGAGTGTGGATATTGGCGGAATTGCAAAAGTGCTGTG
GTGGAGTCACAAGTTCAAGGTCCACGTGGACAGCCATCTGACCGTTGATCCCGTCTTCCTCGATGTGCTTGATCAGGAAAACACTTCTCAACTTGAGCTGTTTCTTACTT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAAAAGCGTAACTGGAGCTGGGGCTCCGCCCTGGTCGGAGCGGCGTCGGCGATTGCAGCGACGGCGATCGTTTCCGCCAAGCCAAAGGACCCAACGTTCCACTT
GATTTCAATAAAGTTCACTTCCTTCAAGCTGAAGCCGCCGGTGGTCGACGCCGAGCTCATCCTCACCGTCCACGTCACCAACCCCAACGTGGCACCCATCCACTACTCCT
CCACCGACATGTCCATTTTCTACGACGGGTCCCTTCTCGGGTCGGCCCAGGTGGACGCCGGTTCGCAGCAGCCCCGGTCCTGCCAGGTCCTCCGACTCCCGGCCCGTCTC
GACGGCCTGAAGCTGGCCCACCACGGCAGCCGGTTCATCTCCGACGTGGCCAAGCGGGAGATGGTTCTAGATGCGAGTGTGGATATTGGCGGAATTGCAAAAGTGCTGTG
GTGGAGTCACAAGTTCAAGGTCCACGTGGACAGCCATCTGACCGTTGATCCCGTCTTCCTCGATGTGCTTGATCAGGAAAACACTTCTCAACTTGAGCTGTTTCTTACTT
GA
Protein sequenceShow/hide protein sequence
MGKKRNWSWGSALVGAASAIAATAIVSAKPKDPTFHLISIKFTSFKLKPPVVDAELILTVHVTNPNVAPIHYSSTDMSIFYDGSLLGSAQVDAGSQQPRSCQVLRLPARL
DGLKLAHHGSRFISDVAKREMVLDASVDIGGIAKVLWWSHKFKVHVDSHLTVDPVFLDVLDQENTSQLELFLT