; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G010970 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G010970
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionglycine-rich cell wall structural protein 2-like
Genome locationchr06:21243622..21244050
RNA-Seq ExpressionLsi06G010970
SyntenyLsi06G010970
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589745.1 hypothetical protein SDJN03_15168, partial [Cucurbita argyrosperma subsp. sororia]1.8e-2974.21Show/hide
Query:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSGGSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSY---------
        MASI  L +V+CFLLSFS ++S+ RVARKDLG+DLGG+GVG+G GIGLG GGSGSGSGSGSGSGSSSSSSSSSYSSSS SGSGAGS+AGSY         
Subjt:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSGGSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSY---------

Query:  -------AGSYAGSQAGSGG--NRNGGLGSGSGYGEGSGKGSGNGGGEGYGEGHGYGGG
               AGSYAGS+AGSG   NRNGG GSGSGYG GSG+GSGNGGGEGYGEGHGYG G
Subjt:  -------AGSYAGSQAGSGG--NRNGGLGSGSGYGEGSGKGSGNGGGEGYGEGHGYGGG

XP_004146660.2 glycine-rich cell wall structural protein 2 [Cucumis sativus]3.9e-3284.25Show/hide
Query:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG--GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGS
        MASIK LA+V+C LLSFSA+LSEGRVARKDLGIDLGGVGVGLG GIGLG G  GSGSGSGSGSGSGS SSSSSSSYSSSSSSGSGAGS+AGSYAGSYAGS
Subjt:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG--GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGS

Query:  QA--GSGGNRNGGLGSGSGYGEGSGKGSG-NGGGEGYGEGHGYGGG
        +A  GSGGNRNGG G GSGYG GSG+G G N  GEGYGEGHGYG G
Subjt:  QA--GSGGNRNGGLGSGSGYGEGSGKGSG-NGGGEGYGEGHGYGGG

XP_008443813.1 PREDICTED: glycine-rich cell wall structural protein 2-like [Cucumis melo]1.3e-3283.56Show/hide
Query:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG--GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGS
        MASIK LA+V+C LLSFS  LSEGRVARKDLGIDLGG+GVGLGAGIGLG G  GSGSGSGSGSGSGS SSSSSSSYSSSSSSGSGAGS+AGSYAGSYAGS
Subjt:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG--GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGS

Query:  QA--GSGGNRNGGLGSGSGYGEGSGKGSGNGGGEGYGEGHGYGGGN
        +A  GSGGNRNGG G GSGYG GSG+G  NG GEGYGEGHGYG G+
Subjt:  QA--GSGGNRNGGLGSGSGYGEGSGKGSGNGGGEGYGEGHGYGGGN

XP_022921883.1 putative glycine-rich cell wall structural protein 1 isoform X3 [Cucurbita moschata]2.4e-2980.69Show/hide
Query:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG----GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYA
        MASI  L +V+CFLLSFS +LS+ RVARKDLG+DLGG+GVG+G GIGLG G    GSGSGSGSGSGSGSSSSSSSSSYSSSS SGSGAGS+AGSYAGS A
Subjt:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG----GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYA

Query:  GSQAGSGGNRNGGLGSGSGYGEGSGKGSGNGGGEGYGEGHGYGGG
        GS  GSG NRNGG GSGSGYG GSG+GSGNGGGEGYGEGHGYG G
Subjt:  GSQAGSGGNRNGGLGSGSGYGEGSGKGSGNGGGEGYGEGHGYGGG

XP_038879748.1 glycine-rich cell wall structural protein 2-like [Benincasa hispida]3.4e-3688.03Show/hide
Query:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSGGSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGSQA
        MASIK LAVV+CFLLSFSA+LSEGRVARKDLGIDLGGVGVGLGAGIGLG GGSGSGSGSGSGS SSSSS SS  SSSSSSGSGAGS+AGSYAGSYAGS+A
Subjt:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSGGSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGSQA

Query:  GSGGNRNGGLGSGSGYGEGSGKGSGNGGGEGYGEGHGYGGGN
        GSGGNRNGG GSG+GYG GSG+G GNGGGEGYGEGHGYG G+
Subjt:  GSGGNRNGGLGSGSGYGEGSGKGSGNGGGEGYGEGHGYGGGN

TrEMBL top hitse value%identityAlignment
A0A0A0LTK9 Uncharacterized protein1.9e-3284.25Show/hide
Query:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG--GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGS
        MASIK LA+V+C LLSFSA+LSEGRVARKDLGIDLGGVGVGLG GIGLG G  GSGSGSGSGSGSGS SSSSSSSYSSSSSSGSGAGS+AGSYAGSYAGS
Subjt:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG--GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGS

Query:  QA--GSGGNRNGGLGSGSGYGEGSGKGSG-NGGGEGYGEGHGYGGG
        +A  GSGGNRNGG G GSGYG GSG+G G N  GEGYGEGHGYG G
Subjt:  QA--GSGGNRNGGLGSGSGYGEGSGKGSG-NGGGEGYGEGHGYGGG

A0A1S3B9N0 glycine-rich cell wall structural protein 2-like6.5e-3383.56Show/hide
Query:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG--GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGS
        MASIK LA+V+C LLSFS  LSEGRVARKDLGIDLGG+GVGLGAGIGLG G  GSGSGSGSGSGSGS SSSSSSSYSSSSSSGSGAGS+AGSYAGSYAGS
Subjt:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG--GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGS

Query:  QA--GSGGNRNGGLGSGSGYGEGSGKGSGNGGGEGYGEGHGYGGGN
        +A  GSGGNRNGG G GSGYG GSG+G  NG GEGYGEGHGYG G+
Subjt:  QA--GSGGNRNGGLGSGSGYGEGSGKGSGNGGGEGYGEGHGYGGGN

A0A6J1C497 putative glycine-rich cell wall structural protein 19.0e-2777.93Show/hide
Query:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG-GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGSQ
        MA+I+  AVV   L   +A++SE RVARKDLGIDLGGVGVGLGAGIGLG G GSGSG+G+GSGSGS SSSSSSSYSSSSSSGSGAGS+AGSYAGSYAGS+
Subjt:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG-GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGSQ

Query:  AGSGGNRNGGLGSGSGYGEGSGKGSGNGGGE--GYGEGHGYGGGN
        AGSG   N G GSGSGYG GSG+GSG G GE  GYGEGHGYGGGN
Subjt:  AGSGGNRNGGLGSGSGYGEGSGKGSGNGGGE--GYGEGHGYGGGN

A0A6J1E2L1 cell wall protein IFF6-like isoform X12.1e-2873.01Show/hide
Query:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG----GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSY-----
        MASI  L +V+CFLLSFS +LS+ RVARKDLG+DLGG+GVG+G GIGLG G    GSGSGSGSGSGSGSSSSSSSSSYSSSS SGSGAGS+AGSY     
Subjt:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG----GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSY-----

Query:  -----------AGSYAGSQAGSGG--NRNGGLGSGSGYGEGSGKGSGNGGGEGYGEGHGYGGG
                   AGSYAGS+AGSG   NRNGG GSGSGYG GSG+GSGNGGGEGYGEGHGYG G
Subjt:  -----------AGSYAGSQAGSGG--NRNGGLGSGSGYGEGSGKGSGNGGGEGYGEGHGYGGG

A0A6J1E521 putative glycine-rich cell wall structural protein 1 isoform X31.1e-2980.69Show/hide
Query:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG----GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYA
        MASI  L +V+CFLLSFS +LS+ RVARKDLG+DLGG+GVG+G GIGLG G    GSGSGSGSGSGSGSSSSSSSSSYSSSS SGSGAGS+AGSYAGS A
Subjt:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSG----GSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYA

Query:  GSQAGSGGNRNGGLGSGSGYGEGSGKGSGNGGGEGYGEGHGYGGG
        GS  GSG NRNGG GSGSGYG GSG+GSGNGGGEGYGEGHGYG G
Subjt:  GSQAGSGGNRNGGLGSGSGYGEGSGKGSGNGGGEGYGEGHGYGGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30450.1 glycine-rich protein8.7e-1463.46Show/hide
Query:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLG-SGGSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGSQ
        MA  + L ++   ++S   +L+E R+ARKDLGIDLGG+G+GLG G+G+G  GGSGSG+G+GSGSGS S SSSSS SSSSSS SG+G  AGS AGS+AGS+
Subjt:  MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLG-SGGSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGSQ

Query:  AGSG
        AGSG
Subjt:  AGSG

AT4G30460.1 glycine-rich protein7.6e-1862.33Show/hide
Query:  VVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSGGSGSGSGSGSGS-GSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGSQAGSG-GNR
        +++  +L+ S ++SE RVARKDLG+DLGG+G G+G GIG+G GGSGSG+G+GSGS G  SSSSSSS SSSSSS  G G DAGS AGSYAGS AGSG G R
Subjt:  VVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSGGSGSGSGSGSGS-GSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGSQAGSG-GNR

Query:  NG-----GLGSGSGYGEGSGKGSGNGG------GEGYGEGHGYGGG
        +G     G G G G+G G G G G GG      GEGYGEG GYGGG
Subjt:  NG-----GLGSGSGYGEGSGKGSGNGG------GEGYGEGHGYGGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAATCAAGTTCCTGGCTGTTGTTCTTTGCTTTTTACTCTCTTTTTCCGCCGTTCTCTCTGAAGGTCGAGTGGCGAGAAAGGATCTCGGCATTGACCTCGGAGG
AGTCGGAGTTGGACTTGGAGCTGGAATTGGCTTAGGCTCAGGTGGAAGTGGTTCTGGCTCTGGCTCTGGCTCCGGATCTGGATCGAGTTCGTCTTCATCTTCATCATCAT
ACTCTTCAAGCTCAAGCTCTGGGTCTGGAGCTGGGTCCGACGCTGGCTCATACGCAGGCTCGTATGCAGGGTCTCAGGCAGGCTCAGGTGGAAATAGGAATGGAGGGTTG
GGGTCAGGTTCGGGATATGGCGAAGGTTCGGGCAAAGGAAGTGGCAATGGCGGTGGTGAAGGATATGGTGAAGGTCATGGCTATGGGGGAGGCAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAATCAAGTTCCTGGCTGTTGTTCTTTGCTTTTTACTCTCTTTTTCCGCCGTTCTCTCTGAAGGTCGAGTGGCGAGAAAGGATCTCGGCATTGACCTCGGAGG
AGTCGGAGTTGGACTTGGAGCTGGAATTGGCTTAGGCTCAGGTGGAAGTGGTTCTGGCTCTGGCTCTGGCTCCGGATCTGGATCGAGTTCGTCTTCATCTTCATCATCAT
ACTCTTCAAGCTCAAGCTCTGGGTCTGGAGCTGGGTCCGACGCTGGCTCATACGCAGGCTCGTATGCAGGGTCTCAGGCAGGCTCAGGTGGAAATAGGAATGGAGGGTTG
GGGTCAGGTTCGGGATATGGCGAAGGTTCGGGCAAAGGAAGTGGCAATGGCGGTGGTGAAGGATATGGTGAAGGTCATGGCTATGGGGGAGGCAACTAA
Protein sequenceShow/hide protein sequence
MASIKFLAVVLCFLLSFSAVLSEGRVARKDLGIDLGGVGVGLGAGIGLGSGGSGSGSGSGSGSGSSSSSSSSSYSSSSSSGSGAGSDAGSYAGSYAGSQAGSGGNRNGGL
GSGSGYGEGSGKGSGNGGGEGYGEGHGYGGGN