; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G18120 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G18120
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionglycine-rich cell wall structural protein 2-like
Genome locationChr1:13590869..13591776
RNA-Seq ExpressionCSPI01G18120
SyntenyCSPI01G18120
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443814.1 PREDICTED: glycine-rich protein DOT1-like [Cucumis melo]2.7e-4393.38Show/hide
Query:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGA--GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGS
        MASIKAI VLCLVLCMSVIESEAGRVARK+LGLDLGGLGVGLGLGLGLG+GGGSGSG GA  GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGS
Subjt:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGA--GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGS

Query:  YAGSRAGSGSGSRNGASGGEGHGYGEGHGYGEGGNN
        YAGSRAGSGS  RNGASGGE HGYGEGHGYGEGGNN
Subjt:  YAGSRAGSGSGSRNGASGGEGHGYGEGHGYGEGGNN

XP_011656027.1 glycine-rich protein DOT1 [Cucumis sativus]2.8e-4899.25Show/hide
Query:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYA
        MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSG GAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYA
Subjt:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYA

Query:  GSRAGSGSGSRNGASGGEGHGYGEGHGYGEGGNN
        GSRAGSGSGSRNGASGGEGHGYGEGHGYGEGGNN
Subjt:  GSRAGSGSGSRNGASGGEGHGYGEGHGYGEGGNN

XP_022135257.1 glycine-rich cell wall structural protein 2-like [Momordica charantia]7.0e-3173.03Show/hide
Query:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYA
        MASI+AIA +C V+C   IES+A RVARKDLGLDLGGLG+GLG G+GLG+GGGSGSG GAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGS A
Subjt:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYA

Query:  GSRAGSGSGSRNGASG-----------------GEGHGYGEGHGY-GEGGNN
        GS +GSG   RNG SG                 GEGHGYGEG GY GEGGNN
Subjt:  GSRAGSGSGSRNGASG-----------------GEGHGYGEGHGY-GEGGNN

XP_022921885.1 glycine-rich protein DOT1-like [Cucurbita moschata]4.8e-3282.73Show/hide
Query:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSS--SSSHSSSSSYGGSGAGSEAGSYAGS
        M SI+AIAVLC V+CMS IES+ GRVARKDLGLDLGGLGVGLG+GLGLGLGGGSGSG G+GSGSGSGSGS S   SSS SSSSSYGGSGAGSEAGSYAGS
Subjt:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSS--SSSHSSSSSYGGSGAGSEAGSYAGS

Query:  YAGSRAGSGSGS--RNGASG-GEGHGYGEGHGYGEGGNN
        YAGSR GS SG   RNG SG GEG GYGEG GYGEGGNN
Subjt:  YAGSRAGSGSGS--RNGASG-GEGHGYGEGHGYGEGGNN

XP_038880102.1 glycine-rich cell wall structural protein 2-like [Benincasa hispida]9.4e-4494.03Show/hide
Query:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYA
        MA+IKAIAVLCLVLCMSVIES AGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSG GAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYA
Subjt:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYA

Query:  GSRAGSGSGSRNGASGGEGHGYGEGHGYGEGGNN
        GSRAGSGSG  NG  GGEGHGYGEG GYGEGGNN
Subjt:  GSRAGSGSGSRNGASGGEGHGYGEGHGYGEGGNN

TrEMBL top hitse value%identityAlignment
A0A0A0LW94 Uncharacterized protein1.4e-4899.25Show/hide
Query:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYA
        MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSG GAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYA
Subjt:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYA

Query:  GSRAGSGSGSRNGASGGEGHGYGEGHGYGEGGNN
        GSRAGSGSGSRNGASGGEGHGYGEGHGYGEGGNN
Subjt:  GSRAGSGSGSRNGASGGEGHGYGEGHGYGEGGNN

A0A1S3B9P9 glycine-rich protein DOT1-like1.3e-4393.38Show/hide
Query:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGA--GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGS
        MASIKAI VLCLVLCMSVIESEAGRVARK+LGLDLGGLGVGLGLGLGLG+GGGSGSG GA  GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGS
Subjt:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGA--GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGS

Query:  YAGSRAGSGSGSRNGASGGEGHGYGEGHGYGEGGNN
        YAGSRAGSGS  RNGASGGE HGYGEGHGYGEGGNN
Subjt:  YAGSRAGSGSGSRNGASGGEGHGYGEGHGYGEGGNN

A0A5A7SV53 Glycine-rich protein DOT1-like1.3e-4393.38Show/hide
Query:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGA--GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGS
        MASIKAI VLCLVLCMSVIESEAGRVARK+LGLDLGGLGVGLGLGLGLG+GGGSGSG GA  GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGS
Subjt:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGA--GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGS

Query:  YAGSRAGSGSGSRNGASGGEGHGYGEGHGYGEGGNN
        YAGSRAGSGS  RNGASGGE HGYGEGHGYGEGGNN
Subjt:  YAGSRAGSGSGSRNGASGGEGHGYGEGHGYGEGGNN

A0A6J1C269 glycine-rich cell wall structural protein 2-like3.4e-3173.03Show/hide
Query:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYA
        MASI+AIA +C V+C   IES+A RVARKDLGLDLGGLG+GLG G+GLG+GGGSGSG GAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGS A
Subjt:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYA

Query:  GSRAGSGSGSRNGASG-----------------GEGHGYGEGHGY-GEGGNN
        GS +GSG   RNG SG                 GEGHGYGEG GY GEGGNN
Subjt:  GSRAGSGSGSRNGASG-----------------GEGHGYGEGHGY-GEGGNN

A0A6J1E720 glycine-rich protein DOT1-like2.3e-3282.73Show/hide
Query:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSS--SSSHSSSSSYGGSGAGSEAGSYAGS
        M SI+AIAVLC V+CMS IES+ GRVARKDLGLDLGGLGVGLG+GLGLGLGGGSGSG G+GSGSGSGSGS S   SSS SSSSSYGGSGAGSEAGSYAGS
Subjt:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSS--SSSHSSSSSYGGSGAGSEAGSYAGS

Query:  YAGSRAGSGSGS--RNGASG-GEGHGYGEGHGYGEGGNN
        YAGSR GS SG   RNG SG GEG GYGEG GYGEGGNN
Subjt:  YAGSRAGSGSGS--RNGASG-GEGHGYGEGHGYGEGGNN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30450.1 glycine-rich protein2.5e-1866.36Show/hide
Query:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYA
        MA  + + +L LV+  S++     R+ARKDLG+DLGG+G+GLG+GLG+GLGGGSGSG GAGSGSGSGS S SSSSS SSSSS   SG+G  AGS AGS+A
Subjt:  MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYA

Query:  GSRAGSGSGS
        GSRAGSGSG+
Subjt:  GSRAGSGSGS

AT4G30460.1 glycine-rich protein1.9e-1864.89Show/hide
Query:  AIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSSSSSHSSSSSYGGSG--AGSEAGSYAGSYAGSR
        ++ ++ L+L  SV+ SE+ RVARKDLGLDLGG+G G+G+G+G+G GGGSGSG GAGSGSG G  S SSSSS SSSSS GG G  AGSEAGSYAGS+AGS 
Subjt:  AIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSSSSSHSSSSSYGGSG--AGSEAGSYAGSYAGSR

Query:  AG--SGSGSRNGASGGEGHGYGEGHGYGEGG
        +G  SGSG   G+ GG GHG G G G G GG
Subjt:  AG--SGSGSRNGASGGEGHGYGEGHGYGEGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCTATCAAAGCTATTGCTGTTCTCTGTTTAGTACTATGTATGTCAGTGATTGAATCAGAAGCAGGGCGTGTGGCCAGGAAGGACCTGGGCCTTGATCTTGGTGG
CTTGGGAGTTGGTCTCGGCCTTGGTTTGGGGTTGGGCTTAGGCGGTGGAAGTGGGTCGGGTGTTGGAGCCGGGTCTGGATCCGGATCCGGGTCTGGGTCCTATTCGTCCT
CATCCTCACATTCATCAAGCTCTAGCTATGGAGGGTCCGGTGCAGGCTCTGAGGCCGGTTCATATGCTGGGTCTTACGCAGGGTCACGTGCTGGGTCAGGTTCCGGCAGC
AGAAACGGCGCCAGCGGGGGAGAGGGTCATGGCTATGGCGAGGGACATGGTTATGGTGAAGGAGGTAACAACTGA
mRNA sequenceShow/hide mRNA sequence
AAATTCTATAAATACCTGCACCAATTTCGCCAAAATACATTACCAAATTCACAGACCCATTTCCTAAGTTTCGAAGTTTCAATATTATTAGATTTTCGAAATTAAAATGG
CCTCTATCAAAGCTATTGCTGTTCTCTGTTTAGTACTATGTATGTCAGTGATTGAATCAGAAGCAGGGCGTGTGGCCAGGAAGGACCTGGGCCTTGATCTTGGTGGCTTG
GGAGTTGGTCTCGGCCTTGGTTTGGGGTTGGGCTTAGGCGGTGGAAGTGGGTCGGGTGTTGGAGCCGGGTCTGGATCCGGATCCGGGTCTGGGTCCTATTCGTCCTCATC
CTCACATTCATCAAGCTCTAGCTATGGAGGGTCCGGTGCAGGCTCTGAGGCCGGTTCATATGCTGGGTCTTACGCAGGGTCACGTGCTGGGTCAGGTTCCGGCAGCAGAA
ACGGCGCCAGCGGGGGAGAGGGTCATGGCTATGGCGAGGGACATGGTTATGGTGAAGGAGGTAACAACTGAAAGAAAGTAAAGAGAGTGGAAAAGATCAAAAGTAGGATA
TTGAATAAAAACCACAGCTAATGGTGGGTGGTATCCGTGACCAGACTGGAGTAATACTTGAATGCATTTTGAGTCTCACCTCTCTTTTATCATTTCTTCTTTTATCTCTT
AAATATTATTATTATTATTATTACGTGAAAATTAAAATTAAAATTAGATAATTCGGTGGAATTGATAAAATTAGATGATGTTTTAGATGCATGTAAGTATTTCTCTCGTA
ATTATAATTAGTGTAGTACTTTGTATCATTGTGTCAGTAAACTTATGACGATTTTATTTACTTTTAAGTTTTAAAGATGATGTATTTTCCAAAAACTTTGTTAATAGTTT
ACTCTTGATTTATCCACAACTTTAATTT
Protein sequenceShow/hide protein sequence
MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGVGAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYAGSRAGSGSGS
RNGASGGEGHGYGEGHGYGEGGNN