; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G25400 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G25400
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionFibroin heavy chain
Genome locationChr2:21622821..21623481
RNA-Seq ExpressionCSPI02G25400
SyntenyCSPI02G25400
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051071.1 fibroin heavy chain [Cucumis melo var. makuwa]3.2e-5188.52Show/hide
Query:  MKSDSSNSPWNNF-PTIFSFDQDRRNTKELP--ALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQS
        MKSDSS+SPWNNF  TIFSFD DRRNTKELP  ALVLEAGGGAGVGCGLG GFGLVGGIGH GASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYG SF+S
Subjt:  MKSDSSNSPWNNF-PTIFSFDQDRRNTKELP--ALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQS

Query:  VDSYFSHLISNPKPKQPSLIQF
        +DSYFSHL S+PK KQPSLIQF
Subjt:  VDSYFSHLISNPKPKQPSLIQF

KAG6578355.1 hypothetical protein SDJN03_22803, partial [Cucurbita argyrosperma subsp. sororia]4.9e-3667.91Show/hide
Query:  MKSDSSNSPW-------NNFPT-----IFS----FDQDRRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLG
        MKSD SNSPW       NNF T      FS    FD +RRN K+ P LVL+AGGGAGVGCG+G+GFGLVGGIGH GASPWNHLHLVFGLG GCGVGLGLG
Subjt:  MKSDSSNSPW-------NNFPT-----IFS----FDQDRRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLG

Query:  IGQGFGYGVSFQSVDSYFSHLISNPKPKQPSLIQ
        IG+G GYG+SF S+DSYFS    NP+ K P LIQ
Subjt:  IGQGFGYGVSFQSVDSYFSHLISNPKPKQPSLIQ

KGN63146.1 hypothetical protein Csa_022245 [Cucumis sativus]2.8e-6099.16Show/hide
Query:  MKSDSSNSPWNNFPTIFSFDQDRRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDS
        MKSDSSNSPWNNFPTIF FDQDRRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDS
Subjt:  MKSDSSNSPWNNFPTIFSFDQDRRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDS

Query:  YFSHLISNPKPKQPSLIQF
        YFSHLISNPKPKQPSLIQF
Subjt:  YFSHLISNPKPKQPSLIQF

XP_022769621.1 keratin-associated protein 21-1 [Durio zibethinus]9.3e-1957.78Show/hide
Query:  RRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDSYFSHLISNPKPK
        R + +++   VL  G GAG+GCG+G+GFGLVGG+G+ G  PWNHL LVFG+GAGCGVG+G G GQG GYG S +S++SY S   SN   K
Subjt:  RRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDSYFSHLISNPKPK

XP_030955948.1 protein TRIGALACTOSYLDIACYLGLYCEROL 5, chloroplastic [Quercus lobata]1.2e-1856.18Show/hide
Query:  NTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDSYFSHLISNPKPKQ
        N K+   +VL  GGG G+GCG+G+GFGLVGG+G+ G  PWNHL LV G+G GCGVG+G G GQG GYG S +S++SY S   S+   K+
Subjt:  NTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDSYFSHLISNPKPKQ

TrEMBL top hitse value%identityAlignment
A0A0A0LSZ2 Uncharacterized protein1.4e-6099.16Show/hide
Query:  MKSDSSNSPWNNFPTIFSFDQDRRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDS
        MKSDSSNSPWNNFPTIF FDQDRRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDS
Subjt:  MKSDSSNSPWNNFPTIFSFDQDRRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDS

Query:  YFSHLISNPKPKQPSLIQF
        YFSHLISNPKPKQPSLIQF
Subjt:  YFSHLISNPKPKQPSLIQF

A0A5A7UC28 Fibroin heavy chain1.5e-5188.52Show/hide
Query:  MKSDSSNSPWNNF-PTIFSFDQDRRNTKELP--ALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQS
        MKSDSS+SPWNNF  TIFSFD DRRNTKELP  ALVLEAGGGAGVGCGLG GFGLVGGIGH GASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYG SF+S
Subjt:  MKSDSSNSPWNNF-PTIFSFDQDRRNTKELP--ALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQS

Query:  VDSYFSHLISNPKPKQPSLIQF
        +DSYFSHL S+PK KQPSLIQF
Subjt:  VDSYFSHLISNPKPKQPSLIQF

A0A6A1VGE8 Uncharacterized protein1.3e-1856.04Show/hide
Query:  RRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDSYFSHLISNPKPKQ
        + N K+    +L  GGGAG+GCG GIGFGLVGG+G++G  PWNHL L FG+G GCGVG+GLG G G GYG+S+ S+ SY S   S+   K+
Subjt:  RRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDSYFSHLISNPKPKQ

A0A6P6AXV7 keratin-associated protein 21-14.5e-1957.78Show/hide
Query:  RRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDSYFSHLISNPKPK
        R + +++   VL  G GAG+GCG+G+GFGLVGG+G+ G  PWNHL LVFG+GAGCGVG+G G GQG GYG S +S++SY S   SN   K
Subjt:  RRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDSYFSHLISNPKPK

A0A7N2KWH8 Uncharacterized protein5.9e-1956.18Show/hide
Query:  NTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDSYFSHLISNPKPKQ
        N K+   +VL  GGG G+GCG+G+GFGLVGG+G+ G  PWNHL LV G+G GCGVG+G G GQG GYG S +S++SY S   S+   K+
Subjt:  NTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDSYFSHLISNPKPKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27695.1 glycine-rich protein1.1e-0447.54Show/hide
Query:  GGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQS
        G G GVGCG G+G+G        G  P N L +  G G GCGVGLGLG G G  +G  ++S
Subjt:  GGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQS

AT1G27695.2 glycine-rich protein1.1e-0447.54Show/hide
Query:  GGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQS
        G G GVGCG G+G+G        G  P N L +  G G GCGVGLGLG G G  +G  ++S
Subjt:  GGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQS

AT1G66820.1 glycine-rich protein3.8e-1034.29Show/hide
Query:  SSNSPWNNFPTIFSFDQD-------RRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQS
        S+N  W+  P + +   +       R     +    +  G G G+GCG GIG GL GG+G   +   +H ++V G G GCG+G G G G G G G SF  
Subjt:  SSNSPWNNFPTIFSFDQD-------RRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQS

Query:  VDSYF
        +   F
Subjt:  VDSYF

AT4G10330.1 glycine-rich protein1.7e-0746.15Show/hide
Query:  GAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDSYFS
        G GVGCG G G GL+GG+G     P     L FGLG G G G+G+G G G G G ++    SY++
Subjt:  GAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDSYFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCAGACTCCTCCAATTCCCCATGGAACAACTTCCCCACCATTTTCAGTTTCGACCAGGATCGCCGCAACACAAAGGAACTACCAGCCTTAGTCTTGGAAGCCGG
GGGAGGAGCAGGAGTTGGGTGCGGCCTTGGAATCGGGTTCGGACTCGTGGGAGGGATCGGCCACGCCGGTGCCTCGCCCTGGAATCACCTTCACCTCGTATTTGGCCTGG
GTGCCGGCTGTGGCGTTGGGTTAGGGCTTGGAATTGGGCAAGGCTTTGGATATGGCGTCTCCTTTCAATCTGTTGATTCTTATTTCTCTCATCTCATTTCTAACCCTAAA
CCTAAGCAGCCTTCCCTCATTCAATTTTGA
mRNA sequenceShow/hide mRNA sequence
AGAGTTTAGAATTTAAATTTGGAAGACATCCAACGGGAAGTGAGATTGGGCGATGGGATAAGAAATATGTGAAGAACAACAAACATAGAGTTCAAAATTCACATCTCTCT
CTCTCACTCTCTCTTTTTCATCCTCTCACTATGAAATCAGACTCCTCCAATTCCCCATGGAACAACTTCCCCACCATTTTCAGTTTCGACCAGGATCGCCGCAACACAAA
GGAACTACCAGCCTTAGTCTTGGAAGCCGGGGGAGGAGCAGGAGTTGGGTGCGGCCTTGGAATCGGGTTCGGACTCGTGGGAGGGATCGGCCACGCCGGTGCCTCGCCCT
GGAATCACCTTCACCTCGTATTTGGCCTGGGTGCCGGCTGTGGCGTTGGGTTAGGGCTTGGAATTGGGCAAGGCTTTGGATATGGCGTCTCCTTTCAATCTGTTGATTCT
TATTTCTCTCATCTCATTTCTAACCCTAAACCTAAGCAGCCTTCCCTCATTCAATTTTGAAATTTCCTCTCTTACCCATTTCCGTACCCTTTCATTTTTATCTTCTCCTT
TCAGAATTTTTTTTTTGCATCTTTCAACATCTTTTTCGACTTCTTATTTCTCCATTTAATTTGTATGCATTAATTGCACATTCATTTTATTAATCCACCACTATCAAATT
C
Protein sequenceShow/hide protein sequence
MKSDSSNSPWNNFPTIFSFDQDRRNTKELPALVLEAGGGAGVGCGLGIGFGLVGGIGHAGASPWNHLHLVFGLGAGCGVGLGLGIGQGFGYGVSFQSVDSYFSHLISNPK
PKQPSLIQF