; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy3G006200 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy3G006200
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptionfibroin heavy chain-like
Genome locationGy14Chr3:5257871..5258581
RNA-Seq ExpressionCsGy3G006200
SyntenyCsGy3G006200
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060661.1 hypothetical protein E6C27_scaffold22G005540 [Cucumis melo var. makuwa]6.69e-12176.89Show/hide
Query:  MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGP-VDRGVSNFGIGPKAGPRAGLGVGG-ISNVDDGSDPGPKAGPGVKEEMSNVG
        MASLKYFLLSPFLFLCLSYTFA+GVFNYD GL FGSMSSPTPDPSAGP VD GVSN GIGPKAGPRAGLG+GG IS+VDD  +PGPKAGP      ++ G
Subjt:  MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGP-VDRGVSNFGIGPKAGPRAGLGVGG-ISNVDDGSDPGPKAGPGVKEEMSNVG

Query:  AGPRVPKLGVSSIEAGPRAGPKGVDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGGWYGPGPIIQEPYNNCMLGYVCPTNRPWACGKVGYGLCESYNFRP
              KLGVS IEAGPRAGPKGV+    G GVGVGV+LPP+FGGPK+G++PGPGGWY PGPIIQEPY NCMLGYVCP NRPWAC K  YGLC+SYNF P
Subjt:  AGPRVPKLGVSSIEAGPRAGPKGVDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGGWYGPGPIIQEPYNNCMLGYVCPTNRPWACGKVGYGLCESYNFRP

Query:  LSASTELHDVKINWAKSKSVETAQHGESGPGIHIDSAH
        LSAST+LH+VKINWAKSK   TAQHGESGP  H+DSAH
Subjt:  LSASTELHDVKINWAKSKSVETAQHGESGPGIHIDSAH

KAG6577377.1 hypothetical protein SDJN03_24951, partial [Cucurbita argyrosperma subsp. sororia]5.98e-4843.05Show/hide
Query:  MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGP-VDRGVSNFGIGP-------------KAGPRAGLGVGG-ISNVDDG----SD
        M SLKYFLLSPF+FLCLS TFAN V N DDG GF   + P   P+AGP V++GVSN   GP             KAGP+AG G  G +S+V  G      
Subjt:  MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGP-VDRGVSNFGIGP-------------KAGPRAGLGVGG-ISNVDDG----SD

Query:  PGPKAGPGVKEEMSNVGAGPRV-PKLG------VSSIEAGPRAGPKG---------------------------------------VDPIVTGLGVGVGV
         GPKAGPG +E +S+V AGPR  PK G      VS ++AG RAGPK                                        VDP++ GLG+G+GV
Subjt:  PGPKAGPGVKEEMSNVGAGPRV-PKLG------VSSIEAGPRAGPKG---------------------------------------VDPIVTGLGVGVGV

Query:  NLPPIFGGPKMGIRPGPGG---WYGPGPIIQEP--YNNCMLGYVCPTNRPWACGKVGYGLCESYNFRPLSASTELHDVKINWAK-SKSVETAQHG
        ++     G + G R G GG   W+GPG  I      N C LGYVCPT     C K  YG C++Y F PL AS  LH+V++ WAK SK   T Q+G
Subjt:  NLPPIFGGPKMGIRPGPGG---WYGPGPIIQEP--YNNCMLGYVCPTNRPWACGKVGYGLCESYNFRPLSASTELHDVKINWAK-SKSVETAQHG

KGN56231.1 hypothetical protein Csa_011503 [Cucumis sativus]1.82e-174100Show/hide
Query:  MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGPVDRGVSNFGIGPKAGPRAGLGVGGISNVDDGSDPGPKAGPGVKEEMSNVGAG
        MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGPVDRGVSNFGIGPKAGPRAGLGVGGISNVDDGSDPGPKAGPGVKEEMSNVGAG
Subjt:  MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGPVDRGVSNFGIGPKAGPRAGLGVGGISNVDDGSDPGPKAGPGVKEEMSNVGAG

Query:  PRVPKLGVSSIEAGPRAGPKGVDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGGWYGPGPIIQEPYNNCMLGYVCPTNRPWACGKVGYGLCESYNFRPLS
        PRVPKLGVSSIEAGPRAGPKGVDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGGWYGPGPIIQEPYNNCMLGYVCPTNRPWACGKVGYGLCESYNFRPLS
Subjt:  PRVPKLGVSSIEAGPRAGPKGVDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGGWYGPGPIIQEPYNNCMLGYVCPTNRPWACGKVGYGLCESYNFRPLS

Query:  ASTELHDVKINWAKSKSVETAQHGESGPGIHIDSAH
        ASTELHDVKINWAKSKSVETAQHGESGPGIHIDSAH
Subjt:  ASTELHDVKINWAKSKSVETAQHGESGPGIHIDSAH

XP_022929340.1 fibroin heavy chain-like [Cucurbita moschata]5.69e-5248.81Show/hide
Query:  MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGP-VDRGVSNFGIGP-------------KAGPRAGLGVGG-ISNVDDGSDPGPK
        M SLKYFLLSPF+FLCLS TFAN V N DDG GF   + P   P+AGP V++GVSN   GP             KAGP+AG G  G +S+V  G   GPK
Subjt:  MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGP-VDRGVSNFGIGP-------------KAGPRAGLGVGG-ISNVDDGSDPGPK

Query:  AGPGVKEEMSNVGAGPRV-PKL------GVSSIEAGPRAGPKGVDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGG---WYGPGPIIQEP--YNNCMLGY
        AGPG +  +SNV AGP V P+       GVSS E G R   + VDP++ GLG+G+GV++     G + G R G GG   W+GPG  I      N C LGY
Subjt:  AGPGVKEEMSNVGAGPRV-PKL------GVSSIEAGPRAGPKGVDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGG---WYGPGPIIQEP--YNNCMLGY

Query:  VCPTNRPWACGKVGYGLCESYNFRPLSASTELHDVKINWAK-SKSVETAQHG
        VCPT     C K  YG C++Y F PL AS  LH+V++ WAK SK   T Q+G
Subjt:  VCPTNRPWACGKVGYGLCESYNFRPLSASTELHDVKINWAK-SKSVETAQHG

XP_023551824.1 fibroin heavy chain-like isoform X2 [Cucurbita pepo subsp. pepo]1.64e-3645.41Show/hide
Query:  PTPDPSAGP-VDRGVSNFGIGPKA---------GPRAGLGVGG-ISNVDDGSDPGPKAGPGVKEEMSNVGAGPRV-PKL------GVSSIEAGPRAGPKG
        P   P+AGP V++GVSN   GP A         GP+AG G  G +S+V  G   GPKAGPG +  ++NV AGP V P+       GVSS E G R   + 
Subjt:  PTPDPSAGP-VDRGVSNFGIGPKA---------GPRAGLGVGG-ISNVDDGSDPGPKAGPGVKEEMSNVGAGPRV-PKL------GVSSIEAGPRAGPKG

Query:  VDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGG---WYGPGPIIQEPYNNCMLGYVCPTNRPWACGKVGYGLCESYNFRPLSASTELHDVKINWAK-SKS
        VDP++ GLG+G+GV++     G + G R G GG   W+GPG   +   N C LGYVCPT     C K  YG C+SY F PL AS +LH+V++ WAK SK 
Subjt:  VDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGG---WYGPGPIIQEPYNNCMLGYVCPTNRPWACGKVGYGLCESYNFRPLSASTELHDVKINWAK-SKS

Query:  VETAQHG
          T Q+G
Subjt:  VETAQHG

TrEMBL top hitse value%identityAlignment
A0A091VXI0 Uncharacterized protein (Fragment)1.36e-0539.24Show/hide
Query:  DGLGFGSMSSPTP------DPSAGPVD-RGVS-NFGIGPKAGPRAGLGVGGISNVDDGSDPGPKAGPGVKEEM---SNVGAGPRVPKLGVS---SIEAGP
        DG G  +  S +P       PSAGPV   GVS   GI P AGP  G GVG  + ++ G+ PGP AG  V++     S+   GP     GVS   S+ AG 
Subjt:  DGLGFGSMSSPTP------DPSAGPVD-RGVS-NFGIGPKAGPRAGLGVGGISNVDDGSDPGPKAGPGVKEEM---SNVGAGPRVPKLGVS---SIEAGP

Query:  RAGPK-GVDP---IVTGLGV------GVGVNLPPIFGGPKMGIRPGPGGWYGPGPIIQ
        R GP  G+DP   + +G GV      G+G  + P  G    G+ PGPG   GP P I 
Subjt:  RAGPK-GVDP---IVTGLGV------GVGVNLPPIFGGPKMGIRPGPGGWYGPGPIIQ

A0A0A0L7X7 Uncharacterized protein8.82e-175100Show/hide
Query:  MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGPVDRGVSNFGIGPKAGPRAGLGVGGISNVDDGSDPGPKAGPGVKEEMSNVGAG
        MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGPVDRGVSNFGIGPKAGPRAGLGVGGISNVDDGSDPGPKAGPGVKEEMSNVGAG
Subjt:  MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGPVDRGVSNFGIGPKAGPRAGLGVGGISNVDDGSDPGPKAGPGVKEEMSNVGAG

Query:  PRVPKLGVSSIEAGPRAGPKGVDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGGWYGPGPIIQEPYNNCMLGYVCPTNRPWACGKVGYGLCESYNFRPLS
        PRVPKLGVSSIEAGPRAGPKGVDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGGWYGPGPIIQEPYNNCMLGYVCPTNRPWACGKVGYGLCESYNFRPLS
Subjt:  PRVPKLGVSSIEAGPRAGPKGVDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGGWYGPGPIIQEPYNNCMLGYVCPTNRPWACGKVGYGLCESYNFRPLS

Query:  ASTELHDVKINWAKSKSVETAQHGESGPGIHIDSAH
        ASTELHDVKINWAKSKSVETAQHGESGPGIHIDSAH
Subjt:  ASTELHDVKINWAKSKSVETAQHGESGPGIHIDSAH

A0A5A7V4J6 Uncharacterized protein3.24e-12176.89Show/hide
Query:  MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGP-VDRGVSNFGIGPKAGPRAGLGVGG-ISNVDDGSDPGPKAGPGVKEEMSNVG
        MASLKYFLLSPFLFLCLSYTFA+GVFNYD GL FGSMSSPTPDPSAGP VD GVSN GIGPKAGPRAGLG+GG IS+VDD  +PGPKAGP      ++ G
Subjt:  MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGP-VDRGVSNFGIGPKAGPRAGLGVGG-ISNVDDGSDPGPKAGPGVKEEMSNVG

Query:  AGPRVPKLGVSSIEAGPRAGPKGVDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGGWYGPGPIIQEPYNNCMLGYVCPTNRPWACGKVGYGLCESYNFRP
              KLGVS IEAGPRAGPKGV+    G GVGVGV+LPP+FGGPK+G++PGPGGWY PGPIIQEPY NCMLGYVCP NRPWAC K  YGLC+SYNF P
Subjt:  AGPRVPKLGVSSIEAGPRAGPKGVDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGGWYGPGPIIQEPYNNCMLGYVCPTNRPWACGKVGYGLCESYNFRP

Query:  LSASTELHDVKINWAKSKSVETAQHGESGPGIHIDSAH
        LSAST+LH+VKINWAKSK   TAQHGESGP  H+DSAH
Subjt:  LSASTELHDVKINWAKSKSVETAQHGESGPGIHIDSAH

A0A6I8PB04 Bassoon presynaptic cytomatrix protein1.64e-0441.04Show/hide
Query:  GLGFGSMSSPTPDPSAGPVDRGVSNFGIGPKAGPRAGLGVGGISNVDDGSDPGPKAGPGVKEEMSNVGAGPRV---PKLGVSSIEAGPRAGPKGVDPIVT
        G G G+   P   P AGP  R     G GP+AGP  G G G  +    G   GP+AGPG        G GPR    P  G  +   GPRAGP G  P   
Subjt:  GLGFGSMSSPTPDPSAGPVDRGVSNFGIGPKAGPRAGLGVGGISNVDDGSDPGPKAGPGVKEEMSNVGAGPRV---PKLGVSSIEAGPRAGPKGVDPIVT

Query:  GLGVGVGVNLPPIFG-GPKMGIRPGPGGWYGPGP
        G G G G    P  G GP+ G   GPG   GPGP
Subjt:  GLGVGVGVNLPPIFG-GPKMGIRPGPGGWYGPGP

A0A6J1EU53 fibroin heavy chain-like2.75e-5248.81Show/hide
Query:  MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGP-VDRGVSNFGIGP-------------KAGPRAGLGVGG-ISNVDDGSDPGPK
        M SLKYFLLSPF+FLCLS TFAN V N DDG GF   + P   P+AGP V++GVSN   GP             KAGP+AG G  G +S+V  G   GPK
Subjt:  MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGP-VDRGVSNFGIGP-------------KAGPRAGLGVGG-ISNVDDGSDPGPK

Query:  AGPGVKEEMSNVGAGPRV-PKL------GVSSIEAGPRAGPKGVDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGG---WYGPGPIIQEP--YNNCMLGY
        AGPG +  +SNV AGP V P+       GVSS E G R   + VDP++ GLG+G+GV++     G + G R G GG   W+GPG  I      N C LGY
Subjt:  AGPGVKEEMSNVGAGPRV-PKL------GVSSIEAGPRAGPKGVDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGG---WYGPGPIIQEP--YNNCMLGY

Query:  VCPTNRPWACGKVGYGLCESYNFRPLSASTELHDVKINWAK-SKSVETAQHG
        VCPT     C K  YG C++Y F PL AS  LH+V++ WAK SK   T Q+G
Subjt:  VCPTNRPWACGKVGYGLCESYNFRPLSASTELHDVKINWAK-SKSVETAQHG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCCTCAAATATTTCCTTCTCTCTCCCTTTCTTTTCCTTTGCTTAAGCTACACCTTTGCCAATGGAGTCTTCAATTACGACGATGGACTTGGTTTCGGTTCCAT
GTCTTCGCCAACACCTGATCCGAGTGCTGGCCCAGTCGATAGAGGGGTGAGCAATTTCGGTATCGGTCCGAAAGCCGGACCGAGAGCTGGCCTAGGAGTTGGAGGAATAA
GTAATGTGGATGATGGTTCAGATCCTGGACCGAAAGCTGGCCCAGGAGTCAAAGAAGAGATGAGCAATGTCGGTGCTGGTCCGAGAGTACCTAAGTTAGGGGTAAGTAGT
ATTGAGGCCGGTCCAAGAGCTGGGCCTAAAGGTGTTGATCCAATTGTTACTGGACTCGGTGTCGGAGTCGGAGTCAATTTGCCTCCTATATTTGGAGGTCCAAAAATGGG
GATAAGGCCGGGACCAGGAGGGTGGTATGGGCCTGGGCCAATAATACAAGAACCGTACAATAATTGCATGTTGGGCTATGTTTGTCCAACAAATAGGCCTTGGGCATGCG
GCAAAGTTGGATATGGACTTTGTGAGTCTTATAACTTTCGTCCATTGTCTGCTTCTACGGAATTGCATGATGTTAAAATCAATTGGGCCAAAAGCAAGTCTGTTGAAACC
GCCCAACATGGTGAATCTGGACCAGGTATTCACATTGACTCAGCCCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCCTCAAATATTTCCTTCTCTCTCCCTTTCTTTTCCTTTGCTTAAGCTACACCTTTGCCAATGGAGTCTTCAATTACGACGATGGACTTGGTTTCGGTTCCAT
GTCTTCGCCAACACCTGATCCGAGTGCTGGCCCAGTCGATAGAGGGGTGAGCAATTTCGGTATCGGTCCGAAAGCCGGACCGAGAGCTGGCCTAGGAGTTGGAGGAATAA
GTAATGTGGATGATGGTTCAGATCCTGGACCGAAAGCTGGCCCAGGAGTCAAAGAAGAGATGAGCAATGTCGGTGCTGGTCCGAGAGTACCTAAGTTAGGGGTAAGTAGT
ATTGAGGCCGGTCCAAGAGCTGGGCCTAAAGGTGTTGATCCAATTGTTACTGGACTCGGTGTCGGAGTCGGAGTCAATTTGCCTCCTATATTTGGAGGTCCAAAAATGGG
GATAAGGCCGGGACCAGGAGGGTGGTATGGGCCTGGGCCAATAATACAAGAACCGTACAATAATTGCATGTTGGGCTATGTTTGTCCAACAAATAGGCCTTGGGCATGCG
GCAAAGTTGGATATGGACTTTGTGAGTCTTATAACTTTCGTCCATTGTCTGCTTCTACGGAATTGCATGATGTTAAAATCAATTGGGCCAAAAGCAAGTCTGTTGAAACC
GCCCAACATGGTGAATCTGGACCAGGTATTCACATTGACTCAGCCCACTAA
Protein sequenceShow/hide protein sequence
MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGPVDRGVSNFGIGPKAGPRAGLGVGGISNVDDGSDPGPKAGPGVKEEMSNVGAGPRVPKLGVSS
IEAGPRAGPKGVDPIVTGLGVGVGVNLPPIFGGPKMGIRPGPGGWYGPGPIIQEPYNNCMLGYVCPTNRPWACGKVGYGLCESYNFRPLSASTELHDVKINWAKSKSVET
AQHGESGPGIHIDSAH