; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018520 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018520
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionFibroin heavy chain
Genome locationChr04:4889522..4889884
RNA-Seq ExpressionHG10018520
SyntenyHG10018520
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051071.1 fibroin heavy chain [Cucumis melo var. makuwa]4.0e-3874.6Show/hide
Query:  MKSD-SNSPWNWNTINFNTNI--FDPNRRNPKAPP--ALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGF
        MKSD S+SPWN    NF+T I  FD +RRN K  P  ALVLEAGGGAGVGCGLG GFGLVGGIGHGG SPWNHLHLVFGLG GCGVGLGLGIG+G GYGF
Subjt:  MKSD-SNSPWNWNTINFNTNI--FDPNRRNPKAPP--ALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGF

Query:  SFHSLNSYFSDHKANPKHK-PPLLQF
        SF SL+SYFS   ++PKHK P L+QF
Subjt:  SFHSLNSYFSDHKANPKHK-PPLLQF

KAF3972995.1 hypothetical protein CMV_003541 [Castanea mollissima]9.0e-2247.73Show/hide
Query:  MKSDSNSPWN-WNTINFNTNIFDPNRR------------NPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGI
        +K+DS+S WN WN  + N    +  ++            NPK    +VL  GGG G+GCG+GLGFG VGGIG+GG  PWNHL LV G+G+GCGVG+G G 
Subjt:  MKSDSNSPWN-WNTINFNTNIFDPNRR------------NPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGI

Query:  GEGIGYGFSFHSLNSYFSDHKANPKHKPPLLQ
        G+GIGYGFS   L SYFS   ++   K  +++
Subjt:  GEGIGYGFSFHSLNSYFSDHKANPKHKPPLLQ

KAG6578355.1 hypothetical protein SDJN03_22803, partial [Cucurbita argyrosperma subsp. sororia]9.5e-4876.52Show/hide
Query:  MKSDSNSPWNWNTI---NFNT----------NIFDPNRRNPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGI
        MKSDSNSPW WNT    NF+T          NIFD NRRN K PP LVL+AGGGAGVGCG+GLGFGLVGGIGHGG SPWNHLHLVFGLGLGCGVGLGLGI
Subjt:  MKSDSNSPWNWNTI---NFNT----------NIFDPNRRNPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGI

Query:  GEGIGYGFSFHSLNSYFSDHKANPKHKPPLLQ
        G+GIGYG SF SL+SYFSDHK NP+ KPPL+Q
Subjt:  GEGIGYGFSFHSLNSYFSDHKANPKHKPPLLQ

KGN63146.1 hypothetical protein Csa_022245 [Cucumis sativus]9.9e-3773.98Show/hide
Query:  MKSD-SNSPWNWNTINFNTNI-FDPNRRNPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGFSFH
        MKSD SNSPWN    NF T   FD +RRN K  PALVLEAGGGAGVGCGLG+GFGLVGGIGH G SPWNHLHLVFGLG GCGVGLGLGIG+G GYG SF 
Subjt:  MKSD-SNSPWNWNTINFNTNI-FDPNRRNPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGFSFH

Query:  SLNSYFSDHKANPKHK-PPLLQF
        S++SYFS   +NPK K P L+QF
Subjt:  SLNSYFSDHKANPKHK-PPLLQF

XP_030955948.1 protein TRIGALACTOSYLDIACYLGLYCEROL 5, chloroplastic [Quercus lobata]3.1e-2248.09Show/hide
Query:  MKSDSNSPWN-WNTINFNTNIFDPNRR-----------NPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIG
        +K+DS+S WN WN  + N    +  ++           NPK    +VL  GGG G+GCG+GLGFGLVGG+G+GG  PWNHL LV G+G+GCGVG+G G G
Subjt:  MKSDSNSPWN-WNTINFNTNIFDPNRR-----------NPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIG

Query:  EGIGYGFSFHSLNSYFSDHKANPKHKPPLLQ
        +GIGYGFS  SL SY S   ++   K  +++
Subjt:  EGIGYGFSFHSLNSYFSDHKANPKHKPPLLQ

TrEMBL top hitse value%identityAlignment
A0A0A0LSZ2 Uncharacterized protein4.8e-3773.98Show/hide
Query:  MKSD-SNSPWNWNTINFNTNI-FDPNRRNPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGFSFH
        MKSD SNSPWN    NF T   FD +RRN K  PALVLEAGGGAGVGCGLG+GFGLVGGIGH G SPWNHLHLVFGLG GCGVGLGLGIG+G GYG SF 
Subjt:  MKSD-SNSPWNWNTINFNTNI-FDPNRRNPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGFSFH

Query:  SLNSYFSDHKANPKHK-PPLLQF
        S++SYFS   +NPK K P L+QF
Subjt:  SLNSYFSDHKANPKHK-PPLLQF

A0A498J4X3 Uncharacterized protein7.4e-2254.63Show/hide
Query:  MKSDSNSPWNWNTINFNTNIFDPNR--------RNPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIG
        + +D +S WNW +   + N  D NR         +PKA   +VL  GGGAG+GCG G+GFGLVGG+G+GG  PWN + LVFG+G+GCGVGLG G G+GIG
Subjt:  MKSDSNSPWNWNTINFNTNIFDPNR--------RNPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIG

Query:  YGFSFHSL
        YGFS  SL
Subjt:  YGFSFHSL

A0A5A7UC28 Fibroin heavy chain1.9e-3874.6Show/hide
Query:  MKSD-SNSPWNWNTINFNTNI--FDPNRRNPKAPP--ALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGF
        MKSD S+SPWN    NF+T I  FD +RRN K  P  ALVLEAGGGAGVGCGLG GFGLVGGIGHGG SPWNHLHLVFGLG GCGVGLGLGIG+G GYGF
Subjt:  MKSD-SNSPWNWNTINFNTNI--FDPNRRNPKAPP--ALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGF

Query:  SFHSLNSYFSDHKANPKHK-PPLLQF
        SF SL+SYFS   ++PKHK P L+QF
Subjt:  SFHSLNSYFSDHKANPKHK-PPLLQF

A0A5N5GAU9 Keratin5.7e-2254.63Show/hide
Query:  MKSDSNSPWNWNTINFNTNIFDPNR--------RNPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIG
        + +D++S WNW +   + N  D NR         +PKA   +VL  GGGAG+GCG G+GFGLVGG+G+GG  PWN + LVFG+G+GCGVGLG G G+GIG
Subjt:  MKSDSNSPWNWNTINFNTNIFDPNR--------RNPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIG

Query:  YGFSFHSL
        YGFS  SL
Subjt:  YGFSFHSL

A0A7N2KWH8 Uncharacterized protein1.5e-2248.09Show/hide
Query:  MKSDSNSPWN-WNTINFNTNIFDPNRR-----------NPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIG
        +K+DS+S WN WN  + N    +  ++           NPK    +VL  GGG G+GCG+GLGFGLVGG+G+GG  PWNHL LV G+G+GCGVG+G G G
Subjt:  MKSDSNSPWN-WNTINFNTNIFDPNRR-----------NPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIG

Query:  EGIGYGFSFHSLNSYFSDHKANPKHKPPLLQ
        +GIGYGFS  SL SY S   ++   K  +++
Subjt:  EGIGYGFSFHSLNSYFSDHKANPKHKPPLLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27695.1 glycine-rich protein9.0e-0448.48Show/hide
Query:  LVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGFSFHS
        +VL    G GVG G G+G G   G G GG+ P N L +  G G GCGVGLGLG G G  +G  + S
Subjt:  LVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGFSFHS

AT1G27695.2 glycine-rich protein9.0e-0448.48Show/hide
Query:  LVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGFSFHS
        +VL    G GVG G G+G G   G G GG+ P N L +  G G GCGVGLGLG G G  +G  + S
Subjt:  LVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGFSFHS

AT1G66820.1 glycine-rich protein1.7e-1046.97Show/hide
Query:  GGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGFSFHSLNSYF
        G G G+GCG G+G GL GG+G G     +H ++V G G+GCG+G G G G G+G G+SF  +   F
Subjt:  GGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGFSFHSLNSYF

AT4G10330.1 glycine-rich protein4.6e-0847.69Show/hide
Query:  GAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGFSFHSLNSYFS
        G GVGCG G G GL+GG+G G   P     L FGLG G G G+G+G G G+G G ++    SY++
Subjt:  GAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGFSFHSLNSYFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCAGACTCCAATTCGCCATGGAACTGGAACACAATCAACTTCAACACCAACATTTTCGACCCTAATCGCCGCAACCCAAAGGCACCGCCAGCCCTAGTCTTGGA
GGCCGGCGGAGGAGCTGGAGTTGGCTGCGGCCTTGGACTCGGCTTCGGCCTCGTCGGCGGGATTGGCCACGGCGGCGTCTCACCCTGGAATCACCTTCACCTCGTTTTCG
GTCTCGGTCTCGGCTGTGGCGTTGGCTTAGGGCTTGGAATTGGCGAAGGCATTGGCTATGGCTTCTCCTTCCACTCCCTCAATTCTTATTTCTCTGATCACAAAGCCAAC
CCTAAACATAAGCCTCCTCTGCTTCAGTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATCAGACTCCAATTCGCCATGGAACTGGAACACAATCAACTTCAACACCAACATTTTCGACCCTAATCGCCGCAACCCAAAGGCACCGCCAGCCCTAGTCTTGGA
GGCCGGCGGAGGAGCTGGAGTTGGCTGCGGCCTTGGACTCGGCTTCGGCCTCGTCGGCGGGATTGGCCACGGCGGCGTCTCACCCTGGAATCACCTTCACCTCGTTTTCG
GTCTCGGTCTCGGCTGTGGCGTTGGCTTAGGGCTTGGAATTGGCGAAGGCATTGGCTATGGCTTCTCCTTCCACTCCCTCAATTCTTATTTCTCTGATCACAAAGCCAAC
CCTAAACATAAGCCTCCTCTGCTTCAGTTTTGA
Protein sequenceShow/hide protein sequence
MKSDSNSPWNWNTINFNTNIFDPNRRNPKAPPALVLEAGGGAGVGCGLGLGFGLVGGIGHGGVSPWNHLHLVFGLGLGCGVGLGLGIGEGIGYGFSFHSLNSYFSDHKAN
PKHKPPLLQF