; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10011672 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10011672
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionfibroin heavy chain-like
Genome locationChr01:9127704..9128549
RNA-Seq ExpressionHG10011672
SyntenyHG10011672
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060661.1 hypothetical protein E6C27_scaffold22G005540 [Cucumis melo var. makuwa]7.4e-5148.6Show/hide
Query:  MAGFKYFLLWPFVFICLSYTFANVVFNSNDGSGSDLRAWPIPDPSAGPGVDRGVSNVGAGPRAEPRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVI
        MA  KYFLL PF+F+CLSYTFA+ VFN + G      + P PDPSAGPGVD GVSN+G GP+A PRA L   GG+S+V   P                  
Subjt:  MAGFKYFLLWPFVFICLSYTFANVVFNSNDGSGSDLRAWPIPDPSAGPGVDRGVSNVGAGPRAEPRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVI

Query:  TGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGVGYRPGFGPP--GFV
             GPK                                      AGPKA  G K+GVSG  AGPR GPKG    VNG  VGVGV   P FG P  G  
Subjt:  TGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGVGYRPGFGPP--GFV

Query:  P-PGFGPRPGYWPVKPYDECTLGYVCPSN--ECSKFEYGPCESYHFRPLTASTDLHEVGINWANSKPVATTQHGGSGPVNHIDSAH
        P PG   RPG    +PY  C LGYVCP+    CSKF YG C+SY+F PL+ASTDLHEV INWA SKP AT QHG SGP  H+DSAH
Subjt:  P-PGFGPRPGYWPVKPYDECTLGYVCPSN--ECSKFEYGPCESYHFRPLTASTDLHEVGINWANSKPVATTQHGGSGPVNHIDSAH

KAG6577377.1 hypothetical protein SDJN03_24951, partial [Cucurbita argyrosperma subsp. sororia]1.9e-5451.88Show/hide
Query:  MAGFKYFLLWPFVFICLSYTFANVVFNSNDGSGSDLRAWPIPDPSAGPGVDRGVSNVGAGPRAE-------------PRAYLRDEGGVSNV----GASPR
        M   KYFLL PFVF+CLS TFAN V NS+DGSG D+ A P   P+AGPGV++GVSNV AGP AE             P+A    EG VS+V     A P+
Subjt:  MAGFKYFLLWPFVFICLSYTFANVVFNSNDGSGSDLRAWPIPDPSAGPGVDRGVSNVGAGPRAE-------------PRAYLRDEGGVSNV----GASPR

Query:  AGPKAGLEGEGGLSNVITGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVG
        AGPKAG   E  +S+V  G R+GPKA PG EG VS+V AGLR G KAGPG E  VSNV AG   GP+A PG + GVS S  G R   + V+ ++NG+ +G
Subjt:  AGPKAGLEGEGGLSNVITGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVG

Query:  VGV--GYRPGF-GPPGFVPPGFGPRPGYWPVKPYDECTLGYVCPS---NECSKFEYGPCESYHFRPLTASTDLHEVGINWA-NSKPVATTQHG
        +GV  GYR GF    G     FGP  G       +ECTLGYVCP+     C KF YG C++Y F PL AS  LHEV + WA  SKP AT Q+G
Subjt:  VGV--GYRPGF-GPPGFVPPGFGPRPGYWPVKPYDECTLGYVCPS---NECSKFEYGPCESYHFRPLTASTDLHEVGINWA-NSKPVATTQHG

KGN56231.1 hypothetical protein Csa_011503 [Cucumis sativus]2.8e-5852.41Show/hide
Query:  MAGFKYFLLWPFVFICLSYTFANVVFNSNDGSGSDLRAWPIPDPSAGPGVDRGVSNVGAGPRAEPRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVI
        MA  KYFLL PF+F+CLSYTFAN VFN +DG G    + P PDPSAGP VDRGVSN G G                     P+AGP+AGL G GG+SNV 
Subjt:  MAGFKYFLLWPFVFICLSYTFANVVFNSNDGSGSDLRAWPIPDPSAGPGVDRGVSNVGAGPRAEPRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVI

Query:  TGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGVGYRPGFGPPGFVPP
         GS  GP                     KAGPGV+E +SNVGAG R        PK+GVS   AGPR GPKGV+ IV G+ VGVGV   P FG P     
Subjt:  TGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGVGYRPGFGPPGFVPP

Query:  GFGPRPGYW----PV--KPYDECTLGYVCPSNE---CSKFEYGPCESYHFRPLTASTDLHEVGINWANSKPVATTQHGGSGPVNHIDSAH
        G  P PG W    P+  +PY+ C LGYVCP+N    C K  YG CESY+FRPL+AST+LH+V INWA SK V T QHG SGP  HIDSAH
Subjt:  GFGPRPGYW----PV--KPYDECTLGYVCPSNE---CSKFEYGPCESYHFRPLTASTDLHEVGINWANSKPVATTQHGGSGPVNHIDSAH

XP_022929340.1 fibroin heavy chain-like [Cucurbita moschata]2.9e-4748.91Show/hide
Query:  MAGFKYFLLWPFVFICLSYTFANVVFNSNDGSGSDLRAWPIPDPSAGPGVDRGVSNVGAGPRAEPRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVI
        M   KYFLL PFVF+CLS TFAN V NS+DGSG D+ A P   P+AGPGV++GVSNV AGP A                             EG +++V 
Subjt:  MAGFKYFLLWPFVFICLSYTFANVVFNSNDGSGSDLRAWPIPDPSAGPGVDRGVSNVGAGPRAEPRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVI

Query:  TGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGV--GYRPGF-GPPGF
         G ++GPKA PG EG VS+V AGLR G KAGPG E  VSNV AG   GP+A PG + GVS S  G R   + V+ ++NG+ +G+GV  GYR GF    G 
Subjt:  TGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGV--GYRPGF-GPPGF

Query:  VPPGFGPRPGYWPVKPYDECTLGYVCPS---NECSKFEYGPCESYHFRPLTASTDLHEVGINWA-NSKPVATTQHG
            FGP  G       +ECTLGYVCP+     C KF YG C++Y F PL AS  LHEV + WA  SKP AT Q+G
Subjt:  VPPGFGPRPGYWPVKPYDECTLGYVCPS---NECSKFEYGPCESYHFRPLTASTDLHEVGINWA-NSKPVATTQHG

XP_023551823.1 fibroin heavy chain-like isoform X1 [Cucurbita pepo subsp. pepo]2.8e-4249.41Show/hide
Query:  WPIPDPSAGPGVDRGVSNVGAGPRAE-------------PRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVITGSRSGPKASPGIEGGVSNVGAGLR
        WP   P+AGPGV++GVSNV AGP AE             P+A    EG VS+V A PRAG KAG   E  +S+V  G R+GPKA PG EG VS+V AG R
Subjt:  WPIPDPSAGPGVDRGVSNVGAGPRAE-------------PRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVITGSRSGPKASPGIEGGVSNVGAGLR

Query:  YGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGV--GYRPGF-----GPPGFVPPGFGPRPGYWPVKPYDECT
         G KAGPG E  V+NV AG   GP+A PG + GVS S  G R   + V+ ++NG+ +G+GV  GYR GF     G   +  PG G R         +ECT
Subjt:  YGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGV--GYRPGF-----GPPGFVPPGFGPRPGYWPVKPYDECT

Query:  LGYVCPS---NECSKFEYGPCESYHFRPLTASTDLHEVGINWA-NSKPVATTQHG
        LGYVCP+     C KF YG C+SY F PL AS  LHEV + WA  SKP AT Q+G
Subjt:  LGYVCPS---NECSKFEYGPCESYHFRPLTASTDLHEVGINWA-NSKPVATTQHG

TrEMBL top hitse value%identityAlignment
A0A0A0L7X7 Uncharacterized protein1.4e-5852.41Show/hide
Query:  MAGFKYFLLWPFVFICLSYTFANVVFNSNDGSGSDLRAWPIPDPSAGPGVDRGVSNVGAGPRAEPRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVI
        MA  KYFLL PF+F+CLSYTFAN VFN +DG G    + P PDPSAGP VDRGVSN G G                     P+AGP+AGL G GG+SNV 
Subjt:  MAGFKYFLLWPFVFICLSYTFANVVFNSNDGSGSDLRAWPIPDPSAGPGVDRGVSNVGAGPRAEPRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVI

Query:  TGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGVGYRPGFGPPGFVPP
         GS  GP                     KAGPGV+E +SNVGAG R        PK+GVS   AGPR GPKGV+ IV G+ VGVGV   P FG P     
Subjt:  TGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGVGYRPGFGPPGFVPP

Query:  GFGPRPGYW----PV--KPYDECTLGYVCPSNE---CSKFEYGPCESYHFRPLTASTDLHEVGINWANSKPVATTQHGGSGPVNHIDSAH
        G  P PG W    P+  +PY+ C LGYVCP+N    C K  YG CESY+FRPL+AST+LH+V INWA SK V T QHG SGP  HIDSAH
Subjt:  GFGPRPGYW----PV--KPYDECTLGYVCPSNE---CSKFEYGPCESYHFRPLTASTDLHEVGINWANSKPVATTQHGGSGPVNHIDSAH

A0A5A7V4J6 Uncharacterized protein3.6e-5148.6Show/hide
Query:  MAGFKYFLLWPFVFICLSYTFANVVFNSNDGSGSDLRAWPIPDPSAGPGVDRGVSNVGAGPRAEPRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVI
        MA  KYFLL PF+F+CLSYTFA+ VFN + G      + P PDPSAGPGVD GVSN+G GP+A PRA L   GG+S+V   P                  
Subjt:  MAGFKYFLLWPFVFICLSYTFANVVFNSNDGSGSDLRAWPIPDPSAGPGVDRGVSNVGAGPRAEPRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVI

Query:  TGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGVGYRPGFGPP--GFV
             GPK                                      AGPKA  G K+GVSG  AGPR GPKG    VNG  VGVGV   P FG P  G  
Subjt:  TGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGVGYRPGFGPP--GFV

Query:  P-PGFGPRPGYWPVKPYDECTLGYVCPSN--ECSKFEYGPCESYHFRPLTASTDLHEVGINWANSKPVATTQHGGSGPVNHIDSAH
        P PG   RPG    +PY  C LGYVCP+    CSKF YG C+SY+F PL+ASTDLHEV INWA SKP AT QHG SGP  H+DSAH
Subjt:  P-PGFGPRPGYWPVKPYDECTLGYVCPSN--ECSKFEYGPCESYHFRPLTASTDLHEVGINWANSKPVATTQHGGSGPVNHIDSAH

A0A6I8PB04 Bassoon presynaptic cytomatrix protein5.4e-0741.98Show/hide
Query:  AGPGVDRGVSNVGAGPRAEPRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVITGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGL
        AGP V       G GPRA P A  R   G    GA PRAGP  G  G G  +    G  +GP+A PG  G     G G   G   GPG   G    G G 
Subjt:  AGPGVDRGVSNVGAGPRAEPRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVITGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGL

Query:  RAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGVGYRPGFGPPGFVPPGFGPRPG
        RAGP  GPGP+ G  G G GPR GP        G   G G G  PG GP     PG GP  G
Subjt:  RAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGVGYRPGFGPPGFVPPGFGPRPG

A0A6J1EU53 fibroin heavy chain-like1.4e-4748.91Show/hide
Query:  MAGFKYFLLWPFVFICLSYTFANVVFNSNDGSGSDLRAWPIPDPSAGPGVDRGVSNVGAGPRAEPRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVI
        M   KYFLL PFVF+CLS TFAN V NS+DGSG D+ A P   P+AGPGV++GVSNV AGP A                             EG +++V 
Subjt:  MAGFKYFLLWPFVFICLSYTFANVVFNSNDGSGSDLRAWPIPDPSAGPGVDRGVSNVGAGPRAEPRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVI

Query:  TGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGV--GYRPGF-GPPGF
         G ++GPKA PG EG VS+V AGLR G KAGPG E  VSNV AG   GP+A PG + GVS S  G R   + V+ ++NG+ +G+GV  GYR GF    G 
Subjt:  TGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGV--GYRPGF-GPPGF

Query:  VPPGFGPRPGYWPVKPYDECTLGYVCPS---NECSKFEYGPCESYHFRPLTASTDLHEVGINWA-NSKPVATTQHG
            FGP  G       +ECTLGYVCP+     C KF YG C++Y F PL AS  LHEV + WA  SKP AT Q+G
Subjt:  VPPGFGPRPGYWPVKPYDECTLGYVCPS---NECSKFEYGPCESYHFRPLTASTDLHEVGINWA-NSKPVATTQHG

A0A7R9G286 Hypothetical protein7.5e-0939.34Show/hide
Query:  PSAGPGVDRGVS---NVGAGPRAEPRAYLRDEGGVSNVGASP-------RAGPKAGLEGEGGLSNVITGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPG
        P  GPGV  G      VG GP   P       G  S VG  P         GP  G  G GG      G+R GP   PG+ GG    G G+  G   GPG
Subjt:  PSAGPGVDRGVS---NVGAGPRAEPRAYLRDEGGVSNVGASP-------RAGPKAGLEGEGGLSNVITGSRSGPKASPGIEGGVSNVGAGLRYGLKAGPG

Query:  VEEGVSNVGAGLRAGPKAGPGPKVGVS---GSGAGPRVGPKGVNSIVN-GVEVGVGVGYRPG--FGPPGFVPPGFGPRPGYWP
           G    G G+  GP  GPG   G     G G GP  GP GV S+   G  VG G GY PG   G PG+ P G G  PGY P
Subjt:  VEEGVSNVGAGLRAGPKAGPGPKVGVS---GSGAGPRVGPKGVNSIVN-GVEVGVGVGYRPG--FGPPGFVPPGFGPRPGYWP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGCTTCAAATATTTCCTTCTCTGGCCCTTTGTTTTCATCTGCTTAAGCTACACCTTCGCCAATGTAGTCTTCAACTCCAATGATGGATCTGGTTCCGATCTTAG
GGCTTGGCCAATACCCGACCCCAGTGCTGGCCCTGGAGTCGATAGAGGGGTAAGTAATGTCGGGGCTGGTCCAAGAGCCGAGCCGAGAGCTTACCTAAGAGACGAGGGAG
GGGTAAGTAATGTTGGTGCTAGTCCGAGAGCCGGACCAAAAGCTGGCCTAGAAGGTGAGGGAGGGTTAAGCAATGTCATTACTGGTTCGAGATCCGGACCGAAAGCTAGC
CCAGGAATCGAAGGAGGGGTAAGCAATGTTGGTGCTGGTCTGAGATATGGATTGAAAGCTGGTCCAGGAGTCGAGGAAGGGGTAAGCAATGTCGGTGCTGGTCTGAGAGC
TGGACCGAAAGCTGGCCCAGGACCTAAGGTAGGGGTAAGTGGTAGTGGGGCTGGTCCGAGAGTCGGGCCAAAAGGTGTTAATTCAATTGTTAATGGAGTCGAAGTTGGAG
TCGGAGTTGGGTACAGGCCAGGATTTGGTCCTCCAGGATTTGTGCCTCCAGGATTTGGTCCAAGGCCAGGGTATTGGCCTGTTAAACCGTACGATGAATGCACATTGGGC
TATGTTTGTCCATCAAATGAATGCAGCAAATTTGAGTATGGACCTTGCGAATCTTATCACTTTCGTCCATTAACGGCTTCTACGGACCTGCACGAAGTTGGCATCAATTG
GGCCAATAGCAAGCCTGTTGCAACGACCCAACATGGCGGATCTGGACCAGTTAATCACATCGACTCAGCCCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGGCTTCAAATATTTCCTTCTCTGGCCCTTTGTTTTCATCTGCTTAAGCTACACCTTCGCCAATGTAGTCTTCAACTCCAATGATGGATCTGGTTCCGATCTTAG
GGCTTGGCCAATACCCGACCCCAGTGCTGGCCCTGGAGTCGATAGAGGGGTAAGTAATGTCGGGGCTGGTCCAAGAGCCGAGCCGAGAGCTTACCTAAGAGACGAGGGAG
GGGTAAGTAATGTTGGTGCTAGTCCGAGAGCCGGACCAAAAGCTGGCCTAGAAGGTGAGGGAGGGTTAAGCAATGTCATTACTGGTTCGAGATCCGGACCGAAAGCTAGC
CCAGGAATCGAAGGAGGGGTAAGCAATGTTGGTGCTGGTCTGAGATATGGATTGAAAGCTGGTCCAGGAGTCGAGGAAGGGGTAAGCAATGTCGGTGCTGGTCTGAGAGC
TGGACCGAAAGCTGGCCCAGGACCTAAGGTAGGGGTAAGTGGTAGTGGGGCTGGTCCGAGAGTCGGGCCAAAAGGTGTTAATTCAATTGTTAATGGAGTCGAAGTTGGAG
TCGGAGTTGGGTACAGGCCAGGATTTGGTCCTCCAGGATTTGTGCCTCCAGGATTTGGTCCAAGGCCAGGGTATTGGCCTGTTAAACCGTACGATGAATGCACATTGGGC
TATGTTTGTCCATCAAATGAATGCAGCAAATTTGAGTATGGACCTTGCGAATCTTATCACTTTCGTCCATTAACGGCTTCTACGGACCTGCACGAAGTTGGCATCAATTG
GGCCAATAGCAAGCCTGTTGCAACGACCCAACATGGCGGATCTGGACCAGTTAATCACATCGACTCAGCCCACTAA
Protein sequenceShow/hide protein sequence
MAGFKYFLLWPFVFICLSYTFANVVFNSNDGSGSDLRAWPIPDPSAGPGVDRGVSNVGAGPRAEPRAYLRDEGGVSNVGASPRAGPKAGLEGEGGLSNVITGSRSGPKAS
PGIEGGVSNVGAGLRYGLKAGPGVEEGVSNVGAGLRAGPKAGPGPKVGVSGSGAGPRVGPKGVNSIVNGVEVGVGVGYRPGFGPPGFVPPGFGPRPGYWPVKPYDECTLG
YVCPSNECSKFEYGPCESYHFRPLTASTDLHEVGINWANSKPVATTQHGGSGPVNHIDSAH