; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003599 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003599
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionkeratin, type II cytoskeletal 3 isoform X1
Genome locationChr08:3921793..3924682
RNA-Seq ExpressionHG10003599
SyntenyHG10003599
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592270.1 hypothetical protein SDJN03_14616, partial [Cucurbita argyrosperma subsp. sororia]4.4e-6589.93Show/hide
Query:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL
        MNG+RH+GGEDE+G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNVGDLL
Subjt:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL

Query:  NGHQSIFPQDDIGTLVDEVVLNTKRLIRATSREIDKWKR
        NGHQSIFPQD+IG LVDE+ LNTK+LIR T+REIDKWKR
Subjt:  NGHQSIFPQDDIGTLVDEVVLNTKRLIRATSREIDKWKR

XP_016902400.1 PREDICTED: glycine-rich cell wall structural protein 1 [Cucumis melo]8.4e-6491.43Show/hide
Query:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL
        MNGNR++ GEDE GLLWNLPVLKSSR G LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGD+L
Subjt:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL

Query:  NGHQSIFP-QDDIGTLVDEVVLNTKRLIRATSREIDKWKR
         GHQSIFP QDDIG LVD++VLNTKRLIRATSREIDKWKR
Subjt:  NGHQSIFP-QDDIGTLVDEVVLNTKRLIRATSREIDKWKR

XP_022932820.1 keratin, type II cytoskeletal 2 epidermal isoform X2 [Cucurbita moschata]1.1e-6388.49Show/hide
Query:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL
        MNG+RH+GGEDE+G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNVGDLL
Subjt:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL

Query:  NGHQSIFPQDDIGTLVDEVVLNTKRLIRATSREIDKWKR
        NG QSIFPQD+IG LVDE+ LNTK+LIR T++EIDKWKR
Subjt:  NGHQSIFPQDDIGTLVDEVVLNTKRLIRATSREIDKWKR

XP_023521209.1 fibroin heavy chain isoform X2 [Cucurbita pepo subsp. pepo]3.8e-6489.21Show/hide
Query:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL
        MNG+RH+G EDE+G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNVGDLL
Subjt:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL

Query:  NGHQSIFPQDDIGTLVDEVVLNTKRLIRATSREIDKWKR
        NGHQSIFPQD+IG LVDE+ LNTK+LIR T+REIDKWKR
Subjt:  NGHQSIFPQDDIGTLVDEVVLNTKRLIRATSREIDKWKR

XP_038889093.1 ctenidin-1 isoform X1 [Benincasa hispida]4.4e-6594.24Show/hide
Query:  NGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLN
        NGNRHQGGEDE GLLWNLPVLKSSR GKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL+
Subjt:  NGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLN

Query:  GHQSIFPQ-DDIGTLVDEVVLNTKRLIRATSREIDKWKR
        GHQSIFPQ DDI  LVDE+VLN+KRLIRATSREIDKWKR
Subjt:  GHQSIFPQ-DDIGTLVDEVVLNTKRLIRATSREIDKWKR

TrEMBL top hitse value%identityAlignment
A0A0A0KSM8 Uncharacterized protein2.6e-6390.71Show/hide
Query:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL
        MNGNR+Q GEDE GLLWNLPVLKSSR G LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGD+L
Subjt:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL

Query:  NGHQSIFP-QDDIGTLVDEVVLNTKRLIRATSREIDKWKR
         G QSIFP QDDIG LVD++VLNTKRLIRATS+EIDKWKR
Subjt:  NGHQSIFP-QDDIGTLVDEVVLNTKRLIRATSREIDKWKR

A0A1S4E347 glycine-rich cell wall structural protein 14.1e-6491.43Show/hide
Query:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL
        MNGNR++ GEDE GLLWNLPVLKSSR G LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGD+L
Subjt:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL

Query:  NGHQSIFP-QDDIGTLVDEVVLNTKRLIRATSREIDKWKR
         GHQSIFP QDDIG LVD++VLNTKRLIRATSREIDKWKR
Subjt:  NGHQSIFP-QDDIGTLVDEVVLNTKRLIRATSREIDKWKR

A0A6J1EY31 keratin, type II cytoskeletal 2 epidermal isoform X13.8e-6287.14Show/hide
Query:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL
        MNG+RH+GGEDE+G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNVGDLL
Subjt:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL

Query:  NGHQSIFP-QDDIGTLVDEVVLNTKRLIRATSREIDKWKR
        NG QSIFP +D+IG LVDE+ LNTK+LIR T++EIDKWKR
Subjt:  NGHQSIFP-QDDIGTLVDEVVLNTKRLIRATSREIDKWKR

A0A6J1F360 keratin, type II cytoskeletal 2 epidermal isoform X25.3e-6488.49Show/hide
Query:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL
        MNG+RH+GGEDE+G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNVGDLL
Subjt:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL

Query:  NGHQSIFPQDDIGTLVDEVVLNTKRLIRATSREIDKWKR
        NG QSIFPQD+IG LVDE+ LNTK+LIR T++EIDKWKR
Subjt:  NGHQSIFPQDDIGTLVDEVVLNTKRLIRATSREIDKWKR

A0A6J1IA95 keratin, type II cytoskeletal 3 isoform X22.2e-6286.33Show/hide
Query:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL
        MNG+R +GGEDE+G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNVGDLL
Subjt:  MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL

Query:  NGHQSIFPQDDIGTLVDEVVLNTKRLIRATSREIDKWKR
        NG QSIFPQD+IG +VDE+ LNTK+LI+ T++EIDKWKR
Subjt:  NGHQSIFPQDDIGTLVDEVVLNTKRLIRATSREIDKWKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G66820.1 glycine-rich protein5.3e-0851.43Show/hide
Query:  LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQ-----LGFGLGAGCGVGLGFGYGVGRGIAQDD-KRRY
        +GP  G G+GCG G GIGL GG G G    GL      LGFG+G G G G G+G+GVG G + DD K R+
Subjt:  LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQ-----LGFGLGAGCGVGLGFGYGVGRGIAQDD-KRRY

AT4G10330.1 glycine-rich protein6.9e-4058.21Show/hide
Query:  NRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLNGH
        NR + G+D+ GLLW LP ++   IGK+GPAFGLGVGCG GFG GL+GG GFGPG+PGLQ G G GAGCG+G+GFGYGVGRG A D  R Y NVG      
Subjt:  NRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLNGH

Query:  QSIFPQDDIGTLVDEVVLNTKRLIRATSREIDKW
              +++ +L+DE+V++TK+L++AT+ EIDKW
Subjt:  QSIFPQDDIGTLVDEVVLNTKRLIRATSREIDKW

AT4G14301.1 unknown protein6.1e-0455.17Show/hide
Query:  IGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRG
        IGK G  FG G+G G GFG G+  G GFG GI G   G G G G G G GFG G+G+G
Subjt:  IGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGGAAACAGACACCAAGGCGGAGAAGATGAGAACGGATTACTCTGGAATCTTCCAGTTCTTAAATCTTCCAGAATCGGAAAGTTAGGTCCCGCCTTCGGTCTGGG
CGTTGGTTGTGGCGTCGGCTTTGGCATCGGCCTTGTCGGAGGTGCTGGATTTGGTCCAGGAATTCCTGGCTTACAACTTGGCTTTGGTCTTGGTGCTGGATGTGGAGTTG
GCTTAGGATTTGGATATGGTGTTGGCAGGGGCATTGCTCAAGATGACAAACGGAGATACTCTAACGTTGGAGATCTATTAAATGGTCATCAAAGTATTTTTCCTCAGGAC
GATATTGGCACGCTTGTTGACGAGGTTGTCCTAAATACAAAGAGGCTTATACGAGCTACTTCAAGGGAGATTGACAAGTGGAAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAACGGAAACAGACACCAAGGCGGAGAAGATGAGAACGGATTACTCTGGAATCTTCCAGTTCTTAAATCTTCCAGAATCGGAAAGTTAGGTCCCGCCTTCGGTCTGGG
CGTTGGTTGTGGCGTCGGCTTTGGCATCGGCCTTGTCGGAGGTGCTGGATTTGGTCCAGGAATTCCTGGCTTACAACTTGGCTTTGGTCTTGGTGCTGGATGTGGAGTTG
GCTTAGGATTTGGATATGGTGTTGGCAGGGGCATTGCTCAAGATGACAAACGGAGATACTCTAACGTTGGAGATCTATTAAATGGTCATCAAAGTATTTTTCCTCAGGAC
GATATTGGCACGCTTGTTGACGAGGTTGTCCTAAATACAAAGAGGCTTATACGAGCTACTTCAAGGGAGATTGACAAGTGGAAAAGATGA
Protein sequenceShow/hide protein sequence
MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSIFPQD
DIGTLVDEVVLNTKRLIRATSREIDKWKR