; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr013375 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr013375
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionkeratin, type II cytoskeletal 3 isoform X2
Genome locationtig00153836:395803..398009
RNA-Seq ExpressionSgr013375
SyntenySgr013375
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022974025.1 keratin, type II cytoskeletal 3 isoform X1 [Cucurbita maxima]8.9e-4887.39Show/hide
Query:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI
        MPR MNG+R+EGGEDE+G+LWKLPVLKS+R+G+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQ+GFGLGAGCGVGLGFGYG GRGIAQDDKRRYSN+
Subjt:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI

Query:  GDLSHGHQSIF
        GDL +G QSIF
Subjt:  GDLSHGHQSIF

XP_022974026.1 keratin, type II cytoskeletal 3 isoform X2 [Cucurbita maxima]8.9e-4887.39Show/hide
Query:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI
        MPR MNG+R+EGGEDE+G+LWKLPVLKS+R+G+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQ+GFGLGAGCGVGLGFGYG GRGIAQDDKRRYSN+
Subjt:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI

Query:  GDLSHGHQSIF
        GDL +G QSIF
Subjt:  GDLSHGHQSIF

XP_023521209.1 fibroin heavy chain isoform X2 [Cucurbita pepo subsp. pepo]1.2e-4787.39Show/hide
Query:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI
        MPR MNG+R EG EDE+G+LWKLPVLKS+R+G+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQ+GFGLGAGCGVGLGFGYG GRGIAQDDKRRYSN+
Subjt:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI

Query:  GDLSHGHQSIF
        GDL +GHQSIF
Subjt:  GDLSHGHQSIF

XP_038889093.1 ctenidin-1 isoform X1 [Benincasa hispida]4.7e-4990.09Show/hide
Query:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI
        MPR  NGNR +GGEDE GLLW LPVLKS+R GKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQ+GFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSN+
Subjt:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI

Query:  GDLSHGHQSIF
        GDL HGHQSIF
Subjt:  GDLSHGHQSIF

XP_038889094.1 glycine-rich cell wall structural protein 1 isoform X2 [Benincasa hispida]4.7e-4990.09Show/hide
Query:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI
        MPR  NGNR +GGEDE GLLW LPVLKS+R GKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQ+GFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSN+
Subjt:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI

Query:  GDLSHGHQSIF
        GDL HGHQSIF
Subjt:  GDLSHGHQSIF

TrEMBL top hitse value%identityAlignment
A0A6J1DP59 glycine-rich cell wall structural protein 11.3e-4790.91Show/hide
Query:  MNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNIGDLS
        MN NR++GGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFG+GLVGGAGFGPGIPGLQ GFGLGAGCGVGLGFGYGVGRGIAQDD+RRYSN+GDL 
Subjt:  MNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNIGDLS

Query:  H--GHQSIFS
        H  GHQSIFS
Subjt:  H--GHQSIFS

A0A6J1EY31 keratin, type II cytoskeletal 2 epidermal isoform X15.7e-4887.39Show/hide
Query:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI
        MPR MNG+R EGGEDE+G+LWKLPVLKS+R+G+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQ+GFGLGAGCGVGLGFGYG GRGIAQDDKRRYSN+
Subjt:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI

Query:  GDLSHGHQSIF
        GDL +G QSIF
Subjt:  GDLSHGHQSIF

A0A6J1F360 keratin, type II cytoskeletal 2 epidermal isoform X25.7e-4887.39Show/hide
Query:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI
        MPR MNG+R EGGEDE+G+LWKLPVLKS+R+G+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQ+GFGLGAGCGVGLGFGYG GRGIAQDDKRRYSN+
Subjt:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI

Query:  GDLSHGHQSIF
        GDL +G QSIF
Subjt:  GDLSHGHQSIF

A0A6J1IA95 keratin, type II cytoskeletal 3 isoform X24.3e-4887.39Show/hide
Query:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI
        MPR MNG+R+EGGEDE+G+LWKLPVLKS+R+G+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQ+GFGLGAGCGVGLGFGYG GRGIAQDDKRRYSN+
Subjt:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI

Query:  GDLSHGHQSIF
        GDL +G QSIF
Subjt:  GDLSHGHQSIF

A0A6J1ICW2 keratin, type II cytoskeletal 3 isoform X14.3e-4887.39Show/hide
Query:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI
        MPR MNG+R+EGGEDE+G+LWKLPVLKS+R+G+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQ+GFGLGAGCGVGLGFGYG GRGIAQDDKRRYSN+
Subjt:  MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNI

Query:  GDLSHGHQSIF
        GDL +G QSIF
Subjt:  GDLSHGHQSIF

SwissProt top hitse value%identityAlignment
Q9JL61 DNA-binding protein Rfx53.7e-0450.67Show/hide
Query:  NGLLWKLPVLKSARLGK---LGPAFGLGVGCGVGFGIGLVGGAGFGPGI-PGLQVGFGLGAGCGVGLGFGYGVGR
        N +L  +P L  A  G    LGP FG G G G G G GL  GAG GPG+ PGL  G G G G G+G G G G GR
Subjt:  NGLLWKLPVLKSARLGK---LGPAFGLGVGCGVGFGIGLVGGAGFGPGI-PGLQVGFGLGAGCGVGLGFGYGVGR

Arabidopsis top hitse value%identityAlignment
AT1G66820.1 glycine-rich protein7.4e-0852.17Show/hide
Query:  LGPAFGLGVGCGVGFGIGLVGGAGFG--PGIPGLQVGFGLGAGCGVGLGFGY--GVGRGIAQDD-KRRY
        +GP  G G+GCG G GIGL GG G G   G+    V  G G GCG+G GFGY  GVG G + DD K R+
Subjt:  LGPAFGLGVGCGVGFGIGLVGGAGFG--PGIPGLQVGFGLGAGCGVGLGFGY--GVGRGIAQDD-KRRY

AT4G10330.1 glycine-rich protein5.6e-3267.02Show/hide
Query:  NRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNIG
        NR+  G+D+ GLLWKLP ++   +GK+GPAFGLGVGCG GFG GL+GG GFGPG+PGLQ G G GAGCG+G+GFGYGVGRG A D  R Y N+G
Subjt:  NRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNIG

AT4G14301.1 unknown protein8.4e-0453.45Show/hide
Query:  LGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRG
        +GK G  FG G+G G GFG G+  G GFG GI G   G G G G G G GFG G+G+G
Subjt:  LGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACGAAAAATGAACGGAAACAGACAGGAGGGCGGAGAAGATGAGAACGGATTGCTATGGAAGCTTCCAGTTCTGAAATCTGCCCGACTCGGAAAGTTGGGTCCCGC
CTTCGGTTTGGGCGTCGGTTGCGGCGTCGGCTTTGGCATCGGCCTCGTCGGAGGTGCTGGATTTGGTCCGGGAATTCCTGGCTTACAAGTTGGCTTTGGTCTTGGTGCTG
GATGTGGCGTTGGCTTAGGATTTGGCTATGGTGTTGGTAGGGGCATTGCCCAAGATGACAAACGGAGATACTCTAACATTGGGGATCTATCACATGGTCATCAAAGTATT
TTTTCTCA
mRNA sequenceShow/hide mRNA sequence
ATGCCACGAAAAATGAACGGAAACAGACAGGAGGGCGGAGAAGATGAGAACGGATTGCTATGGAAGCTTCCAGTTCTGAAATCTGCCCGACTCGGAAAGTTGGGTCCCGC
CTTCGGTTTGGGCGTCGGTTGCGGCGTCGGCTTTGGCATCGGCCTCGTCGGAGGTGCTGGATTTGGTCCGGGAATTCCTGGCTTACAAGTTGGCTTTGGTCTTGGTGCTG
GATGTGGCGTTGGCTTAGGATTTGGCTATGGTGTTGGTAGGGGCATTGCCCAAGATGACAAACGGAGATACTCTAACATTGGGGATCTATCACATGGTCATCAAAGTATT
TTTTCTCA
Protein sequenceShow/hide protein sequence
MPRKMNGNRQEGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQVGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNIGDLSHGHQSI
FSX