; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001232 (gene) of Snake gourd v1 genome

Gene IDTan0001232
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionglycine-rich protein 23-like
Genome locationLG04:5021412..5022080
RNA-Seq ExpressionTan0001232
SyntenyTan0001232
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601160.1 hypothetical protein SDJN03_06393, partial [Cucurbita argyrosperma subsp. sororia]3.4e-3565.09Show/hide
Query:  MPAFCPPLSPSEPPPPFSFNARFFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLD-GGGGGLSV
        MPAFCPPLSP E  PPFSFNA F  GAGGGDE+TSG GAGGEM   G                       GGGGL+VVF+GGGGGL V+LD GGGGGL V
Subjt:  MPAFCPPLSPSEPPPPFSFNARFFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLD-GGGGGLSV

Query:  VLDIGGGGGPLVVLDGSGGGGLTVVFNG-GGGGLLVMLDG-GGGGLSVVLDIGGGGGPLVVLDGSGGGG
        VLD  GGGG  VVLDG+GGGGL VV +G GGGGL V+LDG GGGGL VVLD GGGG P VV +G GGGG
Subjt:  VLDIGGGGGPLVVLDGSGGGGLTVVFNG-GGGGLLVMLDG-GGGGLSVVLDIGGGGGPLVVLDGSGGGG

XP_022957272.1 loricrin-like [Cucurbita moschata]7.4e-3873.2Show/hide
Query:  MQINPINPNKPAPTITPKAMPAFCPPLSPSEPPPPFSFNARFFVGAGGGDEMTSGGGAGGEMSAL-GGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFN
        MQ+NP NPNKPAP ITP AMPAFCPPLSP E  PPFSFNA F  GAGGGDE+TSG GAGGEM    GGGGL VVFDG GGGGL VVLDG GGGGL VV +
Subjt:  MQINPINPNKPAPTITPKAMPAFCPPLSPSEPPPPFSFNARFFVGAGGGDEMTSGGGAGGEMSAL-GGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFN

Query:  G-GGGGLLVMLDG-GGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGG
        G GGGGL V+LDG GGGGL VVLD GGGG         GGG  +VVFNGGGGG
Subjt:  G-GGGGLLVMLDG-GGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGG

XP_022990509.1 glycine-rich cell wall structural protein 1.0-like [Cucurbita maxima]1.3e-3469.33Show/hide
Query:  INPINPNKPAPTITPKAMPAFCPPLSPSEPPPPFSFNARFFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFD-GSGGGGLLVVLDGNGGGGLTVVFNGG
        +NP NPNKPAP ITPK MPAFCPPLSP E  PPFSFNA F VGAGGGDE TSG GAG      GGGG+ VVFD G GGGGL  VLDG GGGGL VV +G 
Subjt:  INPINPNKPAPTITPKAMPAFCPPLSPSEPPPPFSFNARFFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFD-GSGGGGLLVVLDGNGGGGLTVVFNGG

Query:  GGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGL-TVVFNGGGGG
                 GGGGGL VVLD  GGGG  VVLDG GGGG  +VVFNGGGGG
Subjt:  GGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGL-TVVFNGGGGG

XP_023549817.1 glycine-rich cell wall structural protein-like [Cucurbita pepo subsp. pepo]4.5e-3569.93Show/hide
Query:  MQINPINPNKPAPTITPKAMPAFCPPLSPSEPPPPFSFNARFFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFN-
        MQ+NP NPNKPAP ITP AMPAFCPPLSP E  PPFSFNA F  GAG GDE+TSG GAGGEM   G           GGGGL VV DG GGGG +VVFN 
Subjt:  MQINPINPNKPAPTITPKAMPAFCPPLSPSEPPPPFSFNARFFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFN-

Query:  GGGGGLLVMLD-GGGGGLSVVLDIGGGGGPLVVLDGSGGGGL-TVVFNGGGGG
        GGGGGL V+LD  GGGGL VVLD  GGGG  VVLDG GGGG  +VVFNGGGGG
Subjt:  GGGGGLLVMLD-GGGGGLSVVLDIGGGGGPLVVLDGSGGGGL-TVVFNGGGGG

XP_031741791.1 glycine-rich protein 23-like [Cucumis sativus]3.5e-5667.25Show/hide
Query:  MQINPINPNKPAPTITPKAMPAFCPPLSPSEPPPPFSFNARFFVGAGGGDEMTSGGGAGGEMSAL--GGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVF
        MQINP NPNKP P ITP A+PAFCPPL+P EPPPPFSFNA   VG GGGDE   GGGAGG+++    GGGG LVV D  GGGG+ V LDG+GGGG +VV 
Subjt:  MQINPINPNKPAPTITPKAMPAFCPPLSPSEPPPPFSFNARFFVGAGGGDEMTSGGGAGGEMSAL--GGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVF

Query:  -NGGGGGLLVMLD-GGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFN-GGGGGLLVMLD-GGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFN-GG
          GGGGG  V LD GGGGG SV LD GGGGG  V LDG GGGG +V  + GGGGG+ V LD GGGGG SV LD GGGGG  V LDG GGGG +V  + GG
Subjt:  -NGGGGGLLVMLD-GGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFN-GGGGGLLVMLD-GGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFN-GG

Query:  GGGLLVMLDGGGGGGLSVVLDIGGGGDAS
        GGG  V LDGGGGGG SV LD GGGG AS
Subjt:  GGGLLVMLDGGGGGGLSVVLDIGGGGDAS

TrEMBL top hitse value%identityAlignment
A0A1S4DW63 loricrin-like9.1e-3467.11Show/hide
Query:  EMTSGGGAGGEMSAL-GGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFN-GGGGGLLVMLD-GGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGG
        E   GGG G E+  L GGGG+ VV DG GGGG+ VV DG GGGG++VVF+ GGGGG+ V+ D GGGGG+SVV D GGGGG LVVLDG GGGG++VVF+GG
Subjt:  EMTSGGGAGGEMSAL-GGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFN-GGGGGLLVMLD-GGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGG

Query:  GGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGG
        GGG+ V+ DGGGGG+SV+LD GGGGG  VVLDG GGGG++V  +GGGGG
Subjt:  GGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGG

A0A6I8T397 Cysteine-rich venom protein, putative1.7e-1647.78Show/hide
Query:  AGGGDEMTSGGGAGG---EMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTV
        A G    + GGG GG   +    GGGG++    G GGGG++V   G GGGG+ V   GGGG       GGGGG+ V    GGGGG +V   G GGGG+ V
Subjt:  AGGGDEMTSGGGAGG---EMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTV

Query:  VFNGGGG-GLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTV--VFNGGGGGLLVMLDGGGGGGLSVVLDIGGGG
          +GGGG G+++   GGGGG  VV   GGGGG +VV    GGGG  V     GGGGG++V   GGGGGG+ V    GGGG
Subjt:  VFNGGGG-GLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTV--VFNGGGGGLLVMLDGGGGGGLSVVLDIGGGG

A0A6I8U367 Uncharacterized protein1.8e-1848.37Show/hide
Query:  GAGGG--DEMTSGGGAGGEMSAL---GGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLDGGGGGLSVVLDI--GGGGGPLVVLDGSGG
        G GGG     + GGG GG +S     GGGG++V   G GGGG++V   G GGGG+    +GGGGG +V+   GGGG  VV+    GGGGG +    G GG
Subjt:  GAGGG--DEMTSGGGAGGEMSAL---GGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLDGGGGGLSVVLDI--GGGGGPLVVLDGSGG

Query:  GGLTVVFNGGGGGLLVMLDGGGGGLSVVLDI--GGGGGPLVVLDGSGGGGLTVVFNGGGGGLLVMLDGGGGGGLSVVLDIGGGG
        GG+ V  +GGGGG +V    GGGG  VV+    GGGGG +V   G GGGG+ V  +GGGGG +V+   GGGGG+ V    GGGG
Subjt:  GGLTVVFNGGGGGLLVMLDGGGGGLSVVLDI--GGGGGPLVVLDGSGGGGLTVVFNGGGGGLLVMLDGGGGGGLSVVLDIGGGG

A0A6J1GYP3 loricrin-like3.6e-3873.2Show/hide
Query:  MQINPINPNKPAPTITPKAMPAFCPPLSPSEPPPPFSFNARFFVGAGGGDEMTSGGGAGGEMSAL-GGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFN
        MQ+NP NPNKPAP ITP AMPAFCPPLSP E  PPFSFNA F  GAGGGDE+TSG GAGGEM    GGGGL VVFDG GGGGL VVLDG GGGGL VV +
Subjt:  MQINPINPNKPAPTITPKAMPAFCPPLSPSEPPPPFSFNARFFVGAGGGDEMTSGGGAGGEMSAL-GGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFN

Query:  G-GGGGLLVMLDG-GGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGG
        G GGGGL V+LDG GGGGL VVLD GGGG         GGG  +VVFNGGGGG
Subjt:  G-GGGGLLVMLDG-GGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGG

A0A6J1JS83 glycine-rich cell wall structural protein 1.0-like6.3e-3569.33Show/hide
Query:  INPINPNKPAPTITPKAMPAFCPPLSPSEPPPPFSFNARFFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFD-GSGGGGLLVVLDGNGGGGLTVVFNGG
        +NP NPNKPAP ITPK MPAFCPPLSP E  PPFSFNA F VGAGGGDE TSG GAG      GGGG+ VVFD G GGGGL  VLDG GGGGL VV +G 
Subjt:  INPINPNKPAPTITPKAMPAFCPPLSPSEPPPPFSFNARFFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFD-GSGGGGLLVVLDGNGGGGLTVVFNGG

Query:  GGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGL-TVVFNGGGGG
                 GGGGGL VVLD  GGGG  VVLDG GGGG  +VVFNGGGGG
Subjt:  GGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGL-TVVFNGGGGG

SwissProt top hitse value%identityAlignment
O48848 Glycine-rich protein 232.3e-0546.15Show/hide
Query:  FFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLT
        F  G GGG  +  GGG GG     GGGGL       GGGGL       GGGGL     GGG GL     GGGGGL     +GGGGG    L G GGGGL 
Subjt:  FFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLT

Query:  VVFNGGGGGLLVMLDGG-----GGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGGLLVMLDGGGGGGLSVVLDIGGG
            GGGGGL     GG     GGGL     IGGGGG      G GGGG    F GG GG      GGGGG       +GGG
Subjt:  VVFNGGGGGLLVMLDGG-----GGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGGLLVMLDGGGGGGLSVVLDIGGG

Arabidopsis top hitse value%identityAlignment
AT2G32690.1 glycine-rich protein 231.6e-0646.15Show/hide
Query:  FFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLT
        F  G GGG  +  GGG GG     GGGGL       GGGGL       GGGGL     GGG GL     GGGGGL     +GGGGG    L G GGGGL 
Subjt:  FFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLT

Query:  VVFNGGGGGLLVMLDGG-----GGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGGLLVMLDGGGGGGLSVVLDIGGG
            GGGGGL     GG     GGGL     IGGGGG      G GGGG    F GG GG      GGGGG       +GGG
Subjt:  VVFNGGGGGLLVMLDGG-----GGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGGLLVMLDGGGGGGLSVVLDIGGG

AT2G32690.2 glycine-rich protein 231.8e-0546.39Show/hide
Query:  FFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLT
        F  G GGG  +  GGG GG     GGGGL       GGGGL       GGGGL     GGG GL     GGGGGL     +GGGGG    L G  GGG  
Subjt:  FFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLT

Query:  VVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGGLLVMLDGGGGG
          + GG GG L    GGGGG      IGGGGG      G GGGG    F GG GG      GGGGG
Subjt:  VVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGGLLVMLDGGGGG

AT2G32690.3 glycine-rich protein 233.6e-0646.33Show/hide
Query:  FFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLT
        F  G GGG  +  GGG GG     GGGGL       GGGGL       GGGGL     GGG GL     GGGGGL     +GGGGG    L G GGGGL 
Subjt:  FFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLT

Query:  VVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGGLLVMLDGGGGGGLSVVLDIGGG
            GGGGG      G GGGL     IGGGGG      G GGGG    F GG GG      GGGGG       +GGG
Subjt:  VVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGGLLVMLDGGGGGGLSVVLDIGGG

AT2G32690.4 glycine-rich protein 234.4e-0445.1Show/hide
Query:  FFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLT
        F  G GGG  +  GGG GG     GGGGL       GGGGL       GGGGL     GGG GL     GGGGGL     +GGGGG    L G GGGGL 
Subjt:  FFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLT

Query:  VVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGG
             GGGG      GGGGG       GGG G        GGGGL   + GGG
Subjt:  VVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAATCAATCCTATAAATCCCAACAAACCTGCTCCTACAATCACTCCAAAGGCAATGCCGGCCTTTTGTCCACCACTCAGTCCTTCTGAACCTCCTCCTCCCTTTTC
ATTCAATGCTAGATTCTTCGTCGGTGCCGGCGGCGGAGATGAAATGACGTCAGGTGGAGGTGCGGGGGGTGAGATGTCCGCATTGGGTGGCGGTGGACTGCTAGTTGTAT
TTGATGGTAGCGGCGGAGGTGGACTGTTAGTTGTGTTGGATGGCAATGGCGGCGGTGGACTGACAGTCGTATTCAACGGTGGCGGAGGTGGACTGTTAGTTATGTTGGAT
GGCGGCGGCGGTGGACTTTCAGTTGTGTTGGATATCGGAGGTGGAGGTGGACCATTAGTTGTGTTAGATGGCAGTGGCGGCGGTGGACTAACAGTCGTATTCAACGGTGG
TGGAGGTGGACTGTTAGTTATGTTGGATGGCGGCGGCGGTGGACTTTCAGTTGTGTTGGATATCGGAGGCGGAGGTGGACCGTTAGTTGTGTTGGATGGCAGTGGCGGCG
GTGGACTGACAGTCGTATTCAACGGTGGCGGAGGTGGACTATTAGTTATGTTGGATGGCGGCGGCGGCGGCGGACTTTCAGTTGTGTTGGATATCGGAGGTGGAGGTGAT
GCATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAATCAATCCTATAAATCCCAACAAACCTGCTCCTACAATCACTCCAAAGGCAATGCCGGCCTTTTGTCCACCACTCAGTCCTTCTGAACCTCCTCCTCCCTTTTC
ATTCAATGCTAGATTCTTCGTCGGTGCCGGCGGCGGAGATGAAATGACGTCAGGTGGAGGTGCGGGGGGTGAGATGTCCGCATTGGGTGGCGGTGGACTGCTAGTTGTAT
TTGATGGTAGCGGCGGAGGTGGACTGTTAGTTGTGTTGGATGGCAATGGCGGCGGTGGACTGACAGTCGTATTCAACGGTGGCGGAGGTGGACTGTTAGTTATGTTGGAT
GGCGGCGGCGGTGGACTTTCAGTTGTGTTGGATATCGGAGGTGGAGGTGGACCATTAGTTGTGTTAGATGGCAGTGGCGGCGGTGGACTAACAGTCGTATTCAACGGTGG
TGGAGGTGGACTGTTAGTTATGTTGGATGGCGGCGGCGGTGGACTTTCAGTTGTGTTGGATATCGGAGGCGGAGGTGGACCGTTAGTTGTGTTGGATGGCAGTGGCGGCG
GTGGACTGACAGTCGTATTCAACGGTGGCGGAGGTGGACTATTAGTTATGTTGGATGGCGGCGGCGGCGGCGGACTTTCAGTTGTGTTGGATATCGGAGGTGGAGGTGAT
GCATCTTGA
Protein sequenceShow/hide protein sequence
MQINPINPNKPAPTITPKAMPAFCPPLSPSEPPPPFSFNARFFVGAGGGDEMTSGGGAGGEMSALGGGGLLVVFDGSGGGGLLVVLDGNGGGGLTVVFNGGGGGLLVMLD
GGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGGLLVMLDGGGGGLSVVLDIGGGGGPLVVLDGSGGGGLTVVFNGGGGGLLVMLDGGGGGGLSVVLDIGGGGD
AS