; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014777 (gene) of Snake gourd v1 genome

Gene IDTan0014777
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGlycine rich protein
Genome locationLG02:3648264..3649109
RNA-Seq ExpressionTan0014777
SyntenyTan0014777
Gene Ontology termsNA
InterPro domainsIPR010800 - Glycine rich protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KDO58488.1 hypothetical protein CISIN_1g032762mg [Citrus sinensis]7.8e-1752.76Show/hide
Query:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA-------NNHEVATMTNGVEDAKY---------GWREYEGDYP--GRGYYGGYPGRGDYGDNYGWYGRGHC
        M SK F  L LL +IV+ I+ E AARDLA        N EVA  TNGV+DAKY         G R   G YP  GRG YGGYPGRG YG     +G G+C
Subjt:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA-------NNHEVATMTNGVEDAKY---------GWREYEGDYP--GRGYYGGYPGRGDYGDNYGWYGRGHC

Query:  YYGCC--GYRGRCSRCCSYAGEVADAK
         YGCC  GY GR  RCCSYAGE  +A+
Subjt:  YYGCC--GYRGRCSRCCSYAGEVADAK

XP_006447697.1 cold and drought-regulated protein CORA [Citrus clementina]2.7e-1753.54Show/hide
Query:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA-------NNHEVATMTNGVEDAKY---------GWREYEGDYP--GRGYYGGYPGRGDYGDNYGWYGRGHC
        M SK F  L LL +IV+ I+ EVAARDLA        N EVA  TNGV+DAKY         G R   G YP  GRG YGGYPGRG YG     +G G+C
Subjt:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA-------NNHEVATMTNGVEDAKY---------GWREYEGDYP--GRGYYGGYPGRGDYGDNYGWYGRGHC

Query:  YYGCC--GYRGRCSRCCSYAGEVADAK
         YGCC  GY GR  RCCSYAGE  +A+
Subjt:  YYGCC--GYRGRCSRCCSYAGEVADAK

XP_022136254.1 glycine-rich protein 3 short isoform-like [Momordica charantia]4.6e-1748.41Show/hide
Query:  MSSKGFAFLCLLFAIVVFIAPEVAARDLANNHEVATMTNGVEDAK----------------YGWREYEGDYPGRG--YYGGYPGRGDYGDNYGWYGRGHC
        M+SK F  L LLFA+VV ++ +V ARDL    E    T+GVEDAK                 G   Y G YPG G  Y GGYPGRG YG      G  HC
Subjt:  MSSKGFAFLCLLFAIVVFIAPEVAARDLANNHEVATMTNGVEDAK----------------YGWREYEGDYPGRG--YYGGYPGRGDYGDNYGWYGRGHC

Query:  YYGCCGYRGRCSRCCSYAGEVADAKP
        ++GCCGY G C RCCSYAGE  D +P
Subjt:  YYGCCGYRGRCSRCCSYAGEVADAKP

XP_022952571.1 glycine-rich protein-like isoform X2 [Cucurbita moschata]2.7e-1759.83Show/hide
Query:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA-----NNHEVATMTNGVEDAKYGWREYEGDYPGRGYY---GGYPGRGDYGDNYGWYGRGHCYYGCCGYRGR
        MSSK F FL LLFA+V+ I+ EVAARDLA       +E    TNGVEDAKYG   Y+G Y GRG Y   GGY GRG YG   G YGRG C YG CGY   
Subjt:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA-----NNHEVATMTNGVEDAKYGWREYEGDYPGRGYY---GGYPGRGDYGDNYGWYGRGHCYYGCCGYRGR

Query:  CSRCCSYAGEVAD-AKP
          RCCSYAGEV + AKP
Subjt:  CSRCCSYAGEVAD-AKP

XP_030928210.1 cold and drought-regulated protein CORA-like [Quercus lobata]3.5e-1750Show/hide
Query:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA------NNHEVATMTNGVEDAKYGWREYEGDYPGRGY-------YGGYPGRGDYGDN-------YGWYGRG
        M SK F  L L+ AIV+ I+ +VAAR+LA       N EV+T TN V+DAKYG  E  G   G GY       YGG PG+G YG N        G +G G
Subjt:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA------NNHEVATMTNGVEDAKYGWREYEGDYPGRGY-------YGGYPGRGDYGDN-------YGWYGRG

Query:  HCYYGCC--GYRGR-CSRCCSYAGEVADAK
        HCYYGCC  GY G  C RCCSYAGE  D +
Subjt:  HCYYGCC--GYRGR-CSRCCSYAGEVADAK

TrEMBL top hitse value%identityAlignment
A0A2H5PLF8 Uncharacterized protein1.3e-1753.54Show/hide
Query:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA-------NNHEVATMTNGVEDAKY---------GWREYEGDYP--GRGYYGGYPGRGDYGDNYGWYGRGHC
        M SK F  L LL +IV+ I+ EVAARDLA        N EVA  TNGV+DAKY         G R   G YP  GRG YGGYPGRG YG     +G G+C
Subjt:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA-------NNHEVATMTNGVEDAKY---------GWREYEGDYP--GRGYYGGYPGRGDYGDNYGWYGRGHC

Query:  YYGCC--GYRGRCSRCCSYAGEVADAK
         YGCC  GY GR  RCCSYAGE  +A+
Subjt:  YYGCC--GYRGRCSRCCSYAGEVADAK

A0A6J1C3T3 glycine-rich protein 3 short isoform-like2.2e-1748.41Show/hide
Query:  MSSKGFAFLCLLFAIVVFIAPEVAARDLANNHEVATMTNGVEDAK----------------YGWREYEGDYPGRG--YYGGYPGRGDYGDNYGWYGRGHC
        M+SK F  L LLFA+VV ++ +V ARDL    E    T+GVEDAK                 G   Y G YPG G  Y GGYPGRG YG      G  HC
Subjt:  MSSKGFAFLCLLFAIVVFIAPEVAARDLANNHEVATMTNGVEDAK----------------YGWREYEGDYPGRG--YYGGYPGRGDYGDNYGWYGRGHC

Query:  YYGCCGYRGRCSRCCSYAGEVADAKP
        ++GCCGY G C RCCSYAGE  D +P
Subjt:  YYGCCGYRGRCSRCCSYAGEVADAKP

A0A6J1GKL6 glycine-rich protein-like isoform X21.3e-1759.83Show/hide
Query:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA-----NNHEVATMTNGVEDAKYGWREYEGDYPGRGYY---GGYPGRGDYGDNYGWYGRGHCYYGCCGYRGR
        MSSK F FL LLFA+V+ I+ EVAARDLA       +E    TNGVEDAKYG   Y+G Y GRG Y   GGY GRG YG   G YGRG C YG CGY   
Subjt:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA-----NNHEVATMTNGVEDAKYGWREYEGDYPGRGYY---GGYPGRGDYGDNYGWYGRGHCYYGCCGYRGR

Query:  CSRCCSYAGEVAD-AKP
          RCCSYAGEV + AKP
Subjt:  CSRCCSYAGEVAD-AKP

A0A7N2MAF5 Uncharacterized protein1.7e-1750Show/hide
Query:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA------NNHEVATMTNGVEDAKYGWREYEGDYPGRGY-------YGGYPGRGDYGDN-------YGWYGRG
        M SK F  L L+ AIV+ I+ +VAAR+LA       N EV+T TN V+DAKYG  E  G   G GY       YGG PG+G YG N        G +G G
Subjt:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA------NNHEVATMTNGVEDAKYGWREYEGDYPGRGY-------YGGYPGRGDYGDN-------YGWYGRG

Query:  HCYYGCC--GYRGR-CSRCCSYAGEVADAK
        HCYYGCC  GY G  C RCCSYAGE  D +
Subjt:  HCYYGCC--GYRGR-CSRCCSYAGEVADAK

V4U4R7 Uncharacterized protein1.3e-1753.54Show/hide
Query:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA-------NNHEVATMTNGVEDAKY---------GWREYEGDYP--GRGYYGGYPGRGDYGDNYGWYGRGHC
        M SK F  L LL +IV+ I+ EVAARDLA        N EVA  TNGV+DAKY         G R   G YP  GRG YGGYPGRG YG     +G G+C
Subjt:  MSSKGFAFLCLLFAIVVFIAPEVAARDLA-------NNHEVATMTNGVEDAKY---------GWREYEGDYP--GRGYYGGYPGRGDYGDNYGWYGRGHC

Query:  YYGCC--GYRGRCSRCCSYAGEVADAK
         YGCC  GY GR  RCCSYAGE  +A+
Subjt:  YYGCC--GYRGRCSRCCSYAGEVADAK

SwissProt top hitse value%identityAlignment
P23137 Glycine-rich protein1.3e-0945.45Show/hide
Query:  MSSKGFAFLCLLFAIVVFIAPEVAARDLANNHEVATM--TNGVE-DAKYGWREYEGDYPGRGYYGGYPGRGDYGDNYGWYGRGHCYYGCC--GYRGRCSR
        M SK F FL L  A    I+ EV A +LA       +   NGV+ D + G+ +  GD    GYYGG  GRG      G Y R  C YGCC  GY G C R
Subjt:  MSSKGFAFLCLLFAIVVFIAPEVAARDLANNHEVATM--TNGVE-DAKYGWREYEGDYPGRGYYGGYPGRGDYGDNYGWYGRGHCYYGCC--GYRGRCSR

Query:  CCSYAGEVAD
        CCSYAGE  D
Subjt:  CCSYAGEVAD

Arabidopsis top hitse value%identityAlignment
AT2G05520.5 glycine-rich protein 31.6e-0436.84Show/hide
Query:  MSSKGFAFLCLLFAIVVFIAPEVAARDLANNHEVATMTNGVEDAKYGWREYEGDY-PGRGYYGGYPGR---GDYGDNYGWYGRGHCYYGCC--GYRGRCS
        M+SK    L  LFA+++ ++   AA     N E       V+  + G+ +  G+Y  G G Y G  GR   G      G  G  +C +GCC  GY G CS
Subjt:  MSSKGFAFLCLLFAIVVFIAPEVAARDLANNHEVATMTNGVEDAKYGWREYEGDY-PGRGYYGGYPGR---GDYGDNYGWYGRGHCYYGCC--GYRGRCS

Query:  RCCSYAGEVADAKP
        RCCSYAGE    +P
Subjt:  RCCSYAGEVADAKP

AT2G05520.6 glycine-rich protein 31.2e-0437.5Show/hide
Query:  MSSKGFAFLCLLFAIVVFIAPEVAARDLANNHEVATMTNGVEDAKYGWREYEGDY-PGRGYYGG---YPGRGDY----GDNYGWYGRG--HCYYGCC--G
        M+SK    L  LFA+++ ++   AA     N E       V+  + G+ +  G+Y  G GY GG   Y G G      G   G  G G  +C +GCC  G
Subjt:  MSSKGFAFLCLLFAIVVFIAPEVAARDLANNHEVATMTNGVEDAKYGWREYEGDY-PGRGYYGG---YPGRGDY----GDNYGWYGRG--HCYYGCC--G

Query:  YRGRCSRCCSYAGEVADAKP
        Y G CSRCCSYAGE    +P
Subjt:  YRGRCSRCCSYAGEVADAKP

AT2G05530.1 Glycine-rich protein family3.9e-0638.33Show/hide
Query:  MSSKGFAFLCLLFAIVVFIAPEVAARDLANNHEVATMTNGVEDAKYGWREYEGDYPGRGYY--------GGYPGRGDYGDNYGWYG--RGHCYYGCC--G
        M+SK    L  LFA+++ +    AA     +    T+     D   G     G Y G G Y        GGY G G Y  N G +G   G+C YGCC  G
Subjt:  MSSKGFAFLCLLFAIVVFIAPEVAARDLANNHEVATMTNGVEDAKYGWREYEGDYPGRGYY--------GGYPGRGDYGDNYGWYG--RGHCYYGCC--G

Query:  YRGRCSRCCSYAGEVADAKP
        Y G CSRCCSYAGE    +P
Subjt:  YRGRCSRCCSYAGEVADAKP

AT2G05540.1 Glycine-rich protein family2.0e-1039.85Show/hide
Query:  MSSKGFAFLCLLFAIVVFIAPEVAARDLANNHEVATMTNGVEDAKYGWREYEGDYPGRGY-------YGGYPG-------------RGDYGDNYGWYGR-
        M+SK   FL L+  +V+ IA EV ARDLA   + A   N   D +    E  G +PG GY       YGG PG              G YG+  G YG  
Subjt:  MSSKGFAFLCLLFAIVVFIAPEVAARDLANNHEVATMTNGVEDAKYGWREYEGDYPGRGY-------YGGYPG-------------RGDYGDNYGWYGR-

Query:  --GHCYYGCC--GYRGRCSRCCSYAGEVADAKP
          G+C YGCC  GY G C RCC+YAG+    +P
Subjt:  --GHCYYGCC--GYRGRCSRCCSYAGEVADAKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCCAAAGGTTTTGCATTCCTCTGTCTTCTCTTCGCCATTGTCGTCTTCATCGCACCGGAGGTGGCGGCGAGAGACCTTGCAAACAACCATGAGGTCGCAACGAT
GACGAATGGCGTCGAGGATGCGAAGTATGGTTGGAGAGAATACGAGGGAGACTACCCCGGACGTGGCTACTATGGTGGCTACCCCGGACGTGGTGACTACGGTGATAACT
ACGGTTGGTACGGTAGAGGTCACTGCTATTACGGTTGCTGCGGTTACCGCGGCCGGTGCTCCAGGTGCTGCAGCTATGCCGGAGAAGTTGCAGATGCAAAACCCTAA
mRNA sequenceShow/hide mRNA sequence
CATCTCCTTGCCATCTCCAAAACCTAGTTTCTTAAGAAAATAAAATGAGTTCCAAAGGTTTTGCATTCCTCTGTCTTCTCTTCGCCATTGTCGTCTTCATCGCACCGGAG
GTGGCGGCGAGAGACCTTGCAAACAACCATGAGGTCGCAACGATGACGAATGGCGTCGAGGATGCGAAGTATGGTTGGAGAGAATACGAGGGAGACTACCCCGGACGTGG
CTACTATGGTGGCTACCCCGGACGTGGTGACTACGGTGATAACTACGGTTGGTACGGTAGAGGTCACTGCTATTACGGTTGCTGCGGTTACCGCGGCCGGTGCTCCAGGT
GCTGCAGCTATGCCGGAGAAGTTGCAGATGCAAAACCCTAAGGCAGCTTATGGGGGAGGGCAAGAAAGGATTGGCAGTTTAAAGTTTAGGTTAATTAATTAAGCAAAAAT
ATAAACTAAGTAAGAAATAAATATGAGGGAATGGAAGAATGGCTCAGAGTTAGGTTTTCCTCTCCCTTGTGTTTGTGTTTGGGTGTTGTTTTAGATTTCCCTTTTGCAAA
TTATAAATATAAACTAAGGTGAAATGAAAGCTTAATGTTTACATGTATAACAAAATTATAATAAATTATCATTAAATGTA
Protein sequenceShow/hide protein sequence
MSSKGFAFLCLLFAIVVFIAPEVAARDLANNHEVATMTNGVEDAKYGWREYEGDYPGRGYYGGYPGRGDYGDNYGWYGRGHCYYGCCGYRGRCSRCCSYAGEVADAKP