; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS028294 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS028294
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionglycine-rich cell wall structural protein-like
Genome locationscaffold47:2597170..2597915
RNA-Seq ExpressionMS028294
SyntenyMS028294
Gene Ontology termsNA
InterPro domainsIPR010800 - Glycine rich protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572253.1 Glycine-rich protein, partial [Cucurbita argyrosperma subsp. sororia]5.9e-1255.97Show/hide
Query:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGR
        MSSKAF FLGLLFA+V++I S   A+ LA TS  ++ NE TAETN  VEDAK+   G + GG+G    G GGYG    RGGYGG G YG   GRGGY GR
Subjt:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGR

Query:  GGYDGFRPGGGLCGQNGYCCGFR-GGCD-RCCRY
        GGY G    GG  G  GY  G R G C  RCC Y
Subjt:  GGYDGFRPGGGLCGQNGYCCGFR-GGCD-RCCRY

KAG7011884.1 Glycine-rich protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.9e-1255.97Show/hide
Query:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGR
        MSSKAF FLGLLFA+V++I S   A+ LA TS  ++ NE TAETN  VEDAK+   G + GG+G    G GGYG    RGGYGG G YG   GRGGY GR
Subjt:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGR

Query:  GGYDGFRPGGGLCGQNGYCCGFR-GGCD-RCCRY
        GGY G    GG  G  GY  G R G C  RCC Y
Subjt:  GGYDGFRPGGGLCGQNGYCCGFR-GGCD-RCCRY

KAG8369563.1 hypothetical protein BUALT_Bualt14G0026400 [Buddleja alternifolia]8.2e-1450.35Show/hide
Query:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGY--GGYPRRGGYGGFGDYGDYPGRGGYP
        M SKA  FLGL  A V++I S   A+ LA TS   D +E   ETN AV DAK+  GG++GGG    YPG GG   GGYP RGGYG           GGYP
Subjt:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGY--GGYPRRGGYGGFGDYGDYPGRGGYP

Query:  GRGGYDGFRPGGGLCGQNGY----CCG---FRGGCDRCCRY
        GRGGY G  PG G  G+ GY    CCG   +RG C RCC +
Subjt:  GRGGYDGFRPGGGLCGQNGY----CCG---FRGGCDRCCRY

XP_022136252.1 glycine-rich cell wall structural protein-like [Momordica charantia]6.0e-7399.3Show/hide
Query:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGR
        MSSKAFAFLGLLFAIVVVICSTAVA SLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGR
Subjt:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGR

Query:  GGYDGFRPGGGLCGQNGYCCGFRGGCDRCCRYYPGGGLAEARP
        GGYDGFRPGGGLCGQNGYCCGFRGGCDRCCRYYPGGGLAEARP
Subjt:  GGYDGFRPGGGLCGQNGYCCGFRGGCDRCCRYYPGGGLAEARP

XP_027330054.1 glycine-rich cell wall structural protein-like [Abrus precatorius]4.1e-1351.09Show/hide
Query:  SSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGRG
        S  A   LGLL A+++VI S   A+ LA TS + D  E   ET   V DAK+      GGG+G  YPG GG GGYP RGGYG  G Y  + G GGYPGRG
Subjt:  SSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGRG

Query:  GYDGFRPGGGLCGQNGY----CCG--FRGGCDRCCRY
        GY G  PG G  G  GY    CCG  + GGC RCC Y
Subjt:  GYDGFRPGGGLCGQNGY----CCG--FRGGCDRCCRY

TrEMBL top hitse value%identityAlignment
A0A1R3KSB3 Glycine rich protein1.2e-1050.75Show/hide
Query:  MSSK-AFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNN-EVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYP
        MS+K +F F  LL A+V++I S   A+ LA TS  E NN EV  ET   VEDAK+   G+ GG  G  Y G GG GGY  RGGYGG+G  G Y GRGGY 
Subjt:  MSSK-AFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNN-EVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYP

Query:  GRGGYDGFRPGGGLCGQNGYCCGFRGGCDRCCRY
        GRGGY G    G  C ++ Y   +  GC RCC Y
Subjt:  GRGGYDGFRPGGGLCGQNGYCCGFRGGCDRCCRY

A0A1R3KSE4 Glycine rich protein9.1e-1148.92Show/hide
Query:  MSSK-AFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGF---GDYGDYPGRGG
        MSSK +F    LL A+V++I S   AK LA T+   +N EV  E+   VEDAK+   G+ GG  G  Y G GG GGY  RGGYGG+   G YG Y GRGG
Subjt:  MSSK-AFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGF---GDYGDYPGRGG

Query:  YPGRGGYDGFRPGGGLCGQNGYCCG---FRGGCDRCCRY
        Y GRGGY G    GG C     CC    +  GC RCC Y
Subjt:  YPGRGGYDGFRPGGGLCGQNGYCCG---FRGGCDRCCRY

A0A6J1C524 glycine-rich cell wall structural protein-like2.9e-7399.3Show/hide
Query:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGR
        MSSKAFAFLGLLFAIVVVICSTAVA SLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGR
Subjt:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGR

Query:  GGYDGFRPGGGLCGQNGYCCGFRGGCDRCCRYYPGGGLAEARP
        GGYDGFRPGGGLCGQNGYCCGFRGGCDRCCRYYPGGGLAEARP
Subjt:  GGYDGFRPGGGLCGQNGYCCGFRGGCDRCCRYYPGGGLAEARP

A0A6J1GM55 cold and drought-regulated protein CORA-like isoform X12.4e-1155.12Show/hide
Query:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGR
        MSSKAF FLGLLFA+V++I S   A+ LA TS  ++ NE TAETN  VEDAK+   G + GG+G    G GGYG    RGGYGG G YG   GRGGY GR
Subjt:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGR

Query:  GGYDGFRPGGGLC--GQNGY-CCGFRG
        GGY G    G  C  G+ GY CC + G
Subjt:  GGYDGFRPGGGLC--GQNGY-CCGFRG

A0A7J8NKC3 Uncharacterized protein1.2e-1044.29Show/hide
Query:  SKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRA---VEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPG
        SK+F  L LL A V++I S   A+ LA T+ +++N EV  ET  A   VE+AK+ +GG         Y   GGYGGY  RGGYGG+G  G Y   GGY G
Subjt:  SKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRA---VEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPG

Query:  RGGYDGFRPGGGLCGQNGYCC-------GFRGGCDRCCRY
        RGGY G   G G  G  G C         +  GC RCC Y
Subjt:  RGGYDGFRPGGGLCGQNGYCC-------GFRGGCDRCCRY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G05530.1 Glycine-rich protein family8.2e-0435.04Show/hide
Query:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGR
        M+SKA    G LFA+++V+   A A           +  V +E+   V+  +++              G GG GGY   GGY G G +      GGY G 
Subjt:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGR

Query:  GGYDGFRPGGGLCGQNGYC---CGFRG--GCDRCCRY
        GGY+    GGG  G++GYC   C +RG  GC RCC Y
Subjt:  GGYDGFRPGGGLCGQNGYC---CGFRG--GCDRCCRY

AT2G05540.1 Glycine-rich protein family7.2e-0846.67Show/hide
Query:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGR-GGYPG
        M+SKA  FL L+  +V++I S  VA+ LA  S ++ NNE      R        +GG  GGG+G  +PG GGYGG P  GGYG  G  G Y  R GGY  
Subjt:  MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGR-GGYPG

Query:  RGGYDGFRPGGGLCGQNGYCC--GFRGGCDRCCRY
        RGG  G R GGG C     CC  G+ GGC RCC Y
Subjt:  RGGYDGFRPGGGLCGQNGYCC--GFRGGCDRCCRY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCCAAGGCTTTTGCTTTCCTCGGTCTTCTCTTCGCCATTGTCGTCGTCATCTGCTCGACAGCGGTGGCAAAAAGCCTCGCACCGACCTCCATCGACGAGGACAA
CAATGAGGTCACAGCCGAGACCAATAGGGCAGTAGAGGATGCCAAGTTTAGCTGGGGAGGATCATTTGGGGGAGGATTCGGTAGAGACTACCCCGGACCGGGTGGCTACG
GTGGCTACCCCCGACGGGGGGGCTATGGTGGCTTTGGTGATTACGGTGACTACCCTGGGCGCGGTGGCTACCCTGGGCGTGGTGGCTATGACGGATTTAGACCTGGGGGA
GGTCTTTGCGGCCAGAACGGTTATTGCTGCGGTTTCCGTGGCGGGTGCGACCGGTGCTGCAGGTACTACCCCGGTGGAGGGTTAGCAGAGGCAAGACCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCCAAGGCTTTTGCTTTCCTCGGTCTTCTCTTCGCCATTGTCGTCGTCATCTGCTCGACAGCGGTGGCAAAAAGCCTCGCACCGACCTCCATCGACGAGGACAA
CAATGAGGTCACAGCCGAGACCAATAGGGCAGTAGAGGATGCCAAGTTTAGCTGGGGAGGATCATTTGGGGGAGGATTCGGTAGAGACTACCCCGGACCGGGTGGCTACG
GTGGCTACCCCCGACGGGGGGGCTATGGTGGCTTTGGTGATTACGGTGACTACCCTGGGCGCGGTGGCTACCCTGGGCGTGGTGGCTATGACGGATTTAGACCTGGGGGA
GGTCTTTGCGGCCAGAACGGTTATTGCTGCGGTTTCCGTGGCGGGTGCGACCGGTGCTGCAGGTACTACCCCGGTGGAGGGTTAGCAGAGGCAAGACCCTAA
Protein sequenceShow/hide protein sequence
MSSKAFAFLGLLFAIVVVICSTAVAKSLAPTSIDEDNNEVTAETNRAVEDAKFSWGGSFGGGFGRDYPGPGGYGGYPRRGGYGGFGDYGDYPGRGGYPGRGGYDGFRPGG
GLCGQNGYCCGFRGGCDRCCRYYPGGGLAEARP