; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036523 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036523
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGlycine-rich protein family
Genome locationchr3:47838699..47842781
RNA-Seq ExpressionLag0036523
SyntenyLag0036523
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050947.1 uncharacterized protein E6C27_scaffold761G00740 [Cucumis melo var. makuwa]3.7e-4878.51Show/hide
Query:  MASPIGFLLILLALAASLRYPSVCA-EEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNK
        MASPI FL ++LAL  SLR PSV   +EI GQ+V E+FKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNL+CNGPCL+ETQL+LNCLD++F NFLFYNK
Subjt:  MASPIGFLLILLALAASLRYPSVCA-EEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNK

Query:  ATSQNVRNALRVGCSYTSQRG
        AT+Q+VRNALR+GCS++SQRG
Subjt:  ATSQNVRNALRVGCSYTSQRG

XP_008450550.1 PREDICTED: uncharacterized protein LOC103492117 [Cucumis melo]3.9e-5379.23Show/hide
Query:  MASPIGFLLILLALAASLRYPSVCA-EEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNK
        MASPI FL ++LAL  SLR PSV   +EI GQ+V E+FKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNL+CNGPCL+ETQL+LNCLD++F NFLFYNK
Subjt:  MASPIGFLLILLALAASLRYPSVCA-EEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNK

Query:  ATSQNVRNALRVGCSYTSQRGNFNPGLFMQ
        AT+Q+VRNALR+GCS++SQRGNFN GLFMQ
Subjt:  ATSQNVRNALRVGCSYTSQRGNFNPGLFMQ

XP_022158829.1 uncharacterized protein LOC111025289 [Momordica charantia]1.1e-5584.5Show/hide
Query:  MASPIGFLLILLALAASLRYPSVCAEEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNKA
        MA P  FL ILLALA SL  PSVCAEEIPGQVVTEA +CFDNKFIYNGC+S YRLNPSGSFNVPPEATNL+C+GPCLVET+LVL+CLDD FGNFLFYNKA
Subjt:  MASPIGFLLILLALAASLRYPSVCAEEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNKA

Query:  TSQNVRNALRVGCSYTSQRGNFNPGLFMQ
         +QNVRNALR GCSY+SQRGNFNPGLFMQ
Subjt:  TSQNVRNALRVGCSYTSQRGNFNPGLFMQ

XP_022989485.1 uncharacterized protein LOC111486529 [Cucurbita maxima]3.9e-4569.77Show/hide
Query:  MASPIGFLLILLALAASLRYPSVCAEEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNKA
        MASP+  LL+L   A +L   SVC EE  GQ VT+ F+CFDN  IYNGCESAYRLNPSG+ NVP +ATNL+CNGPCL+ETQL+LNCLD  F NFLFYNKA
Subjt:  MASPIGFLLILLALAASLRYPSVCAEEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNKA

Query:  TSQNVRNALRVGCSYTSQRGNFNPGLFMQ
        T   V+NALR GCSY++QRGNFNPG FMQ
Subjt:  TSQNVRNALRVGCSYTSQRGNFNPGLFMQ

XP_031736077.1 uncharacterized protein LOC101204762 [Cucumis sativus]3.5e-5481.68Show/hide
Query:  MASPIGFLLILLALAASLRYPSVCA--EEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYN
        MASPI FL I LALA  LR PSV +  +EIPGQ+V E FKCFDNKFIYNGCE AYRLNPSGSFNVPPEATNL+CNGPCL+ETQL+LNCLD+TF NFLFYN
Subjt:  MASPIGFLLILLALAASLRYPSVCA--EEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYN

Query:  KATSQNVRNALRVGCSYTSQRGNFNPGLFMQ
        KAT+Q+VRNALRVGCSY+SQRGNFN GLFMQ
Subjt:  KATSQNVRNALRVGCSYTSQRGNFNPGLFMQ

TrEMBL top hitse value%identityAlignment
A0A0A0LYD5 Uncharacterized protein1.2e-5278.26Show/hide
Query:  MASPIGFLLILLALAASLRYPS----VCAE-----EIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTF
        MASPI FL I LALA  LR PS    V  E     EIPGQ+V E FKCFDNKFIYNGCE AYRLNPSGSFNVPPEATNL+CNGPCL+ETQL+LNCLD+TF
Subjt:  MASPIGFLLILLALAASLRYPS----VCAE-----EIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTF

Query:  GNFLFYNKATSQNVRNALRVGCSYTSQRGNFNPGLFMQ
         NFLFYNKAT+Q+VRNALRVGCSY+SQRGNFN GLFMQ
Subjt:  GNFLFYNKATSQNVRNALRVGCSYTSQRGNFNPGLFMQ

A0A1S3BPH6 uncharacterized protein LOC1034921171.9e-5379.23Show/hide
Query:  MASPIGFLLILLALAASLRYPSVCA-EEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNK
        MASPI FL ++LAL  SLR PSV   +EI GQ+V E+FKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNL+CNGPCL+ETQL+LNCLD++F NFLFYNK
Subjt:  MASPIGFLLILLALAASLRYPSVCA-EEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNK

Query:  ATSQNVRNALRVGCSYTSQRGNFNPGLFMQ
        AT+Q+VRNALR+GCS++SQRGNFN GLFMQ
Subjt:  ATSQNVRNALRVGCSYTSQRGNFNPGLFMQ

A0A5A7U972 Uncharacterized protein1.8e-4878.51Show/hide
Query:  MASPIGFLLILLALAASLRYPSVCA-EEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNK
        MASPI FL ++LAL  SLR PSV   +EI GQ+V E+FKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNL+CNGPCL+ETQL+LNCLD++F NFLFYNK
Subjt:  MASPIGFLLILLALAASLRYPSVCA-EEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNK

Query:  ATSQNVRNALRVGCSYTSQRG
        AT+Q+VRNALR+GCS++SQRG
Subjt:  ATSQNVRNALRVGCSYTSQRG

A0A6J1E0J9 uncharacterized protein LOC1110252895.3e-5684.5Show/hide
Query:  MASPIGFLLILLALAASLRYPSVCAEEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNKA
        MA P  FL ILLALA SL  PSVCAEEIPGQVVTEA +CFDNKFIYNGC+S YRLNPSGSFNVPPEATNL+C+GPCLVET+LVL+CLDD FGNFLFYNKA
Subjt:  MASPIGFLLILLALAASLRYPSVCAEEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNKA

Query:  TSQNVRNALRVGCSYTSQRGNFNPGLFMQ
         +QNVRNALR GCSY+SQRGNFNPGLFMQ
Subjt:  TSQNVRNALRVGCSYTSQRGNFNPGLFMQ

A0A6J1JMH7 uncharacterized protein LOC1114865291.9e-4569.77Show/hide
Query:  MASPIGFLLILLALAASLRYPSVCAEEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNKA
        MASP+  LL+L   A +L   SVC EE  GQ VT+ F+CFDN  IYNGCESAYRLNPSG+ NVP +ATNL+CNGPCL+ETQL+LNCLD  F NFLFYNKA
Subjt:  MASPIGFLLILLALAASLRYPSVCAEEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNKA

Query:  TSQNVRNALRVGCSYTSQRGNFNPGLFMQ
        T   V+NALR GCSY++QRGNFNPG FMQ
Subjt:  TSQNVRNALRVGCSYTSQRGNFNPGLFMQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G56320.1 BEST Arabidopsis thaliana protein match is: Glycine-rich protein family (TAIR:AT5G49350.2)1.6e-2852.94Show/hide
Query:  GQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNKATSQNVRNALRVGCSYTSQRGNFNPGLFMQ
        G +V  A  CF+N  +Y GC  A+RLN  G F VPPE T+ +CNGPC  ET+LVL C++    +F+FYN+AT ++VRNALR GCS +  RGNFN G + Q
Subjt:  GQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNKATSQNVRNALRVGCSYTSQRGNFNPGLFMQ

Query:  DL
         L
Subjt:  DL

AT5G49350.1 Glycine-rich protein family9.3e-2140.21Show/hide
Query:  EIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNKATSQNVRNALRVGCSYTSQRGNFN
        E P ++V +A +C + K IY  C+ ++RL  +G  N+P   T  +C GPC  ET L LNC+++   ++ F+N+AT  ++R  L+ GCSY  +RG FN
Subjt:  EIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNKATSQNVRNALRVGCSYTSQRGNFN

AT5G49350.2 Glycine-rich protein family9.3e-2140.21Show/hide
Query:  EIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNKATSQNVRNALRVGCSYTSQRGNFN
        E P ++V +A +C + K IY  C+ ++RL  +G  N+P   T  +C GPC  ET L LNC+++   ++ F+N+AT  ++R  L+ GCSY  +RG FN
Subjt:  EIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNKATSQNVRNALRVGCSYTSQRGNFN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCCCAATTGGCTTCCTCCTCATCCTTCTCGCTCTCGCCGCCTCGCTCCGCTACCCCTCAGTTTGTGCTGAGGAGATTCCAGGCCAAGTGGTGACTGAGGCCTT
CAAATGTTTTGACAATAAGTTTATATACAATGGGTGTGAAAGTGCTTATAGATTGAACCCAAGTGGGAGTTTCAATGTCCCTCCTGAGGCTACTAATCTGTATTGCAATG
GACCGTGCTTGGTCGAAACACAACTCGTGCTCAACTGCCTTGACGACACATTTGGAAACTTCTTATTCTACAACAAAGCCACATCGCAAAATGTTCGAAACGCCCTCCGT
GTCGGCTGCAGCTACACAAGCCAGAGAGGGAACTTCAATCCAGGATTGTTCATGCAAGATCTGTCGCCGCCGCCGCCCTCTTTCTCCTTCCATCCCGCCGGCAGCTTCTT
CTTCTTCTTCTTTTTGCTCAGATCTGGTCGGAGGCTTGTTTTTTGGGTGGCTGTAGGCAATGAAGGCGGCGGCGACGGTGACCGGGGAGAAGAGGAGACGGTGGCGGCGG
GCGGTAGCTGCGGCGTGGGCTGTGTGGCGGCGACGGCAATCGGCGGCGAAAGGAAGGAAAGAGAGGGAGAGGGGAAGGAGAAAGAGAAAGAAGGGAAAAGAAAGGAAGGG
GGGTTTCGGGAAGTGGGAAGAAAGAGGAGAGAGAGTGGGATTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCCCAATTGGCTTCCTCCTCATCCTTCTCGCTCTCGCCGCCTCGCTCCGCTACCCCTCAGTTTGTGCTGAGGAGATTCCAGGCCAAGTGGTGACTGAGGCCTT
CAAATGTTTTGACAATAAGTTTATATACAATGGGTGTGAAAGTGCTTATAGATTGAACCCAAGTGGGAGTTTCAATGTCCCTCCTGAGGCTACTAATCTGTATTGCAATG
GACCGTGCTTGGTCGAAACACAACTCGTGCTCAACTGCCTTGACGACACATTTGGAAACTTCTTATTCTACAACAAAGCCACATCGCAAAATGTTCGAAACGCCCTCCGT
GTCGGCTGCAGCTACACAAGCCAGAGAGGGAACTTCAATCCAGGATTGTTCATGCAAGATCTGTCGCCGCCGCCGCCCTCTTTCTCCTTCCATCCCGCCGGCAGCTTCTT
CTTCTTCTTCTTTTTGCTCAGATCTGGTCGGAGGCTTGTTTTTTGGGTGGCTGTAGGCAATGAAGGCGGCGGCGACGGTGACCGGGGAGAAGAGGAGACGGTGGCGGCGG
GCGGTAGCTGCGGCGTGGGCTGTGTGGCGGCGACGGCAATCGGCGGCGAAAGGAAGGAAAGAGAGGGAGAGGGGAAGGAGAAAGAGAAAGAAGGGAAAAGAAAGGAAGGG
GGGTTTCGGGAAGTGGGAAGAAAGAGGAGAGAGAGTGGGATTTTTTAA
Protein sequenceShow/hide protein sequence
MASPIGFLLILLALAASLRYPSVCAEEIPGQVVTEAFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLYCNGPCLVETQLVLNCLDDTFGNFLFYNKATSQNVRNALR
VGCSYTSQRGNFNPGLFMQDLSPPPPSFSFHPAGSFFFFFFLLRSGRRLVFWVAVGNEGGGDGDRGEEETVAAGGSCGVGCVAATAIGGERKEREGEGKEKEKEGKRKEG
GFREVGRKRRESGIF