; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G000800 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G000800
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptionindole-3-acetic acid-induced protein ARG2-like
Genome locationGy14Chr4:495385..496528
RNA-Seq ExpressionCsGy4G000800
SyntenyCsGy4G000800
Gene Ontology termsNA
InterPro domainsIPR004926 - Late embryogenesis abundant protein, LEA_3 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018001.1 Indole-3-acetic acid-induced protein ARG2, partial [Cucurbita argyrosperma subsp. argyrosperma]4.19e-2954.55Show/hide
Query:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA
        RF SGVK++ G V DG  NA++RRGYTAE  AMAAS+RA              R DG++ + E++ERS EK A W+PDPVTG YRPE+  DE+DAVDLRA
Subjt:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA

Query:  KLLKPRRNTV
         +L+PRR  +
Subjt:  KLLKPRRNTV

XP_016901458.1 PREDICTED: indole-3-acetic acid-induced protein ARG2-like [Cucumis melo]5.15e-4974.14Show/hide
Query:  MAPRFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEK--AAVWIPDPVTGCYRPESNMDEMDA
        MAPRFSS VK++PG V DGFLNAI+RRGYTAE MAMAASER  TST A GGVLPA  SDGV+ +KEESERST +  AA W+PDPVTG YRPE+ +DEMD 
Subjt:  MAPRFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEK--AAVWIPDPVTGCYRPESNMDEMDA

Query:  VDLRAKLLKPRRNTVN
        VDLR KLLK RRNT+N
Subjt:  VDLRAKLLKPRRNTVN

XP_023528769.1 protein SENESCENCE-ASSOCIATED GENE 21, mitochondrial-like [Cucurbita pepo subsp. pepo]7.28e-3054.55Show/hide
Query:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA
        RF SGVK++ G V DG  NA++RRGYTAE  AMAAS+RA              R DG++ ++E++ERS EK A W+PDPVTG YRPE+  DE+DAVDLRA
Subjt:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA

Query:  KLLKPRRNTV
         +L+PRR  +
Subjt:  KLLKPRRNTV

XP_031740222.1 protein SENESCENCE-ASSOCIATED GENE 21, mitochondrial-like [Cucumis sativus]1.51e-75100Show/hide
Query:  MAPRFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVD
        MAPRFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVD
Subjt:  MAPRFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVD

Query:  LRAKLLKPRRNTVN
        LRAKLLKPRRNTVN
Subjt:  LRAKLLKPRRNTVN

XP_038903721.1 protein SENESCENCE-ASSOCIATED GENE 21, mitochondrial-like [Benincasa hispida]1.96e-2860Show/hide
Query:  FSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRAK
        F SG K++   V  G  +A+ RRGYTAE  AMAAS+R  +++ A G  +PA RS+G++ +KEESE STEKAA W+PDPVTG YRPE+  D +DAVDLRAK
Subjt:  FSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRAK

Query:  LLKPRRNTVN
        LLKPRRN +N
Subjt:  LLKPRRNTVN

TrEMBL top hitse value%identityAlignment
A0A0A0KV78 Uncharacterized protein7.33e-76100Show/hide
Query:  MAPRFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVD
        MAPRFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVD
Subjt:  MAPRFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVD

Query:  LRAKLLKPRRNTVN
        LRAKLLKPRRNTVN
Subjt:  LRAKLLKPRRNTVN

A0A1S4DZQ1 indole-3-acetic acid-induced protein ARG2-like2.49e-4974.14Show/hide
Query:  MAPRFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEK--AAVWIPDPVTGCYRPESNMDEMDA
        MAPRFSS VK++PG V DGFLNAI+RRGYTAE MAMAASER  TST A GGVLPA  SDGV+ +KEESERST +  AA W+PDPVTG YRPE+ +DEMD 
Subjt:  MAPRFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEK--AAVWIPDPVTGCYRPESNMDEMDA

Query:  VDLRAKLLKPRRNTVN
        VDLR KLLK RRNT+N
Subjt:  VDLRAKLLKPRRNTVN

A0A6J1DM52 late embryogenis abundant protein 2-like2.49e-2247.83Show/hide
Query:  MAPRFSSGVKVLPGFVFDGFLNAI-VRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAV
        MA  FS+ VK++   V DGF NA+ +RRGY A      A++R    +             G   +KEE+ERSTE+   W+PDPVTG YRPE+  DE+D V
Subjt:  MAPRFSSGVKVLPGFVFDGFLNAI-VRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAV

Query:  DLRAKLLKPRRNTVN
        DLRA LLKPRR  VN
Subjt:  DLRAKLLKPRRNTVN

A0A6J1F2C1 protein SENESCENCE-ASSOCIATED GENE 21, mitochondrial-like1.66e-2853.64Show/hide
Query:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA
        RF SGVK++ G V DG   A++RRGYTAE  AMAAS+RA              R DG++ + E++ERS EK A W+PDPVTG YRPE+  DE+DAVDLRA
Subjt:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA

Query:  KLLKPRRNTV
         +L+PRR  +
Subjt:  KLLKPRRNTV

A0A6J1J9I8 protein SENESCENCE-ASSOCIATED GENE 21, mitochondrial-like1.97e-2754.21Show/hide
Query:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA
        RF SGVK+  G V DG  NA++RRGYTAE  AMAAS+RA              R DG++ ++E++ERS EK A W+PDPVTG YRPE+  DE+DAVDLRA
Subjt:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA

Query:  KLLKPRR
         ++  RR
Subjt:  KLLKPRR

SwissProt top hitse value%identityAlignment
P32292 Indole-3-acetic acid-induced protein ARG22.1e-1041.67Show/hide
Query:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA
        R  + VKVL   V DGF N   R G+ A   A AA++ AT    + GG +       V    EE  R  EK + W+PDPVTG YRPE N +E+D  D+RA
Subjt:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA

Query:  KLLKPRRN
         +L  + N
Subjt:  KLLKPRRN

P46521 Late embryogenesis abundant protein Lea5-A7.3e-0837.74Show/hide
Query:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA
        R  S  K L   VFD    +I RRGY+  P A      A T++    G +        + E   SE     +A W PDPVTG YRPE+   E+DA DLR 
Subjt:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA

Query:  KLLKPR
         +L  R
Subjt:  KLLKPR

P46522 Late embryogenesis abundant protein Lea5-D6.6e-0939.25Show/hide
Query:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEK-AAVWIPDPVTGCYRPESNMDEMDAVDLR
        R  S  K+L   +FDG   +I RRGY+  P A       T S    G +    R D +   KE S   T   ++ W PDPVTG YRPE+   E+DA +LR
Subjt:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEK-AAVWIPDPVTGCYRPESNMDEMDAVDLR

Query:  AKLLKPR
          LL  R
Subjt:  AKLLKPR

Q93WF6 Protein SENESCENCE-ASSOCIATED GENE 21, mitochondrial9.2e-1139.25Show/hide
Query:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA
        R  S VK++  FV     NAI RRGY A      A++ + +S   +G V  A      +++K+  E ST+K + W+PDP TG YRPE+  +E+DA +LRA
Subjt:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA

Query:  KLLKPRR
         LL  ++
Subjt:  KLLKPRR

Q9SRX6 Late embryogenis abundant protein 29.0e-0633.64Show/hide
Query:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA
        R  +  K+   F  +   NA+ RRG+            A   T   G V  A       ++K   E S+EKA  W+PDP TG YRPE+  +E+D  +LRA
Subjt:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA

Query:  KLLKPRR
         LL  ++
Subjt:  KLLKPRR

Arabidopsis top hitse value%identityAlignment
AT1G02820.1 Late embryogenesis abundant 3 (LEA3) family protein6.4e-0733.64Show/hide
Query:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA
        R  +  K+   F  +   NA+ RRG+            A   T   G V  A       ++K   E S+EKA  W+PDP TG YRPE+  +E+D  +LRA
Subjt:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA

Query:  KLLKPRR
         LL  ++
Subjt:  KLLKPRR

AT3G53770.2 late embryogenesis abundant 3 (LEA3) family protein6.6e-0445.95Show/hide
Query:  WIPDPVTGCYRPESNMDEMDAVDLRAKLLKPRRNTVN
        W+PDP TG YRP++   E+DAV+LR+      + T N
Subjt:  WIPDPVTGCYRPESNMDEMDAVDLRAKLLKPRRNTVN

AT4G02380.1 senescence-associated gene 216.6e-1239.25Show/hide
Query:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA
        R  S VK++  FV     NAI RRGY A      A++ + +S   +G V  A      +++K+  E ST+K + W+PDP TG YRPE+  +E+DA +LRA
Subjt:  RFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRA

Query:  KLLKPRR
         LL  ++
Subjt:  KLLKPRR

AT4G02380.2 senescence-associated gene 211.5e-0839.74Show/hide
Query:  MAASERATTSTPAAGGVLPATRSDGV---IVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRAKLLKPRR
        +++ +R   +T A G V    RS  V   +++K+  E ST+K + W+PDP TG YRPE+  +E+DA +LRA LL  ++
Subjt:  MAASERATTSTPAAGGVLPATRSDGV---IVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRAKLLKPRR

AT4G15910.1 drought-induced 214.6e-0536.47Show/hide
Query:  IVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKA-AVWIPDPVTGCYRPESNMDEMDAVDLRAKLLK
        ++RR Y      +A S+  T +  + GG      S  V+V K E     ++A + W PDPVTG YRP +   E+D  +LR  LLK
Subjt:  IVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKA-AVWIPDPVTGCYRPESNMDEMDAVDLRAKLLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACCTCGCTTCTCCTCCGGCGTCAAGGTCCTACCCGGTTTCGTTTTCGATGGCTTTCTTAACGCTATCGTCAGGCGTGGATATACGGCGGAACCAATGGCA
ATGGCAGCGTCAGAAAGAGCGACGACGTCTACACCGGCTGCGGGAGGAGTACTTCCGGCCACACGAAGCGATGGAGTGATCGTGGAGAAAGAAGAATCAGAAAGG
AGTACAGAGAAGGCCGCGGTGTGGATACCCGACCCGGTTACCGGGTGTTATCGACCGGAGAGCAACATGGATGAGATGGACGCCGTGGATCTTCGTGCTAAGCTA
TTGAAGCCGAGAAGAAATACAGTTAATTGA
mRNA sequenceShow/hide mRNA sequence
ATTTGATACATTTTTAAAATATTAAAAATTAAAACTATTCATAAAATATATAAAAAAATAAAAAAATAGAGAGAATTAGATAGATTTTGTAACACTTTATAAAAG
TTTTTTCACGTTTGTTATATACATCATTAATTGGGAGAAACGAATAAAAGCGAAATCGATCTTTAAAATTGAAATCGAAACAGATTAGGAAAAGGAATATAATGT
AGAATGAATCGTATATTTAGAAATGTTTTCTTAACTTTATTTTAAATCCAATTTCCACATACCTTTTCCTATTTCCTAATTCTTTCCAGTTTTATATTTCATTTT
ATATAAATACTCTTCGGGTCTTTCCATATTTCCTCACTCTCAAATTCTGTGAAAGATAACTCAATACACCTAAAAACAATACATACGCCTTTCTTTTTTCTCCTT
TTCATTTTTTCCACAAGCAAACCCCAAGTTTTCATCAACGAAACTGATTACTGACGATGGCACCTCGCTTCTCCTCCGGCGTCAAGGTCCTACCCGGTTTCGTTT
TCGATGGCTTTCTTAACGCTATCGTCAGGCGTGGATATACGGCGGAACCAATGGCAATGGCAGCGTCAGAAAGAGCGACGACGTCTACACCGGCTGCGGGAGGAG
TACTTCCGGCCACACGAAGCGATGGAGTGATCGTGGAGAAAGAAGAATCAGAAAGGAGTACAGAGAAGGCCGCGGTGTGGATACCCGACCCGGTTACCGGGTGTT
ATCGACCGGAGAGCAACATGGATGAGATGGACGCCGTGGATCTTCGTGCTAAGCTATTGAAGCCGAGAAGAAATACAGTTAATTGAACGAAGTTTGGATGCGCCA
TTGATGAAGAGAGCCATAGCTTGAGATTTGATCGTAAAGGATTTAAATATATGGATATCTTTTTCAATTCTAGATCAATTAAATATTGAAGATATCAATTAATTT
GAAATATCATACCCTCATTTATCTTATATATCAATATGATCTTGCACATATTGTTTTCATTTTATTGCTTTTAAAGCATTTTCAATGCAACAACATTTTACCCTC
Protein sequenceShow/hide protein sequence
MAPRFSSGVKVLPGFVFDGFLNAIVRRGYTAEPMAMAASERATTSTPAAGGVLPATRSDGVIVEKEESERSTEKAAVWIPDPVTGCYRPESNMDEMDAVDLRAKL
LKPRRNTVN