; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G3185 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G3185
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionLate embryogenesis abundant protein (LEA) family protein
Genome locationctg1041:1976651..1978615
RNA-Seq ExpressionCucsat.G3185
SyntenyCucsat.G3185
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAV70662.1 hypothetical protein CFOL_v3_14160, partial [Cephalotus follicularis]2.36e-3453.73Show/hide
Query:  SIIVHNPWSSNVFVSLRPRH----GSSQVTAAAADEAAHVVKRTTERA---AEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAKQTVEDAWGSVK
        ++++ NPWSS VFV +  R     GS Q  ++A D+A    K+    A    E+VKDKA S A++VT++ ++ A KV ETAQD+  KAKQTV+DAWGSVK
Subjt:  SIIVHNPWSSNVFVSLRPRH----GSSQVTAAAADEAAHVVKRTTERA---AEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAKQTVEDAWGSVK

Query:  DTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKS
        DTT  IK  VVGKAEESKE+IK+TAEN+K+N+K+
Subjt:  DTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKS

XP_008457975.1 PREDICTED: uncharacterized protein At4g13230 [Cucumis melo]5.23e-8091.16Show/hide
Query:  MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQVTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAK
        MASFLATSAT+PKLQRAGSIIVHNPWSS VFVSLRPRHGSSQV+AAA D AA  VKRTTE+AAEEVKDKAVSAA+EVTQ+TK+VAGKVSETAQDMAGKAK
Subjt:  MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQVTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAK

Query:  QTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC
        QTVEDAWGSVKDTTQNIK+KVVGKAEESKEAIKDTAENIK NIKSNC
Subjt:  QTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC

XP_011659168.1 uncharacterized protein At4g13230 [Cucumis sativus]3.30e-90100Show/hide
Query:  MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQVTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAK
        MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQVTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAK
Subjt:  MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQVTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAK

Query:  QTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC
        QTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC
Subjt:  QTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC

XP_022152517.1 uncharacterized protein At4g13230 [Momordica charantia]2.98e-6678.52Show/hide
Query:  MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQ--VTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGK
        MAS LA++ATI KL RAGS++V +PWSS VFVS+RPRH SS     AAAAD AA  VKR TE+AAEEVKDKA SAA+EVTQ+TKEVAGKVSETAQD+AGK
Subjt:  MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQ--VTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGK

Query:  AKQTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC
        AKQTV++AWGSVKDTTQNIK+KVVGKAEESKEAIKDTA+NIK  IKSNC
Subjt:  AKQTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC

XP_038897424.1 uncharacterized protein At4g13230 [Benincasa hispida]2.25e-7282.99Show/hide
Query:  MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQVTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAK
        MASFLA SAT+PKL RA SI V  PWSS VFVS+RP+H SS+V AAAAD AA VVKR TE+AAE+VKDKAVSAADEVTQ+TKEVAGKVSETAQD+AGKAK
Subjt:  MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQVTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAK

Query:  QTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC
        QTV+DAWGSVKDTTQNIK+KVVGKAEESKEAIKDTA+N+KKNIK+NC
Subjt:  QTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC

TrEMBL top hitse value%identityAlignment
A0A0A0K9U4 Uncharacterized protein1.60e-90100Show/hide
Query:  MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQVTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAK
        MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQVTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAK
Subjt:  MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQVTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAK

Query:  QTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC
        QTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC
Subjt:  QTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC

A0A1Q3BS03 Uncharacterized protein (Fragment)1.14e-3453.73Show/hide
Query:  SIIVHNPWSSNVFVSLRPRH----GSSQVTAAAADEAAHVVKRTTERA---AEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAKQTVEDAWGSVK
        ++++ NPWSS VFV +  R     GS Q  ++A D+A    K+    A    E+VKDKA S A++VT++ ++ A KV ETAQD+  KAKQTV+DAWGSVK
Subjt:  SIIVHNPWSSNVFVSLRPRH----GSSQVTAAAADEAAHVVKRTTERA---AEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAKQTVEDAWGSVK

Query:  DTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKS
        DTT  IK  VVGKAEESKE+IK+TAEN+K+N+K+
Subjt:  DTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKS

A0A1S3C7D4 uncharacterized protein At4g132302.53e-8091.16Show/hide
Query:  MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQVTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAK
        MASFLATSAT+PKLQRAGSIIVHNPWSS VFVSLRPRHGSSQV+AAA D AA  VKRTTE+AAEEVKDKAVSAA+EVTQ+TK+VAGKVSETAQDMAGKAK
Subjt:  MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQVTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAK

Query:  QTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC
        QTVEDAWGSVKDTTQNIK+KVVGKAEESKEAIKDTAENIK NIKSNC
Subjt:  QTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC

A0A5C7I5X2 Uncharacterized protein2.30e-3453.38Show/hide
Query:  MASF-LATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQVTAAAA-DEAAHVVKRTTERAAE---EVKDKAVSAADEVTQRTKEVAGKVSETAQDM
        MASF LATS  +PK  +  ++++ NPWSS VFV +R    SS+  A+ A D+AA   K+    A E   +VK++A S A+  TQ+TK+VAGKVSETAQ +
Subjt:  MASF-LATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQVTAAAA-DEAAHVVKRTTERAAE---EVKDKAVSAADEVTQRTKEVAGKVSETAQDM

Query:  AGKAKQTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNI
          KAKQTV+DAWGSVKDTTQ IK  V GKAEESK+++K+ A  +K++I
Subjt:  AGKAKQTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNI

A0A6J1DGG7 uncharacterized protein At4g132301.44e-6678.52Show/hide
Query:  MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQ--VTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGK
        MAS LA++ATI KL RAGS++V +PWSS VFVS+RPRH SS     AAAAD AA  VKR TE+AAEEVKDKA SAA+EVTQ+TKEVAGKVSETAQD+AGK
Subjt:  MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQ--VTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGK

Query:  AKQTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC
        AKQTV++AWGSVKDTTQNIK+KVVGKAEESKEAIKDTA+NIK  IKSNC
Subjt:  AKQTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC

SwissProt top hitse value%identityAlignment
Q8LFD5 Uncharacterized protein At4g132301.7e-0939.8Show/hide
Query:  ADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAKQTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKS
        A     +V  TT    + V DKA  A  +V ++  E A  +S+ A ++  KAK T E+AW  VKDTT+ IK  V GK EE+KE+IK TA+ +++++ +
Subjt:  ADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAKQTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKS

Arabidopsis top hitse value%identityAlignment
AT1G04670.1 unknown protein9.9e-0536.27Show/hide
Query:  VFVSLRPRHGSSQV--------TAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAKQTVEDAWGSVKDTTQNIKQKV
        V  S RP  G  +          A A  E A  VK TT    E ++D A + A  VT+ TK+V  KV+ET   +  KAK +V    G+ K+ T  IK K+
Subjt:  VFVSLRPRHGSSQV--------TAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAKQTVEDAWGSVKDTTQNIKQKV

Query:  VG
        +G
Subjt:  VG

AT4G13230.1 Late embryogenesis abundant protein (LEA) family protein1.2e-1039.8Show/hide
Query:  ADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAKQTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKS
        A     +V  TT    + V DKA  A  +V ++  E A  +S+ A ++  KAK T E+AW  VKDTT+ IK  V GK EE+KE+IK TA+ +++++ +
Subjt:  ADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAKQTVEDAWGSVKDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGCTTTTTGGCAACATCAGCAACCATTCCAAAGCTTCAACGTGCTGGATCTATTATTGTACACAACCCATGGAGTTCTAATGTTTTTGTCTCTCTCAGACCAAG
ACATGGAAGTTCACAGGTTACTGCAGCAGCTGCTGATGAAGCTGCTCATGTCGTGAAGCGAACCACCGAGAGGGCCGCCGAAGAAGTCAAAGATAAGGCCGTCTCTGCCG
CCGATGAGGTGACTCAAAGAACAAAAGAAGTAGCAGGGAAGGTGAGTGAAACAGCACAAGATATGGCTGGGAAGGCAAAGCAAACAGTTGAAGATGCATGGGGATCAGTG
AAGGATACAACTCAAAACATTAAACAGAAAGTGGTTGGCAAAGCTGAAGAATCTAAAGAAGCCATTAAAGACACTGCAGAAAACATCAAAAAAAATATCAAGTCTAACTG
TTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGCTTTTTGGCAACATCAGCAACCATTCCAAAGCTTCAACGTGCTGGATCTATTATTGTACACAACCCATGGAGTTCTAATGTTTTTGTCTCTCTCAGACCAAG
ACATGGAAGTTCACAGGTTACTGCAGCAGCTGCTGATGAAGCTGCTCATGTCGTGAAGCGAACCACCGAGAGGGCCGCCGAAGAAGTCAAAGATAAGGCCGTCTCTGCCG
CCGATGAGGTGACTCAAAGAACAAAAGAAGTAGCAGGGAAGGTGAGTGAAACAGCACAAGATATGGCTGGGAAGGCAAAGCAAACAGTTGAAGATGCATGGGGATCAGTG
AAGGATACAACTCAAAACATTAAACAGAAAGTGGTTGGCAAAGCTGAAGAATCTAAAGAAGCCATTAAAGACACTGCAGAAAACATCAAAAAAAATATCAAGTCTAACTG
TTGA
Protein sequenceShow/hide protein sequence
MASFLATSATIPKLQRAGSIIVHNPWSSNVFVSLRPRHGSSQVTAAAADEAAHVVKRTTERAAEEVKDKAVSAADEVTQRTKEVAGKVSETAQDMAGKAKQTVEDAWGSV
KDTTQNIKQKVVGKAEESKEAIKDTAENIKKNIKSNC