; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG11G009775 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG11G009775
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionLate embryogenesis abundant protein family protein, putative
Genome locationCG_Chr11:16125249..16129453
RNA-Seq ExpressionClCG11G009775
SyntenyClCG11G009775
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598970.1 hypothetical protein SDJN03_08748, partial [Cucurbita argyrosperma subsp. sororia]3.3e-4167.07Show/hide
Query:  MASSTLFKSLPKF----VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMA
        M S TLFKSLPKF      AI +RS TSN +L+ ASNPKSIH +     S          EAKDAINP ANEGM PGE+MM+DKAYSTAEHVSEKT DMA
Subjt:  MASSTLFKSLPKF----VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMA

Query:  GMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNRS
        GM+SA+A EVSAKAK+AMEA    AKD+A RAK T+V+T KDSK+FVKANAK+VEK MNTKNRS
Subjt:  GMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNRS

XP_022933143.1 uncharacterized protein At4g13230 [Cucurbita moschata]3.3e-4167.07Show/hide
Query:  MASSTLFKSLPKF----VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMA
        M SSTLFKSLPKF      AI +RS TSN +L+ ASNPK IH +     S          EAKDAINP ANEGM PGE+MM+DKAYSTAEHVSEKT DMA
Subjt:  MASSTLFKSLPKF----VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMA

Query:  GMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNRS
        GM+SA+A EVSAKAK+AMEA    AKD+A RAK T+V+T KDSK+FVKANAK+VEK MNTKNRS
Subjt:  GMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNRS

XP_022973925.1 uncharacterized protein At4g13230 [Cucurbita maxima]5.2e-3966.46Show/hide
Query:  MASSTLFKSLPKF----VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMA
        M SSTLFKSLPKF     A I  RS TSN +L  ASNPK IH +     S          EAKDAINP ANEGM  GE MM+DKAYSTAEHVSEKT DMA
Subjt:  MASSTLFKSLPKF----VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMA

Query:  GMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNRS
        GM+SA+A EVSAKAKQAMEA    AKD+A RAK ++V+T KDSK+FVKANAK+VEK MNTKNRS
Subjt:  GMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNRS

XP_023546410.1 uncharacterized protein At4g13230 [Cucurbita pepo subsp. pepo]7.3e-4166.46Show/hide
Query:  MASSTLFKSLPKF----VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMA
        M SSTLFKSLPKF      AI +RS TSN +L+ ASNPKSIH +                EAKDAI P ANEGM PGE+MM+DKAYSTAEHVSEKT DMA
Subjt:  MASSTLFKSLPKF----VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMA

Query:  GMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNRS
        GM+SA+A EVSAKAK+AMEA    AKD+A RAK T+V+T KDSK+FVKANAK+VEK MNTKNRS
Subjt:  GMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNRS

XP_038889268.1 uncharacterized protein At4g13230-like [Benincasa hispida]8.9e-5580.37Show/hide
Query:  MASSTLFKSLPK---FVAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMAG
        MASSTLFK+LPK   F AAIT+RSTTSN TLILASNPK IH ++    S          EAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKT DMAG
Subjt:  MASSTLFKSLPK---FVAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMAG

Query:  MVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNRS
        MVSAKAH VSAKAKQAMEAAWDSAKDTAQRAK TLVDTA DSK+FVKAN KSVEKSMNTKN S
Subjt:  MVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNRS

TrEMBL top hitse value%identityAlignment
A0A2N9J2L7 Uncharacterized protein2.0e-2348.75Show/hide
Query:  MASSTLFKSLPKF--VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMAGM
        MAS +L   LPKF    A   R  T NP L +ASNP+ I  S     S+ + S+     A DA+  GAN+    G+  +K+KA+STAEHV++KT DMAGM
Subjt:  MASSTLFKSLPKF--VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMAGM

Query:  VSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKN
        +SA A +V+ KAKQ  + AW +AKDTAQ+AK T++  A++SKE +K NA++V++SMNTKN
Subjt:  VSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKN

A0A6J1CSM2 uncharacterized protein At4g13230-like3.8e-3562.35Show/hide
Query:  MASSTLFKSLPKFV---AAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMAG
        MAS TL K+LPKF    AAI +RST SNP L+L SNPK +H       S  D       EAKDAINPGANE MMPGE+MM + AYSTA+HV EK  DM G
Subjt:  MASSTLFKSLPKFV---AAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMAG

Query:  MVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNR
        MV +   E+S KAKQ MEAAWDS    AQRAK T+V+  K+SKEFVKANA+SV+KSMNTKNR
Subjt:  MVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNR

A0A6J1F3W7 uncharacterized protein At4g132301.6e-4167.07Show/hide
Query:  MASSTLFKSLPKF----VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMA
        M SSTLFKSLPKF      AI +RS TSN +L+ ASNPK IH +     S          EAKDAINP ANEGM PGE+MM+DKAYSTAEHVSEKT DMA
Subjt:  MASSTLFKSLPKF----VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMA

Query:  GMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNRS
        GM+SA+A EVSAKAK+AMEA    AKD+A RAK T+V+T KDSK+FVKANAK+VEK MNTKNRS
Subjt:  GMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNRS

A0A6J1ICL4 uncharacterized protein At4g132302.5e-3966.46Show/hide
Query:  MASSTLFKSLPKF----VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMA
        M SSTLFKSLPKF     A I  RS TSN +L  ASNPK IH +     S          EAKDAINP ANEGM  GE MM+DKAYSTAEHVSEKT DMA
Subjt:  MASSTLFKSLPKF----VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMA

Query:  GMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNRS
        GM+SA+A EVSAKAKQAMEA    AKD+A RAK ++V+T KDSK+FVKANAK+VEK MNTKNRS
Subjt:  GMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNRS

A0A7N2KQQ7 Uncharacterized protein5.3e-2148.45Show/hide
Query:  MASSTLFKSLPKF---VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMAG
        MAS TL  SLPKF    +AI +RST  NP L  A  P+ I  S   + S         + A DA+  GANE    GE  +KDKA STAEHVS+ T DMAG
Subjt:  MASSTLFKSLPKF---VAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMAG

Query:  MVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKN
         +SA A +V+ K KQ  + AW SAKDTAQ+AK  ++   ++SKE +K +A++V+ SMNTKN
Subjt:  MVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKN

SwissProt top hitse value%identityAlignment
Q8LFD5 Uncharacterized protein At4g132306.5e-0842.5Show/hide
Query:  DKAYSTAEHVSEKTMDMAGMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKN
        DKA    + V++K  + A  +S  A  +  KAK   E AWD  KDT ++ K T+    +++KE +KA AK+VE+SMNTKN
Subjt:  DKAYSTAEHVSEKTMDMAGMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKN

Arabidopsis top hitse value%identityAlignment
AT4G13230.1 Late embryogenesis abundant protein (LEA) family protein4.6e-0942.5Show/hide
Query:  DKAYSTAEHVSEKTMDMAGMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKN
        DKA    + V++K  + A  +S  A  +  KAK   E AWD  KDT ++ K T+    +++KE +KA AK+VE+SMNTKN
Subjt:  DKAYSTAEHVSEKTMDMAGMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKN

AT5G44310.1 Late embryogenesis abundant protein (LEA) family protein9.0e-0532.99Show/hide
Query:  KTSHLDESTTRWAE-AKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMAGMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDS
        K   + E T  +AE  KD +N GA+      E   KDKA   AE   EK  DMA     KA ++  K    ++  W++AK TAQ+    +V + +++
Subjt:  KTSHLDESTTRWAE-AKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMAGMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDS

AT5G44310.2 Late embryogenesis abundant protein (LEA) family protein9.0e-0532.99Show/hide
Query:  KTSHLDESTTRWAE-AKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMAGMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDS
        K   + E T  +AE  KD +N GA+      E   KDKA   AE   EK  DMA     KA ++  K    ++  W++AK TAQ+    +V + +++
Subjt:  KTSHLDESTTRWAE-AKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMAGMVSAKAHEVSAKAKQAMEAAWDSAKDTAQRAKHTLVDTAKDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGCTCCACCCTCTTCAAAAGCCTTCCAAAGTTTGTTGCTGCAATTACAAAAAGATCAACAACCTCCAACCCTACTCTCATTTTGGCTTCTAACCCTAAATCTAT
TCACGTAAGTATTGTCTACAAGACTTCACATTTAGACGAGTCCACCACAAGATGGGCAGAAGCCAAAGATGCCATAAACCCAGGAGCAAATGAAGGAATGATGCCTGGTG
AAAATATGATGAAAGATAAGGCTTACTCCACTGCAGAACATGTGAGTGAAAAGACAATGGATATGGCAGGAATGGTAAGTGCAAAAGCACATGAGGTTTCAGCAAAGGCA
AAGCAAGCAATGGAAGCAGCATGGGACTCAGCAAAGGACACAGCCCAAAGGGCAAAACACACATTGGTTGACACTGCCAAAGACTCCAAGGAATTTGTCAAAGCAAATGC
TAAGTCTGTTGAGAAGAGCATGAACACCAAGAACCGTTCTTATATCCCATTAACCCTCTTCTCAAACTTACACTTTACTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGCTCCACCCTCTTCAAAAGCCTTCCAAAGTTTGTTGCTGCAATTACAAAAAGATCAACAACCTCCAACCCTACTCTCATTTTGGCTTCTAACCCTAAATCTAT
TCACGTAAGTATTGTCTACAAGACTTCACATTTAGACGAGTCCACCACAAGATGGGCAGAAGCCAAAGATGCCATAAACCCAGGAGCAAATGAAGGAATGATGCCTGGTG
AAAATATGATGAAAGATAAGGCTTACTCCACTGCAGAACATGTGAGTGAAAAGACAATGGATATGGCAGGAATGGTAAGTGCAAAAGCACATGAGGTTTCAGCAAAGGCA
AAGCAAGCAATGGAAGCAGCATGGGACTCAGCAAAGGACACAGCCCAAAGGGCAAAACACACATTGGTTGACACTGCCAAAGACTCCAAGGAATTTGTCAAAGCAAATGC
TAAGTCTGTTGAGAAGAGCATGAACACCAAGAACCGTTCTTATATCCCATTAACCCTCTTCTCAAACTTACACTTTACTTTTTAA
Protein sequenceShow/hide protein sequence
MASSTLFKSLPKFVAAITKRSTTSNPTLILASNPKSIHVSIVYKTSHLDESTTRWAEAKDAINPGANEGMMPGENMMKDKAYSTAEHVSEKTMDMAGMVSAKAHEVSAKA
KQAMEAAWDSAKDTAQRAKHTLVDTAKDSKEFVKANAKSVEKSMNTKNRSYIPLTLFSNLHFTF