; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G003990 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G003990
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionlate embryogenesis abundant protein At5g17165-like isoform X1
Genome locationCmo_Chr04:1964819..1969599
RNA-Seq ExpressionCmoCh04G003990
SyntenyCmoCh04G003990
Gene Ontology termsNA
InterPro domainsIPR039291 - Late embryogenesis abundant protein At5g17165-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600216.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. sororia]9.3e-6298.41Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG
        MAANSRSAGAIAGLGKRITNQIWT DSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQ TAAAGSRAADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG

Query:  NHAAAEEEKAWFRPTSLEDSEKPHGF
        NHAAAEEEKAWFRPTSLEDSEKPHGF
Subjt:  NHAAAEEEKAWFRPTSLEDSEKPHGF

XP_022941802.1 late embryogenesis abundant protein At5g17165-like isoform X1 [Cucurbita moschata]2.9e-63100Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG
        MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG

Query:  NHAAAEEEKAWFRPTSLEDSEKPHGF
        NHAAAEEEKAWFRPTSLEDSEKPHGF
Subjt:  NHAAAEEEKAWFRPTSLEDSEKPHGF

XP_022941803.1 late embryogenesis abundant protein At5g17165-like isoform X2 [Cucurbita moschata]2.7e-6199.21Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG
        MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKF RAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG

Query:  NHAAAEEEKAWFRPTSLEDSEKPHGF
        NHAAAEEEKAWFRPTSLEDSEKPHGF
Subjt:  NHAAAEEEKAWFRPTSLEDSEKPHGF

XP_022990289.1 late embryogenesis abundant protein At5g17165-like isoform X1 [Cucurbita maxima]2.5e-5995.24Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG
        MAANSRSAGAIAGLGKRITNQI TSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEA+QPTAA  +RAADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG

Query:  NHAAAEEEKAWFRPTSLEDSEKPHGF
        NHAA EEEKAWFRPTSLEDSEKPHGF
Subjt:  NHAAAEEEKAWFRPTSLEDSEKPHGF

XP_023512363.1 late embryogenesis abundant protein At5g17165-like [Cucurbita pepo subsp. pepo]6.7e-6096.83Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG
        MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKF RAAHTSAYDKNPDEQVRPSIVPDDVIQPQ AEKYWAPHPHTGVFGPE DQPTA AGSRAADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG

Query:  NHAAAEEEKAWFRPTSLEDSEKPHGF
        NHAAAEEEKAWFRPTSLEDSEKPHGF
Subjt:  NHAAAEEEKAWFRPTSLEDSEKPHGF

TrEMBL top hitse value%identityAlignment
A0A1S3BYE5 uncharacterized protein LOC1034944371.5e-4982.44Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTS-----DSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTA-AAGS
        MAANSRSAGAIAGLGKRIT+QIWTS     +S ISSSA KFRRAAHTS YDKNP+EQVRPSIVPDDVIQPQAA+KYWAPHP TGVFGP +D P A AA +
Subjt:  MAANSRSAGAIAGLGKRITNQIWTS-----DSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTA-AAGS

Query:  RAADGGNHAAAEEEKAWFRPTSLEDSEKPHG
        RAAD GN++AAEEEKAWFRPTSLEDSEKPHG
Subjt:  RAADGGNHAAAEEEKAWFRPTSLEDSEKPHG

A0A6J1FNH5 late embryogenesis abundant protein At5g17165-like isoform X11.4e-63100Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG
        MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG

Query:  NHAAAEEEKAWFRPTSLEDSEKPHGF
        NHAAAEEEKAWFRPTSLEDSEKPHGF
Subjt:  NHAAAEEEKAWFRPTSLEDSEKPHGF

A0A6J1FT47 late embryogenesis abundant protein At5g17165-like isoform X21.3e-6199.21Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG
        MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKF RAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG

Query:  NHAAAEEEKAWFRPTSLEDSEKPHGF
        NHAAAEEEKAWFRPTSLEDSEKPHGF
Subjt:  NHAAAEEEKAWFRPTSLEDSEKPHGF

A0A6J1JRN3 late embryogenesis abundant protein At5g17165-like isoform X11.2e-5995.24Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG
        MAANSRSAGAIAGLGKRITNQI TSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEA+QPTAA  +RAADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG

Query:  NHAAAEEEKAWFRPTSLEDSEKPHGF
        NHAA EEEKAWFRPTSLEDSEKPHGF
Subjt:  NHAAAEEEKAWFRPTSLEDSEKPHGF

A0A6J1JSV5 late embryogenesis abundant protein At5g17165-like isoform X21.1e-5794.44Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG
        MAANSRSAGAIAGLGKRITNQI TSDSAISSSALKF RAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEA+QPTAA  +RAADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG

Query:  NHAAAEEEKAWFRPTSLEDSEKPHGF
        NHAA EEEKAWFRPTSLEDSEKPHGF
Subjt:  NHAAAEEEKAWFRPTSLEDSEKPHGF

SwissProt top hitse value%identityAlignment
F4KFM8 Late embryogenesis abundant protein At5g171655.9e-1946.77Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG
        MAA S++   I  +G+ I N +    S   +  L   R  HTSAYDKN +E+++PS VPD++I+P  ++KYW+PHP TGVFGP +    A    R   GG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG

Query:  NHAAAEEEKAWFRPTSLEDSEKPH
           +  EEKAWFRPTSLED +K H
Subjt:  NHAAAEEEKAWFRPTSLEDSEKPH

Arabidopsis top hitse value%identityAlignment
AT3G03150.1 unknown protein3.2e-2046.67Show/hide
Query:  SRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGGNHAA
        S+S   I  L K + N    S  A +S+    RR+ H+SAYDKN ++++  S VPD+VI+P  ++KYW+PHP TGVFGP   + +A A     D     A
Subjt:  SRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGGNHAA

Query:  AEEEKAWFRPTSLEDSEKPH
          EE AWFRPTSLEDS+K H
Subjt:  AEEEKAWFRPTSLEDSEKPH

AT5G17165.1 unknown protein4.2e-2046.77Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG
        MAA S++   I  +G+ I N +    S   +  L   R  HTSAYDKN +E+++PS VPD++I+P  ++KYW+PHP TGVFGP +    A    R   GG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGG

Query:  NHAAAEEEKAWFRPTSLEDSEKPH
           +  EEKAWFRPTSLED +K H
Subjt:  NHAAAEEEKAWFRPTSLEDSEKPH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCAATTCGAGGAGCGCAGGAGCGATCGCCGGCTTGGGAAAACGAATCACAAACCAGATCTGGACCAGCGATTCTGCGATCTCCTCCTCTGCTCTGAAGTTCAG
GAGGGCAGCTCACACCTCAGCGTATGACAAAAACCCCGACGAGCAAGTCCGACCAAGCATAGTCCCTGATGATGTGATTCAGCCTCAAGCTGCTGAAAAGTACTGGGCTC
CTCATCCACATACCGGTGTTTTTGGGCCAGAGGCCGATCAGCCTACTGCAGCGGCGGGAAGCCGTGCTGCAGATGGTGGCAACCACGCTGCTGCGGAGGAGGAAAAGGCG
TGGTTCCGACCAACCAGTCTGGAGGATTCCGAGAAACCCCACGGGTTTTAG
mRNA sequenceShow/hide mRNA sequence
CTGTACTCGAACAAAAAAAGGAAAAAGAAAAAGAAAAAATGAAACGACCGTGAATAAACGTCTGCCACGTGGTCCAATAGAAAACGAAAGGGGAAATTCCGTACGTGAGA
TGAAACTGAAAGTGGGTGTGGACGGACGGAGAAGGCAATTTGCGTTGAGAGCTTTTGTCGGCTTTTGGCGGGTATGATACGACGCCGTACTCAAGAACTATAAAAGAAGG
ATTTGCTGCTCCAGGATTAGAATCAAGAATCAAAAGCAATTAATCGCCGTTTTTCTTCTTCATTCTCTCTCTGTTTAAGCCGGAATTGTATGGCCGCCAATTCGAGGAGC
GCAGGAGCGATCGCCGGCTTGGGAAAACGAATCACAAACCAGATCTGGACCAGCGATTCTGCGATCTCCTCCTCTGCTCTGAAGTTCAGGAGGGCAGCTCACACCTCAGC
GTATGACAAAAACCCCGACGAGCAAGTCCGACCAAGCATAGTCCCTGATGATGTGATTCAGCCTCAAGCTGCTGAAAAGTACTGGGCTCCTCATCCACATACCGGTGTTT
TTGGGCCAGAGGCCGATCAGCCTACTGCAGCGGCGGGAAGCCGTGCTGCAGATGGTGGCAACCACGCTGCTGCGGAGGAGGAAAAGGCGTGGTTCCGACCAACCAGTCTG
GAGGATTCCGAGAAACCCCACGGGTTTTAGGGCCGGCCATAATAAGGCATTGTTGTTCTTCTGGCCATCCGAAAGTAGACAACAAAGTAGCCAACTAAAAAGGTACTAAA
TAAAGGAGATGGTTGTTTGTGTTGCTGTACTAAGGGTTTGGAGTCTTGAGTTACTTGAGGTTGAGGCATTCTGGCTAGGATGCTTCGCCTCAGCTTGTTATATGTTATTC
TACCAAACTTTATTTTGACTATGATGCTTTAGTGCAGGTTTATAACTATTCCTCATCCAACCTAGAATAATACGCAAT
Protein sequenceShow/hide protein sequence
MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPDDVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGGNHAAAEEEKA
WFRPTSLEDSEKPHGF