; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G8489 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G8489
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionCupin_3 domain-containing protein
Genome locationctg1557:3105192..3107186
RNA-Seq ExpressionCucsat.G8489
SyntenyCucsat.G8489
Gene Ontology termsNA
InterPro domainsIPR008579 - (S)-ureidoglycine aminohydrolase, cupin-3 domain
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141517.1 uncharacterized protein LOC101218376 [Cucumis sativus]4.64e-97100Show/hide
Query:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKFE
        AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKFE
Subjt:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKFE

XP_008459495.1 PREDICTED: uncharacterized protein LOC103498612 [Cucumis melo]1.44e-9195.62Show/hide
Query:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MASAVDGFL LNLKP+SLLTKPTNVSLYFGKRA SLQIRADSMATERLGIKVEKNPSESKL+ELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF
        AGS+ESVEIGAGDLVVFPKGMSCTWDVSVAVDKHY F
Subjt:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF

XP_022974049.1 uncharacterized protein LOC111472678 [Cucurbita maxima]1.05e-8489.05Show/hide
Query:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MASAV+GF LLNLKP + L+KP NVSL F KRAPSLQIRADSMATERLGIKVEKNP ESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF
        AGS+E VE+GAGDLVVFP GMSCTWDVS+AVDKHYKF
Subjt:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF

XP_023534884.1 uncharacterized protein LOC111796480 isoform X1 [Cucurbita pepo subsp. pepo]7.40e-8589.05Show/hide
Query:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MASAV+GF LLNLKP + L+KP N+SL F KRAPSLQIRADSMATERLGIKVEKNP ESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF
        AGS+E VEIGAGDLVVFP GMSCTWDVS+AVDKHYKF
Subjt:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF

XP_038889944.1 uncharacterized protein LOC120079700 [Benincasa hispida]2.30e-8892.7Show/hide
Query:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MASAV GF LLNLKP  LL+KPTNVSLYFGKRAPSLQIRADSMATE LGIKVEKNP ESKLTELGVR+WPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF
        AGS ESVEIGAGDLVVFPKGMSCTWDVSVAVDKHY F
Subjt:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF

TrEMBL top hitse value%identityAlignment
A0A0A0KVZ0 Cupin_3 domain-containing protein2.25e-97100Show/hide
Query:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKFE
        AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKFE
Subjt:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKFE

A0A1S3C9T6 uncharacterized protein LOC1034986126.97e-9295.62Show/hide
Query:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MASAVDGFL LNLKP+SLLTKPTNVSLYFGKRA SLQIRADSMATERLGIKVEKNPSESKL+ELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF
        AGS+ESVEIGAGDLVVFPKGMSCTWDVSVAVDKHY F
Subjt:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF

A0A6J1DS18 uncharacterized protein LOC1110230001.34e-8083.94Show/hide
Query:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MASAVDGFLLLNL+P +LL+KP  VSL+ GK A SL +RA+SMATE+LGIK+EKNP ESKLT+LGVRQWPKWGC PSKFPWTYSDKETC+LLEGKVKVTP
Subjt:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF
        AGS ESVEIG+GDLVVFPKGMSCTWDVSVAVDKHY F
Subjt:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF

A0A6J1EQ24 uncharacterized protein LOC1114366488.44e-8487.59Show/hide
Query:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MASA++GF LLNLKP + L+KP N+SL F K APSLQIRADSMATERLGIKVEKNP ESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF
        AGS+E VEIGAGDLVVFP GMSCTWDVS+AVDKHYKF
Subjt:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF

A0A6J1IGD9 uncharacterized protein LOC1114726785.09e-8589.05Show/hide
Query:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MASAV+GF LLNLKP + L+KP NVSL F KRAPSLQIRADSMATERLGIKVEKNP ESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF
        AGS+E VE+GAGDLVVFP GMSCTWDVS+AVDKHYKF
Subjt:  AGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G04300.1 RmlC-like cupins superfamily protein2.6e-3159.34Show/hide
Query:  LGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKFE
        + I +E NPS  +L++LGV  WPKW C P K+   + ++ETCYL++GKVKV P GS+E VE GAGDLV  PKG+SCTWDVS+ +DKHYKF+
Subjt:  LGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAGSNESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKFE

AT4G10280.1 RmlC-like cupins superfamily protein4.5e-2340Show/hide
Query:  FLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADS-MATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAG---S
        FL L+L   ++    + ++     +   + + +DS + TE  G+K+ +  S++KL +LGV  WPKW   PSKFPW +   ET Y +EGKVKV   G    
Subjt:  FLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADS-MATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAG---S

Query:  NESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKFE
         E+ EIG GD+VVFPK M   W+++ AV K Y  E
Subjt:  NESVEIGAGDLVVFPKGMSCTWDVSVAVDKHYKFE

AT4G10290.1 RmlC-like cupins superfamily protein1.8e-1644.55Show/hide
Query:  IRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAG---SNESVEIGAGDLVVFPKGMSCTWDVSVAVDKH
        I A  + TE  G+KV +  S++KL ELGV  W  W   P KFPW +   ET Y +EGK+KV         E++E  AGDLVVFP+ M+   DV   V K 
Subjt:  IRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAG---SNESVEIGAGDLVVFPKGMSCTWDVSVAVDKH

Query:  Y
        Y
Subjt:  Y

AT4G10300.1 RmlC-like cupins superfamily protein4.1e-4570Show/hide
Query:  LTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAGSNESVEIGAGDLVVFP
        LT+ +N   Y  +R  S+     + +TE+LGI +EKNP ESKLT+LGVR WPKWGC PSKFPWTYS KETCYLL+GKVKV P GS+E VEI AGD VVFP
Subjt:  LTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAGSNESVEIGAGDLVVFP

Query:  KGMSCTWDVSVAVDKHYKFE
        KGMSCTWDVSVAVDKHY+FE
Subjt:  KGMSCTWDVSVAVDKHYKFE

AT4G28703.1 RmlC-like cupins superfamily protein8.7e-2752.88Show/hide
Query:  MATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKV----TPAGSNES-----VEIGAGDLVVFPKGMSCTWDVSVAVDK
        MA +   I VE+NPS+++L EL  + WPKWGC P K+   Y  +E CY+L GKVKV     P+ S+++     VE GAGD+V FPKG+SCTWDVS++VDK
Subjt:  MATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKV----TPAGSNES-----VEIGAGDLVVFPKGMSCTWDVSVAVDK

Query:  HYKF
        HY F
Subjt:  HYKF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTGCTGTGGACGGCTTCTTATTACTAAATTTAAAGCCACAATCTCTCCTCACCAAACCTACAAACGTTTCTTTGTACTTTGGGAAAAGGGCACCGTCTCTTCA
AATTCGAGCAGATTCCATGGCTACTGAGAGGCTGGGGATCAAAGTTGAGAAGAATCCATCTGAATCTAAGCTTACTGAACTCGGCGTTCGTCAATGGCCTAAGTGGGGTT
GTGGACCAAGCAAATTCCCATGGACATATTCAGACAAAGAGACATGCTATCTGCTAGAAGGAAAGGTAAAAGTTACTCCTGCTGGATCAAATGAATCTGTTGAGATTGGC
GCTGGGGATTTGGTTGTGTTTCCAAAAGGAATGAGCTGCACTTGGGATGTCTCTGTTGCTGTGGATAAGCACTATAAGTTTGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTGCTGTGGACGGCTTCTTATTACTAAATTTAAAGCCACAATCTCTCCTCACCAAACCTACAAACGTTTCTTTGTACTTTGGGAAAAGGGCACCGTCTCTTCA
AATTCGAGCAGATTCCATGGCTACTGAGAGGCTGGGGATCAAAGTTGAGAAGAATCCATCTGAATCTAAGCTTACTGAACTCGGCGTTCGTCAATGGCCTAAGTGGGGTT
GTGGACCAAGCAAATTCCCATGGACATATTCAGACAAAGAGACATGCTATCTGCTAGAAGGAAAGGTAAAAGTTACTCCTGCTGGATCAAATGAATCTGTTGAGATTGGC
GCTGGGGATTTGGTTGTGTTTCCAAAAGGAATGAGCTGCACTTGGGATGTCTCTGTTGCTGTGGATAAGCACTATAAGTTTGAATAG
Protein sequenceShow/hide protein sequence
MASAVDGFLLLNLKPQSLLTKPTNVSLYFGKRAPSLQIRADSMATERLGIKVEKNPSESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAGSNESVEIG
AGDLVVFPKGMSCTWDVSVAVDKHYKFE