; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G28840 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G28840
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionCupin_5 domain-containing protein
Genome locationChr1:23374782..23377041
RNA-Seq ExpressionCSPI01G28840
SyntenyCSPI01G28840
Gene Ontology termsGO:0006744 - ubiquinone biosynthetic process (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0008289 - lipid binding (molecular function)
InterPro domainsIPR009327 - Cupin domain of unknown function DUF985
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold
IPR039935 - Uncharacterized protein YML079W-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK10013.1 Cupin_5 domain-containing protein [Cucumis melo var. makuwa]5.4e-10697.35Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATAS+IV KLNLKPHPEGGFYSETFRDYSVHLSKS LPPEYKVDREVSTCIYFLMPSGCVS+LHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTP A
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

XP_004144040.1 uncharacterized protein LOC101210771 isoform X1 [Cucumis sativus]4.0e-109100Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

XP_008450945.1 PREDICTED: uncharacterized protein LOC103492382 [Cucumis melo]4.6e-10596.83Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATAS+IV KLNLKPHPEGGFYSETFRDYSVHLSKS LP EYKVDREVSTCIYFLMPSGCVS+LHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTP A
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

XP_022147566.1 uncharacterized protein LOC111016461 [Momordica charantia]3.8e-9989.95Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATASEIV KL+LKPHPEGGFYSETFRD SVHLSK+HLPP+YKV REVSTCIYFL+PSGCVS+LHRIPCAETWHFYLGEPLT+LELN+ DGRVKLTCLG 
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFG+FPTKDF ISADGT+TKAAPRDS+NHYSLVGC+CAPAFQFEDFELAKRSDLVSRFPDSEA VSLLTP A
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

XP_038878514.1 uncharacterized protein LOC120070727 [Benincasa hispida]4.3e-10394.15Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATASEIV KLNLKPHPEGGFY+ETFRD SVHLSKSHLPPEYKVDREVSTCIYFL+PSGCVS+LHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPD
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGT+TKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDL SRFP+SEA +SLLTP+
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPD

TrEMBL top hitse value%identityAlignment
A0A0A0M280 Cupin_5 domain-containing protein1.9e-109100Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

A0A1S3BPT5 uncharacterized protein LOC1034923822.2e-10596.83Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATAS+IV KLNLKPHPEGGFYSETFRDYSVHLSKS LP EYKVDREVSTCIYFLMPSGCVS+LHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTP A
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

A0A5A7UIT0 Cupin_5 domain-containing protein2.2e-10596.83Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATAS+IV KLNLKPHPEGGFYSETFRDYSVHLSKS LP EYKVDREVSTCIYFLMPSGCVS+LHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTP A
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

A0A5D3CDS0 Cupin_5 domain-containing protein2.6e-10697.35Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATAS+IV KLNLKPHPEGGFYSETFRDYSVHLSKS LPPEYKVDREVSTCIYFLMPSGCVS+LHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTP A
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

A0A6J1D2R9 uncharacterized protein LOC1110164611.8e-9989.95Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATASEIV KL+LKPHPEGGFYSETFRD SVHLSK+HLPP+YKV REVSTCIYFL+PSGCVS+LHRIPCAETWHFYLGEPLT+LELN+ DGRVKLTCLG 
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFG+FPTKDF ISADGT+TKAAPRDS+NHYSLVGC+CAPAFQFEDFELAKRSDLVSRFPDSEA VSLLTP A
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19130.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF985 (InterPro:IPR009327), RmlC-like jelly roll fold (InterPro:IPR014710); Has 1465 Blast hits to 1465 proteins in 584 species: Archae - 10; Bacteria - 1038; Metazoa - 19; Fungi - 43; Plants - 51; Viruses - 0; Other Eukaryotes - 304 (source: NCBI BLink).1.1e-7771.51Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        M  +SEIVGKLNL+ H EGGF++ETFRD SV LS S LPP +KVDR VST IYFL+PSG VS LHRIP AETWHFYLGEPLTV+EL + DG++K TCLG 
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLT
        DL   +Q PQYTVPPNVWFG+FPTKD + S DG L KA  RDSENH+SLVGC+CAPAFQFEDFELAKRSDL+SRFP  E+ +++L+
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACAGCATCAGAAATTGTAGGGAAATTGAATCTGAAGCCACATCCCGAAGGCGGTTTTTACTCTGAAACCTTCAGGGACTACTCCGTGCATCTCTCCAAATCTCA
CCTCCCACCCGAGTACAAGGTTGATCGAGAGGTCAGTACGTGTATATACTTTTTGATGCCGTCTGGATGTGTTTCTTCTCTTCATCGTATACCATGTGCTGAGACTTGGC
ATTTTTACTTGGGAGAACCACTTACGGTATTAGAGTTGAATGAAAAGGACGGTCGAGTCAAATTGACTTGTCTTGGGTCTGATCTCATTGGAGATAATCAACTACCTCAG
TATACAGTGCCTCCTAATGTTTGGTTTGGTGCTTTTCCAACCAAAGATTTCAATATTTCTGCTGATGGAACTCTGACCAAAGCTGCTCCAAGGGACTCTGAGAATCACTA
CTCGCTTGTGGGCTGCAGCTGTGCACCTGCTTTCCAGTTTGAGGACTTTGAGTTGGCAAAACGCTCTGATCTTGTTTCACGTTTTCCCGATAGTGAAGCATTCGTCTCGT
TGCTGACACCAGATGCCTGA
mRNA sequenceShow/hide mRNA sequence
GTTGTCTATCAAAATATTTTGTTATATTAATATGAAATTATCAGTAATGAGTGATTGCTTTAGGTTGAAGGAGGTTTGGTGAAGAACAAAAGAAGCAAAAAACAATGGCT
ACAGCATCAGAAATTGTAGGGAAATTGAATCTGAAGCCACATCCCGAAGGCGGTTTTTACTCTGAAACCTTCAGGGACTACTCCGTGCATCTCTCCAAATCTCACCTCCC
ACCCGAGTACAAGGTTGATCGAGAGGTCAGTACGTGTATATACTTTTTGATGCCGTCTGGATGTGTTTCTTCTCTTCATCGTATACCATGTGCTGAGACTTGGCATTTTT
ACTTGGGAGAACCACTTACGGTATTAGAGTTGAATGAAAAGGACGGTCGAGTCAAATTGACTTGTCTTGGGTCTGATCTCATTGGAGATAATCAACTACCTCAGTATACA
GTGCCTCCTAATGTTTGGTTTGGTGCTTTTCCAACCAAAGATTTCAATATTTCTGCTGATGGAACTCTGACCAAAGCTGCTCCAAGGGACTCTGAGAATCACTACTCGCT
TGTGGGCTGCAGCTGTGCACCTGCTTTCCAGTTTGAGGACTTTGAGTTGGCAAAACGCTCTGATCTTGTTTCACGTTTTCCCGATAGTGAAGCATTCGTCTCGTTGCTGA
CACCAGATGCCTGATTCAACTGAAGAAGACTGACCCTTTTTCTGGAAATCATAGCTGAGATACTGATTATGAGGAGATCTGGATTGTATGATGTGATTTTGATCTCTGTT
CATTTTCAAATCACGGTGTAGTTAAAGCTTTTAGTCAGTGGATTGTGTTCTAGTGGAACCTCTTATGGCTTACTGAAGATAATGATAATGAATAAGGCTCTCTTTTACAA
CGAATCTTGTAAATCCAATATGTTTTCAGGTTCTGTGGATGCTGCCCCTCACATGAAAAGAGAAATGTACCCACCACAAAAGGAAAAAATCAGTGAGAGAGACCTTACCT
CTGCCTCTCTCTCTCTCTCTCTCTCTCTCTCCATGAGTTTAAGTGTAAAGAATGTGGGCCATCTGTAAAGTTCTTGATTTCATTCTATGATTTAGACATATCCCTAATTT
ATTTTCTTCCATTCTAGAATATTGTCAGAACAATAATTTTGAGAATTTCTGGTGAAAGACATGTGATAAGCAAATTGTGATTTCAACATCTTTTACTTATAGGAAGGAAA
GATTACGAAAGAATTATTCATCACGGC
Protein sequenceShow/hide protein sequence
MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGSDLIGDNQLPQ
YTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA