; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G19820 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G19820
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionSOUL heme-binding family protein isoform 3
Genome locationChr1:15577741..15578674
RNA-Seq ExpressionCSPI01G19820
SyntenyCSPI01G19820
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR018790 - Protein of unknown function DUF2358


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056215.1 SOUL heme-binding family protein isoform 3 [Cucumis melo var. makuwa]4.5e-5184Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPA VLS+PT S GFR R SDG T+TIAAA TQ+P  HNHNQ+ VVGSKLAAADH ++RPTKSRVDVD+LVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSP
        DPITK+ DIRGYLLNIALLRQFFSP
Subjt:  DPITKYGDIRGYLLNIALLRQFFSP

TYK14496.1 uncharacterized protein E5676_scaffold15G00070 [Cucumis melo var. makuwa]6.5e-5082.4Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPA VLS+PT S GFR R SDG T+TIAAA TQ+P  HNHNQ+ VVGSKLAAAD+ ++RPTKSRVDVD+LVKFLYDDLHHVFDEQGIDP+AYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSP
        DPITK+ DIRGYLLNIALLRQFFSP
Subjt:  DPITKYGDIRGYLLNIALLRQFFSP

XP_004151455.2 uncharacterized protein LOC101205468 [Cucumis sativus]4.2e-65100Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSP
        DPITKYGDIRGYLLNIALLRQFFSP
Subjt:  DPITKYGDIRGYLLNIALLRQFFSP

XP_008464979.1 PREDICTED: uncharacterized protein LOC103502719 [Cucumis melo]4.5e-5184Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPA VLS+PT S GFR R SDG T+TIAAA TQ+P  HNHNQ+ VVGSKLAAADH ++RPTKSRVDVD+LVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSP
        DPITK+ DIRGYLLNIALLRQFFSP
Subjt:  DPITKYGDIRGYLLNIALLRQFFSP

XP_038879854.1 uncharacterized protein LOC120071584 isoform X2 [Benincasa hispida]1.9e-3365.35Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPT-RTI-AAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIE
        MAP Q LSIP   FGF  R S GPT RTI AAA   KPH+HN N    V SK+      H+RP KS VDV++LV+FLY+DLHHVFDEQGID TAYDEE+ 
Subjt:  MAPAQVLSIPTASFGFRARTSDGPT-RTI-AAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIE

Query:  FRDPITKYGDIRGYLLNIALLRQFFSP
        FRDPITKY +I GYLLNIALL  FF P
Subjt:  FRDPITKYGDIRGYLLNIALLRQFFSP

TrEMBL top hitse value%identityAlignment
A0A0A0LU04 Uncharacterized protein2.0e-65100Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSP
        DPITKYGDIRGYLLNIALLRQFFSP
Subjt:  DPITKYGDIRGYLLNIALLRQFFSP

A0A1S3CN82 uncharacterized protein LOC1035027192.2e-5184Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPA VLS+PT S GFR R SDG T+TIAAA TQ+P  HNHNQ+ VVGSKLAAADH ++RPTKSRVDVD+LVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSP
        DPITK+ DIRGYLLNIALLRQFFSP
Subjt:  DPITKYGDIRGYLLNIALLRQFFSP

A0A5A7URG2 SOUL heme-binding family protein isoform 32.2e-5184Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPA VLS+PT S GFR R SDG T+TIAAA TQ+P  HNHNQ+ VVGSKLAAADH ++RPTKSRVDVD+LVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSP
        DPITK+ DIRGYLLNIALLRQFFSP
Subjt:  DPITKYGDIRGYLLNIALLRQFFSP

A0A5D3CVR4 Uncharacterized protein3.2e-5082.4Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPA VLS+PT S GFR R SDG T+TIAAA TQ+P  HNHNQ+ VVGSKLAAAD+ ++RPTKSRVDVD+LVKFLYDDLHHVFDEQGIDP+AYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSP
        DPITK+ DIRGYLLNIALLRQFFSP
Subjt:  DPITKYGDIRGYLLNIALLRQFFSP

A0A6J1HKM5 uncharacterized protein LOC1114650221.4e-2659.23Show/hide
Query:  MAPAQV-----LSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDE
        MA AQV     LSIPT  FG R R S GPTR  A + T  P     N    + S L  AD  H++PT   VDVD+LV F+YDDL HVFDEQGID TAYDE
Subjt:  MAPAQV-----LSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDE

Query:  EIEFRDPITKYGDIRGYLLNIALLRQFFSP
        E+ FRDPITKY  I GY+LNIALLR+FF P
Subjt:  EIEFRDPITKYGDIRGYLLNIALLRQFFSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G20140.1 SOUL heme-binding family protein1.4e-1853.09Show/hide
Query:  LVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFRDPITKYGDIRGYLLNIALLRQFFSP
        L VG ++A+A         S V++++LV FLY+DL H+FD+QGID TAYDE ++FRDPITK+  I GYL NIA L+  F+P
Subjt:  LVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFRDPITKYGDIRGYLLNIALLRQFFSP

AT5G20140.2 SOUL heme-binding family protein1.4e-1853.09Show/hide
Query:  LVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFRDPITKYGDIRGYLLNIALLRQFFSP
        L VG ++A+A         S V++++LV FLY+DL H+FD+QGID TAYDE ++FRDPITK+  I GYL NIA L+  F+P
Subjt:  LVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFRDPITKYGDIRGYLLNIALLRQFFSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCAGCCCAAGTTCTCTCAATCCCAACCGCTAGCTTTGGTTTCCGGGCAAGGACCTCCGACGGACCAACCAGAACCATAGCCGCTGCCATAACTCAGAAGCCTCA
CAATCACAATCACAACCAAGATTTGGTTGTTGGATCAAAACTAGCAGCAGCAGATCATCCACATAGAAGGCCAACGAAATCGAGGGTGGACGTTGACCAATTGGTGAAAT
TCTTGTACGATGATCTCCACCACGTGTTCGATGAACAAGGAATTGATCCGACGGCTTACGATGAAGAAATAGAATTTCGAGATCCAATTACAAAATACGGTGACATAAGG
GGATATTTGTTAAATATTGCTCTCTTGCGACAATTCTTTAGTCCTCATCATCTTGCATTGGGTTAA
mRNA sequenceShow/hide mRNA sequence
CAATTTTTCTTATTATAAAATAAAAAGAGTAGATATTTATGTTTGGTGCATCGAAATTTGTAATAAACTAGATATAAACATAGGTATTTAATTCAAAGATTCTTCGTTCA
ATTCCAAATTCCAAATTCTAATGGCTCCAGCCCAAGTTCTCTCAATCCCAACCGCTAGCTTTGGTTTCCGGGCAAGGACCTCCGACGGACCAACCAGAACCATAGCCGCT
GCCATAACTCAGAAGCCTCACAATCACAATCACAACCAAGATTTGGTTGTTGGATCAAAACTAGCAGCAGCAGATCATCCACATAGAAGGCCAACGAAATCGAGGGTGGA
CGTTGACCAATTGGTGAAATTCTTGTACGATGATCTCCACCACGTGTTCGATGAACAAGGAATTGATCCGACGGCTTACGATGAAGAAATAGAATTTCGAGATCCAATTA
CAAAATACGGTGACATAAGGGGATATTTGTTAAATATTGCTCTCTTGCGACAATTCTTTAGTCCTCATCATCTTGCATTGGGTTAAAAAGGTTCCTTCCCATTTCCCTTT
TTATTATTATTATTGTTATATGCTTATTAACTATCTCCCTAATTATTACTATATCCATTATTTGTCTGAATTTTGTATGTTACAAGATTTCAATTAGAATAAACTCCATC
CACACCAGATAAATTCTCTCTTCTTCCCTCTTAGCTTTCTCTCCCTCTACATGTGTGTTCGTCGACTTGCAGTAATCATGTTCCTTGCCAATGGGTTGGAACTTGGATTC
ATCTTCGGCATGTGGAAGAGGGTGCTGCAAAACATGTGAGAGGAGAGAAAAAATAGAAAGTAATTTAGAGAAAGGAAGACGAAGATGTTGAAAGTTGACCAATACTTAAT
TTGGTGAAACCTTGCCACATTCATTTTTTTGTACATTATATATATATTTAATTG
Protein sequenceShow/hide protein sequence
MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPHNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFRDPITKYGDIR
GYLLNIALLRQFFSPHHLALG