; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G19830 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G19830
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionSOUL heme-binding family protein
Genome locationChr1:15579355..15581741
RNA-Seq ExpressionCSPI01G19830
SyntenyCSPI01G19830
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK14496.1 uncharacterized protein E5676_scaffold15G00070 [Cucumis melo var. makuwa]4.0e-8886.08Show/hide
Query:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC
        MKF LLPWKPECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYE  ELELPKYQTL RT NYEVRKYGPFA AERSGENLF C
Subjt:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC

Query:  VNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
        VNS+GGWGDCKEDDRIMKLRNK GGIAAVLNFSGKATEE VKNKAKELRHYLKKDGL+SV NNSCLL              RNEVLIWLQDFSI
Subjt:  VNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

XP_004151455.2 uncharacterized protein LOC101205468 [Cucumis sativus]1.4e-10999.48Show/hide
Query:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC
        MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC
Subjt:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC

Query:  VNSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
        VNSIGGWGDCKEDDRIM+LRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
Subjt:  VNSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

XP_008464979.1 PREDICTED: uncharacterized protein LOC103502719 [Cucumis melo]1.1e-7792.86Show/hide
Query:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC
        MKF LLPWKPECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYE  ELELPKYQTL RT NYEVRKYGPFA AERSGENLF C
Subjt:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC

Query:  VNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLKK
        VNS+GGWGDCKEDDRIMKLRNK GGIAAVLNFSGKATEE VKNKAKELRHYLKK
Subjt:  VNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLKK

XP_038879853.1 uncharacterized protein LOC120071584 isoform X1 [Benincasa hispida]1.1e-7472.59Show/hide
Query:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC
        MKF +LPWKPE V+TGTSIM INP+TGKFC HVDLWDSVQNNDYFSIEGLWDVFKQ RFYETPELE PKYQ LKRT NYEVRKY P   AE SGENL  C
Subjt:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC

Query:  V----NSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
             + +G W +CKED      +N+GGIAAVL FSGK+T+++V+NKAK+LRH LKKDGLK +NN+S LL RYN+S  TWSFVMRNEVLIWL+DFSI
Subjt:  V----NSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

XP_038879854.1 uncharacterized protein LOC120071584 isoform X2 [Benincasa hispida]1.1e-7472.59Show/hide
Query:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC
        MKF +LPWKPE V+TGTSIM INP+TGKFC HVDLWDSVQNNDYFSIEGLWDVFKQ RFYETPELE PKYQ LKRT NYEVRKY P   AE SGENL  C
Subjt:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC

Query:  V----NSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
             + +G W +CKED      +N+GGIAAVL FSGK+T+++V+NKAK+LRH LKKDGLK +NN+S LL RYN+S  TWSFVMRNEVLIWL+DFSI
Subjt:  V----NSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

TrEMBL top hitse value%identityAlignment
A0A0A0LU04 Uncharacterized protein6.8e-11099.48Show/hide
Query:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC
        MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC
Subjt:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC

Query:  VNSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
        VNSIGGWGDCKEDDRIM+LRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
Subjt:  VNSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

A0A1S3CN82 uncharacterized protein LOC1035027195.3e-7892.86Show/hide
Query:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC
        MKF LLPWKPECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYE  ELELPKYQTL RT NYEVRKYGPFA AERSGENLF C
Subjt:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC

Query:  VNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLKK
        VNS+GGWGDCKEDDRIMKLRNK GGIAAVLNFSGKATEE VKNKAKELRHYLKK
Subjt:  VNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLKK

A0A5D3CVR4 Uncharacterized protein1.9e-8886.08Show/hide
Query:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC
        MKF LLPWKPECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYE  ELELPKYQTL RT NYEVRKYGPFA AERSGENLF C
Subjt:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC

Query:  VNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
        VNS+GGWGDCKEDDRIMKLRNK GGIAAVLNFSGKATEE VKNKAKELRHYLKKDGL+SV NNSCLL              RNEVLIWLQDFSI
Subjt:  VNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

A0A6J1EZQ2 uncharacterized protein LOC1114408399.0e-7068.39Show/hide
Query:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC
        MKF LLPWKPE VLTGTSIM INP TGKFC HVDLWDS+QNNDYFS+E LWDVFKQFRFYETPELE PKYQ LKRT NYEVRKY PF   ER+G  +   
Subjt:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC

Query:  VNSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
         N +G   D K++D +     +GGI AVL FSG  TE+  + KAKELR  LKKDGLK +  N CLL RYN S  TWSFVMRNEVLIWL++FSI
Subjt:  VNSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

A0A6J1HKM5 uncharacterized protein LOC1114650222.4e-7068.39Show/hide
Query:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC
        MKF LLPWKPE VLTGTSIM INP TGKFC HVDLWDS+QNNDYFS+E LWDVFKQFRFYETPELE PKYQ LKRT NYEVRKY PF   ER+G  +   
Subjt:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFEC

Query:  VNSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
         N +G + D K++D +     +GGI AVL FSG  TE+  + KAKELR  LKKDGLK +  N CLL RYNDS  TW FVMRNEV+IWLQ+FSI
Subjt:  VNSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G20140.1 SOUL heme-binding family protein2.0e-5345.61Show/hide
Query:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENL--F
        MKF  LPWKPE V TG SIM +NP T KFC H+DLWDS++NNDYFS+EGL DVFKQ R Y+TP+LE PKYQ LKRT NYEVR Y PF   E  G+ L   
Subjt:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENL--F

Query:  ECVNSIGGWGDCK--------------------------------------------EDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKD
           N++ G+   K                                             ++++   + +GG AA + FSGK TE+ V+ K  ELR  L KD
Subjt:  ECVNSIGGWGDCK--------------------------------------------EDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKD

Query:  GLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
        GL++     C+L RYND   TW+F+MRNEV+IWL+DFS+
Subjt:  GLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

AT5G20140.2 SOUL heme-binding family protein1.8e-4643.22Show/hide
Query:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENL--F
        MKF  LPWKPE V TG SIM +NP T KFC H+DLWDS++NNDYFS+EGL DVFKQ R Y+TP+LE PKYQ LKRT NYEVR Y PF   E  G+ L   
Subjt:  MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENL--F

Query:  ECVNSIGGWGDCK--------------------------------------------EDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKD
           N++ G+   K                                             ++++   + +GG AA + FSGK TE+ V+ K  ELR  L KD
Subjt:  ECVNSIGGWGDCK--------------------------------------------EDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKD

Query:  GLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQD
        GL++     C+L RYND   TW+F+M ++VL +  D
Subjt:  GLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTTGCTCTTCTACCATGGAAACCAGAATGTGTTTTGACAGGAACTTCAATCATGACCATTAATCCAAACACTGGCAAGTTTTGTAGACATGTGGATCTTTGGGA
TTCAGTCCAAAATAATGACTACTTTTCTATAGAAGGCCTTTGGGATGTATTTAAACAGTTTCGTTTTTATGAGACTCCAGAATTGGAATTGCCCAAATATCAGACATTGA
AAAGGACTGAAAATTATGAGGTGAGAAAATATGGACCATTTGCTGCGGCAGAAAGAAGTGGAGAGAACTTGTTTGAGTGTGTCAATAGCATCGGCGGTTGGGGAGATTGT
AAAGAAGACGACAGAATCATGAAGTTGAGAAATAAAGGAGGGATTGCTGCAGTGTTGAATTTCAGTGGAAAAGCTACAGAAGAAAAGGTGAAAAACAAAGCCAAAGAATT
AAGACATTATCTCAAAAAAGATGGCCTCAAAAGCGTTAATAATAATAGCTGTTTACTTGTACGTTACAACGATTCCAACCATACATGGAGTTTCGTAATGAGAAATGAGG
TGCTAATATGGCTTCAAGATTTCTCAATTTAG
mRNA sequenceShow/hide mRNA sequence
AGACAGGACCGTATGAAATAACAACAAGATGGACTGCAGCAATGAAGTTTGCTCTTCTACCATGGAAACCAGAATGTGTTTTGACAGGAACTTCAATCATGACCATTAAT
CCAAACACTGGCAAGTTTTGTAGACATGTGGATCTTTGGGATTCAGTCCAAAATAATGACTACTTTTCTATAGAAGGCCTTTGGGATGTATTTAAACAGTTTCGTTTTTA
TGAGACTCCAGAATTGGAATTGCCCAAATATCAGACATTGAAAAGGACTGAAAATTATGAGGTGAGAAAATATGGACCATTTGCTGCGGCAGAAAGAAGTGGAGAGAACT
TGTTTGAGTGTGTCAATAGCATCGGCGGTTGGGGAGATTGTAAAGAAGACGACAGAATCATGAAGTTGAGAAATAAAGGAGGGATTGCTGCAGTGTTGAATTTCAGTGGA
AAAGCTACAGAAGAAAAGGTGAAAAACAAAGCCAAAGAATTAAGACATTATCTCAAAAAAGATGGCCTCAAAAGCGTTAATAATAATAGCTGTTTACTTGTACGTTACAA
CGATTCCAACCATACATGGAGTTTCGTAATGAGAAATGAGGTGCTAATATGGCTTCAAGATTTCTCAATTTAGTACAACATGCAACTATGCAAGTCCAAACACGGCTTTC
TGAATAACCCCAATTGTTTTATGTAAGTTATTTGATGTGTAATTAAATATTCGTAAGTCAAGTTGTTTGTTGCCTTATTTGGTTTGTCGATAACTATGAACTATTACTTG
TCCCACTTTGTAGGTCGGCCATACAATTATGACTTTCAACGTCACTGTTTAAAAAATCTAATTTGATATGTGGATTGGTTAAGACTTCTAACGTTGTATTGCTTCTTAAA
TTTATACTCTATTTAAATCATGAAAAGGACGACCTAAATGACTGGAG
Protein sequenceShow/hide protein sequence
MKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSIGGWGDC
KEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI