; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G18649 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G18649
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionSOUL heme-binding family protein isoform 3
Genome locationctg3379:2983182..2987071
RNA-Seq ExpressionCucsat.G18649
SyntenyCucsat.G18649
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056215.1 SOUL heme-binding family protein isoform 3 [Cucumis melo var. makuwa]1.65e-16187.75Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPA VLS+PT S GFR R SDG T+TIAAA TQ+P  HNHNQ+ VVGSKLAAADH ++RPTKSRVDVD+LVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITK+ DIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTA MKF LLPWKPECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISG
        FKQFRFYE  ELELPKYQTL RT NYEVRKYGPFA AERSGENLF CVNS+ G
Subjt:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISG

TYK14496.1 uncharacterized protein E5676_scaffold15G00070 [Cucumis melo var. makuwa]1.03e-20584.75Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPA VLS+PT S GFR R SDG T+TIAAA TQ+P  HNHNQ+ VVGSKLAAAD+ ++RPTKSRVDVD+LVKFLYDDLHHVFDEQGIDP+AYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITK+ DIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTA MKF LLPWKPECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISGWGDCKEDDRIMELRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK
        FKQFRFYE  ELELPKYQTL RT NYEVRKYGPFA AERSGENLF CVNS+ GWGDCKEDDRIM+LRNK GGIAAVLNFSGKATEE VKNKAKELRHYLK
Subjt:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISGWGDCKEDDRIMELRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK

Query:  KDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
        KDGL+SVNN SCLL              RNEVLIWLQDFSI
Subjt:  KDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

XP_004151455.2 uncharacterized protein LOC101205468 [Cucumis sativus]3.26e-25499.41Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKP NHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISGWGDCKEDDRIMELRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKK
        FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSI GWGDCKEDDRIMELRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKK
Subjt:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISGWGDCKEDDRIMELRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKK

Query:  DGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
        DGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
Subjt:  DGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

XP_008464979.1 PREDICTED: uncharacterized protein LOC103502719 [Cucumis melo]3.86e-19488.7Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPA VLS+PT S GFR R SDG T+TIAAA TQ+P  HNHNQ+ VVGSKLAAADH ++RPTKSRVDVD+LVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITK+ DIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTA MKF LLPWKPECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISGWGDCKEDDRIMELRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK
        FKQFRFYE  ELELPKYQTL RT NYEVRKYGPFA AERSGENLF CVNS+ GWGDCKEDDRIM+LRNK GGIAAVLNFSGKATEE VKNKAKELRHYLK
Subjt:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISGWGDCKEDDRIMELRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK

Query:  K
        K
Subjt:  K

XP_038879853.1 uncharacterized protein LOC120071584 isoform X1 [Benincasa hispida]1.32e-16570.52Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPT-RTIAAAIT-QKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIE
        MAP Q LSIP   FGF  R S GPT RTIAAA    KP  H+HNQ+  V SK+      H+RP KS VDV++LV+FLY+DLHHVFDEQGID TAYDEE+ 
Subjt:  MAPAQVLSIPTASFGFRARTSDGPT-RTIAAAIT-QKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIE

Query:  FRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW
        FRDPITKY +I GYLLNIALL  FF P++ILHWVKKTGPYEITTRWTA MKF +LPWKPE V+TGTSIM INP+TGKFC HVDLWDSVQNNDYFSIEGLW
Subjt:  FRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW

Query:  DVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECV----NSISGWGDCKEDDRIMELRNKGGIAAVLNFSGKATEEKVKNKAKEL
        DVFKQ RFYETPELE PKYQ LKRT NYEVRKY P   AE SGENL  C     + +  W +CKED      +N+GGIAAVL FSGK+T+++V+NKAK+L
Subjt:  DVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECV----NSISGWGDCKEDDRIMELRNKGGIAAVLNFSGKATEEKVKNKAKEL

Query:  RHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
        RH LKKDGLK +NN+S LL RYN+S  TWSFVMRNEVLIWL+DFSI
Subjt:  RHYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

TrEMBL top hitse value%identityAlignment
A0A0A0LU04 Uncharacterized protein1.58e-25499.41Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKP NHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISGWGDCKEDDRIMELRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKK
        FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSI GWGDCKEDDRIMELRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKK
Subjt:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISGWGDCKEDDRIMELRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKK

Query:  DGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
        DGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
Subjt:  DGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

A0A1S3CN82 uncharacterized protein LOC1035027191.87e-19488.7Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPA VLS+PT S GFR R SDG T+TIAAA TQ+P  HNHNQ+ VVGSKLAAADH ++RPTKSRVDVD+LVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITK+ DIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTA MKF LLPWKPECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISGWGDCKEDDRIMELRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK
        FKQFRFYE  ELELPKYQTL RT NYEVRKYGPFA AERSGENLF CVNS+ GWGDCKEDDRIM+LRNK GGIAAVLNFSGKATEE VKNKAKELRHYLK
Subjt:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISGWGDCKEDDRIMELRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK

Query:  K
        K
Subjt:  K

A0A5A7URG2 SOUL heme-binding family protein isoform 37.99e-16287.75Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPA VLS+PT S GFR R SDG T+TIAAA TQ+P  HNHNQ+ VVGSKLAAADH ++RPTKSRVDVD+LVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITK+ DIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTA MKF LLPWKPECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISG
        FKQFRFYE  ELELPKYQTL RT NYEVRKYGPFA AERSGENLF CVNS+ G
Subjt:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISG

A0A5D3CVR4 Uncharacterized protein4.98e-20684.75Show/hide
Query:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR
        MAPA VLS+PT S GFR R SDG T+TIAAA TQ+P  HNHNQ+ VVGSKLAAAD+ ++RPTKSRVDVD+LVKFLYDDLHHVFDEQGIDP+AYDEEIEFR
Subjt:  MAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITK+ DIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTA MKF LLPWKPECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISGWGDCKEDDRIMELRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK
        FKQFRFYE  ELELPKYQTL RT NYEVRKYGPFA AERSGENLF CVNS+ GWGDCKEDDRIM+LRNK GGIAAVLNFSGKATEE VKNKAKELRHYLK
Subjt:  FKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISGWGDCKEDDRIMELRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK

Query:  KDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
        KDGL+SVNN SCLL              RNEVLIWLQDFSI
Subjt:  KDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

A0A6J1HKM5 uncharacterized protein LOC1114650221.95e-15166.09Show/hide
Query:  MAPAQV-----LSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDE
        MA AQV     LSIPT  FG R R S GPTR  A + T  P     N    + S LA  D  H++PT   VDVD+LV F+YDDL HVFDEQGID TAYDE
Subjt:  MAPAQV-----LSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDE

Query:  EIEFRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIE
        E+ FRDPITKY  I GY+LNIALLR+FF P+IILHWVKKTGPYEITTRWTA MKF LLPWKPE VLTGTSIM INP TGKFC HVDLWDS+QNNDYFS+E
Subjt:  EIEFRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIE

Query:  GLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISGWGDCKEDDRIMELRNKGGIAAVLNFSGKATEEKVKNKAKELR
         LWDVFKQFRFYETPELE PKYQ LKRT NYEVRKY PF   ER+G  +    N +  + D K++D +     +GGI AVL FSG  TE+  + KAKELR
Subjt:  GLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISGWGDCKEDDRIMELRNKGGIAAVLNFSGKATEEKVKNKAKELR

Query:  HYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
          LKKDGLK +N   CLL RYNDS  TW FVMRNEV+IWLQ+FSI
Subjt:  HYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46100.1 Nuclear transport factor 2 (NTF2) family protein7.4e-0427.27Show/hide
Query:  IDPTAYDEEIEFRDPITKYGDIR---------GYLL---NIALLR-QFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTG
        + P  Y+E+ EF DP   +  +          G L+   N+ L++ + F  + I HW           +++  M F   PWKP    TG +    +  +G
Subjt:  IDPTAYDEEIEFRDPITKYGDIR---------GYLL---NIALLR-QFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTG

Query:  KFCRHVDLWD
        K CRHV+ W+
Subjt:  KFCRHVDLWD

AT5G20140.1 SOUL heme-binding family protein2.6e-8949.12Show/hide
Query:  LVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRW
        L VG ++A+A         S V++++LV FLY+DL H+FD+QGID TAYDE ++FRDPITK+  I GYL NIA L+  F+PQ  LHW K+TGPYEITTRW
Subjt:  LVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRW

Query:  TAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENL
        T  MKF  LPWKPE V TG SIM +NP T KFC H+DLWDS++NNDYFS+EGL DVFKQ R Y+TP+LE PKYQ LKRT NYEVR Y PF   E  G+ L
Subjt:  TAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENL

Query:  --FECVNSISGWGDCK--------------------------------------------EDDRIMELRNKGGIAAVLNFSGKATEEKVKNKAKELRHYL
              N+++G+   K                                             ++++   + +GG AA + FSGK TE+ V+ K  ELR  L
Subjt:  --FECVNSISGWGDCK--------------------------------------------EDDRIMELRNKGGIAAVLNFSGKATEEKVKNKAKELRHYL

Query:  KKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI
         KDGL++     C+L RYND   TW+F+MRNEV+IWL+DFS+
Subjt:  KKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI

AT5G20140.2 SOUL heme-binding family protein1.8e-8247.49Show/hide
Query:  LVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRW
        L VG ++A+A         S V++++LV FLY+DL H+FD+QGID TAYDE ++FRDPITK+  I GYL NIA L+  F+PQ  LHW K+TGPYEITTRW
Subjt:  LVVGSKLAAADHPHRRPTKSRVDVDQLVKFLYDDLHHVFDEQGIDPTAYDEEIEFRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRW

Query:  TAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENL
        T  MKF  LPWKPE V TG SIM +NP T KFC H+DLWDS++NNDYFS+EGL DVFKQ R Y+TP+LE PKYQ LKRT NYEVR Y PF   E  G+ L
Subjt:  TAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENL

Query:  --FECVNSISGWGDCK--------------------------------------------EDDRIMELRNKGGIAAVLNFSGKATEEKVKNKAKELRHYL
              N+++G+   K                                             ++++   + +GG AA + FSGK TE+ V+ K  ELR  L
Subjt:  --FECVNSISGWGDCK--------------------------------------------EDDRIMELRNKGGIAAVLNFSGKATEEKVKNKAKELRHYL

Query:  KKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQD
         KDGL++     C+L RYND   TW+F+M ++VL +  D
Subjt:  KKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TATAAAATAAAAAGAGTAGATAATTATGTTTGGTGCATCGAAATTTGTAATAAACTAGATATAAACATAGGTATTCGTTCTCTCAATTCCAAATTCCAAATTCTAATGGC
TCCAGCCCAAGTTCTCTCAATCCCAACCGCTAGCTTTGGTTTCCGGGCAAGGACCTCCGACGGACCAACCAGAACCATAGCCGCTGCCATAACTCAGAAGCCTCGCAATC
ACAATCACAACCAAGATTTGGTTGTTGGATCAAAACTAGCAGCAGCAGATCACCCACATAGAAGGCCAACGAAATCGAGGGTGGACGTTGACCAATTGGTGAAATTCTTG
TACGATGATCTCCACCACGTGTTCGATGAACAAGGAATTGATCCGACGGCTTACGATGAAGAAATAGAATTTCGAGATCCAATTACAAAATACGGTGACATAAGGGGATA
TTTGTTAAATATTGCTCTCTTGCGACAATTCTTTAGTCCTCAGATCATCTTGCATTGGGTTAAAAAGACAGGACCGTATGAAATAACAACAAGATGGACTGCAGCAATGA
AGTTTGCTCTTCTACCATGGAAACCAGAATGTGTTTTGACAGGAACTTCAATCATGACCATTAATCCAAACACTGGCAAGTTTTGTAGACATGTGGATCTTTGGGATTCA
GTCCAAAATAATGACTACTTTTCTATAGAAGGCCTTTGGGATGTATTTAAACAGTTTCGTTTTTATGAGACTCCAGAATTGGAATTGCCCAAATATCAGACATTGAAAAG
GACTGAAAATTATGAGGTGAGAAAATATGGACCATTTGCTGCGGCAGAAAGAAGTGGAGAGAACTTGTTTGAGTGTGTCAATAGCATCAGCGGTTGGGGAGATTGTAAAG
AAGACGACAGAATCATGGAGTTGAGAAATAAAGGAGGGATTGCTGCAGTGTTGAATTTCAGTGGAAAAGCTACAGAAGAAAAGGTGAAAAACAAAGCCAAAGAATTAAGA
CATTATCTCAAAAAAGATGGCCTCAAAAGCGTTAATAATAATAGCTGTTTACTTGTACGTTACAACGATTCCAACCATACATGGAGTTTCGTAATGAGAAATGAGGTGCT
AATATGGCTTCAAGATTTCTCAATTTAG
mRNA sequenceShow/hide mRNA sequence
TATAAAATAAAAAGAGTAGATAATTATGTTTGGTGCATCGAAATTTGTAATAAACTAGATATAAACATAGGTATTCGTTCTCTCAATTCCAAATTCCAAATTCTAATGGC
TCCAGCCCAAGTTCTCTCAATCCCAACCGCTAGCTTTGGTTTCCGGGCAAGGACCTCCGACGGACCAACCAGAACCATAGCCGCTGCCATAACTCAGAAGCCTCGCAATC
ACAATCACAACCAAGATTTGGTTGTTGGATCAAAACTAGCAGCAGCAGATCACCCACATAGAAGGCCAACGAAATCGAGGGTGGACGTTGACCAATTGGTGAAATTCTTG
TACGATGATCTCCACCACGTGTTCGATGAACAAGGAATTGATCCGACGGCTTACGATGAAGAAATAGAATTTCGAGATCCAATTACAAAATACGGTGACATAAGGGGATA
TTTGTTAAATATTGCTCTCTTGCGACAATTCTTTAGTCCTCAGATCATCTTGCATTGGGTTAAAAAGACAGGACCGTATGAAATAACAACAAGATGGACTGCAGCAATGA
AGTTTGCTCTTCTACCATGGAAACCAGAATGTGTTTTGACAGGAACTTCAATCATGACCATTAATCCAAACACTGGCAAGTTTTGTAGACATGTGGATCTTTGGGATTCA
GTCCAAAATAATGACTACTTTTCTATAGAAGGCCTTTGGGATGTATTTAAACAGTTTCGTTTTTATGAGACTCCAGAATTGGAATTGCCCAAATATCAGACATTGAAAAG
GACTGAAAATTATGAGGTGAGAAAATATGGACCATTTGCTGCGGCAGAAAGAAGTGGAGAGAACTTGTTTGAGTGTGTCAATAGCATCAGCGGTTGGGGAGATTGTAAAG
AAGACGACAGAATCATGGAGTTGAGAAATAAAGGAGGGATTGCTGCAGTGTTGAATTTCAGTGGAAAAGCTACAGAAGAAAAGGTGAAAAACAAAGCCAAAGAATTAAGA
CATTATCTCAAAAAAGATGGCCTCAAAAGCGTTAATAATAATAGCTGTTTACTTGTACGTTACAACGATTCCAACCATACATGGAGTTTCGTAATGAGAAATGAGGTGCT
AATATGGCTTCAAGATTTCTCAATTTAG
Protein sequenceShow/hide protein sequence
YKIKRVDNYVWCIEICNKLDINIGIRSLNSKFQILMAPAQVLSIPTASFGFRARTSDGPTRTIAAAITQKPRNHNHNQDLVVGSKLAAADHPHRRPTKSRVDVDQLVKFL
YDDLHHVFDEQGIDPTAYDEEIEFRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGPYEITTRWTAAMKFALLPWKPECVLTGTSIMTINPNTGKFCRHVDLWDS
VQNNDYFSIEGLWDVFKQFRFYETPELELPKYQTLKRTENYEVRKYGPFAAAERSGENLFECVNSISGWGDCKEDDRIMELRNKGGIAAVLNFSGKATEEKVKNKAKELR
HYLKKDGLKSVNNNSCLLVRYNDSNHTWSFVMRNEVLIWLQDFSI