; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G042190 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G042190
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionSOUL heme-binding family protein
Genome locationchrH02:23107470..23110610
RNA-Seq ExpressionChy2G042190
SyntenyChy2G042190
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056215.1 SOUL heme-binding family protein isoform 3 [Cucumis melo var. makuwa]2.44e-15586.17Show/hide
Query:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR
        MAP  VL +PTVS GFR RKSDG T+TIAAA TQ+PHNH  NQN  VGSKLAA DHQYKRPTKSRVDVDRLVKFLYDDL HVFDEQGIDP AYDEEIEFR
Subjt:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITK+ DIRGYLLNIALLRQFFSPQIILHWVKKTG YEITTRWTA MKF LLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGG
        FKQ       ELE+PKYQTL RTANYEVRKYGPFA  ERSGENLF CVNS+GG
Subjt:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGG

TYK14496.1 uncharacterized protein E5676_scaffold15G00070 [Cucumis melo var. makuwa]8.42e-19983.58Show/hide
Query:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR
        MAP  VL +PTVS GFR RKSDG T+TIAAA TQ+PHNH  NQN  VGSKLAA D+QYKRPTKSRVDVDRLVKFLYDDL HVFDEQGIDP AYDEEIEFR
Subjt:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITK+ DIRGYLLNIALLRQFFSPQIILHWVKKTG YEITTRWTA MKF LLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK
        FKQ       ELE+PKYQTL RTANYEVRKYGPFA  ERSGENLF CVNS+GGWGDCKEDDRIMKLRNK GGIAAVLNFSGKATEE VKNKAKELRHYLK
Subjt:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK

Query:  KDGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQDFSM
        KDG +SVNN SCLL              RNEVLIWLQDFS+
Subjt:  KDGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQDFSM

XP_004151455.2 uncharacterized protein LOC101205468 [Cucumis sativus]1.03e-22890.59Show/hide
Query:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR
        MAP QVL IPT SFGFRAR SDGPTRTIAAAITQKPHNH+HNQ+L VGSKLAA DH ++RPTKSRVDVD+LVKFLYDDL HVFDEQGIDP AYDEEIEFR
Subjt:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTG YEITTRWTAAMKFALLPWKPECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKK
        FKQ       ELE+PKYQTLKRT NYEVRKYGPFA  ERSGENLF+CVNSIGGWGDCKEDDRIM+LRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKK
Subjt:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKK

Query:  DGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQDFSM
        DG KSVNNNSCLLVRYNDSN TWSFVMRNEVLIWLQDFS+
Subjt:  DGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQDFSM

XP_008464979.1 PREDICTED: uncharacterized protein LOC103502719 [Cucumis melo]3.89e-18887.71Show/hide
Query:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR
        MAP  VL +PTVS GFR RKSDG T+TIAAA TQ+PHNH  NQN  VGSKLAA DHQYKRPTKSRVDVDRLVKFLYDDL HVFDEQGIDP AYDEEIEFR
Subjt:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITK+ DIRGYLLNIALLRQFFSPQIILHWVKKTG YEITTRWTA MKF LLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK
        FKQ       ELE+PKYQTL RTANYEVRKYGPFA  ERSGENLF CVNS+GGWGDCKEDDRIMKLRNK GGIAAVLNFSGKATEE VKNKAKELRHYLK
Subjt:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK

Query:  K
        K
Subjt:  K

XP_038879853.1 uncharacterized protein LOC120071584 isoform X1 [Benincasa hispida]5.20e-15769.08Show/hide
Query:  MAPVQVLPIPTVSFGFRARKSDGPT-RTIAAAIT-QKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIE
        MAP Q L IP V FGF  RKS GPT RTIAAA    KPH  SHNQN  V SK+   DHQ  RP KS VDV+RLV+FLY+DL HVFDEQGID  AYDEE+ 
Subjt:  MAPVQVLPIPTVSFGFRARKSDGPT-RTIAAAIT-QKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIE

Query:  FRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLW
        FRDPITKY +I GYLLNIALL  FF P++ILHWVKKTG YEITTRWTA MKF +LPWKPE V+TGTSIM +NP+TGKFC HVDLWDSVQNNDYFSIEGLW
Subjt:  FRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLW

Query:  DVFKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCV----NSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKEL
        DVFKQ       ELE PKYQ LKRTANYEVRKY P    E SGENL  C     + +G W +CKED      +N+GGIAAVL FSGK+T+++V+NKAK+L
Subjt:  DVFKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCV----NSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKEL

Query:  RHYLKKDGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQDFSM
        RH LKKDG K +NN+S LL RYN+S  TWSFVMRNEVLIWL+DFS+
Subjt:  RHYLKKDGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQDFSM

TrEMBL top hitse value%identityAlignment
A0A0A0LU04 Uncharacterized protein1.8e-17990.59Show/hide
Query:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR
        MAP QVL IPT SFGFRAR SDGPTRTIAAAITQKPHNH+HNQ+L VGSKLAA DH ++RPTKSRVDVD+LVKFLYDDL HVFDEQGIDP AYDEEIEFR
Subjt:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTG YEITTRWTAAMKFALLPWKPECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKK
        FKQ       ELE+PKYQTLKRT NYEVRKYGPFA  ERSGENLF+CVNSIGGWGDCKEDDRIM+LRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKK
Subjt:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKK

Query:  DGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQDFSM
        DG KSVNNNSCLLVRYNDSN TWSFVMRNEVLIWLQDFS+
Subjt:  DGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQDFSM

A0A1S3CN82 uncharacterized protein LOC1035027198.3e-14887.71Show/hide
Query:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR
        MAP  VL +PTVS GFR RKSDG T+TIAAA TQ+PHN  HNQN  VGSKLAA DHQYKRPTKSRVDVDRLVKFLYDDL HVFDEQGIDP AYDEEIEFR
Subjt:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITK+ DIRGYLLNIALLRQFFSPQIILHWVKKTG YEITTRWTA MKF LLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK
        FKQ       ELE+PKYQTL RTANYEVRKYGPFA  ERSGENLF CVNS+GGWGDCKEDDRIMKLRNK GGIAAVLNFSGKATEE VKNKAKELRHYLK
Subjt:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK

Query:  K
        K
Subjt:  K

A0A5A7URG2 SOUL heme-binding family protein isoform 31.3e-12186.17Show/hide
Query:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR
        MAP  VL +PTVS GFR RKSDG T+TIAAA TQ+PHN  HNQN  VGSKLAA DHQYKRPTKSRVDVDRLVKFLYDDL HVFDEQGIDP AYDEEIEFR
Subjt:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITK+ DIRGYLLNIALLRQFFSPQIILHWVKKTG YEITTRWTA MKF LLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGG
        FKQ       ELE+PKYQTL RTANYEVRKYGPFA  ERSGENLF CVNS+GG
Subjt:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGG

A0A5D3CVR4 Uncharacterized protein2.9e-15683.58Show/hide
Query:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR
        MAP  VL +PTVS GFR RKSDG T+TIAAA TQ+PHN  HNQN  VGSKLAA D+QYKRPTKSRVDVDRLVKFLYDDL HVFDEQGIDP AYDEEIEFR
Subjt:  MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFR

Query:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
        DPITK+ DIRGYLLNIALLRQFFSPQIILHWVKKTG YEITTRWTA MKF LLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDV

Query:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK
        FKQ       ELE+PKYQTL RTANYEVRKYGPFA  ERSGENLF CVNS+GGWGDCKEDDRIMKLRNK GGIAAVLNFSGKATEE VKNKAKELRHYLK
Subjt:  FKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGGWGDCKEDDRIMKLRNK-GGIAAVLNFSGKATEEKVKNKAKELRHYLK

Query:  KDGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQDFSM
        KDG +SV NNSCLL              RNEVLIWLQDFS+
Subjt:  KDGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQDFSM

A0A6J1HKM5 uncharacterized protein LOC1114650221.5e-11263.19Show/hide
Query:  MAPVQV-----LPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDE
        MA  QV     L IPTV FG R RKS GPTR  A + T  P     N   ++ S LA  D ++++PT   VDVDRLV F+YDDL+HVFDEQGID  AYDE
Subjt:  MAPVQV-----LPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDE

Query:  EIEFRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIE
        E+ FRDPITKY  I GY+LNIALLR+FF P+IILHWVKKTG YEITTRWTA MKF LLPWKPE VLTGTSIM +NP TGKFC HVDLWDS+QNNDYFS+E
Subjt:  EIEFRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIE

Query:  GLWDVFKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELR
         LWDVFKQ       ELE PKYQ LKRTANYEVRKY PF   ER+G  +    N +G + D K++D +     +GGI AVL FSG  TE+  + KAKELR
Subjt:  GLWDVFKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGENLFQCVNSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELR

Query:  HYLKKDGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQDFSM
          LKKDG K +  N CLL RYNDS +TW FVMRNEV+IWLQ+FS+
Subjt:  HYLKKDGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQDFSM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46100.1 Nuclear transport factor 2 (NTF2) family protein6.6e-0425.25Show/hide
Query:  IDPMAYDEEIEFRDPITKYGDIRGYLLNIALLRQFF--SPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWD
        + P  Y+E+ EF DP   +  +  +  N          S   ++ W           +++  M F   PWKP    TG +    +  +GK CRHV+ W+
Subjt:  IDPMAYDEEIEFRDPITKYGDIRGYLLNIALLRQFF--SPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWD

AT5G20140.1 SOUL heme-binding family protein1.6e-8247.23Show/hide
Query:  NLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTR
        +L VG ++A+          S V+++ LV FLY+DL H+FD+QGID  AYDE ++FRDPITK+  I GYL NIA L+  F+PQ  LHW K+TG YEITTR
Subjt:  NLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTR

Query:  WTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGEN
        WT  MKF  LPWKPE V TG SIM VNP T KFC H+DLWDS++NNDYFS+EGL DVFKQ       +LE PKYQ LKRTANYEVR Y PF   E  G+ 
Subjt:  WTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGEN

Query:  L--FQCVNSIGGWGDCK--------------------------------------------EDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHY
        L      N++ G+   K                                             ++++   + +GG AA + FSGK TE+ V+ K  ELR  
Subjt:  L--FQCVNSIGGWGDCK--------------------------------------------EDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHY

Query:  LKKDGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQDFSM
        L KDG ++     C+L RYND  +TW+F+MRNEV+IWL+DFS+
Subjt:  LKKDGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQDFSM

AT5G20140.2 SOUL heme-binding family protein1.1e-7545.59Show/hide
Query:  NLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTR
        +L VG ++A+          S V+++ LV FLY+DL H+FD+QGID  AYDE ++FRDPITK+  I GYL NIA L+  F+PQ  LHW K+TG YEITTR
Subjt:  NLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFRDPITKYGDIRGYLLNIALLRQFFSPQIILHWVKKTGSYEITTR

Query:  WTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGEN
        WT  MKF  LPWKPE V TG SIM VNP T KFC H+DLWDS++NNDYFS+EGL DVFKQ       +LE PKYQ LKRTANYEVR Y PF   E  G+ 
Subjt:  WTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQ-------ELEVPKYQTLKRTANYEVRKYGPFATTERSGEN

Query:  L--FQCVNSIGGWGDCK--------------------------------------------EDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHY
        L      N++ G+   K                                             ++++   + +GG AA + FSGK TE+ V+ K  ELR  
Subjt:  L--FQCVNSIGGWGDCK--------------------------------------------EDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHY

Query:  LKKDGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQD
        L KDG ++     C+L RYND  +TW+F+M ++VL +  D
Subjt:  LKKDGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCAGTCCAAGTTCTCCCAATCCCAACCGTTAGCTTTGGTTTCCGGGCAAGGAAATCCGACGGACCAACCAGAACCATAGCCGCTGCCATAACTCAGAAGCCTCA
CAATCACAGTCACAACCAAAATTTGGCTGTTGGATCAAAACTAGCAGCAACAGATCACCAATACAAAAGACCAACGAAATCGAGGGTGGACGTTGACCGATTGGTGAAAT
TCTTATACGATGATCTCCAGCACGTGTTCGATGAACAGGGAATTGATCCGATGGCTTACGATGAAGAAATAGAATTTCGAGATCCAATTACAAAATACGGTGACATAAGG
GGATATTTGTTAAATATTGCTCTCTTGCGACAATTCTTTAGCCCTCAGATCATCTTGCATTGGGTTAAAAAGACAGGATCGTATGAGATAACAACAAGATGGACTGCAGC
AATGAAGTTTGCTCTTCTACCATGGAAACCAGAATGTGTTCTGACAGGAACTTCAATCATGACCGTTAATCCAAACACTGGCAAGTTTTGTCGACATGTGGATCTTTGGG
ATTCAGTACAAAATAATGACTACTTTTCTATAGAAGGCCTTTGGGATGTATTTAAACAGGAATTGGAAGTGCCTAAATATCAGACATTGAAAAGGACTGCAAATTATGAG
GTGAGAAAATATGGACCATTTGCTACCACAGAAAGAAGTGGAGAGAACTTGTTTCAGTGTGTCAATAGCATCGGCGGTTGGGGAGATTGTAAAGAAGACGACAGAATCAT
GAAGTTGAGAAATAAAGGAGGAATTGCTGCAGTGTTGAATTTCAGTGGAAAGGCTACAGAAGAAAAGGTGAAAAACAAAGCGAAAGAATTAAGACATTATCTCAAAAAAG
ATGGCCACAAAAGCGTTAATAATAATAGCTGTTTACTTGTACGTTACAACGATTCCAACCAAACATGGAGTTTCGTCATGAGAAATGAGGTGCTAATATGGCTTCAAGAT
TTCTCAATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCAGTCCAAGTTCTCCCAATCCCAACCGTTAGCTTTGGTTTCCGGGCAAGGAAATCCGACGGACCAACCAGAACCATAGCCGCTGCCATAACTCAGAAGCCTCA
CAATCACAGTCACAACCAAAATTTGGCTGTTGGATCAAAACTAGCAGCAACAGATCACCAATACAAAAGACCAACGAAATCGAGGGTGGACGTTGACCGATTGGTGAAAT
TCTTATACGATGATCTCCAGCACGTGTTCGATGAACAGGGAATTGATCCGATGGCTTACGATGAAGAAATAGAATTTCGAGATCCAATTACAAAATACGGTGACATAAGG
GGATATTTGTTAAATATTGCTCTCTTGCGACAATTCTTTAGCCCTCAGATCATCTTGCATTGGGTTAAAAAGACAGGATCGTATGAGATAACAACAAGATGGACTGCAGC
AATGAAGTTTGCTCTTCTACCATGGAAACCAGAATGTGTTCTGACAGGAACTTCAATCATGACCGTTAATCCAAACACTGGCAAGTTTTGTCGACATGTGGATCTTTGGG
ATTCAGTACAAAATAATGACTACTTTTCTATAGAAGGCCTTTGGGATGTATTTAAACAGGAATTGGAAGTGCCTAAATATCAGACATTGAAAAGGACTGCAAATTATGAG
GTGAGAAAATATGGACCATTTGCTACCACAGAAAGAAGTGGAGAGAACTTGTTTCAGTGTGTCAATAGCATCGGCGGTTGGGGAGATTGTAAAGAAGACGACAGAATCAT
GAAGTTGAGAAATAAAGGAGGAATTGCTGCAGTGTTGAATTTCAGTGGAAAGGCTACAGAAGAAAAGGTGAAAAACAAAGCGAAAGAATTAAGACATTATCTCAAAAAAG
ATGGCCACAAAAGCGTTAATAATAATAGCTGTTTACTTGTACGTTACAACGATTCCAACCAAACATGGAGTTTCGTCATGAGAAATGAGGTGCTAATATGGCTTCAAGAT
TTCTCAATGTAG
Protein sequenceShow/hide protein sequence
MAPVQVLPIPTVSFGFRARKSDGPTRTIAAAITQKPHNHSHNQNLAVGSKLAATDHQYKRPTKSRVDVDRLVKFLYDDLQHVFDEQGIDPMAYDEEIEFRDPITKYGDIR
GYLLNIALLRQFFSPQIILHWVKKTGSYEITTRWTAAMKFALLPWKPECVLTGTSIMTVNPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQELEVPKYQTLKRTANYE
VRKYGPFATTERSGENLFQCVNSIGGWGDCKEDDRIMKLRNKGGIAAVLNFSGKATEEKVKNKAKELRHYLKKDGHKSVNNNSCLLVRYNDSNQTWSFVMRNEVLIWLQD
FSM