; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0050141 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0050141
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionSOUL heme-binding family protein
Genome locationCMiso1.1chr02:16141648..16145458
RNA-Seq ExpressionCmc02g0050141
SyntenyCmc02g0050141
Gene Ontology termsGO:0006979 - response to oxidative stress (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043396.1 SOUL heme-binding family protein isoform 1 [Cucumis melo var. makuwa]3.7e-21095.87Show/hide
Query:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
        MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQT--------------GPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLW
        RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQ                PYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLW
Subjt:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQT--------------GPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLW

Query:  DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESP
        DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESP
Subjt:  DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESP

Query:  KVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIM
        KVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIM
Subjt:  KVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIM

KGN65404.1 hypothetical protein Csa_019846 [Cucumis sativus]1.4e-20993.54Show/hide
Query:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
        MA LQLSLQNF STPTL+S+LRPPKSGR+T+L PRLL SRTPA KP+T+NSKWVVR NLVDQ PPKST+DVGRLVDFL+EDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
        RFRDPITKHDTISGYLFNISLLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKFALLPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDI
        WDVFKQLRFYKTPELESPKYLILKRT KYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SESPKVSIQIVLPSEKDI
Subjt:  WDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE
        DSLPDPEQDI+GLRKVEGGIAAVLKFSGKP EEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEE+SLE
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE

TYK11984.1 SOUL heme-binding family protein isoform 1 [Cucumis melo var. makuwa]5.3e-21795.26Show/hide
Query:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
        MAALQLSLQNFLSTPTLTSVLRPPKSGRLT+LLPRLLQSRTPA KPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQT--------------GPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLW
        RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQ                PYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLW
Subjt:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQT--------------GPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLW

Query:  DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESP
        DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRT KYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESP
Subjt:  DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESP

Query:  KVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSL
        KVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSL
Subjt:  KVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSL

Query:  E
        E
Subjt:  E

XP_008463332.1 PREDICTED: uncharacterized protein LOC103501513 isoform X1 [Cucumis melo]1.7e-223100Show/hide
Query:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
        MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
        RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDI
        WDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDI
Subjt:  WDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE
        DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE

XP_011648491.1 uncharacterized protein LOC101206063 [Cucumis sativus]1.4e-20993.54Show/hide
Query:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
        MA LQLSLQNF STPTL+S+LRPPKSGR+T+L PRLL SRTPA KP+T+NSKWVVR NLVDQ PPKST+DVGRLVDFL+EDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
        RFRDPITKHDTISGYLFNISLLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKFALLPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDI
        WDVFKQLRFYKTPELESPKYLILKRT KYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SESPKVSIQIVLPSEKDI
Subjt:  WDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE
        DSLPDPEQDI+GLRKVEGGIAAVLKFSGKP EEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEE+SLE
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE

TrEMBL top hitse value%identityAlignment
A0A0A0LWP3 Uncharacterized protein6.8e-21093.54Show/hide
Query:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
        MA LQLSLQNF STPTL+S+LRPPKSGR+T+L PRLL SRTPA KP+T+NSKWVVR NLVDQ PPKST+DVGRLVDFL+EDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
        RFRDPITKHDTISGYLFNISLLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKFALLPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDI
        WDVFKQLRFYKTPELESPKYLILKRT KYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SESPKVSIQIVLPSEKDI
Subjt:  WDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE
        DSLPDPEQDI+GLRKVEGGIAAVLKFSGKP EEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEE+SLE
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE

A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X18.3e-224100Show/hide
Query:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
        MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
        RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDI
        WDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDI
Subjt:  WDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE
        DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE

A0A1S4E461 uncharacterized protein LOC103501513 isoform X23.1e-19490.18Show/hide
Query:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
        MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDE  
Subjt:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
                                            QTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDI
        WDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDI
Subjt:  WDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE
        DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE

A0A5A7TMX2 SOUL heme-binding family protein isoform 11.8e-21095.87Show/hide
Query:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
        MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQT--------------GPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLW
        RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQ                PYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLW
Subjt:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQT--------------GPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLW

Query:  DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESP
        DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESP
Subjt:  DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESP

Query:  KVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIM
        KVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIM
Subjt:  KVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIM

A0A5D3CLD5 SOUL heme-binding family protein isoform 12.6e-21795.26Show/hide
Query:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
        MAALQLSLQNFLSTPTLTSVLRPPKSGRLT+LLPRLLQSRTPA KPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQT--------------GPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLW
        RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQ                PYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLW
Subjt:  RFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQT--------------GPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLW

Query:  DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESP
        DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRT KYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESP
Subjt:  DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESP

Query:  KVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSL
        KVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSL
Subjt:  KVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSL

Query:  E
        E
Subjt:  E

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic2.1e-1935Show/hide
Query:  FYKTPELESPKYLILKRTPKYEVRKYAPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVS-------------
        F   P+LE+  + +L RT KYE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S   K+              
Subjt:  FYKTPELESPKYLILKRTPKYEVRKYAPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVS-------------

Query:  ----IQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKD-GLKPRNGCL--LARYNDPGRTWNFIMRNEVLIWLE
            +  V+PS K   +LP P+   + +++V   I AV+ FSG  T+E ++ + +ELR +L  D   + R+G    +A+YN P  T  F+ RNEV + +E
Subjt:  ----IQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKD-GLKPRNGCL--LARYNDPGRTWNFIMRNEVLIWLE

Arabidopsis top hitse value%identityAlignment
AT1G17100.1 SOUL heme-binding family protein6.2e-0628.37Show/hide
Query:  LESPKYLILKRTPKYEVRKYAPFIVVETS-----GDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESP----KVSIQIVLPSEKDIDSLPD
        +E P Y ++     YE+R+Y   + V T          A    F  +  YI GKN   +KI MT PV +Q   S+ P      ++   +P +   D  P 
Subjt:  LESPKYLILKRTPKYEVRKYAPFIVVETS-----GDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESP----KVSIQIVLPSEKDIDSLPD

Query:  PEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL
           + + ++K      AV +FSG  +++ + E+A  L SSL
Subjt:  PEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL

AT2G37970.1 SOUL heme-binding family protein2.3e-1633.97Show/hide
Query:  LESPKYLILKRTPKYEVRKYAPFIVVETSGD----KLAGSAGFNTVAGYI--FGK--NSTKEKIPMTTPVFTQ----------TFDSESPK---------
        +E+PKY + K    YE+R+Y P +  E + D    K     GF  +A YI  FGK  N   EKI MT PV T+              ES K         
Subjt:  LESPKYLILKRTPKYEVRKYAPFIVVETSGD----KLAGSAGFNTVAGYI--FGK--NSTKEKIPMTTPVFTQ----------TFDSESPK---------

Query:  -----------VSIQIVLPS-EKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNF--IM
                   V++Q +LPS  K  +  P P  + + +++  G    V+KFSG  +E +V EK K+L S L KDG K     +LARYN P   W      
Subjt:  -----------VSIQIVLPS-EKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNF--IM

Query:  RNEVLIWLE
         NEV+I +E
Subjt:  RNEVLIWLE

AT3G10130.1 SOUL heme-binding family protein1.5e-2035Show/hide
Query:  FYKTPELESPKYLILKRTPKYEVRKYAPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVS-------------
        F   P+LE+  + +L RT KYE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S   K+              
Subjt:  FYKTPELESPKYLILKRTPKYEVRKYAPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVS-------------

Query:  ----IQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKD-GLKPRNGCL--LARYNDPGRTWNFIMRNEVLIWLE
            +  V+PS K   +LP P+   + +++V   I AV+ FSG  T+E ++ + +ELR +L  D   + R+G    +A+YN P  T  F+ RNEV + +E
Subjt:  ----IQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKD-GLKPRNGCL--LARYNDPGRTWNFIMRNEVLIWLE

AT5G20140.1 SOUL heme-binding family protein2.0e-14574.84Show/hide
Query:  STVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTG
        STV++  LV FLYEDL HLFD+QGID+TAYDE+V+FRDPITKHDTISGYLFNI+ L+ IF P+F LHW KQTGPYEITTRWTM+MKF  LPWKPEL+FTG
Subjt:  STVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTG

Query:  TSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNST
         SIM +NPET KFCSH+DLWDSI+NNDYFS+EGL DVFKQLR YKTP+LE+PKY ILKRT  YEVR Y PFIVVET GDKL+GS+GFN VAGYIFGKNST
Subjt:  TSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNST

Query:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDP
         EKIPMTTPVFTQT D++ S  VS+QIV+PS KD+ SLP P ++ + L+K+EGG AA +KFSGKPTE++VQ K  ELRSSL KDGL+ + GC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDP

Query:  GRTWNFIMRNEVLIWLEEWSLE
        GRTWNFIMRNEV+IWLE++SL+
Subjt:  GRTWNFIMRNEVLIWLEEWSLE

AT5G20140.2 SOUL heme-binding family protein6.2e-13972.73Show/hide
Query:  STVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTG
        STV++  LV FLYEDL HLFD+QGID+TAYDE+V+FRDPITKHDTISGYLFNI+ L+ IF P+F LHW KQTGPYEITTRWTM+MKF  LPWKPEL+FTG
Subjt:  STVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTG

Query:  TSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNST
         SIM +NPET KFCSH+DLWDSI+NNDYFS+EGL DVFKQLR YKTP+LE+PKY ILKRT  YEVR Y PFIVVET GDKL+GS+GFN VAGYIFGKNST
Subjt:  TSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNST

Query:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDP
         EKIPMTTPVFTQT D++ S  VS+QIV+PS KD+ SLP P ++ + L+K+EGG AA +KFSGKPTE++VQ K  ELRSSL KDGL+ + GC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDP

Query:  GRTWNFIMRNEVLIWLEEW
        GRTWNFIM   +    + W
Subjt:  GRTWNFIMRNEVLIWLEEW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTCTTCAACTTTCCCTCCAAAACTTCCTCTCAACCCCAACACTCACTTCCGTTCTCCGCCCACCGAAATCCGGCAGACTAACCAACCTCCTACCTCGTCTACT
TCAATCCAGAACTCCAGCTGTTAAACCCAATACCCAAAATTCTAAGTGGGTTGTTCGATTCAACTTGGTTGATCAAAGCCCACCAAAATCGACAGTCGATGTAGGCCGAT
TGGTGGATTTCTTGTATGAAGATCTTTCCCATCTTTTCGATGAACAGGGAATTGATCGAACGGCGTACGATGAACAAGTGAGATTTCGAGACCCCATTACTAAGCACGAT
ACGATTAGTGGGTATTTGTTTAATATTTCCCTCTTGCGAGAAATCTTCAGGCCTGAATTCTTCTTGCACTGGGTTAAACAGACAGGACCATATGAAATAACTACAAGATG
GACTATGATAATGAAGTTTGCCCTTCTGCCATGGAAACCAGAATTAATTTTCACAGGAACTTCCATCATGGGTATCAATCCAGAGACGGGCAAGTTCTGTAGTCATGTGG
ATCTCTGGGATTCGATACAGAACAACGACTACTTTTCTGTAGAAGGCCTTTGGGATGTTTTCAAGCAGCTTCGGTTTTATAAGACTCCAGAATTGGAATCACCCAAGTAT
CTGATTCTGAAAAGGACTCCAAAGTATGAGGTGAGGAAATATGCTCCATTTATAGTGGTAGAAACAAGTGGAGACAAGCTCGCTGGATCTGCAGGATTCAATACAGTTGC
TGGGTACATATTTGGGAAGAACTCTACGAAGGAGAAGATACCCATGACCACTCCTGTATTCACCCAAACATTTGACTCTGAATCACCCAAAGTCTCCATTCAAATAGTTC
TTCCTTCAGAGAAAGATATAGACAGTTTACCAGATCCTGAACAAGACATAATTGGCTTGAGAAAGGTTGAAGGAGGTATTGCTGCAGTTTTGAAATTCAGTGGGAAACCT
ACTGAAGAGATTGTGCAAGAGAAAGCAAAAGAACTGCGGTCTAGTCTCATAAAGGATGGTCTCAAACCCAGGAACGGCTGTTTGCTTGCTCGGTATAACGACCCTGGAAG
AACATGGAACTTTATAATGAGAAATGAGGTGCTAATATGGCTTGAAGAGTGGTCATTGGAGTAA
mRNA sequenceShow/hide mRNA sequence
CATTTCAGCAATGGCTGAGACTTGAACACGTAGATTGTTCAAATGATTTGGATTGTTCTTCACCACAAAAAAATTTGAGGAAGCAAAAACCAGCCAATCACTCCCCGTCC
TTTACCTCCAAATCCCACCTTCCCAATGACAATCCGCCGGCTGTGGCTGAGGCTGAGGCTGAGGCTCAAATGGCCGCTCTTCAACTTTCCCTCCAAAACTTCCTCTCAAC
CCCAACACTCACTTCCGTTCTCCGCCCACCGAAATCCGGCAGACTAACCAACCTCCTACCTCGTCTACTTCAATCCAGAACTCCAGCTGTTAAACCCAATACCCAAAATT
CTAAGTGGGTTGTTCGATTCAACTTGGTTGATCAAAGCCCACCAAAATCGACAGTCGATGTAGGCCGATTGGTGGATTTCTTGTATGAAGATCTTTCCCATCTTTTCGAT
GAACAGGGAATTGATCGAACGGCGTACGATGAACAAGTGAGATTTCGAGACCCCATTACTAAGCACGATACGATTAGTGGGTATTTGTTTAATATTTCCCTCTTGCGAGA
AATCTTCAGGCCTGAATTCTTCTTGCACTGGGTTAAACAGACAGGACCATATGAAATAACTACAAGATGGACTATGATAATGAAGTTTGCCCTTCTGCCATGGAAACCAG
AATTAATTTTCACAGGAACTTCCATCATGGGTATCAATCCAGAGACGGGCAAGTTCTGTAGTCATGTGGATCTCTGGGATTCGATACAGAACAACGACTACTTTTCTGTA
GAAGGCCTTTGGGATGTTTTCAAGCAGCTTCGGTTTTATAAGACTCCAGAATTGGAATCACCCAAGTATCTGATTCTGAAAAGGACTCCAAAGTATGAGGTGAGGAAATA
TGCTCCATTTATAGTGGTAGAAACAAGTGGAGACAAGCTCGCTGGATCTGCAGGATTCAATACAGTTGCTGGGTACATATTTGGGAAGAACTCTACGAAGGAGAAGATAC
CCATGACCACTCCTGTATTCACCCAAACATTTGACTCTGAATCACCCAAAGTCTCCATTCAAATAGTTCTTCCTTCAGAGAAAGATATAGACAGTTTACCAGATCCTGAA
CAAGACATAATTGGCTTGAGAAAGGTTGAAGGAGGTATTGCTGCAGTTTTGAAATTCAGTGGGAAACCTACTGAAGAGATTGTGCAAGAGAAAGCAAAAGAACTGCGGTC
TAGTCTCATAAAGGATGGTCTCAAACCCAGGAACGGCTGTTTGCTTGCTCGGTATAACGACCCTGGAAGAACATGGAACTTTATAATGAGAAATGAGGTGCTAATATGGC
TTGAAGAGTGGTCATTGGAGTAATGAAGCAACAAATTCTTGGTTCTTTCATAAAGCCTTTGAGTGATCTTTGTAGCAAAGATTTCAAATGTTCAAGTAATGAAGCAACAA
ATTCTTGGTTCTTTTCAAGATAGCACTCATGGACTCTGTCACGTTCGGTGTCATCAGACTGTAATGTCTTTCTCGACAATGCATACATGCCCACCGTCATAGTCCAACTT
TTTATAAATGCCTTCCTACACCGTTATCAAATGCTAGGAATACACCGTTATCAAATGCTAGGAATACTCTTGAAATTTTGATTCACATTATGCCATTGGTGCATTGTAAA
CAAATTTTTAAATTGTTTCATTATTGAACATTGTACTTAAATTTTGTTTCAAGCGTTTGACACAAAGACCATGGAATGTGGTGGGCAATAAAAAATTATACTATAACAAG
CTTGAATTTCGATCTATCACAAAATCTAGATTTTTAACTTCATCTATTGCACCTTTTAGCTT
Protein sequenceShow/hide protein sequence
MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLVDQSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQVRFRDPITKHD
TISGYLFNISLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKY
LILKRTPKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSESPKVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKP
TEEIVQEKAKELRSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEWSLE