; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027154 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027154
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSOUL heme-binding family protein
Genome locationtig00153048:1560557..1563259
RNA-Seq ExpressionSgr027154
SyntenySgr027154
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043396.1 SOUL heme-binding family protein isoform 1 [Cucumis melo var. makuwa]3.1e-18979.21Show/hide
Query:  KNSHERLANHHSPSCTFKVHLPDGNSP------VQAQMAGLQLSLQNFLSTPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ-
        KN  ++   +HSPS T K HLP+ N P       +AQMA LQLSLQNFLSTPTL    RPP SGRL  L PRLL++RT    P TQNSKWVVR +LVDQ 
Subjt:  KNSHERLANHHSPSCTFKVHLPDGNSP------VQAQMAGLQLSLQNFLSTPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ-

Query:  --KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQT--------------GPYEITTRWTM
          KSTVDV RLVDFLYEDL HLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNI+LLRE+FRPEFFLHWVKQ                PYEITTRWTM
Subjt:  --KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQT--------------GPYEITTRWTM

Query:  VMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSG
        +MKF LLPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL DVFKQLRF+KTPELESPKY ILKRT  YEVRKYAPFIVVETSGDKL+G
Subjt:  VMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSG

Query:  SAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDIDSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKD
        SAGFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKVSIQIVLPSEKDIDSLPDPEQD IGLRKVEGGIAAVLKFSG+PTE++VQEKAKELRSSLIKD
Subjt:  SAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDIDSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKD

Query:  GLKPSKGCLLARYNDPGRTWSFIMVSPCAFLSY
        GLKP  GCLLARYNDPGRTW+FIMV     L +
Subjt:  GLKPSKGCLLARYNDPGRTWSFIMVSPCAFLSY

XP_008463332.1 PREDICTED: uncharacterized protein LOC103501513 isoform X1 [Cucumis melo]4.2e-18687.94Show/hide
Query:  MAGLQLSLQNFLSTPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQV
        MA LQLSLQNFLSTPTL    RPP SGRL  L PRLL++RT    P TQNSKWVVR +LVDQ   KSTVDV RLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAGLQLSLQNFLSTPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        RFRDPITKHDTISGYLFNI+LLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKF LLPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI
         DVFKQLRF+KTPELESPKY ILKRT  YEVRKYAPFIVVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKVSIQIVLPSEKDI
Subjt:  LDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI

Query:  DSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM
        DSLPDPEQD IGLRKVEGGIAAVLKFSG+PTE++VQEKAKELRSSLIKDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  DSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM

XP_011648491.1 uncharacterized protein LOC101206063 [Cucumis sativus]1.4e-18984.89Show/hide
Query:  HHSPSCTFKVHLPDGNSP--VQAQMAGLQLSLQNFLSTPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVD
        +HSPS T K HLP+   P   +AQMA LQLSLQNF STPTL    RPP SGR+  LPPRLL +RT  F P T+NSKWVVR +LVDQ   KST+DV RLVD
Subjt:  HHSPSCTFKVHLPDGNSP--VQAQMAGLQLSLQNFLSTPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVD

Query:  FLYEDLRHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPET
        FL+EDL HLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNI+LLRELFRPEFFLHWVKQTGPYEITTRWTMVMKF LLPWKPELVFTG SIMGINPET
Subjt:  FLYEDLRHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPET

Query:  GKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPV
        GKFCSHVDLWDSIQNNDYFS+EGL DVFKQLRF+KTPELESPKY ILKRTA YEVRKYAPFIVVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPV
Subjt:  GKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPV

Query:  FTQTFDSELPKVSIQIVLPSEKDIDSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM
        FTQ F+SE PKVSIQIVLPSEKDIDSLPDPEQD +GLRKVEGGIAAVLKFSG+P E++VQEKAKELRSSLIKDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  FTQTFDSELPKVSIQIVLPSEKDIDSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM

XP_022144956.1 uncharacterized protein LOC111014503 isoform X1 [Momordica charantia]4.6e-19390.93Show/hide
Query:  MAGLQLSLQNFLSTPTLGFDFRPPNSGRL--PGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MA LQLSLQNFLSTPT GF FRP  SG L   GLPPRLLK+RTV F P  +NSKW VRLSLVDQ   KS VDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
Subjt:  MAGLQLSLQNFLSTPTLGFDFRPPNSGRL--PGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  QVRFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPITKHDTISGY FNI+LLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPE +FTG SIMGINPETGKFCSHVDLWDSIQNNDYFSLE
Subjt:  QVRFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLRF+KTPELESPKYEILKRTANYEVRKY PF+VVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPS+K
Subjt:  GLLDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  DIDSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM
        DI+SLPDPEQDTIGLRKVEGGIAAVLKFSG+PTEDMVQEKAKELRS LIKDGLKPSKGCLLARYNDPGRTWSFIM
Subjt:  DIDSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM

XP_038879422.1 uncharacterized protein LOC120071301 isoform X1 [Benincasa hispida]6.9e-18989.54Show/hide
Query:  MAGLQLSLQNFLSTPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQV
        MA  QLSLQNF STPTLGF  RPP SGRL  LPPRL KTRT  F P +QNSKWVVRLSLVDQ   KSTVDV RLVDFLYEDLRHLFDEQGIDRTAYDEQV
Subjt:  MAGLQLSLQNFLSTPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        RFRDPIT HDTISGYLFNI+LLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI
         DVFKQLR++KTP LESPKY ILKRTANYEVRKYA FIVVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE+PKV IQIVLPSEKDI
Subjt:  LDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI

Query:  DSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM
        DSLPDPEQD IGLRKVEG IAAVLKFSG+PTE++VQEKAKELRSSLIKDGLKPS GCLLARYNDPGRTW+FIM
Subjt:  DSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM

TrEMBL top hitse value%identityAlignment
A0A0A0LWP3 Uncharacterized protein4.2e-18487.13Show/hide
Query:  MAGLQLSLQNFLSTPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQV
        MA LQLSLQNF STPTL    RPP SGR+  LPPRLL +RT  F P T+NSKWVVR +LVDQ   KST+DV RLVDFL+EDL HLFDEQGIDRTAYDEQV
Subjt:  MAGLQLSLQNFLSTPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        RFRDPITKHDTISGYLFNI+LLRELFRPEFFLHWVKQTGPYEITTRWTMVMKF LLPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI
         DVFKQLRF+KTPELESPKY ILKRTA YEVRKYAPFIVVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQ F+SE PKVSIQIVLPSEKDI
Subjt:  LDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI

Query:  DSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM
        DSLPDPEQD +GLRKVEGGIAAVLKFSG+P E++VQEKAKELRSSLIKDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  DSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM

A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X12.0e-18687.94Show/hide
Query:  MAGLQLSLQNFLSTPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQV
        MA LQLSLQNFLSTPTL    RPP SGRL  L PRLL++RT    P TQNSKWVVR +LVDQ   KSTVDV RLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAGLQLSLQNFLSTPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        RFRDPITKHDTISGYLFNI+LLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKF LLPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI
         DVFKQLRF+KTPELESPKY ILKRT  YEVRKYAPFIVVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKVSIQIVLPSEKDI
Subjt:  LDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI

Query:  DSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM
        DSLPDPEQD IGLRKVEGGIAAVLKFSG+PTE++VQEKAKELRSSLIKDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  DSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM

A0A5A7TMX2 SOUL heme-binding family protein isoform 11.5e-18979.21Show/hide
Query:  KNSHERLANHHSPSCTFKVHLPDGNSP------VQAQMAGLQLSLQNFLSTPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ-
        KN  ++   +HSPS T K HLP+ N P       +AQMA LQLSLQNFLSTPTL    RPP SGRL  L PRLL++RT    P TQNSKWVVR +LVDQ 
Subjt:  KNSHERLANHHSPSCTFKVHLPDGNSP------VQAQMAGLQLSLQNFLSTPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ-

Query:  --KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQT--------------GPYEITTRWTM
          KSTVDV RLVDFLYEDL HLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNI+LLRE+FRPEFFLHWVKQ                PYEITTRWTM
Subjt:  --KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQT--------------GPYEITTRWTM

Query:  VMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSG
        +MKF LLPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL DVFKQLRF+KTPELESPKY ILKRT  YEVRKYAPFIVVETSGDKL+G
Subjt:  VMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSG

Query:  SAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDIDSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKD
        SAGFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKVSIQIVLPSEKDIDSLPDPEQD IGLRKVEGGIAAVLKFSG+PTE++VQEKAKELRSSLIKD
Subjt:  SAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDIDSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKD

Query:  GLKPSKGCLLARYNDPGRTWSFIMVSPCAFLSY
        GLKP  GCLLARYNDPGRTW+FIMV     L +
Subjt:  GLKPSKGCLLARYNDPGRTWSFIMVSPCAFLSY

A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X12.2e-19390.93Show/hide
Query:  MAGLQLSLQNFLSTPTLGFDFRPPNSGRL--PGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MA LQLSLQNFLSTPT GF FRP  SG L   GLPPRLLK+RTV F P  +NSKW VRLSLVDQ   KS VDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
Subjt:  MAGLQLSLQNFLSTPTLGFDFRPPNSGRL--PGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  QVRFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPITKHDTISGY FNI+LLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPE +FTG SIMGINPETGKFCSHVDLWDSIQNNDYFSLE
Subjt:  QVRFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLRF+KTPELESPKYEILKRTANYEVRKY PF+VVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPS+K
Subjt:  GLLDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  DIDSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM
        DI+SLPDPEQDTIGLRKVEGGIAAVLKFSG+PTEDMVQEKAKELRS LIKDGLKPSKGCLLARYNDPGRTWSFIM
Subjt:  DIDSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM

A0A6J1ER73 uncharacterized protein LOC111437064 isoform X15.5e-18486.93Show/hide
Query:  MAGLQLSLQNFL--STPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MA LQ SLQN L  STP+LGF FRPPNSGRL      + ++RTVP  P T+NSKWVVRLSLVDQ   KSTVDVD+LVDFLYEDL HLFDEQGIDRTAYD+
Subjt:  MAGLQLSLQNFL--STPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQ---KSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  QVRFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
        QVRFRDPITKHDTI+GYLFNI+LLRELFRPEF LHWVK+TG YEITTRWTMVMKFVLLPWKP+LVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  QVRFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLRF+KTPELESPKYEILKRT NYEVRKYAPFIVVETSGDKL+GSAGFN VAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK
Subjt:  GLLDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  DIDSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM
        D+ SLPDPEQDTIGLRKVEGG AAVLKFSG+PTE++VQEKAKELRSSLIKDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  DIDSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic5.0e-1731.32Show/hide
Query:  FFKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSEKDI
        F   P+LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++    KD 
Subjt:  FFKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSEKDI

Query:  D--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKD---GLKPSKGCLLARYNDP
        +              +LP P+  ++ +++V   I AV+ FSG  T++ ++ + +ELR +L  D    ++      +A+YN P
Subjt:  D--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKD---GLKPSKGCLLARYNDP

Arabidopsis top hitse value%identityAlignment
AT1G17100.1 SOUL heme-binding family protein5.7e-0830.99Show/hide
Query:  LESPKYEILKRTANYEVRKYAPFIVVET------SGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELP----KVSIQIVLPSEKDIDSLP
        +E P YE++     YE+R+Y   + V T      S    + +A F   A YI GKN   +KI MT PV +Q   S+ P      ++   +P +   D  P
Subjt:  LESPKYEILKRTANYEVRKYAPFIVVET------SGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELP----KVSIQIVLPSEKDIDSLP

Query:  DPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSL
            + + ++K      AV +FSG  ++D + E+A  L SSL
Subjt:  DPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSL

AT2G37970.1 SOUL heme-binding family protein3.4e-1634.21Show/hide
Query:  LESPKYEILKRTANYEVRKYAPFIVVETSGD----KLSGSAGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ------------TFDSELPK-------
        +E+PKY + K    YE+R+Y P +  E + D    K     GF  +A YI  FGK  N   EKI MT PV T+            T +SE  +       
Subjt:  LESPKYEILKRTANYEVRKYAPFIVVETSGD----KLSGSAGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ------------TFDSELPK-------

Query:  -----------VSIQIVLPS-EKDIDSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDP
                   V++Q +LPS  K  +  P P  + + +++  G    V+KFSG  +E +V EK K+L S L KDG K +   +LARYN P
Subjt:  -----------VSIQIVLPS-EKDIDSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDP

AT3G10130.1 SOUL heme-binding family protein3.6e-1831.32Show/hide
Query:  FFKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSEKDI
        F   P+LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++    KD 
Subjt:  FFKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSEKDI

Query:  D--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKD---GLKPSKGCLLARYNDP
        +              +LP P+  ++ +++V   I AV+ FSG  T++ ++ + +ELR +L  D    ++      +A+YN P
Subjt:  D--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKD---GLKPSKGCLLARYNDP

AT5G20140.1 SOUL heme-binding family protein1.1e-14169.01Show/hide
Query:  LGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQKSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNI
        +G D R   + R   +P R + TR  P           V   +    STV+++ LV FLYEDL HLFD+QGID+TAYDE+V+FRDPITKHDTISGYLFNI
Subjt:  LGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQKSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNI

Query:  ALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFFKTPELESPK
        A L+ +F P+F LHW KQTGPYEITTRWTMVMKF+ LPWKPELVFTG SIM +NPET KFCSH+DLWDSI+NNDYFSLEGL+DVFKQLR +KTP+LE+PK
Subjt:  ALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFFKTPELESPK

Query:  YEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKDIDSLPDPEQDTIGLRKVEG
        Y+ILKRTANYEVR Y PFIVVET GDKLSGS+GFN VAGYIFGKNS  EKIPMTTPVFTQT D++L   VS+QIV+PS KD+ SLP P ++ + L+K+EG
Subjt:  YEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKDIDSLPDPEQDTIGLRKVEG

Query:  GIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM
        G AA +KFSG+PTED+VQ K  ELRSSL KDGL+  KGC+LARYNDPGRTW+FIM
Subjt:  GIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIM

AT5G20140.2 SOUL heme-binding family protein3.9e-14268.04Show/hide
Query:  LGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQKSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNI
        +G D R   + R   +P R + TR  P           V   +    STV+++ LV FLYEDL HLFD+QGID+TAYDE+V+FRDPITKHDTISGYLFNI
Subjt:  LGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLVDQKSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNI

Query:  ALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFFKTPELESPK
        A L+ +F P+F LHW KQTGPYEITTRWTMVMKF+ LPWKPELVFTG SIM +NPET KFCSH+DLWDSI+NNDYFSLEGL+DVFKQLR +KTP+LE+PK
Subjt:  ALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFFKTPELESPK

Query:  YEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKDIDSLPDPEQDTIGLRKVEG
        Y+ILKRTANYEVR Y PFIVVET GDKLSGS+GFN VAGYIFGKNS  EKIPMTTPVFTQT D++L   VS+QIV+PS KD+ SLP P ++ + L+K+EG
Subjt:  YEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKDIDSLPDPEQDTIGLRKVEG

Query:  GIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIMVSPCAFLS
        G AA +KFSG+PTED+VQ K  ELRSSL KDGL+  KGC+LARYNDPGRTW+FIM    +F S
Subjt:  GIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIMVSPCAFLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAAATTCGACCACGTAGATTGTTAATACGAACTGGCTGGTTCCTCACCACAAAAAATTCTCACGAAAGACTAGCCAATCATCACTCTCCGTCCTGTACTTTCAA
AGTTCACCTGCCCGACGGCAATTCGCCGGTGCAGGCTCAAATGGCGGGTCTTCAACTTTCCCTCCAAAACTTCCTCTCAACCCCAACACTTGGTTTTGATTTCCGGCCGC
CGAACTCCGGCAGACTACCCGGCCTCCCACCCCGTCTACTTAAAACCAGGACTGTGCCTTTTACACCTCCTACCCAAAATTCTAAGTGGGTCGTTAGATTAAGCTTGGTA
GATCAGAAATCCACGGTCGACGTAGACCGATTGGTGGATTTCTTATACGAAGATCTTCGCCATCTCTTCGATGAACAGGGGATTGATCGGACGGCGTATGATGAACAAGT
GAGATTTCGGGACCCCATTACCAAGCATGATACCATTAGCGGGTATTTGTTTAATATTGCCCTCTTGCGAGAACTCTTCAGGCCCGAGTTCTTCTTGCACTGGGTTAAAC
AGACAGGACCTTATGAAATAACTACAAGGTGGACTATGGTAATGAAGTTTGTCCTTCTACCATGGAAACCAGAATTAGTTTTTACGGGTTATTCCATCATGGGTATCAAT
CCAGAGACGGGCAAGTTTTGTAGCCATGTGGATCTCTGGGATTCAATACAAAATAATGACTATTTTTCTTTAGAAGGCCTGTTGGATGTATTTAAGCAGCTCCGGTTTTT
TAAGACCCCAGAATTGGAATCACCCAAATATGAGATACTGAAAAGGACTGCAAATTATGAGGTGAGGAAATATGCGCCATTTATAGTGGTAGAAACAAGTGGAGACAAGC
TCTCTGGGTCTGCTGGATTCAATACAGTTGCTGGGTATATATTTGGGAAGAACTCTGCAAAGGAGAAAATACCCATGACCACCCCTGTATTCACCCAGACATTTGACTCT
GAATTACCCAAAGTATCCATCCAAATAGTTCTTCCTTCAGAGAAAGATATAGACAGTTTACCAGATCCTGAACAAGACACAATTGGCTTGAGAAAGGTTGAAGGAGGTAT
TGCTGCAGTGTTGAAATTCAGTGGAAGGCCTACTGAAGATATGGTGCAAGAGAAGGCGAAAGAATTGCGGTCTAGTCTTATAAAGGATGGTCTTAAACCCAGCAAGGGCT
GTTTGCTCGCTCGTTACAACGACCCCGGCCGAACATGGAGCTTTATAATGGTTAGTCCATGTGCCTTTCTATCATATGCTACTCAAATTATGGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAAATTCGACCACGTAGATTGTTAATACGAACTGGCTGGTTCCTCACCACAAAAAATTCTCACGAAAGACTAGCCAATCATCACTCTCCGTCCTGTACTTTCAA
AGTTCACCTGCCCGACGGCAATTCGCCGGTGCAGGCTCAAATGGCGGGTCTTCAACTTTCCCTCCAAAACTTCCTCTCAACCCCAACACTTGGTTTTGATTTCCGGCCGC
CGAACTCCGGCAGACTACCCGGCCTCCCACCCCGTCTACTTAAAACCAGGACTGTGCCTTTTACACCTCCTACCCAAAATTCTAAGTGGGTCGTTAGATTAAGCTTGGTA
GATCAGAAATCCACGGTCGACGTAGACCGATTGGTGGATTTCTTATACGAAGATCTTCGCCATCTCTTCGATGAACAGGGGATTGATCGGACGGCGTATGATGAACAAGT
GAGATTTCGGGACCCCATTACCAAGCATGATACCATTAGCGGGTATTTGTTTAATATTGCCCTCTTGCGAGAACTCTTCAGGCCCGAGTTCTTCTTGCACTGGGTTAAAC
AGACAGGACCTTATGAAATAACTACAAGGTGGACTATGGTAATGAAGTTTGTCCTTCTACCATGGAAACCAGAATTAGTTTTTACGGGTTATTCCATCATGGGTATCAAT
CCAGAGACGGGCAAGTTTTGTAGCCATGTGGATCTCTGGGATTCAATACAAAATAATGACTATTTTTCTTTAGAAGGCCTGTTGGATGTATTTAAGCAGCTCCGGTTTTT
TAAGACCCCAGAATTGGAATCACCCAAATATGAGATACTGAAAAGGACTGCAAATTATGAGGTGAGGAAATATGCGCCATTTATAGTGGTAGAAACAAGTGGAGACAAGC
TCTCTGGGTCTGCTGGATTCAATACAGTTGCTGGGTATATATTTGGGAAGAACTCTGCAAAGGAGAAAATACCCATGACCACCCCTGTATTCACCCAGACATTTGACTCT
GAATTACCCAAAGTATCCATCCAAATAGTTCTTCCTTCAGAGAAAGATATAGACAGTTTACCAGATCCTGAACAAGACACAATTGGCTTGAGAAAGGTTGAAGGAGGTAT
TGCTGCAGTGTTGAAATTCAGTGGAAGGCCTACTGAAGATATGGTGCAAGAGAAGGCGAAAGAATTGCGGTCTAGTCTTATAAAGGATGGTCTTAAACCCAGCAAGGGCT
GTTTGCTCGCTCGTTACAACGACCCCGGCCGAACATGGAGCTTTATAATGGTTAGTCCATGTGCCTTTCTATCATATGCTACTCAAATTATGGCTTGA
Protein sequenceShow/hide protein sequence
MVQIRPRRLLIRTGWFLTTKNSHERLANHHSPSCTFKVHLPDGNSPVQAQMAGLQLSLQNFLSTPTLGFDFRPPNSGRLPGLPPRLLKTRTVPFTPPTQNSKWVVRLSLV
DQKSTVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNIALLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGIN
PETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFFKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS
ELPKVSIQIVLPSEKDIDSLPDPEQDTIGLRKVEGGIAAVLKFSGRPTEDMVQEKAKELRSSLIKDGLKPSKGCLLARYNDPGRTWSFIMVSPCAFLSYATQIMA