; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034381 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034381
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSOUL heme-binding protein
Genome locationchr3:6829184..6833331
RNA-Seq ExpressionLag0034381
SyntenyLag0034381
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463332.1 PREDICTED: uncharacterized protein LOC103501513 isoform X1 [Cucumis melo]9.9e-19289.01Show/hide
Query:  MAALQFSLQNFLSTPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV
        MAALQ SLQNFLSTPT+    RPP SG L  L PRLL+SRT A KP+T+NSKWVVR +LVDQSPPKSTVDV RLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQFSLQNFLSTPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        RFRDPITKHDTISGYLFNISLLRE+FRPEF LHWVKQTGPYEITTRWTM+MKFALLPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI
         DVFKQLR+YKTPELESPKY ILKRT  YEVRKYAPFIVVETSGDKLAGS+GFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKVSIQIVLPSEKDI
Subjt:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI

Query:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM
        DSLPDPEQD +GLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL+K+GLKP  GCLLARYNDPGRTW+FIM
Subjt:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM

XP_022144956.1 uncharacterized protein LOC111014503 isoform X1 [Momordica charantia]7.3e-19590.13Show/hide
Query:  MAALQFSLQNFLSTPTVRFGFRPPNSG--TLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE
        MAALQ SLQNFLSTPT  FGFRP  SG  T+ GL PRLLKSRTV FKP  RNSKW VRLSLVDQSPPKS VDVDRLVDFLYEDL HLFDEQGIDRTAYDE
Subjt:  MAALQFSLQNFLSTPTVRFGFRPPNSG--TLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE

Query:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPITKHDTISGY FNISLLRELFRPEF LHWVKQTGPYEITTRWTMVMKF LLPWKPE +FTG SIMGINPETGKFCSHVDLWDSIQNNDYFSLE
Subjt:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLR+YKTPELESPKYEILKRTANYEVRKY PF+VVETSGDKL+GS+GFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPS+K
Subjt:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  DIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM
        DI+SLPDPEQDT+GLRKVEGGIAAVLKFSGKPTE++VQEKAKELRS L+K+GLKPSKGCLLARYNDPGRTWSFIM
Subjt:  DIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM

XP_022930662.1 uncharacterized protein LOC111437064 isoform X1 [Cucurbita moschata]1.2e-18988.53Show/hide
Query:  MAALQFSLQNFL--STPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE
        MAALQFSLQN L  STP++ FGFRPPNSG L+       +SRTV  KPHTRNSKWVVRLSLVDQ+PPKSTVDVD+LVDFLYEDL HLFDEQGIDRTAYD+
Subjt:  MAALQFSLQNFL--STPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE

Query:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
        QVRFRDPITKHDTI+GYLFNISLLRELFRPEFLLHWVK+TG YEITTRWTMVMKF LLPWKP+LVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLR+YKTPELESPKYEILKRT NYEVRKYAPFIVVETSGDKLAGS+GFN VAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK
Subjt:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  DIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM
        D+ SLPDPEQDT+GLRKVEGG AAVLKFSGKPTEEIVQEKAKELRSSL+K+GLKP  GCLLARYNDPGRTW+FIM
Subjt:  DIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM

XP_023530707.1 uncharacterized protein LOC111793169 isoform X1 [Cucurbita pepo subsp. pepo]2.1e-18988.27Show/hide
Query:  MAALQFSLQN--FLSTPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE
        MAAL+FSLQN   LSTP++ FGFRPPNSG L+       +SRTV  KPHTRNSKWVVRLSLVDQ+PPKSTVDVD+LVDFLYEDL HLFD+QGIDRTAYD+
Subjt:  MAALQFSLQN--FLSTPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE

Query:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
        +VRFRDPITKHDTI+GYLFNISLLRELFRPEFLLHWVK+TG YEITTRWTMVMKF LLPWKP+LVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLR+YKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGS+GFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK
Subjt:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  DIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM
        D+ SLPDPEQDT+GLRKVEGG AAVLKFSGKPTEEIVQEKAKELRSSL+K+GLKP  GCLLARYNDPGRTW+FIM
Subjt:  DIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM

XP_038879422.1 uncharacterized protein LOC120071301 isoform X1 [Benincasa hispida]4.5e-19289.54Show/hide
Query:  MAALQFSLQNFLSTPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV
        MA  Q SLQNF STPT+ FG RPP SG L  L PRL K+RT AFKPH++NSKWVVRLSLVDQSPPKSTVDV RLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQFSLQNFLSTPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        RFRDPIT HDTISGYLFNISLLRELFRPEF LHWVKQTGPYEITTRWTMVMKF LLPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI
         DVFKQLRYYKTP LESPKY ILKRTANYEVRKYA FIVVETSGDKLAGS+GFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE+PKV IQIVLPSEKDI
Subjt:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI

Query:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM
        DSLPDPEQD +GLRKVEG IAAVLKFSGKPTEEIVQEKAKELRSSL+K+GLKPS GCLLARYNDPGRTW+FIM
Subjt:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM

TrEMBL top hitse value%identityAlignment
A0A0A0LWP3 Uncharacterized protein5.0e-18988.47Show/hide
Query:  MAALQFSLQNFLSTPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV
        MA LQ SLQNF STPT+    RPP SG +  L PRLL SRT AFKPHT+NSKWVVR +LVDQ PPKST+DV RLVDFL+EDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQFSLQNFLSTPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        RFRDPITKHDTISGYLFNISLLRELFRPEF LHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI
         DVFKQLR+YKTPELESPKY ILKRTA YEVRKYAPFIVVETSGDKLAGS+GFNTVAGYIFGKNS KEKIPMTTPVFTQ F+SE PKVSIQIVLPSEKDI
Subjt:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI

Query:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM
        DSLPDPEQD VGLRKVEGGIAAVLKFSGKP EEIVQEKAKELRSSL+K+GLKP  GCLLARYNDPGRTW+FIM
Subjt:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM

A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X14.8e-19289.01Show/hide
Query:  MAALQFSLQNFLSTPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV
        MAALQ SLQNFLSTPT+    RPP SG L  L PRLL+SRT A KP+T+NSKWVVR +LVDQSPPKSTVDV RLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQFSLQNFLSTPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        RFRDPITKHDTISGYLFNISLLRE+FRPEF LHWVKQTGPYEITTRWTM+MKFALLPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI
         DVFKQLR+YKTPELESPKY ILKRT  YEVRKYAPFIVVETSGDKLAGS+GFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKVSIQIVLPSEKDI
Subjt:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDI

Query:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM
        DSLPDPEQD +GLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL+K+GLKP  GCLLARYNDPGRTW+FIM
Subjt:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM

A0A5D3CLD5 SOUL heme-binding family protein isoform 18.5e-18985.79Show/hide
Query:  MAALQFSLQNFLSTPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV
        MAALQ SLQNFLSTPT+    RPP SG L  L PRLL+SRT AFKP+T+NSKWVVR +LVDQSPPKSTVDV RLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQFSLQNFLSTPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQT--------------GPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLW
        RFRDPITKHDTISGYLFNISLLRE+FRPEF LHWVKQ                PYEITTRWTM+MKFALLPWKPEL+FTG SIMGINPETGKFCSHVDLW
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQT--------------GPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLW

Query:  DSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELP
        DSIQNNDYFS+EGL DVFKQLR+YKTPELESPKY ILKRTA YEVRKYAPFIVVETSGDKLAGS+GFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE P
Subjt:  DSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELP

Query:  KVSIQIVLPSEKDIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM
        KVSIQIVLPSEKDIDSLPDPEQD +GLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL+K+GLKP  GCLLARYNDPGRTW+FIM
Subjt:  KVSIQIVLPSEKDIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM

A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X13.6e-19590.13Show/hide
Query:  MAALQFSLQNFLSTPTVRFGFRPPNSG--TLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE
        MAALQ SLQNFLSTPT  FGFRP  SG  T+ GL PRLLKSRTV FKP  RNSKW VRLSLVDQSPPKS VDVDRLVDFLYEDL HLFDEQGIDRTAYDE
Subjt:  MAALQFSLQNFLSTPTVRFGFRPPNSG--TLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE

Query:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPITKHDTISGY FNISLLRELFRPEF LHWVKQTGPYEITTRWTMVMKF LLPWKPE +FTG SIMGINPETGKFCSHVDLWDSIQNNDYFSLE
Subjt:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLR+YKTPELESPKYEILKRTANYEVRKY PF+VVETSGDKL+GS+GFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPS+K
Subjt:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  DIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM
        DI+SLPDPEQDT+GLRKVEGGIAAVLKFSGKPTE++VQEKAKELRS L+K+GLKPSKGCLLARYNDPGRTWSFIM
Subjt:  DIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM

A0A6J1ER73 uncharacterized protein LOC111437064 isoform X15.9e-19088.53Show/hide
Query:  MAALQFSLQNFL--STPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE
        MAALQFSLQN L  STP++ FGFRPPNSG L+       +SRTV  KPHTRNSKWVVRLSLVDQ+PPKSTVDVD+LVDFLYEDL HLFDEQGIDRTAYD+
Subjt:  MAALQFSLQNFL--STPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE

Query:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
        QVRFRDPITKHDTI+GYLFNISLLRELFRPEFLLHWVK+TG YEITTRWTMVMKF LLPWKP+LVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLR+YKTPELESPKYEILKRT NYEVRKYAPFIVVETSGDKLAGS+GFN VAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK
Subjt:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  DIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM
        D+ SLPDPEQDT+GLRKVEGG AAVLKFSGKPTEEIVQEKAKELRSSL+K+GLKP  GCLLARYNDPGRTW+FIM
Subjt:  DIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIM

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic2.9e-1631.87Show/hide
Query:  YYKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSEKDI
        +   P+LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++    KD 
Subjt:  YYKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSEKDI

Query:  D--------------SLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL---VKEGLKPSKGCLLARYNDP
        +              +LP P+  +V +++V   I AV+ FSG  T+E ++ + +ELR +L    K  ++      +A+YN P
Subjt:  D--------------SLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL---VKEGLKPSKGCLLARYNDP

Arabidopsis top hitse value%identityAlignment
AT1G17100.1 SOUL heme-binding family protein3.2e-0729.08Show/hide
Query:  LESPKYEILKRTANYEVRKYAPFIVVETS-----GDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELP----KVSIQIVLPSEKDIDSLPD
        +E P YE++     YE+R+Y   + V T          A  + F  +  YI GKN   +KI MT PV +Q   S+ P      ++   +P +   D  P 
Subjt:  LESPKYEILKRTANYEVRKYAPFIVVETS-----GDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELP----KVSIQIVLPSEKDIDSLPD

Query:  PEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL
           + + ++K      AV +FSG  +++ + E+A  L SSL
Subjt:  PEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL

AT2G37970.1 SOUL heme-binding family protein1.5e-1534.21Show/hide
Query:  LESPKYEILKRTANYEVRKYAPFIVVETSGD----KLAGSSGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ------------TFDSELPK-------
        +E+PKY + K    YE+R+Y P +  E + D    K     GF  +A YI  FGK  N   EKI MT PV T+            T +SE  +       
Subjt:  LESPKYEILKRTANYEVRKYAPFIVVETSGD----KLAGSSGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ------------TFDSELPK-------

Query:  -----------VSIQIVLPS-EKDIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDP
                   V++Q +LPS  K  +  P P  + V +++  G    V+KFSG  +E +V EK K+L S L K+G K +   +LARYN P
Subjt:  -----------VSIQIVLPS-EKDIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDP

AT3G10130.1 SOUL heme-binding family protein2.0e-1731.87Show/hide
Query:  YYKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSEKDI
        +   P+LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++    KD 
Subjt:  YYKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSEKDI

Query:  D--------------SLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL---VKEGLKPSKGCLLARYNDP
        +              +LP P+  +V +++V   I AV+ FSG  T+E ++ + +ELR +L    K  ++      +A+YN P
Subjt:  D--------------SLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL---VKEGLKPSKGCLLARYNDP

AT5G20140.1 SOUL heme-binding family protein1.1e-14076.3Show/hide
Query:  STVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTG
        STV+++ LV FLYEDL HLFD+QGID+TAYDE+V+FRDPITKHDTISGYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF  LPWKPELVFTG
Subjt:  STVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTG

Query:  YSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSA
         SIM +NPET KFCSH+DLWDSI+NNDYFSLEGL+DVFKQLR YKTP+LE+PKY+ILKRTANYEVR Y PFIVVET GDKL+GSSGFN VAGYIFGKNS 
Subjt:  YSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSA

Query:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKDIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDP
         EKIPMTTPVFTQT D++L   VS+QIV+PS KD+ SLP P ++ V L+K+EGG AA +KFSGKPTE++VQ K  ELRSSL K+GL+  KGC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKDIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDP

Query:  GRTWSFIM
        GRTW+FIM
Subjt:  GRTWSFIM

AT5G20140.2 SOUL heme-binding family protein2.2e-14175.96Show/hide
Query:  STVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTG
        STV+++ LV FLYEDL HLFD+QGID+TAYDE+V+FRDPITKHDTISGYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF  LPWKPELVFTG
Subjt:  STVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTG

Query:  YSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSA
         SIM +NPET KFCSH+DLWDSI+NNDYFSLEGL+DVFKQLR YKTP+LE+PKY+ILKRTANYEVR Y PFIVVET GDKL+GSSGFN VAGYIFGKNS 
Subjt:  YSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSA

Query:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKDIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDP
         EKIPMTTPVFTQT D++L   VS+QIV+PS KD+ SLP P ++ V L+K+EGG AA +KFSGKPTE++VQ K  ELRSSL K+GL+  KGC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKDIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDP

Query:  GRTWSFIMVDVL
        GRTW+FIM  VL
Subjt:  GRTWSFIMVDVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTCTTCAATTTTCCCTCCAAAACTTCCTCTCAACCCCGACAGTCCGTTTCGGTTTCCGGCCGCCGAATTCCGGCACACTGGTCGGCCTCGCTCCCCGTCTACT
CAAAAGCAGAACTGTAGCTTTTAAACCCCATACCCGAAATTCCAAGTGGGTTGTTCGATTAAGCTTGGTAGATCAAAGCCCACCAAAATCGACGGTCGATGTGGACCGAT
TGGTGGATTTCTTGTACGAAGATCTTCTCCATCTCTTCGATGAACAGGGGATTGACCGGACGGCGTACGACGAACAAGTGAGGTTTCGGGACCCCATTACCAAGCACGAT
ACGATTAGCGGGTATTTGTTTAATATTTCCCTCTTGCGAGAACTCTTCAGGCCTGAGTTCTTATTGCACTGGGTTAAACAGACAGGACCATATGAAATAACTACAAGATG
GACTATGGTAATGAAGTTTGCCCTTCTACCATGGAAACCAGAATTAGTTTTTACGGGATATTCCATCATGGGTATCAATCCAGAGACGGGCAAGTTCTGTAGTCATGTGG
ATCTTTGGGATTCAATACAGAATAACGACTACTTTTCTCTAGAAGGCCTGTTGGATGTATTTAAGCAGCTTCGGTATTATAAGACTCCAGAATTGGAATCACCCAAGTAT
GAGATTTTGAAAAGGACTGCAAATTATGAGGTGAGGAAATATGCACCATTTATAGTGGTAGAAACAAGTGGAGACAAGCTCGCCGGGTCTTCTGGATTCAATACAGTTGC
TGGGTATATATTTGGGAAGAACTCTGCAAAGGAGAAAATACCCATGACCACTCCTGTATTCACTCAGACATTCGACTCTGAATTACCCAAAGTCTCCATCCAAATAGTTC
TTCCTTCAGAGAAAGATATAGACAGTTTACCAGATCCTGAACAAGACACAGTCGGCTTGAGAAAGGTGGAAGGAGGTATTGCAGCAGTGTTGAAATTTAGTGGGAAACCT
ACAGAAGAGATTGTGCAAGAGAAGGCCAAAGAATTGCGGTCTAGTCTCGTAAAAGAGGGTCTCAAACCCAGTAAGGGCTGTTTGCTTGCTCGGTATAATGACCCTGGACG
AACATGGAGCTTTATAATGGTTGATGTTCTGTTCTGTGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCTCTTCAATTTTCCCTCCAAAACTTCCTCTCAACCCCGACAGTCCGTTTCGGTTTCCGGCCGCCGAATTCCGGCACACTGGTCGGCCTCGCTCCCCGTCTACT
CAAAAGCAGAACTGTAGCTTTTAAACCCCATACCCGAAATTCCAAGTGGGTTGTTCGATTAAGCTTGGTAGATCAAAGCCCACCAAAATCGACGGTCGATGTGGACCGAT
TGGTGGATTTCTTGTACGAAGATCTTCTCCATCTCTTCGATGAACAGGGGATTGACCGGACGGCGTACGACGAACAAGTGAGGTTTCGGGACCCCATTACCAAGCACGAT
ACGATTAGCGGGTATTTGTTTAATATTTCCCTCTTGCGAGAACTCTTCAGGCCTGAGTTCTTATTGCACTGGGTTAAACAGACAGGACCATATGAAATAACTACAAGATG
GACTATGGTAATGAAGTTTGCCCTTCTACCATGGAAACCAGAATTAGTTTTTACGGGATATTCCATCATGGGTATCAATCCAGAGACGGGCAAGTTCTGTAGTCATGTGG
ATCTTTGGGATTCAATACAGAATAACGACTACTTTTCTCTAGAAGGCCTGTTGGATGTATTTAAGCAGCTTCGGTATTATAAGACTCCAGAATTGGAATCACCCAAGTAT
GAGATTTTGAAAAGGACTGCAAATTATGAGGTGAGGAAATATGCACCATTTATAGTGGTAGAAACAAGTGGAGACAAGCTCGCCGGGTCTTCTGGATTCAATACAGTTGC
TGGGTATATATTTGGGAAGAACTCTGCAAAGGAGAAAATACCCATGACCACTCCTGTATTCACTCAGACATTCGACTCTGAATTACCCAAAGTCTCCATCCAAATAGTTC
TTCCTTCAGAGAAAGATATAGACAGTTTACCAGATCCTGAACAAGACACAGTCGGCTTGAGAAAGGTGGAAGGAGGTATTGCAGCAGTGTTGAAATTTAGTGGGAAACCT
ACAGAAGAGATTGTGCAAGAGAAGGCCAAAGAATTGCGGTCTAGTCTCGTAAAAGAGGGTCTCAAACCCAGTAAGGGCTGTTTGCTTGCTCGGTATAATGACCCTGGACG
AACATGGAGCTTTATAATGGTTGATGTTCTGTTCTGTGATTAA
Protein sequenceShow/hide protein sequence
MAALQFSLQNFLSTPTVRFGFRPPNSGTLVGLAPRLLKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQVRFRDPITKHD
TISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKY
EILKRTANYEVRKYAPFIVVETSGDKLAGSSGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKP
TEEIVQEKAKELRSSLVKEGLKPSKGCLLARYNDPGRTWSFIMVDVLFCD