; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G009190 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G009190
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionSOUL heme-binding protein
Genome locationchr06:18945513..18959294
RNA-Seq ExpressionLsi06G009190
SyntenyLsi06G009190
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043396.1 SOUL heme-binding family protein isoform 1 [Cucumis melo var. makuwa]9.2e-20587.98Show/hide
Query:  ANHSLSSTSKSHLPNDTSP----AEAEAQMAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDV
        ANHS S TSKSHLPND  P    AEAEAQMAALQLSLQNF STPTL   LRPPKSGRLT+L PRLL+SRT A KP TQNSKWVVR +LVDQSPPKSTVDV
Subjt:  ANHSLSSTSKSHLPNDTSP----AEAEAQMAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDV

Query:  GRLVDFLYEDLCHLFDEQGIDRTAYDEQVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFAVLP
        GRLVDFLYEDL HLFDEQGIDRTAYDEQVRFRDPITK+DTISGYLFNISLLRE+FRPEFFLHWVKQ                PYEITTRWTM+MKFA+LP
Subjt:  GRLVDFLYEDLCHLFDEQGIDRTAYDEQVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFAVLP

Query:  WKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVA
        WKPEL+FTGTSIMGINPETGKFCSHVDLWDSIQNNDYFS EGLWDVFKQLRFYKTP LESPKYLILKRT  YEVRKYAPFIVVETSGDKLAGS GFNTVA
Subjt:  WKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVA

Query:  GYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGC
        GYIFGKNSTKEKIPMTTPVFTQTFDSE PKVSIQIVLPS KDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKP NGC
Subjt:  GYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGC

Query:  LLARYNDPGRTWNFIM
        LLARYNDPGRTWNFIM
Subjt:  LLARYNDPGRTWNFIM

KGN65404.1 hypothetical protein Csa_019846 [Cucumis sativus]6.4e-19891.27Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV
        MA LQLSLQNFPSTPTL   LRPPKSGR+T L PRLL SRT AFKP T+NSKWVVR +LVDQ PPKST+DVGRLVDFL+EDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL
        RFRDPITK+DTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFA+LPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS EGL
Subjt:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL

Query:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI
        WDVFKQLRFYKTP LESPKYLILKRTA YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SE PKVSIQIVLPS KDI
Subjt:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGNGVL
        DSLPDPEQDI+GLRKVEGGIAAVLKFSGKP EEIVQEKAKELRSSLIKDGLKP NGCLLARYNDPGRTWNFIM N VL
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGNGVL

XP_008463332.1 PREDICTED: uncharacterized protein LOC103501513 isoform X1 [Cucumis melo]6.2e-20192.59Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV
        MAALQLSLQNF STPTL   LRPPKSGRLT+L PRLL+SRT A KP TQNSKWVVR +LVDQSPPKSTVDVGRLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL
        RFRDPITK+DTISGYLFNISLLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKFA+LPWKPEL+FTGTSIMGINPETGKFCSHVDLWDSIQNNDYFS EGL
Subjt:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL

Query:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI
        WDVFKQLRFYKTP LESPKYLILKRT  YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE PKVSIQIVLPS KDI
Subjt:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGNGVL
        DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKP NGCLLARYNDPGRTWNFIM N VL
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGNGVL

XP_011648491.1 uncharacterized protein LOC101206063 [Cucumis sativus]6.2e-20990.82Show/hide
Query:  ANHSLSSTSKSHLPNDTSPAEAEAQMAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLV
        ANHS S TSKSHLPNDT PA AEAQMA LQLSLQNFPSTPTL   LRPPKSGR+T L PRLL SRT AFKP T+NSKWVVR +LVDQ PPKST+DVGRLV
Subjt:  ANHSLSSTSKSHLPNDTSPAEAEAQMAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLV

Query:  DFLYEDLCHLFDEQGIDRTAYDEQVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPE
        DFL+EDL HLFDEQGIDRTAYDEQVRFRDPITK+DTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFA+LPWKPELVFTG SIMGINPE
Subjt:  DFLYEDLCHLFDEQGIDRTAYDEQVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPE

Query:  TGKFCSHVDLWDSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTP
        TGKFCSHVDLWDSIQNNDYFS EGLWDVFKQLRFYKTP LESPKYLILKRTA YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTP
Subjt:  TGKFCSHVDLWDSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTP

Query:  VFTQTFDSELPKVSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGN
        VFTQ F+SE PKVSIQIVLPS KDIDSLPDPEQDI+GLRKVEGGIAAVLKFSGKP EEIVQEKAKELRSSLIKDGLKP NGCLLARYNDPGRTWNFIM N
Subjt:  VFTQTFDSELPKVSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGN

Query:  GVL
         VL
Subjt:  GVL

XP_038879422.1 uncharacterized protein LOC120071301 isoform X1 [Benincasa hispida]1.9e-20293.12Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV
        MA  QLSLQNFPSTPTLGFGLRPP+SGRLT L PRL K+RT AFKP +QNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL
        RFRDPIT +DTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKF +LPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS EGL
Subjt:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL

Query:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI
        WDVFKQLR+YKTPALESPKYLILKRTANYEVRKYA FIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE+PKV IQIVLPS KDI
Subjt:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGNGVL
        DSLPDPEQDIIGLRKVEG IAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIM N VL
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGNGVL

TrEMBL top hitse value%identityAlignment
A0A0A0LWP3 Uncharacterized protein3.1e-19891.27Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV
        MA LQLSLQNFPSTPTL   LRPPKSGR+T L PRLL SRT AFKP T+NSKWVVR +LVDQ PPKST+DVGRLVDFL+EDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL
        RFRDPITK+DTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFA+LPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS EGL
Subjt:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL

Query:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI
        WDVFKQLRFYKTP LESPKYLILKRTA YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SE PKVSIQIVLPS KDI
Subjt:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGNGVL
        DSLPDPEQDI+GLRKVEGGIAAVLKFSGKP EEIVQEKAKELRSSLIKDGLKP NGCLLARYNDPGRTWNFIM N VL
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGNGVL

A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X13.0e-20192.59Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV
        MAALQLSLQNF STPTL   LRPPKSGRLT+L PRLL+SRT A KP TQNSKWVVR +LVDQSPPKSTVDVGRLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL
        RFRDPITK+DTISGYLFNISLLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKFA+LPWKPEL+FTGTSIMGINPETGKFCSHVDLWDSIQNNDYFS EGL
Subjt:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL

Query:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI
        WDVFKQLRFYKTP LESPKYLILKRT  YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE PKVSIQIVLPS KDI
Subjt:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGNGVL
        DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKP NGCLLARYNDPGRTWNFIM N VL
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGNGVL

A0A5A7TMX2 SOUL heme-binding family protein isoform 14.5e-20587.98Show/hide
Query:  ANHSLSSTSKSHLPNDTSP----AEAEAQMAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDV
        ANHS S TSKSHLPND  P    AEAEAQMAALQLSLQNF STPTL   LRPPKSGRLT+L PRLL+SRT A KP TQNSKWVVR +LVDQSPPKSTVDV
Subjt:  ANHSLSSTSKSHLPNDTSP----AEAEAQMAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDV

Query:  GRLVDFLYEDLCHLFDEQGIDRTAYDEQVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFAVLP
        GRLVDFLYEDL HLFDEQGIDRTAYDEQVRFRDPITK+DTISGYLFNISLLRE+FRPEFFLHWVKQ                PYEITTRWTM+MKFA+LP
Subjt:  GRLVDFLYEDLCHLFDEQGIDRTAYDEQVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFAVLP

Query:  WKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVA
        WKPEL+FTGTSIMGINPETGKFCSHVDLWDSIQNNDYFS EGLWDVFKQLRFYKTP LESPKYLILKRT  YEVRKYAPFIVVETSGDKLAGS GFNTVA
Subjt:  WKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVA

Query:  GYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGC
        GYIFGKNSTKEKIPMTTPVFTQTFDSE PKVSIQIVLPS KDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKP NGC
Subjt:  GYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGC

Query:  LLARYNDPGRTWNFIM
        LLARYNDPGRTWNFIM
Subjt:  LLARYNDPGRTWNFIM

A0A5D3CLD5 SOUL heme-binding family protein isoform 15.3e-19889.29Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV
        MAALQLSLQNF STPTL   LRPPKSGRLT L PRLL+SRT AFKP TQNSKWVVR +LVDQSPPKSTVDVGRLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLW
        RFRDPITK+DTISGYLFNISLLRE+FRPEFFLHWVKQ                PYEITTRWTM+MKFA+LPWKPEL+FTGTSIMGINPETGKFCSHVDLW
Subjt:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLW

Query:  DSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELP
        DSIQNNDYFS EGLWDVFKQLRFYKTP LESPKYLILKRTA YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE P
Subjt:  DSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELP

Query:  KVSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGNGVL
        KVSIQIVLPS KDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKP NGCLLARYNDPGRTWNFIM N VL
Subjt:  KVSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGNGVL

A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X11.2e-18987.89Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLT--DLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDE
        MAALQLSLQNF STPT GFG RP KSG LT   L PRLLKSRT+ FKP  +NSKW VRLSLVDQSPPKS VDV RLVDFLYEDL HLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLT--DLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDE

Query:  QVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTE
         VRFRDPITK+DTISGY FNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKF +LPWKPE +FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS E
Subjt:  QVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTE

Query:  GLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAK
        GL DVFKQLRFYKTP LESPKY ILKRTANYEVRKY PF+VVETSGDKL+GS GFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKVSIQIVLPS K
Subjt:  GLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAK

Query:  DIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGNGVL
        DI+SLPDPEQD IGLRKVEGGIAAVLKFSGKPTE++VQEKAKELRS LIKDGLKPS GCLLARYNDPGRTW+FIM N VL
Subjt:  DIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGNGVL

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic2.0e-1632.42Show/hide
Query:  FYKTPALESPKYLILKRTANYEVRKYAPFIVV------ETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSAKDI
        F   P LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++   AKD 
Subjt:  FYKTPALESPKYLILKRTANYEVRKYAPFIVV------ETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSAKDI

Query:  D--------------SLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKD---GLKPSNGCLLARYNDP
        +              +LP P+   + +++V   I AV+ FSG  T+E ++ + +ELR +L  D    ++      +A+YN P
Subjt:  D--------------SLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKD---GLKPSNGCLLARYNDP

Arabidopsis top hitse value%identityAlignment
AT2G37970.1 SOUL heme-binding family protein2.6e-1634.55Show/hide
Query:  ALESPKYLILKRTANYEVRKYAPFIVVETSGD----KLAGSTGFNTVAGYI--FGK--NSTKEKIPMTTPVFTQ------------TFDSELPK------
        A+E+PKY + K    YE+R+Y P +  E + D    K     GF  +A YI  FGK  N   EKI MT PV T+            T +SE  +      
Subjt:  ALESPKYLILKRTANYEVRKYAPFIVVETSGD----KLAGSTGFNTVAGYI--FGK--NSTKEKIPMTTPVFTQ------------TFDSELPK------

Query:  ------------VSIQIVLPSA-KDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDP
                    V++Q +LPS  K  +  P P  + + +++  G    V+KFSG  +E +V EK K+L S L KDG K +   +LARYN P
Subjt:  ------------VSIQIVLPSA-KDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDP

AT3G10130.1 SOUL heme-binding family protein1.4e-1732.42Show/hide
Query:  FYKTPALESPKYLILKRTANYEVRKYAPFIVV------ETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSAKDI
        F   P LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++   AKD 
Subjt:  FYKTPALESPKYLILKRTANYEVRKYAPFIVV------ETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSAKDI

Query:  D--------------SLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKD---GLKPSNGCLLARYNDP
        +              +LP P+   + +++V   I AV+ FSG  T+E ++ + +ELR +L  D    ++      +A+YN P
Subjt:  D--------------SLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKD---GLKPSNGCLLARYNDP

AT5G20140.1 SOUL heme-binding family protein1.1e-13975.08Show/hide
Query:  STVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTG
        STV++  LV FLYEDL HLFD+QGID+TAYDE+V+FRDPITK+DTISGYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF  LPWKPELVFTG
Subjt:  STVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTG

Query:  TSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNST
         SIM +NPET KFCSH+DLWDSI+NNDYFS EGL DVFKQLR YKTP LE+PKY ILKRTANYEVR Y PFIVVET GDKL+GS+GFN VAGYIFGKNST
Subjt:  TSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNST

Query:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDP
         EKIPMTTPVFTQT D++L   VS+QIV+PS KD+ SLP P ++ + L+K+EGG AA +KFSGKPTE++VQ K  ELRSSL KDGL+   GC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDP

Query:  GRTWNFIMGNGVL
        GRTWNFIM N V+
Subjt:  GRTWNFIMGNGVL

AT5G20140.2 SOUL heme-binding family protein9.4e-13975.65Show/hide
Query:  STVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTG
        STV++  LV FLYEDL HLFD+QGID+TAYDE+V+FRDPITK+DTISGYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF  LPWKPELVFTG
Subjt:  STVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTG

Query:  TSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNST
         SIM +NPET KFCSH+DLWDSI+NNDYFS EGL DVFKQLR YKTP LE+PKY ILKRTANYEVR Y PFIVVET GDKL+GS+GFN VAGYIFGKNST
Subjt:  TSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNST

Query:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDP
         EKIPMTTPVFTQT D++L   VS+QIV+PS KD+ SLP P ++ + L+K+EGG AA +KFSGKPTE++VQ K  ELRSSL KDGL+   GC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDP

Query:  GRTWNFIM
        GRTWNFIM
Subjt:  GRTWNFIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGAAACATTGGCCAATCACTCCCTGTCCTCTACCTCCAAATCCCACCTCCCCAATGACACTTCGCCGGCTGAAGCTGAGGCTCAAATGGCCGCTCTTCAACTTTC
CCTCCAAAACTTCCCCTCAACCCCAACACTCGGTTTCGGTCTCCGGCCACCGAAATCCGGCAGACTAACCGACCTCGCGCCCCGTCTACTTAAAAGCAGAACTCTAGCTT
TCAAACCCCAAACCCAAAATTCTAAGTGGGTTGTTCGATTAAGCTTGGTAGATCAAAGCCCACCAAAATCGACGGTCGATGTAGGCCGATTGGTGGATTTCTTGTATGAA
GATCTTTGCCATCTCTTCGATGAACAGGGGATTGATCGAACGGCGTACGACGAACAAGTGCGATTTCGGGACCCCATTACCAAGTACGATACGATTAGTGGGTATTTGTT
TAATATTTCCCTCTTGCGAGAACTTTTCAGGCCTGAATTCTTCTTGCATTGGGTTAAACAGACAGGACCATATGAAATAACTACGAGATGGACTATGGTAATGAAGTTTG
CCGTTCTACCATGGAAACCAGAATTAGTTTTCACGGGAACTTCCATCATGGGTATCAATCCAGAGACCGGCAAGTTCTGTAGTCATGTGGATCTCTGGGATTCAATACAA
AACAACGACTACTTTTCTACAGAAGGCCTTTGGGATGTTTTCAAGCAGCTTCGGTTCTATAAGACTCCAGCATTGGAATCACCCAAGTATCTGATTCTGAAAAGGACTGC
AAATTATGAGGTGAGGAAATATGCACCATTTATAGTGGTGGAAACAAGTGGAGACAAGCTCGCTGGGTCTACAGGATTCAATACAGTTGCTGGGTATATATTTGGGAAGA
ATTCTACAAAGGAGAAGATACCCATGACCACTCCTGTATTCACCCAAACATTTGACTCTGAATTACCTAAAGTCTCCATTCAAATAGTTCTTCCTTCAGCGAAAGATATA
GACAGTTTACCAGATCCTGAACAAGACATAATTGGCTTGAGAAAGGTTGAAGGAGGTATTGCTGCGGTGTTGAAATTCAGTGGGAAACCTACAGAAGAGATTGTGCAAGA
GAAGGCAAAAGAACTGCGGTCTAGTCTCATAAAGGATGGTCTCAAACCCAGTAATGGCTGTTTGCTTGCTCGGTATAACGACCCTGGAAGAACATGGAACTTTATAATGG
GCAATGGTGTATTGAATGACGATGTCGAGAACATTCAATGTGATTTAGATGAAGATGACCCTAGTTATGAGCCCATTCCTCTAGCACCTATTGAGCTCCGTTCTCCAGCA
CCCACCCAGCCCCGTTCTCCATCACCCATCGAGCCTGACGAAACACCCATCGAGCCACTTTTGCCAACACCCACCGAACCTGATGAAACACCCACTGAGCCCCATCCGTC
AACACCCATCAAGCCTCTTCCTCTAGCACCCATTAAGCCCATTATTCCTTCGGCACCCATTGAGCTCATTCTTCCTTCAGCACTCATTGAGCCCTTTCTTCCTCCAACAT
CCACCGAGCACCTTATTCCTCCAACATCCACCAAGCCCCTTATTCCTCCAACACCCATTGAGCCCTTTCTTCCTCCAATACCCACCGAACCCCTACCTCTTCTTTCAGCA
TCCACAAAGCCTTTCGTGATACCACCTCTGCTCCCATCAACCATACAACATCTTGAGCTACTCTCGGTTCCACAATTATTCAAGAGGTTGAAAATTAATGTTAGGGGCTA
G
mRNA sequenceShow/hide mRNA sequence
ATGGTTGAAACATTGGCCAATCACTCCCTGTCCTCTACCTCCAAATCCCACCTCCCCAATGACACTTCGCCGGCTGAAGCTGAGGCTCAAATGGCCGCTCTTCAACTTTC
CCTCCAAAACTTCCCCTCAACCCCAACACTCGGTTTCGGTCTCCGGCCACCGAAATCCGGCAGACTAACCGACCTCGCGCCCCGTCTACTTAAAAGCAGAACTCTAGCTT
TCAAACCCCAAACCCAAAATTCTAAGTGGGTTGTTCGATTAAGCTTGGTAGATCAAAGCCCACCAAAATCGACGGTCGATGTAGGCCGATTGGTGGATTTCTTGTATGAA
GATCTTTGCCATCTCTTCGATGAACAGGGGATTGATCGAACGGCGTACGACGAACAAGTGCGATTTCGGGACCCCATTACCAAGTACGATACGATTAGTGGGTATTTGTT
TAATATTTCCCTCTTGCGAGAACTTTTCAGGCCTGAATTCTTCTTGCATTGGGTTAAACAGACAGGACCATATGAAATAACTACGAGATGGACTATGGTAATGAAGTTTG
CCGTTCTACCATGGAAACCAGAATTAGTTTTCACGGGAACTTCCATCATGGGTATCAATCCAGAGACCGGCAAGTTCTGTAGTCATGTGGATCTCTGGGATTCAATACAA
AACAACGACTACTTTTCTACAGAAGGCCTTTGGGATGTTTTCAAGCAGCTTCGGTTCTATAAGACTCCAGCATTGGAATCACCCAAGTATCTGATTCTGAAAAGGACTGC
AAATTATGAGGTGAGGAAATATGCACCATTTATAGTGGTGGAAACAAGTGGAGACAAGCTCGCTGGGTCTACAGGATTCAATACAGTTGCTGGGTATATATTTGGGAAGA
ATTCTACAAAGGAGAAGATACCCATGACCACTCCTGTATTCACCCAAACATTTGACTCTGAATTACCTAAAGTCTCCATTCAAATAGTTCTTCCTTCAGCGAAAGATATA
GACAGTTTACCAGATCCTGAACAAGACATAATTGGCTTGAGAAAGGTTGAAGGAGGTATTGCTGCGGTGTTGAAATTCAGTGGGAAACCTACAGAAGAGATTGTGCAAGA
GAAGGCAAAAGAACTGCGGTCTAGTCTCATAAAGGATGGTCTCAAACCCAGTAATGGCTGTTTGCTTGCTCGGTATAACGACCCTGGAAGAACATGGAACTTTATAATGG
GCAATGGTGTATTGAATGACGATGTCGAGAACATTCAATGTGATTTAGATGAAGATGACCCTAGTTATGAGCCCATTCCTCTAGCACCTATTGAGCTCCGTTCTCCAGCA
CCCACCCAGCCCCGTTCTCCATCACCCATCGAGCCTGACGAAACACCCATCGAGCCACTTTTGCCAACACCCACCGAACCTGATGAAACACCCACTGAGCCCCATCCGTC
AACACCCATCAAGCCTCTTCCTCTAGCACCCATTAAGCCCATTATTCCTTCGGCACCCATTGAGCTCATTCTTCCTTCAGCACTCATTGAGCCCTTTCTTCCTCCAACAT
CCACCGAGCACCTTATTCCTCCAACATCCACCAAGCCCCTTATTCCTCCAACACCCATTGAGCCCTTTCTTCCTCCAATACCCACCGAACCCCTACCTCTTCTTTCAGCA
TCCACAAAGCCTTTCGTGATACCACCTCTGCTCCCATCAACCATACAACATCTTGAGCTACTCTCGGTTCCACAATTATTCAAGAGGTTGAAAATTAATGTTAGGGGCTA
GCCTAAGGTCAAAGGAGAGCCTAAGATGAAGGTGAAGGTAGAGCCAGAGGTGAAGAGTGAGCTTAAGGTTGAAATAGAGCAGAAGGTCAAGGTGGAGCCTAATGTTGAGG
AATCTCCCCCCAATGTTAAATGTAAATGGAACCAAGAAGGACGACTAGGAAGAGAAAGGTGATATAACCATACACTCCTACTATAGATTCGAGAAGGTTGAAGTGGTCTA
AGAAGTATACAGATCCAGGTATTTTTCCACCTGAGAAGCACGATCTTG
Protein sequenceShow/hide protein sequence
MVETLANHSLSSTSKSHLPNDTSPAEAEAQMAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYE
DLCHLFDEQGIDRTAYDEQVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQ
NNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI
DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMGNGVLNDDVENIQCDLDEDDPSYEPIPLAPIELRSPA
PTQPRSPSPIEPDETPIEPLLPTPTEPDETPTEPHPSTPIKPLPLAPIKPIIPSAPIELILPSALIEPFLPPTSTEHLIPPTSTKPLIPPTPIEPFLPPIPTEPLPLLSA
STKPFVIPPLLPSTIQHLELLSVPQLFKRLKINVRG