; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022260 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022260
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionSOUL heme-binding protein
Genome locationscaffold2:6874869..6884294
RNA-Seq ExpressionSpg022260
SyntenySpg022260
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463332.1 PREDICTED: uncharacterized protein LOC103501513 isoform X1 [Cucumis melo]1.4e-18787.4Show/hide
Query:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV
        MAALQ SLQNFLSTPT+    RPP SG L  L PRL++SRT A KP+T+NSKWVVR +LVDQSPPKSTVDV RLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV

Query:  RYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMK-SVLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        R+RDPITKHDTISGYLFNISLLRE+FRPEF LHWVKQTGPYEITTRWTM+MK ++LPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMK-SVLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI
         DVFKQLR+YKTPELESPKY ILKRT  YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK+I
Subjt:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI

Query:  DSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM
        DSLPDPEQ+ +GLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL+KDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  DSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM

XP_022144956.1 uncharacterized protein LOC111014503 isoform X1 [Momordica charantia]2.3e-19088.8Show/hide
Query:  MAALQFSLQNFLSTPTVRFDFRPPNSG--TLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE
        MAALQ SLQNFLSTPT  F FRP  SG  T+ GL PRL+KSRTV FKP  RNSKW VRLSLVDQSPPKS VDVDRLVDFLYEDL HLFDEQGIDRTAYDE
Subjt:  MAALQFSLQNFLSTPTVRFDFRPPNSG--TLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE

Query:  QVRYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VR+RDPITKHDTISGY FNISLLRELFRPEF LHWVKQTGPYEITTRWTMVMK V LPWKPE +FTG SIMGINPETGKFCSHVDLWDSIQNNDYFSLE
Subjt:  QVRYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLR+YKTPELESPKYEILKRTANYEVRKY PF+VVETSGDKL+GS GFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPS+K
Subjt:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  NIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM
        +I+SLPDPEQ+T+GLRKVEGGIAAVLKFSGKPTE++VQEKAKELRS L+KDGLKPSKGCLLARYNDPGRTWSFIM
Subjt:  NIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM

XP_022930662.1 uncharacterized protein LOC111437064 isoform X1 [Cucurbita moschata]1.0e-18587.73Show/hide
Query:  MAALQFSLQNFL--STPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE
        MAALQFSLQN L  STP++ F FRPPNSG LI       +SRTV  KPHTRNSKWVVRLSLVDQ+PPKSTVDVD+LVDFLYEDL HLFDEQGIDRTAYD+
Subjt:  MAALQFSLQNFL--STPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE

Query:  QVRYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
        QVR+RDPITKHDTI+GYLFNISLLRELFRPEFLLHWVK+TG YEITTRWTMVMK V LPWKP+LVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  QVRYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLR+YKTPELESPKYEILKRT NYEVRKYAPFIVVETSGDKLAGS GFN VAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK
Subjt:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  NIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM
        ++ SLPDPEQ+T+GLRKVEGG AAVLKFSGKPTEEIVQEKAKELRSSL+KDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  NIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM

XP_023530707.1 uncharacterized protein LOC111793169 isoform X1 [Cucurbita pepo subsp. pepo]1.7e-18587.47Show/hide
Query:  MAALQFSLQN--FLSTPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE
        MAAL+FSLQN   LSTP++ F FRPPNSG LI       +SRTV  KPHTRNSKWVVRLSLVDQ+PPKSTVDVD+LVDFLYEDL HLFD+QGIDRTAYD+
Subjt:  MAALQFSLQN--FLSTPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE

Query:  QVRYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
        +VR+RDPITKHDTI+GYLFNISLLRELFRPEFLLHWVK+TG YEITTRWTMVMK V LPWKP+LVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  QVRYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLR+YKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK
Subjt:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  NIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM
        ++ SLPDPEQ+T+GLRKVEGG AAVLKFSGKPTEEIVQEKAKELRSSL+KDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  NIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM

XP_038879422.1 uncharacterized protein LOC120071301 isoform X1 [Benincasa hispida]8.2e-18888.47Show/hide
Query:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV
        MA  Q SLQNF STPT+ F  RPP SG L  L PRL K+RT AFKPH++NSKWVVRLSLVDQSPPKSTVDV RLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV

Query:  RYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        R+RDPIT HDTISGYLFNISLLRELFRPEF LHWVKQTGPYEITTRWTMVMK V LPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI
         DVFKQLRYYKTP LESPKY ILKRTANYEVRKYA FIVVETSGDKLAGS GFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE+PKV IQIVLPSEK+I
Subjt:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI

Query:  DSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM
        DSLPDPEQ+ +GLRKVEG IAAVLKFSGKPTEEIVQEKAKELRSSL+KDGLKPS GCLLARYNDPGRTW+FIM
Subjt:  DSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM

TrEMBL top hitse value%identityAlignment
A0A0A0LWP3 Uncharacterized protein7.0e-18586.86Show/hide
Query:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV
        MA LQ SLQNF STPT+    RPP SG +  L PRL+ SRT AFKPHT+NSKWVVR +LVDQ PPKST+DV RLVDFL+EDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV

Query:  RYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMK-SVLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        R+RDPITKHDTISGYLFNISLLRELFRPEF LHWVKQTGPYEITTRWTMVMK ++LPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMK-SVLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI
         DVFKQLR+YKTPELESPKY ILKRTA YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNS KEKIPMTTPVFTQ F+SE PKVSIQIVLPSEK+I
Subjt:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI

Query:  DSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM
        DSLPDPEQ+ VGLRKVEGGIAAVLKFSGKP EEIVQEKAKELRSSL+KDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  DSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM

A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X16.7e-18887.4Show/hide
Query:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV
        MAALQ SLQNFLSTPT+    RPP SG L  L PRL++SRT A KP+T+NSKWVVR +LVDQSPPKSTVDV RLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV

Query:  RYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMK-SVLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        R+RDPITKHDTISGYLFNISLLRE+FRPEF LHWVKQTGPYEITTRWTM+MK ++LPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMK-SVLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI
         DVFKQLR+YKTPELESPKY ILKRT  YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK+I
Subjt:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI

Query:  DSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM
        DSLPDPEQ+ +GLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL+KDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  DSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM

A0A5A7TMX2 SOUL heme-binding family protein isoform 17.0e-18582.95Show/hide
Query:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV
        MAALQ SLQNFLSTPT+    RPP SG L  L PRL++SRT A KP+T+NSKWVVR +LVDQSPPKSTVDV RLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQV

Query:  RYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQT--------------GPYEITTRWTMVMK-SVLPWKPELVFTGYSIMGINPETGKFCSHVDLW
        R+RDPITKHDTISGYLFNISLLRE+FRPEF LHWVKQ                PYEITTRWTM+MK ++LPWKPEL+FTG SIMGINPETGKFCSHVDLW
Subjt:  RYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQT--------------GPYEITTRWTMVMK-SVLPWKPELVFTGYSIMGINPETGKFCSHVDLW

Query:  DSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELP
        DSIQNNDYFS+EGL DVFKQLR+YKTPELESPKY ILKRT  YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE P
Subjt:  DSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELP

Query:  KVSIQIVLPSEKNIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMVPTAAH
        KVSIQIVLPSEK+IDSLPDPEQ+ +GLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL+KDGLKP  GCLLARYNDPGRTW+FIMV +  H
Subjt:  KVSIQIVLPSEKNIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMVPTAAH

A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X11.1e-19088.8Show/hide
Query:  MAALQFSLQNFLSTPTVRFDFRPPNSG--TLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE
        MAALQ SLQNFLSTPT  F FRP  SG  T+ GL PRL+KSRTV FKP  RNSKW VRLSLVDQSPPKS VDVDRLVDFLYEDL HLFDEQGIDRTAYDE
Subjt:  MAALQFSLQNFLSTPTVRFDFRPPNSG--TLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE

Query:  QVRYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VR+RDPITKHDTISGY FNISLLRELFRPEF LHWVKQTGPYEITTRWTMVMK V LPWKPE +FTG SIMGINPETGKFCSHVDLWDSIQNNDYFSLE
Subjt:  QVRYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLR+YKTPELESPKYEILKRTANYEVRKY PF+VVETSGDKL+GS GFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPS+K
Subjt:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  NIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM
        +I+SLPDPEQ+T+GLRKVEGGIAAVLKFSGKPTE++VQEKAKELRS L+KDGLKPSKGCLLARYNDPGRTWSFIM
Subjt:  NIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM

A0A6J1ER73 uncharacterized protein LOC111437064 isoform X14.8e-18687.73Show/hide
Query:  MAALQFSLQNFL--STPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE
        MAALQFSLQN L  STP++ F FRPPNSG LI       +SRTV  KPHTRNSKWVVRLSLVDQ+PPKSTVDVD+LVDFLYEDL HLFDEQGIDRTAYD+
Subjt:  MAALQFSLQNFL--STPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDE

Query:  QVRYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
        QVR+RDPITKHDTI+GYLFNISLLRELFRPEFLLHWVK+TG YEITTRWTMVMK V LPWKP+LVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  QVRYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLR+YKTPELESPKYEILKRT NYEVRKYAPFIVVETSGDKLAGS GFN VAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK
Subjt:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  NIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM
        ++ SLPDPEQ+T+GLRKVEGG AAVLKFSGKPTEEIVQEKAKELRSSL+KDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  NIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIM

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic1.2e-1632.79Show/hide
Query:  YYKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVS-------------
        +   P+LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S   K+              
Subjt:  YYKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVS-------------

Query:  ----IQIVLPSEKNIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKD---GLKPSKGCLLARYNDP
            +  V+PS K   +LP P+  +V +++V   I AV+ FSG  T+E ++ + +ELR +L  D    ++      +A+YN P
Subjt:  ----IQIVLPSEKNIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKD---GLKPSKGCLLARYNDP

Arabidopsis top hitse value%identityAlignment
AT1G17100.1 SOUL heme-binding family protein1.8e-0729.79Show/hide
Query:  LESPKYEILKRTANYEVRKYAPFIVVETS-----GDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELP----KVSIQIVLPSEKNIDSLPD
        +E P YE++     YE+R+Y   + V T          A    F  +  YI GKN   +KI MT PV +Q   S+ P      ++   +P +   D  P 
Subjt:  LESPKYEILKRTANYEVRKYAPFIVVETS-----GDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELP----KVSIQIVLPSEKNIDSLPD

Query:  PEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL
           E + ++K      AV +FSG  +++ + E+A  L SSL
Subjt:  PEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL

AT2G37970.1 SOUL heme-binding family protein3.6e-1635.26Show/hide
Query:  LESPKYEILKRTANYEVRKYAPFIVVETSGD----KLAGSVGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ------------TFDSELPK-------
        +E+PKY + K    YE+R+Y P +  E + D    K     GF  +A YI  FGK  N   EKI MT PV T+            T +SE  +       
Subjt:  LESPKYEILKRTANYEVRKYAPFIVVETSGD----KLAGSVGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ------------TFDSELPK-------

Query:  -----------VSIQIVLPS-EKNIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDP
                   V++Q +LPS  K  +  P P  E V +++  G    V+KFSG  +E +V EK K+L S L KDG K +   +LARYN P
Subjt:  -----------VSIQIVLPS-EKNIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDP

AT3G10130.1 SOUL heme-binding family protein8.6e-1832.79Show/hide
Query:  YYKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVS-------------
        +   P+LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S   K+              
Subjt:  YYKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVS-------------

Query:  ----IQIVLPSEKNIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKD---GLKPSKGCLLARYNDP
            +  V+PS K   +LP P+  +V +++V   I AV+ FSG  T+E ++ + +ELR +L  D    ++      +A+YN P
Subjt:  ----IQIVLPSEKNIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKD---GLKPSKGCLLARYNDP

AT5G20140.1 SOUL heme-binding family protein2.2e-13875.65Show/hide
Query:  STVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQVRYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTG
        STV+++ LV FLYEDL HLFD+QGID+TAYDE+V++RDPITKHDTISGYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMK + LPWKPELVFTG
Subjt:  STVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQVRYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTG

Query:  YSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSA
         SIM +NPET KFCSH+DLWDSI+NNDYFSLEGL+DVFKQLR YKTP+LE+PKY+ILKRTANYEVR Y PFIVVET GDKL+GS GFN VAGYIFGKNS 
Subjt:  YSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSA

Query:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKNIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDP
         EKIPMTTPVFTQT D++L   VS+QIV+PS K++ SLP P +E V L+K+EGG AA +KFSGKPTE++VQ K  ELRSSL KDGL+  KGC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKNIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDP

Query:  GRTWSFIM
        GRTW+FIM
Subjt:  GRTWSFIM

AT5G20140.2 SOUL heme-binding family protein2.2e-13875.65Show/hide
Query:  STVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQVRYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTG
        STV+++ LV FLYEDL HLFD+QGID+TAYDE+V++RDPITKHDTISGYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMK + LPWKPELVFTG
Subjt:  STVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQVRYRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSV-LPWKPELVFTG

Query:  YSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSA
         SIM +NPET KFCSH+DLWDSI+NNDYFSLEGL+DVFKQLR YKTP+LE+PKY+ILKRTANYEVR Y PFIVVET GDKL+GS GFN VAGYIFGKNS 
Subjt:  YSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSA

Query:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKNIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDP
         EKIPMTTPVFTQT D++L   VS+QIV+PS K++ SLP P +E V L+K+EGG AA +KFSGKPTE++VQ K  ELRSSL KDGL+  KGC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKNIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDP

Query:  GRTWSFIM
        GRTW+FIM
Subjt:  GRTWSFIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTCTTCAATTTTCCCTCCAAAACTTCCTCTCAACCCCGACAGTCCGTTTCGATTTCCGGCCGCCGAATTCCGGCACACTGATCGGCCTCGCTCCCCGTCTAAT
CAAAAGCAGAACTGTAGCTTTTAAACCCCATACCCGAAATTCCAAGTGGGTTGTTCGATTAAGCTTGGTAGATCAAAGCCCACCAAAATCGACGGTCGATGTGGATCGAT
TGGTGGATTTCTTGTACGAAGATCTTCTCCATCTCTTCGATGAACAGGGGATTGACCGCACGGCGTACGACGAACAAGTGAGGTATCGGGACCCCATTACCAAGCACGAT
ACCATTAGCGGGTATTTGTTTAATATTTCCCTCTTGCGAGAACTCTTCAGGCCTGAGTTCTTATTGCACTGGGTTAAACAGACAGGACCATATGAAATAACTACAAGATG
GACTATGGTAATGAAGTCTGTCCTACCATGGAAACCAGAATTAGTTTTTACGGGATATTCCATCATGGGTATCAATCCAGAGACGGGCAAGTTCTGTAGTCATGTGGATC
TTTGGGATTCAATACAGAATAACGACTACTTTTCTCTAGAAGGTCTGTTGGATGTATTTAAGCAGCTTCGGTATTATAAGACTCCAGAATTGGAATCACCCAAGTATGAG
ATTCTGAAAAGGACTGCAAATTATGAGGTGAGGAAATATGCACCCTTTATAGTGGTAGAAACAAGTGGAGACAAGCTCGCCGGGTCTGTTGGATTCAATACAGTTGCTGG
GTATATATTTGGGAAGAACTCTGCAAAGGAGAAAATACCCATGACCACTCCTGTATTCACTCAGACATTCGACTCTGAATTACCCAAAGTCTCCATCCAAATAGTTCTTC
CATCAGAGAAAAATATAGACAGTTTACCAGATCCTGAACAAGAAACAGTCGGCTTGAGAAAGGTGGAAGGAGGTATTGCAGCAGTGTTGAAATTTAGTGGGAAACCTACA
GAAGAGATTGTGCAAGAGAAGGCCAAAGAATTGCGGTCTAGTCTCGTAAAAGATGGTCTCAAACCCAGTAAGGGCTGTTTGCTTGCTCGGTATAATGACCCTGGACGAAC
ATGGAGCTTTATAATGGTTCCAACCGCAGCTCACGAAGACCCGCTCGCCGTCGCTCGCGAAGACCCGCCAACTCGCCGTCGCTTGAGAAGACTTGTTCGTCGTCGCTCGC
GAAGACCTGCCAGCTCGTCGTCACTCGCGAAGACCCGCCCGCTCGCCGTCGCTCACAAAGATCATGCTCAGATCTGCTCGCCGTCACTCGCGAAGACCCGCCAGCTCGCC
GTCGCTCGCGAAGATCTTGCTCAGATCTCCTCGGAAGAGTGGCTCGTCCTTATGGAGAAAATAGGCATCCCTACAATTTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCTCTTCAATTTTCCCTCCAAAACTTCCTCTCAACCCCGACAGTCCGTTTCGATTTCCGGCCGCCGAATTCCGGCACACTGATCGGCCTCGCTCCCCGTCTAAT
CAAAAGCAGAACTGTAGCTTTTAAACCCCATACCCGAAATTCCAAGTGGGTTGTTCGATTAAGCTTGGTAGATCAAAGCCCACCAAAATCGACGGTCGATGTGGATCGAT
TGGTGGATTTCTTGTACGAAGATCTTCTCCATCTCTTCGATGAACAGGGGATTGACCGCACGGCGTACGACGAACAAGTGAGGTATCGGGACCCCATTACCAAGCACGAT
ACCATTAGCGGGTATTTGTTTAATATTTCCCTCTTGCGAGAACTCTTCAGGCCTGAGTTCTTATTGCACTGGGTTAAACAGACAGGACCATATGAAATAACTACAAGATG
GACTATGGTAATGAAGTCTGTCCTACCATGGAAACCAGAATTAGTTTTTACGGGATATTCCATCATGGGTATCAATCCAGAGACGGGCAAGTTCTGTAGTCATGTGGATC
TTTGGGATTCAATACAGAATAACGACTACTTTTCTCTAGAAGGTCTGTTGGATGTATTTAAGCAGCTTCGGTATTATAAGACTCCAGAATTGGAATCACCCAAGTATGAG
ATTCTGAAAAGGACTGCAAATTATGAGGTGAGGAAATATGCACCCTTTATAGTGGTAGAAACAAGTGGAGACAAGCTCGCCGGGTCTGTTGGATTCAATACAGTTGCTGG
GTATATATTTGGGAAGAACTCTGCAAAGGAGAAAATACCCATGACCACTCCTGTATTCACTCAGACATTCGACTCTGAATTACCCAAAGTCTCCATCCAAATAGTTCTTC
CATCAGAGAAAAATATAGACAGTTTACCAGATCCTGAACAAGAAACAGTCGGCTTGAGAAAGGTGGAAGGAGGTATTGCAGCAGTGTTGAAATTTAGTGGGAAACCTACA
GAAGAGATTGTGCAAGAGAAGGCCAAAGAATTGCGGTCTAGTCTCGTAAAAGATGGTCTCAAACCCAGTAAGGGCTGTTTGCTTGCTCGGTATAATGACCCTGGACGAAC
ATGGAGCTTTATAATGGTTCCAACCGCAGCTCACGAAGACCCGCTCGCCGTCGCTCGCGAAGACCCGCCAACTCGCCGTCGCTTGAGAAGACTTGTTCGTCGTCGCTCGC
GAAGACCTGCCAGCTCGTCGTCACTCGCGAAGACCCGCCCGCTCGCCGTCGCTCACAAAGATCATGCTCAGATCTGCTCGCCGTCACTCGCGAAGACCCGCCAGCTCGCC
GTCGCTCGCGAAGATCTTGCTCAGATCTCCTCGGAAGAGTGGCTCGTCCTTATGGAGAAAATAGGCATCCCTACAATTTATTAG
Protein sequenceShow/hide protein sequence
MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLIKSRTVAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLLHLFDEQGIDRTAYDEQVRYRDPITKHD
TISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKSVLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYE
ILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNIDSLPDPEQETVGLRKVEGGIAAVLKFSGKPT
EEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMVPTAAHEDPLAVAREDPPTRRRLRRLVRRRSRRPASSSSLAKTRPLAVAHKDHAQICSPSLAKTRQLA
VAREDLAQISSEEWLVLMEKIGIPTIY