; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021876 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021876
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionSOUL heme-binding protein
Genome locationscaffold2:6888094..6892838
RNA-Seq ExpressionSpg021876
SyntenySpg021876
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463332.1 PREDICTED: uncharacterized protein LOC103501513 isoform X1 [Cucumis melo]1.8e-19888.63Show/hide
Query:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDEQV
        MAALQ SLQNFLSTPT+    RPP SG L  L PRLL+SRT A KP+T+NSKWVVR +LVDQSPPKSTVDV RLV+FLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        RFRDPITKHDTISGYLFNISLLRE+FRPEF LHWVKQTGPYEITTRWTM+MKF LLPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI
         DVFKQLR+YKTPELESPKY ILKRT  YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK+I
Subjt:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI

Query:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSLE
        DSLPDPEQD +GLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL+KDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEE+SLE
Subjt:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSLE

XP_011648491.1 uncharacterized protein LOC101206063 [Cucumis sativus]2.0e-20586.99Show/hide
Query:  EKLANHSPSCTSKSHLPNGNSP--VEAQMAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVD
        +K ANHSPS TSKSHLPN   P   EAQMA LQ SLQNF STPT+    RPP SG +  L PRLL SRT AFKPHT+NSKWVVR +LVDQ PPKST+DV 
Subjt:  EKLANHSPSCTSKSHLPNGNSP--VEAQMAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVD

Query:  RLVNFLYEDLLHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGI
        RLV+FL+EDL HLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEF LHWVKQTGPYEITTRWTMVMKF LLPWKPELVFTG SIMGI
Subjt:  RLVNFLYEDLLHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGI

Query:  NPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPM
        NPETGKFCSHVDLWDSIQNNDYFS+EGL DVFKQLR+YKTPELESPKY ILKRTA YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNS KEKIPM
Subjt:  NPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPM

Query:  TTPVFTQTFDSELPKVSIQIVLPSEKNIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFI
        TTPVFTQ F+SE PKVSIQIVLPSEK+IDSLPDPEQD VGLRKVEGGIAAVLKFSGKP EEIVQEKAKELRSSL+KDGLKP  GCLLARYNDPGRTW+FI
Subjt:  TTPVFTQTFDSELPKVSIQIVLPSEKNIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFI

Query:  MRNEVLIWLEEFSLE
        MRNEVLIWLEEFSLE
Subjt:  MRNEVLIWLEEFSLE

XP_022144956.1 uncharacterized protein LOC111014503 isoform X1 [Momordica charantia]2.0e-20089.92Show/hide
Query:  MAALQFSLQNFLSTPTVRFDFRPPNSG--TLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDE
        MAALQ SLQNFLSTPT  F FRP  SG  T+ GL PRLLKSRT  FKP  RNSKW VRLSLVDQSPPKS VDVDRLV+FLYEDL HLFDEQGIDRTAYDE
Subjt:  MAALQFSLQNFLSTPTVRFDFRPPNSG--TLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDE

Query:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPITKHDTISGY FNISLLRELFRPEF LHWVKQTGPYEITTRWTMVMKFVLLPWKPE +FTG SIMGINPETGKFCSHVDLWDSIQNNDYFSLE
Subjt:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLR+YKTPELESPKYEILKRTANYEVRKY PF+VVETSGDKL+GS GFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPS+K
Subjt:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  NIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        +I+SLPDPEQDT+GLRKVEGGIAAVLKFSGKPTE++VQEKAKELRS L+KDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
Subjt:  NIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

XP_022930662.1 uncharacterized protein LOC111437064 isoform X1 [Cucurbita moschata]1.3e-19688.69Show/hide
Query:  MAALQFSLQNFL--STPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDE
        MAALQFSLQN L  STP++ F FRPPNSG LI       +SRT   KPHTRNSKWVVRLSLVDQ+PPKSTVDVD+LV+FLYEDL HLFDEQGIDRTAYD+
Subjt:  MAALQFSLQNFL--STPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDE

Query:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
        QVRFRDPITKHDTI+GYLFNISLLRELFRPEFLLHWVK+TG YEITTRWTMVMKFVLLPWKP+LVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLR+YKTPELESPKYEILKRT NYEVRKYAPFIVVETSGDKLAGS GFN VAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK
Subjt:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  NIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSLE
        ++ SLPDPEQDT+GLRKVEGG AAVLKFSGKPTEEIVQEKAKELRSSL+KDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEEFSLE
Subjt:  NIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSLE

XP_038879422.1 uncharacterized protein LOC120071301 isoform X1 [Benincasa hispida]1.7e-19989.66Show/hide
Query:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDEQV
        MA  Q SLQNF STPT+ F  RPP SG L  L PRL K+RT AFKPH++NSKWVVRLSLVDQSPPKSTVDV RLV+FLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        RFRDPIT HDTISGYLFNISLLRELFRPEF LHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI
         DVFKQLRYYKTP LESPKY ILKRTANYEVRKYA FIVVETSGDKLAGS GFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE+PKV IQIVLPSEK+I
Subjt:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI

Query:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSLE
        DSLPDPEQD +GLRKVEG IAAVLKFSGKPTEEIVQEKAKELRSSL+KDGLKPS GCLLARYNDPGRTW+FIMRNEVLIWLEEFSLE
Subjt:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSLE

TrEMBL top hitse value%identityAlignment
A0A0A0LWP3 Uncharacterized protein2.4e-19688.37Show/hide
Query:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDEQV
        MA LQ SLQNF STPT+    RPP SG +  L PRLL SRT AFKPHT+NSKWVVR +LVDQ PPKST+DV RLV+FL+EDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        RFRDPITKHDTISGYLFNISLLRELFRPEF LHWVKQTGPYEITTRWTMVMKF LLPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI
         DVFKQLR+YKTPELESPKY ILKRTA YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNS KEKIPMTTPVFTQ F+SE PKVSIQIVLPSEK+I
Subjt:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI

Query:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSLE
        DSLPDPEQD VGLRKVEGGIAAVLKFSGKP EEIVQEKAKELRSSL+KDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEEFSLE
Subjt:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSLE

A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X18.9e-19988.63Show/hide
Query:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDEQV
        MAALQ SLQNFLSTPT+    RPP SG L  L PRLL+SRT A KP+T+NSKWVVR +LVDQSPPKSTVDV RLV+FLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL
        RFRDPITKHDTISGYLFNISLLRE+FRPEF LHWVKQTGPYEITTRWTM+MKF LLPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGL

Query:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI
         DVFKQLR+YKTPELESPKY ILKRT  YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK+I
Subjt:  LDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNI

Query:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSLE
        DSLPDPEQD +GLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL+KDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEE+SLE
Subjt:  DSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSLE

A0A5A7TMX2 SOUL heme-binding family protein isoform 14.2e-19683.05Show/hide
Query:  EKLANHSPSCTSKSHLPNGNSP------VEAQMAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKST
        +K ANHSPS TSKSHLPN N P       EAQMAALQ SLQNFLSTPT+    RPP SG L  L PRLL+SRT A KP+T+NSKWVVR +LVDQSPPKST
Subjt:  EKLANHSPSCTSKSHLPNGNSP------VEAQMAALQFSLQNFLSTPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKST

Query:  VDVDRLVNFLYEDLLHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQT--------------GPYEITTRWTMVMKFV
        VDV RLV+FLYEDL HLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRE+FRPEF LHWVKQ                PYEITTRWTM+MKF 
Subjt:  VDVDRLVNFLYEDLLHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQT--------------GPYEITTRWTMVMKFV

Query:  LLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFN
        LLPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+EGL DVFKQLR+YKTPELESPKY ILKRT  YEVRKYAPFIVVETSGDKLAGS GFN
Subjt:  LLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFN

Query:  TVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPS
        TVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK+IDSLPDPEQD +GLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL+KDGLKP 
Subjt:  TVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPS

Query:  KGCLLARYNDPGRTWSFIM
         GCLLARYNDPGRTW+FIM
Subjt:  KGCLLARYNDPGRTWSFIM

A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X19.5e-20189.92Show/hide
Query:  MAALQFSLQNFLSTPTVRFDFRPPNSG--TLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDE
        MAALQ SLQNFLSTPT  F FRP  SG  T+ GL PRLLKSRT  FKP  RNSKW VRLSLVDQSPPKS VDVDRLV+FLYEDL HLFDEQGIDRTAYDE
Subjt:  MAALQFSLQNFLSTPTVRFDFRPPNSG--TLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDE

Query:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPITKHDTISGY FNISLLRELFRPEF LHWVKQTGPYEITTRWTMVMKFVLLPWKPE +FTG SIMGINPETGKFCSHVDLWDSIQNNDYFSLE
Subjt:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLR+YKTPELESPKYEILKRTANYEVRKY PF+VVETSGDKL+GS GFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPS+K
Subjt:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  NIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        +I+SLPDPEQDT+GLRKVEGGIAAVLKFSGKPTE++VQEKAKELRS L+KDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
Subjt:  NIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

A0A6J1ER73 uncharacterized protein LOC111437064 isoform X16.4e-19788.69Show/hide
Query:  MAALQFSLQNFL--STPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDE
        MAALQFSLQN L  STP++ F FRPPNSG LI       +SRT   KPHTRNSKWVVRLSLVDQ+PPKSTVDVD+LV+FLYEDL HLFDEQGIDRTAYD+
Subjt:  MAALQFSLQNFL--STPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDE

Query:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
        QVRFRDPITKHDTI+GYLFNISLLRELFRPEFLLHWVK+TG YEITTRWTMVMKFVLLPWKP+LVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLR+YKTPELESPKYEILKRT NYEVRKYAPFIVVETSGDKLAGS GFN VAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK
Subjt:  GLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  NIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSLE
        ++ SLPDPEQDT+GLRKVEGG AAVLKFSGKPTEEIVQEKAKELRSSL+KDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEEFSLE
Subjt:  NIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSLE

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic7.9e-1933.5Show/hide
Query:  YYKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVS-------------
        +   P+LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S   K+              
Subjt:  YYKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVS-------------

Query:  ----IQIVLPSEKNIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKD---GLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLE
            +  V+PS K   +LP P+  +V +++V   I AV+ FSG  T+E ++ + +ELR +L  D    ++      +A+YN P  T  F+ RNEV + +E
Subjt:  ----IQIVLPSEKNIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKD---GLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLE

Arabidopsis top hitse value%identityAlignment
AT1G17100.1 SOUL heme-binding family protein3.2e-0729.08Show/hide
Query:  LESPKYEILKRTANYEVRKYAPFIVVETS-----GDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELP----KVSIQIVLPSEKNIDSLPD
        +E P YE++     YE+R+Y   + V T          A    F  +  YI GKN   +KI MT PV +Q   S+ P      ++   +P +   D  P 
Subjt:  LESPKYEILKRTANYEVRKYAPFIVVETS-----GDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELP----KVSIQIVLPSEKNIDSLPD

Query:  PEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL
           + + ++K      AV +FSG  +++ + E+A  L SSL
Subjt:  PEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL

AT2G37970.1 SOUL heme-binding family protein7.6e-1734.45Show/hide
Query:  LESPKYEILKRTANYEVRKYAPFIVVETSGD----KLAGSVGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ------------TFDSELPK-------
        +E+PKY + K    YE+R+Y P +  E + D    K     GF  +A YI  FGK  N   EKI MT PV T+            T +SE  +       
Subjt:  LESPKYEILKRTANYEVRKYAPFIVVETSGD----KLAGSVGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ------------TFDSELPK-------

Query:  -----------VSIQIVLPS-EKNIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSF--IM
                   V++Q +LPS  K  +  P P  + V +++  G    V+KFSG  +E +V EK K+L S L KDG K +   +LARYN P   W+     
Subjt:  -----------VSIQIVLPS-EKNIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSF--IM

Query:  RNEVLIWLE
         NEV+I +E
Subjt:  RNEVLIWLE

AT3G10130.1 SOUL heme-binding family protein5.6e-2033.5Show/hide
Query:  YYKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVS-------------
        +   P+LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S   K+              
Subjt:  YYKTPELESPKYEILKRTANYEVRKYAPFIVV------ETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVS-------------

Query:  ----IQIVLPSEKNIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKD---GLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLE
            +  V+PS K   +LP P+  +V +++V   I AV+ FSG  T+E ++ + +ELR +L  D    ++      +A+YN P  T  F+ RNEV + +E
Subjt:  ----IQIVLPSEKNIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKD---GLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLE

AT5G20140.1 SOUL heme-binding family protein5.4e-14876.09Show/hide
Query:  STVDVDRLVNFLYEDLLHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTG
        STV+++ LV FLYEDL HLFD+QGID+TAYDE+V+FRDPITKHDTISGYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF+ LPWKPELVFTG
Subjt:  STVDVDRLVNFLYEDLLHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTG

Query:  YSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSA
         SIM +NPET KFCSH+DLWDSI+NNDYFSLEGL+DVFKQLR YKTP+LE+PKY+ILKRTANYEVR Y PFIVVET GDKL+GS GFN VAGYIFGKNS 
Subjt:  YSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSA

Query:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKNIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDP
         EKIPMTTPVFTQT D++L   VS+QIV+PS K++ SLP P ++ V L+K+EGG AA +KFSGKPTE++VQ K  ELRSSL KDGL+  KGC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKNIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDP

Query:  GRTWSFIMRNEVLIWLEEFSLE
        GRTW+FIMRNEV+IWLE+FSL+
Subjt:  GRTWSFIMRNEVLIWLEEFSLE

AT5G20140.2 SOUL heme-binding family protein1.9e-14075.97Show/hide
Query:  STVDVDRLVNFLYEDLLHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTG
        STV+++ LV FLYEDL HLFD+QGID+TAYDE+V+FRDPITKHDTISGYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF+ LPWKPELVFTG
Subjt:  STVDVDRLVNFLYEDLLHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTG

Query:  YSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSA
         SIM +NPET KFCSH+DLWDSI+NNDYFSLEGL+DVFKQLR YKTP+LE+PKY+ILKRTANYEVR Y PFIVVET GDKL+GS GFN VAGYIFGKNS 
Subjt:  YSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRTANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSA

Query:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKNIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDP
         EKIPMTTPVFTQT D++L   VS+QIV+PS K++ SLP P ++ V L+K+EGG AA +KFSGKPTE++VQ K  ELRSSL KDGL+  KGC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKNIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLVKDGLKPSKGCLLARYNDP

Query:  GRTWSFIM
        GRTW+FIM
Subjt:  GRTWSFIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAAGCCATACCTGTCTCCAAAATCAATTCCCCCTCGAAGTTGAATCAGATGCCTTAGCTGTCGTCAAAGTCTTGATCAGAGATGAGGAGGATTTAGCGAAGCTGAA
GCTATTCGTCAAAGCTATCAATTACATTGCCTCTCGTTTGGAACTAGTCAAATTTCTACACTGTAATCGCCTCTCCAATTCAGTAGCCCACCGGTTGGCAGGGCAAACTT
GTTCTTTTGATTTTGAAAAACTAGCCAATCACTCGCCGTCCTGTACCTCCAAATCCCACCTTCCCAATGGCAATTCGCCGGTTGAAGCTCAAATGGCCGCTCTTCAATTT
TCCCTCCAAAACTTCCTCTCAACCCCGACAGTCCGTTTCGATTTCCGGCCGCCGAATTCCGGCACACTGATCGGCCTCGCTCCCCGTCTACTCAAAAGCAGAACTGAAGC
TTTTAAACCCCATACCCGAAATTCCAAGTGGGTTGTTCGATTAAGCTTGGTAGATCAAAGCCCACCAAAATCGACGGTCGATGTGGACCGATTGGTGAATTTCTTGTACG
AAGATCTTCTCCATCTCTTCGATGAACAGGGGATTGACCGCACGGCGTACGACGAACAAGTGAGGTTTCGGGACCCCATTACCAAGCACGATACGATTAGCGGGTATTTG
TTTAATATTTCCCTCTTGCGAGAACTCTTCAGGCCTGAGTTCTTATTGCACTGGGTTAAACAGACAGGACCATATGAAATAACTACAAGATGGACTATGGTAATGAAGTT
TGTCCTTCTACCATGGAAACCAGAATTAGTTTTTACGGGATATTCCATCATGGGTATCAATCCAGAGACGGGCAAATTCTGTAGTCATGTGGATCTTTGGGATTCAATAC
AGAATAACGACTACTTTTCTCTAGAAGGCCTTTTGGATGTATTTAAGCAGCTTCGGTATTATAAGACTCCAGAATTGGAATCACCCAAGTATGAGATTCTGAAAAGGACT
GCAAATTATGAGGTGAGGAAATATGCACCATTTATAGTGGTAGAAACAAGTGGAGACAAGCTCGCCGGGTCTGTTGGATTCAATACAGTTGCTGGGTATATATTTGGGAA
GAACTCTGCAAAGGAGAAAATACCCATGACCACTCCTGTATTCACTCAGACATTCGACTCTGAATTACCCAAAGTCTCCATCCAAATAGTTCTTCCTTCAGAGAAAAATA
TAGACAGTTTACCAGATCCTGAACAAGACACAGTCGGCTTGAGAAAGGTGGAAGGAGGTATTGCAGCAGTGTTGAAATTTAGTGGGAAACCTACAGAAGAGATTGTGCAA
GAGAAGGCCAAAGAATTGCGGTCTAGTCTCGTAAAAGATGGTCTCAAACCCAGTAAGGGCTGTTTGCTTGCTCGGTATAATGACCCTGGACGAACATGGAGCTTTATAAT
GAGAAATGAGGTGCTAATATGGCTTGAAGAATTCTCATTGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATAAGCCATACCTGTCTCCAAAATCAATTCCCCCTCGAAGTTGAATCAGATGCCTTAGCTGTCGTCAAAGTCTTGATCAGAGATGAGGAGGATTTAGCGAAGCTGAA
GCTATTCGTCAAAGCTATCAATTACATTGCCTCTCGTTTGGAACTAGTCAAATTTCTACACTGTAATCGCCTCTCCAATTCAGTAGCCCACCGGTTGGCAGGGCAAACTT
GTTCTTTTGATTTTGAAAAACTAGCCAATCACTCGCCGTCCTGTACCTCCAAATCCCACCTTCCCAATGGCAATTCGCCGGTTGAAGCTCAAATGGCCGCTCTTCAATTT
TCCCTCCAAAACTTCCTCTCAACCCCGACAGTCCGTTTCGATTTCCGGCCGCCGAATTCCGGCACACTGATCGGCCTCGCTCCCCGTCTACTCAAAAGCAGAACTGAAGC
TTTTAAACCCCATACCCGAAATTCCAAGTGGGTTGTTCGATTAAGCTTGGTAGATCAAAGCCCACCAAAATCGACGGTCGATGTGGACCGATTGGTGAATTTCTTGTACG
AAGATCTTCTCCATCTCTTCGATGAACAGGGGATTGACCGCACGGCGTACGACGAACAAGTGAGGTTTCGGGACCCCATTACCAAGCACGATACGATTAGCGGGTATTTG
TTTAATATTTCCCTCTTGCGAGAACTCTTCAGGCCTGAGTTCTTATTGCACTGGGTTAAACAGACAGGACCATATGAAATAACTACAAGATGGACTATGGTAATGAAGTT
TGTCCTTCTACCATGGAAACCAGAATTAGTTTTTACGGGATATTCCATCATGGGTATCAATCCAGAGACGGGCAAATTCTGTAGTCATGTGGATCTTTGGGATTCAATAC
AGAATAACGACTACTTTTCTCTAGAAGGCCTTTTGGATGTATTTAAGCAGCTTCGGTATTATAAGACTCCAGAATTGGAATCACCCAAGTATGAGATTCTGAAAAGGACT
GCAAATTATGAGGTGAGGAAATATGCACCATTTATAGTGGTAGAAACAAGTGGAGACAAGCTCGCCGGGTCTGTTGGATTCAATACAGTTGCTGGGTATATATTTGGGAA
GAACTCTGCAAAGGAGAAAATACCCATGACCACTCCTGTATTCACTCAGACATTCGACTCTGAATTACCCAAAGTCTCCATCCAAATAGTTCTTCCTTCAGAGAAAAATA
TAGACAGTTTACCAGATCCTGAACAAGACACAGTCGGCTTGAGAAAGGTGGAAGGAGGTATTGCAGCAGTGTTGAAATTTAGTGGGAAACCTACAGAAGAGATTGTGCAA
GAGAAGGCCAAAGAATTGCGGTCTAGTCTCGTAAAAGATGGTCTCAAACCCAGTAAGGGCTGTTTGCTTGCTCGGTATAATGACCCTGGACGAACATGGAGCTTTATAAT
GAGAAATGAGGTGCTAATATGGCTTGAAGAATTCTCATTGGAGTAG
Protein sequenceShow/hide protein sequence
MISHTCLQNQFPLEVESDALAVVKVLIRDEEDLAKLKLFVKAINYIASRLELVKFLHCNRLSNSVAHRLAGQTCSFDFEKLANHSPSCTSKSHLPNGNSPVEAQMAALQF
SLQNFLSTPTVRFDFRPPNSGTLIGLAPRLLKSRTEAFKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVNFLYEDLLHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYL
FNISLLRELFRPEFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGYSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRYYKTPELESPKYEILKRT
ANYEVRKYAPFIVVETSGDKLAGSVGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKNIDSLPDPEQDTVGLRKVEGGIAAVLKFSGKPTEEIVQ
EKAKELRSSLVKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSLE